Text
                    Finite Elements: An Introduction
J. Tinsley Oden
Finite elements; perhaps no other family of approximation methods has had
a greater impact on the theory and practice of numerical methods during the
twentieth century. Finite element methods have now been used in virtually every
conceivable area of engineering that can make use of models of nature characterized
by partial differential equations. There are dozens of textbooks, monographs,
handbooks, memoirs, and journals devoted to its further study; numerous
conferences, symposia, and workshops on various aspects of finite element
methodology are held regularly throughout the world. There exist easily over one
hundred thousand references on finite elements today, and this number is growing
exponentially with further revelations of the power and versatility of the method.
Today, finite element methodology is making significant inroads into fields in which
many thought were outside its realm; for example, computational fluid dynamics. In
time, finite element methods may assume a position in this area of comparable or
greater importance than classical difference schemes which have long dominated the
subject.
Why finite elements?
A natural question that one may ask is: why have finite element methods been so
popular in both the engineering and mathematical community? There is also the
question, do finite element methods possess properties that will continue to make
them attractive choices of methods to solve difficult problems in physics and
engineering?
In answering these questions, one must first point to the fact that finite element
methods are based on the weak, variational, formulation of boundary and initial
value problems. This is a critical property, not only because it provides a proper
setting for the existence of very irregular solutions to differential equations (e.g.
distributions), but also because the solution appears in the integral of a quantity over
HANDBOOK OF NUMERICAL ANALYSIS, VOL. II
Finite Element Methods (Part 1)
Edited by P.G. Ciarlet and J.L. Lions
© 1991. Elsevier Science Publishers B.V. (North-Holland)
3


4 J. T. Oden a domain. The simple fact that the integral of a measurable function over an arbitrary domain can be broken up into the sum of integrals over an arbitrary collection of almost disjoint subdomains whose union is the original domain, is a vital observation in finite element theory. Because of it, the analysis of a problem can literally be made locally, over a typical subdomain, and by making the subdomain sufficiently small one can argue that polynomial functions of various degrees are adequate for representing the local behavior of the solution. This summability of integrals is exploited in every finite element program. It allows the analysts to focus their attention on a typical finite element domain and to develop an approximation independent of the ultimate location of that element in the final mesh. The simple integral property also has important implications in physics and in most problems in continuum mechanics. Indeed, the classical balance laws of mechanics are global, in the sense that they are integral laws applying to a given mass of material, a fluid or solid. From the onset, only regularity of the primitive variables sufficient for these global conservation laws to make sense is needed. Moreover, since these laws are supposed to be fundamental axioms of physics, they must hold over every finite portion of the material: every finite element of the continuum. Thus once again, one is encouraged to think of approximate methods defined by integral formulations over typical pieces of a continuum to be studied. These rather primitive properties of finite elements lead to some of its most important features: A) Arbitrary geometries. The method is essentially geometry-free. In principle, finite element methods can be applied to domains of arbitrary shape and with quite arbitrary boundary conditions. B) Unstructured meshes. While there is still much prejudice in the numerical analysis literature toward the use of coordinate-dependent algorithms and mesh generators, there is nothing intrinsic in finite element methodology that requires such devices. Indeed, finite element methods by their nature lead to unstructured meshes. This means, in principle, analysts can place finite elements anywhere they please. They may thus model the most complex types of geometries in nature and physics, ranging from the complex cross-sections of biological tissues to the exterior of aircraft to internal flows in turbo machinery, without strong use of a global fixed coordinate frame. C) Robustness. It is well known that in finite element methods the contributions of local approximations over individual elements are assembled together in a systematic way to arrive at a global approximation of a solution to a partial differential equation. Generally, this leads to schemes which are stable in appropriate norms, and, moreover, insensitive to singularities or distortions of the mesh, in sharp contrast to classical difference methods. There are notable exceptions to this, of course, and these exceptions have been the subject of some of the most important works in finite element theory. But, by and large, the direct use of Galerkin or Petrov-Galerkin methods to derive finite element methods leads to conservative and stable algorithms, for most classes of problems in mechanics and mathematical physics.
Introduction 5 D) Mathematical foundation. Because of the extensive work on the mathematical foundations done during the seventies and eighties, finite elements now enjoy a rich and solid mathematical basis. The availability of methods to determine a priori and a posteriori estimates provides a vital part of the theory of finite elements, and makes it possible to lift the analysis of important engineering and physical problems above the traditional empiricism prevalent in many numerical and experimental studies. These properties are intrinsic to finite element methods and continue to make these methods among the most attractive for solving complex problems. They represent the most desirable properties of any numerical scheme designed to handle real-world problems. Moreover, the basic features of finite element methodology provide an ideal setting for innovative use of modern supercomputing architectures, particularly parallel processing. For these reasons, it is certain that finite element concepts will continue to occupy an important role in applications and in research on the numerical solution of partial differential equations. The early history When did finite elements begin? It is difficult to trace the origins of finite element methods because of a basic problem in defining precisely what constitutes a "finite element method". To most mathematicians, it is a method of piecewise polynomial approximation and, therefore, its origins are frequently traced to the appendix of a paper by Courant [1943] in which piecewise linear approximations of the Dirichlet problem over a network of triangles is discussed. Also, the "interpretation of finite differences" by Poly a [1952] is regarded as embodying piecewise polynomial approximation aspects of finite elements. On the other hand, the approximation of variational problems on a mesh of triangles goes back much further: 92 years. In 1851, Schellbach [1851] proposed a finite-element-like solution to Plateau's problem of determining the surface S of minimum area enclosed by a given closed curve. Schellbach used an approximation Sk of S by a mesh of triangles over which the surface was represented by piecewise linear functions, and he then obtained an approximation to the solution to Plateau's problem by minimizing Sh with respect to the coordinates of hexagons formed by six elements (see Williamson [1980]). Not quite the conventional finite element approach, but certainly as much a finite element technique as that of Courant. Some say that there is even an earlier work that uses some of the ideas underlying finite element methods: Gottfried Leibniz himself employed a piecewise linear approximation of the Brachistochrone problem proposed by Johann Bernoulli in 1696 (see the historical volume, Leibniz [1962]). With the help of his newly developed calculus tools, Leibniz derived the governing differential equation for the problem, the solution of which is a cycloid. However, most would agree that to credit this work as a finite element approximation is somewhat stretching the point. Leibniz had no intention of approximating a differential equation; rather, his purpose was to derive one. Two and a half centuries later it was realized that useful approximations of differential equations could be determined by not necessarily
6 J.T. Oden taking infinitesimal elements as in the calculus, but by keeping the elements finite in size. This idea is, in fact, the basis of the term "finite element". There is also some difference in the process of laying a mesh of triangles over a domain on the one hand and generating the domain of approximation by piecing together triangles on the other. While these processes may look the same in some cases, they may differ dramatically in how the boundary conditions are imposed. Thus, neither Schellbach nor Courant, nor for that matter Synge who used triangular meshes many years later, were particularly careful as to how boundary conditions were to be imposed or as to how the boundary of the domain was to be modeled by elements, issues that are now recognized as an important feature of finite element methodologies. If a finite element method is one in which a global approximation of a partial differential equation is built up from a sequence of local approximations over subdomains, then credit must go back to the early papers of Hrennikoff [1941], and perhaps beyond, who chose to solve plane elasticity problems by breaking up the domain of the displacements into little finite pieces, over which the stiffnesses were approximated using bars, beams, and spring elements. A similar "lattice analogy" was used by McHenry [1943]. While these works are draped in the most primitive physical terms, it is nevertheless clear that the methods involve some sort of crude piecewise linear or piecewise cubic approximation over rectangular cells. Miraculously, the methods also seem to be convergent. To the average practitioner who uses them, finite elements are much more than a method of piecewise polynomial approximation. The whole process of partitioning of domains, assembling elements, applying loads and boundary conditions, and, of course, along with it, local polynomial approximation, are all components of the finite element method. If this is so, then one must acknowledge the early papers of Gabriel Kron who developed his "tensor analysis of networks" in 1939 and applied his "method of tearing" and "network analysis" to the generation of global systems from large numbers of individual components in the 1940s and 1950s (Kron [1939]; see also Kron [1953]). Of course, Kron never necessarily regarded his method as one of approximating partial differential equations; rather, the properties of each component were regarded as exactly specified, and the issue was an algebraic one of connecting them all appropriately together. In the early 1950s, Argyris [1954] began to put these ideas together into what some call a primitive finite element method: he extended and generalized the combinatoric methods of Kron and other ideas that were being developed in the literature on system theory at the time, and added to it variational methods of approximation, a fundamental step toward true finite element methodology. Around the same time, Synge [1957] described his "method of the hypercircle" in which he also spoke of piecewise linear approximations on triangular meshes, but not in a rich variational setting and not in a way in which approximations were built by either partitioning a domain into triangles or assembling triangles to approximate a domain (indeed Synge's treatment of boundary conditions was clearly not in the spirit of finite elements, even though he was keenly aware of the importance of
Introduction 7 convergence criteria and of the "angle condition" for triangles, later studied in some depth by others). It must be noted that during the mid-1950s there were a number of independent studies underway which made use of "matrix methods" for the analysis of aircraft structures. A principal contributor to this methodology was Levy [1953] who introduced the "direct stiffness method" wherein he approximated the structural behavior of aircraft wings using assemblies of box beams, torsion boxes, rods and shear panels. These assuredly represent some sort of crude local polynomial approximation in the same spirit as the Hrennikoff and McHenry approaches. The direct stiffness method of Levy had a great impact on the structural analysis of aircraft, and aircraft companies throughout the United States began to adopt and apply some variant of this method or of the methods of Argyris to complex aircraft structural analyses. During this same period, similar structural analysis methods were being developed and used in Europe, particularly in England, and one must mention in this regard the work of Taig [1961] in which shear lag in aircraft wing panels was approximated using basically a bilinear finite element method of approximation. Similar element-like approximations were used in many aircraft industries as components in various matrix methods of structural analyses. Thus the precedent was established for piecewise approximations of some kind by the mid-1950s. To a large segment of the engineering community, the work representing the beginning of finite elements was that contained in the pioneering paper of Turner, Clough, Martin and Topp [1956] in which a genuine attempt was made at both a local approximation (of the partial differential equations of linear elasticity) and the use of assembly strategies essential to finite element methodology. It is interesting that in this paper local element properties were derived without the use of variational principles. It was not until 1960 that Clough [1960] actually dubbed these techniques as "finite element methods" in a landmark paper on the analysis of linear plane elasticity problems. The 1960s were the formative years of finite element methods. Once it was perceived by the engineering community that useful finite element methods could be derived from variational principles, variational^ based methods significantly dominated all the literature for almost a decade. If an operator was unsymmetric, it was thought that the solution of the associated problem was beyond the scope of finite elements, since it did not lend itself to a traditional extremum variational approximation in the spirit of Rayleigh and Ritz. From 1960 to 1965, a variety of finite element methods were proposed. Many were primitive and unorthodox; some were innovative and successful. During this time, a variety of attempts at solving the biharmonic equation for plate bending problems were proposed which employed piecewise polynomial approximations, but did not provide the essentials for convergence. This led to the concern of some as to whether the method was indeed applicable to such problems. On the other hand, it was clear that classical Fourier series solutions of plate problems were, under appropriate conditions, convergent and could be fit together in an assemblage of rectangular components (Oden [1962]) and, thus, a form of "spectral finite element methods"
8 J.T. Oden was introduced early in the study of such problems. However, such high-order schemes never received serious attention in this period, as it was felt that piecewise polynomial approximations could be developed which did give satisfactory results. It was not until the mid- to late 1960s that papers on bicubic spline approximations by Bogner, Fox, and Schmit [1966] and Birkhoff, Schultz, and Varga [1968] provided successful polynomial finite element approximations for these classes of problems. Many workers in the field feel that the famous Dayton conferences on finite elements (at the Air Force Flight Dynamics Laboratory in Dayton, Ohio, USA) represented landmarks in the development of the field (see Przemienieckj et al. [1966]). Held in 1965, 1968, 1970, these meetings brought specialists from all over the world to discuss their latest triumphs and failures, and the pages of the proceedings, particularly the earlier volumes, were filled with remarkable and innovative accomplishments from a technical community just beginning to learn the richness and power of this new collection of ideas. In these volumes one can find many of the premier papers of now well-known methods. In the first volume alone one can find mixed finite element methods (Herrmann [1966]), Hermite approximations (Pestel [1966]), C^-bicubic approximations (Bogner, Fox and Schmit [1966]), hybrid methods (Pian [1966]) and other contributions. In later volumes, further assaults on nonlinear problems and special element formulations can be found. Near the end of the sixties and early seventies there finally emerged the realization that the method could be applied to unsymmetric operators without difficulty and thus problems in fluid mechanics were brought within the realm of application of finite element methods; in particular, finite element models of the full Navier-Stokes equations were first presented during this period (Oden [1969], Oden and Somogyi [1968], Oden [1970]). The early textbook by Zienkiewicz and Cheung [1967] did much to popularize the method with the practicing engineering community. However, the most important factor leading to the rise in popularity during the late 1960s and early 1970s was not purely the publication of special formulations and algorithms, but the fact that the method was being very successfully used to solve difficult engineering problems. Much of the technology used during this period was due to Bruce Irons, who with his colleagues and students developed a multitude of techniques for the successful implementation of finite elements. These included the frontal solution technique (Irons [1970]), the patch test (Irons and Razzaque [1972]), isoparametric elements (Ergatoudis, Irons and Zienkiewicz [1966]), and numerical integration schemes (Irons [1966]) and many more. The scope of finite element applications in the 1970s would have been significantly diminished without these contributions. The mathematical theory The mathematical theory of finite elements was slow to emerge from this caldron of activity. The beginning works on the mathematical theory of finite elements were
Introduction 9 understandably concerned with one-dimensional elliptic problems and used many of the tools and jargon of Ritz methods, interpolation, and variational differences. An early work in this line was the paper of Varga [1966] which dealt with "Hermite interpolation-type Ritz methods" for two-point boundary value problems. We also mention in this regard the paper of Birkhoff, de Boor, Schwartz and Wendroff [1966] on "Rayleigh-Ritz approximation by piecewise cubic polynomials". This is certainly one of the first papers to deal with the issue of convergence of finite element methods, although some papers on variational differences yielded similar results but did not focus on the piecewise polynomial features of finite elements. The work of Kang Feng [1965], published in Chinese (a copy of which I have not been able to acquire for review) may fall into this category and is sometimes noted as relevant to the convergence of finite element methods. The mathematical theory of finite elements for two-dimensional and higher- dimensional problems began in 1968 and several papers were published that year on the subject. One of the first papers in this period to address the problem of convergence of a finite method in a rigorous way and in which a priori error estimates for bilinear approximations of a problem in plane elasticity are obtained, is the often overlooked paper of Johnson and McLa^ [1968], which appeared in the Journal of Applied Mechanics. This paper correctly developed error estimates in energy norms, and even attempted to characterize the deterioration of convergence rates due to corner singularities. In the same year there appeared the first of two important papers by Ogenesjan and Ruchovec [1968,1969] in the Russian literature, in which "variational difference schemes" were proposed for linear second-order elliptic problems in two-dimensional domains. These works dealt with the estimates of the rate of convergence of variational difference schemes. Also in 1968 there appeared the important mathematical paper of Zlamal [1968] in which a detailed analysis of interpolation properties of a class of triangular elements and their application to second-order and fourth-order linear elliptic boundary value problems is discussed. This paper attracted the interest of a large segment of the numerical analysis community and several very good mathematicians began to work on finite element methodologies. The paper by Zlamal also stands apart from other multidimensional finite element papers of this era since it represented a departure of studies of tensor products of polynomials on rectangular domains and provided an approach toward approximations in general polygonal domains. In the same year, Ciarlet [1968] published a rigorous proof of convergence of piecewise linear finite element approximation of a class of linear two-point boundary value problems and proved V° estimates using a discrete maximum principle. We also mention the work of Oliveira [1968] on convergence of finite element methods which established correct rates of convergence for certain problems in appropriate energy norms. A year later, Schultz [1969] presented error estimates for "Rayleigh-Ritz- Galerkin methods" for multidimensional problems. Two years later, Schultz [1971] published L2 error bounds for these types of methods. By 1972, finite element methods had emerged as an important new area of numerical analysis in applied mathematics. Mathematical conferences were held on
10 J.T. Oden the subject on a regular basis, and there began to appear a rich volume of literature on mathematical aspects of the method applied to elliptic problems, eigenvalue problems, and parabolic problems. A conference of special significance in this period was held at the University of Maryland in 1972 and featured a penetrating series of lectures by Ivo BabuSka (see Babuska and Aziz [1972]) and several important mathematical papers by leading specialists in the mathematics of finite elements, all collected in the volume edited by Aziz [1972]. One unfamiliar with aspects of the history of finite elements may be led to the erroneous conclusion that the method of finite elements emerged from the growing wealth of information on partial differential equations, weak solutions of boundary value problems, Sobolev spaces, and the associated approximation theory for elliptic variational boundary value problems. This is a natural mistake, because the seeds for the modern theory of partial differential equations were sown about the same time as those for the development of modern finite element methods, but in an entirely different garden. In the late 1940s, Laurent Schwartz was putting together his theory of distributions around a decade after the notion of generalized functions and their use in partial differential equations appeared in the pioneering work of S.L. Sobolev. A long list of other names could be added to the list of contributors to the modern theory of partial differential equations, but that is not our purpose here. Rather, we must only note that the rich mathematical theory of partial differential equations which began in the 1940s and 1950s, blossomed in the 1960s, and is now an integral part of the foundations of not only partial differential equations but also approximation theory, grew independently and parallel to the development of finite element methods for almost two decades. There was important work during this period on the foundations of variational methods of approximation, typified by the early work of Lions [1955] and by the French school in the early 1960s; but, while this work did concern itself with the systematic development of mathematical results that would ultimately prove to be vital to the development of finite element methods, it did not focus on the specific aspects of existing and already successful finite element concepts. It was, perhaps, an unavoidable occurrence, that in the late 1960s these two independent subjects, finite element methodology and the theory of approximation of partial differential equations via functional analysis methods, united in an inseparable way, so much so that it is difficult to appreciate the fact that they were ever separate. The 1970s must mark the decade of the mathematics of finite elements. During this period, great strides were made in determining a priori error estimates for a variety of finite element methods, for linear elliptic boundary value problems, for eigenvalue problems, and certain classes of linear and nonlinear parabolic problems; also, some preliminary work on finite element applications to hyperbolic equations was done. It is both inappropriate and perhaps impossible to provide an adequate survey of this large volume of literature, but it is possible to present an albeit biased reference to some of the major works along the way. An important component in the theory of finite elements is an interpolation theory: how well can a given finite element method approximate functions of a given
Introduction 11 class locally over a typical finite element? A great deal was known about this subject from the literature on approximation theory and spline analysis, but its particular- ization to finite elements involves technical difficulties. One can find results on finite element interpolation in a number of early papers, including those of ZlAmal [1968], Birkhoff [1969], Schultz [1969], Bramble and ZlAmal [1970], BabuSka [1970,1971], and BabuSka and Aziz [1972]. But the elegant work on Lagrange and Hermite interpolations of finite elements by Ciarlet and Raviart [1972a] must stand as a very important contribution to this vital aspect of finite element theory. A landmark work on the mathematics of finite elements appeared in 1972 in the remarkably comprehensive and penetrating memoir of BabuSka and Aziz [1972] on the mathematical foundations of finite element methods. Here one can find interwoven with the theory of Sobolev spaces and elliptic problems, general results on approximation theory that have direct bearing on finite element methods. It was known that Cea's lemma (Cea [1964]) established that the approximation error in a Galerkin approximation of a variational boundary value problem is bounded by the so-called interpolation error; that is, the distance in an appropriate energy norm from the solution of the problem to the subspace of approximations. Indeed, it was this fact that made the results on interpolation theory using piecewise polynomials of particular interest in finite element methods. In the work of BabuSka [1971] and BabuSka and Aziz [1972], this framework was dramatically enlarged by BabuSka's introduction of the so-called "INF-SUP" condition. This condition is encountered in the characterization of coerciveness of bilinear forms occuring in elliptic boundary value problems. The characterization of this "INF-SUP" condition for the discrete finite element approximation embodies in it the essential elements for studying the stability in convergence of finite element methods. Brezzi [1974] developed an equivalent condition for studying constrained elliptic problems and these conditions provide for a unified approach to the study of qualitative properties, including rates of convergence, of broad classes of finite element methods. The fundamental work of Nitsche [1970] on L°° estimates for general classes of linear elliptic problems must stand out as one of the most important contributions of the seventies. Strang [1972], in an important communication, pointed out "variational crimes", inherent in many finite element methods, such as improper numerical quadrature, the use of nonconforming elements, improper satisfaction of boundary conditions, etc., all common practices in applications, but all frequently leading to exceptable numerical schemes. In the same year, Ciarlet and Raviart [1972b, c] also contributed penetrating studies of these issues. Many of the advances of the 1970s drew upon earlier results on variational methods of approximation based on the Ritz method and finite differences; for example the fundamental Aubin-Nitsche method for lifting the order of convergence to lower Sobolev norms (see Aubin [1967] and Nitsche [1963]; see also Ogenesjan and Ruchovec [1969]) used such results. In 1974, the important paper of Brezzi [1974] mentioned earlier, used such earlier results on saddle point problems and laid the groundwork for a multitude of papers on problems with constraints and on the stability of various finite element procedures. While
12 J. T. Oden convergence of special types of finite element strategies such as mixed methods and hybrid methods had been attempted in the early 1970s (e.g. Oden [1972]), the Brezzi results, and the methods of BabuSka for constrained problems, provided a general framework for studying virtually all mixed and hybrid finite elements (e.g. Raviart [1975], Raviart and Thomas [1977], BabuSka, Oden and Lee [1977]). The first textbook on mathematical properties of finite element methods was the popular book of Strang and Fix [1973]. A book on an introduction to the mathematical theory of finite elements was published soon after by Oden and Reddy [1976] and the well-known treatise on the finite element method for elliptic problems by Ciarlet [1978] appeared two years later. The penetrating work of Nitsche and Schatz [1974] on interior estimates and Schatz and Wahlbin [1978] on L°° estimates and singular problems represented notable contributions to the growing mathematical theory of finite elements. The important work of Douglas and Dupont (e.g. [1970, 1973]; Dupont [1973]) on finite element methods for parabolic problems and hyperbolic problems must be mentioned along with the idea of elliptic projections of Wheeler [1973] which provided a useful technique for deriving error bounds for time-dependent problems. The 1970s also represented a decade in which the generality of finite element methods began to be appreciated over a large portion of the mathematics and scientific community, and it was during this period that significant applications to highly nonlinear problems were made. The fact that very general nonlinear phenomena in continuum mechanics, including problems of finite deformation of solids and of flow of viscous fluids could be modeled by finite elements and solved on existing computers was demonstrated in the early seventies (e.g. Oden [1972]), and, by the end of that decade, several "general purpose" finite element programs were in use by engineers to treat broad classes of nonlinear problems in solid mechanics and heat transfer. The mathematical theory for nonlinear problems also was advanced in this period, and the important work of Falk [1974] on finite element approximations of variational inequalities should be mentioned. It is not too inaccurate to say that by 1980, a solid foundation for the mathematical theory of finite elements for linear problems had been established and that significant advances in both theory and application into nonlinear problems existed. The open questions that remain are difficult ones and their solution will require a good understanding of the mathematical properties of the method. The works collected in this volume should not only provide a summary of important results and approaches to mathematical issues related to finite elements, but also they should provide a useful starting point for further research.
References Argyris, J.H. A954), Energy theorems and structural analysis, Aircraft Engrg. 26,347-356; 383-387; 394. Argyris, J.H. A955), Energy theorems and structural analysis, Aircraft Engrg. 27,42-58; 80-94; 125-134; 145-158. Argyris, J.H. A966), Continua and discontinua, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH, 11-190. Aubin, J.P. A967), Behavior of the error of the approximate solutions of boundary-value problems for linear elliptic operators by Galerkin's method and finite differences, Ann. Scuola Norm. Pisa C) 21, 599-637. Aziz, A.K., ed. A972), The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York). Babuska, I. A970), Finite element methods for domains with corners, Computing 6, 264-273. Babuska, I. A971), Error bounds for the finite element method, Numer. Math. 16, 322-333. Babuska, I. and A.K. Aziz A972), Survey lectures on the mathematical foundation of the finite element method, in: A.K. Aziz, ed. The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 5-359. Babuska, I., J.T. Oden and J.K. Lee A977), Mixed-hybrid finite element approximations of second-order elliptic boundary-value problems, Comput. Methods Appl. Mech. Engrg. 11, 175-206. Birkhoff, G. A969), Piecewise bicubic interpolation and approximation in polygons, in: IJ. Schoenberg, ed., Approximations with Special Emphasis on Spline Functions (Academic Press, New York) 85-121. Birkhoff, G., C. de Boor, M.H. Schultz and В. Wendroff A966), Rayleigh-Ritz approximation by piecewise cubic polynomials, SI AM J. Numer. Anal. 3, 188-203. Birkhoff, G., M.H. Schultz and R.S. Varga A968), Piecewise Hermite interpolation in one and two variables with applications to partial differential equations, Numer. Math. 11, 232-256. Bogner, F.K., R.L. Fox and L.A. Schmit Jr A966), The generation of interelement, compatible stiffness and mass matrices by the use of interpolation formulas, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH, 397-444. Bramble, J.H. and M. Zlamal A970), Triangular element in the finite element method, Math. Сотр. 24 A12), 809-820. Brezzi, F. A974), On the existence, uniqueness, and approximation of saddle-point problems arising from Lagrange multipliers, Rev. Francaise d'Automat. Inform. Reck Oper. 8-R2, 129-151. Cea, J. A964), Approximation variationnelle des problems aux limites, Ann. Inst. Fourier (Grenoble) 14, 345^44. Ciarlet, P.G. A968), An 0(h2) method for a non-smooth boundary-value problem, Aequationes Math. 2, 39-49. Ciarlet, P.G. A978), The Finite Element Method for Elliptic Problems (North-Holland, Amsterdam). Ciarlet, P.G. and P.A. Raviart A972a), General Lagrange and Hermite interpolation in R" with applications to the finite elment method, Arch. Rational Mech. Anal. 46, 177-199. Ciarlet, P.G. and P.A. Raviart A972b), Interpolation theory over curved elements with applications to finite element methods, Comput. Methods Appl. Mech. Engrg. 1, 217-249. Ciarlet, P.G. and P.A. Raviart A972c), The combined effect of curved boundaries and numerical integration in isoparametric finite element methods, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 409-^74. 13
14 J.T. Oden Clough, R.W. A960), The finite element method in plane stress analysis, in: Proceedings 2nd ASCE Conference on Electronic Computation, Pittsburgh, PA. Courant, R. A943), Variational methods for the solution of problems of equilibrium and vibration, Bull. Amer, Math. Soc. 49, 1-23. Douglas, J. and T. Dupont A970), Galerkin methods for parabolic problems, SI AM J. Numer. Anal. 1, 575-626. Douglas, J. and T. Dupont A973), Superconvergence for Galerkin methods for the two-point boundary problem via local projections, Numer. Math. 21, 220-228. Dupont, T. A973), L2 -estimates for Galerkin methods for second-order hyperbolic equations, SI AM J. Numer. Anal. 10, 880-889. Ergatoudis, I., B.M. Irons and O.C. Zienkiewicz A966), Curved isoparametric quadrilateral finite elements, Internat. J, Solids Structures 4, 31-42. Falk, S.R. A974), Error estimates for the approximation of a class of variational inequalities, Math. Сотр. 28, 963-971. Herrmann, L.R. A966), A bending analysis for plates, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH, 577. Hrennikoff, H. A941), Solutions of problems in elasticity by the framework method, J. Appl. Mech., A169-175. Irons, B. A966), Engineering applications of numerical integration in stiffness methods, AIAA J. 4, 2035-3037 Irons, B. A970), A frontal solution program for finite element analysis, Internat, J. Numer. Methods Engrg. 2, 5-32. Irons, B. and A. Razzaque A972), Experience with the patch test for convergence of finite elements, in: A.K. Aziz, Ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 557-587. Johnson Jr, M.W. and R.W. McLay A968), Convergence of the finite element method in the theory of elasticity, J. Appl. Mech. E, 35, 274-278. Rang, Feng A965), A difference formulation based on the variational principle, Appl. Math. Comput. Math. 2, 238-162 (in Chinese). Kron, G. A939), Tensor Analysis of Networks (Wiley, New York). Kron, G. A953), A set of principles to interconnect the solutions of physical systems, J. Appl. Phys. 24, 965-980. Leibniz, G. A962), G.W. Leibniz Mathematische Schriften, С Gerhardt, ed. (G. Olms Verlagsbuchhand- lung) 290-293. Levy, S. A953), Structural analysis and influence coefficients for delta wings, J. Aeronaut. Set 20. Lions, J. A955), Problemes aux limites en theorie des distributions, Acta Math. 94, 13-153. McHenry, D. A943), A lattice analogy for the solution of plane stress problems, J. Inst. Civ. Engrg. 21, 59-82. Nitsche, J.A. A963), Ein Kriterium fur die Quasi-Optimalitat des Ritzschen Verfahrens, Numer. Math. 2, 346-348. Nitsche, J.A. A970), Lineare Spline-Funktionen und die Methoden von Ritz fur elliptische Randwert- probleme, Arch. Rational Mech. Anal. 36, 348-355. Nitsche, J.A. and A.H. Schatz A974), Interior estimates for Ritz-Galerkin methods, Math. Сотр. 28, 937-958. Oden, J.T. A962), Plate beam structures, Dissertation, Oklahoma State University, Stillwater, OK. Oden, J.T. A969), A general theory of finite elements, II: Applications, Internat. J. Numer. Methods Engrg. 1, 247-259. Oden, J.T. A970), A finite element analogue of the Navier-Stokes equations, J. Engrg. Mech. Div. ASCE 96 (EM 4). Oden, J.T. A972), Finite Elements of Nonlinear Continua (McGraw-Hill, New York). Oden, J.T. and J.N. Reddy A976), An Introduction to the Mathematical Theory of Finite Elements (Wiley-Interscience, New York). Oden, J.T. and D. Somogyi A968), Finite element applications in fluid dynamics, J. Engrg. Mech. Div. ASCE 95 (EM 4), 821-826.
References 15 Ogenesjan, L.A. and L.A. Ruchovec A968), Variational-difference schemes for linear second-order elliptic equations in a two-dimensional region with piecewise smooth boundary, U.S.S.R. Comput. Math, and Math. Phys. 8 A), 129-152. Ogenesjan, L.A. and L.A. Ruchovec A969), Study of the rate of convergence of variational difference schemes for second-order elliptic equations in a two-dimensional field with a smooth boundary, U.S.S.R. Comput. Math, and Math. Phys. 9 E), 158-183. Oliveira, E.R. de Arantes e. A968), Theoretical foundation of the finite element method, Internat. J. Solids Structures 4, 926-952. Pestel, E. A966), Dynamic stiffness matrix formulation by means of Hermitian polynomials, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH, 479-502. PiAN, T.H.H. A966), Element stiffness matrices for boundary compatibility and for prescribed stresses, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH, 455-478. Polya, G. A952), Sur une interpretation de la methode des differences finies qui peut fournir des bornes superieures ou inferieures, C.R. Acad. Sci. Paris 235, 995-997. Przemieniecki, J.S., R.M. Bader, W.F. Bozich, J.R. Johnson and W.J. Mykytow, eds. A966), Proceedings Conference on Matrix Methods in Structural Mechanics, Wright-Patterson AFB, Dayton, OH. Raviart, P.A. A975), Hybrid methods for solving 2nd-order elliptic problems, in: J.H.H. Miller, ed., Topics in Numerical Analysis (Academic Press, New York) 141-155. Raviart, P.A. and J.M. Thomas A977), A mixed finite element method for 2nd-order elliptic problems, in: Proceedings Symposium on the Mathematical Aspects of the Finite Element Methods, Rome. Schatz, A.H. and L.B. Wahlbin A977), Interior maximum norm estimates for finite element methods, Math. Сотр. 31, 414-442. Schatz, A.H. and L.B. Wahlbin A978), Maximum norm estimates in the finite element method on polygonal domains, Part I, Math. Сотр. 32, 73-109. Schellbach, K. A851), Probleme der Variationsrechnung, J. Reine Angew. Math. 41, 293-363. Schultz, M.H. A969a), L"-multivariate approximation theory, SIAM J. Numer. Anal. 6, 161-183. Schultz, M.H. A969b), Rayleigh-Ritz-Galerkin methods for multi-dimensional problems, SIAM J. Numer. Anal. 6, 523-538. Schultz, M.H. A971), L2 error bounds for the Rayleigh-Ritz-Galerkin method, SIAM J. Numer. Anal. 8, 737-748. Strang, G. A972), Variational crimes in the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York). Strang, G. and G. Fix A973), An Analysis of the Finite Element Method (Prentice-Hall, Englewood Cliffs, NJ). Synge, J.L. A957), The Hypercircle Method in Mathematical Physics (Cambridge University Press, Cambridge). Taig, I.C. A961), Structural analysis by the matrix displacement method, English Electrial Aviation Ltd. Report, S-O-17. Turner, M.J., R.W. Clough, H.C. Martin and L.J. Topp A956), Stiffness and deflection analysis of complex structures, J. Aero. Sci. 23, 805-823. Varga, R.S. A966), Hermite interpolation-type Ritz methods for two-point boundary value problems, J.H. Bramble, ed., Numerical Solution of Partial Differential Equations (Academic Press, New York). Wheeler, M.F. A973), A-priori L2-error estimates for Galerkin approximations to parabolic partial differential equations, SIAM J. Numer. Anal. 11, 723-759. Williamson, F. A980), A historical note on the finite element method, Internat. J. Numer. Methods Engrg. 15, 930-934. Zienkiewicz, O.C. and Y.K. Cheung A967), The Finite Element Method in Structural and Continuum Mechanics (McGraw-Hill, New York). Zlamal, M. A968), On the finite element method, Numer. Math. 12, 394-409.
Basic Error Estimates for Elliptic Problems P.G. Ciarlet Analyse Numerique, Tour 55-65 Universite Pierre et Marie Curie 4, Place Jussieu 75005 Paris, France HANDBOOK OF NUMERICAL ANALYSIS, VOL. II Finite Element Methods (Part 1) Edited by P.G. Ciarlet and J.L. Lions © 1991. Elsevier Science Publishers B.V. (North-Holland)
Contents Preface 21 Chapter I. Elliptic Boundary Value Problems 23 Introduction 23 1. Abstract minimization problems, variational inequalities and the Lax-Milgram lemma 24 2. The Sobolev spaces Hm(Q) and Green's formulae 30 3. Examples of second-order boundary value problems: The membrane problem, the boundary value problem of linearized elasticity and the obstacle problem 35 4. Examples of fourth-order boundary value problems: The biharmonic problem and the plate problems 53 Chapter II. Introduction to the Finite Element Method 59 Introduction 59 5. The three basic aspects of the finite element method 60 6. Examples of simplicial finite elements and their associated finite element spaces 65 7. Examples of rectangular finite elements and their associated finite element spaces 75 8. Examples of finite elements with derivatives as degrees of freedom and their associated finite element spaces 83 9. Examples of finite elements for fourth-order problems and their associated finite element spaces 87 10. Finite elements as triples (T, P, S) and their associated /Vinterpolation operators Пт 93 11. Affine families of finite elements 97 12. General properties of finite element spaces 102 13. General considerations on the convergence of finite element methods and Cea's lemma 112 Chapter III. Finite Element Methods for Second-Order Problems: The Basic Error Estimates 115 Introduction 115 14. The Sobolev spaces Wm-P(Q) and the quotient space Wk+1-p(Q)/Pk(Q) 118 15. Estimate of the seminorms \v — IITv\mT for polynomial-preserving operators Пт 121 16. Estimate of the interpolation errors \v — IlTv\m T for an affine family of finite elements 126 17. Interpolation and approximation properties of finite element spaces 131 18. Estimate of the error ||« —u»|| t D when the solution и is smooth and sufficient conditions for limt_0 ||«-u»||liD=0 when ue H\Q) 137 19. Estimate of the error \u — uh\0 n when и is smooth and the Aubin-Nitsche lemma, first estimate of the error \u — uh\0 ш D 140 20. Discrete maximum principle in finite element spaces 144 21. Estimates of the error \u — uh\0 ш n when ue W1,r(D), n<p, or when ue W2-P(Q), n<2p, when the discrete maximum principle holds 150 22. Estimates of the errors \u — uh\0 M_n and \u — uj, fl when ue W2,a>(Q) and Nitsche's method of weighted norms 155 23. Estimate of the error ||и —uh||j n for the obstacle problem and Falk's method 173 24. Additional references 180 19
20 P.G. Ciarlet Chapter IV. The Effect of Numerical Integration for Second-Order Problems 183 Introduction 183 25. The effect of numerical integration and examples of numerical quadrature schemes 184 26. Abstract error estimate and the first Strang lemma 192 27. Uniform JVellipticity of the approximate bilinear forms 193 28. Consistency error estimates and the Bramble-Hilbert lemma 196 29. Estimate of the error || и — Mj, || j fi 204 Chapter V. Nonconforming Finite Element Methods for Second-Order Problems 209 Introduction 209 30. Nonconforming methods 210 31. Abstract error estimate and the second Strang lemma 212 32. An example of a nonconforming finite element: Wilson's brick 214 33. Consistency error estimate and the bilinear lemma 219 34. Estimate of the error {£twJm--"<iIi,t}1/2 for Wilson's brick and the patch test 221 Chapter VI. Finite Element Methods for Second-Order Problems Posed over Curved Domains 227 Introduction 227 35. Isoparametric families of finite elements 228 36. Examples of isoparametric finite elements 232 37. Estimates of the interpolation errors || v—Пт v \\ m T for an isoparametric family of finite elements 237 38. Approximation of a domain with a curved boundary with isoparametric finite elements 250 39. Isoparametric numerical integration 254 40. Abstract error estimate 256 41. Uniform Kft-ellipticity of the approximate bilinear forms 258 42. Interpolation and consistency error estimates 260 43. Estimate of the error flu—ujj n 265 Chapter VII. Finite Element Methods for Fourth-Order Problems 273 Introduction 273 44. Conforming methods for fourth-order problems: Almost-affine families of finite elements 274 45. Examples of polynomial finite elements of class Ч1 276 46. Examples of composite finite elements of class #! 279 47. Examples of singular finite elements of class 'i1 288 48. Estimates of the error ||u —uj|2 n for finite elements of class cel 295 49. A nonconforming finite element for the plate problem: The Adini-Clough-Melosh rectangle 298 50. Estimate of the error {£ТеГ\и — uh\22 T}1/2 for the Adini-Clough-Melosh rectangle 302 References 313 Glossary of Symbols 335 Subject Index 343
Preface The objectives of this article are to give a thorough mathematical description of the finite element method applied to linear elliptic problems of the second and fourth order that typically arise in linearized elasticity (including the "almost linear" problems modeled by variational inequalities) and to establish the corresponding "global" error estimates; the "interior" error estimates are established in the next article by Wahlbin. The prerequisites consist essentially in a good knowledge of analysis, functional analysis, and a certain familiarity with Sobolev spaces and linear elliptic partial differential equations. Apart from these, the article is essentially self-contained. The main topics covered are the following (more detailed informations are provided in the introductions to each chapter): - Description and mathematical analysis of various problems of linearized elasticity, such as the membrane and plate problems, the boundary value problem of three-dimensional elasticity, the obstacle problem (Chapter I). - Description of the conforming finite elements currently used for approximating second- and fourth-order problems, including composite and singular elements in the latter case (Chapters II and VII). - Derivation of the fundamental error estimates in the ff ^norm, L2-norm, and L°°-norm, for conforming finite element methods applied to second-order problems, including detailed analyses of the discrete maximum principle, and of the method of weighted norms of Nitsche (Chapter III). - Derivation of the error estimates in the Ях-погт for the obstacle problem (Chapter III). - Description of finite element methods with numerical integration for second- order problems, and derivation of the corresponding error estimates in the Я^погт (Chapter IV). - Description of nonconforming finite element methods for second- and fourth- order problems, and derivation of the corresponding error estimates in the discrete Я1- and Я2-погт (Chapters V and VII). - Description of the combined use of isoparametric finite elements and of isoparametric numerical integration for second-order problems posed over domains with curved boundaries, and derivation of the corresponding error estimates in the Я^погт (Chapter VI). - Derivation of the error estimates in the Я2-погт for the polynomial, composite, and singular, finite elements used for solving fourth-order problems (Chapter VII). 21
22 P.G. Ciariet In addition, various relevant extensions, refinements, etc., of these estimates have been mentioned, with appropriate references to the existing literature. This article is a revised, updated, and enlarged edition of those parts of my book, The Finite Element Methodfor Elliptic Problems (North-Holland, Amsterdam, 1978) that are relevant here. Although I have added more than 230 items to the 316 references that I have kept from this book, I have made no attempt to compile an exhaustive bibliography. I however hope that the present bibliography is "reasonably complete". The various finite elements described in this article have been named according to the most common usages in the engineering literature. As a result, the terminology adopted in this article often departs strikingly from that used in my book of 1978. For instance, what I then called a "triangle of type A)" or a "rectangle of type B)" are now called a "linear triangle" or a "biquadratic rectangle". Since the readers of this article are perfectly aware that a triangle may be isoceles, but certainly not genuinely "linear", or that a rectangle may be square, but certainly not genuinely "biquadratic", I have blithely committed these serious abus de langage, which clearly convey more information than the names that I had originally chosen.
Chapter I Elliptic Boundary Value Problems Introduction Many problems in linearized elasticity are modeled by a minimization problem of the following form: The unknown u, which is the displacement of a mechanical system, satisfies и e U and J(u) = inf J(v), veU where the set U of admissible displacements is a closed convex subset of a Hilbert space V, and the energy J of the system takes the form J(v) = ^a(v,v)-l(v), where a{',-) is a symmetric bilinear form and I a linear form, both defined and continuous over the space V. In Section 1, we first prove a general existence result (Theorem 1.1) for such minimization problems, the main assumptions being the completeness of the space V and the V-ellipticity of the bilinear form. We also describe other formulations of the same problem (Theorem 1.2), which are its variational formulations. When the bilinear form is not symmetric, these formulations make up variational problems on their own. For such problems, we give an existence theorem when U=V (Theorem 1.3), which is the celebrated Lax-Milgram lemma. All these problems are called abstract problems inasmuch as they represent an "abstract" formulation which is common to many examples, such as those examined in this chapter. The analysis made in Section 1 shows that a candidate for the space V must have the following properties: It must be complete on the one hand, and it must be such that the expression J(v) is well defined for all functions ve V on the other hand (V is a "space of finite energy"). The Sobolev spaces fulfill these requirements. After briefly mentioning in Section 2 some of their basic properties (other properties will be introduced in later sections as needed), we examine in Sections 3 and 4 specific examples that fit in the abstract setting of Section 1, such as the membrane problem, the obstacle problem, the clamped plate problem, and the boundary value problem of linearized elasticity, which is by far the most important example. Indeed, even though throughout this article we will often find it convenient to work with the 23
24 P.G. Ciarlet Chapter I simpler looking problems described at the beginning of Section 3, it must not be forgotten that these are essentially convenient model problems for the boundary value problem of linearized elasticity. For each of the examples, we establish in particular the V-ellipticity of the associated bilinear form, and, using various Green's formulae in Sobolev spaces, we show that when solving these problems, one solves, at least formally, elliptic boundary value problems of the second and fourth order. 1. Abstract minimization problems, variational inequalities and the Lax-Milgram lemma All functions and vector spaces considered in this article are real. Let there be given a normed vector space V with norm || • ||, a continuous bilinear form a(','):Vx V-+M, a continuous linear form /: V-+U and a nonempty subset U of the space V. With these data we associate an abstract minimization problem: Find an element и such that ueU, J{u) = inf J(v), A.1) veV where the functional J: V-*U is defined by J:veV^J{v)=^a(v,v)-l(v). A.2) As regards existence and uniqueness properties of the solution of this problem, the following result is essential. Theorem 1.1. Assume in addition that (i) the space V is complete, (ii) U is a closed convex subset of V, (iii) the bilinear form a( •, •) zs symmetric and (iv) the bilinear form is V-elliptic, in the sense that there exists a constant a such that a>\ (i.3) a || и || ^ a(v, v) for all veV. Then the abstract minimization problem A.1) has one and only one solution. Proof. The bilinear form a( •, •) is an inner product over the space V, and the associated norm is equivalent to the given norm || • ||. Thus the space V is a Hilbert space when it is equipped with this inner product. By the Riesz representation theorem, there exists an element ale V such that l(v) = a(al,v) for all veV, so that, taking again into account the symmetry of the bilinear form, we may rewrite the functional as J(v) = %a{v, v) — a(al, v) = ja(v — ol,v — al) — \a{al, ol). Hence solving the abstract minimization problem amounts to minimizing the
Section 1 Elliptic boundary value problems 25 distance between the element al and the set U, with respect to the norm ч/й( •, •). Consequently, the solution is simply the projection of the element al onto the set U, with respect to the inner product a(v)- By tne projection theorem, such a projection exists and is unique, since U is a closed convex subset of the space V. □ Next, we give equivalent formulations of this problem. Theorem 1.2. An element и is the solution of the abstract minimization problem A.1) if and only if it satisfies the relations ueU, A.4) a(u,v — u)^l(v—u) for all veil, in the general case, or ueU, e(«, »)>/(») for alive U, A.5) a(u, u) = l{u), if U is a closed convex cone with vertex 0, or ueU, (L6) a(u, v) = l(v) for all veU, if U is a closed subspace. Proof. The projection и of al onto U is completely characterized by the relations ueU, A.7) a{al—u,v — u)^0 for all veU, the geometrical interpretation of the last inequalities being that the angle between the vectors (al — u) and (t> — u) is obtuse (Fig. 1.1) for all v e U. These inequalities may Fig. 1.1. In a Hilbert space with inner product a( •, •), the projection и of al onto the closed convex set U is characterized by the variational inequalities a(rsl — u,v — u)<0 for all veU.
26 P.G. Ciarlet Chapter I be written as a(u, v—u)^-a(ai, v — u) = l(v — u) for all veU, which proves relations A.4). Assume next that U is a closed convex cone with vertex 0. Then the point (u + v) belongs to the set U whenever the point v belongs to the set U (Fig. 1.2). Therefore, Fig. 1.2. In a Hilbert space with inner product a{ •, •), the projection и of al onto a closed convex cone U with vertex 0 is characterized by the equation a(u — al, u)=0 and by the variational inequalities a(u — al,!>)>0 for all veU. upon replacing v by (u + v) in inequalities A.4), we obtain the inequalities a(u,v)^l(v) for all veU, so that, in particular, a(u, u)^l{u). Letting v = 0 in A.4), we obtain a{u, u)^l(u), and thus relations A.5) are proved. The converse is clear. If U is a subspace (Fig. 1.3), then inequalities A.5) written with v and —v yield Fig. 1.3. In a Hilbert space with inner product a(-,-), the projection и of <r( onto a closed subspace U is characterized by the variational equations a(al—u, d) = 0 for all vs U.
Section 1 Elliptic boundary value problems 27 a(u, v) ^ l(v) and a{u, v) =$ l(v) for all veU, from which relations A.6) follow. Again the converse is clear. □ Remark 1.1. Since the projection mapping is linear if and only if the subset U is a subspace, it follows that problems associated with variational inequalities are generally nonlinear, the linearity or nonlinearity being that of the mapping le V'-me V, where V is the dual space of V, all other data being fixed. One should not forget, however, that if the resulting problem is linear when one minimizes over a subspace, this is also because the functional is quadratic, i.e., it is of the form A.2). The minimization of more general functionals over a subspace would correspond to nonlinear problems. The various equivalent formulations of the minimization problem A.1) given in Theorems 1.1 and 1.2 may be also interpreted from the point of view of differential calculus, as follows. We first observe that the functional J is differentiable at every point ueV, the action of its Frechet derivative J'(u) e V on an arbitrary element v e V being given by J'(ify = a(u,v)-l(v). A.8) Let then и be the solution of the minimization problem A.1) and let v = u + w be any point of the convex set U. Since the points (u + 6w) belong to the set U for all 6e [0,1] (Fig. 1.4), we have, by definition of the derivative J(u), 0^J(u + 6w)-J(u) = ej'(u)w + 6\\w\\s(e) Fig. 1.4. If и belongs to a convex set U and if J(u)=infytsUJ{v), then for each v = {u + w)el), J(u + 0w)-J(uK=O for all Os£0s: 1. for all 0e[0,1], with Итв^0е(в) = 0. As a consequence, we necessarily have J'(u)w>0, A.9) since otherwise the difference J(u + 6w)—J(u) would be <0 for в small enough. Using A.8), we may rewrite inequality A.9) as J'(u)w — J'(u)(v — u) = a(u, v — u) — l(v—u)^ 0,
28 P.G. Ciariet Chapter I which is precisely A.4). Conversely, assume that we have found an element ueU such that J'(u)(v-u)^Q for all ve U. A.10) The second derivative 5"{и)&У?2(У\ Щ of the functional J is independent of ueV and its action on arbitrary elements veV and we V is given by J"(u)(v,w) = a(v,w). A.11) Thus, for any point v = u + w belonging to the set U, an application of Taylor's formula yields J(u + w) - J{u) = J'(«)(w) + ie(w,w)^ia || w ]|2, A.12) which shows that и is a solution of problem A.1). We have J(v) — J(u)>0 unless v = u so that we see once again that the solution is unique. Arguing as in the proof of Theorem 1.2, we then easily verify that inequalities A.10) are equivalent to the relations J'(u)v^Q, J'(u)u=0 for all veU, A.13) when U is a convex cone with vertex 0, and that they reduce to J'(u)v = 0 for all uel/, A.14) when U is a subspace. Notice that relations A.13) coincide with relations A.5), and that relations A.14) coincide with relations A.6). When U=V, relations A.14) reduce to the familiar condition that the first variation of the functional J, i.e., the first-order term J'(u)w in the Taylor expansion A.12), vanishes for all w e V when the point и is a minimum of the function /: V-*M, this condition being also sufficient if the function J is convex, as is the case here. By means of the equivalent relations A.10), A.13), and A.14), relations A.4), A.5), and A.6) thus appear as generalizations of the previous condition, the expression a{u, v—u)—f{v-u)~ J'(u)(v — u) playing in the present situation the role of the first variation of the functional J relative to the convex set U. It is in this sense that the formulations of Theorem 1.2 are called variational. More precisely, the characterizations A.4), A.5), and A.6) are called variational formulations of the original minimization problem, the equations A.6) are called variational equations, and the inequalities of A.4) and A.5) are called variational inequalities. Without making explicit reference to the functional J, we can also define various abstract variational problems: Find an element и such that U€U> A.15) a(u, v — u)^l(v — u) for all ve U,
Section 1 Elliptic boundary value problems 29 in the general case, or, find an element и such that ueU, a(u,v)>l(v) for all eel/, A.16) a(u, u) = l(u), if U is a cone with vertex 0, or, finally, find an element и such that ueU, (U7) a(u, v) = l(v) for all veV, if U is a subspace. By Theorem 1.1, each of these problems has one and only one solution if the space V is complete, if the subset U of V is closed and convex, and if the bilinear form is F-elliptic, continuous, and symmetric. // the assumption of symmetry of the bilinear form is dropped, the above variational problems still have one and only one solution if the space V is a Hilbert space, but there is no longer an associated minimization problem. Here we shall confine ourselves to the case where U=V. Theorem 1.3. (Lax-Milgram lemma). Let Vbea Hilbert space, leta(- ,-):Vx V~* Ш. be a continuous V-elliptic bilinear form, and let l:V-*U be a continuous linear form. Then the abstract variational problem: Find an element и such that ueV> A.18) a(u,v)=l(v) for all veV, has one and only one solution. Proof. Let M be a constant such that |а(и,о)|<М||и|||И forallu,ueF. A.19) For each ueV, the linear form ve V-*a(u, v) is continuous and thus there exists a unique element AueV (V is the dual space of V) such that a(u, v) = Au(v) for all v e V. A.20) Denoting by || • ||' the norm in the space V', we have Mu|r = sup!^f^M||u|!. veV И Consequently, the linear mapping A: V-+V is continuous, with MII*(K;K',<M. A.21) Let t.V'->V denote the Riesz mapping which is such that, by definition, l(v)=((zl,»)) for all /e V and all ve V, A.22) where ((•,•)) denotes the inner- product in the space V. Then solving the variational problem A.18) is equivalent to solving the equation xAu = xl. We will show that this
30 P.G. Ciariet Chapter I equation has one and only one solution by showing that, for appropriate values of a parameter p > 0, the affine mapping veV-*v-p{xAv-xl)eV A.23) is a contraction. To see this, we observe that \\v- pxAv\\2 = \\v\\2 -2p{(xAv,v))+p2\\xAv\\2 ^{\-2pa + p2M2)\\v\\2, since, by inequalities A.3) and A.21), {(tAv, v)) = Av(v) = a(v, v) > a || v \\2, ||ту1«|| = ||А»||'<||Х||||о||<М||»||. Therefore the mapping defined in A.23) is a contraction whenever the number p belongs to the interval ]0,2a/M2[ and the proof is complete. □ Remark 1.2. It follows from the previous proof that the mapping A: V-+V is onto. Since 0£||и||2<в(и,и) = /(и)<||/|Г||и||, the mapping A has a continuous inverse A'1, with Therefore the variational problem A.18) is well-posed in the sense that its solution exists, is unique, and depends continuously on the data f (all other data being equal). More generally, one can show that, if Wj and u2 are solutions of problem A.15) corresponding to linear forms lx and l2, then ll«i-«2K-l|fi-M'. a The original reference of the Lax-Milgram lemma is Lax and Milgram [1954]. Our proof follows the method of Lions and Stampacchia [1967], where it is applied to the general variational problem A.15), and where the case of semipositive- definite bilinear forms is also considered; Stampacchia [1964] had the original proof in this case. For constructive existence proofs and additional references, see also Glowinski, Lions and Tremolieres [1976a]. We also mention that BabuSka (Babuska and Aziz [1972, Theorem 5.2.1]) has extended the Lax-Milgram lemma to the case of bilinear forms defined on a product of two distinct Hilbert spaces. 2. The Sobolev spaces Hm(Q) and Green's formulae For treatments of differential calculus with Frechet derivatives, the reader may consult Avez [1983], Cartan [1967], Dieudonne [1967], Schwartz [1967]. For the theory of distributions and its applications to partial differential equations, see
Section 2 Elliptic boundary value problems 31 Schwartz [1966]. Other references are Treves [1967], Shilov [1968], Vo-Khac Khoan [1972a, 1972b], Choquet-Bruhat [1973], Hormander [1983]. The Hilbertian Sobolev spaces Hm(Q) are studied in Lions and Magenes [1968, Chapter 1], Dautray and Lions [1984, Chapter 4] (references on the more general Sobolev spaces Wm,p(Q) are given in Section 14). Let us first briefly recall some results from differential calculus. Let there be given two normed vector spaces X and 7 and a function v: A^ Y, where A is a subset of X. If the function is к times differentiable at a point as A, we shall denote by Dkv(a), or simply by Dv(a) if к = 1, its kih Frechet derivative. Note that we also use the alternate notations Dv(a) = v'(a) and D2v(a) = v"(a). The fcth derivative Dkv(a) is a symmetric element of the space Л?к(Х; Y), and its norm is given by \\Dkv(a)\\= sup \\Dkv{a)(huh2,...,hk)l lUisik In the special case where X = R" and Y= R, let eh l^i^n, denote the canonical basis vectors of №. Then the usual partial derivatives are given by biV(a) = Dv{a)ei, dijv(a) = D2v(a)(ehej), dijkv(a) = D3v(a)(et, eh ek), etc., and occasionally, we use the notation Vu(a), or Sv{a), to denote the gradient of the function v at the point a, i.e., the vector in R" whose components are the partial derivatives 8,i?(a), 1 ^ i ^ n. We also use the multi-index notation for denoting the partial derivatives: Given a multi-index a=(a!,a2,.. .,a„)eN", we let |а| = Е"=1аг. Then the partial derivative d*v(a) is the result of the application of the |a|th derivative £)|а|и(а) to any |a|-vector of (R"I"' where each vector et occurs at times, 1 <i<n. For instance, if n — 3, we have Э1»(в) = ЭA,0-0)ф), 6123ф) = 8A'1'1)ф), d111v(a) = di3'°-0)v(a),... Clearly, there exist constants C(m, n) such that for any partial derivative d"v(a) with |a| = m and any function v, \dxv(a)\^ ||Dmv(a)|| <C(m,n) max \dxv(a)\ (unless otherwise specified, it is understood that the space W is equipped with the Euclidean norm). As a rule, we represent by symbols such as Dkv,v",Qtv,dxv,..., the functions associated with any derivative or partial derivative. When h1 = h2 = --- = hk = h, we simply write Dkv(a)(h1,h2,...,hk)=Dkv(a)hk. Thus, given a real-valued function v, Taylors formula of order к is written as * 1 1 v(a + h) = v(a)+ X -Dlv(a)hl + -—-Dk+1v(a + 6h)hk+1, for some 0e]0,1[ (whenever such a formula applies).
32 P.G. Ciarlet Chapter I Given a bounded open subset Q in W, the space &>(Q) consists of all indefinitely differentiable functions v:Q-*U with compact support. For each integer m ^ 0, the Sobolev space Hm{Q) consists of those functions v e L2(Q) for which the partial derivatives d*v in the distributional sense with |a| ^ m, belong to the space L2(Q) for all |a| 5%m, i.e., for each multi-index a with |a| ^m, there exists a function dav e L2{Q) that satisfies dVdx = (~l)|a| Ue^dx iord\\<t>e9(Q). B.1) n a Equipped with the norm MML.n=( I [|6ar]2dxI/2, \|«|<m J / the space Hm(Q) is a Hilbert space. We shall also make frequent use of the semi-norm \|a[=m J / П We define the Sobolev space the closure being understood in the sense of the norm || • ||тЯ. When the set Q is bounded, the Poincare-Friedrichs inequality holds: there exists a constant C(Q) such that No.n<C(fi)Hli0 foraU»eflJ(Q). B.2) Therefore, w/геи t/ге set Q is bounded, the seminorm | • |m,n is a norm over the space Hq(Q), equivalent to the norm || • ||mifi (see also Theorem 3.1). Following NeCas [1967], we say that an open set Q has a Lipschitz-continuous boundary Г if the following conditions are fulfilled: There exist constants a>0 and /?>0, and a finite number of local coordinate systems and local maps ar, 1 ^r<K, that are Lipschitz-continuous on their respective domains of definitions {xreRn;|x,'|sS<x}, such that (Fig. 2.1): R T=U {{x\,xr);x\=ar(xr),\xr\<a}, r=l {{x\,xr);ar{xr)<x\<aT{xr) + P, |xr|<a}c:£2, 1<г^Д, {(xr1,xr);ar(xr)-li<xr1<ar(xr),\xr\<a}c:W-Q, l^r^R, where xr = (x;>,.. .,xr„), and |xr|<a stands for |xf|<a,l^i^n. Occasionally, we shall also need the following definitions: A boundary is of class S£ if the functions a/. \xr\ <<x~>R are of class Ж (such as <€m or свт'% and a boundary
Section 2 Elliptic boundary value problems 33 Fig. 2.1. A domain in U2 is an open, bounded, connected subset with a Lipschitz-continuous boundary. is said to be sufficiently smooth if it is of class <€m, or ^""'a, for sufficiently high values of m, or m and a. A domain in W is an open, bounded, connected subset with a Lipschitz- continuous boundary. This definition is particularly well suited for our subsequent purposes, in that it allows the consideration of domains with "corners" or "edges", such as polyhedra. In the remaining part of this section, it will be always understood that Q is a domain in R". In particular then, a superficial measure, which we shall denote dy, can be defined along the boundary, so that it makes sense to consider the spaces Ь2(Г), whose norm shall be denoted || • ||£,2(r). Then it can be proved that there exists a constant C(O) such that NL*(r)*£C(G)N|1(o for all ve^iQ). B.3) Since {^"(fl)} ~ =H1(Q) if О is a domain, the closure being taken with respect to the norm I| • [|lrn, there exists a continuous linear mapping tr:veHx(Q)-*tr veL2(r), which is called the trace operator. Note however that, when no confusion should arise, we shall simply write trt; = y. The following characterization of the space
34 P.O. Ciarlet Chapter I Hh{Q) holds: Hh(Q)={veH Щ; v=0 on Г}. Since the unit outer normal vector v = (v1(..., v„) (Fig, 2.1) exists almost everywhere along Г, the (outer) normal derivative operator, n is defined almost everywhere along Г for smooth functions. Extending its definition to 8V = S"=! V; tr 8; for functions in the space H2(Q), we obtain the following characterization of the space Яо(£2): Hg(fl) = {ueH2(fi);u = 8vt;=0 on Г}. Given two functions u, veH1(Q), the fundamental Green formula udtv dx = — btu v dx + \u vvt dy B.4) holds for any ie [1, n]. From this formula, other Green's formulae are easily deduced. For example, replacing и by 8,u and summing from 1 to n, we get Y, 8;tt3;t)dx=— Auvdx+ 8vui;dy B.5) n г for all ueH2(Q), veH\Q). As a consequence, we obtain by subtraction: (uAv — Auv)dx = n {ubvv — dvuv)dy B.6) for all u, veH2(Q). Replacing и by Au in formula B.6), we obtain Mu/li>dx = A2uvdx— \dvAuvdy+ AudYvdy B.7) J -J J л q а г г for all ueH\Q), veH2{Q). As another application of formula B.4), let us prove the relation \Mo.a = \v\2,a &>га11»еЯ§@), B.8) which implies that, over the space Hq(Q), the seminorm v-*\Av\0i{} is a norm, equivalent to the norm || • ||2,д: We have, by definition, \v\l,n = Z@i;vJ+Y@ijvJ\^ i iitj J
Section 3 Elliptic boundary value problems 35 Clearly, it suffices to prove relations B.8) for all functions v e 3>{Q). But for such functions, we have (8yi?J dx = divQijjV dx - dijvdjjv dx, as two applications of Green's formula B.4) show, and thus B.8) is proved. For n = 2, let т = (тх, т2) denote the unit tangential vector along the boundary Г, oriented in the usual way. In addition to the normal derivative operator 3V, we introduce the differential operators 8t, Svr, 6„ defined by 2 dzv{a)=Dv{a)x= ]T тД-i^a), i=i 2 8vti;(a)=D2t;(a)(v,T)= £ v^Syufa), 2 3„ф) = /Jф)(т,т) = ^ т(т;8ур(а). u=i Note that 8zzv does not coincide in general with the second derivative of the function v considered as a function of the curvilinear abscissa along the boundary. Then one can show that another Green's formula, viz., {2d12ud12V-dilud22V-d22ud11v}dx'= {-dztudvv + dvtudrv} dy, B.9) r holds for all functions ueH\Q), veH2{Q). 3. Examples of second-order boundary value problems: The membrane problem, the boundary value problem of linearized elasticity and the obstacle problem We next proceed to examine several examples of minimization problems that fall in the abstract setting of Section 1. According to the analysis made in this section, we thus need to specify for each example the space V, a subset U of the space V, a bilinear form a(-,-):Vx F->R, and a linear form /: V-*R. In all the examples, the set Q is a domain in W. Example 3.1. The first example corresponds to the following data: V=U = Hh(Q), a(u,v)= I Y, SjuSji) + buv I dx, C.1) я /(d) = \fvdx, Q
36 P.G. Ciarlet Chapter I and the following assumptions are made on the functions b and /: beU°{Q), b^O a.e. опй, feL2(Q). C.2) To begin with, it is clear that the symmetric bilinear form a( •, •) is continuous since for all u,veHl(Q), n \a(u,V)\^ £ |6;М|о,я|9;1>|0,й+|Ь|о,а,,йМо,яМо,Я i=l <max{l,]b|0,oo.o}Nli,oNli,fl. where j • \0if, and | • |0o0 fi denote the norms of the space L\Q) and L°°(fi) respectively. Further, it is Ho(£2)-elliptic since, for all veH1(Q), a(v, v) > Q Y @ivJ dx = \v\la (by the Poincare-Friedrichs inequality B.2), the seminorm | • 11>fi is a norm over the space Ho(Q), equivalent to the norm || • || li0). Next, the linear form I is continuous since for all veH1(Q), \m\<\f\o,o\v\0.o^\f\ o.nlMli.n- Therefore, by Theorem 1.1, there exists a unique function неЯо(Й) that minimizes the functional J:»-»J(») = i £ (diVJ + bv2]dx- fv dx C.3) Q over the space Ho(Q), or equivalently, by Theorem 1.2, that satisfies the variational equations 1 Г £ Э,иЭ,1> + buv Ydx = fv dx for all v e Hh(Q). C.4) ;=i J J q n Using these equations, we now show that we are also solving a partial differential equation in the distributional sense. More specifically, let @)'(Q) denote the space of distributions over the set Q, i.e., the dual space of the space @{Q\ equipped with the Schwartz topology, and let <•,•> denote the duality pairing between the spaces 2'{Щ and 3>{Q). As usual we identify a function g that is locally integrable over Q with the distribution g: феЩ£2)-+§Рдфйх. Since the inclusion 9{Q)aV-=Hl(Q) holds, the variational equations C.4) are satisfied for all functions ce§(fl). Therefore, by definition of the differentiation of distributions, we may write n a(u,ф) = £ <S/",б,Ф> + <Ъи,ф} = (-Аи + Ьи,фУ for all фе®{Q). ;=i
Section 3 Elliptic boundary value problems 37 Since 1(ф) = {/, ф) for all ^e^(fl), it follows from the above relations that и is a solution of the partial differential equation — Au + bu=fin <%'(Q). To sum up, the solution и of the minimization (or variational) problem associated with the data C.1) is also a solution of the problem: Find a distribution ue@)'{Q) such that ueHh№ C.5) -Au + bu=f in 9>\Q), and conversely, if a distribution и satisfies C.5), it is a solution of the original problem. To see this, we observe that the equalities a(u,ф) = (-Аи + Ьи,ф} = (/,ф) = 1ф) for all фе2{Q) hold in fact for all functions ф е Hq@) since 3>(Q) is a dense subspace of the space Hh@). Remembering that the functions in the space Hq(Q) have a vanishing trace along Г, we shall say that we have formally solved the associated boundary value problem — Au + bu=f in Q, u = 0 on Г. C.6) The terminology "boundary value problem" refers to the fact that the values of the unknown function и (as in this example) or of some differential operator acting on the unknown function и (as in some of the subsequent examples) are specified along the boundary of the set Q. Problem C.6) is called a homogeneous Dirichlet problem for the operator u-* — Au-\-bu, since it is formally posed exactly as in the classical sense where, typically, one would seek a solution in the space #°(£2)п#2(£2). Actually, when the data are sufficiently smooth, it can be proved (but this is not easy) that the solution of C.4) is also a solution of C.6) in the classical sense. Nevertheless, one should keep in mind that, in general, nothing guarantees that the partial differential equation — Au + bu =f in Q can be given a sense otherwise than in the space 3i'(Q). Likewise the boundary condition и=0 on Г cannot be understood in general otherwise than in the sense of a vanishing trace, or even in no sense at all if the set Q were "only" supposed to be bounded. A modification of the linear form of C.1) will lead us to the second example. Example 3.2. Let the space V and the bilinear form be as in C.1), and let the linear form be defined by l(v) = \fvdx-a{u0,v), C.7) n where the functions / and b satisfy assumptions C.2) and u0 is a given function in the space H1(Q). Proceeding as before, we could likewise show that these data correspond to the formal solution of the nonhomogeneous Dirichlet problem for the
38 P.G. Ciariet Chapter I operator u-+ —Au + bu, viz., — Au + bu~f in Q, u = u0 on Г. C.8) When b=0 and n — 2, this problem is called the membrane problem: It arises in linearized elasticity when one considers the problem of finding the equilibrium position of an elastic membrane, with tension т, under the action of a vertical force of density F = xf, and whose vertical displacement и is equal to a known function u0 along the boundary Г (cf. Fig. 3.1). Fdx Fig, 3.1. The membrane problem: the unknown function u: Q <= U2 ->R represents the vertical displacement of a membrane subjected to a vertical force of density F per unit area. Example 3.3. The third example corresponds to the following data: a{u, v) = £ diudiv + buv )dx, l(v) = fvdx+ \gv dy, о г and the following assumptions are made on the functions a, f and g: beL^iQ), b^bo>0 a.e. on Q, feL2(Q), gsL2(D C.9) C.30)
Section 3 Elliptic boundary value problems 39 for some constant b0. Hence both the space V and the linear form / differ from those of Example 3.1. The bilinear form is Hl{Q)-e\liptic since a(v, v) ^ min{l,b0} ||«||ii0 for all veHl(Q). The linear form veH1(Q) -* $rgvdy is continuous since by inequality B.3), gvdy <Ыьнг)Ытп =* C(Q)\\g\\LHn\\v\\uo. Therefore there exists a unique function ue H1(Q) that minimizes the functional J:»-»J(») = i Z(d,vJ + bv2\dx fvdx — gvdy, over the space H\Q) or, equivalently, such that Y, diUdiV + buv>dx fv dx + gv dy for all v e H1 (Q). C.11) Because @{Q) is a subspace of the space Hi{Q), an argument similar to the one used in Example 3.1 shows that и is also a solution of the partial differential equation — Au + bu = f in 3i'(Q). Hence we have (—Au + bu)vdx = a(u, v)~ \gvdy for all veHi(Q). To sum up, the solution u of the minimization (or variational) problem associated with the data C.9) is also a solution of the problem: Find a distribution и е !3'{Q) such that ueH\Q), -Au + bu = f in ЩП), Г C-12) ( — Au + bu)vdx = a(u,v)— gvdy for all vsH1(Q), and, conversely, if a distribution и is a solution of problem C.12), it is clearly a solution of the variational equations C.11). If we assume additional smoothness on the solution, the second relations in C.12) shows that the solution satisfies a boundary condition: If the solution и is in the space H2{Q), for example, an application of Green's formula B.5) shows that, for all veH'iQ), a(u, v) = (—Au + bu)vdx + dvuvdy fvdx + gvdy. C.13)
40 P.G. Ciarlet Chapter I Therefore the conjunction of relations C.12) and C.13) implies that 9vMt>dy= \gvdy for all veH1(Q), C.14) г г and from these, one deduces that dyu=g on Г. Consequently, we shall say that we have formally solved the associated boundary value problem: — Au + bu = f in Q, dvu = g on Г, C.15) which is called a nonhomogeneous Neumann problem if g ф 0, or a homogeneous Neumann problem if g = 0, for the operator и ~* ~-Au + bu. Remark 3.1. Without using differentiation of distributions, we may use Green's formula B.5) to obtain the partial differential equation since а{и,ф)= ( — Аи + аи)фйх — < — Ли + аи, ф) for all ф е 3)(Q). о This is not a coincidence: The definition of differentiation for distributions is precisely based upon the fundamental Green formula B.4). If b = 0, the bilinear form a(- ,•) of'C.9) is no longer Hl(Q)-elliptic. We can however circumvent this difficulty by using the following result (which is a special case of a result proved later; cf. Theorem 14.1): Let P0{Q) denote the space of all constant functions over Q. Then the seminorm I'li.o is a norm over the quotient space H1(Q)/P0(Q), equivalent to the quotient norm. This observation leads us to our fourth example, Example 3.4. Let we Hl(£2)/P0(Q) denote the equivalence class of an arbitrary element wsH1{Q), and let: 5/=[/ = Я1@)/Ро(А), а(й, v) = Y, 9,u9;t)dx, C.16) /(«5) = fv dx + gv dy. г The bilinear form a( •, •) is well defined and continuous over the quotient space V=H1(Q)/P0(Q), and further, it is now V-elliptic by the aforesaid result. If (and only if) fdx+ \gdy = 0, C.17)
Section 3 Elliptic boundary value problems 41 the linear form I is also well defined and continuous over the quotient space. Arguing as before, we find that the variational problem associated with the data C.16) corresponds to the formal solution of the associated boundary value problem — Ли-f in Q, dvu=g on Г, C.18) which is another nonhomogeneous Neumann problem for the operator —A. Remark 3.2. The necessity of the relation C.17) also follows a posteriori from C.18) and from the formula \qAu Ax = \Tbyu dy, obtained by letting v = 1 in Green's formula B.5). Example 3.5. In the fifth example, we extend in two directions the previous examples: First the associated partial differential equation will have nonconstant coefficients and secondly, the bilinear form will not be necessarily symmetric so that the Lax-Milgram lemma (Theorem 1.3) will be needed for proving the existence of a solution. The data are the following: У=и = {оеНЩ-^ = 0 on Г0}, a(u, v)- < £ aifiiudjV + buvfdx, C.19) l(v)= \fvdx + gvdy, a ri where Г0 = Г—Г t is a dy-measurable subset of the boundary Г whose dy-measure is >0, and the functions a,7, b and / satisfy the following assumptions: flyeL^Q), KiJ^n, beL°°(fl), b>0 a.e. on O, C.20) feL2(Q), geLHTil and there exists a constant ft such that л n j3>0, £ ayZtZj^fi £ & for all &, Ui^ a.e. on Q. C.21) The F-ellipticity of the bilinear form of C.19) will be a consequence of the following result. Theorem 3.1. Let Qbea domain in Ш". Then the space V defined in C.19) is a closed subspace of Hl(Q). If the dy-measure of Г0 is >0, the seminorm |*ti,n iS a norm over the space V, equivalent to the norm || * lli.n- Proof. Let (vk) be a sequence of functions in the space V that converges to an element ve H1(Q). Since the sequence (trut) converges to try in the space Ь2(Г) (cf. inequalities B.3)), it contains a subsequence that converges almost everywhere
42 P.G. Ciarlet Chapter I to tr v and thus tr v = 0 a.e. on Г0, This implies that the function v belongs to the space V. Next, let us show that | • |1>fl is a norm over the space V. Let v be a function in the space V that satisfies |t>|1>n=0. Then it is a constant function by virtue of the connectedness of the set Q; thus its trace is a constant function that takes the same value, and this value is zero since the trace vanishes on the set F0, whose dy-measure is >0. Finally, assume that the two norms | • |ljC and || • |i 1,0 are not equivalent over the space V. Then there exists a sequence (vk) of functions vke V such that llf*IIi,o=l for all k, lim Ы1>п = 0. fc->oo By Rellich's theorem, any bounded sequence in the space H1(Q) contains a subsequence that converges in L2(Q), so that there exists a sequence (t>() of functions d, e V that converges in the space L2{Q). Since Нт^иМ^я = 0, on the other hand, the sequence {vt) is a Cauchy sequence in the complete space V, and therefore it converges with respect to the norm || ■ ||1>я to an element veV. Since |y| 1>fi = limi_ ao\vi\i,n — 0>we deduce that v = 0, which is in contradiction with the equalities ||t^||1;fi=l for all k. □ From this theorem, we infer that the bilinear form of C.19) is F-elliptic since we have a(v,v)^P\v\ln for all veHl{Q) by inequalities C.20) and C.21). By the Lax-Milgram lemma (Theorem 1.3), there exists a unique function ueV that satisfies the variational equations 1 I] uijdiudjV + buvfdx — fvdx + gv dy for all v e V. C.22) Referring once again to formula B.4), we obtain another Green's formula: n Г п Г п Y, aijdiudjvdx= — ]T 5j(aij6iu)vdx+ ^ a^dtuvjdy, C.23) ij= 1 J Uj= 1 J Uj= 1 a n г valid for all functions ue H\Q),ve H1(Q), provided the functions atJ are smooth enough, e.g., in the space <^'1(Q), so that the functions ауЭ(н belong to the space Hl(Q). Using C.23) we conclude that we have formally solved the associated boundary value problem n ~ Yj dj(aijdiu) + bu=f in Q, u=i и = 0 on Г0, C.24) n £ aijyfiiU = g on Гъ
Section 3 Elliptic boundary value problems 43 which is called a homogeneous mixed problem ifg=Q, or a nonhomogeneous mixed problem if g=£0, for the operator n "-"- E Э;(ауЭ,и) + Ьм. C.25) Notice that condition C.21) is the classical ellipticity condition for an operator such as that of C.25), which is accordingly called an elliptic operator when this condition is fulfilled. The boundary operator и u-> £ atjVjdiU u=i is called the conormal derivative operator associated with the operator of C.25). If Г=Г0, or if Г=Г!, we have formally solved a homogeneous Dirichlet problem, or a homogeneous or a nonhomogeneous Neumann problem respectively, for the operator of C.25) (in the second case, we would require an inequality such as b > b0 > 0 a.e. on Q to get existence). We now come to the sixth example, which is by far the most important. Example 3.6. Let Q be a domain in U3. We define the space V=U={v = (Vl,v2,v3)e(H\Q)K; »,=0 on Г0, Ui<3}, C.26) where Г0 is a dy-measurable subset of Г, whose dy-measure is >0. The space V, which is a closed subspace of (Hl(Q)K, is equipped with the product norm / 3 N1/2 r = (»i,»2,»3)-IMIi,o = ( E ЫЫ C-27) For any у=(г1,у2,из)е(Я1(А)K, let Ф) = tyW = i(9y», + 3,-оД 1 ^ i,j ^ 3, C.28) and ffyW==^W = -l( I ^(")Vy + 2/iey(v), l^UO, C.29) where Ey is the Kronecker's symbol, and A and /^ are two constants that satisfy A>0, ц>0. C.30) We then define the symmetric bilinear form a(u,v) = з £ ffy(»)£y(v)d* i,j=l j A div и div v + 2ц £ ey(»Ny(v) i dx, C.31)
44 P.O. Ciarlet Chapter I and the linear form l(v)= \f-vdx + g-vdy з = £ f,vtdx+ \ Y,9iVidy, C.32) where/=(/1J2J3N(L2(fi)K andg = igug2,g3)e(L2{r1)K, with Г^Г~Г0. It is clear that these bilinear and linear forms are continuous over the space V. To prove the P-ellipticity of the bilinear form (Theorem 3.2), one needs Korn's inequality: There exists a constant C(Q) such that, for all v = (v1,v2, v3)e{H1(Q)K, Ыип^СЩ £ |ey(r)|§>0+ £ ЫЬУ'2. C.33) \«,j=i ;=i / This inequality is difficult to establish; its proof may be found in Duvaut and Lions [1972, Chapter 3, Section 3.3], or in Fichera [1972a, Section 12]. Theorem 3.2. Let Q be a domain in R3, and let Г0 denote a dy-measurable subset of Г, whose dy-measure is >0. Then the mapping С 3 -11/2 v=(i>i,t>2,i>3)-{Z М")|Ц C-34) is a norm over the space V of C.26), equivalent to the product norm C.27). Proof. The proof is similar to that of Theorem 3.1, and for this reason will be only sketched (for details, see e.g. Duvaut and Lions [1972] or Ciarlet [1988, Section 6.3]). One first shows that a function ve (H1{Q)K that satisfies ey(v)=0 in Q, 1 ^i,j<3, is of the form v: x-+v(x) = a+bx Ox, for some constant vectors aeU3 and be U3. Using Korn's inequality C.33), one then shows that the norm defined in C.34) is equivalent to the product norm. □ The F-ellipticity of the bilinear form C.31) is therefore a consequence of inequalities C.30), since by C.31): з a(v,v)^2fi X |£y(v)|giQ. ij=l We thus conclude that there exists a unique function us V that minimizes the functional J{v) = i \ JA(div(v)J + 2/i £ (£;>)J1 dx- f-vdx + g-vdy) C.35)
Section 3 Elliptic boundary value problems over the space V or, equivalently, that satisfies 45 Г 3 £ ffv(ii)ey(»)dx = f-vdx + g-vdy for all ve V. C.36) Since relations C.36) are satisfied by all functions v e {ЩО)K, they could yield the associated partial differential equations. However, as was pointed out in Remark 3.1, it is equivalent to use Green's formulae, which in addition have the advantage of simultaneously yielding the associated boundary conditions (cf. C.40)). Using Green's formula B.4), we obtain, for all ue {H2(Q)f and all ve (НЩK: <rij(ufijVidx = - \(QJaij(H))vidx + ои{и)ь^йу, so that, using definitions C.28) and C.29) we have proved that the following Green's formula holds: * з n = [ £ (- £ W)W+ f £ ( £ ffu(Wkdy> C-37) J i=l \ j=l / J i=l \j=l / A Г for all functions us{H2{Q)f and vs{H\Q)f. Arguing as in the previous examples, we find that we are formally solving the equations These equations are sometimes written in vector form as — \xAu—{k+^)grad div и =/ in Q, C.38) C.39) which is obtained from C.38) simply by using relations C.29). Taking equations C.38) into account, we find that the variational equations C.36) reduce to •3/3 \ /* 3 E £ ffij(u)vJ\vidy= Y,9tVidy forallveF, J t=l \j=l / J i=l since v = 0 on Г0 = Г — Г%. To sum up, we have formally solved the following associated boundary value
46 P.G. Ciarlet Chapter I problem: —цЛи—(Х + /x)grad div и =f in Q, u = 0 on Г0, C.40) з X Oij(u)Vj=gt on Л, f ^ i s$ 3, ;=i which is called the boundary value problem of linearized elasticity, it is a linearization of the boundary value problem of nonlinear three-dimensional, elasticity that describes the equilibrium of an elastic homogeneous, isotropic body (Fig. 3.2) that occupies the set Q in the absence of forces; the vector и denotes the displacement of the points of Q under the influence of given forces (as usual, the scale for the displacements is distorted in the figure). The body Q cannot move along Г0, and along Г и surface forces of density g are given. In addition, a volumic force, of density / is prescribed inside the body Q. For details about three-dimensional elasticity and the validity of this linearization, see e.g. Ciarlet [1988]. Fig. 3.2. The boundary value problem of linearized elasticity: the unknown vector field u:Q->R3 represents the displacement vector of a homogeneous, isotropic, elastic body occuping the set U in the absence of forces, and subjected to volumic forces of density /: Q-+R3 and to surface forces of density Then we recognize in (еу(и)) the linearized strain tensor while (<Ху(и)) is the linearized stress tensor, the linear relationship C.29) between the linearized tensors being known in linearized elasticity as Hooke's law. The constants X and fi are the Lame coefficients of the material that constitutes the body; experimental evidence indicates that they satisfy inequalities C.30) for actual materials.
Section 3 Elliptic boundary value problems 47 Remark 3.3. It can be shown that the bilinear form of C.31) remains elliptic under the weaker assumptions 3A + 2//>0, /i>0. The variational equations C.36) represent in linearized elasticity the principle of virtual work, valid for all kinematically admissible displacements v, i.e., that satisfy the boundary condition v=0 on Г0. Remark 3.4. It is interesting to notice that the strict positiveness of the dy-measure of Г0 has a physical interpretation: It is intuitively clear that if the dy-measure of Г0 were zero, the body would be "free" and therefore there could not exist an equilibrium position in general. The functional J of C.35) represents in linearized elasticity the total energy of the body. It is the sum of the strain energy: ijb(divvJ + 2^ i(£ij(v)J\ux, and of the potential energy of the exterior forces: (J f-vdx + g-vdy Example 3.6 is the most crucial one, not only because it has obviously many applications, but also because its variational formulation, described here, is basically responsible for the invention of the finite element method by engineers. We conclude this section by studying various problems posed in terms of variational inequalities, i.e., where the set U is not a vector space. To begin this second series of examples, we describe an interesting variant of the membrane problem. Example 3.7. The obstacle problem consists in finding the equilibrium position of an elastic membrane, with tension z, which, as before (cf. Fig. 3.1), passes through the boundary Г of an open set Q of the horizontal plane and is subjected to the action of a vertical force of density F = zf but which, in addition, must lie over an "obstacle" which is represented by a function %: П-*М, as illustrated in Fig. 3.3. This seventh example is thus naturally associated with the following data: V=Hh(Q\ О с U2, U = {veHl0{Q);v^x a.e. in Q}, C.41) a{u, v) = Vu-Vrdx, Kv) = fvdx. For definiteness, we shall make the following assumptions on the functions % and /: XeH2(Q), x^O onf, feL2(Q). C.42)
P.G. Ciarlet Chapter I Unknown contact zone Fig. 3.3. The obstacle problem: the membrane must lie over an "obstacle", which is represented by a function x- U-*R. The set U, which is not empty by virtue of the second assumption of C.42), is easily seen to be convex. To show that it is closed, it suffices to observe that every convergent sequence in the space L2(Q) contains an a.e. pointwise convergent subsequence. Thus we may apply Theorem 1.1: There exists a unique function ueU that minimizes the membrane energy J:v-*J(v)=$ ||Vi;|l2dx- fvdx C.43) over the set U, and it is also the unique solution of the variational inequalities Vu • V(t> — u) dx > f(v — u) dx for all veU, C.44) by Theorem 1.2. By contrast with the solution of the membrane problem, the solution of the obstacle problem is not smooth in general, even if the data are very smooth. To be convinced of this phenomenon, consider the one-dimensional analog with f = 0, as shown in Fig. 3.4. In this case, the solution is affine in the region where it does not touch the obstacle and consequently, whatever the smoothness of the function x, the second derivatives will have discontinuities at points such as £ and n. Therefore the solution и is "only" in the space H2(I). These results carry over to the two-dimensional case, but they are of course much less easy to prove. For example, it is known that if the function x satisfies the assumptions of C.42),/ = 0, and Q is a convex polygon, the solution и belongs to the space Hq(Q) n H2(Q). If the set Q is convex with a boundary of class tf2 and
Section 3 Elliptic boundary value problems 49 Fig. 3.4. The one-dimensional analog of the obstacle problem, with /=0. assumptions C.42) hold then we have again и е Ho(Q) n H2(Q). In both cases, the norm ||и||2,о can be estimated in terms of the norms Hxll2.fi and \f\0,n of the data. These results are proved in Brezis and Stampacchia [1968] and Lewy and Stampacchia [1969]. In order to interpret the variational inequalities C.44) as a boundary value problem, assume that the solution и is in the space H2(Q). Hence the function (u—%) is in the space 4>°{Q), since #еЯ2@) by the first assumption of C.42), and H2{Q)c(e0(Q) when n = 2. Let xe Q be such that u(x)>x(x). Then there exist <5>0 and r>0 such that Br(x)={yeU2;\\y-x\\^r}cQ, Ф)>Х(у) + & for all yeBr(x). Consequently, if q>e 3)(Q) is such that supp cpczBr(x), there exists eo(<p)>0 such that v£ — (u + вер) 6 U for all |е|^е0(<р). Using the functions vE, |e| ^e0{(p), in the variational inequalities C.44), we find that Vw • Vq> dx — f(p dx > ^0 for all |s| <£0{(p)- n Therefore, Vu • V<p dx — \ fcp dx = 0 for all <p e 2> (Q) such that supp q> с Br(x). From Green's formula B.5), we thus infer that — Au = f'm the open ball (ye R2; \\y — x\\ <r). Hence — Au — f in the open set Q+ = {xe Q; м(х)>/(х)}. Using the same Green formula and the relation — Au = f in Q+,v/e next find that
50 P.G. Ciarlet Chapter I the variational inequalities C.44) reduce to {-Au-f)(v-x)dx^0 for all «e f/, and thus — Au^f on the set {xeQ;u(x) = x(x)}- To sum up, we have formally solved the associated boundary value problem: — Au—f in Q+= {xeQ;u(x)>x{x)}, -Au^f mQ° = {xeQ;u{x) = x{x)}, C.45) и > x in Д и = 0 on Г. As was already observed (cf. Remark 1.1), this is a nonlinear problem, since the set U of C.41) is not a vector space. In the same vein, notice that the region where the membrane touches the obstacle, i.e., the set Q°, is not known in advance. We may also view the boundary value problem C.45) as &free boundary problem, where the free boundary Г* = Ш+r\bQ° is one of the unknowns. In such a formulation, it is customary to adjoin to C.45) two transmission conditions: ^frl^t , J on Г*, C.46) trav(uifi*) = tr6v(u|0o)j which simply reflect here the assumption that и e H2(Q). Observe, however, that these transmission conditions make sense only if Г* is smooth enough. Example 3.8. This example, which is closely related to Example 3.7, corresponds to the following data: V=H\Q\ !/={1)€Я'(й);^0 dy-a.e. on Г}, a{u, v) = r 100= fi < Y, 9;u6jP + bui?>dx, ;dx, C.47) where the functions b and / satisfy the same assumptions as in C.10). Arguing as before, we easily see that the corresponding variational problem amounts to formally solving the associated boundary value problem: — Au + bu = f in Q, u^O on Г, C.48) 3vu>0 on Г, u9vu=0 on Г. This problem falls in the class of Signorinfs problems, named after Signorini [1933], i.e., problems where some boundary conditions take the form of inequalities, such as
Section 3 Elliptic boundary value problems 51 those appearing in C.48). For extensive discussions of such problems, see Duvaut and Lions [1972b], Fichera [1972], Necas [1975], Glowinski, Lions and Tremolieres [1976a, 1976b], Necas and Hlavacek [1981], Glowinski [1984]. Example 3.9. Another problem modeled by variational inequalities is the elastic- plastic torsion problem, which arises in the following situation: Consider a cylindrical thin rod with a simply connected cross section ficl2, subjected to a torsion around the axis supporting the vector e3. The torsion angle т per unit length is assumed to be constant throughout the length of the rod (cf. Fig. 3.5, where the vertical scale should be considerably increased). ^=^ Fig. 3.5. The elastic-plastic torsion problem: A cylindrical rod is subjected to a constant torsion angle т per unit length along the vertical direction. Let us first assume that this problem is modeled by linearized elasticity. Then certain simplifying assumptions (the weight of the rod is neglected among other things) imply that the components <т1Ь <т12, o21 and <x33 of the stress tensor vanish everywhere in the rod, while the components al3 and а2Ъ, which are functions of x ± and x2 only, are such that ег13 = 2/га32м, cr23= — 2jUt6xM,
52 P.O. Ciarlet Chapter I where ц is the second Lame coefficient of the constitutive material of the rod, and the stress function и satisfies ~Au—\ in Q, u — Q on Г. Therefore the function и minimizes the functional J: v-*J{v)- Wvfdx- fvdx, with /=1, over the space Hq(Q). If the plasticity of the material is also taken into account, the stresses cannot take arbitrary large values. A particular mathematical model of this restriction is the von Mises criterion, which reduces in this case to the condition that the quantity (к1з12 + к2з12I/2> and consequently the norm ||Vu||, cannot exceed a certain constant. Notice, however, that by contrast with linearized elasticity, it is no longer straightforward to relate the displacement field to the stress field, as shown for instance by the discussion in Duvaut and Lions [1972, Chapter 5, Section 6]. This problem thus corresponds to the following data (for definiteness, the upper bound on ||Vm|| has been set equal to one): V=Hh@), n = 2, U^{veHi{Q); ||Vi?||<1 a.e. in Q), a(u,v) = Vu-Vvdx. C.49) /(i?)= fvdx, feL2(Q), n with / = 1 in this case. Since U is again a nonempty closed convex subset of the space V, the variational problem associated with the data C.49) has a unique solution u, which can be shown to be in the space W2'"(Q)r\Ho(Q) for 1 <p< go if feLp(Q) and the boundary Г is smooth enough; cf. Brezis and Stampacchia [1968]. Using Green's formula B.5), we then infer that we have formally solved the associated boundary value problem: -Au=f inG* = {xefi;||V«(x)||<l}, ||V«|| = 1 in £2-0*, C.50) и = 0 on Г. As in the case of the obstacle problem, we may view C.50) as a free boundary value problem and accordingly, adjoin to C.50) appropriate transmission conditions at the interface between the sets Q* and (Q — Q*). More details about the elastic- plastic torsion problem are found in Lanchon [1972], Glowinski and Lanchon [1973]. In Necas and Hlavacek [1981], more general elastic-plastic problems are also considered. Thorough treatments of the variational formulations of linear elliptic boundary
Section 4 Elliptic boundary value problems 53 value problems are given in Lions [1962], Agmon [1965], NeCas [1967], Lions and Magenes [1968], Vo-Khac Khoan [1972b]. Shorter accounts are given in Aubin [1972], Babuska and Aziz [1972], Oden and Reddy [1976a]. More specialized treatments, particularly for general nonlinear problems, are LadyZenskaja and Ural'CEva [1968], Lions [1969], Ekeland and Темам [1974]. For regularity results, see Grisvard [1976,1985,1987], Kondratev [1967], Blum and Rannacher [1980]. For more classically oriented treatments, see for example Bers, John and Schechter [1964], Courant and Hilbert [1953, 1962], Gilbarg and Trudinger [1983], Miranda [1970], Stakgold [1968]. As introductions to classical linearized elasticity, see Landau and Lifchitz [1967], Germain [1972], Fraeijs de Veubeke [1979], Valid [1977], Bamberger [1981], Germain [1986a, 1986b]. Mathematical treatments of problems of linearized elasticity, including their variational formulations, are given in Knops and Payne [1971], Gurtin [1972], Fichera [1972a, 1972b], Duvaut and Lions [1972], Villagio [1972], Oden and Reddy [1976b], Necas and HlavAcek [1981]. The derivation of linearized elasticity from "genuine", nonlinear, elasticity, is described in Ciarlet [1987, Section 6.2]. For detailed expositions of variational inequalities, and their applications to linearized elasticity, see in particular Duvaut and Lions [1972], Fichera [1972b], Glowinski, Lions and Tremolieres [1976], Baiocchi and Capelo [1978], Kinderlehrer and Stampacchia [1980], Necas and HlavAcek [1981], Glowinski [1984], Panagiotopoulos [1985], Rodrigues [1987]. 4. Examples of fourth-order boundary value problems: The biharmonic problem and the plate problems Whereas in the preceding examples the spaces V were subspaces of the Sobolev space H1(il), we consider in the last examples Sobolev spaces that involve second-order derivatives. To begin with, we consider the minimization problem that corresponds to the following data (Q is again a domain in W): V=V = H$(Q), a(u, v) = AuAvdx, D.1) /(e) = \fvdx, feL2(Q). Since the mapping v -»• \Av\0,n is a norm over the space Hl{Q), as we showed in B.8), the bilinear form is Ho(£?)-elliptic. Thus there exists a unique function ueHl(Q) that minimizes the functional J: v^J{v)=- U\Av\2dx- \fvdx D.2)
54 P.G. Ciarlet Chapter I over the space Hl(Q) or, equivalently, that satisfies the variational equations AuAvdx = fvdx for all veHl(Q). D.3) j a Using Green's formula B.7): AuAvdx = A2uvdx dvAuvdy + ]Audvvdy, we find that we have formally solved the following homogeneous Dirichlet problem for the biharmonic operator A2: A2u = f in Q, m = 8vu = 0 on Г. D.4) This problem is, in particular, a mathematical model for a specific class of problems in fluid mechanics: It can be shown (see e.g. Ciarlet [1978, p. 280ff.]) that the solution of the Stokes problem for an incompressible viscous fluid in a simply connected domain QcU2 may be reduced to the solution of D.4), where the unknown и is an appropriate stream function. Finally, we let, for n = 2, V=U=HUO), a(u,v)= | {AuAv + (l—v)Bd12ud12v — d11ud22v — d22ud11v)}dx D.5) {vAuAv + A — v)(Si 1ud11v + 822 ud2i v + 2d12ud12v)} dx, a l{v)=\fvdx, feL2(Q). These data correspond to the variational formulation of the clamped plate problem in linearized elasticity: The unknown и represents the vertical displacement of a plate of constant thickness e under the action of a transverse force, of density F = T^£e3//(l-cr2) per unit area. The constants E = /iCA + 2/i)/(A + fi) and v = i*/ (A + fi) are respectively the Young modulus and the Poisson coefficient of the elastic material constituting the plate, X and \i being the Lame coefficients of the same material. When /=0, the plate lies in the plane of coordinates (xl7x2) (cf. Fig. 4.1). The condition u = 8„u=0 contained in the definition of the space U\{Q) take into account the fact that the plate is clamped. It is worth noticing that the expressions found in the variational formulation D.5) of the clamped plate problem can be obtained by applying a "limit" analysis (when the thickness of the plate approaches zero) to the variational formulation of the boundary value problem of three-dimensional elasticity (cf. C.31) and C.32)).
Section 4 Elliptic boundary value problems 55 fc Fdx / / 1-i— 7/x г Fig. 4.1. The clamped plate problem: the unknown «:fJc=R2->R represents the vertical displacement of a clamped plate occupying the set Q in the absence of applied forces, and subjected to a vertical force of density F per unit area. This process results in a simpler problem, in that there are now only two independent variables. However, this advantage is compensated by the fact that second partial derivatives are now present in the bilinear form. This will yield a fourth- order partial differential equation; see D.8). The Poisson coefficient v satisfies the inequalities 0<v<2; hence the bilinear form is Ho(£2)-elliptic, since we have a(o,p) = vM»|§,0 + (l-v)|»|i,0 for all veH2(Q). Thus, there exists a unique function ueHl(Q) that minimizes the total energy of the plate: J(»)=i \Av\2+2(l-v)((dl2vJ-d11vd22v)}dx- fvdx, D.6) over the space #o(fi) or, equivalently, that solves the variational equations {jMZlt>-l-(l—v)B512w812i> —З^мб^и—322«9iitf)}dx= \fvdx n forall»eiJg(fl). Using Green's formulae B.7) and B.9): D.7) AuAvdx= A2uvdx- bvAuvdy + Audvvdy, {2312u812d — d11uc>22V — 322M611t;}dx = {— 8„u8Tt> + d„uQzv} dy,
56 P.G. Ciarlet Chapter I we find that we have again formally solved the homogeneous Dirichlet problem for the biharmonic operator A2: A2u = f in 0, w = c\,w=0 onf. D.8) Therefore, in spite of a different bilinear form, we eventually find the same problem as in the previous example. This is so because, in view of the second Green formula which we used, the contribution of the integral A — v){2812w812t; — 611u622f--S22iii;}dx я is zero when the functions v are in the space Щ0), and consequently in its closure Hl(Q). Thus, the partial differential equation is still A2u — fin Q. However different boundary conditions would result from another choice for the space V. To distinguish the two problems, we shall refer to a fourth-order problem corresponding to the functional of D.2) as a biharmonic problem, while we shall refer to a fourth-order problem corresponding to the functional of D.6) as a plate problem. Other plate problems are also of interest. For instance, if we let V={veH2(Q);v = dvv = 0 on Г0, v = Oonr1} D.9) where Г0 and Tj are two disjoint dy-measurable subsets of Г, with dy-meas Г0 >0, and the bilinear and linear form are as in D.5), we find that the plate is subjected to a boundary condition of simple support along f\, and to a boundary condition of free edge along Г — {Гои/^}. For details about such boundary conditions, which are quite commonly encountered in practical situations, see, e.g., Germain [1986b, p. 86], Ciarlet [1990]. Remark 4.1. The membrane problem, which we have already described (cf. Fig. 3.1), and the plate problems, which we just described, can be derived from the boundary value problem of three-dimensional linearized elasticity described in Section 3 through a systematic "limit" process, which has recently received considerable attention; see in particular Ciarlet and Destuynder [1979], Ciarlet and Kesavan [1980], Ciarlet and Rabier [1980], Destuynder [1981, 1986], Raoult [1985], and the book by Ciarlet [1990]. In particular, the problem is reduced by this process to a problem in two variables with only one unknown function (the "vertical" displacement). To sum up, we have examined in Sections 3 and 4 various minimization or variational problems whose associated bilinear form is F-elliptic on an appropriate Hilbert space V. For this reason these minimization or variational problems, as well as their associated boundary value problems, are called elliptic boundary value problems. In the same spirit such problems are said to be second-order problems, or fourth-order problems, when the associated partial differential equation is of order two or four, respectively. Finally, one should recall that even though the association between the two
Section 4 Elliptic boundary value problems 57 formulations may be formal, it is possible to prove, under appropriate smoothness assumptions on the data, that a solution of any of the variational problems considered here is also a solution in the classical sense of the associated boundary value problem. Remark 4.2. In this article, one could conceivably omit all reference to the associated boundary value problems, inasmuch as the finite element method is based only on the variational formulations. Note that, by contrast, finite difference methods are most often derived from the boundary value problems themselves.
Chapter II Introduction to the Finite Element Method Introduction This chapter is an introduction to the finite element method, which is a very powerful means of approximating the solutions of second-order or fourth-order problems posed in variational form over a space V. A well-known approach for approximating such problems is Galerkirfs method, which consists in defining similar problems, called discrete problems, over finite-dimensional subspaces Vh of the space V. Then a conforming finite element method, i.e., a finite element method in its simplest form, is a Galerkin method characterized by three basic aspects in the construction of the space Vh: First, a triangulation 3~h is established over the set Q, i.e., the set О is written as a finite union of polyhedra Te$~h. Secondly, the functions vhe Vh are piecewise polynomials, in the sense that for each Те ZTh, the spaces PT = {vh\T; vh e Vh} consist of polynomials. Thirdly, there should exist a basis in the space Vh whose functions have small supports. These three basic aspects are discussed in Section 5, where we also give simple criteria that insure the validity of inclusions such as VkcH1(Q), or VhcH2{Q) (Theorems 5.1 and 5.2). We also briefly indicate how these three basic aspects still pervade the more general finite element methods that will be subsequently described. In Sections 6-9 we describe various examples of finite elements, which are either n-simplices (simplicial finite elements) or n-rectangles (rectangular finite elements), in which either all degrees of freedom are point values (Lagrange finite elements), or some degrees of freedom are directional derivatives (Hermite finite elements). These finite elements yield either the inclusion Vh <=Hl(Q) (finite elements of class её°) or the inclusion Vh с H2(Q) (finite elements of class 4>x) when they are assembled in a finite element space Xh. In Sections 10-12j\\ve give general definitions pertaining to finite elements and finite element spaces, and we discuss their various properties. Of particular importance are the notion of an affine family (T, PT,LT) of finite elements (Section 11), where all the finite elements in the family can be obtained as images through affine mappings of a single reference finite element, and the notion of PT-interpolation operators; in particular, a basic relationship between these two notions is proved in Theorem 11.1. The Pr-interpolation operator and its global counterpart, the X^-interpolation operator, both play a fundamental role in the interpolation theory in Sobolev spaces that will be developed in the next chapter. We also show how to 59
60 P.G. Ciarlet Chapter II impose boundary conditions on functions in finite element spaces. In Section 12, we also briefly indicate why a particular finite element should be preferred to another one in practical computations. Finally, in Section 13, we define the convergence and the order of convergence for a family of discrete problems. In this respect, Ceds lemma (Theorem 13.1) is crucial: The error \\u—uh \\, i.e., the distance (measured in the norm of the space V) between the solution и of the original problem and the solution uh of the discrete problem, is, up to a constant independent of the space Vk, bounded above by the distance 1пСй6г„ II" — vh || between the function и and the subspace Vh. This is a particularly important result, since all subsequent convergence results will be essentially variations on this theme! The finite element method was first conceived in a paper by Courant [1943], but the importance of this contribution was ignored at that time. Then the engineers independently re-invented the method in the early fifties: The earliest references generally quoted in the engineering literature are those of Argyris [1954-1955], Turner, Clough, Martin and Topp [1956]. The name of the method was proposed by Clough [1960]. Historical accounts on the development of the method from the engineering point of view are given in Oden [1972], and Zienkiewicz [1973], and in the introductory article of J.T. Oden. Many books have been written on the manifold aspects of the numerical implementation of the finite element method. We mention in particular the successive editions of the celebrated book by Zienkiewicz [1971], the pioneering book of Oden [1972] (whose value was probably not fully recognized when it appeared), and the other successful books of Norrie and de Vries [1973], Gallagher [1975], Irons and Ahmad [1979], Zienkiewicz and Morgan [1983], Bathe [1982], Dhatt and Touzot [1984], Kardestuncer and Norrie [1987], Argyris and Mlejnek [1986-1988]. Nice blends of computational and mathematical aspects are found in the pioneering books of Strang and Fix [1973] and in the later books of Oden and Carey [1981-1984], Axelsson and Barker [1984], Wait and Mitchell [1985], and Hughes [1987]. Mathematical aspects are the main themes in Oden and Reddy [1976a], Raviart and Thomas [1983], Johnson [1987], and, at a more advanced level, in Babuska and Aziz [1972] and Ciarlet [1978]. Among the many more specialized books on this method, we mention in particular George [1986] for the fascinating topic of automatic generation of tri- angulations, Bernadou and Boisserie [1982] for shell problems, Thomasset [1981], Темам [1984] and Girault and Raviart [1986] for the Navier-Stokes equations, and Glowinski [1984] for nonlinear variational problems. 5. The three basic aspects of the finite element method Consider the linear abstract variational problem: Find ueV such that a(u,v) = l(v) for all ve V, E.1)
Section 5 Introduction to the finite element method 61 where the space V, the bilinear form a(v), and the linear form / are assumed to satisfy the assumptions of the Lax-Milgram lemma (Theorem 1.3). Then the Galerkin method for approximating the solution of such a problem consists in defining similar problems in finite-dimensional subspaces of the space V. More specifically, with any finite-dimensional subspace Vh of V, we associate the discrete problem: Find uh e Vh such that a{uh,vh) = l(vh) for all vheVh. E.2) Applying the Lax-Milgram lemma, we infer that such a problem has one and only one solution uh, which we shall call a discrete solution. If the bilinear form is symmetric, the discrete solution is also characterized by the property (Theorem 1.2): J(uh)= inf J(vh), E.3) where the functional J is given by J(v)=$a(v,v)—f(v). This alternate definition of the discrete solution is known as the Ritz method. Let us henceforth assume that the abstract variational problem E.1) corresponds to a second-order or to a fourth-order elliptic boundary value problem posed over an open subset Q of W, with a Lipschitz-continuous boundary Г. Typical examples of such problems have been studied in Sections 3 and 4. In order to apply the Galerkin, or the Ritz, method, we face, by definition, the problem of constructing finite-dimensional subspaces Vh of spaces V such as H})(Q),H1{Q),Hq(Q}, The finite element method in its simplest form к a specific process of constructing subspaces Vh, which are then called finite element spaces: This construction is characterized by three basic aspects, which for convenience shall be recorded as (FEM1), (FEM2) and (FEM3), respectively, and which shall be described in this section. (FEM1) The first basic aspect, and certainly the most characteristic, is that a triangulation 2Th is established over the set Q, i.e., the set U is subdivided into a finite number of subsets T, in such a way that the following properties are satisfied: BTh 1) For each Те 3~h, the set T is closed and its interior T is nonempty and connected. E~л2) For each Te^h, the boundary б Г is Lipschitz-continuous. (fk3) G=\)TefbT. (<rh4) For each distinct TuT2e&~h, one has f1nf2=0. Remark 5.1. A fifth condition (^,5), relating "adjacent" sets Г, will be later introduced. Once a triangulation 3Th is established over the set Q, one defines a finite element space Xh through a specific process, which will be illustrated by many examples in this chapter. All we need to know for the moment is that Xh is a finite-dimensional space of functions defined over the set Q. Hence we ignore at this stage instances of finite element spaces whose "functions" may have two definitions across "adjacent" sets T (see Section 12).
62 P.G. Ciarlet Chapter II Given a finite element space Xh, we define the finite-dimensional spaces PT = {vh\T;vheXh}, spanned by the restrictions vh\T of the functions vh e Xh to the sets Те 2Th. Since our aim is to approximate the solutions of problems posed in spaces such as H1(Q) or H2(Q), our first task consists in obtaining sufficient conditions that guarantee that the inclusion XhczH1(Q), or the inclusion XhcH2(Q), holds (converses of these results hold, as we shall show in Theorems 30.1 and 49.1). Remark 5.2. Here and subsequently, we shall systematically use the improper notation Hm(T) in lieu of Hm(f), for the sake of notational brevity. Theorem 5.1. Assume that the inclusions PTc:Hl(T)for all Те3~hand Xhc^iQ) hold. Then the inclusions XhczH\Q), X0h = {vHeXh; vh = 0 on Г}сHh(Q), hold. Proof. Let a function v e Xh be given. Since Xk <=<g0(Q), v is in the space L2(Q). Therefore, by definition of the space H1(Q), it remains to find functions vte L2(Q) such that v;4>ux-- vd^dx forall</>e^(fi). Q П For each i, a natural candidate is the function whose restriction to each finite element Г is the function 9,(i;|r). Since by assumption each set T has a Lipschitz- continuous boundary dT, we may apply Green's formula B.4): For each Te$~h, д:(ю\т)фйх-- v\Td;4>dx + i>|r<Krdv, where viS is the rth component of the unit outer normal vector along ЭГ. By summing over all finite elements, we obtain tv >dx = юд(фйх+ £ »\тфч1.тйу, Te^h J 8Г and the proof follows since the sum I TeSTh ь\тфуитйу vanishes: Either a portion of 9 T is a portion of the boundary Г of Q in which case ф = 0 along this portion, or the contribution of adjacent elements is zero.
Section 5 Introduction to the finite element method 63 The boundary Г being Lipschitz-continuous by assumption, the second inclusion follows from the characterization (cf. Section 2) Hh(O) = {veH1(Q);v = 0oar}. □ If a space Xh fulfills the assumptions of Theorem 5.1, we shall use the finite element space Vh — X0h if we are solving a second-order homogeneous Dirichlet problem, or the space Vh — Xh if we are solving a second-order homogeneous or non- homogeneous Neumann problem. The proof of the next theorem is similar to that of Theorem 5.1 and, for this reason, is omitted. Theorem 5.2. Assume that the inclusions Рт сH2(T) for all Ге 3~h, and Xhc'i^1 (Q), hold. Then the inclusions 1»сЯ2(Й), XOh = {vheXk;vh = 0 on Г}сH2(Q)nHl0(Q), Xooh = {vh eXh;vh = dvvh = 0 on Г} с Hl(Q), hold. Thus if we are to solve a simply supported plate problem, or a clamped plate problem, we shall use the finite element space Vh = X0h, or the finite element space Vk = X00h, respectively, as given in the previous theorem. Let us return to the description of the finite element method. (FEM2) The second basic aspect of the finite element method is that the spaces Рт, Те 3~h, contain polynomials, or, at least, contain functions that are "close to" polynomials. At this stage, we cannot be too specific about the underlying reasons for this aspect of the method but at least, we can say that (i) it is the key to all convergence results as we shall see and (ii) it yields simple computations of the coefficients of the resulting linear system E.4). Let us now briefly examine how the discrete problem E.2) is solved in practice. Let (wfc)f=i be a basis in the space Vh. Then the solution uh = Y$=llkwk of problem E.2) is such that the vector (d, £2,..., См) is solution of the linear system м £ a(wk,wJC = l(wm), l<ms£M, E.4) k = l whose matrix is always invertible, since the bilinear form, being assumed to be F-elliptic, is a fortiori Vh -elliptic. By reference to the data associated with the boundary value problem of linearized elasticity (Section 3), the matrix (a(wk, wm)) and the vector (l{wm)) are often called the stiffness matrix and the load vector, respectively. In the choice of the basis (wt)£i i, it is of paramount importance, from a numerical standpoint, that the resulting matrix possess as many zeros as possible,
64 P.G. Ciarlet Chapter II For all the examples that were considered in Section 3 the coefficients a(wk, wm) are integrals of a specific form: For instance, in the case of Example 3.1, one has a(wk,wm)= ( £ diwkdiwm + bwkwm)dx, n so that a coefficient a(wk, wm) vanishes whenever the dx-measure of the intersection of the supports of the basis functions wk and wm is zero. As a consequence, we next state: (FEM3) The third basic aspect of the finite element method is that there exists at least one "canonical" basis in the space Vh whose corresponding basis functions have supports that are as "small" as possible, it being implicitly understood that these basis functions can be easily described. Note that when the bilinear form is symmetric, the matrix {a(wk, wm)) is symmetric and positive-definite, a fact of considerable practical importance for the numerical solution of the linear system E.4). By contrast, this is not usually the case for standard finite difference methods, except for domains with special geometries, such as rectangular domains. Since another noticeable practical feature of the matrix (a(wk, wm)) is its sparsity, one could conceivably, assuming again the symmetry of the bilinear form, start out with any given basis, and, using the Gram-Schmidt orthonormalization procedure, construct a new basis (w? )£L x that is orthonormal with respect to the inner product a{ •, •). This is indeed an efficient way of getting a sparse matrix since the corresponding matrix a{wf,wf) is the identity matrix! However, this process is not recommended from a practical standpoint: For comparable computing times, it yields worse results than the solution by standard methods of the linear system corresponding to the "canonical" basis. It was mentioned at the beginning of this section that the three basic aspects were characteristic of the finite element method in its simplest form. Indeed, there are more general finite element methods: (i) One may start out with more general variational problems, such as variational inequalities (see Section 3) or different variational formulations such as mixed, or hybrid, formulations (see the article by Roberts and Thomas in this volume), or boundary formulations (see the article by Nedelec in a later volume), etc. (ii) The space Vh, in which one seeks the discrete solution, may no longer be a subspace of the space V. This may happen when the boundary of the set Q is curved, for instance. Then it cannot be exactly triangulated in general by standard finite elements and thus it is replaced by an approximate set Qh (see Section 38). This also happens when the functions in the space Vh lack the proper continuity across adjacent finite elements (see the "nonconforming" methods described in Section 30 and Section 49). (iii) Finally, the bilinear form and the linear form may be approximated. This is the case for instance when numerical integration is used for computing the coefficients of the linear system E.4) (see Sections 25 and 39).
Section 6 Introduction to the finite element method 65 Nevertheless, it is characteristic of all these more general finite element methods that the three basic aspects, possibly in some more elaborate form, constitute their underlying principles. In order to establish a distinction with these more general methods, we shall reserve the terminology conforming finite element methods for the finite element methods described at the beginning of this section, i.e., for which the space Vh is a subspace of the space V, and the bilinear form and the linear forms of the discrete problem are identical to the original ones. Let us therefore assume, as in the rest of this chapter, that we are using a conforming finite element method for solving a second-order or a fourth-order boundary value problem. We first summarize the various requirements that a finite element space Xh must satisfy, according to the considerations that we have made so far: (i) Such a space is associated with a triangulation 2Th of the set Q, i.e., Q = Ureirh T (FEM1), and for each Те $~h, the space PT = {vh\T;vheXh} E.5) should consist of functions that are polynomials or "nearly polynomials" (FEM2). (ii) By Theorems 5.1 and 5.2, inclusions such as Xh <= "W°(Q) or Xh <= Я?1 (Q) should hold, depending upon whether we are solving a second-order or a fourth-order problem. For the time being, we shall ignore boundary conditions, which we shall take into account in Section 12. (iii) Finally, we must check that there exists one canonical basis in the space Xk, whose functions have "small" supports, and which are easy to describe (FEM3). In this chapter we shall describe various finite elements that are all polyedra in IR", and which are for this reason sometimes called straight finite elements. By virtue of the relation & = {]тег„Т, we shall therefore restrict ourselves in this chapter to problems that are posed over a set Q that is itself a polyhedron, in which case we shall say that the set Q is polygonal. 6. Examples of simplicial finite elements and their associated finite element spaces We begin, in Sections 6-8, by examining examples of finite element spaces Xh that satisfy the inclusion Xk <= <£°(Q), and which are the most commonly used by engineers for solving second-order problems arising in linearized elasticity with conforming finite element methods. As shown in Sections 3 and 4 such problems are posed in open subsets of either IR2 or R3. Hence the value to be assigned in practice to the dimension и in the forthcoming examples is either 2 or 3. We equip the space W with its canonical basis (e;)"=i- For each integer fc^O, we denote by Pt the space of all polynomials of degree </c in the variables x1,x2,.-., xn, i.e., a polynomial pe Pk is of the form p:x = (x1,x2,...,x„)eW-+p(x) = £ Уг1«2-х„х?х?---х?г, for appropriate coefficients уЯ1а2...Хп, or equivalently, if the multi-index notation is
66 P.G. Ciarlet Chapter II used, p;xeU"->p(x) = £ yax*. The dimension of the space Pk is given by к dim Pk - F.1) If Ф is a space of functions defined over W, and if A is any subset of U", we denote by Ф (A) the space formed by the restrictions to the set A of the functions in the space Ф. Thus, for instance, we let Pk(A) = {p\A;pePk}. F.2) Notice that the dimension of the space Pk(A) is the same as that of the space Pk = Pjt(R") if the interior of the set A is nonempty. An n-simplex in 0?" is the convex hull Г of (n +1) points a, = (ау)"= i 6 W, which are called the vertices of the n-simplex, and which, by definition, are such that the matrix flll fl12 '" al,n+l \ д., 1 a^-i '°° a~) „ i A = a, \1 nl "n2 1 a, F.3) n,n + 1 1 / is invertible (equivalently, the {n+1) points a,- are not contained in a hyperplane). Thus, one has F.4) T=lx= Y, V,;0<A^ 1,1 <./<»+1, Z ^ = 1 t j'=i j=i Notice that a 2-simplex is a triangle and that a 3-simplex is a tetrahedron. Remark 6.1. If the matrix A of F.3) is singular, the set T defined by F.4) is still the convex hull of the points ap 1 ^;'<n+1, but it is not necessarily an m-simplex for some m < n. For any integer m with 0 ^ m ^ n, an m-face of the n-simplex T is any m-simplex whose (m+1) vertices are also vertices of Г. In particular, any (n — l)-face is simply called a face, any 1-face is called an edge, or a side. The barycentric coordinates А. = ЛДх), 1<;"<п+1, of any point x = (x;)"= x e R" with respect to the (n+1) vertices o7- are the (unique) solutions of the linear system n+1 n+1 Z аиЛ=х.' ^'^ Z^=1> j=i ;=i F.5) whose matrix is precisely the matrix A of F.3). By inspecting the linear system F.5),
Section 6 Introduction to the finite element method 67 one sees that the bar у centric coordinates of a point xeU" are affine functions of the coordinates xl,x2,...,x„ of x (i.e., they belong to the space Pj): n A<= EMj + ^i. U»<n + 1, F.6) j=i where the matrix B=(b,7) is the inverse of the matrix A. Remark 6.2. In the engineering literature, the barycentric coordinates are often called area coordinates if n = 2, or volume coordinates if и = 3. The barycenter, or center of gravity, of an n-simplex T is the point of T whose all barycentric coordinates are equal (to l/(n +1)). To describe our first finite element, we need to prove that a polynomial p: х-^Е^-лУаХ0 of degree 1 is uniquely determined by its values at the (n +1) vertices aj of any n-simplex in №. To see this, it suffices to show that the linear system £ ya(aif = ni, К^и+1, 1*1 <i has one and only one solution (j„, |<x| < 1) for all right-hand sides pp 1 <j'^n+1. Since dimPx = card I (J {a,-} 1 =и + 1, the matrix of this linear system is square, and therefore it suffices to prove either uniqueness or existence. In this case, the existence is clear: The barycentric coordinates verify Аг(а^) = 5у, l^i,j^n + l, and thus the polynomial ; = i has the desired interpolation property. As a consequence, we have proved the identity P= 1р(«Д' for all pe Px. F.7) i=l Although we shall not repeat this kind of argument in the sequel, it will be often implicitly used. Having thus completely determined a polynomial pe Pt by its values р(а,), К i< n +1, we can now define the simplest "paradigm of all finite elements", which we shall call linear n-simplex: The set T is an n-simplex with vertices at, 1 < i'^n +1, the space PT is the space P^T), and the degrees of freedom of the finite element, i.e., those parameters that uniquely define a function in the space Pr, consist of the values at the vertices. Denoting by ST the corresponding set of degrees of freedom, we shall write symbolically ZT = {p(ai):l^i^n + l}.
68 P.O. Ciarlet Chapter II linear triangle, or CouranVs triangle, dimPr = 3 linear n-simplex PT=Pl(T), dimPT=(n + l) ST = {p{aiy.l^i^n + l} Fig. 6.1 In Fig. 6.1, we have recorded the main characteristics of this finite element for arbitrary n, along with the figures in the special cases n = 2 and 3. Note that the linear triangle (и = 2) is also known as Courant's triangle, after Courant [1943]. Let us call ay = i(a,- + a,-), l<i</<n + l, the midpoints of the edges of the n- simplex Г. Observing that 1к{аи) = %Fki + 5kj), 1 < i < ; < n + 1,1 ^ ^ ^ " + 1» and that dimP2(r) = card((j{e,})u( \J {atJ} we obtain the identity (here and subsequently, indices i, j,k,..., are always assumed to take all possible values in the set {1,2,..., n) whenever this fact is not specified): р = £да*-Мвг)+14МР(«ц) forallpeP2. F.8) This identity allows us to define a finite element, called the quadratic n-simplex: the space PT is P2(T), and the set ST consists of the values at the vertices and at the midpoints of the edges (Fig. 6.2). Let auj=iBaf + a,) for i Ф], and am = $[at + a}+ak) for i <j < k. From the identity />=£HCA;- 1)CA( -2)р(ад+ E !WA,- l)p(e<u) + I 27Я1-ЯД,р(аик) i 1Фз i<j<k forallpeP3, F-9) we deduce the definition of the cubic n-simplex (Fig. 6.3). linear tetrahedron dimP7.=4
Section 6 Introduction to the finite element method 69 quadratic triangle dimPT=6 quadratic tetrahedron dimPT = 10 quadratic n-simplex Pj. = p2(D, dimPT=|(n + l)(n + 2) ZV = {p(a,):l<Kn+l;p(ey):l«i<./«n + l} Fig. 6.2. зэ a331 з113 a. cubic triangle dimPT = 10 cubic tetrahedron dimPT = 20 cubic n-simplex Pr=P3(T), dimPr=£(n+l)(n+2)(n + 3) 2'7. = {р(о,-):1<1<и+1;р(ощ):1<и<п + 1,Мл р(а,д): l<!<;'<fe<n+l} Fig. 6.3.
70 P.G. Ciarlet Chapter II One may define analogous finite elements with polynomials of higher degree, but they are not often used. In this respect, we leave to the reader the proof of the following theorem (for a proof, see e.g. Nicolaides [1972]), from which the definition of the n-simplex of degree к can be derived for any integer k^l. Theorem 6.1. Let T be an n-simplex with vertices aj, 1 ^_/<n+1. Then for a given integer fe> 1, any polynomial pePk is uniquely determined by its values on the set f n+1 "+1 { 1 k— 1 ") 1 Lk{T) = \x = £ Xja/, £ A, = l,A7ejO,-,...,—,lkl<Kn + a F.10) Remark 6.3. The set Lk(T) is called the principal lattice of order к of the n-simplex T. Let us now examine how the degrees of freedom p(aijk) can be eliminated in the definition of the cubic n-simplex. This elimination yields a new finite element, which is often preferred by engineers in actual computations, and whose definition is based on the following result: Theorem 6.2. For each triple (i,j, k) with i<j<k, let <Mp) = 12p(ay*) + 2 X Р(аг)-3 £ р{ацт). F.11) l = i,j,k l,m = itj,k 1Фт Then any polynomial in the space P'3 = {peP3;<t>ijk(P) = 0,l^i<j<k^n + l} F.12) is uniquely determined by its values at the vertices ab 1 ^i^n+1, and at the points aujb l^i,j^n+ l,i¥=j- In addition, the inclusion Рг^Р'ъ F.13) holds. Proof. The ("t1) degrees of freedom фт are linearly independent (since фцк{р) = I2p(aijk)+ •••) and thus, the dimension of the space P'3 is dimF3=dimP3-r^J = (n+lJ, i.e., precisely the number of degrees of freedom. Using identity F.9), we thus obtain the identity P = I (Wa, - 1)CA, - 2) - # £ W* W + E(fWA(-l) + ¥ £ X,kjh)pfoy) forallpeP's, F.14) and the first part of the theorem is proved.
Section 6 Introduction to the finite element method 71 To prove that the inclusion F.13) holds, let p be a polynomial of degree ^2 and let Л£^2(К"; Щ be its second derivative (which is constant). From the expansions p(al) = piaijk) + Dp(aijk)(ai -aijk) + jA(a, -aiJkJ, lei, valid for any triple I = {i,j,k] with i<j<k, we deduce £ Piai) = Мацк) + 2 £ A(ai~aijkJ, lei Ш since П;е/(аг— aijk) = 0. Likewise, from the expansions P(fliim)=Piflijk) + Dp(aiJk)(allm - aijk) + jA(allm - aijkf, l,meJ, 1фт, we deduce £ Р(ацт) = 6р{ацк)+ 2 £ A(allm-aijkJ, l,mel l,mel taking into account that а.'д = 2 (««у + fl*«) = г(%* + fli.k) = 2 iflm + %;)• Because Л is a bilinear mapping, and because aum ~ aijk = iB(a, - aiJk)+(am - ayt)), we can write E -4(a( - aijkf -§E л(а"т - aijkf = - f Л( £ (а, - агд)) = О, ie/ i.me/ Vis/ / 1фт and the proof is complete. □ From Theorem 6.1, we deduce the definition of the reduced cubic n-simplex (Fig. 6.4). We next describe the construction of the finite element spaces associated with anyone of the finite elements that have been described so far. Since an n-simplex has a nonempty interior and a Lipschitz-continuous boundary, conditions (^~hl) and E^,2) of Section 5 are automatically satisfied. In order that a triangulation 3~h made up of n-simplices Те 3~h also satisfy conditions (^*3) and (^~j,4), we shall assemble them in such a way that Q= (JTe^hT and that distinct n-simplices have piecewise disjoint interiors. Then the first basic aspect (FEM1) of Section 5 is satisfied. In view of satisfying inclusions such as It c1°(fi) and 1лс^'(й), we shall however impose a fifth condition on a triangulation made up of n-simplices, viz. (cf. Theorems 5.1 and 5.2, and Theorem 6.3 below) (•^*5) Any face of any n-simplex Тг in the triangulation is either a subset of the boundary Г, or a face of another n-simplex T2 in the triangulation.
72 P.O. Ciarlet Chapter II reduced cubic reduced cubic triangle tetrahedron dimPT = 9 dimPr = 16 reduced cubic n-simplex PT = P'3(T) (cf. F.12)), dimPT =(n+1J Хт = {р(а,): 1 «i«n +1;р[аи1): Uy^n+1,!#j) Fig. 6.4. In the second case, the n-simplices Т\ and T2 are said to be adjacent. An example of a triangulation for n = 2 is given in Fig. 6.5 while Fig. 6.6 shows an example of a triangulation that violates condition E"fc5), since the intersection of 7\ and T2 is not an edge of T2. We then associate with any triangulation S~h that satisfies conditions (&~hl) to (^"л5) and with each type of finite element, a finite element space Xh, whose functions vh:Q-*U are constructed as follows: With linear n-simplices, a function vh e Xh is (i) such that each restriction vh\T is in the space PT =Pi(T) for each Те ^ and (ii) completely determined by its values at all the vertices of the triangulation. Likewise, with quadratic n-simplices, a function of Xh is (i) in the space PT = P2(T) for each Те 3Th and (ii) completely determined by its values at all the vertices and all the mid-points of the edges of the triangulation. Similar constructions hold for cubic, or reduced cubic, n-simplices. In all cases, a function vh in the space Xh is thus determined by degrees of freedom b-+vh(b), which make up a set of the form Zh = {vh(b):beNh}, F.15) where Nh is a finite subset of Q. The set Zh is the set of degrees of freedom of the finite element space Xh.
Section 6 Introduction to the finite element method 73 Fig. 6.5. An example of a triangulation of a polygonal set in R2. Fig. 6.6. An example of a situation that violates condition (^,5). One should observe that if there is no ambiguity in the definition of the degrees of freedom across adjacent finite elements, it is precisely because we have required that triangulations satisfy condition {3~hb\ This requirement also plays a crucial role in the proof of the following result. Theorem 6.3. Let Xh be a finite element space associated with n-simplices of degree к for some integer к ^ 1 or with reduced cubic n-simplices. Then the inclusion Xhcz^°(Q)nHx{Q) holds. Proof. We shall give the proof in the case n = 2 and for quadratic triangles; the other cases are similarly treated. Given a function vh in the space Xh, consider the
74 P.O. Ciarlet Chapter II two functions vh\Tl and vh\Tl along the common side T = [Ь,-,Ь,] of two adjacent triangles Тг and T2 (Fig. 6.7). Let t denote an abscissa along the axis containing the segment T. Considered as functions of t, the two functions vh\Tl and i;fc|T2 are quadratic polynomials along T", whose values coincide at the three points bhbj,bij =%(bi + bj). Therefore these polynomials are identical, and the inclusion Fig. 6.7. Quadratic triangles assembled in a triangulation. Xhczcg0(Q) holds. Hence the inclusion XhcH1(Q) is a consequence of Theorem 5.1. □ Since requirement (FEM2) is clearly satisfied (in each case, the spaces Рт, Те 2Th, consist of polynomials), it remains to verify requirement (FEM3), i.e., that each one of these finite element spaces Xh possesses a "canonical" basis whose functions have small supports. In each case, the set Zh of degrees of freedom of the spaces is of the form (cf. F.15)): Ih = {v(bk):l^k^M]. F.16) If we define functions wk, 1 ^ к ^ М, by the conditions wkeXh, wk(bt) = dk„ l^k,l^M, F.17) it is seen that (i) such functions form a basis of the space Xh and that (ii) they have "small" supports. In Fig. 6.8, we have represented the three types of supports that are encountered when cubic triangles are employed, for instance.
Section 7 Supp(w;) Supptw;) Supp (wk) Fig. 6.8. Supports of the basis functions associated with cubic triangles. 7. Examples of rectangular finite elements and their associated finite element spaces Before we turn to a second category of finite elements, we need a few definitions. For each integer к > 0, we denote by Qk the space of all polynomials that are of degree <fc with respect to each one of the n variables x i, X2,. •., xn, i.e., a polynomial p e Qk is of the form р:х=(хьх2,...,х„)£К"->-р(л;)= £ yXia2-inxVx¥---Xn", for appropriate coefficients уагаг...ап. The dimension of the space Qk is given by dimQt = (fc+l)", G.1) and the inclusions Pk^&cPa G.2) hold. Notice that the dimension of the space Qk(A) is the same as that of the space Q,k = Qk(№") as long as the interior of the set А с R" is not empty. Theorem 7.1. A polynomial peQk is uniquely determined by its values on the set Mk = \x = r^,lj,.../AeUn;ije{0,l,...,k},l<j^n\. G.3)
76 P.G. Ciarlet Chapter II Proof. It suffices to use the identity - - i - ;'—/' I \ к к к Oiij^k 7=1 \ i} = 0 h h I \ "- K K p= I П П Щ^Ы'Ьт^т) {1A) oaij^k 7=1 \ i}=o 1 «7<n \ij*h forallpegit. П Remark 7.1. To prove Theorem 7.1, one could also show that if a polynomial of Qk vanishes on the set Mk, it is identically zero. In W, an n-rectangle, or simply a rectangle if n = 2, is a set of the form в Т=П iahbi] = {x = (xl,X2,...,xn); a,-<Xi<bj, l<i<n}, G-5) !=1 with — oo < a; < fc,- < + oo for each г, i.e., it is a product of compact intervals with nonempty interiors; in particular, the unit hypercube [0,1]" is an n-rectangle. A face of an n-rectangle T is any one of the sets n л Ы x П [««. M or {bj} x П [a«,b,], 1 </■<«, ;=i t=i while an edge of T, also called a side, is any one of the sets n i=l ■#7 with Cj = a( or bi; 1 < i< n, г =£/', 1 <j^ n. A uertex of T is any point x=(xt, x2, ■ ■ ■, x„) of T with X; = a; or bh 1 ^ i ^ n. Given an n-rectangle Г, there exists a (nonuniquely determined) diagonal affine mapping, i.e., of the form FT(x) = BTx + bT, where BT is an n x n diagonal matrix and for is a vector in R", such that T=FT(w,m Observing that the set Mk of G.3) is a subset of the unit hypercube [0,1]", we then infer from Theorem 7.1 that a polonomial pe Qk is also uniquely determined by its values on the subset Mk(T) = FT(Mk) G.6) of the n-rectangle T; from this observation, we deduce the definition of finite elements called n-rectangles of degree k. Just as in the case of n-simplices, the values k= 1,2 or 3 are the most commonly encountered. In Figs. 7.1, 7.2 and 7.3, the corresponding elements are represented for n = 2 and 3, their most common names are indicated, and the numbering of the nodes, i.e., of the points occurring in the sets of degrees of freedom, is indicated for
Section 7 Introduction to the finite element method 77 ut т a* *a / bilinear, or A-node, rectangle dimPT=4 3-rectangle of degree 2 dimPT = 8 n-rectangle of degree 2 PT = Ql(T), dimFr = 2B 'Sr^ipiay.asMdT)}^. G.6)) Fig. 7.1. . a7 a T—т—T <>- -f-? 41 О • 1 a,,. e, biquadratic, or 9-node, rectangle dimPr = 9 ^^ i-rectangle of degree 3 dimPr=27 n-rectangle of degree 3 Pt = Qi(T), dimPr = 3" IT = {p(a):asM2(T)} (cf. G.6)) Fig. 7.2.
78 P.G. Ciarlet Chapter II bicubic rectangle 3-rectangle dim P T = 16 of degree A dim PT = 64 n-rectangle of degree 4 Pt = Q3(T), dimP = 4" Г7. = {р(Я):а£М3(Г)}(сГ.G.6)) Fig. 7.3. и = 2, according to the following rule: Assuming, without loss of generality, that the set Г is the unit square [0, l]2, we number four points consecutively if they are the vertices of a square centered at the point (£, j). This rule allows for particularly simple expressions of the four corresponding functions pt appearing in identities of the form Р = 1>(я.')Р.- for all peQ», i which are special cases (for k= 1,2, 3 and n-2) of the identity G.4): we first notice that the coordinates of a given point with respect to the four vertices at, 10'^4, of the unit square are (x1,x2), (x2,l-Xl), (l-x1;l-x2), {l-x2,x1), respectively. Then, if we introduce the variables x3 = l—xb x4=l— x2, G.7) it can be easily checked that the four functions p, corresponding to such four consecutively numbered points at are obtained through circular permutations of the variables x1,x2,x3,x4 (such permutations correspond to rotations of + jn around the point (j, i)). Let us illustrate this fact by means of examples:
Section 7 79 If we consider the biline^. L0,1]2), we have the identity 4 V= Z P(ai)Pi for all peSi, with p1=(l-x1)(l-x2), p2 = x1{l-x2), p3 = x1x2, P4 = (l-x1)x2. We may thus condense these expressions as Pi=x3x4, ... G.8) Likewise, if we consider the biquadratic unit square, we have the identity 9 Р=£рО;)Р; for all ре<?2, with Pi = x3Bx3 - l)x4Bx4 -1), p5 = -4x3(x3-l)x4Bx4-l), ... G.9) Pg == 10X5X2X3X4., using the above rule. Finally, if we consider the bicubic unit square, we have Pi = ЬсзCхз - l)Cx3 - 2)x4Cx4 - l)Cx4 - 2), ... Р5=-4Хз(Зх3-1)(хз-1)х4(Зх4-1)(Зх4-2), ••■ ,7щ p9 = |x3Cx3 - 2)(x3 - l)x4Cx4 - l)Cx4 - 2), P13 = ¥^зCх3 - l)(x3 - l)x4Cx4-l)(x4 -1), Remark 7.2. The inconsistency for the notations ah 5^i^9, between the biquadratic and bicubic rectangles avoids the introduction of a new letter. We now define finite elements similar to the biquadratic or bicubic rectangles, but in which the values at the interior nodes are no longer degrees of freedom (for simplicity, we restrict ourselves to the^ case и = 2). The existence of these finite elements is a consequence of the following two theorems. Theorem 7.2. Let the points ah l^i^9,beas in Fig. 7.2. Then any polynomial in the space Q'2 = \peQ2; 4p(a9)+ £ p(a;)-2 £ p(a;) = ol G.11) (. 1=1 1=5 J
80 P.G. Ciarlet Chapter II is uniquely determined by its value at the points ah 1 < i ^ 8. In addition, the inclusion Pz^Q'i G.12) holds. Proof. The first part of the proof is similar to the first part of the proof of Theorem 6.2. In particular we have the identity 8 P = E P(a'}P' for allpeQ'2, i = l with Pi=x3x4Bx3+ 2x4-3), ... р5 = -4хзХ4(хз-1), To prove the inclusion G.12), let p be a polynomial of degree 2, and let A denote its (constant) second derivative. From the expansions P(ad=P(a9) + Dp{a9)(ai—a9)Jr\A{al—a9I, KK8, we deduce 4 4 E p(a;)=4p(a9) + | X A(ai-agJ, i=l 1=1 8 8 E p(fli) = 4p(a9)+ i £ A(at-a9J, ;=5 i = 5 since 4 8 £ (fl»-fl9)= E (a~a9) = 0. i=l i=5 Because the mapping A is bilinear, and because a5 = г(а1 + Яг)> • • • > we obtain 8 4 E Л(а;-а9J=4 E Mfl{-a9f. i = 5 J=l Combining the previous relations, we deduce that 4 8 4p(a9)+ E Р^гЬ2 E P(«.') = 0, 1=1 i=5 and the proof is complete. □ Theorem 7.3. Let the points ah 1 ^ i < 16, be as in Figf. 7.3. Define the space Q'3 = {peQ3; MP) = 0,1^<4}, G.14)
Section 7 where Introduction to the finite element method 81 ф1(р)=9р{а13) + 4р{а1)+2р(а2) + р{а3)+2р{а4) ~6p(as)-3p(a6)-3p(a11)-6p{ai2), G.15) and ф2{р), Фг(р), and ФЛр) are derived by circular permutations in the sets U(a;}, (JW. U W, 0D 1=1 i = 5 i = 9 «=13 Then any polynomial in the space Q'3 is uniquely determined by its values at the points ah 1 ^z'^ 12. In addition, the inclusion Ps^Q's G.16) holds. Proof. The proof is similar to that of Theorem 7.2 and for this reason, is omitted. We shall only record here the identity P = Z P(ai)Pi fOT a11 PE б'з> with p1=x3x4(l + ^x3(x3-l) + ^xi(x4.-l)), Ps=-jx3(x3-l){3x3-l)x^, ••• р9=|хз(х3-1)Cх3-2)х4, G.17) From these two theorems we derive the definition of the reduced, or S-node, biquadratic rectangle (Fig. 7.4) and of the reduced, or Yl-node, bicubic rectangle (Fig. 7.5); they are also called the biquadratic, and bicubic, rectangles of the Serendipity family, as a reminder of the ingenuity that their discovery indeed required! Other a8(» e,,L a? -•- A3 i>a6 reduced, or 8-node, biquadratic rectangle PT = Q'2(T)(ci.G.U)), dimPT-- Fig. 7.4.
82 P.G, Ciarlet Chapter II e," a4 аЧ> a,,«» Эц a? I <» a9 reduced, or Yl-node, bicubic rectangle PT = Q'3(T)(d.(l.U)), Гг={р(аг): 1$К12} dimPT=12 Fig. 7.5. examples of Serendipity finite elements are found in Zienkiewicz [1971, pp. 108, 121, 126], in particular for n = 3. Their basis functions (and those of other rectangular finite elements) are found in El-Zafrany and Cookson [1986]. See also ZlAmal [1973c] for an interesting approach to such finite elements. If it happens that the set Qc W is rectangular, i.e., it is either an n-rectangle or a finite union of n-rectangles, it can be conveniently "triangulated" by finite elements which are themselves n-rectangles: The fifth condition ($~h5) imposed on such a triangulation now reads: (#"й5) Any face of any n-rectangle Ty in the triangulation is either a subset of the boundary Г, or a face of another n-rectangle T2 in the triangulation. In the second case, the n-rectangles 7\ and T2 are said to be adjacent. An example of a triangulation made up of rectangles is given in Fig. 7.6. With such a triangulation, we may associate in a natural way a finite element space Xh with each type of the rectangular finite elements that we just described. Since the discussion is almost identical to the one concerning n-simplices, we shall be very brief. In particular, one can prove the following analog of Theorem 6.3. Theorem 7.4. Let Xh be a finite element space associated with n-rectangles of degree к for some integer k^l,or with reduced biquadratic, or reduced bicubic, rectangles. Then the inclusion XiC^fljnH'P G.18) holds.
Section 8 Introduction to the finite element method 83 Fig. 7.6. A triangulation made up with rectangles. Finally, as in the case of simplicial finite elements (Section 6), it is easily seen that such finite element spaces possess bases whose functions have "small" supports. 8. Examples of finite elements with derivatives as degrees of freedom and their associated finite element spaces So far, the degrees of freedom of each finite element have been "point values", i.e., of the form p{a), for some points aeT. We now introduce finite elements in which some degrees of freedom are partial derivatives, or, more generally directional derivatives, i.e., expressions such as Dp{a)b, D2p(a)(b, c),..., where b, с are vectors in R". The first example of this type of finite element is based on the following theorem. Theorem 8.1. Let T be an n-simplex with vertices at, l^i^n+l,andletaiJk=^(ai + aj + ak\ Kkj'<fc<n+1. Then any polynomial in the space P3 is uniquely determined by its values at the vertices ab 1 ^ i ^ n +1, by the values of its n first partial derivatives at the vertices ah l^i<n+l, and by its values at the points aiik, l^i<j<k^n+l. Proof. The argument relies as usual on an identity: P=T,(-2X.! + 3X?-7b £ XjXk\p(ai) + 27 £ Mjbpiatjb) 1 \ }<k ] i<j<k + £ Я,Я,BЯ, + Я>- 1)Ор(а,)(ъ-а1) for all peP3. (8.1) To prove this identity, one needs in particular an expression for the derivatives of the barycentric coordinates, in order to show that Dp(at) = Dpia,), l<z'<n + l, where p denotes the right-hand side of (8.1). By differentiating the polynomial p, we
84 P.G. Ciarlet Chapter II obtain Hence, to show that the above expression is equal to Dp(at), it is equivalent to show that Dp{ai){ak—ai) = Dp{ai){ak — ai), l^k^n + l, кфй These last relations are in turn consequences of the relations Dlj(ak~ai) = 5jk—A/(a;), l^k^n+l, кф1, which clearly hold, since each function A,- is affine. П From this theorem, we deduce the definition of a finite element, which is called the cubic Hermite n-simplex (Fig. 8.1), where the directional derivatives Ор(щ)(ал—а{) are degrees of freedom. Of course, the knowledge of these n directional derivatives at a vertex at is equivalent to the knowledge of the first derivative Dp(at). Such a knowledge is indicated graphically by one small circle, or sphere, centered at the point at. Since the first derivative Dp(at) is equally well determind by the partial derivatives Qjp(at), 1 ^j ^ n, another possible set of degrees of freedom for this element is the set FT indicated in Fig. 8.1. cubic cubic Hermite triangle Hermite tetrahedron dim?r=10 dim?T = 20 PT-- IT- rT- =ЫТ) = {p(a,Y- Dpia, = {p(e.)= djp(a cubic Hermite n-simplex dimPr = £(n + l)(n + 2)(n + 3) lsSisSn+1; p(aijk): Ki<j<k^n Щ-а,): l^Uj^n + lJ^i} ): Kisjn + l, K;'<n} + 1; + 1; Fig. 8.1
Section 8 Introduction to the finite element method 85 By the same method that led us to the reduced cubic п-simplex, we can eliminate the degrees of freedom p(aiJk), i<j<k. This "reduction" relies on the following theorem, whose proof is similar to that of Theorem 6.2: Theorem 8.2. For each triple (i,j, k) with i<j<k, let ^;д(Р) = 6Р(Яи*)-2 £ p(al)+ £ Dpia^a, - aijk). l = i,j,k l = i,j,k Then any polynomial in the space P'i = {peP3; фик(р) = 0, Ui</<k<n + l} (8.2) (8.3) is uniquely determined by its values and the values of its n first partial derivatives at the vertices ah l<i<n + l. In addition, the inclusion holds. From this theorem, we deduce the definition of the reduced cubic Hermite n-simplex which, for и = 2, is also called the Zienkiewicz triangle (Fig. 8.2), after Bazeley, Cheung, Irons and Zienkiewicz [1965]. Given a triangulation &~h made up of n-simplices, we associate in a natural way a finite element space Xh with either type of finite elements. For instance assume we Zienkiewicz triangle, or reduced cubic Hermite triangle dimPT = 9 reduced cubic Hermite tetrahedron dimPT=16 PT= ZT = I'T-- reduced cubic Hermite = P(T)(cf. (8.3)), dimPr = ( •{рЫ- = {Pfa): l^i^n+l;Dp(a,)(aj- l«i«n + l; dtp(aty.l^ n-simplex n+lf e,):l<i Kn+1, j<n+l, l^j<n} ■*}}, Fig. 8.2
86 P.G. Ciarlet Chapter II are using cubic Hermite n-simplices, the case of reduced cubic Hermite n-simplices being similar. Then a function vh is in the space Xh if (i) each restriction vh\T is in the space PT = P3(T) for each Те &~h, and (ii) it is defined by its values at all the vertices of the triangulation, by its values at the centers of gravity of all triangles found as 2-faces of the n-simplices Те 3~п, and by the values of its n first partial derivatives at all the vertices of the triangulation. The corresponding set of degrees of freedom of the space Xh is thus of the form Sh ={vh(b):beNvuNc, djvh(b), beNy, l<j<n}, (8.4) where Ny denotes the set of all the vertices of the n-simplices of the triangulation and Nc denotes the set of all centers of gravity of all 2-faces of the n-simplices found in the triangulation. Note that requirement ($~h5) (cf. Section 6) again insures that the degrees of freedom are unambiguously defined across adjacent finite elements; this requirement is also the basis for the following theorem. Theorem 8.3. Let Xh be the finite element space associated with cubic, or reduced cubic, Hermite n-simplices. Then the inclusion Xhc<g°(u)r\H\0) (8.5) holds. Proof. As in Theorem 6.3, it suffices to prove the inclusion Xhcz<g°(Q). Assume n = 2, then along any side common to two adjacent triangles, there is a unique polynomial of degree 3 in one variable that takes on prescribed values and prescribed first derivatives at the end points of the side. This argument easily extends to the n-dimensional case. □ To verify that the third basic aspect (FEM 3) (cf. Section 5) is satisfied, let us assume, to fix ideas, that we are considering cubic Hermite triangles, so that the associated set of degrees of freedom of the space is of the form Zh={v(bk), dlV(h), 82t;(bt): 1^/c^J; фк): J+l^k^L}. (8.6) Then if we define functions wk, wk,wke Xh by the conditions w*(bi) = <5iu, l^kJ^L, d1wk(b,) = 82wk(bl) = 0, l^k^L, l^l^J, wk1(fei) = 0, l^k^J, 1</<L, diwl(b,) = 5kh d2w1k(bl) = 0, l^kJ^J, wfcbl) = 0, l<k<J, 1^/<L, ЗХ(Ь,)=0, d2wi(bl)=ekl, l^k,KJ, it is easily seen that these functions form a basis of the space Xh and that they have "small" supports.
Section 9 Introduction to the finite element method 87 9. Examples of finite elements for fourth-order problems and their associated finite element spaces Finally, we examine some examples of finite elements whose associated finite element spaces satisfy the inclusion Xh c^iQ), and which may therefore be used for solving fourth-order problems if the inclusions PT<=H2(T), Те STh, also hold; cf. Theorem 5.2. It is legitimate to restrict ourselves to the case where n = 2, in view of the examples given in Section 4. Our first example is based on the following result. Theorem 9.1. Let T be a triangle with vertices a;,l^i<3, and let ау=|(а; + а^), l<i</<3, denote the midpoints of the sides. Then any polynomial p of degree 5 is uniquely determined by the following set of 21 degrees of freedom: Гг = {6*р(а,.):|а|^2,1<^3; 8vpfflu): l<i<./<3}, (9.1) where 9„ denote the normal derivative operator along the boundary of T. Proof. Given a set of degrees of freedom, finding the corresponding polynomial of degree 5 amounts to solving a linear system with a square matrix, for which existence and uniqueness for all right-hand sides are equivalent properties, as we already observed. We shall prove the latter property, i.e., that any polynomial peP5 such that 6ар(а,) = 0, |а|<2, Ui<3, 8vp(fly) = 0, Ui</<3, is identically zero. Let t denote an abscissa along the axis that contains the side T = [a1,a2']. Then the restriction p\r, considered as a function q of t, is a polynomial of degree 5, and q satisfies q(a1) = q'(a1) = q"(a1) = q(a2) = q'(a2) = q"(a2) = 0, since, if т is a unit vector on the axis containing the side T, we have 9'(fli) = 9tP(fliX q"(a1) = dz%p(a1), ...; hence q=0. Likewise, considered as a function r of t, the normal derivative 3vp along T is a polynomial of degree 4, and r satisfies г(а1)=/-'(а1)=ф12) = г(а2) = г'(а2)=0, since r(a1) = dvp(a1), r'(a1) = 8vtp(a1), r(a12) = 6vp(a12), ...; hence r = 0. Since we have 8rp = 0 along T'(p = 0 along T), we have proved that p and its first derivative Dp vanish identically along T. We now show that this implies that
88 P.G, Ciarlet Chapter II the polynomial X\ is a factor of p: After using an appropriate affine mapping if necessary, we may assume without loss of generality that i3(x1,x2) = x1. We can write 5 p(xi,x2)= £4p.-(*2) ! = 0 where рь0^^5, are polynomials of degree E —£) in the variable x2. Therefore p@,x2)=p0(x2)=0 forallx2eR, 51p@,x2)=Pi(x2)=0 forallx2eR, which proves our assertion. Similar arguments hold for the other sides, and we find that the polynomial (A? X\X\) is a factor of p. Since the A,- are polynomials of degree 1 that do not reduce to constants, it necessarily follows that p=0. □ With Theorem 9.1 we can define a finite element, the ^-quintic triangle, also called the 21-degree of freedom triangle, or the Argyris triangle (Fig. 9.1), after Argyris, Fried and Scharpf [1968]. Its basis functions may be found in Bernadou Argyris triangle, or ^-quintic triangle, or 21-degree of freedom triangle PT = P5(T), dimPr = 21 ^T = {p(ai),9ip(a,),82p(e,),Sup(Oi),812p(a,))e22P(Oi): KK3; Э,р(ау):1$1<;<3} r; = {p(ai): l^i^DpiaMaj-a,): Ui,j<3,;Vi; D2p(a,)(aj — ahak— a,): 1 < i, j, к г£ 3, )ф i, к ф i; SvPK): Ki<;<3} I? = {p(fl,.),Dp(fl|.)(<i1..1 -af),Dp(a,)(a,.+ 1 -a,): l«i«£3; £>2p(a,)K'+1 - a;J: 1 < ', i < 3; Dp(ati)vt: {i, j, k} = {1,2,3}, i < j}. Fig. 9.1
Section 9 Introduction to the finite element method 89 and Boisserie [1982, p. 71]. We also note that Theorem 3 of ZlAmal [1968] yields an alternate proof to Theorem 9.1. Figure 9.1 is self-explanatory as regards the graphical symbols used for representing the various degrees of freedom. We observe that at each vertex ab the first and second derivatives Dp(at) and D2p[at) are known. With this observation in mind, we see that other possible definitions for the set of degrees of freedom are the sets E'T and 1"T indicated in Fig. 9.1. In the expression of the set I"T, the indices are numbered modulo 3, and each vector vb 1 < i < 3, is the height originating at the point at. It may be desirable to dispose of the degrees of freedom Э„р(ау), 1<г'<;<3. This reduction will be a consequence of the following result. Theorem 9.2. Any polynomial in the space P'5(T) = {pe P5(T); 8vpe Р3(Г) for each side Г of T} (9.2) is uniquely determined by the following set of IS degrees of freedom: ZT = {&'p(ai):\a\^2,l^i^3}. (9.3) The space P'5(T) satisfies the inclusion P4(T)c=F5(T). (9.4) Proof. By writing dvpeP3(T) in definition (9.2), it is of course meant that, considered as a function of an abscissa along an axis containing the side T", the normal derivative dvp is a polynomial of degree 3. The inclusion (9.4) being obvious, it remains to prove the first part of the theorem. To begin with, we prove a preliminary result: Let T = [ah af] be a segment in W, with midpoint a^, and let v be a function such that v\T,e P4(T'). Then we have v\T.eP3(T') if and only if £y(p)=0, where Zy(») = Щщ) + viaj)) - Щаи) + Dv(ai)(aj -at) + Ща^ - а,.). (9.5) To see this, let, for any xe T',aA = D4v(x)t4, where т is a unit vector along T, so that a4 is a constant. Then we have Ф,) = Фу) + Щац){а{ - ау)+\D2 фу)(а,- - ауJ + iD3v{aij)(ai -ауK + £а41| a^a^ ||4, v(aj) = фу) + Dv(au)(aj - ay) + \D2 v[au){aj - ayJ +iD3v(aij)(aj-aijK + £а41| а3-аи ||4, from which we deduce (al—aij= — (a; — ay)): ф;) + v(aj) = 2фу) + \{D2 фу)(а, - ayJ + D2 фу)(а; - ayJ} + 2S^{\\ai-aij\\4 + \\aJ-aij\\4}.
90 P.G. Ciarlet Chapter II Likewise, Dv(ai)(ai - au) = D2v(aij)(ai - аиJ + jD3 «(ау)(а,- - ayK + £a4 Ц a( - ay |[4, Dviaj){aj - otj) = £>2 фу)(а^ - ayJ + i£>3 фу)(й/ - ayK + ia41| as - ay ||4, and therefore, £>2фу)(а,- - auJ +D2 ю(ац){а} - auJ = Dv(ai)(ai-aij) + Dv(aj)(aj-aij)-&4{\\ai-aij\\4+\\aj-aijf}. Combining our previous relations, we get 2v(au) = v(fli) + v{a})+i {Dvia^aj - а,-)+£>ф;)(а; - a_,-)} + <fea41| a; - a^ ||4, and the assertion is proved. As a consequence of this preliminary result, the space P'S(T) may be also defined as Р'5(Г) = {pe PS(T); Zy(SvP) = 0» Ui<7<3}. (9-6) i.e., we have characterized the space PH^) by the property that each normal derivative Svp(a;j) is expressed as a linear combination of the parameters 9°tp(aI), dxp(aj), | a [ = 1,2. Then the proof is completed by combining the usual argument with the result of Theorem 9.1. Q From Theorem 9.2, we deduce the definition of a finite element, called the reduced ^-quintic triangle, or the 18-degree of freedom triangle, or the Bell triangle, after Bell [1969]; see Fig. 9.2, where we have indicated three possible sets of degrees of freedom that parallel those of the Argyris triangle. Remark 9.1. The Argyris and Bell triangles should be also attributed to Felippa [1966], who described them for the first time. Given a triangulation made up of triangles, we associate a finite element space Xh with either type of finite elements. We leave it to the reader to derive the associated set of degrees of freedom of the space Xh and to check that the canonical basis is again composed of functions with "small" support. We shall only prove the following result. Theorem 9.3. Let Xh be the finite element space associated with Argyris triangles or Bell triangles. Then the inclusion Xhc<el(U)r\H2(Q) (9.7) holds. Proof. By Theorem 5.2, it suffices to show that the inclusion 1,с^'(й) holds. Let Ti and T2 be two adjacent triangles with a common side 7" = \bh bf\ (Fig. 9.3) and let vh be a function in the space Xh constructed with Argyris triangles. Considered as functions of an abscissa t along an axis containing the side T, the
Section 9 Introduction to the finite element method 91 Bell triangle, or reduced ^-quintic triangle, or li-degree of freedom triangle PT = F}(T) (cf. (9.2)), dimPT = 18 Zt = {Pfe), Sip(e,), Э2р(а;), St iPiai), Э12р(а,), 622р(а,): 1 =S К 3} £'r = Wa.-): 1<1<3;Вр(а,-)(а;-в|): l<i,j<3,jiH; 1Jр(а,-Ж—fli,eic-ai):Ui,7',fc«3,j#i>fe9ti} J'f = {р(а,)(Вр(в()(а1-_1 -a1-),Dp(fli)(aI+1 -a,): 1 *£i<3; D2p(a,.)(aJ+1-a/:Ui,^3} Fig. 9.2. Fig. 9.3. Two adjacent Argyris triangles. functions vh\Ti and гл|Гг are along V polynomials of degree 5 in the variable t. Call these polynomials </, and q2. Since, by definition of the space Xh, we have q(bi} = q'(bi) = l"(bi) = q(bj) = q'(bJ) = q"(bj) = 0, with q = qx—q2, it follows that q=0 and hence the inclusion Vhcz<g°(Q) holds.
92 P.G. Ciariet Chapter II Likewise, call r1 and r2, the restrictions to the side T" of the functions dvvh\Ti and —bvvh\Tl. Then both rt and r2 are polynomials of degree 4 in the variable t and, again by definition of the space Xh, we have Kb,) = r'(^)=КЬу) = Ф}) = r'(hj)=0, with r~r1—r2, so that r=0. We have thus proved the continuity of the normal derivative which, combined with the continuity of the tangential derivative (q=0 along T implies q' = 0 along T"), shows that the first derivatives are also continuous on Q. If the space Xh is constructed with Bell triangles, the argument is identical for the difference q = qi~q2- The difference r — rl—r2 vanishes because it is a polynomial of degree 3 in the variable t which is such that г(Ь() = г'(Ь£) = г(^.) = г'(^) = 0. □ To conclude, we give one instance of a rectangular finite element which may be used for solving fourth-order problems posed over rectangular domains. Its existence depends upon the following theorem, whose proof offers no difficulties. Theorem 9.4. Let T denote a rectangle with vertices at, 1 ^ i < 4. Then a polynomial peQ3 is uniquely determined by the following set of degrees of freedom: ST = {p(ai),d1p(aild2p(ai),d12p(ai):l^i^4}. (9.8) The resulting finite element is the Bogner-Fox-Schmit rectangle named after Bogner, Fox and Schmit [1965]; it is also called the У?1-bicubic rectangle; see Fig. 9.4, which is again self-explanatory for the graphical symbols. е„и Bogner-Fox-Schmit rectangle or W1 -bicubic rectangle PT = Qi, dimfV=16 IT = {p(ai), dlP(o,), e2p(fli), е12р(а,): 1 $ К 4} Fig. 9.4.
Section 10 Introduction to the finite element method 93 The proof of the next result is similar to that of Theorem 9.3. Theorem 9.5. Let Xh be the finite element space associated with Bogner-Fox-Schmit rectangles. Then the inclusion Xhc=Vl(Q)nH2{Q) (9.9) holds. Finally, one easily checks, using the standard construction, that a finite element space constructed with any one of the last three finite elements indeed possesses canonical bases whose functions have "small" supports. Remark 9.2. Other examples of finite elements yielding the inclusion Xh c<g'1@)n H2(Q) will be given in Sections 45-47. Whereas it is fairly easy to conceive finite element spaces contained in %>°(Q), the construction of finite element spaces contained in <^1(fl) is less obvious, as shown by the three examples described in this section (and by the even more intricate examples that will be given in Chapter VII; see also the discussion in Zienkiewicz [1971, Section 10.3]). This observation has been justified by the beautiful result of ZeniSek [1973,1974], who has proved the following: Let n = 2, let Xh be a finite element space where all the sets Tare triangles, and where the spaces PT are spaces of polynomials, i.e., there exists some integer / such that the inclusions Рт с Pt(T) hold for all Те 2Th (therefore finite elements of class Ш1 using "singular functions", or of "composite" type, as described in Chapter VII are excluded from the present analysis). Then, for any integer m>0, the inclusion ХксЯ?т(П) implies that, at each vertex b of the triangulation, the linear forms vh->d"vh(b) are degrees of freedom of the space Xh for all | а К 2m. As a corollary, the inequality / ^ 4m +1 holds (the proof of the corollary is simple, but the proof of the first result is by no means trivial). Thus for instance the particular choice m = 1 shows that the Bell triangle is optimal for fourth-order problems, since the dimension A8) of the space P'S(T) of (9.2) is the smallest possible for conforming finite element methods using piecewise polynomial spaces and triangles. Note that 2eniSek [1972] has also extended his results to the case of higher dimensions. See also Le Mehaute [1984] for further extensions. Finite element spaces whose functions are piecewise polynomials, and which are contained in ^""(£2), have also been studied by Barnhill and Gregory [1975b], Deleze and Goel [1976], Morgan and Scott [1975], Scott [1974], Strang [1973, 1974]. 10. Finite elements as triples (T,P,Z) and their associated Pr-interpolation operators ПТ Motivated by the previous examples, we are now in a position to give a general definition, first proposed by Ciarlet [1975, p. 61]: A finite element in № is a triple
94 P.G. Ciarlet Chapter II (£?,£) where: (i) Г is a closed subset of W with a nonempty interior and a Lipschitz- continuous boundary; (ii) P is a finite-dimensional space of real-valued functions defined over the set T; we let iV=dimP; (iii) Z is a set of N linear forms фи К i < N, defined over the space P and, by definition, it is assumed that the set Z is P-unisolvent, in the following sense: given any real scalars x^l^i^N, there exists a unique function peP that satisfies ф1(р) = а1, l^i^N. A0.1) Of course, this implies that the N linear forms ф^ are linearly independent. In particular, there exist N functions p,e P, 1 sSi^iV, that satisfy Ф}{рд = Ьф l^j^N, A0.2) and the following identity holds: Р=1Ф,(Р)Р1 for all peP. A0.3) The linear forms 0,, 1 ^ i ^ N, are called the degrees of freedom of the finite element, and the functions pt,l^i^N, are called the basis functions of the finite element. Whenever we find it convenient, we shall use the notations PT, ZT, ф1Т and pt, in lieu of Р,Е,ф( and pt. Remark 10.1. The basis functions are also called the shape functions in the engineering literature. Remark 10.2. We shall see later that, in practice, the linear forms ф1 are in fact defined over spaces that are larger than P, but we ignore this fact for the time being. Remark 10.3. The set T itself is often called a finite element; this is clearly an abus de langagel Remark 10.4. The P-unisolvence of the set Z is equivalent to the fact that the N linear forms ф{ form a basis in the dual space of P. One may then view the bases {<tpi)?= i and (Pi$= i as being dual bases, in the algebraic sense (cf. in particular identity A0.3)). In the light of the definition of a finite element, let us briefly review the examples given in the previous sections. We have seen examples for which the set Г is either an n-simplex, in which case the finite element is said to be simplicial, or triangular iin — 2, or tetrahedral if n = 3, or an n-rectangle in W, in which case the finite element is said to be rectangular. As we already mentioned, these are all special cases of straight finite elements, i.e., for which the set T is a polyhedron in W. Other polygonal shapes are found in practice, such
Section 10 Introduction to the finite element method 95 as quadrilaterals (see Sections 36 and 46) or "prismatic" finite elements (cf. Fig. 12.1). We shall also describe (Section 36) "curved" finite elements, i.e., whose boundaries are composed of "curved" faces. The main characteristic of the various spaces P encountered in the examples is that they all contain a "full" polynomial space Pk(T) for some integer fc^l, a property that will be shown in the subsequent chapters to be crucial as far as convergence properties are concerned. In all the previously described examples, the degrees of freedom were of some of the following forms: p-*p(af), p->Dp(amk, (Ю.4) p->D2p(amik,ti), where the points a',r=0,1,2, belong to the finite element, and the nonzero vectors £», £?ь £?i are either "attached to the geometry" of the finite element (as in Dp(ai)(aj — aj), 6vp(ai;),...) or are fixed vectors of R" (as in 9,р(аД 80р(ак)). The points a\,r=0,1,2, are called the nodes of the finite element. Whereas only directional derivatives of order 1 or 2 occurred in the examples, one could conceivably consider degrees of freedom that would be partial derivatives of arbitrarily high order, but these are seldom used in practice. As we shall see later (cf. Sections 32 and 50) there are however practical instances of degrees of freedom that are not attached to nodes: They are instead averages (over the finite element or over one of its faces) of some partial derivative. When all the degrees of freedom of a finite element are of the form р-+р(аг), we say that the associated finite element is a Lagrange finite element, while if at least one directional derivative occurs as a degree of freedom, the associated finite element is said to be a Hermite finite element. As the examples in the previous sections have shown, there are essentially two methods for proving that a given set I of degrees of freedom is P-unisolvent: After it has been checked that dim P = card(X), one either exhibits the basis functions, or one shows that if all the degrees of freedom are set equal to zero, then the only corresponding function in the space P is identically zero. Note that we have used the first method for all the examples, except for the Argyris triangle. Given a finite element (T,P,E) and a function v: T-*U sufficiently smooth, so that the degrees of freedom 4>i(v), 1 =$ i < N, are well defined, we let m= £ ФШ (Ю.5) denote the P-interpolant of the function v; the P-interpolant is also denoted nTv if necessary. Since the set I is P-unisolvent, the P-interpolant is also the unique function that satisfies UveP, ф;(т) = ф&), l^i^N. A0.6) Whenever the degrees of freedom are of the form A0.4), let s denote the maximal
96 P.O. Ciariet Chapter II order of derivatives occurring in the definition of the set 1. Since, for all the finite elements described so far, the inclusion Pcz<£s(T) holds, we shall consider that the domain dom П of the P-interpolation operator П is the space dom П = ЩТ). A0.7) This being the case, it follows that the F'-interpolation operator reduces to the identity over the space P с dom П, i.e., Пр=р for all pe P. A0.8) In order that the P-interpolation operator be unambiguously defined, it is therefore necessary that the forms фь which are a priori only defined on the space P, be also defined on the space ^S(T). To see this, assume again that the space P is contained in the space <ё'(Т). Then if a degree of freedom were only defined over the space P, it would have infinitely many extensions to the space ^(T). Let us give one simple example of such a phenomenon: Let T be an n-simplex with barycenter a. Then the linear form pe^TD——^ lpdx meas(T) j т is one possible extension of the form pe P1(T)->'p(a). Of course, these considerations are usually omitted: when a degree of freedom such as dipiuj) is considered for instance, it is implicitly understood that this linear form is the usual one, i.e., it is defined over the space ^(T), but it is not any one of its other possible extensions from the space P to the space <^1(T). For another illustration of this circumstance, see the description of Wilson's brick in Section 32. Whereas for a Lagrange finite element, the set of degrees of freedom is unambiguously defined (indeed, it can be conveniently identified with the set of nodes), there are always several possible definitions for the degrees of freedom of a Hermite finite element which correspond to the "same" finite element. More precisely, we shall say that two finite elements (T, P, I) and (S, Q, E) are equal if we have T=S, P = Q, ПТ = П8. A0.9) To illustrate this point, consider the reduced cubic Hermite n-simplex. Two possible sets of degrees of freedom for this element are (cf. Fig. 8.2): Z = {p{al): 1^г'<п + 1;£)р(а,.)(а7.~а;): l^i,j^n+ U#;'}, E' = {p{ai): l^!'sSn + l;c\p(a;): ls£i^n+l,l*£fcs£n}. Let us denote by П and П' the corresponding P3(T)-interpolation operators. Then, for any function ve сё1 (Г) = dom Я = dom П', we have, with self-explanatory
Section 11 Introduction to the finite element method 97 notations, IIv = £ via^Pi + £0ф,)(ау - aL)pip Я'» = £ viajp't + ^кФдРш, i i,k and n i=l for appropriate coefficients fiijk for each pair (i,j). To conclude that П = П', it suffices to observe that for each polynomial ре Р'з(Т), one also has Dp(ai)(aj — ai) = 4t=i /4>AP(a>) with the same coefficients цт. 11. Affine families of finite elements We now come to an essential idea, which we shall first illustrate by means of an example. Suppose that we are given a family of quadratic triangles (T,PT,ET), and that we want to describe such a family as simply as possible. Let f be a triangle with vertices d;, and midpoints of the sides aij = j(ai + aJ), 1 < i<J< 3, and ЫЕ ={p(ai): l^i^3;p{du): 1<i'<j'<3}, so that the triple (T,P,E) with P=P2(f) is also a quadratic triangle. Then for each finite element (T, PT, ET) in the family (Fig. 11.1), there exists a unique invertible affine mapping FT: xe U2->FT(x) — BTx + bT, Fig. 11.1. Two quadratic triangles are affine-equivalent.
98 P-G. Ciarlet Chapter II where BT is an invertible 2x2 matrix and bT is a vector of U2, such that Рг(а,.) = а,-, l^i<3. Then it automatically follows that *V(«y) = ey> l^i<;<3, since the property for a point to be the midpoint of a segment is preserved by an affine mapping (likewise, the points that we called a{iJ or aijk keep their geometrical property through an affine mapping). Once we have established a bijection xe T-+x = FT(x)e T between the points of the sets f and T, it is natural to associate the space QT = {p:T->U;p=poFT1,peP} with the space P. Then it automatically follows that Qt = Pi(T) = Pt, because the mapping FT is affine. In other words, rather than prescribing such a family by the data T, PT and ZT, it suffices to give one referencefinite element (f, 1, P) and to specify the affine mappings FT. Then the generic finite element (T,PT,ZT) in the family is such that T=FT(f), PT = {p--T-+R;p=poF-1,peP}, ZT = {p(FTto)): U»'<3;p(fг(йц)): Ui</<3}. With this example in mind, we are in a position to give the general definition: Two finite elements (f, P, 2) and (T, P, Z), with degrees of freedom of the form A0.4) are said to be affine-equivalent if there exists an invertible affine mapping: F:x€U"-*F(x) = Bx + beU", A1.1) such that the following relations hold: T=F(T), A1.2) P = {p:T^R;p=p°F-\peP}, A1.3) a\ = F{a'd, r = 0,l,2, A1.4) &=B&, a = B&k, Й = ВЙ, A1.5) whenever the nodes а,, respectively c-, and vectors £», £JL £jj, respectively <fa» I*. &h occur in the definition of the set Z, respectively Z. Remark 11.1. The justification of the relations A1.5) will be found in the proof of Theorem 11.1. Remark 11.2. Let (Г, Р, Z) be a finite element with degrees of freedom of the form A0.4), let F: R"-»Rn be an invertible affine mapping and let Г, Р and Z be defined
Section 11 Introduction to the finite element method 99 through relations A1.2)-A1-5). Then it is easily seen that the triple (T, P, I) is a finite element. This observation thus provides a systematic means of constructing finite elements. For instance, let {t,P,£) be an n-rectangle of degree k. Then finite elements whose associated sets Г are parallelograms for и = 2, parallelepipeds for n = 3,..., can be constructed in this fashion (such finite elements are seldom used in practice however). The real merit of this observation lies in that mappings F more general than affine mappings may be allowed: This is the basis for defining an important class of "curved" finite elements (cf. Section 35). With this definition of affine-equivalent finite elements in mind, let us review the examples given so far (the reader should check the details of the various assertions that follow). To begin with, it is clear that two n-simplices of the same degree к ^ 1 are affine equivalent, and that this is also the case for reduced cubic и-simplices, in view of the definition F.12) of the associated space PT. Likewise, two п-rectangles of the same degree к ^ 1, or two reduced biquadratic, or bicubic rectangles are affine equivalent through diagonal affine mappings. In other words, two Lagrange finite elements of any one of the types considered so far are affine-equivalent. As regards Hermite finite elements, the situation is less simple. Consider for example two cubic Hermite n-simplices with sets of degrees of freedom of the form ET (Fig- 8.1). Then it is clear that they are affine-equivalent because the relations a^-a;=JF(aJ.)-F(ai) = B(aJ.-d;), ls%ij's£n + l, j^i, hold among other things. However, had we chosen sets of degrees of freedom in the form Z'T, it would not have been clear to decide whether the two finite elements were affine-equivalent, and yet these two sets of degrees of freedom correspond to the same finite element, as we already pointed out. The same analysis and conclusion apply to the reduced cubic Hermite n-simplex, or to the Bogner-Fox-Schmit rectangle. In the latter case, it suffices to observe that this rectangular finite element can also be defined by the following set of degrees of freedom (the index i being counted modulo 4) X'T = {p(ai),Dp(ai){ai-1 -ai),Dp{ai){ai+1 -at), D2p{ai)(ai-1-ai,ai+1-aiy. l<i<4}, A1.6) whose degrees of freedom satisfy relations A1.4) and A1.5). There are counter-examples. For instance, consider a finite element in R2 where some degrees of freedom are normal derivatives at some boundary nodes. Then two such finite elements are not in general affine equivalent: The property for a vector to be normal to a side is not in general preserved through an affine mapping. Thus two Argyris triangles are not affine-equivalent in general, except (for instance) if they happen to be both equilateral triangles. In the same vein, it can be shown that two Bell triangles are not affine-equivalent in general, because condition A1.3) is violated. Whenever two finite elements are affine-equivalent, we shall systematically use the correspondences
100 P.G. Ciarlet Chapter II xef-*x = F{x)eT, A1.7) peP-tp^poF-teP, A1.8) between the points x e t and xeT, and the functions peP and peP. Notice that these correspondences imply that p(x)=p{x) for all J?ef, peP. A1.9) We next prove a crucial relationship between the P-interpolation operator П and the P-interpolation operator П associated with two affine-equivaient finite elements (T, P, f) and (T, P, E). This relationship is a consequence of the fact that the basis functions are also in the correspondence A1.8), as we now show: Theorem 11.1. Let (T,P,E) and (T,P,E) be two affine-equivaient finite elements with degrees of freedom of the form A0.4), and let ph l^i^JV, denote the basis functions of the finite element (T,P,E). Then the functions Pi=pi°F~1, Ki<N, are the basis functions of the finite element (T, P, E). In addition, the associated P-interpolation operator П and the P-interpolation operator П satisfy {nvf = m, A1.10) for all functions yedom П and uedorn П associated in the correspondence u6domi7-*u = tJoF-1Gdomn. A1.11) Proof. The P-interpolation operator П is of the following form (the notations are self-explanatory): IIv = liv(a?)p? + ll{Dv(ai)Zhl}phc+ £ {DMafML ti)}ph. i i,k itk,l Using the chain rule, we obtain Dv(al)a=Dv(F(a} ))B& = Dv{F{a\ ))DF{a\ )& =D(vF){u})& = m&NL and D2v(af)(a, ti) = D2v(F(d?)№?k, BU) = D2v(F(al))(DF(a?)£fk, DF{af)&) =D2(vF)(aMfk, U) = D2v(afMfk, 8), since D2F = 0. Thus we also have n»=£«(a?)p?+IP»(#)&}pi+ I {D2v(af)(ifk,U)}pii i Uk i,k,l from which we deduce, using the correspondence A1.8), {nvT=Ylvia?)p?+Z{msl)ik}pk+1 {D2wmL&)}*&«■
Section 11 Introduction to the finite element method 101 Using functions v e P in this relation, we infer that the functions pf, pfk, pfk are the basis functions of the finite element (f,P,I), by virtue of identity A0.8). Using this result, we conclude that the function {Ilv}" is equal to the function Ш, by definition of the P-interpolation operator П. □ Remark 11.3. The basis functions of each simplicial finite element (T,P,E) described in Sections 6 and 8 are polynomial functions of the barycentric coordinates attached to the n-simplex T. The invariance of the barycentric coordinates by the affine mapping F shows directly in this case that the basis functions are in the correspondence A1.8). It is easy to verify that the conclusions of Theorem 11.1 hold in the more general situation where the sets of degrees of freedom are given as £ = {<f>t: l^i^N} and 1={фг. 1 <i^N}, provided the degrees of freedom satisfy ^,@)=M»), ККЛГ, foralluedomn, A1.12) and, in essence, the proof of Theorem 11.1 consisted in showing that these relations are satisfied when the degrees of freedom are of the form A0.4). Remark 11.4. In Section 32, we shall encounter an instance of a degree of freedom, which is not of the form A0.4) but which satisfies relations A1.12). A family of finite elements is called an affine family if all its finite elements are affine-equivalent to a single finite element (T, P, I), which is called the reference finite element of the family (the finite element (t, Р, t) need not belong itself to the family). If an affine family consists of simplicial finite elements, a customary choice for the set t is the unit n-simplex, whose vertices are fii=(l,0,...,0), d2=@,l,0,..,,0), d„=@,...,0,l), a„+1=@,0,...,0). In this case, the barycentric coordinates take the simple form n li = xi, Ui^n, Ля+1=1-£x(. If an affine family consists of rectangular finite elements, customary choices for the set t are the unit hypercube [0,1]", or the hypercube [— 1,1]". The concept of an affine family of finite elements is of crucial importance, for the following reasons: (i) In practical computations, most of the work involved in the computation of the coefficients of the linear system E.4) is performed on a reference finite element,
102 P.G. Ciarlet Chapter II not on a generic finite element. This point will be further illustrated in Sections 25 and 39. (ii) For such affine families, a fairly elegant interpolation theory can be developed (Section 16), which is in turn the basis of convergence theorems. (iii) Even when a family of finite elements of a given type is not an affine family, it is generally associated in a natural way with an affine family whose "intermediate" role is essential. For example, when we shall study in Section 45 the interpolation properties of the Argyris triangle, an important step will consist in introducing a slightly different finite element, the ^"-quintic Hermite triangle, which can be imbedded in an affine family. In the same fashion, we shall consider in Section 37 the "isoparametric" families of curved finite elements essentially as perturbations of affine families. 12. General properties of finite element spaces Our next task is to give a precise description of the construction of a finite element space with finite elements (T,PT,IT), Те &~h. For the sake of simplicity, we shall restrict ourselves to the case where the finite elements are all polygonal, so that the set & = U Te^ii T *s necessarily polygonal, and to the case where the finite elements are all of Lagrange type. These restrictions will allow us to avoid technical difficulties, such as appropriately defining a "face" of a nonpolygonal finite element, or explicitly stating the compatibility conditions that degrees of freedom of adjacent Hermite finite elements should satisfy. There are indeed polygonal finite elements that are used in actual computations, and which are neither n-simplices nor и-rectangles. Of course, such finite elements are not just arbitrary polygonal domains. Rather they are adapted to special circumstances: Thus, if the domain Q is a cylindrical domain in M3, it might be appropriate to use prismatic finite elements, an example of which is given in Fig. 12.1: Fig. 12.1. An example of a prismatic finite element.
Section 12 Introduction to the finite element method 103 The space P is the tensor product of affine polynomials in x1, x2 by the space of affine polynomials in x3, i.e., a function p in the space P is of the form p(x1,x2,x3) = y1+y2x1+y3x2+ytlx3+y5x1x3+y6x2x3. Returning to the general case, we shall assume that each polygonal set T has a nonempty interior, and that the interiors of distincts sets Те 2Th are pairwise disjoint, so that requirements {&~h\)-{!Fh4) of Section 5 are satisfied (a polygonal domain has a Lipschitz-continuous boundary). A portion T of the boundary of T is a face if it is a connected subset of an affine hyperplane of R" with a nonempty interior relatively to this hyperplane. In order to unambiguously define the functions of the finite element space, we ask that the finite elements satisfy the following requirement, which generalizes condition (^"/,5) already required for п-simplices and п-rectangles (Section 6 and 7): (■^"/,5) Any face of a finite element I1! is either a face of another finite element T2, in which case the finite elements 7\ and T2 are said to be adjacent, or it is a portion of the boundary of the set Q. Finally, we ask that the sets of degrees of freedom of adjacent finite elements shall be related as follows: Whenever (Tl,Pl,Zl), with 2;' = {p(a|): 1 ^i^N1}, 1=1,2, are two adjacent finite elements, then (Й{а!-})пТ2=(У{а,?})пГ2- A2Л) We define the set Nh= (J NT A2.2) TePh where NT denotes the set of nodes of the finite element (T,PT, IT). For each b e Nh, we let T(X), Ae Л{Ь), denote all those finite elements for which b is a node. Then the associated finite element space Xh is defined by Xh = lv=(vT)Teyhe J] рт\ (- Te3Th VTw(b) = vT(ll)(b) for all be Nh and all l,fie A{b) and a function in the space Xh, is uniquely determined by the set Zh = {v(b):beNh}, A2.3) which is called the set of degrees of freedom of the finite element space. Since Xh is a priori defined as a subspace of the product space ITr6^rhPr, it is thus
104 P.G. Ciarlet Chapter II realized that an element veXhis not in general a function defined over the set Д since it need not have a unique definition along faces common to adjacent finite elements. It is nevertheless customary to say that the elements of the space Xh are "functions" over П, which, by virtue of assumption A2.1) are at least continuous at all nodes common to adjacent finite elements (the inclusions Ртасё0{Т), Те 3~h, usually hold in practice). It is also a usual practice to consider the functions vT, Те 3~h, as being the restrictions to the finite elements Г of the function ye Xh, just as if v were an "ordinary" function defined over the set U. This is why we shall use the alternate notation v\T = vT. If it happens, however, that for each function ve Xh, the restrictions v\Tl and v\T2 coincide along the face common to any pair of adjacent finite elements 7\ and T2, then the function v can indeed be identified with a function defined over the set U. Although this last property is satisfied by all the finite element spaces that we have constructed so far, it is by no means necessary. Following Crouzeix and Raviart [1973], let us consider for example the finite element space constructed with the following finite element (T,P,S): The set T is an n-simplex with vertices Qj, 1 «$ j < n +1, the space P is the space P x (Г) and the set of degrees of freedom is the set IT = (p(Cj): 1 ^ i ^ n +1}, where for each i the point c; is the barycenter of the face that does not contain the point a;,i.e., Cj= - £ dj, ligi<n+l. Fig. 12.2. The nonconforming Crouzeix-Raviart linear triangle and tetrahedron. The resulting finite element is called the nonconforming Crouzeix-Raviart linear n-simplex; cf. Fig. 12.2 for the special cases n — 2 and 3. To show that the set ET is P^TJ-unisolvent, it suffices to observe that the points сi = (Cj-i)"= i, К i < и +1, are also the vertices of an n-simplex: If we Jet С denote the
Section 12 Introduction to the finite element method 105 (n +1) x (n +1) matrix defined by C = u '12 -21 ^22 ■■"nl Ы2 -l,n+l \ un,n +1 1 v i ■ 1 / it is easily verified that det C=( — l/n)"det Л, where A is the matrix of F.3) and thus detC#0. One may also notice that the functions Pi = l — nXu l^i^n+1, are the associated basis functions. Then it is clear that the functions of the corresponding finite element space generally have two definitions along faces common to adjacent finite elements, except at the centroids of these faces. Remark 12.1. This triangular finite element has no "rectangular counterpart"! For let T be the unit square and let the points a,-, 5 < i < 8, be as in Fig. 7.2. Then the set {p{cii): 5^is*8} is not 6i(T)-unisolvent (consider the polynomial p(x1,x2) = (xi-i)(x2-i)). In the same vein, let T be a triangle with vertices flj,l<i<3, and let aUi = £Ba,-+ a,-). Then the set {р(ащ): l^i,j^3,i=£j} is not P2(T)-unisolvent. All the previous considerations can be extended so as to include the case of finite element spaces constructed with Hermite finite elements: The only difficulties are technical; for instance, extending the compatibility conditions A2.1) to general Hermite finite elements is easy in its principle, yet it involves cumbersome notational devices. We shall simply point out that it is usually necessary to choose between various possible sets of degrees of freedom corresponding to the same finite element, so as to unambiguously define a set Ih of degrees of freedom of the corresponding finite element space. These considerations have been illustrated at various places in Sections 8 and 9. When the degrees of freedom of all the finite elements encountered in a finite element space are of the forms A0.4), the degrees of freedom of the finite element space are of the following forms: v->v(b% v->Dv{b})rj}k, v-^D2tib]){ri]k,nll A2.4) where the points Ц,г — 0,1,2, are called the nodes of the finite element space. If we write the set Ih as ^ = {^:1<J'<M}, A2.5) then the basis functions wp l^j^M, of the finite element space are defined by the
106 P.G. Ciarlet Chapter II relations It is easy to verify on each example that the basis functions of the finite element space are constructed by appropriately assembling the basis functions of the finite elements. More specifically, let фн е Eh be a degree of freedom of one of the forms A2.4), let b be the associated node, and let Tx , A e A(b), denote all the finite elements of ^~h that contain b as a node (see Fig. 12.3, where bilinear rectangles are considered). For each Ae A(b), let pk denote the basis function of the finite element Тл Fig. 12.3. Construction of a basis function of the finite element space associated with bilinear rectangles. associated with the restriction of фн to Тд. Then the function we Xh defined by fpA over Тх,ЛеЛ(Ь), 0 elsewhere, w = A2.7) is the basis function of the space Xn associated with the degree of freedom фи. As a practical consequence, requirement (FEM3) (existence of a basis of Xh whose functions have "small supports") is always satisfied by the examples. The reader should refer to Fig. 6.8 where it was shown on an example that the basis functions constructed in this fashion have indeed "small" supports. The "worst" case concerns a basis function attached to a vertex, say b, of the triangulation. In this case, the corresponding support is the union of those finite elements that have b as a vertex. In most commonly encountered triangulations in the plane, the number of such finite elements is very low (six or seven, for example). Let there be given a finite element space Xh with a set of degrees of freedom of the form A2.5). With any function v:U-+U, sufficiently smooth in order that the degrees of freedom <£j,A(t>), l^j'^M, be well defined, we associate the function nhV= £ ФзА^}, A2.8) j=i
Section 12 Introduction to the finite element method 107 where the functions Wj are the basis functions defined in A2.6). The function IIkv, which is called the Xk-interpolant of the function v, is equivalently characterized by the conditions A2.9) nhve Xk, If s denotes the maximal order of directional derivatives found in the finite elements {T,PT,IT), e3~k, we shall consider, in accordance with A0.7), that the domain dom Лл of the X^-interpolation operator Пк is the space dom nh = T(Q). A2.10) It might be helpful to keep in mind the following tableau (Fig. 12.4), where we have listed the main "global" (i.e., on Q) versus "local" (i.e., on a generic finite element (T,P,I)) notations, definitions and correspondences. "Global" definitions S3 Boundary of the set Q: Г Triangulation of the set U: 0~ъ Finite element space: Xh Generic function of Xh: v or vh Set of degrees of freedom of Xh ■ £ь Degrees of freedom of Xh: Фн or <t>Jih,l*:j^M Basis functions of Xh: v/s,l^j^M Nodes of X/,: bj ^-interpolation operator: TIh "Local" definitions (T,P,I): finite element ST: boundary of T; T: side, or face, of T P ot PT = {v\T;veXh] p or pT: generic function of PT Г or IT: set of degrees of freedom of (T, P, Г) ф, or фит, l^i^JV: degrees of freedom of (T, P, Г) Pt or piT, 1 ^i^N:basis functions of (T,P, 1) а,,аф.'..:nodes of(T,P,S) П or Пт: PT-interpolation operator Fig. 12.4. "Global" definitions, pertaining to a finite element space Xh, and "local" definitions, pertaining to a finite element (T, P, £). We next state a relationship of paramount importance between the "global" interpolation operator nh and the "local" interpolation operators Пт. Theorem 12.1. Let v be any function in the space domIlh. Then the restrictions v\T belong to the spaces dom Пт, and we have {nhv)\T = nT(v\T) for all Те Гк. A2.11) Proof. These relations are direct consequences of the way in which the set Ik is derived from the sets IT, Те !Fk. □ It has always been assumed thus far that all the finite elements (T, PT, ST), Те $~к, used in the definition of a finite element space are all of the same type. By this, we mean that, for instance, the finite elements are all quadratic n-simplices, or that the
108 P.O. Ciarlet Chapter II finite elements are all Argyris triangles, etc. If this is the case, we shall say that any finite element (T,PT,ET), Те 3~h, is the generic finite element of the finite element space. The next two definitions are of particular importance, in view of Theorems 5.1 and 5.2. We say that a finite element (T, PT, ET) is of class #° if (i) the inclusion Рт с <g°(T) holds and (ii) whenever it is the generic finite element of a triangulation and Tt and T2 are two adjacent finite elements, the restrictions vh\Tl and vh\Tj coincide along the face common to Tx and T2 for any function vh in the corresponding finite element space. Hence the inclusion Xhc<^°(Q) holds in this case. Likewise, we say that a finite element (T,PT,IT) of a given type is of class ^l if (i) the inclusion PT cg^CT) holds and (ii) whenever it is the generic finite element of a triangulation and T: and T2 are two adjacent finite elements, for any function vh in the corresponding finite element space the restrictions vh\Tl and vh\Tl coincide along the face T common to Tt and T2 and the outer normal derivatives satisfy 3v^!r, +5v^lr2=0 along T". Hence the inclusion Xh<=.<$x(U) holds in this case. Thus all the finite elements seen in Sections 6-8 are of class <&°, the Argyris and Bell triangles and the Bogner-Fox-Schmit rectangle (Section 9) are of class (ё1. There are also finite elements that are not of class сё°, such as the nonconforming Crouzeix-Raviart linear n-simplex (cf. Fig. 12.2). One may also combine finite elements of different types, provided some compatibility conditions are satisfied along faces that are common to adjacent finite elements, in such a way that a function in the resulting finite element space Xh is still unambigously defined on the one hand, and an inclusion such as Xhcz<&°(Q) (for example) holds on the other hand: For instance, one may combine n-simplices of degree к with n-rectangles of degree k; another example is considered in Fig. 12.5. •— •— reduced о biquadratic rectangle i • Fig. 12.5. Combining finite elements of different types in a finite element space. The last topic we wish to examine in this section is: How are boundary conditions taken into account in a finite element space? Again, we shall essentially concentrate on examples. Let Xh be a finite element space whose generic finite element is one of the following: и-simplex of degree k,k^l, reduced cubic n-simplex, «-rectangle of degree k,k^\, reduced biquadratic or bicubic rectangle. Then the inclusion Xh с <£°{йIлН1{П) holds (Theorems 6.3 and 7.4) and thus the inclusion XOh = {vkeXh;vh\r = 0}^Hh(Q) A2.12) holds. In each case, it is easily verified that a sufficient (and clearly necessary)
Section 12 Introduction to the finite element method 109 condition for a function vh e Xh to vanish along Г is that it vanishes at all the boundary nodes, i.e., those nodes of the space Xh that are on the boundary Г. In other words, if we let Nh denote the set of nodes of the space Xh, the finite element space X0k of A2.12) is simply given by X0h = {vhe X„;vh(b)=0 for all be Ы„пГ}. A2.13) When Hermite finite elements are used, the situation is less simple. Let us consider for example cubic, or reduced cubic, Hermite и-simplices. For each boundary node be Nhr\r, we let ry(b),ye Г(Ь), denote a maximal set of linearly independent vectors in W with the property that the points (b + ty{b)),ye Г(Ь), belong to the boundary Г. Then the space X0h of A2.12) is given in this case by X0h = {vhe X„; i>„(b)=0 for all be Ы„пГ, Dvh(b)ty{b) = 0 for all be Ы„пГ and all ye Г(Ь)}. A2.14) Fig. 12.6. When cubic, or reduced cubic, Hermite triangles are used, the values vk(b), vh(b*), and the directional derivatives Dvh{b)z1,Dvh{b*)xl, Dvh(b*)x2 must vanish in order that vk=0 on Г. As an illustration, we have indicated in Fig. 12.6 the directional derivatives that must be set equal to zero along a specific portion of the boundary of a polygonal set in U2, when cubic or reduced cubic, Hermite triangles are used. In particular, one should observe that at a corner, such as b*, the directional derivatives Dvh(b*)fi3 and Dvh(b*)iiA must necessarily vanish. If we next assume that the inclusion Xh c<^,1(iQ)nH2(Q) holds, it follows that we have the inclusions X0k = {vheXh; vh\r=0}czH2(G)nHtiQ), A2.15) XOOh = {vheXh;vh\r=dvvh\r = 0}cHtiQ), A2.16) so that we are facing the problem of constructing such spaces X0k and X00k. Again they are obtained by canceling appropriate values and directional derivatives at boundary nodes. As an example, we have indicated in Fig. 12.7 all the directional derivatives that must be set equal to zero when Argyris triangles are used and the second inclusion A2.16) is needed. It should be realized that at a boundary node such
110 P.G. Ciarlet Chapter II у(Ьг) = Э,у(Ьг)=Э^(Ьг> = a„v(b,8) = o v(b,) = a,v(b,) = avv(b1)- ~f~fk j = aI1v(b,) = a„v(b1)=a^v(b,)=o b, V. x Fig. 12.7. Directional derivatives that must vanish in order that t> = 5vi>=0 on Г when Argyris triangles are used. as b2, the only "free" degree of freedom is Qvvvh(b2) while all degrees of freedom are zero at a corner such as bt. We shall also record for subsequent uses the following crucial properties: (i) All finite elements of class <^° and of class <^1 described in Sections 6-9 are such that vedomIIh and v\r = 0 => IIhveX0h, A2.17) where the finite element space X0h is defined as in A2.12), or A2.15) if finite elements of classЧ?1 are employed. (ii) All finite elements of class её1 described in Sections 6-9 have the property that veuom.TIh and v\r = 5vv\r = 0 => IIhveX00k, A2.18) where the finite element space X00h is defined as in A2.16). Notice that it is clearly possible to extend the previous considerations so as to include the case where boundary conditions are imposed only over a portion Г0 of the boundary Г, provided such a portion Г0 is exactly the union of some faces of the finite elements found in the triangulation. Remark 12.2. Let us briefly show how some "global" properties, i.e., of a finite element space, may be in fact derived from a "local" property, i.e., of a generic finite element. For simplicity, we restrict ourselves to Lagrange finite elements, leaving the case of Hermite finite elements to the reader. Let (T, P, I) be a Lagrange finite element, with ЛГ as its set of nodes; hence the set of degrees of freedom is of the form E = {p{a): ae N}.lfT is any face of the set T, we let 2|r = {p(a):aeWn7"}, РG") = {р|г;Г->Ш:рбР}.
Section 12 Introduction to the finite element method 111 Then all the Lagrange finite elements heretofore described, except the nonconforming Crouzeix-Raviart linear triangle, have the property that for each one of their faces T, the set E\r is P(T')-unisolvent. This crucial "local" property has the following easily established consequences: (i) The Pr-interpolant of a function v e dom Пт which vanishes along a face T also vanishes along T. As a consequence, the "global'' property A2.17) holds. (ii) Let ф: ре P-*p{a) be one of the degrees of freedom of the finite element and let p be the associated basis function. Then the basis function p vanishes along any face that does not contain the node a. The wa_y in which the basis functions of the space Xh are constructed from the basis functions of each finite element (cf. A2.7)) then implies the "global" property that the basis functions of the space Xh have indeed small supports (FEM3). (iii) Assume in addition that, for each pair {Т^^г) of adjacent finite elements found in a triangulation, one has PTl \T = РТг\т along the common face T", and that the inclusions Рт с %°(Т), Те Ph, hold. Then the "global" inclusion Xh c<e°{Q) holds. The choice of a finite element for solving a given problem usually relies on the following considerations: (i) The finite element must be well adapted to the geometry of the problem, and yet be easy to assemble. For example, assembling tetrahedra is not an easy task and thus prismatic finite elements (see Fig. 12.1 for an example) are usually preferred whenever possible. In this respect, see the discussion in Zienkiewicz [1971, Chapter 6]. Geometrical considerations may also justify the choice of curved finite elements (cf. Chapter VI) instead of straight finite elements for some "particularly curved" domains. (ii) The finite element must satisfy appropriate continuity requirements: They must be of class <ё°, or of class Ш1, if we are to solve a second-order, or a fourth-order, problem by a conforming method. (iii) We shall see that a mathematical proof of convergence requires (among other things) the inclusions Pt{T)<=.PT> Те УА, for second-order problems, and the inclusions P2(T)^PT,Te$~h, for fourth-order problems. Incidentally, the engineers were well aware of these conditions, which they discovered empirically, long before the mathematicians undertook the numerical analysis of the finite element method! (iv) The coefficients of the resulting linear system E.4) should be easy to compute on the one hand, and the linear system should be as easy as possible to solve on the other hand. We shall very briefly discuss here these fundamental practical aspects of the finite element method, by simply recording two simple guidelines that tend to reduce certain computational difficulties: A first guideline is that, if possible, the sets of degrees of freedom associated with a given node in the triangulation be all alike, so as to avoid different instructions depending on which node is considered. This explains for instance why reduced cubic Hermite n-simplices may be preferred to cubic Hermite n-simplices, or why Bell triangles may be preferred to Argyris triangles, even though there is in both
112 P.G. Ciarlet Chapter II cases a decrease of one in the order of convergence, as we shall see; in addition, such choices slightly reduce the dimension of the resulting linear system. A second guideline is that each node of the finite element space should be common to the greatest possible number of finite elements. For example, it is easily realized that, for a given triangulation, cubic Hermite triangles lead to a smaller linear system than cubic triangles; yet the asymptotic rates of convergence are the same, as we shall see. (v) In addition, miscellaneous aspects may be considered. For instance, one may argue that cubic Hermite triangles introduce artificial "constraints" (the continuity of the first derivatives at the vertices) on the one hand, but on the other hand, this is an advantage when solving the boundary value problem of linearized elasticity, if one needs to compute the stresses (cf. C.29)) at the vertices. Likewise, one may argue that the use of Argyris triangles for solving a plate problem introduces artificial "constraints" (the continuity of the second derivatives at the vertices and "extra" boundary conditions as shown in Fig. 12.7) on the one hand, but on the other hand, this is an advantage if one needs to compute the "bending moments" at the vertices (the bending moments are linear expressions involving the second partial derivatives of the solution), etc. For more details, the reader is referred to the books listed in the Introduction of this chapter. 13. General considerations on the convergence of finite element methods and Cea's lemma Whereas up to now, our discussion has been concerned with one discrete problem, we shall now consider families of discrete problems. More specifically, assume that we are approximating the solution и of the variational equations a(u,v) = l{v) for all ve V, A3.1) where the space V, the bilinear form a(v), and the linear form /, satisfy the assumptions of the Lax-Milgram lemma (Theorem 1.3). Confining ourselves to the case of conforming finite element methods, we consider a family (Vh) of subspaces of the space V, where it is understood that h is the defining parameter of the family and that h approaches zero (the parameter h will be given a specific meaning in Section 17). With each finite element space Vh is associated the discrete solution uh, which satisfies a{uh,vh) = l(vh) for all u,,e K„. A3.2) Then we shall say that the associated family of discrete problems is convergent or, equivalently, that convergence holds, if, for any problem of the form A3.1) posed in the space V, we have lim||u-Mj|=0, A3.3) ft-0 where || • || denotes the norm in the space V.
Section 13 Introduction to the finite element method 113 We are therefore interested in giving sufficient conditions for convergence and, as a first result in this direction, we have the following basic abstract error estimate. Theorem 13.1 (Cea's lemma). There exists a constant С independent of the sub- space Vh such that \\u-uh\\^C inf \\u-vj. A3.4) VheVh Consequently, a sufficient condition for convergence is that there exists a family (Vh) of subspaces of the space V such that, for each ueV, lim inf || и-и,, ||=0. A3.5) k-0 vheVh Proof. Let wh be an arbitrary element in Vh. It follows from A3.1) and A3.2) that a{u-uh,wh)=0. Using the same constants a,M as in A.3) and A.19), we have, for any vh e Vh, oc\\u-uh\\2 ^a{u-uk,u-uh) = a(u-u„,u-vh)^M\\u-uh\\ \\v-vh\\, and the conclusion follows with C = M/ot. □ Cea's lemma is named after Сёа [1964, Proposition 3.1], who proved it in the symmetric case. It was independently rediscovered by Varga [1966a], and extended to the nonsymmetric case by Birkhoff, Schultz and Varga [1968, Theorem 13]. Note that, when the bilinear form is symmetric, there is a remarkable interpretation of the discrete solution. Since a(u—uh,wh) = 0 for all whe Vh, it follows that uh is the projection over Vh of the exact solution u, with respect to the inner product a(v)- Therefore, we have in this case: a(u — uh,u — uh)= inf a(u — vh,u — vh). Using the K-ellipticity and the continuity of the bilinear form, we deduce ||и-мЛ<У(М/а) inf ||u-i>J|. Thus we have obtained a "better" constant than in the proof of Theorem 13.1, since we necessarily have M^tx. The simple, yet crucial, inequality A3.4) shows that the problem of estimating the error \\u — uh|| is reduced to a problem in approximation theory, i.e., to evaluate the distance d(u, Vh)= inf„,,eKh \\u — vh || between a function ue V and a subspace Vh с V. This explains why this problem will be a central theme of the next chapter, where we shall essentially prove results of the following type: Assuming appropriate smoothness on the function u, we shall show that the distance d(u, Vh) is itself bounded by a constant (which usually involves norms of higher order derivatives of the function u) times he, for some exponent /? > 0. We shall therefore obtain the
114 P.G. Ciarlet Chapter II additional information that, for a smooth enough solution u, there exists a constant C(u) independent of h such that ||и-илКС(и)Л'. A3.6) If this is the case, we shall say that the order of convergence is /?, or equivalently, that we have an 0(hp) convergence, and we shall simply write ||u-«J| = 0(h'). A3.7) Using more elaborated techniques, we shall also evaluate the difference {u — uh) in other norms than the norm of the space V (which is either the || • || li0 or the || • ||2>fi norm), such as the |-|0iO or |*|o,oo,n norms (cf. Sections 19 and 21 respectively), and we shall also call errors the corresponding norms \u — М),|о,д or |и—ujJo,x,a- Whereas a mathematician is generally satisfied with a sufficient condition for convergence such as that of Theorem 13.1, this condition rightly appears as a philosophical matter to many an engineer, who is much more concerned in getting even a rough estimate of the error for a given space Vh: For practical problems, one chooses often one, sometimes two, seldom more, subspaces Vk, but certainly not an infinite family. In other words, the parameter h never approaches zero in practice]
Chapter III Finite Element Methods for Second-Order Problems: The Basic Error Estimates Introduction In this chapter, we estimate various norms of the difference («—uh), where и е V is the solution of a second-order boundary value problem and uh e Vh is the discrete solution obtained in a subspace Vh of V. The first error estimate is based on Cea's lemma (Theorem 13.1): it consists in estimating the distance inf„heKl> \\u — vh || 1>n by means of the ZA-interpolant Пни of the solution u. This gives the error estimate ll«-W)illi,o<C||u-nA«||lin. Since we shall assume in this chapter that the set U is polygonal, it can be written as a union Q= (Jre^h T of polygonal sets T and straight finite elements (T, PT, XV), Te$~h, can thus be used for constructing subspaces Vh с V, as indicated in Chapter II. Hence the corresponding finite element method is conforming. Taking into account that we are using the norm || • ||li0 and that (ПИи)\т = Пти for all Te$~h (Theorem 12.1), we can write ll«-n*H||ll0 = | Z 1|и-Лги|||,г Therefore, the problem of finding an estimate of the error ||м—h4||1iD is reduced to the problem of estimating the "local" interpolation errors ||и —Яги||11Г. The solution of such "local" interpolation problems is the object of Sections 15 and 16, where in view of other future needs, we shall also estimate more general norms or seminorms of the difference (и—Пти). A typical, and crucial, result in this direction is that, for a finite element (Г, PT, IT) that can be imbedded in an affine family and whose PT-interpolation operator leaves invariant all polynomials of degree^k (equivalently, the inclusions Pk(T) с Рт hold), there exists a constant С independent of T such that hk+1 l»-tfr0|«.r<C-ZJrM* + i.r> O^m^k+l for alive Hk+i(T), Рт 1/2 115
116 P.G. Ciarlet Chapter III where hT— diameter of T, pT = supremum of the diameters of the spheres inscribed in Г, Such a result is proved (in a more general form) in Theorem 16.1. One key idea for obtaining such an estimate consists in first obtaining it over a reference finite element and then to convert it into an estimate valid for any affine- equivalent finite element. Another key idea is to use a basic result about Sobolev spaces, due to Deny and Lions, which pervades the mathematical analysis of the finite element method: Over the quotient space Hk+i(Q)/Pk(Q), the seminorm |*|t+i,n is a norm equivalent to the quotient norm. This result is proved in Theorem 14.1, for the more general Sobolev spaces Wm-P(Q). In practice, one often considers a regular family of finite elements, in the sense that the diameters hT approach zero, and that there exists a constant a independent of Г such that hT^apT. For such a regular family, the interpolation error estimate becomes (Theorem 16.2) \v-nTv\m>T = 0(hkT+1-m), O^m^k + 1, ifveHk+1{T). Hence, using Cea's lemma, we obtain in this fashion the error estimate (Theorem 18.1) \\u~uh\\un^C\\u~nhu\\ua = 0(hk), with h=maxTrfh h,ifueHk + 1 (Q). We also mention in passing the inverse inequalities (Theorem 17.2), a "technical" property of finite element spaces that is often useful in the process of getting error estimates. The range of applicability of the above error estimate is however limited since it requires that the solution и be smooth enough in order that its Xft-interpolant be well defined and that it be in the space Hk+1(Q). Fortunately, we show in Theorem 18.2 that, under the minimal assumptions that the solution и is in the space H1(Q) and that the spaces PT contain the space Pi(T), T e &~h) convergence still holds, i.e., one has limfc-0l|w —w*lli,n=0. Next, using a method due to Aubin and Nitsche (cf. the Aubin-Nitsche lemma (Theorem 19.1)), we show that there is in most cases an improvement in the estimate of the error \u—Н/,|о,п, in the sense that (Theorem 19.2) 1«-и„|0,п = О(йк+1). We then turn to various ways of estimating the error \u—«л1о,а>,я- After a first estimate, based on the estimate of the error \u — uh\0,n combined with an inverse inequality (cf. Theorem 19.3), we consider in Section 20 the situation where the discrete problem satisfies a discrete maximum principle, in the sense that /^0 => maxuA(x)^max{0, maxu^x)},
Introduction Basic error estimates 117 where / denotes the function appearing in the right-hand side of the partial differential equation of the associated boundary value problem. In the case of the operator (—А и + au) with a > 0 and n=2, it is shown (Theorem 20.1) that the discrete maximum principle holds for h small enough if there exists e>0 such that all the angles of all the triangles found in all the triangulations are ^(гтс — е); if fl = 0, it suffices that the angles of the triangles be ^j% (Theorem 20.2). Returning to the general case, we show that when the discrete problems satisfy a maximum principle, one has (Theorem 21.5): lim\u—uh\0 0,0 = 0, if ue W1'P(Q) with p>n, й->0 1"-"а1о,оо,я = 0(й), if и е W2-P{Q) with 2p>n. We then follow in Section 22 the penetrating method of weighted norms ofNitsche, who has shown that, if и e Wk+Ux(Q), |и-"/.1о,со,«- \глгик + 1 0{h2'e) for any e>0, if fc=l, 0(hk+1), if k>2, _@(h1~E) for any £>0, if k=l, \\u~~uh 1, oo,fi~ \r\/ik\ -с i^i @(hk), if k^2, where Но.со.я and || • \\li00yp represent the norms of the spaces L°°(f2) and IF1'00 (£2), respectively. These error estimates are established in Theorem 22.7 for fc= 1. It is worth pointing out that all the error estimates found in this chapter are optimal in the sense that, with the same regularity assumptions on the function u, one gets the same asymptotic estimates (or "almost" the same for the norms [•|0oo-ftand || • || i,oo,« when /c= 1) when the discrete solution щ е Vh is replaced by the ZA-interpolant Пьи £ Vh. In Section 23, we consider the finite element approximation of problems posed as variational inequalities, and in particular, we consider the approximation of the obstacle problem: Following an ingenious method due to Falk, we show in this case that the discrete solutions obtained with linear triangles satisfy (Theorem 23.2) \\u-uh\\ua=0{h). This result is itself a consequence of an abstract error estimate (Theorem 23.1), valid for general variational inequalities. Finally, we give in Section 24 a review of various extensions of the basic error estimates established in this chapter. It is only in the sixties that mathematicians, notably Mikhlin [1964,1971], showed real interest in the analysis of the Galerkin and Ritz methods. Although they were not aware of the engineers' contributions, it is interesting to notice that the approximate methods that they studied resembled more and more the finite element method, as exemplified by the basic contributions of Сёа [1964], Varga [1966b] (for the one-dimensional case), Birkhoff, Schultz and Varga [1968] (for the multidimensional, but still tensor product, case), Friedrichs and Keller [1966], where a finite difference method was analyzed as a finite element method on special
118 P.G. Ciarlet Chapter III triangulations. Then the outbreak came with the paper of Zlamal [1968], which is generally regarded as the first mathematical error analysis of the "general" finite element method as we know it today. Other "historical" references on the numerical analysis of the method are found in the introductory article of Oden. 14. The Sobolev spaces W"*@) and the quotient space Wk+1'p(Q)/Pk(Q) For general references on Sobolev spaces, see Adams [1975], Lions [1962], NeCas [1967]. An excellent introduction is given in Brezis [1983, Chapter 9]. Throughout this section, Q denotes a domain in U", i.e., a bounded open connected subset of W with a Lipschitz-continuous boundary. For any integer m^O, and any number p satisfying 1 ^p^ oo, the Sobolev space Wm,p{Q) consists of those functions ve LP(Q) for which all partial derivatives d*v (in the distribution sense) with |a| =%m belong to the space LP(Q). Equipped with the norm \S*v\pdx\ , if l<p<oo, INL.,.0=«r' о A4.1) max < ess sup |5si;(x)j >, if p = со, the space Wm,p(Q) is a Banach space. We shall also use the seminorms I |a =m J Up v\pdx> , if ls£p<co, «>1».р.о = < я A4.2) max < ess sup \d*v{x)\ >, if p = oo. ,М=» (. «я j The Sobolev space W$-P(Q) is the closure of the space 3>(Q) in the space Wm,p(Q). Given a subset A of W and given a function ие^%4), the notations IHL,,»,^ and \v\m<aDiA also denote the norm max|a|<m sup^^S'i^x)! and the seminorm rnax|a|=m supxe^|9aD(x)|, respectively. Notice that Wm'2{Q) = Hm{Q), \У%л{П) = Щ@), II ' llm,2,G= II ' \\m,D> \'\m,2,Q=\'\m,Q- We now record some basic properties of the Sobolev spaces that will often be used. In what follows, the notation Xo,Y indicates that the normed linear space X is contained in the normed linear space Y with a continuous injection, and the notation X с Y indicates that this injection is compact. Finally, for any integer m ^ 0 and any number a e ]0,1], <^m-'\Q) denotes the space of all functions in <<gm(Q) whose mth derivatives satisfy a Holder's condition with exponent a. Equipped with the
Section 14 norm Basic error estimates 119 N*™.«=l|tf|L,«,,6+ max sup |0| -m x,yeQ хфу \&Щх)-&х(у)\ \\х-У\\ where || • || denotes the Euclidean norm in R", the space Cm,"(U) is a Banach space. By the Sobolev imbedding theorems, the following inclusions hold, for all integers m^O and for all numbers p with l<p^ oo, f LP\Q) with 1 1 m p* p n L"(Q) forallge[l,oo[, ^•"(Q) forallO<a<l, if m < - , P ., и it m = - , P if- <m< - + 1, P P if m=- + l, P if - +1 < m. P A4.3) By the Kondrasov theorems, the compact injections Г 11m L«(Q) for all l^q<p* with — = ---, P* P n if m<- if m= -, P ., и if-<m, ^•'(fijc^ L«(fl) for all 9e[l, oo[, ifw=^ A4.4) hold for all l^p^oo. The compact injection which holds for any n by the Kondrasov theorems, is known as RellicKs theorem. Notice that analogous inclusions can be derived by "translating" the orders of derivations. Thus for instance, one has Wm+r,p(Q)c+ W'P*(Q) if m<(n/p), etc. We also note that, for l^p<oo, one has the important density property: {^x(Q)}' = Wm-p(Q). Remark 14.1. The assumption that Q is a domain in № is not necessarily needed for proving some of the above properties. For example, one can derive the compact inclusion W1,p{Q)<gLq(Q) for all l^q<p*, or the above density property, under weaker assumptions.
120 P-G. Ciarlet Chapter III Since an open set Q with a Lipschitz-continuous boundary is bounded, the space Pk(Q) is contained in the space Wk+i,p{Q), and thus it makes sense to consider the quotient space Wk+1'p(Q)/Pk(Q). This space is a Banach space when it is equipped with the quotient norm veWk+1'p(Q)/Pk(Q)-+\\v\\k+Up,n = inf ||»+p||t+1.,l0, A4.5) where v = {weWk+Up{Q);(w-v)ePk(Q)} A4.6) denotes the equivalence class of the element ve Wk+1,P(Q). Then the mapping ve Wk+1-p(Q)/Pk(Q)^\v\k+Up,a = \v\k+Up>a A4.7) is a priori only a seminorm on the quotient space Wk + 1,p(Q)/Pk(Q), which satisfies the inequality l4+i,„.fl^Nn.i.p.o fotd\veWk+1-p(Q)/Pk(Q). A4.8) To see this, observe that, for any polynomial pe Pk{Q), \\v + p\\k+Up,Q = {\v\k+l,p,Q+\\v + p\\iP,!2}ilP>\v\k+l,p,Q, with the standard modification for p = oo. It is a fundamental result that the mapping defined in A4.7) is in fact a norm over the quotient space, equivalent to the quotient norm A4.5). This result was first proved by Deny and Lions [1953-1954]. Theorem 14.1. Let k^O be an integer, and let pe [1, oo]. There exists a constant C(Q) such that inf ||o + p||lk + 1.^<C(Q)|»|t+liP.fl forallveWk+l'"(Q) A4.9) pePk(fi) and consequently, such that ll»llt+i.*o<C(G)l4+i.,.n forallveWk + 1'p(Q)/Pk(Q). A4.10) Proof. Let N = dim Pk{Q) and let f, 1 < i < N, be a basis of the dual space of Pk{Q). Thus, by the Hahn-Banach extension theorem, there exist continuous linear forms over the space Wk+ Up(Q), again denoted fhl^i^N, such that /,-(p) = 0, Ki^JV, implies p = 0 if pe Pk(Q). We will show that there exists a constant C(Q) such that \\v\\k+l,P,B^C(Q)(\v\k + Up,n+ Y\m\) foral\veWk + 1-p(Q). A4.11) Inequality A4.9) will then be a consequence of inequality A4.11): Given any function се Wk+Up{Q), let qePk{Q) be such that fi{v + q) = Q, lsgisSAT. Then by A4.11), inf \\v + p\\k + ltPiQ^\\v + q\\k + UPin^C(Q)\v\k+Up>i2, psPk(fi)
Section 15 Basic error estimates 121 which proves A4.9). If inequality A4.11) is false, there exists a sequence (t?,) of functions t?,e Wk+i-p{Q), such that NI* + i.,.o = l for all/>1, , N 1 A4.12) ИтЫк+ир,п+ Zl/»(o,)lHa Since the sequence (t?,) is bounded in W11+1,p@), there exists a subsequence, again denoted (t?,) for notational convenience, that converges in the space Wk,p{Q) (this follows from the Kondrasov or Rellich theorems for 1 ^p< со and from Ascoli's theorem for p = со). Since lim|t?,|t + 1,p,fi = 0, /-♦oo by A4.12), and since the space Wk+1,P{Q) is complete, the sequence (t?,) converges in the space Wk+Up{G). The limit t? of this sequence is such that |S°4o>p,g = lim |8"t?,|ofp,o = 0 for all a with |a| = к +1, (-►CO and thus 6at? = 0 for all multi-indices a with |a| = k+1. Since a domain is connected, by assumption, it follows from distribution theory (see Schwartz [1966, p. 60] that the function v is a polynomial of degree ^k. Using A4.12), we have /,(»)= lim/,(»,)=0; i->co hence we conclude that v=0 since vePk(£2). But this contradicts the equality IMI* + i,p,n = l for all/. □ 15. Estimate of the seminorms ||v—nTv\m^T for polynomial-preserving operators Пт One of our main objectives in this chapter is to estimate the interpolation errors \v — nTv\m<4iT and ||t?—nrt»||m>eiT, where Пт is the PT-interpolation operator associated with a finite element (T, PT, ZT). At other places, however, we shall need similar estimates, but in situations where Пт is a more general polynomial- preserving operator, i.e., Пт is not necessarily an interpolation operator. This is why we shall develop an error theory valid also for such general operators; we follow here Ciarlet and Raviart [1972a]. To begin with, we need a definition: We say that two open subsets Q and U of W are affine-equivalent if there exists an invertible affine mapping F:xeU"^F(x) = Bx + beU" A5.1) such that Q = F{U). A5.2)
122 P.G. Ciariet Chapter III As in the case of affine-equivalent finite elements (compare with A1.1) and A1.3)), the correspondences xeti-*x = F(x)eQ, A5.3) (v:fi-+U)^(v = v°F-1:Q->U), A5.4) hold between the points xeU and xefi, and between functions defined over the set U and the set Q. Notice that we have v(x) = v(x) A5.5) for all points x and x in the correspondence A5.3) and all functions v, v in the correspondence A5.4). Remark 15.1. If the functions v and v are defined only almost everywhere (as in the next theorem for instance), it is understood that relation A5.5) needs to hold for almost all points x e U, and thus for almost all points xeQ. We first show that, if Q and U are affine-equivalent, any Sobolev seminorm | ■ |mj)>0 (cf. A4.2)) is bounded above and below by the corresponding seminorm |*|т,р,й, times factors that depend on the matrix В of A5.1), and on the numbers m and p. Here and subsequently, || • || stands for both the Euclidean norm in W and for the associated matrix norm. Theorem 15.1. Let Q and U be two affine-equivalent open domains in W. If a function v belongs to the space Wm'p(Q)for some integer m ^ 0 and some number p e [1, oo], the function v = vF belongs to the space Wm'p(Q); in addition, there exists a constant С = С(т, n) such that \v\m,„,a^C\\B\\m\uttBr^\v\n^ forallveWm-p(Q), A5.6) where В is the matrix o/A5.1). Analogously, one has Ыт.г*^С\\В-1\Г\де1В\1»Щя,р.а forallveWm-"(Q). A5.7) Proof, (i) Let us first assume that the function v belongs to the space ^""(O), so that the function v belongs to the space ^тф). Since, for any multi-index a. with |a| = m, one has d*v(x) = Dmv{x)(eXi,eX2,,..,eJ where the vectors eai, 1 ^ i^ m, are some of the basis vectors of R", we deduce that |8«i5(x)|<||D"iS(Jc)||= sup \Dmv(x){^^2,,..,U\. Consequently, we obtain |«|=FB Up Up Dmv{x)\\pdx} , A5.8) я
Section 15 Basic error estimates 123 where the constant Ci{m,n) may be chosen as C1(m,n) = sup(card{ae Nm: \u\ = m}I/p. 1*SP Using the chain rule, we note that, for any vectors &e R", 1 <i<m, ОтЩШг, {2,..., U = Dmv(x№u B£2,..., B£J, so that \\Dmv{x)\\^\\Dmv(x)\\\\B\r, and, therefore, ' \\Dmv\ Dmv(x)\\pdx^\\B\\mp \\Dmv(F(x))\\" dx. A5.9) a Using the formula of change of variables in multiple integrals, we get ||/)mi;(F(x))||pdJc = |det£-1| \\Dmv(x)\\p dx, A5.10) й й and since there exists a constant C2(m, n) such that |[£>mi;(x)|| < C2(m, n) max |6афс)|, |«|=m we obtain \Dmv{x)\\p dx}'\c2{m,n)\v\m,pM. A5.11) Inequality A5.6) is then a consequence of inequalities A5.8)—A5.11). (ii) To complete the proof when p Ф со, it remains to use the continuity of the linear operator г: ve^m(Q)-^^e Wm-p(U) with respect to the norms || • \\т,р,а and II' II m.p.tf, the denseness of the space ^m(Q) in the space Wm-"(Q), and the definition of the (unique) extension of the mapping i to the space Wm'p(Q). (iii) Let us finally consider the case p= со. A function ve Wm,co(Q) belongs to the spaces Wm'p{Q) for all p < со (recall that a domain is bounded). Therefore, by (ii), the function v belongs to the spaces Wm'p{U) for all p< со, and there exists a constant C(m, n) such that |ЗЖр,й^1%1,р,й ^C(m, n)||B||W sup Met £Г1/р1Ит,р,я up for all p> 1 and for all multi-indices aeNm such that |a| <m. Since this upper bound on the seminorm |8ау|0,р,й is independent of the number p, it follows that, for each |a|<m, the function d"v is in the space L°°(i5) for each |a|<m. Consequently, the function v belongs to the space И"",С0(Й). То conclude, it suffices to use inequality
124 P.G. Ciarlet CHAPTER III A5.7), which holds for all p^l, in conjunction with the property that, for any function weL°°(£7}, w o.oo.n = Hm Mo,p,fi. p-* 00 (iv) Inequality A5.7) is proved in a similar fashion. □ In order to apply Theorem 15.1 it is desirable to evaluate the norms \\B\\ and || B~x || in terms of simple geometric quantities attached to the sets Q and U. This is the object of the next theorem, where we use the following notations: fc = diam(G). £=diam(d), A5.12) p = sup{diam(S); S is a ball contained in Q}, .. /5 = sup{diam(£); § is a ball contained in $}. Theorem 15.2. Let F:xe W-+F(x) = Bx + beU"bean affine mapping, and let U and Q = F(U) be two affine-equivalent bounded open subsets ofW. Then the upper bounds \\B\\^h/p, WB-'W^H/p A5.14) hold. Proof. We may write ||B||=4 sup ||B£||. P 11411=? Given a vector £ satisfying ||£|| =p, there exist two points y,ze{U)~~ such that у - z = £, by definition of p (Fig. 15.1). Since Bf = F( j>) - F{z) with F{y) e Q, F{z) e Q, we deduce that \\B£\\^h, and thus the first inequality of A5.14) is proved. The other inequality is proved in a similar fashion. □ We are now in a position to prove an important property of polynomial-preserving operators, i.e., that leave invariant some space Pk(U) (cf. A5.16)). Theorem 15.3. Assume that, for some integers k^O and m^O and some numbers p,qe [1, oo], Wk+1'P{Q) and Wm,\U) are two Sobolev spaces that satisfy Wk + 1*{U)<^Wm«{u), A5.15) and let ne^C(Wk + hp{U); Wm<\U)) be a mapping that satisfies Пр=р for allpePk{U). A5.16) For any open set Q that is affine-equivalent to the set U, let the mapping TlQ be defined by {navf = ne, A5.17)
Section 15 Basic error estimates 125 Fig. 15.1. Examples of affine-equivalent open subsets in R2. Two open subsets U and Q are affine- equivalent if there exists an affine mapping F:xe W->(Bx + b)eU"'S\xcb that Q = F(U). for all functions ve Wk+1,P(G) and ve Wk+1'"(Q) in the correspondence A5.4). Then there exists a constant С(П, Q) such that, for all affine-equivalent sets Q, \v-nQv\m,q^C{n,Q){m^{Q)Ylq~llp ~\v\k+1,P,n, A5.18) for alive Wk+1-'(Q\ where h and p are defined as in A5.12) and A5.13). Proof. Using the polynomial invariance A5.16), we obtain the identity v-fiv = (I-fi){v+p) ior&\\veWk+Up(U), pePk{U), where / denotes the identity mapping from Wk+1-"{U) into Wm,q(U), which is continuous by A5.15). From this identity we deduce that 1^-Ли|т>д,й^||/-Я||^(^+1,Р(й);^».,«(й)) inf ||t5+p|U+i,p,fl <С(П,й)Щк+ир#, A5.19) by Theorem 14.1. It follows from relation A5.17) that v — Uv ={v—IJnv}~, and therefore an application of Theorem 15.1 yields Iv-n^L^^CWB-'ridetBl^lv-nvU^. A5.20)
126 P.G. Ciarlet Chapter III By the same theorem, l%+i.p.ft<C||B||* + 1|detBr1"|»|i+i.,l0, A5.21) and thus, to obtain inequality A5.18), it suffices to combine inequalities A5.19), A5.20) and A5.21), the upper bounds \\B\\^h/p and ||В_1|| ^h~/p (Theorem 15.2), and, finally, to observe that |det B\ = meas@)/meas(O). □ Remark 15.2. A similar result can be established for polynomial-preserving operators that leave invariant the space Qk(Q) instead of the space Pk(Q). In this case, the seminorm \v\k+UPiD appearing in the right-hand side of inequality A5.18) can be replaced by the seminorm p ")i/p dx> . 84' This result follows from an abstract generalization of Theorem 14.1, which is due to Luc Tartar, and which may be found in Brezzi and Marini [1975], or in Ciarlet [1978, Exercise 3.1.1]. 16. Estimate of the interpolation errors |v—nTv\mqT for an affine family of finite elements In Theorem 15.3, let us now choose as a particular polynomial-preserving operator Пт the PT-interpolation operator associated with a finite element (Г, PT, ET) that is affine-equivalent to a finite element (f, P, Z). We obtain in this fashion the following result, which is essentially Theorem 6 of Ciarlet and Raviart [1972a]. Theorem 16.1. Let (t,P,t) be a finite element, and let s denotes the greatest order of the partial derivatives occurring in the definition oft. Assume that for some integers m^O and /c:>0 and for some numbers p,qe [1, со], the following inclusions hold: Wk+1'"(f)^^s(f), A6.1) Wk+1'"(f)c+ Wm'q{t), A6.2) Pk(T) с Р <= Wm'q(t). A6.3) Then there exists a constant C(T, P, t) such that, for all affine-equivalent finite elements (T, P, £), and all functions v e Wk + 1'P(T), Iv-nTvL^r^CifJ^^meas^y^-^^-M^^r, A6.4)
Section 16 Basic error estimates 127 where TITv denotes the PT-interpolant of the function v, and mesLs(T) = dx-measure of T, hT = dmm(T), A6.5) /9T = sup{diam(S); S is a ball contained in T}. Proof. The inclusion Pk(t) с Р in conjunction with the fact that the P-interpolation operator П reduces to the identity over the space P (cf. A0.8)) implies that Пр = р tora\\pePk{T). A6.6) Let v be a function in the space Wk+1,P(T). Then v belongs to the space dom77 = ^s(f) (cf. A0.7)) since the inclusion Wk+1-P(f) a <<?s(f) holds. For definiteness, let us assume that s = 2 (recall that in practice, s = 0,1 or 2) so that the P-interpolant of the function v takes the form nv = Y^f)P? + I{Dv(^)Uk}pk+ Z {D2v(af)(HU)}pfkl. A6.7) i i,k i,kj We now show that the linear mapping П: Wk+Up(T)->Wm'q{f) denned in this fashion (recall that fiv s P and that P с Wm-q{t) by A6.3)) is continuous. To see 4his, we observe that, by A6.7), ii^»iL.,.f<ii^?iiip?iimf„f+2:i{D^/)a}iiiPiL.,.f i i,k + ^\{D,v{aMl,m\\\PlAm,q,T <С(||р?||и,в,г, \\Ш\ \\Pk\\m,g,t, \\Ш \\Ш \\Ph\Ut.TWh.*>.t, and thus, by A6.1) \\fi4m*.f<C(t,P,£)\\v\\k+itP,t. Since the PT- and P-interpolation operators are related through the correspondence {nTv}~ = Ш for all v e dom IJT (cf. A1.10)), we may apply Theorem 15.3. Hence inequality A6.4) is just inequality A5.18) adapted to the present situation. □ Remark 16.1. The factor {meas(T)}1/?"/p found in A6.4) may also be expressed in terms of the parameters hT and pT by means of the inequalities {nTv} = Ш for all v £ dom Пт where /?„ denotes the dx-measure of the unit ball in R". Theorem 16.1 thus gives us an estimate of the interpolation error \v — IITv\m^T, which is valid for all finite elements that are affine-equivalent to the same finite
128 P.G. Giarlet Chapter III element (T, P, t). In other words, the estimate of Theorem 16.1 is valid for all finite elements of an affine family, according to the definition given in Section 11. We now show that it is possible to dispose of the parameter pT in the estimate A6.4) provided we restrict ourselves to finite elements that do not become "fiat". More specifically, we shall say that a family of finite elements (T, PT, ET) is regular if the following two conditions are satisfied (for notational brevity, Г is viewed here as the parameter that defines the family): A) there exists a constant a such that —<<r for all T; A6.8) Pt B) the family (hT) is bounded and 0 is its unique accumulation point; by an abus criant de notation, we shall record the latter properties as /гг-0. A6.9) When an affine family is regular, the interpolation error estimate of Theorem 16.1 can be immediately converted into simple estimates of the norms \\v — nTv\\m<qtT, Theorem 16.2. Let there be given a regular affine family of finite elements (T, PT, ET) whose reference finite element (T, P, t) satisfies assumptions A6.1), A6.2) and A6.3). Then there exists a constant C(t, P, I) such that, for all finite elements (T, PT, IT) in the family and all functions v e Wk+l'p(T), \\v-nTv\Uq,T<C(f,Pj){m™s(T)}ll'l-UphkT+1-m\v\k+Up,T. A6.10) Remark 16.2. Only the boundedness of the diameters hT is used here, in conjunction with inequality A6.8). For a family of triangular finite elements, condition A6.8) is equivalent to Zlamal's condition (Zlamal [1968]) that there exists a constant 60 such that вт^во>0 for all Г, A6.11) where for each triangle T, 6T denotes the smallest angle of T. In order to get a more concrete understanding of the estimates of Theorem 16.2 we have recorded in Fig. 16.1 estimates of the interpolation error \\v — nTv\\m:T (p = q = 2) tor various finite elements that can be imbedded in regular affine families. Note that, if the function v lacks the "optimal" regularity assumed in Fig. 16.1 ("optimal", in the sense that it yields the highest possible exponent of hT in the interpolation error estimate), weaker estimates may still hold, provided the PT-interpolant is still defined. Assume for instance that we are considering cubic triangles, or cubic tetrahedra, and that the function v is only in the space H2(T). Since H2(T) c+ <£°(T) for n^3, Theorems 16.1 and 16.2 can still be applied with /c=l; this yields ||u — nTv\\m)T =0{hr~m) for 0<ms?2. If we were considering Hermite cubic triangles, or Hermite cubic tetrahedra, however, these theorems could not be applied, since the space H2(T) is not contained in the space <5?1(Г) (except for n=l).
Section 16 Basic error estimates 129 !lv-llTvilm/r 0<m<2 (k = 1) 0(h^-m) 0«m«3 (k = 2) 0(h^--m) 0<m «4 (k = 3) Regularity of the function v H2(T) hj(T) H"m Upper bound on the dimension n, to insure that Hk+1(T)CtS(TI n «3 (s = 0) n« 5 (s = 0) n<3 <s= 1) n « 7 (s = 0) n< 5 (s= II h<3 (s = 2) Simplicial finite elements Rectangular finite elements Ш I'M I_! ' " Fig. 16.1. Examples of interpolation error estimates ||» — nTv\\mT for finite elements that are imbedded in regular affine families. It is worth mentioning at this point that a different, and more constructive, approach to interpolation theory for affine-equivalent finite elements can be also developed, along the following lines: Let (T, P, E) be a Lagrange finite element such that the inclusion Pk(T) с Р с 4f*(T) A6.12) holds for some integer fc>0, and let there be given a function v е <gk+1(T). If a and x are two points in the set T (assumed here to be convex), Taylor's formula with integral remainder gives us: v(a) = v(x) + Dv(x)(a-x)+ ■■■ +—D"v(x)(a-xf+ Rk(v;a,x), A6.13)
130 P.G. Ciarlei Chapter III where Rk(v;a,x) = - (l-tfDK+1v{ta+(\-t)x)(a-x)k+1dt. A6.14) Let Г = {р(а(): l^i^N}, and let p(, l^i^N, denote the associated basis functions. It can then be shown (cf. Ciarlet and Raviart [1972a]) that the associated P-interpolation operator satisfies JV Dm(nv-v)(x}= ^Rk(v;ai,x)Dmpi(x), O^m^k, for all xeT. A6.15) i = l Note that, for m=0, relation A6.15) reduces to a multipoint Taylor formula N N v(x)= £ o(e()p,(x)- X Rk{%ai,x)pi{x), i=i i=i that Ciarlet and Wagschal [1971] already used for estimating the interpolation errors associated with certain types of finite elements. In the same spirit, Coatmelec [1966] had earlier made an in-depth study of polynomial approximation of Taylor fields, viewed as constructive solutions of Whitney's extension theorem. From A6.14) and A6.15), it then follows that sup||D"(nu-»Xx)|| xeT ^_!_SUp||D* + 1o(x)|| sup{£ k-xir^supHZTpiMli- A6.16) (fe+1)! xeT *eT [i=1 J xeT We also note that there exists a constant C(m, n), which can be estimated, such that l<a>,r^up]|£mt<x)KC(m,n)Mm)tc,T for all i;e<r(T), A6.17) xeT and that, if (f, P, 1) denotes an affme-equivalent finite element, we have \\Dmpi(x)\\^\\Dmpi(x)\\\\B-1\r A6.18) (cf. the proof of Theorem 15.1). Hence A6.16H16.18) yield a sharp estimate of the constant C{T, P, I) that appears in the inequality \v-nv\m^,T < C{f, P, t)(hkT+ l/pT) \v\k+ 1>00,r, A6.19) which is nothing but the special case p = q = ao of inequality A6.4). As shown in Ciarlet and Raviart [1972a], this analysis can be also extended to affine-equivalent Hermite finite elements of the type considered in Theorem 16.1. Using estimates of the norms \Rk{v; а, ')\0,р,т, Arcangeli and Gout [1976] and Gout [1977] have likewise obtained sharp estimates of the constant C(T, P, I) that appears in the inequality A6.4) when p = q. For instance, they have obtained the following estimates: For linear triangles, 1«-Лг»Цг^З(Л?./р50М2.г for m=0,1;
Section 17 Basic error estimates 131 for quadratic triangles, \v-nTv\o,T^2hT\v\3iT, \v-nTv\UT^6(h\lpT)\v\XT, \v-nTv\2,T <9(hl/p$)\v\3,T. 17. Interpolation and approximation properties of finite element spaces We are given a polygonal domain BcW, a family (&~h) of triangulations Q = [jreshT and an associated family (Xh) of finite element spaces made up of finite elements (Г, Рт, £т)те!гь- We shall consistently make three basic assumptions about the triangulations and the finite elements, according to the following definitions: (HI) A family of triangulations $~h is regular if there exists a constant a such that — <<r for all Ге (J*^i, A7.1) Рт and if the quantity h = maxhT A7.2) TsSi, approaches zero. In other words, the family formed by the finite elements (T, PT, ET), Te{Jh$~h, is a regular family of finite elements, in the sense understood in Section 16. Remark 17.1. There is of course an abus de notation about h, which was first considered as a defining parameter of both families (J~h) and {Xh), and which is now specifically defined in A7.2). (H2) All the finite elements (T, Рт, Ет), Те {Jh&~h, ore affine-equivalent to a single reference finite element (T, P, I). In other words, the family (T, Рт, Ет), Те \Ji,&~h, is an affine family of finite elements, in the sense understood in Section 11. (H3) All the finite elements (T,PT,ZT), Te(J;,,^"k, are of class <ti>°, in the sense understood in Section 12. We first prove an approximation property of the associated family of finite element spaces Vh (Theorem 17.1), from which we will later derive an estimate for the error || и — ил|| 1,д (Theorem 18.1). Whenever no confusion should arise, we use in the sequel the same letter С to denote various constants which are not necessarily the same at their various occurrences. It is an easy exercise, which is left to the reader, to derive more general interpolation error estimates, obtained by replacing A7.3)— A7.4) by the more general assumptions A6.1)—A6.3).
132 P.G. Ciarlet Chapter III Theorem 17.1. In addition to (HI), (H2) and (H3), assume that there exist integers к and I with 0^1 <fc, such that the following inclusions are satisfied: Pk(T)^PczHl(T), A7.3) Hk+\T)c><$s(T), A7.4) where s is the maximal order of partial derivatives occurring in the definition of the set 2. Then there exists a constant С independent of h such that, for all functions veHk+\Q)nV, \\v-nkv\\mM^Chk + 1-m\v\k+US2, 0</nsSmin{U}, A7-5) ")l/2 E \\v-nhv\\i,T\ ^Chk+1~m\v\k+un, 2<m<min{k+U}, A7.6) where nhve Vh is the Xh-interpolant of the function v. Proof. Applying Theorem 16.2 with p = <j = 2, we obtain \\v-nTv\\m,T^ChkT+1-m\v\k+UT, 0<m^min{fc+U}. Using the relations (IIhv)\T = nT{v\T), Те $~h (cf. A2.11)) and the inequalities hT^h, Te9~h{d. A7.2)), we get I 1/2 Г 1 1>2 { X ||с-Л»<г}1/2<СЙ* + 1-"{ £ \v\2k + 1,T = Chk+1-m\v\k+USi, (Kms£min{fc+U}. Thus inequalities A7.6) are proved, and inequalities A7.5) likewise follow, since ")l/2 E \\v-nkv\\i,A = И»-ВД„.о, ~ТеГ„ J for m = 0 and for m = l (when f^l), and since the inclusions Рсй'(Г) and Xfcc^°@) imply ХнсЯНО) (Theorem 5.1). □ Remark 17.2. Analogous interpolation error estimates hold if the function v is only in the spaces <es(Q)rJlTe^hHk+1(T)nV. It suffices to replace the seminorm |i4+i,a by the seminorm {Хгея, Mi?+1 г}1/2 m the right-hand sides of inequalities A7.5) and A7.6). While the approximation properties of finite element spaces obtained in Theorem 17.1 rely on the interpolation theory developed in Section 16, a different, "interpolation-free", approximation theory is also possible. Let us describe such an approach, due to Clement [1975], and further generalized by Bernardi [1986]: The objective is to construct an operator rh:ve Hl(Q)-*rhve Xh, whose approximation properties are similar to those of the .^-interpolation operator Пн, even when the X,,-interpolant TIhv is not defined (for example, when /=0). To fix ideas, consider a finite element space XH made up of linear triangles (the
Section 17 Basic error estimates 133 analysis can be extended to triangles of degree k). With each vertex bt, 1< i<M, of the triangulation, we associate as usual the basis function wt e Xh defined by the relations wi(bJ) = 5ij, 1 <j^M, and we let S; = suppw,-. A7.7) Given a function veL2(Q), let Ptv denote the L2(S;)-projection of v on the subspace Pi(S,), l^i^M, i.e., Ptv satisfies РреРШ A7.8) (v-Ptv)pdx = 0 for all pgPi№), S,- and let the operator rh be defined by м rh:veL2(Q)^rhv = £ Рф^е Xh. A7.9) Then the operators rh: L2(Q)-+Xh associated with a regular family of triangulations satisfy the following approximation properties (as usual, С denotes various constants independent of h): For all functions v e L2(Q), lim\v-rhv\0ji = 0, \v-rhv\o,a**Qv\0,al A7.10) A->0 for all functions veH1(Q), lim|»-r4»|lfl=0, Ic-rjrUo^Cft1-"!»!^, m = 0,l; A7.11) for all functions v e H2{Q), \v-rHv\m,n^Ch2-m\v\2,Q, m=0,l, { £ |i;-r^|ifij1/2^C|t;|2,fl. A7.12) (гея, J There are other ways of defining an "interpolation-free" approximation theory for finite element spaces; see Hilbert [1973], Pini [1974], and, in particular, Strang [1972a], who cleverly adapts to finite element spaces the regularization by convolution procedure. See also the interesting "иои/оса/" finite element proposed by Scott [1976c]. There has been a considerable interest in interpolation theory and approximation theory in several variables during the past decades, one reason behind this interest being the need of such theories for studying the convergence of finite element methods. Special mention must be made of the pioneering works of Polya [1952] and Synge [1957], who considered what we call here bilinear rectangles and linear triangles, respectively. The "classical" approach consists in obtaining error estimates in ^-norms. In this direction, see the contributions of Barnhill and Gregory [1976b], Barnhill and Whiteman [1973], Birkhoff [1971, 1972], Birkhoff, Schultz and Varga [1968], Carlson and Hall [1973], Ciarlet and Raviart [1972a], Ciarlet and
134 P.G. Ciarlet Chapter III Wagschal [1971], Coatmelec [1966], Leaf and Kaper [1974], Nicolaides [1972, 1973], Nielson [1973], Schultz [1969, 1973], Strang [1971, 1972a], ZenJsek [1970, 1972, 1973], ZlAmal [1968, 1970]. Although in most cases a special role is played by the canonical Cartesian coordinates, a more powerful coordinate-free approach, using Frechet derivatives, can be developed, as in Coatmelec [1966], Ciarlet and Wagschal [1971], Ciarlet and Raviart [1972a], where, as noted earlier in Section 16, the interpolation error estimates are obtained as corollaries of multipoint Taylor formulae. See also le Mehaute [1981, 1984], Apprato, Arcangeli and Manzanilla [1987], for further extensions, Another frequently used tool is the kernel theorem of Sard [1963]. Some authors have obtained realistic estimates of the constants that appear in the interpolation error estimates. See Arcangeli and Gout [1976], Atteia [1977], Barnhill and Whiteman [1973], Gout [1977], Meinguet [1975,1977,1978,1979, 1981, 1984], Meinguet and Descloux [1977], Sanchez and Arcangeli [1984]. The approach in Sobolev spaces that we have followed here has been given much attention. In this respect, we quote the pioneering contributions of Bramble and Hilbert [1970, 1971] and Bramble and ZlAmal [1970]. Other relevant references are Aubin [1967a, 1967b, 1968a, 1968b], Babuska [1970b, 1972b], Birkhoff, Schultz and Varga [1968], Bramble [1970], Ciarlet and Raviart [1972a], Fix and Strang [1969], di Guglielmo [1970], Hedstrom and Varga [1971], Koukal [1973], Nitsche [1969, 1970], Schultz [1969], Varga [1971]. Interesting connections between interpolation theory and spline theory can be found in Atteia [1975], Mansfield [1972b], Nielson [1973], Sablonniere [1987], Boattin [1988] and, especially, Duchon [1976, 1977]. The dependence of the interpolation error estimates upon the geometry of the element (through the parameters hT and pT) generalized Zlamal's condition, as given in ZlAmal [1968,1970], and the "uniformity condition" of Strang [1972a]. Jamet [1976a] has also shown that, for some finite elements at least, the regularity condition given in A6.8) can be replaced by a less stringent one; the same condition has been simultaneously and independently found in a special case by Babuska and Aziz [1976]. If triangles are considered, this condition states that no angle of the triangle should approach % in the limit while by the present analysis no angle should approach 0 in the limit. Incidentally, this was already observed by Synge [1957]. The classical Jackson-Bernstein-Zygmund theory for trigonometric approximation has been extended to finite element approximations by Widlund [1977], who has shown that approximation error estimates of a certain order with respect to h imply conversely that the approximated function has some specific smoothness. Another constructive approach to approximation theory, based on appropriately averaged Taylor expansions, has been used by Dupont and Scott [1978,1980], who notably obtained error estimates in fractional order Sobolev spaces; see also Sanchez and Arcangeli [1984]. The Banach space interpolation method of Lions and Peetre [1964] has been used by Bramble and Scott [1978] and Scott [1979], who have shown that approximation estimates in the different norms of a same "Banach scale" may be all derived from the estimates expressed in the "highest-order" norm.
Section 17 Basic error estimates 135 To conclude this section, we record another basic assumption on triangulations, which was not needed thus far, but which is crucial for establishing some refined error estimates (see in particular Theorem 19.3 and Section 22): (H4) A family of triangulations 2Th satisfies an inverse assumption if there exists a constant v such that -^-<v for all Te\J3Th. A7.13) nT h A regular family of triangulations that satisfies an inverse assumption (i.e., (HI) and (H4) are satisfied) is called quasi-uniform. For such families, we are able to estimate the equivalence constants between Sobolev seminorms (we recall that a is the constant that appears in the regularity assumption; cf. A7.1)). Theorem 17.2. Let there be given a family of triangulations that satisfies hypotheses (HI), (H2) and (H4), and let there be given two pairs (I, r) and (m, q) with l,m^0 and {r,q)€ [1, oo] such that l^m and P<=Wl'r(f)nWm'q(T). A7.14) Then there exists a constant С — C(a, v, /, r, m, q) such that ]i/e £ ( yir I Ыт,в,Г j < (fcB)m,{0,l/r-l/«}ft»-i 1 E Nk.r j M M Vh £ *h, A7.15) ifp,q<oo, with max^L^r in lieu of \ £ \vh\qm,q,T} , if <? = <», max Ы,оо,г in lieu of < £ М^Л , if r = со. 1/r Te2Th Proof. Given a function vh e Xh and a finite element Г e ^, we have by assumption (H2) and by Theorem 15.1, l»rl/.,.f <C||Br||l|detBr|-1/'|f*li.r,7-, I^L^r^CIIBr^ridetBrl^lPrL.,.^ l ' where the function vT is in the standard correspondence with the function vh\T. Define the space N = {peP:\p\lrt = 0}J{V> , if' = °' xy imr,r ' 1РпР,_,(Г), if/^1.
136 P-G. Ciarlet Chapter HI Since l^m by assumption, the implication peN => \р\ял,т = 0 holds, and therefore the mapping ре?/Я^\\Р\\тЛ.т=ШР-5\тл,т seN is a norm over the quotient space Pj$. Since this quotient space is finite- dimensional, this norm is equivalent to the quotient norm IMkr.r and therefore there exists a constant С = C(l, r, m, q) such that \PL,q,T = \\nm,q,T^C\\p\\i,r,T = C\P\i,r,i forallpeP. A7.17) Taking into account assumptions (HI) and (H4), we obtain from inequalities A7.16), A7.17), and from Theorem 15.2, /•LnU/e-l/r |f»L.,.T<C(ff, v) ~^r- кЬ,г,т. A7.18) Assume first that q = со, so that there exists T0e$~h such that (й")~1/г (hnyllr ™мК|м,со,г = |ра|И|Я,,Го<С-^гт-|р>|,,г>Го<С fem_; |i>t|,„ by A7.18). Assume next that <j<co. We deduce from inequality A7.18) that )l/4 (WU/e-l/r С )l/q Then we distinguish three cases: (i) r^q, so that ) 1/8 f ") 1/r TeSTh J (.Ts^h J by Jensen's inequality; (ii) q<r< oo, so that I Ы?,„г ^l"-1" I k|I,,r[ with JTfc = cardjr^C((j,v)//in, by Holder's inequality; (iii) r= со, in which case "j i/e Z KliU.il ^Jr^'maxlonlj^.r, and inequality A7.15) is proved in all cases. □
Section 18 Basic error estimates 137 Inequalities of the form A7.15) may be immediately converted into inequalities involving the seminorms |*|m>e>n or \-\i,r>n if it so happens that the inclusions Xh с Wm'q(Q) or Xh с Wlr(Q) hold. For example, let us assume that assumption (H3) is also satisfied and that the inclusion P<^Hv{t) holds, so that the inclusion XhcV°{Q)r\H\Q) holds. Then we have С Klo,«,,a<^7iklo,G for all «„б**, A7.19) С Ni,fi*S ^|»*1ол for all v„eXh, etc. A7.20) If hypothesis (H3) is satisfied and if the inclusion PcWlcc(T) holds, then we get similarly С Ni.oo,c*$^No.c0,fl forall^eA"*, etc. A7.21) Clearly, similar inequalities between norms can be also derived from these inequalities. For instance, we deduce from A7.20) that Ыкя^Ыся ioT&llvk€Xh. A7.22) Inequalities such as A7.19)—A7.22), and the more general inequality A7.15), bear the generic name of inverse inequalities (whence the terminology "inverse assumption" in (H4)). Inverse inequalities can be likewise established which involve other seminorms or norms: For instance, let there be given a family of triangulations that satisfies (H2) and (H4), and assume that Рс<ё°{Т). Then for each pe [1, со], there exists a constant C(p) such that HLp(n^-j^jf\vh\o,P,i2, etc. A7.23) Inverse inequalities have appeared at many places; see notably Descloux [1973]. Remark 17.3. Usually, inverse inequalities have no "continuous" counterpart (consider e.g. inequalities A7.19)—A7.23)), as reflected by a factor in the right-hand sides that tends to infinity as h tends to zero. 18. Estimate of the error ||и—«*|Imj when the solution и is smooth and sufficient conditions for limA^01| и—иА [[ 1>ft=0 when ueH1(ii) Let there be given a second-order boundary value problem, posed over a space V that satisfies the usual inclusions Ho(Q)c V = V сЯ'(й). A basic assumption throughout the remainder of this chapter will be that the set Q is polygonal, essentially because such an assumption allows us to exactly cover the set Q with polygonal finite elements. Then with any such finite element and with any triangulation &~h of Q, we associate a finite element space Xh, and we define an
138 P.G. Ciarlet Chapter III appropriate subspace Vh of Xh which takes into account the boundary conditions contained in the definition of the space V (cf. Section 12). Since Vh is then included in the space V, we are using a conforming finite element method. Another basic assumption will be that the space Vh contains the X h-interpolant of the solution и of the boundary value problem; in this respect, see Section 12, where the special cases Vh=Xk^V = H1(Q) and Vh — Xoh^V=Hi{Q) have been thoroughly discussed. If the space V is of the form F={D€H1(fl);c = Oonf0}, and if the subset Г0 of Г can be exactly covered by a union of faces of finite elements, it is easily seen that the space Vh = {vheXh;vh = 0 on Г0} still contains the XA-interpolant of u. By contrast, this is not true if we are approximating a nonhomogeneous Dirichlet problem; such a problem requires specific approximations, which will be briefly described in Section 24. We now prove our first convergence result; we recall that assumptions (HI), (H2), (H3) have been defined in Section 17. Theorem 18.1. In addition to (HI), (H2) and (H3), assume that there exists an integer к > 1 such that the following inclusions are satisfied: Pk{T)dPc:Hl{T), A8.1) Як+1(Г)с^(Г), A8.2) where s is the maximal order of partial derivatives occurring in the definition of the set t. Then if the solution ueV of the variational problem is also in the space Hk+ 1(Q), there exists a constant С independent of h such that \\u-uh\\1<n^Chk\u\k+UQ, A8.3) where uhe Vh is the discrete solution. Proof. It suffices to use inequality A7.5) with v = u and m= 1, in conjunction with Cea's lemma (Theorem 13.1); this yields ll«-«*lli,n^C inf \\u-vh\\UQ^C\\u-nhu\\un. □ The error estimate || и — uh ||: Q < Chk\u\k +1 n of Theorem 18.1 has been established under the assumptions that the solution и is sufficiently smooth (ue Hk+ 1(Q) for some /c^l) and that the Xh-interpolant TIhu exists (cf. the inclusion Hk+l(T)c+<tf\T), which is satisfied if к > \n — 1 + s). If these hypotheses are not valid, it is still possible to prove the convergence of the method if the solution и "only" belongs to the space H1(Q) and if the "minimal" inclusions A8.4) below hold. One should notice that neither the inclusions A8.4) nor the assumption s^l in the next theorem are restrictive in practice for second-order problems.
Section 18 Basic error estimates 139 Theorem 18.2. In addition to (HI), (H2) and (H3), assume that the inclusions PiCftcPcH^t) A8.4) are satisfied, and that there are no directional derivatives of order ^ 2 in the set t. Then lim||u-»A[|ljfi=0. A8.5) /i-»0 Proof. Define the space Г =W2'x{Q)r\V. A8.6) Since the inclusions A8.1) and W2'">(K)c+<g%g), s=0orl, W2'<°(K)c+H\K), hold, we may apply Theorem 16.2 with fc = l, p = oo, m = \, q—2: There exists a constant С such that \\v-IlTv\\UT^C{meas(T)}ll2hT\v\2,x,T for all сетГ. Therefore, \\v-nhv\\un = \ Y Iv-nMlX'^ChimeasiQ)}1'2^^, and thus we have proved that lim||!>-n»i>||liO = 0. A8.7) fi-0 For all h and all nef", we can write 1|и-ад1>0 < ll«-»||i,o+ \\v-nhv\\uii. A8.8) Given the solution ueV and any number e>0, there exists a function p£ef that satisfies the inequality ||и—vell i,n^ie» smce tne space 'V is dense in the space V. Then by A8.7) there exists an /i0(e) such that \\ve — nhve\\li(i^js for all h^h0(e). Hence ||m—nhvE\\1<n^s by A8.8), and thus lim inf ||u — vh\\=0. Л-0 vheVh Then the conclusion again follows from Cea's lemma (Theorem 13.1). □ Remark 18.1. An inspection of the above proof shows that the choice A8.6) for the space "У is the result of the following requirements: On the one hand -f" must to be dense in the space V; on the other hand the value к = 1 is needed in order to apply Theorem 16.2 so as to obtain property A8.7) with the assumption Р1(Г)<=Р. Therefore the space У must contain derivatives of order 2 (this condition limits in turn the admissible values of s to 0 and 1) and consequently one is naturally led to the choice A8.6). In fact, any space of the form У = W2 'P(Q) n V with p sufficiently large, would have also been acceptable, as one may verify.
140 P.G. Ciarlet Chapter III Remark 18.2. While we have used here a "density argument", based on the interpolation theory of Section 16, an "interpolation-free" approach is also possible, ■ since the estimates of A7.11), used in conjunction with Cea's lemma, provide another proof of Theorem 18.2. Remark 18.3. (i) Using a priori estimates in various norms on the solution (cf. e.g. NeCas [1967] and Kondratev [1967]), it is possible to get error estimates that depend solely on the data of the problem. See Bramble and Zlamal [1970], Nitsche [1970], Oganesjan and Rukhovets [1969]. (ii) In the case of the equation - Ли = f over a rectangle, Barnhill and Gregory [1976a] obtain theoretical values for the constants that appear in the error estimate, and these values are realistic, as shown in Barnhill, Brown, McQueen and Mitchell [1976]. (iii) Error estimates expressed in mesh-dependent norms have been obtained by Babuska and Osborn [1980]. (iv) Using the theory of n-widths, BabuSka and Aziz [1972, Section 6.4] have discussed whether the estimate of the error \\u — uA||lifi is indeed the best possible. 19. Estimate of the error \u—«Aj0,n when и is smooth and the Aubin-Nitsche lemma, first estimate of the ИТОГ I" н*1о,оо,Я In Theorem 18.1 we have given assumptions that insure that ^u — uh\\ua = 0(hk); hence the error | и — щ 10;fl is at least of the same order. We now show that, under mild additional assumptions, one has in fact \u — uh\Q<il = 0{hk +1). To this end, we begin by defining an abstract setting adapted to this type of improved error estimates: Let there be given a normed vector space V, with norm |j • ||, and a Hilbert space with norm | • | and inner product (•, •), such that Р=Я and Vc+H (in the applications we have in mind, we shall have typically Hq(Q)c VczH1(Q) and H = L2(Q)). Our first observation is that, if the space H is identified with its dual space, the space H may then be identified with a subspace of the dual space V of V. To see this, let / e Я be given. Since Vc H with a continuous injection г, we have l(/,»KI/IM<IMII/INI for alb e К and thus the mapping v e V-+(fi v) defines an element Je V. The mapping fe H-tJe V is an injection for if (/, v) = 0 for all veV, then (/, i>) = 0 for all v e H since V is dense in Я, and thus / = 0. We may therefore identify / and /, i.e., we shall write (/») = /(») for all fe H, veV. A9.1) Remark 19.1. Some care should be exercised when making such identifications, in this respect see the discussion given by Brezis [1983, p. 81].
\u-uh\^M\\u-uh\\[ sap {— inf; ||q>e-q>h\\ \), A9.5) Section 19 Basic error estimates 141 We next prove an abstract error estimate. Making the same assumptions as in the Lax-Milgram lemma (Theorem 1.3), we let as usual ueV and uhe Vh denote the solutions of the variational problems a(u, v) = l{v) for all veV, A9.2) a(uh, »»)= l(vh) forall^e^. A9.3) respectively. We recall that M denotes an upper bound for the norm of the bilinear form a(v)(cf. A.19)). Theorem 19.1 (Aubin-Nitsche lemma). Let the assumptions and notations be as in the Lax-Milgram lemma. In addition let Я be a Hilbert space, with norm | • | and inner product (% •), such that V=H and Vc*H. A9.4) Then one has '_1_ ygeH l\g\ <PheVh ' where, for any ge H, cpge V is the unique solution of the variational problem: a(v,q>g) = (g,v) forallveV. A9.6) Proof. To estimate \u — uh\, we shall use the characterization \u - uh\ = sup \{g, и - uh)\/\g\. A9.7) Using the identification A9.1) we can solve problem A9.6) for all ge H (the proof is exactly the same as that of the Lax-Milgram lemma). Since (u — uh) is an element of the space V, we have in particular Ф~Щ,(рд) = {д, u-uh) on the one hand, and we have a(u — uh, (ph)=0 for all q>he Vh on the other. Combining these relations, we obtain (g,u-uh) = a(u-uh,cpg-(ph) for all cphв Vh, and therefore, |@,и-и»)|<М||и-и*|| inf \\<pt-<ph\\. A9.8) PheVh Inequality A9.5) is thus a consequence of the characterization A9.7) and of inequality A9.8) □ An inspection of the above proof shows that cpg has to be the solution of A9.6), i.e., of a problem where the arguments are interchanged in the bilinear form. Problem
142 P.G. Ciarlet Chapter III A9.6) is a special case of the following variational problem: Given any element g e V, find an element cpeV such that a(v, cp)=g(v) for all veV. A9.9) Such a problem is called the adjoint problem of problem A9.2). When solving the variational problem A9.2) amounts to solving, at least formally, a second-order boundary value problem (i.e., when V с Hl(Q)), it is easily verified that solving its adjoint problem A9.9) also amounts to solving a second-order boundary value problem, which is in general different, however (unless of course the bilinear form is symmetric). As we shall see, the abstract error estimate of Theorem 19.1 will yield an improvement of the order of the estimate of the error |w —uh\0,Q, but only for a restricted class of second-order problems, which we now define: A second-order boundary value problem whose variational formulation is A9.2), or A9.9), is said to be regular if, for any f e LZ(Q), or for any g e L2(Q), the corresponding solution uf, or ug, is in the space H2(Q)r\V, It then follows from the closed graph theorem (for a proof, see e.g. Brezis [1983, p. 20]) that there exists a constant С such that ll«/ll2.o<C|/|0jO forall/eL2@), A9.10) l<Pgh,Q^C\g\0M forallgreL2^). A9.11) Remark 19.2. Consider problem A9.2) for instance. Without the assumption of regularity, we simply know that (use Remark 1.2 and the identification A9.1)): «IIM^^H/r-sup^supi^l^l/lo, veV || f|| i,o veV |И i,fi for all feL2(Q). ; For problems with smooth data, the assumption of regularity is essentially ш restriction on the possibility of "mixing" different boundary conditions. For instance, while the Dirichlet (Г0 = Г) and Neumann (Го = 0) problems associated with the idata of C.19) are regular if Q is convex and if the functions a^ and a are sufficiently ismooth, the mixed problem associated with these data is not regular in general. We are now in a position to estimate the error |w — uh\0,a- Theorem 19.2. In addition to (HI), (H2) and (H3) (cf. Section 17), assume that s = 0, that the dimension nis <3, and that there exists an integer k^l such that the solution и is in the space Hk+l(Q) and such that the inclusions PAT)cPcH\T) A9.12) hold. Then if the adjoint problem is regular, there exists a constant С independent of
Section 19 Basic error estimates 143 h such that \u~uh\0M^Chk+1\u\k+ua. A9.13) Proof. Since n^3, the inclusion Я2(Т)<^<^°(Г) holds (if s = l, the inclusion H2(T) с» ^(Г) holds only if n = 1; this is why we have restricted ourselves to the case s = 0). Applying Theorem 17.1 and inequality A9.11) we obtain, for each geH=L\Q), inf \\<Pg-9h\\i,n^\\q>g-nh<pg\\ltQ^Ch\<pg\lwQ^Ch\g\0tQ. Combining this inequality with inequality A9.5) then gives \u-uh\o,Q^Ch\\u-uh\\l,a, and it remains to use inequality A8.3) of Theorem 18.1. □ The results of Theorems 19.1 and 19.2 were established independently by Aubin [1967b] and Nitsche [1968], and also by Oganesjan and Rukhovets [1969]. See also Kikuchi [1975c] for a generalization. The asymptotic error estimates obtained in Theorems 18.1 and 19.2 are optimal, in the sense that the orders of convergence are the same as if the discrete solution uh were replaced by the XA-interpolant of the function и: Compare A8.3) and A9.13) with A7.5) for m= 1 and m=0. Consequently, Fig. 16.1 is also useful for getting a practical appraisal of the upper bounds of Theorems 18.1 and 19.2. For instance, one gets \\u—uh\\m# = 0(h2~m), w = 0,l, with linear и-simplices or with bilinear rectangles, ||u-—M/j||m,fi = m = 0,l, with quadratic, or reduced cubic, n-simplices or with biquadratic, or reduced biquadratic, rectangles, etc. Nevertheless, the higher the order of convergence, the higher the assumed regularity of the solution, and this observation limits considerably the practical value of such estimates. For example, let us assume that we are using cubic и-simplices but that the solution is "only" in the space H2(Q): Then the application of Theorems 18.1 and 19.2 with fe = l only implies that ll"-"Jlm,« = 0(fo2-m), т=0Д. If the solution is not very smooth, special techniques may be applied that yield "local" error estimates which may nevertheless be of a high order. This fascinating aspect of the error analysis for finite element methods is treated in depth in the next article by Wahlbin. We finally show that, if the family of triangulations satisfies an inverse assumption, the estimate of the error |и — и»|0,я established in Theorem 19.2 may be in turn used to derive a first estimate of the error |« —M/,j0,cc,g (other estimates of the same error will be given in Sections 21 and 22). Theorem 19.3. In addition to (HI), (H2), (H3), and (H4) (cf. Section 17), assume that s = 0, that the dimension n is =%3, and that there exists an integer k^l such that the
144 P.G, Ciarlet Chapter III solution и is in the space Hk+1(Q) and such that the inclusions P^fjcPc^'ff) A9.14) hold. Then if the adjoint problem is regular, there exists a constant С independent of h such that \u-uh\0iCC,n^Chk+i-^\u\k + i,n- A9.15) Proof. Write 1"-«А|о1о0>п^|и/,-^м[0,а),о + |м-Ялм|о,0о,о. A9.16) We first infer from Theorem 16.2 that (by assumption (H3), we also have P с LQ0(f)): \u-nhu\0^T^ChkT+1~nl2\u\k+UT for all Te\J!Th; h hence |и-Я»н|0100>а<СЛ*+1-"/2М1+11О. A9.17) We then infer from the inverse inequality A7.15) with l = m = 0, q=x>,r = 2, from the error estimate A9.13), and from Theorem 16.2, that \uh-nhu\0^M^Ch~m\uh-nhu\Q,n ^Ch-"'2{\u-uh\0,a + \u-nhu\0,Q} A9.18) ^C/zt+1-"/2|u|k+1,n, and the error estimate A9.15) follows by combining inequalities A9.16), A9.17), and A9.18). П 20. Discrete maximum principle in finite element spaces Let Q be a polyhedral domain in IR", with boundary Г, and consider the model problem ~Au + bu = f in Q, u = u0 on Г, B0.1) where b, f and u0 are given functions, which are assumed to be sufficiently smooth for the time being (specific smoothness assumptions will be given later on). We also assume that b^O in Г2. B0.2) If we approximate the solution и of B0.1) by using finite element spaces made up of linear n-simplices, it follows from Theorem 1У.З that jO(fc) if us H\Q), и = 2, ,_ \u-uh\0^tD = < ,_ B0.3) lO(y/h) ifueH2(Q), n = 3,
Section 20 Basic error estimates 145 if the adjoint problem is regular and if the associated family of triangulations is quasi-uniform. Our next objective is to show that, for a general class of second-order Dirichlet problems (cf. B0.16)), which include B0.1) as a special case, one can improve the estimates B0.3) if the finite element spaces satisfy a discrete maximum principle as introduced by Ciarlet [1970a, 1971]; it is the discrete analog of the well-known "continuous" maximum principle (cf. Protter and Weinberger [1967], Sperb [1981]). More specifically, we shall show in Section 21 that, if this is the case, lim\u-uh\o^n=0 if u€W^(Q), n<p, B0.4) A-»0 |и-и*1о.».о = 0(Л) ifueW2-p(Q), n<2p. B0.5) The discrete maximum principle has been extensively used for proving convergence and error estimates for classical finite difference schemes (cf. for example Varga [1966a], Bramble, Hubbard and Thomee [1969], Ciarlet [1970b], and the references therein). In this case the solution of a problem such as that of B0.1) is supposed to be classical, in the sense that it has continuous derivatives of sufficiently high order. If the continuous problem is now approached from a variational viewpoint, then a maximum principle for the continuous problem can be derived, as in Aronszajn and Smith [1957], and a parallel approach can also be taken up for the finite element method (cf. Ciarlet [1970a]). We use here the discrete analog of another approach, due to Stampacchia [1965], where the essential step consists in obtaining a priori bounds in L°°(£2) for the solution of the discrete problem (cf. Theorem 21.4). A similar approach has been used in Lebaud [1969], and Lebaud and Raviart [1969] for finite difference schemes of variational type. We also mention that Nitsche [1970] has also obtained, by a different approach, an estimate of the form B0.5) when Q is convex and n = p = 2. Therefore, it is somehow a satisfactory situation to realize that for three well-known approximation schemes for solving second-order Dirichlet problems (classical finite differences, variational finite differences, finite element methods), the existence of a maximum principle for the discrete problem implies the possibility of obtaining uniform convergence of the approximate solutions to the exact solution. We now describe the variational problem that we will be approximating: Let Q denote as before a polyhedral domain in U" with Г = dQ. We are given functions akleLm(Q), l<fc,/^n, a^L^Q), К/с^и, B0.6) beLx(Q), we let V=HlQ{Q) B0.7) and we consider the bilinear form
146 P.G. Ciarlet Chapter III a(u,v) = ' ( « « ") < Y, akAudkV+ Y, aAu v + buv>dx, B0.8) u,i=i t=i J By B0.6), this bilinear form is continuous over the space V xV. Furthermore, we shall assume that the bilinear form is ^elliptic, i.e., that there exists a constant a > 0 such that a{v,v)^a\\v\\lQ for all» e К B0.9) For instance, this is the case if the functions akl satisfy the condition C.21), if ak = 0 andifb^O. Let V' = H~1(Q) denote the dual space of the space V. Given any function u0eH1{Q) and any element leV, there exists, by the Lax-Milgram lemma (Theorem 1.3), one and only one function us H1(Q) such that a(u,v) = l{v) for all v eV, B0.10) u-~u0eV, B0.11) Suppose that we are given functions fkeLp{Q), O^k^n, B0.12) u0eW^{Q), B0.13) with 2^n<p, B0.14) so that the inclusion Wl'p(Q)c+<g°(Q) holds, in particular. Then the linear form I defined by UveV-+W= |/0i> + £/t_ldx B0.15) Q belongs to the space V since condition B0.14) implies that fkeL2(Q), O^k^n. Moreover, the function u0 belongs to the space #°@) so that its restriction to Г (still denoted u0) is well defined as a function in the space ^°(Г). Using Green's formula, we find that solving the variational problem B0.10)- B0.11) amounts to formally solving the boundary value problem - £ 2*КАЦ+ Е аДм + 6и=/0- £ЭкЛ in Д M=l k=\ Jc=l U = U0 ОП Г, B0.16) which is a nonhomogeneous Dirichlet problem for the operator n n L:u-*Lu=— £ Qk(akldiu)+ £ akdku + bu. B0.17)
Section 20 Basic error estimates 147 With all the above assumptions, it can then be proved (cf. Stampacchia [1965, Theorems 4.1 and 4.2]) that the solution u of B0.10)-B0.11) lies in the space НЩпЬ°°@), and that No.oo,a<ll"ollt»(o + Cj E I/*Io,p,oK B0.18) for some constant С independent of the particular functions u0 e WX'P(Q), fk e LP{Q), Osjfc^n. Remark 20.1. It would be simpler to let fk=0, Kfe<n, but we shall need later on (cf. the proof of Theorem 21.5) the possibility of having nonzero functions fk. With a triangulation 3~h of the set Д made up of и-simplices Те £~h, we associate as usual the finite element spaces Xh = {vheV°(Q); i^eP^r) for all Те<Г„}, I/ / Y П Г\ B0Л9) Vh = {vhe Xh;vh = 0 on П. We let bt, l^i^N, and bh N+l^i^N + M, denote the vertices of the triangulation &~h that belong to Q, and to Г, respectively, and we let wt, l^i^N + M, denote the functions of Xh that satisfy Wi(bj) = 5U, l^i,j^N + M, B0.20) i.e., the functions wt, 1 ^ i < N, or wb 1 ^ i<N + M, form a basis of Vh, or of Xh. Given the function u0 of B0.13), we let N + M "o* = Z uo{bi)wh B0.21) i = N+l so that u0h e Хл; then the discrete problem consists of finding a function uh e Xft such that a(uh,vh) = l(vh) for all^eF,, B0.22) uh-u0heVh. B0.23) Clearly, this problem has a unique solution. If we look for a solution uh of B0.22M20.23) in the form N "а = "о* + £ <J,w, with {, = м4(ЬД ККЛГ, B0.24) i=l condition B0.22) is automatically satisfied, and the vector £=(<J1; £2,..., £w) is the solution of the linear system A£ = p, B0.25) with
148 P.G. Ciarlet Chapter III /4=(ay), atj^aiwpWi), t^i^N, l^j^N+M, B0.26) n + M P=(PlP2,.--Jn), Pt = №)- I Mbj)au l^i<N, B0.27) J = JV+1 We then say that the discrete problem B0.22)-B0.23) is of nonnegative type if the matrix A of B0.26) is irreducibly diagonally dominant (Varga [1962, p. 23]), and the coefficients аи defined in B0.26) satisfy ey<0 tor 1Ф), Ki^JV, l^j^N + M, B0.28) N + M £ ву>0, l^i^N. B0.29) j=i Note that these conditions are reminiscent of similar conditions for discrete operators associated with finite difference schemes for solving B0.16). To be more specific assume that f0^0 and fk=0, l<fc<n, in B0.15) so that l(wt) = jn/oWidx^O, l^i<N, since each basis function wt is^O. Then we have (cf. Ciarlet [1970b, Theorem 3]): max{&;l<is$AT}<max{0, тах{и0(Ь,); N + l^j^N + M}}, B0.30) an inequality that can be equivalently written as тахиАг*тах{0, тах{м0л}}. B0.31) Following Ciarlet [1970a], we shall then say that the discrete problem B0.22)-B0.23) satisfies a discrete maximum principle, in the sense that inequality B0.31) holds when /(w;)<0, l^i^N; note that this inequality is nothing but the discrete analog of the maximum principle for the continuous problem (cf. Stampacchia [1965, Theorem 3.6]). Following Ciarlet [1971], we next give an example of a discrete problem that satisfies the discrete maximum principle. Let a(u,v)= < £ 5kudkv + buv>dx, B0.32) a with beL°°(Q), b>0 infi, B0.33) so that the operator L of B0.17) reduces to that of B0.1), viz., Lu=-Au + bu. B0.34) Given an n-simplex T of the triangulation S~h, let ar, Kr^n+1, denote its vertices and let Ar, 1<г<и+1, denote the barycentric coordinates of a point xeT with respect to the points ar. With each n-simplex Г, we associate the parameter a T = max {cos(DAP, DXsj}, B0.35)
Section 20 Basic error estimates 149 with ШГ = (81ЯГ,82ЯГ,...,ЭЯХ l^r^n+l, and where • and || • || respectively denote the Euclidean scalar product and Euclidean norm in W, Finally, with each triangulation 2Th, we associate the parameter ah=maxaT. B0.36) TePh Theorem 20.1. Let the bilinear form be as in B0.32)-B0.33) with b^O, and consider the finite element approximation B0.22)-B0.33) of the corresponding variational problem B0.10)-B0.11). Given a sequence of triangulations 3~h for which h = maxhT TeFh approaches 0, the discrete problem satisfies the discrete maximum principle for h small enough if there exists a constant <т0 =<70(Ь) independent ofh such that <7л<(То<0 for all h. B0.37) If b=0, it suffices that <тл<0 for all h. B0.38) Proof. Consider an п-simplex T&3~h and a basis function w;. Then either Гс supp wh which implies that the point bt is one of the vertices, say ar, of Г, so that Wi\T =ЯГ; or T^supp W;, in which case w;|r = 0. Therefore, for j=£i, the coefficient ai} = a(wj, wt) reduces to a finite sum of integrals of the form fc=i г :=1 J and the indices r and s are always different when they occur in the sum. Since the functions Xt are affine and satisfy 0<lr^l, l^r^n, we have a,^(W,"Ws + |fo|0,oc,fl)meas(r)< fe + |b|0.eJ, jmeas(T), since l/h^\\DXr\\,l^r^n+l. Thus, we have ars<0 for all r^s, for h small enough if condition B0.37) is satisfied, or for all h if fe = 0; this shows that conditions B0.28) are satisfied. From the definition of the basis functions, it follows that T,^=^wj = 1 over Д and therefore
150 P.G. Ciarlet Chapter III iv+м г Y atj= I Kdx^O, l^i^N, n since b ^ 0 by B0.33). Hence conditions B0.29) are also satisfied. Since the matrix (oy) is irreducibly diagonally dominant, the proof is complete. □ As observed by Ciarlet [1971], there is a simple geometrical intepretation of condition B0.37) when n = 2: Theorem 20.2. Let the assumptions be as in Theorem 20.1. When n — 2, the discrete problem satisfies the discrete maximum principle for h small enough if there exists e > 0 such that for all h, all the angles of the triangles of 2Th are ^|я —e; if b = 0, it suffices that all the angles of the triangles of $~h be <2Я- Further information on the discrete maximum principle in finite element spaces are found in Lorenz [1977], Hohn and Mittelmann [1981], Ruas Santos [1982]. 21. Estimates of the error \u—н*1о,со,й when «e Wl'p(Q),n<p, or when и e W1,r(Q), n < 2p, when the discrete maximum principle holds We follow here Ciarlet and Raviart [1973]. To begin with, we prove various technical results (Theorems 21.1, 21.2, 21.3). Theorem 21.1 Let T be an n-simplex in W, with vertices ar,l^r^n + l, and let a function ve Pi(T) be >0 on T. Then for any p with p> 1, there exists a constant С i >0, independent of T and v, such that d meas(T) £ {v{ar)Y^\v|gtP>r. B1.1) r=l Proof. Let A,(x) denote the barycentric coordinates of a point xeT with respect to the points ar, 1<г^и + 1; observe that Яг^0 over T. Since the function v = 1?tl v{ar)Xr is ^0 on T, we have, by Jensen's inequality, Iv \po,P,t = I {"l v(ar)K(x)V dx >"% {v(ar)Y T T {K(x)}p dx. Now let f be a fixed n-simplex of W with vertices йг, 1 ^r<n +1. Then there exists an invertible matrix В of order n and a vector beW such that the set T is the image of the set t through the mapping F: x->F(x) = Bx + b, and this mapping can be so chosen that F(dr) = ar, 1 ^ r < n +1. Therefore, denoting by Xr, 1 «S r < n +1, the barycentric coordinates of a point x e t with respect to the points ar, we obtain
Section 21 Basic error estimates 151 {Ar(x)}"dx = |{Ar(Bx+fe)}p|detDF(x)|dJc=^^2 j {£(*)}'dx, J meas(T) J т f t since the barycentric coordinates are invariant through the affine mapping F. Thus, inequality B1.1) holds, with C1 = {meas(f)}"min Ar(x)pdx;l<r<n+1>. □ Theorem 21.2. Let there be given a discrete problem of nonnegative type (cf B0.28)~B0.29)). Let £,-, К i < N + M, be given real numbers and letaeU be given such that тах{0,тах{^; N+l^i ^ N+M}} ^oc. B1.2) Then if we let (IrminKa, l^i^N + M, B1.3) we have N N + M E E a^-nMj^- B1.4) Proof. Let / = {ls$KiV; £;>a} and J = {l^i<iV + M; &^а}. From B1.2), it follows that /uJ= {1,2,..., iV+M}. Next, it is easily verified that JV N + M N + M E E ^rii)ij=a[Kr«) E au+E E %(£••-a)(£/-a)> and thus this quantity is ^ 0: the first term is $s 0 since a ^ 0 by B1.2), (£; — a) > 0 for г e 7 and Z^M ay > 0 by B0.29), and the second term is also ^ 0 since for i e / and je J, we have ay<0 by B0.28) (Jn J= 0), (^-a)>0 for ieI and (£;-a)sS0 for jeJ- D Theorem 21.3. Let there be given a discrete problem of nonnegative type. Given any function uh e Xh and any real number a, let uhA denote the function ofXh that satisfies uhJbi) = min{<x,uh(bi)}, l^i^N + M. B1.5) Then, if max{0,max{ufc(fe,);iV+l^i^iV + M}}<a, B1.6) the function vh,* = 4-4,« B1-7) belongs to Vh, and Фи,*, vhJ^a(uhya, vh<a). B1.8)
152 P.G. Ciarlet Chapter III Proof. We have uh(bl) = uhJlDi) for N +1 ^i^N + M by B1.6); hence vhtX belongs to Vh. Next, we may write Фй, f*,a) = Фи,*, Ча) + Ф*,«> Uh ~ %,J, and thus, we have to prove that Ф*,«ц-«*,»)>о. If we let £( = ил(Ьг) and »/, = им(Ь;), 1 ^ i^ N + M, the last expression can be rewritten as N N + M and this quantity is ^0 by Theorem 21.2, □ Following Ciarlet and Raviart [1973, Theorem 1], we now prove that the norms |w*lo,a>,o are bounded independently of h. This crucial step towards estimating the error JM — Mftlo, oo,n is nere nothing but the discrete analog of the a priori estimate B0.18) for the continuous problem. A similar estimate has been established by Schatz [1980]. Theorem 21.4. Let the assumptions on the continuous problem be as in Section 20. Then there exists a constant C, which is the same for all the discrete problems of nonnegative type, such that n oo.fi < II «ок»(П+С £ \fk B1.9) where uh e Xh is the solution of the discrete problem B0.22)-B0.23). Proof. Let a0h-max{0,maxuoh} = max{0,тах{ий(Ь;); N+l^i^N + M}}, and let a be any real number with By Theorem 21.3, the function vKa—uh — uh^ belongs to Vh, and moreover Фм> vhJ sS a(uh, vhJ = (/, vhJ. B1.10) Since n^e^cfl1 (Q), the functions vht0L and dk v hA, 1 < к < n, belong to If' (£2) with p' denned by l/p + 1/p' = 1 ip' < 2 by B0.14)); thus, by Holder's inequality, we obtain П Kvh,«X\fo\o.p,i}\Vh,*\o,p,,n+ £ \fk\o,P,n\dkVh,o:\o,p;s2- B1.11) Let E(u) = {xeQ;vhA(x)>0}. B1.12)
Section 21 Basic error estimates 153 Then vhyX = Qkvhwe = 0,1 < к ^ n, in Q — £(a), since £(a) is a union of n-simplices of 9~h (Vh,« ^ 0 over Q); hence using again Holder's inequality, we obtain I vh,x \o,p;a = I t>*,« |o,P',£(a) < \vh,a |0,£(a) {meas(£(a))}A/2)" (i/p) From this inequality, similar inequalities for bkvhA, l^k^n, and inequality B1.11), we get K.)<("+1){ £ |Л1о.р>01 II ofc.e ||1.в{шеа8(£(а))}<1^-<1/'>; B1.13) from inequalities B0.9), B1.8), B1.10), and B1.13), we obtain «KJIi.o^(« + l){£ |/t|o,P,«j{meas(£(a))}A/2>-A«. B1.14) From the Sobolev imbedding theorem A4.3), we infer that ,"l/2*=i-l/n, if n>2, НЧОЬьХЩ with■.„,,, , , ., B*= any real number >1, if и = 2; hence inequality B1.14) implies that there exists a constant C2 independent of h such that K«lo.2.,D <C2|£ |/,|0.n„|{meas(£(a))}<1/2>-<1^. B1.15) Next, let /? be any real number that satisfies /?>a, and let £(/?) be defined as in B1.12). Using Theorem 21.1, we obtain ГеВД J Г ^Ci E {^,a(bi)}2*meas(suppw;) B1.16) Ь(е£(я) JsCiQS — aJ* £ meas(suppwI) = C1(^—aJ*meas(£(/J)). Inequalities B1.15) and B1.16) imply that the function ф:а^а.он -»■#*)=meas(£(a)) is >0 and nondecreasing on the interval [a0ft, + oo [, and that, for any f5>a^a0h, with 2* С3=СГ1<|С2 £ |Л|0.,4 , v = 2*(d/2)-(l/p)). * = o
154 P.G. Ciarlet Chapter III Therefore, applying a result of Stampacchia [1965, Lemma 4.1], we obtain #Хо^2»«'-1>С3(#*о*)Г1}1/2*)=0, which means that, for all xeQ, П ^max{0,maxM0} + C Y \fa\a,p,a, B1.17) r k = 0 with C = СГ1/2* C22v/2*(v- 4(meas(fi)J*<v-J (since the piecewise affine function uoh interpolates the function u0 on Г, we have aQh «S max{0, тахги0 })• The conclusion of the theorem follows by observing that we can similarly prove an inequality opposite to that of B1.17). □ We are now in a position to prove the main result of this section, due to Ciarlet and Raviart [1973, Theorem 2]. Theorem 21.5. Let the assumptions on the continuous problem be as in Section 20, and consider a family of discrete problems of nonnegative type, associated with a regular family of triangulations. Then lim|M-uh|o,oo,fi=0, if ue Wl-'{Q\ n<p, B1.18) h->0 |и-и*1о.о0.о<СЛ|и|2>р.0, if ueW^iQ), n<2p. B1.19) Proof. For each h, let Пни denote the Xft-interpolant of the solution u, i.e., the unique function in Xh that satisfies Ялм(Ь;) = м(Ь;), l^i^N + M. Since the function (щ ~Пни) belongs to the space Vh, we infer from B0.10) and B0.22) that а(ин — Пии, vh) = a(u—nhu, vh) for all vh e Vh. Hence we may write a{u„-nhu,vh)=- I f0vh+ Y fAvh} dx for all vhe V„, Q with n f0 = b(u-nhu)+ Y акдк{и-П„и), fk= Y ак1д,{и-П„и), l^k^n. 1=1 The assumptions B0.6) and the fact that ueWUp(Q) imply that fkeW(Q), O^k^n. Hence we may apply Theorem 21.4; this shows that there exists a constant
Section 22 Basic error estimates 155 С independent of h and и such that \uh -nhu|0>Oo,Q <C|| u-HhM || liP,fi, and thus 1"-"а1о,оо,й^A + С)||м-Лам||11Р,й. B1.20) If ue W2,p(fl) with n<2p, then 1|и-Л»н||,,Р@<СА|и|2>л0 B1.21) by Theorem 16.2, and inequality B1.19) is proved when n<2p. Assume next that n<p. By Theorem 16.2 again, there exists a constant С independent of h such that 11»-Яко||1.Р1й^С|р|1.л0 for all ve W^iQ), n<p, and thus №«;|li.,.D^(l+C)Nli.„.a for all re W^{Q), n<p. B1.22) Let e>0 be given. Since the space W2,P(Q) is dense in the space Wi,p(Q) and since inequality B1.22) holds, there exist uee W2'P{Q) such that ll«-Mi.p.o<ie, ||n*(M-«,)lli.,l0<k B1.23) Next, by B1.21), there exists /i0(e) such that II ««-#*«« II i.i».o<ie ГогаНй<Мв). B1.24) Hence B1.23) and B1.24) combined imply that l|M-tfhu||1>p,fl^£ for all h^h0(e), B1.25) and B1.18) follows from B1.20) and B1.25). □ 22. Estimates of the errors \u—н*|о,<»,п and |н—«*|iH0,a when ue W^iQ) and Nitsche's method of weighted norms Our objective in this section is again to study the uniform convergence of the finite element method when linear n-simplices are used. We shall however use a technique quite different from that of the previous section, which relied on a discrete maximum principle. We follow instead the particularly penetrating method of weighted norms of Nitsche, which yields optimal orders of convergence if the solution is sufficiently smooth, i.e., if we W2,co{Q) in the present case. For ease of exposition, we shall simply consider the homogeneous Dirichlet problem for the operator —AinQ, whose variational formulation corresponds to the following data: V=Hh(Q), Г Г B2.1) a(u, v)= VvVvdx, l(v) = \fv dx, fe L2 (O). n a
156 P.G. Ciarlet Chapter III Assuming that Q is a convex polygonal subset of IR2, we shall restrict ourselves to finite element spaces Xh which are made up of linear triangles; hence the corresponding discrete problems are posed in the spaces Vk ~{vh e Xh; vh = 0 on Г}. Extensions to triangles of higher degree and to higher dimensions are indicated at the end of this section. We shall assume once and for all that we are given a regular family of triangulations of the set Q that also satisfies an inverse assumption, i.e., that there exist two constants a and v such that hT/pT^(x, ,222- h/hT^v for all Гe (J 5;. h The main tool that Nitsche used in his study of the errors \u-uh\0ta0!„ and l« —и* 11,00,я is the consideration of appropriate weighted norms and seminorms. Accordingly, the first part of this section will be devoted to the study of those properties of such seminorms that are of interest for our subsequent analysis; these properties are the object of Theorems 22.1-22.4. Given a weight function ф, i.e., a function that satisfies феЬ<°@), ф^О а.е. on Q, B2.3) we define, for each integer m^O, the weighted seminorms veHm(Q)^\vU;m,Q = \ \ф £ \d»v\2dx\ . B2.4) Я To begin with, we observe that, if the function ф '1 exists and is also in the space П°{0), an application of Cauchy-Schwarz inequality gives а(и^)^\и\ф..иа\ь\ф-и.1>0 for all u^eH^Q), oteR. B2.5) Departing from the general case, we shall in fact concentrate our subsequent study on weighted seminorms of the particular type | • |^«;m>fi, ae U, where the function ф is of the form B2.7) below. Our first task is to extend to such weighted seminorms the property that there exists a constant clt solely dependent upon the set Q, such that Mlo^c^AvIoj, for all ve HtiQ)r>H2(Q). B2.6) We recall that such an inequality follows from the H2(£2)-regularity of the solution и of problem B2.1) when feL2(Q) and Q is convex. Theorem 22.1. There exists a constant Cx = Cx (Q) such that, for all functions ф of the form 1 ф: хе и-*ф(х) — ,x-x||2 + 02' B2.7) >0, x = (x!,Jc2NlR2,
Section 22 Basic error estimates 157 we have M^2,fi^C1(M^-1;o,fl+Mb) forallveHl{Q)nH2{Q). B2.8) Proof. Let v be an arbitrary function in the space Hl{Q)nH2(Q). Then the function W = (Xi— Xi)v also belongs to the space Hj(fl)nH2(fl), and (x1—x1)d11v = d11w — 2d1v, (xl-x1)d12V = d12W-d2v, (x1-x1)822u = 822w, Aw=(x1 — x1)Av + 2d1v. Using these relations and inequality B2.6), we find that there exists a constant c2 such that (x.-x,J X №dx Q <2cfHw|gpfl + 8|p|fi0 ^c2j (x,-x1J(J»Jdx+|c|f,0l. Since we have likewise f fi (x2-x2J £ ISVdx^c, 1^1 = 2 (х2-х2J(Лг;)Мх + И1^, we eventually obtain Нф->;2,П: ((x1-x1J + (x2-x2J + 02) X |9<42dx 101 = 2 <max{2c2,c?}(Ht;||-i;o,0 + |i;|f,n), and the proof is complete. □ As exemplified by the above computations, we shall depart in this section from our usual practice of letting the same letter С denote various constants, not necessarily the same in their various occurrences. This is due not only to the unusually large number of such constants which we shall come across, but also, and essentially, to their sometimes intricate interdependence. Therefore, constants will be numbered and, in addition, their dependence on other quantities will be made explicit when necessary. However the possible dependence on the set Q and on the constants a and v of B2.2) will be systematically omitted. While we shall use capital
158 P.G. Ciarlet Chapter III letters Q, ijs 1, for constants occurring in important inequalities, lower-case letters C{,i^l, will rather be reserved for intermediate computations. In the next two theorems, we examine the relationships between the weighted seminorms |#Ц«;т,п (when the function ф is as in B2.7)) and the standard seminorms hlm.oo.o- Such relationships will play a crucial role in the derivation of the eventual error estimates. Theorem 22.2. For each number a> 1 and each integer m^Q, there exists a constant C2{a., m) such that, for all functions ф of the form ф: xe 0-*ф(х) = 9>0, xeQ, 1 i|x-xl|2 + 02' B2.9) we have Nr;m,^C2(a, m) -rrloL.,,,0 for all ve Wm^{Q). B2.10) For each number ft e ]0,1[, and each integer m^O, there exists a constant C3(P, m) such that, for all functions ф of the form B2.9), we have МФ;т,^С3(р,т)\\пв\1'Мт,о>,п forallveW^iQ), в^р. B2.11) Proof. Clearly, one has Иф';т,П^С3{т) "|l/2 фяйх\ \v\miXi£i. Let next 5 = diam Q, so that ф"йх^ Q B(x;5) If a>l, we write idr ф"dx = 2k zdx r2 i Д2« tdt (т2 + в2)' (t2 + 92f J (r2 + B2f 2(u-l)B2i'-1)' and inequality B2.10) is proved with C2(a, m) = c3 (m) n/(a —1I/2. If a = 1, we have for tdt т2 + в2-\\пв\ + ^п(в2 + д2) \1пв\ + ^ЫA + 52)^с^Р)\Ы9\,
Section 22 Basic error estimates 159 with and inequality B2.11) is proved with C3{ji,m) = сг{т)BпсА{Р))г12. □ We next obtain inequalities in the opposite direction. In order that they be useful for our subsequent purposes, however, we shall establish these inequalities only for functions in the finite element space Xh, and further, we shall restrict ourselves to weight functions of the form ф or ф2, with ф as in B2.9) for which (i) the parameter в cannot approach zero too rapidly when h approaches zero (cf. B2.13)), and for which (ii) the points x depend upon the particular function vh e Xh under consideration (cf. B2.14) and B2.16)). Theorem 22.3. Assume that for each h, we are given a function фн of the form фк:хвй-^фн{х)=- 2 2, xheQ, B2.12) || x—хй || +uh and that there exists a number у such that 7>0, 0h^yh for all h. B2.13) Then there exist constants C4(y) and C5 (y) with the following properties: Let vh e Xh be given. If the point xheQ in B2.12) is chosen in such a way that M^)l = |o*lo.a,,o, B2.14) then Ыо,^^С4(у)(в2/И)К\ф1;0,п. B2.15) If the point xheQ in B2.12) is chosen in such a way that тах{\д^„(хк)\, \d2Vk{xh)\} = \Vh\i,«>,Q, B2.16) then IVkLco^CsiyrnhMw B2.17) Proof, (i) Let vh be an arbitrary function in the space Xh, and let the point xh be chosen as in B2.14). Then there exists a constant c5 such that Ыо.оо.п- |f*(x)| = \vh(xh)\ - \v„(x)\ ^ \vh(xh)-vh(x)\ ^VZKli.ao.siWx-XbW^ics/hyVb^acsiWx-Xhl for all xeQ (in the last inequality we have used the fact that the family of triangulations satisfies an inverse assumption; cf. Theorem 17.2). In other words, К(хIИ1-(с5/й)||х-хА||)|р,,|о,0о,о for a11 xeQ,
160 P.O. Ciarlet Chapter III and consequently (B(a;r) — {xe R2; ||x — a\\ ^r}), finBixh;h/2c5) The set Q being polygonal, there exists a constant c6 such that f_ / й M / Л 2 meas<finBl xA; ~— I >^c6. and we also have с / h l-~\\x-xh\\^j for all xeB\xh; — h \ 2c5 and 1 1 1 , „ / h by assumption B2.13). Combining the previous inequalities, we obtain an inequality of the form B2.15), with (ii) Let vh be an arbitrary function in the space Xh, let the point xh be chosen as in B2.16), and let The Z!~h denote a triangle that contains the point xh. Since the gradient Vvh is constant over the set Th, we deduce 2 2 f dx j ||X — Xj,|| +4>h Th Combining this inequality and the inequalities meas(Th)>c7(G, v)h2, 1 1 we obtain an inequality of the form B2.17) with c5(y)=v/(T+T/y)/C7. □ To conclude this analysis of weighted seminorms, we examine in the next theorem the interpolation error estimates expressed with the seminorms |*|ф«;т,д, where, for each h, the function фк is of the form B2.12). The conclusion (cf. B2.20)) is that the error estimates are exactly the same as in the case of the usual seminorms | • |m>n,
Section 22 Basic error estimates 161 provided the parameter 6h does not approach zero too rapidly with h (cf. B2.19)). Notice, however, that if the behavior of the function 6h can be "at best" linear as in the previous theorem, the constant that appears in inequality B2.19) is not arbitrary, by contrast with the constant у which appeared in inequality B2.13). Finally, observe that no restriction is imposed upon the points xh. Theorem 22.4. Assume that for each h, we are given a function фк of the form <f>h:xen-+<ph(x)=- 2 2, xheQ. B2.18) Then there exists a constant C6, and for each aeR — {0}, there exist constants C7(a)>0 and C8(a) such that, if 04>C7(«)A for all h, B2.19) the following estimates hold: \ь-ПАф^т,^С6п2'тЦФ1,2<о, m=0,l, for all ve H2(Q), B2.20) and \Ф&и-Л„(ф&к)\фГл.я <C8(#A)(|f(,l«tl;o,B + |!'Jfti,fl) M all vheXh. B2.21) Proof, (i) There exists a constant c8 such that \v-nTv\m,T^c8hz-m\v\2iT, m=0,l, foralbe#2(G), TeSr„. Next, we have \ь-ПТр\Ф1,т,Т^{ф1{хТ)I12\ь-П^\тЛ, Ы2,т<№(хт)Г1!2Ыфы,т, if, for each Те 2ГН, the points xTe T and xTeT are chosen in such a way that 0 < ф1(хТ) = inf ф1(хт), ф1(хт) = sup фЦх). Since 9;(<ffl(*) (*.—*ы) ._ фЦх) М\\х-хн\\2 + вГ ' l'A we obtain |6^)(x)| J|J(#)(*)IL„, \\x-xh\\ >| xeQ Фь(х) хеП 4>h(x) xeO Wx~xh II + &и Vh and therefore Ф1{хт)^Ф'н(хт) + \Ф/вкЖ(хт).
162 P.O. Ciarlet Chapter III Consequently, if we let C7(a) = 2|«|, so that 0h^C7(a)h implies ФЦхтУФЪхт)<2 for all Те STh, the conjunction of the above inequalities yields inequality B2.20) with C6 = -Jlcs. (ii) Since the function ф&ь is in the space ^°{П) = dom TIh and since the restrictions ф%ьь\т belong to the space H2(T) for all TeJ~h, the same argument as in (i) shows that ,1/2 I TeSrh We have (recall that vh\TePX{T) for all Те 3Th) ЫФн^)НЬцФ1^нН^Фн)Ь^ + {^Фь)^н in any Te^„, and 9y(^*)(x) , , , п(*<-хм)(х;-хц) <50- ^j(x) v '(l|x-x*ll2+02J \\x-Xhf+ei for all xefi. Hence, using the inequalities llx-xj2 ^,фА1/2(х) 1 ^Ф112(х) Aх-хл||2 + 02J^ 2в, ' ||x-xJ2 + 02 ^ dh we deduce that \ди(ф1)(х)\^B/в„)(\а2 + а\ + \а\Ж+ ш(х) for all xeQ. Using the above inequalities and the inequality (cf. (i)) |ЬШ)(х)\ ^(\а\/в„Шх) for all xeQ, we conclude that there exists a constant cg(a) such that, in any Te3Th, «S ^r (<^+ Нх)М*I2 + 0!(x) I |в|«*(хI2) for all x e T,
Section 22 Basic error estimates 163 and thus, Z \ФЫк';2,Т TeSTh I [фГ I |8^№^)(х)|Чх<^(|^г>;0,я+Ы,2«;1,о). T Therefore we have proved inequality B2.21), with С8(а) = С7УМ4 □ After the above preliminaries, we now come to the central object of this section, i.e., the estimate of the errors | u — uh \ 0, ж >fi and \u — uh \ 1 y w ;fi via the method of weighted norms ofNitsche. The analysis will comprise three stages. In the first stage (cf. the next theorem), we consider for each h the projection operator Ph: v 6 Hh@)-+Phv e Vh B2.22) associated with the inner product a( •, •) of B2.1), which is therefore defined for each v e Ho(Q) by the relations PhveVh> a{v-Phv,wh) = 0 for all whe Vk. B2.23) Thus we have in particular uh = Phu, where uh is the discrete solution found in the space Vh and и is the solution of the problem defined in B2.1). We shall then show that for an appropriate choice of the parameters 6h in the functions фк (cf. B2.25) and B2.26) below), the mappings Ph are bounded independently of h when both spaces Hq(Q) and Vh are equipped with the weighted norm »-»(l»IJj,-o.D + l»l«I1;i.oI/2. B2.24) Theorem 22.5. Assume that for each h, we are given a function фк of the form ф„:хей^фк(х)= „г.аг' xheQ- B2-25) Then there exist three constants h0 e ]0,1[, C9 >0, and C10, such that, if eh = Cgh\lnh\1/2 for all h, B2.26) the following inequalities hold for all h^h0: \Pk^ifijt +\Р0\Ф^# ^С10(\и\ф2,ом +|»|^;1>0) for allveHHQ). B2.27) Proof. For convenience, the proof will be divided in four steps.
164 P.G. Ciarlet Chapter III (i) There exist two constants Cu and C12 such that, if вн>Сцк for all h, B2.28) then 1Л,<;1,о ^ С12(\Р„и\2ф2,0М + \v\2K,un) for all v e Hh{Q). B2.29) For brevity, let vh=Phv. Since \щ\ф„-л.п =Фи,Фь^ь) + 2 Ыфь^ьйх and Аф„(х) = 4f,l|X~X*„ll2 ~f}*»W^4#M for a0 * 6 й> llx-Xfcir+e^ we deduce that N^,;i,fi <Ф*><Ь.ил) + 2Ы^;о,п- B2.30) Using relations B2.23) we can write a{vh,(t>hvh) = a(vh-v,<j)hvh-nh((t)hvh)) + a{v,<t>hvh), B2.31) and an application of inequality B2.5) with a = 1 shows that |a(wfc —и,0луь —iTfc(^fcUh))|<(|r|^h;lin + ji?h|^h;1,n)|0hy* —Г7л@ьил)|0-1.1>я. We next infer from Theorem 22.4 that, if Bh > c10h with c10 = C7(l) for all ft, B2.32) then (cf. inequality B2.21) with a= 1) l<M* - П11(фк1к)\ф-киа^с11(^в11){\и11\ф1.0^ + ^11\ф11.и{2} with cn = C8(l). Combining the previous inequalities, we find that, for eh^c10h, \a(vh-v^hvh-IIh№hvh))\ + ЫФнл,п). B2.33) By another application of inequality B2.5) with a— 1, we obtain ф, «MfcKM^i.nl^f.lfcr'ji.fl- B2.34)
Section 22 Basic error estimates 165 Since £ \di(f>h(x)\2^m(x) forallxefl, we find that there exists a constant c12 such that l^*f*k-1;l.n<Cl2(|u*|^;0,n + |l'*k;l,n)- B2-35) Combining relations B2.30)-B2.35), we find that, for Qh^cl0h, l»*lJhil.O^2|»»|J>.0>o + C12|0|^;i.o(|»ji|^;0fo + l»*l*b;l,o) + с1ЛЩ8к)(Ыфн;1,п + \1>и\фь;1,п)(ЫФЪо,п + \»ь\филЛ i.e., we have found an inequality of the form A2^2C2 + c12B(A + C) + c11(h/eh)(A + B)(A + C). Assuming that eh>2clxh for all h, B2.36) we get A2^4C2+(l + 2c12)BC + A((l+2c12)B+Q ^4C2 + (i+c12)(B2 + C2)+i42 + (l + 2c12J£2 + C2, and therefore step (i) is proved with (cf. B2.32) and B2.36)) Ci^maxfoo^cn) B2.37) in relation B2.28), and C12=max{ll+2c12, (l+2c12)C+4c12)} B2.38) in relation B2.29). (ii) Assume that в^СцИ, where the constant CX1 has been determined in step (i). Then there exists a constant C13 such that ^ CMk;o,v+Ml,i,n + h2№h\l^;2,Q) forallveHh(Q), B2.39) where, for each h, фн = ф/,(и) is the solution of the variational problem: ФнеНКО), Vi/vVwdx= \(j>l{Phv)w&x forallwsHl{Q). B2.40) Notice that, because the set Q is assumed to be convex, the function фъ is in the space H2(Q), and therefore it is legitimate to consider the seminorm \'\ф-'-2,я m inequality B2.39).
166 P.G. Ciarlet Chapter III Using the definition of the function \j/h, and letting again vh = Phv, we can write Ыф>;о,а = Фн-и,Фь-Щ\1/н)+ \<f>fahvdx. B2.41) n By applying inequality B2.5) with a = 1 and inequality B2.20) with a = — 1 (this is possible because we assume О^СцП and Cu ^c10 = C7(—1); cf. B2.32) and B2.37)), we obtain \Фь-^^н-Пнфн)\^С1Ы.Мфы1га + ^\фи;иоМь\ф-1-,2,а- B2.42) Next we have ^»»od*^|ffcUio.ol,'k:04i<i(l»*lfe;o,o+l»l*i;;o.o). B2.43) li so that, by combining relations B2.41), B2.42), and B2.43), we obtain the inequality l»J5^o,o^C7A(|»»|^;1,n + |cUIi;i.o#*|#-'i2,o + i(|i'iil5j;o,o + l»lfe;o.oX which in turn implies the inequality l»*l*^o.o<^l»»lLi.o+lwl*;:o,o + l»lLi.o + A + 1/5)C?fc2|^|J-l;2@ forany<5>0. B2.44) Let then 8 = 1/CC12) in inequality B2.44), where C12 is the constant appearing in inequality B2.29). Then inequality B2.44) added to inequality B2.29) times the factor 2/CC12) yields iWb.fl+(i/CC12)Mwl>0 ^Mfeio.0 + fK;i.o + (l + 3C12)C?fc2|^|J-i:2iD, i.e., an inequality of the form B2.39). (iii) Given any number в0е]0,1[, t/iere exists a constant Ci4@o) such tnai Ш2Ф; ';2,о < C14@O) ^ \PHv\Jlo,n for all в„ e ]0,0O]. B2.45) Since —A\\ih — ^\vh (recall that гл = Рй1)), we have И</'*1^-1;о>о = 1"л1^;о.п. and consequently, by Theorem 22.1, l^l*h-l;2.n<C1(N23.0>n + |^|f.o). B2.46) Since фк(х)^1/в% for all xe Д we first find that Mko.n^pMfoo.o- B2-47)
Section 22 Basic error estimates 167 To take care of the other term that appears in the right-hand side of inequality B2.46), we shall prove that, for each number 60 e ]0,1 [, there exists a constant с13(в0) such that, for all functions ф of the form ф:хе 0->ф(х) = 1 ||x-x||2 + 02' xeQ, O<0<0O, we have \Ф\\^с1Ъ(в0)Щ^\Аф\2- ;0,Si for all ф е HKQ) n H2(Q); B2.48) B2.49) the proof given here of inequality B2.49) is due to Rannacher [1977]. Taking into account that \ДФи\ф;2;о,а=\»ь\ф1;о,а> and applying inequalities B2.49) with ф = фь and ф = фи, B2.46), and B2.47), we then find an inequality of the form B2.45), with Ci4.(#o) = Ci 1 |ln<90 + c13@o) B2.50) It therefore remains to prove relation B2.49) (another method for proving the same relation is given in Nitsche [1977]). Given an arbitrary function ф е Яо(£2) n H2(Q), we have mu=- ^|ln0|. ./|2 фЛф Лх^\Лф\ф^.оа\ф\ф1.оп в2 ;0,Si + 2|in e\ \Ф\ф2;0,П- B2.51) Let then G denote the Green function associated with the operator — Л in Q and the boundary condition v=0 on Г, so that \Ф\р-,о,а = Ф2Ы Si О(х,0Лф@^ dx ^\ф-2^)\Аф@\2Пф2(х)О(х,0 Si Si ф2{п)С{х,ц)йп W dl B2.52) There exists a constant c14 such that (cf. for example Stakgold [1968, p. 143]) (KG(x,j;)s$c14(l-|-|ln||x-j;||| for all x,уеО, хфу. B2.53) Using this inequality, we proceed to show that for arbitrary points x,xeQ and for
168 P.O. Ciarlet Chapter III any number в with 0<в^в0<1, there exists a constant c15@o) such that 4>2(rj)G(x,rj)dri = G(x, rj) ,„ ч |1п в\ (h-xp+e2J B2.54) a To see this, write m\\x — rj\\ (||»/-x||2 + 02) d^= x |ln||x-f?|| (||4-x||402) d?y, n where Ял Q1 = fl1(x,0) = {^6f2;||^-xK0}, О2 = 02(х,0) = {чеС;в<||»/-хК1}, Оз=Оз(х) = {^бО;1<||^-х||}. We then obtain the following inequalities (observe that the last two inequalities make sense only if the sets Q2 and Q3 are not empty, and that we have diam £2 ^ 1 if the set Q3 is not empty): Mlx-glll H ^ 04 1 \n\\rj-x\\dri=-Q4 In II£II d£ г(£-1п0Хя 1 + В@;в) 1 \ |1П 0| 2|ln0o|/ в2 ' ln\\x-t]\\ 2 , o2\2 J (||,,-x||402) «2 <|ln0| drj dr\ 2 , Д2\2 П2 <|ln0| (ll»?-x||2+02) d?7 |ln0| d£ (||т7-х||Ч02J 02 J A + ||^||2J ln||x — rj = 71- |ln0| Нч-хцче2J"^ «з < ln(diam £2) drj (||77-x|!2 + 0 2 , Д2ч2 <ln(diam Q) drj (||4-jc||2 + fl2> 2 , Д2\2 Qi = —^ ln(diam О) ^ Ttlnfdiamfl)^ |ln 0| |lnflol
Section 22 Basic error estimates 169 Consequently, inequality B2.54) is proved, with f 3 1 „ l + 21n(diamO) Cl5@o) = rcC14<-—— + ... 2 + 2+ yin&ol 2|ln0ol 2|ln в01 Using this inequality, it is easy to verify that there exists a constant c15F0) such that 02(i7)G(x,ff)di7<cls@o)^. Then inequalities B2.51)-B2.54) together imply inequality B2.49) with C13@o)=4(l+C?5@o))- (iv) It remains to combine the results of steps (ii) and (iii): We have determined constants C11,C13 and C14@O) for each 0oe]O, 1[ such that (cf. inequalities B2.39) and B2.45)) Cnh^eh^90<l <C13(Nfco.0 + l<ii,D) + C13C14(eo) ^^ liVlfco*. B2-55) Let for example 0O = i and let 0„ = C9/i|lnft|1/2 withC9 = 2(C13C14(i)I/2. B2.56) Then there exists a number h0e ]0,1[ such that Л<йо ^ |С11Л<в4<Ьво, ^ ° l|lnfl4|^2|lnfc|. l ; This being the case, we have found an inequality of type B2.27) with C10 = 2V/C^7. □ B2.58) We next develop the second stage of this analysis. Using the inequalities (cf. Theorems 22.2 and 22.3) relating the seminorms |'L,co,n> m = 0,l, and the weighted seminorms that appear in inequality B2.27) we show in the next theorem that the projection operators Ph of B2.22), considered as acting from the subspace H0(Q)n W1,m(Q) of the space Hq(£2) onto the space Vh, are bounded independently ofh when the space Ho(Q)nWl'°°(Q) is equipped with the norm + fc|lnfc||»|lf00(O B2.59) and the space Vh is equipped with the norm »-»|1пЛГ1/2Мо.<0,0 + ЛМ1.(в,0. B2.60) Remark 22.1, Such norms may be viewed as "weighted W1,00(Q)-like" norms.
170 P.G, Ciarlet Chapter III Theorem 22.6. Let the constant h0>0 be as in Theorem 22.5. Then there exists a constant C15 such that, for all h^h0, |lnftr1/2|PAi;|o,a),fi + /j!F^|1,o0,fi<C15(|t;[o,QO,n+fe|ln/j||D|1,00,n) for all »ей}@)п№"'ю@). B2.61) Proof. Let there be given a function v in the space Яо@)пИ/1'С10(й). For each h^h0, we define the function 1 фоь:Х€П~*ф0„(х)= vOi|2.fl2' B162) ||X —Xfc|| + vh with \PHv(x°h)\ = \Phv\o,^a, eh = C9h\lnh\112, B2.63) where /i0>0 and C9 are the constants found in Theorem 22.5. Since eh^-Cxxh for h ^ h0 (cf. B2.57)), we may apply inequality B2.15): We find in this fashion that there exists a constant ci6 = Q(Cn) B2.64) such that in*>koo,^c16@»^U,,;o,«. B2.65) By inequality B2.27), and by inequalities B2.10) and B2.11), there exists a constant {в,,^e0-^iorh^h0; cf. B2.57)) c17 = max{C2B,0), C3(i,l)} B2.67) such that N«J,.:o.o + l»l#„l>;i.n<c17^-|i;|o.„,o + |ln0»|1'2|»|1,oo,oj. B2.68) Combining inequalities B2.65) to B2.68), we find that ip i <r (в>ч i , №g»l1/2, i ]•» ftyl0,co,n^:<--10Cl6c17l T" Mo,оо,Я Н 7 It'll,oo.fi Using the relations 0„ = C9h\ln h\112 (cf. B2.26)) and the inequality |ln в„\**2|ln h\ (cf. B2.57)), we eventually find that, for all h^h0, |lnh|-1/2|Pfci;|o,a>.o<c18(|i;|o,e,l0 + fc|lnfc||»|1,00.0), B2.69) with Ci8 = C10c16c17max{C9!V/2Ci}. B2.70)
Section 22 Basic error estimates 171 Likewise, for each h^h0, define the function ф1„:хеа-,ф,к(х)={{х_х^2 + щ, B2.71) with max{|81PJk«(xjl)|,|82Fikc(xi|)|} = |»*li.x,o, eh = C9h\lnh\1/2. B2.72) Then inequality B2.17) shows that there exists a constant Ci9 = C5(C11) B2.73) such that \PkV\u«,.0<c19(eh/h)\PkvUlllilfi; B2.74) moreover, by inequality B2.27), l^»UIfc:i^<C1o(N^io.fl + Klb:i.0)- B2-75) Then, arguing as before, we find that for all h^h0, Л|Р*»|1..л<с20(|р|о,».п + Л|1пЛ||»|1><0,0) B2.76) with C2o = C10Ci7C19max{l,x//2C9}. B2.77) The conjunction of inequalities B2.69) and B2.76) implies that inequality B2.61) holds, with Cl5=C18+C20' □ Remark 22.2. In Theorem 22.5, the behavior of 6h as a function of h is bounded below by a constant times Gi|ln h\112). The key to the success of the present argument is that such a function of h tends nevertheless sufficiently rapidly vers zero with h so as to produce the right factors (as functions of h) in inequalities B2.69) and B2.76). In the third, and final, stage of this study, we show that the uniform boundedness of the projection mappings Ph (which we just established) yields in turn the desired error estimates (recall that the discrete solution uh is nothing but the projection Phu of the solution u). Theorem 22.7. Assume that the solution ueHo(Q) of the boundary value problem associated with the data B2.1) is also in the space W2,X(Q). Then there exists a constant С independent ofh such that |и-и*1о...о<СЙ2|1пй|3/2|и|2>в>.0> B2.78) <СЛ|1пА|М2.00.о. B2.79) Proof. The norm of the identity mapping acting from the space Hq{Q)c\ Wl'x(Q)
172 P.G. Ciarlet Chapter HI equipped with the norm of B2.59) into the same space, but equipped with the norm of B2.60), is bounded above by [In h0| ~1/2 for all h ^ h'0 = min{h0,1/e}. We next have the identity u — uh = u — Phu = {I~Ph)(u — vh) for all vhe Vh, so that we infer from Theorem 22.6 that, for all h^h'0, |ln/ir1/2|M~MA!o,oo,J3 + %~W»,ll,oc.fi sc(|ln/i0r1/2 + C15) inf (\u-vh\0t<xl,Q + h\lnh\\u-vh\ltao,Q). vt,eVh Since there exists a constant c2i such that inf (\u-vh\0,x,n + h\lnh\\u-vh\lra0,a)^c2lhz\\nh\\u\2iO:)tl}, VheVh inequalities B2.78) and B2.79) follow with C = c21(|ln/j0r1/2 + c15). D In fact, the error estimate of B2.78) is not optimal: Nitsche [1977] has obtained the improved error bound l«-M»loI„,n<Ch2|lnft|M2iX>0, B2.80) at the expense of a technical refinement in the argument, special to linear triangles. Observe, however, that the discrepancy between the error estimates B2.78) and B2.80) is in a sense insignificant: Both estimates yield an 0(hz~e) convergence for any e>0. To conclude, it is worth pointing out that all the essential features of Nitsche's method of weighted norms have been presented: Indeed, the extension to more general cases proceeds along the same lines. In particular, the use of higher-order polynomial spaces (i.e., PT = Pk(T) for some к > 2, n arbitrary) yields a simplification in that the "|ln h\" term present for к — 1 disappears in the norms then considered. Thus inequality B2.61) is replaced by an inequality of the simpler form (cf. Nitsche [1975]) + h\Phv\l,x,a^C(\v\0,*,o + h\v\Ux,i2). B2.81) Such inequalities are obtained after inequalities reminiscent of that of B2.27) have been established for appropriate weighted norms of the form i*|^+1;o,fi + N^;i,fi> jn<a<jn + l, with functions фн again defined as in B2.25). The "|ln h\ factor" in the estimate of the error \u — U/,|ol0o,fi for linear triangles has been the subject of various conjectures as to whether it should appear or not, although Claes Johnson (cf. Scott [1976a, p. 684]), Jesperssen [1978], Fried [1980] had pointed out that it is unavoidable, the reason being that piecewise linear polynomials cannot approximate sufficiently well a Green's function. The controversy was settled by Haverkamp [1984], who showed by means of an example that the estimate no longer holds if (In h\ is replaced by o(|ln h\). The subject of uniform convergence of finite element methods has a long story. In
Section 23 Basic error estimates 173 one dimension, an 0(h2) uniform convergence for piecewise linear approximations was first obtained by Ciarlet [1968]. Further results were then established by Nitsche [1969], Ciarlet and Varga [1970], Douglas and Dupont [1973,1976b], Douglas, Dupont and Wahlbin [1975b], Natterer [1977]. For special types of triangulations in higher dimensions, see Bramble, Nitsche and Schatz [1975], Bramble and Schatz [1976], Bramble and Тномёе [1974], Douglas, Dupont and Wheeler [1974b], Natterer [1975b]. The first contribution to the general case is that of Nitsche [1970]. Then Ciarlet and Raviart [1973] improved the analysis of Nitsche by using a discrete maximum principle, as shown in Sections 20 and 21, where various references on this approach are found. Then Natterer [1975a], Nitsche [1975, 1976b, 1977] and Scott [1976a] obtained simultaneously optimal (or nearly optimal) orders of convergence. The greatest generality is achieved by the method of weighted norms of Nitsche [1975] (which we have followed in this section; see also Nitsche [1977,1981b]), and in the work of Scott [1976a]. While weighted Sobolev norms are also introduced by Natterer, Scott's main tool is a careful analysis of the approximation of the Green's function. Nitsche's method has been combined with a duality argument using a regularized Green's function, by Frehse and Rannacher [1976, 1978], and Rannacher and Scott [1982]. Nitsche's method has also been extended to more general second-order boundary value problems, by Rannacher [1976b]; to the obstacle problem (cf. the next section); to higher-order triangles, by Raugel [1978b]; to Orlicz normes, by DurAn [1987]; to plate problems, by Rannacher [1976a]. The uniform boundedness in appropriate norms of particular Hilbertian projections, on which Nitsche's argument is essentially based, was also noticed by Douglas, Dupont and Wahlbin [1975a], who have established through a different approach the boundedness in the norms |"|0,4,й, К<7< °o, of the L2(£2)-projections onto certain finite element spaces; see also Crouzeix and Thomee [1987]. The boundedness in L°°(Q) of the Hl(&)-projections have been established by Schatz [1980], Schatz and Wahlbin [1982], Suzuki and Fuiita [1986]; and in Wl'p(Q), by Rannacher and Scott [1982], who then obtained, by an interpolation argument, estimates of the error |н—мй|0,р,я, 2<p<oo, and \u — uft|1)P,0, 1 < p ^ oo. 23. Estimate of the error ||н—h*||i,0 for the obstacle problem and Falk's method To conclude this chapter, we consider the finite element approximation of problems posed as variational inequalities. Following an analysis due to Falk [1974], we shall first prove an abstract error estimate (Theorem 23.1), which is valid for a general class of approximation schemes for variational inequalities of the form B3.1) below, and then we shall apply this result to the finite element approximation of the obstacle problem (Theorem 23.2). The abstract setting is the following: Let V be a Hilbert space, with norm |[ • \\, let a(% •): Vx V-+U be a continuous, symmetric and F-elliptic bilinear form (with the
174 P.G. Ciarlet Chapter HI usual F-ellipticity and continuity constants a and M), let /: V-+U be a continuous linear form, and let U be a nonempty closed convex subset of V. Then there is a unique element и that satisfies (cf. Theorem 1.2) ue U, B3.1) a(u, v — u)^l(v — u) for all veU. Let Vh be a finite-dimensional subspace of the space V and let Uh be a nonempty closed convex subset of Vh. Observe that, in general, the set Uh is not a subset of U. Then, quite naturally, the discrete problem consists in finding an element щ such that uhe Uk, B3 2) a{uh,vh-uh)^l{vh-uh) for all vheUh, and, again by Theorem 1.2, this problem has a unique solution uh. In the proof of the next theorem, we shall use the mapping A e У(¥; К') defined by the relations Av(w) = a(v, w) for all v,weV, B3.3) and which was already used in the proof of Theorem 1.3. Notice that in the present situation we do not have Au = I in general, as in the case of the linear problem (Lr=l/). Also, we shall consider a Hubert space H, with the norm |-| and inner product (•, •), such that V=H and V^H. B3.4) The space H will be identified with its dual, so that it may be in turn identified with a subspace of the dual space of V, as we showed in Section 19. We now estimate the error \\u — uh\\, Theorem 23.1. Assume that l(v) = (f,v) for some fsH and that AueH. B3.5) Then there exists a constant С independent of the subspace Vh and of the set Uh such that ||и-и»КС{ inf {\\u-vh\\2 + \Au-f\\u-vh\} + \Au-f\M\uh-v\}112. B3.6) Proof. We have a || и — uh ||2 ^ a(u — uh, и — uh) = a(u, u) + a(uh ,uh)~ a(u, uh) — a(uh, u),
Section 23 Basic error estimates 175 and thus, by B3.1) and B3.2), a(u, u) ^ a(u, v) + f(u - v) for all v e 17, a(uh, uh)^a(uh, vh) + f(uh-vh) for all vke Uh. Therefore we deduce that, for all ve U and all vheUh, cc\\u-uh\\2^a(u,v-uh) + a(uh,vh-u) + f(u-v)+f(uh-vh) = a{u, v—uh)—f(v — uh)+a(u, vh — u)—f(vh — u) + a{uh — u,vh — u) =(f—Au,u — vh)+(f —Au, uh — v) + a(u — uh,u — vh). We thus have, for all ve U and all vheUh, ct\\u-uh\\2^\f-Au\{\u-vh\ + \uh-v\} + M\\u-uh\\ ||u-»»||. Since 1 fa M ) I|w-mJ| \\u-vh\\^-<—\\u-uh\\2 + ~\\u-vh\\2\, we obtain, upon combining the two previous inequalities, U\\u~uh\\2^\f-Au\{\u-vh\ + \uh-v\}+UM2M\\u-vh\\2, B3.7) from which inequality B3.6) follows. □ Several comments are in order about this theorem: (i) The proof has been given in such a way that it includes the case where the bilinear form is not symmetric. (ii) If 17 = V then Au—f=0, so that, with the natural choice Uh=Vh, the error estimate of B3.6) reduces to the familiar error estimate of Cea's lemma (Theorem 13.1). (iii) If the inclusion UhcU holds, then the term mfveV\uh — v\ (which can be expected to be the harder to evaluate) vanishes in the error estimate. This is not the case, however, of the finite element approximation of the obstacle problem which we shall describe. (iv) Had we not introduced the space H in our argument, we would have found, instead of inequality B3.7), the inequality fr\\u-uH\\2^\\Au-l\\'{\\u-vh\\ + \\uh-v\\} + №2M\\u-vh\\\ B3.8) where || • ||' denotes as usual the norm of the dual space of V. However inequality B3.8) is likely to yield a poorer order of convergence, since the term mivltSVyi\u — vh\ can be reasonably anticipated to be of a higher order than the term infPFi6£;b \\u — vh ||. This observation will be confirmed in the proof of Theorem 23.2. We next apply the abstract error estimate of Theorem 23.1 to the finite element approximation of the obstacle problem; this problem corresponds to the following
176 P.G, Ciarlet Chapter III data (cf. C.41)) V=Hl{Q), QcU2, U~{ve Ho(Qy,V^x a.e. in Q}, Vu-Vvdx, fvdx, feL2(Q). XeH2(Q\ *s£0 onf, a(u, v) -■ n Г l(v) = ft B3.9) We shall assume that О is a polygonal domain (so that Q can be exactly covered by triangulations made up of triangles; cf. Remark 23.1 for the case of a domain with a curved boundary), and that the solution и is in the space H2(Q). Notice that while this is a reasonable regularity assumption, it would be unrealistic to assume that и е H3(Q) for instance (cf. the discussion given in Section 3). Finally, observe that since H2(Q)с%>°(П), the point values of the function x are well defined. With a triangulation 2Th of the set Q— {Jth^T, we associate the finite element space Xh whose generic finite element is the linear triangle, and we let Vh = {vheXh;vh = Oonr}=X Oft> and Uh = {vhe Vh;vh(b)>x(b)foT all ЬеЛ] B3.10) B3.11) where Jfh denotes the set of nodes of the space Xh. Notice that the set Uh is not in general contained in the set U, as shown in Fig. 23.1 in the one-dimensional case. Let us now apply the abstract error estimate of Theorem 23.1. Theorem 23.2. Assume that the solution и is the space H2(Q). Then, for any regular family of triangulations, there exists a constant C{u, f, x) independent of h such that \u-Uh\\i,a<C(u,f,x)h. B3.12) Fig. 23.1. The set Uh = {vh e Vk; vk(b)^x{b) for all be Л\) is not in general a subset of the set 17.
Section 23 Basic error estimates 111 Proof. We apply Theorem 23.1 with H = L2(Q), so that we need to verify that AueL2(Q) (feL2(Q) by assumption). Since the solution и is assumed to be in the space H2(Q), we have Au(v) = Vm • Vv dx = — Auvdx for all v eV, ft Q and thus \Au(v)\^\Au\0,Q\v\0<a foralluEF. Hence Au is indeed an element of the space H. Let FIhu denote as usual the AVinterpolant of the function u, which is in the space Vh. Since nhu(b) = u(b) 7z x(b) for all beJfh, it is also an element of the set Uh. Thus, inf {||M-t)fc||f,n+|^u-/|0,n|u-t)j|o.n} ^||и-Лли|||,я + {Ми|о,о+|/|о.о}|м-и*|о,о ^C(\u\l,n+{\Au\Q,n + \f\0,n})u\2,n)h2. B3.13) In order to estimate the term mtveV\uh — v\0<n, it is convenient to introduce the function (Fig. 23.2) ut = raax{uh,x}. B3.14) Both functions uh and % being in the space HX{Q), their maximum uf is also in H1{Q) (this is a nontnvial fact, whose proof may be found in Lewy and Stampacchia [1969, p. 169]), and the assumption x<0 on Г further implies that и$еНЦО). Finally, uf^xxaQ (by construction; cf. B3.14)) and thus ufieU. Let Ah = {xeQ;uh<x}, Fig. 23.2. The discrete solution uh, the obstacle x, and the function u* = max{uh,x}-
178 P.G. Ciarlet Chapter III so that I"a-«*Io,o= \uh-%\2dx, Ah since uh-ut = 0 on Q-Ah. Since the Xrinterpolant П\x of the function x satisfies uk(b)>X(b)~nhX(b) for all be Jfh, it follows that uh-nhx^0 infl, since we are using piecewise affine functions. Consequently 0<\(х-и„){х)Ых-щ)(хН(х-Па)(х)=--\(х-Пьх)(х)\ for all хеЛ„, and thus г i Х-Пих\26х^\х-Пнх\1я. !и*-"*1о,п= \uk-x\2dx^ Ah Ah Therefore, inf \uh-v\0tQ^\uh-u^\OM^\x--nhx\o,a^C\x\2,Qh2, B3.15) and the conclusion follows from inequalities B3.13) and B3.15). □ Remark 23.1. As shown by Falk [1975], the error estimate B3.12) of Theorem 23.2 holds unchanged in the following situation: The set Q is convex with a sufficiently smooth boundary (so that и e H2(Q)). Let Qh — \JTeS-h T denote a triangu- lation made up of triangles, in such a way that all the vertices of 3Th that are on the boundary of the set Qh are also on Г. Let then Xh denote the associated finite element space, whose generic element is the linear triangle, and let X0h denote the subspace of Xh whose functions vanish on the boundary of the set Qh. The space Vh then consists of the functions in the space X0h extended by zero on the set Q — Qh. In this fashion, the functions in the space Vh are defined over the set 0. The proof then relies on an inequality analog to B3.15), adapted to the present situation, and on the inequalities !Ит,п-^С/12-т||ц||2,„, т = 0,1, $ueH2{Q)r^Hl(Q). The abstract error estimate of Theorem 23.1 has been independently rediscovered by Roux [1976] in the study of the numerical approximation of a two-dimensional compressible flow problem which can be reduced to variational inequalities by the method of Brezis and Stampacchia [1973]. Incidentally, the functional setting for
Section 23 Basic error estimates П9 this problem is interesting in itself in that the corresponding space К is a weighted Sobolev space, and the domain of definition of its functions is unbounded. The same problem is similarly studied via variational inequalities by Ciavaldini and Tournemine [1977], who have extended the abstract error estimate of Theorem 23.1 so as to include the case where the bilinear and linear forms are approximated, through the process of numerical integration. The related problem of a linearly elastic body resting on a support has been thoroughly studied by Fremond [1971, 1972]. The error estimate of Theorem 23.2 has also been established by Mosco and Strang [1974]; see also Mosco and Scarpini [1975]. Brezzi, Hager and Raviart [1976] have given another proof of Theorem 23.2. They have also shown that \\u~uh\\i,B-O(h3l2~e),E>0 arbitrarily small, when quadratic triangles are used. Natterer [1975a] has studied the error in the norm | - lo,o> using an argument based on the Aubin-Nitsche lemma. For another approach, see Berger [1976]. Nitsche [1977] has been able to apply his method of weighted norms to this problem. In this fashion, he obtains an estimate of the form l«-«tk».o<Ch2|lnfc|(||H||2>0OiO+||^||2,e).o). However, the corresponding discrete solution uf is found in the subset Ujf = UnX0h, instead of the present subset Uh. Then various estimates of the error I" — и*1о,со,я have been obtained, by Baiocchi [1977], Finzi-Vita [1982], Cortey- Dumont [1983]. The elastic-plastic torsion problem, which we described in Section 3, is another instance of a problem modeled by variational inequalities: The space V, the bilinear form a{-,•), and the linear form I, are as in B3.9), while the set U is of the form U={veHh(Q); \\Wv\\ <1 a.e. in Q}. B3.16) Assuming again that U is a polygonal domain, we let Uh = {vheVh; ||Vi>»Kl a.e. in Q}, B3.17) the space Vh being defined as in B3.10). Hence the inclusion UhcU B3.18) is satisfied in this case. This simplification is compensated, however, by the fact that the Xh-interpolant of a function of the set U п#°(й) is not necessarily contained in the set Uh. It can be shown, however, that if ue W2,P(Q) for some p>2, there exist numbers e(/i)>0 such that Шпб(й)-0, {\+г{К))-'П„иеи„. /i->0 This property implies in turn that (cf. Glowinski [1984, p. 52]): IlK-^lk^O^2-1"). The elastic-plastic torsion problem has been extensively studied in Lanchon [1972]. Using techniques from duality theory, Falk and Mercier [1977] have
180 P.G. Ciarlet Chapter III constructed a finite element method that directly yields an approximation of the stresses cr13 and с2з with an 0{h) convergence in the norm |#|о,д, a particularly appropriate estimate for this problem, where a direct knowledge of the stresses is more important than a knowledge of the stress function. For related results, see Mercier [1975a, 1975b], Gabay and Mercier [1976], and Brezzi, Johnson and Mercier [1977], where elasto-plastic plates are considered. A third type of problem that reduces to variational inequalities occurs with sets U of the form U = {veH\Q); v^ij/ a.e. on Г}. Such problems with unilateral constraints occur in particular in elasticity; they are then called Signorini problems (cf. C.47)-C.48)). A finite element approximation of such problems is studied in Scarpini and Vivaldi [1977]. An extension of the present setting consists in looking for the solution и of variational inequalities of the form (see Duvaut and Lions [1972]): a(u,v — u) +j{v) —j(u)>/(t; — u) for all veU, where j: V->M is a nondifferentiable functional. Such problems are found in particular in the study of Bingham flows, with _/(u) = Jn || Vi? || dx. Their finite element approximations have been analyzed in Bristeau [1975, chapter 2], Fortin [1972], Glowinski [1975]. Extensive treatments of variational inequalities and of their approximations are found in Glowinski, Lions and Tremolieres [1976a, 1976b] and in Glowinski [1984]. 24. Additional references There is an enormous literature on the numerical analysis and implementation of the finite element method; our purpose in this section is simply to list some relevant references in various domains that shall not be discussed in this article. We first mention that error analyses have often been performed on variants of the finite element method that are interesting by themselves. In this direction, see notably Aubin [1967b, 1972], Babuska [1970b, 1971a, 1971b, 1972b, 1974a], Fix and Strang [1969], di Guglielmo [1971], Strang [1971], Strang and Fix [1971], Rose [1975], Mock [1976]. More specifically, "nonuniform" error estimates have been obtained by Babuska and Kellogg [1975] and Helfrich [1976]; indefinite bilinear forms are considered by Clement [1974], Schatz [1974]; Hoppe [1973] has suggested using piecewise harmonic polynomials, and his idea has been justified by Rabier [1977]. We also mention the very challenging and promising field of study opened by Werschulz [1982], who has analyzed the finite element method from the viewpoint of computational optimality. There are various ways of treating nonhomogeneous Dirichlet boundary conditions. The most straightforward method consists in interpolating the boundary condition at the boundary nodes: See Aubin [1972], Strang and Fix [1973, Section 4.4],
Section 24 Basic error estimates 181 Тномёе [1973а]. Lagrange multipliers may also be used, as in Babuska [1973a], PitkAranta [1979], Bramble [1981], as well as penalty techniques, as in Babuska [1973b], Utku and Carey [1982], Shi [1984f]. Neumann problems are considered by Molchanov and Galba [1985], while truly mixed problems are treated by Снои and Wang [1979], ZenIsek [1987]. The computation of gradients of solutions is often of great importance, for example in the stress analysis of elastic structures. For this aspect, and the related question of superconvergence, see Douglas and Dupont [1973, 1974], Douglas, Dupont and Wheeler [1974a, 1974b], Bramble and Schatz [1976, 1977], Тномёе [1977], ZlAmal [1977, 1978], Lesaint and ZlAmal [1979], Louis [1979], Krizek and NeittaanmAki [1984], Babuska and Miller [1984a, 1984b], Barker [1984], Ciavaldini and Crouzeix [1985], Levine [1985], Wheeler and Whiteman [1987], and the article of Wahlbin. Superconvergence may be also obtained by applying the Richardson extrapolation procedure to an appropriate asymptotic expansion of the finite element solution. After the pioneering work of Lin, Lu and Shen [1983], such asymptotic expansions have been studied by Lin and Lu [1984a, 1984b], Lin and Zhu [1984,1985,1986], Blum, Lin and Rannacher [1986], Nakao [1987]. Since the solution of realistic boundary value problems usually presents singularities at some portions of the domain Q, "interior" or "local" error estimates have proved to be an invaluable tool for handling such situations, where the "global" error estimates obtained in this chapter are of no avail. In this respect, see the pioneering contributions of Nitsche [1972a], Nitsche and Schatz [1974], Bramble and Тномёе [1974], Bramble, Nitsche and Schatz [1975], Descloux [1975, 1977], Descloux and Nassif [1977], Douglas and Dupont [1976], Schatz and Wahlbin [1977, 1978, 1979, 1981], and the article of Wahlbin. More generally, for problems where the solutions present singularities due to corners, to changes in boundary conditions, to singular right-hand sides (such as the Dirac distribution), or to coefficients that present singularities, see Fix [1969], Ciarlet, Natterer and Varga [1970], Wait and Mitchell [1971], Babuska [1970a, 1972a, 1974b, 1976], Dailey and Pierce [1972], Babuska and Rosenzweig [1972], Fried and Yang [1972], Veidinger [1972], Barnhill and Whiteman [1973, 1975], Crouzeix and Thomas [1973], Fix, Gulati and Wakoff [1973], Strang and Fix [1973, Chapter 8], Scott [1973b], Nitsche [1976a], Schatz and Wahlbin [1976], Thatcher [1976], Jesperssen [1978], Raugel [1978a, 1978b], Schreiber [1980], Whiteman and Akin [1980], Bendalli [1981], Mercier and Raugel [1982], Whiteman [1982], Blum and Dobrowolski [1983], Eriksson and Тномёе [1984], Wahlbin [1984], Eriksson [1985], Casas [1985a, 1985b, 1985c], Babuska and Osborn [1986], Li [1986], French [1987], Li, Mathon and Sermer [1987], and the article of Wahlbin.
Chapter IV The Effect of Numerical Integration for Second-Order Problems Introduction Up to now, we have considered finite elements methods that are conforming, in the sense that the space Vh is a subspace of the space V, and the bilinear form and the linear form used in the definition of the discrete problem are identical to those of the original problem. In this and the next two chapters, we study various violations of this "conformity", which are constantly used in everyday computations. To begin with, we examine in this chapter the effect of numerical integration. Assuming as before that the domain Q is polygonal and that the inclusion VhaV still holds, we use in addition a quadrature scheme for computing the coefficients of the resulting linear system. Each such coefficient being of the form £ L(x)dx, TeFh J T the integrals \Tcp{x)dx, Te^~h, are approximated by finite sums of the form ^=1Со1>гф(Ь1>7-), whose weights ft)ijT and nodes bltTe T are derived from a single quadrature formula defined over a reference finite element. Examples of useful quadrature formulae are given in Section 25. This approximation thus yields an approximate bilinear form ah(', •) and an approximate linear form /,,(•), which are defined over the space Vh, but not over the space V. Our study of this kind of approximation follows a general pattern that is also common to the two other methods described in Chapters V and VI. First, we prove an abstract error estimate, the first Strang lemma (cf. Theorem 26.1), which relies on the crucial assumption that the approximate bilinear forms are uniformly Vh-elliptic, in the sense that there exists a constant <x>0 independent of h such that a-h(vh,vh)^S\\vh\\2 for all vheVh. This is why we next examine (Theorem 27.1) under which assumptions on the quadrature scheme over the reference finite element this property is true. The abstract error estimate of Theorem 26.1 generalizes Cea's lemma: In the right-hand side of the inequality, there appear two additional consistency errors, 183
184 P.G. Ciarlet Chapter IV which measure the quality of the approximation of the bilinear form and of the linear form, respectively. We are then in a position to study the convergence of such methods. More precisely, we shall essentially concentrate on the following problem: Find sufficient conditions that insure that the order of convergence in the absence of numerical integration is unaltered by the effect of numerical integration. Restricting ourselves for simplicity to the case where PT = Pk(T) for all Te$~h, our main result in this direction (Theorem 29.1) is that one still has Ци-и*111.о = 0(й*), provided the quadrature formula is exact for all polynomials of degree Bk — 2). The proof of this result depends in particular on the Bramble-Hilbert lemma (Theorem 28.1), which is a useful tool for handling linear functional that vanish on polynomial subspaces. In the present case, it is repeatedly used in the derivation of the consistency error estimates (Theorems 28.2 and 28.3). 25. The effect of numerical integration and examples of numerical quadrature schemes Throughout this chapter, we shall assume that we are solving the second-order boundary value problem that corresponds to the following data: V=Hh(Q), a(u, v) = ]T uijdiudjvdx, B5.1) u=i l(v) fvdx, where Q is a polygonal domain in W, and the functions ay e L°°(£2) and / e L2(Q) are assumed to be everywhere defined over Q. We shall also assume that the ellipticity condition is satisfied i.e., that there exists fi such that /?>0, B5.2) П П X aijMtiZjT*P £ Z? for all xeQ and all ^Ui^n. U=i i = i Hence the bilinear form of B5.1) is Ho@)-elliptic. Solving this problem amounts to formally solving (cf. C.24)) the homogeneous Dirichlet problem for the operator u~*— E"J=19,-(ajy9;M), viz., л м = 0 on Г.
Section 25 Numerical integration 185 Indications for handling more general operators, such as м-+ — Z"J=i ЭДа^м) + bu, are given in Section 29. We are given a family of finite element spaces Xh made up of finite elements (T, Рт, Ет) Те $~h, where 3~h are triangulations of the set Q (because the set Q is assumed to be polygonal, it can be exactly covered by triangulations), and we define as usual the spaces Vh = {vhe Xh; vh=0 on Г}. The assumptions made throughout this chapter about the triangulations and the finite elements are the same as in Section 17. Let us briefly record these assumptions for convenience: (HI) The associated family of triangulations is regular. (H2) All the finite elements (T, PT,IT), Te{Jh$~h, are affine-equivalent to a single reference finite element (f, P, t). (H3) All the finite elements (T, PT, ZT), Те \)h3Th, are of class <$°. As a consequence, the inclusions XkcH1(Q) and VhcHo(Q) hold, as long as the inclusion PcH\T) holds. Given a finite element space Vh, solving the corresponding discrete problem amounts to finding the coefficients £k, 1 ^ к ^ M, of the expansion uh = ££i t Ck wfc of the discrete solution uh over the basis functions wk, 1 ^k^M, of the space Vh. These coefficients are solutions of the linear system (cf. E.4)) м £ a(wk,wm)(k = l(wm), l^m^M, B5.4) where, according to B5.1), fl(wbwm)= £ £ dijdiWbdjWndx, B5.5) TeFh fwm dx. B5.6) In practice, even if the functions a{j, f have simple analytical expressions, the integrals Jr--dx that appear in B5.5) and B5.6) are seldom computed exactly. Instead, they are approximated through the process of numerical integration, which we now describe: Let jY^*) dx denote any one of the integrals appearing in B5.5) or B5.6), and let FT:xe T->FT(x) = BTx + bTbe the invertible affine mapping that maps f onto T. Assuming, without loss of generality, that the (constant) Jacobian of the mapping FT is > 0, we have cp(x) dx = (det BT) (p(x)dx, B5.7) T T where the functions ц> and ф are in the usual correspondence, i.e., <p(x) = ф(х) for all
186 P.G. Ciarlet Chapter IV x = FT(x), xef. Therefore, computing the integral\T<P(X) dx amounts to computing the integral \тф(х)йх. Then a numerical quadrature scheme over the set T consists in approximating the integral $тф{х) dx by a finite sum of the form Ef= i c&i<p(ftj), an approximation that we shall symbolically record as U(x)di~ £>(ф(£,). B5.8) J (=1 t The numbers <S; are called the weights, and the points Si are called the nodes, of the quadrature formula 2fi 1 й,ф(£(). For simplicity, we shall only consider in the sequel examples for which the nodes belong to the set t and the weights are >0. Nodes outside the set Г and negative weights are not excluded in principle, but, as expected, they generally result in quadrature schemes that behave poorly in actual computations. Hence these situations will not be considered here. From B5.7) and B5.8), we infer that the quadrature scheme over the set f automatically induces a quadrature scheme over the set T, namely, L <p(x)dx~ X <»1,тФ1,т), B5.9) where the weights coliT and nodes bliT are defined by coljT=(detBr)dib bhT = FT(Bl), 1^ЫЬ. B5.10) Accordingly, we introduce the quadrature error functionals ET(<p)= \(p{x)dx- £ аI<тфит), B5.11) j (=i т Ё(ф)=Ш)йх~^оЬ1ф(£1), B5.12) f which are related by Ет(ср) = (й(ЛВт)Ё(ф). B5.13) Let us now give a few examples of often used quadrature formulae, which, by B5.10), need only be defined over the reference set T. Notice that each scheme preserves some space of polynomials; this polynomial invariance will subsequently play a crucial role in the error estimates. More precisely, given a space Ф of functions ф defined over the set f, we shall say that the quadrature scheme is exact for the space Ф, or exact for the functions феФ,И £(<p) = 0for all феФ. Before we turn to the examples, let us mention a useful relation: Let T be an n-simplex, and let A;(x), 1 ^i^n +1, denote the barycentric coordinates of a point x with respect to the vertices of the п-simplex. Then for any integers a;>0,
Section 25 Numerical integration 187 1 =*;'</* +1, one has XV(x)Xf(x)-X^i(x)dx= "l!g'!-l"+l!W' -,meas(D. B5.14) т Let 74 be an n-simplex with vertices ah 1 <;'^n +1, and barycenter Г Fig. 25.1. The numerical quadrature scheme Jfip(x)dx~meas(T)(p(d) is exact if феР^Т). (Fig. 25.1). Then the quadrature scheme <p(x)dx~meas{T)(p(a) B5.15) is exact for polynomials of degree <1, i.e., <p(x)djc-meas(f)<p(d)=0 for all феР^Т). B5.16) To see this, let <p= S-^/^d,)^- be any polynomial of degree <1. Using the relations (n+1) %i (x) Ax = meas( f), К i s£ n +1 (cf. B5.14)), we obtain meastf) n+1 <p(x)di = —— £ <p(d;) = meas(f)<p(d). n + i 1 = 1 f Let n = 2 and let Г be a triangle with midpoints of the sides dy, 1 <z^_/<3 (Fig.
188 P.G. Ciarlet Chapter IV Fig. 25.2. The numerical quadrature scheme jV<p(x)dx~£meas(f) Llei<jS3<p(a,j)is exact if феР2(Т). 25.2). Then the quadrature scheme <p(x)dx~4meas(f) £ Ф(&ц) B5.17) f is exact for polynomials of degree =%2, i.e., U(x)dx-imeas(f) £ Ф{аи) = 0 for all феР2(Т). B5.18) J 1«;<./чз f To see this, let П denote the P2(T)-interpolation operator associated with the set t = {p[di): Ю'<3; р(аи): Kkj'^3}. Then using B5.14), one can show that imeas(f) £ 0(atj) = Пф(х) dx, and the assertion is proved. Finally, let n = 2 and let f be a triangle with vertices ah 1 < i < 3, with midpoints of the sides au, Ki'</<3, and with barycenter d123 (Fig. 25.3). Then the quadrature scheme U(S)d£~&meas(f){3£.p(a() + 8 £ <p(ay) + 270(a123)l B5.19) f is exact /or polynomials of degree < 3, i.e., (p(x)dx-^meas(f) <3 £ <p(d,) + 8 £ ф(ау) + 27ф(а123)> = 0 t i=l l5i<j«:3 J f ГогаИфеРз(Т'). B5.20) To see this, one first observes that the set £ = {p(a,): Ui^3;^u): 1<*<7<3; £(d123)} is P-unisolvent, where i3=p2(f)©V{21l233}.
Section 25 Numerical integration 189 a, Fig. 25.3. The numerical quadrature scheme <p«)dx~6omeas(f)]3 £ф(а;) + 8 £ ф{йи) + 27ф(й123) I i=l l«i<j«3 f is exact if феР3(Т). This fact, combined with B5.14), then implies that the quadrature scheme of B5.19) can also be written <p(x) dx ~ Я <p(x) dx, where П is the P-interpolation operator. From this, one deduces that the quadrature scheme of B5.19) is exact for the space P3{T), but not for the space P^{t). Let us finally consider the case where f is an n-rectangle. It is well known that for each integer /c^O, there exist (k+1) points bte[0,1] and (fc + 1) weights a>i > 0, l<j^/c +1, such that the quadrature scheme k + l <p(x)dx~ £ o)i(p(bi) B5.21) ;= l [0.1 J is exact for the space P2k+1([0,1]) (see e.g. Crouzeix and Mignot [1984, Theorem 2.9]). This particular quadrature formula is known as the Gauss-Legendre formula. Then it is easy to infer from this result that the numerical quadrature scheme t + i <p(x)dx~ £ (coitcoh ■■•coin)<p(bil,bh,...,bin) B5.22) [0,1]" Kj"<ii is exact for the space £>2k+i([0> 1]"). This result thus provides examples of quadrature schemes over n-rectangles. Examples of numerical quadrature schemes used in actual finite element computations are found in the book of Zienkiewicz [1971, Section 8.10]. For general introductions to the subject of numerical integration, also known as:
190 P.G. Ciariet Chapter IV numerical quadrature, approximate integration, approximate quadrature, see the survey of Haber [1970], and the books of Davis and Rabinowitz [1975], Stroud [1971]. Existing quadrature schemes over triangles are reviewed in Dunavant [1985]; over tetrahedra, in Keast [1986]; over the unit cube, in Dunavant [1986]. Quadrature schemes over arbitrary compact subsets T of U" that are exact for spaces intermediate between the spaces Pk{T) and Qk(T) are studied in Guessab [1986]. The problem of finding quadrature schemes with a minimum number of nodes that are exact for the space Pk(t) is studied in Schmid [1978], and, for the space Qk(T), in Gout and Guessab [1986a, 1986b]. See also Guessab [1987] for further extensions. Let us return to the description of the discrete problem. Instead of solving the linear system B5.4) with the coefficients B5.5) and B5.6), all integrals JT • ■ ■ dx will be computed using a quadrature scheme given on the set f. In other words, we are solving the modified linear system м I ah(wk,wm)Ck = h(wm), l^m^M, B5.23) k=l where (compare with B5.5) and B5.6) respectively) L n ah{wk,wm)= Z £ Щ.Т Z (aiAwkdjWm)(bUT), B5.24) TeSTh (=1 i,j=l L '»(*.)= Z Z Щ.тФ«№1.т). B5.25) ТеУу, (=1 Remark 25.1. Conceivably, different quadrature formulae could be used for approximating the coefficients a(wk, wm) on the one hand, and the coefficients l(wm) on the other hand. However, our final result (Theorem 29.1) will show that this is not necessary. For our subsequent analysis, it will be more convenient to consider the following equivalent formulation of the discrete problem, represented by the linear system B5.23). We seek a discrete solution uh e Vh that satisfies dh (Щ, vh) = lh {vh) for all vh e Vh, B5.26) where, for all functions uh,vheVh, the bilinear form ah and the linear form fh are respectively given by L n ah{uh,%)= £ £ o),,T £ {aijdiUkdjVh)(blfT), B5.27) L h(v„)= Z Z а>,.т№Жт). B5.28) TeiTh 1=1 Note that the expressions B5.27) and B5.28) show why the functions au andf need
Section 25 Numerical integration 191 to be defined everywhere over the set Q, since the nodes bh T may be arbitrarily located in Q. Also, in order that definition B5.27) make sense, it is necessary that, over each set T, the first partial derivatives of the functions in the space Xh should have unambiguously defined extensions to the boundary of T, should some node blT be situated on the boundary of T. If this node coincides with a node bUT* belonging to an adjacent set T*, it should be clear that the values to be assigned to the derivatives ^iVh(bi,T> and 8jDft(bijT») are thus generally different. Since the definition of the discrete problem requires the knowledge of the values of the functions ay and /only at a finite number of points of Q, the discrete problem is in this sense quite reminiscent of a discrete problem obtained by a finite difference method. In fact, this is even true to the extent that most classical finite difference schemes can be exactly interpreted as finite element methods with specific finite element spaces and specific quadrature schemes. To be more specific, let Q = [0, /p] x [0, Jp~\ where / and J_are integers and p is a strictly positive number, let STh be a triangulation of the set Q made up of bilinear rectangles, with sets Г of the form [V,(' + 1)P]XUP,0'+1)P]. 0<K/-1, O^J^J-l, and let Uij denote the unknown, so far denoted £*, corresponding to the fcth node (ihjh), l<i^/-l, 1^_/<J—1. We further assume that the bilinear form is of the form a(u,v)= £ dtudivdx, n i.e., that the corresponding partial differential equation is the Poisson equation — Au =f in Q, and we only consider nodes (ipjp) that are at least two squares away from the boundary of the set Q, i.e., for which 2<г'</ —2, 2^j^J — 2. Then, in the absence of numerical integration, the expression Hf=la(wk,wm)l,k corresponding to the mth node (ipjp) is, up to a constant factor, given by the expression Wij—(Ui+lij+Ui+1j+i + Uij+1 + Ui-ij+l + Ui-1j Assume next that we are using the following numerical quadrature scheme over the reference square f =[0, l]2: ф(х)йх~Цф@,0) + ФФ, 1) + ФA, D+0A,0)), ЮЛ]2 which is exact for the space Qi{T). Then the equality £ttia/,(wt, wm)Ct = ^(wm) becomes 4Uij-(Ui+lJ+ UiJ+1 + U^1J+UiJ^)^p2f(ip,jp),
192 P.O. Ciarlet Chapter IV i.e., it is exactly the standard five-point difference approximation to Poisson's equation. It is interesting to notice that it is generally impossible to derive this scheme from a finite element method without numerical integration (Birkhoff and Gulati [1974]). More general comparisons between finite element methods, with or without numerical integration, and finite difference methods, are found in Birkhoff and Gulati [1974], Tomlin [1972], Walsh [1971]. 26. Abstract error estimate and the first Strang lemma To sum up, we started out with a standard variational problem: Find ueV such that, for all veV, a(u, v) - l(v), where the space V, the forms a{ •, •) and l( •) satisfy the assumptions of the Lax-Milgram lemma. Then given a finite-dimensional subspace Vh of the space V, the discrete problem consists in finding uh e Vh such that, for all vh,eVh, ak{uh,vh) = lh(vh), where ah{- ,•) is a bilinear form defined over the space Vh and /й( •) is a linear form defined over the space Vh. Notice that, in the present case, the forms ah( *, •) and lh( •) are not defined on the space V, since the point values are not defined in general for functions in the space H\Q). Our first task is to prove an abstract error estimate adapted to the above abstract setting, but first we need some definitions: We shall refer to ah( •, •) as an approximate bilinear form and to lh( •) as an approximate linear form. Denoting by || • || the norm of the space V, we shall say that approximate bilinear forms ah(-,-):Vhx Vh->M, associated with a family of subspaces Vh of the space V, are uniformly Vh-elliptic if there exists a such that S>\ B6.1) a || vh |j2 < ah(vh, vh) for all vh e Vh and all h. In particular then, the constant 5. is independent of the subspace Vh. Notice that such an assumption implies in particular the existence of the discrete solutions. The following error estimate is due to Strang [1972b]. Theorem 26.1 (First Strang lemma). Consider a family of discrete problems whose associated approximate bilinear forms are uniformly Vh-elliptic. Then there exists a constant С independent of the space Vh such that и и^Л ■ f Iii и , \a(Vh,Wh)-ah(vh,wh)\ llu-M^C inf <\\u-vh\\+ sup — \1Ы~1н(щ)\\ + sup - ,;_ "v "" 1. B6.2) Proof. Let vh be an arbitrary element in the space Vh. Using the assumption of
Section 27 Numerical integration 193 uniform ^-ellipticity, we may write: a II uh - vh ||2 ^ ah(uh -vh,uh-vh) = a(u-vh,uh-vh)+{a(vh,uh-vh)-ah(vh,uh-vk)} + {lk(uh-v„)-l(uh-vh)}, and thus, by the continuity of the bilinear form a{ •, •), ll^,.jl ,, , \a{Vh,uh-vh)-ah{vh,uh-vh)\ \lh(uh-vh)-l(u„-vh)\ a uh~vh ^M u-vh\\+ +- ——— II «A — f A fl \)Uh-Vh\\ ^,, \a(Vh,wh)-ah(vh,wh)\ !Wwfc) —/(wA)| ^M u-vk + sup — + sup — . wheVh IIWAII WhzV„ 11**11 Combining the above inequality with the triangular inequality \\u-u„\\^\\u-vh\\ + \\uh-vh\\ and taking the infimum with respect to vheVh, we find inequality B6.2). □ Note that the abstract error estimate B6.2) generalizes the abstract error estimate established in Cea's lemma (Theorem 13.1) for conforming finite element methods, since, in the absence of numerical integration, we would have ah( •, •) = a( •, •) and '*(•) = '(■)■ Remark 26.1. One can similarly obtain an abstract error estimate that generalizes the Aubin-Nitsche lemma (Theorem 19.1) in the present setting. Let Я be a Hilbert space such that V=H with Vc+H. Then one can show that |u-u,,|s:sup— inf {M\\u-uh\\\\(pe-<Ph\\ geH \9\ 9heVh + 1Ф*. <Ph)-ah(Uh, <Ph)\ + \K(Ph)-lk(<Ph)\}, where | • | denotes the norm in the space H, and for each geH, the function q>ge Vis the unique solution of the variational problem a(v, <pg) = (g, v) for all veV, where (•,•) denotes the inner product in H. 27. Uniform Fj-ellipticity of the approximate bilinear forms We now give sufficient conditions on a quadrature scheme that insure that the approximate bilinear forms are uniformly Ffc-elliptic: Notice in particular that in the next theorem assumptions B7.2) and B7.3) exhibit the relationship that should exist between the reference finite element (t, P, t) and the quadrature scheme defined on T. The next proof is based on, and generalizes, an idea of Strang (cf. Strang and Fix [1973, Section 4-3]).
( = 1 or 194 P.G. Ciarlet Chapter IV Theorem 27.1. Let there be given a quadrature scheme L 0(x)dx~ £ <S|<p(£j), ш,>0, l^l^L, f over the reference finite element (t, P, 1), and assume that there exists an integer k"^l such that: P^PAT) B7.1) on the one hand, and either L \J {61} contains a Pr_x(T)-unisolvent set, B7.2) = i |<K*)di=£<M(£«) forall<p<sP2k^2(f), B7.3) f or both B7.2) and B7.3) hold, on the other hand. Then there exists a constant a. > 0 independent of h such that, for all approximate bilinear forms of the form B5.27) and all spaces Vh, &\Vk\i.s2<ah(Vh,vh) forallvheVh. B7.4) Proof, (i) Let us first assume that the union (Jf=1{£j contains a Pf-^Tyum- solvent subset. Using the positivity of the weights, we find that L n peP and £югХ(Э;р(£г)J=0 1=1 !=1 => dip(bi)=0, Ki^n, l^I^L. For each ie[l,и], the function S;p is in the space Pk^x{t) by assumption B7.1) and thus it is identically zero since it vanishes on a P^-iCfj-unisolvent subset, by assumption B7.2). As a consequence, the mapping ( L n -jl/2 p- Z^L is^)!2 U=l i=l J defines a norm over the quotient space P/P0(t). Since the mapping p -* \p\imf is also a norm over this space and since this space is finite-dimensional, there exists a constant С > 0 such that C\p\b^ £ <Si£ \QtP(€,)\2 for all p€F. B7.5) i=l !=1 If we instead assume that the quadrature scheme is exact for the space Ргк-гС?), inequality B7.5) becomes an equality with С = 1, since the function E"=1|S,p|2 belongs to the space Piv-iiT) for all peP and since Ef=1c&,£"=1F(р(?,)J is
Section 27 Numerical integration 195 precisely the quadrature formula that corresponds to the integral fiiwdjc-ipif.f. r (ii) Let us next consider the approximation of one of the integrals и £ aijQiVhdjVhdx. T Let vh\T = pT, and let preP be the function associated with pT through the usual correspondence xet-yFT(x) = BTx + bT=xeT. Using the ellipticity condition B5.2) and the positivity of the weights, we can write L n £ ф1.т Z (aiAvhdjVh)(bliT) 1 = 1 i,j~l L n = Z <°i,t Z (ъАРт^РтККт) B7.6) 1=1 i,j=l 1=1 i=l Observe that E"= j |S,pr(b,,r)|2 is the square of the Euclidean norm || • || of the vector DpT(bLT). Since \DpT(b,)\\^\\BT\\\\DpT(bitT)\\ (for all ^R", we have Dp(b,)£ = Dp{bitT){BT£)), we can write, using relations B5.10) and B7.5) and Theorem 15.1, L и Z ^'.r Z |9;Рг(Ьг, = i >=i >||5Г1Г22>, i=i =(detBr)|| 5r ||- ^C(detBT)\\BT\ t)\2 t z \шь)\2 1=1 2 £ й, i № ;=i (=i -2|pr|?,f^C{ r$)l2 Вг IIII Bf Ml}- 2|Prlf,r B7.7) Since the family of triangulations is regular by assumption, there exists a constant С independent of T eS~h and h such that llfirllllBF1!!^—<C. B7.8) P Pt Combining inequalities B7.6), B7.7) and B7.8), we find that there exists a constant a > 0 independent ofT^3Th and h such that L n Z <°i,t £ (aiAvi$jVh){bi,T)>5.\vb\2UT for a\lvheVh. B7.9) 1=1 U=i
196 P.G. Ciarlet Chapter IV (iii) It is then easy to conclude: Using inequalities B7.9) for all Те 3~h, we obtain L n Te^h l-l i,j=l >& Z b*li,r = «|fi,li,c for all vheVh. Q Те Л Remark 27.1. The expressions £f=1 £5(Е"=1|9;рг(£,)|2 are precisely the approximations we get when we apply the quadrature scheme to the integrals |prli,f> which in turn correspond to the model problem — Ли =f in Q, и — 0 on Г. This observation is the basis for assumptions B7.2) or B7.3), which in essence insure that the mapping Г L я "I 1/2 P- I Щ I |8,-р(йI2 u=i ;=i J is a norm over the quotient space P/P0(T). Let us apply this theorem to the examples of quadrature schemes given in Section 25: - If (T, P, Г) is a linear n-simplex (P = Pi(T) and thus k' = 1), we may use the quadrature scheme of B5.15) since {a} is a P0(T)-unisolvent set. - If (f , P, t) is a quadratic triangle (i3 = P2{T) and thus fc' = 2), we may use the quadrature scheme of B5.17) since U;<j{dy} is a P^T)-unisolvent set. Notice that in both cases, both assumptions B7.2) and B7.3) are satisfied. - If (t, P, S) is a cubic or reduced cubic triangle (P с Р3(Г) and thus k' = 3), we may use the quadrature scheme of B5.19) since the set of numerical integration nodes (strictly) contains the PjCTJ-unisolvent subset (и,-{я;})и(уг<;-{ац}). Note however that the quadrature scheme is not exact for the space P^(T); hence assumption B7.3) is not satisfied in this case. 28. Consistency error estimates and the Bramble-HHbert lemma Now that the question of uniform Fh-ellipticity has been taken care of, we can turn to the problem of estimating the various terms appearing in the right-hand side of inequality B6.2). For the sake of clarity, we shall essentially concentrate on one special case (which nevertheless displays all the characteristic properties of the general case), namely the case where P = Pk(T) B8.1) for some integer fc> 1. Indications for handling the cases where Рк(Т)сРс:Рк(Т) or where Pk(t)czPdQk{t) are given at the end of Section 29. If the solution и belongs to the space Hk+1{Q) and if its JsVinterpolant is well
Section 28 Numerical integration 197 defined, we infer from B8.1) that (cf. Theorem 17.1) inf ||и-»»||1.о<||и-Л»«||1.0<Сй*|и|»+1,я- B8.2) Thus we have an 0(hk) convergence in the absence of numerical integration. Our basic objective is to give sufficient conditions on the quadrature scheme that insure that the effect of numerical integration does not decrease this order of convergence. Remark 28.1. This criterion for appraising the required quality of the quadrature scheme is to some extent arbitrary, but at least it is unambiguously defined. Surprisingly, the results that shall be obtained in this fashion are nevertheless quite similar to the conclusions that the engineers had empirically drawn from their numerical experience. Let us assume that the approximate bilinear forms are uniformly FA-elliptic, so that we may apply the abstract error estimate of Theorem 26.1. In view of B6.2) and B8.2), our aim is to obtain consistency error estimates of the form \а(Пки, wfc) - ah(nhu, wfc)| k :z —шг0—^С{аи'ф * B8J) sup i*(y-M»»)i<cW B84) wheVh II Wh 111, О Notice that, in the usual terminology of numerical analysis, the uniform ellipticity condition appears as a stability condition, while the conditions (implied by the above error estimates) \а(Пьи, wh) ~ ah(nhu, wh)\ hm sup — = 0, ИО »heKh II WA lll.fi hm sup —-— = 0, h-+0 wheVh IIWhHl.fi appear as consistency conditions. This is why we call consistency errors the two terms of the form supWh€l/h{ • • •} appearing in the left-hand sides of inequalities B8.3) and B8.4). By definition of the quadrature error functionals ET(-) of B5.11), we have, for all wheVk, a(Ilhu,wh)-ah{nhu,wh)= £ Ет{ Ё Яу0*№«)8/И>Д B8.5) /(**)-/»(*»)= S ET(fwh). B8.6) TeSTh It turns out that we shall obtain (Theorems 28.2 and 28.3) "local" quadrature
198 P.G. Ciarlet Chapter IV error estimates of the form |£г(в,у8;р'Э;рI <c(flylr;S<P')fcrl9yPlo.r for all p',pePT, B8.7) |£r(/p)l*Wlr)A*r||p||1>T for all pePT, B8.8) from which the "global" consistency error estimates B8.3) and B8.4) are deduced by an application of the Cauchy-Schwarz inequality; this is possible only because the constants C(a0-|r;S;p') and C(f\T) appearing in the above inequalities are of an appropriate form. To begin with, we prove a preliminary result, due to Bramble and Hilbert [1970]. It is a very useful tool for getting error estimates, not only in numerical integration, but also in interpolation theory, where it provides an alternate approach to that based (as here) on Theorem 15.3. Theorem 28.1 (Bramble-ffilbert lemma). Let Q be a domain in W; let k^O be an integer, let pbea number satisfying 1 ^p< oo, and let I be a continuous linear form on the space Wk+1,P(Q) with the property that J(p) = 0 forallpePk(Q). B8.9) Then there exists a constant C(Q) such that \l(v)\^C(Q)\\l\\'k+i,P.a\v\k + 1,p,a forallveWk+l-"{Q), B8.10) where \\ • ||i + i,p,n is the norm in the dual space of Wk+1'P(Q). Proof. Let v be any function in the space Wk + l'p(Q). Since by assumption, l(v) = l(v + p) for all pePk{Q), we may write |/(»)| = |/(»+p)|^||/||i+i.pl0||c + p||in.i,p,o forallpePfc@), and thus №)!<ll'IIUi,P,n inf ||» + p||it+i.p.o. pePk(Q) The conclusion follows by Theorem 14.1. □ In the sequel, we shall often use the following result: Let the functions q> e Wm'q{Q), and we (f",B(fl) be given. Then the function <pw belongs to the space Wm'4(Q), and m B8.11) j = o for some constant С solely dependent upon the integers m and n; thus С is in particular independent of the set Q. To prove this, we use the formula m dx{q>w) = £ £ d^wd^'q) for all a with |a| = m, J"=0 |/J|=7 0 + 0'=<x
Section 28 Numerical integration 199 in conjunction with inequalities of the form л л 0,q,O- 1=1 0,?,fi Л=1 From now on in this chapter, we essentially follow Raviart [1972] and Ciarlet and Raviart [1972c]. Theorem 28.2. Assume that, for some integer /c> 1, P = Pk(f), Ё(ф) = 0 /ога11феР2к-2(Т). Then there exists a constant С independent of Te3~h and h such that \ET(adip'djp)\*;ChkT\\a\\Ko0:T\\dip'\\k-UT\djp\0!T ^CbrNkoo.rllp'Ikrbkr for all aeWk^{T),pePk{T),p'ePk(T). B8.12) B8.13) B8.14) Proof. We shall get an error estimate for the expression ET(avw) with a e Wk- °°(Г), vePb-^T), wePk-!(T). From B5.13), we infer that ET(avw) = {detBT)E{avw), B8.15) with aeW^it), vePk-i{T), w£Pt_i(f). For any wePbl(f) and any фе W^CT), we have (\Ук'х(Т)^^0{Т) since k&l) \Ё{фЩ = <pvv dx — Y, <*>i{<pw)(f>i) /=i where, here and subsequently, the letter С represents various constants solely dependent upon the reference finite element. Since |<p|o,a>,f < II <P II *,»,?. and since all norms are equivalent on the finite-dimensional space Pk-i(T), we deduce that \Ё{фЩ^С\\ф\\К(Х1,тЩо,т- Thus, for a given wePk-i(T), the linear form фе\¥к'х(Т)->Ё(ф\Ь) is continuous with norm <C|w|0,f on the one hand, and it vanishes over the space Pk-X{t) on the other hand, by assumption B8.13). Therefore, using the Bramble-Hilbert lemma (Theorem 28.1), there exists a constant С such that \ЩЩ^С\ф\к,п,тЩоЛ for all фе Wk^{t), wePk.x{t). Next, let ф = а€ with aeWk-x{t), uePt_i(f). Using B8.11) and taking into
200 P.G. Ciarlet Chapter IV account that |гЗ|к,ю.г = 0, we get \4>\k,o0,T = \av\k,x,,T k-1 k~l in the last inequality, we have again used the equivalence of norms over the finite-dimensional space P^-iCf). Therefore, we obtain \Ё(аЩ\^С< £ \a\k-laD,f\v\jA\w\0,T w-° } B8.16) forallde^^f), veP^^T), weP^^f). Then it suffices to use the inequalities (cf. Theorems 15.1 and 15.2) \u\k-j.n,t^ChkT-J\a\k.Jie)<T, 0^j^k-l, \v\j,t^ChJT{det Вт)-1'2^^, (Kj^fc-1, |wlo,r<C(detBr)-1/2|w|0,T, in conjunction with relations B8.15) and B8.16). We obtain in this fashion: \ET(avw)\^ChU £ N)c-j;co,rNj,rHw|o,r ^ChkT\\a\\k,XiT\\v\\k-UT\w\0iT, for all ae W-^T), veP^^T), wePk-x(T), and we conclude the proof by replacing v by btp' and w by djp in the last inequality. Q Remark 28.2. Let us indicate why a direct application of the Bramble-Hilbert lemma to the quadrature error functionals ET( •) does not yield the proper estimate. Let us assume that, for some integer /^0, £(ф) = 0 for all феРг(Т), and let r e [1, oo] be such that the inclusion W'+Ut(f )<+<g°(T) holds; hence we have \Ё(Ф)\^С\ф\0^т^С\\ф\\1+иг,т forall<peW"+14f), and thus assumption B8.13), together with the Bramble-Hilbert lemma, implies that \Ё{ф)\^С\ф\1+1>ГшТ forall(p6W!+1'r(f). Let us then replace ф by the product avw, with a sufficiently smooth function a, vePk-1(T),wePk-1(t). Using inequalities of the form B8.11) and the equivalence of norms over the space Р^-хСТ), we automatically get all the seminorms \w\j,t> 0^/'<min{/ +1, k— 1}, in the right-hand side of the final inequality, whereas only the seminorm |w|o,r should appear.
Section 28 Numerical integration 201 The reader should notice that the ideas involved in the proof of the previous theorem are quite reminiscent of those involved in the proof of Theorem 15.3. In both cases, the central idea is to apply the fundamental result of Theorem 14.1 (in the disguised form of the Bramble-Hilbert lemma in the present case) over the reference finite element and then to use the standard inequalities to go from T to T, and back. The same analogies also hold for our next result. Theorem 28.3. Assume that, for some integer /c> 1, P=Pk(t), B8.17) Ё(ф) = 0 forallcpeP2k-2(T), B8.18) and let qe{\, со] be a number that satisfies the inequality k~(n/q)>0. B8.19) Then there exists a constant С independent of Te^l and h such that \ЕтШ^Ыкт{ты5(Т)У'2-^М\\к^т\\р\\ит for all feWk-\T), pePk(T). B8.20) Proof. For any /e WKq(T) and any pePk(T), we have ET(fp) = (dctBT)E(fp), B8.21) with feWk'q{T),pePk(f). Let us write E(fp) = £(fnp) + £(f(p~np)), B8.22) where 77 is the orthogonal projection in the space L2(T) onto the subspace Pi{T). (i) Let us first estimate Ё(/Йр). For all ^e^'ff), we have l^)l«Wlo.».r<Wl|t.,.r, since inequality B8.19) implies that the inclusion Wk'q{T)z+<$°(T) holds, and, in addition, Ё(ф)=0 for all феРъ-^Т), by virtue of assumption B8.18) (therefore, this assumption is not fully used at this stage, unless /c= 1). Using the Bramble-Hilbert lemma, we obtain \Ё(Ф)\^С\ф\к,9,т forall06H*<(f). In particular, let \ji=ffip with /e Wk-q{f),pePk(T). Using inequality B8.11), we find: \fUp\t,,,T<C{\f\k,qMfip\o,«.T+\f\k-i.,,t\Up\i.«>.t}, since all seminorms |i?p|/,a,,t vanish for 1^2 (fipeP^f)). Using the equivalence of norms over the finite-dimensional space Py(t), we get \fnp\k,q,T<c{\f\ktq,r\np\o,T+\f\k-uq,T\iip\uT}-
202 P.G. Ciarlet Chapter IV Further we have \fip\o.T^\P\o,T> since Я is a projection operator, and \fip\i,t^\P-fip\i,T + \P\i,T- Since the mapping Я leaves the space P0{t) invariant, there exists (cf. Theorem 15.3) a constant С such that \P~np\ut^c\p\liT. Thus, upon combining all our previous inequalities, we have found a constant С such that \E(fnp)\^C{\f\Kq,t\p\o,T + \f\k-i,qMP\uT} for all jWk-«(f), pePt(T). { ' (ii) Let us next estimate E(f(p — np)). We observe that if k~l, the difference (p —Яр) vanishes and therefore, we may henceforth assume that /c^2. This being the case, there exists a number pe [1, + oo] such that the inclusions Wk«(t)c>Wk-'i"'>(t)i*<g%t) hold. To see this, consider first the case where 1 < q < n, and define a number p by letting 1/p-l/q — l/n, so that the inclusion Wl'q{T)<^,lS{T) (and consequently the inclusion Wk-q(T)<+ Wk-Up{t)) holds. Then the inclusion Wk'u"{t)c»<Jf0(f) also holds because k-l-(n/p) = k-(n/q)>Q by B8.19). Consider next the case where n^q. Then either n<q and the inclusion Wl,q(T)<^Lp(T) holds for all pe[l, oo], or n=q and the same inclusion holds for all (finite) p 3s 1, so that in both cases the inclusion Wk-\tК Wk~ Up{T) holds for all p ^ 1. Since in this part (ii) we assume к ^ 2, it suffices to choose p large enough so that k-l-(n/p)>0 and then the inclusion W*-1-p(f)c>«,0(f) holds. Using now familiar arguments, we eventually find that |£(/(p-^P))l<C|/(p-^)lo.».t<^l/lo.».t|p-rfplo.«,,r ^C||/!l*-i,,,flp-tfplo,oo,f for all /еГ-'Щ pePk(t), Thus for a given pePk(T), the linear form /6W*-i.*(f )->£(/(£-#>)) is continuous, with norm ^С\р—Пр\0<ао^, and it vanishes over the space Pk-2(f) (notice that, by contrast with step (i), the "full" assumption B8.18) is used here). Another application of the Bramble-Hilbert lemma shows that \E{f{p-rtp))\^C\f\k-Up,T\p-np\0,T toTaHfeWk-1-"{t), pePk(T).
Section 28 Numerical integration 203 Since the operator Я leaves the space P0(T) invariant, we have, again by Theorem 15.3, |p-tfp|o,f<£|Pkt- We also have \g\o,P,t^C{\g\0,q,f + \g\Uq,t} for all ge Wl-\t), since the inclusion WUq(T)c+IJ{T) holds, and thus, l/k-i.„.r<£{lA-1.,,r + l/k,.r} for all /e Wk-'(t). Combining all our previous inequalities, we obtain: \E(f{p-np))\^C{\f\^Uq,T+\flktq,T}\p\ut for all /e иг*'«(?), pePk(T). ( ' ' (iii) The proof is completed by combining inequalities B8.21), B8.22), B8.23), B8.24), and \f\k-j.,.T<ChkT-1{<tetBTr1"\f\k-u.T, J=0,1, \p\j,T<Chir(detBT)-ll2\p\j,T, ; = 0,1. П Remark 28.3. Several comments are in order about this proof. (i) There always exists a number q that satisfies inequality B8.19). In particular, the choice q = oo is possible in all cases. (ii) As in Theorem 28.2, a direct application of the Bramble-Hilbert lemma yields unwanted norms in the right-hand side of the final inequality, which should be of the form \ET(fp)\ ^ ••• ||p]| lyT (cf. Remark 28.2). (iii) Why did we have to introduce the projection Я? Arguing otherwise as in part (ii) of the proof, we would find either \E(fp)\ <C|/[k_ ltPtf |pj0i t, or \E(fp)\ < ^l/U- i,p,f IIPII i,f: In both cases, there would be a loss of one in the exponent of hT. (iv) Since in both steps (i) and (ii) of the proof, only the invariance of the space P0(f) through the projection operator is used, why is it not enough to consider the orthogonal projection Я0 from the space L2(T) onto the subspace P0(fO It is true that, if к > 2, the whole argument holds with Я0 instead of Я. If fc= 1 however, part (i) of the proof yields the inequality \Ё[/П0р)\ =*£!/|*,4,? 1р!о,г, which is perfectly admissible; but then part (ii) of the proof is needed, since it is necessary to estimate the quantity Ё(/{р—П0р)) for peP^t). It is however impossible to find a space W°-P{T) = LP(T) that would be contained in the space <ё°{Т). Thus it is simply to avoid two distinct proofs (one with Я if k= 1, another one with Я0 if к >2) that we have used the single mapping Я. (v) Why is it necessary to introduce the intermediate space Wk~1,p(T)l For all p e Pk(T), the function (p—lip) is also a polynomial of degree < k. Since, on the other hand, the quadrature scheme is exact for polynomials of degree <Bfc —2), the application of the Bramble-Hilbert lemma to the linear form f-*E{f{p — Up)) necessitates that the function / be taken in a Sobolev space that involves derivatives up to and including the order (& — 1), and no more.
204 P.G. Ciarlet Chapter IV For studies of numerical integration along the lines developed here, see also Mansfield [1971,1972a]). In Arcangeli and Gout [1976] and Meinguet [1975], the constants appearing in the quadrature error estimates are evaluated. 29. Estimate of the error ||« — и^Ц^д Combining the previous theorems, we obtain the main result of this chapter, which is due to Raviart [1972] and Ciarlet and Raviart [1972c]. Theorem 29.1. In addition to (HI), (H2) and (H3), assume that there exists an integer /c>l such that the following relations are satisfied: P = Pk{T), B9.1) Ык+\Т)с>ЩТ), B9.2) where s is the maximal order of partial derivatives occurring in the definition of the set 1, that the weights of the quadrature schemes are > 0, and that Ё(ф) = 0 forall(j>6P2k~2(T). B9.3) If the solution ueHo{Q) of the variational problem corresponding to the data B5.1) belongs to the space Hk + Щ, if atj e Wk~ °°(й), 1 s$ i, j ^ n, and if fe WKq{Q) for some number q^2 with к > n/q, there exists a constant С independent of h such that ll«-uJ|li0<CA*||u|it, + liD+ £ Цвц1к».о11и|1*+1.п+11Я1м.п[- B9.4) Proof. The assumptions B9.1) and B9.2) imply that (Theorem 17.1) \\u-nhu\\un^Chk\u\k+un, where, here and subsequently, С stands for a constant independent of h. Using B8.5), Theorem 28.2, and the Cauchy-Schwarz inequality, we obtain, for any wheVh, \a(Ilhu,wh)-ah(nhu,wh)\ < I t !ЕгК-9г(П,,и|г)9>,,|г))| n ^C £ h\ £ ||ayИ*,„.гII^a"Ht.rfWfcl 1>r ^chk( £ ikii^oV z \\nhu\\iTVl2\wh\1M.
Section 29 Numerical integration 205 By Theorem 17.1 we have S ЦЯлМ||,2,г}1/2^11и,я + { £ \\и~Лни\Цт}112 <1|и|и,о + С/1|м|А + 1,я<С||н|и+1,05 and thus, \a{Vk,wh)-ah(vk,wh)\] inf <||«-t;J|1>n+ sup i>heFh I w„eVh II wfc lll.fi .,, „ ,, , |a(nfc«,wA)-a/l(n»«,vi'i)| < и-Ялм lin+ sup — WheVh II W Jl.fi <СЙЧ|м|л + 1,л+ Ё Hflylkco.nlMlft + l,fl I i,j=l Combining likewise B8.6) and Theorem 28.3, we obtain |/(wk)-«w»)|< I |£r(M)| <C £ ^{measCD^-^H/IU.e.rllWfclli.r ^ Chk{meas{Q)}1/2 /? || /1| M,„ || w„ || lifl. In the last inequality, we have used the inequality r -ji/» г -ji/^r -)i/y ЦвгМгК ZW ]ЦЬгГ {Ikrl4 , г (.г J (. r J (. t J valid for any numbers a^l, /?^ 1, y> 1 that satisfy 111 a jS 7 Here, l/a=i—1/я, /? = <?, у = 2 (this is why the assumption <j>2 was needed). Consequently, we obtain sup ■|/G)"ft(w>)l<Cfc*{mea.@)}^-^||/lk<,o. WheKh II wj|i,fi To complete the proof, it remains to use the abstract error estimate of Theorem 26.1; we may indeed apply this result since, by virtue of assumptions B9.1) and B9.3), the approximate bilinear forms are uniformly ^-elliptic, by Theorem 27.1. □ Remark 29.1. When P = Pk(t), the condition that the quadrature scheme be exact for the space P2k-2(T) has a simple interpretation: It means that all integrals \TaifiiUhftjVh&x are exactly computed when all the functions a^ are constant. To
206 P.G. Ciarkt Chapter IV see this, notice that diP'djpdx = (det Вт){Ъ,р'У{д}р}" dx for all p',pe PT, T f with det BT = constant, {Э,р'}"еР*-1(?'). {djpyeP^dt). Let us now illustrate the error estimate of Theorem 29.1 by some applications: - If we use linear n-simplices, we still get \\u — uh\\ltQ = 0(h) provided we use a quadrature scheme exact for constant functions, such as that of B5.15). - If we use quadratic triangles, we still get ||u—uh\\ i,n = 0(h2) provided we use a quadrature scheme exact for polynomials of degree <2, such as that of B5.18). - If we use cubic triangles, it is necessary to use a quadrature scheme exact for polynomials of degree ^4, in order to preserve the error estimate \\u—uj|i Q = 0(h3), etc. Let us next briefly mention various possible extensions of Theorem 29.1. For example, one might wish to analyze the effect of numerical integration for the homogeneous Neumann problem corresponding to the following data: V=H1(Q), a(u, v) = j Q < Yj dijdiudjv + buvydx, B9.5) /(») = fvdx, where the functions а^еЬ°°@) and fel!{Q) satisfy the same assumptions as in B5.1), the function b is defined everywhere over the set Q and there exists b0 such that b0>Q, b(x)^bo>0 for ailxeQ. Thus the discrete problem corresponds to the approximate bilinear form l n M"i,,vh)= X £ (u,,r £ (ау8,-и*Э;РЛ)(Ь,>г) Tefh 1=1 i,j=i + 1 I о»«.г(Ьи»»»КЬ«.т). B9-7) ТеГн 1=1 With the same assumptions as in Theorem 27.1, one can then show that there exists a constant a>0 such that a II vh || i,Q<a„(vh, vh) for all vhe Vh and all h,
Section 29 Numerical integration 207 i.e., that the approximate bilinear forms are uniformly ^-elliptic. Assume next that, for some integer k^ 1, Ё(ф)=0 ior all феР2к-2(Т). Then, the same arguments as in the proof of Theorem 28.2 imply that there exists a constant С independent of TeiFh and of h such that \Ет(ар'р)\^Сккт\\а\\к,ао,т\\р'\\к,тЫит forallaeH^r), реРк(Т), р'еРк(Т). From these results the analogue of Theorem 29.1 then easily follows. A second extension due to Ciarlet and Raviart (unpublished) consists in studying the case where the space P satisfies the inclusions Pk(T)czPcpkiT). B9.8) instead of the equality P = Pk(T) assumed thus far. We first observe that the question of uniform Fft-ellipticity is already settled by Theorem 27.1. Theorems 28.2 and 28.3 can then be extended to this case if the quadrature scheme is exact for the space Pk+k'-2(T); these results then imply that the analogue of Theorem 29.1 holds if all the weights are >0, the union {Jf=i{Et} containsaPt_1(f)-unisolvent subset and the quadrature scheme is exact for the space Pk+k'-z(T)- This shows in particular that reduced cubic triangles may be used in conjunction with the quadrature scheme of B5.19). Finally, consider the case where the space P satisfies the inclusions Pk(T)czPciQk(f), B9.9) as is the case for rectangular finite elements. Assuming again the positivity of the weights, one can show that the approximate bilinear forms are uniformly K^-elliptic if the union (Jf=1{£j contains a Qk(T)n P„fc_1('f)-unisolvent subset; the analogues of Theorem 28.2 and 28.3 likewise hold if the quadrature scheme is exact for the space Qzk-iiT). These results then imply that the analogue of Theorem 29.1 holds if all the weights are >0, if the union U^=i(^'} contains a Qk{T)r\Pnk_ jCfj-unisolvent subset, and if the quadrature scheme is exact for the space Q2k-i{T) (an example of such a scheme has been given in B5.22)). As a consequence, and contrary to the case where P = Pk(t) (cf. Remark 29.1), it is no longer necessary to exactly compute the integrals §TaijdiUhdjVkdx when the coefficients atj are constant functions. This last extension is again due to Ciarlet and Raviart. Using the abstract error estimate mentioned in Remark 26.1, Ciarlet and Raviart have also established that, if the assumptions of Theorems 19.2 and 29.1 simultaneously hold, then |M-w„|o,fi = 0(/i'c+1)
208 P.G. Ciarlet Chapter IV if the quadrature scheme is exact for the space Р2к-г{Т) if fc>2, or if the quadrature scheme is exact for the space Р1(Г) if fc= 1. An interesting extension of Theorem 29.1, based on appropriate discrete Poincare-Friedrichs inequalities (cf. ZeniSek [1981a]), has been given by ZenISek [1987], who has established convergence under the "minimal" assumption that the solution и is in the space H1(Q). Quadrature schemes of order lower than those indicated by the theory are often employed in practice, in order to reduce computational costs: This procedure of under integration sometimes leads to various numerical instabilities; in particular, hourglass instabilities often develop, in that "spurious oscillations" are superposed on an otherwise smooth solution. This interesting phenomenon is described and studied in Belytschko and Ong [1984], Jacquotte and Oden [1984], Jacquotte [1985], Jacquotte, Oden and Becker [1986], Кон and Kikuchi [1987], Wissmann, Becker and Moller [1987]. For other references concerning the effect of numerical integration, see BabuSka and Aziz [1972, Chapter 9], Fix [1972a, 1972b], Herbold [1968] where this problem was studied for the first time, Herbold, Schultz and Varga [1969], Herbold and Varga [1972], Oden and Reddy [1976a, Section 8.8], Schultz [1972], Strang and Fix [1973, Section 4.3].
Chapter V Nonconforming Finite Element Methods for Second-Order Problems Introduction We study in this chapter a first instance of finite element methods for which the spaces Vh are not contained in the space V. This violation of the inclusion VhcV results of the use of finite elements that are not of class <^°, i.e., that are not continuous across adjacent finite elements, so that the inclusion Vh с Hl(Q) is not satisfied (Theorem 30.1). The terminology "nonconforming finite element method" is specifically reserved for this type of "nonconformity" (likewise, nonconforming methods for fourth-order problems use finite elements that are not of class У?1; cf. Section 49). For definiteness, we assume throughout this chapter that we are solving a homogeneous Dirichlet problem posed over a polygonal domain Q. Then the discrete problem consists in finding a function uh e Vh such that, for all vhe Vk,ah(uh,vh) — l(vh), where the approximate bilinear form aA(v) is defined by a*(v)= £ {•••}dx, ТеГъ . T the integrand {• • •} being the same as in the bilinear form that is used in the definition of the original problem. The linear form /(•) need not be approximated since the inclusion Vh с L2(Q) holds. Assuming that the mapping is a norm over the space Vh, we prove an abstract error estimate, the second Strang lemma (Theorem 31.1), where a consistency error term is added to the expected term infVheVh\\u — vh \\h. Just as in the case of numerical integration (cf. Section 26), this result holds under the assumption that the approximate bilinear forms are uniformly Vh-elliptic, in the sense that there exists a constant a > 0 independent of h such that ah(vh>vh)>u\\vh Ил for all vheVh. We then proceed to describe in Section 32 a three-dimensional "nonconforming" 209
210 P.G. Ciarlet Chapter V finite element, known as Wilson's brick, which has gained some popularity among engineers for solving the boundary value problem of linearized elasticity. Apart from being nonconforming, this finite element presents the added theoretical interest that some of its degrees of freedom are of a form not yet encountered. This is why we need to adapt to this finite element the standard interpolation error analysis (Theorem 32.1). Next, using a "bilinear lemma" which extends the Bramble-Hilbert lemma to bilinear forms (Theorem 33.1), we analyze the consistency error (Theorem 34.1); we show in this fashion that ll"-«**ll* = J E l«-«**li.rf =0(h), I TeSTh ) if the solution и is in the space H2(Q). We also establish the connection between the convergence of such nonconforming finite element methods and the patch test of Irons. 30. Nonconforming methods Assume, as in Chapter IV, that we are solving the second-order boundary value problem that corresponds to the following data: V = Hh(Q), Г " a(u,v)= Yj ciifiiUdjVdx, C0.1) a l(v) - fa dx. Q At this descriptive stage, the only assumptions that we need to record are that ay e I," (G), l^ij^n, feL2(Q), C0.2) and that the set Q is a polygonal subset of W. As in the preceding chapter, this last assumption insures that the set Q can be exactly covered with triangulations. Assume next that, with any triangulation Q = []тегъТ of the set Q, we associate a finite element space Xh whose generic finite element is not of class Я>°. Then the space Xh will not be contained in the space H1(Q), as we now show (note that the next theorem is the converse of Theorem 5.1). Theorem 30.1. Assume that the inclusions Рт с <^°(Т) for all Te^~h and Xh a Hi(Q) hold. Then the inclusion holds.
Section 30 Nonconforming finite element methods 211 Proof. Assume that the conclusion is false. Then there exists a function veXh, there exist two adjacent finite elements T(l) and TB), and there exists a nonempty open set О с ГA)и ГB) such that (for instance) (»lr(l) - »ll42)) > ° а1оПё Г'П0> C0-3) where T is the face common to T{\) and TB). Let q>e£#(Q), <p > 0 with supp <p с 0. Using Green's formula B.4), we obtain: 6itxpdx= X divcpdx J A=l,2 . q та) = - E vd,<pdx+ £ A=l,2 J Л=1,: "Irw'pVi.T.^dy ,2 ГЦ) 8Г(А) (f|r(i)-^ vdi(pdx+ (f|r(i)-^lrB))<pvi,r(i)dy, я т and thus we reach a contradiction since the integral along T should be >0 by C0.3). □ More specifically, let us assume that the inclusions Рт с я1 (Г) for all Te^h, C0.4) hold; hence at least the inclusion Xh с L2(Q) C0.5) holds. We then define a subspace X0h of Xh that takes as well as possible into account the boundary condition v = 0 along the boundary Г of Q. For example, if the generic finite element is a Lagrange element, all degrees of freedom are set equal to zero at the boundary nodes. But, again because the finite element is not of class Я>° (cf. Remark 12.2), the functions in the space X0h will in general vanish only at the boundary nodes. In order to define a discrete problem over the space Vh = X0h, we observe that, if the linear form / is still defined over the space Vh by virtue of the inclusion C0.5), this is not the case of the bilinear form a( •, •). To obviate this difficulty, we define, in view of C0.1) and C0.4), the approximate bilinear form ah(Uh,vh)= £ £ aifiiUhb}vh dx, C0.6) and the discrete problem consists in finding a function uh e Vh such that ak(uh,vh) = l(vh) for all vheVh. C0.7) We shall call any such finite element approximation of a second-order boundary
212 P.G. Ciarlet Chapter V value problem a nonconforming finite element method. By extension, we shall call any generic finite element used in such a method a nonconforming finite element. 31. Abstract error estimate and the second Strang lemma To begin with, we must equip the space Vh with a norm. Since | • |liG is a norm over the space V— Hl0{Q), a natural candidate is the mapping ^->K!L = { S klf.rV'2 Ci.i) which is a priori only a seminorm over the space Vh. Thus, given a specific nonconforming finite element, our first task is to check that the mapping of C1.1) is indeed a norm on the space Vh. After this, we shall be interested in finding sufficient conditions that guarantee that the approximate bilinear forms of C0.6) are uniformly Vh-elliptic, in the sense that there exists a constant a such that a>0> C1.2) «IIvh\\h ^ah(vh,vh) for all vheVh and all h. This condition, which clearly implies the existence and uniqueness of the solution of the discrete problem, is also a crucial assumption for obtaining the abstract error estimate (Theorem 31.1). It follows from the definition C1.1) of the norm || • \\h that the approximate bilinear forms C0.6) are uniformly Vh-elliptic if the functions a^ satisfy the usual ellipticity condition, i.e., if there exists a constant /? such that j3>0, £ay(x)^/?£tf C1.3) for all (£j,£2,..., £n)eM" and for almost all xeQ. From now on, we shall consider that the domain of definition of both the approximate bilinear form of C0.7) and the seminorm of C1.1) is the space (Vh + V). This being the case, notice that ak(v, v) = a(v, v), \\v\\h = \v\un forallueK C1.4) Also, the first assumptions C0.2) imply that there exists a constant M independent of the space Vh such that \ah(u,v)\^M\\u\\h\\v\\h for all u,ve(Vh + V). C1.5) The next result is due to Strang [1972b]. Theorem 31.1 (Second Strang lemma). Consider a family of discrete problems whose associated approximate bilinear forms are uniformly V^-elliptic.
Section 31 Nonconforming finite element methods 213 Then there exists a constant С independent of the subspace Vh such that \\u-uh\\h<c( inf Uu-vhh+ sup ^>-^>l\ Cl.6) \vheVh whsVh \\wh\\h / Proof. Let vh be an arbitrary element in the space Vh. Using the uniform JVellipticity and the continuity of the bilinear forms ah (cf. C1.2) and C1.5)) and the definition C0.7) of the discrete problem, we may write a IK - vh\\l < ah(uh - vh,uh - vh) = ah(u - vh, uh - vh) + {l(uh - vh) - ah{u, uh - vh)}, from which we deduce «\\uh-vk\\h<M\\u-vJh + — -— Wuh-Vh\\h \l(wh)-ah(u,wh)\ KM\\u-vh\\h+ sup jr—■т ■ WheVh II "'ft lift Inequality C1.6) then follows from this inequality and the triangular inequality II и - «»||k < II и - vj„ + || uh - v„ \\h. П Note that the error estimate C1.6) indeed generalizes the error estimate established in Cea's lemma (Theorem 13.1) for conforming methods, since the difference {/(wA) — ah(u, wh)} vanishes for all wh e Vh when the space Vh is contained in the space V. Remark 31.1. An abstract error estimate can also be established that eventually yields an estimate of the error \u — uh\0_n: Let Я be a Hilbert space such that V=H, Vz+H and VhczH for all h, and let, for all u,ve(Vh+V), Dh(u,v) = ah(u,v)- f{v). Finally, assume that the bilinear form is symmetric. Then the abstract error estimate of the Aubin-Nitsche lemma (Theorem 19.1) is replaced in the present situation by the following inequality: |u-MA|^sup— inf {M||w-hJ| \\q>g-<Ph\\ + \Dh(u,(pg-(ph)\ + \Dh(cpg,u-uk)\}, geH \g\ VheVh where |• | denotes the norm in H, and for each geH, q>ge V denotes the unique solution of the variational problem a(v, q>g) = (g, v) for all veV, where (%•) denotes the inner product in H. This abstract error estimate is found in Nitsche [1974] and Lascaux and Lesaint [1975].
214 P.O. Ciarlet Chapter V 32. An example of a nonconforming finite element: Wilson's brick We now describe an example of a nonconforming finite element, named Wilson's brick after Wilson, Taylor, Doherty and Ghabussi [1973]. It is used for approximating the solution of linearized elasticity problems posed over three- dimensional rectangular domains. Remark 32.1. A two-dimensional analog of this finite element, known as Wilson's rectangle, is used for approximating problems in "plane elasticity". This finite element is studied in Lesaint [1976]. Wilson's brick is an example of a rectangular finite element in !R3: The set Г is a 3-rectangle, whose vertices will be denoted at, l^i<8 (Fig. 32.1). Wilson's brick, n = 3 Pt=Qi(T)® V{xj;l^j<l}, dim(Pr)=ll Ir = ip(<ii): 1«(<8; L— е„рс!х:14У«£3> Г *,Мз J " 3 т Fig. 32.1. The space PT is the space P2(T) to which are added linear combinations of the function (x1x2x3); equivalently, the space PT is the space Qi(T) to which have been added linear combinations of the three functions x), l<_/<3. We shall therefore record this definition by writing Fr = P2(r)©V{x1x2x3} = e1(r)©V{x?;l<j<3}. C2.1) Notice that the inclusions F2(T)c:Fr, Q1(T)czPT C2.2)
Section 32 Nonconforming finite element methods 215 hold and that dim(Pr)=ll. C2.3) We next show that the values р[а{), К i< 8, at the vertices, together with the values of the (constant) second derivatives дпр, 1 <j < 3,form a Pj-unisolvent set. To see this, it suffices to check the validity of the following identity: For all functions pePf, with f=[_l, +l]3, one has p=Ul +x1)(l+x1){(l+x3)p(a1)+{l -x3)p(a5)} + Ы1-х1)A + х2){A+х3)р(а2) + A-х3)Р{&б)} + i(l-x1)(l-x2){(l+x3)p(a3)+(l-x3)p(fl7)} C2.4) + Ш+х1)A-х2){A+х3)р(а^) + A-хъ)р(а8)} + i(x?-l)S11p+i(xi-l)822p+i(x§-l)a33p. Therefore if we denote by с = £Xf= j at the center of the finite element T, we are naturally tempted to define the following set of degrees of freedom: ST = {p(a;): Ui<8; Ъир(с): 1 ^3}, C2.5) whose degrees of freedom are all in a familiar form. Of course, nothing obliges us to attach the last three degrees of freedom to the particular point с (except perhaps an aesthetical reason of symmetry), since the second derivatives djjp, 1 ^y < 3, are constant for any function pe PT. Keeping this last property in mind, we may also choose for degrees of freedom the averages §тдпрйх, 1г%у'<3, and we shall indeed show that this choice is more appropriate for our subsequent purposes. For the time being, we observe that such degrees of freedom are of a new type, although they are still linear forms over the space #2(T) as indeed they should be, to comply with the general definition of a degree of freedom given in Section 10. Notice that since any function pePT satisfies «c)=8dbJ8^dx' lw' C16) т where 2hj, 1 <j < 3, denote the lengths of the sides as indicated in Fig. 32.1, the two types of degrees of freedom are interchangeable over the space PT. However, relations C2.6) do not hold in general for arbitrary functions in the space ^2(T); in other words we obtain in this fashion two different finite elements (cf. Remark 32.2 below; also, this is an instance of a phenomenon that was mentioned in Section 10). Let us then equip Wilson's brick with degrees of freedom of the form C2.6). Our next objective is to extend the definition of affine-equivalence so that Wilson's bricks can be imbedded in an affine family, the reference finite element being in this case the hypercube f = [-1, +1]3. To do this, it suffices, according to equations A1.12), to write the degrees of freedom in such a way that if we have the identity 8 3 P= TP(ui)Pi+ S Ф)Ш} forallpePf, C2.7) ■=i j=i
216 P.G. Ciarlet Chapter V then we also have the identity 8 3 P= I PiadPi+1 ФАР)Ъ for all? 6 Рг, C2.8) where the basis functions p,- and p(, respectively 4; and qjt are in the usual correspondence A1.9), and ^and <f>j, l</<3, denote the degrees of freedom of the form JT6^pdx, attached to the sets f and Г, respectively. Using C2.4), we easily deduce that any function p in the space PT satisfies the following identity, where c,-, 1 ^ i < 3, denote the coordinates of the point c: T After inspecting C2.8) and C2.9), we find that the proper choices for <j>j and q} are: 2 1 / !x, — с 2 8,,pdx, q^^-Jj-l} _i), K;<3. C2.10) These choices insure that the following relations hold: Pi(flt) = <5at, l<i,fc<8, fc(a,) = 0, Ui^8, l</<3, C2.11) ^(Pi) = 0, Ui<8, 1^3, ФА<1}) = &ф Kj,/<3. Consequently, we shall henceforth consider that t/ie set of degrees of freedom of Wilson's brick is 1г = |р(аг): Ui^e;^-^- Ujpdx: 1^3 j. C2.12) г Notice that we could drop the multiplicative factors hj/(hlh2h3)m the last degrees of freedom without changing the definition of the finite element. According to definition A0.6), the associated PT-interpolation operator Пт is such that, for any sufficiently smooth function v : T-»R, the function nTv belongs to the space PT and is uniquely determined by the conditions IITv(ai) = v{ai), ls£i<8, C2.13) ф;{Пть) = ф&), U;^3.
Section 32 Nonconforming finite element methods 217 Notice that the last three conditions can also be written as дл(Птр) dx = дп v dx, 1 <K 3. C2.14) T T By construction, the Pr-interpolation operator satisfies {nTv}~=nfv C2.15) for functions v and v in the usual correspondence t>-»r = £ • F- \ Also, by virtue of the first inclusion C2.2), we have Птр=р for all peP2(f). C2.16) Remark 32.2. According to definition A0.9), the finite elements {T,PT,ST) and (T, PT, IT) (cf. C2.5) and C2.12)) are not identical since the associated interpolation operators do not coincide over the space ^2(T) (we momentarily ignore that the domain of the interpolation operator corresponding to the set IT is wider; see below). We are now in a position to explain the definite advantage of choosing the forms ф} as degrees of freedom, rather than the point values дпр(с). On the one hand, the basic properties C2.15) and C2.16) of the interpolation operator are unaltered, but on the other hand, the interpolation operator Пт has a wider domain: Whereas in the first case, one is led to assume that the function v : T-* U is twice differentiable over T in order to define its Pr-interpolant, in the second case the Pr-interpolant is well defined for functions "only" in the space H2(T) (which is contained in the space <^°(T) for и = 3). This property will later avoid unnecessary restrictions on the smoothness of the solution и of our original problem (cf. Theorem 34.1). Although the larger Sobolev space over which the Pr-interpolant is defined is the space W2'P(T) for p>§, we shall consider for simplicity that dom ПТ = Н2(Т). C2.17) In the next theorem, we estimate the interpolation errors |u — UTv\m<T. The notations hT and pT represent the usual geometrical parameters (cf. A6.5)). Theorem 32.1. There exists a constant С such thatjor all Wilson's bricks (Г, PT, ZT\ |»-tfr»L.r<C-4>|,.r> (Kms$/, / = 2,3, for all veH\T). C2.18) Рт Proof. By an argument similar to that used in the proof of Theorem 16.1, it can be checked that the mapping Пт: H\T) сH2(T) = dom Пт-+Нт{Т) is continuous for O^m^l, 1=2 or 3. It then suffices to combine this fact with relations C2.15) and C2.16), and to apply Theorem 15.3. □
218 P.G. Ciarlet Chapter V Let Q be a rectangular domain in R3, so that it may be covered by triangulations &~h composed of 3-rectangles. We then let Xh denote the finite element space whose functions vk have the following properties: (i) For each Te$~h, the restrictions vk\T belong to the space PT defined in C2.1). (ii) Each function vk e Xh is defined by its values at all the vertices and by the averages bnvh\Tdx, U/^3, Te$-h. Since the basis functions qs given in C2.10) do not vanish on the boundary of Wilson's brick, this element is not of class Я>°; hence the space Xh is not contained in the space H1(Q), by Theorem 30.1. Continuity is however guaranteed at the vertices of the triangulations, since the functions qj vanish at all nodes of Wilson's brick (cf. C2.11)). Finally, we let Vh = X0h, where X0h denotes the space of all functions vh e Xh that vanish at the boundary nodes. For the same reasons as before, the functions in the space X0h do not vanish along the boundary Г, but they vanish at the boundary nodes. According to the discussion made in Section 31, we need first to verify that the mapping || • \\h defined in C1.1) is indeed a norm over the space Vk. Theorem 32.2. The mapping f*->ll»»IU = { I Ы1т\Ш C2.19) is a norm over the finite element space Vk constructed with Wilson's bricks, Proof. Let vk be a function in the space Vh that satisfies hl=\ I Krilr}1/2=o. Then each polynomial vk\T is a constant function. Hence djj(vh\T) = 0, l^y'^3, in each Те 9~к, on the one hand. On the other, the function ък.й^>Ш must be equal to the same constant over all Te3~k, since it is continuous at all the vertices and thus, it is identically zero since it vanishes at the boundary nodes. □ In order to simpify the exposition, we shall henceforth assume that the bilinear form of C0.1) is a{u,v)— Y, 8juS(i;dx, C2.20) J ; = i n i.e., the corresponding boundary value problem is a homogeneous Dirichlet problem for the operator —A. The extension of the subsequent analysis to more
Section 33 Nonconforming finite element methods 219 general bilinear forms, such as a(u, v) — \< Y, aijdiUdjV+buv>dx, « offers no particular difficulties (other than technical). With the choice C2.20), the uniform Vh-ellipticity of the approximate bilinear forms is a consequence of the identity \\Vh\\h=ah(vh,vh) for all vhe V„. C2.21) This being the case, we may apply the abstract error estimate of Theorem 31.1. The first term, MVheVJ\u — vh\\h, is easily taken care of: Assuming that we consider a family of discrete problems associated with a regular family of triangulations, and assuming that the solution и is in the space H2(Q), we deduce from Theorem 32.1 that inf ||H-ty»<J £ |и-Лги|?,г1 <СЛ|и|2>0. C2.22) vheVh (TeXh J Notice that the derivation of this interpolation error estimate makes a crucial use of the familiar implication (cf. A2.17)) ve dom Пк — H2(Q) and v\r = 0 => ПниеХ0Н, where Пн is the Xh -interpolation operator. Remark 32.3. Of course, we could assume that ueH3{Q), thus getting an 0(h2) estimate instead of C2.22). However the eventual gain is nil because the other term in the right-hand side of inequality C1.6) is of order h, whatever the additional smoothness of the solution may be. Besides, we recall that the weaker assumption и € H2(Q) is more realistic: One does not have a smoother solution in general on convex polygonal domains. 33. Consistency error estimate and the bilinear lemma The other term, Ыи,щ)-1(щ)\ SUP и—л , wheVh \\Wh\\h which appears in inequality C1.6), is a consistency error term due to the "nonconformity" of the method. Consequently, a sufficient condition for convergence is the consistency condition: lim sup l^b^Lo, Our next objective is thus to estimate this consistency error term, through a careful
220 P.C. Ciarlet Chapter V analysis of the difference Dh{u,wh) = ah{u,wh)-l(wh), w„eVh C3.1) Since — Au = f, we can write for any function wheVh, Г з Dh(u,wh)= £ X SiuSjWfcdx Te^i, i=l fw„ dx. T — X ) Z SiMSiWfcdx + zJuWfcj-dx, C3.2) TePhJ U=l J Г i.e., we have obtained one decomposition of the form Dh(u,wh)= £ &тЫт,Щ\А C3.3) TeSTh where, for each Те 2Th, the mapping DT(%') is a bilinear form defined over the space H2{T) x Pr. Ignoring for the time being that such a decomposition is not unique (we shall return to this crucial point later), let us assume that, for one decomposition of the form C3.3), we can show that there exists a constant С independent of T e STh and h such that \DT(v,p)\^ChT\v\2<T\p\lwT for all ve H2{T), pePT. C3.4) Then an application of Cauchy-Schwarz inequality yields \Dh(u,wh)\^Ch\u\2,n\\wh\\h> C3.5) and therefore we obtain \ah{u,wh)-l(wh)\ sup — ^Ch\u\2,Q, C3.6) wheVh \\wh\\h i.e., an estimate of the same order as that of C2.22). For proving estimates such as those of C3.4), the following result turns out to be useful. It plays with respect to bilinear forms the role played by the Bramble-Hilbert lemma (Theorem 28.1) with respect to linear forms. For this reason, we shall refer to this result, due to Ciarlet [1974a], as the "bilinear lemma". Theorem 33.1. Let Q be a domain in W, letk^OJ^Obe integers,letp,qe [1, +oo], let W be a space that satisfies the inclusions P,(fl)cIfcH"+1(fi), C3.7) and finally, let bbea continuous bilinear form over the space Wk+1'P(Q) x W(the space W is equipped with the norm ||*|li+i,«,o) that satisfies b(p,w) = 0 for allpePk(Q), weW, C3.8) b(v,q) = 0 for alive Wk+1'p{Q), qe P,{Q). C3.9)
Section 34 Nonconforming finite element methods 221 Then there exists a constant C{Q) such that |b(»,w)|^C(C)||b||Ni+1.Pl0K+ll,f0 forallveWk+1-*(Q), weW, C3.10) where \\b\\ denotes the norm of the bilinear form b in the space ^£2{WkJel'p{Q) x W; U). Proof. For each function we W, the linear form b(% w):ve Wk+1'p(Q)-*b(v, w) is continuous and it vanishes over the space Pk(Q), by C3.8). Thus, by the Bramble-Hilbert lemma, there exists a constant C^O) such that \b{v,w)\^C1(Q)\\b(;w)\\'k + i.p.a\v\k+l.P,a for all veWk + 1-"(Q). C3.11) Using C3.9),we may write b(v, w) = b(v, w+q) for all qeP^Q), and thus \Hv,w)\ = \b(v,w + q)\^\\b\\\\v\\k + 1,Pia\\w + q\\l + UqM. Therefore, by Theorem 14.1, \b(v,w)\^\\b\\\\v\\k+Up,n inf ||w + 9lli+i.,.o <C2(Q)\M ||»||*+llP>0|w|,+1>,^ for all ve Wk+1'"(Q), we W, Consequently, l|b(%w)lli+i.,.o= sup 'У'^1 ^Сг@)\\Ц\М,+1Лл, C3.12) veWk+l.P{B) llplllt + l,p,0 and inequality C3.10) follows from inequalities C3.11) and C3.12). □ 34. Estimate of the error {£r6*-J«—»*li,r}1/2 for Wilson's brick and the patch test We now prove the main result of this chapter. Theorem 34.1. Assume that the solution и is in the space H2(Q). Then, for any regular family of triangulations, there exists a constant С independent of h such that ll«-u*ll» = { I \u-uh\\X'\ch\u\2,Q. C4.1) Proof. The central idea of the proof is to apply the bilinear lemma to each term DT(u, wh) occurring in a decomposition of the expression Dh(u, wh) of the form C3.3). Some care has to be exercised, however: From C3.2), an obvious choice for the bilinear forms DT is (v,p)eH2(T)xPT-+\ -JX divdip + Avp\dx = т е ire 9Vi T denotes the normal derivative operator г set T. It is quickly realized, however, that there are not "enough" polynomial dv.rvpdy, т er where 9v_r denotes the normal derivative operator along the boundary 9 Г of the
222 P.G. Ciarlet Chapter V invariances at our disposal in such bilinear forms DT in order to eventually obtain estimates of the form C3.4). Fortunately, there is another possible decomposition of the form C3.3) which will yield the right estimates. The key idea is to obtain the desired additional "local" polynomial invariances from a "global" polynomial invariance, as we now show. Let Yh denote the finite element space whose generic finite element is the bilinear rectangle. In other words, for each Te$~h, the restrictions vh\T span the space Qi (T), and each function vhe Yh is defined by its values at all the vertices of the triangulation. Next, let Wh = Y0h denote the space of all functions vh e Yh that vanish at all the boundary nodes, so that the inclusion Whc<e0{Q)nHh{Q) holds. Notice also that the second inclusion of C2.2) implies that the inclusions YhcXk, Yoh = W„cX0h = Vh C4.2) hold. Finally, let Dh\H2(Q)xXh-*U denote from now on the function defined according to the second expression found in C3.2), viz., Dh(v,wh)=Y [iJ^QivdtWk + AvwXdx. C4.3) Ts.Th J U=l J T Then Dh(v,wh)=Q foi all veH2(Q), whe Wh. C4.4) For any function whe Xh, let Ahwh denote the unique function in the space Yh that takes the same values as wh at all the vertices of the triangulation. Then, for each Te$~h, Ahwh\T—AT(wh\T), where Лт denotes the corresponding <2i(T)-interpola- tion operator, and the function Ahwh belongs to the space Wh = Y0h if the function wh belongs to the space Vh-X0h. Using the definition C4.3), we deduce that Dh(v,wh) = Dh{v,wh-Ahwh) for all veH2{Q), whe Vh, C4.5) so that another possible decomposition of the difference Dh(',-) of C4.3) consists in writing Dh(v,wh)= X DT(v,wh) for all veH2(Q), wheVh, C4.6) ТеГн where the bilinear forms DT(-, •) are now given by (compare with C3.2)): DT{v,p)= \dVtTv(p-ATp)dy for all ve H2(T), pePT. C4.7) We observe that, by definition of the operator AT, we have DT{v,p) = 0 for alive H\T), peQ^T), C4.8) and thus we get a first polynomial invariance. To obtain a polynomial invariance with respect to the first argument v, assume
Section 34 Nonconforming finite element methods 223 that the function v belongs to the space Px (T). Then the expression DT(v, p) is a sum of three terms, each of which is, up to a constant multiplicative factor, the difference between integrals of the expression (p - Л тр) over opposite faces. Consider one such term, say (with the notation of Fig. 32.1): <5i= \(Р~Лтр)йх2йх3- Up-ATp)dx2dx3. C4.9) r, reusing the properties of the interpolation operator Лт, the identity C2.9), and the equations 8л(Ллр) = 0, К;'<3, we deduce that г Since the function ({(xi—Cx)/^}2 —1) vanishes along the faces T\ and Tj,and since the functions {{(x}—Cj)/hj}2 — 1), j=2,3, take on the same values at the points P i and P i (cf. Fig. 32.1), we conclude that dt =0. The other analogous terms vanish for the same reasons. Consequently, we obtain a second polynomial invariance: DT(v,p) = 0 for allveP^T), pePT- C4.10) Each expression DT(v, p) found in C4.6) is of the form з DT(v,p)= £ AlT(v,p), C4.11) where ^i,r(u.p)= Sit;(p-/lrp)dx2dx3- 81u(p-/lrp)dx2 dx3, C4.12) and the expressions A2T{v,p) and A3T(v,p) are analogously defined. Using the standard correspondences v-*v between the functions v: T-+M and v: T->R, we note that ^.r(»,P) = (*2M«iMi.r(M). etc. C4.13) Since each function Ahf, je {1,2,3}, satisfies the following polynomial in- variances: Aj,T(v,P)=0 foraUveH2(T), peP0(T), Ajs(v,p) = 0 for all veP^f), pePT, we deduce from the bilinear lemma (Theorem 33.1) that there exists a constant С such that \Alt(v,p)\^C\v\2tT\p\ltT for all veH2(T), pePT. C4.15) Using Theorem 15.1 and the regularity of the family of triangulations, we find that
224 P.G. Ciarlet Chapter V there exist constants С such that |j5|2,f^C||Br||2|detBrr1/2|t;|2,r^C^^2|t;|2,r, C4.16) Ipli.r^CHBrllldetBrr^lpli.r^Cfcf^lpli.r, C4.17) so that, upon combining C4.11), C4.13), C4.15), C4.16), and C4.17), we eventually find that there exists a constant С such that \DT(v,p)\^ChT\v\2tT\p\UT for all Те [Jhrh, veH2(T). This last inequality is of the desired form C3.4) and therefore the proof is complete. □ Remark 34.1. Loosely speaking, one may think of the space Wh introduced in the proof as representing the "conforming" part of the otherwise "nonconforming" space Vh (cf. C4.4)). In analyzing the consistency error, we have followed the method described in Ciarlet [1974a] for studying nonconforming methods: The main idea consists in obtaining two polynomial invariances in the functions DT(-, •), in such a way that an application of the bilinear lemma yields the desired consistency error estimates. For the specific application of this method to Wilson's brick, we have extended to the three-dimensional case the analysis that Lesaint [1976] has made for Wilson's rectangle. As already mentioned (cf. Remark 32.1), Lesaint has considered the use of this element for approximating problems of "plane elasticity", for which he was able to show the uniform ellipticity of the corresponding approximate bilinear forms. In this fashion, Lesaint obtains an 0(h) convergence in the norm || • ||й. Also, the idea of introducing the degrees of freedom jr8^pdx is due to Lesaint. More recently, Shi [1988] has established that the 0(h) convergence is optimal. Using the abstract error estimate mentioned in Remark 31,1, Lesaint [1976] has also shown that, if the solution и is in the space H2(Q), \u-uh\0,Q^Ch2\u\2,o- C4.18) It is worth pointing out that, by contrast with C4.14), the "full" available polynomial invariances are used in the derivation of the error estimate C4.18). The second polynomial invariance C4.10) is related to the patch test, a famous milestone in the history of the finite element method. It follows from C4.6) and C4.10) that Dh{p,w„) = 0 for all pe P^), whsVh. Hence if the function wh is a basis function w( e V, whose support is thus a patch 3PU i.e., a union of finite elements TeSTh, it follows that A,(p,w;) = 0 for all pe PX (<?*,■) C4.19) This is an instance of the celebrated Irons patch test, which Irons (cf. Bazekey, Cheung, Irons and Zienkiewicz [1965], Irons and Razzaque [1972a]) was the first
Section 34 Nonconforming finite element methods 225 to empirically recognize as a condition for getting convergence of a nonconforming finite element method. The theoretical importace of the patch test was immediately recognized in the pioneering work of Gilbert Strang on the mathematical analysis of nonconforming methods (cf. Strang [1972b], and Strang and Fix [1973, Section 4.2]). The patch test and, more generally, the assessment of nonconforming methods have received considerable attention in the engineering literature, as illustrated by the contributions of Wilson, Taylor, Doherty and Ghabussi [1973], Fraeijs de Veubeke [1974], Carey [1976], Taylor, Bersford and Wilson [1976], Oliveira [1977], Sander and Beckers [1977], Samuelsson [1979]. Stummel [1979, 1980a] has shown that the following generalized patch test is a necessary and sufficient condition for the convergence to 0 of the consistency error term arising in the approximation of a 2mth-order elliptic problem over Q с W by a nonconforming method: vheVhmd ||»Ь||<М for all Л => Km £ \ <pd'vhvUTdy = 0 C4.20) 6Г for any M>0, for all |«|^m—1, for all <pe${W), and for all i = l,2,...,rc, where V; x denotes as usual the rth component of the unit outer normal vector along ST. This generalized patch test is thus independent of the particular operator considered (in particular, nonconstant coefficients are allowed), it holds for operators of arbitrary orders, and, finally, it yields convergence (without orders, however). At the same time, Stummel [1980b] has questioned the validity of the "traditional" patch test, and since then, a controversy has developed about which patch test should be preferred! In this direction, see Robinson [1982], Irons and Loikkanen [1983], Shi [1984c, 1984d, 1987a, 1989], Razzaque [1986], Taylor, Simo, Zienkiewicz and Chan [1986]. Among the other studies of nonconforming methods, we mention Rachford and Wheeler [1974], Nitsche [1974], Cea [1976], Kang [1979], Shi [1985]. The nonconforming Crouzeix-Raviart linear triangle (cf. Fig. 12.2), proposed and analyzed by Crouzeix and Raviart [1973], has proved to be very useful for studying two-dimensional incompressible flows; this has prompted Fortin and Soulie [1983] and Fortin [1985] to propose related nonconforming quadratic triangles and tetrahedra. Estimates of the error |w —uAj0i00>i5 have been obtained by Gastaldi and Nochetto [1987]. A "quadrilateral version" of Wilson's brick, called the quadrilateral Wilson's element, has been proposed by Taylor, Bersford and Wilson [1976], and its convergence has been studied by Lesaint and ZlAmal [1980] and Shi [1984a]. Another nonconforming quadrilateral element, introduced by Sander and Beckers [1977], has been studied by Shi [1984b]. Finally, we mention that nonconforming finite elements may be also viewed as special cases of "primal-hybrid" finite elements. See Section 19 of the article by Roberts and Thomas.
Chapter VI Finite Element Methods for Second-Order Problems Posed over Curved Domains Introduction We studied in Chapter V a first violation of the inclusion Vh с V. Another violation of the inclusion Vh с V occurs in the approximation of a boundary value problem posed over a domain Q with a curved boundary Г (i.e., the set Q is no longer assumed to be polygonal). In this case, the set Q is usually approximated by two types of finite elements: The finite elements of the first type are straight, i.e., they have plane faces, and they are typically used "inside" Q. The finite elements of the second type have at least one "curved" face, and they are typically used so as to approximate "as well as possible" the boundary Г. In Sections 35 and 36, we describe one way of generating finite elements of the second type, the isoparametric finite elements, which are quite commonly used in actual computations. The key idea underlying their conception is the generalization of the notion of affine-equivalence: Let there be given a Lagrange finite element (T, P, {р(а{): 1 ^ i s£ JV}) in W and let F: x e T-*F(x) = (F*(x))"= x e W be a mapping such that FteP,l^i^n. Then the triple (T = F(t),P = {p=poF-';peP}, {р(а^(а;)): l^i^N}) is also a Lagrange finite element (Theorem 35.1), and two cases can be distinguished: (i) The mapping F is affine, i.e., FieP^t), 1 <i^n; therefore the finite elements (Г, P, 27) and (f, P, 27) are affine-equivalent. (ii) Otherwise, the finite element (Г, Р, 27) is said to be isoparametric, and isoparametrically equivalent to the finite element (t, P, £). If (T, P, t) is a standard straight finite element, it is easily seen in the second case that the boundary of the set T is curved in general. This fact is illustrated by several examples. We then develop an interpolation theory adapted to this type of finite element. In our analysis, however, we shall restrict ourselves to the isoparametric quadratic n-simplex: it is one of the simplest isoparametric finite element, yet it retains all the characteristic features of more general such elements. For an isoparametric family {T,PT,ZT) of quadratic n-simplices, we show (Theorem 37.3) that the nr-inter- 227
228 P.G. Ciarlet Chapter VI polant of a function v satisfies inequalities of the form \v-nTv\m,T^Ch3T-m\\v\\3,T, 0O»<3, where hT — diam(T). This estimate, which is of the same form as in the case of affine families (cf. Section 16) is established under the crucial assumption that the ''isoparametric" mappings FT do not deviate too much from affine mappings; in addition, the family is also assumed to be regular, in a sense that generalizes the regularity of affine families. Even if we use isoparametric finite elements Te^to "triangulate" a set Q, the boundary of the set Qh = \Jtbsth T is usually not identical to the boundary F, even though it may be much closer to Г than if we used straight finite elements. Since the domain of definition of the functions in the resulting finite element space Vh is the set Qh, the space Vh is thus not contained in the space V; therefore both the bilinear form and the linear form need to be approximated. In order to be in as realistic a situation as possible, we then study in Section 39 the simultaneous effects of such an approximation of the domain Q and of isoparametric numerical integration. As in Chapter IV, this last approximation amounts to use a quadrature formula over a reference finite element T for computing all the integrals |гф(х) dx appearing in the linear system, via the isoparametric mappings FT: t-* T,Te$~h. Restricting ourselves again to isoparametric quadratic w-simplices for simplicity, we show (Theorem 43.1) that, if the quadrature formula over the set f is exact for polynomials of degree 2, we have ]|S-Mi,nh = 0(/i2), where й is an extension of the solution of the given boundary value problem to the set Qh (in general йьфй), and /i=maxrg^h/jr. This error estimate is obtained through the familiar process: We first prove an abstract error estimate (Theorem 40.1), valid under a uniform Vh-ellipticity assumption of the approximate bilinear forms. Then we use the interpolation theory developed in Section 37 for evaluating the term ||й—Пьй\\ 1>0h (Theorem 42.1) and finally, we estimate the two consistency errors (Theorems 42.2 and 42.3; these results largely depend on related results of Chapter IV). It is precisely in these last estimates that a remarkable conclusion arises: In order to retain the 0(h2) convergence, it is not necessary to use more sophisticated quadrature schemes for approximating the integrals when isoparametric, instead of straight, finite elements are used. 35. Isoparametric families of finite elements Our first task consists in extending the notions of affine-equivalence and affine families that we discussed in Section 11. There, we saw how to generate finite elements through affine maps, a construction that will now be generalized in Theorem 35.1 for more general mappings F. We begin by considering Lagrange finite elements.
Section 35 Second-order problems posed over curved domains 229 Theorem 35.1 Let (T, P, E) be a Lagrange finite element in W with t = {p(di): 1 =% i<;N}, let there be given a one-to-one mapping F:xe Г-*-(РДх))]= t e R" that satisfies FjeP, l</<n, C5.1) and let T=F{T), P = {p:T-+U;p=p°F-1,peP}, C5.2) Z = {p(F(U,)):l^i^N}. Then the set I is P-unisolvent. Consequently, if T is a closed subset of W with a nonempty interior, the triple (T, P, I) is a Lagrange finite element. Proof. Let us establish the natural correspondences xeT-+x = F(x)eT, peP-*p=p^F~1eP, between the points in the sets T and T, and between the functions in the spaces P and P. If pi, 1 < i^ N, denote the basis functions of the finite element (t, P, £), we have for all p e P and all x e T, N N p(x) = p(x)= Y P{al)pi(x)= £ p(ai)pi(x), i.e., N P= E P(ai)Pi for all peP. ;=i The functions pt, l^i^N, are linearly independent since Е^=1/1;Р;=0 implies £f=i A;p; = 0 and thus kt=0,l^i^N.ln other words, we have shown that the set Г is P-unisolvent, which completes the proof. □ Remark 35.1. If the mapping F is in addition assumed to be continuous, the set T=F(T) has a nonempty interior: Since Thas a nonempty interior (by definition of a finite element; cf. Section 10), this follows from the invariance of domain theorem (cf. Dieudonne [1982, Theorem 24.8.7]). We shall henceforth use the following notation: To indicate that a mapping F:xef-»F(x) = (Fj(x))?=1eRn satisfies relations C5.1), we write: Fe(P)" *> FjeP, l<j'<n. Notice that the construction of Theorem 35.1 is indeed a generalization of the construction that led to affine-equivalent finite elements, because the inclusion Pi{T)cP is satisfied by all the finite elements hitherto considered.
230 P.G. Ciarlet Chapter VI With Theorem 35.1 in mind, we now give several definitions: First, any finite element (T,P, Z) constructed from another finite element (t,P,Z) through the process described in this theorem is called an isoparametric finite element, and the finite element (Г, P, Z) is said to be isoparametrically equivalent to the finite element (t, P, t). Observe, however, that the finite elements (t, P, f) and (Г, P, Z) cannot in general be interchanged in this definition, by contrast with the definition of affme-equivalence. Next, we say that a family of finite elements (T, PT, ZT) is an isoparametric family if all its elements are isoparametrically equivalent to a single finite element (T, P, t), called the reference finite element of the family. In other words, for each T, there exists an isoparametric mapping FT: T-+W, i.e., that satisfies FTe{P)" and FT is one-to-one, C5.3) such that T=FT(T), PT = {p:T^U;p = p°FT1,peP}, C5.4) ZT = {p(FT(ai)):l^i^N}. As exemplified by the special case of affine-equivalent finite elements, one may consider a family of isoparametric finite elements whose associated mappings FT all belong to the same space (Q)", where б is a strict subspace of the space P. Such finite elements are called subparametric finite elements. Examples are given in Fig. 36.4. Remark 35.2. The prefix "iso" in the adjective "isoparametric" refers to the fact that the same space P is used for defining both the finite element (f, P, Z) and the isoparametric mapping FTe(P)". It is worth pointing out that, by contrast with the space PT corresponding to an affine-equivalent finite element, the space PT defined in C5.4) generally contains functions that are not polynomials, even when the space P consists of polynomials. However this complication is ignored in practical computation, inasmuch as all the computations are performed on the set t, not on the set T. All that is needed is the knowledge of the mapping FT, as we shall notably see in Section 39. In practice, the isoparametric finite element found in Theorem 35.1 is not determined by a mapping F but, rather, by N distinct points at,l^i^N, which in turn uniquely determine a mapping F satisfying Fe(P)m, F(ui) = ah l^i^N. C5.5) Such a mapping is given by N F:xebF(x)=^pi(i)ai, C5.6) as it is readily verified, and it is uniquely defined since the set Z is P-unisolvent and
Section 35 Second-order problems posed over curved domains 231 since, for each je {1,2,..., n}, we must have FjeP, FjiaJ^aj,, l^i^N, where we have let a; = (%)"=i- However, in the absence of additional assumptions, nothing garantees that the mapping F: T-+F(T) is invertible, and indeed this property will require a verification in each instance. Finally, notice that the points at are precisely the nodes of the finite element (T, Р, I). The main interest of isoparametric finite elements lies in the freedom in the choice of the points ah which in turn yields more general geometric shapes of sets T than the polygonal shapes considered up to now. As we shall show in Section 38, this property is crucial for getting a good approximation of a curved boundary. Isoparametric Hermite finite elements can be likewise defined. Their construction relies on the following theorem whose proof, quite similar to that of Theorem 35.1, is omitted. Theorem 35.2. Let (T, P, 1) be a Hermite finite element in R", and assume that the order of directional derivatives occurring in the set £ is one, i.e., £ = {cpf:l^i^N0;(pfk:l^k^di,l^i^N1}, C5.7) and the degrees of freedom are of the following form: ф?:р^р(а?), <plk:p->Dp(al)a. C5.8) Let there be given a differentiable one-to-one mapping F: T-+IR" that satisfies FeP", C5.9) and let T = F(f), P = {p:T->R;p=p°F-1,peP}, Z={q>?:l^i^N0;<pjk:lz:k^di,l^i^N1}, C5.10) (p?:p-*p(af), q>\k:p-*Dp{a})^k, where a? = F(df), UKW0, al = F(d}), l^i^Nu C5.11) & = DF(a})&, Ufc^4, l&^Ni. Then the set I is P-unisolvent. Consequently, if T is a closed subset of № with a nonempty interior, the triple (T, P, Z) is a Hermite finite element. The finite element (T, P, Z), which is called an isoparametric Hermite finite element, shares the same properties as the isoparametric Lagrange finite element constructed in Theorem 35.1: In particular, it generalizes the notion of affine- equivalent Hermite finite element; as in C5.6), the mapping F can be easily expressed in terms of the basis functions of the finite element (T, P, £); the finite element (T, P, I) is completely determined once the finite element (Г, Р, £), the points df,
232 P.G. Ciarlet Chapter VI and the vectors £,}k are known (note that the points a} cannot be arbitrarily chosen, however). For more details, cf. Ciarlet and Raviart Г1972Ь]. 36. Examples of isoparametric finite elements Let us now describe several instances of commonly used isoparametric finite elements. For brevity, we shall give a detailed discussion only for our first example, the isoparametric quadratic n-simplex, for which the finite element (f, P, t) is the quadratic п-simplex. Such an isoparametric finite element is uniquely determined once we are given (n+1) points denoted a£, 1 ^i^n+1, and |n(n+1) points denoted atJ, Ki</^nH-l, since there exists a unique mapping F such that (cf. Theorem 35.1) Fe(P2(T))", F{ai)-al, KisSw+l, Р(аи) = а^, Ukj^n+1, This mapping is given by (cf. F.8) and C5.6)): F: xe f-»F(x) = ]T A,(x)BA,(x)- l)at+ £ Щх)Цх)аи. C6.1) Observe that, if it so happens that the points atj are exactly the midpoints Kfli + Oj), then (by virtue of the uniqueness of the mapping F) the mapping F is "degenerate" and becomes afline. This consideration is illustrated in Fig. 36.1 for n = 2, i.e., in the case of the isoparametric quadratic triangle. (Affine case) a (Isoparametric case) Fig. 36.1. Isoparametric quadratic triangle. It is only later (Theorem 37.2) that we shall give sufficient conditions that guarantee the invertibility of the mapping F of C6.1), but at least we can already indicate that these conditions proceed from a natural idea: When n—2 (cf. Fig. 36.1), let us assume that the three vertices a{, 1 <i<3, are the vertices of a nondegenerate triangle T. Then the mapping F: f->T is invertible if the points ay are not "too far"
Section 36 Second-order problems posed over curved domains 233 from the actual midpoints i(a;+a,) of the triangle T (for a counterexample, see Remark 36.1). The boundary of the set T=F{f) is composed of faces, which are the images F(T') of the faces t' of the n-simplex f. Since each basis function ф of the quadratic и-simplex (f, P2(t), t) vanishes along any face of f that does not contain the node associated with ф (cf. Remark 12.2), we conclude that each face of the isoparametric quadratic n-simplex is solely determined by the nodes through which it passes. This property, which is common to all the isoparametric finite elements considered in this chapter (as the reader may check) allows the construction of triangulations made up of isoparametric finite elements (cf. Section 38). Remark 36.1. When n = 2, these considerations can be made more specific: Let a,- and uj, i ф], be two "vertices" of an isoparametric quadratic triangle. Then the curved "side" joining these two points is an arc of parabola uniquely determined by the following conditions: It passes through the points ah aj, atJ and its asymptotic direction is parallel to the vector ay—^(a,- + a,-). With this result, it is easy to construct examples where the mapping F is not one-to-one. For instance, the mapping F corresponding to the following nodes is not invertible: fl!=@,0), fll2=(l,0), e2 = B,0), fll3=(U), a3=@,2), «23 =@,1). Remark 36.2. Let us verify that our description indeed coincides with the one used by the engineers; consider for instance the quadratic isoparametric triangle as described by Felippa and Clough [1970, p. 224]: Given six points ai = (au,a2i), Ki<6, in the plane (the points a4, a5 and a6 play momentarily the role of the points that we usually call a12, a23 and a13, respectively), a "natural" coordinate system is defined, whereby the following relation (written in matrix form) should hold between the Cartesian coordinates xt and x2 describing the finite element and the "new" coordinates Ab X2 and A3: /Ai^-lA fxA /au a12 fli3 a14. a15 a16\ x2 = a21 a22 a23 a24. a25 a26 l/ V i i i i i i Mi^-l) A2Bi2-1) ЯзBА3-1) AX^2 4/l2A3 . 4A3Ai Then we observe that the first two lines of this matrix equation precisely represent relation C6.1), with F{x) = (F1(x), F2(x)) now denoted (x1,x2). The last line implies either At + X2 + A3 = 1 or Ax + A2 + A3 = —^, hence the solution At + X2 + A3 = 1 is the only one that is acceptable if we impose the restriction that A,->0, l</<3. Therefore, the "natural" coordinates Xy, X2 and X3 are nothing but the barycentric
234 P.G. Ciariet Chapter VI coordinates with respect to a fixed triangle T, and the isoparametric finite element associated with the points ab 1 ^i<6, is in this formulation the set of those points (x1,x2) given by the first two lines of the above matrix equation when the "natural" coordinates Xb also known as "curvilinear" coordinates, satisfy the inequalities 0 s£ Xt «S1, 1 =$ i *S 3, and the equality Sf= x kt -1. A general description of isoparametric finite elements along these lines is found in ZffiNKJEWicz [1971, Chapter 8]. The first references where such finite elements were found in the engineering literature are Argyris and Fried [1968] and Ergatoudis, Irons and Zienkiewicz [1968]. (Affine case) (Isoparametric case) Fig. 36.2. Isoparametric cubic triangle. We next consider the isoparametric cubic n-simplex (cf. Fig. 36.2 for n = 2), whose associated mapping F is given by (cf. F.9)): F:JC6f'-+F(x) = j;iAi(x)<3A1E)-l)CA,(jc)-2)(ai) + 1!Я,(х)А^)(ЗА,(х)-1)(а,у)+ £ 2П£хЩх)Хк(х)ат. i*j i<j<k C6.2) Observe that, if the points aiJk play no role in the definition of the boundary of the set T, the space PT depend on their positions. We could similarly define the isoparametric reduced cubic n-simplex and, more generally, the isoparametric n-simplex of degree к for any integer fe^l. All these isoparametric finite elements are instances of isoparametric simplicial, or triangular if и = 2, or tetrahedral if n = 3, finite elements in the sense that they are isopara- metrically equivalent to a finite element for which the set T is an n-simplex.
Section 36 Second-order problems posed over curved domains 235 As an instance of isoparametric Hermite triangular element, we finally mention the isoparametric cubic Hermite triangle which, according to Theorem 35.2, is defined by three "vertices" ah 1 ^ i ^ 3, two directions <*,-, nt at each point ah 1 < i < 3, and a point a123 (cf. Fig. 36.3). Fig. 36.3. Isoparametric cubic Hermite triangle. We next describe some examples of isoparametric rectangular finite elements, in the sense that they are isoparametrically equivalent to a finite element for which the set t is an n-rectangle, for example the unit hypercube Г=[0,1]". In this fashion we obtain the isoparametric bilinear n-rectangle, also known for n = 2 as the isoparametric 4-node rectangle, or as the 4-node quadrilateral (cf. Fig. 36.4). Fig. 36.4. The isoparametric bilinear rectangle, also called the isoparametric 4-node rectangle, or the 4-node quadrilateral. For и = 2, this is an example of a true isoparametric finite element whose sides are not curved! This is so because the functions in the space Qi([0, l]2) are affine in the direction of each coordinate axis. However, this is special to dimension 2. If n = 3 for instance, the faces of the set T are portions of hyperbolic paraboloids and are therefore generally curved. Another instance of isoparametric rectangular finite element is the isoparametric n-rectangle of degree 2, also known as the isoparametric biquadratic rectangle when n = 2. This last element is represented in Fig. 36.5, where various subparametric cases of interest are also indicated, notably the biquadratic quadrilateral, which
236 P.G. Ciarlet Chapter VI Isoparametric case Fig. 36.5. The isoparametric biquadratic rectangle, also called the biquadratic quadrilateral if Fe(QAT)Y, corresponds to FeFi(r)J. If we replace, again when n = 2, the reference finite element by the reduced biquadratic, or 8-node rectangle (cf. Fig. 7.4), we likewise obtain the isoparametric %-node rectangle, and the 8-node quadrilateral when Fe(QAT)J. The biquadratic quadrilateral and the 8-node quadrilateral provide instances of quadrilateral finite elements, that is, finite elements for which the set T is a quadrilateral. When a finite element (T, P, I) is isoparametrically equivalent to a finite element (f, P, t) through a mapping F, we shall systematically use the correspondences xet->x = F(x)eT, peP->p = poF~1 eP, C6.3) C6.4) between the points in the sets t and Г, and between the functions in the spaces P and P, respectively. We shall extend the correspondence C6.4) to more general
Section 37 Second-order problems posed over curved domains 237 functions defined over the sets f and T by letting (v: f-*U)-*{v=voF~1:T^R). C6.5) Then the associated /-interpolation and /"-interpolation operators П and П are such that {ту = т foralb"€domi7=^0(f), C6.6) provided vsAomtl=>v = v°F_1 edomП — Я>°{Т)\ hence this condition excludes situations where the mapping F~l would not be continuous. 37. Estimates of the interpolation errors \\v—nTv\\m^T for an isoparametric family of finite elements This section, which is based on Ciarlet and Raviart [1972b], is devoted to the derivation of an interpolation theory for isoparametric finite elements, i.e., we shall estimate the interpolation errors \\v—ПTv||m>4iT for finite elements (T,P,E) that are isoparametrically equivalent to a reference finite element (f, P, £). This analysis is carried out in three stages, which parallel those used for affine-equivalent finite elements: (i) Assuming the P-interpolation operator П leaves the space Pk(t) invariant, we obtain by an argument similar to that used in Theorem 15.3 inequalities of the form \v~nvUq,T^C{T,P,£)\v\k + 1^f, C7.1) Thus this step is the same as before. (ii) We then examine how the seminorms occurring in C7.1) can be bounded above by analogous seminorms defined on T, and vice versa. Recall that for affine families, we found inequalities of the form (cf. Theorem 15.1) |BL.,,r<C||B|r|detB-1|1"'|«L,P.r, C7.2) l«U,T^CHB-1ndetB!1/"|t5UP,f, C7.3) with F:xe f->F(x) = Bx + b. In the present case, we shall find (Theorem 37.1) that the seminorms \v\m:PtT are bounded above not only in terms of the seminorm \v\m,p,T, but instead in terms of all the seminorms \v\itP,t, 1 <i<m. (iii) As in the affine case, where the quantities ||B||, |detB_1|, ||B-11|, |detB|, which appear in C7.2) and C7.3), were eventually estimated in terms of the geometrical parameters meas(T), hT and pT (cf. Theorem 15.2), we shall also estimate (Theorem 37.2) analogous quantities appearing in the more general inequalities found in step (ii) in terms of simple geometrical parameters attached to the finite element (T, P, ZT). Thus there are essentially two new steps ((ii) and (iii)) to develop. For ease of exposition, we shall detail these new steps about one example only, the isoparametric quadratic n-simplex. Indications for handling more general isoparametric elements will be also given.
238 P.G. Ciarlet Chapter VI We shall use the following notations: J(F)(x) = detCjF;(x)) = Jacobian of F at jc, j(F-i)(x) = {j(F-1)(x)}_1=Jacobian of F at x, C7'4) |F|,,co,r = sup||i),F(x)||^(R«;Rn), *«!■ C7.5) |F-1h,».rsssup||D'F-1(x)||iP,(R";R"), whenever F: f cz№-*T=F{T)ciW is a sufficiently smooth mapping with a sufficiently smooth inverse F_1: Т-^Т. Notice that when the mapping F is of the form F:x-*F(x) = Bx + b, then J(F) = detB, J(F~1) = detB-1, |i4i,».f=IIB||, |F"Mi,».t = IIB-1II- Since we are considering isoparametric quadratic n-simplices, we shall apply inequality C7.1) with the values m = 0,1,2,3 and к +1 = 3 only; thus we shall restrict ourselves to the seminorms | • ]i,P,o with 0 =% / s$ 3 in the next theorem. Notice that the following result is valid for general mappings F, i.e., it is irrelevant here that the mapping F be in the space (i3)", with P attached to some finite element (T,P,t). Theorem 37.1. Let Q and Q be two bounded open subsets of W such that Q = F(U), where F is a sufficiently smooth one-to-one mapping with a sufficiently smooth inverse F~l:Q^U, Then if a function v:Q-+U belongs to the space Wi,p(Q) for some integer 1^0 and some number pe[l, со], the function v = v°F~1;Q->U belongs to the space Wl'v(Q) and, in addition, there exist constants С such that \v\o.P.o<\J(F)\h%,a\v\o.p,u forallvsU{Q), C7.6) l«li.p.o<C|J(F)|i('e)ifi|F-1|i.a,.nl»li.p.fl forallveW^(U), C7.7) l»l2.p.fl<C|J(F)|^e)>u{|F-1|f...0|i5|2.P.fl+|F-1|2.«,.Dl»li.p./>} for alive W2'p{£2), C7.8) |о|з.,,0^Си(Л1^.й{1^_1|!.».о1»1з.р.й + \F~l\Uaa,n\F-1\2^S2\v\2,P,a+\F~1\3,*,a\v\i,P,u} for alive W3-P(Q). C7.9) Proof. Let 1 =*p< oo. As in Theorem 15.1, it suffices to prove inequalities C7.6)- C7.9) for smooth functions. Using the formula for change of variables in multiple integrals, we obtain \v\o,p,n- \v(x)\pdx~- v(F-1(x))\pdx= \J(F)(x)\\v(x)\pdx.
Section 37 Second-order problems posed over curved domains 239 from which we deduce inequality C7.6). Since v = v°F~1, we next infer that Dv(x) = Dv(x)oDF"'M for all x = F(x), and thus, ||JMx)K|F-1|i.«,flllW(x))ll forallxefl. Consequently, \Dv(x)\\pdx^\F-1\p1,OD,n =\F-up 1,oo.fi |J(F)(x)|||jDt;(x)||pdx ^\F |1,«,,о|^(^)|о,оо,й \Dv(x)\\pdx, and inequality C7.7) follows from this inequality and the equivalence between the seminorms (cf. A5.8) and A5.11)) «->M».,.n and c-< \\Dmv(x)Vdx Vp We likewise have for all x 6 Q, ^ e W, £2 e R", i>2»(x)«i.f2) = ^(x)(D2F-1(x)K1,f2)) + D2»(Jc)(DF-1(x)f1,2)F-1(*)«2); hence we obtain, for all x = F(x)ef2, ||Z>2r(x)|| = sup \D2v(x)(^^2)\^\F-1\2^Q\\Dv(m + iF'1\lx,i2\\D2vm lift К1 г=1,2 Therefore, | D2i;(x) || "dx^ ^IF i/p ID^F-^^rdx i/p + l^1|2,oo,fi \D2v{F~ ^W^dx Up
240 Since P.G. Ciarlet Chapter VI Dlv{F-\x))\\pAx i/p \j{F)(m\D%m^ Ax\'\\j{F)\y^J\{\\Div{xw uxl\tP for any integer /, inequality C7.8) is proved. Inequality C7.9) is proved analogously by using the following inequality: ||D3»WKIi7l3...flllD»(J£)ll+3|F-1li,».D|F-1l2.e,.0l|B2^)ll + \F-1\U,Q\\D3v(x)\\ which holds for all x = F(x) eQ. □ To apply the previous theorem, we must next obtain estimates of the following quantities: |J(F)lo...J-, l-W'^lo.-.r, \F\l,ao,T> 1=1,2,3, \F- 1=1,2, for an isoparametric quadratic н-simplex. To this end, the key idea is the following: Since the affine case is a special case of the isoparametric case, we may expect the same type of error bounds as in the affine case, provided the mapping F is not "too far" from the unique affine mapping F that satisfies F(di) = ai, Ki'^n + 1. Therefore we are naturally led to introduce the n-simplex T=F(T) and the points (cf. Fig. 37.1 for n — 2) aij = F(aij), ls£i<Kn+l. C7.10) C7.11) C7.12) Fig. 37.1. The quadratic triangle (f, P, Ё) associated with an isoparametric quadratic triangle: Г has the same vertices as T; P = P2{T), and f = {p(a,): Ui«3; p{ai}): l^i<j<3}.
Section 37 Second-order problems posed over curved domains 241 An inspection of Fig. 37.1 suggests that the vectors (ay —5y) serve as a good measure of the discrepancy between the mappings F and F: To see that this is indeed the case, let p0- denote the basis functions attached to the node a;j- in the quadratic п-simplex (f, P2(T), Ё), where Then we can write F=¥+Y,PiA*t}-uiS), C7.13) since the mapping satisfies the relations Ge(P2(T)r, G(di)=ah l^z'<n + l, G(fly)=ay, l<i</s£n + l, which precisely characterize the unique isoparametric mapping F. Let there be given an isoparametric family of quadratic n-simplices (T, PT, IT), each of which is determined by the data of vertices aiiT, 1<г<и+ 1, and points aij, r, 1 <' </^ w +1. As in C7.10) and C7.11), we let FT denote for each T the unique affine mapping that satisfies ГГ(а;) = а;.Г, 1 ^i^n+1, and we define the n-simplex T=FT(T). Finally, we let, for each (f), nr = diam(T), C7.14) pT — diameter of the sphere inscribed in T. C7.15) We shall then say that such an isoparametric family of quadratic n-simplices is regular if the following three conditions are simultaneously satisfied: (i) There exists a constant a such that hT/pT^a for all T. C7.16) (ii) The quantities hT approach zero, (iii) We have К,7-А-.г11=0(й£), ki<y<«+l, C7.17) where, for each T, aijt T — Рт(йи). Remark 37.1. If the family is affine, condition C7.17) is automatically satisfied since aiUT = aiUT. In this case, the above definition thus reduces to that of a regular affine family as given in Section 16. Although it is clear that condition C7.17) does insure that the mappings FT and FT do not differ too much (cf. C7.13)), the condition that the vectors {aijtT—aijtT) have
242 P.G. Ciarlet Chapter VI to be precisely of order 0(hT) may seem arbitrary at this stage. As we shall show later (cf. Theorem 37.3), the basic justification of this assumption is that it yields the same interpolation error estimates as in the affine case. To begin with, we show that this assumption allows us to obtain upper bounds of the various quantities found in the inequalities of Theorem 37.1 and that it also implies that the mappings FT are invertible for hT small enough (recall that the invertibility of the mapping FT is part of the definition of an isoparametric family). Theorem 37.2. Let there be given a regular isoparametric family of quadratic n-simplices. Then, provided hT is small enough, the mappings FT:T~*T=FT(T) are one-to-one, their Jacobians J(FT) do not vanish, and there exist constants С such that \FT\Uoo,f^ChT, \FT\2ta0tT^ChT, \FT\3,^f=0, C7.18) |ff1li,».T^C/fcr, l^kc.r^C/ftr, C7.19) |J(Fr)|o,cc,f<Cmeas(T), |J(Ff ^о^г^С/тамСТ). C7.20) Proof. For notational convenience, we shall drop the index T throughout the proof. Using the decomposition C7.13) of the mapping F, we deduce that, for all xet, DF{x)=DF{x) + E(x)=B + E(x) with £(x)= £ (ay-аи)Ори(х). C7.21) Since assumption C7.17) holds and since the basis functions ри are independent of T, we thus find that sup|!£(x)KC7!2 C7.22) xet (as usual the same letter С stands for various constants). Therefore |F|i.c0.r = sup||DF(x)K||B||+sup||£(x)KCA, xeT xsT since i| В || ^Ch (cf. Theorem 15.2); likewise, D2F{x)=DE(x), since D2F = 0; hence arguing as before, we find that snp \\DE{x)\\^Ch2, xet and thus m2,oo,? = sup||D2F(x)KC/i2. xeT Hence all relations C7.18) are proved (the last one is clear, since Fe(P2(f))n). Considered as a function of its column vectors djF(x), 1 ^j^n, the determinant J(F)(x) = det(DF(x)) is a continuous multilinear mapping; therefore there exists
Section 37 Second-order problems posed over curved domains 243 a constant C=C(n) such that л J(F)(x) < С £ II djF(x) || for all xeT. j=i Since the inequality | F | lf ю>j- < Cft proved earlier implies that sup*,,? || djF(x) || < C7i, 1 <7<и, it follows that |J(^')lo.«,.r = sup|J(F)(Jc)|<Cfc"<Cmeas(T'), xeT and the first inequality of C7.20) is proved. Because of assumption C7.16), the matrices В are all invertible; hence C7.21) may be rewritten as DF(x)=B(I + B-1E(x)). Using the inequality ||B_1 \\^C/h (cf. Theorem 15.2 and assumptions C7.16)) and inequality C7.22), we conclude that s\ipst£?\\B~1E(x)\\ ^Ch, Let then у be a fixed number in the interval ]0,1[. There exists /i0>0 such that sup||B_1£(x)||<y for all А^й0; xeT hence for h^h0, the operator (I + B~1E(x)) is invertible for each xeT, and sxxpHI + B-'Eix))-1!^-^—. ' C7.23) xeT 1~У This shows that the derivative DF(x) is invertible for all xeT, with (DF{x)) ~J = (/ + В ~J E(x)) -1В ~x. C7.24) We next prove that the mapping F: T-+T is invertible. By the implicit function theorem, we can only deduce that the mapping F is locally invertible, i.e., in a sufficiently small neighborhood of each point of f; this is why the global invertibility requires an additional analysis. Let then x, yet be such that F{x) = F(y). Since the set t is convex, Taylor formula yields F(y) = F(x) + DF(x)(y-x)+iA(y-xJ, where A eiS^R"^") is the constant second derivative of the mapping F; hence DF(x)(y-x)= -U(y~xJ= -U(x-yJ = DF(y)(x-y), and consequently, (DF(x) + DF(My-*) = 0. Since each component Ft of the mapping F is in the space P2(T), and since (Ux + y))et, djF^PAt) => bjF№+djFt($)=2djFt(№+p)), i<»,;<«,
244 P.G. Ciarlet Chapter VI i.e., 0 - (DF(x)+DF(y))(y -x) = 2 DF&x + у))(У -*)■ Since the derivative DF(j(x+y)) is an invertible mapping in i?(R"), we conclude that x = y. We can now turn to the proof of inequalities C7.19) and C7.20). To begin with, we have {DF{x)}~l^DF~\x) for each xef and thus, by inequalities C7.23) and C7.24), IF-Mi.oo.r-suplflV-'MKC/A, xsT which proves the first inequality of C7.19). Next, let there be given smooth enough functions F: R"-+Rn and G: R"-»-R"; then the function H = G°F: W^W is such that D2H(x)^1,^2) = JDG(x)(ZJF(x)(^,^)) + ^2G(x)(DF(xK1,DF(x)^) foralUbkeR". If we apply this formula with G = F'\ in which case Я = /, we obtain D2F-\x){DF{x)^, DF(x)£2) = -Df \x){D2F{x){^, £2)) forallx = F(x)ef. Since the mapping DF(x): W-*W is invertible for each x = F(x)e Г, we infer that D2F-1W(f?1,J72)=-DF-1(x)(ZJF(x)(DF-1(x)Ol,i)F~1W'72)) for all tjur}ze№, and thus, ||D2F-1(x)H= sup IID^-^K^b^lKll^f^ini^F-1^)!!3- llmKl 1=1,2 Hence, using the second inequality of C7.18) and the first inequality C7.19), we conclude that ii7|2.cc,r = sup||D2F-1(x)K|F|2,00,f!F-1|lOo,r^C//I, xeT and the second inequality of C7.19) is proved. Using C7.24), we can write B = DF(x){I + B~1E(x))-1 for all xef, and thus, by C7.23), \detB\ = \J(F%x)\\dst(I + B-xE(x))-1\^}J* for all xef.
Section 37 Second-order problems posed over curved domains 245 Therefore, we deduce that 1 1 r=inf J(f)(x) ^A — y)"|detB\^C meas(f), and the second inequality of C7.20) is proved. □ The elegant proof of the invertibility of the mapping FTeP2(f) given here is due to Annie Raoult. In more general cases, it can often be derived from the following result, due to Meisters and Olech [1963]: Let tbea compact subset of R" with a connected boundary df, and let there be given a mapping F: tс R"->R" that satisfies the following assumptions: (i) The mapping F can be extended to an open subset Q containing T in such a way that its extension (still denoted) F is in the space (e1(U; W). (ii) J{F){x)>0 for all xef. (iii) The restriction of F to df is one-to-one. Then the mapping F: f—>W is one-to-one. Since it is usually straightforward to prove that the Jacobians J(F){x) > 0 for all xef, checking that the mapping F is one-to-one on T thus amounts to checking that it is one-to-one on df; when и = 2 for instance, this is particularly easy since the boundary 9 Г is a union of arcs of simple algebraic curves (cf. Remark 36.1). Other sufficient conditions of injectivity for such mappings F are given in Ciarlet [1987, Section 5.5]. Let us now return to our discussion of the isoparametric quadratic triangle. Combining Theorems 37.1 and 37.2, we are in a position to prove an important result (compare with Theorem 16.2). Theorem 37.3. Let there be given a regular isoparametric family of quadratic n-simplices (T,PT,ST), and let there be given an integer m^O and two numbers p, <je[l, oo] such that the following inclusions hold: W3-P(f)c>%0{f), C7.25) W3-p(f)c+Wm-%f), C7.26) where (f, P, I) is the reference quadratic n-simplex of the family. Then, if the diameters hT are small enough, there exists a constant С such that, for all finite elements in the family, and all functions ve W3,P(T), \\v-nTv\\m,q,T^C{meas(T)yi«-1'ph3T-m{\v\2,p,T + \v\Xp,T}, C7.27) where, for each T, T denotes the n-simplex that has the same vertices as T. Proof. The inclusion C7.25) guarantees the existence of the P- and Pr-inter- polation operators Й and Пт, which satisfy relation C6.6). Combining the
246 P.G. Ciarlet Chapter VI inequalities established in Theorems 37.1 and 37.2, we obtain \v-nTv\0,t,T^\J{FT)\h%,T\e-Uv\o.<.t <C{meas(T)}1'«|0-.tfp|o.e.i-, \v-nTv\1,q,T^C\J(FT)\h{\a,T\Fj1\1,a),T№-Uv\i.<.T ^С{теа.&(Т)}114^\е-Пю\ичЛ, |о-Яг|;|2.,.г^Си(^г)|^в,.1-{|^11?...г1<5-Ле|2,,,г + |ff1|2,oo,rl«-i?«5|l,«,t} Г ] 1 ^с{те^(Т)}^1-^\д-пе\2>йгт+—\е~т[ичЛ The relation P = P2(T) and the inclusions C7.25) and C7.26) next imply, by Theorem 15.3, that there exists a constant С depending only on the set f such that, for all ve W3'P(T), l«5-^lu,r<C|u|3>p,f, l^m, and thus, after combining the above inequalities, we obtain \v-nTvUg,T^C{mcas(f)}^--\v\3^t. Another application of Theorems 37.1 and 37.2 then yields: |pi3,p,^C|J(Ff1)|^,r{|Fr|f.00,f|t;|3,P,r + + l*Yll,<o,f \Ft\i,«>,t\v\2,p,T + |frkoo,?Ml,p,T} ^C{msas(T)}'^h3T{\v\2,p,T+\v\3,PtT}, and thus inequality C7.27) is proved for the values m — 0,1,2. The proof for m — 3 proceeds along the same lines and, for this reason, is omitted. □ It is interesting to compare the estimate of the above theorem with the analogous estimates obtained for a regular affine family of quadratic n-simplices (cf. Theorem 16.2; in this case, Г=Г): li» - ВД«.«.г < C{meas(T)}^- ШЬ3Т~"\v\3tPtT. We conclude that the two estimates coincide except for the additional seminorm Иг.р.г (which appears when one differentiates a function composed with other than an affine function; cf. the end of the proof of Theorem 37.1). Also, the present estimates have been established under the additional assumption that the diameter hT are sufficiently small, basically to insure the invertibility of the derivatives DFT(x), xet (cf. the proof of Theorem 37.2). Remark 37.2. As in the case of affine families (cf. Remark 16.1), the parameter
Section 37 Second-order problems posed over curved domains 247 meas(f) can be replaced by h\ in inequality C7.27), since it satisfies (cf. C7.16)) the inequalities P„a~"hnT ^ meas(f) < PJTT, where /?„ denotes the dx-measure of the unit ball in R". Similar analyses can be carried out for other isoparametric finite elements. Consider for instance the isoparametric cubic n-simplex (cf. Fig. 36.2 for n = 2). If we introduce as in C7.10) the unique affine mapping FT that satisfies FT(a;) = at, 1 ^ i: ^ n + 1, we can obtain by an argument similar to that of Theorem 37.3 interpolation error estimates similar to those of the affine theory, viz., H»-nr«;|L.f.r^C{meas(f)}1"-1'^-"ll»ll4.p.r» 0^m<4, C7.28) provided we consider a regular isoparametric family of isoparametric cubic и-simplices, in the following sense (compare with C7.17)): \\aiij,T-aiihT\\=0{hl), l^i,j^n+l, гФ}, C7.29) l|e№r-fly».rll = 0(ArX Ui<j<k«+1, C7.30) where aiijiT = FT(aUJ) and aiJk<T = FT{dijk). It is clear however that if the points aiij<T are taken from an actual boundary (as they would be in practice), condition C7.29) cannot be satisfied since in this situation one has at best ||агу>г — aUj,T || = 0(/г|). There is nevertheless one case where this difficulty can be circumvented: Assume that n — 2 and that (cf. Fig. 37.2, where the Fig. 37.2. The nodes an2 and a221 of the isoparametric cubic triangle may be chosen in such a way that II «ii2 — ^i i2 II = 0(/i2), ||a22i — «22i II = 0(A2); yet if veH*(T), the interpolation error estimate is still II v- Tlv||m,r = 0(/!*-m), 0 s= m < 4.
248 P-G. Ciarlet Chapter VI indices Г have been dropped for convenience) а331,Г=5з31,Г> flUJ,T = fl113J> fl332,T =Й332,Г> а223,Г = «223,Tj C7.31) I|ail2,r-flll2,rll =0(fcr), || «221,7- ~ 5221,7-11 = 0(/#), II(йц2,г - flii2,r) - (fl22i,r ~ 522i,r) II = 0(h\), C7.32) И(а123,г ~ Й123,г) - (ЯЦ2,Г - «ИМ) - (Я221.Г - Й221,г) II = O(fcj-). Note that assumptions C7.32) are now realistic, in the sense that the points fli 12,r and 022iir can be actually chosen along a smooth boundary so as to fulfill the above conditions. Then it can be shown that the interpolation error estimates of C7.28) still hold if the assumptions C7.29)-C7.30) are replaced by C7.31)-C7.32). For details, see Ciarlet and Raviart [1972b], where the isoparametric cubic Hermite triangle (Fig. 36.3) is also considered. If we turn to isoparametric rectangular finite elements, the situation is less simple. Of course, we could again consider this case as a perturbation of the affine case. But, as exemplified by Fig. 36.5, this would reduce the possible shapes to "nearly parallelograms". Hopefully, a new approach can be developed whereby the admissible shapes correspond to mappings FT that are perturbations of mappings FT in the space (Qi(f)f, instead of the space (P^f))", Accordingly, a new theory, valid in particular for the 4-node quadrilateral, has to be developed, along the following lines: (i) Let Q be a domain in W. Then for any integer fcjs 1 and any pe [1, <x>], the seminorm v-+[v] k,p,Q: i = l Up £ \Dkv{x)(eiff dx\ C7.33) is a norm over the quotient space Wk + 1'p(Q)/Qk(Q), equivalent to the quotient norm (this fact was already noted in Remark 15.2). Hence, let there be given two Sobolev spaces Wk+1-'(Q)aad W"*(fi)with Wk+1'P{Q)^ Wm>p{Q) and let Пе £(Wk+Up{Q); Wm'q{Q)) be a mapping that satisfies Tlq = q foralUe&@). . C7.34) Then (compare with Theorem 15.3) there exist a constant C{Q, П) such that \v-nv\llhqJ1^C{Q,n)[v]k + i,p,0 foi alive Wk+l"(Q). C7.35) (ii) Let F: Q с К"-» R" be a sufficiently smooth mapping. For any integer l^ 1, we let га,=о,я= max sup||Z?'F(Jc)(e()'||. C7.36) lsSi4n xetl Then, if the same assumptions as in Theorem 37.1 hold, there exist constants С such that
Section 37 Second-order problems posed over curved domains 249 for all ve W2*{Q), M3,P,ft^C|J(F-1)|i^,«{[F]f,00,6|t;|3,P,n+[F31,to,6[F]2,00,d^2,P.fi + [^3.c0.fil»li,r.O} for all г6 W^{U), C7'38) (iii) Consider an isoparametric family of 4-node quadrilaterals (cf. Fig. 36.4); and for each Г, let hT = dmm(T), h'T=smallest length of the sides of T, C7.39) yj-^maxllcosKaj+i-aJ^aj-i-aj)}!, l<i<4(mod4)}. Such a family is said to be regular if all the sets T are convex, if there exist constants a' and у such that hT/h'T^a', yr<y<l for all T. C7.40) and if the quantity hT approaches zero. Note that condition C7.40) implies that the ratios hT/pT are bounded but that the converse is clearly false. Then given a regular family, the mappings FT: T =[0,1]2-*T are one-to-one, and the following estimates hold (compare with Theorem 37.2) LFrli.cf^ChT, [FT]2.».T<Cftf, C7.41) [Frll^,t^C/hT, C7.42) \J(FT)\o.oo,T<Ch2T, \J(FT%,x,T^C/h2T. C7.43) (iv) All the above results combined then imply that the following interpolation error estimate holds: Assume that W2-p(f)o,<g0(T) and that W2'p(f) c» Wm-q(t). Then there exists a constant С such that, for all ve W2,P(T), \\v-nTv\\m«,T*ZC{hl}1i'-1"h$-m\\v\\2,p,T, m = 0,l. C7.44) (v) Consider next an isoparametric family of biquadratic quadrilaterals (cf. Fig. 36.5). For each T = F(t), the mapping FT e (Q2 (t)J is uniquely determined by nine points aitT, l^i^9. Let then FT denote the mapping uniquely determined by the conditions Рт£ШП\ FAd^a^, UK 4. C7.45) Then we say that the family is regular if the family of quadrilaterals T—Fr(T) is regular, if conditions C7.40) hold, and if (compare with C7.17)): Нг-3,,г11=0(й& 5<К9, C7.46) where aUT = FT(dt), 5j$z'<9. If the family is regular, the mappings FT: f=[0,1]2-*T are one-to-one for hT small enough, and the following estimates hold (compare again with Theorem 37.2): [Fr]i.-.J-<Cfcr, [Fr]2,co,T<C#, [Fr]3jOO,f = 0, C7.47)
250 P.G. Ciarlet Chapter VI IffMi.co.r^C/Ar, Iff'kco.r^C/Ar, C7.48) \JiFTb.*.t**Ch2T, lAF^h.^T^C/hl, C7.49) and, finally, the following interpolation error estimates hold: Assume that W3,p(t)c+ ^°(T) and that W3-P(TК PFm'«(f). Then, there exists a constant С such that, for all teff3''(r), \\v-nMm«.T^C{h2}ll*-l*h3T-m\\vh,P,T, « = 0,1,2. C7.50) A general interpolation theory for isoparametric rectangular finite elements, which comprises the above results as special cases, is given in Ciarlet and Raviart [1972b]. Significant improvements have then been obtained by Jamet [1976b], who obtained interpolation errors for quadrilateral elements degenerating into triangles. Curved finite elements of other than isoparametric type have also been considered, notably by ZlAmal [1970, 1973a, 1973b, 1974] and Scott [1973a]. Both authors begin by constructing a curved face T" by approximating a smooth surface through an (и— i)-dimensional interpolation process. This interpolation serves to define a mapping FT which in turn allows to define a finite element with T" as a curved face. Then the corresponding interpolation theory follows basically the same pattern as here. In particular, Scott constructs in this fashion a curved finite element that resembles the isoparametric cubic triangle and for which an interpolation theory can be developed which requires weaker assumptions than those indicated here. See also Bernardi [1988]. Wachspress [1971, 1973, 1975] has devised a clever class of rational finite elements, where the sets T are quadrilaterals, or more generally, polygons, and where "the spaces PT consist of rational functions. These finite elements of a new type have then been studied along the lines developed here by Apprato and Arcangeli [1979], Apprato, Arcangeli and Gout [1979a, 1979b], Gout [1979a, 1979b], Laydi and Lesaint [1985]. Curved finite elements based on blending function interpolation have been advocated by Gordon and Hall [1973]; in this direction, see also Cavendish, Gordon and Hall [1976], Barnhill [1975a, 1975b], Zacharias and Subba Rao [1982]. Other relevant references are McLeod and Mitchell [1972, 1975], Lukas [1974], Mitchell [1976], Mitchell and Marshall [1975], Arcangeli and Gout [1976], Leaf, Kaper and Lindeman [1976]. 38. Approximation of a domain with a curved boundary with isoparametric finite elements As in Chapters IV and V, we consider the homogeneous second-order Dirichlet problem that corresponds to the following data: V=Hh(Q) a(u,v)= 1 Yj a^diudjudx, Hp)= \fvdx, C8.1) a a
Section 38 Second-order problems posed over curved domains 251 where Q is a bounded open subset of W with a curved boundary Г (the main novelty), and the functions ai} e ЬЮ(А) and / e L2(Q) are everywhere defined over the set Q. We shall assume that the ellipticity condition holds, i.e., that there exists a constant /? such that P>0, » я C8 2) £ OijixfaZj^p X tf for all xeQ, fo)e R". In this section, we describe in detail how to construct a triangulation of the set Q with isoparametric finite elements, a process that will lead us to the definition of a discrete problem without numerical integration; the definition of a discrete problem with numerical integration will be given in Section 39. Our objective is to assemble isoparametric finite elements (T, PT> ZT), Te$~h, in such a way that the union Qh = {JTs&-hT is a "good" approximation of the set U. We assume that these elements form an isoparametric family, i.e., that they are all constructed from a single reference finite element (T, P, t) through isoparametric mappings FT e (P)" (cf. Section 35). Since we shall assume for ease of exposition that (f, P, t) is a Lagrange element, each mapping FT is thus uniquely determined once the nodes of the finite element (T,PT,ET) are specified. These nodes, which will always be assumed to belong to the set Q, must thus be chosen in such a way that 3~h is a triangulation of the set Qh, in the sense that conditions (^1)-(^"л4) set up in Section 5 hold, and that the boundary of the set Q~h "approximates at best" the boundary of Q. Finally, we shall restrict ourselves to finite elements that possess the following property: Each basis function ф of the reference finite element (T, P, t) vanishes along any face of the set f that does not contain the node associated with ф. C8.3) As shown by the examples given in Section 36, this is not a restrictive assumption. Of course, we shall take advantage of the isoparametric mappings FT for getting a good approximation of the boundary Г: By an appropriate choice of nodes along Г, we construct finite elements with (at least) one curved face which should be "close" to Г, at any rate closer than a straight face would be. Let us assume for definiteness that we are using simplicial finite elements. We then distinguish two cases: either the mapping FT is affine, i.e., FT e (P: (f))", or the mapping FT is "truly" isoparametric, i.e., FTe (P)" but FT$(P1(t))n. The latter case applies in particular to "boundary" finite elements, while the former rather applies to "interior" finite elements. These considerations are illustrated in Fig. 38.1, where we consider quadratic triangles. For computational simplicity, it is desirable to keep to a minimum the number of curved faces, and this is why, in general, only the "boundary" finite elements will have one curved face. However, all the subsequent analysis applies equally well to all
252 P.G. Ciarlet Chapter VI Fig. 38.1 "Affine" and "truly isoparametric" quadratic triangles. possible cases, including those in which all the finite elements (Г, Рт, Ет), Те 2Th, are "truly" isoparametric. Hence we must check in particular that the intersection of "adjacent" finite elements is indeed a face for both of them. In other words, there should be no holes and no overlaps. This is true because, by C5.6), the finite elements that satisfy C8.3) are such that any one of their faces is solely determined by the nodes that are on it (of course, the nodes that define a common face are assumed to be the same for two adjacent finite elements). As an illustration, we have represented in Fig. 38.2 three
Section 38 Second-order problems posed over curved domains 253 isoparametric quadratic tetrahedra "just before assembly": The face T is completely determined by the nodes au a2,a3, a12, a23, an, and the arc s/ is completely defined by the nodes aua2 and a12. Returning to the general case, we shall assume that all the nodes that are used in the definition of the faces that approximate the boundary Г are on Г and that there are no other nodes on Г. Because each face V of an isoparametric finite element is necessarily of the form T = FT(f') with FT e (J3)" and f' a face of t, it is clear that the boundary Гн of the set Qh=\Jres-h T does not coincide in general with the boundary Г of the set Q. Nevertheless, we shall call ^i, a triangulation of the set Q, even though it should more appropriately be called a triangulation of the set Qh. We then let Xh denote the finite element space whose functions vh:Qh->-U are defined as follows: (i) For each Te$~h, the restrictions vh\T span the space Рт = {р:Т->П;р=р°Рт\реР}. (ii) Over each Те 3~h, the restrictions vh\T are defined by their values at the nodes of the finite element (T, PT, IT). If the functions of the space P are smooth enough, such a space Xh is contained in the space ^°(Qh) (by C8.3), a function in the space P is solely determined along a face T of Tby its values at the nodes that lie on T'\ and consequently the inclusion ХксН1{Оь) holds by Theorem 5.1, where Qh denotes the interior of the set Qh. We let X0h denote the subspace ofXh whose functions vanish at the boundary nodes, i.e., those nodes that are on the boundary Г. We recall that, by construction, these nodes are precisely those that are on the boundary Гк. Therefore another application of property C8.3) shows that the functions in the space Xoh vanish along the boundary ГА, and thus the inclusion Vh = X0h^Hh(Qh) C8.4) holds. Since we expect that the boundaries Fh and Г are closer and closer as /i->0, we shall henceforth assume that there exists a bounded open set Q such that QcQ and QhcQ C8.5) for all the triangulations 2Th that we shall consider. Then the most natural definition of a discrete problem associated with the space Vh consists in finding a function Uh e Vh such that Г " Г £ aijdiHhdjvhdx= Jvhdx for all vhe Vh, C8.6) J " ,j = l я», ah where the functions atj and / are some extensions of the functions ay and / to the set Q.
254 P.G. Ciarlet Chapter VI 39. Isoparametric numerical integration In spite of the simplicity and of the natural character of definition C8.6), several questions immediately arise: How should one choose between all possible extensions? How should one construct such extensions in practice? What is the dependence of the discrete solution uh upon these extensions? Surprisingly, it turns out that these ambiguities will be circumvented by taking into account the effect of isoparametric numerical integration: As in Section 25, we assume that we have at our disposal a quadrature scheme over the set T: L <p(x)dx~ X (Ь1ф{В1), ш,еН, dtet, 1*=/<L. C9.1) i = i т Given two functions ф: T->Umd(p : T=FT(T)->U in the usual correspondence ф-*(р = фор~1^ we have <p(x) dx = (p(x)J(Fr)(x) dx, where the Jacobian J{FT) of the mapping FT may be assumed without loss of generality to be >0 over the set f. Therefore, the quadrature scheme C9.1) over the reference element t automatically induces a quadrature scheme over the finite element T (compare with B5.9) and B5.10)), viz., L <p(x)dx~ £ ojliT(p{bl<T), C9.2) with weights coiiT and nodes blT defined by oi,,T = cbiJ(FT)(Si), bKT = FT(Sl), 1</<L. C9.3) Accordingly, we define the quadrature error junctionals ET{(p) = (р(х)йх-^(°итФ1Л C9.4) £(ф)Лф(х)йх-^ mftPil C9.5) J i=i т which are related through the equation ET(<p)=E(<pJ(FT)). C9.6) Let us examine how isoparametric numerical integration affects the definition of the discrete problem C8.6). Assuming that the extensions ay and J are defined everywhere over the set {Q}~, we now seek a discrete solution uhe Vh that satisfies
Section 39 Second-order problems posed over curved domains 255 (compare with B5.27) and B5.28)): L n L S Z т1.т Z (uiAuhdjVh)(bl<T)= £ £ ®1,тG"*)(^,т) for allufteKh. C9.7) We then show that the extensions atJ and J are not needed in the definition of the discrete problem C9.7) if all the quadrature nodes blT, 1 ^/<L, Te3~h, belong to the set Q. To show that this is indeed a common circumstance, let us consider one typical example. Let и = 2 and assume that we are using'isoparametric quadratic triangles and that each node of the quadrature scheme over the set t either coincides with a node of the quadratic triangle T, or is in the interior int f of the set T; as shown by the examples given in Section 25 (cf. Figs. 25.1, 25.2 and 25.3), this is a realistic situation. To prove our assertion, it suffices to consider only a "boundary" finite element and, at this point, it becomes necessary to indicate how the boundary nodes are actually chosen. With the notations of Fig. 39.1, the point allyT is chosen at the Fig. 39.1. The boundary node a12 T of an isoparametric quadratic triangle is chosen as the intersection of the boundary Г with the perpendicular bissector of the segment [al T, a2 r]. intersection between the boundary Г and the line perpendicular to the segment [fli,r>02,r] passing through its midpoint ai2,T = i(fli,r + fl2,r)- This choice has three important consequences: First, if the boundary Г is smooth enough, we automatically have I|fli2,r-a12,rll=0(^), C9.8) where hT is the diameter of the triangle with vertices aUT, 1< i < 3. This estimate will insure that a family made up of such isoparametric quadratic triangles is regular in the sense understood in Section 37; we shall use this property in Theorem 42.1. Secondly, the image bT — F-j(S) of any point Se int T belongs to the set fin T provided hT is small enough. Thirdly, it is clear that there exists a bounded open set Q such that the inclusions C8.5) hold.
256 P.G. Ciarlet Chapter VI Remark 39.1. This construction can be easily extended to an open set with a piecewise smooth boundary, i.e., a Lipschitz-continuous boundary composed of a finite number of smooth arcs, provided each intersection of adjacent arcs is a "vertex" of at least one isoparametric quadratic triangle. Remark 39.2. When n = 3, a node such as a12 (cf. Fig. 38.2) may be chosen in such a way that the distance between the points a12 and a12 is equal to the distance between the point a12 and the boundary F. Returning to the general case, we are therefore justified in assuming from now on that the relations bUT = FTFl)eQ, U^L, for all Tef„, C9.9) hold for all the triangulations S~h to be considered. This being the case, the discrete problem C9.7) consists in finding a discrete solution uhe Vh such that ah{uh,vh) = lh{vh) foralbfc6Kk, C9.10) where, for all functions uh,vhe Vh, the approximate bilinear form ah(-,-) and the approximate linear form lh(-) are given by L n ah(uh,vh)= £ £ (oUT X {aifiiUi$jVh){KT), C9.11) Te$-h 1 = 1 U=l lh(v„) = £ £ ^Afv^b.r). C9.12) ГеЛ, ( = 1 In other words, thanks to the effect of numerical integration, the discrete problem can be defined without any reference to possible extensions of the functions atj and f, and this observation is of course of great practical value (by contrast, extensions explicitly appear in the final error estimate; cf. Theorem 43.1). Conceivably, several quadrature schemes over the reference finite element could be used, which vary according to which finite element is considered in the triangulation. In particular, one would naturally expect that more sophisticated schemes are needed for dealing with the "truly" isoparametric finite elements. Since our final result (Theorem 43.1) shows however that this is not necessary, we shall deliberately ignore this possibility (which, at this stage, would simply require notational modifications in C9.11) and C9.12)). 40. Abstract error estimate The remainder of this chapter is based on Ciarlet and Raviart [1972c]. Given a family of discrete problems of the form C9.10), we shall say that the approximate bilinear forms ah{-,-) of C9.11) are uniformly Vh-elliptic if there exists a constant a such that *>0'2 D0.1) &\\v>,\\i,nh^ah{vh,"и) for all vhe Vh and all h.
Section 40 Second-order problems posed over curved domains 257 As usual, we first prove an abstract error estimate. The arbitrariness of the functions 5y and й appearing in the next theorem is temporary: When this error estimate is actually applied, these functions will be extensions of the functions atJ and и (cf. Theorem 43.1). Theorem 40.1. Given an open set £2 that contains all the sets Qh, and given functions aueL°°(&), let ah(v,w)= £ uijdivdjwdx for all v,we Hl(Qh), D0.2) J u=i On and consider a family of discrete problems of the form C9.10), whose associated approximate bilinear forms are uniformly Vh-elliptic. Then there exists a constant С independent of the space Vh such that \\u-uhL^c( inf \\\u-vh]\uah+ sup l*<*.**>-*<*.".) + sup hy] » hK *"- , D0.3) vv/iere u is any function in the space Я^Й), аш/ нй denotes for each h the solution of the discrete problem C9.10). Proof. The assumption of uniform ^-ellipticity insures in particular that each discrete problem has a unique solution uh. Also, there exists a constant M independent of h such that \ah(v,w)\^M\\v\\US2Jw\\UQh for all v,weH\Qh). D0.4) Let then vh denote an arbitrary element in the space Vh. We have &\\Uh-vh\\2Uah**ah(uh-vh,uh-vh) = ah(u-vh,uh-vh)+{ah(vh,uh-vh)-ah{vh,uh-vh)} + {lh(uh-vh)- ah(u, uh -vh)}, so that, using D0.4), &\\Uh-vh\\i,ah .,-,,. „ ,\ah(vh,uh-vh)-ah(vh,uh-vh)\ + \\uk-vh\lUOh \ah(u,u„-vh)-lh(uh-vh)\ \\uh-vh\\1A ..->„- „ , \ah(vh,Wh)-ak(vh,wh)\ \ah{u,wh)-lh(wh)\ ^M\\u-vh\\UQh+ sup — + sup WhsKh l|W(,||lf0h WheVh \\Wh\\l„ iih
258 P.G. Ciarlet Chapter VI Combining the above inequality with the triangular inequality \\u-uh\\i,By,**\\u-vh\\liBh+\\uh-vh\\iiBh, and taking the infimum with respect to vke Vh, we obtain inequality D0.3). □ The remainder of this chapter will thus be devoted to giving sufficient conditions that imply the uniform FA-ellipticity of the approximate bilinear forms (Theorem 41.1) and to estimating the various terms that appear in the right-hand side of inequality D0.3). To keep the development within reasonable limits, we shall however restrict ourselves to finite element spaces made up of isoparametric quadratic n-simplices. In Section 37, we have defined a regular family of isoparametric quadratic n-simplices (cf. C7.16) and C7.17)). Since this regularity will be a pervading assumption in the remainder of this chapter, it is crucial to notice that condition C7.17) is perfectly compatible with the construction of boundary elements (cf. C9.8) and Remark 39.2). 41. Uniform Frellipticity of the approximate bilinear forms It is remarkable that the following sufficient conditions for the uniform F^-ellipticity are the same as for straight quadratic n-simplices (cf. Theorem 27.1 with fc' = 2). Theorem 41.1. Let (Vh) be a family of finite element spaces made up of isoparametric quadratic n-simplices forming a regular family, and let there be given a quadrature scheme L cp(x)dx~ Yj <*>i<P{b~i)> ю(>0, 1</<L, J ;=i f such that either the union (Jf=1{^} contains a P\(t)-unisolvent subset, or the quadrature scheme is exact for the space Pz{f), or both these assumptions hold. Then the associated approximate bilinear forms are uniformly Vh-elliptic, i.e., there exists a constant a such that a>0, D1.1) a||f*lli,fib<a*A'*. vh) far all vhe Vh and all h. Proof, (i) Arguing as in part (i) of the proof of Theorem 27.1 we find that there exists a constant C>0 such that C|pli.r<X>« t !S«M)I2 forallp6P = P2(f). D1.2) 1=1 1=1 (ii) Given a finite element (T,PT,ZT), Те &~h, and a function vhe Vh, let p = vh\T.
Section 41 Second-order problems posed over curved domains 259 Using the ellipticity condition C8.2), we obtain £ щ,т t (ауЗ^Э^ХМ^ I e>/.r t 1М2>,,гI2, D1.3) 1 = 1 i,j=l i = l i=l and we recognize in the expression £"= j |5,p(b,,r)|2 the square of the Euclidean norm || • || of the vector Dp{bUT). Since FT is invertible for h small enough by Theorem 37.2, we have Dp(x) = DP{x)DFt1{x) for all x = FT(x), xet; here Dp(x) and Dp(x) are identified with the row vectors (9j p{x),..., 9„p(x)) and F\ p(x),..,, d„p(x)) respectively, and DFj1 (x) is identified with the Jacobian matrix of the mapping Ff * at x. Using the inequality ^AAr£,T^- \\A~1\\ ~2££T valid for any invertible matrix A and any row vector £ (the superscript T denotes transposition), we obtain |, ^~В^°^ЩШ> ,!>'** D1.4) for allx = Fr(x)eT. Since ю|>г = Ш|/(^г)(Я,) (cf. C9.3)) and since the weights d), are positive, we deduce from D1.2) and D1.4) that: Z щ,т £ №iP(bi,T)\2 (=1 i=l 1Л2 ^ , * 2 Iplb, D1.5) ККГ )kco,TKrll,os,r where, here and subsequently, we use the notations introduced in C7.5). By Theorem 37.1, there exists a constant С such that IPli.r^C 12 |p|li7. forallp = />oFf\ peP. D1.6) ккгЛо;«,гкг и,»,г Hence, combining inequalities D1.3), D1.5) and D1.6), we obtain L n Z mi,T Z (aiAvhQjVh){bi,T) 1 = 1 U=l >BCC2 MlLr ш j) "^ Wff^lo.oo.rl^rJIo.oc.rdFrlLoo.flfr-Ml.ocr}2' У'' If we next make use of the estimates established in Theorem 37.2, which we may apply since the isoparametric family is assumed to be regular, we find that the denominators appearing in the right-hand side of inequality D1.7) are uniformly bounded for all Те STh, all vhe Vh and all h. Therefore we have shown that there
260 P.G. Ciarlet Chapter VI exists a' such that oc'>0, L n Z ш1,т Z (aiAVhdjVk)(bi,T)>a'\Vh\i,T 1=1 u-i D1.8) for all Te^~h, all vheVh, and all ft. (iii) Using inequality D1.8), we obtain L n Teirh 1 = 1 i,j=l >«' Z Ni,r = a'kli,oh for all d,6 Randall ft. D1.9) Since all the sets Qh are contained in a single bounded open set Q (cf. C8.5)), there exists a constant С independent of ft such that IMIia^CIpIlc foralbe#i(fl*). D1.10) To see this, it suffices to apply the Poincare-Friedrichs inequality over the set Q to the function ve Hl{Q) that equals an arbitrary function ve Hi(Q) on Qh and that otherwise vanishes on Q—Qh. Inequality D1.1) is then a consequence of inequalities D1.9) and D1.10). □ 42. Interpolation and consistency error estimates With the finite element space Xh constructed in Section 38, we associate the X^-interpolation operator Tlh, whose definition is the natural extension of the definition given in Section 12 in the case of straight finite elements: The Xh-interpolant FIhv of a function vedomnh = ^0(Qh) is the unique function that satisfies nhveXh, Ялр(а;,г) = ф,-,г), UiO + 1, for all ТеГк, D2.1) Пн v(aUi T) = v(aih T), 1 s$ i < j s$ n +1, for all T e Fh. In this fashion, it is clear that the relations nhv\T = nTv for all Те Fh D2.2) again hold. We now estimate various norms of the difference {v — JJhv). In particular, these estimates will subsequently allow us to obtain, for a specific choice of function й, an estimate of the first term in the right-hand side of the error estimate D0.3) when we choose vh = nhu. As usual, the same letter С stands for various constants independent of ft and of the various functions involved.
Section 42 Second-order problems posed over curved domains 261 Theorem 42.1. Let (Xh) be a family of finite element spaces made up of isoparametric quadratic n-simplices forming a regular family and assume that rc<5. Then there exists a constant С independent of h such that ll»-nht>|L,0h<C*3-»||»||3>ai, m = 0,l, for all veH\Q), D2.3) E 11«-Пг»||*>гГ/%СЛ3-"||»||з.011, m = 2,3, for all veH\U), where h = max hT, D2.5) TePh and Q is any open set such that the inclusions C8.5) hold. We also have the implication veH3(Q)andv = 0onr => TIhveX0h. D2.6) Proof. Since n ^ 5, the inclusion H3(T)of'£0(T) holds and thus we may apply Theorem 37.3: For all functions vbH3(T), \\v-nTv\\m,T <CPT-m{\v\2,T + \v\3,T} ^Chrm\\v\\3,T, 0<m^3, and inequalities D2.3) and D2.4) follow from these inequalities and property D2.2). If a function vanishes on Г, its AVinterpolant vanishes at all the nodes situated on Гк by construction; therefore it vanishes on the boundary Th of the set @h= {Jrerh T, and implication D2.6) is thus proved. □ As in Chapter IV, the consistency errors \ah{Vh,wh) - a„(vh,wh)\ \ah(u,w„) - l„(wh)\ sup — withrA6Kh, sup WhtVt, I W*||i,с WheV„ !|Wfc|ll,flh which appeared in the abstract error estimate D0.3), will be estimated after similar, but "local", terms have been themselves estimated. Such "local" estimates are the object of the next two theorems (compare with Theorems 28.2 and 28.3). The quadrature error functionals ET(-) and £(•) have been defined in C9.4) and C9.5). Theorem 42.2. Let there be given a regular isoparametric family of quadratic n-simplices (T, PT, IT), and let the quadrature scheme over the reference finite element be exact for the space P2(T), i.e., Ё{ф) = 0 for all феР2(Т). D2.7) Then there exists a constant С independent of T such that |£т(аЭ(Р'8;р)|<СА?-||а||2>ао,г||р'||2.г1р11.г ,42 8) for all aeW2<x{T), pePT, p'ePT. Proof. For notational convenience, the indices T are dropped throughout the proof.
262 P.G. Ciarlet Chapter VI (i) To begin with, we record some consequences of Theorem 37.2: Inequalities C7.18) imply that ia«FJo,.,r<Cfc, l<i,fc*Sn, D2.9) |3yf*lo.»,r ^Ch2, l^i,j,k^n, D2.10) and inequalities C7.20) imply that \W\o,*,,T^Ch", |J(F-1)|0,ao,T = |J(F)-1|o.=o,T^C//j''. D2.11) Next, we show that IMfJkoo.rsSC^1, l<i<n, D2.12) |SyJ(f)lo.»,r<C/i" + 2, KU^n. Let us denote by 3(F(x) and 3(j.F(x) the column vectors with components 6,^к(х), 1 ^ к < n, and 3;j-Ffe(x), 1 ^ /с ^ n, respectively. To prove the first inequalities of D2.12), we observe that, for any xeT, we have n Э, J(F)(x) = X detC1 F(x),..., Qj.! F(x), 8yJF(x), 87+1 F(x),..., 6„F(x)), j=i and we use inequalities D2.9) and D2.10). The second inequalities of D2.12) are proved in a similar fashion (since Fe(P2(t)f, the partial derivatives dijkF vanish identically), (ii) The expression to be estimated can be written as £г(аЭ,р'ЭД = £(а{Э,р'Н8;рр(Л). D2.13) Then it is clear that, by contrast with the affine case, the functions C;p'}л and {Qjp} A no longer belong to the space Pi{f) in general. This is why our first task is to determine the nature of these functions: Denoting by e, the jth basis vector of R", we have {bjPY(x) = djp(x) = Dp(x)ej=Dp(x)DF-1(x)ej=Dp{x)(DF(i))-1ej. Noting that the vector fj = (DF(jt))~1ej is the solution of the linear system DF(x)fj - ej, we find that {SJ.p}"(x)={J(F)(x)}-1 x £ 3^)detC1F(x),...,8t-1F(x),eJ-,3t+1F(x),...,3nF(x)). fc= l D2.14) Consequently, the expression {Э;р}~(х) J(F)(x) is a finite sum of terms of the form ±dkP{x)Y[dlFjm(x), and likewise the expression {Эгр'}~(х) is a finite sum of terms of the form ±{ЛЛ(х)}-1ал'(*)Пал.>(*)-
Section 42 Second-order problems posed over curved domains 263 Using D2.13), we thus find that ET(adiP'djP)= Y ±E({J(F)}-1alldsFMnd,Fmdrp\p), D2.15) к,Щ),1фк \ s*r 1Фк / r,j(s),s^r where the symbol £' simply reminds that the indices ./G) and j(s) do not take all possible values 1,2,..., n. (iii) We now take crucial advantage of the fact that the functions {8;p'}~ and {djp}~ can be expressed in terms of the functions dkp, 1 </c<n, which do belong to the space Pi(t): Any one of the terms occurring in the sum D2.15) can be written as E {J(F)} -'аИ Wm П WkiAP'\p ) = ЁFЩ, \ sitr Хфк D2.16) D2.17) with s^r l^k v = drp'eP1(T), w = dkpeP1(T), and consequently, inequality B8.16) with fc = 2 yields \E{Svw)\ «S C{|^|2,«„rl«5|o,J- + l^li,a,,f \v\i,t}\w\o,t It thus remains to express the various seminorms found in this inequality in terms of appropriate norms defined over the set T. Using Theorems 37.1 and 37.2, we obtain: D2.18) |р|,.г<СА-"'2Л'|р|/.г, I = 0,1, \p'\ut^Ch-"'2hl\\p'\\UT, /=1,2. Next, we have (cf. B8.11)) |£|l,co,r {J(F)}-iaUasFmYld,Fm s*r l^k 1,00,? D2.19) ^(•/(FrMo.co.rlaL.oo.r П dsFm П diFjw s#r l^k 0,co,r +\т-1\о,оо,т\й\о,*,т S5*r 1фк № l.OO.T + \m-1w.*>.t\&\o.*>.t Пэ,^,Пэ^«) s^r 1Фк 0,аэ,Т D2.20) and we likewise have an analogous inequality for the seminorm \S\2,ao,t- Using
264 P.G. Ciarlet Chapter VI inequalities D2.9) and D2.10), we obtain nasF,(s)ns,F„ jw 11 ui' m 1Фк ^Ch2"-2 + \ 1 = 0,1,2, D2.21) X,a>,T and, using inequalities D2.11) and D2.12), we obtain \J(F)~l\b*,.t<CV-', м = 0,1,2. D2.22) Therefore, combining inequalities D2.20), D2.21), D2.22) with the inequalities (cf. Theorems 37.1 and 37.2) l<2|v.e,.r<Cfcv||a||¥,0O,r> v = 0,1,2, D2.23) we eventually find that Ifli.x.r^C/f-Mlfllli.co.r- D2.24) By a similar analysis, we would find that \6\2,oo,T^Ch"\\a\\M. D2.25) Then the inequalities D2.18), D2.19), D2.24), D2.25) together with D2.16) imply that <Cfc2||fl||2,0O.r||p'|l2.rlpli.7- D2-26) Ё {/(F)} "МП bsFm П ^Fmdrp'dkp \ s#r 1Фк Adding up inequalities D2.26), we find that the expression £г(аЭ(р'б,р) (cf. D2.15)) satisfies an inequality similar to that of D2.26), and the proof is complete. □ Rem ark 42.1. With the same assumptions as in Theorem 42.2, one can show that the estimates \ET{adip'djp)\^ChT\\a\\2,a:,iT\\p'\\1,T\p\UT hold. From these, another proof of the uniform Ffc-ellipticity of the approximate bilinear forms can be deduced (this type of argument was used notably by Zlamal [1974]). Theorem 42.3. Let there be given a regular isoparametric family of quadratic n-simplices (T,PT,IT), let the quadrature scheme over the reference finite element be such that Ё(ф) = 0 forall<peP2{t), D2.27) and finally, let qe\_l, oo] be any number that satisfies the inequality 2-n/q> 0. D2.28) Then there exists a constant С independent of T such that |£r(/p)l<CfcHmeasE!')}1/2-1/« Ц/112.,,г||р||1>г for all fW2'*(r), pePT, D2.29)
Section 43 Second-order problems posed over curved domains 265 where T denotes for each T the n-simplex with the same vertices as those of T. Proof. For all feW2'"(T) and all pePT, we have ET(fp) = E(fpJ(F)), D2.30) and it follows from the proof of Theorem 28.3 (cf. B8.23) and B8.24)) that there exists a constant С such that \Щр)\ ^ С{{\0\1л,т + \9\2,q,t}\P\ut + l^l2*r|plo.r) for al\ geW2«(T), peP2(T). D2.31) Letting g=fJ(F) D2.32) in inequality D2.31) and making use of inequalities D2.11), D2.12), and |/|M,r *c C{meas(?)}"l'W|| / ||м>г, ц = 0,1,2 (cf. Theorems 37.1 and 37.2), we obtain ^CimeasCfy-vw+'WfW^T, 1=1,2. These last inequalities, combined with relations D2.30), D2.31), D2.32) and the first inequalities of D2.19) with / = 0,1, yield inequality D2.29). □ 43. Estimate of the еГГОГ ] | Й — H/, 11 m We are now in a position to prove the main result of this chapter, which should be compared with Theorem 29.1. We recall that и is the solution of the variational problem corresponding to the data C8.1). For references concerning the existence of extensions such as u and a;j below, see Lions [1965, Chapter 2], NeCas [1967, Chapter 2]. Theorem 43.1. Let n ^ 5, let (Vh) be a family of finite element spaces made up of isoparametric quadratic n-simplices forming a regular family, and let there be given a quadrature scheme on the reference finite element that satisfies Ё(ф) = 0 for all феР2(Т). D3.1) Let Q be an open set such that the inclusions Q<=Q, Qhc:Q for all h D3.2) hold, and such that the functions и and aijt 1 < i,j ^ n, possess extensions й and
266 P.G. Ciariel Chapter VI 5y, 1 ^i,j^n, that satisfy ueH3(Q), uijeW^iQ), l^i,j^n, D3.3) 7= E S;(aye,-u)eW2'«(G) for some q>2>n/q. D3.4) u= i T/jen t/геге exists a constant С independent of h such that l|fi-«»lliA^ СЛ2|||и||э.й+ _t ||ау||2,да1й||й||з,а + ||7|12*й|, D3.5) w/iere ft = max 7-^ toT. Proof. By Theorem 41.1, the approximate bilinear forms are uniformly Vh -elliptic and therefore, we can use the abstract error estimate D0.3) of Theorem 40.1. (i) Since n «S 5, the inclusion H3{Q) c, <g°({U}') holds, and by Theorem 42.1, the function Пкй belongs to the space X0h since и = и = 0 on the boundary Г. We may thus let vh = Пнй in the term vaiVheVh{...} that appears in the abstract error estimate. In this fashion, we obtain l|w-Wfcili,fi1,^C^II""i7/1ulli,fih+ sup (. wheKh II Wh II l.Ch, + sup ■' fcV *' hK нИ\, D3.6) and, by Theorem 42.1, we know that I! й - nhu || lifti < Ch21! Й j| 3,fih ^ С/г21| Й || з,й. D3.7) (ii) To estimate the two consistency errors, a specific choice must be made of the functions йц that appear in the bilinear form ah( •, •): We shall choose precisely the functions given in D3.3). Notice that, since the inclusion W2-'X'{U) c*. <<f *({$} ~) holds, the functions au are in particular defined everywhere on the set U Then we have, for all wheVh, ah{nhu,wh)-ah{nhu,wh) Г" n L n E йи9A"8;wftdx - E E шг.г Е (ЯгА-Л^^/Жг). and, since all the quadrature nodes b,,r belong to the set Q, we have аи(Ъит) = йиФит)- Consequently, we can rewrite the above expression as ah(nhu,wh)-ah(nhu,wh)= £ £ £г(й,78(Ялй8^л).
Section 43 Second-order problems posed over curved domains 267 Using the estimates of Theorem 42.2 and Cauchy-Schwarz inequality, we obtain \ah{nhu,wh)-ah{nhu,wh)\ tl <C £ h2T Y, \\aij\\2,ao,AnTU\\2,T\WhW,T ^Ch2\ £ llSyll^flj j X \\Птй\\1т\\щ\иок- Another application of Theorem 42.1 yields 11/2 Г ■} i/2 Z \\Птй\\1т\ ^\\й\\2Л, + 1 Z || м — ГГги || i.y }> <11«1|2.011 + СЛ||й||з,01><С||й||з1Д, and thus, we have shown that \ah(Ilhu,wh) - ah(Ilhu,wh sup WheKh , „ <Ch2 £ ||fly||2.«,,fll|fi||3.fl. lWAlll,fih i,j=l D3.8) (iii) Let us next estimate the numerator of the second consistency error. First it is easily verified that assumptions D3.3) imply in particular that the functions (ауб.й) belong to the space ЯХ(Й). Therefore Green's formula yields ah(u, wh) ■■ Oh Z aijdiuejwb dx Z 9j(ay8,fi)w» dx = u=i fwhdx for all wheVhcz Hi{Qh). Oh lib Since - Z 3/aiA") = - Z 8/ау8гм)=/ on Д U=l i,j=l the function f given in D3.4) is an extension of the function f. Besides, using once again the fact that all integration nodes bt-T belong to the set Q, we infer that f(bl:T) =/(b;,r); consequently, we can write ah(u,wh)-lh(wh) = fwh dx - Z Z wi,r(M)(b,,r) Te$-h 1=1 Oh = Z ETGwh),
268 P.G. Ciarlet Chapter VI and the estimates of Theorem 42.3 imply \ah(u,wh)-lh(wh)\^C £ h2T{m^{t)}^-^\\J\\2^T\\wh\\uT TeSTh ( ")I/2-l/e ^Ch4 £ meas(f» li/lk^J! wJ|1A. t TeFh J By construction, the interiors of the n-simplices do not overlap. Hence the quantity £ meas(r) = meas| [J Г] TeSTh is bounded independently of h. Thus, we have shown that sup ^^-^Uch^n^ D3.9) and inequality D3.5) follows from inequalities D3.6), D3.7), D3.8) and D3.9). □ We have therefore reached a remarkable conclusion: In order to retain the same order of convergence as in the case of a polygonal domain {when only straight finite elements are used), the same quadrature scheme should be used, whether it be for straight or for isoparametric finite elements. Thus, if n = 2 for instance, we can use the quadrature scheme of B5.18), which is exact for polynomials of degree ^2. Remark 43.1. As expected, the order of convergence in the absence of numerical integration is the same, i.e., one has \\u — uh\\iMh = 0{h2), where uh is now the solution of the discrete problem C8.6). The corresponding error analysis, which relies again on the abstract error estimate of Theorem 40.1, was carried out in Scott [1973a]. To complete this study, it remains to show that for a given domain with a curved boundary (irrespectively of whether or not numerical integration is used), isoparametric quadratic n-simplices do yield better estimates than their straight counterparts! More specifically, assume that the set О is a bounded convex domain in U2 (Fig. 43.1). Given a triangulation 3~h made up only of "straight" triangles as indicated in Fig. 43.1(a), let Xh denote the finite element space whose generic finite element is the quadratic triangle, and let Vh = {vheXh; vh = 0 on rh}, where Fh is the boundary of the set [JTer.T. Then one can show (cf. Strang and Berger [1971], Тномёе [1973b]; see also Strang and Fix [1973, chapter 4]) that \\u-uhU,ah=0(h3/2),
Section 43 Second-order problems posed over curved domains 269 (a) (fa) Fig. 43.1. Approximation of a domain in R3 with straight (a) or isoparametric (b) quadratic triangles. where uh e Vh now satisfies E aiAuhQjvhdx = fa* dx for all vhe Vh 3=1 J i,j=l (} Q (note that in this case, the X^-interpolant of the solution и does not belong to the space Vh). This shows that finite element spaces constructed with isoparametric quadratic triangles, as in Fig. 43.1(b), yields better error estimates. Remark 43.2. By contrast with the case of straight finite elements (cf. Remark 29.1) the integrals f,ray3,-ufc8jt>4 dx are no longer computed exactly when the coefficients atJ are constant functions. To see this, let (T,PT,IT) be an isoparametric quadratic n-simplex; then we have diP'djpdx-- J(F){Qip'Y{Qjpy d* for a11 Р',РеРт, and (cf. D2.14)), J(F)(x){diP'Y(x)= £ dkp'(x)det(9,F(x),...,Э*_^(jc),et, dk+1F(x),....8„F(x)) k=l = {polynomial of degree ^n in x}, {djpY(x)={J(F)(x)}~1 x {polynomial of degree ^n in x}. Since J(F)(x) = det(81F(x),...,S„F(x)) = { polynomial of degree ^n in x}, it eventually follows that {polynomial of degree ^ 2n in x} 6ip'5]pdx = {polynomial of degree < n in x} dx. The exact computation of such integrals would thus require a quadrature scheme exact for rational functions of the form N/D with NeP2„{T), DePn(T), and such
270 P.G. Ciarlet Chapter VI schemes are not known, except for the 4-node quadrilateral (cf. Babu and Finder [1984], Mizukami [1986]). Remark 43.3. Approximate quadrature used in conjunction with the 4-node quadrilateral often leads to hourglass instabilities (cf. Liu and BelyTschko [1984]), already noted at the end of Section 29 about the general phenomenon of underintegration. In this direction, see also Girault [1976], who has nevertheless shown that 4-node quadrilaterals may be used in conjunction with a one-node quadrature scheme, even though the uniform FA-ellipticity no longer holds. As already mentioned, the error analysis developed in this chapter follows the general approach set up in Ciarlet and Raviart [1972c] (however it was thought at that time that more accurate schemes were needed for isoparametric elements), where an estimate of the error in the norm | • |g ^ was also obtained. See also Nedoma [1979] and Lenoir [1986], for further results in the same spirit, and, especially, Wahlbin [1978], who establishes the uniform convergence. An analogous study was made by Zlamal [1974], who showed that, for two-dimensional curved elements for which P = Pk{t), к even, it is sufficient to use quadrature schemes exact for polynomials of degree ^ 2k — 2, in order to retain the 0(hk) convergence in the norm || • || 1>flh. Zlamal [1973b] has also evaluated the error in the absence of numerical integration; for complementary results, see Veidinger [1975]. Likewise, Scott [1973a] has shown that quadrature schemes of higher order of accuracy are not needed when curved finite elements are used. However, the finite elements considered by Zlamal and Scott are not of the isoparametric type as understood here. For such elements, a general theory is yet to be developed, in particular for quadrilateral finite elements. Zlamal's method has been further improved by ZenISek [1981a, 1981b, 1987], who was able in particular to handle various types of nonhomogeneous boundary conditions, then by Lenoir [1986], who extended it to an arbitrary space dimension, and finally by Bernardi [1986, Chapter 6], who was in addition able to handle nonconforming curved elements. See also Cermak [1983a, 1983b] for related ideas. Alternate ways of handling Dirichlet problems posed over domains with curved boundaries have been proposed, which rely on various alterations of the bilinear form of the given problem. In this direction, we notably mention: (i) penalty methods, as advocated by Aubin [1969] and Babuska [1973b], and later improved by King [1976]; (ii) methods where the boundary condition is considered as a constraint and as such is treated via techniques from duality theory, as in Babuska [1973a]; (iii) least square methods as proposed and studied in Bramble and Schatz [1970, 1971], Bramble and Nitsche [1973], Baker [1973]; (iv) methods where the domain is approximated by a polygonal domain, as in Bramble, Dupont and Thomee [1972]; (v) various methods proposed by Nitsche [1971, 1976c]. Finite element approximations of Neumann problems posed over domains with curved boundaries have been studied by Barrett and Elliott [1984, 1987],
Section 43 Second-order problems posed over curved domains 271 Molchanov and Galba [1985]. References about nonconforming isoparametric quadrilateral elements have been given at the end of Section 34. For additional references for the finite element approximation of boundary value problems posed over domains with curved boundaries, see Babuska [1971b], Berger [1973], Berger, Scott and Strang [1972], Blair [1976], Bramble [1975], Nitsche [1972b], Scott [1975], Shah [1970], Strang and Berger [1971], Strang and Fix [1973, Chapter 4], Тномёе [1973а, 1973b]. We finally mention that, following the terminology of Strang [1972b], we have perpetrated in the last three chapters three variational crimes, by using numerical integration, nonconforming finite elements, and approximate boundaries.
Chapter VII Finite Element Methods for Fourth-Order Problems Introduction In this chapter, we study two commonly used finite element approximations of the plate problem on polygonal domains. To begin with, we consider various conforming methods, which thus require straight finite elements of class c€l. Although such finite elements cannot be imbedded in affine families in general, we show that, under very reasonable assumptions, they form almost-ajfine families (Section 44), in the sense that if the associated .P^interpolation operator Пт leaves invariant the space Pk(T), there exists a constant С independent of T such that \v-nTv\m,T^ChkT+1-m\v\k+1,T for all veHk+l(T), for all integers m^k+1 for which PTcHm(T). This property is shared not only by the finite elements of class W1 introduced in Section 9, such as the Argyris triangle (Section 45), but also by composite finite elements, such as the Hsieh-Clough-Tocher triangle (Section 46), and by singular finite elements, such as the singular Zienkiewicz triangle (Section 47). For finite element spaces made up of such almost-affine families, we obtain (Theorem 48.1) error estimates of the form \\u-uhh,n^C\\u-nhu\\2tn = 0(hk~1), h = maxhT, by an application of Cea's lemma. We also show (Theorem 48.2) that the minimal assumptions "ueH2(fi)" and "P2{T)<=:PT, Те Jt" insure convergence, i.e., Iimfc-,0|l"-«*ll2.n = 0. The actual implementation of conforming methods offers serious computational difficulties: Either the dimension of the "local" spaces PT is fairly large (at least 18 for triangular polynomial elements) or the structure of the space PT is complicated (cf. the Hsieh-Clough-Tocher triangle or the singular Zienkiewicz triangle for example). The basic source of these difficulties is of course the required continuity of the first-order partial derivatives across adjacent finite elements. It is therefore tempting to relax this continuity requirement, and this results in 273
274 P.G. Ciarlet Chapter VII nonconforming methods: One looks for a discrete solution in a finite element space Vh that is no longer contained in the space H2(Q), or not even in the space H1(Q) in some cases. The discrete solution then satisfies ah{uh,vh)-=l(vh) for all vhe Vh, where Tefh J T the integrand {■•■} being the same as in the bilinear form of the original problem. The analysis of such nonconforming methods follows exactly the same pattern as that of nonconforming methods for second order problems (cf. Chapter V). We concentrate our study here on one example, where the generic finite element is the Adini-Clough-Melosh rectangle, in which case we show that (Theorem 50.1) I \u-uh\lTV'2 = 0(h), if the solution и is in the space H3(Q). The very important mixed and hybrid finite element methods for approximating plate problems, or other fourth-order problems, are treated in depth in the article by Roberts and Thomas. There are numerous publications in the engineering literature about the various finite elements that can be employed for solving plate problems. Rather than giving a necessarily incomplete list, we refer the reader to the two "review" papers of Batoz, Bathe and Ho [1980] and Hrabok and Hrudey [1984]. 44. Conforming methods for fourth-order problems: Almost-affine families of finite elements To begin with, we study several types of conforming finite element methods commonly used for approximating the solution of plate problems. For definiteness, we consider the clamped plate problem, which corresponds to the following data (cf. Section 4): V^H20{Q), QczU2, a(u, v) = {AuAv + {l — v)BQ12ud12v—d11ud22v — ^22ud11v)}dx, D4.1) l(v) fvdx, /eL2@), where the constant v (the Poisson coefficient of the material constituting the plate) lies in the interval ]0, ^[. The conforming methods that we shall describe apply equally well to any fourth- order boundary value problem posed over a space V such as Hl(Q), H2(Q)r\Hi{Q), H2(Q), whose data a(-, •) and /(■) satisfy the assumptions of the Lax-Milgram lemma.
Section 44 Fourth-order problems 275 For instance, we could likewise consider the simply supported plate or the biharmonic problem. By contrast, the nonconforming methods studied at the end of this chapter are specifically adapted to plate problems (cf. Remark 49.1). We assume that the set Q is polygonal, so that it may be covered by triangulations composed of polygonal sets. In order to develop a conforming method, we face the problem of constructing subspaces of the space H2{Q). Since the functions found in standard finite element spaces are "locally regular" (PTc=H2(T) for all Те $~h), this construction amounts in practice to finding finite element spaces Xh that satisfy the inclusion Xhczc^1(Q) (cf. Theorem 5.2), i.e., whose finite elements are of class c€1. Remark 44.1. A thorough discussion about the use of finite elements of class 4>l from the engineering viewpoint is given in Zienkiewicz [1971, Chapter 10]. There, finite elements of class Я>1 are called compatible, while finite elements that are not of class Я>1 are called incompatible. We have already described three finite elements that are of class (€1, the Argyris triangle (cf. Fig. 9.1), the Bell triangle (cf. Fig. 9.2), and the Bogner-Fox-Schmit rectangle (cf. Fig. 9.4). As we pointed out in Section 11, Argyris triangles or Bell triangles cannot be imbedded in affine families in general, because normal derivatives at some nodes are used either as degrees of freedom (for the Argyris triangle) or in the definition of the space PT (for the Bell triangle). This is in general the rule for finite elements of class W1, but there are exceptions. For instance, the Bogner-Fox-Schmit rectangle is a rectangular finite element of class <^1 that can be imbedded in an affine family. Nevertheless, if most finite elements of class W1 do not form affine families, we shall show that their interpolation properties are quite similar to those of affine families, and it is this similarity that motivates the following definition (compare with Theorem 16.1). Definition 44.1. Consider a family of finite elements (T, PT, IT) of a given type, and let s denote the greatest order of partial derivatives occurring in the definition of the set Xr. Then such a family is said to be almost-affine if, for any integers к, т ^ 0 and any numbers p,qe[l, oo] compatible with the following inclusions: Wk+1-P{T)o><#"(T), D4.2) Wk+l>p{T)c+Wm'q{T), D4.3) Pk{T)cPTc:Wm'q(T), D4.4) there exists a constant С independent of T such that \\v~nTv\\m,gtT^C{mcas(T)}1"'-inkT+l-'"\v\k + liPiT foralli;eW'* + 1>,'(r), where Пт denotes as usual the associated PT-interpolation operator, and hT = diam(T).
276 P.G. Ciarlet Chapter VII In order to simplify the exposition, we shall consider in the subsequent examples only the highest possible value of the integer к for which the inclusions Wk + 1,p(T)t+ <g%T) and Pk(T)cPT are satisfied, but it is implicitly understood that any lower value of к compatible with these two inclusions is also admissible. By definition, a regular affine family is almost-affine. In particular then, a regular family of Bogner-Fox-Schmit rectangles (T, PT, ZT) (cf. Fig. 9.4) is almost-affine. We recall that the set Г is a rectangle with vertices ah 1 <i«t4, Рт=0,ъ{Т), and IT = {p(ai), dip(a;), д2р(а{), 812p(a;): 1 s£i<4}. Hence, for all pe]l,oo] (the inequality p>\ implies the inclusion WA'p{T)o> <^2(T) = domПт) and all pairs (m, q) with mS=0 and qe[1, со] compatible with the inclusion W*-t{T)^Wm-\T), D4.6) there exists a constant С independent of T such that \\v-~nTv\\miq,T^C{mQas{T)}llq-lll'h^-m\v\^T for all veW*"(T). D4.7) Remark 44.2. The Bogner-Fox-Schmit rectangle is not the only rectangular finite element of class (el that may be used in practice. See for instance Gopalacharyulu [1973]. 45. Examples of polynomial finite elements of class <el Let us next examine the Argyris triangle (cf. Fig. 9.1). We recall that this finite element is a triple (T, PT, ET) where the set T is a triangle with vertices ah 1 <i<3, and midpoints а0=у(а; + яу), l^i</'^3, of the sides, the space PT is the space P5(T), and the set IT (whose P5(T)-unisolvence has been proved in Theorem 9.1) can be chosen in the form 2г = {Э«р(а.-): 1<»<3,|«1<2; 8vp(ay): l<i<^3}. The basis functions of the Argyris triangle are explicitly computed in Bernadou and Boisserie [1982, pp. 69ff]. The first interpolation error estimates for the Argyris triangle are due to Zlamal [1968], who obtained estimates of the norms |ji>—Яу^Ц^т-); the results and methods of Zlamal were later extended by ZeniSek [1970] to finite element spaces that satisfy the inclusion XhCWm(Q). Bramble and ZlAmal [1970] then obtained estimates in Sobolev norms, which are contained in the next estimates. The method of proof that we follow here is due to Ciarlet [1976]. Theorem 45.1. A regular family of Argyris triangles is almost-affine: For all ре£1, со] and all pairs (m, q) with m^O and <?e[l, со] compatible with the inclusion W6-P{T)<+ W""-«(T), D5.1)
Section 45 Fourth-order problems 277 there exists a constant С independent of T such that !к-ЯгИ|т.9,т^С{теа5(Г)}1/«-1^6г-т1^1б,р.г for all veW6-"(T), D5.2) where Пт denotes the associated P5(T)-interpolation operator. Proof. The key idea is to introduce a finite element similar to the Argyris triangle, but which can be imbedded in an affine family, and which will play a crucial intermediary role in obtaining the interpolation error estimate. Since the degrees of freedom 8vp(a,7), l<i<;<3, are those that prevent the property of affine equivalence, we naturally introduce the (tf0-quintic Hermite triangle, as described in Fig. 45.1. For notational convenience, we shall henceforth denote by b( the midpoint of the side that does not contain the vertex ah 1 ^г'^3. Arguments similar to those used in the proof of Theorem 9.1 then show that the set ET is P5(T)-unisolvent and that this is a finite element of class W0, but not of class её1. t?°-quintic Hermite triangle PT = P5(T), dimPj- = 21 ET = {d"p(at):l*ZiiZ3,\a\!Z2;Dp(bi)(ai-bi):l^i^3} Fig. 45.1. In addition, it is clear that two arbitrary ^°-quintic Hermite triangles are affine- equivalent. Therefore, if AT denotes the associated P5(T)-interpolation operator, for all pe[l, oo] and all pairs (m,q) with О^га^б and qe[\, oo] such that W/6,p(T)c> Wm-q(T), there exists a constant С independent of T such that ^-^г^1т,9,к<С{теа8(Г)}1/«-1/рйГт1Иб,Р,г for all veW6'"(T). D5.3) It thus remains to estimate the seminorms \HTv —Arv\m^r, where v is any function in W6'P{T). The difference Л =nTv — ATv D5.4)
278 P.G. Ciarlet Chapter VII is a polynomial of degree < 5 that satisfies ЭМ(аг)=0, |а|<2, l<i<3, D5.5) since d*nTv{ai) = d'xATv(ai) = dxv(ai),|a|s$2, l<i^3, and ev4(ft,) = Sv(»->lr»)(bj), l^i<3, D5.6) since bvnTv(bi) = dvv(bi), l^i<3. For 1 ^i<3, let Vf and т,- be the unit outer normal and tangential vectors along the side opposite to the vertex at. Denoting by • the Euclidean inner product in U2, we can write, for 1 ^ i < 3, DJ(bi)(ai-bI-) = 8v4(fei){(fl{-b£)-vi}, D5.7) since on the one hand D A{bi)vi = bvA(bi), and since on the other DA(bi)ti = 0 as a consequence of relations D5.5), which imply that the difference A vanishes along each side of the triangle. Combining relations D5.6) and D5.7), we obtain Dzl(bi)(ai-b!-) = 8v(t;-^Tu)(bi){(a,--b1)-vi}, l^i<3. D5.8) Let qt denote the basis functions of the <^°-quintic Hermite triangle that are associated with the degrees of freedom Орф{){а,—bt), 1 ^i^3. Then using relations D5.5) and D5.8), we can write з A=nTv-ATv= £ {DA№at-b,)}qi ( = i = £ Э,(»-Лт1>ХЬ,){(аг-Ь,)-*}<й. D5.9) Applying Theorem 16.1 with m=l, q = co and fc = 5, we obtain |а¥(ю-Лг1>)(Ь,)| ^s/2\v-ATv\Ua>tT ^ClmeastT)}-1"'^^!^^, l<i<3. D5.10) Pt Next, it is clear that |(a,-bf)-Vil<hr, Ki<3. D5.11) Finally, let qL be the basis functions of a reference ^°-quintic Hermite triangle, associated in the usual correspondence with the basis functions qt. From Theorems 15.1 and 15.2, we infer that k«U«.r<ci^5?P^|4lL.t.t. D5.12) Pt Relations D5.9)-D5.12) then imply that \nTv-ATv\m^T^C{mws{T)yi«-^ -^TTl46,p,r Pt ^C{mt^{T)Ytq-llph6T-n\v\6iPiT, D5.13)
Section 46 Fourth-order problems 279 since we are considering a regular family, and inequality D5.2) follows from inequalities D5.3) and D5.13). □ By a similar method, we could likewise show that a regular family of Bell triangles (cf. Fig. 9.2) is almost-affine, with the value k = 4 in the corresponding inequality D4.5). The basis functions of the Bell triangle are explicitly computed in Mitchell [1973], Barnhill and Farin [1981]. 46. Examples of composite finite elements of class <4!x For the first time in this article, we leave the realm of "purely polynomial" finite elements. As we already pointed out in Section 9, Bell's triangle is optimal among triangular polynomial finite elements of class (€l, since dim PT^ 18 for such finite elements, as a consequence of ZeniSek's result. Therefore, a smaller dimension of the space PT for triangular finite elements of class сё1 requires that functions other than polynomials be used. For example, one can use piecewise polynomials inside the set T, a process that is the basis for constructing composite finite elements, also named macro-elements. Or one can add some judiciously selected rational functions to a space of polynomials, a process that is the basis for constructing singular finite elements (singular in the sense that some functions in the space PT or some of their derivatives become infinite, or are not defined at some points of T). We shall describe and study in detail one example of each type, and then briefly mention other examples of composite and singular finite elements. The Hsieh-Clough-Tocher triangle, sometimes abbreviated as the HCTtriangle, is defined as follows: The set Г is a triangle subdivided into three triangles Tt with vertices a, ai+i, ai + 2, 1«*;<3 (Fig. 46.1), where a is an arbitrary point in the interior of the set T (here and subsequently, the indices are counted modulo 3 when necessary). The space PT and the set ET are indicated in Fig. 46.1. For convenience, we again denote by bh 1 ^i^3, the midpoint of the side that does not contain the vertex a,. The Hsieh-Clough-Tocher triangle appeared in Clough and Tocher [1965]. It is also named after Hsieh who was the first to conceive in 1962 the idea of matching three polynomials in order to get a finite element of class сё1. Our first task is as usual to prove the Pr-unisolvence of the set ZT. Since dim P3(Ti) = 10, it is necessary to find 30 equations to define the three polynomials p\Ti, 1 ^r<3. First, it is easily seen that the degrees of freedom of the set ZT provide 21 equations. To see that the condition "ре^^Г)" yields 9 additional equations, it suffices to express that the functions p, 8xp and d2p are continuous at the point a F equations) and that the normal derivatives are continuous across the midpoints of the sides [a, a,] C equations). It therefore remains to show that the 30 x 30 matrix of the corresponding linear system is invertible, and this is the object of the next theorem. The first proof of unisolvence that did not use the basis functions was given by
P.G. Ciarlet Chapter VII Hsieh-Clough-Tocher triangle PT = {pe<e\T);p\TleP3(Td, UK3}, dimPT=12 rr = {p(ai),elP(ai),82p(fl(),avp(bi):Ui«3} Fig. 46.1. Ciarlet [1974c]. The proof given here is due to Percell [1976]; see also Douglas, Dupont, Percell and Scott [1979]. Another proof consists in explicitly computing its basis functions, as in Mitchell [1973], Bernadou and Hassan [1981], Bernadou and Boisserie [1982, pp. 75ff]. Theorem 46.1. Let the triple {T,PT,ST) be as in Fig. 46.1. Then the set IT is PT-unisolvent. The resulting Hsieh-Clough-Tocher triangle is a finite element of class <ё1. Proof. It suffices to show that a function p in the space PT vanishes if р(а;)-51Р(а;) = 82р(а;) = 8,р(Ь;)-0, Ui*$3. D6.1) For 1 =% / =$ 3, let Hi denote the unique function that satisfies HiSP^Ti), jUj(a)=l, iii{ai+1) = ni(ai + 2) = 0. Hence the function jx: T-+M defined by is continuous. Since the function p | T. is a polynomial of degree < 3 over each triangle T;, assumptions D6.1) imply that there exist functions v,- such that vieP^T), p\Tt = yifif, Ui<3. Since the functions p: T-+R and ц: Т-+М are continuous, the function v: Т-+Ш
Section 46 Fourth-order problems 281 defined by is also continuous (the function ц does not vanish in the interior T of T). On each segment [a, ai+2], the gradient Vp, which is unambiguously defined since the function p is continuously differentiable, is given by either expression [a,a,+2] [Bv,.+l№+1Vjui+1 + /u,?+iVvj+1)|[a>ei.+2], and thus 2vV(Ju, + 1-/i;) + ^V(vi+1-vI-) = 0 along [a,aI + 2], since ^0 in f. Since /i(ai+2) = 0 and V{ni+1—fii)¥:0 (otherwise the lines /i,= 0 and /ii + 1 = 0 would be parallel), we conclude that v(ai+2) = v;(a,+2) = 0. A similar argument shows that V;(ai+1) = 0. Consequently, each function vieF1(ri) is of the form Vi = CiHi, Ci = constant, and the continuity of the function v then implies that v{a) = vi(a) = Ci, U/O. Denoting by С the common values of the constant C;, we conclude that v|r,=V; = C,"b 1^!<3, and therefore that P\n = Cnf, whence Wp\Tt = 3CfifVfih Ui<3. Then the constant С is necessarily zero for otherwise the function p would not be continuously differentiable along the segment [_a,ai+2] since VjUj^Vju,-+1. That the Hsieh-Clough-Tocher triangle is of class #* follows by an argument analogous to the proof of Theorem 9.3. Q Remark 46.1. The normal derivatives at the midpoint of the sides can be eliminated by requiring that the normal derivative vary linearly along the sides. This elimination results in a finite element of class <^1 with dimPr = 9, which will be described at the end of this section. Remark 46.2. Triangular finite elements of class <€1 analogous to the Hsieh- Clough-Tocher triangle, but which use polynomials of higher degree, may be also defined: For instance, let T= |Jf=1 Tt denote a similar subdivision of the triangle T,
282 P.G. Ciarlet Chapter VII and let P7. = {petf1(K);Plr(eP4(r,),U»'<3}, ^r = {p(ai),S1p(ai),82p(a£): l<i<3;p(a0): l^i</<3; 8vp(a,y):l^iJ<3, i^j; р(а),Э1р(а),Э2р(а)}, where al7=4(a, + a,), а(и =4Ba; + a,-). Then the set ZT is PT-unisolvent (cf. Percell [1976]). Two reasons prevent the Hsieh-Clough-Tocher triangle from being imbedded in an affine family: As for the Argyris triangle, one reason is the occurrence of the normal derivatives 9vp(b;) as degrees of freedom; the other reason is that the point a may be allowed to vary inside the set T. This is why we must adapt to this element the notion of a regular family: We shall say that a family of Hsieh-Clough-Tocher triangles (T, PT, IT) is regular if the following three conditions are simultaneously satisfied: (i) There exists a constant a such that hT/pT =$ a for all T. (ii) The quantities hT approach zero. (iii) Let t be a fixed triangle with vertices at, 1 < i < 3. For each triangle T with vertices aUT, 1 ^ i ^ 3, let FT denote the unique affine mapping that satisfies FT(&i) = aUT, 1 ^ i ^ 3. Then (Fig. 46.2) all the points aT = Ff 1(aT) must belong to a fixed compact subset В of the interior of the triangle f (the compact subset В may vary from one regular family to another). Fig. 46.2. In a regular family of Hsieh-Clough-Tocher triangles, all the points uT = FT1(aT) belong to a fixed compact subset В of int T, Remark 46.3. Conditions (i) and (ii) are those that define a regular family of finite elements (cf. Section 16). Condition (iii) expresses precisely in which sense the points aT may vary inside the triangle T, so as to guarantee that the family under consideration is almost-affine (cf. the next theorem).
Section 46 Fourth-order problems 283 The next interpolation error estimates are due to Ciarlet [1974c]. Theorem 46.2. A regular family of Hsieh-Clough-Tocher triangles is almost-affine: For all pe[l,oo] and all pairs (m,q) with m^O and qe[l,oo] such that W^(T)o>Wm«(T), D6.2) Рт с Wm'q(T), there exists a constant С independent of T such that \\v-nTv\\^T < С{т^{Т)У^^пАт-тЩ^т for all veW*'"(T). D6.3) Proof. We first observe that the inclusion W4'"(T) c> q1 (T) = dom Пт holds for all p > 1. The proof of the theorem comprises three steps. A) As in the proof of Theorem 45.1, we shall introduce an "intermediary" finite element which is similar to the Hsieh-Clough-Tocher triangle, but which can be imbedded in an affine family. To this end, we replace in the set ET the normal derivatives by appropriate directional derivatives, and we restrict the position of the points aT, as follows: With each Hsieh-Clough-Tocher triangle (T,PT,ST), we associate the finite element (Т,РТ,БТ), where ET = {р{ад, Dpiai)(ai+1 - ad, Dp(a,)(c,_i - аД Dp(bt)(a - b,): 1 < i < 3}D64) (the proof of the Pr-unisolvence of the set ET is similar to that of the Pr-unisolvence of the set IT as given in Theorem 46.1). We denote by AT the PT-interpolation operator associated with each finite element (T,PT,ST). For each point aeB, let 3~{a) denote the (possibly empty) subfamily of Hsieh-Clough-Tocher triangles for which aT = FT{a). Then, for each deE, the subfamily (T,PT, Ет), Те3~{a\ is affine. Consequently, for all pairs {m, q) compatible with the inclusions D6.2), the inclusion P3(T)czPT D6.5) implies that there exists a constant C{a, t) such that I» - Лть\тл,т ^ C(d, f){meaS(T)}llq-llpK-m\v\4,p,T for all ve W*-P(T) and all Te^(d). D6'6) B) We next show that, when the points a vary in the compact set Ё, the constants C(d, f) appearing in the last inequality are bounded. To prove this, we recall that in the proof of Theorem 15.3, we found that these constants are of the form (cf. inequality A5.19)): C(a, t) = C(t) || I - A\a) Www'.ptfy.wn.nT)), D6.7) where, for each deB, A(d) denotes the Pj-interpolation operator associated with the corresponding reference finite element (T,P(a),E(a)). The P j-interpolant A(a)v of a function ve W4'P{T) is given by (the notation should
284 P.G. Ciarlet Chapter VII be self-explanatory): з з A(d)v = X ЩШ&,')+ E {ОЩЩ - &i)}q^a,•) + £{Л^)(в-&«)}ЭДИ, D6.8) i= 1 and there exist constants C(T) independent of a such that №;)l *S |tJ|o,=o,r ^ C(f)||t5||4,p,t, !Я-"ад}^*п<^--'<оЛ-'- D6.9) Let us then consider the norm || • \\m,qjT of any one of the basis functions р((а,-), qi}{&, •) and r;(d, •). On each one of the triangles t;(d), 1 ^ i ^ 3, that subdivide the triangle t, the restriction of any one of these basis functions is a polynomial of degree ^3, whose coefficients are obtained through the solution of a linear system with an invertible matrix (the set 2(d) is ^(d)-unisolvent as long as the point a belongs to the interior of the set t). This matrix depends continuously on the point a since its coefficients are polynomial functions of the coordinates of the point a. Consequently, each basis function is in turn a continuous function of the point a and there exists a constant С such that sup{||ft(£-)L,«.r, \\qi}(a,-)\\m,q,t, Hr,-D-)L.,.t} < C, D6.10) aeB since the set В is compact. Hence it follows from relations D6.7) to D6.10) that supC(d,f) = C(B,f)<oo. Лей Combining this result with inequality D6.6), we obtain \v-ATv\m,q,T ^ C(B,T){meas(T)yi*-llPhrm\v\4,P,T for ail ve W4-P{T) and all Т. D6Л1) C) By an argument similar to that used in the proof of Theorem 45.1 (cf. D5.9)), we find that nTv-ATv = £ Qy{v-ATv){bi){(a-biyvi}ri, D6.12) where the functions rt, 1 ^ j < 3, are the basis functions associated with the degrees of freedom {Dp(bt)(a — bf)} in the finite element (T,PT,ET). Applying Theorem 16.1 with m = 1, q = oo and к = 3, we find that |8,(» - Лго)(Ь,)| < y/2\v- 'Mi.co.r ^ C{meas(r)}-1/p—|»|4.p.r. D6.13) Pr
Section 46 Fourth-order problems 285 We next have \{(a-bt)-vt}\^hT, \rtL«.T<C {meas(T)}1/e Рт D6.14) D6.15) and we deduce from relations D6.12) to D6.15) that kr n/q-l/p ,lT h5 r---r-im,e,^C{meas(T)}1/"-1/',--^T|,|4>p>T \Ilrv — ATv\, v. Pt < C{meas(T)}ll9-llpki.-m\vU,Ptr. D6.16) Then the proof is completed by combining this inequality with inequality D6.11). □ When q = 2, it is easily seen that the highest admissible value for the integer m compatible with the inclusion Рт а Hm(T) is m = 2 (that the integer m is at least 2 follows from an application of Theorem 5.2 to the partitioned triangle T=\Jf=i, Ti, that the integer m cannot exceed 2 can be seen by an argument that will be used later; cf. Theorem 49.1). Hence this is the first instance of a restriction on the possible inclusions Рт а Wm'q(T) (the other finite elements described in this and the next sections provide further instances of this restriction). Fortunately, the inclusion Рт с H2(T) is precisely that which is needed to insure convergence for fourth-order problems (cf. Section 48). Let us now describe the "reduced" finite element that was announced in Remark 46.1. The reduced Hsieh-Clough-Tocher triangle is a triangular finite element whose corresponding space PT and set IT are indicated in Fig. 46.3. Combining the reduced Hsieh-Clough-Tocher triangle PT = {ps<g\T);p\r.eP3(T,\ 1<г<3,ЭуреР,(Г) for each side T of T} dimPr=9 ZT = {p(a,), 51p(ai), Ь2р(щ): 1 <i<3} Fig. 46.3.
286 P.G. Ciarlet Chapter VII methods found in the proofs of Theorems 9.2,46.1, and 46.2, we could then show that the set LT is Pr-unisolvent, and that a regular family of reduced Hsieh-Clough- Tocher triangles is almost-affine, with the value fe = 2 in the corresponding interpolation error estimates D4.5); the definition of a regular family is verbatim that of Hsieh-Clough-Tocher triangles. The basis functions of the reduced Hsieh-Clough-Tocher triangle are explicitly computed in Kikuchi [1975c], Bernadou and Hassan [1981], Bernadou and Boisserie [1982, pp. 77ff]. The Hsieh-Clough-Tocher triangle, the reduced one, or more general composite triangular finite elements, also play a crucial role for constructing smooth surface interpolants with specified degrees of freedoms at arbitrary scattered points in the plane; in this direction, see notably Powell and Sabin [1977], Sablonniere [1980, 1984, 1985], Barnhill and Farin [1981]. A three-dimensional Hsieh-Clough- Tocher tetrahedron has been proposed by Alfeld [1984]. Composite finite elements are not necessarily triangular. Consider for instance the Fraeijs de Veubeke-Sander quadrilateral (T, PT, ET): The set Г is a convex non- degenerate quadrilateral, and the space PT and ZT are as in Fig. 46.4, where T\ Fraeijs de Veubeke-Sander quadrilateral РТ = Л1(Г) + Л2(Г) Ri(T)={pe<gl(T);p\TieP3(Ti),p\r^r,eP,(T-Tl)}, i = l,2 dimPr=16 IT = {р(а;), 6,p(q,.), 82р(а,.),avp(b,): UK4} Fig. 46.4. denotes the subtriangle with vertices al,a2,a4r,T1 denotes the subtriangle with vertices au a2, a3, and bt denotes the midpoint of [a,-, ai+1]. The Pr-unisolvence of the set ZT has been established by Ciavaldini and Nedelec [1974]. We shall say that a family of Fraeijs de Veubeke-Sander quadrilaterals is regular if it is a regular family of finite elements in the usual sense and if, in addition, the
Section 46 Fourth-order problems 287 following condition is satisfied: For each quadrilateral T in the family, let FT denote the unique affine mapping that satisfies FT@) = aT, FT(d1) = a1:T and FT(d2) = a2T, where aT is the intersection of the two diagonals of the quadrilateral T, and where d1=(l,0), d2 = @, l)(cf. Fig. 46.5). Then there must exist compact intervals/3 and/4 contained in the half-axes {(x1,x2)e R2;x!<0, x2 = 0} and {{x1,x2)e U2; xt=0, x2<0} respectively, such that all the points dJj = Ff1{aj,T) belong to the intervals Ij, for j=3 and 4. In other words, the quadrilaterals Ft 1 (T) must lie "in between" the two "extremal" quadrilaterals t0 and Ti indicated in Fig. 46.5. Ciavaldini and Nedelec [1974] have shown that a regular family of Fraeijs de Fig. 46.5. In a regular family of Fraeijs de Veubeke-Sander quadrilaterals, all the quadrilaterals Ff '(T) must lie "in between" two fixed quadrilaterals T0 and Tt, where FT denotes the unique affine mapping that satisfies F7@) = aT,FJ-(a|.) = ei,T, i'=l, 2.
288 P.G. Ciariet Chapter VII Veubeke-Sander quadrilaterals is almost qffine, with the value fe=3 in the corresponding interpolation error estimates D4.5). A similar analysis (unisolvence, interpolation error estimates) can be carried out for the reduced Fraeijs de Veubeke-Sander quadrilateral, whose characteristics are indicated in Fig. 46.6 (the spaces Ri(T) and R2(T) are denned in Fig. 46.4). reduced Fraeijs de Veubeke-Sander quadrilateral PT={peR1(T) + R2(T); d,.pe Р^Г) along each side Г of T] dimPT=12 I7. = {p(e,-).S1p(fli))82p(fli):UK4} Fra. 46.6. 47. Examples of singular finite elements of class #' We next describe an example of a triangular finite element of class (£l, where the <^1-continuity is obtained by adding appropriate rational functions to a familiar space of polynomials. This element is found in Section 10.10 of Zienkiewicz [1971], where other "singular" finite elements are also described. The singular Zienkiewicz triangle is defined as follows (Fig. 47.1): The set T is a triangle with vertices ah l^i^3, the space PT is the space P{T) of the Zienkiewicz triangle (cf. Fig. 8.2) to which are added linear combinations of the three functions q(: T->R, Ui^3, called singular shape functions by Zienkiewicz [1971], and defined by 4Ai/.;+1A;+2 forO«£Af<l, <КЛ,+ 1,Л1+2<1, D7.1) (A; + li + i) (A; + Л; + 2 ) qi{ai+i) = qi(ai + 2) = 0, where the functions Xh KK3, are the barycentric coordinates with respect to the vertices at of the triangle T. Notice that the function given in the first line of definition
Section 47 Fourth-order problems 289 singular Zienkiewicz triangle рг=Гз(Г)е v dimPT = 12 ^r = {p(e,-)»SiP(a f 4Я(Я?+1А?+2 ^ [(А,-+Я,+1)(Я,-+А/+2)] i),S2p(ai),Svp(bi):l^ • (cf. (8.3) and D7.1)) ^3} Fig. 47.1. D7.1) is not defined for A,- + Ai+1=0 or for А; + Аг+2 = 0, i.e., for 2I+2 = l or Ai+1 = l; this is why values must be assigned to the function qt at the vertices ai+1 and a, + 2. Finally, the set IT is the same as the set IT of the Hsieh-Clough-Tocher triangle. As usual, we begin by examining the question of unisolvence. Observe that this finite element is an instance where the validity of the inclusion Ргс<^1(Г), which is part of the definition of a finite element of class сё1, and of the inclusion Рт с Н2(Т) requires a proof. Theorem 47.1. Let the triple (T,PT,ET) be as in Fig. 47.1. Then the set ZT is PT-unisolvent. The resulting singular Zienkiewicz triangle is a finite element of class (€x, and the inclusion PTcH2(T) holds. Proof. A) To begin with, we verify that the inclusions РТ<=^^Т) and PT <^H2{T) hold. Since such inclusions are invariant through affine transformations, we may consider the case where the set Tis the unit triangle with vertices dt =A,0), a2 = @,1), and d3 = @,0). Then it suffices to study the behavior of the function qy: T-+U in a neighborhood of the origin in T. We have <fiM = 2 X^X2 (x1+x2) fix) forx#0, D7.2) 4i@) = 0, where the function fix) = 4il—x1 — x2J/il—x2) and its derivatives have no singularity at the origin. Since limXbX2^0+^1^2/(^1 +^г) = 0, we deduce that
290 P.G. Ciarlet Chapter VII МтХиХг-,0+ij1(x)=0. Therefore the function qt is continuous at the origin. For x1,x2^0 and x#0, we have: 3 2 8i4i(x)= ~r~72 №+ -^~ a1/(x), D7.3) x1x2Bx1+x2) хгх2 -=— Дх) H (Xi+X2f Xj+X2 а2 ^! (х) = -^—-г^-^— Д*) + тгттг s2 /(*)> and thus we conclude that lim 3^j(x) = 0 = 6,-4!@), j = 1,2, D7.4) which proves that the function qx is continuously differentiable at the origin. Arguing analogously with the vertex a2, and next with the functions q2 and q3, we conclude that the inclusion Ргс<Г(Т) holds. This inclusion implies the inclusion PT<=:Hl{T) and thus, in order to obtain the inclusion PT<=H2(T), it remains to show that the second partial derivatives of the function q± are square integrable around the origin. For x^O, we can write 2x3 {xi+x2Y xlCxi+x2) (xx+x2K S1^1(x)= Г ГГэ /M + 0i2«, D7.5) 2x3 9224j.(x)= -—■ /(x) + 022(x), pC!+X2J where the functions gu,gi2 and #22 are continuous around the origin. Since the three functions factoring the function f(x) are bounded on the set f, the inclusion РтсЯ2(Г) thus follows. B) The inclusion PT с <й'1(Т) proved in A) guarantees that the degrees of freedom of the set ET are well-defined for the functions in the space PT. The Pr-unisolvence of the set IT will be a straightforward consequence of the PT-unisolvence of the set ST = {р(ад,Dpiadidj-ai), Орфд^-Ъй U»J<3, \j-i\ = 1}, D7.6) which we first establish. Let ph l^i^3, and pu, l^i,j^3, \j—i\ = l, denote the basis functions of the space Р'з(Г) as given in (8.3). By definition, they satisfy Pi(at) = <5flb Dpi(ak)(a,-ak) = 0, D7.7)
Section 47 Fourth-order problems 291 for l«Si,fc,/s$3, |fc-/| = l, and Ру(а*) = 0, Dpij(ak)(al-ak) = 5ikdjh D7.8) for l<ij",fc,/^3, |j'-j| = |fc-/| = 1. We shall show that they also satisfy DPi(bk)(ak-bk) = -i+iub 1^M3, D7.9) ^A)fe-^)=-Ht<5ft+l<5#, Ki,M<3, |j-i| = l. D7.10) To this end, we first compute the directional derivatives Dp(fe,-)(fli — fe,-) of a function p: T-*U expressed in terms of barycentric coordinates. Let р(х1,х2) = q(Xu Я2, Я3) be such a function. Denoting by В — (fr^) the inverse matrix of the matrix A of F.3) for n-2, we find that з з k=l k=l Let us compute for example the quantity 2 3 Dp(b1)(a1-b1)= X I bkjdkq@AA)(ah-Haj2 + aJ3)), where а,;, у = 1,2, denote the coordinates of the vertex at. By definition of the matrices В and A, 2 ^ h]aji = 5ki-bk3, l«Sfc,i<3, and thus Bp(b1)(fl,-b1) = 9ie@,i,i)-i{S29@,i,i) + 83e@,i,i)}. D7.11) Then relations D7.9) and D7.10) follow from this relation and analogous relations for Dp(bi)(ai — bi), i = 2,3, combined with the following expressions of the basis functions Pi and ptj, which are easily derived from relations (8.1) and (8.2): pt=-21? + 3ti+ 21^2*3, D7.12) Рг;=иЯ,(Я,-Я;+1). D7.13) On the other hand, the functions qt as defined in D7.1) satisfy qi{ak) = 0, l^i,fc<3, ^«,-(я*)(в£-в») = 0, l**i,k,l^3, \k-l\ = l, D7.14) D(li{bk){ak-bk) = 5ik, l<i,fc^3. The second relations follow from D7.4); the last ones follow from another application of relation D7.11) and similar ones.
292 P.O. Ciarlet Chapter VII Then relations D7.7)-D7.14) imply together that the functions (which all belong to the space PT): (Pij-U9i + 4-21i}l 1«ЗД^З, {У,/} = {1,2,3}, D7.15) qh Ki<3, form a basis of the space PT that corresponds to the degrees of freedom of the set ST of D7.6). Hence ST is Pr-unisolvent. It remains to establish that the set ET is also PT-unisolvent. To prove this, we make the following observation: Along each side T" of the triangle T, the restrictions p\r,pe PT, are polynomials of degree ^3 in one variable, while the restrictions Dp(')^\T-,pePT, of any directional derivatives are polynomials of degree ^2 in one variable. This is clearly true for the functions in the space РЦТ), and it is a straightforward consequence of the definition for the functions q{. Notice that this property implies in particular that this finite element is of class c€l. Let then p e PT be a function that satisfies p(ai) = d1p{ai) = d2p(ai) = dvp(bi) = 0, l s*i<3. The conjunction of these relations and of the above property implies that the normal derivative and the tangential derivative vanish along any side of the triangle T. Consequently, the directional derivatives Dp(bi)(at~bi)t 1<i'<3, vanish, and therefore the function p is identically zero since the set ET is Pr-unisolvent, □ Remark 47.1. The second partial derivatives of the basis function 4i as given in D7.5) are not defined at the origin. More specifically, an easy computation shows that, for each slope t > 0, 0*3 Sim dllq1(x) = This phenomenon is also noted by Zienkiewicz [1971, p. 199], who observes that "second-order derivatives have non-unique values at nodes." While this observation does not prevent the function q± from being in the space (i1(T)r\H2(T), it may nevertheless cause computational difficulties; in particular, very accurate quadrature schemes should be used in conjunction with this element. Irons and Razzaque [1972b] and Razzaque [1973] obviate this difficulty by "smoothing" the second derivatives. We next establish the interpolation error estimates. Theorem 47.2. A regular family of singular Zienkiewicz triangles is almost-affine: For
Section 47 Fourth-order problems 293 all pe]1, oo] and all pairs (m,q) with m>0 and qe [1, oo] such that W^{T)^W*<«{T), D7.16) PTcWm-q(T), there exists a constant С independent of T such that \\v-nTv\Uq,T^C{meas(T)y">'^h3T-m\v\3,p,T for all ve W3'?(T). D7.17) Proof. We simply give some indications. The inequalities D7.17) are first established for the "intermediary" finite element (T, PT, ST) with ST as in D7.6), which can be imbedded in an affine family, and which satisfies P2(T)<=PT (the inequality p > 1 guarantees that the inclusion W3-P{T) q. ^1{T) = dom Пт holds). These estimates are then extended to the Zienkiewicz triangle by the same device as in the proofs of Theorems 45.1 or 46.2. □ The normal derivatives at the midpoints of the sides can be again eliminated from the set of degrees of freedom, by requiring that the normal derivative vary linearly along each side. We obtain in this fashion the reduced singular Zienkiewicz triangle, whose corresponding space PT and set ET are indicated in Fig. 47.2. Arguing as in Theorems 47.1 and 47.2, we could then show that the set IT is /V-unisolvent and that a regular family of reduced singular Zienkiewicz triangles is almost-affine, with the value /c = 2 in the corresponding interpolation error estimates D4.5). We next describe yet another way of adding rational functions to a polynomial space so as to obtain a singular finite element of class 4>l. reduced singular Zienkiewicz triangle PT={pe P'i(T)® V {gj;6vpe Pt(Г) for each side T of T} (cf. (8.3) and D7.1)) dimPr = 9 ZT = {p(a,), dtpiui), e2p(a,): 1«: i < 3} Fig. 47.2.
294 P,G. Ciarlet Chapter VII Let T be a triangle, and let R3{T) denote the space of tricubic polynomials, i.e., polynomials whose restrictions along each parallel to any side of Г are polynomials of degree О in one variable. One can then show that the space R3(T) coincides with the space Р3(Г) to which are added linear combinations of the three functions Af Я2Я3, kx к\къ and kx k2k\ (which are not linearly independent, however) and that dimR3(T)=12. This space was introduced by Birkhoff [1971]. Following Birkhoff and Mansfield [1974], we then define the Birkhoff- Mansfield triangle as indicated in Fig. 47.3 (as usual, 8vrp(fe;) = D2p(b,)(v, т) where r is the unit tangential vector at the point bt). The following properties can then be established: Birkhoff-Mansfield triangle Рг = К3(ТH V fi^iii-}, dimPr = 15 Zt = {p(fl,Q, Sip(a,), 82р(а;), 6vp(bj),Svtp(bi): 1 <i$3} Fig. 47.3. The functions in the space PT are polynomials of degree ^ 3 in one variable along each side of the triangle T, and any directional derivative Dp(-)^, where £ is any fixed vector in U2, is also a polynomial of degree ^ 3 in one variable along each side of the triangle T; the set IT is Pr-unisolvent; the resulting Birkhoff-Mansfield triangle is of class e#1, and the inclusion PTczH2{T) holds. Finally, a regular family of Birkhoff-Mansfield triangles is almost-affine, with the value к = Ъ in the corresponding interpolation error estimates D4.5). A similar analysis (unisolvence and interpolation error estimate) can be carried out for the reduced Birkhoff-Mansfield triangle, whose characteristics are indicated in Fig. 47.4. Other ways of adding rational functions are mentioned in Mansfield [1974, 1978], Dupuis and Goel [1970]. Boolean sum interpolation theory can also be used to derive blending polynomial interpolants, which interpolate a function v e ^""(T) and all its derivatives of order ^ m on the (possibly curved) boundary of
Section 48 Fourth-order problems 295 reduced Birkhoff-Mansfield triangle PT = {PeR3(T)@ V I-—1-} Dp(-)ZeP2(K') i~1 IM + M+i) for each side T of T and all feR2} dimPr = 12 Zr = {p(ai\ SiPfe), SzPfa), d.Pfe): 1 <K3} Fig. 47.4. a triangle T. In this direction, see Barnhill [1975a, 1975b], Barnhill, Birkhoff and Gordon [1973], Barnhill and Gregory [1975a, 1975b]. 48. Estimates of the error || и—»* II 2,0 for finite elements of class <ёх Let us now return to the finite element approximation of a fourth-order problem, such as the clamped plate problem D4.1), to fix ideas. We consider a family of finite element spaces Xk, made up with finite elements (Г, PT, ET) of a given type, which satisfy the following assumptions: (H2') The family (T,PT,ZT), Те (J Fh, is almost-affine. h (H3') All finite elements (Г, PT, IT), Те (J &~h, are of class <el. h Note that (H2') and (H3') parallel the assumptions (H2) and (H3) made in Section 17. If we assume, as in the subsequent theorems, that the inclusions Рт с Н2{Т) hold, the inclusion Xh<^H2(Q) then follows from hypothesis (H3'). This being the case, let Vh = X00h = {vheXh;vh = dyvh = 0 on Г}. D8.1) Note further that the ^-interpolation operator associated with any one of the finite elements of class &1 that we have so far considered satisfy the implication vedomIIh, v = dvv = 0 on Г => IlhveX00k, D8.2)
296 P.G. Ciarlet Chapter VII which will accordingly be an implicit assumption in the remainder of this section. To begin with, we obtain an estimate of the error \\u~uh\\2,n- As usual, the same letter С represents various constants which may not be the same in their various occurrences. We also recall that assumption (HI) means (cf. Section 17) that the spaces Vh are associated with a regular family of triangulations. Theorem 48.1. In addition to (HI), (H2'), (H3'), assume that there exists an integer fc^2 such that the following inclusions are satisfied: Pk(T)^PTcH2(T), D8.3) Нк+1(Т)о,ЩТ), D8.4) where s is the maximal order of partial derivatives occuring in the definition of the set ET, and let uhe Vh denote the discrete solutions. Then if the solution и е Ho{Q) of the clamped plate problem is also in the space Hk+1(Q), there exists a constant С independent ofh such that \\u~uh\\2,n^Chk-l\u\k+un- D8.5) Proof. Using Cea's lemma (Theorem 13.1), inequality D4.5), and relation D8.2), we obtain f V2 VhsVh <C||«-/T*«||2.0 = C-j X \\u-nTu\\iT I TeFh J From this theorem, the least assumptions that insure an 0(/i) convergence of the error ||u — uj2,n are the inclusions P2(T)<=.PT on the one hand, and the H3{Q)- regularity of the solution и of the plate problem on the other hand. Notice that it is remarkable that this regularity result holds precisely if the right-hand side / is in the space L2{Q), and if О is a convex polygon, an assumption often satisfied by plates. Therefore, since one cannot expect better regularity in general, the choice PT — P2(T) seems optimal from the point of view of convergence. However, by ZeniSek's result (Section 9), this choice is not compatible with the inclusion Xh a ^l(U) for triangular "polynomial" finite elements, and thus either higher-degree polynomials must be used, or one should use "nonpolynomial" finite elements such as singular or composite elements in order to decrease the dimension of the space PT. These considerations are illustrated in the following tableau (Fig. 48.1), where we have summarized the application of Theorem 48.1 to various finite elements of class <€l. One should notice that, if the reduced Hsieh-Clough-Tocher triangle and the reduced singular Zienkiewicz triangle are optimal in that the dimension of the corresponding spaces PT is the smallest, this reduction in the dimension of the spaces
Section 48 Fourth-order problems 297 Finite element Argyris triangle Bell triangle Bogner-Fox-Schmit rectangle Hsieh-Clough-Tocher triangle Reduced Hsieh-Clough- Tocher triangle Singular Zienkiewicz triangle Reduced singular Zienkiewicz triangle dim PT 21 18 16 12 9 12 9 Рк(Т)сЯг Ps(T) = PT PAT)^PT P3(T)<zPT P3(T)<=PT P2(T)czPT P2(T)czPT P2(T)cPT Ци-Mftlko 0(A«) 0(Л3) Oih2) 0(h2) 0(h) 0(h) 0№) Assumed regularity of the solution ueH6(Q) И£й5(й) ueH*(Q) ueH*(Q) ueH\Q) ueH\Q) ueH3(Q) Fig. 48.1. Orders of convergence obtained with various finite elements of class #'. PT is obtained at the expense of an increased complexity in the structure of the functions psPT. Notice that in order to get an 0(hk+1) estimate of the error \u — W/,|о,й based on the Aubin-Nitsche lemma (Theorem 19.1), it would be necessary to assume that, for any g e L2(Q), the corresponding solution (pg of the plate problem belongs to the space H\Q)c\Hl(Q) and that there exists a constant С such that || (pg\\4.,n^C\g\0<n for all gsL2(Q). However, this regularity no longer holds on convex polygonal domains in general; cf. Kondratev [1967], Grisvard [1985]. It is true only if the boundary Г is sufficiently smooth: For example, this is the case if the boundary Г is of class сёА. But then this regularity of the boundary becomes incompatible with our assumption that Q be a polygonal set! We next give minimal assumptions (cf. D8.6)) that guarantee convergence. Theorem 48.2. In addition to (HI), (H2'), and (H3')> assume that the inclusions P2(T)cPTc:H2(T) D8.6) are satisfied, and that the maximal order я of partial derivatives found in the set XT is <2. Then lim||«-K»||2.fl = 0. D8.7) A->0 Proof. The argument is the same as in the proof of Theorem 18.2 and, for this reason, will be only sketched. Using inequality D4.5) with к=2, p=oo, m-2 and q = 2, one first shows that the space Г = W3^(Q)nHl(Q)
298 P.G. Ciariet Chapter VII is dense in the space Hl{Q), and this result is used in conjunction with the inequality inf ||u-uh||2ifl^||u-i;i|2,o+||t)-nii;||2i„ Vh E Vh which holds for any function ve'f. □ There are further aspects of the finite element approximation of the plate problem or of more general fourth-order problems that will not be covered here. Let us simply mention some relevant references: Rannacher [1976a] has obtained estimates of the error \u — и»1о,а>,д- The effect of numerical integration is analyzed in Bernadou and Ducatel [1978]. The approximation of fourth-order problems on domains with curved boundaries is considered by Mansfield [1978], who handles in addition the effect of numerical integration. Her approach parallels that given in Ciarlet and Raviart [1972c] for second-order problems (cf. Chapter VI). Curved isoparametric finite elements of a new type are suggested by Robinson [1973]. In the case of the simply supported plate problem, we mention the BabuSka paradox (cf. BabuSka [1963]; see also Birkhoff [1969]): Contrary to second-order problems, no convergent approximation may be found if the curved boundary is replaced by a polygonal domain. For, assume Q is a circular domain; then the exact solutions on a sequence of polygonal domains approaching the circle do not converge to the solution on the circle! Additional references concerning curved boundaries, or different boundary conditions for the plate problems, are Nitsche [1971, 1972b], Chernuka, Cowper, Lindberg and Olson [1972], and the survey of Scott [1976b]. Finite element approximation of variational inequalities of order four are considered by Glowinski [1975,1984], Glowinski, Marini and Vidrascu [1984]. See also Glowinski, Lions and Tremolieres [1976b, Chapter 4]. 49. A nonconforming finite element for the plate problem: The Adini-Clough-Melosh rectangle The general approach followed in this section is that of Ciarlet [1974a, 1974b]. In Lascaux and Lesaint [1975], a thorough study is made not only of the Adini- Clough-Melosh rectangle, but of other nonconforming finite elements for the plate problem, such as the Zienkiewicz triangle, Morley's triangle, and various instances of Fraeijs de Veubeke triangles (these last finite elements are briefly described at the end of this section); the convergence of the Adini-Clough- Melosh rectangle has also been studied by Kikuchi [1975b, 1976] and Mivoshi [1972]. Curved nonconforming elements have been considered by Barnhill and Brown [1975]. Nonconforming elements for the plate problem are extensively discussed from an engineering viewpoint in Zienkiewicz [1971, Chapter 10]. Although the references given in Chapter V were more specifically concerned with second-order
Section 49 Fourth-order problems 299 problems, some of them are also relevant in the present situation, notably Сёа [1976], Nitsche [1974], Oliveira [1977]. We first give the general definition of a nonconforming method for solving the clamped plate problem, which corresponds to the data D4.1). Assuming that the set Q is polygonal, so that it may be exactly covered with triangulations, we construct a finite element space Xk whose generic finite element (T, PTt ET) is not of class <€1. Using the same arguments as in Theorem 30.1, one can then prove the following result, which shows that in this case, the space Xh cannot be a subspace of H2(Q) (note that this theorem is the converse of Theorem 5.2). Theorem 49.1 Assume that the inclusions РтсЯ>1{Т) for all T' e2Th and XhaH2(Q) hold. Then the inclusion holds. Let us henceforth assume that we are using finite elements (T, PT, IT) that satisfy PTcH2{T) for all Te^;. D9.1) Then we have in particular XhcL2(Q). D9.2) After defining an appropriate subspace Xook of Xh, which takes the boundary conditions f=Svt>=0 along Г into account as well as possible (this will be illustrated on one example), we define the approximate bilinear form: ah{uh,vh)= Y, {AuhAvh-\-{l-v){2d12uhd12Vh-'diiUhb22vh-'d22uhdllvh)}&x Te<rh J = Y. {v^M»* + (l-v)(911Mj811i;A + 822MA922^ + 2812MA9i2l'A)}dx. TeSrh J D9.3) Observe that this definition is justified by the inclusions D9.1). Then the discrete problem consists in finding a function uheVh = X00ll such that a*(«*, Vh) = 1Ы for all vh e Vh D9.4) (the linear form need not be approximated in view of the inclusion D9.2)). Since \'\z,a is a norm over the space V=Hl(Q), we are naturally led to introduce the seminorm »*-KII* = { I I»*||.t}1/2 D9.5) over the space Vh. We also extend the domains of definition of the mappings ah( •, •) and || • ||л to the space {Vh + V). Thus there exists a constant M independent of the
300 P.G. Ciarlet Chapter space Vh such that | a„(u, v)\^ M || и || h || vh fotauu,ve{Vh+V). D In the ensuing analysis, we shall essentially concentrate our study on one exarr of a nonconforming finite element method for solving the plate problem, which ma use of the Adini-Clough-Melosh rectangle, sometimes abbreviated as the AC rectangle. This element, due to Adini and Clough [1961] and Melosh [1963], correspo to the following data: The set R is a rectangle whose vertices a„ 1 ^ i'^4, are coun Ф- аг a3 т" Ft t 3l f; a4 r: © 2h, Adini-Clough-Melosh rectangle /,T = i>3(T)®V{x1xLx?x2}, dimPr=12 2;T = {p(el),81p(e,}.8,p(e,):l<i<4} a, T, e 2h Fig. 49.1. as in Fig. 49.1. The space PT consists of all polynomials of the form />:x = (xbx2)-»p(x)= X Уа,»;,*!1*:!2+ 713*1*2+Уз1*1*2> i.e., Рт^Ръ{Т)®У{х1х1,х\х2}; D
Section 49 Fourth-order problems 301 hence the inclusion P3(T)CPT D9.8) holds, and dimPr=12. D9.9) Finally, the set Гг is given by 1т^{р(ад,д1Р(ад,д2Р(ад- 1^К4}. D9.10) Let us assume without loss of generality that the set T is the square t= [— 1, +1]2, in which case it is easily verified that P=YP{ai)Pi+ I D№№j-&dPtj iorallpePf, D9.11) i=l |j'-<| = l(mod4) with PiM=4(i +*iXi +*2)(i+Mxi +x2)-Uxl+xD), Рг2(х) = Ш+х1)A+х2JA-х2), D9.12) Pi4W = 8(l + ^2)(l+^iJ(l--^i), etc. Therefore this identity implies that the set IT is PT-unisolvent. Let us assume that the set Q is rectangular, so that it may be covered by triangulations made up of rectangles. With such a triangulation 3^, we associate a finite element space Xh whose functions vh are defined as follows: (i) for each rectangle Te^l, the restrictions vh\T span the space PT of D9.7); (ii) each function vh e Xh is defined by its values and the values of its first derivatives at all the vertices of the triangulation. Along each side 7" of a rectangle Г, the restrictions p\T',pePT are polynomials of degree ^ 3 in one variable. Since such polynomials are uniquely determined by their values and the values of their first derivative at the end-points of T", we conclude that the Adini-Clough-Melosh rectangle is a finite element of class <€a. It is not of class c£1, however: Along the side T\ = [a4, ax] (for instance; cf. Fig. 49.1), the normal derivative is a polynomial of degree < 3 in the variable x2 on the one hand, and on the other the only degrees of freedom that are available for specifying the normal derivative along the side T\ are its values at the two end-points. We then let Vh = X0oh, where X00h denotes the space of all functions vheXh that satisfy vh{b)=biVh{b)—b2vh{b) = Q at all the boundary nodes b. Then the functions vh e Vh vanish along the boundary Г; their derivatives dvvh vanish at all the boundary nodes, but not along the entire boundary Г, in general. To sum up, we have constructed a finite element space Vh whose functions vh satisfy: vheHh(Q)nC°(Q),vh\TeH2(T) for all Te^, ^ „v л , , , . D9.13) cvvh{b)=0 at the boundary nodes. Observe that the associated ^-interpolation operator Uh is such that v<EHl{Q)n&omTlh => nhveX00h=Vh. D9.14)
302 P.G. Ciarlet Chapter VII Hence this implication holds in particular for functions in the space Ho(Q)r\H3(Q), since H3(Q)c^\Q) = domnh. Prior to the error analysis, we must examine whether the mapping || ■ ||A of D9.5) is indeed a norm. Theorem 49.2. The mapping Vh-+\\vh\\h = \ £ \ьь\\Л is a norm over the space Vh. Proof. Let vh be a function in the space Vh such that || vh || h — 0. Then the functions 8j(«ii I r)> J=U 2, are constant over each rectangle Te^j,. Since they are continuous at the vertices, the functions djvh, j= 1,2, are therefore constant over the set Q, and since they vanish at the boundary nodes, they are identically zero. Thus the function vheVh is identically zero, as a consequence of the inclusion К),сЯо@)п^0@). □ Notice that the approximate bilinear forms ah(-,') are uniformly Vh-elliptic, since (l~v)\\vh\\2h^ah(vh,vh) for аЛ о»б И», D9.15) by D9.3), and the Poisson coefficient v lies in the interval ]0, i[. Remark 49.1. Had we tried to use nonconforming finite element methods for the biharmonic problem, in which case the approximate bilinear form reduces to ^re^fr^A^dx, the uniform FA-ellipticity is no longer automatic, and this is essentially why we restrict ourselves to plate problems. By contrast, the conforming methods described in the previous section apply equally well to any fourth-order elliptic boundary value problem. 50. Estimate of the error {T.Te^\u-uh\l T}1'2 for the Adini-Clough-Melosh rectangle We shall assume in what follows that the spaces Vh are associated with a regular family of triangulations, and that the solution и is in the space Н3(Ц|пЯо@); this is и ti ^Л ■ r n ii , |a*(",Wft)-fK)l\ ,,Л1, Ци-ил||л<С mf l\u-vjh+ sup — . E0.1) V^hsKh WheVh \\Wh\\h J We shall assume in what follows that the spaces Vk are associated with a regular family of triangulations, and that the solution и is in the space H3(Q)r~\Ho(Q); this is a reasonable regularity assumption, which holds in particular if /e L2(Q) and Q is a convex polygon, i.e., a rectangle in the present case. Since any family of
Section 50 Fourth-order problems Adini-Clough-Melosh rectangles is affine, we obtain inf \\u-vh\\h^\ £ \и-Пти\22Л <C/i|«|3jfl, 303 E0.2) and this estimate takes care of the first term in the right-hand side of inequality E0.1). The estimate of the second term, i.e., the consistency error estimate, rests on a careful decomposition of the difference Dh{u,wh) = ah{u,wh)-f{wh), wheVh, E0.3) Let us first show that the term l(wh) = \Q fwhdx can be rewritten as l(wh) = V(Au)-Vwhdx for all whe Vh E0.4) (this equality clearly holds if и e H\Q)r\Hl{Q), in which case l(wh)= \nA2u wh dx, but we only assume here that и е H*{Q)nHl(Q)). To see this, let wh e Vh be given, and let (w\) be a sequence of functions wkh e ЩО) such that limt^ „, || wkh — wh\\ 1>fi = 0 (recall that whe VhcHi{Q)). By making use of Green's formulae B.5) and B.9), we obtain for all integers k, AuAwkhdx = V(Au)-Vwkhdx, {2812«912.wJ-811M822wX-822«anwJ}dx = 0> since 8„wJ = 8twj; = 0 along Г, and thus, by definition of the abstract problem (cf. D4.1)), fwkhdx=-\ V(Au)-Vwkhdx. n a Therefore, fwh dx = lim fw\ dx = lim I AuAwl dx k-> oo = lim <(- | V(Au)-Vwkhdx}=- k~* oo V(Au)-Vwhdx, and equality E0.4) is proved.
304 P.G. Ciarlet Chapter VII Using the same Green formulae as above, we obtain with the same notation as in B.9), {JuJwfc + (l-v)B812u812Wft-3uu822H'fc-822u911w^}dx V(Au)-4whdx + Audv,Twhdy dT + (l-v) {-dZZtTudv,Twh + dVTrTudZtTwh}dy for all Te$~h and all whe V„. When these expressions are added as in the approximate bilinear form of D9.3), we find that Y i - VDu) • Vw„ dx i = - V(Au)• Vwh dx = J(w„), г n using the inclusion Vhc:H\Q) and relation E0.4). Let us next show that £ Svt,r"St,r(wft|r)dy=0- 7e55, J er To this end, consider separately the situation where T a 8 Г is a side common to two adjacent rectangles Tx and T2, and the situation where T сЭГ is a portion of the boundary Г. In the first case the two corresponding integrals cancel because и e H3(Q) and wh € C°(Q), and in the second case the integral vanishes because wh = 0 along Г. To sum up, we have found that Dk(u,wh) = ah{u,wh)-l(wh) = X I (Л«-A - v)8„,r«)8TfT(wfc,r) dy for all whe V„, E0.5) ar i.e., we have obtained one decomposition of the expression Dh(u, wh) as Dk(u,wh)= Y DT(u\T,wh\T), Te$-h where each mapping DT(-, •) is a bilinear form over the space H3(T) x PT. As in the proof of Theorem 34.1, the key idea consists in obtaining another decomposition of the form E0.5) (cf. E0.8)), which takes into account the "conforming" part of the first-order partial derivatives of the functions in the space Vh (for a related idea, cf. Remark 34.1). This will in turn allow us to obtain appropriate estimates of the difference Dh{u, wh), as we now show.
Section 50 Fourth-order problems 305 Theorem 50.1. Assume that the solution и of the plate problem is in the space Ho@)r\H3(Q). Then,for any regular family of triangulations, there exists a constant С independent ofh such that \\u-uh\\h = < X \и-иИ\22Л ^Ch\u\3yD, E0.6) ITe&h J where uh denotes the discrete solution found with Adini-Clough-Melosh rectangles. Proof. After inspecting the decomposition E0.5), we are naturally led to study the bilinear form Dh{-,-):{v,wh)eH\Q)xVh -+Dh(v,wh)= £ {Av~(l-v)dt%Tv)dVtT{wyT)dy Teffi, J dT =D1h(v,d1wh)+D2h(v,d2wh), E0.7) with Dl(v,djwh)= £ Tesrh T'j {Av-(l-v)dxl,Tv)dj(whlT)dy {Av-{l-v)druTv)dj(whlT)dy\, j = l,2, Tj where, for each Те ^rh, the sides T} and T'j,j=l, 2, are defined as in Fig. 49.1. For each triangulation &~h, let Yh denote the finite element space whose generic finite element is the bilinear rectangle, and let Zh = Y0h denote the space of all functions wh e Yh that vanish at the boundary nodes. Clearly, the inclusion Z„ <= V°(i!)nHh(Q) implies that D{(v,zh) = 0, 7 = 1,2, forallreH3(fi), zheZ„, where Di(v,zh)= £ j TeSTh T'j (Av-(l-v)dtliTv)zhdy (Av-{l-v)dzuTv)zhdy\, 7=1,2. Tj Consequently, if AT denotes for each Te^h the g^THnterpolation operator
306 P.G. Ciarlet associated with the bilinear rectangle, we can also write Ai(u,Wi)= £ DT(v,wh) foiall(v,wh)eH3(Q)xVh, TeSTh where, for each Те 3~h, the bilinear form DT(', •) is given by DT(v,p)^AUT(v,diP) + A2iT(v,d2p) for all (v,p)e H3(T)xPT, with Chapter VII <4j,rMjP): (Av-(l- v)drZtTv)(djp - ATdjp) dy Tj E0.8) E0.9) (Av - A - v)8„,To)(ajP - ЛгЭ,.р)ё7, j = 1,2. E0.10) Using the definition of the operator Лт, we find a Jirsr polynomial invariance: AlT(v,q) = 0, j=l,2, for all г;еЯ3(Г), geQ^T), E0.11) with Aj,t(v> l) = (Av-(l~ vN„iTv)(q - ATq) dy (Ли - A - vMZZiTv){q - ATq) dy, j = 1,2. We next establish the second polynomial invariance: AlT(v,q) = 0, j = 1,2, for all veP2(T), qedjPT, where the spaces ejPT = {djp;pePT}, j= 1,2, both contain the space Qi(T). To see this, it suffices to show that E0.12) E0.13) (q-ATq)dy = \{q-ATq)dy, ./=1,2 for all qedjPT. E0.14) Tj Tj Let us prove this equality for,/ = 1, for instance. Each function qedj^Pj is of the form q = y0(*i) + ri(*i)*2 + y2*i + Уз*1 wherey0 andyj are polynomials of degree ^ 2 in the variable xt. Given any function
Section 50 Fourth-order problems 307 r defined on a side T, let XT{r) denote the linear function along T" that assumes the same values as the function r at the end points of T. Then we have (q - ATq)\Ti{x2) = y2x\ + уъх\ - kT\{y2x\ + y3x2), and therefore, (q-ATq)\ri(x2) = (q-ATq)\r;(x2), which proves E0.14). Consequently, the polynomial invariance of E0.12) holds. To estimate the quantities AjiT(v, djp) of E0.10), it suffices to estimate the similar expressions hA<P,<l)= \9(q-ATq)dy- \q>(q-ATq)dy E0.15) Tj Г] for cpeH1(T),qed)PT,) = \,2. Using the standard correspondences between the functions v: T-> R and v: Г-> Ш, we obtain <5i,r(<7>,4) = Mi,f(<M), <52,r(<P,«) = M2,7-(<M); E0.16) we shall also take into account the fact that a function q belongs to the space QjPt when the function q belongs to the space 5jPT- Corresponding to the polynomial in variances E0.11) and E0.12), we now have: «5,,H<M) = 0 for all феН\Т), qeP0(t), <5;,r(<M) = 0 for all феР0(Т), qedjPt. Hence if we equip the spaces djPT with the norm |j • ||1>r, we obtain \ht@>4)\^^\\0\\LHbt)M\\LHst)<^^\\i.TH\\i.t for all феН\Т), qedjPT, and thus each bilinear form fy, j- (•, •) is continuous over the space H1 (t) x djPf. The bilinear lemma (Theorem 33.1) then implies that there exists another constant С such that \31Т(ф,ф\^С\ф\иТ\4\иТ for all феН^Т), qed}PT. E0.18) By Theorem 15.1 and the regularity assumption, there exists a constant С such that \Ф\1,т^С\ср\ит, \qh,T^C\q\UT. E0.19) Combining relations E0.16), E0.18) and E0.19), we conclude that \Sj,T(<P><l)\<Ch\(p\liT\q\UT foiallcpeH'iT), qedjPT, ./=1,2. E0'20) Let then veH3(T) and pePT be two given functions; hence the functions cp = Av — A — v)b22vm&q = 6\p belong to the spaces H1^) and SjPy, respectively.
308 P.G. Ciariet Chapter VII Then we have \Al,T(V,p)\=\e1,T(Av-(l--v)d22V,d1p)\^ChT\v\3tT\p\2iT. Thus, after analogously estimating the term \A2tT(v,p)\, we obtain \DT(v,p)\^ £ \AJtT(v,d}p)\^ChT\v\3<T\p\2,T ;=i for all veHb{T), pePT. In this fashion we are able to estimate the second term in the abstract error estimate E0.1): We find that, for all wheVk, \ah{u,w„)-l{wh)\^ £ \DT(u,wh)\^Ch\u\3^\\wh\\h, TeFh and the proof is complete. □ Using the abstract error estimate mentioned in Remark 31.1, Lascaux and Lesaint [1975] have shown that the error estimate expressed with the norm || • || 1>n is II и - u„ \\ua <С/г2|и|3,„, i.e., it is of a higher order, as expected. The same authors have also shown that the error estimate E0.6) can be improved when all the rectangles Te$~H are equal, in the sense that in this case, \\u-uh\\h^Chz\u\^Q, if the solution и is in the space HA(Q). Another popular nonconforming finite element for solving the plate problem is the Zienkiewicz triangle (cf. Bazeley, Cheung, Irons and Zienkiewicz [1965]) which was described in Section 8 (cf. Fig. 8.2). Through a refinement of the argument used in the proof of Theorem 50.1, Lascaux and Lesaint [1975] have shown that the needed polynomial invariances hold if and only if all sides of all the triangles found in the triangulation are parallel to three directions only, in which case one gets \\u-uh ||ft ^ Ch\u\3tC), \\u~uh ||1>0 < Ch2\u\xn, if the solution и is the space H3(Q). This is therefore an answer to the Union Jack problem: As pointed out in Zienkiewicz [1971, pp. 188-189], the engineers had empirically discovered that the configuration of Fig. 50.1(a) systematically yields poorer results than the configuration of Fig. 50.1(b). The reason why the degree of freedom p[a123) (which is normally found in the cubic Hermite triangle) should be eliminated is that the presence of the associated basis function AiA2A3 (cf. (8.1)) destroys the required polynomial invariances. See also Shi [1984e, 1987b] for a study of the Zienkiewicz triangle from the viewpoint of Stummel's generalized patch test. A modified Zienkiewicz triangle has also been proposed by Hanssen, Syvertsen and Bergan [1978], and further developed by Argyris, Haase and Mlejnek [1980,
Section 50 Fourth-order problems 309 Я (a) Fig. 50.1. The Union Jack problem: If the nonconforming Zienkiewicz triangles are arranged as in (a), the method does not converge; if the triangles are arranged as in (b), the method converges. 1982]. Then Shi [1987] has shown that the associated nonconforming finite element method is convergent, without any geometrical restriction imposed on the triangulations. Whereas the Adini-Clough-Melosh rectangle and the Zienkiewicz triangle yield finite element spaces that satisfy the inclusion Vhc<e°(Q)r\Ho(Q), there exist convergent nonconforming finite elements for the plate problem that are not even of class <ё°. Here we shall simply describe two such elements and mention their convergence properties, referring the reader to Lascaux and Lesaint [1975] for complete proofs. See also Ruas [1988] for extensions to biharmonic equations in R", n>3. The first element is named Morley triangle, after Morley [1968]; its characteristics are indicated in Fig. 50.2. Details about its implementation are found in Peisker and Braess [1987]. The second element, named Fraeijs de Veubeke triangle after Fraeijs de Veubeke [1974]), is an example of a finite element where some degrees of freedom are averages Morley triangle PT = PZ{T), dim PT = 6 IT = {р(а,у. 1 =S i < 3; Svp(alV): 1 < i <j s: 3} Fig. 50.2.
310 P.G. Ciarlet Chapter VII Fraeijs de Veubeke triangle Рт = {реР3(Т);ф(р)^0}, dimPr = 9 M0 = 27p(a1M)- l p(fl,)-8 I pK')+3 ^ -L IDpi-)(a12i-ai)dy ST = \ pfa): 1 $ is* 3; p(ay): 1 =S i < J =S 3; —- 6vp dy: 1 sj i ^ 31. Fig. 50.3, (another similar instance is Wilson's brick; cf. Section 32). Its characteristics are indicated in Fig. 50.3 where for each i, | T\ | denotes the length of the side T\. One can then prove that each set IT is Pr-unisolvent, and that, for regular families and for any veH3(T) с dom Пт, \v~nTv\miT^Ch3~m\v\3,T, 0^m^3, i.e., regular families of Morley triangles or Fraeijs de Veubeke triangles are almost-affine (the space PT corresponding to the Fraeijs de Veubeke triangle contains the space P2(T)). Let next Xh denote the associated finite element space, and let Vh = X00h, where X00h is composed of the functions in Xh whose degrees of freedom vanish along the boundary Г. Although neither element is of class <ё°, the averages of the first-order partial derivatives are the same across any side common to two adjacent finite elements, and the same averages vanish along a side included in Г in both cases; these last facts play a critical role in the error analysis. It can then be established that, in each case, the seminorm || ■ \\h of D9.5) is a norm over the space Vh, and that, if the solution и belongs to the space H4(Q), the error estimate It и — "л Ik ^ C(fc|u|3>n + /г2|иЦ,о) holds. Note that, contrary to the Zienkiewicz triangle, no restriction need to be imposed here on the geometry of the triangulations in order to get convergence.
Section 50 Fourth-order problems 311 The decomposition E0.5) is here replaced by Dh(u,wh) = ah(u,wh) ttu-(l~v)dzt,TU)dv,T(WlllT}dy = z TeSTH ЭГ - Z {{КтМЫт) + A - v)8vtjr«8t,r(wft|r)} dy ТеЛ J dT for all WheVh, and the key idea is again to subtract off appropriate "conforming" parts in the above expression. Then it is possible to apply the bilinear lemma, one side at a time rather than one element at a time, as in the case of Wilson's brick or the Adini-Clough- Melosh rectangle. There are alternate ways of defining nonconforming methods for plate problems. For example, let us assume that we are given a finite element space Vh that satisfies only the inclusion Vh с Я?°(й)г\Н1{0). If we assume as usual that the functions in the spaces PT are smooth, the conformity would require the additional conditions that 8v(t>j,|rA)) + 8v(ty|rB))=0 along any side T" common to two adjacent finite elements T(l) and TB), and that dvvh = 0 along Г. If these conditions cannot be exactly fulfilled, they may be considered as constraints, and accordingly, they may be dealt with either by a penalty method or by duality techniques (for a general introduction to these techniques, see e.g. Ciarlet [1983]). In the first approach, one minimizes a functional of the form J*(Vh) = iahivh, vh) - l(vh) + — 0(vh), where ФЮ= Z f {8vNr,i,) + 6vNr<2,)}2dy+ f(8v.J2dr, ГA)#ГB) ГA)оГB) Г and e( •) is a function of h that approaches zero as h -»0, e.g., e(/i) = Ch", С > 0, where the exponent a > 0 is chosen so as to maximize the order of convergence. A method of this type has been studied by Babuska and Zlamal [1973], who showed that cubic Hermite triangles used in this fashion yield the error estimates 1|и-"Л*<Сч/й||м||з,й, ||M-MjJi,fl^Cfe||M||3,0, if ueH3{Q) and s(h) = Ch2, and II" - "* II* <Ch\\u||4in, ||и - uh ||lifl ^ Ch\\и ||4>й, if ueHA(Q) and s{h) = Ch3. These authors have applied this penalty method to the
312 P.G. Ciarlet Chapter VII biharmonic problem instead of the plate problem. Such techniques are actually used in practice: see Zienkiewicz [1974]. The second approach consists in introducing an appropriate Lagrangian. This is advocated for example by Harvey and Kelsey [1971], who also use the cubic Hermite triangle, but for solving the plate problem. The uniform convergence of nonconforming finite element methods for plate problems is studied in Rannacher [1977].
References Adams, R.A. A975), Sobolev Spaces (Academic Press, New York). Adini, A. and R.W. Clough A961), Analysis of plate bending by the finite element method, NSF Rept. G.7337. Agmon, S. A965), Lectures on Elliptic Boundary Value Problems (Van Nostrand, Princeton, NJ). Alfeld, P. A984), A trivariate Clough-Tocher scheme for tetrahedral data, Comput. Aided Geom. Design 1, 169-181. Apprato, D. and R. Arcangeli A979), Approximation d'un probleme aux limites elliptique d'ordre deux par elements finis rationnels de Wachspress avec integration numerique, RAIRO Anal. Numer. 13, 3-20. Apprato, D., R. Arcangeli and J.L. Gout A979a), Sur les elements finis rationnels de Wachspress, Numer. Math. 32, 247-270. Apprato, D., R. Arcangeli and J.L. Gout A979b), Rational interpolation of Wachspress error estimates, Comput. Math. Appl. 5, 329-336. Apprato, D., R. Arcangeli and R. Manzanilla A987), Sur la construction de surfaces de class C* a partir d'un grand nombre de donnees de Lagrange, Modelisation Math. Anal. Numer. 21, 529-555. Arcangeli, R. and J.L. Gout A976), Sur revaluation de l'erreur d'interpolation de Lagrange dans un ouvert de R", Rev. Francaise Automat. Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. 10, 5-27. Argyris, J.H. A954-1955), Energy theorems and structural analysis, Part I: General theory, Aircraft Engrg. 26, 347-356, 383-387, 394; 27,42-58, 80-94, 125-134 (also published as a book, Butterworths Scientific Publications, London, 1960). Argyris, J.H. and I. Fried A968), The LUMINA element for the matrix displacement method (Lagrangian interpolation), Aero. J. Roy. Aero. Soc. 72, 514-517. Argyris, J.H., I. Fried and D.W. Scharpf A968), The TUBA family of plate elements for the matrix displacement method, Aero. J. Roy. Aero. Soc. 72, 701-709. Argyris, J.H., M. Haase and H.P. Mlejnek A982), On an unconventional but natural formation of a stiffness matrix, Comput. Methods Appl. Mech. Engrg. 22, 1-22. Argyris, J.H., M. Haase and H.P. Mlejnek A980), Some considerations on the natural approach, Comput. Methods Appl. Mech. Engrg. 30, 335-346. Argyris, J.H. and H.P. Mlejnek A986-1988), Die Methode der Finiten Elemente, Band 1: Verschiebungs- methode in der Statik; Band 2: Kraft und gemischte Methoden, Nicht-linearitaten; Band 3: Einfiihrung in die Dynamite (Vieweg, Braunschweig) (English translation in preparation; North-Holland, Amsterdam). Aronszajn, N. and K.T. Smith A957), Characterization of positive reproducing kernels; Application to Green's/unctions, Amer. J. Math. 79, 611-622. Atteia, M. A975), Fonctions "spline" et methode d'elements finis, Rev. Francaise Automat. Informat. Recherche Operationnelle Ser. Rouge Anal. Numer. R-2, 13-40. Atteia, M. A977), Evaluation de l'erreur dans la methode des elements finis, Numer. Math. 28, 295-306. Aubin, J.P. A967a), Approximation des espaces de distributions et des operateurs differentiels, Memoire 12, Bull. Soc. Math. France. Aubin, J.P. A967b), Behavior of the error of the approximate solutions of boundary value problems for linear elliptic operators by Galerkin's and finite difference methods, Ann. Scuola Norm. Sup. Pisa 21, 599-637. 313
314 P.O. Ciarlel Aubin, J.P. A968a), Evaluation des erreurs de troncature des approximations des espaces de Sobolev, J. Math. Anal. Appl. 21, 356-368. Aubin, J.P. A968b), Interpolation et approximations optimales et "spline functions", J. Math. Anal. Appl. 24, 1-24. Aubin, J.P. A969), Approximation des problemes aux limites non homogenes et regularite de la convergence, Calcolo 6, 117-139. Aubin, J.P. A972), Approximation of Elliptic Boundary-Value Problems (Wiley-Interscience, New York). Avez, A. A983), Calcul Differentiel (Masson, Paris). Axelsson, O. and V.A. Barker A984), Finite Element Solution of Boundary Value Problems: Theory and Compulation (Academic Press, New York). Babu, D.K. and G.F. Pinder A984), Analytical integration formulae for linear isoparametric finite elements, Internal. J. Numer. Methods Engrg. 20, 1153-1166. BabuSka, I. A963), The theory of small changes in the domain of existence in the theory of partial differential equations and its applications, in: Differential Equations and Their Applications (Academic Press, New York) 13-26. BabuSka, I, A970a), Finite element methods for domains with corners, Computing 6, 264-273. BabuSka, I. A970b), Approximation by hill functions, Comment. Math. Univ. Carolin. 11, 787-811. BabuSka, I. A971a), The rate of convergence for the finite element method, SI AM J. Numer. Anal. 8, 304-315. BabuSka, I. A971b), Error bounds for the finite element method, Numer. Math. 16, 322-333. BabuSka, I. A972a), A finite element scheme for domains with corners, Numer. Math. 20, 1-21. BabuSka, I. A972b), Approximation by hill functions II, Comment. Math. Univ. Carolin. 13, 1-22. BabuSka, I. A973a), The finite element method with Lagrangian multipliers, Numer. Math. 20, 179— 192. BabuSka, I. A973b), The finite element method with penalty, Math. Сотр. 27, 221-228. BabuSka, I. A974a), Method of weak elements, Technical Note BN-809, University of Maryland, College Park, MD. BabuSka, I. A974b), Solution of problems with interfaces and singularities, in: С de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations, (Academic Press, New York) 213-277. BabuSka, I. A976), Singularities problem in the finite element method, in: K.-J. Bathe, J.T. Oden and W. Wunderlich, eds., Formulation and Computational Algorithms in Finite Element Analysis (MIT Press, Cambridge, MA) 748-792. BabuSka, I. and A.K. Aziz A972), Survey lectures on the mathematical foundations of the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 3-359. BabuSka, I. and A.K. Aziz A976), On the angle condition in the finite element method, SI AM J. Numer. Anal. 13, 214-226. BabuSka, I. and R.B. Kellogg A975), Nonuniform error estimates for the finite element method, SI AM J. Numer. Anal. 12, 868-875. BabuSka, I. and A. Miller A984a), The post-processing approach in the finite element method, Part 1: Calculation of displacements, stresses and other higher derivatives of the displacements, Internal. J. Numer. Methods Engrg. 20, 1085-1109. BabuSka, I. and A. Miller A984b), The post-processing approach in the finite element method, Part 2: The calculation of stress intensity factors, Internat. J. Numer. Methods Engrg. 20, 1111-1129. BabuSka, I. and J. Osborn A980), Analysis of finite element methods for second order boundary value problems using mesh dependent norms, Numer. Math. 34, 41-62. BabuSka, I. and J. Osborn A986), Finite element methods for the solution of problems with rough input data, in: P. Grisvard, W. Wendland and J.R. Whiteman, eds., Singularities and Constructive Methods for Their Treatment, Lecture Notes in Mathematics 1121 (Springer, Berlin) 1-18. BabuSka, I. and М.В. Rosenzweig A972), A finite element scheme for domains with corners, Numer. Math. 20, 1-21. BabuSka, I. and M. Zlamal A973), Nonconforming elements in the finite element method with penalty, SI AM J. Numer. Anal. 10, 863-875.
References 315 Baiocchi, С. A977), Estimations d'erreur dans L°° pour les inequations a obstacle, in: I. Gallioani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 27-34. Baiocchi, С and A. Capelo A978), Disequazioni Variazionali e Quasi Variazionali: Applicazioni a Problemi di Frontiera Libera B Vols.) (Pitagora, Bologna). English translation: Variational and Quasivariational Inequalities: Applications to Free Boundary Problems (Wiley-Interscience, New York, 1984). Baker, G.A. A973), Simplified proofs of error estimates for the least squares method for Dirichlet's problem, Math, Сотр. 27, 229-235. Bakker, M. A984), One-dimensional Galerkin methods and superconvergence at interior nodal points, SI AM J. Numer. Anal. 21, 101-110. Bamberger, Y. A981), Mdcanique de I'Ingenieur II: Milieux Diformables (Hermann, Paris). Barnhill, R.E. A975a), Blending function finite elements for curved boundaries, in: J.R. Whiteman, ed., Proceedings Conference on the Mathematics of Finite Elements and Applications (Academic Press, London) 59-66. Barnhill, R.E. A975b), Blending function interpolation: A survey and some new results, in: Numerische Methoden der Approximationstheorie, Band 3 (Oberwolfach) 43-89. Barnhill, R.E., G. Birkhoff and W.J. Gordon A973), Smooth interpolation in triangles, J. Approximation Theory 8, 114-128. Barnhill, R.E. and J.H. Brown A975), Curved nonconforming elements for plate problems, Rep. No. 8, University of Dundee. Barnhill, R.E., J.H. Brown, N. McQueen and A.R. Mitchell A976), Computable finite element error bounds for Poisson's equation, lnternat. J. Numer. Methods Engrg. 11, 593-597. Barnhill, R.E. and G. Farin A981), C1 quintic interpolation over triangles: Two explicit representations, lnternat. J. Numer. Methods Engrg. 17, 1763-1778. Barnhill, R.E. and J.A. Gregory A975a), Compatible smooth interpolation in triangles, J. Approximation Theory 15, 214-225. Barnhill, R.E. and J.A. Gregory A975b), Polynomial interpolation to boundary data on triangles, Math. Сотр. 29, 726-735. Barnhill, R.E. and J.A. Gregory A976a), Sard kernel theorems on triangular domains with application to finite element error bounds, Numer. Math. 25, 215-229. Barnhill, R.E. and J.A. Gregory A976b), Interpolation remainder theory from Taylor expansions on triangles, Numer. Math. 25, 401^*08. Barnhill, R.E. and J.R. Whiteman A973), Error analysis of finite element methods with triangles for elliptic boundary value problems, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications (Academic Press, London) 83-112. Barnhill, R.E. and J.R. Whiteman A975), Error analysis of Galerkin methods for Dirichlet problems containing boundary singularities, J. Inst. Math. Appl. 15, 121-125. Barrett, J.W. and CM. Elliott A984), A finite element method for solving elliptic equations with Neumann data on a curved boundary using unfitted meshes, IMA J. Numer. Anal. 4, 309-325. Barrett, J.W. and CM. Elliott A987), A practical finite element approximation of a semi-definite Neumann problem on a curved domain, Numer. Math. 51, 23-36. Bathe, K.-J. A982), Finite Element Procedures in Engineering Analysis (Prentice-Hall, Englewood Cliffs, NJ). Batoz, J.-L., K.-J. Bathe and L-W. Ho A980), A study of three-node triangular plate bending elements, lnternat. J. Numer. Methods Engrg. 15, 1771-1812. Bazeley, G.P., Y.K. Cheung, В.М. Irons and O.C. Zienkiewicz A965), Triangular elements in bending: Conforming and nonconforming solutions, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright Patterson A.F.B., Dayton, OH, 547-576. Bell, K. A969), A refined triangular plate bending element, lnternat. J. Numer. Methods Engrg. 1, 101-122. Belytschko, T. and J.S.-J. Ong A984), A consistent control of spurious singular modes in the 9-node Lagrange element for the Laplace and Mindlin plate equations, Comput. Methods Appl. Mech. Engrg. 44, 269-295.
316 P.G. Ciarlet Bendalli, A. A981), Approximation of a degenerate boundary value problem by a finite element method, RAIRO Anal. Numer. 15, 87-99. Berger, A.E. A973), L2-error estimates for finite elements with interpolated boundary conditions, Numer. Math. 21, 345-349. Berger, A.E. A976), The truncation method for the solution of a class of variational inequalities, Rev. Francaise Automat. Informal. Recherche Operationnelle Sir. Rouge Anal. Numer. 10, 29-42. Berger, A.E., R. Scott and G. Strang A972), Approximate boundary conditions in the finite element method, in: Symposia Mathematica 10 (Academic Press, New York) 295-313. Bernadou, M. and J.M. Boisserie A982), The Finite Element Method in Thin Shell Theory: Application to Arch Dam Simulation (Birkhaiiser, Boston, MA). Bernadou, M. and У. Ducatel A978), Methodes d'elements finis avec integration numerique pour des problemes elliptiques du quatrieme ordre, RAIRO Anal. Numer. 12, 3-26, Bernadou, M. and K. Hassan A981), Basis functions for general Hsieh-Clough-Tocher triangles, complete or reduced, Internal. J. Numer. Methods Engrg. 17, 784-789, Bernardi, С A986), Contributions а ГAnalyse Numerique de Problemes Non Lineaires, Doctoral Dissertation, Universite Pierre et Marie Curie, Paris, France. Bernardi, C. A988), Optimal finite element interpolation on curved domains, Technical Rept. R88008, Laboratoire d'Analyse Numerique, Universite Pierre et Marie Curie, Paris, France. Bers, L., F. John and M. Schechter A964), Partial Differential Equations (Wiley, New York). Birkhoff, G. A969), Piecewise bicubic interpolation and approximation in polygons, in: I.J. SCHOENBERG, ed., Approximation with Special Emphasis on Spline Functions (Academic Press, New York) 185-221. Birkhoff, G. A971), Tricubic polynomial interpolation, Proc. Nat. Acad. Sci. U.S.A. 68, 1162-1164. Birkhoff, G, A972), Piecewise analytic interpolation and approximation in triangulated polygons, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 363-385. Birkhoff, G. and S. Gulati A974), Optimal few-point discretizations of linear source problems, SI AM J. Numer. Anal. 11, 700-728. Birkhoff, G. and L. Mansfield A974), Compatible triangular finite elements, J. Math. Anal. Appl. 47, 531-553. Birkhoff, G, M.H. Schultz and R.S. Varga A968), Piecewise Hermite interpolation in one and two variables with applications to partial differential equations, Numer. Math. 11, 232-256. Blair, J.J. A976), Higher order approximations to the boundary conditions for the finite element method, Math. Сотр. 30, 250-262. Blum, H. and М. Dobrowolski A983), On finite element methods for elliptic equations on domains with corners, Computing 28, 53-63, Blum, H, Q. Lin and R. Rannacher A986), Asymptotic error expansion and Richardson extrapolation for linear finite elements, Numer. Math. 49, 11-37. Blum, H. and R. Rannacher A980), On the boundary value problem of the biharmonic operator on domains with angular corners, Math. Methods Appl. Sci. 2, 556-581. Boattin, М. A988), Approximation en coordonnees barycentriques generalisees, These, Universite Paul Sabatier, Toulouse, France. Bogner, F.K., R.L. Fox and LA. Schmit A965), The generation of interelement compatible stiffness and mass matrices by the use of interpolation formulas, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright Patterson A.F.B., Dayton, OH, 397-444. Bramble, J.H. A970), Variational Methods for the Numerical Solution of Elliptic Problems (Chalmers Institute of Technology, Goteborg). Bramble, J.H. A975), A survey of some finite element methods proposed for treating the Dirichlet problem, Adv. Math. 16, 187-196. Bramble, J.H. A981), The Lagrange multiplier method for Dirichlet's problem, Math. Сотр. 37, 1-12. Bramble, J.H., T. Dupont and V. Thomee A972), Projection methods for Dirichlet's problem in approximating polygonal domains with boundary-value corrections, Math. Сотр. 26, 869-879. Bramble, J.H. and S.R. Hilbert A970), Estimation of linear functional on Sobolev spaces with application to Fourier transforms and spline interpolation, SI AM J. Numer. Anal. 7, 113-124.
References 317 Bramble, J.H. and S.R. Hilbert A971), Bounds for a class of linear functional with applications to Hermite interpolation, Numer. Math. 16, 362-369. Bramble, J.H., B.E. Hubbard and V. Thomee A969), Convergence estimates for essentially positive type discrete problems, Math. Сотр. 23, 695-710. Bramble, J.H. and J.A. Nitsche A973), A generalized Ritz-least-squares method for Dirichlet problems, S1AM J. Numer. Anal. 10, 81-93. Bramble, J.H., J.A. Nitsche and A.H. Schatz A975), Maximum-norm interior estimates for Ritz-Galerkin methods, Math. Сотр. 29, 677-688. Bramble, J.H. and A.H. Schatz A970), Rayleigh-Ritz-Galerkin methods for Dirichlet's problem using subspaces without boundary conditions, Comm. Pure Appl. Math. 23, 653-675. Bramble, J.H. and A.H. Schatz A971), Least squares methods for 2mth order elliptic boundary-value problems, Math. Сотр. 25, 1-32. Bramble, J.H. and A.H. Schatz A976), Estimates for spline projections, Rev. Francaise Automat. Informal. Recherche Operationneile Sir. Rouge Anal. Numer. 10, 5-37. Bramble, J.H. and A.H. Schatz A977), Higher order local accuracy by averaging in the finite element method, in: C. de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York), 1-14. Bramble, J.H. and R. Scott A978), Simultaneous approximation in scales of Banach spaces, Math. Сотр. 32, 947-954. Bramble, J.H. and V. Thomee A974), Interior maximum norm estimates for some simple finite element methods, Rev. Frangaise Automat. Informat. Recherche Operationneile Ser. Rouge Anal. Numer. R-2, 5-18. Bramble, J.H. and M. Zlamal A970), Triangular elements in the finite element method, Math. Сотр. 24, 809-820. Brezis, H. A983), Analyse Fonctionnelle (Masson, Paris). Brezis, H. and G. Stampacchia A968), Sur la regularite de la solution d'inequations elliptiques, Bull. Soc. Math. France 96, 153-180. Brezis, H. and G. Stampacchia A973), Une nouvelle methode pour l'etude d'ecoulements stationnaires, C.R. Acad. Sci. Paris Ser A 276, 129-132. Brezzi, F., W.W. Hager and P.-A. Raviart A977), Error estimates for the finite element solution of variational inequalities, Part 1: Primal theory, Numer. Math. 28, 431—443. Brezzi, F., С Johnson and В. Mercier A977), Analysis of a mixed finite element method for elasto-plastic plates, Math. Сотр. 31, 809-817. Brezzi, F. and L.D. Marini A975), On the numerical solution of plate bending problems by hybrid methods, Rev. Francaise Automat. Informat. Recherche Operationneile Ser. Rouge Anal. Numer. R-3, 5-50. Brezzi, F. and P.-A. Raviart A976), Mixed finite element methods for 4th order elliptic equations, Rapport interne No. 9, Centre de Mathematiques Appliquees, Ecole Polytechnique, Palaiseau. Bristeau, M.-O. A975), Application de la methode des elements finis a la resolution numerique d'inequations variationnelles d'evolution de type Bingham, Doctoral Thesis (Зыпе Cycle), Universite Pierre et Marie Curie, Paris, France. Carey, G.F. A976), An analysis of finite element equations and mesh subdivisions, Comput. Methods Appl. Mech. Engrg. 9, 165-179. Carlson, R.E. and C.A. Hall A973), Error bounds for bicubic spline interpolation, J. Approximation Theory 4, 41-47. Cartan, Н. A967), Calcul Differentiel (Hermann, Paris). Casas, E. A985), L2 estimates for the finite element method for the Dirichlet problem with singular data, Numer. Math. 47, 627-632. Cavendish, J.C., W.J. Gordon and C.A. Hall A976), Ritz-Galerkin approximations in blending function spaces, Numer. Math. 26, 155-178. СЁА, J. A964), Approximation variationnelle des problemes aux limites, Ann. Inst. Fourier (Grenoble) 14, 345^44. Cea, J. A976), Approximation variationnelle; convergence des elements finis; un test, in: Journies Elements Finis (Universite de Rennes, Rennes).
318 KG. Ciaflel CermAk., L. A983a), The finite element solution of second order elliptic problems with the Newton boundary condition, Apl. Mat. 28, 430-456. CermAk, L. A983b), A note on a discrete form of Friedrichs' inequality, Apl. Mat. 28, 457-466. Chernuka, M.W., G.R. Cowper, G.M. Lindberg and M.D. Olson A972), Finite element analysis of plates with curved edges, Internat. J. Numer, Methods Engrg. 4, 49-65. Choquet-Bruhat, Y. A973), Distributions, Theorie et Problemes (Masson, Paris). Chou, S.-I. and С.-С. Wang A979), Error estimates of finite element approximations for problems in linear elasticity, Part 1: Problems in elastostatics, Arch. Rational Mech. Anal. 72, 41-60. Ciarlet, P.G. A968), An 0(h2) method for a non-smooth boundary value problem, Aequationes Math. 2, 39-49. Ciarlet, P.G. A970a), Discrete variational Green's function, I: Aequationes Math. 4, 74-82. Ciarlet, P.G. A970b), Discrete maximum principle for finite-difference operators, Aequationes Math. 4, 338-352. Ciarlet, P.G. A971), Fonctions de Green discretes et principe du maximum discret, Doctoral Dissertation, Universite de Paris VI, Paris, France. Ciarlet, P.G. A974a), Conforming and nonconforming finite element methods for solving the plate problem, in: G.A. Watson, ed., Conference on the Numerical Solution of Differential Equations, Lecture Notes in Mathematics 363 (Springer, Berlin) 21-31. Ciarlet, P.G. A974b), Quelques methodes d'elements finis pour le probleme d'une plaque encastree, in: R. Glowinski and J.L. Lions, eds., Computing Methods in Applied Sciences and Engineering, Lecture Notes in Computer Science 10 (Springer, Berlin) 156-176. Ciarlet, P.G. A974c), Sur I'element de Clough et Tocher, Rev. Francaise Automat. Informal. Recherche Op'erationnelle Ser. Rouge Anal. Numer. R-2, 19-27. Ciarlet, P.G. A975), Lectures on the finite element method, Tata Institute of Fundamental Research, Bombay. Ciarlet, P.G. A976), Numerical Analysis of the Finite Element Method, Seminaire de Mathematiques Superieures (Presses de fUniversite de Montreal, Montreal, Que.). Ciarlet, P.G. A978), The Finite Element Method for Elliptic Problems (North-Holland, Amsterdam). Ciarlet, P.G. A983), Introduction a TAnalyse Numerique Matricielle et a I'Optimisation (Masson, Paris); English translation: Introduction to Numerical Linear Algebra and Optimization (Cambridge University Press, Cambridge 1989). Ciarlet, P.G. A988), Mathematical Elasticity I: Three-Dimensional Elasticity (North-Holland, Amsterdam). Ciarlet, P.G. A990), Mathematical Elasticity II: Lower-Dimensional Theories of Plctes and Rods (North-Holland, Amsterdam). Ciarlet, P.G. and P. Destuynder A979), A justification of the two-dimensional line?' - .te model, J. Mecanique 18, 315-344. Ciarlet, P.G. and S. Kesavan A980), Two-dimensional approximation of three-dimei..,onal eigenvalue problems in plate theory, Comput. Methods Appl. Mech. Engrg. 26, 149-172. Ciarlet, P.G., F. Natterer and R.S. Varga A970), Numerical methods of high-order accuracy for singular nonlinear boundary value problems, Numer. Math. 15, 87-99. Ciarlet, P.G. and P. Rabier A980), Les Equations de von Karman, Lecture Notes in Mathematics 826 (Springer, Berlin). Ciarlet, P.G. and P.-A. Raviart A972a), General Lagrange and Hermite interpolation in R" with applications to finite element methods, Arch. Rational Mech. Anal. 46, 177-199. Ciarlet, P.G. and P.-A. Raviart A972b), Interpolation theory over curved elements, with applications to finite element methods, Comput. Methods Appl. Mech. Engrg. 1, 217-249. Ciarlet, P.G. and P.-A. Raviart A972c), The combined effect of curved boundaries and numerical integration in isoparametric finite element methods, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 409-474. Ciarlet, P.G. and P.-A. Raviart A973), Maximum principle and uniform convergence for the finite element method, Comput. Methods Appl. Mech. Engrg. 2, 17-31.
References 319 Ciarlet, P.G. and R.S. Varga A970), Discrete variational Green's function, II: One dimensional problem, Numer. Math. 16, 115-128. Ciarlet, P.G. and С Wagschal A971), Multipoint Taylor formulas and applications to the finite element method, Numer. Math. 17, 84-100. Ciavaldini, J.F. and M. Crouzeix A985), A finite element method scheme for one dimensional elliptic equations with high super convergence at the nodes, Numer. Math. 46, 417-427. Ciavaldini, J.F. and J.C. Nedelec A974), Sur l'element de Fraeijs de Veubeke et Sander, Rev. Francaise Automat. Informed. Recherche Operationnelle Sir. Rouge Anal. Numer. R-2, 29-45. Ciavaldini, J.F. and G. Tournemine A977), A finite element method to compute stationary steady state flows in the hodograph plane, J. Indian Math. Soc. 41, 69-89. Clement, P. A974), Methode des elements finis appliquee a des problemes variationnels de type indefini. Doctoral Thesis, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland. Clement, P. A975), Approximation by finite element functions using local regularization, Rev. Francaise Automat. Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. R-2, 77-84. Clough, R.W. A960), The finite element method in plane stress analysis, in: Proceedings Second ASCE Conference on Electronic Computation, Pittsburgh, PA. Clough, R.W. and J.L. Tocher A965), Finite element stiffness matrices for analysis of plates in bending, in: Proceedings Conference on Matrix Methods in Structural Mechanics, Wright Patterson A.F.B., Dayton, OH, 515-545. Coatmelec, С A966), Approximation et interpolation des fonctions differentiables de plusieurs variables, Ann. Sci. Ecole Norm. Sup. 83, 271-341. Cortey-Dumont, P. A983), Contribution a I'approximation des inequations variationelles en norme L", C.R. Acad. Sci. Paris Ser. I. Math. 296, 753-756. Courant, R. A943), Variational methods for the solution of problems of equilibrium and vibrations, Bull. Amer. Math. Soc. 49, 1-23. Courant, R. and D. Hilbert A953), Methods of Mathematical Physics, I (Interscience, New York). Courant, R. and D. Hilbert A962), Methods of Mathematical Physics, II (Interscience, New York). Crouzeix, M. and A. Mignot A984), Analyse Numerique des Equations Dijferentielles (Masson, Paris). Crouzeix, M. and P.-A. Raviart A973), Conforming and nonconforming finite element methods for solving the stationary Stokes equations I, Rev. Francaise Automat. Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. R-3, 33-76. Crouzeix, M. and J.M. Thomas A973), Elements finis et problemes elliptiques degeneres, Rev. Francaise Automat. Informat. Recherche Operationnelle Ser. Rouge Anal. Numer. R-3, 77-104. Crouzeix, M. and V. Тномёе A987), The stability in Lp and Wlp of the L2 projection onto finite element function spaces, Math. Сотр. 48, 521-532. Dailey, J.W. and J.G. Pierce A972), Error bounds for the Galerkin method applied to singular and nonsingular boundary value problems, Numer. Math. 19, 266-282. Dautray, R. and J.-L. Lions A984), Analyse Mathematique et Calcul Numerique pour les Sciences et les Techniques, Collection du Commissariat a l'Energie Atomique (Masson, Paris). Davis, P.J. and P. Rabinowitz A975), Methods of Numerical Integration (Academic Press, New York). Deleze, M. and J.-J. Goel A976), Tetraedre comme element fmi de classe C1, a seize parametres, contenant les polynomes de degre deux, Report, Institut de Mathematiques, Universite de Fribourg, Fribourg, France. Deny, J. and J.L. Lions A953-1954), Les espaces du type de Beppo Levi, Ann. Inst. Fourier (Grenoble) 5, 305-370. Descloux, J. A973), Two basic properties of finite elements, Report, Departement de Mathematiques, Ecole Polytechnique Federale de Lausanne, Switzerland. Descloux, J. A975), Interior regularity and local convergence of Galerkin finite element approximations for elliptic equations, in: J.J.H. Miller, ed., Topics in Numerical Analysis II (Academic Press, New York) 27-41. Descloux, J. A977), Interior L" estimates for Galerkin approximations of elliptic equations, in: Proceedings Symposium on the Mathematical Aspects of the Finite Element Methods, Rome, Italy. Descloux, J. and N. Nassif A977), Interior Lm estimates for finite element approximations of solutions
320 P.G. Ciarlet of elliptic equations, in: I. Galligani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 56-63. Destuynder, P. A981), Comparaison entre les modeles tridimensionnels et bidimensionnels de plaques en elasticite, RAIRO Anal. Numer. 15, 331-369. Destuynder, P. A986), Une Thiorie Asymptotique des Plaques Minces en Elasticite Lineaire (Masson, Paris). Dhatt, G. and G. Touzot A984), The Finite Element Method Displayed (Wiley, New York). Dieudonne, J. A967), Fondements de ('Analyse Moderne (Gauthier-Villars, Paris). Dieudonne, J. A982), Elements d'Analyse 9 (Gauthier-Villars, Paris). di Guglielmo, F. A971), Resolution approchee de problemes aux limites elliptiques par des schemas aux elements finis a plusieurs fonctions arbitraires, Calcolo 8, 185-213. Di Guglielmo, F. A971), Resolution approchee de problemes aux limites elliptiques par des schemas aux elements finis a plusieurs fonctions arbitraires, Calcolo 8, 185-213. Douglas Jr, J. and T. Dupont A973), Superconvergence for Galerkin Methods for the two point boundary problem via local projections, Numer. Math. 21, 270-278. Douglas, Jr, J. and T. Dupont A974), Galerkin approximations for the two point boundary problem using continuous, piecewise polynomial spaces, Numer. Math. 22, 99-109. Douglas Jr, J. and T. Dupont A976), Interior penalty procedures for elliptic and parabolic Galerkin methods, in: R. Glowinski and J.L. Lions, ed., Computing Methods in Applied Sciences, Lecture Notes in Physics 58 (Springer, Berlin) 207-216. Douglas Jr, J., T. Dupont, P. Percell and R. Scott A979), A family of C1 finite elements with optimal approximation properties for various Galerkin methods for 2nd and 4th order problems, Rev. Francaise Automat. Informal. Recherche Operationelle Sir. Rouge Anal. Numer. R-13, 227-255. Douglas Jr, J., T. Dupont and L. Wahlbin A975a), The stability in L" of the L2 -projection into finite element function spaces, Numer. Math. 23, 193-197. Douglas Jr, J., T. Dupont and L. Wahlbin A975b), Optimal La error estimates for Galerkin approximations to solutions of two point boundary value problems, Math. Сотр. 29, 475-483. Douglas Jr, J., T. Dupont and M.F. Wheeler A974a), A Galerkin procedure for approximating the flux on the boundary for elliptic and parabolic boundary value problems, Rev. Francaise Automat. Informat. Recherche Opirationnelle Sir. Rouge Anal. Numer. R-2, 47-59. Douglas Jr, J., T. Dupont and M.F. Wheeler A974b), An L°° estimate and a super-convergence result for a Galerkin method for elliptic equations based on tensor products of piecewise polynomials, Rev. Francaise Automat. Informat. Recherche Opirationnelle Sir. Rouge Anal. Numer. R-2, 61-66. Duchon, J. A976), Interpolation des fonctions de deux variables suivant le principe de la flexion des plaques minces, Rev. Francaise Automat. Informat. Recherche Opirationnelle Sir. Rough Anal. Numer. 10 A2), 5-12. Duchon, J. A977), Splines minimizing rotation: invariant semi-norms in Sobolev spaces in: Constructive Theory of Functions of Several Variables, Lecture Notes in Mathematics 571 (Springer, Berlin) 85-100. Dunavant, D.A. A985), High degree efficient symmetrical Gaussian quadrature rules for the triangle, Internat. J. Numer. Meth. Engrg. 21, 1129-1148. Dunavant, D.A. A986), Efficient symmetrical cubature rules for complete polynomials of high degree over the unit cube, Internat. J. Numer. Meth. Engrg. 23, 397-407. Dupont, T. and R. Scott A978), Constructive polynomial approximation in Sobolov spaces, in: C. de Boor and G. Golub, eds., Recent Advances in Numerical Analysis (Academic Press, New York) 31^44. Dupont, T. and R. Scott A980), Polynomial approximation of functions in Sobolov spaces, Math. Сотр. 34, 441-463. Dupuis, G. and J.-J. Goel A970), Finite elements with a high degree of regularity, Internat. J. Numer. Methods Engrg. 2, 563-577. DurAn, R.G. A987), Quasi-optimal estimates for finite element approximations using Orlicz norms, Math. Сотр. 49, 17-23. DUVAUT, G. and J.L. Lions A972), Les Iniquations en Mecanique et en Physique (Dunod, Paris). EKELAND, 1. and R. Темам A974), Analyse Convexe et Problemes Variationals (Dunod, Paris). El-Zafrany, A. and R.A. Cookson A986), Derivation of Lagrangian and Hermitian shape functions for quadrilateral finite elements, Internat. J. Numer. Meth. Engrg. 23, 1939-1958.
References 321 Ergatoudis, I., B.M. Irons and O.C. Zienkiewicz A968), Curved, isoparametric, "quadrilateral" elements for finite element analysis, Internat. J. Solids and Structures 4, 31-42. Eriksson, K. A985a), Improved accuracy by adapted mesh-refinements in the finite element method, Math. Сотр. 44, 321-343. Eriksson, K. A985b), Finite element methods of optimal order for problems with singular data, Math. Сотр. 44, 345-360. Eriksson, K. A985c), Higher order local rate of convergence by mesh refinement in the finite element method, Math. Сотр. 45, 109-142. Eriksson, K. and V. Тномёе A984), Galerkin methods for singular boundary value problems in one space dimension, Math. Сотр. 42, 345-367. Falk, R.S. A974), Error estimates for the approximation of a class of variational inequalities, Math. Сотр. 28, 963-971. Falk, R.S. A975), Approximation of an elliptic boundary value problem with unilateral constraints, Rev. Francaise Automat. Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. R-2, 5-12. Falk, R.S. and B. Mercier A977), Error estimates for elasto-plastic problems, Rev. Francaise Automat. Informal. Recherche Operationnelle Sir. Rouge Anal. Numer. 11, 135-144. Felippa, C.A. A966), Refined finite element analysis of linear and nonlinear two-dimensional structures, Doctoral Thesis, University of California, Berkeley, CA. Felippa, C.A. and R.W. Clough A970), The finite element method in solid mechanics, in: G. Birkhoff and R.S. VARGA, eds., Numerical Solution of Field Problems in Continuum Mechanics (American Mathematical Society, Providence, RI) 210-252. Fichera, G. A972a), Existence theorems in elasticity, Handbuch der Physik VIa/2 (Springer, Berlin) 347-389. Fichera, G. A972b), Boundary value problems of elasticity with unilateral constraints, Handbuch der Physik VIa/2 (Springer, Berlin) 391-424. Finzi-Vita, S. A982), L50-error estimates for variational inequalities with Holder continuous obstacle, RA1RO Anal. Numer. 16, 27-37. Fix, G J. A969), Higher-order Rayleigh-Ritz approximations, J. Math. Mech. 18, 645-658. Fix, G.J. A972a), On the effects of quadrature in the finite element method, in: J.T. Oden, R.W. Clough and Y. Yamamoto, eds., Advances in Computational Methods in Structural Mechanics and Design (The University of Alabama Press, Huntsville, AL) 55-68. Fix, G.J. A972b), Effects of quadrature errors in finite element approximation of steady state, eigenvalue and parabolic problems, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 525-556. Fix, G.J., S. Gulati and G.I. Wakoff A973), On the use of singular functions with finite element approximations, J. Comput. Phys. 13, 209-228. Fix, G.J. and G. Strang A969), Fourier analysis of the finite element method in Ritz-Galerkin theory, Stud. Appl. Math. 48, 265-273. Fortin, M. A972), Calcul numerique des ecoulements des fluides de Bingham et des fluides Newtoniens incompressibles par des methodes d'elements finis, Doctoral Thesis, Universite de Paris VI, France. Fortin, M. A985), A three-dimensional quadratic nonconforming element, Numer. Math. 46, 269-279. Fortin, M. and M. Soulie A983), A non-conforming piecewise quadratic finite element on triangles, Internat. J. Numer. Methods Engrg. 19, 505-520. Fraeus de Veubeke, B. A979), A Course in Elasticity (Springer, New York). Fraeus de Veubeke, B. A984), Variational principles and the patch test, Internat. J. Numer. Methods Engrg. 8, 783-801. Frehse, J. and R. Rannacher A976), Eine L'-Fehlerabschatzung fur diskrete Grundlosungen in der Methode der finite elemente, Bonner Math. Schriften 89, 92-114. Frehse, J. and R. Rannacher, A978), Asymptotic L°°-error estimates for linear finite element approximations of quasilinear boundary value problems, SI AM J. Numer. Anal 15, 418—431. Fremond, M. A971), Etude de structures visco-elastiques stratifees soumises a des charges harmoniques, et de solides elastiques reposant sur ces structures, Doctoral Thesis, Universite Pierre et Marie Curie (Paris VI), Paris, France. Fremond, M. A972), Utilisation de la dualite en elasticite. Complements sur les energies de Reissner.
322 P.G. Ciariet Equilibre d'une dalle elastique reposant sur une structure stratifiee, Annaies de Flnstitut Technique du Batiment et des Travaux Publics, Supplement au No. 294, 54-66. French, D.A. A987), The finite element method for a degenerate elliptic equation, SI AM J. Numer. Anal. 24, 788-815. Fried, I. A980), On the optimality of pointwise accuracy of the finite element solution, Internat. J. Numer. Methods Engrg. 15, 451-456. Fried, I., Yang, S.K. A972), Best finite elements distribution around a singularity, AIAA J. 10, 1244-1246. Friedrichs, K.O. and H.B. Keller A966), A finite difference scheme for generalized Neumann problems, in: J.H. Bramble, ed., Numerical Solution of Partial Differential Equations (Academic Press, New York) 1-19. Gabay, D. and B. Mercier A976), A dual algorithm for the solution of nonlinear variational problems via finite element approximation, Comput. Math. Appl. 2, 17—40. Gallagher, H. A975), Finite Element Analysis: Fundamentals, (Prentice-Hall, Englewood Cliffs, NJ). Gastaldi, L. and R. Nochetto A987), Optimal L°°-error estimates for nonconforming and mixed finite element methods of lowest order, Numer. Math. 50, 587-611. George, P.L. A986), MODULEF: Generation automatique de maillages, Collection Didactique, INRIA, Rocquencourt, France. Germain, P. A972), Mecanique des Milieux Continus I (Masson, Paris). Germain, P. A986a), Mecanique I, Ecole Polytechnique, Palaiseau, and Ellipses, Paris, France. Germain, P. A986b), Mecanique II, Ecole Polytechnique, Palaiseau, and Ellipses, Paris, France. Gilbarg, D. and N.S. Trudinger A983), Elliptic Partial Differential Equations of Second Order (Springer, New York, 2nd ed.). Girault, V. A976), Nonelliptic approximation of a class of partial differential equations with Neumann boundary conditions, Math. Сотр. 30, 68-91. Girault, V. and P.-A. Raviart A986), Finite Element Methods for Navier-Stokes Equations (Springer, Berlin). Glowinski, R. A984), Numerical Methods for Nonlinear Variational Problems (Springer, New York). Glowinski, R. and H. Lanchon A973), Torsion elasto-plastique d'une barre cylindrique de section multi-connexe, J. Mecanique 12, 151-171. Glowinski, R., J.-L. Lions and R. Tremolieres A976a), Analyse Numerique des Inequations Variationnelles 1: Theorie Generate, Premieres Applications (Dunod, Paris). Glowinski, R., J.-L. Lions and R. Tremolieres A976b), Analyse Numerique des Inequations Variationnelles 2: Applications aux Phenomenes Stationnaires et d'Evolution (Dunod, Paris). Glowinski, R., L.D. Marini and M. Vidrascu A984), Finite-element approximations and iterative solutions of a fourth-order elliptic variational inequality, IMA J. Numer. Anal. 4, 127-167. Gopalacharyulu, S. A973), A higher order conforming, rectangular plate element, Internat. J. Numer. Methods Engrg. 6, 305-309. Gordon, W.J. and C.A. Hall A973), Transfinite element methods: Blending-function interpolation over arbitrary curved element domains, Numer. Math. 21, 109-129. GOUT, J.L. A977), Estimation de l'erreur d'interpolation d'Hermite dans R". Numer. Math. 28,407-429. Gout, J.L. A979a), Interpolation error estimates on Hermite rational "Wachspress type" 3rd-degree finite element, Comput. Math. Appl. 5, 249-257. Gout, J.L. A979b), Construction of an Hermite rational interpolation Wachspress type finite element, Comput. Math. Appl. 5, 337-347. Gout, J.L. and A. Guessab A986a), Sur les formules de quadrature numerique a nombre minimal de noeuds d'integration, Numer. Math. 49, 439-^55. Gout, J.L. and A. Guessab A986b), Exemples de formules de quadrature numerique a nombre minimal de noeuds sur des domaines a double symetrie axiale, Modelisation Math. Anal. Numer. 20, 287- 314. Grisvard, P. A976), Behavior of the solutions of an elliptic boundary value problem in a polygonal or polyhedral domain, in: B. Hubbard, ed., Numerical Solution of Partial Differential Equations, III (SYNSPADE 1975) (Academic Press, New York) 207-274. Grisvard, P. A985), Elliptic Problems in Nonsmooth Domains (Pitman, Boston, MA).
References 323 Grisvard, P. A987), Singularities in elasticity theory, in: P.G. Ciarlet and M. Roseau, eds., Applications of Multiple Scaling in Mechanics (Masson, Paris). Guessab, A. A986), Cubature formulae which are exact on spaces P intermediate between Pk and Qk, Numer. Math. 49, 561-576. Guessab, A. A987), Sur les formules de quadrature numerique dans R" avec certains noeuds ayant une composante connue, Appl. Anal. 26, 129-144. Gurtin, M.E. A972), The linear theory of elasticity, in: S. FlOgge and C. Truesdell, eds., Handbuch der Physik VIa/2 (Springer, Berlin) 1-295. Haber, S. A970), Numerical evaluation of multiple integrals, SI AM Rev. 12, 481-526. Hanssen, L., T.G. Syvertsen and P.G. Bergan A978), Stiffness derivation based on element convergence requirements, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications (Academic Press, London) 83-96. Harvey, J.W. and S. Kelsey A971), Triangular plate bending elements with enforced compatibility, AIAA J. 9, 1023-1026. Haverkamp, R. A984), Eine Aussage zur L^-Stabilitat und zur genauen Konvergenzordnung der Hj-Projektionen, Numer. Math. 44, 393-405. Hedstrom, G.W. and R.S. Varga A971), Application of Besov spaces to spline approximation, J. Approximation Theory 4, 295-327. Helfrich, H.-P. A976), Charakterisierung des K-Funktionales zwischen Hilbertfaumen und nicht- uniforme Fehlerschranken, Bonner Math. Schriften 89, 31-41. Herbold, R.J. A968), Consistent quadrature schemes for the numerical solution of boundary value problems by variational techniques, Doctoral Thesis, Case Western Reserve University, Cleveland, OH. Herbold, R.J., M.H. Schultz and R.S. Varga A969), The effect of quadrature errors in the numerical solution of boundary value problems by variational techniques, Aequationes Math. 3, 247-270. Herbold, R.J. and R.S. Varga A972), The effect of quadrature errors in the numerical solution of two-dimensional boundary value problems by variational techniques, Aequationes Math. 7, 36-58. Hilbert, S. A973), A mollifier useful for approximations in Sobolev spaces and some applications to approximating solutions of differential equations, Math. Сотр. 27, 81-89. Hohn, W. and H.D. Mittelmann A981), Some remarks on the discrete maximum principle for finite elements of higher order, Computing 27, 145-154. Hoppe, V. A973), Finite elements with harmonic interpolation functions, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications (Academic Press, London) 131-142. HOrmander, L. A983), The Analysis of Partial Differential Equations I, Grundlehren der Mathematischen Wissenschaften 256 (Springer, Berlin). Hrabok, M.M. and T.M. Hrudey A984), A review and catalogue of plate bending finite elements, Comput. & Structures 19, 479-495. Hughes, T.J.R. A987), The Finite Element Method: Linear Static and Dynamic Finite Element Analysis (Prentice-Hall, Englewood Cliffs, NJ). Irons, B. and S. Ahmad A979), Techniques of Finite Elements (Ellis Horwood, Chichester). Irons, B.M. and M.J. Loikkanen A983), An engineers' defence of the patch test, Internat. J. Numer. Methods Engrg. 19, 1391-1401. Irons, B.M. and A. Razzaque A972a), Experience with the patch test for convergence of finite elements, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 557-587. Irons, B.M. and A. Razzaque A972b), Shape function formulations for elements other than displacement models, presented at the International Conference on Variational Methods in Engineering, Southampton, UK. Jacquotte, O.-P. A985), Stability, accuracy, and efficiency of some underintegrated methods in finite element computations, Comput. Methods Appl. Meek Engrg. 50, 275-293. Jacquotte, O.-P. and J.T. Oden A984), Analysis of hourglass instabilities and control in underintegrated finite element methods, Comput. Methods Appl. Mech. Engrg. 44, 339-363. Jacquotte, O.-P., J.T. Oden and E.B. Becker A986), Numerical control of the hourglass instability, Internat. J. Numer. Methods Engrg. 22, 219-228.
324 P.G. Ciarlet Jamet, P. A976a), Estimation d'erreur pour des elements finis droits presque degeneres, Rev. Francaise Automat, Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. 10, 43-61, Jamet, P. A976b), Estimation de l'erreur d'interpolation dans un domaine variable et application aux elements finis quadrilateraux degeneres, in: Methodes Numeriques en Mathematiques Appliquees (Presses de l'Universite de Montreal, Montreal, Que.) 55-100. Jesperssen, D. A978), Ritz-Galerkin methods for singular boundary value problems, SIAM J. Numer. Anal 15, 813-834. Johnson, C. A987), Numerical Solutions of Partial Differential Equations by the Finite Element Method (Cambridge University Press, Cambridge). Kang, F. A979), On the theory of discontinuous finite elements, Math. Numer. Sinica 1, 378-385. Kardestuncer, H. and D.H. Norrie, eds. A987), Finite Element Handbook (McGraw-Hill, New York). Keast, P. A986), Moderate-degree tetrahedal quadrature formulas, Comput. Methods Appl. Mech. Engrg. 55, 339-348. Khoan, V.-K. A972a), Distributions, Analyse de Fourier, Opirateurs aux Derivees Partielles 1 (Vuibert, Paris). Khoan, V.-K. A972b): Distributions, Analyse de Fourier, Operateurs aux Derivees Partielles 2 (Vuibert, Paris). KlKUCHI, F. A975a), Approximation in finite element models, Rept. No. 531, Institute of Space and Aeronautical Science, University of Tokyo. Kikucki, F. A975b), Convergence of the ACM finite element scheme for plate bending problems, Publ. Res. Inst. Math. So'., Kyoto Univ. 11, 247-265. Kikuchi, F. A975c), On a finite element scheme based on the discrete Kirchoff assumption, Numer. Math. 24, 211-231. Kikucki, F. A976), Theory and examples of partial approximation in the finite element method, Internal. J. Numer. Methods Engrg. 10, 115-122. Kinderlehrer, D. and G. Stampacchia A980), An Introduction to Variational Inequalities and Their Applications (Academic Press, New York). King, J.T. A976), New error bounds for the penalty method and extrapolation, Numer. Math. 23, 153-165. Knops, R.J. and L.E. Payne A971), Uniqueness Theorems in Linear Elasticity, Springer Tracts in Natural Philosophy 19 (Springer, Berlin). Кон, B.C. and N. Kikuchi A987), New improved hourglass control for bilinear and trilinear elements in anisotropic linear elasticity, Comput. Methods Appl. Mech. Engrg. 65, 1-46. Kondratev, V.A. A967), Boundary value problems for elliptic equations in domains with conical or angular points, Trudy Moskov. Mat. Obshch. 16, 209-292. Koukal, S. A973), Piecewise polynomial interpolations in the finite element method, Apt. Math. 18, 146-160. KrIZEK, M. and P. NeittaanmAki A984), Superconvergence phenomenon in the finite element method arising from averaging gradients, Numer. Math. 45, 105-116. LadyZenskaja, O.A. and N.N. Urauceva A968), Linear and Quasilinear Elliptic Equations (Academic Press, New York). Lanchon, H. A972), Torsion elastoplastique d'un arbre cylindrique de section simplement ou multiplement connexe, Doctoral Thesis, Universite Pierre et Marie Curie (Paris VI), France. Landau, L. and E. Lifchitz A967), Theorie de lElasticite (Mir, Moscow). Lascaux, P. and P. Lesaint A975), Some nonconforming finite elements for the plate bending problem, Rev. Francaise Automat. Informat. Recherche Operationnelle Ser. Rouge Anal. Numer R-l, 9-53. Lax, P.D. and A.N. Milgram A954), Parabolic equations, in: Annals of Mathematics Studies 33 (Princeton University Press, Princeton, NJ) 167-190. Laydi, M. and P Lesaint A985), Sur les elements finis rationnels de Serendip de degre quelconque, Numer. Math. 46, 175-187. Leaf, G.K. and H.G. Kaper A974), L°°-error bounds for multivariate Lagrange approximation, SIAM J. Numer. Anal. 11, 363-381. Leaf, G'K., H.G. Kaper and A.J. Lindeman A976), Interpolation and approximation properties of rational coordinates over quadrilaterals, J. Approximation Theory 16, 1-15.
References 325 Lebaud, G. A969), Sur I'approximation de l'equation Au= -2,Ui Di(\Diu\p~2Diu)= T, Rend. Mat. 2, 443-471. Lebaud, G. and P.-A. Raviart A969), Sur I'approximation du probleme de Dirichlet pour les operateurs elliptiques d'ordre 2, Rend. Mat. 2, 507-562. le Mehaute, A. A981), Taylor interpolation of order n at the vertices of a triangle. Applications to Hermite interpolation and finite elements, in: Approximation Theory and Applications (Academic Press, New York) 171-185. le Mehaute, A. A984), Interpolation et approximation par des fonctions polynomials par morceaux dans R", Doctoral Dissertation, Universite de Rennes, France. Lenoir, M. A986), Optimal isoparametric finite elements and error estimates for domains involving curved boundaries, SIAM J. Numer. Anal. 23, 562-580. Lesaint, P. A976), On the convergence of Wilson's nonconforming element for solving the elastic problem, Comput. Methods Appl. Mech. Engrg. 7, 1-16. Lesaint, P. and M. Zlamal A979), Superconvergence of the gradient of finite element solutions, RAIRO Anal. Numer. 13, 139-166. Lesaint, P. and M. Zlamal A980), Convergence of the nonconforming Wilson element for arbitrary quadrilateral meshes, Numer. Math. 36, 33-52. Levine, N. A985), Superconvergent recovery of the gradient from piecewise linear finite element approximations, IMA J. Numer. Anal. 5, 407-427. Lewy, H. and G. Stampacchia A969), On the regularity of the solution of a variational inequality, Comm. Pure Appl. Math. 22, A969) 153-188. Li, Z.-C. A986), A nonconforming combined method for solving Laplace's boundary value problems with singularities, Numer. Math. 49, 475-497. Li, Z.-C, R. Mathon and P. Sermer A987), Boundary methods for solving elliptic problems with singularities and interfaces, SIAM J. Numer. Anal. 24, 487-498. Lin, Q. and T. Lu A984a), Asymptotic expansions for finite element approximation of elliptic problems on polygonal domains, in: R. Glowinski and J.L. Lions, eds., Computing Methods in Applied Sciences and Engineering VI (North-Holland, Amsterdam) 317-321. Lin, Q. and T. Lu A984b), Asymptotic expansions for finite element eigenvalues and finite element solutions, Bonner Math. Schriften 158, 1-10. Lin, Q., T. Lu and S. Shen A983), Asymptotic expansion for finite element approximations, Research Rept. IMS-11, Chengdu Branch of Academia Sinica. Lin, Q. and Q. Zhu A984), Asymptotic expansion for the derivative of finite elements, J. Comput. Math 2, 361-363. Lin, Q. and Q. Zhu A985), Linear finite elements with high accuracy, J. Comput. Math. 3, 115-133. Lin, Q. and Q. Zhu A986), Local asymptotic expansion and extrapolation for finite elements, J. Comput. Math. 4, 263-265. Lions, J.L. A962), Problemes aux Limiles dans les Equations aux Derivees Partielles (Presses de I'Universite de Montreal, Montreal, Que.). Lions, J.L. A965), Problemes aux Limites dans les Equations aux Derivees Partielles (Presses de I'Universite de Montreal, Montreal, Que.). Lions, J.L. A969), Quelques Methodes de Resolution des Problemes aux Limites Non Lineaires (Dunod, Paris). Lions, J.L. and E. Magenes A968), Problemes aux Limites non Homogenes et Applications, 1 (Dunod, Paris). Lions, J.L. and J. Peetre A964), Sur une classe d'espaces d'interpolation, Publ. Math. Inst. Hautes Etudes Set 19, 5-68. Lions, J.L. and G. Stampacchia A967), Variational inequalities, Comm. Pure Appl. Math. 20,493-519. Liu, W.K. and T. Belytschko A984), Efficient linear and nonlinear heat conduction with a quadrilateral element, Internat. J. Numer. Methods Engrg. 20, 931-948. Lorenz, J. A977), Zur Inversmonotonie diskreter Probleme, Numer. Math. 27, 227-238. Louis, A. A979), Acceleration of convergence for finite element solutions of the Poisson equation, Numer. Math. 33, 43-53. LukAs, I.L. A974), Curved boundary elements: General forms of polynomial mappings, in: J.T. Oden et
326 P.G. Ciarlet al., eds., Computational Methods in Nonlinear Mechanics (Texas Institute for Computational Mechanics, Austin, TX) 37-46. Mansfield, L.E. A971), On the optimal approximation of linear functional in spaces of bivariate functions, SI AM J. Numer. Anal. 8, 115-126. Mansfield, L.E. A972a), Optimal approximation and error bounds in spaces of bivariate functions, /. Approximation Theory 5, 77-96. Mansfield, L.E. A972b), On the variational characterization and convergence of bivariate splines, Numer. Math. 20, 99-114. Mansfield, L.E. A974), Higher order compatible triangular finite elements, Numer. Math. 22, 89-97. Mansfield, L.E. A978), Approximation of the boundary in the finite element solution of fourth order problems, Si AM J. Numer. Anal. 15, 568-579. McLeod, R. and A.R. Mitchell A972), The construction of basis functions for curved elements in the finite element method, J. Inst. Math. Appl. 10, 382-393. McLeod, R. and A.R. Mitchell A975), The use of parabolic arcs in matching curved boundaries in the finite element method, J. Inst. Math. Appl. 16, 239-246. Meinguet, J. A975), Realistic estimates for generic constants in multivariate pointwise approximation, in: J J.H. Miller, ed., Topics in Numerical Analysis II (Academic Press, New York) 89-107. Meinguet, J. A977), Structure et estimations de coefficients d'erreur, RAIRO Anal. Numer. 11,355-368. Meinguet, J. A978), A practical method for estimating approximation errors in Sobolev spaces, in: D.C. Handscomb, ed., Multivariate Approximation (Academic Press, London) 169-187. Meinguet, J. A979), A convolution approach to multivariate representation formulas, in: W. Schempp and K. Zeller, eds. Multivariate Approximation Theory (Birkhauser, Basel) 198-210. Meinguet, J. A981), From Dirac distribution to multivariate representation formulas, in: Z. Ziegler, ed. Approximation Theory and Applications (Academic Press, New York) 225-248. Meinguet, J. A982), Sharp "a priori" error bounds for polynomial approximation in Sobolev spaces, in: W. Schempp and K. Zeller, eds. Multivariate Approximation Theory II (Birkhauser, Basel) 255-274. Meinguet, J. A984), A practical method for obtaining a priori error bounds in point-wise and mean-square approximation problems, in: Approximation Theory, Spline Functions, and Applications (Reidel, Dordrecht, Netherlands) 97-125. Meinguet, J. and J. Descloux A977), An operator-theoretical approach to error estimation, Numer. Math. 27, 307-326. Meisters, G.H. and C. Olech A963), Locally one-to-one mappings and a classical theorem on Schlicht functions, Duke Math. J. 30, 63-80. Melosh, R.J. A963), Basis of derivation of matrices for the direct stiffness method, AIAA J. 1,1631-1637. Mercier, B. A975a), Approximation par elements finis et resolution, par un algorithme de penalisation- dualite, d'un probleme d'elasto-plasticite, C.R. Acad. Sci. Paris, Sir. A 280, 287-290. Mercier, B. A975b), Une methode de resolution du probleme des charges Hmites utilisant les fiuides de Bingham, C.R. Acad. Sci. Paris, Ser. A 281, 525-527. Merqer, B. and G, Raugel A982), Solution of a boundary value problem in an axisymmetric domain by finite elements in r, z and Fourier series in в, RAIRO Anal. Numer. 16, 405-461. Mikhlin, S.G. A964), Variational Methods in Mathematical Physics (Pergamon, Oxford) (original Russian edition: 1957). Mikhlin, S.G. A971), The Numerical Performance of Variational Methods (Wolters-Noordhoff, Groningen, Netherlands). Miranda, С A970), Partial Differential Equations of Elliptic Type (Springer, New York). Mitchell, A.R. A973), An introduction to the mathematics of the finite element method, in: J.R. Whiteman, ed. The Mathematics of Finite Elements and Applications (Academic Press, London) 37-58. Mitchell, A.R. A976), Basis functions for curved elements in the mathematical theory of finite elements in: J.R. Whiteman, ed. The Mathematics of Finite Elements and Applications II (Academic Press, London) 43-58. Mitchell, A.R. and J.A. Marshall A975), Matching of essential boundary conditions in the finite element method, in: J.J.H. Miller, ed. Topics in Numerical Analysis II (Academic Press, New York) 109-120.
References 327 Miyoshi, Т. A972), Convergence of finite elements solutions represented by a non-conforming basis, Kumamoto J. Sci. {Math.) 9, 11-20. Mizukami, A. A986), Some integration formulas for a four-node isoparametric element, Computer Methods Appl. Meek Engrg. 59, 111-121. Mock, M.S. A976), Projection methods with different trial and test spaces, Math. Сотр. 30, 400-416. Molchanov, I.N. and E.F. Galba A985), On finite element methods for the Neumann problem, Numer. Math. 46, 587-598. Molchanov, I.N. and E.F. Galba A985), On finite element methods for the Neumann problem, Numer. Math. 46, 587-598. Morgan, J. and R. Scott A975), A nodal basis for C1 piecewise polynomials of degree и > 5, Math. Сотр. 29, 736-740. Morley, L.S.D. A968), The triangular equilibrium element in the solution of plate bending problems, Aero. Quart. 19, 149-169. Mosco, U. and F. Scarpini A975), Complementarity systems and approximations of variational inequalities, Rev. Francaise Automat. Informal. Recherche Operationnelle Sir. Rouge Anal. Numer. R-l, 5-8. Mosco, U. and G. Strang A974), One-sided approximation and variational inequalities, Bull. Amer. Math. Soc. 80 A974), 308-312. Nakao, M.T. A987), Superconvergence of the gradient of Galerkin approximations for elliptic problems, Modelisation Math. Anal. Numer. 21, 679-695. Natterer, F. A975a), Uber die punktweise Konvergenz finiter Elemente, Numer. Math. 25, 67-77. Natterer, F. A975b), Berechenbare Fehlerschranken fur die Methode der Finiten Elemente, in: International Series of Numerical Mathematics 28 (Birkhauser, Basel) 109-121. Natterer, F. A977), Uniform convergence of Galerkin's method for splines on highly nonuniform meshes, Math. Сотр. 31, 457-468. Necas, J. A967), Les Methodes Directes en Theorie des Equations Elliptiques (Masson, Paris). NeCas, J. A975), On regularity of solutions to non-linear variational inequalities for second order elliptic systems, Rend. Mat. 8, 481-498. Necas, J. and I. HlavAcek A981), Mathematical Theory of Elastic and Elasto-Plastic Bodies: An Introduction (Elsevier, Amsterdam). Nedoma, J. A979), The finite element solution of elliptic and parabolic equations using simplicial isoparametric elements, RAIRO Anal. Numer. 13, 257-289. Nicolaides, R.A. A972), On a class of finite elements generated by Lagrange interpolation, SI AM J. Numer. Anal. 9, 435-445. Nicolaides, R.A. A973), On a class of finite elements generated by Lagrange interpolation II, SI AM J. Numer. Anal. 10, 182-189. Nielson, G.M. A973), Bivariate spline functions and the approximation of linear functionals, Numer. Math. 21, 138-160. Nitsche, J.A. A968), Ein Kriterium fur die Quasi-Optimalitat des Ritzchen Verfahrens, Numer. Math. 11, 346-348. Nitsche, J.A. A969), Orthogonalreihenentwicklung nach linearen Spline-Funktionen, J. Approximation Theory 2, 66-78. Nitsche, J.A. A970), Linear Spline-Funktionen und die Methoden von Ritz fur elliptische Randwert- probleme, Arch. Rational Mech. Anal. 36, 348-355. Nitsche, J.A. A971), Uber ein Variationsprinzip zur Losung von Dirichlet-Problemen bei Verwendung von Teilraumen, die keinen Randbedingungen unterworfen sind, Abh. Math. Sem. Univ. Hamburg 36, 9-15. Nitsche, J.A. A972a), Interior error estimates of projection methods, in: Proceedings EquaDiff 3, Purkyne University, Brno, 235-239. Nitsche, J.A. A972b), On Dirichlet problems using subspaces with nearly zero boundary conditions, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 603-628. Nitsche, J.A. A974), Convergence of nonconforming methods, in: С de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 15-53.
328 P.G. Cianei Nitsche, J.A. A975), L„ -convergence of finite element approximation, in: Proceedings Second Conference on Finite Elements, Rennes, France. Nitsche, J.A. A976a), Der Einfiuss von Randsingularitaten beim Ritzschen Verfahren, Numer. Math. 25, 263-278. Nitsche, J.A. A976b), Ober L^-Abschatzungen von Projektionen auf finite Elemente, Bonner Math. Schriften 89, 13-30. Nitsche, J.A. A977), Lw -convergence of finite element approximations, in: I. Galugani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 261-274. Nitsche, J.A. A981a), On Korn's second inequality, RAIRO Anal. Numer. 15, 237-248. Nitsche, J.A. A981b), Schauder estimates for finite element approximations of second order elliptic boundary value problems, in: Proceedings, Special Year on Numerical Analysis, College Park, MD, 290-343. Nitsche, J.A. and A.H. Schatz A974), Interior estimates for Ritz-Galerkin methods, Math. Сотр. 28, 937-958. Norrie, D.H. and G. de Vries A973), The Finite Element Method: Fundamentals and Applications (Academic Press, New York). Oden, J.T. A972), Finite Elements of Nonlinear Continua (McGraw-Hill, New York). Oden, J.T. and G.F. Carey A981-1984), Finite Elements I: An Introduction (with E.B. Becker); II: A Second Course; III: Computational Aspects; IV: Mathematical Aspects; V: Special Problems in Solid Mechanics (Prentice-Hal!, Englewood Cliffs, NJ). Oden, J.T. and J.N. Reddy A976a), An Introduction to the Mathematical Theory of Finite Elements (Wiley-Interscience, New York), Oden, J.T. and J.N. Reddy A976b), Variational Methods in Theoretical Mechanics (Springer, Heidelberg). Oganbsjan, L.A. and P.A. Rukhovets A969), Investigation of the convergence rate of variational- difference schemes for elliptic second order equations in a two-dimensional domain with a smooth boundary, Zh. Vychisl. Mat. i Mat. Fiz. 9, 1102-1120 (in Russian). Oliveira, E.R. de Arantes e A977), The patch test and the general convergence criteria of the finite element method, Internal. J. Solids Structures 13, 159-178. Panagiotopoulos, P.D. A985), Inequality Problems in Mechanics and Applications (Birkhauser, Boston, MA). Peisker, P. and D. Braess A987), A conjugate gradient method and a multigrid algorithm for Morley's finite element approximation of the biharmonic equation, Numer. Math. 50, 567-586. Percell, P. A976), On cubic and quartic Clough-Tocher finite elements, SIAM J. Numer. Anal. 13, 100-103. Pini, F. A974), Approximation by finite element functions using global regularization, Report, Departement de Mathematiques, Ecole Polytechnique Federate de Lausanne, Switzerland. PitkAranta, J. A979), Boundary subpaces for the finite element method with Lagrange multipliers, Numer. Math. 33, 273-289. Poly a, G. A952), Sur une interpretation de la methode des differences finies qui peut fournir des bornes superieures ou inferieures, C.R. Acad. Sci. Paris 235, 995-997. Powell, M.J.D. and M.A. Sabin A977), Piecewise quadratic approximations on triangles, ACM Trans. Math. Software 3, 316-325. Protter, M. and H. Weinberger A967), Maximum Principles in Differential Equations, (Prentice-Hall, Englewood Cliffs, NJ), also (Springer, Berlin, 1984). Rabier, P. A977), Interpolation harmonique, Rev. Francaise Automat. Informal. Recherche Opera- tionnelle Ser. Rouge Anal. Numer. 11, 159-180. Rachford Jr, H.H. and M.F. Wheeler A974), An Я "' -Galerkin procedure for the two-point boundary value problem, in: C. de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 353-382. Rannacher, R. A976a), Zur punktweisen Konvergenz der Methode der finiten Elemente beim Plattenproblem, Manuscripta Math. 19, 401-416. Rannacher, R. A976b), Zur Lm -Konvergenz linearer finiter Elemente beim Dirichlet problem, Math. Z. 149, 69-77.
References 329 Rannacher, R. A977), Lm-Fehlerabschatzung fur ein nicht konforme Finite-Elemente-Methode beim Plattenproblern, Z. Angew. Math. Mech. 57, 247-249. Rannacher, R. and R. Scott A982), Some optimal error estimates for piecewise linear finite element approximations, Math. Сотр. 38, 437-445 Raoult, A. A985), Construction d'un modele devolution de plaques avec terme d'inertie de rotation, Ann. Mat. Рига Appl. 139, 361-400. Raugel, G. A978a), Resolution numerique de problemes elliptiques dans des domaines avec coins, C.R. Acad. Sci. Paris Sir. A-B 286, 791-794. Raugel, G. A978b), Resolution numerique de problemes elliptiques dans des domaines avec coins, Doctoral Dissertation, Universite de Rennes, France. Raviart, P.-A. A972), Mithodes des Elements Finis, Lecture Notes, Laboratoire d'Analyse Numerique, Universite Pierre et Marie Curie, Paris, France. Raviart, P.-A. and J.M. Thomas A983), Introduction а Г Analyse Numerique des Equations aux Dirivees Partielles (Masson, Paris). Razzaque, A. A973), Program for triangular bending elements with derivatives smoothing, Internal. J. Numer. Methods Engrg. 6, 333-343. Razzaque, A. A986), The patch test for elements, Internat. J. Numer. Methods Engrg. 22, 63-71. Robinson, J. A973), Basis for isoparametric stress elements, Comput. Methods Appl. Mech. Engrg. 2, 43-63. Robinson, J. A982), The patch test: Is it or isn't it?, Finite Element News 1, 30-34. Rodrigues, J.F. A987), Obstacle Problems in Mathematical Physics (North-Holland, Amsterdam). Rose, M.E. A975), Weak-element approximations to elliptic differential equations, Numer. Math. 24, 185-204. Roux, J. A976), Resolution numerique d'un probleme d'ecoulement subsonique de fluides compressibles, Rev. Francaise Automat. Informat. Recherche Opirationnelle Sir. Rouge Anal. Numer. 10, A2), 31- 50. Ruas, V. A988), A quadratic finite element method for solving biharmonic problems in R", Numer. Math. 52, 33-43. Ruas Santos, V. A982), On the strong maximum principle for some piecewise linear finite element approximate problems of non-positive type, J. Fac. Sci. Univ. Tokyo, Sec. IA 29, 473-491. Sablonniere, P. A980), Interpolation d'Hermite par des surfaces de classe C1 quadratiques par morceaux, in: Actes du Deuxieme Congr'es International sur les Methodes Numeriques de Vlngenieur (Dunod, Paris) 175-185. Sablonniere, P. A984), Composite finite elements of class C\ J. Comput. Appl. Math. 12-13, 541-550. Sablonniere, P. A987), Error bounds for Hermite interpolation by quadratic splines on an a-triangulation, IMA J. Numer. Anal. 7, 495-508. Samuelsson, A. A979), The global constant strain condition and the patch test, in: R. Glowinski, E.Y. Rodin and O.C. Zienkiewicz, eds., Energy Methods in Finite Element Analysis (Wiley, Chichester) 49-68. Sanchez, A.M. and R. Arcangeli A984), Estimations des erreurs de meilleure approximation polynomiale et d'interpolation de Lagrange dans les espaces de Sobolev d'ordre non entier, Numer. Math. 45, 301-321. Sander, G. and P. Beckers A977), The influence of the choice of connectors in the finite element method, Internat. J. Numer. Methods Engrg. 11, 1491-1505. Sard, A. A963), Linear Approximation, Mathematical Surveys 9 (American Mathematical Society, Providence, RI). Scarpini, F. and Vivaldi, M.A. A977), Error estimates for the approximation of some unilateral problems, Rev. Francaise Automat. Informat. Recherche Opirationnelle Sir. Rouge Anal. Numer. 11, 197-208. Schatz, A.H. A974), An observation concerning Ritz-Galerkin methods with indefinite bilinear forms, Math. Сотр. 28, 959-962. Schatz, A.H. A980), A weak discrete maximum principle and stability of the finite element method in L" on plane polygonal domains, Math. Сотр. 34, 77-91. Schatz, A.H. and L.B. Wahlbin A976), Maximum norm error estimates in the finite element method for
330 P.G. Ciarlet Poisson equation on plane domains with corners, in: C.K. Chui, G.G. Lorentz and L.L. Schumaker, eds., Approximation Theory II (Academic Press, New York) 541-547. Schatz, A.H. and L.B. Wahlbin A977), Interior maximum norm estimates for finite element methods, Math. Сотр. 31, 414-442. Schatz, A.H. and L.B. Wahlbin A978), Maximum norm estimates in the finite element method on plane polygonal domains, Part 1, Math. Сотр. 32, 73-109. Schatz, A.H. and L.B. Wahlbin A979), Maximum norm estimates in the finite element method on plane polygonal domains, Part 2: Refinements, Math. Comp, 33, 465-492. Schatz, A.H. and L.B. Wahlbin A981), On a local asymptotic error estimate in finite elements and its use: numerical examples, in: R. Vichnevetsky and R.S. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations IV (IMACS, New Brunswick, NJ) 14-17. Schatz, A.H. and L.B. Wahlbin A982), On the quasi-optimality in !„ of the H1 projection into finite element spaces, Math. Сотр. 38, 1-22. Schmid, H.J. A978), Cubature formulas with a minimal number of knots, Numer. Math. 31, 281-297. Schreiber, R. A980), Finite element methods of high-order accuracy for singular two-point boundary value problems with nonsmooth solutions, SIAM J. Numer. Anal. 17, 547-566. Schultz, M.H. A969), L™ -multivariate approximation theory, SIAM J. Numer. Anal. 6, 161-183. Schultz, M.H. A972), Quadrature-Galerkin approximations to solutions of elliptic differential equations, Proc. Amer. Math. Soc. 33, 511-515. Schultz, M.H. A973), Error bounds for a bivariate interpolation scheme, J. Approximation Theory 8, 189-194. Schwartz, L. A966), Theorie des Distributions (Hermann, Paris). Schwartz, L. A967), Cours d'Analyse (Hermann, Paris). Scott, R. A973a), Finite element techniques for curved boundaries, Doctoral Thesis, MIT, Cambridge, MA. Scott, R. A973b), Finite element convergence for singular data, Numer. Math. 21, 317-327. Scott, R. A974), C1 continuity via constraints for 4th order problems, in: С de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 171-193. Scott, R. A975), Interpolated boundary conditions in the finite element method, SIAM J. Numer. Anal. 12, 404-427. Scott, R. A976a), Optimal L" estimates for the finite'element method on irregular meshes, Math. Сотр. 30, 681-697. Scott, R. A976b), A survey of displacement methods for the plate bending problem, in: K.-J. Bathe, J.T. Oden and W. Wunderlich, eds., Proceedings V'.S.-Germany Symposium on Formulations and Computational Algorithms in Finite Element Analysis (MIT Press, Cambridge, MA) 855-876. Scott, R. A976c), A hexagonal finite element with one degree-of-freedom per hexagon, Internat. J. Numer. Methods Engrg. 10, 958-959. Scott, R. A979), Applications of Banach space interpolation to finite element theory, in: M.Z. Nashed, ed., Functional Analysis Methods in Numerical Analysis, Lecture Notes in Mathematics 701 (Springer, Berlin) 298-318. Shah, J.M. A970), Two-dimensional polynomial splines, Numer. Math. 15, 1-14. Shi, Z. A984a), A convergence condition for the quadrilateral Wilson element, Numer. Math. 44,349-361. Shi, Z. A984b), On the convergence properties of the quadrilateral elements of Sander and Beckers, Math. Сотр. 42, 493-504. Shi, Z. A984c), An explicit analysis of Stummel's patch test examples, Internat. J. Numer. Methods Engrg. 20, 1233-1246. Shi, Z. A984d), Difficulties with Irons' patch test, Z. Angew. Math. Mech. 64, T314-T315. Shi, Z. A984e), The generalized patch test for Zienkiewicz's triangles, J. Comput. Math. 2, 279-286. Shi, Z. A984f), On the convergence rate of the boundary penalty method, Internat. J. Numer. Methods Engrg. 20, 2027-2032. Shi, Z. A985), Convergence properties of two nonconforming finite elements, Comput. Methods Appl. Mech. Engrg. 48, 123-137. Shi, Z. A986), The optimal order of convergence of Wilson's nonconforming element, Math. Numer. Sinica 8, 159-163.
References 331 Shi, Z. A987a), The F-E-M-test for convergence of nonconforming finite elements, Math. Сотр. 49, 391-405. Shi, Z. A987b), Convergence of the TRUNC plate element, Comput. Methods Appl. Meek Engrg. 62, 71-88. Shi, Z. A989), On Stummel's examples to the patch test, Comput. Meek 5, 81-87. Shilov, G.E. A968), Generalized Functions and Partial Differential Equations (Gordon and Breach, New York). Signorini, A. A933), Sopra alcune questioni di elastostatica, Atti Soc. Ital. Prog. Sci. Sperb, R. A981), Maximum Principles and Their Applications (Academic Press, New York). Stakgold, I. A968), Boundary Value Problems of Mathematical Physics II (MacMillan, New York). Stampacchia, G. A964), Formes bilineaires coercitives sur les ensembles convexes, C.R. Acad. Sci. Paris Ser. A. 258, 4413-4416. Stampacchia, G. A965), Le probleme de Dirichlet pour les equations elliptiques du second ordre a coefficients discontinus, Ann. Inst. Fourier {Grenoble) 15, 189-258. Strang, G. A971), The finite element method and approximation theory, in: B.E. Hubbard, ed., Numerical Solutions of Partial Differential Equations II (Academic Press, New York) 547-583. Strang, G. A972a), Approximation in the finite element method, Numer. Math. 19, 81-98. Strang, G. A972b), Variational crimes in the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 689-710. Strang, G. A973), Piecewise polynomials and the finite-element method, Bull. Amer. Math. Soc. 79, 1128-1137. Strang, G. A974), The dimension of piecewise polynomials, and one-sided approximation, in: G.A. Watson, ed., Conference on the Numerical Solution of Differential Equations, Lecture Notes in Mathematics 363 (Springer, New York) 144-152. Strang, G. and A. Berger A971), The change in solution due to change in domain, in: Proceedings AMS Symposium on Partial Differential Equations, Berkeley, CA, 199-205. Strang, G. and G.J. Fix A971), A Fourier analysis of the finite element method, in: Proceedings CIME Summer School, Cremonese, Rome. Strang, G. and G.J. Fix A973), An Analysis of the Finite Element Method (Prentice-Hall, Englewood Cliffs, NJ). Stroud, A.H. A971), Approximate Calculation of Multiple Integrals (Prentice-Hall, Englewood Cliffs, NJ). Stummel, F. A979), The generalized patch test, SIAM J. Numer. Anal. 16, 449^171. Stummel, F. A980a), Basic compactness properties of nonconforming and hybrid finite element spaces, RAIRO Anal. Numer. 4, 81-115. Stummel, F. A980b), The limitations of the patch test, Internat. J. Numer. Methods Engrg. 15, 177-188. Suzuki, T. and H. Fujita A986), A remark on the L" bounds of the Ritz operator associated with a finite element approximation, Numer. Math. 49, 529-544. Synge, J.L. A957), The Hypercircle in Mathematical Physics (Cambridge University Press, Cambridge). Taylor, R.L., P.J. Bersford and E.L. Wilson A976), A non-conforming element for stress analysis, Internat. J. Numer. Methods Engrg. 10, 1211-1220. Taylor, R.L., J.C. Simo, O.C. Zienkiewicz and A.C.H. Chan A986), The patch test: A condition for assessing FEM convergence, Internat. J. Numer. Methods Engrg. 22, 39-62. Темам, R. A984), Navier-Stokes Equations: Theory and Numerical Analysis (North-Holland, Amsterdam, 3rd rev. ed.). Thatcher, R.W. A976), The use of infinite grid refinements at singularities in the solution of Laplace's equation, Numer. Math. 25, 163-178. Thomasset, F. A981), Implementation of Finite Element Methods for Navier-Stokes Equations (Springer, Heidelberg). ТномЁЕ, V. A973a), Approximate solution of Dirichlet's problem using approximating polygonal domain, in: J.J.H. Miller, ed., Topics in Numerical Analysis (Academic Press, New York) 311-328. ТномЁЕ, V. A973 b), Polygonal domain approximation in Dirichlet's problem, J. Inst. Math. Appl. 11, 33^14.
332 P.G. Ciarlei Тномёе, V. A977), High-order local approximations to derivatives in the finite element method, Math. Сотр. 31, 652-660. Tomlin, G.R. A972), An optimal successive overrelaxation technique for solving second order finite difference equations for triangular meshes, Internat. J. Numer. Methods Engrg. 5, 25-39. Treves, F. A967), Topological Vector Spaces, Distributions and Kernels (Academic Press, New York). Turner, M.J., R.W. Clough, H.C. Martin and L.J. Topp A956), Stiffness and deflection analysis of complex structures, J. Aero. Sci. 23, 805-823. Utku, М. and G.F. Carey A982), Boundary penalty techniques, Comput. Methods Appl. Meek. Engrg. 30, 103-118. Valid, R. A977), La Mecanique des Milieux Continus et le Calcul des Structures (Eyralles, Paris). English translation: Mechanics of Continuous Media and Analysis of Structures (North-Holland, Amsterdam, 1981). Varga, R.S. A962), Matrix Iterative Analysis (Prentice-Hall, Englewood Cliffs, NJ). Varga, R.S A966a), On a discrete maximum principle, SIAM J. Numer. Anal. 3, 355-359. Varga, R.S. A966b), Hermite interpolation-type Ritz methods for two-point boundary value problems, in: J.H. Bramble, ed., Numerical Solution of Partial Differential Equations (Academic Press, New York) 365-373. Varga, R.S. A971), Functional Analysis and Approximation Theory in Numerical Analysis, Regional Conference Series in Applied Mathematics (SIAM, Philadelphia, PA). Veidinger, L. A972), On the order of convergence of the Rayleigh-Ritz method with piecewise linear trial functions, Acta Math. Acad. Sci. Hungar. 23, 507-517. Veidinger, L. A975), On the order of convergence of a finite element method in regions with curved boundaries, Acta Math. Acad. Sci. Hungar. 26, 419-431. Villagio, P. A972), Energetic bounds in finite elasticity, Arch. Rational Mech. Anal. 45, 282-293. Wachspress, E.L. A971), A rational basis for function approximation, J. Inst. Math. Appl. 8, 57- 68. Wachspress, E.L. A973), A rational basis for function approximation, II: Curved sides, J. Inst. Math. Appl. 11, 83-104. Wachspress, E.L. A975), A Rational Finite Element Basis (Academic Press, New York). Wahlbin, L.B. A978), Maximum norm error estimates in the finite element method with isoparametric quadratic elements and numerical integration, RAIRO Anal. Numer. 12, 173-202. Wahlbin, L.B. A984), On the sharpness of certain local estimates for Я1 projections into finite element spaces: influence of a reentrant corner, Math. Сотр. 42, 1-8. Wait, R. and A.R. Mitchell A971), Corner singularities in elliptic problems by finite element methods, J. Comput. Phys. 8, 45-52. Wait, R. and A.R. Mitchell A985), Finite Element Analysis and Applications (Wiley, New York); rev. ed. of The Finite Element Method in Partial Differential Equations (Wiley, New York 1977). Walsh, J. A971), Finite-difference and finite-element methods of approximation, Proc. Roy. Soc. London Ser. A 323, 155-165. Werschulz, A.G. A982), Optimal error properties of finite element methods for second order elliptic Dirichlet problems, Math. Сотр. 158, 401-413. Wheeler, M.F. and J.R. Whiteman A987), Superconvergent recovery of gradients on subdomains from piecewise linear finite-element approximations, Numer. Methods Partial Differential Equations 3, 65-82. Whiteman, J.R. A982), Finite elements for singularities in two- and three-dimensions, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications IV (Academic Press, London) 35-51. Whiteman, J.R. and J.E. Akin A980), Finite elements, singularities, and fracture, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications III (Academic Press, London) 35-51. WlDLUND, O. A977), On best error bounds for approximation by piecewise polynomial functions, Numer. Math. 27, 327-338. Wilson, E.L., R.L. Taylor, W,P. Doherty and J. Ghabussi A973), Incompatible displacement models, in: S.T. FENVES et al, eds., Numerical and Computational Methods in Structural Mechanics (Academic Press, New York) 43-57.
References 333 Wissmann, J.W., Т. Becker and H. MOller A987), Stabilization of the zero-energy modes of under-integrated isoparametric finite elements, Comput. Mech. 2, 289-306. Zacharias, С and V. Subba Rao A982), Curved elements with polynomials of varying degree, J. Math. Phys. Sci. 16, 223-230. ZenISek, A. A970), Interpolation polynomials on the triangle, Numer. Math. 15, 283-296. ZenISek, A. A972), Hermite interpolation on simplexes in the finite element method, in Proceedings EquaDiff3, Purkyne University, Brno, 271-277. ZenISek, A. A973), Polynomial approximation on tetrahedrons in the finite element method, J. Approximation Theory 7, 334-351. Zeni'Sek, A. A974), A general theorem on triangular finite C'^-elements, Rev. Francaise Automat. Informal. Recherche Operationnelle Ser. Rouge Anal. Numer. R-2, 119-127. ZenISek, A. A981a), Discrete form of Friedrichs' inequalities in the finite element method, RAIRO Anal. Numer. 15, 265-286. ZenISek, A. A981b), Nonhomogeneous boundary conditions and curved triangular finite elements, Apt Mat. 26, 121-141. ZenISek, A. A987), How to avoid the use of Green's theorem in the Ciarlet-Raviart theory of variational crimes, Modelisation Math. Anal. Numer. 21, 171-191. Zienkiewicz, O.C. A971), The Finite Element Method in Engineering Science (McGraw-Hill, London). Zienkiewicz, O.C. A973), Finite elements: The background story, in: J.R. Whiteman, ed., The Mathematics of Finite Elements and Applications (Academic Press, London) 1-35. Zienkiewicz, O.C. A974), Constrained variational principles and penalty function methods in finite element analysis, in: G.A. Watson, ed., Conference on the Numerical Solution of Differential Equations, Lecture Notes in Mathematics 363 (Springer, New York) 207-214. Zienkiewicz, O.C. and K. Morgan A983), Finite Elements and Approximation (Wiley, New York). Zlamal, M. A968), On the finite element method, Numer. Math. 12, 394-409. Zlamal, M. A970), A finite element procedure of the second order of accuracy, Numer. Math. 16, 39<M02. ZlAmal, M. A973a), The finite element method in domains with curved boundaries, Internat. J. Numer. Methods Engrg. 5, 367-373. Zlamal, M. A973b), Curved elements in the finite element method, I. SI AM J. Numer. Anal. 10,229-240. Zlamal, M. A973c), A remark on the "Serendipity family", Internat. J. Numer. Methods Engrg. 7,98-100. Zlamal, M. A974), Curved elements in the finite element method, II, SI AM J. Numer. Anal. 11,347-362. Zlamal, M. A77), Some superconvergence results in the finite element method, in: I. Galligani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 353-362. Zlamal, M. A978), Superconvergence and reduced integration in the finite element method, Math. Сотр. 32, 663-685.
Glossary of Symbols General notation {xeA;R(x)}: general notation for a set. ?;(•), v( •,*),...: function v of one variable, two variables, v(',b): partial mapping x-+v{x,b). supp v = с1{хеЛГ; фс) # 0}: support of a function v. osc(v;A)= sup \v{x) — v(y)\. X,J!€A vA or v\A: restriction of a function v to the set A. P(A) = {p\A; peP}, where P is any space of functions defined over a domain that contains the set A. tr v or simply v. trace of a function v. C(a), C(a, b),...: any "constant" that depends solely on a, a and b,.... c\A or A: closure of a set A. intv4 or A: interior of a set A. dA: boundary of a set A. card(/l): number of elements of a set A. diam(j4): diameter of a set A. X — A: complement set of the subset A of the set X. =>: implies. Derivatives and differential calculus Dv(a) or v'(a): first (Frechet) derivative of a function v at a point a. £>2y(a) or t/'(fl): second (Frechet) derivative of a function v at a point a. D^a) or t/fc,(a): fcth (Frechet) derivative of a function г; at a point a. Dkv{a)hk = Z)*^)^!,й2>..., /ij if hx = h2 = ■■■ = hk = A. diV(a) = Dv(a)ei ~) 8yi^fl) = £Jг;(а)(е;,е^) > also used for vector-valued functions. bijkv{a) = Div{a){ei,ej,ek) J 335
336 P.G. Ciarkt J(F)(x) = det(8/,.(x)): Jacobian of a mapping F:xer-»F(x) = (fi(*))?=ieR''at x. n div v = Yi ^iv- 4v(a) = (8(p)f = i, also denoted Vp(a) or grad v(a). n !=1 |ot| = £ a;, for a multi-index a = (al5... ,a„)el4l". aj times a2 times a„ times 9"г;(с) = Z)wt;(a)(e;,.. .,e1,e2,...,e2>. ,,,e„,...,e„). v = (vl5 v2,..., v„): unit outer normal vector. 9V = £ VjSji (outer) normal derivative operator. i=l т = (t15t2): unit tangential vector along the boundary of a plane domain. 2 9Tu(a) = Dv{a)x = £ тД.у(а). i=i 2 6vtu(o) = D2D(a)(v>T)= £ у.-тД.ф). U=l 2 8„г;(а) = £2г;(а)(т,т)= J] т£тДс(а). Genera! notation for vector spaces II' II or || • \\x- norm (in the space X). |-|: seminorm. Br(a) = {xeX; \\x-a\\ O). Br = ВД0). S£(X\ Y): space of continuous linear mappings from X into Y.
Glossary of symbols 337 J?k(X; Y): space of continuous /c-linear mappings from Xk into Y. X': dual of a space X. || • ||': norm in the space X'. <•,•>: duality pairing between a space and its dual. x+Y={x+y;ysY}. X+Y={x + y;xeX,yeY}. X®Y={x + y,xeX, yeY} whenXn7={0}. X/Y: quotient space of X by Y. У{еА,ХеЛ}: vector space spanned by the vectors ел,ЯеЛ. I: identity mapping. e*: inclusion with continuous injection. €: inclusion with compact injection. dim X: dimension of the space X. KerA = {xeX;Ax = 0}. ImA = {AxeY;xeX]. Notation for specific vector spaces (u, v) = («,v) = uvdx: inner product in L2(Q). wvdx: inner product in (L2 (Q))n. ^m{A): space of real-valued, m times continuously differentiable functions on a subset A of №. V«>(A)= f) V*(A). <€m*{A) = \ve<$m{Q); for each $ with fi = m, sup |8У 8У' < + oo >>. I x,yeA \\X — y\\ хфу IMI*~-M) = II"L.»m + max SUP ' F^ FT^- \l>\=™ x,yeA \\X-y\\ хфу ЩО) = {ve4>x(Q); suppt; is a compact subset of Q}. @'(Q); space of distributions over Q.
338 P.C. Ciarlet Hm{Q) = {veL2(Q); b*veL2{Q) for all a with |a| «S ra}. Щ@): closure of 3>{Q) in Hm(Q). . |z|€m 1 1/2 |3ay|2dxf , . |a|=m j Й 1/2 " IL.o = i I II0/ lllo \ for functions f = (Vi)U г т (HQ)f, 1/2 j v ко H .|>( l-.fi \ for functions v = (Vi)U i in (Я^))'1. W™-p@) = {veLP{Q); d"v€Lp(Q) for all a with |a| ^ m}. WZ'P(Q): closure of 0(G) in Wm-"(Q), v\\m.P.a = l I |Э"«|'<ЬсУ , U?< i/p oo. If IL,ao,n = max ^esssup|Sat;(x)|>. IIf ll'm,p,fi: norm in the dual space of Wm,p(Q). f Г 1 1/p I»L.p.oH I |8«»|'dx[ , Up<co. Mm,co,o = max ^esssup|3*u(x) v = {weWk+1-!'(Qy,(w-v)ePk(Q)} \\v\\k+i,p,n = inf \\v + p\\k+Up,a, vei реРк<Я) |i5|t+i,p,n = |fk+i,p,n, ^e*5 1/2 notation in the quotient space Wk+1-p(Q)/Pk(Q). v\cp;m,Q=) \<P Z |9^u|2dx> : weighted seminoma \v\m,*>.T = sup \\Dmv{x)\\Xm{U„,R) for v.Ti
Glossary of symbols 339 I F |M>0O.r = sup \\DmF(x)\\<?miStn;Kn) for F:fc K"^K". Ит,оо,т = max sup ||Z)mF(x)(e,.)m || for F: fa W -►R". Ят(Т),И""-"(Г), ||-L,p,r,...: alternate notation for Hm(f), W""'p(f), || • j|m,Pj/,..., where T is a subset of U" with interior t. Elasticity X, ц: Lame's coefficients of a material. £ = : Young s modulus. Я Poisson's coefficient. 2{k + iiY Eo(H)= 2$jui + diuj)'- components of the linearized strain tensor. Gу(и): components of the stress tensor. Spaces of polynomials Pk: space of all polynomials in xlt..., x„ of degree </c. Qk: space of all polynomials in xx,..,, x„, of degree =$/c with respect to each variable xt, 1 < г^ n. Notation special to U" eh 1 < i ^ n: canonical basis of №. ( n ] i/2 IIv II = ) Z I ui I f : Euclidean norm of the vector v = («,.)"= t. Wl IIВ || = sup - : norm of the matrix B, induced by the Euclidean veK" II II vector norm. a • b: Euclidean scalar product in W of the vectors a and b. ax b: vector product of the vectors a and b. det B: determinant of a square matrix B. meas(yl) = dx: dx-measure of a set A <= W. A
340 P.G. Ciarlet dy: superficial measure along a Lipschitz-continuous boundary of an open subset of R". Xj = kj(x): barycentric coordinates of a point xeIR", l^j^n+1. dij^Hai + aj), i<j. = iBa; + aj), i Ф). = i(af 4- a• + ak), 1Ф), )Фк, кФи Finite elements (most common notation) (T, P, Z) or (T, PT, ZT): finite element. P = jPt: space of functions p or pT: T-* U. Z = ZT: set of degrees of freedom of a finite element. 4>i — 9i,T, 1 < i < AT: degrees of freedom of a finite element. Pi = Put, 1 < i < N: basis functions of a finite element. r\T: set of nodes of a finite element. s = sT: maximal order of directional derivatives found in the set Z. Uv = nTv: P- or FT-interpolant of a function v. dom П = <Р{Т). /iT = diam(T). pr = sup{diam(S): S is a ball contained in Г}. xebx=F(x)sT: corresponding points of f and T= F{T) (F: bijection). (v: f-+№)^(v = v-F-1: T-R): corresponding functions defined over f and T = F{T) (F: bijection). FeiPfoFteP, l^i^n T = F(T), where Рб(Р1{Т)У>, F(di) = ai, l^t<rt+l v for isoparametric ' simplicial elements. hT = diam(f) pT: diameter of the sphere inscribed in f j L q>(x)ux~ Y, °W(&i): quadrature formula with weights a>l and nodes bt. т Г . L Е(ф)= ф(х)йх~ Y щфкРгУ- quadrature error functional on T. J 1 = 1 f
Glossary of symbols 341 £r(<?)= U>(x)dx- Ёй!.гАг>: T quadrature error functional on T = FT(T), where a)i,T=tt>iJ(FT)(h), bhT = FT(b,). Finite element spaces (most common notation) STh: triangulation of a set Q. Xh: finite element space without boundary conditions. X0h = {v„eXk; vh =0 on Г}. Xooh = {vheXh; vh = dvvh = 0 on Г). Vh: finite element space with boundary conditions, Ih: set of degrees of freedom of a finite element space Xh. cph or (pkh, 1 < к ^ M: degrees of freedom of a finite element space Xh. {wk)ff=1: basis in a finite element space Xh or Vh. t]h: set of nodes of a finite element space Xh. nhv: Xfc-interpolant of a function v. dom ПА = ^"(Q), s = max sT. Various sets of hypotheses concerning the finite element method (FEM1) The set Q is triangulated. (FEM2) The spaces PT, T&2Th, contain polynomials or "nearly polynomials". (FEM3) There exists a basis in the finite element space Vh whose functions have "small" supports. {$~h 1) Each set T is closed, t Ф 0, and f is connected. (^~h2) Each boundary 6T is Lipschitz-continuous. {3rh2)Q={JTe^T. (Fh4) Т,ФТ2 => ftnf2=0. {3~ьЬ) Condition on adjacent finite elements.
342 P.G. Ciarkl (HI) Regularity of a family of triangulations. (H2) All finite elements (T,PT,ET), Ts\Jh$~h, are affine-equivalent to a single reference finite element. (H3) All finite elements (T,PT,ZT), Te\JhFh, are of class %°. (H4) The family of triangulations satisfies an inverse assumption. (H2') The family (Г, PT, IT), Те {jh^h, is almost-affine, (H3') All finite elements (T,PT,IT), Te{Jh2Th, are of class <%K
Subject Index An asterisk in the left margin indicates a specific finite element Abstract error estimate, 113, 117, 141, 183, 192, 193, 213, 228, 257, 302 Abstract minimization problem, 23, 24 Abstract variational problem, 23, 28, 29 *ACM rectangle, 300 *Adini-Clough-Melosh rectangle, 274,298,300, 302 Adjacent finite elements, 73, 86, 103 Adjacent isoparametric finite elements, 252 Adjacent n-rectangles, 82 Adjacent n-simplices, 72, 73 Adjoint problem, 142 Admissible displacement, 23, 47 Affine-equivalent finite elements, 97,98,100,215, 227, 229, 277 Affine-equivalent open subsets of R", 99,121,125 Affine family of finite elements, 59, 97,101,115, 129, 276 Affine mapping, 76, 97, 98, 121, 125, 227 Almost-affine family of finite elements, 273,275, ' 276, 277, 283, 286, 288, 292, 293, 294 Approximate bilinear form, 183, 192, 206, 209, 211,219,224,256,299 Approximate integration, 190 Approximate linear form, 183, 192, 256 Approximate quadrature, 190 Approximation without interpolation, 132,133, 140 Area coordinate, 67 *Argyris triangle, 88, 90, 91, 99, 108, 109, 111, 112,273,275,276,297 Basis functions of the —, 88, 89, 276 Asymptotic expansion of the discrete solution, 181 Aubin-Nitsche lemma, 116, 141, 179, 193 Automatic generation of triangulations, 60 Averages, as degrees of freedom, 93, 215, 216 224, 309 BabuSka's paradox, 298 Banach space interpolation method, 134 Barycenter of an «-simplex, 67 Barycentric coordinate, 66 Basic aspects of the finite element method, 59,61, 63, 64, 65 Basis function of a finite element 94, 100, 106, 107 Basis function of a finite element space, 63,64,74, 86, 105, 106 *Bell triangle, 90, 91, 99, 108, 275, 279, 297 Basis functions of the —, 90, 91, 279 •Bicubic rectangle, 78, 79 ♦<Г—, 92 •Reduced —, 81, 82, 99 Biharmonic operator, 54, 56 Biharmonic problem, 56, 275, 302, 309, 312 Bilinear form, 23 Indefinite —, 180 F-elliptic —, 23, 24 Bilinear lemma, 210, 219, 220, 221, 224, 307 ♦Bilinear rectangle, 77, 222 ♦Isoparametric —, 235, 236, 249 ♦Biquadratic rectangle, 77, 79 ♦Isoparametric —, 235, 236 ♦Reduced —, 81, 82, 99 ♦Birkhoff-Mansfield triangle, 294 ♦Reduced —, 294, 295 Blending function interpolation, 250, 294 ♦Bogner-Fox-Schmit rectangle, 92,99,108,275, 297 Boundary, — node, 109, 253, 255 — of class X, 32 Curved —, 250, 251, 268, 270, 271, 298 Free —, 50 Lipschitz-continuous —, 32 Piecewise smooth —, 256 Sufficiently smooth —, 33 Boundary condition, 39 — in a finite element space, 60, 108, 209, 218, 301 343
344 P.G. Ciarlet Boundary value problem, 37, 56 — formally associated with a variational problem, 37, 40, 42, 45 —of linearized elasticity, 23,46,56,63,65,210, 214 Elliptic —, 24, 56 Fourth-order —, 56, 65, 273 Nonlinear —, 46, 47, 48, 50, 52, 53 Regular second-order —, 142 Regularity of the solution of a —, 53, 142,297 Second-order —, 56 Bramble-Hilbert lemma, 184, 198, 220, 221 Cea's lemma, 60, 113, 193 Center of gravity of an n-simplex, 67 Choice of a finite element, 111 Clamped plate problem, 23, 54, 55, 56, 63, 274, 295, 296, 298, 299 Compatible finite elements, 275 Composite finite element, 279, 286 Conforming finite element method, 59, 65, 115, 138 — for fourth-order problems, 274 — for plate problems, 274 — for second-order problems, 65, 115, 138 Conormal derivative operator, 43 Consistency condition, 197, 219 Consistency error, 183, 197, 209, 219, 261 — estimate, 197, 199, 201, 221, 261, 26ч 303 Convergence, 60, 112, 114 Convergent family of discrete problems, 112 Coordinate, Area —, 67 Barycentric —, 66 Curvilinear —, 234 Volume —, 67 Corner singularity, 181 ♦Courant's triangle, 68 *Crouzeix-Raviart linear n-simplex, 104 *Crouzeix-Raviart linear tetrahedron, 104 *Crouzeix-Raviart linear triangle, 104, 105, 225 *Cubic Hermite n-simplex, 85, 86 ♦Reduced —, 85, 86 *Cubic Hermite tetrahedron, 84 ♦Reduced —, 72, 85 "Cubic Hermite triangle, 84 ♦Isoparametric —, 235, 248 ♦Reduced —, 85 ♦Cubic n-simplex, 68, 69 ♦Isoparametric —, 234, 247 ♦Isoparametric reduced —, 234 ♦Reduced —, 71, 72 ♦Cubic triangle, 69, 74, 196 ♦Isoparametric —, 234 ♦Cubic tetrahedron, 69 ♦Reduced —, 72, 85 Curved boundary, curved domain, 227,250,251, 268, 270, 271, 298 Curved finite element, 95, 250, 298 Nonconforming —, 274, 298 Curved nonconforming finite element, 270, 298 Curvilinear coordinate, 234 Definition of a finite element, 93 Degrees of freedom, — of adjacent finite elements, 73, 86, 103, 105 — of a finite element, 59, 67, 83, 94, 95, 216, 309 — of a finite element space, 59, 72, 105 — which are averages, 95, 215, 216, 224, 309 Set of— of a finite element, 67,94,95,96,103, 105 Set of— of a finite element space, 72, 86,103, 105 Derivative, 31 Diagonal affine mapping, 76 Differential calculus, 27, 30, 31 Directional derivative, 59, 83, 84 Dirichlet problem, Homogeneous — 37, 43, 54, 56, 63, 184, 185, 209, 218, 250 Nonhomogeneous —, 37, 146, 180 Discrete maximum principle, 116, 145, 148,150, 173 Discrete Poincare-Friedrichs inequality, 208 Discrete problem, 59,61,112,147,185, 206,209, 211,253,255,256,299 — of nonnegative type, 148, 151 Discrete solution, 61, 116, 190, 254, 256 Asymptotic expansion of the —,181 Displacement, 23, 46 Distribution, 30, 36 Domain in R", 33, 118 Domain with corners, 33, 53 Duality techniques, duality theory, 179, 270, 311 Edge of an n-rectangle, 76 Edge of an и-simplex, 66 ♦8-node rectangle, 81
Subject Index 345 *18-degree of freedom triangle, 90, 91 Elasticity, Linearized —, 23,46,51,53,56,63,65,210,214 Nonlinear —, 46, 53 Elasto-plastic torsion problem, 51, 52, 179 Elliptic boundary value problem, 24, 56 Ellipticity condition, 43, 184 Energy, 23 — of a plate, 55 potential — of the exterior forces, 47 Strain —, 47 Total —, 47 Equality of two finite elements, 96, 215 Error, 60, 113, 114 Error estimate, Abstract —, 113, 117, 141, 183, 192,193, 213, 228, 257, 302 — in mesh-dependent norms, 140 — in the norm |-|0 „, 143, 179, 207, 224, 270, 297 — inthenomi||ilo,or(Zr^J-|Jr)"M38, 176, 204, 221, 266, 270, 308, 311 — in the norm || • |i2 D, or BT«^N| Tf'\ 273, 274, 296, 297, 302, 305, 308, 311 — in the norm |-|0oo„, 144,154,171,179, 298 — in the norm INIlf00>o, 171 — in the norm No.p.o' 2<p<oo, 173 — in the norm ll'llliPin, l<p<oo, 173 Interior —,181 Local —, 143, 181 Nonuniform —, 180 Estimate of the constant, — in the error estimate, 113, 140 — in the interpolation error estimate, 134 — in the quadrature error estimate, 204 Extension operator, 257 Extrapolation procedure, Richardson's —,181 Face, — of an isoparametric finite element, 252 — of an isoparametric quadratic и-simplex, 232, 233 — of an и-rectangle, 76 — of an n-simplex, 66 Finite difference method, 57, 64, 191, 192 Finite element, Adjacent —s, 73, 86, 103 Adjacent isoparametric —s, 252 Affine-equivalent —s, 97, 98, 100, 215, 227, 229, 277 Affine family of —s, 59, 97, 101, 115, 129, 276 Almost-affine family of —s, 273,275,276,277, 283, 286, 288, 292, 293, 294 Basis function of a —, 94, 100, 106, 107 Choice of a —, 111 Compatible —s, 275 Composite —, 279, 286 Curved —, 95, 250, 298 Curved nonconforming —, 270, 298 Definition of a —, 93 Degree of freedom of a —, 59, 67, 83, 94, 95, 216, 309 Degrees of freedom of adjacent —s, 73,86,103, 105 Equality of two —s, 96, 215 Face of an isoparametric —, 252 — not of class «*>, 210, 218, 309 — of a given type, 107, 108 — of class «*>, 59, 93, 108,110 — of class че1, 59, 93,108,110, 275, 276, 279, 280, 281, 288, 289, 292, 293, 296 —s of different types, 108 Generic — of a finite element space, 102,105, 106 Hermite —, 59, 95, 96 Incompatible —, 275 Isoparametric family of —s, 230 Isoparametric —, 227, 230, 234, 270, 271 Isoparametric Hermite —, 231 Isoparametric Lagrange —, 229, 230 Isoparametrically equivalent —s, 227, 230 Lagrange —, 59, 95, 96, 99, 227, 228 Node of a—, 76, 95, 231 Nonconforming —,212 Nonconforming — for second-order problems, 214 Nonconforming—for the plate problem, 274, 298,299,300,308,311 Nonlocal —, 133 Parallelepiped —, 99 Parallelogram —, 99 Prismatic —, 95, 102 Quadrilateral —, 95, 235, 236, 250, 271 Rational —, 250 Rectangular —, 59, 75, 82, 94, 101, 214 Reference — of a family, 98, 101, 230 Regular family of —s, 116, 128, 131, 134 Serendipity —, 82 Set of degrees of freedom of a —, 67,94,95,96, 103, 105 Set of nodes of a —, 95 Simplicial —, 59, 65, 94, 101 Simplicial isoparametric —, 234 Singular —, 273, 279, 288, 293 Straight —, 65, 94, 227, 273
346 P.G. Ciarlet Subparametric —, 230 Tetrahedral —, 94 Tetrahedral isoparametric —, 234 Triangular —, 94, 128 Triangular isoparametric —, 234 Finite element method, 47, 57, 59, 60, 61, 64 Basic aspects of the —, 59, 61, 63, 64, 65 Computational optimality of the —, 180 Conforming —, 59, 65, 115, 138 Conforming—for fourth-order problems, 274 Conforming — for plate problems, 274 Conforming — for second-order problems, 65, 115, 138 Hybrid —, 225, 274 Linear system found in the —, 63,101,185,190 Mixed —, 274 Nonconforming —, 209, 210, 212, 270, 271, 274, 275 Nonconforming — for plate problems, 274 Nonconforming — for second-order problems, 210, 214 Numerical implementation of the —, 60, 111, 180 Variants of the —, 180 Finite element space, Basis function of a —, 63, 64, 74, 86, 105, 106 Boundary condition in a —-, 60, 108, 209, 218, 301 Degrees of freedom of a —, 59, 67, 83, 94, 95, 216, 309 — made up of isoparametric finite elements, 253 Generic finite element of a —, 102, 105, 108 Node of a —, 105 Requirements for a —, 65 Set of degrees of freedom of a — 72, 86, 103, 105 Set of nodes of a —, 105 Support of a basis function in a —, 59, 64, 74, 106 First Strang lemma, 183, 192 Flow problem, 178, 180 Formal solution of a boundary value problem. 37, 40, 42, 45 *4-node quadrilateral, 235, 249, 270 *4-node rectangle, 77 Fourth-order boundary value problem, 56, 65, 273 *Fraeijs de Veubeke-Sander quadrilateral, 286 * Reduced —, 288 *Fraeijs de Veubeke triangle, 298, 309 Fractional order Sobolev space, 138 Frechet derivative, 27, 30, 31 Free boundary, 50 — problem, 50, 52 Free edge of a plate, 56 Galerkin's method, 59, 61 Gauss-Legendre formula, 189 Generalized patch test, 225, 308 Generic finite element of a finite element space, 102, 105, 106 Gradient, Finite element approximations of—, 181 Green's formula, 24, 34, 35, 42, 62 Green's function, Finite element approximation of —, 172, 173 Harmonic polynomial, 180 *HCT triangle, 279 Hermite finite element, 59, 95, 96, 99 Homogeneous Dirichlet problem, 37, 43, 54, 56, 63, 184, 185, 209, 218, 250 Homogeneous mixed problem, 43 Homogeneous Neumann problem, 40, 43, 63, 206 Hooke's law, 46 Hourglass instability, 208, 270 *Hsieh-Clough-Tocher tetrahedron, 286 *Hsieh-Clough-Tocher triangle, 273, 279, 280, 281, 282, 283, 286, 296, 297 Basis functions of the —, 280 ♦Reduced —, 285, 286, 297 Hybrid finite element method, 225, 274 Incompatible finite element, 275 Indefinite bilinear form, 180 Instability, Hourglass —, 208, 270 Interior error estimate, 181 Interpolation error, interpolation error estimate, 115, 121, 126, 129, 161, 245, 246, 259, 261 Interpolation-free approximation theory, 132, 133, 140 Interpolation method, Banach space —, 134 Inverse assumption, 135, 156 Inverse inequality, 116, 137 Irons patch test, 210, 224 "Isoparametric bilinear rectangle, 235 "Isoparametric biquadratic rectangle, 235, 236 Isoparametric cubic Hermite triangle, 235, 248
Subject Index 347 ♦Isoparametric cubic n-simplex, 234, 241 •Isoparametric cubic triangle, 234 Isoparametric family of finite elements, 102, 230 Isoparametric finite element, 227,230,270,271 Adjacent —s, 64, 252 Face of an —, 252 Finite element space made up of —s, 253 Nonconforming —, 271 Quadrilateral —, 236, 271 Simplicial —, 234 Tetrahedral —, 234 Triangular —, 234 Triangulation made up of—s, 233, 251, 253 'Isoparametric 4-node quadrilateral, 235 Isoparametric Hermite finite element, 231 Isoparametric Lagrange finite element, 229, 230 Isoparametric mapping, 228, 230 "Isoparametric n-rectangle of degree 2, 235 ♦Isoparametric n-simplex of degree k, 234 Isoparametric numerical integration, 228, 254 ♦Isoparametric quadratic n-simplex, 227, 232, 241, 246, 258 ♦Isoparametric quadratic triangle, 232, 252, 255 ♦Isoparametric reduced cubic n-simplex, 234 ♦Isoparametrically equivalent finite elements, 227, 230 Kinematically admissible displacement, 47 Kondrasov theorem, 119 Korn's inequality, 44 Lagrange finite element, 59, 95, 96, 99, 227, 228 Lagrange multiplier, 181 Lame's coefficient, 46 Lax-Milgram lemma, 23, 29 Least square method, 270 ♦Linear n-simplex, 67, 68, 196 Linear system found in the finite element method, 63, 101, 185, 190 ♦Linear tetrahedron, 68 ♦Linear triangle, 68, 156 Linearized elasticity, 23,46,51,53,56,63,65,210, 214 Lipschitz-continuous boundary, 32 Load vector, 63 Local error estimate, 143, 181 Macroelement, 278 Maximum principle, 145 Discrete — 116, 145, 148, 150, 173 Membrane problem, 23, 38, 47, 56 Mesh-dependent norms, Error estimate in —, 140 Method of weighted norms of Nitsche, 117,155, 163, 172, 179 m-face of an n-simplex, 66 Minimization problem, 23, 24, 27, 35, 56 Abstract —, 23, 24 Variational formulation of a —, 23, 28 Mixed finite element method, 274 Mixed problem, 43, 181 ♦Morley triangle, 298, 309 Multi-index notation, 31 Multi-point Taylor formula, 130, 134 Navier-Stokes equations, 60 Neumann problem, Homogeneous —, 40, 43, 63, 206 Nonhomogeneous —, 40, 41, 43, 63 ♦9-node rectangle, 77 Nitsche's method of weighted norms, 117, 155, 163, 172, 179 Node of a finite element, 76, 95, 231 Node of a finite element space, 105 Node of a quadrature formula, 183, 186, 254 Nonconforming finite element, 212 — for curved domains, 270, 298 — for second-order problems, 210, 214 — for the plate problem, 274, 298, 299, 300, 308, 311, 312 — of isoparametric type, 271 Nonconforming finite element method, 209,212, 274 — for second-order problems, 210, 214 — for the plate problem, 274, 298, 299, 300, 308, 311, 312 Nonhomogeneous Dirichlet problem, 37, 146, 180 Nonhomogeneous mixed problem, 43 Nonhomogeneous Neumann problem, 40, 41, 43,63 Nonlinear elasticity, 46, 53 Nonlinear problem, 27, 46, 47, 48, 50, 52, 53, 60 Nonnegative type, Discrete problem of—, 148, 151 Nonuniform error estimate, 180 Normal derivative operator, 128 n-rectangle, 59, 76, 82 Adjacent —s, 82 ♦n-rectangle of degree к, 76, 77, 78, 82, 99
348 P.G. Ciarlet n-simplex, 59, 66 Adjacent n-simplices, 72, 73 *Crouzeix-Raviart linear —, 104 ♦Cubic Hermite —, 84, 86 •Cubic —, 68, 69 * Linear —, 67, 68, 200 *— of degree k, 70, 99 ♦Quadratic —, 68, 69, 245 Reduced cubic Hermite —, 85, 86 Reduced cubic —, 71, 72, 99 Numerical implementation of the finite element method, 60, 111, 180 Numerical integration, 64, 183, 185, 192, 204, 208, 254, 298 Isoparametric —, 254 Numerical Quadrature, 186, 190 n-width, 140 0(/г") convergence, 114 Obstacle problem, 23, 47, 48, 173, 175 Optimality of the finite element method, Computational —, 180 Order of convergence, 60, 114 Parallelepiped finite element, 99 Parallelogram finite element, 99 Patch test, Generalized — of Stummel, 225, 308 Irons —, 210, 224 Penalty method, 181, 270, 311 Piecewise harmonic polynomial, 180 Piecewise smooth boundary, 256 P-interpolant, 95 P-interpolation operator, 59, 96, 100, 107, 216, 237 Plasticity, 52 Plate, Clamped —, 23, 54, 55, 274, 295, 296, 298 Energy of a —,55 Free edge of a —, 56 Simply supported —, 56, 63, 275, 298 Plate problem, Clamped —, 23, 54, 55, 56, 63, 274, 296, 298, 299 Conforming finite element method for the —, 274 Nonconforming finite element method for the —, 274, 298, 299, 300, 308, 311, 312 Simply supported —, 56, 63, 275, 298 Poincare-Friedrichs inequality, 32 Discrete —, 208 Poisson's coefficient, 54 Polygonal subset of R", 65, 137 Polynomial preserving operator, 121, 124 Potential energy of the exterior forces, 47 Principal lattice of order k, 70 Principle of virtual work, 47 Prismatic finite element, 95, 102 Problem, Abstract minimization —, 23, 24 Abstract variational —, 23, 28, 29 Adjoint —, 142 Biharmonic —, 56, 275, 302, 309, 312 Boundary value —, 37, 56 Clamped plate —, 23, 54, 55, 56, 63, 274, 295, 296, 298, 299 Elasticity —, 23, 46, 56, 63, 65, 210, 214 Elasto-plastic torsion —, 51, 52, 179 Elliptic boundary value —, 24, 56 Fourth-order boundary value —, 56, 65, 273 Free boundary —, 50, 52 Homogeneous Dirichlet ■—, 37, 43, 54, 56, 63, 184, 185, 209, 218, 250 Homogeneous mixed —, 43 Homogeneous Neumann —, 40, 43, 63, 206 Membrane —, 23, 38, 47, 56 Minimization —, 23, 24, 27, 35, 56 Mixed —, 42, 181 Navier-Stokes —, 60 Nonhomogeneous Dirichlet —, 37, 146, 180 Nonhomogeneous mixed —, 43 Nonhomogeneous Neumann —, 40,41,43,63 Nonlinear — 27, 46, 47, 48, 50, 52, 53, 60 Obstacle —, 23, 47, 48, 173, 175 Plate —, 23, 54, 55, 56, 63, 274, 275, 296, 298, 299 Second-order boundary value —, 56 Shell —, 60 Signorini —, 50, 180 Simply supported plate —, 56, 63, 275, 298 Stokes —, 54 Union Jack —, 308, 309 Variational —, 23 Well-posed —, 30 Quadratic functional, 27 ♦Quadratic n-simplex, 68, 69 ♦Isoparametric —, 227 ♦Quadratic tetrahedron, 69 ♦Quadratic triangle, 69, 74, 87, 196, 225 ♦Isoparametric —, 232 Quadrature error estimate, 198, 261 Quadrature error functional, 186, 254, 261 Quadrature formula, 183, 186 Example of a—, 187, 188, 189 Node of a—, 183, 186,254
Subject Index 349 — exact for a given function space, 186 Weight of a — 183, 186, 254 Quadrilateral, ♦Biquadratic —, 235, 236, 249 *4-node —, 235, 249, 270 *Fraeijs de Veubeke-Sander —, 286 ♦Reduced Fraeijs de Veubeke-Sander —, 288 ♦Wilson's — 225 Quadrilateral finite element, 95, 235, 236, 250, 271 ♦Quintic Hermite triangle, «*- —, 102, 277 Quintic triangle, ♦<Г- —, 88 ♦Reduced <Г- —, 90, 91 Rational finite element, 250 Rational function, 250, 269, 288, 294 Rectangle, ♦ACM —, 300 ♦Adini-Clough-Melosh —, 274, 298, 300, 302 ♦Bicubic —, 78, 79 ♦Bilinear —, 77, 222 ♦Biquadratic —, 77, 79 ♦Bogner-Fox-Schmit —, 92, 99, 108, 275, 297 ♦«"-bicubic — 92 ♦8-node —, 81 ♦4-node —, 77 ♦Isoparametric biquadratic —, 236 ♦9-node — 77 *n— of degree 1, 77 *n— of degree 2, 77 *n-— of degree 3, 78 ♦Reduced bicubic —, 81, 82, 99 ♦Reduced biquadratic —, 81, 82, 99 ♦12-node —, 81, 82 ♦Wilson's —, 214, 224 Rectangular subset of R", 82 Rectangular finite element, 59, 75, 82, 94, 101, 214 ♦Reduced biquadric rectangle, 81, 82, 99 ♦Reduced Birkhoff-Mansfield triangle, 294, 295 ♦Reduced cubic tetrahedron, 72, 85 ♦Reduced Fraeijs de Veubeke-Sander quadrilateral, 288 ♦Reduced Hsieh-Clough-Tocher triangle, 285, 297 Basis function of the —, 286 ♦Reduced singular Zienkiewicz triangle, 293, 297 Reference finite element of an affine family, 59, 98, 101 Reference finite element of an isoparametric family, 230 Regular family — of biquadratic quadrilaterals, 249 — of finite elements, 116, 128, 131, 134 — of 4-node quadrilaterals, 249 — of Fraeijs de Veubeke-Sander quadrilaterals, 286 — of Hsieh-Clough-Tocher triangles, 282 — of isoparametric quadratic n-simplices, 241, 246, 258 — of isoparametric cubic n-simplices, 241 — of triangulations, 131 — of triangulations made up of quadratic n-simplices, 241, 258 Regularization by convolution, 133 Regular second-order boundary value problem, 142 Regularity of the solution of a boundary value problem, 53, 142, 297 Rellich's theorem, 42, 119 Richardson's extrapolation procedure, 181 Ritz method, 61 Second-order boundary value problem, 56 Second Strang lemma, 209, 212, 213 Serendipity finite elements, 82 Set of degrees of freedom of a finite element, 67, 94, 95, 96, 103, 105 Set of degrees of freedom of a finite element space, 59, 72, 105 Set of nodes of a finite element, 95 Set of nodes of a finite element space, 105 Shell problem, 60 Side of an n-rectangle, 76 Side of an n-simplex, 66 Signorini problem, 50, 180 Simplicial finite element, 59, 65, 94, 101 Simplicial isoparametric finite element, 234 Simply supported plate, 56, 63, 275, 298 ♦Singular Zienkiewicz triangle, 273, 288, 289, 297 ♦Reduced —, 293, 297 Singular finite element, 273, 279, 288, 293 Singular right-hand side, 181 Singularities in the solution of a boundary value problem, 181 Singularity, Corner —,181 Sobolev imbedding theorem, 119 Sobolev space, 23, 31, 32, 116, 118
350 P.G. Ciarlet Fractional order —, 134 Spline theory, 134 Stability condition, 197 Stiffness matrix, 63 Stokes problem, 54 Straight finite element, 65, 94, 227, 273 Strain energy, 47 Strain tensor, 46 Stress function, 54 Stress tensor, 46 Stammers generalized patch test, 225, 308 Subparametric finite element, 230 Sufficiently smooth boundary, 33 Superconvergence, 181 Support of a basis function of a finite element space, 59, 64, 74, 106 Taylor formula, 31 Multipoint —, 130, 134 Tetrahedral finite element, 94 Tetrahedral isoparametric finite element, 234 Tetrahedron, ♦Crouzeix-Raviart linear —, 104 *Cubic Hermite —, 84 ♦Cubic —, 69 ♦Hsieh-Clough-Tocher —, 286 "Linear —, 68 ♦Quadratic —, 69, 225 ♦Reduced cubic —, 72, 85 *3-rectangle of degree 1, 77 *3-rectangle of degree 2, 77 *3-rectangle of degree 3, 79 Total potential energy, 47 Trace operator, 33 Transmission conditions, 50, 52 Triangle, ♦Argyris —, 88, 90, 91, 99, 108, 109, 111, 112, 273, 275, 276, 297 •Bell —, 90, 91, 99, 108, 275, 279, 297 ♦Birkhoff-Mansfield —, 294 *Courant's —, 68 ♦•if-quintic Hermite — 102, 277 ♦^-quintic—, 88 ♦Crouzeix-Raviart linear —, 104, 105, 225 ♦Cubic Hermite —, 84 ♦Cubic —, 69, 74, 196 ♦Fraeijs de Veubeke —, 298, 309, 310 ♦HCT —, 279 ♦Hsieh-Clough-Tocher —, 273,279,280,281, 282, 283, 286, 296, 297 ♦Linear —, 68, 156 ♦Morley — 298, 309 ♦Quadratic —, 69, 74, 97, 196, 225 ♦Reduced Birkhoff-Mansfield —, 298, 299 ♦Reduced «''-quintic, 90, 91 ♦Reduced cubic Hermite —, 85 ♦Reduced cubic —, 72, 196 ♦Reduced singular Zienkiewicz —, 293, 297 ♦Singular Zienkiewicz —, 273, 288, 289, 297 ♦21-degree of freedom —, 88 ♦Zienkiewicz —, 85, 298, 308 Triangular finite element, 94, 128 Triangular isoparametric finite element, 234 Triangulation, 59, 61, 65, 73, 74, 82, 83, 251, 253 Automatic generation of —s, 60 Quasi-uniform family of—s, 135 Regular family of—s, 131 Regular family of —s made up of quadratic n-simplices, 241, 258 Tricubic polynomial, 294 Trigonometric approximation, 134 ♦12-node rectangle, 81, 82 ♦21-degree of freedom triangle, 88 Underintegration, 208, 270 Uniform convergence, 144, 154, 155, 171, 179, 270, 298, 312 Uniform Kj-ellipticity of approximate bilinear forms, 183,192,209,212,219,224, 228,256, 258, 264, 302 Unilateral constraint, 180 Union Jack problem, 308, 309 Unisolvence of a set of degrees of freedom, 94,95 Unit hypercube in R", 76 Unit n-simplex, 101 Unit square, 78 Variational crime, 271 Variational equations, 28 Variational formulation of a minimization problem, 23, 28 Variational inequalities, 24, 27, 28, 64, 117,173, 180 — of order four, 298 Variational problem, Abstract —, 23, 28, 29 Nonlinear —, 60 Well-posed —, 30 K-elliptic bilinear form, 23, 24 Vertex of an isoparametric quadratic n-simplex, 232 Vertex of an n-rectangle, 76 Vertex of an n-simplex, 66 Volume coordinate, 67
Subject Index 351 Von Mises criterion, 52 *Wachspress rational finite elements, 250 Weight function, 156 Weight of a quadrature formula, 183, 186 Weighted norm, 156, 163 Weighted seminorm, 156 Well-posed problem, 30 ♦Wilson's brick, 210, 214, 216 •Wilson's quadrilateral, 225 *Wilson's rectangle, 214, 224 AVinterpolant, 107, 260 .^-interpolation operator, 59, 107, 260 Young's modulus, 54 Zenisek's result, 93, 279, 296 "Zienkiewicz triangle, 85, 298, 308 ♦Modified —, 308 •Singular —, 273, 288, 289, 297 ZlamaFs condition, 128, 134
Local Behavior in Finite Element Methods Lars B. Wahlbin Department of Mathematics and Center for Applied Mathematics White Hall Cornell University Ithaca, NY 14853, USA Supported by the National Science Foundation, U.S.A. and by the Army Research Office through the Mathematical Sciences Institute at Cornell. HANDBOOK OF NUMERICAL ANALYSIS, VOL. II Finite Element Methods (Part 1) Edited by P.G. Ciarlet and J.L. Lions © 1991. Elsevier Science Publishers B.V. (North-Holland)
Contents Chapter I. Introduction 357 1. Foreword 357 2. Perspective: Local behavior in Fourier series 359 3. Prologue: Local behavior in one-dimensional spline-L2 projections. Exponential decay 365 4. Prologue continued: Local behavior in one-dimensional elliptic projections. Negative norms 372 5. Prologue concluded: Superconvergence and local averaging in one-dimensional elliptic projections 381 6. Overview: Capsule summaries of each section 389 Chapter II. Local Behavior in Finite Element L2 Projections 395 7. Locally quasi-uniform meshes 395 8. Less regular meshes in two-dimensional Lagrangian elements 399 Chapter III. Local Estimates in the Finite Element Method for Second-Order Elliptic Problems, with Applications 403 9. Local estimates in energy and L2 norms 403 10. Local pointwise error estimates 409 11. Problems with a rough right-hand side 416 12. A model problem on a plane polygonal domain with a reentrant corner: Unrefined meshes 421 13. Sharpness of the results of Section 12: Pollution 425 14. A priori and adaptive mesh refinements 428 15. Stress intensity factors, and better a posteriori approximations via the Trigonometric Fitting Method 436 16. Local behavior when singular functions are added 442 17. Local behavior in problems with rough coefficients: Pollution 446 18. An example of global pollution with numerical integration 449 Chapter IV. A Singularly Perturbed Elliptic to Elliptic Model Problem 451 19. The model problem 451 20. The finite element method and its local behavior in energy 452 21. Local pointwise error estimates 457 22. Some concrete estimates using Theorem 21.2 463 Chapter V. A Singularly Perturbed Convection-Dominated Model Problem 467 23. The model problem 467 24. The streamline diffusion finite element method 471 25. Numerical crosswind and downwind spread 476 26. The numerical crosswind spread: An alternative proof 480 27. Pointwise local error estimates 483 355
356 LB. Wahlbin Chapter VI. Time-Localized Behavior in Second-Order Parabolic Problems 491 28. A linear model problem: Full advantage of the smoothing effect 491 29. A nonlinear model problem: Restricted advantage of the smoothing effect 496 Chapter VII. Superconvergence 501 30. Difference quotients on translation-invariant meshes for approximating derivatives 501 31. Higher-order local accuracy by averaging: The K-operator 506 32. Miscellaneous results 508 References 513 Subject Index 521 Note added in proof This article reflects my knowledge as of December 1986. Only minor corrections have been made during proof-reading.
Chapter I Introduction 1. Foreword The purpose of this article is to survey what is mathematically known about local behavior in finite element projection methods. The purpose of this introductory chapter is to display some relevant questions, sidelights, and techniques, in very simple one-dimensional cases. When facing various types of singularities one is often forced to consider the local behavior of an approximation method. This situation is eloquently summed up in the last section of Oliger [1978, Section 5, pp. 105-106], in his article from an AMS short course in numerical analysis (italics mine): In closing, I want to point out an area of research activity which involves detailed mathematical analysis of the solutions of approximate methods. In our discussions up to this point we have only considered approximations of smooth solutions. However, many problems of interest do not have smooth solutions—they have contact discontinuities, shocks, etc. It is necessary to know how our methods behave in the neighborhood of such discontinuities and whether or not the resulting effects are local or global. That is, if something "funny" happens near a discontinuity, will this pollute our answers everywhere, or not? Most of the studies regarding these questions are carried out on simple model equations This is an area of research activity where the tools of classical and modern analysis are providing significant results, where questions are abundant and answers are rare. An archetypical question is the following: The problem at hand contains isolated singularities and we know, a priori, that our approximation method cannot resolve these singularities (to resolve them may be too costly, we do not know where the singularities are, or even what is) their nature. Assuming then that we have given up on resolving the singular behaviors, can we at last assert how good our approximation is in regions of smooth behavior? Can we precisely account for the spread of errors emanating at the singularities into smooth regions? And, can the analysis indicate an efficient way of resolving the singularities, e.g. by mesh refinement, inclusion of special functions mimicking a singularity, tracking of its unknown location, or by a posteriori processing? Regrettably, "... questions are abundant and answers are rare." 357
358 L.B. Wahlbin Chapter I The term "pollution" may be given the following meaning: On a subdomain A of the basic domain, the function sought is smooth and, in principle, approximable to some optimal order 0{hr). However, the projection method under consideration gives a lower rate of convergence on A. Some theory is known. As an example the reader may ponder the following: Why is it that finite element solutions with unrefined meshes of a two-dimensional cracked torsion problem lead only to first order accuracy in function values away from the crack, regardless of the order of the finite elements used, whereas when approximating the Green's function in an uncracked problem (with a smooth boundary), while still not refining the meshes, we have essentially the best possible order of accuracy that the finite elements allow away from the point of singularity. After all, the Green's function singularity is worse than the crack singularity! This is a question that present theory can elucidate. Furthermore, present theory can tell how to do an economical mesh refinement, e.g. in the cracked torsion problem, either a priori or adaptively a posteriori. The refinement suggested by theory often looks "dramatic" and therefore many an eyeball refinement ends up being not daring enough and, in the end, uneconomical. (Cf., e.g., Figs. 14.2 and 14.3 below.) Another, rather distinct, question about local behavior is that of "supercon- vergence": Are there identifiable points where the rate of convergence can be proven to be better than in general? Or, is there some simple a posteriori processing that gives better rate of convergence at selected points? Again, answers are rare. I now remind the reader that "most of the studies regarding these questions are carried out on simple model equations". Amplifying this point, I quote from Kreiss and Oliger [1973, Introduction]: We have developed many concepts and carried out many analyses by studying model equations. We find that most computational difficulties are linear effects and can be studied in simple situations where a detailed analysis is possible. We stress the importance of this technique. An adequate, rigorous analysis is usually practically impossible for the large nonlinear models; computational difficulties are apt to be wrongly ascribed. This can easily lead to a large and incorrect folklore which can steer future research in the wrong direction. There are certainly many examples of this in the past. This is not to say that there are no pitfalls inherent in this technique. Great care must be used in selecting model equations and in extrapolating conclusions to more complicated phenomena. However, this is an invaluable tool to isolate and analyze phenomena. The impracticality of a complete, rigorous analysis in complicated situations is easily understood: You have to combine, say, four model situations (sharp front not resolved, numerical integration, approximate treatment of boundary conditions and approximate, fast, solution of matrix equations, e.g.). Each of the basic papers in model situations cover, say, 30 pages. A fully combined proof will end up being excessive in length.
Section 2 Introduction 359 When wishing to apply a typical result to a specific situation, then, the First Law of Applied Mathematics is in effect: Nothing Fits. The treatment of boundary conditions at curved boundaries is different, numerical integration is not taken into account, the result is proven for triangular elements but not for quadrilaterals, for strictly polynomial shape functions but not for isoparametric ones, the result is proven only for strictly interior subdomains but not for subdomains contiguous to the boundary of the basic domain, for Runge-Kutta time stepping methods but not for linear multistep methods, etc. (Sometimes results are given in a model situation which contains generically conflicting hypotheses, "for simplicity". A common case is that of a basic polygonal domain where the solution is as smooth as required, affectionately known as the "smooth polygonal" case. Another common pitfall is that of assuming min \\u-x\\i.2.Q^Chr~1\\u]\r,2,o for general r, general curved boundaries and imposed essential boundary conditions.) It is important to understand the basic techniques of proofs concerning local behavior. Once the underlying ideas are comprehended, one may extend the known results to a model situation which contains the salient features of interest. It is therefore my intention to elucidate the techniques of proof in this survey. The rest of this Introduction will be devoted to a brief overview of results and techniques for Fourier series and spline projections with respect to L2 and Я1 norms in one-space dimension. The aim is to contrast the three cases in order to appreciate the local nature of the spline projections, and also to give an introduction to important techniques for proving local results about finite element projections. We end with a section on identifiable superconvergence points in one dimension, and also an overview of the article. I will, no doubt, commit errors of omission of relevant results, errors of misattributing credit, as well as factual errors in giving results and proofs. I would appreciate having such errors brought to my attention so that I can correct them in future editions of the Handbook. 2. Perspective: Local behavior in Fourier series The finite element methods we shall consider are of Petrov-Galerkin, most often Galerkin, type. Thus, in simple cases, they are methods of projection with respect to some norm. To put the present survey in perspective, we shall briefly consider the local convergence in Fourier series; in Section 3 we shall contrast the results with those for spline-L2 projections and, in Section 4, with Ht projections. The well-known controversy in the eighteenth century as to whether a plucked string may be represented by its Fourier series is, perhaps, the first example of a typical local problem in projection methods. For our present purposes it is appropriate to pose three questions:
360 L.B. Wahlbin Chapter I Can the plucked string be represented by any trigonometric series? B.1) How does the Fourier series behave near the singularity? B.2) How does the Fourier series behave in smooth regions? B.3) In the finite element context the analogue of question B.1) is frequently easy to settle. Various interpolants or quasi-interpolants are of a highly local nature which can or cannot be fitted to a particular singularity and, if they can, the question is answered in the positive. If the interpolant cannot fit the singularity, it is often easy enough to show that no function from the finite element space employed can. (Of course the word "fit" above implies questions of asymptotic convergence.) The analogues of questions B.2) and B.3) are often much harder in the finite element context. In the Fourier series context a high point in elucidating the local behavior came with Riemann's Habilitationsschrift, Riemann [1854], also known by his charming prelude that, although he knew that the Fourier coefficients were given by integrals, he did not quite know what an integral was, and hence desired to discuss that point. His localization principle asserts that, provided the function is globally in L1 (in modern terms), convergence at a point is determined only by the behavior of the function approximated in any open (periodic) neighborhood of that point. The questions B.1) - B.3) are not asked with sufficient precision for modern purposes. It is necessary to specify what error measure is used and, at least, what is then the asymptotic rate of convergence. Otherwise, the results would give little practical guidance to a computer user. (Some researchers impose more rigorous standards of precision, in particular, they take issue with the very concept of asymptotic rate of convergence. As an example I quote from Fichera [1978, p. 10] in his critique of the investigations of Bramble and Osborn [1973]: Bramble and Osborn can only claim to have "ultimately" estimated the rate of convergence of /4B) towards ц. But the concept of an ultimate estimate (i.e. valid starting from some unspecified n0) is, indeed, meaningless in numerical analysis! In order not to end this article prematurely, asymptotic estimates with constants that are impossible to realistically secure, except by numerical experimentation, are admitted in this article. Such asymptotic estimates constitute an invaluable debugging tool, for example.) We shall next consider the selection of an error measure in the context of local error behavior near a singularity. We shall do so in the situation of a simple example of a stepfunction. Let thus u(x) denote the 27t-periodic function defined by JO for -^7С<Х<^71, uix) — < „ , , , B.4) [1 for —я<х<—£я; or jn<x^n. The Fourier series is 1 2 (cos x—-jcos Зх+ycos 5x—•■•) B.5) 2 n
Section 2 Introduction 361 and we let u„(x) stand for the nth truncated Fourier series. Let further V„ denote the trigonometric polynomials of degree n. Let now A denote any open interval of Q=(—п, к) containing the two rough points, x = + £n. Assuming that we desire to measure the error as the maximum over A, i.e., as \\u—un\\XfA, it is clear that we do not have convergence, since even inf \\u-v„\\0,<»,A>l B.6) v„eV„ We try instead an error measure more forgiving to approximating discontinuous functions by continuous ones, let us say the Hausdorff graph measure. For the convenience of the reader I will recall its definition. First, for the discontinuous function such as u(x) of B.4), we define its augmented plane graph AGu by vertically filling in the jump discontinuities. ~>x (AGu) -i^t о iit For a continuous function, the augmented graph is the ordinary one. For points P = (xP,yP), Q=(xQ, yQ) in the plane, set dist(P,Q)=meLx(\xP-XQ\,\yP-yQ\). The Hausdorff graph measure of the distance between и and v over the interval A is then H{u,v;A):=max.( max min dist(P, Q), max min dist(P, Q)) B.7) \ Pel/ QeV QeV PeU J where U denotes the portion of AGu over the x-interval A, and similarly for V. From Sendov [1969] it is known that there exists vneV„ such that H(u,vn;Q)=o(^\. B.8) However, for the Fourier series u„ the classical Gibbs phenomenon tells us that for any open interval A containing — j% or %n, H (u,un; A) = 0A) B.9) and not better. Thus, approximation of this step function is in principle possible in the Hausdorff measure to order (In n)/n but the Fourier series does not converge in Hausdorff measure near ±iir. Various error measures in which the Fourier series for the step function и does converge near the singular points include any U measure for p < oo, any negative norm We are now advised that results concerning local behavior in regions including singularities may dramatically differ according to the error measure chosen. In our finite element situation a more typical problem, at present, is to consider the error in smooth regions, away from singularities. In the Fourier series context, Riemann's localization principle tells us that we have pointwise convergence in such smooth regions (provided the function approximated is globally in L1). However, consider the step function u(x) of B.4) and its Fourier series given in B.5). The
362 LB. Wahibin Chapter I interval A>=[—i,i] is certainly an interval of smoothness for u; «=0 there! But, as is easily seen from B.5), 1|м-"л|1о,^ = 0(п-1) B.10) and not better. In fact, it is easy to construct examples for which, in regions where the approximating function vanishes identically, its Fourier series converges arbitrarily slow, see Wahlbin [1985, Example 6.1]. It is now a challenge to determine the underlying principles of how the global deportment of a function approximated determines the rate of convergence of its Fourier series in smooth regions. We shall first give an almost trivial result, cast in a form that can be readily compared with the corresponding result for spline-L2 and Я1 projections (to be given in Sections 3 and 4). To fix notation, let u{x) denote a 2n:-periodic function to be approximated and let u„(x) denote its nth degree Fourier series. Let Xn £ Vn denote an arbitrary trigonometric polynomial of degree at most n. Further, introduce the Dirichlet kernel Dn(r)=~sin((n + iW/sini£. B.11) Then, fixing a point x0 of interest, (u - un)(x0) = (u-x„){x0) + (Xn - un)(x0). B.12) Since the nth Fourier approximation of x„ e V„ is itself, we have as is well known, iln - wB)(x0) = Яп(*о ~ t)(Xn ~ u)(t) dt. B.13) At this point another question of precision enters: we wish to explain the rate of convergence of the Fourier series for u, at x0, in terms of the local behavior of и around x0, and the global (periodic) behavior of u. (Certainly, as argued above, both these factors need to be taken into account.) To make those two factors precise, introduce a parameter d, where (always interpreted in the periodic sense), Ad = [x0-d,x0 + d] B.14) will be the interval around xQ where local behavior of и will be taken into account. Assume for the moment that d^n'1. B.15) Then from B.13) (again in the periodic sense), \(Xn~u„){x0)\ \D„(x0-t)(In-u)(t)\dt ^ \Dn{x0-t)(xn-u)(t)\dt + + J \D„(x0-t)(x„-u)(t)\dt. B.16)
Section 2 Introduction 363 From B.11) it is easy to estimate the Dirichlet kernel in the various regions involved in B.16). In the first integral, |DB(x0 —f)|<C«, for the second, d ~dx^Cln{nd). x \D„(x0-t)\dt^C л~'«|хо-(|«<1 л Finally, in the third integral of B.16), we use the estimate \D„(t-x0)\^C/d in that region. Thus, using the L00 norm for %„-u in the first two integrals in B.16) and the L1 norm in the third, and combining with B.12), and letting fi = [—я, я] denote the global interval, l("-"n)(*o)l ^ + C\in(nd)\)\\U-Xn\\o,cc,Ad+(C/d)\\u-x„\\o,un, any XneVn. B.17) (It is trivial to extend this result to the case d^n-1.) For future reference we state B.17) in precise form. Theorem 2.1. Let и be a In-periodic function and let un be its nth-order Fourier approximation. There exists a constant C, independent of n, such that if Ad = [x0 — d,x0 + d~\ {in the periodic sense), and Q = [—%b%],for any %„ a trigonometric, polynomial of degree at most n, |(«-M,)(xo)l<(l + C|ln(nd)|)||«-Zll||o.e).^ + (C/d)||ii-Zl,||o.1jj. B.17') We have here a trivial result which gives the rate of convergence for the Fourier series at a point x0 in terms of how the basic function can be approximated by any trigonometric polynomial of degree n, viz., %n. We point out that the estimate B.17) involves the local approximability of и on the neighboring domain Ad in a more severe error measure (viz., the maximum norm) than that which measures the global influence (viz., the global L1 deviance). Another, most important, feature of B.17) is the following. If we try to be more lenient about taking the local behavior of н into account, that is, we decrease d, the more prominently the global L1 term enters due to the factor Cd'1. This balance between local and global influences to convergence is typical also of the finite element situation. To use B.17') in order to predict the rate of convergence at x0 in any specific situation, one must select a suitable trigonometric polynomial %„■ In order to give a simple result, introduce the pth modulus of continuity in a Banach space В with step t, cop(u, t;B) = sup || J?u\\B B.18) where ^f is the pth undivided difference, A?u(x) = {Tx-iyu(x), 7>(х)=ф + т). B.19)
364 L.B. Wahtbin Chapter I Theorem 2.2. Given natural numbers p and q, there exist a constant С such that the following holds: There exists %„ e Vn such that Wu~Xn\\o,a,,Ad <Cffl'U^^;L»(^+c(ta.w-)) + C«"f(ll«llo.a,.^+ll«|lo.i4») С2-20) and \u-Xnh,i,Q^CcoPl u,^^-;L1{Q) ) + Cn-«||M||0.i.n- B.21) Sketch of Proof. The proof follows traditional lines. We shall use a very localized trigonometric kernel k„(t) due to Noble [1954], cf. Bary [1964, Lemma, p. 270]. The kernel has the following properties: With a uniform constant С = C(q), \k„(t)\^Cn/\nn, all t, |fc„(r)| s=Cn~« tor C{\nn)/n^\t\^Ti, B.22) B.23) fe„(£)df = l, B.24) \k„(t)\dt^C. Define then for any p, B.25) Xn(x)-- kn(t)Y Ljuix + ltjdt B.26) which is a trigonometric polynomial of degree n, cf. Lorentz [1966, p. 58]. Then Я Л u(x)-x„(x)= kH(t)A?u(x)dt. B.27) ~K For xeAd, \(u-Xn)(x)\< \kn(t)Ufu{x)\dt+ \kn(t)\\^u{x)\dt |г|5=СAпп)/и |f|JsC(Inn)/n which by use of B.22) and B.23), B.25) gives B.20).
Section 3 Introduction 365 For B.21), Iw-xJi.fi^ \k„(t)Afu{x)\dxdt ■•• + |l|«C(lnn)/n |t|>C(lnn)/n which gives B.21) by use of B.22), and B.23). □ Combining Theorems 2.1 and 2.2, we see that in order to guarantee a rate of convergence of almost order p ((In ri)p+ 1n~p to be exact) for the Fourier series at Xq, it is necessary that u has essentially p derivatives in the L°° sense on an interval around x0, but also that и has essentially p derivatives globally in the L1 sense. The reader may readily apply the two theorems to the stepfunction of B.4), taking x0 =0 for instance. The prediction is, with p = 1, q = 2, that the convergence is as 0((ln ri)jn) which is sharp apart from the logarithmic factor. 3. Prologue: Local behavior in one-dimensional spline-L2 projections. Exponential decay Theorem 2.1 gives a result on how the local behavior in Fourier series is influenced by the mixture of local and global effects in a "best possible" approximation by trigonometric series. Our goal in this section is to give two "analogues" of Theorem 2.1. One, Theorem 3.1, is a rather straight analogue, but does display that spline-L2 approximation is appreciably more local than Fourier series approximation is. Then, Theorem 3.3 gives a further localization, of considerable interest for our future purposes. (The "analogue" of Theorem 2.2, referring to straight approximation theory, is well known in the spline case and stated as Theorem 3.2 below merely for completeness; again, though, it is considerably more local than its Fourier counterpart.) To amplify this point, in every comparison spline-L2 projection will turn out to be much more local than Fourier series, i.e., L2 projection into trigonometric polynomials. We shall use the fundamental techniques of Nitsche and Schatz [1972] and since some essential ideas will recur throughout this article we shall give a fair amount of detail. Our aim is not, however, to give the best possible results; this would swell the exposition to unmanageable bounds. We first need to establish some notation. Let Q = [ — 1,1], say, and let Th denote a family of interval subdivisions of O, U= U Tlt Т, = [х„х,+ 1], i = 0,..., N, with x0= —1, xN+1 — l. The parameter h can be thought of as
366 L.B. Wahlbin Chapter I (proportional to) N \ Let Vh denote spline spaces on Th. With integers ц and r, -U/*<r-2, C.1) we define Vh = {veWiH): v is a polynomial of degree r— 1 on each interval 7}}. C.2) Given a function м on Q we let uh=Phue Vh denote its L2 projection into Vh so that (u-Phu,v) = 0 (oweVh, C.3) where (f,g)= $ofg. As a first indication that convergence may be more locally determined than in the case of Fourier series, consider the case //= — 1 so that Vh consists of unconnected piecewise polynomials. Clearly then Phu is completely determined on each subinterval Tt by the behavior of и only on that T{. At present we do not wish to consider local properties on a scale smaller than that of a mesh interval. The properties of L2 projection into full polynomials on an interval, that is, Legendre series, have been extensively studied; a facile investigation of their local properties from the present points of view is contained in Wahlbin [1985]; in the case of discontinuous splines this amounts to a "microlocal" analysis. (We shall return to such microlocal, or subscale, analysis only in connection with superconvergence points.) For piecewise polynomials that are at least continuously connected across mesh points xh i.e., fi^O, Pku suffers some influence from и from outside intervals. To simplify our analysis of the situation (cf. references at the end of this section) we assume that our family of triangulations Yh is quasi-uniform, i.e., that there exist positive constants с and С independent of h such that c/i^measTj^C/i, i = 0,...,N. C.4) The assumption C.4) implies the following inverse property. Its proof is well known and can be found in any basic text on finite element methods. Lemma 3.1 (Inverse property). There exists a constant С independent ofh such that for any mesh interval Th l|o|L.,.r(^Cfc---<1/«-1/rt||»||oAri forveV„, l^q^p^n. C.5) The next result represents the basic technical tool introduced by Nitsche and Schatz [1972]. It is often referred to as "superapproximation" and is still the most important general tool. To describe it we need some notation. For intervals AqSA^Q set 8<(^0M1) = distF^0\6f2,6^1\3fi) C.6) i.e., the minimum distance between endpoints of A0 and Ax, not counting endpoints
Section 3 Introduction 367 coinciding with dQ. Let further «,?(I) = {t)e^co(l):a<(suppz;,/l)>0} C.7) and V^(A)^{veVh:d<(snpvv,A)>0}. C.8) Lemma 3.2 ("Superapproximation"). There exist constants с and С such that the following holds. Let A0^AX ЕЙ be intervals with d = d<(A0,A1)^ch. Let о>е<^5(Л0) with Nlltco.Ao^^"'» 1 = 0,..., Г. C.9) Then for any xeVh there exists феУ^А^ such that \\ыХ-Ф\\о,2,а1^СЛ№)\\х\\о,2,а0- C.10) Proof. We first consider the case of %?° splines, i.e., ц — 0 in C.2). We may then for any function ve^iTi) construct its {r — l)st-degree polynomial interpolant TIhv on T, using r equispaced interpolation points, including both endpoints of Тг. By the Bramble-Hilbert lemma, Bramble and Hilbert [1970, 1971], 11»-Я4»||§>2>Г|^СЛ2г||рм||§.2,г,. C.11) For юеЩА0) the combination of those local interpolants result in a function in Vh(^i) provided с is large enough; we denote it still by TIhv. Further with 1Н1*,Ло=( I ИГ2.2.Т,У/2 C.12) denoting the piecewise Hr(A0) norm, C.11) gives W^nhvU,2,Ai^Ch2lv\\2h;r,Ao. C.13) To prove C.10) we take \jj — IIh{oix); it remains to calculate h2r\\oix\\l;r,A0- By Leibnitz' lemma and since Dr%=0 on each Tt, this can be estimated by a sum of terms of the form /22l(Da<a)(Z>^)|lb,rj, a + /J = r, /J<r-1. By C.9) and the inverse property of Lemma 3.1 each such term is estimated by CA2d-2W-t>)\\X\\l2,Ti and for h/d^l, this quantity is bounded by CA2(h/dJ1| x II Ь,2.тг Summing over all Tt involved gives C.10) in the case ц = 0. The case of general ц is similar. An analogue of C.13) is needed and to construct it we first recall the quasi-interpolation operator of de Boor and Fix [1973]: Set Li — LXmax@,i-r+l), ^min(W+l,i + r)] • C.14)
368 LB. Wahlbin Chapter I There exists a constant С and a family of operators IIh:W-*Vh such that \\v-nhv\\l2<Ti^Ch2'\\v^\\l2,ft. C.15) Taking v = a>x we would need to convert the norm on the right of C.15) into a piecewise norm. This is done by the following trick from Douglas, Dupont and Wahlbin [1975a]: With each interior mesh point xt associate the functions Vik(x)—(x — xtf+, к = ц + 1,..., r— 1. Starting at left we determine a linear combination w of the vik so that a>x + wee&r~1r\Hr. Set then \ji=nh(o)x + w) — w which is clearly in Vh. By C.15), \\Q>X-^\\i2,Ti=\\(oiX + w)-nh((ox + w)\\lZrT( < Ch2'\\ (ox + хГ Ilk- = Ch2'\\ ((охГ Who and we have obtained a proper estimate involving only the piecewise r derivatives. The proof is concluded as before. □ We now wish to consider the error (u—Phu)(x0). Stressing the analogy with the Fourier case we proceed as follows: For any #ле Vh, (u-Phu){x0) = (u-Xh)(xo) + Ph{Xh-u)(x0). C.16) Assume that x0e %. Let 6° be a polynomial of degree r— 1 on Tt such that vd° dx = t^x0) for any v a polynomial of degree r— 1. C.17) Tt Regard 5° as extended by zero to Q. It is clear from simple considerations involving mapping Ti to a unit interval that |5°(x)I^Cfc_1 forxei;. C.18) Consider now the second term on the right of C.16). We have Рн{Хн-и)Ы = {Рь{Хь-и),60) = (Хи-и,Рьд% C.19) Thus, dl-=Ph8°eVh here takes the role of the Dirichlet kernel in the Fourier case. We shall next give an estimate for 3% ■ For future purposes we state a preliminary result in greater generality. Lemma 3.3. There exist positive constants c0, c1 and С such that the following holds. Let A^AdsQ with d = d<(A,A<i)'5::Coh. Assume that vheVh is such that (»*,Z*)=0 forallXheVZ{Ad). C.20) Then IMo^^Ce-'^IIMo^,. C-21) Proof. With the constant с from Lemma 3.2, let ch-3; let A£A6s.A2S with 8 = d<(A,Ai)=d<(Ai,A2g). Let a>e'e<(As) be a nonnegative function with ||<»||,i00<^-', 1 = 0,..., r, C.22)
Section 3 Introduction 369 and with cos 1 on A. C.23) Then \\vh\\2o,2,A^(vh,covh). C.24) Assuming now A2d GAd, we have (vh, ф)-0 for any %e V^(A2g) and thus \\vh\\o,2,A^(vh,o)vh-x)- C.25) Using Lemma 3.2, \\vh\\2o,2,A<C(h/S)\\vh\\l2,A26. C.26) The argument can clearly be repeated with domains A2g £ A3S £ А4E £ • ■ • £ Л 2JW as long as they are inside Д,, i.e., since <5 = ch, N = 0(d/h). C.27) Assuming, as we may, that Ch/5 < 1, C.26) leads to the result C.21). □ Remark 3.1. The quasi-uniform condition C.4) is demanded in Lemma 3.3 only for those Г( that meet Ad. Applying C.21) to vh = db, using that, by the inverse property of Lemma 3.1, 15A°(x) | <C/T1/21| <5л° || о,2,г,, xeTj C.28) and that by the stability in L2 and C.18), 11^0|1о,2,й<11<5°||о,2,й = СЙ-1/2, C.29) we have Lemma 3.4. Assume that C.4) holds. Then there exist constants с and С such that \5°h(x)\^Ce-c^'xoVhh-1, C.30) or, after a change of constants, .С/Г1, \x-x0\^h, №)l=< с . lft {331) -e-el*-«l/*, |х-х0|^й. \x-x0\ Thus the kernel in spline-L2 projection is considerably more localized than the Dirichlet kernel in Fourier series, for which the estimate corresponding to C.31) is (Cn , |x-x0|=$n_1, |£„ (х-х0)|<{ С i i^-i |x — x0| We note incidentally the following consequence of Lemma 3.4.
370 L.B. Wahlbin Chapter I Lemma 3.5. Assume that C.4) holds. Then the L2 projection is stable in any If norm, Proof. From C.30) it follows that || <5° ||o, i,n *S С with С independent of x0. Thus, |P*«(xo)| = IM?)|< || и ||0l„.oC, and Ph is stable in L°°. By duality \\PhU\\Un= sup (Phu,v) and since (Phu,v) = (Phu,Phv) = (u,Phv), the already established result gives CP*M,i>K||u||0,i>nC so that Ph is also stable in L1. By Riesz-Thorin's interpolation lemma it is then stable in any LP space. □ We return now to C.19) and use Lemma 3.4. Let Ad = \_x0 — d,x0+d]nQ; we obtain for d ^ h, \Ръ(Хь-и)(х0)\^1>\х„-и\\50н\+ \ \xh-u\\d°h\ Ad fi\-4d <C||M-Z*||0i00,Ail+(C/d)e-a'/*||«-z*||o.i,o- Combining this with C.16) we have the following analogue of Theorem 2.1. Theorem 3.1 (Exponential decay in spline-L2 projection). Assume that the family of subdivisions of Q is quasi-uniform, C.4), and let Vh be given as in C.2). There exist positive constants с and С such that the following holds. If Ad = [x0 — d, x0 + d~\r\Q and uh denotes the spline-L? projection of a function u, then for ih arbitrary in Vh, |(«-u*)(x0)|^C||H-ZJ|0,..^ + (C/d)e-'**||tt-ZA||o,il0. C.32) Thus, in spline-L2 projection the global influence is weighted down by an additional factor e~cdlh compared to the global influence in Fourier series. The analogue of Theorem 2.2 is well known for splines. The result of de Boor and Fix quoted in C.15) holds in greater generality with respect to norms and rates of convergence less than r, and one may derive the following by use of well-known smoothing techniques, cf. Strang [1972] and Hilbert [1973]. Theorem 3.2. Given a natural number p^r there exists a constant С such that the following holds. There exists XheVh such that 11«-Х»11о.«,.^<СА'||ы^||0>00^га, C.33) ll«-X*llo.i.o<C||u||0.li0. C.34)
Section 3 Introduction 371 We next give a result which "completely" localizes Theorem 3.1 to a subdomain. It assumes the quasi-uniformity condition C.4) only for elements T; in that subdomain; often this criterion is enough to meet practical situations involving systematic mesh refinements towards a known singularity. Although the technique is by no means the best possible in many situations for one-dimensional spline-L2 projections (cf. the references at the end of this section) it is the way in which we shall proceed in multidimensional Galerkin methods for partial differential equations and so we take this opportunity to illustrate the technique in a situation where all the technical work is already done. Assume thus that x0 is given, Ad = [x0 — d,x0 + d}r\Q with d^cyh (cx conveniently large) and assume that the family of subdivisions is quasi-uniform on Ad, i.e., corresponding to C.4), di^meas Ti^Ch for those Tt that intersect Ad. C.35) Assume also that (u~uh,X) = 0 forzePJW C36) Theorem 3.3 (Further localization of exponential decay in spline-L2 projection). Assuming C.35) and C.36) there exist constants c, cx and С such that the following holds for any %heVh: If d~z-cxh, then |(Ii-uft)(x0)|<C||u-xA||o,0o,^ + Cd-1/2e-c<i/A||u-u)lt|0,2,^. C.37) Thus when applying this result the quantity ||м — щ\\ол,ла has to be estimated. This is frequently easy to do to one's satisfaction, by standard techniques. Proof. Let Ph denote the L2 projection into V^(Ad). We have already seen, combining Remark 3.1 and Lemma 3.5, that Ph is stable in L00. Thus, (u - uh){x0) = (u(x0) ~ РнФо)) + (РиФо) ~ РнФо))- C.38) Here, \u(x0)-Phu(xo)\^C\\u\\0,KtAil C.39) while Phu—Phu satisfies C.20) and hence by Lemma 3.3, cf. again Remark 3.1, after combining with the inverse estimate, \Phu(x0)-Phu(xu)\^Ch-1i2e-cd'h\\Phu-Phu\\0t2Md <Cd-v2e-«"hilPhu-Phulk2,A< C-40) (after a change of constants). Next, \\Phu-Phu\\0<2<A<1 <[lFA«-u|!o,2,^+il^«-"llo,2,xd < Н»11о,2,А,+ 11Л."-М11о,2,Л„^1/2||"Но,оо,Л„+ \\PhU~u\\o,2,Ad-
372 L.B. Wahlbin Chapter I Combining this with C.38)-C.40), |(M-M(l)(xo)|^C||u||0.ooM(, + Cd-1/2e-^',||u-«/1||0,2,/4<i. Since PhXk = Xh f°r апУ lhtVh we may add and subtract Xh to arrive at C.37). □ Our treatment above has followed the fundamental paper of Nitsche and Schatz [1972]. In particular, Lemma 3.2 is basic. In the one-dimensional spline situation one may proceed via a careful analysis of the inverse of a banded matrix, cf. Demko [1977], to obtain similar results. The decay estimates given above are frequently implicit in investigations concerning the Lp stability of the L2 projection, also under relaxed mesh conditions as compared to quasi-uniformity. As examples we mention Descloux [1972], Douglas, Dupont and Wahlbin [1975a], and Demko's paper. We shall give Descloux's argument, as adapted by Crouzeix and Тномёе [1987а], in Section 8. For a survey of Lp stability of the L2 projection under relaxed mesh conditions, see de Boor [1979] where references to original contributions in this regard are given. 4. Prologue continued: Local behavior in one-dimensional elliptic projections. Negative norms Our goal in this section is not to prove the best possible results for one-dimensional elliptic spline projections but rather to introduce, in a simple setting, techniques and results that generalize to problems in more dimensions. Nevertheless, the results are often sharp also in one dimension, cf. Section 17. (To prove "better", i.e., more localized, results in one dimension, the reader may contemplate integrating the results for the L2 projection in Section 3, or, in the case of continuous splines, ^ = 0, skip forward to Theorems 5.1 and 5.3 to "deduce" that the elliptic projection lies within 0(ftr+1) of a completely local interpolant on each mesh interval where the solution is smooth. Care is advised in carrying through such a program, to account for outside singularities, in particular for roughness emanating from unsmooth coefficients, cf. again Section 17.) We shall first give an analogue of Theorem 3.3 (Theorem 4.1 below) and contrast the two. We then proceed to weaken the global influence, in Theorem 4.2. The reader not familiar with negative norms is urged to contemplate Theorem 4.2, Lemma 4.4 and their interplay. Let x0e£2 = [—1,1] be a point of interest and let Ad = [x0— d,x0 + d]. For simplicity assume Ad s Int Q. Let Vh be a spline space as in C.1), C.2) with ц ^ 0 and let for A<qQ, Vh(A) = {ve Vh: dist(supp v, A)>0}, D.1) corresponding to C.8) (for notational simplicity we do not consider the case when A, or Ad, may abut on the boundary of Q). Let now u(x) be a function and uh e Vh be such that (u'-u'h,x') = 0 for all х*К(АЛ D.2)
Section 4 Introduction 373 We first observe that D.2) makes sense also if и is merely continuous. For, integrating by parts over each element Ti intersecting Ad, (m'-M;,x')= I - ("- Tic\Ad¥4 L J uh)x"dx + (u-uh)x'\sTl D.3) Thus we may, perhaps, expect to obtain a result valid if и is merely continuous on Ad. Let now Ph denote a local elliptic projection into Vh, i.e., PhueVh with (u'-(P„u)',x') = 0 forallxeVV Let further с«е<^(Л,/2) with co(x) = l on AdjA.. Then (u-uh)(x0)={cou-Ph(ou)(x0)+{Pha>u-uh)(x0). D.4) D.5) D.6) We first argue that Ph is stable in L°° for functions with support in Ad/2, d^ch. Assume for this that the family of triangulations is quasi-uniform on Ad, i.e., that C.35) holds. Lemma 4.1. There exists a constant С such that for v with s\nppv^Ad/2, II Phv Wo,oo,Adi2 ^C\\v Цо.оо.Ллг- Proof. We have llA.f|lo,oo„4d/2= SUp (P„V,w). ws^oi.Ada) l|w|ll.^d/2=l For each such w, let W be the solution of -W" = w mAd, W=0 ondAd. Further, set Wh = PhW. Then (Phv,w)=((Phvy,W')=((Phv)',W'k) = (v',W'h) = (v',(Wh-W)') + (v,w) D.7) D.8) D.9) *I v(Wh- W)" dx + v(Wk- W)' \dTi } + (v, w). D.10) Ti A simple trace inequality establishes that (since the mesh is quasi-uniform on Ad), for xeT,., \mHc\\r\\0,UTi+ch-i\\f\\0>UT, D.ii) (Use a cutoff function that isolates Г; and then the Fundamental Theorem of
374 L.B. Wahlbin Chapter I Calculus.) Thus from D.10), using D.11) m\hf=(Wh-W)', l(A», w)| <CMo,*.Art{MWh-WrUil,A„ + h-4(Wh-W)'\\o.1,At/2 +1} D-12) where the subscript h in the norm for the second derivatives of Wh — W denotes that we are operating piece wise on each element intersecting Aij2, Letting TIhW be, say, the piece wise linear interpolant of W, we have since then {nhW)" = 0 piecewise, Wh~Wy\;UAd/^\\W^UAd+\l(Wk~nhW)l\kiUAd/2. D.13) Note that \\W"\\0,UAd^L D.14) By inverse properties, assuming as we may here that AAI1 is a mesh domain for notational simplicity, \\(Wh-nhW)"\\h;UAd/2 ^ch-4iwh-nhw)'\\0A<Am ^Ch-4{Wh-W)'\\^UAd^ + Ch--'\\{W-nhW)'\\0A<Adn, D.15) and since, as is well known, cf. Theorem 3.2, \W-nhwy\\0A ^Ch\\W"\\o,i,Ad/2^Ch, we have from D.13)—D.15), \\(Wh-Wy\\h;UAa/2*:C + Ch-l\\(Wh~Wy\\0,UAdl2. D.16) Thus, combining with D.12), \(Phv,w)\<C\\v\\0taiAm{l+h-4(Wh-W\\0iltAJ. D.17) Under the circumstances, since (W'h — W, 1)=0, W'h is the L2 projection of W into a spline space with ц and r lowered by one. Thus by Lemma 3.5 (easily localized), for i in that lower spline space, \\W'h-W'\\0^Ad/2^Cmm \\W'-x\\o,uAdll D-18) x and using again a variation of Theorem 3.2 for approximation, min W-%\\oA,Ad^Ch\\W"\\0A,Ad^Ch. D.19) i By D.17)-D.19) then, l(At>,w)i<ciMi0i00iAV2, which in light of D.8) proves D.7). □
Section 4 Introduction 375 Returning now to D.6) we have, using Lemma 4.1 with v = a>u, \(u-uh)(xo)\^C\\u\\0^Ad/2 + C\(Phcau-uh)(x0)\. D.20) We next come to a crucial part of our argument. Let Wh:=Ph0)U-Uk. D.21) Then wheVh and satisfies K,x') = 0 forallZe^04-/4), D.22) since o)= 1 on Ad/4.. Following Nitsche and Schatz [1974] we may call wh "discrete harmonic" in AdjA. Letting now & with d>(x0)= 1 be a suitable cutoff function isolating Adji, hence HcS'lloo^Crf-1, we have by Cauchy-Schwarz' inequality, KCx0)| = {&wh)'^Cd-^2\\whU^Adl^Cd^\\Wh\\o,2,A^ D.23) The following is a fundamental result from Nitsche and Schatz [1974]. Again "superapproximation", cf. Lemma 3.2, will figure in its proof but we shall merely note so at the relevant place. Lemma 4.2. Assuming that the family of meshes is quasi-uniform on Ad, that d > ch, and that wheVh satisfies D.22), we have llwillo.a.^.^ci-'HwJIo.a.^. D.24) Proof. For simplicity in writing we set |d = E, id=25. Let here й)е<^о>(Лзв/2) be such that <y = l on As, D.25) «о^Ио.со^г-', 1=0,...,r. D.26) We have II < || l,2,A6 < || (OW'„ || Ъа,Лм13 = (W'h, CO2 W'h) ={Wh,{<o2wh)')-2{(oWh,(o'wh). D.27) Here, @)Wi, CO'W„) ^ С5 ~ 1 || 0)Wi || 0,2,^3,5,2 II Wh II 0,2,Лз,/2 ^\\coK\\l2,A^ + CS-2\\wh\\l2,A}6l2. D.28) Further, by D.22), (W'h,(co2whY) = {w'h,(co2wh-xy) iov any XeVh(A2d). D.29) An easy modification (left to the reader) of the superapproximation result of
376 L.B. Wahlbin Chapter I Lemma 3.2 gives that there is a % in th{AiSI2) (slightly enlarged) such that ll(«2wh-x)'||o,2.^,/2<C(^)(||w;||0,2,^/2 + r4|wfc||o,2.^/2). D-30) Thus, from D.29), IK, (u>2w„)')| ^ C(h/3)(\\ w'„ ||g,2,^/2+^-1 II w'h Цо.2^3^ || wh \\0,2,Aim <C(h/3)\\ Wh\\la,Aiil2 + C3-21| Wk \\i2,Aw. D.31) Combining D.27) with D.28) and D.31), II wi \\2o,2,A6 ^ C(h/3)\\ w'„ || 20a,Aim + C3 - 21| щ || la,A3il2 and repeating the argument once more with §<5 and 23 taking the role of 3 and f 3, respectively, II Wfc \Ц2,Аб < C(fc2/<52)|| w'h \\l2,A26 + C3~21| wh \\l2iAu. D.32) Assuming as we may (modulo shrinking 23 somewhat) that A2d is a mesh domain we have by the inverse property Lemma 3.1 applied locally, h2\\w'h\\l2>A2i^C\\wh\\l2M„. Inserting this into D.32) we obtain the desired result D.24). □ We now use Lemma 4.2 in equation D.23). Thus, |wfc(x0)|^Cd-1/2||wft||o,2,^. D.33) Next use this with wh = Pha>u — uh in D.20) so that |«-M»(x0)I^C||M||o,<0.^I + Cd-1/2||J,»fl)M-M*llo.2.^4- D-34) To further elaborate the last term of D.34), since cos 1 on Ал/Л, \\PhCOU-Uh\\o,2,AdM^\lPh(OU-0}u\\0i2:Adli + \\u-Uh\\0,2,Aal4- D-35) By Cauchy-Schwarz' inequality and Lemma 3.1, ||Phoiu-ani||o.2,^/4<Cd1121|Phcou-coul\0,wA,IA ^Cd1'21| и\\0,w,Adl2. D.36) We next combine D.34)-D.36), add and subtract a general xeK, and have then proven the following result (changing the constant slightly). Recall that Ad = \_x0-d,x0+d']. Theorem 4.1. Assume that the family of triangulations is quasi-uniform on Ad, cf C.35). There exist constants с and С such that the following holds. Let d^ch and uhe Vh be such that ((u — uh)', ty') = 0for all фе V^Aj). Then for any xe Vh, \(u-uh)(x0)\^C\\u-X\\o,oo,Ad + Cd-ll2\\u-uh\\oa,Ad. D.37) Comparing Theorem 4.1 with Theorem 3.3 one immediately notices the absence of the factor e~cdlh in C.37). The form \\u — uh\\2iAi contains the global influences
Section 4 Introduction 377 affecting the error at x0; in light of the quasi-interpolant of de Boor and Fix [1973], cf. C.15), the first term on the right of D.37) is a very adequate representation of local influences to the error at x0. It is then clearly desirable to weaken the global term. This can be done by replacing the L2 norm by a still weaker norm, viz., a negative norm in H~s(Ad), the dual of Й*{АЛ) over the pivot space L2. Any s may be taken. We follow here the development of Nitsche and Schatz [1974]. Thus define IMI~s>2,^= sup (v,w). D.38) we«SoMd) l|w|U,2,^|<1=l If a function is appreciably smaller in a negative norm than in, say, the L2 norm, this is frequently due to oscillations. As an example the reader may contemplate the functions vn(x) = sin(nx), n= 1,2,... on [0, ri]. Here, IKII-^ClT'. We need to extend the inverse estimates of Lemma 3.1 somewhat to negative norms. Lemma 4.3. There exists a constant С such that for A any mesh domain and vs Vh, \\v\\o,2,A^Ch-s\\v\\-s,2,A. D.39) Proof. Let A = (J Tb T; mesh intervals. Then i A standard scaling argument establishes that IMIo.T.^Cr^lMI-^j,. The proof is then concluded by noting that for the negative norms we are considering, W\2-S,2,T^\M2-s,2,A. П i Remark 4.1. We do not claim the extended estimate IMI-,,2„4<C7rs+t|M|_s,2w4, O^t^s, with С independent of A. Actually this is true in one dimension but we know of no proof in many dimensions except by the theory of interpolation spaces in which case С will depend on A. To weaken the global term in D.37) proceed as follows: Break into the proof of Theorem 4.1 at the point D.34) and consider the term \\PhOM-uh\\0i2iAdlA.
378 L.B. Wahlbin Chapter I Now wh = Phtou — uh is a function in Vh which is discrete harmonic, i.e., satisfies (п'к,ф')=0{отфеУ„(А2а) D.40) (we change domain notation slightly at this point for simplicity). Lemma 4.4. For wheVh satisfying D.40), !|w*||o.2.x^Cd-s||wA||_s,2,^. D.41) Proof. Consider first the case of d of unit size. Set A0 = Ad, Al = A3dl2, A2 = A2d. Let coe^oiAzdii) with со = 1 on Ad. Then, for any nonnegative integer /, \\Wh\\-i.2.A0^\\oiv/h\\-i,2.Al= SUP (<»Wk,v). D.42) ve4t$(Ai) II f II 1.2=1 For any such v, let V solve -V" = ° on A,, D.43) F = 0 on ЪАу. Then \\V\\! + 2,2,a^M,,2,Ai. D-44) Now (coWh,v) = ((cowhY, V') = (to'wh, V') + (cow'h, V) = (co'Wh,V') + (w'h,(coV)'-co'V) = ((o'w„, V')-{co'W'h, V) + (w'h,(toV)') = {wh,2w'V' + a>HV) + (w'i.,(a>V)'-ili'h) for any \j/h e V(A0). Thus, by approximation theory, {coWh,vHC\\Wh\\^^,Al\\V\\l+2,Ai + \\wh\\i,A2Ch>\\V\\l+2,Al where у = min(/+1, r— 1), or, in light of D.44) and D.42), l|w/,||-Wo<C||Wfc||-/_ljXl + CF||wft||1^2. Now recalling Lemma 4.2 we obtain (changing notation for the domains), II w* II _ M„ < С || wh || _, _, ,A2 + Ch> || щ || 0,Al ■ D.45) For /=0, using Lemma 4.3, we have since у = 1 then (assuming A2 a mesh domain for notational simplicity), НЩ|о,Ло<С|К11-1,л2- The argument now proceeds by induction. In the case of general d, transform the situation to unit size d. The mesh parameter
Section 4 Introduction 379 for the spline space will then be replaced by h/d, and the factor d s in D.41) results from this transformation. □ We now return to D.34) and apply the above lemma, D.41), to obtain (again changing notation) \(u-uh)(x0)\^C\\u\\0ja>jAdl2 + Cd~ll2-s\\Ph(ou-uh\\-s,Adl4 ^С||и|[0,хЛт + Cd"ll2-S]|Phcou-cou|| _s,Ad/4 + Cd-ll2-s\\u-uh\\-SrAdl4. D.46) By Friedrich's inequality, \\v\\0<Ad/4^Cds\\v\\s,Adl4 for ve^^A^ and hence by duality, d~s~1/2\\Ph(ou-a)u\\-.SiAdlA ^Cd-lt2\\Ph(ou-wu\\0tAdl4 ^C\\Pha)u-a>u\\0^Ad/4^C\\u\\0^Adl2. D.47) We thus obtain the following further elaboration of Theorem 4.1. Theorem 4.2. Under the general hypotheses of Theorem 4.1, for any s^O, for any |(и-«»)(х0)|<С||«-х11о,..^ + С«/-1/2-*||и-и»||_м, D.48) (where С now depends also on s). Theorem 4.2 holds if the elliptic projection is taken with respect to an equation with variable coefficients, i.e., (a(x)(u-uh)', ф') + (Ь(х)(и-щ)', ф) + {с(х)(и-и„),ф) = 0 for ikeV„(Ad), provided the coefficients are sufficiently smooth; we shall not give the arguments but refer to Thomee and Wahlbin [1983, Lemma 3.4] for the counterpart of Lemma 4.1, i.e., Ьх stability of the elliptic projection. (Although given there in a global setting it is not hard to localize the arguments.) Theorem 4.2 is a one-dimensional version of a multidimensional result of Schatz and Wahlbin [1977] which will be given later in this article. Some essential points of proof will recur, e.g. the L00 stability of a localized projection (very simple in the one-dimension situation) and the use of the energy-norm-based techniques of Nitsche and Schatz [1974]. In the one-dimensional spline-L2 projection there was room for improvement of the results with respect to constraints on the triangulations, mainly to reduce the quasi-uniformity constraint. Such is also the situation for one-dimensional elliptic projections. We refer to Natterer [1977] and GOsman [1981] for results which are very much related to Theorem 4.1. Finally we remark on the use of Theorem 4.2. Pose the following question: What
380 L.B. Wahlbin Chapter I does it take in order to guarantee optimal rate of convergence, 0(hr), at a point x0l For the first term on the right of D.48) it takes e.g. that ue^At). We then ask for \\u-uh\\-SiAd^Chr, somes. D.49) (We disregard the factor d~s~112 here.) A simple answer is as follows; we assume for simplicity that the meshes are globally quasi-uniform although this is by no means necessary. Lemma 4.5. For l<s<r, ||«-«fc||-(r-2,.0<Cfc,-1min||ii-Z||1.2.0<Cfc'+f-2||«||I,2j,. D.50) Proof. We have ||«-И*||-(г-2),П= S"P (U-Uh,v). re«J(G) Wr-2,D=1 For each such v, let -w" = v in Q, w(-l) = w(l) = 0. Then with wh the projection of w, (u-uh,v) = ((u-uh)',w') = ((u-uhy,(w-why) = (("-x)',(w-wJ')<||M-zl!i,2,«C^-1||w||r,2.fl which leads to the desired result. □ Hence D.49) and local optimal order follows if ueH2(Q). The result of this lemma can often be, and needs to be, improved. E.g., let u = G1, the Green's function centered at Xj for our two-point boundary value problem and consider x0 ф x^. Then G1 is not in H2(Q) so that our theory does not predict optimal order at x0. If Xj is a mesh point and ^=0 this is disturbing since, in our simple case, then G1 — Gl =0. A slightly different result is as follows. Lemma 4.6. ll"-"JI-(r-i),fi<C/ir-1min||u-zll1,lif3. D.51) Proof. As in the proof of the previous lemma we have with || v ||r_ 1>2>fi = 1 now, (u-uh,v) = ((u-xy,(w-why)^\\u~x\\uuii\\w-wh\\Ux>Si. We know (for quasi-uniform meshes) that the elliptic projection is stable in L°°; Lemma 4.1 is simple to adapt to the case of the whole of Q. Using then inverse estimates and approximation theory it is easy to see that the projection is stable in
Section 5 Introduction 381 WUco. Hence l|w-wJlie)>0<Cmin llw-x||li00iO<C/i,~1|N|rie)iO x <C^-1||w||r+1>1>fi = C/i-1. The desired result D.51) is thus proven. Q Going back to the Green's function G1 above, it is easily derived that ram\\Gl-x\\ulM*kCh X^h and thus Gl approximates to optimal order hr at x0 фхх. (Since we cannot expect better for variable smooth coefficients and we have a proof that clearly generalizes to that case, we stop.) Negative norm estimates play important roles in finite element theory. As we have seen above they help explain how local roughness spreads into regions of smooth behavior. They are essential for results about convergence in finite element approximations for eigenvalues, see Bramble and Osborn [1973]. In the next section they will play a prominent role in connection with averaging to obtain higher rate of convergence. (In our multidimensional investigations later they will also be an important tool, but, alas, the counterparts of Lemmas 4.4 and 4.5 are seldom perfectly valid in practice, either because the domain or coefficients do not admit perfect duality estimates or, in smooth situations, because of slight "imperfections" in approximating curved boundaries.) 5. Prologue concluded: Superconvergence and local averaging in one-dimensional elliptic projections In a smooth problem the best general pointwise approximation of function values is 0(hr), with r as in C.1), C.2). In this section we consider two problems: (i) Can we find a priori identifiable points with higher order of accuracy, and (ii) can we obtain higher order of accuracy via (simple) postprocessing? For problem (i), that of super- convergent points, we first consider the case of continuous splines in two-point boundary value problems (^ = 0 in C.1), C.2)). Here identifiable superconvergence points for function values are found; the mesh points themselves are 0(Jt2r~2) and some r—2 specified interior points on each mesh interval are of order 0{hr+1). This result also holds for nonuniform meshes. A second result about superconvergent points concerns Hermite cubics (/г = 1, r = 4) for two-point boundary value problems. Here there are two identifiable superconvergent points on each mesh interval of 0(/i5) accuracy, but only on strictly interior subintervals and only for completely uniform meshes. As we look on mesh intervals Tt approaching the boundary, or we consider nonuniform meshes, we can still assert that §Tj error = 0 (in a very simple situation, see E.18)) so that we know that the error vanishes at least at one point; however, no such point has ever been
382 L.B. Wahlbin Chapter I identified. This may illustrate that the problem of finding superconvergent points is challenging. In each case mentioned above we first find superconvergent points for the derivatives and then the result is integrated. Since the derivative of the elliptic projection is the L2 projection into a lower-order spline space we actually give results for L2 projections too. (These latter results are classical for the case \x — 0, that is, for L1 projections into full polynomial spaces.) For problem (ii) above, that of a posteriori processing, we describe the salient points about the "X-operator" of Bramble and Schatz [1977]. This easily implemented method gives 0(h2r~2) accuracy at points where the solution is smooth and the mesh is completely uniform in a neighborhood (provided the solution is somewhat smooth globally). We shall later return to the iC-operator in several dimensions; it is the one practical method of some generality. Let now 12 = [ — 1,1] be a basic domain, let Vh denote a spline space with vh(—1) = 0 for all vheVh and consider the problem of finding и s.t. ~(a(x)u')'=f(x) infl, ы(-1) = н'A) = 0. E.1) We assume a(x) ^ a0 > 0 and that a(x) and / are as smooth as required. (The form of E.1) considered is for simplicity only.) The projection (Galerkin approximation) uhevh to E.1) is given by Ж"-"/,,*) s (Ф)(и-"/,)', Z') = 0 ioxxeK E.2) Consider first the case of continuous splines, д = 0 in C.1), C.2). Since E.1) has a unique solution for every/(x), it has a Green's function G°(x) = G(x,x0) centered at x0 such that A(v,G°) = v(x0) E.3) for each veH1 with t>(—1) = 0. In particular, (u-uh)(x0)~A(u-uh,G°). E.4) Letting G°heVh be the projection of G°, by E.2) and E.4), {u-uh){x0) = A{u-uh,G°~G°h). E.5) Thus, \{u~uh)(x0)\^C\\u-uh\\ul^\\G0~G0h\\U2<Q, E.6) Since и is smooth we have, as is well known, \\и-Ч\\иг,0^Ск'-К E.7) Further, as is a nice observation from Douglas and Dupont [1974], since G° has a break in its first derivative at x0 but is smooth on both sides if x0 is a mesh point, the continuous splines can mimic that break so that l|G°-G?||i,2.D<C inf ||G°-z||1,2>e<CA'-1, E.8)
Section 5 Introduction 383 as is easily seen. Combining this with E.6) and E.7), we have the following result of Douglas and Dupont. Theorem 5.1 (Mesh point superconvergence in "if0 splines). Assume the problem E.1) is smooth and that ц=0 in C.1), C.2). Then for xt a mesh point, |(и-м»)(х,)|<СЙ2г-2. E.9) We continue with the case of continuous splines and search for surperconvergent point(s) inside each element 7V (The simple argument appears to be first published in Chen [1979], cf. also Barker [1982, 1984].) Let e(x) = (u — uh)(x) denote the error. Let p(x) denote any function in Pr_ 2(T;), the polynomials of degree r—2 on Г( = [хг,х1+1]. Set with p(x) = 0 outside Th p jp(t)dt, x*xt+1, EШ) -1 ^"\xi+lh X>Xi + 1. Then P(x)e Vh. By E.2), i a(x)e'{x)P'(x) = 0, -l or ae'p=0 foranypePr_2G;.). E.11) We next Taylor expand ae' on Tt around the midpoint xi+lj2. Writing the Taylor expansion in terms of Legendre polynomials L;(x) we have ae' = c0L0{x) + c1L1(x)+--- + cr~1Lr-1{x) + O(hr), and, by E.11), c0 = c1 = --- = cr_2=0 so that ae'(x) = cr_1Lr_1(x) + 0(hr). E.12) Since a(x) ^ a0 > 0 we have the following superconvergence result for derivatives, one order better than the global result. Theorem 5.2 (Interior superconvergence for derivatives in ^° splines). Assume that the problem E.1) is smooth and that \i = 0 in C.1), C.2). Then for x a zero of the (r — l)th Legendre polynomial on T(, (M-u„)'(Jc) = 0(/ir). E.13)
384 L.B. Wahlbin Chapter I Finally, integrating from xt, ae = {ae)' - a!e — ae(x) — ae(xt) — a e or by E.9) and E.12), assuming \e{x)\^Chr E.14)' (as is known in many cases), and that r 3=3 so that 2r —2^r+l, ae(x) = Q(hr + 1) + cr_1 Lr-i(t)to. Using Legendre's differential equation one sees that the inside zeros of the integral on the right coincides with the inside zeros of Ь'г-г. Thus we have the following superconvergence result for the function values, again one order better than globally. Theorem 5.3 (Interior superconvergence in 4>° splines). Assume that the problem E.1) is smooth, and that ц = 0 in C.1), C.2). Further assume E.14) and that r ^ 3. Then for x a zero of the derivative of the (r-l)th Legendre polynomial on Th (u-uh)(x) = 0(hr+1). E.15) Hence, in the case of merely continuous splines, we have identifiable super- convergence points on all mesh intervals, also for nonuniform meshes. Without attempting to be systematic we next consider the example of Hermite cubics as an illustration. Further, the arguments will only be sketched. Assume for simplicity that a = \ in E.1). With e{x)=(u — uh)(x), we shall first investigate e'(x). Then u'h is the L2 projection P°u' of u' into <^° quadratics, essentially. Set, for notational simplicity, v=u', vh =P°v. At a point x0, let further vh be a function in the ^° quadratics that interpolates v at x0. Assume also a uniform mesh. Then with 5° an approximate Dirac delta for quadratics centered at x0, and with <5? its L2 projection into the continuous quadratics, {v-vh)(x0) = (vh-vh)(xo) = (vh-vh,6%) = (Vh-v,&h)- E.16) Lemma 3.4 gives us very fine control over the behavior of <5°. In particular, if x0 is interior to Q the exponential decay suggests that, provided the mesh is completely uniform, 5% is (almost) symmetric if x0 is either a mesh point or halfway between two mesh points. If then vh —1> = const • (x — x0K 4—, the third-order term will be cancelled out in E.16) so that (v-vh)(x0) = O(h% x0 interior mesh point or midpoint, uniform mesh. E.17) This is a superconvergence result for derivatives in Hermite cubic approximations
Section 5 Introduction 385 to two-point boundary value problems. The approximation is one order better than globally at the above-mentioned points. To translate E.17) into a superconvergence result for function values, note first that e= ex = 0. E.18) T, Tt For, if — w" = l|r. or x|Ti, 0 outside Th vv(-l) = w'(l) = 0, then we Vh in the Hermite cubic case. Thus, i e=~\ ew"=(e',w') = 0. T, -1 (We have assumed a== 1 for simplicity.) This shows E.18). Writing now for Tj = [x—\h, хЛ-\\х\ with x the midpoint, e(x)=A+B(x-x) + C(x~xJ+D(x-xK+E(x-xL + 0(h5) E.19) we know that В = 0(h4) since e'(x) = 0(fc4). Further, E.18) leads to, when integrating E.19) against x-x, that D = 0(h2). Thus, e(x) = A + C{x - xJ + E(x - JcL + 0(hs). E.20) Integrating this against 1, E.18) gives M + fC(i>iK+f£(i/iM = 0(/i6). E.21) Further e'(x)=0(h*) at endpoints gives 2C-i/i + 4£(i/03 = O(h4), or, C^-$Eh2+0(h3). E.22) Reporting E.22) in E.21), A-%Eh2\h2+lEijh* = 0(h5) so that
386 L.B. Wahtbin Chapter I Thus, inserting this and E.22) in E.20), e(x) -= Eh4-?— - \Eh2{x ~- xJ + E(x - xL + Q(h5) 15* 16 = £(йоЛ4-Мх-хJ+(х-хL) + 0(й5). We see from this that the superconvergent points for function values are given at the inside roots of the polynomial multiplying E, i.e., at (x-x)/7i = ±(i- 1/V30I/2 = ±0.2596648112.... E.23) We summarize the above investigation of superconvergence in Hermite cubics. The result extends to variable coefficients but, as numerical experimentation shows, the restriction to interior intervals and completely uniform meshes is essential. (This is in spite of the fact that, by E.18), the error e(x) changes sign on each Tt also for nonuniform meshes and Tt near ends. The point(s) where this happens in general have never been identified.) Thus we are dealing with a fairly fickle phenomenon here. Theorem 5.4 (Superconvergence in Hermite cubics). Assume that the problem E.1) is smooth and that ц = 1, r=4 in C.1), C.2). Further assume that the mesh is uniform, i.e., measG])=h, all i. Let AuQbea fixed subinterval of Q. Then for any Tt inside A, (и-мА)'(х) = 0(й4), x = x1-,x,+1/2,xj+i, E.24) and (m-u„)(x) = 0(A5), x = xi+1/2±/1(i-l/v/30I/2. E.25) The argument given extends to other ^l splines, since essentially only the symmetry (almost) of <5£ and E.18) were used. In the remainder of this section we describe, in a simple case, the so-called K-operator of Bramble and Schatz [1977]. This method extends to several dimensional problems. It applies to a very general class of locally uniform meshes and sometimes furnishes a pointwise accuracy of 0(h2r~2) by use of a local averaging operator which is a priori computed and depends only on the finite elements being used, not on h or on the variable coefficients of the second-order elliptic problem considered. A point to note is that sometimes somewhere (in a smooth problem) an order of accuracy 0(h2r~2) is lurking, viz., if one measures the error in a negative norm Я~(г~2), see Lemma 4.5. Higher-order accuracy in a negative norm suggests oscillatory behavior of the error. Bramble and Schatz found a local, simple and systematic way of averaging out those oscillations to obtain 0(h2'~2) convergence in smooth problems. Consider, for simplicity in one part of the argument, the problem -u"=f in[-l, 1] = 0, m(-1) = u'A) = 0. E.26) Assume that the mesh is uniform (throughout the domain, for simplicity at present),
Section 5 Introduction 387 i.e., that meas(Ti)==/t. E.27) For each of the finite element spaces Vh considered, we have for interior points, xeAuQ, for h sufficiently small, a local basis: There exist functions <Pi,..., <p„ independent of h which are piecewise polynomials with compact support such that any %{x)e Vh is of the form j=l heZ E.28) A particular case is that of smoothest splines, fi = r — 2, in which we have the "classical" В splines. Since the construction of the X-operator is based on those, we describe them in more detail: For t real, define m= and for / an integer, \l/i(t)=ift*ij/*---*il/, convolution I times. E.29) E.30) Then ij/i is the В spline basis function, a piecewise polynomial of degree I. In this case, /i=r—2 = 1— 1, any %eVh is of the form X(x)= ^apil/iix/h-P). fisZ E.31) Consider now a kernel Kh defined in terms of the above В spline basis function as 0—2) Kh{x)= X h-lWT-2(xlh-fi. P=-(r-2) E.32) The coefficients kfi are solutions of a certain system of linear equations and tables are given in Bramble and Schatz [1977]. The choice of kfi can be summed up in Kh being an approximate identity of order 2r—2. Here we shall merely point out certain properties of Kh at the points of our development where they are used. The idea is then to replace uh, the Galerkin approximation, by Kh*uh(x)-=h = b-i K„(x-y)u(y)dy. If, cf. E.28), щ = T,j IiXaiq>j(x/h — a), if x = hy is a mesh point in А С Q (so that resulting integrals never abut on the boundary on Q), then {Kh*uh){hy)= x Eai Y,h Wr-2(y-P-x-y)<Pj(y)&y
388 L.B. Wahlbin Chapter I where di=l,kfLr-2E-p-y)<pj(y)dy. Thus the values of Kh*uh at mesh points are given via a finite linear combination of the coefficients a{ for uh in the basis (pj. The finitely many nonzero d{ may be computed a priori; they are independent of h, of the particular mesh point considered and depend only on the particular finite element function space used. We proceed to give the salient points about why u(x0)-K„*uh(x0) = O(h2r-2), x0eA, E.33) in smooth situations. Write и-Кк*ик=(и-К„*и) + Кк*(и-и„). E.34) The choice of coefficients kfi in Kh has been made so that it is an approximate identity (with a highly localized kernel). Then, for x0elnt A0<qA, \(u-Kh*u)(x0)\^Ch2'-2\\u\}2r-Mo. E.35) It remains to consider Kh*(u — uh) in E.34). For a general v, one has for Aq^A^, see Bramble and Schatz [1977, Lemma 4.2], IMIo.*„<C £ l|£J't>ll-,r-2),2M,- E.36) 0«j<r-2 A first crucial fact is that &(Kh*v)(x) = R\»*biv(x) E.37) where d{ is the y'th centered divided difference. Here RV>(x)=h~l *£ k»ii,,-2-j(x/h-p). />=-<r-2) as is easily seen. It follows that ||Я^*^-<г-2>,2.л1<С||8И-<г-2),2,,42 for A,€A2. Based on the above ideas and after some minor further technicalities involving Sobolev's inequality, Bramble and Schatz show in their Lemma 6.1 that r-l r-2 |(X»*i;)(Xo)|^C £ ||6(t;||-(r_2),2Mo + ^-2 £ Ш„,Ао. E.38) j=0 J=0 It remains to apply this with v = e = и — uk. A second crucial aspect can now be appreciated. Namely, that on uniform meshes, difference quotients of e satisfies "the same" approximate equation as e does. In the present simple situation, i.e., (e',X') = 0 (orXePh(A0) one has
Section 6 Introduction 389 (Не',х')=(е'Л(Х'))=(е',Ш')=0 since (if v40€ff), dlxeVh, cf. E.28). (For variable coefficients there are some further technicalities.) Returning to E.38) and taking v=e, we have then for the two terms on the right, for AquAx, l|8ie II -(,- 2),2.л0 < Ch2(\\ и \\r+l2,Ai + || и ||r.2i0) E.39) (after localization of Lemma 4.4) and, using the local result of Theorem 4.2, 1|8|е||о,оо^^С||9|и~х11оо.л1/2 + С||^е||_(г_2),2,Л1 ^Ch'(\\u\\r+j,K,Al + \\ula,n). E.40) Inserting E.39) and E.40) into E.38), and combining with E.35) and then E.34), we have the following theorem. Theorem 5.5. Assume that x0eInt/lcQ, and that the mesh is uniform on A. There exists a constant С such that with r as in C.1), C.2), \(u-Kh*uJ(x0)\^Ch2r-2qu\\2r-i.2.A + Mr.2.a)- E-41) We may summarize the most important points in the above derivation via the following mnemonic notation. e(x0) = Local approximability +1| e || _(r_2)>2>й (Theorem 4.2), D\Kh*e)=^*dle, дЦик) = FJ,u)h for locally translation-invariant finite element spaces, |[e || ~(r-2),2,n = 0(/г2г_2) in smooth problems (Lemma 4.5). Note that, as a consequence of E.40), difference quotients are better approximations to derivatives than the differentiated approximate solution itself. We shall return to this point in several dimensions in Section 30. 6. Overview: Capsule summaries of each section In this section we give a capsule summary of each section of this article, except the Foreword and this section itself. Chapter L Introduction 2. Perspective: Local behavior in Fourier series Local behavior in Fourier series is determined by local approximability and global influences. The global influences may be measured by best global L1 approximation and are, in comparison to the finite element situation, quite severe.
390 LB. Wahlbin Chapter I 3. Prologue: Local behavior in one-dimensional spline-L2 projections. Exponential decay Local behavior in spline-L2 projections is determined by local approximability and exponentially decaying outside influences. 4. Prologue continued: Local behavior in one-dimensional elliptic projections. Negative norms Local behavior in approximating second-order two-point boundary value problems is determined by local approximability and global influences. The global influences may be measured by the error in any negative norm, a "weak" measure. 5. Prologue concluded: Superconvergence and local averaging in one-dimensional elliptic projections Superconvergence without postprocessing is a fickle phenomenon. Only the case of continuous splines is reasonably understood. With Hermite cubics, as an example, superconvergence occurs on uniform meshes but only away from the boundary. On uniform meshes the K-operator averages out oscillations and recovers the best error, hidden in negative norms. Chapter II. Local behavior in finite element L2 projections 7. Locally quasi-uniform meshes As in the one-dimensional situation, local behavior is determined by local approximability and exponentially decaying outside influences. 8. Less regular meshes in two-dimensional Lagrangian elements A more precise accounting of the outside influences is given. Chapter III. Local estimates in the finite element method for second-order elliptic problems, with applications 9. Local estimates in energy and L2 norms Basic estimates for the local error in the above-mentioned norms are given. They depend on local approximability and global influences; the latter are measured in negative norms. 10. Local pointwise error estimates The estimates again depend on local approximability and global influences, the latter measured in negative norms. The theory is not mature when estimating the pointwise error close to the boundary. The rest of this chapter consists of applications of the basic estimates of this section. 11. Problems with a rough right-hand side The theory of Section 10 is applied away from rough spots. Careful duality estimates for the global influences show that, in smooth domains, the error in the finite element
Section 6 Introduction 391 solution conforms to what one would expect from elementary approximation theory, 12. A model problem on a plane polygonal domain with a reentrant corner: Unrefined meshes The basic theory of Section 10 is applied. On interior domains, suboptimal order estimates result, due to a failure of the duality argument in this case. This is in contrast to the results of Section 11. 13. Sharpness of the results of Section 12: Pollution The results of Section 12 are shown to be sharp. 14. A priori and adaptive mesh refinements The basic theory of Section 10 is applied to the case of a model problem on a polygonal domain with a reentrant corner. It is shown that mesh refinements guided by elementary approximation theory give optimal results. Thereafter the results of Eriksson and Johnson [1988] for an adaptive algorithm are described. 15. Stress intensity factors, and better a posteriori approximations via the Trigonometric Fitting Method In the case of a model problem on a polygonal domain, the basic theory of Section 10 gives guidelines as how to recover the parameters in the singular expansion around a corner. 16. Local behavior when singular functions are added Still in the model problem on a polygonal domain, it is shown how the local behavior when singular functions are added may be reduced to the "same" question without singular functions, already solved in Section 10. 17. Local behavior in problems with rough coefficients: Pollution One-dimensional examples are given and, following the inverse technique of Section 13, severe pollution effects are exhibited. 18. An example of global pollution with numerical integration Systematic integration errors lead to severe global pollution. Chapter IV. A singularly perturbed elliptic to elliptic model problem 19. The model problem Salient features of the singularities entering are described. 20. The finite element method and its local behavior in energy A preliminary investigation, to be used in earnest in the next section.
392 L.B. Wahlbin Chapter I 21. Local pointwise error estimates The pollution effects are not severe in this problem; influences of the singularities are locally confined. 22. Some concrete estimates using Theorem 21.2 The estimates of Section 21 are further elucidated in the neighborhood of typical singularities. Chapter V. A singularly perturbed convection-dominated model problem 23. The model problem The singularly features are elucidated, via proofs that will later be used in the finite element situation. 24. The streamline diffusion finite element method The ordinary Galerkin method is worthless. Instead, the streamline diffusion method is introduced. It is modified with artificial crosswind diffusion. 25. Numerical crosswind and downwind spread While downwind spread is always restricted to Chln(l/h), numerical crosswind spread is harder to analyze. We give an analysis and choose crosswind diffusion as to minimize crosswind spread according to these results. (According to numerical experiments these results are not sharp.) 26. The numerical crosswind spread: An alternative proof We explain in more transparent detail how our minimizing crosswind result of Section 25 was arrived at. Also, we further localize the result. 27. Pointwise local error estimates The tight control of numerical crosswind spread gained in Sections 25 and 26 allows decent pointwise error estimates. (They are, however, probably not sharp in practice.) Chapter VI. Time-localized behavior in second-order parabolic problems 28. A linear model problem: Full advantage of the smoothing effect Solving a second order parabolic problem with rough or incompatible initial data, but otherwise smooth data, the solution is smooth for positive time. Using the L2 projection into Vh to approximate initial data, the semidiscrete time continuous Galerkin approximation is of optimal order for positive time in a linear problem, provided integrations in evaluating the L2 projection are faithfully carried out. Systematic errors in evaluating L2 projections or time-discretization in the presence of variable coefficients may drastically alter these optimal order results.
Section 6 Introduction 393 29. A nonlinear model problem: Restricted advantage of the smoothing effect Still the solution is smooth for positive time but, due to nonlinear aliazing, the Galerkin solution does not take "full" advantage of this. The restricted result is shown to be sharp. Chapter VII. Superconvergence 30. Difference quotients on translation-invariant meshes for approximating derivatives On domains where the solution is smooth, difference quotients of the finite element solution provide an optimal 0{hr) order approximation to derivatives of the continuous solution. (This is in contrast to what the derivatives of the finite element solution provide.) The result is subject to sufficient-order approximation globally in negative norms, which may be hampered by singularities in right-hand sides, coefficients, reentrant corners, or, more subtly, deficient approximation of smooth boundaries. 31. Higher-order local accuracy by averaging: The K-operator On locally translation invariant meshes, a simple postprocessing of the finite element solution provides a superapproximating approximation (up to order h2r~2), subject again to global influences measured in negative norms (which may again be subtly obstructed). 32. Miscellaneous results Various identifiable superconvergence points for functions or gradients are described, in special situations. Various postprocessing methods, local and global, are also given. Again the results are subject to subtle obstructions, in particular from approximations of smooth boundaries.
Chapter II Local Behavior in Finite Element L2 Projections 7. Locally quasi-uniform meshes When seeking finite element solutions of partial differential equations the L2 projection frequently appears. E.g., in Lagrangian formulations the mesh moves and may, after some time, deteriorate so that it is necessary to rectify it. Then, how does one transfer data from the old mesh to the new? One possibility is to use the L2 projection, cf. Chavez [1983]. Another example occurs in the solution of parabolic problems where it has been found that taking the approximate initial data as the L2 projection of the given data into a finite element space has some interesting properties, cf. Section 28, when initial data is not smooth, or incompatible. Also, a right-hand side of an equation is, in many finite element methods, replaced by its L2 projection. In this section we shall give the multidimensional counterpart of Theorem 3.3. Again our development follows Nitsche and Schatz [1972] with some ideas from Schatz and Wahlbin [1983] thrown in to show exponential decay. In the one-dimensional case our spline spaces were characterized by two parameters, ц and r in C.1) and C.2). (Clearly, we could have let these parameters vary across, respectively on, each mesh interval.) In several dimensions there is a plethora of possible finite element spaces and the only practical way is to nail down a priori assumptions, letting the user of a particular finite element space worry about verifying these assumptions. For the reader not familiar with applying (or deriving) local results, we point out that deciding on what these a priori assumptions should be is not easy: one may err on the side of giving too specific hypotheses which, say, only apply to Lagrange elements, or one may err in giving very general hypotheses which are almost impossible to verify for a nonexpert. In the present section we shall formalize the superapproximation Lemma 3.2, as translated to the multidimensional case, as our basic assumption. (Various comments about this will be given later.) We now need to introduce notation. Let Q be a basic compact domain in UN and for A0sAx zQ set 8<(^o,^i) = dist(e/40\8fl,6/41\6f2). G.1) Thus for instance d<(A0fA1)=d in the following situation 395
396 LB. Wahlbin Chapter II Fig. 7.1. Further, let <$™(A)^{ve%x(A):d<(supvv,A)>0} and V£(A) = {veVh:d<(suppv,A)>0} G.2) where Vh is some basic family of finite element spaces on Q. Assumption 7.1 (Superapproximation). There exist constants с and С and a number L such that the following holds: Let Aq^A^^Q with rf = 3<(/i0,/i1)^c/i. Let further we#5(A)) with M\i*,A0^Ad~l, ! = 0,.,.,L. G.3) Then for any %eVh there exists феУ^А^) such that \\cox-i>\\o,2,A^CAh/d\\X\\o,2,Ao- G-4) We have seen the basic ingredients necessary for verifying Assumption 7.1 in Lemma 3.2: It takes a local approximation operator with error governed by the local || ||r>2 norm, it takes that Drx=0 for xzVh (e.g., for isoparametric elements, D'x is not exactly zero), and it takes a locally quasi-uniform mesh. In the case of tensor products, e.g., one needs to have a more precise error functional, cf. Bramble and Hilbert [1970,1971]. It is my experience that it is quite easy to verify Assumption 7.1 in any situation with a locally quasi-uniform mesh family. The next result corresponds to Lemma 3.3. The proof is virtually the same but repeated here to bring out essential features. Lemma 7.1. Assume that Assumption 7.1 holds. There exist positive constants c0,c1 and С such that the following holds. Let А0^Аг with d = d<(A0,A1)^c0h. Assume further that vh e Vh is such that (vh,x„) = 0, all taeVfiAt). G.5) Then \\vh\\0,2,A0^C^d/h\\vh\\0,2,Al. G.6) Proof. Let 5 = ch, with с as in Assumption 7.1. Let further A0^AS^A2S with
Section 7 Finite element L2 projections 397 d = d<(A0,Ai)=d<(Ait2i)- Let ше^?(Лй) be a nonnegative function with Mkoo^AS'1, /=0,...,L, G.7) and with (o = l onA0. G.8) Then \MlAo^(VH,COVh). G.9) Assuming that A2»^Ai, we have from G.5), .II»*IIomo<(»*.«b»*-z)» GЛ0) for #e Vh(A2s). Thus, via Assumption 7.1, 11»*11§.ло<С(Ь/5)||»4||§Ми. Repeat the argument with domains A2s£ •■• ZA2NS as long as they remain inside Л1; i.e., since <5=ch, for N = 0(d/h). Since Ch/3<1 may be assumed, we obtain the result upon setting Ch/S = e "cons!. П Now consider (и — uh)(x0) for x0e£2. Assume that (u-u„,x) = 0 forZeK», G.11) i.e., that щ is the L2 projection of и into Vk. We next make an assumption that is trivial to verify in most situations. Assumption 7.2. There exists a constant С such that for any x0 e Q, there exists a function 5° with supp(<5°)eBch(x0) and for Kp^oo, insert1*-™, and such that (vh,e°) = vh(x0) forvheVh. G.12) Now, for any xeVh, (u - uh)(x0) = (" - x)(x0) + (x~ uh)(x0). G.13) Here, (Z-«*)(xo) = (z-«*,50)d-(Z-m*.^0)o G-14) where <5° denotes the L2(Q) projection of E° into Vh. We continue to parallel the development in Section 3. We are aiming for local results expressed in a pointwise fashion. Since the L2 projection is inherently based on the L2 inner product, cf. Lemma 7.1, it becomes necessary to have means of going between various "LP norms. We, therefore, formalize the inverse property of Lemma 3.1 in the multidimensional situation. Assumption 7.3 (Inverse assumption). There exists a constant С independent of h
398 L.B. Wahlbin Chapter II such that for any element. Th Ыо.р,тЛСЪ~т*~шЫол.т, fi>rveVh, K^p^oo. G.15) We can now give the counterpart of Lemma 3.4. Lemma 7.2. Assume Assumptions 7.1, 7.2, and 7.3. Then there exist positive constants с and С such that \6°h(x)\^Ce~cix'xol/h/hN. G.16) Proof. By Assumption 7.3, №I<СЙ-*/2||й2||о>7., G.17) where xe 7}. For \x — xQ\^Ch we have from Lemma 7.1 that ll^llcr^Ce-^-^H^IIoj, G-18) and then by Assumption 7.2 and the obvious stability in L2 of the L2 projection, ll#||o.0^ll*0||o,0<Ch-"'2. G-19) The desired result G.16) obtains. □ As in Lemma 3.5 we have the LP stability of the L2 projection. The proof is exactly the same. Lemma 7.3. Let Assumptions 7.1, 7.2, and 7.3 hold. Then the L2 projection is stable in any LP norm, 1 ^p^oo. The counterpart of Theorem 3.1 is now the following. Again the proof is the same. Theorem 7.1 (Exponential decay in finite element L2 projections). Let Assumptions 7.1, 7.2, and 7.3 hold. There exist positive constants с and С such that the following holds. IfAd — B(x0, d)nQ and ukeVhis the L2 projection into Vh, then for %h arbitrary in Vh, \(u~uh)(x0)\ <C\\u-Xh\\o,o0,Ad + Cd-Nc-c',lh\\u-xh\\o,i,n- G-20) To utilize this result one has to make assumptions about local approximability, cf. Theorem 3.2. Since this is well known in standard finite element analysis, we leave it to the reader. Finally in this section we will further localize Theorem 7.1 as in Theorem 3.3. Again let Ad = B(x0, d)r\Q and let Assumptions 7.1, 7.2, and 7.3 hold whenever we are on Ald. Assume also that {u-u„,x) = 0 forxeV?(Ad). G.21)
Section 8 Finite element L2 projections 399 Theorem 7.2 (Further localization of exponential decay in finite element L2 projection). There exists a constant ct such that if d^c1h and Assumptions 7.1, 7.2, and 7.3 hold on A2i, and G.21) holds, then with positive с and С independent of h, for any xheVh, \(u-uh)(x0)\^C\\u~XH\\o,a,,Ad + Cd-N'2e-c<"h\\u-uh\\o,A<,- G-22) Sketch of proof. The proof is so close to that of Theorem 3.3 that we shall merely sketch it. Let Ph denote the L2 projection into Vh and Ph the L2 projection into Vb{Ad). Then (u-uh)(x0) = {u(xo)-Phu(x0))+(Phu(x0)-Phu(x0)). G.23) By an obvious localization of Lemma 7.3, \u(x0)-Phu(x0)\^C\\u\\0 G.24) The quantity {Ph — Ph)u satisfies G.5) and hence from Lemma 7.1 (localized), combined with the inverse assumption, KPk-PkHxo)l<Ch-«>2e-cd/k\\(Pk-Pl,)ul\0.A< <Cd-^2e-c"/A||(PA-PA)u||o,^, G.25) after a change of constants. Then use the triangle inequality, and Cauchy-Schwarz' inequality, and again Lemma 7.3 (localized), obtaining \\Phu-Phu\\0,Ad^{\u-Phul0,Ad+\\u-Phu{{0,Ad <CdN'2\\u-Phu\\0^Ad+\\u-Phu\\0rAd <,CdNI2\\u\\0^,Ad+\\u-PhuU,Ad. G-26) The combination of G.23)-G.26) leads to G.22) for #/, = 0. Since PhXh = Xh f°r Xh e Vh we may add and subtract %h to arrive at G.22). □ 8. Less regular meshes in two-dimensional Lagrangian elements In Section 7 we gave Theorem 7.2, a result pertaining to the case when the family of meshes is quasi-uniform on the subregion of the full region Q considered. Considering its proof, or the proofs of LP stability of the L2 projection as in Descloux [1972], Douglas, Dupont and Wahlbin [1975b], or the one-dimensional cases surveyed in de Boor [1979], one is led to consider relaxing the assumption of local quasi-uniformity of the meshes; exponential decay as d/h should be replaced by decay according to the number of elements which are removed from the basic point. Here we shall review the careful investigation of Crouzeix and Тномёе [1987а, Section 2] which is based on Descloux's technique. We consider a bounded polygonal domain Q in the plane. The family of triangulations ^ is into disjoint triangles Tt such that no vertex of any triangle lies
400 L.B. Wahlbin Chapter II on the interior of a side of another triangle. (This technically rules out many situations of a posteriori, or adaptive, mesh refinement.) The finite element spaces Vh will be of Lagrangian, i.e., <<?0, type: Vh = {ve<e°(Q):v\T. is a polynomial of degree r — 1}. (8.1) We introduce the relevant, notation of Crouzeix and Thomee. Let T0 be a fixed triangle. Let Rj{T0) be the set of triangles which are; triangles away from T0, defined recursively as R(T0)= T0 and, for j> 1, Rj(T0) is the union of closed triangles in 3~h which are not in [JkjRi but which have at least one vertex in K,-_j. For TeRj(T0) we set the generalized distance between T and T0 as ЩТ0, T):=j, (8.2) Thus, for T ф T0,S>{T0,T) is the least integer; such that there exists a sequence of vertices vhi=l,...J, with vx a vertex of T0, Vj a vertex of Г, and u,- and vi + 1 are vertices of the same triangle for 1 <is£;. It follows that 2{T, Т0) = ^(Г0, Т). Let further ar = area of triangle T. (8.3) The following is the basic result of Descloux, as further amended by Crouzeix and Thomee. Theorem 8.1. Let 1 ^p^ со. There exist positive constants с and C, depending only on r {the polynomial degree), and r and p, respectively, such that if supptHST0, if DJ=[)Rl{T0)= U T (8-4) l>j ЩТо, T)>j and if Phv0 is the L2 projection into Vh, then II P*v0 II 0.2.D, < Genial2 ~ *» \\ v01| „,,. (8.5) Proof. We shall show that for some к> 0, \\РЫ\Ъл^к\\Рм\\ЬшЯ, for;>l. (8.6) Assuming this for the moment, since Rj^Dj^^Dj, we have Qj^Wj-i-Qj), (8.7) where Qj=\\Phv0\\lDj. (8.8) Thus G^(K/(l+K))Gj-i<(K/(K+l)yeo<e-e«||P4i;ollS, (8.9)
Section 8 Finite element 1} projections 401 where е~с=(к/A + к)I/2. Since supp %eT0, ||PA,0||o = max^|Zo x*v* Hz Ho ^ mH ITU^ II y0 II 0,p, Го maX ^7~Ц qellr-iiTo) JJ9tlO.ro яеПг-i И\\о,Т0 where p' is the conjugate index to p. By a standard transformation to a reference triangle (as used in proving most inverse assumptions), with С depending only on r, Ml O^To^f-1'2 II? |1о,Го whence || Phv01| о < Catf-vr || v01! 0.Р.ГО- (8-Ю) Using this in (8.9), and taking square roots, ИЛ»о11о,1)^е-еЧо2/,,11''о11о.лго, which would show (8.5). It remains to prove (8.6). We have (P*vo,X) = 0 for x in Ц, (8.П) with support in Dj-i—DjURj,}^ 1. Let w=PAu0 and define, for any we J^, a new function Wj in Ул by setting Wj=w on £>,, w, = 0 on Q\Dj-x- (8.12) To define w,- on the triangles Г in Rj = Dj\DJ-1, introduce for such a triangle Г the Lagrangian nodes, with barycentric coordinates (i'i/(r— 1), i2/(r — l), i3/(r — l)) for i\,ii,h nonnegative numbers. Now set w;-=w on all nodes that are interior to Rj (i.e., wj is cut down to zero on nodes on or inside the closure of \Ji<jRi(T0)). Then setting # = w,- in (8.11) we have with w = Phv0, (w, wj) = || w || §л + (w, w^.=0, (8.13) so that 1Иб>,.= --(и',^)кг (8.14) In order to estimate the quantity on the right of (8.14), consider again a triangle T in Rj. Note that T has either only one vertex or one whole edge on Rj-x- For qeIIr-x(T) we let qTenr-x(T) be the polynomial that vanishes at the Lagrangian nodal points of T in Rj-1 and that agrees with q at other Lagrangian nodal points. Then ~(w, Wj)T~ — (w, wT)T. (8.15) We now have a localized problem: What happens on one triangle if we cut down
402 L.B. Wahlbin Chapter II a polynomial w to vanish on one vertex or one edge? Transformation to a reference triangle is then possible, Jacobian determinants cancel, etc. and we easily find that |(w,wT)rl^K||w||o,T, (8-16) where к only depends on r, the polynomial degree. Thus from (8.14), (8.15), (8.16), after summation, iMiiUj^KiMiiW This proves (8.6) and thus, as we have noted, completes the proof of the theorem. □ It should now be clear that we can select v0 = S° as a polynomial of degree r — 1 on T0 э x0 such that (%,<50)г0 = х(хо), МхеПг-ЛТо). (8.17) Further, 11<5011од,г„<С (8.18) so that, cf. Assumption 7.2, ll^llo.^CaV?-1', (8.19) as follows by transforming the situation to a reference triangle. Then as before, for uh the L2 projection into Vh of u, for x0 e T0, %h e Vh, (w - uh)(x0) = {u- Xh)(x0) + (Xh ~ Щ){х0) (8.20) where (Xh - uh){x0) = (Хь ~ uh, <5°) = (Xh ~ м, <5л )д (8.21) with ё® the L2 projection of «5°. By use of (8.19) and Theorem 8.1 results about the local behavior of u — uh can then be obtained. Clearly the precise expression of such results will depend (if phrased in norms other than L2) on assumptions concerning the mesh family, e.g., triangle size on Rj{T) as compared to T etc. We refrain from writing them up; the reader should have no problem in a situation where the mesh is, say, orderly refined. In Crouzeix and Тномёе [1987а] conditions are given that assure that the L2 projection is stable in LP(Q) or W1,P(Q), thus opening the possibility for further localization, cf. Theorem 7.2. The conditions given there will give the reader sufficient clues as how to proceed in a specific case.
Chapter III Local Estimates in the Finite Element Method for Second-Order Elliptic Problems, with Applications 9. Local estimates in energy and L2 norms In this section we shall give local error estimates in energy and L2 norms. We follow, by and large, the treatment in Nitsche and Schatz [1974], cf. also Nitsche [1972a] and Descloux [1976]. Modifications are introduced as in Schatz and Wahlbin [1977, 1982]. In particular, our statements follow these two later papers. The results are not only models for later pointwise local error estimates but also provide basic technical tools in deriving these. Indeed, the local estimates in energy based norms give a fundamental tool in proving any U or W1,p error estimate as witnessed by the proofs in the following papers devoted to global estimates: Natterer [1975], Nitsche [1975, 1977], Frehse and Rannacher [1976], Scott [1976], Rannacher and Scott [1982], and Schatz and Wahlbin [1982]. The investigation in Nitsche and Schatz [1974] is quite general in that it treats variable coefficient problems and a general set of finite element spaces, given via assuming suitable hypotheses. We wish to keep the present exposition in a less general setting as not to overburden it. The interested reader is referred to the original paper for the general case. On the other hand, the results of Nitsche and Schatz are not given for subdomains abutting on the boundary of the basic domain. We shall include this case, very important in practice, in our simplified setting. To fix thoughts, let Q € UN be a basic domain and consider as a basic problem that of finding и such that — Au~f on Q и=0 on dQ. (9.1) To approximate (9.1), let тА = (J Г,- be a family of triangulations of Qh into simplicial elements, possibly modified in the isoparametric fashion near the boundary. Although somewhat questionable in practice, we assume for simplicity that Qh^Q. (9.2) On these triangulations let Vh denote Lagrangian elements of order r—1 which vanish on 5Qh. 403
404 L.B. Wahlbin Chapter III Let next B<=QhsQ (9.3) be a basic domain inside of which almost all our considerations will take place. Most often, В can be thought of as a disc intersected with Qh. Further, with notation as in G.1) with Q now replaced by Qh, V^(B) = {vsVh:8<(^PPV,B)>0}, (9.4) and similarly for <^5(B). Consider then any function uhe Vh such that Щи - uh, x) s V(« - uh)Vx = 0 for xe V£(B). (9.5) ft. In particular, щ could be the finite element solution to (9.1). Furthermore, we asume that the family of triangulations is quasi-uniform on B, i.e., that there exist positive constants с and С such that cfc<pT<diam(T)^Ch for ТглВФф, (9.6) where pT denotes the radius of the largest inscribed ball of Г. In this setting and seeing to it that the possible isoparametric modifications at the boundary are well behaved, cf. Ciarlet [1978,4.3], it is frequently easy to verify the following two properties which will be basic in our development. Since we do not wish to go into details about the isoparametric modifications the properties are stated as assumptions. (The properties are the analogues of Assumptions 7.1 and 7.3, but we restate them here since there are some minor changes.) Assumption 9.1 (Superapproximation). Let A0zAl^B with d = d<(A0,Ai)'^ch. Let (oe^^iAo). Then for any xeVh there exists \j/eV^(Ai) with \\<ox-*!>\\i,2,A,^Ch\\xh,2,A0 (9.7) where C = C(co). Assumption 9.2 (Inverse assumption). There exists a constant С independent of h such that for any element Tt meeting B, WkP,T^Ch~^^-m^^\\v\\m,q,T, (9.8) for veVh, l^q^p^co, l^m. Further let \\v\\-s,a= sup (v,w). we<e™(A) М.,л = 1 Then for veVh and A a mesh domain \Ho,A<Ch-s\H-s,A where С does not depend on A or v.
Section 9 Local estimates 405 For this, cf. Lemma 4.3; the proof there easily extends to the case when A abuts on the boundary. The following is our basic local energy estimate. Many ideas in its proof were sketched in a simple situation in Section 4. Theorem 9.1. Let Assumptions 9.1 and 9.2 hold on B^Qh and let e=u — uh satisfy (9.5). Let A^B with d<(A,B)>0. Then {for h sufficiently small), ||Ve||0M<Cmin \\u-x\\i,B + C\\e\\0iB (9.9) where С depends on ЬК(А, В) and the constants of the assumptions. The proof will rely on the following lemma. Lemma 9.1. Let wheVh be such that D(wh,X) = 0 forxeV^A,). (9.10) ThenforAcAuQ<(A,A1)>0, \\Wh\\i.A^C\\w„\\0,Al. (9.11) The proof of this lemma is postponed for the moment. Proof of Theorem 9.1. Let AzA1cA2^B with 3<(/l,y41), d<(A1,A2)>0. Let (ое<ё^(А2) with cosl on Au and let Px denote the H^Q) projection into Vh. Then on A, u-uh = lcou~P1(cou)'] + lP1(cou)-uh]. (9.12) Here, as is well known, ||V(©«-/'1(fl)H))||0li<C||(B«||1.Oli<C||H||ltB. (9.13) Set next wh = P1(cou) — uh. Then wh satisfies (9.10) and so by Lemma 3.1, \Ы\1,л<С\\щ\\А1^\\<т-Р1{аш)\\А1 + \\е\\Л1. (9.14) By Poincare's inequality (note that сои can be assumed to vanish on a major portion of 5\4i even if cou^O on dQh), and using (9.13), ||ow-P1((Bii)L1^||V(aw-P1((BH))||0li<C||M||1,J,. (9.15) Combining (9.12)—(9.15) proves (9.9) with x = 0. Now add and subtract a /e Vh. □ It remains to verify the lemma. Proof of Lemma 9.1. We first consider ||VwA||o,^- Let соe<€<(Ax) with co= 1 on A. Then, by (9.10), l|VwJ|gM<(Vwfc) coVwh) = (Vwh, V(cowh)) + (Vwh, (Vco)wh) = (VwA, V(cowh-x)) + ?(wh, (Aco)wh).
406 LB. Wahlbin Chapter III Thus, by the superapproximation hypothesis, UVwJg.^CfcllwJf.^ + illwJg,^ and so \\y>k\\l.A^Chll2\\wh\\liAi+C\\Wh\\0,Ar The argument may be repeated for ||wJliAl on the right to yield with AzA2^B, \\wh\\UA^Ch\\Wh\\ltAj + C\\w„\\0tA2. (9.16) In light of the inverse hypothesis this proves (9.11) after a change of notation. D It is easy to trace through the proof how the constant in Theorem 9.1 varies if d = 5<(A, B) becomes small. Another way of seeing the same thing is to consider domains of size comparable to S<(/4, B), scale such up to unit size and then note (or, in general, make an assumption) that after scaling, the previous assumptions hold with h replaced by h/d. In the case of variable coefficients one would then have to be careful about how constants associated with them enter into the proof, cf. Schatz and Wahlbin [1977, Lemma 3.1]. Corollary 9.1. With d = d<{A, B)^const-h, (9.9) is replaced by ||Ve||0ii4^Cmin(||V(«-Z)||0il, + d-1||«-zllo,B) + Cd-1l|e|lo.B- (9-17) XtVh We next want to replace the L2 norm on the right-hand side of (9.9) by a weaker, negative, norm, ||е||-,,в= sup (e,v). (9.18) ds«?(B) М«.в=1 For this we shall need three additional assumptions. The first one amounts in practice to a very weak regularity condition on the boundary of Q and is not needed if we consider only completely interior subdomains. It is needed for a duality argument. Assumption 9.3. The families of mesh boundaries SQh are uniformly Lipschitz so that there exists a universal extension operator E (cf., e.g. Stein [1970]). Assumption 9.4 (Locally smooth boundary). Let Л3 еВ, Э<(ЛЬ B)>0. There exists iiEfl such that 9<(Л1пВ, В)>0 and such that if — AV=v in Ay. n я I (9Л9) u = 0 on oAu then W\\1 + 2,a^CMlm> l = 0,...,s-l. (9.20)
Section 9 Local estimates 407 In order to verify Assumption 9.4 one would adjoin a surface smoothly to 60, taking care that it goes between Ax and В in C2h. We shall also need a very weak approximation hypothesis. Assumption 9.5 (Weak approximation). There exists a constant С and a positive number у such that the following holds. With At as in Assumption 9.4, for V vanishing on dA~i, min W-xWiM^^Ch^lVW^. XeV<(AinQh) In practice this is clear with y=i. We can then proceed essentially as in Section 4. Theorem 9.2. With assumptions as in Theorem 9.1 and with Assumptions 9.3, 9.4, and 9.5, HVelk^Cmin \\и-Х\\ив+С\\е\\-й,в (9.21) xeVh From the proof of Theorem 9.1 it is clear that Theorem 9.2 would follow from the following result which extends Lemma 9.1. Lemma 9.2. Let wheVh satisfy (9.10). Then KI|^C||w»||_1>i4l. (9.22) Proof. Let A^A0sA1 and let coe^<{A0) with w = l on A. Then KII-m= sup (wh,v). (9.23) For each fixed such v, let V solve (9.19) on a domain A0, where v is replaced by Ev via the universal extension operator. Then (щ, v) = {cowh, v) = D(awh, V) = whBVa>VV-(Aco)V)+ \VwhV(coV-x) for arbitrary %eV^(А0глОн). Consequently by Assumption 9.5, |(w»,»)I^C(||w4||-/_1,i4o+||wik||1,>(ofc'')||^||I + 2Mo so that by Assumptions 9.3 and 9.4, \\Wb\\-i,A^C\\wh\\-i-uA0 + Cby\\»dLA0. (9.24) By Lemma 9.1 then, taking 1=0, and changing notation for domains, l|w*||oM^C||w»||_lii4o + CA'||w»||0.^.
408 L.B. Wahlbin Chapter III The argument may now be repeated until the inverse assumption can be applied to the last term on the right. This would prove (9.22) for s=l. The argument then proceeds via induction in (9.24). □ Again the mapping as described for Corollary 9.1 gives: Corollary 9.2. With d = d<(A,B)^const-h, (9.21) is replaced by l|Ve||0fi4<Cmm(||V(M-Z)||0iB + d-1||u-Z||0tB) + ОГ1-' ||e||_, B. (9.25) XeVh We now turn to local error estimates in the L2 norm. We cannot expect a "straight" analogue of Theorem 9.1 (or Theorem 9.2) since the Я1 projection is not in itself stable in L2 for merely continuous elements, see Babuska and Osborn [1980, p. 58]. The traditional path to L2 error estimates is via a duality argument and we shall follow this path. Note that in our previous duality arguments in this section, namely in the proof of Lemma 9.2, we worked opposite a function in Vh and thus the "skin layer" Q\Qh never entered our considerations. In particular, all our results were stated in terms of domains which are subdomains of Qh. Since the H1 projection into Vh does not see и in the skin layer this is correct. In the argument to follow shortly the behavior of и in the local skin layer will enter through a boundary integral term. We shall make the following modification of Assumption 9.5. Assumption 9.6. There exists a constant С such that the following holds: With Ax as in Assumption 9.4, with V vanishing on dQr\A1, and with A0zAt where Э<(Ло,Л1)>0, min \\V-x\\UAo^Ch\\V\\2rAi. (9.26) In practice, this is clear provided dist(Sfl, 5Qh)^Ch2 locally. Theorem 9.3. Let Assumptions 9.1-9.6 hold onB^Qh and let e = u-uh satisfy (9.5). Let A^B with d<(A,B)>0. Then (for h sufficiently small) \\e\\o,A^Chmin ||м —xIIi,b +^lul-i/2,BnBQ), + Q\e\\_SfB. (9.27) XSVh Proof. Let A be as in Assumption 9.4 and AS:A0^Ar\Qh with their 6< distances positive (as can always be arranged). Let а»б^5(/40) with a> = 1 onl We have ||е||-1М= sup (coe,v). (9.28) \\vh.A=i- For each fixed v, let -AV = Ev on A, F = 0 опйЛ. (9.29)
Section 10 Local estimates 409 Then, as in the proof of Lemma 9.2, and since uh-0 on dQh, for any tsVh, (coe,v)annh)=- (oeAV = V(coe)VV+ cou(dV/dn) AnQh Anf)h ЪЯг\д£)н VeV(a)V-x) + (ou(dV/dn). eBVcoVV-(Ao})V) + Antih AnQ„ ЪАпдПь Thus by our Assumptions 9.3 and 9.5 and by a trace inequality, IklN^CIIell-j.^ + CAIIVelloMo + M- 1/2,ал0пЭпй- The desired result now follows in the obvious fashion by use of Theorem 9.1 and induction. It is left to the reader to formulate the obvious analogue of Corollary 9.1. Remark 9.1. As the reader has undoubtedly noticed, we have gone through some contortions in order to keep the domains entering in our statements inside Qh. Apart from the intellectual honesty of this (since only и as seen on Qh enters in the elliptic projection) it is of some practical importance. A well-known example is that of quadratics in the plane where the triangulation is straight-edged with no isoparametric modifications at the boundary. Then for smooth functions vanishing on 80 on a domain В abutting on a smooth boundary bQ, the error in Я^В) is typically 0(/i) at best whereas the error in H1(Bn£2ll) is 0(/i3/2), see Strang and Fix [1973, p. 195]. Likewise, for isoparametric modifications of polynomial order r — 1, the H1{B) error is typically of order 0(hrl2) whereas the error in H1{BnQh) is of order 0(hr~x). (In certain cases it does happen that the error in H1(B) is of order 0(hr~1); the reader may, as an elementary but interesting calculus exercise, verify that this happens if quadratic plane isoparametrics are used on Q the unit ball in U2; further, in this case Qh s Q when the usual halfpoint interpolation procedure is used. These two facts are connected: if Qh я Q should occur, then dist(£2, Qh) = 0(/i4) while if the boundaries of Qh and Q interweave, then dist(Q,Qh) = 0(h3) in general. In the latter case, however, the undershoot and overshoot areas cancel to order 0(ft4).) The case of natural (Neumann) boundary conditions is considerably easier to treat. We leave it to the reader and shall later freely use the relevant anologues of the results above. For local error estimates in mixed methods, see Douglas and Milner [1985]. 10. Local point wise error estimates We shall first consider interior estimates, following Schatz and Wahlbin [1977]. In order not to overburden the exposition we shall be somewhat cavalier in giving
410 LB. Wahlbin Chapter III hypotheses and refer to the paper mentioned for exactness. (By now the reader should have appreciated the role of superapproximation.) Let x0 be a point in Q, and let B€ Q be a ball of radius d around x0. With A{u, v) the bilinear form associated with a uniformly elliptic second-order operator with smooth coefficients, assume that uheVh(B) satisfies Л(«-и*,х)=0, fOTXeVh(B), A0.1) where Vh(B) denotes the functions in Vh restricted to B, and Vh(B) = {xeVh(B):snppX^B}. A0.2) Note that in this setting the boundary conditions on dQ do not enter. Assume further that the family of meshes is quasi-uniform on В and that the highest possible (the optimal) approximation order in IF is 0(hr), r^-2. Theorem 10.1. Under the above hypotheses, there are constants с and С such that for d^ch, for s^O, |(tt-u,)(xo)I^C(ln^rmin|!U-x!|o,oo.B + C^-iV/2-sl!u-u/J||_s,B, A0.3) where A for r = 2. For the necessity of the logarithmic factor, see Haverkamp [1984]. The result of Theorem 10.1 was known earlier than 1977 in various cases of perfectly regular interior meshes (uniform, translation invariant). We refer to Bramble and Thomee [1974], Bramble, Nitsche and Schatz [1975] and Bramble and Schatz [1976] for such results, often based on local Fourier analysis. We proceed to describe the salient features in the proof. We consider only the case d = 1; the case of d< 1 follows by mapping В to a unit ball and replacing h by h/d. In a sense one relies on global L^ stability results for elliptic projections in a localized boundary value problem. For simplicity assume that the form A is coercive on B, cf. Schatz and Wahlbin [1977, Appendix 1]. We then take the Neumann problem over В as our basic localized problem: Find veH1(B) such that A(v,q>)={fcp) for <peH\B). A0.5) Let then Pt denote the elliptic projection into Vh(B). We note that ||»-PiP||I.B<Cfc»-'||»||liB A0.6) for 1 = 0,1, 2=%s<r. This follows by extending v over dB and using ordinary approximation theory and duality arguments. Let next coe^gQB) with co(x)=l on \B. Then (u-uh)(x0) = {cou-Pl{a)u)){x0) + {P1(a)u)-u„){x0). A0.7)
Section 10 Local estimates 411 The two essential steps are the following: flow-ЛМПо. „.я/4 <C(ln l/hf \\(ou\\0^B A0.8) and for the function w„ = P1(cou)—uheVh which is "discrete harmonic" on \B, K(*o)l<CK||1(Jve. A0.9) One then uses Lemmas 9.1 and 9.2 to obtain K(Xo)I*SCIKI!-s,b/4 ^C\\¥Am)-ym\_l%m + C\u-uh\_^. A0.10) Here, by A0.8), ||«^-Р1(ви*)||_лВ/4<С||а)и-Р1(й)м)||01вI1,/4<СAп1/йУ||и||0>00|Я. A0.11) Combining the above would give A0.3) for %=0; then write u — uh=(u~x) + ix—«*) to complete the proof. The two basic building blocks are thus A0.8) and A0.9). The stability result A0.8) is reminiscent of global stability results such as those referred to in the second paragraph of Section 9. We shall follow the ideas of Frehse and Rannacher [1976] and Scott [1976], in particular. A nice thing now happens: The results on the error in Li-based norms for smoothed Green's functions that are basic in the global investigations just mentioned also give a way of proving A0.9). Let us state this fundamental result: Lemma 10.1. Let TciB be an element. Let (pe%?o(T) and let v and vhe Vh(B) satisfy А(ф,и) = (ф,ср) foril/eH^B), A0.12) A(x,v-v„) = 0 forXeVh(B). A0.13) Then lk-^lli,i,B<C^2+1(lnl//iri|(?||o,r, A0.14) and for Bt CB, II о-ИлН^гл.в, ^Cftw/2(ln 1/Л)'|| ф ||o.r- (Ю.15) Here wh'2'l'h is the piecewise W2'1 norm. Before indicating the proof of Lemma 10.1 in a simple case, let us show how A0.8) and A0.9) would follow. For A0.8), let Xj be a point in ^B and denote й=сои. If xteTx we have by the inverse property, \Plu(x1)\^Ch-N'2\\Plu\\0,Ti «ЭДо.оо.г, +С/Г»'2||й-Р1й||0,г1. A0.16)
412 L.B. Wahlbin Chapter III Here ll«-.Pifi|lo.T1= sup (й-Р^ср). A0.17) For each such q>, let и and vh satisfy A0.12), A0.13). Then {u — P1u,(p) = A(u — P1u,v) = A(u — P1u,v — vh) = A{u,v — vh). A0.18) With L* the adjoint elliptic operator associated with A, and with 9/5n denoting the conormal derivative, A(u,v-vk)=Z |fiL*(«-p»)+I J u^fv-vh). A0.19) T, ЭГ; The summation here is only over elements meeting supp й and thus they are quasi- uniform. A simple trace inequality gives that I/Il.cbt,, < C(|| V/||ll(r|) + h~ > \\f\\LliTl)). A0.20) Thus, using Lemma 10.1, A{u,v-vh)^C\\u\\0tO0{\\v-vl,\\h.2tUB^+h\\v-vh\\1AyB/A) ^СAп1/Л)'Л№/2||й||0,оо. (Ю.21) Combining the above A0.8) obtains. For A0.9), let t]he Vh be such that wh^r]h on jgB while supp r\h<^\B and ||r\h|| t < C||Wfc||1>B/8. Such a cut-down is easily constructed, see Schatz and Wahlbin [1977, Proposition 2.2]. Then if xoeT0, \Мхо)\ = Ых0)\^Ск~^2\Ш\0гТо = Ск-т sup (r,h,cp). A0.22) №1о,т0=1 Again, for each such q> let у and uft satisfy A0.12), A0.13). Then {rjh,(p) = A(r]h,v) = A(rjh,vh). A0.23) Let now хь = v>, on ^B and supported in fB, while || u,, - ih || i,W8)\(B/ie) < C|]yJI i,(b/8)\(b/i6)- Then since /y/, = wA on r^B and wfc is discrete harmonic, (r\h, cp) = A(rih, vh~Xh)^C\\ wh\\ ив/а || vh || 1i(b/8)\(b/i6)- A0.24) The important point here is that we now only need to estimate vh on an annulus A — ^B\-^B away from the "singular" point x0. Then vh is "discrete harmonic" on this annulus and thus by (an easy extension of) Lemmas 9.1 and 9.2, KIIi,^CIKIIi,i,b- A0.25)
Section 10 Local estimates 413 Here, by Lemma 10.1, iMii,i,B^iif-^iii,i,B+iMii,i,ij <CAw'2 + 1(ln W+Nli,i,B. A0.26) It is easily seen, cf. Schatz and Wahlbin [1977, Lemma 4.2], that II и II I.!.» <C|| cp ||0ЛрВ <C^'21| <p ||0,B = ChN'2. Collecting the above, the estimate A0.9) follows. To complete this informal description of the proof of Theorem 2.1, one should indicate the proof of Lemma 10.1. First note that A0.15) follows from A0.14) by introducing a suitable interpolant to v, by using inverse properties, and by using well-known results about the continuous problem. (The use of inverse properties accounts for the restriction to Bt <g# since elements at дБ may be very irregular.) The result A0.14) is, for TV = 2, contained in e.g. Scott [1976]. While being essentially a global result and thus outside the scope of this article, the proof is a very nice application of the local Я1 estimates of Section 9 and thus we shall take this opportunity to give a very brief sketch in the case of TV = 2 and piecewise linear elements (r = 2). Sketch of Proof of Lemma 10.1, A0.14), in a simple case. Let the element Г contain the center of the unit circle B, taken as the origin for notational simplicity. Let Aj denote the annuli and let J be the largest integer such that 2~j^h. Set dj = 2~i,Ah = B\\Jj=0Aj so that TcAh, and let A^Aj-iUAjuAj+i. Setting e = v—vh we have IHIi,i,b= t \\*h.i.Aj + \Hh.i.Ah. (Ю.27) By Cauchy-Schwarz' inequality and A0.6), \\eh,i.A,^Ch\\e\\ltAb^Ch2\\v\\2,B^Ch2\\(p\\Q,T. A0.28) Further, again using Cauchy-Schwarz, by the local H1 estimates and approximation theory, \\e\\i.i.Aj<Cdj\\e\\UAj^Cdj{hM2.A-J+dr1\\e\\o.Aj}- (Ю-29) From well-known properties of the continuous problem, since v satisfies a homogeneous equation away from T, Wvh.Aj^Cd^Wvh.s^CdJ-ihMo.T.
414 L.B. Wahlbin Chapter III Hence, 1И1,1>^СЛ2||<рЦо.г + С|М|0,4 so that combining with A0.28), and using again A0.6), 1И1.1,*<С£Л2||ф||о,г + с£|Иом} j=0 j = 0 ^C(lnl/h)h2\\<pi0,T + C(\nl/h)\\e\\Q,B ^Ch2(lnl/h)\\(p\\0,T which is A0.14) in the present case. □ Remark 10.1. One may inquire about the counterpart of Theorem 10.1 in VF1'00. The proof above can rather easily be adapted to cover that case. The crucial point is to estimate Vwh(x0) for wh discrete harmonic. Proceeding as in A0.22) one ends up with sup(»/fc, (px) (or q>y) on the right. For the duality argument, let -Av-q>x. Then proceed to A0.24) essentially as before. It may be assumed that <p behaves like a finite element function so that || V<p |l о ^С/Г^ф Но- Then on the right of A0.24) H«l,lll,(B/8)\(B/16)<C||tjJ|o,i^||D-uJ|o,l + l|u|lo,l ^ Chm + x(In l/fc)'|| q>x\\о + ChN'21| ф || о ^Chm(lnl/h)f. This leads to the appropriate analogue of Theorem 10.1 in W1,a0. Rannacher and Scott [1982] have shown that the logarithmic factor is not necessary for r = 2. It is now natural to ask about the analogue of Theorem 10.1 in case В abuts on the boundary. To my knowledge no systematic general investigation has been performed in the literature, not even in the Neumann case assuming Qh = Q. The problem, apparently, lies with constructing a localized boundary value problem (analogous to the interior Neumann problem A0.5)) for which one can prove all that is needed. The problem should have (at least) H2 regularity. (In the Neumann case, adjoin a surface smoothly to 60 and consider a local Neumann problem.) However, for essential boundary conditions on dQ, the boundary conditions cannot, with present technology of proofs, be taken of Dirichlet type on the rest of the boundary of the localized problem since, as of today, proofs of L°° stability of elliptic projections require that the mesh boundary is within 0(h2) of the exact one, cf. Schatz and Wahlbin [1982]; such would not be the case on the rest of the boundary. In a very special case a mixed problem was used in Schatz and Wahlbin [1978]. In extending Theorem 10.1 up to the boundary one is then reduced to applying various ad hoc methods. We shall give two such special methods in the following examples.
Section 10 Local estimates 415 Example 10.1 (Reflection over a straight boundary). Consider the Dirichlet form, in the plane for simplicity, D(u,v)= \uxvx + uyvy. Let Г be the boundary x = 0 (note that rotations do not change the Dirichlet form). Let Vh be Lagrangian elements on triangles which vanish on Г, and assume also that и = 0 on Г. Assume D(u-uh,x) = 0 forXeV?(B) where В is a semidisc centered on Г. We wish to estimate u — uh on A£B, d<(A,B)>0, say A a semidisc concentric with B. If one extends и oddly over Г to й, щ to щ, and of course the mesh similarly, one has (for <ё° or <&l finite elements) Z)(tf-ufc,x)=0 iori€Vh{B) since this is automatic for even functions in Vh(B). Thus, e.g. 11"-й,Но,оо,лСAп W" min ||й-х11о,«,л + С||ы-йл||о.^ xeVh(B) so that, restoring now a general d~d<(A, B), \\u-uh\\^A^C{\ndlh)fmm\\u-x\\ao,B + Cd-m\\u-uh\\OB. A0.30) xeVh The example clearly generalizes to more dimensions, to Neumann problems, Example 10.2 (A two-dimensional technique). Let A(u-uh,x) = 0 iorXeV£(B). A0.31) Assume now that one can find some boundary value problem containing the region of interest such that the elliptic projection P1 thereon is (almost) stable in L00. This problem could e.g. be the basic problem with coefficients changed outside В as in Schatz and Wahlbin [1983], or it could be the basic problem with the mesh changed so as to be globally quasi-uniform if this can be done, cf. Schatz and Wahlbin [1982]. With со a cutoff function isolating A^AiZB, i.e. w = l on At, while 6<(suppw, B)>0, (u - uh)\A =(cou-P1 (сои)) + (P,, (сои) - uh). A0.32) Then by assumption, \\cou-Pi(cm)\\0^ <C(ln 1/ЛЛ1©и||о.» <C(ln 1/АЛ|и||о,».*. (Ю.ЗЗ) Now by A0.31), wh = P1 (сои) — uh is "discrete harmonic" on A i. In the two-dimensional case with quasi-uniform meshes one has the following variant of Sobolev's
416 L.B. Wahlbin Chapter III inequality, llxllo.«.^ain l/AI/2Hzlli.2.^ for any XeVh, A0.34) see Wendland [1979, Theorem 8.3.3] and Schatz, Thomee and Wahlbin [1980, Lemma 1.1]. (To prove A0.34), express jr(x0) in terms of a suitable Green's function, after extension over dQh by zero.) Here A^A' ^At. Next use the discrete harmonic estimates of Lemmas 9.1 and 9.2, assuming the boundary is sufficiently smooth. Thus w < cam/ftI'2 и wj|_IiAl. (Ю.35) к\Ю,ю,А -^^U" Ч'Ч II "(ill-«.Л Since llwJI-^^llwW-^lCMIUs^. + llw-Wftll-M! and ||<mm-F1(com)||_Mi <C(ln 1Л)г"Ц»11о,»,в. the combination of the above give, upon writing и-иА=(и-х) + (Х-«(■)> with general ^ = 9<(Л,В), il« —«fcllo.oo,^ ^C(lnl//ir"+1/2min!|M-Z||0,t>3,B + C(ln^)I/2d-I-s||M-ufcj|_s,B. A0.36) Although a result lacking in elegance it is, at present, the best we can do in many situations close to the boundary. In the rest of this chapter we shall give applications of the basic estimate A0.3). 11. Problems with a rough right-hand side Here we assume that the basic second-order problem is uniformly elliptic on a smooth domain and has smooth coefficients. We first consider the (much simpler than essential boundary conditions) case of homogeneous natural Neumann conormal conditions. Thus, Lu=f in Q, bu AU) —- = 0 on QQ (n conormal direction) on where / may be rough. We assume furthermore that the associated bilinear form A is coercive over H1(Q) so that A1.1) has a unique solution. Let now Vh be finite element spaces and let x0 be a point interior to Q, x0elntfl. A1.2) Let further В be a ball of radius d centered at x0 with В <£ Q. (The case when x0 is close to the boundary is left to the reader, cf. Examples 10.1 and 10.2.) Assuming that the meshes are quasi-uniform on В we have thus from Theorem 10.1, for щ the
Section 11 Local estimates All elliptic projection into Vh, \(u-uh)(xJ\*:C(lnl/hymin\\u-x\\0^B + Cd-»'2-°\\u-uhl\_S!B. A1.3) Here the influences on the error in the elliptic projection at x0 are neatly separated into a local approximability term and a term in a weaker norm which accounts for all outside influences. Note that if one attempts to make the local term more local, i.e., decrease d, then the factor in front of the global influence term increases. Assume now further that / is smooth in a neighborhood B' of B; then so is и and we assume, as is not too unreasonable in practice, that тш||м-х11о.„.в<СА,||ы|| j,.. A1.4) xeVh To estimate the global influence in A1.3) one most often resorts to a duality argument. We give a very simple one here. In the present Neumann situation we assume (as is again not totally unreasonable in practice) that for Px the global projection into Vh, lk-filli.fl<CA'-1||»ll,,0 for2<s<r. A1.5) Now l|M-uJ|_(r_2)ifl= sup (u-uh,<p). A1.6) Mlr,n=l A1.7) For each such <p, let v solve L*v = q> in Q, dv — = 0 on dQ. an Then with vh = P1v, by use of A1.5), (u-uh,<p) = A(u-uh,v-vk) = A(u-x,v-vh) ^ C|| tt —X || 1>я j| 17 —ий || 1>n «SCft'-MlM-xlli.o. A1.8) Provided feL2(Q) we have for a suitable ^ in Vh, ll«-Zlli.D^CA|N2jO<Cft||/||0>o. A1.9) We have thus proven the following local error estimate of optimal order, apart from the logarithmic factor. Theorem 11.1. Consider A1.1) with L and dQ smooth. Let x0 be interior to Q and the ball В of radius d also interior to Q. Assume further that the meshes are quasi-uniform on В and that A1.4) and A1.5) hold.
418 LB. Wahlbin Chapter III Let ueWr<<°{B) andfeL2{Q). Then \(u-uh)(xo)\<Chr(\nd/hY\\u\\r^B + Ch'\\f\\0,Q. A1.10) It is sometimes necessary to perform the duality argument in A1.8) with considerably more care. As an example we consider interior approximation of the Green's function in the setting of A1.1). We continue to assume A1.4) and A1.5). Let thus yelntQ A1.11) and the Green's function Giy) centered at у be defined by A(Gi3,),v) = v{y) foTveWl-x(Q). A1.12) The approximate Green's function Giy) e Vh is given as A(Gi>\z) = X(y) ioTXeVk. A1.13) We have the following result, Schatz and Wahlbin [1977, Theorem 6.1]. Theorem 11.2. Let x,yeQx<gQ and assume in particular A1.1), A1.4) and A1.5). Then for \x—y\^Ch, \G(y\x)~G^{x)\^Chr(ln\x-y\/hY/\x-y\N'2+r, A1.14) and for \x — y\^Ch, Thus, for interior situations at least, the error in the Green's function is, apart from the logarithmic factor if f = 2, exactly as one would guess from pure approximation theory, A1.4). In the case of | x — у | ^ Ch the approximate Green's function behaves no worse than the continuous one. As noted in Bramble and Schatz [1976], a use for such refined error estimates in the Green's function is as follows. For uh the elliptic projection of и one has (u-uh)(y)= I ((P\x)-GP(x))f(x)dx. A1.16) j supp/ Thus, provided supp fig Q (otherwise one would have to do additional work, cf. Examples 10.1 and 10.2), the estimates of Theorem 11.2 will furnish error estimates for u — uh under very weak regularity assumptions on/ In this connection see also Scott [1973]. Proof of Theorem 11.2. We shall content ourselves with showing A1.14) and refer to Schatz and Wahlbin [1977, Theorem 6.1] for the estimate A1.5) (which has a less interesting proof). The proof of A1.14) involves a precise variant of the duality argument. Set d — \x—y\.
Section 11 Local estimates 419 Since, as is well known, \\G(y)\\rimtB^Cd~N~r+2 we have by use of (an obvious modification of) Theorem 10.1, with eiy) = Gly) - Gtf\ with В of radius id centered at x: \eM(x)\^Chr(lnd/hyd-N-r+2 + Cd-N-r+2\\eM\\2-r<UB A1.17) where I|e&)ll2-,,i..= sup (<P\q>). A1.18) ll«Pllr-2,OT,B~ 1 For each cp as above, let v be such that A(w, v) = (w, cp) for w e H * (Q). Then v is smooth and (e<y\cp) = A(e<y\v) = v(y)-vh(y) A1.19) with yA the adjoint projection of r into Vh. Since у is interior to Q we may apply Theorem 10.1 and A1.4) so that \(v-vh)(y)\^C{hr(lnd/hY\\vl,a0yS+d-NI2\\v-vh\\0,il} (П-20) where В is centered at у with radius \d. Thus В and В are separated by |d and one may derive by use of the continuous Green's function that NU,*<CIMU2,»,B = c. (n.21) Further, by A1.5), and straight duality, <C/fd*'2||H,-2...i> = Cftrdw/2. Hence, by A1.18)-A1.22), \\ewh-r.i.B<Chr(lnd/hy. Reporting this into A1.17) the desired result A1.14) obtains. □ Remark 11.1. Note that the proof of Theorem 11.2 still holds if the mesh is refined in a "systematic" fashion towards the point y. We refer to Eriksson [1985a,b] for details. We lastly consider the counterpart of Theorem 11.1 for essential boundary conditions, i.e., Lu=f in Я и = 0 on8fi. A1.23) Now there are severe difficulties in performing even the simple duality argument given in the proof of Theorem 11.1. One may for simplicity assume that Qh<=,Q and the functions in Vh vanish on dQh. Typically, for isoparametric elements of order r — 1 (after an imaginary shift of Q as to be a superset of Qh), the distance between dQh and Зй is 0(hr). We amplify one difficulty:
420 L.B. Wahlbin Chapter III Remark 11.2. In the above situation the counterpart of A1.5), i.e., for veH1{Q), infllf-xllu^Cft'-MMU xeVh does not in general hold. Indeed, Q needs to be replaced by Qh on the left, since, in general, Nkoxo^OfA) and not better. This leads to grief in the duality argument. The correct assumptions to make are now to keep the interior high-order assumption A1.4), and to replace A1.5) by: For veH1{Q)nHs{Q), min||i>-z||li0li<Cfc'-1|MU0 (П.24) xeVh for l^s^r'. E.g., for plane quadratic elements on straight-edged triangles, r' =§ (r = 3 in A1.4) but globally up to the boundary, r = 2 only). For isoparametric elements of basic polynomial order r—1, if Qh^Q (not always practical!), one has the distance between dQh and 30 of order 0{hr) and r' =r in A1.24). Furthermore one needs to formalize that distFflb6i2) = 0(^) A1.25) and that dQh is uniformly Lipschitz. In the counterpart of Theorem 11.1 to be given we consider only d of unit size. Theorem 11.3. Consider A1.23) with globally quasi-uniform meshes such that A1.4), A1.24), and A1.25) {with Qh^Q) hold. Then for x0 interior to Q, if ие\У'°° in a neighborhood В of x0 and feL2{Q), then \{u-uh){x0)\^Chr{ln l/hY\\u\\r^B + C{h^ + ¥)\\f\\0,n. A1.26) Note that the term hq describing the size of the boundary perturbation should enter. Consider e.g. what happens even in the continuous problem if the boundary is perturbed by 0{hq) in one direction; cf. Remark 11.3 below for cases where the boundaries 60 and QQh interweave. Proof. We have by Theorem 10.1 (d = l), |(«-«fc)(xo)|<C(Inl/h)r"min||u-z||0i00,B/2+C||«-tifc||_,,Bj A1.27) where BzQh. From A1.4), !|w-zllol0o,B/2^C^||u|lo,a>,B^C/ir. A1.28) Next we apply a duality argument, ||м —илЦ _s,jj= sup {u-uh,q>), A1.29) ll9li«,B = l
ди Section 12 Local estimates 421 For each such cp, let veH1(Q) with A(w,v) = (w,q>) for weHl{Q). A1.30) Then having uh extended by zero to Q, (u-uh,(p)=A(u-uh,v) = A(u-uh,v-vh) = A{u-x,v-vh) any*eF„, A1.31) where vh is the elliptic projection of v. In obvious notation, since u = v=0 on 8£2, (u-uh,(p) = AQJu -x,v-vh) + An\Bh(u, v) = Anh(u-X>v~vh)- fv~ a\nh afih Thus, from A1.24) and A1.25), for s sufficiently large, |(м-«*,9)|<СЛ||м||2>0СЙ''-1||»Нл0 + Сй«(||/||о,0 + |и|1>в0к) which, in case dQk is uniformly Lipschitz, is bounded (by well-known trace inequalities and regularity in A1.23)), by C(fcr' + WH0iO. From A1.27), A1.28) and the above the desired result A1.26) obtains. □ Remark 11.3. There are some special cases in which the negative norm estimates in the proof of Theorem 11.3 can be appreciably improved on. This happens e.g. on square domains in the plane, and, for quadratic elements with a special technique for the essential boundary conditions, in Scott [1975]; cf. also e.g. Nitsche [1972b] and Bramble [1975]. In this connection, cf. Remark 9.1. We may summarize the present investigations in saying that, in case singularities are introduced by a rough right-hand side only, the elliptic projection is very locally determined (at least in the interior). 12. A model problem on a plane polygonal domain with a reentrant corner: Unrefined meshes Let Q be a bounded polygonal domain in the plane (see Fig. 12.1), and consider the model problem of finding и on Q such that — Au= f in Q, A21) ы=о onr=aa v Assume (to isolate effects) that /е^°°(й). Let xh = \J Ti be a family of quasi-uniform (i.e., unrefined) triangulations of Q (Qh = Q is natural for a polygonal domain). That is, with p; the radius of the largest inscribed disc of Th
422 L.B. Wahlbin Chapter III Fig. 12.1. ch^Pi^diamiT^Ch A2.2) where the positive constants с and С are independent of h. Let a.j denote the angle interior to Q at each vertex vj of Q. We designate as the "worst" vertex that with a,- maximal, i.e., Pj=n/iXj, а,#я, A2.3) minimal. Set that worst vertex as v0 and /? = тг/а0<1, A2.4) i.e., the worst corner is reentrant. As is well known, cf. Kondratev [1967], Kellogg [1971, 1972] and Grisvard [1976, 1985], with polar coordinates г, в centered at a fixed vertex vJf u(r,0) = ao,/^ sin(ft0) + ■■• A2.5) as r tends to zero. In general a0J is not zero and thus the worst singularity we have to contend with occurs at v0. Consider now an interior fixed domain А С Q. By the results of Section 10, for uh the obvious projection into a suitable finite element space Vh on this unrefined mesh family, under rather standard assumptions, we have with A<g A' <gQ, \\u-uh\\0^tA^C(lnl/hYmm\\u-x\\o,^,A- + C\\u--uh\\0tn. A2.6) *eVh Since, if / is smooth, и is also smooth on A', with standard notation, тт||и-х||0>00,л^С/1'. A2.7) xeVh To estimate the second term on the right of A2.6) we shall use a duality argument. Thus write II u-щ ||0>o= sup (u-uh,v). A2.8) !*=1
Section 12 Local estimates 423 For each such fixed v, let -Aw = v infi, w=0 on Г. A2.9) Then, cf. the papers referred to above, for /?< 1, IMI/, + i-,ifl<C||!?||o.0, A2.10) where now fractional order Sobolev spaces are employed. Hence, {u-uh,v) = D(u~uh,w) = D{u-uh,w-wh) ,<„,,, „ „ „ 12.11) ^ll"-"*lli,fll|w-wj|1>n and by interpolation in standard approximation analysis, and by A2.10), |(M-«»,o)|<Cfc2',-2'||M||/,+1_..„||w||, + 1-^0<C/i2'-2*. A2.12) Thus, noting that 2/?<r in standard situations, we have: Theorem 12.1. Let Q be a polygonal domain with worst corner a0>ir and set /? = 7t/a0<l. Then on globally quasi-uniform meshes with standard finite element spaces (r^2), for the solution и of A2.1) with smooth f and its elliptic projection uh, on A<£Q, A2.13) we have for any 8 > 0, Н«-и»||о.о0,^Сгй2'-'. A2-14) We remark that, by use of Example 10.1, A2.14) holds for Ac Q as long as A is a fixed domain away from any vertex of Q. Or, one may use Example 10.2 and the maximum norm stability of Schatz [1980b]. We shall next consider the corresponding estimate fine-tuned somewhat in that we consider the error at a point x0 as it approaches the worst vertex v0. We remind the reader that the mesh family is globally quasi-uniform, i.e., unrefined. Set dist(x0,t>0) = ^const-ft. A2.15) Assume still that /?< 1, r^2. Letting Ad = {x:\d^\x-x0\^2d} A2.16) we have from Section 10, cf. Example 10.1, \(u-uh)(x0)\^C(lnd/hYmm\\u-x\\o,a0,Ad + Cd-i\\u-uh\\0,Ad. A2.17) xeVh For / smooth it is known, cf. A2.5) and our general references above, that |DrU(x)|<C|x-t;ol^r- A2-18)
424 L.B. Wahlbin Chapter III Thus, by standard approximation theory, for the first term on the right in A2.17), (Ind/hf mm \\u~x\\o » Ad^C{\ndlh)fhrd^r. A2.19) xeVh For the second term on the right of A2.17), we employ a duality argument as in A2.8), A2.9) and A2.11). However, to obtain the precise dependence on din our final result (Theorem 12.2 below) we shall (cf. the Green's function estimation in Section 11) need to perform a rather sophisticated duality argument. The argument is based on the fact that for p < B/B — /?)), IM!2,„,^CpNlo,P,«, A2.20) cf. Merigot [1971, 1972, 1974a,b]. An easy variation of the Bramble-Hilbert Lemma gives, for 1 <p<2, min!|V(x-Z)||0,fi^Cpli2-^|!w!|2,p,„. A2.21) x*vh Thus, proceeding as in A2.8), A2.9), A2.11), for the second term in A2.17), for v in *гШ with || о ||o,^ = l, «Г1 II"-и* Псы, <d-1D(«-Hfc,w-w»)^Cd-1ft2B-2">||«||2iP,0||w||2,Pf0 ^C{f)d-1h2<2-2^\]v\\0,p,Ad ^C(f)d-lh2^^2lp)d^-p)lp\\v\\0tAd^C(J)h2{2-2tp)d-B-2lp). A2.22) Since p<2/B — /?), i.e., 2 —2/p</?, is allowed, we get for any e>0, d-iWu-UbWo^Ctf'-'d-'-'. A2.23) (Recall that /e^°°@).) Comparing now the two terms involved on the right of A2.17), i.e., the two terms estimated in A2.19) and A2.23), respectively, for d^h, since fi< 1 and r^-2, we see that (for e small enough) the term A2.23) is dominant. We have thus the following result: Theorem 12.2. Let Q be a polygonal domain with worst corner a0>7t and set /? = тг/а0<1. Then on globally quasi-uniform meshes with standard finite element spaces (r^2), for the solution и of A2.1) with smooth f and its elliptic projection uh, for \x0— v0\=d^const'h, for any s>0 we have \(u-uh)(x0)\^Ceh2l>-4-e-\ A2.24) Note that for d^ 1, Theorems 12.1 and 12.2 coincide (upon taking into account Example 10.1). We note that, in the basic estimates A2.6) and A2.17), the second terms measuring global influences (pollution effects) dominate over the first terms, the influence of local approximability. This is in decided contrast to the situation in Section 11 where
Section 13 Local estimates 425 roughness was introduced by nonsmooth right-hand sides in otherwise smooth problems. 13. Sharpness of the results of Section 12: Pollution Considering unrefined (globally quasi-uniform) meshes, we found in Section 11 that, in model situations, the error in problems where roughness is introduced by an unsmooth right-hand side only, the error away from the singularity is almost of optimal order. In the situation of Section 12, where roughness is introduced by a reentrant corner while the right-hand side is smooth, the error estimate is decidedly not optimal, since h2^d~f>hrd^~r. The estimates of Section 12 were dominated by the estimations of global influences. In this section we show that the estimates of Section 12 are sharp in general for globally quasi-uniform meshes. This is often referred to as a "pollution effect". The arguments of this section are taken from Wahlbin [1984], cf. Dobrowolski [1981, Theorem 7.1]. We place ourselves in the situation of a plane bounded polygonal domain Q with, for the worst vertex, P = n/a<l. A3.1) Assume that with polar coordinates centered at the worst vertex, u(r,0) = ar" sinQS0) + --- A3.2) where the "stress-intensity factor" is nonzero, a#0. A3.3) Assume further that there is an element T0 of "size" h at the worst vertex. Then since, by a simple scaling argument, min \\г^тт-х\\иг,т0>с^ A3.4) X polynomial of degree ^r- 1 we are justified in practical unrefined situations with A3.3) to assume that with c> 0, min||«-Z||1>2,0>c/i'. A3.5) X<sV„ Our first result shows that the global influence terms considered in Section 12 can never be better than 0(h2l>). Thus, they dominate over the local approximability influence. Precisely, we have as follows. Theorem 13.1. Assume A3.1), A3.3) and A3.5). Let A^Q be any subdomain of Q and s any nonnegative number. Then for uh the elliptic projection of и into Vh, \\u-uh\\-s,A>ch2l> A3.6) where с = c(a, A, s) > 0.
426 L.B. Wahlbin Chapter III The result of Dobrowolski referred to above gives A3.6) for s=0 and A = Q. Proof. We shall give only the essential details and refer to Wahlbin [1984, Theorem 1.1] for the full technical details. Consider any specific u0 =a0rpsin(e (!) + ■■ ■ with аОт^0. Setting w — aaQ 1u0—u then w is somewhat smoother than u0 or и is, and by a standard duality argument, the error in the elliptic projection in w is h2^o{\). Thus it suffices to consider any specific u0. To construct our basic u0, proceed as follows. Let x0 e Int A, and let A0 € A be an annulus centered at x0. Denote A0 = B1\B0 where jB0 €Bt € A are concentric discs. Let coe^OQ) with Jl outside Bb JO inside B0. Let further G0(x) be the Green's function for A2.1) with singularity at x0 and set u0~coG0. Then clearly u0е'ё^'{IntQ),u0 = 0 on dQ, and supp(zlM0)£^o£'4- A3.7) An essential fact is that for this uQ, we have a0 # 0. To see this, use a conformal map zp to locally straighten the boundary around the worst vertex v0. Since u0 is harmonic near v0 in Q, the transformed function й0 is harmonic in the localized halfplane and vanishes on a piece of the real axis. Hence it is smooth and harmonic in a neighborhood of the origin by Schwarz' reflection principle and Weyl's lemma (or, by Schauder estimates). Thus, in new polar coordinates, QO  = E AiPl Sin(^) i=l for p small. Note now that by the maximum principle, u0 is positive (for p small). Hence, Av is positive by the orthogonality of the sin(i0)-functions on [0, я]. Transform back to original coordinates, where a0 corresponds to Ax. With E0=u0—Phu0, Ph the elliptic projection, we have by A3.5), ch2* < || £ || b, = (V£0, V(m0 - Phu0)) = (V£0, Vm0) £о(Ли0)<||£о1и,лМио1Ь j л so that \\E0\\-..A>(c/\\Au0\\,,A)h2l'. □
Section 13 Local estimates ATI Our next result in this section is that Theorem 12.2 is sharp in the situation of unrefined (globally quasi-uniform) meshes. We refer to Wahlbin [1984, Remark 1.1] for exact hypotheses. Theorem 13.2. Assume A3.1), A3.3) and A3.5). Let, with v0 the worst corner, Ad = {x: d^\x — v0\^2d}nQ. Then for any d>0, there exist positive constants с and d0 such that for h1~d^d^:d0, Ци-и*11о.«.^сй2'<Г'. A3.8) Sketch of Proof. See Wahlbin [1984, Theorem 1.2] for all details. We shall only give the essential points. Consider first a specific и = щ depending on d. Let са^^^ф) with [\ ior\x-x0\^d, [0 for \x — x0\^2d. Then s\ipp(Vcod)sAd and we may assume (by a scaling argument) that KlU^Qrf-*. A3.10) Define then a specific ud, uj=<i>y sin(j?0). A3.11) Then supped M,)c: A,. A3.12) Set next Ed--=ud—Phud, with Ph the elliptic projection. Since d P h it is easy to see from A3.5) that \\ЕЛиа>ск'. A3.13) Thus, as in the proof of Theorem 13.1, using A3.12), ch^ *S || £„« \,a = (VEd, V(ud - Phud)) = (VE„ Vud) Ed{Aud)^\\Ed\\0^iAd\\Aud\\0il:Ad. By A3.10) and the general form of ud, we easily have IM«„||o.i.^Cd'. A3.14) Hence, \\Ели.*,.лл>сН2Ч-'. A3.15) The rest of the proof consists in showing that the elliptic projection error on Ad in wd—aud — и
428 L.B, Wahlbin Chapter III (for и = ar^ sin(/?0) + • ■ •) is of higher order than 0(h2pd'^). This follows by a variant of the arguments of Theorem 12.2, since wd is "smoother"; the essential singularity at the worst vertex is knocked out. The reader can easily supply the arguments or consult Wahlbin [1984] for details. □ The result of this section and Section 12 are easily extended to the situation where Dirichlet and Neumann conditions are mixed. The lasting message when compared to the results of Section 11 is, that for unrefined meshes, singularities introduced by rough domains (or abrupt change in boundary conditions) pollute throughout the domain whereas singularities introduced by rough right-hand sides are nicely confined to a small neighborhood of the singularity. The results given here and proven in full detail in Wahlbin [1984] have various precedents. The fact that the error is not better, anywhere, than h2p on an unrefined mesh has been computational folklore for about twenty years. Attempts to furnish a rigorous result were given in Nitsche and Schatz [1974, Section 7, Example 4] for an L-shaped domain (the example is originally due to BabuSka and Bramble), and in the already mentioned result by Dobrowolsky [1981, Theorem 7.1]. Another extremely interesting result on the structure of the error is given in Nitsche [1976a]. The result cannot (apparently) explain the full pollution effect from reentrant corners; it is more successful in one-dimensional problems where singularities are introduced by degenerating coefficients, see Nitsche [1976b], and Schreiber [1980, Section 6.2]. (Cf. also Jespersen [1978] and Eriksson and Тномёе [1984].) (We shall not touch the question of degenerate elliptic problems in two dimensions. We refer to French [1987] for state of the art results.) Let us also point out that in the case of finite difference methods for the problem A2.1), results analogous to Theorem 12.1 were given in Laasonen [1967]. 14. A priori and adaptive mesh refinements We start by treating systematic a priori refinements. We consider the model problem -Au=f in QuU2, A4.1) м = 0 on r=9fi, where Q is a bounded polygonal domain in the plane. We use polynomial finite elements of degree r— 1 with Qh = Q, as is natural on a polygonal domain. (For concreteness, the reader may consider Lagrangian triangular elements.) The pointwise results of Section 10 suggest that the resulting error in the finite element solutions are, by and large, determined by local approximability, provided we can handle the global influences. We assume that /e#°°(fl). Then as we approach a corner of interior angle a, и behaves as u(r,e) = a0rl>sm(p6)+--- A4.2)
Section 14 Local estimates 429 as the polar coordinate r centered at that vertex approaches zero, where /? = я/а. Now think of h as the diameter of elements that are unit size away from vertices. As we approach a vertex v, on Qj={x:2-j~1^\x-v]^2-J}, A4,3) dj = 2~J, A4.4) we have min l|«~zllo,oo,o,<c^4"r A4.5) where hj is the mesh parameter on Qj. We would like to make this local approximability equal to hr, viz., what rules a unit distance away from the corners. For technical reasons we shall need a slight "over-refinement". We proceed to detail our assumptions. Consider a neighborhood of any corner v with interior angle a, and set /? = я/а. Let Qj,j=ja,..., Ja denote the domains in A4.3). With h the "interior h" A4.5) suggests for the local hj on Qj, we should take hj = hdxj~9lT. We assume that the family is quasi-uniform on Qj with, for some 6>0, hJ = kd)-l"r+>. A4.6) Then the right-hand side in A4.5) can be replaced by 0{hr), h being the interior mesh diameter. This annular refinement is broken off at some point, viz. Jx, and setting for the "innermost" domain Qi<a = {x:\x-v\tZ2-J'} A4.7) we take on this domain a quasi-uniform mesh with hi = hla where h,~hrll>+s. A4.8) Note that for h small enough, we have with any preassigned constant c0, for e' >0 small enough compared to <5, /T£' hj/dj^c0. A4.9) Here;'=/ is allowed, with dI = 2~J". The following is the main result of this section. Theorem 14.1. With the above assumptions, for any e>0 there exists a constant С = Сг such that \\u-uh\\o,<»^Ch"\ A4.10) Before proving Theorem 14.1 we wish to investigate the total number of degrees of freedom asymptotically involved in such a mesh refinement. On each Qj, since we assume quasi-uniformity, we have by A4.6), # elements on Qj = 0(Area(Qj)/(hj)) = 0(dj/h]) = h-20(dj»lr~2d).
430 L.B. Wahlbin Chapter III We may assume that 2/?/r — 2<5>0 and thus, summing a geometric series, the total number of elements involved is 0(h~2), i.e., not worse (in the asymptotic sense) than that involved in a globally quasi-uniform mesh family. The argument in proving Theorem 14.1 below is based on Schatz and Wahlbin [1978,1979]. However, considerable simplifications are introduced which shorten the proof to about a quarter of its original length. (These simplifications have, apparently, never been published before. The main point is to use the estimates of Section 10 in a careful manner.) The original papers by Schatz and Wahlbin [1978, 1979] also contain various "bells and whistles": What happens if one underrefines; how to get 0(hr) accuracy close to one corner while having a minimal refinement at other corners to keep the pollution effect under control, etc. We shall not treat those questions with our simplified technique but the reader wishing to do so should have no particular difficulties. Naturally, before the question of mesh refinement was treated in the pointwise situation, it was considered in energy-based norms. We refer to Babuska [1970], Babuska and Rosenzweig [1973], Thatcher [1976], and Raugel [1978] for a representative survey. In the pointwise situation we also note Eriksson [1985b]. It is somewhat technical to construct mesh refinements as above while keeping quasi-uniformity on each Qj, cf. Eriksson [1985a, Section 6]. In Schatz and Wahlbin [1978, 1979] it is also shown how the simpler case when elements are merely required to satisfy a maximum angle conditions works. Then the basic approximation theory of Babuska and Aziz [1976] and Jamet [1976] is applied. Tracing this case through the present development is easy. The description of our mesh refinement is further elaborated in the one- dimensional situation in Rice [1969]. Proof of Theorem 14.1. Consider the error on Qj close to the generic vertex v of angle a, and set /?=n/oc. If Qj does not include the vertex, then we use Theorem 10.1 and Example 10.1 to obtain that \\U~ uh\\o,cc,!2f ^C(ln 1/hfmin llM-zllco^+CdJ1 ||и-мй||о.0> A4.11) where ffj = Dj-1uQjuOJ+1. A4.12) Assuming further that Q'j does not contain the vertex, we have by A4.6) and straightforward approximation theory, miii||U-zllo,.,n><Cfcr. A4.13) In case Qj abuts on the vertex, which happens only if J — I or / — 1, we proceed as follows. We have by Sobolev's inequality on Vk, cf. Example 10.2, ll"-«*llo,co.«><ll"i!o,a,,n>+l|M*||o,oo,C><Cfe,'+C|luh||l,o>'
Section 14 Local estimates 431 with Q'j—(Q'j)'. Thus, in this case of being very close to a vertex, by the triangle inequality, 1|и-и*11о.».«*<СЛг+||и-|У1@з. A4.14) We now proceed to consider the case when Q'j does not contain a vertex. By A4.11) and A4.13), for any £>0, ll«-«*llo.e,.o><CA'-4-d71|l«-M*llo.o>. A4-15) Next in store is a duality argument. We have ||и-и*Ио,п>= sup (u-uh,v). |[0||o.n>=l For each such v, let w solve — Aw = v in Q, w = 0 опГ = бй. Then (u-uh,v) = (V(« - щ), Vw) = (V(u - uh), V(w - x)) for any iB Vh. Thus, by Theorem 9.1, and Cauchy-Schwarz' inequality, for any djl\(u-uh,v)\^djl £ IIV(«—«*)||0.QjIIV(w—x)IIo.ej i ^CdJ1 E(IIV(w-^)llo,fi} + ^1||«-%llo,fl})^|w|2,2,«} j + CdJ1\\V(u~uh)\\0,Q~\MW-X)\\o,a'i- Note that, for Q'j not touching a vertex, and j=£J±l, \w\2,2,n'<CdJx\w\uz,n) and since II Vw || g,0 = (Vw, Vw)=(w, v) = (w, i>)a/ <Н»||о,,||о||о.,.о/. l/p+l/q=l we have by Sobolev's inequality, applied with pc^co,q^\, l|Vw||0,0<C||«;||,.0j<c4-'||c||o.0/ = Cd)-. Hence, A separate argument easily establishes this also for j = J± 1. Thus, noting also that
432 L.B. Wahlbin Chapter III for a suitable ф, mU~m°'n'^{chr, if Qj close to the vertex, we have from A4.15) et seq., for any e>0, using also Cauchy-Schwarz' inequality, and a simple estimate for w — % on Q'i, \\u~ uh llo.oo.nj <CA,-' + dJeXM72Hu-M*llo.ov + C|N-«J|lf0j j ^Chr~e + djEYJhjdj1\\u-uh\\0<oo^j + C\\u-uh\\un'i. j Combine this with A4.14). With 5=^/г^71!|ы~мА||0,0О,п. + 1|м-мй||1>Й1, A4Л6) j which includes corresponding sums at each vertex, we have for any e' > 0, \\u-uh\\o.a>.aj^Chr-t + h-''S. A4.17) Multiplying A4.17) (and A4.14)) by hjdj1 and summing, by use of A4.9), for any c0 < 1, for h small, S^Chr~c + c0S so that S^Chr~c. Thus from A4.17) !|w-"J!o,=o,fi/<c^-E which proves Theorem 14.1. □ We proceed to consider a posteriori adaptive mesh refinements following Eriksson and Johnson [1988]. For energy-based investigations, see e.g. Babuska, Miller and Vogelius [1983]. Considering piecewise linear elements one first guesses that on a triangulation xh = \jTt, with ht =diam Tb II e || lj00,n <K max К\и\г^,т.. A4.18) We thus consider gradient errors (in which case for piecewise linears there is no pollution effect). The extent to which A4.18) is true is, of course, a main point. It is further assumed that the constant К can be estimated; it may depend on interpolation error estimates and estimates for discrete Green's functions which would then have to be calculated and estimated.
Section 14 Local estimates 433 We next need to compute approximations to D"u, |a| = 2. Consider difference operators ВЫх) = <Х±УН^{Х) A4.19) where у = A,0) or @,1). Here Я = Сг hT if xe T and Cx is a sufficiently large constant. This type of difference quotient is then applied to (uh)x or (uh)y. If x is close to the boundary x±yh is chosen so as to belong to Qh. Specifying an error tolerance S, the algorithm proceeds as follows: Step 1. Start with an initial mesh xh = ^i which is quasi-uniform with cd^ch^hT^Ch. Step 2. Given a mesh zh compute the corresponding finite element solution uheVh. Step 3. Compute the following quantity for each xeTexh: D2Huh(x) = max{\D},D*uh(u)\,\y\ = \a\ = l,\x-y\^C2h} where h = minTethhT and C2 is a sufficiently large constant. Set Dj[(uh; T) = max D%uh(x). xeT Step 4. If for all Keih we have KhKD2H(uh;T)^d, then stop and accept the finite element solution uh. If not, construct a new mesh ih by "minimally" refining the old mesh xh so that for each Kerh, KhfDUuk, T)^S for all feih with f = T. Then redefine Th = fh and return to Step 2. Eriksson and Johnson proceed to show that their algorithm works in a model case which "mimics" a boundary singularity, except it is placed inside a smooth convex plane domain Q. More exactly, let the origin be inside Q and with Dmu{x) = max{\Dyu(x)\, |y| = m}, for с and С positive, c\x\ll-2^D2uix)^C\x\11-2, D3u(x)^C\xf'3 A4.20) where 1 < /? < 2. Using local error estimates for gradients in an essential way (we refer to their paper for details) they then prove: Theorem 14.2. Suppose the exact solution и satisfies A4.20). Then the adaptive algorithm with initial quasi-uniform mesh E=d will produce a sequence of meshes T%, и = 0,1,2,... with corresponding finite element solutions u\ such that IIм UA lll,co,«h^(-^
434 LB. Wahlbin Chapter III Further, the mesh T\ will be ''correctly refined" in the region {\x j ^ Ch„_,} where hn_ x is the minimal stepsize of elements in Tnh~l and given by h,-i=cd\ 0„ =A - B-/?)")/(/?-1) and the region {|x|< Chn_l} will have a quasi-uniform mesh of size hn, We conclude this section with a numerical illustration courtesy of Eriksson and Johnson. They applied the method to the mixed Dirichlet-Neumann problem Au = 0 in Q, m = 0 on Г^иГ2, 3w — =м0 опГь on where Q is the semidisc {|x|=$l,x2 >0}, Ff = {хеГ:х2—0,х1 >0}, ff = {хеГ: х2 = 0,х1 <0}, Г2={хеГ:х2>0}. The exact solution was taken as u(x) = r2/3 sin(f 0) and since then Vu is infinite at the origin, error control of the form IML,fi**niax/t?.|u|2 TsS<5 was used. Taking 5 = 0.01 the illustrated initial and final mesh and actual error distributions were obtained (see Figs. 14.1-14.4). Fig. 14.1. Initial mesh.
Section 14 Local estimates 435 Fig. 14.2. Final mesh. Fig. 14.3. Final mesh magnified 15 times. 0.004 0.003 £ 0.002. Ш 0.001 0.000- 0.000 0.125 0.250 0.375 0.500 0.625 0.750 0.875 1.000 Radius vector Fig. 14.4. Total error distribution.
436 LB. Wahlbin Chapter HI 15. Stress intensity factors, and better a posteriori approximations via the Trigonometric Fitting Method In this section we continue to consider the model problem -Au=f in A A5.1) u = 0 on F=dQ A5.2) where £2 is a plane bounded polygonal domain and where /e #"(£!)• The purpose of the present section is to illustrate how our previous local error estimates shed light on various computational methods for finding "stress- intensity" factors and also on the problem of obtaining, via a simple postprocessing, an approximation that is more accurate close to a vertex in the unrefined mesh situation. Thus our aim is not to describe all of the multitude of methods extant for finding stress intensity factors but rather to give applications of local error estimates. We shall first need to describe in some detail the structure of и near a vertex, cf. Grisvard [1985]. In terms of local polar coordinates (r,0) centered at that vertex, к where with j3=n/«, a the interior angle, yk(r,0) = rk!)sin(kpe), A5.4) \RKAr^)\^CtrM~\ апуг>0, A5.5) M = min((K + l)ftL + 3) A5.6) and where vL can be constructed a priori according to a recipe which we proceed to give. The idea is to construct a function ф much that near the vertex, _^ = /+0(rI,+ 1), Ф=0 onf. A5.7) Assuming that we know how to Taylor expand f, it suffices to consider — Лфц — х1у' near v, Vj У A5.8) фц = 0 on Г near v. Setting N = (' +j we first attempt фу as a homogeneous polynomial of degree N + 2, фи{х,у)= £ c„xV+2~"; A5.9) и = 0 this will work except in certain exceptional cases. Equating —Лф^ to x'yJ termwise gives i+j+l = N+l linear conditions and the boundary conditions on the two legs emanating from the vertex give two more. Hence, to show \j/i} exists as above it suffices to show uniqueness in the problem —Аф = 0 near v, ф = 0 on Г near v. In
Section 15 Local estimates 437 polar coordinates ij/ = rN+2w(9) and thus (N + 2fw(e) + w"(e) = 0 with w@) = w(a) = 0. Thus uniqueness follows unless N + 2 is an eigenvalue for the two-point boundary value problem just described, i.e., unless N + 2 = SP, S integer. A5.10) If such is the case first seek (assume the 9 = 0 axis is the positive x-axis) Ри=У2 I c„xy-" A5.11) n = 0 so that —АРц = х1уК This does have a unique solution since it corresponds, with Pij = rN+2w(e), to the initial value problem (N + 2Jw(9) + w"(9) = 0, w@) = w'@) = 0. But Pu only satisfies the boundary condition on 6=0, not necessarily on 9 = a. To adjust Ptj, let <pw===r"+2(mrsin((N + 2H)+0cos((iV~+2H)). A5.12) This q>N is harmonic (being the real part of zN+2 In z), satisfies the Dirichlet condition for 9-0, while for 9 = cc one has (S as in A5.10)), (pN = rN+2{-l)sa*0, 9 = a. A5.13) Thus adding a suitable multiple of q>N to Ptj will furnish a desired фи for A5.8) also in the exceptional cases. Setting w = u — vL we have near the vertex v, -Aw = t near v, A514) w=0 on Г near i>, with |£|^Ol+1. A5.15) We shall next indicate how this leads to A5.3). Let us assume /?< 1, this being the case of most interest. Straightening the boundary via a Z = zfi transformation, z = x + iy = re,e, Z = £ + iri — p&v, since Axy = 4-A^\(zp)'\2, we have -A&w=t iorrj^O, \Z\^p0, 516> w(£,0)=0, where w is the transformed function and \DlJ\^Cp(L+3)al:t~2~M A5.17) so that Г belongs to the Holder class <g'<I-+3>a/'1-2 on the closed half-disc. By classical Schauder estimates then, $e<#(L + 3w*-^ any£>o, A5.18)
438 L.B. Wahlbin Chapter III and hence w has a Taylor expansion *= X CyfV + fi A5.19) where Jf = [(L + 3)а/тг-e], and R£?(t+3W*-1 with |J)yjRKC/>A+3W*"t-ijl. A5.20) In polar coordinates p, q>, w= X PkU<P) + R A5.21) SsSJf and so ~Aw= £ /-2[Tt(^)/c2 + n'(^)] + 0(^<L+3>a/,,"-£). A5.22) By A5.16), A5.17) this equals ?=0(p(L+3)a/,t) and hence Гк(<р)/с2 + THk(<p) = 0, or, T^cp) = ak sin(fc<p) + bk cos(k(p). By the boundary condition w(£, 0) = 0, bt = 0 so that Tt(co) = ak sin(fc<p). A5.23) Going back to original coordinates shows that A5.3) is satisfied. From now on we assume that the function vL in A5.3) is known in advance, for given L. Note that L= — 1 is allowed, which means setting vL = 0. We now consider finite element solutions щ in some family of finite element spaces Vh which satisfy the homogeneous Dirichlet conditions on Г. We con- centrate attention at the worst vertex, i.e., that of lowest B. We assume that near the vertex, \(u-uh){r,e)\^CE{h,r) forr^r0, CK0=$tc A5.24) where E(h, r) is a known function. As examples, in the situation where the mesh is globally quasi-uniform, we saw in Section 12 that E{h, r) = h2f ~*r~^ ~£ may be taken, and we saw in Section 13 that this estimate is sharp. In a situation where suitable mesh refinements are performed we found in Section 14 that E{h,r) = hs~\s being the order of the finite element space and h the interior element diameter. The coefficient a% in A5.3), the "stress intensity factor" in linear elastic fracture mechanics, is often of some interest and we first consider how to recover it. This problem has an enormous literature associated with it and we can only give a very small sample of articles where further references are given: Gallagher [1975], Hellen [1980], and many articles in the conference report Grisvard, Wendland and Whiteman [1985]. The simplest method for finding ax goes under names such as the "Direct Method" or "Equation Substitution". By A5.3) we have for 9=\ct, /?< 1 and L— — 1 (so that no preliminary evaluation of vL needs be done), a1=u(r,ia)r"^+£, A5.25)
Section 15 Local estimates 439 where £(r)<Cmax(r^,r2~^(lnl/r)) (since ^2(a/2) = 0 and v-t = 0(r2(ln l/r))). Thus with A5.24), ai^UbirJay-'' + {E + E(h,r)r~>}. A5.26) This relation gives some indication of what point r to choose to evaluate at. E.g., in an unrefined situation £(/i,r)s;ft2/V_/' (modulo e's) and if 3/9<2, then r^h1'2 should be taken. The resulting accuracy is 0(//). The low accuracy of the "Direct Method" is well known in practice and the "Extrapolation Method" is one example of a method suggested to improve the accuracy. From A5.3) we have with v3 known, for в—^ас, u-r3=a1^-a3r^ + 0(r5^) A5.27) so that if F(r0) = (u-v3)(r0,i«)H, A5.28) we have F(r0)=a1-a3r20i, + O(rt11), FBr0) = ai -a322"r^ + 0(^), A5.29) so that extrapolating away a3, 22»F(ro)-FBr0) 22/?-l = ai+0(rtfi). A5.30) Replacing и by uh in F(r0) it is obvious how to use the local error estimate A5.24) to obtain a rational choice for r0. Other methods proposed for finding ax include the "Energy Release Method", the "./-Integral Method of Rice" and various methods based on the Volkov-Masja- Plamenevski formula, «1=BД) f/(r-'sin(j80) + fl) where H is harmonic with H= — r~fsm((№) on Г. With apologies to many researchers for giving only a brief summary of the field we point out that in all cases local error estimates can be used to evaluate the resulting accuracy in approximating ax. We proceed now to consider methods in which, assuming vL known, a fair number of coefficients ak in A5.3) are identified so as to furnish a high-order a posteriori approximation to и close to the worst vertex, say, even if uh was originally found only on an unrefined mesh with then only hp accuracy close to the vertex. Clearly, we may go on with the extrapolation method based on distances r0,2r0,3r0, etc., in the obvious analogues of A5.29), A5.30). However, in practice one then often finds oneself (after some preliminary analysis based on local error estimates to see what r0 should be) using points outside Ql A more satisfactory method in practice is to stick with an optimal distance r0 away from the vertex and try to get hold of approximations toa1,a2,a3...by using values
440 L.B. Wahlbin Chapter III uh{r0, в) on the part of the circle of radius r0 around v meeting Q. We call this method "Trigonometric Fitting". It was introduced in Rosser [1975,1977], cf. also Schatz and Wahlbin [1981]. We proceed to describe this method. We wish to arrange so that ^ a№k(r0,e)*(uh-vL)(r0,9), A5.31) * = i in some sense, in order to find approximations to a\. To motivate our method, note that from A5.3) for any I^K, £ ak^(ro,0) = ((«-«t)-R/iL)(ro>0) A5.32) i=l where |K/iZ.|^C/#-« with M = min((/+l)j3,L + 3). Introduce now the notation Ak=akrf and similarly for A\. Also set (ok(d) = sm(kep). Then from A5.32), by well-known relations for trigonometric functions, . 2 (u-vL-RIiL){ro,ff)M0)^. A5.33) A5.34) A5.35) A5.36) A5.37) Employing the trapezoidal rule on / + 2 points including the endpoints, we are led to defining 2 ' ^=7TtI^~ yiXro, 6t) а>к(вд, к = 1,... ,1, 1 + 1 i=i A5.38) 0,= ia/(/+l), ;=!,...,/. We first wish to consider the error in Ak. Setting TF=mT)im (i5-39) we have from the above (since the trapezoidal rule is exact on (ока),), Ak-Ahk = T{(u - uh){r0,- )wk(-)) - T(RIiLcok). A5.40)
Section 15 Local estimates 441 Consequently, with M as in A5.34) and E(h,r0) as in A5.24), for any г>0, \At-Akk\^CE(h,r0) + Cer§-'. A5.41) Note that M depends on the number of interpolation points used, not on the number of coefficients desired to be found. Assuming next, for simplicity, that we form the a posteriori approximation ut--=Za^>,+4 A5-42) * = i (with К = / so that we use all our identified coefficients), then (н-м?)(г,0)= 1К-4)^(-)+Д^. A5.43) *=i \ro/ Thus by A5.41), \(u-ut)(r,e)\^C(E(h,r0) + Cj§-s)(jfJ + C,rfi-«. A5.44) It is now clear that A5.41) gives a clue as to how to choose a suitable r0, based on local error estimates and M; note that M includes some work expanded in identifying vL if M is high. E.g., if (modulo s in our expressions below) we equalize the two terms on the right of A5.41), then in the unrefined mesh situation £(M£*2/,«*-',ro = A2,"(*+w, so that e.g. |a1-a?|<CA2«1-2W(*+») A5.45) and, to be used only for r ^ r0 of course, |(u-uj»(r,0)|<CA2'A-'/(*+«)^Y. A5.46) Thus, for M high, this trivial postprocessing in the unrefined situation gives an approximation to и that is almost 0{h2?) also as we approach the corner. Note that 0(h2fi) is the inherent best possible order of accuracy, by the results of Section 13. Various numerical experiments with the Trigonometric Fitting Method as compared to the Direct Method and the Extrapolation Method were reported in Schatz and Wahlbin [1981] in the case of the L-shaped domain (and with an embarrassing misprint for vL). For uniform h = ^o, the Direct Method (with optimized r0) gave a relative error in ax of 17% (it gave 8% unoptimized, i.e., with r0 = h, indicating that asymptotic error estimates are suspect in inaccurate methods), the Extrapolation Method gave 1%, and the Trigonometric Fitting Method 0.2% for / = 2,0.1 % for I = 3 and higher. The predicted optimal r0 's from the theory were used, setting all unknown constants to 1 (and e to 0). It thus appears that our local error estimates not only give a formidable insight in the general behavior of the approximations but can be quite useful in actual computations. The brief description and analysis of the Trigonometric Fitting Method above is based on an unpublished investigation by Schatz, Whiteman and myself. The reader
442 L.B. Wahlbin Chapter III who wishes to do so should have no trouble in extending the method to nonhomogeneous boundary data, Neumann conditions, or mixed Dirichlet- Neumann conditions. 16. Local behavior when singular functions are added We continue to consider the model problem A5.1), A5.2) on a plane polygonal domain. Concentrating attention at the corner v of maximum interior angle a, we have with /? = я/а, in polar coordinates, u(r, в) = а1&'1 +w, where ^{r,9) = rllsm{p9)(oir) A6.1) A6.2) and where со is a cutoff function isolating the corner. In particular, У\ = 0 on dQ. A natural and well-known finite element procedure is to adjoin singular functions to the basic finite element space. For simplicity we shall only consider the case of adjoining one, the first, singular function. The generalization to inclusion of more singular functions (at various corners) is obvious. Let thus Vh be a basic finite element space, satisfying the homogeneous Dirichlet conditions A5.2), and based on a quasi-uniform mesh family. Set and seek uh e Vh so that D(u-uh,x) = V(M-m„)-Vx = 0 forallZ€Fft A6.3) A6.4) Remark 16.1. To solve for uh, one naturally utilizes the theory of bordered matrices. In fact, including к singular functions, Vh = Ух ф У2 Ф ■ • ■ Ф У Ф Vh, so that uh = a\y1 + - + ahkyk + wh, wheVh, and letting a\,..,, a\ be the last variables adjoined to the (typically nodal) variables involved in whe Vh, one has with S the stiffness matrix in Vh a matrix equation involving a matrix of the form
Section 16 Local estimates 443 By the theory of bordered matrices this is solved by fe+1 application of S_1 (perhaps, once and for all LU decomposed or utilizing some other fast method) and then solving a (small) kxk system. The basic fact which reduces the local behavior in и — uh as above to previously known cases is a representation of uh due to Schatz [1980a], cf. Dobrowolski [1981] for an expose. Note that uh=a\SPx+wh A6.5) where wheVh. Theorem 16.1 (Representation). Let Ph denote the H1(Q) projection into Vh, without singular functions. Then «-Hfc=(a,-fl?)(^1-P4yi) + (w-PAw). A6.6) Proof. We first show that uh=Phu + a\(^-Ph^), A6.7) For, with w":=Mfc-a^l6Fft, A6.8) we have D(Wh,x) = D(uh,X)~ah1D(^1,x). A6.9) Considering only xe Vh here, D(uh,X) = (f,x) = D(u,x) = D(Phu,x) A6-10) and D(^1,x) = D(Ph^1,x). A6.11) Thus, by A6.9), for all *e Vh, D{wh-Phu+a\Ph^,x)^ A6.12) meaning that wh = Phu-ah1Phyi. A6.13) Hence uh = wh + a\У1=РИи + a\(^ -Ph^) A6.14) which proves A6.7).
444 LB. Wahlbin Chapter III Continuing, by A6.7) = alSr1 + w-(Ph(a1Sr1 + w) + a\{Sfi1-PhS>>1)) = aiSf1 +w-a1Ph£f1 -Phw-a\<fx + ah1Ph^1 = (a1-aA1)(y1-Ph^1)+(w-PhW) proving A6.6). D By our previous results we know how У — Ph£^i behaves, and also, under suitable assumptions on w, how w — Phw behaves locally. Thus, the next order of business is clearly to get an estimate for ax — a\. Theorem 16.2. WM-PbSfJWl -a" = —иT7/re—„■ re ч 2—-. A6.16) Proof. By A6.6), since D(u — uh,x) = 0 for any x&Vh, we have D(w_P/,w_(fll -a\Wx ~Pft^1),x) = 0 for any ieVh. A6.17) Take here x = ^i ~Ph^ie Vh to show A6.16). □ Since we have assumed a quasi-uniform mesh family, we know, for /?<1, that IIV^-P^llg^CA2'. A6.18) For the numerator NU in A6.16) we have with % any approximation to w in Vh, NU = D(w-x,Sr1-Ph&'l). A6.19) To get any further we have to make assumptions about w. A natural one (which can be bettered in many situations) is to assume, still with fi< 1, w = Cr2"sinO30) + --. A6.20) with corresponding behavior of derivatives. Further, assume piecewise linear elements and 2/?> 1. To further estimate the numerator NU one should not merely estimate it as ||w — xllill^i-Ph^iWi', this is too rough. Instead, a more refined argument is called for; thus consider domains Q}= {x: 2~i~1 =% \x — v\ ^2~J} towards the worst corner. Assuming suitable behavior at the other corners (we leave the details to the reader), with dj=2~j, NU^ £ IIV^-xMo^JV^-P^lkfl,- j <EAdJ/,-1(IIV(^i-Qlloj4 + d71ll^i--P*^illo.oi) A6.21)
Section 16 Local estimates 445 by the local Я1 estimates of Section 9, and where Q'j=Qj-i\jQjuQj+i. Thus, j ^Ch^dj'^ + Ch^df'-'d^m-Pk^hjrj. A6-22) j j A local duality argument as in Section 12, A2.22) et seq. gives di^^-P^Wo^C^'-'d]-'-; A6.23) so that from A6.22), NU^Cfc^f^ + C^2*-^?-1-6. A6.24) j j Summing this (to the highest J s.t. 2~J~h, some technical details are left to the reader in this sketch), since /J<1, ,TTT {Ch^~e for /Kf, ш<{а>- fbr?>i A6-25) Inserting this and A6.18) into A6.16), we have: Theorem 16.3. For /?< 1 wit/i quasi-uniform piecewise linear elements, and assuming A6.20) for the behavior ofw, h,Jcy-' for fi^l Ь-™{С#-»- for W>i A6-26) We comment on the above result. Of course, as /? tends to 1 the solution becomes smooth and we would expect no identification of the nonexistent stress intensity factor. Otherwise, the identification of it is rather "flaky" in that it is growing in accuracy up to j8 = § and then decreasing in accuracy. For /? < § the accuracy is the same as in the "Direct Method" of Section 15, i.e., worse than in the "Trigonometric Fitting Method". This flakiness led to the introduction of the so-called Dual Singular Function method, using Vh®Sf-u £f-1=r~0 sin(/?0) the dual singular function to ifi, as test functions in order to pick out a\ in a sharper fashion, see Dobrowolski [1981]. We have now (under certain assumptions) an estimate for ax — a\. Using this in A6.6) of Theorem 16.1 the question of the local behavior of и — щ reduces to the already investigated case of the local behavior in the elliptic projection into Vh, the finite element space without singular function. The result will obviously depend on assumptions for w, and what occurs at the other corners. Again a rather careful local duality argument needs to be performed to estimate the last term in \(w-Phw)(r,e)\ < СЙ2г2*-2 + ОТ1 ||w-P»w||o.^r but we leave this to the reader.
446 L.B. Wahlbin Chapter III 17. Local behavior in problems with rough coefficients: Pollution Very little is known concerning the global effects of rough coefficients in a problem. To my knowledge no systematic investigations have been undertaken. Therefore, the present section will treat only simple two-point boundary value problems. The typical situation is that of a jump discontinuity in a coefficient at a point, and the globally quasi-uniform mesh not matched to that discontinuity. As a first example, consider that of seeking u(x), 0^x<2, such that — u"=f in @,1), ~m" + u=/ in A,2), A7.1) u@) = uB)=0, with и demanded to be сё1 across x = 1. We give an example. Let /0€@,1) and x0el0. Let v be linear from 0 to x0 (where it assumes the value (e—e_1) + (l—x0)(e + e_1), then of the form u(x) = (e-e-1) + (l+x)(e + e_1), x0=*x<l, A7.2) and finally y(x)=e2_x-e1-2, l^x^2. A7.3) Thus v has a sharp break at x0 but is (ё1 across x— 1. Let further A outside I0, ,._ .. [0 in a neighborhood oi x0. Set u—cov. A7-5) Then, since the singularity at x0 in v has been killed off, и satisfies A7.1) above with / smooth and supp/£/(>. A7.6) Assume now that we are using uniform (or, quasi-uniform) meshes, but x = 1 is not a mesh point but, say, always halfways between. For Vh any ordinary finite element space then, \\u-x\\2u2>ch\ anyxe^, A7.7) as is trivial to see since и has a sharp break across x = 1. Let 2 2 a(u, v)= u'v'+ luv A7.8) о i and E-.=u~uh A7.9)
Section 17 Local estimates 447 the error in the corresponding finite element solution. By A7.7) and A7.6), ch3^a{E,E) = a(E,u)=-(E,f)Io ^ II£11 Any norm on /0 II/IIDual norm- A7.10) Thus, ll«-"/,|JAnynormon/„^C(/0)/23. A7.11) Hence, for r > 4, there is no chance that the Galerkin solution will approach optimal order on I0, which is a domain away from the rough spot, x = 1. We next wish to match this negative result with a positive result. From the results of Section 4 we have for the general error e — u~ uh in A7.1), |e(*o)I^Cmin||H-Z||o..j0 + Cd-1||e||_1/2./o. A7.12) Assume now for simplicity that r>3. A7.13) Employing the interpolation spaces Я*'00, based on the Sobolev spaces Hk=Wk'2 via the real interpolation method []е.оо, cf. Bergh and Lofstrom [1976], we prove the following: Claim 17.1. IffeH112'00, thenueH25-00. A7.14) Proof. Let x„ with t >0, be as follows: {0 forOO^l-t, linear in 1-t^x^l, A7.15) 1 forl<x<2. Let -m" = x,u + /; A7.16) then with Хцл] the characteristic function of [1,2], ("-«)" = fe-Xu,2])«, A7.17) so that llu-uha^MkJlh-XiLvh^CtV2. A7.18) Next, split / into / = /o+/i A7.19) where f0£L2, f1eiix. Correspondingly, let й=й0 + Й1 A7.20)
448 where -йо = -«'; = Then = /o> ■-Xtu+fi- l|floll2,2<C||/0||0, and since -«!" = llfiilla. Thus, II"IIh^ = {l,)'u + ltu !'+/i, 2^cr^ + \\f1h >.«,=■ sup 0<(<1 K(u, t: l 1,2- Я3 Л/2 L.B. Wahlbii ■ a2) Chapter III A7.21) A7.22) A7.23) A7.24) A7.25) A7.26) where for the Lions-Petree K-functional, K(u,t) Цм-й-йоЦнг + гЦйЛнз fl/2 - fl/2 Ct^ + CII/oll^ + Ct^ + Ctll/JIg, tl/2 „ „K{f,t;H\L2) ^C + C 1/2 —- A7.27) and Claim 17.1 obtains. □ We may then employ an obvious duality argument in A7.12) to prove that (for r > 3, remember), \e(x0)\^Ch3 A7.28) so that the lower bound A7.11) is complemented with an upper bound. We next consider a pollution effect when a higher-order coefficient is rough. Let [\ for 0*£x<l ФОН-, г -,' A7.29) (_2 for l<x<2 and consider the problem (au')'=f in0^x^2, и@) = иB)=0. A7.30) Then since (for / smooth), u' has a jump discontinuity across x = 1, we have ch^\\u-uh\\\,2 A7.31) if x = 1 is halfway between mesh points for a quasi-uniform mesh. Following the
Section 18 Local estimates 449 procedure in the first example above, it is easy to show that, for an example, c/i^||£||Anynormon/o. A7.32) Thus we have a rather severe pollution effect in this example. We leave it to the reader to give a positive result in this case. 18. An example of global pollution with numerical integration In general the influence of numerical integration of coefficients, right-hand sides, etc., is easy to trace through in smooth problems. (In my experience, the simplest way is to first use results concerning u — uh where uh is a fictitious projection into Vh without numerical integration. Then consider the difference uh — u% where ut is obtained with numerical integration. The general principle should be clear from Wahlbin [1978].) Consider the problem of finding u(x) such that with 0<cr< 1, u"=f = x\, -Юс<1, A8Л) м(-1)=0 = иA), where Use piecewise linear elements on a quasi-uniform mesh. To get a total numerical method we interpolate x + linearly (with a mesh point at 0) and use that in an exact fashion. (This is equivalent to some numerical integration.) Let fj denote this linearly interpolated /=x+. Clearly, this is h2 accurate away from the origin. To elucidate the pollution ensuing from the above procedure, we need not even consider the finite element projection: it is enough to consider what would happen in the continuous problems when / = x + is replaced by //. Thus let й be given by ur=f„ -Ux^l, tf(-l) = 0 = u(l). A8.3) For the error e = и — й then, e"=f~fi, -Kx«a e(-l) = e(l)=0. A8.4) Since x = 0 is a mesh point it is clear from the geometry of the situation that f-fi>0foT0<e<l. Now let x0 be near —\, where, after all, both u and й are linear. Then with G° the Green's function for the problem centered at x0, e(x0) = G°(y)(f-fi)(y)dy. A8.5)
450 L.B. Wahlbin Chapter III Note that if x0=* — \, G° is positive for O^x^l. Thus, since, as already noted, f—fi^0, the error in f—fj rules e(x0). By the well-known error formula for the trapezoidal rule, for xbxi+1 mesh points, i^l, (/-//) = ( j /J-ift[/«+1 +/i]= -hh3f'4) A86) Xi Thus, summing, е(хо)>СЛ'+1, xo^-i, A8.7) giving a systematic suboptimal error away from the problematic point, i.e., a pollution effect.
Chapter IV A Singularly Perturbed Elliptic to Elliptic Model Problem 19. The model problem Consider the problem of finding и — u(x; e) such that Ltu==-s2Au + a(x)u=f(x) in Q, A9.1) where О is a plane convex polygon with straight edges. In general, Dirichlet boundary conditions lead to sharper boundary layers than Neumann conditions and hence we take u=0 on dQ A9.2) as our boundary conditions, selecting the harder case. We wish to avoid the problem of turning points and assume therefore that with positive constants а0,аг, 0<ao^a(x)^a1. A9.3) The limiting case of A9.1) is u=f/a. This is singularly perturbed in L°° but, e.g., regularly perturbed in L2. In the case that dQe^2-" for some я>0, the precise behavior of boundary layers as £->0 in A9.1), A9.2) is known, provided A9.3) holds and a(x), f(x)eW2. Let 9(x) denote the distance from x to dQ along the normal to dQ, let x' denote the point where the normal meets SO, and let \j/(x) be a smooth cutoff function isolating a layer around dQ. Set , , /(x) Z(x') ( Э(х) r—\ (iQA^ «.(*) == -T-T - zr^exP v a(x) m*)- A9-4) a{x) a(x) \ г J The first term on the right is called the "regular" expansion, the second term the "boundary layer correction". The theory of such expansions can be found in Besjes [1975], Eckhaus [1973, Section 2.5.3; 1979, Section 7.1.1], and Lions [1973, Chapter II]. Using a "normal-tangential" coordinate system it is easy to see that 451
452 L.B. Wahlbin Chapter IV Le(u — «c)=0(s). The maximum principle then gives 1|и-и.||о...о<в^10(е) = Сб, A9.5) so that, within O(e), in this case A9.4) describes the boundary layer behavior as £-»0. The thickness of the layer is thus 0(elnl/£), cf. Baranger [1979]. In the case of less smooth boundary or coefficients, e.g. our basic case of a convex polygon, we cannot ascertain this precise behavior. For our future local error estimates in the finite element method we need some basic estimates for the continuous problem. For convenience we collect them here. These proofs utilize well-known regularity theory from, e.g., Grisvard [1985]. The details are given in Schatz and Wahlbin [1983, Section 2 and Appendix]; that paper is the basis for this whole chapter. Lemma 19.1. There exists a constant С such that NIco^CII/Hcu,, A9.6) INu^Ce-Ml/llo-fl, A9-7) and Нкя<С£-2||Я|0,я. A9-8) The next result concerns exponential decay in functions which satisfy the homogeneous equation in a subset of Q. Lemma 19.2. There exist positive constants с and С such that the following holds. Let As В with d = d<(A,B)>0 and let Lev = 0 on B. Then for 0<es$l, NIo^ + dllVrHo^Ce-^Hk IIVDllo^e-^HViicB, and в2И2,л<С(£2Г1|^|о,в+1Ио,в)- A9-11) 20. The finite element method and its local behavior in energy We consider the problem A9.1), A9.2) on a convex polygonal domain with a uniformly positive a{x), A9.3). Let Vh be a finite element family satisfying the homogeneous Dirichlet conditions and based on a quasi-uniform family of partitions. For concreteness we may take Qh = Q and Lagrangian elements of order r — 1 on triangular subdivisions. Then the properties listed in Chapters II and III, such as approximation, superapproximation, and inverse properties, hold. Let Ac(v,w) = e2{Vv,Vw) + (av,w). B0.1) A9.9) A9.10)
Section 20 An elliptic to elliptic model problem 453 Theorem 20.1. There exist positive constants c1,c2 and С such that the following holds. Let A&B^Q with d = d<(A,B)^c1h. Let veH^Q) and vheV„ be such that A£(v-vh,X) = 0 forXeVf(B). B0.2) Then ll^-^lli^^Cmin(l|V(!;-x)llo,B + rfllI'-Zllo,B) XtVh + Cd-1e-c^+l»\\v-vh\\o,B- B0.3) The result B0.3) should be compared with exponential decay result in the L2 projection, Theorem 7.2; note that e = 0 "formally" corresponds to the L2 projection. (In the next section we will give the pointwise result which is more perfectly analogous to Theorem 7.2.) For s of unit size B0.3) is exactly Corollary 9.2 for s = 0. Thus the whole present chapter is, in a formal sense, intermediate between the limiting cases of the L2 projection and an ordinary elliptic projection. In light of Chapters II and III, Sections 9 and 10, it is not surprising that we manage to give rather satisfactory results. In order to prove Theorem 20.1 we shall need the following discrete version of Lemma 19.2, A9.9). Lemma20.1. With notation as in Theorem 20.1, if AE(vk,x) = 0for%еУь (В), then \\»Jo,A + d\\VvJ0,A*ZCe-''4'l'+h)\\vJOJl. B0.4) Proof. Let Dp and Dp+6 be two concentric discs of radii p and p + 5, respectively, with center in A, and let coe'tfoiDp+s) be such that B0.5) B0.6) B0.7) B0.8) (The dependence of с on various approximation and inverse constants is easy to trace but left to the reader.) We have now for any xe^h(Bp+6), £2\\(DVvh\\l+a0\\covh\\i ^s2(Vvh,aJVvh) + (avh,w2vh) = Ae{vh, aJvh)-e2(wVvh, 2(Vco)vh) = A,(v„, cd2 vh-X)- e2(coVvh, 2(Vo>K). B0.9) co = l on 1МЦв„ч Let Bt = Qr\Dr MVh,X.) = and that S^ch. D„ ,^С<Г' and assume that 0 for xeV?(Bp + a)
454 L.B. Wahlbin Chapter IV By "superapproximation" we find thus that e2\\coVvh\\%+a0\\o>vhU + C\\vh\\0tBpJh2e-i\\Vvh\\0,Bpt6h2e-2\lvhl\o,BpJ + CE2<T4|fflv4lio|[t>ft||<uw B0.10) Since 5 ^ ch, e2h5-2\\Vvh\\0,BpJvh\\0,Bptd ^СгЧё-1\\Ък\\2о,вР^ + Се28~2Ы\1ар,5. B0.11) Further, by the inverse property, for 6 ^ ch with с large, h2S~* ||vh \\0iBpti||Wvh \\0iBptt ^ChS'11| v„ \\lB^16. B0.12) Also, Ce2e-i\\c»Vvdo\\vdo,B^^z\\o)Vvd20 + Cs2e~2\MlB^ B0.13) Using BQ.11)-B0.13) in B0.10), je2\\Vvh\\20tBp+a0\\vh\\lBp ^Ce'he-'WVvbWls^ + C^d-' + hd-'nvbWl^. B0.14) Iterating the gradient term on the right once more and using inverse properties, ^h8-1lCe2hd-i\\Vvdi*p.2^a'?t-2 + h6-lnvJlBl,,u} ^с(^^Ш2о,Вр^ B0.15) Similarly, iterating the last part of the l? term on the right of B0.14), and using B0.15), ^-хЬЛ%^^сы~^гь-2ь&-х)ЬЛ\Лр^ ^с(Е~^-)\Ш\1в^- B0.16) \ <*2 / Using B0.15) and B0.16) in B0.14), and changing notation slightly, h2 II Vvh \\1вр+ао II vh \\1ве^с(~~^\ || v„ \\20,Bpt6. B0.17) Consequently, with К independent of s, h, p or 6, IIVBjo.^Ka-MMo.iv, fore^ft, B0.18) and v„h,B.^K*—s IKikw B0.19)
Section 20 An elliptic to elliptic model problem 455 (In the case s^h the inequality B0.18) has to be separately established. This is easy from B0.19) and inverse properties.) We now choose 6 so that /f2 + h2 KV ^e. B0.20) S Then, iterating, Ыо,Вр<е-"Шо.вр^б. B0.21) We may do this for NS^d, B0.22) or, from B0.20) N^d/S = c2d/(s + h). B0.23) By B0.21) (and squaring and summing) this proves half of B0.4) and combining with B0.18) (8=d taken), the lemma is proven. □ We can now prove Theorem 20.1. Proof of Theorem 20.1. It suffices to verify the result with A and В replaced by Bp and Bp+6, S^d^ch, and then squaring and summing. Let юе^оФр+гз) witn ш=1 onDp+s, WcoWvHD^^Cd''. B0.24) With v — cov, vh = Pch(a>v), the projection w.r.t. the form AE, we have \\v-vh\\lrBp^\\v-vh\\i + \\vh-vh\\UBp. B0.25) Here, by Lemma 20.1, since Ae(vh-vh,x)=0 forxeVZ(Bp+i), we have ^C5-^-al^hK\\v-vh\\o,Bpt6 + \\v-vh\\o,BpJ. B0.26) Hence, \\v-vk\\ltBp^\\S-Skh+6-1\\€-Uo + C6-1s-a^+^||u-Pillojw B0.27) It remains to estimate the first two terms on the right of B0.27). For this we use the following lemma. Lemma 20.2 (Global energy estimates). There exists a constant С such that HV(p-PJp)|lo<{^I"llfl' B0.28) [Ch\\v\\2,
456 L.B. Wahlbin Chapter IV and II ог II ^.jC/jN!l> nn-)Q\ ||у-ад^!с№ц2. (Ж29) Admitting this lemma for the moment, llu-oJIi^CHcIl! and, for S^d^ch, Since \\S\\^C\\\/v\\0^16 + cS-l\\v\\Q,Bpt26, we have from B0.27), ll»-i'*l!i.j.^C(||V»||o,V(, + ^-1||»|lo^„) + Cr1e-^<E+">b-t;J|o,Bp+M B0.30) which, upon squaring and summing, shows B0.3) with % = 0. The whole of Theorem 20.1 now follows by writing v — vh — (v — x) — (fy — x)- П It remains to show Lemma 20.2. Proof of Lemma 20.2. Set e = v — P\v. Then e2\\Ve\\20 + a0\\e\\l^A£(e,e) = Ae(e,v~x) ioi aay xeVh. B0.31) Hence, e2||Ve||§ + fl0||e||2^C{e2||V(c-Z)||§ + ||»-Z||g}. B0.32) We first treat the case s^ch. Taking x = Plv in B0.32) it is well known that II V(t7 — x)IIi ^СЛЦю||! and llc-zllo^ChllolUsothat l|Ve||§<C{||»||?+hV2||»||?}<C||i;||?. Hence, the first inequality of B0.28) obtains. Using instead that || V(t> — y) \\ x ^ Ch\\ v || 2 and || и — xllo^Ci2IMl2 the second part of B0.28) follows too. We continue with a duality argument. Let w solve Lcw = e in Q, w = 0 on dQ. Then, for any #e Vh, \\e\\o=AAw-X,e) so that with a suitable %, IMlNC{E2||Ve||0fc||w||2 + C||e||0fc2||vvi|2}
Section 21 An elliptic to elliptic model problem 457 and by use of Lemma 19.1, \\e\\20^C{h\\VeUe\\0 + h2£-2\\e\\20}. Hence for s^ch,c large, ||e||o ^Сй|| Ve||o and B0.29) follows. It remains to consider the case of e^ch. By B0.32) with x = P\v we have also ||e||g<C(e2+fc2)||i>||?<CA2||»||?, or ||e||g<C(e2+A2)A2|Mli<CA*|M|§, establishing B0.29) in this case of e^ch. For B0.28), use B0.29), a suitable intermediate #e vh, and the inverse property. This proves Lemma 20.2. □ 21. Local pointwise error estimates As remarked in Section 10, to prove local pointwise estimates present technology of proofs requires a "related" projection on a domain, including the one of interest, for which one knows (almost) stability in L00. In the situation of Section 10 that problem was a local Neumann problem on a disc (and the approximation properties at the boundary of its domain never really "entered" since all functions could be extended to the outside and then approximated on full elements). Thereafter, we gave some examples of how to extend this to domains abutting on the boundary. In the present setting we shall use the global Peh projection itself as the fundamental stable process. Thereafter, localization will be done by cutoff functions and (the cheap way out!) use of the two-dimensional Sobolev's inequality on Vh. Clearly, this is extremely inelegant, leads to loss of unnecessary log l/h factors, is restricted to two dimensions, etc., but it does, at present-day technology of proofs, lead to a manageable and readable proof. As usual at this stage of our survey, we shall be cavalier about exact assumptions. Consult Chapters II and III for more exact details, or, consult Schatz and Wahlb[n [1983]. Suffice it here to have in mind Lagrangian elements on a quasi-uniform family of triangulations so that all the relevant formalized assumptions listed in previous chapters hold. We start by proving the global almost best approximation property of the P% projection into Vh based on the form AE, see B0.1). Theorem 21.1 (Global almost stability in L00). Assume a quasi-uniform mesh family and that и is continuous. Then ||м-^и|1о.».о<1п(С + Б/А)тт ||u-xll0.».a- B1.1)
458 L.B. Wahlbin Chapter IV Proof. We write uh — Pehu, Let x0 be a point in Q, x0 e Г0 an element. By the inverse property, \(u-u„)(x0)\ < |и(хо)! + Ых0)\ < |ц(х0)| + Ch~11| u» ||0,r0 <С||и||0(в).0 + Сй-1||н-нЛо,Го- B1-2) Here, /1_1||м-мй||0,г0 = й~1 sup (и-ик,ф). B1.3) !l«o = l For each such fixed ф, let L£u = 0 in Q, v = 0 опЭО, B1.4) and set vh = Pehv. The following are simple results that we shall need: Nlo+lkllo< lCCh/E B1-5) ||Vp||o+||V»»||o^lie-2ln1/a(C + 8/ft). B1.6) To see these results, note first that s2\\Vv\\20 + a0\\v\\HAc{v,v) = W,v), B1.7) and similarly for vh, so that the first inequality in B1.5) obtains. For the second piece of B1.5), consider first v itself. Letting Lew = v/\\v\\o, w = 0 on dQ, we have |М1о = Ле(в,*)=.((М). B1-8) Sobolev's inequality in the form Мо.со^СМ^Мо'2 leads to W.wXII^IIo.illwIli'lllwll^^Cftll^lloCe-^Cte-1 by use of Cauchy-Schwarz' inequality and Lemma 19.1. This shows half of the second part of B1.5) and since, by Lemmas 20.2 and 19.1, ||t> —i>fc||0^Cfc||u|| i =$Сй/е all of B1.5) follows. For B1.6), by the analogue B1.7), е2\М\Н(Ф^иН\\Ф\\0л,твЫ0,о>,т0- B1-9) Let Q1 denote the intersection of a disc of radius max( C/i, e) around x0. By Cauchy- Schwarz' inequality and by Sobolev's lemma on Vh, see A0.34), and also by use of B1.5), e2||eh||?^Chln1/2(C + e/fc)(||i;fc||1.0. + e-1||»*llo.ff) <Cfcta1/2(C + 6/fc)||»Ji + Cfe2e-2ln1/2(C + e/fe). B1.10) By this, lluJU^ Che" 2ln1/a(C + e/fc).
Section 21 An elliptic to elliptic model problem 459 Since also, by Lemmas 20.2 and 19.1, \\v-vh\\Hi^Ch\\v\\H2^ChE~2 we have completed the proof of B1.6). Return now to B1.3): (и-ии,ф) = Ae{u~ uh, v) = Ae(u -uh,v-vh) = Ae(u, v - vh) Ti \ uA(v-vh)+ ф u — (v-vh) ]+(au,v-vh) Ti BTt <C||M||o.00{e2l»-»*l2,i;* + 82fc||V(»-»»)||o.i + ll»-o*llo.i}, B1.11) where we used a trace inequality on ST,-. The notation |'|2,i;a means a piecewise norm. Here, adding and subtracting a suitable %, \v-vh\2,uh^C\\v\\2yl + Ch''1\\v-vh\\1,1 so that \к-\и-ик,ф)\ ^C\\u\\0^{E2h^\\v\\2tl+E2h-2\\v-vh\\1A+h-'\\v-vh\\0A B1-12) Let now Qj = {xeQ:2-j ^\x~x0\^2'j+1} B1.13) and Q'j = Qj-i vQjKjQj+i, Q"j = Q'j- j uQjuQ'j+1. Assume for simplicity in writing that 0=[j?=1 Qj. Let J* be such that 2'J* = C^h with C* sufficiently large, to be determined, and let Q* = {xeQ: \x—x0\ <2~J*} so that G=((j£i Gj^Q*. Set further dj=2~i and dif = 2~\ and J, E Nl*,p,n,= E llfllk.p.flj+ll^L.p.fi,- B1.14) Returning now to B1.12) we estimate the three terms on the right. For the first we have by use of Cauchy-Schwarz' inequality, and Lemma 19.1, A9.8) and Lemma 19.2, A9.11), ^С+СеЧ-1 £ \\Vv\\0in, + Ch-1 £ 4N|o.OJ. B1.15) ;=i j=i
460 L.B. Wahlbin Chapter IV Note next that j, \ 1/2 £ e-M,/. ^lnl/2(C + g/^ B1.16) 7=1 / for, since exp(—2c2_7£) is increasing with x we may estimate the sum inside the square root by I = J{*+Jexp(-2c2~x/e)dx. Substituting 2~x/s = y, since 2~J*=ё С*/г, l/2e /*Slog2 | e~2c^ C»/i/4c and the result readily follows. By the exponential decay results of Lemma 19.2, A9.10), for C„ large, using also B1.6), B1.16), e2ft_1 Z IIV»||o.o; 7=1 «jCe2*. £ e-^||V»||o.0v 7=1 / J* \l/2 <Сб2Л-1f£ е-*-'"] ||V»||0^ln(C + 6/fc). B1.17) Introducing the notation ^ = fc-1I*^(||t>||0.0,+ ||Ph||o.flj), B1.18) we have from B1.15) and B1.17), fi2fc-1||o|li.2<C62A-1£*d;||p||2i2iDj<ln(C + E/fc) + C^. B1.19) For the term h~1\\v — vh\\1 in B1.12) it is immediately bounded by СУ. Hence, by B1.12) and B1.19), \к-1(и-икф)\ ^С\\и\\01Х(Ы(С + ф) + У + е211-2\\ь-Ьь\\1Л), B1.20) For the last term on the right we use the local energy norm error estimate of Section 19 and find, for any %eVh, e2A~2||»-»*||i,i = e2ft-2£*||»-»h||ljlp0y <Ce2h-22;*(dy||C-Z||1>2f0;+||»-z||o.0;) + Ce2fc-2X*e-^№+*)|l»-»*llo.oj ^Zt + Z*- ' B1.21)
Section 21 An elliptic to elliptic model problem 461 Here, by local approximation theory and by B1.19), n<C£2fc_1£4#lk2,fi;^ln(C + £//i) + C^. B1.22) Further, by Lemma 20.2, B0.19) and Lemma 19.1, A9.8), Z^Cs4-2(y\-2^+»y12 \\v-vh\\o ^CE2lnll2(C + (£ + h)/h)\\v\\2,2 <С1п1/2(С+е//г). B1.23) Inserting B1.22) and B1.23) into B1.21), and then the result into B1.20), |й->-и*,0|<СИо.вAп(С + е/Л) + П B1.24) It remains to prove that S? = h-1Z*dj(\\vh,oj+H\\o.0jHC. B1.25) Admitting this for the moment, \h-\u-uh, 0)|<С||и||о.„ ln(C + e/ft) and so II« - "л II o, «> «S hi(C + e//i) N1 о, со and the theorem would follow upon adding and subtracting %eVh. To verify B1.25) proceed as follows. Since, by B1.5), ||tf|| + ||t>A|| <C it suffices to estimate A^I^flNo^+IIMo,,,,). For C^ large enough we invoke the exponential decay results of Lemmas 19.2 and 20.1. Thus, У^С + Ch-1 £ ^-«""IMIo.oj + CA-1 f ^e-^*+,!)||fJlo,ni j=i j=i ^c+ch-hf^ ^ye-2^"Y/2 win ;=i \e + С/Г1(б + й)(| (-~%f е~2Ы'11£+И) W; (illO <С + СА_1е||17||0 + СЛ-1(е + Л)||»л||0. B1.26) Applying B1.5) and separating the cases e^/i and e^/i we deduce B1.25). This completes the proof of Theorem 21.1. □ Note that the global estimate of Theorem 21.1 is of no use whatsoever if e<^h and
462 LB. Wahlbin Chapter IV boundary layers are present; it then merely says that uh is bounded! Hence, in this case, it is imperative to localize the result. For the result to be stated in Theorem 21.2, we consider a disc D of radius d around a point x0, and, if abutting on the boundary, its intersection with Q. We assume that the mesh family is quasi-uniform on the whole of Q, but, as is easily seen, it suffices that a mesh family quasi-uniform on D can be extended to a mesh family quasi-uniform on Q. (However, this problem has not been given much thought in the literature.) Likewise, we assume that on the whole of Q, 0<ао<й(х)<а1, xeQ; B1.27) if this holds for xeD it is quite easy to extend a(x) "virtually" to the whole of Q so that B1.27) is satisfied. Thus, under various circumstances, Theorem 21.2 can be further localized. Theorem 21.2 (A local pointwise error estimate). Let the mesh family be quasi- uniform on the convex polygonal domain Q, and let 0<ао<а(х)<а1, xeQ. There exist positive constants cx, c2 and С such that the following holds. Let и he a continuous function with u — 0 on 60. Let uheVh be such that Ae(u-uh,X)=0 forXeVf(Qd) B1.28) where Qd = DnQ with d^c^h. Then |(u-w,)(x0)|^Cln1/2(rfA){ln(C + eA)min||M-Z||0.QO,^ r.eVh + d-1e-^+»\\u-uh\\0,[2d}. B1.29) Proof. Let D" and D' be discs concentric with D of radii \d and \d, respectively, and set Q" = QnD", Q' = QnD'. Let cae^(D) be such that @ = 1 onD', INIo,»^!. B1-30) Set й = (ои and uh = Pl(cou). Then |(н-нл)(х0)|<|(м-цА)(х0)| + |(нл-ил)(х0)|. B1.31) Here by Theorem 21.1, |(« — fifc) (x0)|< ln(C + s/fc) || й || 0, „ <ln(C + fi/fc)||M||0,w,Ol,. B1.32) For the function uh — uh in Vh we have by B1.30), At(uh-uh,X) = 0 foixeV^(ff). B1.33) By Sobolev's inequality in Vh, \(uh-uh){x0)\ <Cln1/2(d/ft){||fi»-UJ|lfn.. + d-1||fi*-«fcHo,0"}. B1.34)
Section 22 An elliptic to elliptic model problem 463 Hence by Lemma 20.1, and by B1.30), |(й*-иА)(х0I <Cln1/2(rf/^-1{e-c^+'I,||u(l-Mft||o,fi-} ^C]^l2{d/h)d-y\\uh-u\\0,n. + C\nll2(d/h)d-1e-c^+h)\\u-uh\\0,ni. B1.35) By Theorem 20.1 and Cauchy-Schwarz' inequality, Cln1'2(d/fc)d-1||H»-fi||o.0. ^Cln^id/hWb-uWo,^ ^Cln(C+e/h)lnll2(d/h)\\u\\0t00}ad. B1.36) Combining the above we have proven B1.29) with / = 0. Writing u—uh=(u — %) — (и — х)и completes the proof of Theorem 21.2. □ 22. Some concrete estimates using Theorem 21.2 The reader may easily extend the result of Theorem 21.2 to the case when Q is a convex domain with a smooth boundary, and the approximating domain Qh £ Q has a boundary within 0(h2) distance of the boundary of Q. In this section we shall consider the case when the nonsmoothness in the homogeneous Dirichlet problem A9.1), A9.2) is occurring only in a boundary layer. Furthermore, we concentrate attention on the case of piecewise linears on a quasi- uniform straight-edged family of triangulations. Thus, we make the following assumptions: Local behavior of u. B2.1) There exist positive constants С and c, independent of e, such that with d(x) = dist(x,8GX B2.2) we have -cdUJ/Л < CI 1 + 5— ) for l«l < 2- B2-3) If data in the problem are smooth enough, this follows from Besjes [1975, Theorem 13]. A priori assumptions. B2.4) There exists a constant С such that ll/IL.o«C. B2.5) Finally, we make an a priori approximation assumption appropriate to the
464 L.B. Wahlbin Chapter IV piecewise linear case. Local approximation. B2.6) There exist constants с and С such that the following holds. Given a function v with u=0 on Эй there is j_e Vh such that for 7] an element, Wv-xIUt^CVWvWw, B2.7) where T'i = (\jTj)r\Qh with the union taken over elements 7} within a ch distance of Г,. Note that by the maximum principle, cf. Schatz and Wahlbin [1983, Lemma A.1] for less smooth cases, with B2.5), IMIo,»,o<flolll/llo.e,.fl<C- B2.8) Let now e(x) — u(x) — uh(x). Theorem 22.1. Assume that the local result of Theorem 21.2 holds. Assume furthermore B2.1), B2.4) and B2.6) above. There exist positive constants cb c2, c3 and C, independent of h and e, such that the following holds. Recall that d(x) = dist(x,8fl). B2.9) (a) If e^h, then |e(x)i^Cln1/2(<i//i)/i2 for d{x)>c2hln(l/h), B2.10) and |e(x)|<C for d(x)^c2hln{l/h). B2.11) If furthermore е^с^\п{1/К), then ]e(x)\^Clnll2(d/h)(h2 + c'C3ilh) for d(x)^c2h. B2.12) (b) If z>h, then ie(x)|^ln1/2(d//i)ln(C + e//i)/i2 for d(x)^c2eln(l/e), B2.13) and \e(x)\^\n(C + e/h)h2E for any d(x). B2.14) Proof. The easier estimates are B2.11) and B2.14). From the global almost best approximation property of Theorem 21.1, ]e(x)|<ln(C + £//i)min ||u-Z||0>e>. B2.15) xeVh For B2.11), simply take # = 0 and use B2.8), and that e/h^ 1.
Section 22 An elliptic to elliptic model problem 465 For B2.14), using B2.7) and B2.3), min||u-xllo,»<C/z2||M||2>00<C/iV2 B2.16) XeVh and the desired results follow. The remaining results are based on the local estimate B1.29) of Theorem 21.2. Taking there d equalling the present d(x) = dist(x, 60), we have for d ^ c'h, \e(x)\ «= С In1'V/>0{ln(C + e/h) min ||«-z||0p00,O|f XeV" B2.17) + Crf-1e-^+*>||u-u,I||o>„d}, where dist(Qd, dQ)^id'd. Here, by B2.6) and B2.1), min \\u~x\\o,*A^Ch2(l+e-cd*e-2HClnll2(d/h)h2 B2.18) XsVh under our assumptions on the relative sizes of £, h and d = d(x). Similarly, d-'Wu-UHWo^CWu-UbWo^^MC + s/h). This proves B2.10). Similarly, easy arguments establish B2.12) and B2.13), see Schatz and Wahlbin [1983, Theorem 10.1] for details. □ Remark 22.1. Finally, we remark that the basic paper Schatz and Wahlbin [1983] comes equipped with various bells and whistles, such as treating nonlinear problems, and giving very sharp estimates for || и — uh || under minimal assumptions on the smoothness of coefficients, and under minimal smoothness or compatibility assumptions on the right-hand side /. Also, Neumann conditions and nonhomo- geneous conditions of Dirichlet of Neumann type are treated there. The present chapter is merely an excerpt introducing the reader to the basic ideas. For one-dimensional results we refer the reader to Niijima [1980a,b] and references therein. Numerical illustrations can be found in the papers of Niijima and Schatz- Wahlbin. We give one here showing how boundary layer pollution abates. With e = 0.001, let —s2Au + u=0 in0^x,y<l, u(x, y) — e " x/c + e ~y,t on the boundary, which has the obvious exact solution. With triangular linear elements the mesh was essentially the product of a uniform subdivision in the ^-direction with h = 0.05 and a subdivision {xJ+ j}2,0 in the x-direction, xj+1 =0.017 +0.002/2. Thus the x-mesh was slightly refined towards the origin. There were 361 interior nodes and 800 elements. We display the errors at mesh points in a subregion of the unit square
466 L.B. Wahlbin Chapter IV (see Fig. 22.1, reproduced with the kind permission of the American Mathematical Society). / ® f 4 I ? • o.i . * 1 1 i / / / SCALE FOR ERRORS: / =0.1 0.1 Fig. 22.1.
Chapter V A Singularly Perturbed Convection-Dominated Model Problem 23. The model problem Consider the problem of finding и = u{x, y; <5, s) such that Ld:t=—euxx—euyy + ux + u=f in Q, и = 0 on 9Д B3.1) where Q is a plane convex domain. Clearly this problem has a unique solution for any (reasonable) / and it has a maximum principle so that (cf. as general references Eckhaus [1972, 1973, 1979], Eckhaus and de Jaeger [1966], and Lions [1973]) NL<C||/|L B3.2) where С does not depend on ё or e. We are interested in the case when e, 5 >0 and small. B3.3) The formal limiting case is the "hyperbolic" problem of finding м° such that u°x + u°=f infl, и° = 0 опГ_, B3.4) where Г_ is the inflow part of the convex boundary Г = 30 (see Fig. 23.1) r_=closure{xer:vx<0} B3.5) with v = {vx,vy) the outward normal.
468 L.B. Wahlbin Chapter V Clearly, at least as measured in L°°, the problem is singularly perturbed as e, 5 tend to zero in the sense that I|m°-"IIo,.,o-M> as<5,e-+0+. B3.6) Under various hypotheses the behavior in boundary layers of и is well described in the general references listed above. In order to highlight a comparison with the finite element situation we shall state and sketch an elementary proof of the following result. For simplicity of proof we assume that 5 and e are not completely independent, but that with some p, e^5p or 5^ep. B3.7) Lemma 23.1. Let s>0 be any number. There exist constants С and К such that the following holds. For any (x0,y0)eQ, let Q0 = {{x,y):x-x0^K8lnl/d,\y-yo\<Ky/Blnl/e}r\Q. B3.8) Then for и the solution of B3.1), М*о,)'оI^С|1Якао,о„ + С(<55 + е5)Шо,Я. B3.9) Thus, Lemma 23.1 rather sharply delineates how / influences the solution и at a point. Note that the region Q0 extends all the way upwind from (x0, y0). We say that the downwind spread is 8 In 1/8 and the crosswind spread ^/eln 1/e in the continuous problem. For the proof of Lemma 23.1 we shall use an auxiliary result in L2. Since the proof is very analogous to the finite element situation, for further future elucidation and comparison, we state this result separately. We denote by L* the adjoint to L. Lemma 23.2. Let L*jCu=0 in £2'1 = O\£2'0,Q'0 as in B3.8) with К replaced by 2K. Then with О1 = й\О0Э^'ь setting l№¥D = Slvx\\lD + e\\vy\\lD+ ||v||§.D, B3.10) we have U.vT}0^C(d' + ^)[M]0l. B3.11) Proof. With p and a small quantities to be determined, set Пко = {(х,уУ-х-х0^2К5ЩЩ-кр, |y-j;ol^2Kv/^ln(l/8)-fca}nO. B3.12) Let Q\ = Q\Qk0. B3.13) Note that Qi = f2?. Introduce now suitable cutoff functions to isolate Q\. In this "rectangular"
Section 23 A convection-dominated model problem situation we can find фк(х, у) such that фк = 1 on Q\, фк=0 on Oq+1, ■фк\<Ср \ 9j ^Cff, ox and such that 8 Let further B(v, cp)- 3{vx, cpx) + s(vy, cpy)+(vx, cp) + (v, <p), and with Нк=Ио,я?, П=ЬЫ1+Фу\\1+Ы\1 Then, by B3.15), fk < 3{vx, фкvx) + s(vy, фкиу) + (v, (M + \ ЛФк)х nr+1\fif Here, фх,фких)=ё{юх,{фки)х)-5(юх,(фк)хр), Фу, Фк У) = Фу, (Фк »)у) ~ Фу, (Фк)у»), and и2(Фк)х=~ »*(#*)• Thus, ■tl^Biv, фки)-д(их, (фк)хи)-е(юу, (фк)ую) and since B(v, ф^)=0 by assumption, fl^ -3{vx, (фк)^)-фу, №k)yv). 469 B3.14) B3.15) B3.16) B3.17) B3.18) B3.19) By Cauchy-Schwarz' inequality and the arithmetic-geometric mean inequality, and by B3.14), ^х,(Ф^)^б\\у/(}ргкТ^х\\0\\у/(фк)х1\\0 <Cdp-1,2\\vJk + Jy/bfk}xv\\o,o^\G'! Cd e p (Фи)У- Q? + 1\Q? Further, by B3.14), £ £ Ф^Д^К - IK II *+1 + c~i И f +1 ■ B3.20) B3.21)
470 L.B. Wahlbin Chapter V Select now C<5/p=i B3.22) so that the term /о^Чя?^*)* cancels back in B3.18). If further Ce/ff2 = e-1, B3.23) then it follows that n<e-'fl+i. B3.24) Iterating N times, СИ]01<е-*[М]0». B3.25) Since pcze and ff~ y/s, by B3.21) and B3.22), ifK = K{s) is large enough we can do this iteration a number of times N which is logarithmic in the minimum of ln(l/E), ln(l/£). Thus, B3.11) follows. □ Proof of Lemma 23.1. Let /=/o+/i where /0 is supported on Q0, and correspondingly m = u0 + Hi. By the maximum principle, 1|ио11о.=0<С||/о||о>0О<С||/||о.„,Яо. B3.26) For Ui, with G the Green's function centered at {x0,yo)> so that u1(x0,y0) = B(u1,G) = (/1,G), B3.27) since G satisfies the conditions of Lemma 23.2 on the support of /j, we have l«1(x0,}'o)l^ll/1|lo,o!|G|lo)«1^C||/||o,fi(<5s + £s)[[G]L- B3-28) By the simplifying assumption B3.7) and standard elliptic estimates, for some /, [[G]]o^Cmin[$-',e-']||G||0.o and the desired result would, clearly, by B3.26) through B3.28), follow if we can show that IIGIIo.oSSCe-1'4. B3.29) To prove B3.29), let G be the Green's function on the whole of U2 for Ls,e. By the maximum principle , G>G$= 0 so that it suffices to estimate ||G||R2. But and using Parseval's formula and elementary estimations the result obtains. (Considering e.g. the first quadrant in (£,>/), subdivide it into six regions by t?<e/2,4>e~1/2,^l, £>1, S^8-\ fed'1. In each region, estimate \&G{i,r\)\2 accordingly as 1, Г2, S~2r\ e'2n-\ «2+eV)_1, ((J2r + eV)_1 and integrate.) This proves Lemma 23.1. □
Section 24 A convection-dominated model problem 471 24. The streamline diffusion finite element method Let Vh denote a typical family of finite element spaces, based on quasi-uniform mesh families and vanishing on the boundary of dQk. Our interest is in the singularly perturbed case and we assume that 8,e^h. B4.1) Hence we do not seek to resolve boundary layers or other singularities but rather aim for a numerical method in which singularities do not pollute into regions where the solution is smooth. The ordinary Galerkin method, i.e., seeking uh e Vh such that Щик)х,хх) + Фн)у,Ху)+(Ых,х)+(ин,хЫЯх) foTX€Vh, B4.2) is well known for its severe pollution. This may be understood already from the one-dimensional analogue, -euxx + ux+u=f on/ = [0,1], u@)=u(l)=Q, ( • } and the case of piecewise linear functions on a uniform mesh. Then the equations for nodal values correspond, essentially, to finite difference equations where ux has been replaced by a centered difference quotient. Thus, in terms of nodal values Uj, the significant terms on the left are -^/j_1("J4-i-2uJ + «j_1)-i(MJ+1-^_1) + h$uj+1+liUj+frj-1) = ---. B4.4) The corresponding homogeneous three-term recurrence relation has fundamental solutions of the form r{,r% where fr + S/h+k lih and 5/h are small, one of the roots is close to — 1 and hence contributes a highly oscillatory piece to the solution of B4.4). In the simple one-dimensional situation, one well-known fixup is that of using upstream differencing for approximating ux. In the finite element situation, a similar effect, in the one-dimensional problem B4.3), comes about by using a Petrov-Galerkin method, i.e., a method with different trial and test spaces. To whit, we seek щ е Vk in the same trial space as before but use test functions x+i^&t» Xе Vh. Thus, S(wh,x'+W)+Wh,x+jhx')+(uh,x+W)=(fa+k') for^n. B4.6) In the one-dimensional piecewise linear situation, the equations for nodal values are now -(8 + ih)h~l(uJ+1-2uj + Uj-1)~(l-$h)(uJ+1-Uj-1) + h&uJ+1+$Uj + tuJ-1)=-. B4.7)
472 L.B. Wahlbin Chapter V (We have discarded the form 5h(u'h,x'h) which does not make sense. Its formal order is Sh^h2.) Rewriting B4.7) to make it look like upwinding, -5h~ \uJ+ l-2uj + Uj- i) + (Mj-_ i -uj) + h&+i)uj+i+buj + &-touj-i)=- B4.8) and we recognize the classical upwinding. (In case the test functions were taken as X + const • h/, with const > 0, we would see that the method gets more information from upstream than from downstream.) In the one-dimensional situation, note that if we can make sense of the terms &h(u'h, x"), as is the case for "^-splines, e.g., we may keep them. Taking testfunctions X + h/ for simplicity, by rearrangement, the equation analogous to B4.6) becomes (d + h)(u'h,x') + (l~h){u'h,x) + (uh,x) + Sh(u'h,x") = (f,X + hx') B4.9) which exhibits the feature of having diffusion of strength (<5 + h) while being exactly satisfied for the real solution to the continuous problem B4.3). In short, we have a method of upwinding which is, at least formally, of any desired order of accuracy. More elaboration on the one-dimensional situation can be found in Wahlbin [1974a,b], where the rationale for using test functions x + h%' was to obtain optimal order error estimates in hyperbolic problems, cf. Dupont [1973]. As for other articles, we mention only Christie, Griffiths, Mitchell and Zienkiewicz [1976], and Leonard [1979]. The literature on numerical solution of one-dimensional convection-dominated flows is enormous. Now back to the two-dimensional situation of the model equation B3.1). As is well known, the ordinary Galerkin method fails badly in practice as wild oscillations spreading into most of the domain occur when boundary layers are present,, cf. the one-dimensional situation described above. A "classical" method of introducing artificial diffusion, or damping, is to find uhe Vh such that for /e Vh, B4.10) This method is, in practice, discarded due to excessive damping, in particular in the crosswind direction. Note also that the solution и of B3.1) would satisfy B4.10) only to within 0{h) accuracy in general, i.e., the higher diffusion is completely artificial, and the method is at best of first order. The streamline diffusion method is a finite element method which combines formal high accuracy with decent stability properties. The method was introduced in the case of stationary problems by Hughes and Brooks [1979], cf. Raithby and Torrance [1974] for the finite difference situation. The idea is to introduce extra diffusion only in the wind direction ( = positive x-direction in our model). Following the ideas of the one-dimensional case already described, this can be done without changing the (formally) optimal order of the scheme, namely by using a Petrov-Galerkin method with test functions x + hxx. Note that, in contrast, any crosswind (y-direction) diffusion has to be artificial.
Section 24 A convection-dominated model problem 473 Thus, in a formal sense, we consider the problem of finding ще Vh such that <KK)x, (x + hxx)x) + Фи)у, (X + hxx)y) + ((uh)x,X + hxx) + (u„,X + hxx) = (f,X + hxx)- B4.11) For simplicity below we shall only consider the case of Vh piecewise linears on quasi-uniform triangulations. B4.12) Then the terms Sh((uh)x, xxx) and eh((uh)y, xxy) do not make sense and we shall simply discard them. Furthermore, we shall allow artificial crosswind diffusion so that e is, possibly, replaced by £той^0. Thus, introducing the notation (motivated by some integration by parts in B4.11)), B(v,(p) = C + h)(vx, <px) + £mod(vy,<p,) + (l-h)(vx,<p) + (v, q>) B4.13) and ф = ср + 1крх, B4.14) we seek uhe Vh such that B(uh,X) = (f,x) f°rxeVh. B4.15) It is clear that B4.15) has a unique solution for any/. For, the homogeneous equation enjoys uniqueness, since if B(uh,x) = 0 we have upon taking x = uh, 0 = B(uh,uh)^e + h)\\(uh)x\\20 + £moJ(uh)y\\20 + \\uh\\20 B4.16) so that uk = 0. The modified crosswind diffusion gmod will be given as follows in the sequel. Let 0^£co^h be a crossover point, typically dependent on h, and set emo(i4eco' JJS:;eco' B4.17) [s, if e^£co. Thus in the "classical" method B4.10), eco = h, while for the "pure" streamline diffusion method, £c0 = 0. In our later theoretical development there will be a theoretically optimal ес0~й3/2, which may, however, not be optimal in computational practice. We next summarize the deviations made from B3.1) in B4.15). Differentiating B3.1) with respect to x, multiplying by -h, adding to B3.1) itself, and integrating some by parts, it is easily found that the solution и of B3.1) satisfies В(и,(р) = (/,ф) + Рет(и,(р) foT(peE\Q), B4.18) with the perturbation form given as Per(u, <p) = h(Suxx + suyy,<px) + £per(uyy, cp) = Per^u, qy) + Per2(w, cp) B4.19)
474 L.B. Wahlbin Chapter V where fO for s^£ (eco-e) for e<Eco. Formally, these perturbation terms are of order he + hE + EpeT. We next give general references to the literature on streamline diffusion methods. As already mentioned, the germs are found in Raithby and Torrance [1974] and Wahlbin [1974a,b], cf. also Dendy [1974]. The paper by Hughes and Brooks [1979] in the stationary situation is seminal. The problem of putting this computationally promising method on a sound mathematical basis was started in Johnson and NAvert [1981] and continued with extensions to, e.g. time-dependent problems and the zero diffusion limit of hyperbolic problems in NAvert [1982], Johnson, Navert and Pitkaranta [1984], and Johnson and Saranen [1986]. For computational aspects of the method in various situations, see Brooks and Hughes [1982] and also the series of papers summarily entitled "A new finite element formulation for computational fluid dynamics", Hughes, Franca, Mallet and Mizukami (in various combinations) [1986]. In particular, we mention number IV of that series where an interesting nonlinear method for pouring on more streamline diffusion in regions of high gradients (but moderate streamline derivative for щ so that in the case of downstream layers the numerical solution is clearly amiss) is given. Our analysis in upcoming sections is based on Johnson, Schatz and Wahlbin [1987], with due thanks to predecessors mentioned above. A local analysis for a hybrid-upwind finite element method is contained in Risch [1986]. In the papers mentioned above, time dependent problems and the limiting hyperbolic case are sometimes treated, with local error estimates. The techniques of analysis are quite similar in the singularly perturbed convection-dominated case and in the pure hyperbolic case. However, in the pure hyperbolic case one also has recourse to "time-stepping" methods such as the discontinuous Galerkin method. We refer to Lesaint and Raviart [1974] for the inception of the method, to e.g. Richter and Falk [1984] for further considerations and references, and to Johnson and PitkAranta [1986] for an elucidation of the local behavior in the discontinuous Galerkin method. Ending this section on a somewhat loose note, I wish to point out the problem of crosswind spread in the model problem B3.1) and its numerical analogue B4.15). At present, the question of numerical downwind spread of very sharp fronts is answered in a rather satisfactory fashion: It is limited to 0{h In l/h). (Naturally, this is an asymptotic result and oscillations may not please the practical eye.) However, the numerical spread crosswind of fronts following characteristics (in the limiting zero diffusion hyperbolic case) is virtually unknown; we shall give some theoretical results, optimized by playing around with the artificial crosswind diffusion £mod. We warn the reader that the theoretical results do not match computational experience. (Of course, the theoretical results are too conservative.) We end this section by displaying some graphs depicting crosswind spread (see Fig. 24.1). These graphs were computed by William Semper and are included here with his kind permission.
Section 24 A convection-dominated model problem 475 Fig. 24.1. Here, <5 and e in B3.1) are both 10 5, and the real solution given by и(х>У) = ((х +1J -exp((x-1)/<5))A -exp(-^)) B4.21) with £2 = [0,1] x [0,1]. (Thus, boundary values are inhomogeneous.) The method B4.15) was applied with h = ^ with four-node quadrilaterals on a uniform mesh. A series of emod were taken in B4.15), starting with £mod =\h and then quartering it in each successive computation. The graphs depict a cut гХх—\, with u(%,y) and uh(j, y) sketched versus у for each modified £mod. (Of course, и is the same.)
476 L.B. Wahlbin Chapter V 25. Numerical crosswind and downwind spread Let Vh denote piecewise linears on quasi-uniform triangulations of Qh £ Q with dQk and Sfi,, at most 0(h2) removed. Let the form В be as in B4.13), and let uhe Vh satisfy В(»ьХ) = (/.х) forZen, B5Л) where Z = X + bXx- B5-2) The object of this section is to elucidate how the values of uh are influenced by /, cf. Lemma 23.1. However, we start with a result in L2 -based norms. Let again [Mb = ft1/2ll»xll§,i> + e$&KH2.D + IMiw>. B5-3) Theorem 25.1. For any s>0 there exists a constant К such that the following holds. Let uhe Vh satisfy B5.1) and let Q0 = {x^A,B1^y^B2}r\Q, B5.4) and Q^ ^{x^A + plnil/h), Bl~aln(l/h)^y^B2+ff\n{l/h)}nQ, B5.5) where the downwind spread is p = Kh B5.6) and the crosswind spread a is given by (кеЦ2й for h*lz^mod^h, <j = < Kh3/2e~U2 for h2 ^£raod ^h3'2, B5.7) [Kh1'2 for smod^h2. Then CWLo^Wllo^+WIU- B5-8) The crosswind spread described in Theorem 25.1 is probably not sharp, in general. In various situations of completely uniform meshes following characteristic directions, better results are known, see NAvert [1982]. For a discussion of this point, see again Johnson, Schatz and Wahlbin [1987]. Also, in Section 26 we shall give an alternative proof for the crosswind spread which, we hope, explains how the expression B5.7) "naturally" enters. The rest of the present section is devoted to a proof of Theorem 25.1. We note that the case of a = Khllz is contained in NAvert [1982], as is the fact that the downwind spread is p = Kh in all cases. Thus, the contribution of Johnson, Schatz and Wahlbin [1987] is to fine-tune the situation when fo2 <£mod=%/i. (This fine- tuning is, however, crucial in order to obtain pointwise estimates that are somewhat realistic.)
Section 25 A convection-dominated model problem All For notational simplicity in the proof of Theorem 25.1 we let u = uh, £=smod and replace the form В of B4.13) by B(v, cp) = h(vx, q>x) + s(vy, cpy) + (vx, (p) + (v, q>). B5.9) Following Johnson and NAvert [1981] and Navert [1982] we start by introducing a suitable cutoff function. Let g(s)e<£2(— oo, oo) with #(s) = ]s| for |s|^l and set <p(t) = exp(-<?(s))ds. B5.10) Then, as is easily checked, there exist positive constants с and С such that c^cp(t)^C forf^l, B5.11) <p(r) = e"' forol, B5.12) <p\t) ^0 all t, B5.13) \<p'(t)\ + \<Pn(t)\<C\<p{t)\ all г, B5.14) \q>"(t)\^-C<p'(t) all t. B5.15) Further, with the relative oscillation on a domain D defined as RO(D,u) = max|p(x)|/min|i)(x)|, B5.16) xbD xeD on any interval / of length 1, RO{l,q>) + RO{I,<p')^C. B5.17) Define then сф,,)^(^(^),(^). B5.18) From the properties B5.11)-B5.17) above it follows that cox^0, B5.19) and \rvnf |< jCP"^"'10 for a + j?<2, B5.20) 1 * у(°Ы\-Ср-с,+1о-1!а)х for «T^l, a + /?<2, B5.21) and that RO(T, со) and RO(T,cox) are bounded independently of h on any element T. B5.22) From the stringently given properties above it is clear that Theorem 25.1 would follow from the following: Let L= Гтах((/?//г)!/2,/т3/2/((Тб1/2),£/а2) for h2^e^h, B5.23) ~ (тах((/г//?I/2,/г1/2<х_1) for e^h2. B5.24)
47S LB. Wahlbin Chapter V Further set Q(u)=(h\\coux\\2+4couy\\2 нЧ|ож||2чЧ1(а>Ы1/2«112I/2- B5.25) Then for L sufficiently small, 0(иКС|!ю/||о. B5.26) In order to show B5.26) we rely on the following customized result on "super- approximation" in the present situation. Let Е = со2и~Ы(со2и) B5.27) where u — uhe Vh, со is as above, and Int denotes the piecewise linear interpolant into Vh. Lemma 25.1. There exists a constant С such that /j||co-1V£||0 + i|u)_1£||o<C/i1/2L6(M). B5.28) We postpone the proof of this lemma until the end of this section and proceed with the main business, viz. to prove B5.26). Note first that 0 = {{cou)x, сои) = (оохи, сои) + (ux, со2 и). B5.29) Hence, since B(uk, x)=W, %) with j£ = Int(cyzu)e Vh, Q2(u) = h{coux, coux) + e(couy, couy) + (сои, сои) — (coxu, coy) = h(ux, (со2 u)x — 2cocoxu) + e(uy, (co2u)y — 2cocoyu) + (u, co2u) + (ux,co2u) = B(u, ca2u) — 2h(coux, coxu)—2e(eouy, coyu) — B(u, E) — (/,£) + (f, (со2 и))—2h(coux, coxu) - 2e(couy, coyu) = Ii+-+Is. B5.30) Here, by Lemma 25.1 and since e^h, I1=h(aux,Q)~i Ex) + s(couy,co~1Ei,) + (coux,co'1E) + (cou,co''1E) ^[||0)Mje||o+F/ft)||o««y||o + ||CBM||o]x[fc||£B-1V£||0 + ||(B-1£||o] ^[h1'2||rauxl|0+e1/2||(«M,||o(s/hI'2+^1/2||cou||o]CL<2(M) ^CLQ2(u). B5.31) Again using Lemma 25.1, h^WcoflUWco-'EWo+hWco-'VEWo^Wcoffo + ChL'Q2^). B5.32) Further, by B5.20) and since p^h, h < II<»f\\oU\w" IIо +Ш(oxu ||0 +h\\ coux||0] ^\\cof\\0lC\\cou\\0+h\\coux\\0-]^Q2(u) + C\\cof\\l B5.33)
Section 25 A convection-dominated model problem 479 Next, by B5.21), U^2h\\a)ux\\0\\coxu\\0^h\\coux\\l + Ch\\o)xu\\l ^ + CL)Q2(u). B5.34) Finally, again using B5.20), /5 < Ceff "u || сон, || о ||ew || о ^s\\a)uyU + Cea~2\\cou\\^(i + CL)Q2(u). B5.35) It is now clear that B5.26) will follow if L is small enough, and, as we have said, so would then Theorem 25.1 with К large enough. It remains to prove Lemma 25.1. Proof of Lemma 25.1. On any triangle T we have by well known approximation theory, hl\VE\\0,T + \\E\l0iT^Ch2 £ ||^(й>2«)Но,г- B5.36) Since D2u = 0 it follows by use of B5.20) that \D2(co2u)\^\(w2)yyu\ + 2\(co2)yuy\ ^Ca-2\(o2ul + Ca'1\oJuyi. B5.37) For the second mixed derivative, DxDy(co2u) = 2atxa>yu + 2акохуи + 2a>coxuy + 2a>a>yux, B5.38) we use B2.20) and B2.21) to arrive at \DxDy(co2u)\^ С(т~ 1\соа)хи\ + С\й)сохиу\ + Сс~ х\ш2их\. B5.39) Similarly, \Dx(aJu)\ ^ Cp~l \cocoxu\ + C\coa)xuJ. B5.40) Inserting the above in B5.36) and employing B5.22), fc||u)-1V£||o,r + ||©-1£||0ir ^Clh2a-2\\cou\\0!T+h2a'1\\couy\\0iT + h2a-1\\coxu\\0tT + hz\\coxuy\\0tT+h2a-1\\cDuJ0tT+h2p-1\\coxu\\0,T + h2lcDxuJ0,T]. B5.41) We proceed to operate further on the last five terms. By B5.20) and since с > h, ft2ff-1|l©x"llo.r<CA^-Ip/2IINo)»lI/2«llo>r <СЙ/Г1/2||(«>|Юх|)>/2и|1о.г- B5-42) By inverse properties, and as above, A2||fl),«,|lo.r<CA||a>xu||0,r<CAp-1/2||(fi)|a,:e|I/2a||0ir. B5.43)
480 LB. Wahlbin Chapter V Further, since a^p, h2o~l \\couJ0_T^Ch2p-1 \\cozux\\0iT. B5.44) Finally, as for the first term treated above, Л2/о-1||й),«||0.г^СЛр-1'2||И(в,|I'2и||01Г B5.45) and by B5.20), Thus, and we consider now the case e ^h2, by B5.41) and the above, hlltB-'VEIIo.j. + Hw-^Ho,,. ^Clh2p-l\\coux\\0iT + h2a'x\\(auy\\0:T + h2a~2\\(au\\QiT + hp-V2\](co\ojx\I'2u\\0,T-} ^ Ch^Kh/pW2 i| coux ||0i7. +(^>£1/2)£1/21| amy ||0>r + (h^2/a2)\\ сои ||0,r + {h/pI12 IIM o)JI/2h ||0,r]. B5.47) Since L^/i3/2cr-2, in the case of e^h2, we obtain Lemma 25.1 upon squaring and summing over all elements. In the case of s^h1, we replace the second term on the right of B5.41) via the inverse property, ft2ff-1||(»M,||o.r<Cftff-1||u>M|lo.r B5.48) and continue as before. П 26. The numerical crosswind spread: An alternative proof In this section we will give another proof of the numerical crosswind spread result in Theorem 25.1. That theorem suffers from two deficiencies: The proof is rather hard to see through, in particular, the customized superapproximation property of Lemma 25.1 obscures how the expressions for the crosswind spread in B5.7) were arrived at. Secondly, in B5.8), the global term frs||/j|fi, although multiplied by an arbitrary power h\ may not be easy to estimate in very rough flows. On the other hand, there is a reason why we presented the proof of Theorem 25.1: It is short, comparatively speaking. Thus, in the present section, we shall only consider the question of crosswind spread and shall do so only if /=0 on the relevant domain. This brings the more elucidating and localized proof within manageable proportions. Again we consider the notationally simplified situation of и = uhe Vh, e ~ emod, and B(u, x) = h(ux, xx) + e(uy, xy) + (ux, x) + {u, x). B6.1) As remarked above, for pedagogical reasons we present the following result elucidating the role of the crosswind spread parameters in B5.7).
Section 26 A convection-dominated model problem 481 Theorem 26.1. For any s>0 there exists a constant К such that the following holds. Let Q0 = {Bl^y^B2}nQ B6.2) and Qt ={Bi-oW/hHy^B2+obi(W}r\a, B6.3) where с is as in B5.7). Let ueVh be such that B(u, x)=0for ^e Vh with support in Qq . B6.4) Then [МЪ^Кй'СМЗог. B6.5) Remark 26.1. The techniques employed give, when combined with a downwind spread analysis, that for и and Qq as in Theorem 25.1, The details for doing the complete analysis of this are lengthy, but not hard. Proof of Theorem 26.1. Let со =. 1 on Q0, while co( у) depends only on y, is supported within a cr-layer, and \cQU)(y)\^Ca~j, ./ = 0,1,2. B6.6) Then, since ((cou)x, сои) — 0 in our case, [ [«] ]Qo = h || ux\\ l0o + г || uy || 1По + || и || g,Qo < h((oux, coux) + s(couy, couy) + (сои, сои) + ({сои)х, сои) = /г(мх, (со2 и)х) + е(цу, (co2u)j,) + (их, со2 м) +(м, со2и)—2е{иу, сосоуи) = 5(и, со2и) — 2е(и , cocovu) =В(и,со2и — х) — 2е(иу,сосоуи). B6.7) Thus, taking x=Int(co2M), with Д|, the support of со, [МЪ„ *£C(h\\ ux Ho.no + £|| "y llo.ob + й|| u llo.ob) -h(<7~2||M ||o,flo + °~ ^ II "* llo.fli + <7" * II "у II O.flb) + |2е(му, йкоун)|. B6.8) We consider now the case of h2 < s < /г; in the case of г < h2 the y-derivatives of и occurring on the right of B6.8) are all replaced by и via inverse properties. (We leave the easy argument to the reader.) Multiply out the parentheses in B6.8) to obtain ten terms to estimate. We retire
482 them in order. L.B. Wahlbin G 04 ff h h , Ch\\ ux || о.яё,—II ux II о,яо < С-Щ ux || t,ab- о <т h e_1 Ch3 C/i||Mx||o,Q6-||My||o,n6^-^-ft||"«||§ini + -2-Flluyllo,o6) h c_ e/t Cs||MyПо.яь—2II" 11о,ой<-ZT"eH">I'2 - + C —H«llo,n6- С 04 0,йо С Cell м> llo.fii-11 их |1о,о^-7Ге11 ыу Ио.об +—-Г"(ЙИ "* По.дб)- 04 "■ Сей, Се || му Ио,до~е11 "у У о,об < С-£|| Щ Ц2,^. с <т Chapter V B6.9) B6.10) B6.11) B6.12) B6.13) B6.14) The next three terms are easily combined as (assuming as we may that Q'0 is a mesh domain so we can use inverse properties), 1 Сй2||и||о,п„ — ||u||o,Qi + -||«xl|0>ob + -|l"yllo,flc <C-|Nlo.n<0 a ° Finally, \Ъ{щ,@(оуи)\<:—Ё\\иу\\1<а-0+-^\\и\\о,а'0. Employing the above in B6.8), noting that й2/<т4 «S/i/c2 and ей/а2«$е/сг2, [M]£o< [[«]]&,■ e_1 Ch Ch3 Ch3 e" ^ + — + ТГГ + Т4-+С-2 B6.15) B6.16) B6.17) a a e а о We now desire to choose a so that B6.17) becomes [Wl^e-1». B6.18) Then the argument could be iterated with Q0 replaced by Q'0, Q'0 by Ql, etc., a number N times, where 7V = 0(ln l/й). Clearly, B6.5) would follow. To obtain B6.18), by B6.17) we need h/a small h3/a2e small /i3/G* small e/o-2 small, B6.19) B6.20) B6.21) B6.22)
Section 27 A convection-dominated model problem 483 or uP/h, c^/h^e-1'2, ap/h3/\ ар/г1'2. B6.23) An elementary investigation of these inequalities leads to the condition B5.7) in the case of h2^B^h. This proves Theorem 26.1 and has, we hope, clarified how the crosswind spread a was chosen. □ 27. Pointwise local error estimates Compared with the situations in Chapters III and IV, in the convection-dominated case there are no results in general asserting Lx stability of the streamline diffusion method (or, any relevant localization of it). Thus, obtaining localized pointwise error estimates will not follow the pattern of Chapters III and IV. (Certain first- order accurate methods do enjoy L00 stability, cf. e.g. Risch [1986].) The investigations of Johnson and Navert [1981] and NAvert [1982] give localized L2 estimates of the following type. Let Q0 = {x^A,B1^y^B2}nQ B7.1) and Qt = {x^A + Khln(l/h), B1-KJh\n{\/h)^y^B2+KsJh ln(l/A)} nfl. B7.2) Then if the streamline diffusion method is used with piecewise polynomials of degree r— 1 and if и is smooth on Q% (and the terms hS(uxx, Хх) + ЩЩу> Хх) are suitably retained in the form B, see the papers mentioned above for details), and /e L2(Q), ll«-"Jo,flo^C/f-1/2. B7.3) In two-space dimensions it would then follow by Sobolev's lemma on Vh that, if и is smooth on Oj, l!"-"Jo,TO,fio^C(ml//01/2/f-3/2. B7.4) As already remarked, in the convection-dominated problem we have no known L00 stable projections to use. The only theoretical results improving those of B7.4) are those of Johnson, Schatz and Wahlbin [1987] which we proceed to describe. The results are based on a careful investigation of how the approximate Green's function behaves. For this it is essential to have tight control over crosswind spread. Looking back at the expression B5.7) for the crosswind spread, we shall only consider the choice of emod that minimizes <r. Thus, choose en,od = maxfok3/2X B7.5)
484 L.B. Wahlbin Chapter V so that a<max(e1/2,ft3/4). B7.6) Of course, in making the choice B7.5) we have introduced artificial crosswind diffusion when e < ft3/2 and thus destroyed any formal accuracy properties above 0(/i3/2). We shall only consider the case of Vh being piecewise linears (r = 2) on a family of quasi-uniform triangulations. We point out that our results are probably not sharp, but they are the best that can be proven for general mesh families at the present level of technology of proof. Let (x0,y0) be a point of Q, let eco = h3/2, cf. B7.5), and set Оо=ф^о + 2КИпA//т), |y-jo|<2Ke!2dln(l/fc)}nfi, B7.7) where К is as in Theorem 26.1 with s = 6. Let further Nk2№0>+ \\duxx + suyy\\Lim+\\Vu\\LHa)+ \\fhHQ)^Q- B7.8) Then: Theorem 27.1. |(м-и»)(х0,УоI<СЙ5'41п3'2A/й), B7.9) where С depends on Q of B7.8) and parameters in the quasi-uniformity of the triangulations. Remark 27.1. Clearly, the result B7.9) misses being of optimal order by a factor of about h~3/4. Compared to the L2 result of 0(fo3/2) it misses being "optimal" by a factor of about h~1/4. This missing factor h~1/4 will be further elucidated below. We proceed to prove Theorem 27.1. Proof of Theorem 27.1. Let G = G{h°'yo)e Vh be the discrete Green's function, B(LG) = x(x0,y0) fovXeVh. B7.10) With P0u the L2 projection into Vh we have, see B4.18) et seq. for notation, (uh-P0u)(xo,y0) = B(uh-Pou, G) = {f, G)-B(P0u, G) = {f,G) + Per(«, G) - B(P0u, G) - Рег(и, G) = B(M-F0u,G)-Per(u,G). B7.11) Let now Q'0^Q0 be as in B7.7) with 2K replaced by K. We claim that l|G||ll00,ONO1,<a.3. B7.12) To see this let eh be a linear function on the element Г containing (x0, y0) such that (<5*> x)= х(хо> Уо) f°r X linear on T, with 5h extended to be zero outside T. Then B7.10) amounts to B{x, G) = (Sh, x) for xe F/, so that by the counterpart of Theorem 25.1 (with the wind direction reversed),
Section 27 A convection-dominated model problem 485 \\G\\0w0<Ch6\\eh\\o- B7.13) Since the dimensions involved are much greater than h and since G vanishes outside Qh, we may assume that Q\Q'0 is a mesh domain. Since j|<5J < Ch'l, B7.12) follows by inverse properties. Let now BD(v, (p) denote that the integrations in the form В are extended only over the domain D. Then, Яйчяь(и-Р0м,е)<(||М||1,1 + ||,Р0«||1.1)||С||1,00.ОЧОо. B7.14) Since by assumption \\u\\ltl^C and since (by inverse assumptions) ||PoM||ia<q|i>||lj2^C/r4|i>|lo^C/ri«!|0<C/2-\ B7.15) we get from B7.12), Bnxab(u-P0u,G)^Ch2. B7.16) For the remaining part of В we have Bnb(u~P0,G) [{h + 8){u- P0u)xGx + emod(u - P0u)yGy - (и - P0u)Gx Q'oniih + (u-P0u)G] ^CmW(u~P0u)\\0^Qbnnh+\\u-P0u\\0^n,onQJ-I B7.17) where I=\\GJL^b) + e^h-4G,\\Lt(m)+\\G\\L>my B7.18) Since meas(fio) ^ C^Uou ln(l/A) we obtain by Cauchy-Schwarz' inequality (this is the point where it is important to have tight control over crosswind spread!), ^СеГа1п1/2A//0/, B7.19) with /=l|Gx|lo + emodA-1||C?JP|lo + IIG|lo- B7.20) Hence, using the highly local properties of the L2 projection from Chapter II, Bnb{u-P0u, G)<CA2eJ& ln^2(l/h)f. B7.21) We next estimate the perturbation form Рег(и, G) = Регх(и, G) + Per2(u, G) in B7.11). For the first part we have again using B7.12), Per^u, G) = H5uxx + myy, Gx) ^Ch2\\u\\^(nJGx\\Li(no) + Ch\\duxx + euyy\\Li(n)h3 ^Ch2s^d In'lHl/hnGJo + Ch* B7.22)
486 L.B. Wahlbin Chapter V and for the second, after integration by parts over Q\Q0, Per2(u,G) = eper(uw, G) (Щу>0)а0- uyGy+ uyG n\Q0 Mir, ^ C£pererd ln^2(l/ft)|| G||0 + Ceptrh3. B7.23) Collecting the above into B7.11), \(u-uh)(x0,y0)\ ^CA^^In^l/fcKHG.IIo + e^/.-MlG^lo + IIGIIo] + Ce^ejfi, ln1/2(l//r)|| G || о + Ch2. B7.24) We shall next use the following lemma, the proof of which will be postponed. Lemma 27.1. II Gx!|0 < СЛ " 3/*e^* 1пA/й), B7.25) \\Gy\\0^Ch-llAs~Jtln(l/h), B7.26) ||0||0^СЛ-1/4е^41пA/й). B7.27) Admitting this lemma for the moment, we have from B7.24) that \(u~uh)(x0,y0}\ ^ ChhUt, ln3'2(l/h)[h- 3>Ae-0T + Л~ 5'4eJ& + h-^e-BTl + Свреге^п3/2A/Ь)Г1/4е^ ^Ch5/*ln3!2(l/h), B7.28) since h'sl4e^d^h~3,4'eUad and since eper^/i3/2 in our present case. This would conclude the proof of Theorem 27.1. □ It remains to prove Lemma 27.1, and for this we shall need the following variant of Sobolev's inequality. Lemma 27.2. For ve H&Q), 1Ио.р^Ыо!1КИ#Ир (meas^I"). B7.29) Proof. Our proof is a minor modification of a standard proof of Sobolev's inequality. Let we <&${Q). Then IM*. JOI <i | |wX'(x', y)\ dx' or \w(x, yM\ \wr(x, y')\ dy'. Thus,
Section 27 Л convection-dominated model problem 487 \w(x, y)\2^i \wx{x', y)\ dx' \wy.(x, y')\ dy'. Integrating (and removing the primes), \w(x, y)\2 dx dy^U \wx(x, y)\ dx dyj( \wy{x, y)\ dx d Л Apply this to w=|d|p so that Dw=p\v\p~1sgn(Dv). Using also Cauchy-Schwarz' inequality, \v\2p^p2 vrl\v> krV ^>JolMo M2'*' By Holder's inequality, N2(p~14N|2.£~1,meas(£I/'' and hence Mll^vJol^Uvf^^-ip'meeLsiQ)^ or, \\vfL2p^\\vx\\0\\vy\\0-ip2meas(Q)l">. Changing 2p to p completes the proof of Lemma 27.2. □ We proceed now to prove Lemma 27.1. Proof of Lemma 27.1. We have A||Gx||g + enod||G,||§ + ||G||5 = B(G,G) = G(xo)j'o). By well-known inverse estimates, and then using Lemma 27.2, G{x0,y0HCh-2i'\\Gh,P^Ch-^p\\GJh'2W,\\b'2. Choosing р = 1пA//г) and A = (h/emodI/4 below, Gix0,y0)^Cln(l/h)\\Gx\\h'2\\Gyn12 ^Cln(l/h){A\}Gx\\0 + A-l)\Gy\\0) ^CMlMAh-^h^GJo + A-'B^ih^GyU) ^hWGJl+^E^WG.Wb + Cl^iyh^h^ + s-'A-2). B7.30) B7.31) B7.32)
488 L.B. Wahlbin Chapter V Hence, from B7.30), fe||Gje|Je + eimKl||G,t|g+l|G||g^CIii2(l/h)(Aemod)-1-/2 B7.33) which proves Lemma 27.1. □ We wish to end this chapter by pointing out the feeble nature of the results above. What we have stated above, for general meshes not uniform and not aligned to the characteristics of the limiting hyperbolic problem, exhausts the present knowledge of crosswind spread. Comparing the results of Section 23 for the continuous problem with those of the discrete problem, one is humbled. Compare e.g. B7.27) with B3.29). Numerical experiments suggest that the theoretical results are way too conservative. As an example of the sorry state of affairs, let us return to the maximum principle B3.2) in the continuous problem. The best known result for the numerical scheme (for piecewise linears and when smod = min(fo3/2, e)) is as follows, displaying a loss of ft-1/\ essentially. Theorem 27.2. Mo.oo^C/r^ln^l/Wllo,^ B7.34) Proof. We have with G the discrete Green's function, \uh{x0,yo)\ = \B{ukiG)\ = \[f,G)\^\{f,5)o0\ + \(f,0)^oo\ with Q0 as in Theorem 27.1. Thus, by inverse estimates, assuming Q0 is a mesh domain, o,oo,fi0 Halloa,no + 11/11 We have seen that HG|lo.i.o\no<Cfc', while by Cauchy-Schwarz' inequality and Lemma 27.1, 1|С||0,1,0о<Се^1п1/2A/Л)||С||о^СЙ-1'41п3/2A/Л). This proves the theorem. □ Finally, we have already remarked that the present investigations do not follow the typical pattern from Chapters III and IV, in that there is no basic problem having pointwise stability to start from. Thus, perhaps, it is not surprising that in order to give even a weak global L1 estimate in the presence of singularities one needs to use two local tools developed above. As an example the reader may consider the global L1 estimate of order almost 0{h112) given under realistic assumptions in Johnson, Schatz and Wahlbin [1987, Corollary 3.6].
Section 27 A convection-dominated model problem 489 Note added in proof. In the paper "Pointwise error estimates for a streamline diffusion finite element scheme" by K. Niijima (Preprint, Kyushu University, Fukuoka, 1988) the author replaces hs/4 in B7.9) by h11/s. The key is an improvement of the estimates of Lemma 27.1.
Chapter VI Time-Localized Behavior in Second-Order Parabolic Problems 28. A linear model problem: Full advantage of the smoothing effect We shall consider only the simplest model problem and shall be content with merely giving references to relevant extensions. Thus, let Q be a bounded domain in UN with smooth boundary and consider the problem of finding u(x, t), xeQ, t^O, such that u, = Au, xeQ, £>0, u(x,t)=0, xedQ, B8.1) u(x,0) = lix), xeQ. In the case that v is smooth and compatible enough so that u{t) is sufficiently smooth on the closed interval T, then for uh(t) a semidiscrete finite element solution, with r the optimal order of approximation in Vh, ||M(t)-«»(t)||o<CAr||»||r. teT, foTveD((—A)rl2) which thus requires smoothness and the compatibility conditions A}v\aB = 0 for j<\r. The object of this section is to elucidate what happens if initial data v are not smooth or compatible enough for the above to hold. As is well known, even if initial data v(x) is rough, or incompatible, the solution u(x, t) for f > 0 is smooth, viz., as smooth as 80 allows. Assume in the rest of this section that ЭЙ is infinitely differentiable, for simplicity. The question now is whether the finite element solution to B8.1) takes advantage of this smoothing property, i.e., whether in the case of rough (or incompatible) initial data v, the error in the Galerkin finite element solution is small for positive (and bounded) time. Our analysis (indicated in a simple case) will be given only for the semidiscrete (continuous in time) finite element solution. The analysis is based on having a method for approximating the associated elliptic problem. Let T denote the solution operator to the problem -Av=f in Q, t;=0 on dQ B8.2) 491
492 L.B. Wahlbin Chapter VI so that v- Tf. Let {A;}f and {q>j}? be the eigenvalues (in nondecreasing order) and L2 orthonormal eigenfunctions for this problem. We can then introduce the associated HS = H%Q) spaces, /oo \l/2 Nl.=(l*5#) > Pj=(">><pj)- B8-3) Since for the solution of B8.1), CO u(x,t)= £ Pfi-^'vjix), P] = (v,<Pj), B8.4) j=i an exact statement of the smoothing property is that with E(t) the solution operator to B8.1) so that u(x,t)=(E(t)v)(x), ll£@llp,e*scr(p-«)/2, p>9- B8-5) We assume now that we have a family of approximations Th: L2-> Vh to the inverse of (—A) such that Th is self-adjoint, positive-semidefinite on L2, and positive-definite on Vh, B8.6) and there is an integer r > 2 such that !ir,-T||0i^C^+2 for0<^r-2. B8.7) The integer r is, of course, the optimal L2 error. Note that equation B8.1) may be written as £,7m + u = 0, h@) = i>. B8.8) Correspondingly, we let the semidiscrete finite element solution be given by DtThuh + uh=0, uk@) = vh B8.9) with vhe Vh given. We shall assume that with P0 the L2 projection into Vh, vh = P0v. B8.10) We then have the following estimate showing that for positive time the semi- discrete error in approximating B8.1) is of optimal order in the linear problem. Here, eh(t) = {uh-u)(t). Theorem 28.1. Assume B8.1), 80 smooth, B8.6), B8.7) and B8.9), B8.10). There exists a constant C(t*) such that for 0<t^t*, i|eft@||o^Cfcrr'/2N!o. B8.11) Before proving Theorem 28.1 we give references to the literature. The fact that numerical solutions to parabolic problems take advantage of the smoothing
Section 28 Second-order parabolic problems 493 property was noticed in the context of finite difference methods in Juncosa and Young [1957]. The theory for difference schemes was further developed in Kreiss, Тномёе and Widlund [1970], LOfstrom [1970] and Тномёе and Wahlbin [1974], cf. Brenner, Thomee and Wahlbin [1975, Chapter 4] for a summary. In the finite element situation Theorem 28.1 is due to Helfrich [1974], cf. Blair [1970] for earlier thoughts in this direction. Our write up (with the formalism of the operators Tand Th) is taken from Bramble, Schatz, Тномёе and Wahlbin [1977]. Extensions (with identical statement) of Theorem 28.1 to situations with time- dependent coefficients and non-self-adjoint elliptic forms are given in the basic paper of Helfrich and also in Huang and Тномёе [1981], Luskin and Rannacher [1982a] and Sammon [1982]. The extension to nonhomogeneous equations with a right- hand side smooth near the time of interest is in Тномёе [1980]. The papers referenced above all give error estimates in spatial L2 norms at a given time. For pointwise error estimates the translation technique of Bramble, Schatz, Тномёе and Wahlbin [1977, Section 4] is easy to apply also for non-self-adjoint problems where the elliptic operator has time-dependent coefficients. Also, the estimates can be further localized in space, see Bramble, Schatz, Тномёе and Wahlbin [1977, Section 5], cf. also Nitsche [1979] and Тномёе [1979]. Further, and sharper, pointwise error estimates with the smoothing property are given in Schatz, Тномёе and Wahlbin [1980] and in Тномёе and Wahlbin [1983]. For the analogues of Theorem 28.1 with time discretization, see Huang and Тномёе [1982], Sammon [1983], which in particular contains a very thought- provoking observation concerning higher-order schemes, Luskin and Rannacher [1982b], and for pointwise estimates, Schatz, Тномёе and Wahlbin [1980, Section 4]. An analogous use of a smoothing property in the case of the Euler-Poisson- Darboux equation is found in Genis [1982]. We remark that in all references above {and in Theorem 28.1) it is assumed that the L2 projection of initial data, P0v, is evaluated exactly. The effect of numerical integration brings the situation close to that of finite differences; this is thoroughly elucidated in Wahlbin [1980], and shows that in order to retain the beneficial effects of the smoothing property one should, if possible, subtract off singularities in initial data v and treat the evaluation of the L2 projection of those separately and accurately. I.e., in practice one needs a certain amount of data preparation to fully benefit from the parabolic smoothing effect. In the same spirit is the counterexample in Sammon [1983] just referred to. He gives simple model numerical examples with time-dependent coefficients where smooth initial data leads to fourth-order convergence but nonsmooth initial data only gives second-order convergence for positive time. This again indicates that the semidiscrete results in the presence of perfect integration may not always carry over in practice. Further, the reason why the choice vh = P0v in B8.10) is appropriate in order to achieve the smoothing gain is explained in Тномёе [1972] and Wahlbin [1981]. We next give the proof of Theorem 28.1, following Helfrich [1974] and Bramble, Schatz, Тномёе and Wahlbin [1977].
494 L.B. Wahlbin Chapter VI Proof of Theorem 28.1. Let /x^—A/1 and (pj denote the eigenvalues and L2 orthonormalized eigenfunctions of T=(—A)"l. Then with fij = {v, <pj) the Fourier coefficients of initial data v, u(t)=^e-^^j(pj(x). B8.12) J Introducing the resolvent Rz(T) = (z — T)~l we shall use that the L2 projection onto the eigenspace corresponding to the eigenvalue \i can be expressed as X Hj(Pj(x)=^Ux{T)vdz, B8.13) where у is a curve in the complex plane enclosing only this eigenvalue. It follows that J ~ I V e-tlzRz{T)vdz, B8.14) where Fj is a curve enclosing only the eigenvalues \ix,.,. \l} (for fiJ+1 Фц/). Taking limits, for u(t) = E(t)v the solution of B8.1), u{t) = E(t)v = — 2m e-"zRz(T)vdz, B8.15) with Г the positively oriented curve defined by |argz| = in. We note that since we have for |argz| = i7t, IIR^DIU^IImzr^v^lzr1, B8.16) and 1|Л,(ТI1„+2^л/2. B8.17) Corresponding to the eigenvalues and eigenfunctions fij and q>j for T, let njih and <Pjj be those for the approximate elliptic solution operator Th. Hence, for uh(t) the solution of the semidiscrete problem B8.9), uh(t)= £ e-vwPj.HVjj, B8.18) where /?,-,* = (и*, <Pj,h)- In particular, since vh = P0v, the L2 projection of v, we have Pj,h={v> <Pj,k) and we obtain as above, with Eh(t) the discrete solution operator,
Section 28 Second-order parabolic problems 495 e-,lzR2(T„)vdz. B8.19) uh(t) = E,,(t)v= — Since Th is self-adjoint on L2 we have as in B8.16), ИВДЛко^гГ1. B8.20) Combining B8.15) and B8.19) we have for eh(t) = uh(t)-u(t), eh(t) = Fh{t)v=Eh{t)v-E{t)v B8.21) where the error operator Fh(t) is given as e-nR,(Th)-K(T))dz. B8.22) 2m We next derive the following lemma. Lemma 28.1. For 0^q^r—2, \\Rz(Th)-Rz(Tn0,q^Ch"+2\zr2 forzer. B8.23) Proof. As a preliminary step we prove that if Th satisfies B8.6), B8.7), then \\Rt(Th)-R(T)\\o,,^Ch,\z\-1 foiO^s^r. B8.24) To see this we use the resolvent formula to write Rz{Th) - RZ(T) = Rz(Th)(Th - T)RZ(T) B8.25) so that by B8.20), B8.7) and B8.17), ||К,(Гл)-Л,(ГI1о, <1|Л,(Г4)||о,о1|Г*-Г||о.г-2||Яж(Г)||г-2^ «гС/гНгГ1 B8.26) which is the desired result B8.24) for s = r. Also, by B8.20) and B8.16), ||Л,(Тл)-Л1(ЛНо.о <1|Л,(Г»)||01о + ||Л,(ГI1р,о<С|гГ1. B8.27) Our spaces defined by eigenfunction expansions are easily seen to be interpolation spaces (Bergh and Lofstrom [1976]) and thus B8.26) and B8.27) give B8.24) by interpolation. In order to prove B8.23), we write, again using the resolvent formula, Rz(Th)-Rz(T) = Rz(T)(Th - T)Rz(T) + Rz(T)(Th - T)(Rz(Th)-Rz(T)). B8.28) Hence, by B8.16) and the preliminary result B8.24),
496 L.B. Wahlbin Chapter VI \\RE{Th)-Rt(T)\\o« ^1|Л,(Г)||о.оНГй-Г||о.в||Яг(Г)||м + 1|Лг(Т)Но,о1|Г»-Г||о.о1|Я,(Т*)-Я2(Г)||о., ^Chq + 2\z\-2 B8.29) which proves the desired result B8.23). □ Continuing with the proof of Theorem 28.1, we have Fh(t) = ~L-nRATh)-Rz(T))dz г so that by the above Lemma 28.1, \Fh(t)\\o,9^Ch^2 |2|-2e-c,/|z|dj.| = c^ + 2rl £8,30) Writing next F„(t) = Fh$t)E&) + E{it)Fk{h)-(Fktit)J B8.31) and noting that it suffices to consider h2t~2 < 1, we have for the first term on the right, by B8.30) and B8.5), Н^(^)£(^I1о,о<11^Й011о.,-2 1|£Й011г-2.о<СЛ'1-г'2. B8.32) Since the second term on the right of B8.31) is the adjoint of the one just treated, its 1} norm is the same. For the last term on the right of B8.31), by B8.30) for q = 0, II(F„№2 llo.o < II F*(i0 ll§.o *SCh2f1 \\FSt)llo.o- B8.33) Thus, from B8.31)-B8.33), II Fh(t) ||o.o 4Ch't~ + Cihh-^W F„Ut) Ho.o. B8.34) By repeated application s times, IIF„(t) ||o.o <CWt' + C(h2rj|| Fh((iyt)||o,o B8.35) from which the desired result B8.11) follows if we take s^^r. □ 29. A nonlinear model problem: Restricted advantage of the smoothing effect In this section we consider the semilinear problem of finding и = и(х, t) such that u,-Au=f(u) infix/, / = @,f*], м = 0 onQflx/. B9.1) u@) = v in Q,
Section 29 Second-order parabolic problems 497 Here Q is a bounded domain in UN with a sufficiently smooth boundary. As always when solving a nonlinear problem we want some guarantee that the solution does not blow up in finite time, at least not inside / in our case. We assume thus, cf. Smoller [1983, Chapter 14, Section B], that there is an a priori bound В so that \u(x,t)\^B, xeQ, tel. B9.2) In many situations, following the theory of invariant regions just referred to in Smoller's book, В is a priori computable. Thus we may artificially change f(y) for \y\ >B and assume that / is a smooth function for ye R with \f(y)U\fb)\<B forj/eR. B9.3) (Certain finite element situations where this a priori change is not necessary are given in THOMEEand Wahlbin [1975].) For spatial discretization of B9.1) let Vh<zEl(Q) be such that Theorem 28.1 holds in the linear homogeneous case. For simplicity in this outline, assume further that the basic method for the elliptic problem is the ordinary Galerkin method, i.e., that the semidiscrete solution щ: T-+Vh is defined by Kr,z) + (VMft,Vz) = (/("ft),Z) foTxeV„, uh@) = Pov = vh, where P0v is the L2 projection of v into Vh. In the case that v is smooth and compatible enough so that u(i) is sufficiently smooth on the closed interval T, then \\uh(t)-u(t)\\0^C{u,B)hr forte/, B9.5) where r is the optimal order of approximation, cf. Wheeler [1973]. The object of this section is to elucidate the cases when initial data v is not smooth enough, or incompatible, so that B9.5) cannot be guaranteed to hold. We shall merely give an outline of the most surprising results of Johnson, Larsson, Тномёе and Wahlbin [1987]. The main result is the following. Theorem 29.1. Let \u(x, t)\ ^B, xe Q, teT; let vh = P0v and let uh be given by B9.4). Then with С depending on B, t* and approximation parameters for Vh, ||«»(r)-«(t)||o<C/i2(r4|log(A2/t)l)> tel. B9.6) Proof. Simple energy arguments show that u{t) and uh{t) are bounded in L2 so that B9.6) trivially holds for t^h2. With E(t) the solution operator to the linear homogeneous problem B8.1) and Eh(t) its semidiscrete counterpart, we have by Duhamel's principle, u(t) = E(t)v + E(t~s)f(u(s))ds, B9.7)
498 and L.B. Wahlbin Chapter VI u„(t) = Eh(t)vh + Eh{t - s)P0f(uh(s)) ds. B9.8) Hence with Fh{t) = Eh(t)P0 — E(t) the error operator for the linear homogeneous equation, for eh(t) = uh{i) — u(t) the error in the semilinear problems, Fh(t-s)f(u(S))ds. B9.9) eh(t) = Fh(t)v + Eh(t - s)[/(u„(s)) -/(u(s))] ds + From the proof of Theorem 28.1, see B8.30), it is seen that, regardless of the optimal order r^2, IFtiMo^Chh-^vWo^CBtfr1, B9.10) and !|£ft@w||o^C||w||o. B9.11) Thus, since | /' |< B, see B9.3), and since the theorem holds for t < h2 as noted above, \\eh(t)\\o^Chh-l + C л* t о л2 |e*(s)||0ds t-h2 + J )(\\Fh(t-s)f(u(s))\\0)ds 0 l-h2 l-\fl ds t~~s ^Ch2t~l + Ch2 + C \\eh{s)\\0ds + Ch2 ^Ch2t_1 + Cfc2log(t/fc2) + C | ||eh(s)||0ds. й2 Letting <р(г) = Д21|eA(s)|| о we thus have (p'{t)~-C(p{t)^Chh-l + Ch2\og(tlh2) for ft2<t<t*, <p(fc2) = 0 and hence, t </>(£)< С eC(t~s)(^s-J + /i2 log(s//i2)) ds< Ch2 log(t/h2). Inserting this into B9.12) completes the proof. □ + Ch2 B9.12) B9.13) B9.14)
Section 29 Second-order parabolic problems 499 Note that if the optimal order of approximation is r>2, this is not reflected in Theorem 29.1. The result of Theorem 29.1 is sharp in the following sense: An equation can be exhibited such that the following holds: If for all и with \u(x, t)\ ^B for xe Q, te I, we have for a fixed t0, 0<to^t*, \\uh(t0)-u(t0)\\0^C(t0,B)h°, B9.15) then a cannot be greater than 2. This should be contrasted with the linear situation where, provided numerical integration is used carefully enough in evaluating the L2 projection of initial data, Wahlbin [1980], the error is of optimal order for positive time. A complete counterexample in spline spaces is given in Johnson, Larsson, Тномёе and Wahlbin [1987, Section 6]. However, it is easy to understand why the order a is restricted to 2 in B9.15), regardless of the optimal approximation order of the finite element spaces Vh. For, note that Theorem 29.1 would hold, with /i = (n — l)-1, if trigonometric polynomials of degree (n —1) were used on [ — л, я] with periodic boundary conditions and vh the Fourier series of v. (The proof of this is exactly as the proof of Theorem 29.1.) Further note that trigonometric approximation has unbounded optimal order r\ Finally, note that Theorem 29.1 holds as well for systems. Then take м = (м15 и2) as Ui,, = UliXX+f{u2), ,t = ,xx B9.16) on [—я, я] x @, oo) where / is smooth with f(y) = 4y2 for|v|^l. B9.17) Further, take Hj@) = 0, u2@) = cos(nx) B9.18) so that by elementary checking, u2{x, t)=exp(-n2t) cos(nx), 1— exp(—2n2t) Ml(x, t)= -s [1 + exp(-2,A)cosBnx)]. Using for approximation, trigonometric polynomials of degree <(и — 1) and, with Fourier series for initial data, the approximate solutions are identically zero. Thus, B9.19) describes the error which is, indeed, exactly of order n~z = 0(h2) for t positive. Note that in B9.19), a high Fourier mode (cos(nx)) has aliazed itself by nonlinear interaction into a low, indeed constant, mode (viz. 1/w2). Also note that the example is in complete contrast to the linear situation, regardless of whether the equation B9.16) is regarded as an analogue of the linear homogeneous or inhomogeneous case.
500 L.B. Wahlbin Chapter VI Further results are given in Johnson, Larsson, THOMEEand Wahlbin [1987]. In particular, results when initial data v are somewhat smooth and compatible are derived, and also pointwise results in space, and results for gradients. For further investigations concerning the smoothing effect in nonlinear problems we refer to Heywood and Rannacher [1988] for the Navier-Stokes equations; they also noted an upper bound in their proof for the rate of convergence with incompatible data similar to our counterexample. For time discretizations, see Crouzeix and Thomee [1987b]. Interesting uses of Theorem 29.1 to the long time behavior of Galerkin approximations are given in Larsson [1985] and Hale, Lin and Raugel [1988].
Chapter VII Superconvergence 30. Difference quotients on translation-invariant meshes for approximating derivatives This section is based on the fundamental investigation in Nitsche and Schatz [1974, Section 6]. For simplicity we consider the model problem -Au=f in £3, u = 0 ondQ, C0.1) In two or more space dimensions, results on superconvergence are restricted to uniform, or nearly uniform, meshes, cf. Section 5. In the present situation we consider an interior subdomain AuQ and let with Vh finite element spaces, Vh(A) = {xeVh:suppx^A}. C0.2) The approximation uk e Vh we are seeking is required to satisfy (V(M-u„)(VX) = 0 xePMi), C0.3) where A&Ay^Q. We let as usual r denote the optimal order of approximation in LP spaces. The mesh is uniform and translation-invariant on a neighborhood of A in the sense that with Tlv{x) = v(x + vh), v a multi-integer, with A^A, TlXeVh(A,) forxetAA). C0.4) Remark 30.1. Of course, the mesh parameter h may be reparametrized. Thus for this mesh configuration (see Fig. 30.1), we have translation invariance if h is replaced by 2/z in the translation operators. Also, the translation invariance need not be with respect to the coordinate axes; in any direction with translation invariance we may consider corresponding difference quotients. To approximate Dxu, a. a multi-integer, consider a difference operator Diu= X CJh)Tiu, C0.5) for some finite M so that C0.4) holds for |v|<M. We assume that \(D«u -Dlu){x0)\ *£ Chr\\u\\ |e| +r^S{xoM), C0.6) 501
502 LB. Wahlbin Chapter VII Fig. 30.1. where S{x0,M) = {x:\xi-x0i\^Mh}, C0.7) so that the order of accuracy of the difference approximation matches that of the finite element space. Set now e = u-uh. C0.8) The basic observation in Nitsche and Schatz [1974] is that (VDSe,VZ) = 0 iorXeVh(A). C0.9) For the form on the left equals (Ve, V(Df x)) and, by C0.4), £f *e ^Hi) so that C0.3) applies. The development in Nitsche and Schatz' paper now proceeds via local L2 or energy estimates. In this presentation we shall consider pointwise estimates and make the assumption, The pointwise local estimate of Theorem 10.1 holds. C0.10) This assumption C0.10) is verified in Schatz and Wahlbin [1977] for a large class of finite element spaces. It follows from C0.9) with x0 in A, \Dte{x0)\^C\n(\lhYmm\\Dlu-zU^A + C\\Dlu-x\\--s,u. C0.11) xeVh
Section 30 Superconvergence 503 Assuming that uecer+'l'l\A1), by approximation theory, and by C0.6), rnin||Z>£u-xllo,oo„4 XsVh ^\№-D*)u\\0taa,A + niin\\D*u-xh,*,A xeVh <Сйг1|н||о,0О.|«|+,.л1- C0.12) Hence, again using C0,6), \(D*u-Dluh)(x0)\ ^\(D«-D°h)u(x0)\ + \D*he(x0)\ ^Cln(l/hyhr + C\\D«he\\-s.iA, C0.13) For the last term on the right of C0.13), |№||-5<,л= sup (Dfccp). C0.14) Ыж'.Л = 1 For each such <p, ^Q\e\\-s>Al\\<p\\s;Al. e.g. for s' = s + \a\+^N + 1. Thus, from C0.13) and C0.14), we have the following theorem. Theorem 30.1. Assume C0.3), C0.4), C0.6), C0.10) and that ue Wr+M{Ax). For any s, \(D*u-Dtuh)(x0)\ ; ^C\n{l/hy}f\\u\\r+lxUo0iAl + C\\e\\^,Al. C0.15) A similar estimate was given in Bramble, Nitsche and Schatz [1975]. The upshot of this is that, in the case of locally translation-invariant meshes, one should take for an approximation of a derivative D"u a suitable difference quotient, not a derivative, of the finite element solution uh. The error is then of the order governed by the minimum of the local approximability (hr if ue Я?г+М locally) and и — uh in any negative norm, the latter accounting for global influences such as e.g. pollution effects in problems with reentrant corners. Note that in general, say for first derivatives, Vu — Vma is only of order hr ~J. Thus, the result above is "superconvergent", the global effects permitting. How to estimate |[« — uh\\ -s,Al in various situations has been clarified in Chapter III. Clearly, extreme care is called for. As in the one-dimensional case of Chapter I, there may be singularities in data, such as right-hand sides or coefficients, preventing the negative norm from being of sufficiently high order to entail superconvergence. In the multidimensional case, the domain may also contain reentrant corners, i.e., be singular, so that superconvergence in the difference approximation to derivatives is obstructed, see Section 13. A more subtle point is that even if all data in the problem
504 L.B. Wahlbin Chapter VII are smooth, the negative norm may not be of order hr because a curved boundary is not fitted well enough. In the next section this problem is even more pronounced, since there higher-order approximations are involved, and we refer the reader forward to the first paragraph of Section 31 for further amplification of this point. We further remark that the investigation in Nitsche and Schatz [1974, Section 6] also covers variable smooth coefficients in the basic elliptic operator. It is easy to modify the above pointwise error estimate to cover that case. A frequently rediscovered special case of Theorem 30.1 occurs in the plane case when the mesh is triangular and uniform on A, and translation-invariant (see Fig. 30.2). Consider piecewise linear finite elements on this mesh. To approximate ux and uy we take forward difference quotients, ux ~ dx,ku = (u(x + h,y)- u{x, y))/h, uy~dy,hu =(u{x, y + h)- u(x, y))/h. C0.16) x A Fig. 30.2. These should be considered as approximations for ux(x + ^h, y) and uy(x,y + %h), respectively, so that they are of second-order accuracy. Thus, if ue^iAi) we have by Theorem 30.1, {ux{x0 + ^h, y0)-bXthuh{x0, y0))\ ^ С ЫA/h)h2 + C\\u-uh\\ C0.17) and \(uy(x0, y0 + ^h)~dythuh{x0, y^l^Clnil/fyh2 + C\\u-uk\\ -SiAl. C0.18) In particular, consider two adjoining triangles such as, e.g. in Fig. 30.3, and take (xo,yo)i in C0.17) and (x0, у oh in C0.18) as indicated.
Section 30 Superconvergence 505 <xO'Voh P midpoint of common diagonal Since in the piecewise linear situation, 9*,(.ил((х0,3^0I) =(«*(*(> + ^ У о) ~ щ(*о, УоШ = («л(х0 + й, д>0) ~ Уи(*о + jh, y0))/h + (uh(x0 + \h, y0) - uh(x0, y0))/h =Ы*Т-+%и„)хТ+ C0.19) and similarly for the y-difference with (x0,у 0J, we have the following. Corollary 30.1. Let the triangular meshes be uniform and translation-invariant with piecewise linears on the plane interior domain Ax. Let ue ^3(АУ). Then the average of VwA taken over two neighboring triangles approximates Vu at the midpoint of their common diagonal to order C\n(l/h)h2MMl + C\\u-uh\\-s,Al. for any s. C0.20) We leave it to the reader to account for how the distance between x0 and QA x enters; the guidelines of Chapter III should suffice. Extensions of the result of Corollary 30.1 to, for example, meshes which are "almost uniform", that is, perturbed 0(/i2) from the uniform situation, occur, e.g. in Lin, Lu and Shen [1983] and Lin and Xu [1985]. In the latter paper it is shown how any domain can be triangulated into piecewise almost uniform meshes and how the averaging techniques apply in that case. Remark 30.2. Note that the method of taking difference quotients works up to straight boundaries in two dimensions, cf. Example 10.2, or for Poisson's equation in more dimensions, Example 10.1, for tangential derivatives provided the mesh is translation-invariant in the tangential direction. For approximation of the normal derivative at the boundary, one may try reflection but very few mesh configurations would work, cf. Remark 30.1 for one of the few.
506 L.B. Wahlbin Chapter VII 31. Higher-order local accuracy by averaging: The JT-operator The K-operator, introduced in Bramble and Schatz [1977], is a fairly general method of generating higher-order approximations on subdomains where the mesh is translation-invariant, the solution is locally sufficiently smooth, and provided the global error in a low negative norm |[e||_Si0 is of sufficiently high order. We have described the method, and its analysis, in Section 5 in a one-dimensional case. We point out that in the multidimensional case, as seen in Chapter III, many things can go wrong in estimating the negative form ||e|| _SiJJ to order higher than 0(hr). As in the one-dimensional case there may, of course, be singularities in data away from the point of interest which prevents the negative norm term from being of higher order. However, in the multidimensional case the boundary of the domain may also be "singular", cf. Section 13. The most subtle point is, however, that even if all data in the problem are smooth, ||e|| _s>n may still not be better than 0(/f) because a curved boundary SO may not have been fitted well enough. A common pitfall is to assume that mm\\u-x\\1^Chr-1\\ul.a. C1.1) xeVh Then, in a smooth problem, a standard duality argument would give ||e||_(r_2),0*SCft2<'-1>, C1.2) where for r^3, 2(r—l)>r. However, C1.1) does not hold in general. Thus, when applying the results below concerning the K-operator one should be very careful. We consider the problem ~Au = f in OCR*, n яп CL3) м = 0 onoQ for simplicity; the paper by Bramble and Schatz contains the case of variable smooth coefficients. We assume that locally for x e A € Q, m X(x)= £ X afajQi-ix-a), XeVh, C1.4) Thus, q>i,..., cpm are the basic building blocks for the finite element space, and in adjoining elements the building blocks are merely translated. Let again r be the optimal order of approximation in IP spaces. As in Section 5, consider smoothest splines of degree r—\ and continuity r — 2. With Ф1-2 the one-dimensional basis function, let M / r-2 \ Kh(x)=U{ E h-%tf-2(h-lxm-P)\, C1.5) m=l \0=-(r-2) / where the kp are as in Section 5, so that we have an approximate identity of order 2r — 2. (Tables are given in Bramble and Schatz [1977].)
Section 31 Superconvergence 507 As in Section 5 it is easily seen that for uh developed as in C1.4), Kh*uh at mesh points is given by a finite linear combination of a^ with coefficients d{ that only depend on the finite element space used, not on h, or on the particular mesh point considered (or, on the coefficients of the elliptic operator). The idea is now to consider u — Kh*uh. We have with e — u — uh, u~Kh*uh = (u-Kh*u) + Kh*e. C1.6) For x0e A, since Kh is an approximate identity and localized, \{u-Kh*u)(x0)\^Ch2r'2\\u\\2r^^A. C1.7) The rest of the argument now proceeds as in E.36) et seq.: Kh*e may be estimated in terms of negative norms of derivatives of Kh * e; derivatives of Kh * e may be bounded by difference quotients of e. Thus Bramble and Schatz [1977, Lemma 6.1], \(Kh*e)(x0)\ Щ £ Whe\\2-r,A + hr~2 X 11ЗДо.«А C1.8) where N0 = [^/V]-l-l. By localized duality estimates, cf. Section 9, and by Section 30 (assuming these results may be applied), we arrive at: Theorem 31.1. Under the above assumptions, for x0e A<qQ and for any s, \(u-Kh*uh)(x0MCh2r-2\\u\\2r-2+No,z,A + C\\e\\-s,A. C1.9) Thus, for r^3, we have an easily calculated candidate for a superconvergent approximation to и at interior nodes. Again we point out that one must be careful in estimating ||e||_Sij4. The technique above in deriving Theorem 31.1 includes a "Sobolev-loss" from C1.8). In the special case of tensor products of smoothest splines, it was shown in Bramble and Schatz [1976, Theorem 8] how the first term on the right of C1.9) can be replaced by Ch2r~2 ln(l//i) ||м||2г-2,а>,л> а more satisfying result one would believe holds in general. The approaches of Sections 30 and 31 can clearly be combined so that approximations to derivatives of higher order than 0(hr) can be obtained by postprocessing, see e.g. Тномёе [1977]. Similarly, both approaches can readily be adapted to parabolic problems, hyperbolic problems, Euler-Poisson-Darboux, Sobolev equations, etc., all time-dependent problems containing a spatial second-order elliptic operator. In this respect see e.g. Bramble, Schatz, Тномёе and Wahlbin [1977, Sections 6 and 7]. Later advancements in the theory of approximations for time-dependent problems have, of course, yielded better results, see e.g. Bales [1984], Genis [1984], and for the parabolic situation, Тномёе [1984].
508 L.B. Wahlbin Chapter VII 32. Miscellaneous results In this section we shall describe some superconvergence results, frequently derived for special type of elements. A nice survey of superconvergence exists in Krizek and NeittaanmAkj [1987], which has a rather complete bibliography, including one- dimensional cases. However, our survey in this chapter has a different emphasis, in that I have stressed two systematic techniques in Sections 30 and 31 for translation- invariant meshes. Furthermore, the results were always given pointwise, always localized, although not systematically carried up to the boundary or done for piecewise almost uniform (or more general) meshes. The results we are about to describe are frequently phrased so that they are dependent on global smoothness assumptions, and they are sometimes given in less satisfying error measures than the pointwise measure. Sometimes, they do apply up to boundaries. We saw in Corollary 30.1 how the quite general theory for approximating derivatives by difference quotients lead to the fact that, in the uniform piecewise linear triangular situation, the average of the gradients of щ in two neighboring triangles is an 0(h2 In(l/h))||M||3i0Oii4l + И _J>i4l C2.1) approximation to Vu at the midpoint of the common side. As noted by Levine [1985], this implies an averaging technique for approximating the gradient of и at centroids of triangles: Simply take the three edge-midpoint approximations already found and average them. In the uniform piecewise linear situation, to approximate the gradient at a nodal point, by the theory of Section 30 simply use two suitable (following edges) centered difference quotients. Due to cancellations this method can be expressed as averaging the approximate gradient in all six surrounding elements, see Krizek and NeittaanmAki [1984], where also a technique up to the boundary is given. Clearly, the results of Sections 30 and 31 provide an almost inexhaustible source for deriving special formulae in special situations. Frequently, however, such special formulae were not first derived by the difference quotient or K-operator techniques. Indeed, in e.g. the piecewise linear situation a result by Oganesyan and Rukhovets [1969, C.18)], namely that for u{ the piecewise linear interpolant to u, \\uh-u[\\mnh)^Ch2\\u\\H4n), C2.2) appears to have served as inspiration to many researchers. Having C2.2) one may sink one's teeth into the explicitly and locally given u[ and come up with some averaging formula A such that Vu—AVu{ is of higher order at desired points. It is then automatic that Vu — AVuh is of order h2, at last in energy. In the results mentioned so far the meshes are either completely uniform, a higher order perturbation of such a mesh, see Lin and Xu [1985], or a smooth transformation of a uniform mesh, see Levine [1985]. (Levine's case thus falls into that of Lin and Xu.)
Section 32 Superconvergence 509 We shall next give two results which identify superconvergent points for the function и itself Following first Douglas, Dupont and Wheeler [1974a], consider — Au = f on C = the plane rectangle [0,1] x [0,1] м=0 on dQ. Let Vh consist of tensor products of continuous piecewise polynomials of any order r— 1; the meshes are quasi-uniform and rectangular but not necessarily uniform, They show that if u is globally smooth enough, then at the mesh points K, \(u-uh)(K)\^Chr+1 C2.4) which is superconvergent to one extra order. At the "opposite end of the scale", consider tensor products of smoothest splines, i.e., tensor products of one-dimensional splines with polynomial degree r — 1 and continuity degree r—2. Then from Bramble and Schatz [1976, Theorem 10] we have for the problem C2.3) (also generalized to UN) provided the meshes are uniform, and provided r^3 is odd, for К an interior mesh point, with e — u — uk, Jl!4ln(l//1)|H||4,M + ||e||.M for /- = 3, N>\ П JKjfcr+1l!"L+i,°o,, + M-M otherwise, ^ which is again superconvergent to one extra order, the global influences permitting. Turning now to identifiable superconvergence points for the gradients, we first describe the results of Zlamal [1977]. He considers also variable smooth coefficients in C2.3). The finite elements are (almost) rectangular quasi-uniform (but not necessarily uniform) eight-node quadrilaterals and he shows that under appropriate global smoothness assumptions, with G denoting the four Gauss points on each rectangle (i.e., the maps of (± 1/^/3, ± 1/^/3) to each rectangle), У/2 YMu-uh)(G)\2hj ^Ch3. C2.6) Hence, in this global discrete energy norm we have superconvergence for gradients at Gauss points to one further order. This result was generalized in Lesaint and Zlamal [1979] to any degree continuous (almost) rectangular tensor product of polynomials, still showing (in an average sense) one order of superconvergence at the appropriate Gauss points for gradients. Further, Zlamal [1978] gives similar results for quadratic and linear triangular elements. Andreev and Lazarov [1988] show in the quadratic triangular case on almost uniform meshes that the Gauss points on triangle edges are points of superconvergence for the derivative in the direction of the side. Based on this they give an averaging scheme to produce superconvergence of gradients at element vertices. Nakao [1987] has shown how to obtain, via a certain local postprocessing, one order higher convergence at nodes for the gradient in the case of continuous tensor products on rectangular elements, in the case that the polynomial degree r— 1 is odd.
510 L.B. Wahlbin И;" if Chapter VII Fig. 32.1. Consider the situation shown in Fig. 32.1. With аг = /г; /hf,a.j = hj /hf, set Dixuh(pij) = 7—-} \ ^" «/,(*.- -, У,) + af T- »*(*.- +. ^j) 1 + <X; 8x Sx C2.7) Djyuh(Pi})= j-^ |— гфс,., у,-) + а$ ^ и„(х;, y, + )j. Letting К denote the knots Py, set VuA(K) = F1-JC«A,6^uJ. If u is sufficiently smooth (и е Wp+ 2(Q), p>2, for example) then \(Vu-Vuh)(K)\^Chr, C2.8) C2.9) again exhibiting one order of superconvergence. We conclude this section by describing two more computationally complicated methods for obtaining higher-order approximations. The first one is a method due to Wheeler [1973] for obtaining a better approximation to the flux at the boundary of a domain (in the case of Dirichlet conditions). To motivate the method, with y = Fu/8n)|r we have by integration by parts in C2.3) yv = 8fi П (fv — Vu ■ Vv), any v smooth enough. C2.10) Thus, it makes sense to approximate the normal flux у by yh, where yh is given by
Section 32 Superconvergence 511 (and uh is the already computed finite element approximation) -I< VkX=\(fX-Vuk'Vx) brXBVh, C2.11) where Vh does not satisfy the essential boundary conditions. Solving for yh amounts to solving for an L2 projection on dQh into Vh\UOh, once щ is computed. In Douglas, Dupont and Wheeler [1974b] it is shown that (for Q a rectangle, for и globally smooth enough, and for continuous tensor product elements), the approximation yh defined by C2.11) to у = фи/дп)\г is superconvergent in the sense that l|y*-ylk„.«><CA', C2.12) which is one order superconvergent. We remark that methods such as the Lagrange multiplier method of BabuSka [1973] incorporate a separate approximation to у in the basic finite element formulation. We refer to Bramble [1981] for some results about this approximation. Our second computationally more complicated method should probably be applied only if one desires a good approximation to и at only a few points inside Q. It applies, however, to any mesh but requires exact knowledge of a fundamental solution. We follow the particular presentation of Louis [1979]. Let x0 be a fixed point and B(x0,r)={x:\x-x0\^r}czQ. C2.13) Let co(z)=-~ln\z\+il,(z), C2.14) where ф(г) e #a> is chosen so that ео(г) = Э(в(г)/Эп = 0. C2.15) For example, Louis gives the choice 1 By Green's second formula, "AW=-^^(l^l4-4'-2kl2 + C-41nr)r4). C2.16) "(*o)= (ЯуМхо~у) + и(у)Аф(х0~у)Nу. C2.17) B(xo,r) Having then uh e Vh a basic finite element solution, define fi*(*o)= (f(yMx0-y) + uh(y)Ai/,(x0-y))dy. C2.18) B(x0,r)
512 L.B. Wahlbin Chapter VII It is clear that (if the integral in C2.18) is evaluated exactly), l(M-fi*)(*o)l<C|||u-U»|||_IiB(xo.r), C2.19) where ||| ||| denotes the dual norm to HS(B). Thus the approximation щ picks up whatever negative norm accuracy the finite element solution uh enjoys. (As we have remarked at many places, one must be careful in evaluating negative norms. Boundary approximations may cause grief even on smooth domains.)
References Andreev, А.В. and R.D. Lazarov A988), Superconvergence of the gradient for quadratic triangular finite elements. Numer. Meths. Partial Differential Equations 4, 15-32. BabuSka, I. A970), Finite element methods for domains with corners, Computing 6, 264-273. BabuSka, I. A973), The finite element method with Lagrangian multipliers, Numer. Math. 20,179-192. BabuSka, I. and А.К. Aziz A976), On the angle condition in the finite element method, SIAM J. Numer. Anal. 13, 214-226. BabuSka, I., A. Miller and M. Vogelius A983), Adaptive methods and error estimation for elliptic problems of structural mechanics, in: I. BabuSka, J. Chandra and J.E. Flaherty, eds., Adaptive Computational Methods for Partial Differential Equations (SIAM, Philadelphia, PA) 53-73. BabuSka, I. and J. Osborn A980), Analysis of finite element methods for second order boundary value problems using mesh dependent norms, Numer. Math. 34, 41-62. BabuSka, I. and M. Rosenzweig A973), A finite element scheme for domains with corners, Numer. Math. 20, 1-21. Barker, M. A982), A note on C° Galerkin methods for two-point boundary value problems, Numer. Math. 38, 447-453. Bakker, M. A984), One-dimensional Galerkin methods and superconvergence at interior nodal points, SIAM J. Numer. Anal. 21, 101-110. Bales, L. A984), Semidiscrete and single step fully discrete approximations for second order hyperbolic equations with time-dependent coefficients, Math. Сотр. 43, 383-414. Baranger, J. A979), On the thickness of the boundary layer in elliptic-elliptic singular perturbation problems, in: P.W. Hemker and J.J.H. Miller, eds., Numerical Analysis of Singular Perturbation Problems (Academic Press, New York) 395-400. Barv, N.K. A964), A Treatise on Trigonometric Series 2 (Macmillan, New York). BERGH, J. and J. LOfstrom A976), Interpolation Spaces: An Introduction (Springer, New York). Besjes, J.G. A975), Singular perturbation problems for linear elliptic differential operators of arbitrary order, I: Degeneration to elliptic operators, J. Math. Anal. Appl. 49, 24-46. Blair, J. A970), Approximate solution of elliptic and parabolic boundary value problems, Thesis, University of California, Berkeley, CA. Bramble, J.H. A975), A survey of some finite element methods proposed for treating the Dirichlet problem, Adv. in Math. 16, 187-196. Bramble, J.H. A981), The Lagrange multiplier method for Dirichlet's problem, Math. Сотр. 37,1-12. Bramble, J.H. and S.R. Hilbert A970), Estimation of linear functionals on Sobolev spaces with application to Fourier transforms and spline interpolation, SIAM J. Numer. Anal. 7, 113-124. Bramble, J.H. and S.R. Hilbert A971), Bounds for a class of linear functionals with application to Hermite interpolation, Numer. Math. 16, 362-369. Bramble, J.H., J.A. Nitsche and A.H. Schatz A975), Maximum norm interior estimates for Ritz- Galerkin methods, Math. Сотр. 29, 677-688. Bramble, J.H. and J.E. Osborn A973), Rate of convergence estimates for non-selfadjoint eigenvalue approximation, Math. Сотр. 27, 525-549. Bramble, J.H. and A.H. Schatz A976), Estimates for spline projections, RAIRO Anal. Numer. 10, 5-37. Bramble, J.H. and A.H. Schatz A977), Higher order local accuracy by averaging in the finite element method, Math. Сотр. 31, 94-111. 513
514 L.B. Wahlbin Bramble, J.H., A.H. Schatz, V. Thomee and L.B. Wahlbin A977), Some convergence estimates for semidiscrete Galerkin type approximations for parabolic equations, SIAM J. Numer. Anal. 14, 218-240. Bramble, J.H. and V. Thomee A974), Interior maximum norm estimates for some simple finite element methods, RAIRO Anal. Numer. 8, 5-18. Brenner, P., V. Thomee and L.B. Wahlbin A975), Besov Spaces and Applications to Difference Methods for Initial Value Problems, Lecture Notes in Mathematics 434 (Springer, Berlin). Brooks, A.N. and T.J.R. Hughes A982), Streamline upwind Petrov-Galerkin formulations for convection dominated flows with particular emphasis on the incompressible Navier-Stokes equations, Comput. Meths. Appl. Mech. Engrg. 32, 199-259. Chavez, P. A983), Automatic procedures in evolutionary finite element calculations: Restoration of deteriorated meshes, data transfer between meshes and mesh refinement, Thesis, Cornell University, Ithaca, NY. Chen, CM. A979), Superconvergent points of Galerkin's method for two-point boundary value problems, J. Numer. Anal. Chinese Univ. 1, 73-79. Christie, I., D.F. Griffiths, A.R. Mitchell and О.С. Zienkiewicz A976), Finite element methods for second order differential equations with significant first derivatives, Internal. J. Numer. Meths. Engrg. 10, 1389-1396. Ciarlet, P.G. A978), The Finite Element Method for Elliptic Problems (North-Holland, Amsterdam). Crouzeix, M. and V. Thomee A987a), The stability in Lp and W\ of the L2 projection into finite element function spaces, Math. Сотр. 48, 521-532. Crouzeix, M. and V. Thomee A987b), On the discretization in time of semilinear parabolic equations with nonsmooth initial data, Math. Сотр. 49, 359-377. De Boor, С A979), On a max-bound for the least-squares spline approximant, in: Z. Ciesielski, ed., Approximation and Function Spaces (North-Holland, Amsterdam) 163-175. De Boor, C. and G.J. Fix A973), Spline approximation by quasiinterpolants, J. Approx. Theory 8, 19-45, Demko, S. A977), Inverses of band matrices and local convergence of spline projections, S1AM J. Numer. Anal. 14, 616-619. Dendy, J.E. A974), Two methods of Galerkin type achieving optimum L2 accuracy for first order hyperbolics, SIAM J. Numer. Anal. 11, 637-653. Descloux, J. A972), On finite element matrices, SIAM J. Numer. Anal. 9, 260-265. Descloux, J. A976), Interior regularity and local convergence of Galerkin finite element approximations for elliptic equations, in: J. Miller, ed., Topics in Numerical Analysis 2 (Academic Press, New York). Dobrowolski, M. A981), Numerical approximation of elliptic interface and corner problems, Rheinische Friedrich-Wilhelms-Universitat, Bonn. Douglas, J. and T. Dupont A974), Galerkin approximation for the two-point boundary problem using continuous piecewise polynomial spaces, Numer. Math. 22, 99-109. Douglas, J., T. Dupont and L.B. Wahlbin A975a), Optimal Lx error estimates for Galerkin approximations to solutions of two-point boundary value problems, Math. Сотр. 39, 475-483. Douglas, J., T. Dupont and L. Wahlbin A975b), The stability in L« of the L2-projection into finite element function spaces, Numer. Math. 23, 193-197. Douglas, J., T. Dupont and M.F. Wheeler A974a), An L" estimate and a superconvergence result for a Galerkin method for elliptic equations based on tensor products of piecewise polynomials, RAIRO Anal. Numer. 8, 61-66. Douglas. J., T. Dupont and M.F. Wheeler A974b), A Galerkin procedure for approximating the flux on the boundary for elliptic and parabolic boundary value problems, RAIRO Anal. Numer. 8,47-59. Douglas, J. and F. Milner A985), Interior and superconvergence estimates for mixed methods for second order elliptic problems, RAIRO Model. Math. Anal. Numer. 19 C), 397-428. Dupont, T. A973), Galerkin methods for first order hyperbolics: An example, SIAM J. Numer. Anal. 10, 890-899. Eckhaus, W. A972), Boundary layers in linear elliptic singular perturbation problems, SIAM Rev. 14, 225-270.
References 515 Eckhaus, W. A973), Matched Asymptotic Expansions and Singular Perturbations (North-Holland, Amsterdam). Eckhaus, W. A979), Asymptotic Analysis of Singular Perturbations (North-Holland, Amsterdam). Eckhaus, W. and E.M. de Jaeger A966), Asymptotic solutions of singular perturbation problems for linear differential equations of elliptic type, Arch. Rational Mech. Anal. 23, 26-86. Eriksson, K. A985a), Improved accuracy by adapted mesh refinements in the finite element method, Math. Сотр. 44, 321-344. Eriksson, K. A985b), Higher order local rate of convergence by mesh refinement in the finite element method, Math. Сотр. 45, 109-142. Eriksson, K. and C. Johnson A988), An adaptive finite element method for linear elliptic problems, Math. Сотр. 50, 361-384. Eriksson, K. and V. Thomee A984), Galerkin methods for singular boundary value problems in one space dimension, Math. Сотр. 42, 345-367. Fichera, G. A978), Numerical and Quantitative Analysis, Survey and Reference Works in Mathematics 3 (Pitman, San Francisco, CA). Frehse, J. and R. Rannacher A976), Eine L'-Fehlerabschatzung diskreter Grundlosungen in der Methode der finiten Elemente, Tagungsband "Finite Elemente", Bonner Mathematische Schriften (Universitat Bonn, Bonn). French, D. A987), The finite element method for a degenerate elliptic equation, SIAM J. Numer. Anal. 24, 788-815. Gallagher, R.H. A975), Survey and evaluation of the finite element method in linear fracture mechanics analysis, in: Computational Fracture Mechanics, Second National Congress on Pressure Vessels and Piping (ASME, New York) 637-653. Genis, A. A984), On finite element methods for the Euler-Poisson-Darboux equation, SIAM J. Numer. Anal. 21, 1080-1106. Grisvard, P. A976), Behavior of the solution of an elliptic boundary value problem in a polygonal or polyhedral domain, in: B. Hubbard, ed., Numerical Solution of Partial Differential Equations 3 (Academic Press, New York) 207-274. Grisvard, P. A985), Elliptic Problems in Non-Smooth Domains (Pitman, Boston, MA). Grisvard, P., W. Wendland and J.R. Whiteman, eds. A985), Singularities and Constructive Methods for Their Treatment, Lecture Notes in Mathematics 1121 (Springer, Berlin). Gusman, B. A981), Bounds of Galerkin projections on splines with highly nonuniform meshes, SIAM J. Numer. Anal. 18, 1109-1119. Hale, J., X. Lin and G. Raugel A988), Upper semicontinuity of attractors in dynamical systems, Math. Сотр. 50, 89-123. Haverkamp, R. A984), Eine Aussage zur LK-Stabilitat und zur genauen Konvergenzordnung der Hi-Projection, Numer. Math. 44, 393-405. Helfrich, Н.-Р. A974), Fehlerabschatzungen fur das Galerkin Verfahren zur Losung von Evolutions- gleichungen, Manuscripta Math. 13, 219-235. Hellen, Т.К. A980), Numerical methods in fracture mechanics, in: G.C. Chell, ed., Developments in Fracture Mechanics 1 (Applied Science Publishers, Barking, England) 145-181. Heywood, J.G. and R. Rannacher A988), Finite element approximation of the nonstationary Navier-Stokes problem, Part III, SIAM J. Numer. Anal. 25, 489-512. Hilbert, S. A973), A mollifier useful for approximations in Sobolev spaces and some applications to approximating solutions of differential equations, Math. Сотр. 27, 81-89. Huang, M. and V. Thomee A981), Some convergence estimates for semidiscrete type schemes for time- dependent nonselfadjoint parabolic equations, Math. Сотр. 37, 327-346. Huang, M. and V. Thomee A982), On the backward Euler method for parabolic equations with rough initial data, SIAM J. Numer. Anal. 19, 599-603. Hughes, T.J.R. and A.N. Brooks A979), A multidimensional upwind scheme with no crosswind diffusion, in: T.J.R. Hughes, ed., Finite Element Methods for Convection Dominated Flows, AMD 34 (ASME, New York) 19-35. Hughes, T.J.R., L.P. Franca and M. Mallet A986), A new finite element formulation for computational fluid dynamics: I. Symmetric forms of the compressible Euler and Navier-Stokes
516 L.B. Wahlbin equations and the second law of thermodynamics, Comput. Meths. Appl. Mech. Engrg, 54, 223- 234. Hughes, T.J.R., M. Mallet and A. Mizukami A986), A new finite element formulation for computational fluid dynamics: II. Beyond SUPG, Comput. Meths. Appl. Mech. Engrg. 54, 341-355. Hughes, T.J.R. and M. Mallet A986a), A new finite element formulation for computational fluid dynamics: III. The generalized streamline operator for multidimensional advective-diffusive systems, Comput. Meths. Appl. Mech. Engrg. 58, 305-328. Hughes, T.J.R. and M. Mallet A986b), A new finite element formulation for computational fluid dynamics: IV. A discontinuity-capturing operator for multidimensional advective-diffusive systems, Comput. Meths. Appl. Mech. Engrg. 58, 329-336. Jamet, P. A976), Estimations d'erreur pour des elements finis droits presque degeneres, RAIRO Anal. Numir. 10, 43-61. Jesperssen, D. A978), Ritz-Galerkin methods for singular boundary value problems, SI AM J. Numer. Anal. 15, 813-834. Johnson, C, S. Larsson, V. Thomee and L.B. Wahlbin A987), Error estimates for spatially discrete approximations of semilinear parabolic equations with nonsmooth initial data, Math. Сотр. 49, 331-357. Johnson, С and NAvert, U. A981), An analysis of some finite element methods for advection-diffusion problems, in: O. Axelsson, L.S. Frank and A. Van Der Sluis, eds., Analytical and Numerical Approaches to Asymptotic Problems in Analysis (North-Holland, Amsterdam) 99-116. Johnson, C, U. NAvert and J. PitkAranta A984), Finite element methods for linear hyperbolic problems, Comput. Meths. Appl. Mech. Engrg. 45, 285-312. Johnson, С and J. PitkAranta A986), An analysis of the discontinuous Galerkin method for a scalar hyperbolic equation, Math. Сотр. 46, 1-26. Johnson, С and J. Saranen A986), Streamline diffusion methods for the incompressible Euler and Navier-Stokes equations, Math. Сотр. 47, 1-18. Johnson, С, А.Н. Schatz and L.B. Wahlbin A987), Crosswind smear and pointwise errors in streamline diffusion finite element methods, Math. Сотр. 49, 25-38. Juncosa, M.L. and D.M. Young A957), On the Crank-Nicolson procedure for solving parabolic partial differential equations, Proc. Cambridge Philos. Soc. 53, 448-461. Kellogg, R.B. A971), Interpolation between subspaces of a Hilbert space, Tech. Note BN-719, University of Maryland, College Park, MD. Kellogg, R,B. A972), Higher order singularities for interface problems, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method (Academic Press, New York) 589-602. KoNDRAT'EV, V.A. A967), Boundary problems for elliptic equations in domains with conical or angular points, Trans. Moscow Math. Soc. 16, 227-313. Kreiss, H. and J. Oliger A973), Methods for the Approximate Solution of Time Dependent Problems, GARP (Global Atmospheric Research Programme) Publication Series 10 (World Meteorological Organization, Geneva). Kreiss, H.O., V. Thomee, and O. Widlund A970), Smoothing of initial data and rates of convergence for parabolic difference equations, Comm. Pure Appl. Math. 23, 241-259. Krizek, P. and P. NeittaanmAki A984), Superconvergence phenomenon in the finite element method arising from averaging gradients, Numer. Math. 45, 105-116. Krizek, P. and P. NeittaanmAki A987), On superconvergence techniques, Acta Appl. Math. 9,175-198. Laasonen, P. A967), On the discretization error of the Dirichlet problem in a plane region with corners, Ann. Acad. Sci. Fenn. Ser. A. I Math. 408, 1-16. Larsson, S. A985), On reaction-diffusion equations and their approximation by finite element methods, Thesis, Chalmers University of Technology, Goteborg. Leonard, B.P. A979), A survey of finite differences of opinion on numerical muddling of the incomprehensible defective confusion equation, in: T.J.R. Hughes, ed., Finite Element Methods for Convection Dominated Flows, AMD 34 (ASME, New York) 1-17. Lesaint, P. and P.A. Raviart A974), On the finite element method for solving the neutron transport equation, in: С De Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 89-123.
References 517 Lesaint, P. and M. Zlamal A979), Superconvergence of the gradient of finite element solutions, RA1RO Anal Numer. 13, 139-168. Levine, N. A985), Superconvergent recovery of the gradient from piecewise linear finite-element approximations, IMA J. Numer. Anal. 5, 407-427. Lin Qun, Lu Tao and Shen Shumin A983), Asymptotic expansion for finite element approximation-, Institute of Mathematical Sciences, Chengdu Branch of Academia Sinica. Lin Qun and Xu Jinchao A985), Linear finite elements with high accuracy, J. Comput. Math. 3. Lions, J.L. A973), Perturbations Singulieres dans les Problemes aux Limites et en Contrdle Optimal, Lecture Notes in Mathematics 323 (Springer, New York). Lofstrom, J. A970), Besov spaces in the theory of approximations, Ann. Mat. Рига Appl. D) 85, 93- 184. Lorentz, G.G. A966), Approximation of Functions (Holt, Rinehart and Winston, New York). Louis, A. A979), Acceleration of convergence for finite element solutions of the Poisson equation, Numer. Math. 33, 43-53. Luskin, M. and R. Rannacher A982a), On the smoothing property of the Galerkin method for parabolic equations, SIAM J. Numer. Anal. 19, 93-113. Luskin, M. and R. Rannacher A982b), On the smoothing property of the Crank-Nicolson scheme, Applicable Anal. 14, 117-135. Merigot, M. A971), Regularite des derivees de la solution du probleme de Dirichlet dans un secteur plan, C.R. Acad. Sci. Paris, 273, 356-359. Merigot, M. A972), Regularite des derivees de la solution du probleme de Dirichlet dans un secteur plan, Matematiche 28, 1-36. Merigot, M. A974a), Etude du probleme Au = f dans un polygone plane: Inequalites a priori, Boll. Un. Mat. Ital. D) 10, 577-597. Merigot, M. A974b), Potentiels et regularite Lv pour les problemes elliptiques dans un secteur plan, C.R. Acad. Sci. Paris 278, 1487-1489. Nakao, M.T. A986), Superconvergence of the gradient of Galerkin approximations for elliptic problems, RAIRO Model. Math. Anal. Numer. 21, 679-695. Natterer, F. A975), Uber die punktweise Konvergenz finiter Elemente, Numer. Math. 25, 67-77. Natterer, F. A977), Uniform convergence of Galerkin's method for splines on highly nonuniform meshes, Math. Сотр. 31, 457-468. NAvert, U. A982), A finite element method for convection-diffusion problems, Thesis, Chalmers University of Technology and the University of Goteborg, Goteborg. Niijima, K. A980a), On a three point difference scheme for a singular perturbation problem without a first derivative term, I, Mem. Numer. Math. 7. Niijima, K. A980b), On a three point difference scheme for a singular perturbation problem without a first derivative term, II, Mem. Numer. Math. 1. Nitsche, J. A972a), Interior error estimates of projection methods, in: Proceedings Conference on Differential Equations and Their Applications (EQUADIFF 3), J.E. Purkyne University, Brno, 235-239. Nitsche, J.A. A972b), On Dirichlet problems using subspaces with nearly zero boundary conditions, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method (Academic Press, New York) 603-627. Nitsche, J. A975), Lx convergence for finite element approximation, in: Proceedings Second Conference on Finite Elements, Rennes, France. Nitsche, J. A976a), Zur lokalen Konvergenz von Projectionen auf finite Elemente, in: Approximation Theory, Lecture Notes in Mathematics 556 (Springer, Berlin) 329-346. Nitsche, J. A976b), Der Einfluss von Randsingularitaten beim Ritzschen Verfahren, Numer. Math. 25, 263-278. Nitsche, J. A977), L„ convergence of finite element approximations, in: Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, New York) 261-274. Nitsche, J. A979), Interior error estimates for semidiscrete Galerkin approximations for parabolic equations, in: Proceedings Conference on Progress in the Theory and Practice of the Finite Element Method, Goteborg.
518 L.B. Wahlbin Nitsche, J. and A. Schatz A972), On local approximation properties of L2-projection on spline- subspaces, Applicable Anal. 2, 161-168. Nitsche, J.A. and A.H. Schatz A974), Interior estimates for Ritz-Galerkin methods, Math. Сотр. 28, 937-958. Noble, M.E. A954), Coefficient properties of Fourier series with a gap condition, Math. Ann. 128,55-62 (Correction, Math. Ann. 128, 256). Oganesyan, L.A. and L.A. Rukhovets A969), Study of the rate of convergence of variational difference schemes for second order elliptic equations in a two-dimensional field with a smooth boundary, U.S.S.R. Comput. Math, and Math. Phys. 9, 158-183. Oliger, J. A978), Methods for time dependent partial differential equations, in: Numerical Analysis: Proceedings of Symposia in Applied Mathematics 22 (AMS, Providence, RI) 87-108. Raithby, G.D. and K.E. Torrance A974), Upstream-weighted differencing schemes and their application to elliptic problems involving fluid flow, Comput. & Fluids 2, 191-206. Rannacher, R. and R. Scott A982), Some optimal error estimates for piecewise linear finite element approximations, Math. Сотр. 38, 437-445. Raugel, G. A978), Resolution numerique de problemes elliptiques dans des domaines avec coins, C. R. Acad. Sci. Paris Ser A-B 286 A8), A791-794. Rice, J.R. A969), On the convergence of nonlinear spline approximation, in: I.J. Schoenberg, ed., Approximations with Special Emphasis on Spline Functions (Academic Press, New York) 349-365. Richter, G.R. and R.S. Falk A984), An analysis of a finite element method for hyperbolic equations, in: R. Vichnevetsky and R. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations, VI (IMACS, New Brunswick, NJ) 297-300. Riemann, B. A854), Uber die Darstellbarkeit einer Funktion durch eine trigonometrische Reihe, Collected Works, 227-271 (Leipzig, 2nd ed., 1892). RlSCH, U. A986), Ein hybrides upwind FEM Verfahren und dessen Anwendung auf swach gekoppelte elliptische Differentialgleichungssysteme mit dominanter Konvektion, Dissertation, Magdeburg. Rosser, J.B. A975), Calculation of potential in a sector, MRC Rept. 1535, Madison, WI. Rosser, J.B. A977), Harmonic functions on region with reentrant corners, MRC Rept. 1796, Madison, WI. Sammon, P.H. A982), Convergence estimates for semidiscrete parabolic equation approximations, S1AM. J. Numer. Anal. 19, 68-92. Sammon, P.H. A983), Fully discrete approximation methods for parabolic problems with nonsmooth initial data, SI AM J. Numer. Anal. 20, 437-470. Schatz, A.H. A980a), The finite element method on polygonal domains, in: Seminar on Numerical Analysis and Its Applications to Continuum Physics (Colecao AT AS, Rio de Janeiro) 57-64. Schatz, A.H. A980b), A weak discrete maximum principle and stability of the finite element method in Vе on plane polygonal domains, Math. Сотр. 34, 77-91. Schatz, A.H., V. Thomee and L.B. Wahlbin A980), Maximum norm stability and error estimates in parabolic finite element equations, Comm. Pure Appl. Math. 33, 265-304. Schatz, A.H. and L.B. Wahlbin A977), Interior maximum norm estimates for finite element methods, Math. Сотр. 31, 414-442. Schatz, A.H. and L.B. Wahlbin A978), Maximum norm estimates in the finite element method on plane polygonal domains. Part 1, Math. Сотр. 32, 73-109. Schatz, A.H. and L.B. Wahlbin A979), Maximum norm estimates in the finite element method on plane polygonal domains, Part 2, Refinements, Math. Сотр. 33, 465-492. Schatz, A.H. and L.B. Wahlbin A981), On a local asymptotic error estimate in finite elements and its use: numerical examples, in: R. Vichnevetsky and R.S. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations, IV (IMACS, New Brunswick, NJ) 14-17. Schatz, A.H. and L.B. Wahlbin A982), On the quasi-optimality in L^ of the Й1 -projection into finite element spaces, Math. Сотр. 38, 1-21. Schatz, A.H. and L.B. Wahlbin A983), On the finite element for singularly perturbed reaction-diffusion problems in two and one dimensions, Math. Сотр. 40, 47-89. Schreiber, R. A980), Finite element methods of high order accuracy for singular two-point boundary value problems with non-smooth solutions, SIAM J. Numer. Anal. 17, 547-566.
References 519 Scott, R. A973), Finite element convergence for singular data, Numer. Math. 21, 317-327. Scott, R. A975), Interpolated boundary conditions in the finite element method, SI AM J. Numer. Anal. 12, 404-^27. Scott, R. A976), Optimal L" estimates for the finite element method on irregular meshes, Math. Сотр. 30, 681-697. Sendov, B. A969), Some questions of the theory of approximations of functions and sets in the Hausdorff metric, Russian Math. Surveys 24, 143-183. Smoller, J. A983), Shock Waves and Reaction-Diffusion Equations (Springer, New York). Stein, EM. A970), Singular Integrals and Differentiability Properties of Functions (Princeton University Press, Princeton, NJ). Strang, G. A973), Approximation in the finite element method, Numer. Math. 19, 81-98. Strang, G. and G.J. Fix A973), An Analysis of the Finite Element Method (Prentice-Hall, Englewood Cliffs, NJ). Thatcher, R.W. A976), The use of infinite grid refinements at singularities in the solution of Laplace's equation, Numer. Math. 25, 163-178. ТномЁЕ, V. A972), Spline approximation and difference schemes for the heat equation, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 711-746. ТномЁЕ, V. A977), Higher order local approximations to derivatives in the finite element method, Math. Сотр. 31, 652-660. ТномЁЕ, V. A979), Some interior estimates for semidiscrete Galerkin approximations for parabolic equations, Math. Сотр. 33, 37-62. ТномЁЕ, V. A980), Negative norm estimates and superconvergence in Galerkin methods for parabolic problems, Math. Сотр. 34, 93-113. ТномЁЕ, V. A984), Galerkin Finite Element Methods for Parabolic Problems, Lecture Notes in Mathematics 1054 (Springer, Berlin). ТномЁЕ, V. and L.B. Wahlbin A974), Convergence rates of parabolic difference schemes for non- smooth data, Math. Сотр. 28, 1-13. ТномЁЕ, V. and L.B. Wahlbin A975), On Galerkin methods in semilinear parabolic problems, SI AM J. Numer. Anal. 12, 378-389. ТномЁЕ, V. and L.B. Wahlbin A983), Maximum norm stability and error estimates in Galerkin methods for parabolic equations in one space variable, Numer. Math. 41, 345-371. Wahlbin, L.B. A974a), A dissipative Galerkin method for the numerical solution of first order hyperbolic equations, in: С De Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 147-169. Wahlbin, L.B. A974b), A dissipative Galerkin method applied to some quasilinear hyperbolic equations, RAIRO Anal. Numer. 8, 109-117. Wahlbin, L.B. A978), Maximum norm error estimates in the finite element method with isoparametric quadratic elements and numerical integration, RAIRO Anal. Numer. 12, 173-202. Wahlbin, L.B. A980), A remark on parabolic smoothing and the finite element method, SI AM. J. Numer. Anal. 17, 33-38. Wahlbin, L.B. A981), A brief survey of parabolic smoothing and how it affects a numerical solution: finite differences and finite elements, in: Lectures on the Numerical Solution of Partial Differential Equations, University of Maryland Lecture Notes 20 (University of Maryland Press, College Park, MD). Wahlbin, L.B. A984), On the sharpness of certain local estimates for Й1 projections into finite element spaces: Influence of a reentrant corner, Math. Сотр. 42, 1-8. Wahlbin, L.B. A985), A comparison of the local behavior of spline L2 projections, Fourier series and Legendre series, in: P. Grisvard, W. Wendland and J.R. Whiteman, eds., Singularities and Constructive Methods for Their Treatment, Lecture Notes in Mathematics 1121 (Springer, New York). Wendland, W.L. A979), Elliptic Systems in the Plane (Pitman, San Francisco, CA). Wheeler, J.A. A973), Simulation of heat transfer from a warm pipeline buried in permafrost, in: Proceedings 74th National Meeting of the American Institute of Chemical Engineers.
520 L.B. Wahlbln Wheeler, M.F. A973), A priori L2 error estimates for Galerkin approximations to parabolic partial differential equations, SI AM. J. Numer. Anal. 10, 723-759. Zlamal, M. A977), Some superconvergence results in the finite element method, in: Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 353-362. Zlamal, M. A978), Superconvergence and reduced integration in the finite element method, Math. Сотр. 32, 663-685.
Subject Index Asymptotic estimates, general —, 360 Bordered matrices in singular functions, 442 Boundary approximation influence, 408, 409, 420, 421 Bramble-Hilbert Lemma, 367, 424 В splines, 387 Corner problems, 421-445 Degenerate problems, 428 Delta function, approximate —, 368, 396 "Discrete harmonic", 375, 405, 407, 411, 414, 415 First Law of Applied Mathematics, 359 Green's function, finite element approximation of —, 418 — in convection-dominated problems, 486 smoothed —,411 Hausdorff measure, 361 Interpolation spaces, 377, 447 Inverse property, 366, 377, 397, 404 Isoparametric elements, 409, 420 K-operator, 386-389, 506-507 Local pointwise estimates, — in convection-dominated problems, 484 — in elliptic projections, in gradients, 414 interior, 410 up to boundaries, 415, 416 — in elliptic to elliptic singular perturbations, 462 — in L2 projections, 395-403 Locally smooth boundary, 406 Mesh refinements, 428^135 Mixed methods, 409 Negative norm, 377, 406 Numerical illustrations, — adaptive mesh in corner problem, 434,435 — calculation of stress intensity factors, 441 — crosswind spread in convection-dominated problems, 474 — pollution in the elliptic to elliptic singularly perturbed case, 466 Parabolic problems and smoothing effects, — linear problems, 491-496 effects of numerical integration, 493 effects of time discretizations, 493 — nonlinear problems, 496-500 long time asymptotic numerical behavior, 500 sharpness of results, 499 Plucked string and Fourier series 359 Pointwise stability, — of L2 projection, 370, 398 — of elliptic projection, 373, 411 Pollution — in corner problems on unrefined meshes, 421-428 — in problems with rough coefficients 446-449 — with numerical integration, 449^450 Quasi-interpolation operator, 367 521
522 L.B. Wahlbin Quasi-uniform mesh family, 366, 404, 421 Riemann localization principle, 360 Singular perturbation problems, 451-489 Sobolev's lemma, — on finite element space, 415, 416, 458 special modification of —, 486 Streamline diffusion method, 471 Stress intensity factors, — Direct Method, 438 — Energy Release Method, 439 — Equation Substitution, 438 — Extrapolation Method, 439 — ./-Integral Method of Rice, 439 — Trigonometric Fitting Method, 440-441 — Singular Function Methods, 442-445 solution by bordered matrices, 442 Superapproximation, 367, 396, 404, 478 Superconvergence, — in continuous splines, 382, 383, 384 — in Hermite cubics, 381, 384-386 interior — for derivatives in ^° splines, 383 interior — in 4° splines, 384 mesh point — in <ё° splines, 382 miscellaneous — results, 508-512 — of difference quotients, 388, 501-505 — via the K-operator, 382, 506, 507 Universal extension operator E, 406 Upwinding, 472, 474, 475 Weak approximation, 407
Mixed and Hybrid Methods J.E. Roberts INRIA B.P. 105 78153 he Chesnay Cedex, France J.-M. Thomas Laboratoire de Mathematiques Appliquees Universite de Pau et C.N.R.S.; URA 1204 avenue de I'Universite 64000 Pau, France HANDBOOK OF NUMERICAL ANALYSIS, VOL. II Finite Element Methods (Part 1) Edited by P.G. Ciarlet and J.L. Lions © 1991. Elsevier Science Publishers B.V. (North-Holland)
Contents Chapter I. Introduction 527 1. Preliminary remarks 527 2. Conforming and equilibrium variational formulations for a model boundary value problem 531 3. First examples of mixed and hybrid formulations 536 3.1. The dual mixed formulation 536 3.2. The primal mixed formulation 537 3.3. The dual hybrid formulation 539 3.4. The primal hybrid formulation 540 4. General orientation 541 Chapter II. Finite Element Type Interpolation of Scalar and Vectorial Functions 545 5. Lagrangian interpolation: An H1 approximation of scalar functions 545 6. Interpolation of the moments of a vectorial function: An H(div) approximation of vector valued functions 549 7. Examples of finite element type approximation for nontriangular geometric forms 558 Chapter III. Abstract Theory of Approximation of Solutions of Problems with Lagrangian 563 Multipliers 8. General remarks 563 9. Analysis and approximation of the solution of a variational problem with Lagrangian multipliers (theory of BabuSka) 564 10. Analysis and approximation of the solution of a variational problem with Lagrangian multipliers (theory of Brezzi) 567 11. Effects of numerical integration 573 Chapter IV. Mixed Finite Element Methods 577 12. Examples of primal mixed methods for a Dirichlet problem 577 13. Examples of dual mixed methods for a Dirichlet problem 580 14. Examples of mixed methods for a Neumann problem 585 15. Examples of mixed finite element methods for a Robin's boundary condition 588 16. Other examples of mixed finite element methods 590 17. Solution of the linear system 591 17.1. Conjugate gradient method 593 17.2. Penalty method 593 17.3. Augmented Lagrangian method 594 17.4. Alternating-direction method 597 17.5. Mixed-hybrid method 599 525
526 J.E. Roberts and J.-M. Thomas Chapter V. Hybrid Finite Element Methods 601 18. Examples of primal hybrid finite element methods 601 19. Primal hybrid methods and nonconforming methods 607 20. Examples of dual hybrid finite element methods 610 21. Hybridization of equilibrium methods 615 Chapter VI. Extensions and Variations 619 22. Other examples of mixed and hybrid methods 619 23. Extensions and variations of the theory 620 References 623 List of Some Special Symbols 635 Subject Index 637
Chapter I Introduction 1. Preliminary remarks The terms "mixed finite element method", "hybrid finite element method", and even "mixed-hybrid finite element method" come essentially from the vocabulary of structural mechanics as do the terms "conforming finite element method" and "equilibrium finite element method"; cf. in particular the historical account of hybrid and mixed elements given by Pian [1983]. Consider for simplicity the linear elasticity problem describing the displacement и of an elastic body Q. A finite element method based on the variational formulation associated with the principle of minimization of the potential energy form is called a displacement method as the procedure yields an approximation of the displacement field и. However, in structural mechanics one is often more interested in the stress tensor a than in the displacement. Methods have been devised for computing an approximation of a from the approximation of u. Alternatively the stress may be characterized as the minimum of the complementary energy form on the space of symmetric tensors satisfying a relation expressing the equilibrium between the internal and boundary forces acting on Q, and finite element methods based on the associated variational form are termed equilibrium methods. The drawback of these methods is that it is not usually easy to construct an approximation space of tensors satisfying the equilibrium relation and having the required amount of regularity, C° in this instance. Mixed methods and hybrid methods were devised to avoid this difficulty. With the aid of Lagrangian multipliers the above constrained minimization problem may be turned into a saddle point problem. The Hellinger-Reissner principle characterizes the pair (a, u) as the saddle point of a Lagrangian where a is allowed to vary over the space of symmetric tensors subject only to regularity constraints, the equilibrium relation now being incorporated in the Lagrangian. Finite element methods based on the corresponding variational principle are called mixed methods. Mixed methods were first introduced in the literature in Hermann [1967] for the plate problem. We shall say, more generally, that a finite element method is a mixed method if it involves the simultaneous approximation of two or more vector fields defined on the physical domain. These will in general be the principal unknown and an expression involving one or more of its derivatives. An alternative approach is taken in the development of hybrid methods. The idea is to retain the equilibrium requirement for functions in the space in which the stress 527
528 J.E. Roberts and J.-M. Thomas Chapter I is approximated but to relax the regularity requirement. Given a finite element mesh on the domain Q, a mesh-dependent Lagrangian is constructed for which the unique saddle point is {a, (p) where a varies over a space of tensors satisfying the equilibrium condition but defined only on the union of the interiors of the elements. The Lagrangian multiplier <p is a vector field defined on the union of the boundaries of the elements which turns out to be none other than the trace of the displacement u. Finite element methods based on the associated variational formulation are hybrid methods. It is equally possible, starting from the original displacement formulation, to define a Lagrangian, again mesh-dependent, by introducing a multiplier Я to relax the regularity requirement on the space in which the displacement и is sought. The space of multipliers is again a space of vector fields on the union of the boundaries of the elements, and и is looked for in a space of vector fields defined on the interiors of the elements. There is a unique saddle point (и, Я) for this Lagrangian and, moreover, the multiplier Я is the normal constraint a • v on the boundaries of the elements, v denoting a field of unit normal vectors on the union of the boundaries. Methods obtained from such a Lagrangian are also termed hybrid methods. Again we shall adopt a general terminology saying that the method is a hybrid method if it involves the simultaneous approximation of a vector field defined on the union of the elements of the discretization and another defined on the union of the boundaries of the elements. The first reference for hybrid methods is Jones [1964]. An apercu of these methods is given in Ciarlet [1978, Chapter 7]. Besides structural mechanics mixed and hybrid methods have been found useful in many fields. In fluid mechanics for the incompressible Stokes problem, for instance, the role of the displacement is played by the fluid pressure, that of the stress by the fluid velocity, that of the equilibrium condition by the incompressibility condition. The above ideas will be developed in detail in Sections 2 and 3 in the context of a simpler model problem. First, however, we need to describe the functional framework in which we shall work. Let Q be a bounded, open subset of R", with n = 2 or n = 3, having a boundary F, F=dQ, which is piecewise C1. For each natural number m denote by Hm(Q) the Sobolev space of order m of scalar valued functions on Q, defined recursively by H°(Q) = L2(Q), Hm(Q) = \veHm~i(Q):Va = (x1,,,,,<xn)eNnmth £ «, = m, A.1) dmv ) d*v = e L\Q)} for all m > 1. dx\l---dx*" J We put Ио,я= v2(x)dx) ,
Section 1 Introduction 529 |fU = ( I \\&v\\o*) forallm^l, A.2) \|a|=m / Ыт,п = (\Нгт^,а + М2т,пI12 for all m>l. The product space (Hm@))" = {?=teI^„:?1eHm@)for all i=l,...,n} will be equipped with the following seminorm and norm: / n \l/2 / n \l/2 We shall make use of the following result from functional analysis: Theorem 1.1. The mapping v->v\r defined a priori for functions v regular on Q can be extended to a continuous linear mapping called the trace map of H\Q) into Ь2(Г). In other words there exists a constant C, depending only on Q, such that we have Nlo.r^C|N1>0 forallvsE\Q). A.4) It can be shown that on the kernel Hq(Q) of the trace mapping the correspondence v-*\v\i,n defines a norm equivalent to the norm i;->|M|ljfi; i.e. the Friedrichs- Poincare inequality holds on НЦО). The image of the trace mapping, denoted Я1/2(Г), is a Hilbert space with norm Шц2,г= inf Nkfi. A.5) {veHi(S}):v\r = il'} Thus for each vector q={qi)i^iin€(H1(Q)f the n traces qt\r are defined and belong to Ь\Г), and in particular, the linear combination ?-v = S"=1^|rV;, where v = (v.)is;i<B denotes the unit exterior normal vector to Q, is in L2(T). Imposing less regularity on each component qf, we can still define the normal trace as an element of a space of distributions on Г, cf. Theorem 1.2 below. First, though, we need to introduce two more spaces. The dual space of Н1/2(Г) is denoted Н~1,2(Г) and is a Hilbert space with norm Ml-i/2,r= sup <.Ц,Ф)Г> A.6) {^бН1'2(Г):»||,/2.г=1) where <\->r is the duality pairing between Н1/2(Г) and Н~1/2(Г). The space #(div; Q) is defined by H(div;Q) = L = (qiI^ne(L2(Q)T:divq=t ^L2(Q)\ A.7) and is a Hilbert space with norm WqWm^n^iMln+W^qWloY'2. A.8)
530 J.E. Roberts and J.-M. Thomas Chapter I Theorem 1.2. The mapping q^q • v defined a priori from (H1(Q)f into Ь2(Г) can be extended to a continuous linear mapping from #(div;0) onto Н~1/2(Г). Further we have the following characterization of the norm on Я~1/2(Г): 1И-1/2,Г = inf lkllH(div;0)- A-9) (?eff(div;fl):rv = /i) A demonstration of the first part of this theorem can be found in Темам [1977, Theorem 1.2, p. 9]. The characterization of the norm was given by Thomas [1977, Chapter I]. To avoid, whenever possible, working in this space Я~1/2(Г) which contains all the functions of Ь2(Г), we define, with the aid of Theorem 1.2, the space je{uiv;Q) = {qeH{div;Q):q-veL2(r)} A.10) which is a Hilbert space with norm k!l^,div;«)={klll(div^,+ k-v||g,r}1/2. AЛ1) We shall make use of the following version of Green's formula: (v div q + grad v • q) dx = \vq-vda for all veH\Q) and qe3tf(div;Q). Remark 1.1. The reader not familiar with the notation Я1 /2(Г) or Я " 1/2(Г) may find it somewhat surprising. It is natural in that these spaces can also be defined as Sobolev spaces of order s, for s not necessarily an integer. The study of such spaces is not within the scope of this work but may be found in Lions and Magenes [1968, Chapter I] or in Adams [1975, Chapter 7]. Other examples of functional spaces will be used in the following, and corresponding trace theorems will be stated. However, before ending this section we would like to give necessary and sufficient conditions for a locally regular scalar function to belong to #J@) and for a locally regular vector function to belong to H(div; Q). More precisely, let 9~h be a decomposition of Q into compact sets T,Q = {JTs#-h T, such that the interiors of the sets Г of 2Th are pairwise disjoint and such that the boundary 9 T of an element T of $~h is piece wise C1. For the sake of simplicity we shall make the following abuses of notation; we shall write Hm(T) for Hm{t\ H-IL.T for ||-L./, H(div;T) for H(div;t), |NI*<div:r> for || • Ижш^г,, etc. Theorem 1.3. Let 2Th be such a decomposition of Q,Q={JTBg-h T. A function veL2(Q), whose restriction v\T may be identified with a function vTeH1{T)for each Tef),, belongs to H\Q) if and only if for each interface T' = TlnT2 with Тг^еЗГ^ the
Section 2 Introduction 531 traces of vTi and of vTl on T" coincide: »Г|1г = 0г21г for all Г = Г!ПГ2 wirA 71,r2e^i. A.13) Similarly a function q e (L2@))", whose restriction q | r may be identified with a function qT e Jf(di\; T) for each Te$~k, belongs to Ж (div; Q) if and only if for each interface T'=TlnT2 with ГъГ2е,^,, the normal trace ofqTl coincides with the negative of that ofqTl: (9TrvTi)\r + {qT2'VT2)\r = 0 for all T'= Т^глТг with TltT2e&i,, (U4) where vT is the unit exterior normal vector of T. Clearly we have vr, = — vT2 on T' = Txr\T2 so that if v is any vector orthogonal to T we have ?Ti'v = 9r2'v on T' = T1nT2. 2. Conforming and equilibrium variational formulations for a model boundary value problem Many physical phenomena can be modelled by a system of first-order partial differential equations on a domain Q <= W of the form " du Pi= E aij-f- in й> l^isSn, B.1) where the vector function /» = (p,-)i«i<n satisfies the equilibrium relation div/?+/=0 infi, B.2) and the boundary conditions are of the form (p-v-g)+t(u-u) = 0 on Г, B.3) where Г is the boundary of Q; cf. Duvaut and Lions [1972, pp. 16-17] for example. For a review of such boundary conditions, cf. Dautray and Lions [1985, pp. 230-240]. The coefficients a{J of the system, defined on Q and the function t defined on Г are known functions dependent on the physical problem. The function / defined on Q and the functions g and u on Г are data functions. The function t is allowed to be infinite, t = + oo, on a part Г0 of Г thus implying that и = й on Г0. The steady state heat equation with convection terms neglected is an example of such a model; и denotes the temperature, p is the heat flow, and p • v, v a unit vector normal to Г, is the flux across Г. Linear models for elasticity problems involve an analogous system relating the displacement field u=(uiI%l^n and the stress field o=(aij)liijin. Hooke's law, which gives the stress field in terms of the linearized displacement field ejj(u)=j((dui/dxJ) + (duj/dxi)), is the analogue of B.1). For these examples corresponding to fundamental problems of engineering, we
532 J.E. Roberts and J.-M. Thomas Chapter I see that it is at least as important to calculate an approximation of p, or even the moments ofp • vr on Г or ofp ■ v on a surface contained in Q with normal vector v, as to calculate an approximation of и or of the trace u\r or of the trace of и on a surface lying in 0. On the other hand, from a strictly theoretical point of view, an apparently obvious approach would seem to be to begin by simplifying the study of system B.1), B.2) by eliminating the unknown p, thus considering и as the solution of the second order partial differential equation -J,^(^>/ta0- ,Z4) The classical methods for the numerical approximation of problem B.1)—B.3) are based on the following approach: one determines a function uh satisfying B.4) in a more or less weak sense—using for instance standard finite difference or finite element methods. Once the function щ has been calculated, if it is necessary to obtain an approximation ofp, the simplest procedure is to consider the function ph whose ith component, l^f^n, is given by (ph)i = Y,"=1aij(duh/dxj). This naive treatment has the inconvenience of producing a function ph which satisfies only in a very weak sense the equilibrium condition and for which the two normal traces ph • vr across an interface T'czQ do not agree. Experience shows that it is better, a priori, to choose a method which solves directly for the desired quantities rather than to try to obtain them in an indirect fashion as in the example above. Our objective here is to propose several such methods. We shall consider in this section, by way of introduction, the case in which the coefficient functions au are constant in Q, and the matrix (ay) is symmetric and positive definite. Thus the relations B.1) are invertible: —=lAijPj, Ui<„, B.5) UX; j=j and the inverse matrix (Atj) is also symmetric and positive-definite. For the moment we shall restrict our attention to the two following model problems: Homogeneous Dirichlet Problem. Given / in L2{Q), find peH(div;Q) and ueH^Q) satisfying B.1) and B.2). Homogeneous Neumann Problem. Given / in L2(Q) with $afdx = 0, find pe #0(div; £2) and ueH1(Q)/U satisfying B.1) and B.2), where for the sake of simplicity we suppose that Q is connected. Just as Hl(Q) denotes the kernel of the trace mapping of H1(Q) into Ьг{Г), H0(div; Q) denotes the kernel of the normal trace mapping of H(div; Q) into L2{T). A specific representative of the equivalence class of и in the quotient space H\Q)/U, for и a solution of the homogeneous Neumann problem, may be chosen by requiring further that a relation such as \nu dx = 0 or such as \г и dx = 0 be satisfied. (Using the standard abuse of notation we write и for an element of H1(Q) as well as for its equivalence class in H1(Q)/U.)
Section 2 Introduction 533 The solution (p, u) in the case of the homogeneous Dirichlet or the homogeneous Neumann problem may be characterized as the unique solution of a minimization problem. For the homogeneous Dirichlet problem put Wf = {qeH(div;Q):divq+f=0}, V=Hh(Q), B.6D) and for the homogeneous Neumann problem Wf = {qeHo(div;Q):divq+f=0}, V=H\Q)/U. B.6N) Then in either case (p, u) is the unique element of Ws x V satisfying B.1) or equivalent^ B.5). On Wf x V define the functional ™4^Нл£)С?,'и*-£)Ь ,2-7) я For each pair (q, v) in Wf x V we have %,»)=* with the equality being realized, as \_A~\ is positive-definite, only in case (q, v) satisfies the relation qt = S"= x aij{dv/dxj) for all i, i = 1,..., n; i.e. if (q, v) satisfies B.1). Thus the solution {p,u) of B.1)-B.2) with the homogeneous Dirichlet or the homogeneous Neumann condition is the unique element of Wf x V satisfying /(/7,u) = 0 B.8) or in other words the unique element minimizing / on Wf x V. This presentation was inspired by that in Barlow [1986]. With this characterization of the solution (p, u) of B.1)—B.3), it is natural to try to construct an approximation in the following manner. Let Vh be a finite-dimensional subspace of V and Wh a finite-dimensional, affine subspace of H(div; Q) contained in Wf. We consider the following problem: find (p, uh) satisfying {ph,uh)eW(xVh, I(Ph,Uh)= inf I{qh,vh). {qh.vh)eWfhxVh Problem B.9) has a unique solution (ph, uh); the uniqueness is an immediate consequence of the characterizations given by B.15) and B.16) below. Furthermore, the calculation of the value I(ph, uh) > 0 furnishes, a posteriori, a control of the error committed in the approximation. Formulation B.9) thus seems to provide a natural coupling between the approximations ph of p and uh of u. In fact it does not at all. To see this one has only to note that, with the choice B.6D) as well as with the choice B.6N), we have I(9,v) = S*(l) + ff(v\ B.10)
534 J.E. Roberts and J.-M. Thomas Chapter I with ff(v)=± Z аи—-—йх--\/ьйх, B.11) я /М=Ц Z ^y^dx. B.12) Hence if (/», u) is the solution of B.8), then и is characterized as being the unique solution of the minimization problem: find и satisfying ueV, /» = inf/'(»); B.13) veV and p as being that of the minimization problem: find p satisfying peW, ЛМ= inf/,(*)• ^2-14) Since l(p,u) = 0, we have the relation /;(«)+/tW=0 B.15) known in mechanics as the complementary energy principle. The solution {ph,uh) of B.9) is characterized with the aid of the following two minimization problems: find uh satisfying uheVh, ff(uh)= inf /'(»*); B.16) v„eVh find ph satisfying P*eWfh, /*(#>») = inf /„(ft). B.17) Since the approximate solution ил e J^ is an element of V, we say that uh is obtained by a conforming approximation method. Similarly, since the approximate solution ph e Wfh is an element of Wf we say that ph is obtained by an equilibrium approximation method. If / is the energy defined by / = //(м)= —/*(/»), cf. B.15), we remark that the conforming approximation permits us to calculate an approximation ff(uh) of / which is of necessity excessive, whereas the equilibrium approximation permits us to calculate an approximation — /*(p*) of/ which is necessarily too small: -/*(РьН- f*(P) = ?= /'(«) ^ /Чин)- B.18) Remark 2.1. The possibility to thus obtain an a posteriori bound on the energy / is what motivated the development of equilibrium approximation methods, cf. Fraeijs deVeubeke [1965, 1973, 1975], Fraeijs de Veubeke and Hogge [1972]. Remark 2.2. Let g be a function in Ь2(Г) satisfying \r g ds = 0. If (p, u) is a solution
Section 2 Introduction 535 of the system />=gradw in Q, divp = 0 in Q, p-v = g on Г, for Q a connected open subset of W, then as above one can verify that the function u, defined to within a constant, minimizes on H 1(Q)/U the functional /,(») = i |gradi;|2dx- gvda, B.19) whereas the vector function p minimizes the functional /*(*)=* \q\ 2dx B.20) on the affine manifold of vector functions q with div q=0 on Q and q • v=g on Г. If u minimizes /9(t>) then we have div(grad u) = 0; and if p minimizes fjq), we have rot p=0 since then p has a scalar potential. These principles are known in fluid mechanics as Dirichlet's principle and Kelvin's principle, respectively; cf. Serrin [1959, Section 24]. In the standard terminology of fluid mechanics, the vector variables representing the flow rate are denoted и or v, while the scalar variable is the scalar potential of the velocity field or the pressure and is generally denoted (p, ф, or p. Remark 2.3. We shall see later on, cf. Remark 3.1, that the two minimization problems B.13) and B.14) (or similarly B.19) and B.20)) are dual problems in the sense of the mathematical theory of duality in convex analysis. Usually B.13) is considered to be the primal problem and B.14) its dual. In keeping with standard practice we shall say that a formulation is primal, respectively dual, when the elimination of certain variables yields the conforming formulation B.13), respectively the equilibrium formulation B.14). To conclude this presentation of conforming and equilibrium approximations we give the variational formulations associated with these minimization problems: Conforming variational formulation. Find a solution u of ueV, B.21) " Г ди dv , ytj "Эх, Эх, n a fv dx for all veV,
536 J.E. Roberts and J.-M. Thomas Chapter I which leads one to seek an approximation uh of и satisfying B.22) ti j"^ ^dx = |^ dx for all vhe Vh. Q Q Equilibrium variational formulation. Find a solution p of peWf, B.23) n f* E ^-ijPj9idx = 0 for all qe W°, ft which leads one to seek an approximation ph of p satisfying B.24) £ АиР1чЧы dx = 0 for all qhe W%, ft where W°, respectively W°, is Wf, respectively W{, for f=0 and hence is a sub- space of H(div; Q) or #0(div; Q) depending on whether the problem considered is the homogeneous Dirichlet problem or the homogeneous Neumann problem. 3. First examples of mixed and hybrid formulations The actual construction of a solution/>,, of problem B.24) (equilibrium formulation) poses several practical difficulties. If 3~h is a triangulation of Q, Q= \JTe9-hT, and Wl is a finite dimensional subspace of W° associated with 3~h, then for a test function qh to belong to W° it must satisfy the following two constraints: (i) The normal traces of qh must be "continuous across the interface" between any two abutting finite elements Tx and T2 of &~h, i.e. qh-vTi and — qh-vT2 must coincide on V = Ту пТ2 (cf. the second part of Theorem 1.3). (ii) Within each finite element T of 3~h the divergence of qh must vanish; i.e. div(^|r) = 0 for all Te$~h; thus, in particular JBr qh• vT da = 0. We do not intend to attack head on the practical problem of constructing a basis for the subspace Wl. To circumvent this difficulty we shall free ourselves of one of the constraints that should be satisfied by the test functions by using the technique of Lagrangian multipliers. 3.1. The dual mixed formulation The affine subspace defined by B.6D) in the case of the homogeneous Dirichlet problem and by B.6N) in the case of the homogeneous Neumann problem may be
Section 3 Introduction 537 characterized as being the set of functions qe ^satisfying the constraint div q + f=0 in Q, where W is the space defined by W=H(div;Q), W=H0(div;Q). C.1D) C.1N) Thus we are led to substitute for the minimization problem B.14) the saddle point problem of the Lagrangian if (•, •) defined on WxL2(Q) by C.2) ^* (»>») = /*(?) + (div q + f)v dx. Thus we seek (p, u)eWxL2(Q) satisfying ^*0».v)<&*(P>"K^*fo") for a11 (?>v)eWxL2{Q). The solution is characterized as being the solution of the system {p,u)eWxL2{Q), C.3) I AijpJqidx + и div q dx = 0 for all qe W, C.4) J n (div p)v dx = — \fv dx for all veL2 (Q). If (p, u) satisfies C.4) it is clear that p satisfies B.23). Denote, for the moment, by й the solution of B.21). Using Green's formula together with relation B.5), we see that I Аир^йх-- (gradu)-^dx й div q dx for all qeW. Hence the Lagrange multiplier associated with the constraint div q=f is none other than u = u. C.5) 3.2. The primal mixed formulation In virtue of C.5), the solution (p, u) actually belongs to W x V. Using Green's
538 J.E. Roberts and J.-M. Thomas formula we establish that the solution (p, u) satisfies Chapter I Z Aijpjqtdx- (grad u) • q dx = 0 for all qeW, — /rgradudx = — i/udx for all ve V. a a As W is dense in (L2@))", the pair (p, u) is the solution of the system (p,u)e(L2(Q))nxV, Z \AijPAdx- ij=l J (grad u)• q dx = 0 for all qe(L2(Q)f, p • grad v dx = — !/Ъ dx for all ue К C.6) This formulation, C.6), can be interpreted as being a characterization of the saddle point of the functional &f{-,-) defined on (L2(Q)f x V by Ji?% v) = Д (?) - U • grad и dx + \fv dx, C.7) where /* is the continuous extension of /* to the space (L2{Q))n. This functional can be found in the literature written in the form SCf(q,v)=-i £ lAijqjqidx + Z I Z ^y«j -Q-!<?,■dx + I fv dx -]fV{ C.7) The elimination of p from formulation C.6) gives us again the conforming variational formulation B.21) of the problem. This type of method is analyzed in Babuska, Oden and Lee [1977]. Such methods have proved useful in theoretical studies such as the development of new models for nonlinear elasticity, cf. Ciarlet and Destuynder [1979a, 1979b], Destuynder [1986]. Remark 3.1. We have on the one hand Aq), ilqeWf, sup gf(q,v) = V u . vev (. + 00, otherwise,
Section 3 Introduction 539 and on the other inf J2?%»)=-i £ »e^ i,j=i We deduce that dv dv , OX; OX; /t>dx. inf sup &'(q, v) = inf Л (q) = /„ (/») ,eff veV qeWf and that sup inf £ff(q,v) = -inf ff(v)=-ff{u). veV qeW veV Then using the complementary energy principle B.15), we obtain inf f*(q)= inf sup if J(?,t)) = sup inf sef{q, v)=- inf /'(»). C.8) In this form we recognize the minimization problems B.13) and B.14) as dual problems in the sense of optimization theory, cf. for example Ciarlet [1982, p. 222]. 3.3. The dual hybrid formulation To a decomposition 2Th of Q, Q = \JTesrh T, is associated the affine subspace Yf of (L2(Q))" defined by Y* = {qe(L2(Q))n:4Terh, div(q\T) + f\T=0, C.9) in the distributional sense}. In other words, Yf is the set of vector functions qe{L2(Q)f satisfying for each Te&~h £-gradt>rdx= |/bTdx for all vTeHo(T). For each Te&~h, the restriction q\T of the function qe Yf to T belongs to the space H(div; T). Further, the normal trace q\T'VT is defined, cf. Theorem 1.2, to be an element of H~ 1/2FГ), the dual space of Я1/2(ЭГ); we denote by < •, • >ar this duality. We ascertain that the affine subspace Wf may be characterized as being the set of functions qe Yf satisfying the constraint Z (я\т'гТ' у>бг=0 for all re К TsSrh This leads us to associate to the decomposition 3~h of Q the Lagrangian ^(v) defined on Yf x V by #*(».«>)=/*(*)- I <*lT-vr,i;>8r- C-10) TsSTh
540 J.E. Roberts and J.-M. Thomas Chapter I (The choice of 2%{q, v) as given in C.10) instead of as A(*)+ £ <9\t-vt,v)st is made simply to obtain C.13) below.) Thus we seek an element (p,u) e Yf x V satisfying ®*(Л^^(лн)<3»(?,и) for all (q,v)eYf x V. C.11) A pair (p, u) is such an element if and only if it is a solution of the system {P,u)eYfxV, I Aijpjqidx+ £ <?lr'Vr,">8T=0 forall?eY°, C.12) i,J=l Ге^ь £ </j|r,vr,u>ST=0, for all у е F, where Y° is Fr for/sO. If (p,u) is a solution of problem C.12), then p is the solution of B.22). It is clear that и is not uniquely defined as an element of V; only its traces on the interfaces are determined by the first equation of C.12). However, if u is the solution of B.20), we can choose и = й. C.13) Remark 3.2. In fact with the dual hybrid formulation the Lagrangian multiplier should not be considered as a function in V defined on all Q but as an element of the product space ПТе*гИН112(дТ) satisfying certain matching conditions at the interfaces. This will be the point of view taken for the analysis of the method given in Section 5. A variation of considerable practical interest consists of taking the multiplier to be an element of UTe3rhL2(QT) satisfying again certain matching conditions at the interfaces. 3.4. The primal hybrid formulation To a decomposition 2Th of Q we associate in a manner similar to the above a subspace X of L2(Q) defined by X = {veL2(Q):VTe^h, t^sH^T)}. C.14) The space I7 may be characterized as being the space of functions veX satisfying the constraint Y, <?'v|r, vT}ST =0 for all qe W. Thus we replace the minimization problem B.13) by the saddle point problem for the Lagrangian 3)s{',') defined onlxWby &(v,q) = ?*{v)- £ <«fv|r,i»r>er, C.15)
Section 4 Introduction 541 where ff is the extension of /f to X defined as follows: /»=iZ([la,^^dx)-Udx. Te^t,\J i,j=l UXJ UXi / J T ft Thus we seek a solution (p, u) of the system (p,u)eWxX, I C.16) Te^„ /bdx foralli>eX, C.17) £ <?*vT,ur>er=0 for all ^eW. The solution (/>, ы) of problem C.17) is such that и is a solution of problem B.21) and if/ is the solution of B.23) we may choose as indeed only the normal traces of p on the interfaces are determined by C.17). Remark 3.3. The hybrid formulations have been presented here as subordinate to a decomposition STh of Q which will, of course, coincide with the triangulation of Q by elements Ton which will be constructed approximations of functions. One can, however, generalize this type of hybrid formulation to one subordinate to an arbitrary decomposition of Q into subdomains. A particular case is that for which the decomposition is the simplest possible, a decomposition into a single "subdomain". In this case the primal hybrid formulation of the homogeneous Dirichlet problem becomes: find a saddle point (и,Х)еН1@)хН'1/2(Г) of the Lagrangian £?*(•, •) definedby&f(v,fi) = f-f{v)-<fi,vyrforall(v,!i)eH1(Q)xH-ll2(r).Itwasforthis type of formulation, adapted to a nonhomogeneous Dirichlet problem, that the first mathematical analysis of a finite element method with Lagrangian multipliers was given, cf. BabuSka [1973]. 4. General orientation The elementary examples of mixed formulations and of hybrid formulations presented in Section 3 may all be written in the following general form: find a pair (q>, I) satisfying (peW, XeM, а(ср,ф) + Ь(ф,Л) = Дф) Ь(<р,/1) = д(ц) for all фе W, for all fieM, D.1) where W and M are Hilbert spaces, a(v) is a symmetric bilinear form on WxW, b( •, ■) is a bilinear form on Wx M,f is a linear form on W, and g is a linear form on
542 J.E. Roberts and J.-M. Thomas Chapter I. M. Then given a finite-dimensional subspace Wh of W and a finite-dimensional subspace Mh of M we consider the problem: find a pair (q>h, kh) satisfying q>heWh, A„eMfcs a(<p*,^) + W*A)«/№*) for all ^e^, D.2) Ь(<Рл>/*а)=0(р*) for all ^еМ„. This finite-dimensional problem may then be posed in the obvious manner as a problem of solving a linear system of Nh equations in Nh unknowns, where iV„ = dim W„+dimM„. D.3) The matrix of this system is symmetric but cannot be positive-definite. Thus the first problem is to find sufficient conditions, bearing on the pair {Wh,Mh) of subspaces, for the matrix of the associated linear system to be invertible. In a more general way Chapter III treats the numerical analysis of this type of problem in an abstract setting. The a priori estimates of the error committed in approximating the solution (ф, A) of D.1) by the solution {q>h,Xh) of D.2) are obtained as a function of the error of approximation of (<p, A) in Wh x Mh. In the examples given in Section 3, the spaces И'and M appearing in the mixed or hybrid formulations are either L2(Q), Hl(Q), or closed subspaces of these spaces for the space of scalar functions and either (L2(Q))B, H(div; Q), or closed subspaces of these spaces for the space of vector functions. Chapter II contains the major results concerning interpolation by finite elements of scalar functions in H1(Q) and of vector functions in H{di\;Q). Chapters II and III are independent. Both are fundamental for the analysis of the examples developed in the subsequent chapters. Chapters IV and V treat essentially the approximation of the model problem governed by the first-order system of partial differential equations £ 8m , • Pi = L %^Г> U»^n> D-4) j=i axj div/» + / = 0, D.5) posed on a bounded, open domain Qcl" with и = 2 or n = 3 with boundary Г piece wise C1. The coefficients atJ are supposed to be bounded and measurable in Q and to satisfy the uniform ellipticity hypothesis n X ац(х)^>«р\^\2 a.e. onfi for all £еМ", D.6) for some ap>0. Unlike in Sections 2 and 3, we shall not suppose the symmetry of [a], i.e. that ац = ап. Also we shall not maintain throughout the following the notation particular to Sections 2 and 3. With the above hypotheses concerning the coefficients cy the matrix [c] may be inverted to obtain — = ZAijPj, 1«:;<и; D.7) OX; j=1
Section 4 Introduction 543 the coefficients Au are then bounded and measurable in Q and satisfy the uniform ellipticity hypothesis n I i4y(x)&u>«,|£|2 a.e. on О foralUeR", D.8) w=i for some ad>0. Chapter IV is concerned with mixed methods. Primal mixed methods are treated rather rapidly, the dual mixed methods in some more detail. Several algorithms for the solution of the linear system are also given here. In Chapter V hybrid methods are studied. Primal hybrid and dual hybrid methods as well as the method of hybridization of mixed methods are included. In Chapter VI some further examples of mixed methods and of hybrid methods are mentioned, and some extensions and variants of the theory in the literature are cited.
Chapter II Finite Element Type Interpolation of Scalar and Vectorial Functions 5. Lagrangian interpolation: An H1 approximation of scalar functions In the examples of mixed and hybrid formulations presented in Section 3, an essential role is played by two Hilbert spaces: the space H1(Q) of scalar functions veL2(Q) such that gradueCL^O))'1 and the space H(div;0) of vector functions qe(L2(Q))n such that div qeL2(Q). In this chapter we shall describe several choices of "finite element type" subspaces of Hl(Q) and of H(div; Q). For this general presentation we shall use the most elementary setting for finite element methods (some generalizations will be presented in Section 7): (i) Q will denote a bounded, open, polyhedral subset of W, n = 2 or 3; (ii) 3~h will be a triangulation of Q by n-simplexes T of diameter no greater than h (T is a triangle for n = 2, a tetrahedron for n = 3), with Q= \J T. E,1) TeSTh For the definition of a triangulation, cf. Ciarlet [1978, p. 38] for example. The boundary of an n-simplex T, i.e. the union of its n +1 (n — l)-dimensional faces, will be denoted by ST, and in a general manner we shall use the symbol 7" to represent an arbitrary (и — l)-dimensional face of T and we shall write T" < Г; Т will be an edge of T when n = 2, a triangular face when n = 3. We shall also use the following notation: mes T = the Euclidian measure of T in R" (geometric area if n = 2, geometric volume if n = 3), hT = the diameter of T, which here, with a triangulation by simplexes, ,<. _. is just the length of the longest edge, pT = the radius of the circle inscribed in T if n = 2, of the sphere inscribed in T if n = 3, and /i=max/iT. E.3) TzSTh Let Ж be a set of positive numbers. A family of triangulations {^~h: НеЖ) will 545
546 J.E. Roberts and J.-M. Thomas Chapter II said to be regular if inf й=0, and inf min pT/hT>0. E.4) This definition is equivalent to that given by Ciarlet [1978, p. 124]. We shall say that the family {&~h\ кеЖ] is uniformly regular if inf /j = 0, and inf {(minpr)/(max/jT)}>0. E.5) ЫЖ ЫЖ ТеУь TeS^h Thus a family of triangulations {2Th: кеЖ) is uniformly regular if and only if it is regular and it satisfies the inverse hypothesis: there exists a constant C, independent of h, such that for each he Ж min hT^Ch, E.6) Terh cf. Ciarlet [1978, p. 140]. Remark 5.1. The regularity hypothesis E.4) is quite natural for the theory of interpolation by finite elements. However, in the chapters that follow we shall be rather abusive of the inverse hypothesis E.6), using it often just to simplify the exposition of certain results. For each natural number k, we denote by Pk(T) the space of restrictions to Г of polynomial functions of degree к in n variables; we have .. _,_ (n + k)l J±(*+lXk + 2), ifn = 2, .... dim^(r)==-^T=U(fe-fiX^2)(fc+3), if„=3. EЛ) To an n simplex Г and a positive integer к is associated E%\ the principal lattice of order к of the triangle T, i.e. the set of points of T each of whose barycentric coordinates, relative to the vertices of T, is a multiple of 1/fc. We have card2#>=^±7^. E.8) n\k\ The Lagrangian interpolant JSf^i? on T of order k,k^l, of a function veC°(T) is by definition the unique function in Pk{T) such that £fv{a) = v{a) for all aelf. E.9) Given a triangulation STh of Q and a function veC°(Q), for a positive integer k, we shall denote by JOf^v the function on Q which coincides with the polynomial function if'rVlr) on T for eacn simplex Te^h. One says that &fh is the Lagrangian interpolant of order к on STh of the function v. It is essential to remark that if^'i; is, for each positive integer k, a continuous function on Q. Thus one can easily verify that the mapping jSfj» is surjective from C°(Q) onto the space Lj,fc) of simplicial Lagrangian interpolants defined by Lf> = {veC0(Q): 4Te2Th, v\TePk(T)}. E.10) For each k, k^l, the space Lifc) is a finite-dimensional subspace of H1(Q). In
Section 5 Finite element type interpolation 547 dimensions n-2 and n-3 one may define S£{jh for veH2{T)czC0(T) but not for иеЯ1(Т).1 With the aid of the following notation, given for each positive integer m, Hm{^h) = {veL2(Q):\/Te^h,v\TeHm(T)}, E.11) one may characterize Lj,*' as being the fixed point space of the operator jS?ift) on W = {veH1{Q)nH2\2ThY v = S£fv). E.12) Thus we have: Theorem 5.1. For each positive integer к and each triangulation 2Th ofQ, the mapping Sef defines a projection of H^Q^H1^) onto L?>. We also have: Theorem 5.2. Suppose we are given a regular family {3~h: he Ж} oftriangulations ofQ. For each positive integer k, there exists a constant C, independent of h, such that l»-^4.n + *~>-^?4lo.n<Cfc"|»L + i.Q, for each veHm+l{Q), l^m^k. We shall not give here the demonstrations of these results fundamental for the study of the convergence of finite element methods but refer the reader to Raviart and Thomas [1983, pp. 79-103], for example, for the details of the proofs. We would like now to make precise the idea of interpolation of the trace on the set d^~h = \JTsSrhbT of a scalar function v continuous on Q. For a triangulation &~h of Q and a positive integer k, the space Lf}h is defined by U% = <Ф = (ФтWe П Ь2(ЭГ): 3i>eL?> s.t. VTe^, i>|BT = ^Tl. E.14) (. Teffi, J The space L$)-h may also be characterized as being the space to which ф — (Фт)т€Ун belongs if and only if it satisfies the following properties: фтеРк(дТ) for each Те3Th, E.15a) Фтх —Фтг on T' whenever T" is a face common to the simplexes Tx and T2 of $~h, where Pk{bT) is defined to be the space of restrictions to the boundary ST of T of polynomials of degree not greater than к in n variables or equivalently the space of functions in n variables, polynomial of degree not greater than к on each face T of Г and continuous on dT. One may easily verify that dm,P,p7> jdimn(r)_dimiVn_i(n if fc>„. E-16) 'We have adopted the notational convention Hm(A) = Hm(A), A the interior of A.
548 J.E. Roberts and J.-M. Thomas Chapter II Let S'tln be the mapping which associates to an element ф = (|Аг)ге^,еПГб^С°(ЗГ) the element (if^ Фт)т£гк, where S^-фт, the feth-order Lagrangian interpolant of фт, is the function coinciding with Уг(Фт\г)on each face T" of T. As the elements of Lf are of class C° (cf. Ciarlet [1978, p. 95] for example), for each function veHl(Q)nH2{b^h\ we have &$Ыът) = &УШ)\ът for all ТеГ„, or in other words, if we put <A(,) = (^lar)rejrh, E.17) then we have ^>W=((^M|BrW,,. E.18) Theorem 5.3. For each positive integer к and each triangulation 3f~h of Q, the mapping v-+H?fph фм, with <//(„, given by E.17), is a surjection of H1(Q)r\H2'(S~h) onto bflh. It is a bijection of bf] onto L*h only in case k^n. Proof. In light of Theorem 5.1, it suffices to show that the mapping t>-*jS? *„ Фю is a surjection of LJ,fc) onto L^ and is also a bijection only if к ^ n. For к < n, the claim follows immediately from the facts that according to E.16) the space Pk(T) and РкфТ) have the same dimension and that for each ф, ф = {фт)те!Ги> belonging to L(sj-h, there exists a unique function v in L|f' whose restriction to the boundary of Г is equal to фт for each T in 3~h. For к > п, the dimension of Pk{T) is greater than that of Pk(8T), so a bijection from L{^ onto Lj^j, cannot exist. To show surjectivity in this case, we have only to remark that given ф = {фт)тезгиеЬмги, the function vebf\ for example, defined by its values at the lattice points а в Угел,^?' as follows: , ч \Фт{а\ if aedT for some Te&~h, via)= <л @, otherwise, defines an antecedent of i/'. П Before announcing an error bound for interpolation in L^, we would like to define a norm on the space ПГе5гьЯ1/2(8Г) which will be seen to be well-adapted to the analysis of dual hybrid methods. Thus, given a triangulation 3~h of й, we put INIIi.r = {N?.r + b-2INI§.7-}1/2 for all vgННП Te^h, for all феН1/2(дТ), ТеГь, E.19) for all ф=(фт)Те^е П Я^ЭГ). lll>Allli/2,ar = IIMIIl/2lMi,= j may now state: inf Hlvlllx ;H1(T):«|BT=*} Z lll<WII?,2,ar] ,T 1/2 Theorem 5.4. Suppose {2Гк; he Ж) is a regular family oftriangulations of Q. For
Section 6 Finite element type interpolation 549 each positive integer k, there exists a constant C, independent of h, such that if veHm+1(Q), l^m^fe, and ф = ф(„) is determined by E.17), then №-&&M\ii2fi*<chr\v\m+1<B. Proof. With the norms defined in E.19) we have so the demonstration is completed on applying Theorem 5.2. □ 6. Interpolation of the moments of a vectorial function: An #(div) approximation of vector valued functions We retain here the assumptions made in Section 5 concerning the domain Q с U", n = 2 or 3, and the triangulation STh, Q = []те^к T. Given a positive integer k, we shall associate to each и-simplex Te$~h a subspace Dk{T) of the product space (Pk(T))n and an interpolation operator Sf defined on a subspace Ж(div; T) of H(div; T) and having values in the space Dk(T). Denoting by Ж(йщ^'11) the space denned by Jf(div; fh) = {qe{L2{Q)T: \/ТеГ„, ?|re Jf (div; Г)}, F.1) we can associate to each vector function qe Ж (div; S~h) the function $Рде{Ь2@))" determined by D*)*)lr = Aw(*lr) ^г all TeSTh. F.2) The spaces Dk(T) and the operators S^ will be constructed in such a way that the following two hypotheses are satisfied div ?=0 => div((f?V) = 0 for all Те 9~h, and for all qe Ж (div; T), F'3^ and ^qeH(div; Q) for all ?e#(div; 0)nJf(div; «TJ. F.4) Property F.3) is a local property of the interpolation of vector functions in Jf(div; Г). Its analogue for the approximation of scalar functions in Hl{T) is that the interpolant of a function whose gradient vanishes also has a trivial gradient; i.e., the interpolant of a constant function is a constant. However we did not think it necessary to state such a natural requirement. Property F.4) is a continuity condition for the interpolant of a vector function having a certain amount of regularity on Q. Here of course the scalar analogue, that the interpolant jSP^d be in HJ(i2) whenever veH1(Q)nJf2(JJ, is satisfied as the space Lf* is a subspace of C°(Q). Remark 6.1. Almost reflexively, a mathematician's first idea is to try to reduce the study of interpolation of vector functions to that of scalar functions as given in the
550 J.E. Roberts and J.-M. Thomas Chapter II preceding section by choosing Dk(T) = (Pk(TV, jf(div,T)=(H2(T)r, £f={<eff>n (fcth-order Lagrangian interpolation on T carried out component by component). One may check that condition F.4) is satisfied; however, F.3) is not. To see this consider in two dimensions, on the reference triangle Г with vertices @,0), A,0) and @,1), the function q=(q1,q2) given by qi(xl,x2) = 2xi ~xxx2, q2(X1,X2) = 2X2 — X1X2. This function q is divergence-free. Yet its Lagrangian interpolant in (P^T)J is none other than the function x-»(^xx, \x2) in t whose divergence is the constant function x-»l. We shall give two families of examples of H(div) approximation; the first examples, the original examples due to Thomas [1977, Chapter III], cf. also Raviart and Thomas [1977] for the two-dimensional case and Nedelec [1980] for the three-dimensional case, we present in some detail; the second examples due to Brezzi, Douglas and Marini [1985] in the two-dimensional case, and to Brezzi, Douglas, Duran and Fortin [1987] in the three-dimensional case will be described more briefly. Yet another family of examples of #(div) approximation may be found in Nedelec [1986]. For each positive integer к we put ^ = (^-1)"©*^-! F-5) and define Dk(T) to be the space of restrictions to T of the functions of Dk. In other words, the vector function q=(q1,..., qn) belongs to Dk{T) if and only if there exists и+ 1 scalar polynomial functions qfePk_1(T), i = 0, l,...,n, such that q((x) = qf(x) + xtq$(x) for all x e T, i — 1,..., n. F.6) The expression F.6) for q may be made unique by further requiring that q% be homogeneous of degree fc —1. Thus we have dimD»(r) = (n + l) dim Pk_1(T)-dim Pk_2(T), F.7) which together with E.7) gives us or more specifically ,. jk(k + 2), if n = 2, dimZ)'(r) = W + l)(/c + 3), if„ = 3.
Section 6 Finite element type interpolation 551 For each nonnegative integer / we denote by J?i(T) the space of linear forms on {C°(T))n defined by q-rdx e i?((C°(T))n, R):re(P,)" к F.8) Ml{T) = i.e. an element olJijJ) assigns to the vector function q on Г one of its moments on T of order no greater than /. In keeping with classical notation we shall write Jfl(T) = {Q) for / a negative integer, in particular for /= — 1. Similarly for each nonnegative integer I and each face 7" of T we denote by Jf^T') the space of linear forms on (C°(T)f defined by JtiT) = \\q q-vTwda 6if((C°(r))",IR):weP,K F.9) i" where vT is the unit exterior normal to 57; constant on each face 7" of T. Thus an element of Ji0") assigns to the vector function q on Г one of the moments on 7" of order no greater than I of its exterior normal trace q-vT. Now we define, for к a positive integer, MfJ U л XT) \г<т к l )\uJ?k_2(T), F.10) where T'<T means 7" is an (n — l)-dimensional face of T. Without change of notation we shall consider henceforth that the forms in М'ф are defined on the space Jf (div; T), Jf(div; T) = {qeH(di\;T):q-vTeL2(dT)}. F.11) The dimension of the space Jt(j* is given by dim^r?) = (n + l)dimPk_1(R"-1) + ndimPJt_2(T), or more explicitly, by *-"-<"+ч(£щгОг <6Л2) Thus we see that dimDt(T) = dim^), fc=l,2,.... F.13) Theorem 6.1. Let T be ann simplex and к a positive integer. For each qe34?(di\; T) there exists one and only one function, which we shall denote S^q, in Dk(T) for which m(Sfq) = m(q) for all me Jt%\ F.14) The mapping <f x': ,?f (div; T)^Dk(T) thus defined further satisfies the commuting
552 J.E. Roberts and J.-M. Thomas Chapter II diagram property Jf (div; T) Dk(T) div div L2(T) л(Г" Pk-i(T) i.e. div(<??V) = Дг J (div q) for all q e jf (div; Г), where П^~1) denotes the orthogonal projection in L2(T) onto Ffc„i(T). F.15) Proof. Suppose for the moment that the operator iP from Jf(div; T) to Dfc(T) satisfying F.14) is well defined and let qeJf(di\; T). Then it is clear that the divergence of any element of Dk{T), and in particular of iP q, is a polynomial of degree not exceeding k — \. Thus using Green's formula with F.14) we have wdiv(<?(fV)dx = wdiv^dx for all wePt_!(T) or equivalently F.15). To show that, given qs Ж (div; Г), F.14) does indeed determine a unique element of Dk{T), in view of F.13) it suffices to show that ifpeDk(T) and m(p) = 0 for each m e J(P, thenp must be trivial. It is easy to check that the normal trace on any face T" of T of any element of Dk(T) is a polynomial of degree at most fc— 1. Thus, the normal trace of p on T' must be trivial. The divergence of p, a priori a polynomial of degree at most fc — 1 on T, can then be shown to vanish by using Green's formula. It follows that/» belongs to the subspace (Р^-^Г))" of Dk(T); to see this simply apply Euler's identity for homogeneous functions: x - grad /(*) = а/(дг) whenever / is homogeneous of degree a. Next we show that if v' is a vector normal to a face T' of Г that p • v' vanishes on T. As p • v' vanishes on T", we have that p • v' = hw, where b is the first degree polynomial whose zeroes form the (n— l)-dimensional hyperplane containing 7", and wePk-2- Now wv'e(Pk-2T and we have Ww dx = p'(wv')dx = 0. As fr is either always positive or always negative on T, w and hence p • v' vanish on Г. The theorem now follows on noting that for any n of the и +1 faces of I| the corresponding normal vectors form a basis for R". □ For each Те &~h we have now constructed an interpolation operator Sr from
Section 6 Finite element type interpolation 553 ^f(div; Г) into Dk(T)c(Pk{T))" satisfying F.15) and a fortiori F.3). To a triangula- tion STh of Q we associate the space Jf(div; 3~h) defined by F.1) together with F.11): jr(div;*i) = {qe(L2(Q))n:4Te$'h,q\TeH(div; T) and q\T-vTeL2(dT)}, F.16) and the equilibrium interpolation operator S^ of order к on Ж (div; ^) constructed from the operators iff' with the aid of F.2). To see that S^ satisfies F.4), we note that if q is sufficiently regular in the sense that q e H(di\;Q) n Jtidiv;^) then Theorem 1.3 implies 9\TrvTl+q\T2'vTl = 0 on Г for all Г = rxnT2 with Т1гТ2е&1,. ' Thus, for each T = Тхс\Тг with TuT2eFh,r=efq satisfies (r|ri-vri+r|r2-vr2)wds = 0 for all wePt-j. 7" The restrictions r|re.Dk(T), Te^h, are polynomial of degree at most к and their normal traces on 9 Г are piecewise polynomial of degree at most k — 1; indeed, on each face T of T, Te3~h, x-vr is constant. With this observation of central importance we see that r satisfies the continuity condition F.17) and hence r=4fc)?e#(div;£2); i.e. F.4) holds. We denote by Ef} the finite-dimensional space of equilibrium interpolants Ef = {qeH{&v;Q):4Te3rh,q\TeDk(T)}, F.18) which, as we have just seen, is the space of functions q e (L2(Q))n for which the restriction q\T may be identified with a function in Dk(T) for each Te3~h and for which F.17) holds. In analogy with E.12), Ef] may also be characterized as a space of fixed points as follows: £f> = {^6H(div;fi)nJf(div;^): <«(?) = «}. F.18') Thus we have: Theorem 6.2. For each positive integer k, and each triangulation 2Th ofQ, the mapping (f^ is a projection from H(div;f2)nJf(div;^,) onto E%\ The next theorem gives a bound for the error of interpolation by <f jf1: Theorem 6.3. Suppose {&~h: heЖ} is a regular family of triangulations of Q and к is a positive integer. There exists a constant C, independent ofh, such that II *-*?>* II o.0<Cfc'M,0 for all qe{Hl(Q))n, l^l^k, '
554 J.E. Roberts and J.-M. Thomas Chapter II and |div(»-<V)||o.D<Cfc/|div»|I,0 for all qe(H Щ)" with div q e H\Q), 0 ^ / *S k. F.20) Proof. Estimate F.20) is a classical corollary of property F.15) identifying div(<?£>0) as the image under the orthogonal L2 projection onto Lf} of div q, and we shall not give the details of its demonstration. The argument justifying F.19) is more technical: for each Те 9~h we must find a bound for the quantity \q-£VqU,T = Z II €,-№)« I 2 0,Г 1/2 however, this cannot be done component by component as the interpolation space is not a simple tensor product of n copies of the same space of polynomials. Thus it is necessary to work directly with the subspaces of vector functions of и variables with values in R". We employ the technique of reducing the question of the error bound on Г to that on the reference triangle Tas in Ciarlet and Raviart [1972]. For each TeJj, there exists an affine transformation F: f->T=F(T), bijective from Tonto T. We denote by DjF the linear tangent mapping and by J the Jacobian of the transformation. We have here |J|=(mes T)/(mes t). To each scalar function veHl(f), classically is associated the function veHl(T\ defined by v(x) — v(x) for each x = F(x) e T. F.21) Here, to each vector function q e H(div; t) we associate the function q, that we shall show belongs to H(div; T), denned by q{x) — — DF q(x) for each x = F(x) e T. Ml The transformation F.22) was constructed in such a way that q • grad v dx = q- grad v dx. т i Using Green's formula we deduce, cf. Thomas [1976, Lemma 5.1], that v div q dx = t div q dx, F.22) F.23) F.24) and we have, with v denoting of course the exterior unit normal of dT, vq-vda = vq • v da, F.25) for all sufficiently regular functions v and q. Thus q^>q is a bijective affine
Section 6 Finite element type interpolation 555 transformation from H(div; t) onto H(div; T) such that (mes T)div q=(mes f )div q F.26) and on each face Г = F(T") of T (mes T')q • v=(mes f ')£ • v. F.27) Furthermore, it is immediately verified that this transformation takes Dk(T) onto D*(T): qeDk(T) о qeDk(T). F.28) We conclude from F.26), F.27) and F.28) that the interpolant in Dk{T) of a function q, image of the function q defined on t, is none other than the image of the interpolant in Dk(T) of q: {ifqYF^DF^q). F.29) This property allows us to reduce the problem of obtaining an error bound for interpolation on Г to that of obtaining an error bound for interpolation on f, the reference n-simplex. As the subspace (Pk-i)" of Dk is invariant under the interpolation operator, we know that there exists a constant С depending only on Г and к such that \\q~£fq\\0,t^C\q\ltf for all qe(H'(f)f with 1 <J^fc With this bound and change of variable formulas we obtain for each T in 3Th a local bound for \q—S(rq\o,T and then the global bound F.19). □ Remark 6.2. The introduction of the spaces Dk is fundamental for our construction of an H(div;£2) approximation. We have the strict inclusions (f|,-i)"c^icft)"> and we have chosen to index by к the space Dk strictly between (Pk-i)" and (Pk)" though in the literature one also finds this same space indexed by k— 1. Our choice was made with consideration for the error bounds. As Lagrangian interpolation, J? Jf}, by functions locally in Pk leads to error bounds in Hl(£i) of order hk, so the interpolation if\ with ё as in equilibrium, by functions locally in Dk leads to an error in H(div; Q) of order hk. Remark 6.3. The transformation q-+q introduced during the demonstration of Theorem 6.3 in F.22) is none other than the transformation known in mechanics as the Piola transformation of a vector field defined on a manifold of R", cf. for example Marsden and Hughes [1983, pp. 116-118]. Next we present a few results concerning the approximation of the normal traces on the set S5"A = (Jr6^h 9Г of sufficiently regular vector functions q, for example qe(H1(Q)f. Given a triangulation STh of U and a positive integer k, we denote by £j$-h the space defined by
556 J.E. Roberts and J.-M. Thomas Chapter II <*> -),,-(„ \ _сП r2fflTva„.p<*i, Е&,=<Ц=(к)те*,е П L2(dTy.3qeErs.t.VTe3rh,q\BT'vT~iiT\, F.30) where £* is the space defined by F.18). The space Efj-h may also be characterized as the space of functions /л=(цт)Те^ъ satisfying the following two properties: liTeD$T) for each Tefh, F.31a) MTl+^r2==0 on each T, T = 7^r\T2 with TuT2efh, F.31b) where by Dk(dT) we mean the space Dk{dT) = {iieL2{bT):lqeDk{T),q-yT = ti}, F.32) or equivalently the space of functions polynomial of degree at most Ic-lon each face T" of T with no continuity constraint at the intersection of two faces. Thus we have dimfl'(sr)-'"+"(„-w-i)r F-331 Let itlh be the operator which associates to each element \i=(цт)тегъ sTlTeSrh L2(8T) the element (#ат^г)ге^, where Sfj is the orthogonal projection from L2FT) onto DkET), Te!Th. Associating to each vector function qeJ^(div; &~h)nH(div;Q) the element ^д)Нч^т)т^, F.34) we clearly have 41Ы = ((Л) • vr)r6*. e 4*A • F-35) Hence we deduce: Theorem 6.4. For each positive integer к and each triangulation ^~h ofQ, the mapping q-*&drh/*(»>> with Mw given by F.34), is a surjection of H(di-v; Q)nJ^{div;^~h) onto E(^h, It is a bijection of El® onto El^h only in case fc= 1. Here we shall give only one result concerning error bounds for interpolation in Efsrh. However, beforehand, in a manner analogous to E.19), we define several norms that we shall find useful. Given a triangulation 3~h of Q, we put llklllH(div;r, = {ll?llo,r + ^l|div?||lr}1/2 forallf6tf(div;r),re.rfc, IIMII-i/2.er= inf lllflllHcuvjr, for all/ieH-^Snre^,, {q£H(div,T):q-x>T=tt} -i/2.w»={ E lll/*rlll-i/2.Brj1/2 foTsHi ц = (цт)е П Я_1/2(ЭГ)- F.36)
Section 6 Finite element type interpolation 557 The following result will be needed for our analysis of primal hybrid methods: Theorem 6.5. Suppose {^h:heJf} is a regular family of triangulations of Q. For each positive integer k, there exists a constant C, independent of h, such that if qe(Hl(Q))", l^l^k, and ц = ц(д) is determined by F.34), then \\\q-^hn\\\-U2,^h^Chl\q\ltn. F.37) To obtain this result we employ the bound F.19) of order / and the bound F.20) of order (/—1). The second family of examples, is defined for integers к, к ^ 2. One chooses for interpolation space on the simplex T the space Г)?(Г)=(Р,_1(Т)Г, F.38) where we have kept the indexing convention of Remark 6.2 instead of that of Brezzi, Douglas and Marini [1985]. For qe Jf(div; Г) the interpolant g*lk)q is defined to be the unique element of Df(T) such that m(g pk)q) = m(g) for all m e Jff\ where the set of moments J?*ik) used to define the interpolant is the union J(T = { U ^»-1,(Г))и^гра2(Т)и^_1(П \T'<T / with M,(T') defined by F.9), Jtfad(T) = ](«-*«• grad w dx J e £{{C°{T))n, R): w e P, I and Jfj-(T) = <lq^ q-wux\€&{{C°(T))n,U): т we{Pl)n,wvT = 0on 5T, anddivw = 0>. Define the operator g*(k) by F.2) and the space E%ik) by analogy with F.18) or equivalently F.18'). It can be shown that F.15) still holds with gj* replaced by Sfk) and Я?"X) replaced by 77?~2), and thus so do F.3) and F.4) if gf is also replaced by g%(kK In fact the analogues of Theorems 6.1 and 6.2 obtained by replacing gf, gf, Dk{T), and £?> by gfk\ gt{k\ Dt(T) and E?k) respectively remain valid. One can also demonstrate the following analogue of Theorem 6.3: if {$~у;ЛеЖ} is a regular family of triangulations of Q, then 11*-*?№)*11о,о<СЛ'|*|,.0 for all qe(H\Q)f with l^l^k, F.39)
558 J.E, Roberts and J.-M. Thomas Chapter II and \\div(q--^mq)\\o,n<Chl\divqkn for all qe(H\Q)f with divqeHl(Q) for 0^k-1. F'40) Remark 6.4. The space £*(A), Efk) = {qeH(div;Q)nJt(diY;^h): q = $tk)q}, F.41) is clearly a proper subspace of Ehk). Hence fewer degrees of freedom are needed to determine the interpolants in Ef(k) than in Ehk). The interpolation errors in these two approximation spaces are of the same order when measured in the L2 norm || • ||0iO; however, interpolation in E%ik) is less precise by one order than that in Ef} when measured in the H(div;0) norm unless \divq\k-ltSi=0, i.e. unless div qePk_2. Moreover one may verify that Dt(T) = {qeDk(T):divqePk_2(T)}. Remark 6.5. In fact one of the most important distinctions between the approximations in E(k) and in E%(k) lies not in the interpolations themselves but in the fact that for the dual mixed method for the elliptic problem the Ekk) interpolation for the vector variable is used in conjunction with a locally Pk_1 interpolation for the scalar variable whereas when interpolation in Ef(k) is used for the vector variable the natural domain for approximating the scalar variable is the space of locally Pk_2 functions. Thus, for example, when a piecewise constant approximation is sufficient for the scalar variable but more precision than that provided by the EJ,1' interpolant is desired for the vector variable, one has the option of approximating the vector function in EfB) while retaining the locally P0 approximation of the scalar function instead of using the full PltEh2) approximation. The gain in accuracy thus obtained, however, is, a priori, only in the (L2(Q))" norm. 7. Examples of finite element type approximation for nontriangular geometric forms Using the families of examples of finite element type approximation developed in Sections 5 and 6, we can construct approximations of order k, for each positive integer k, of functions in H\Q) and of functions in #(div; £2) in the academic setting Q, a polyhedral domain triangulated by и-simplexes. To simplify vocabulary we fix и = 2. Thus for the preceding theory Q was supposed to be an open, polygonal domain triangulated by triangles. Here we shall indicate a few generalizations related to the geometry of the triangulation: rectangular finite elements, curved finite elements such as curvilinear triangles, convex quadrilaterals.... We begin with the case of rectangular finite elements. We shall describe families of examples of approximation of functions in H1(Q) and of functions in H(div; Q) when Q is an open set of R2 "triangulated" in the form Q = (J TeSrh T by rectangles T having
Section 7 Finite element type interpolation 559 sides parallel to the axes. These families will be the rectangular analogues of the triangular families denned in the two preceding sections. We first consider the approximation of Я1 functions. For any two nonnegative integers к and I, denote by PM the space of polynomials in two variables, of degree at most к in the first variable and of degree at most I in the second variable, and by Pk,lT) the space of restrictions to the rectangle Г of the polynomials in Pkl. To a rectangle T and a positive integer к is associated the set Г* of points of the grid on T obtained by subdividing each edge of Г into к equal parts. The Lagrangian interpolant jS?f]v on T of order к of a function v e C°(T) is the unique function in Pk,k(T) which coincides with v at each point of 1^. For 3Tk a triangulation of Q by rectangles, we define the interpolation operator £C^] from H1(Q)nHz(^'h) onto Lf\ where Ljf is defined by ьР^^еНЩ-.ЧТеГьМтеРиТ)}, G.1) by requiring that SC^v agree on T with if (£>(v \ T) for each Te$~h. The error estimate for Lagrangian interpolation given by Theorem 5.2 remains valid when the family of triangulations is a regular family of triangulations by rectangles; i.e. if {!Fh: h e Ж} is a family of triangulations STh of Q by rectangles Tof diameter hT ^ h which is regular in the sense that the ratio of the width to the length of each rectangle T,Te^h,he Ж, is bounded below by a positive constant independent of h, then there exists a constant С such that |o-^f,»li.o + fcl|o-^?)»llo.o<Cfc"|oL+i.fl for all v 6 Hm+1{Q) with 1 < m < fc. G-2) The demonstration for this case may be found in Raviart and Thomas [1983, pp. 79-103], for example. For the approximation of vector functions on a rectangle T we define the space Dk(T) to be the product space Pk,k-i{T) x Pk-ltk(T). The equilibrium interpolant S(fq of order к, к ^ 1, on Tof the vector function qe Ж(й\\; Т) is the unique function in Dk(T) such that J< ?fq)-vTwuo = i" and q-vTwua for all wePk-x(T) with T<T, G.3) J< »») T ^)-rdx = fi-dx foraIlr6Plt_2|lk_1(r)xPt_lilk_2(D. G-4) For ^ a triangulation of £2 by rectangles we define the interpolation operator Sf from H(div;£2)n^T(div;5rA) onto Ehk\ where Ehk) is defined by 4fc, = {?eH(div;G):Vre,rft, *|re2VT)}, G.5) by requiring that Sfq agree on T with S%\q\T) for each Te^,. Using Green's
560 J.E, Roberts and J.-M. Thomas Chapter II formula one can show that on each rectangle T div(<f(T*V) = Л?" ^(div q) for all qe Jf (div; Г), G.6) where Я?-1' is the orthogonal L2 projection of L\T) onto Рь_1Д_1(Г). The error estimate given by Theorem 6.3 remains valid in the rectangular case: if {J~h: h e Ж} is a regular family of triangulations of Q by rectangles then there is a constant С such that \\q-£fqio,n^Chl\q\UQ G.7) for all qe{H\Q)J with 1 < J^Jfc, and '|di%-4fcV)lio,o<Cft'|div?b for all ? e (Я ^fi)J with div ? e Я'(О), 0 < / ^ k. 2..,^,. „„„™ л^, G-8) For the demonstration in the case n = 2 treated here, see Thomas [1977, Chapter III]; for the generalization to the case n = 3, see Nedelec [1980]. The analogue for rectangular elements of the second family described in Section 6 is obtained by choosing for к ^ 2 for interpolation space on the rectangle T the space DnT)HPk~AT)J+(rotx\x2) + (TotxlXk2) G.9) and for interpolation operator S%(k) on Ж (div; T) the mapping associating to qeЖ(di^v■, T) the unique function in Df(T) such that (dif(k)q)-vTwd(T= \q-vTwdc J G.10) т т for all wePk-x{T) with T' < Г, and •W^.fHY^ l/».-Hv fnr oil fc№. _m\2 f)-rdx = U-rdx for all 1-е(Р^2(Г)J. G.11) Then to a triangulation 5^ of 13 by rectangles we associate the interpolation space Efk) and interpolation operator gtfk) from tf(div; 0)пЖ (div; jTJ onto £fk) defined by analogy with the preceding example. The analogue of G.6) holds in this case as do the error bounds F.39) and F.40). For the details of the construction and the demonstration of the error bounds see Brezzi, Douglas and Marini [1985, Section 5]. The extension to the three-dimensional case is given by Brezzi, Douglas, Duran and Fortin [1987, Section 3]. Remark 7.1. In the rectangular case the difference in size of the spaces Dk(T) and
Section 7 Finite element type interpolation 561 D*(T) is more striking than in the triangular case. For T a rectangle, dim Dk(T) = 2k2 +2k, dim D$(T)=k2+к+2, and indeed, if NE is the number of elements in &~h, f4NF if k —7 dim£r-dim£r.^+*-4NE={;NEE] ц-j- However, at the level of construction of the spaces E^ and £*№), we note that the degrees of freedom which assure that the continuity condition F.17) holds, i.e. those which assure the connections between the finite elements and thus contribute to the size of the algebraic system to be solved, are exactly the same for the two methods. The situation is not the same in three dimensions. If Dk(T) and D%(T) denote the local approximation spaces on a rectangular solid Г as defined by Nedelec [1980] and by Brezzi, Douglas, Duran and Fortin [1987] respectively, we have dimDk(T) = 3k2(k + l), dim Dt(T) = %k{k2 + 3k+ 8). Given a triangulation &~h of Q by rectangular solids, the degrees of freedom needed to guarantee the required continuity in the corresponding finite-dimensional subspaces of H(div; Q), £j,k) and £*№), are the moments of the normal component with respect to Pk-i,k-i(T') (i.e. k2 moments) for elements of Ef\ Pk-i(T') (i.e. |fc(fc+l) moments) for elements of Ejf№). Here the gain in calculation time with interpolation by <f *№), the operator associated with £*"I), instead of $t\ the operator associated with £f, is clear. If NF is the number of two-dimensional faces T" of elements Г of &~h with T <= Q, then the difference in calculation time can be estimated in terms of the number C^-3*)NF={6NF' ifk = 2' Ek ^N*-j18NF> iffc = 3 Nevertheless, concerning the precision of the interpolation in three dimensions as well as in two dimensions, we note as in the simplicial case (cf. Remark 6.5) that while the order of approximation is the same for S^] and $t(k) in (L2(Q))n, the accuracy in H(div; Q) of S*(k) is of one order less than that of if]'. Remark 7.2. Another family of examples of H(div) approximation on rectangles in two dimensions and in three dimensions may be found in Brezzi, Douglas, Fortin and Marini [1987]. For rectangular elements in three dimensions as well as for prisms, other families are given in Nedelec [1986]. We cannot develop here a theory of approximation as general as that constructed
562 J.E. Roberts and J.-M. Thomas Chapter II above for finite elements of such an elementary form as a triangle or a rectangle for the case of finite elements of other geometric forms. However, we shall describe an example where the triangulation S~h of О consists of convex quadrilaterals. Thus for Te$~h there exists a bijection Fe(P1AJ such that T = F(t), with f the reference square having vertices @,0), A,0), A,1) and @,1). For each point xet, the linear tangent mapping DF(x) and its Jacobian J(x) = det(DF(x)) are affine functions of the coordinates (xl,x2). For a scalar function veH2(T) we may define a function УЦ]ь in the space {weHHTy.woFePi^T)} G.12) by interpolating the values of v at the four vertices of the quadrilateral T. Similarly for a vector function qe Ж(div; Г) we may interpolate, on each of the four edges, the average value of the normal trace to define a function e^q in the space {retf(div; T):\J\DF-1(r^F)eP2,1(f)xPia(f)}- G.13) Even though the spaces defined in G.12) and G.13) are spaces of polynomial functions only if T is a parallelogram, we note that in any case the trace of S'^v on each edge T of Г is an affine function and the normal trace oiS^^q on V is constant. Furthermore, |J|div(<f(T1)?) is constant on T and we have IJIdiv^V) l mes T divqdx. G.14) Proceeding as in each of the preceding examples, by patching together the maps £(t\ Te$~h, we define an operator &{1) on Н1{0)глН2{2Гн) having as image the finite-dimensional subspace Li,1'; and using the maps S^, Te$~h, we define the operator «fj,1* on Н{А'\\;0)г\Ж{А\\;^) having as image a finite-dimensional sub- space E(h1} of H(div; Q). Error estimates for the interpolation operators ifj,1' and S'ih1) are given by Thomas [1977, Theorems III.4.2 and III.4.4], cf. Bernardi [1986, Chapter 6] and Jensen [1979], for analogous results on curved domains.
Chapter III Abstract Theory of Approximation of Solutions of Problems with Lagrangian Multipliers 8. General remarks In this chapter we develop an abstract theory for the approximation of solutions of problems with Lagrangian multipliers. The main purpose for the construction of such a theory is to unify the analysis of the mixed and hybrid methods presented in Chapter I, and hence to provide a framework for generalizations and extensions of the analysis to that of similar problems. Even though these methods were conceived of in the context of approximation of problems of minimization of a quadratic functional with affine constraints, the properties of constrained optimization are not at all fundamental to the analysis of the problems studied here. On the contrary, more than one familiar with conforming approximations of minimization problems seems to have been misguided by this approach. In the exposition of the abstract theory given in this chapter, we will make no restrictions which allow the class of problems studied here to be interpreted as being minimization problems on an affine manifold. The expression "with Lagrangian multiplier" will be used in the same formal sense as is the expression "variational problem". Let us now give the general framework for this study. Let W and M be Hilbert spaces with inner products ((■, • ))w and ((•, -))M respectively. The norms associated with these inner products are denoted || • ||w and || * ||M. Further, let a(•, •) and b{-,') be bilinear forms defined on WxW and on Wx M respectively. Given the linear forms /(•) defined on W and g( •) defined on M, we consider the following problem: find a pair (cp, A) satisfying (<p,X)eWxM, а(<р,ф)+Ь(ф,Л) = /(ф) for all \j/<=W, (8.1) b(cp,n)=g{n) for all цеМ. We introduce the product space X=WxM which is a Hilbert space with norm 1Н1х=(Ж1^ + 1ИмI/2 for all !> = №,/*)£*■ 563
564 J.E. Roberts and J.-M. Thomas Chapter III Clearly one may also formulate (8.1) in the following manner: find u=(<p,X) satisfying ueX, k(u,v) = l(v) forall^eX, (8.2) where k(•, •) is a bilinear form onlxl given by k(u, v) = a((p, ф) + Щ, X) + b(<p, fi) (8.3) for all u = (cp,A)eX and for all v=(i//,[i)eX, and where /(•) is a linear form defined on X by l(v)=f(ilf) + g(ii) for all в = for,/^eX. (8.4)' All questions of continuity aside, it is evident that the analysis of the variational problem (8.2) cannot be carried out with the aid of the Lax-Milgram theorem since the bilinear form k{% •) is not X-elliptic: for each v = @,ц)еХ, we have k(v,v) = 0. One may show, nevertheless, the following result: Theorem 8.1. Suppose that the bilinear form k( •, •) is continuous onXxX and that it satisfies and inf sup k{u, v)>0 (8.5a) {«eX;|iu||x=l) {vsX:Mx = l} sup k{u, v)>0 for all veX with v^O. (8.5b) {ибХ:||и||х = 1} Then for each continuous linear form i(-) on X, there exists a unique solution of problem (8.2). This theorem is a special case of the theorem due to BabuSka [1971] given in the following section. Remark 8.1. Hypothesis (8.5b) is superfluous when the bilinear form &(•,•) is symmetric because, in this case, (8.5a) implies (8.5b). 9. Analysis and approximation of the solution of a variational problem with Lagrangian multipliers (theory of BabuSka) We consider the two Hilbert spaces X and У equipped with the norms || • \\x and || • ||у, and let k(-, •) be a bilinear form on X x Y. We have Theorem 9.1. Suppose that &(%•) is a continuous, bilinear form on X xY which satisfies inf sup k(u, v)>0 (9.1a) {usX:iJU||x=l} {vsY:Mr=l}
SECTION .9 Abstract theory of approximation 565 and sup k(u, v)>0 for all ye У with v¥=0. (9.1 b) {u6X:||u||x=l} Then for each continuous linear form /(•) on Y, the problem: find и satisfying ueX, k(u,v) = l(v) forallveY, (9.2) has a unique solution. Proof. Define the continuous linear operator Ж from X to Y by ((Jfu, v))Y = k(u, v) for all ueX and for all veY. Hypothesis (9.1 a) implies that the operator Ж is injective and that its image is closed in У; hypothesis (9.1b) expresses the fact that the image of Jf is dense in У. □ Remark 9.1. Conditions (9.1a) and (9.1b) together give a necessary and sufficient condition for problem (9.2) to have a unique solution given a continuous, linear form /(•). In the case X= У, the ellipticity condition, inf k(v, v) > 0, {re*:||p||x = l} obviously implies conditions (9.1a) and (9.1b); thus it is a sufficient condition for problem (9.2) when Y=X to have a unique solution. In this sense Theorem 9.1 generalizes the Lax-Milgram theorem. In the next section we shall come back to and treat in detail the "particular case" where X=Wx M and k(% •) is of the form (8.3), i.e. the case that motivates our study. First, however, we give a general result concerning the approximation of the solution of problem (9.2). Given finite-dimensional subspaces Xh and Yh of X and У respectively, we seek the "approximate solution" uheXh, an approximation of the solution u of problem (9.2), defined by u„eXh, k{uh,vh) = l(v„) for all vheYh. (9.3) Applying Theorem 9.1 in the finite-dimensional case, we obtain: Theorem 9.2. Suppose that /c( •, •) is a bilinear form on Xh x Yh which satisfies inf sup k{uh,vh)>0 (9.4a) and sup k(uh,vh)>0 for all vheYwith vh=£0. (9.4b) {uneXi,:||iihlx= 1} Then for each linear form /(■) on Yh, problem (9.3) has a unique solution.
566 J.E. Roberts and J.-M. Thomas Chapter III Remark 9.2. The space Xk being finite-dimensional, condition (9.4a) is equivalent to sup k(uh, vh)>0 for all uheXh with uh #0. {Bherh:||»h||v=l} Furthermore, when the hypotheses of Theorem 9.2 are realized, one has necessarily that dimyA=dimZfc. (9.5) Conversely, when we know, a priori, that (9.5) is satisfied (for example when Xh - Yh), it suffices to check that either one or the other of conditions (9.4a) and (9.4b) holds. Remark 9.3. It is important to note that it is not sufficient that hypotheses (9.1a) and (9.1b) be satisfied in order that the corresponding conditions (9.4a) and (9.4b) be satisfied as well. Herein lies the charm of the analysis! Remark 9.4. Even when X=Y and the bilinear form fe(% •) is X-elliptic, it can be useful to consider examples for which (9.5) holds with Xh ф Yh. Such is, for instance, the case in a Petrov-Galerkin type approach to a diffusion-convection problem, cf. for example Mizukami and Hughes [1985]. We conclude this section with an estimation of the error committed in approximating the solution и of problem (9.2) by the solution uh of problem (9.3): Theorem 9.3. Suppose that k(',')is a continuous, bilinear form on X xY and let К be a number such that sup k(u,v)^K\\u\\x forallueX. (9.6) {veY:\\v\\y = 1} Suppose that the choice of subspaces Xh and Yh is such that dimFA = dimZh (9.7) and that there exists a number к > 0 such that sup k{uh,vh)^K\\uJx foralluheXh. (9.8) {vheYh:\\vh\\Y = l} Then for each linear form l(-)on Y, if и satisfies (9.2) and uh is the solution to (9.3), we have the error bound \\u-uh\\x^(l+(K/K)) inf \\u-wh\\x. (9.9) wheXh This result is due to Babuska [1971]; (cf. also BabuSka and Aziz [1972, Theorem 6.2.1, p. 186]); a simple demonstration in the case Xh = Yh can be found in Rabier and Thomas [1985, Exercise 3.13], for example. For a given problem of the form (9.2), the constants К and к which appear in (9.6) and (9.8), respectively, obviously depend on the choice of norm on X and on Y and on the choice of the subspaces Xh and
Section 10 Abstract theory of approximation 567 Yh—nothing prevents one from making the choice of the norms as a function of the choice of the finite-dimensional subspaces. We will see some such examples later on. For the moment we shall settle for a simple criterion for convergence when the index h, indicating the choice of finite-dimensional subspace Xk as well as a real parameter, tends toward 0. The set of parameters will be denoted Ж. Theorem 9.4. Suppose that the norms \\'\\x and \\'\\Y on the spaces X and Y, respectively, are chosen independently of the parameter hejf and that condition (9.6) is satisfied. Suppose that for each hejf, we have been able to choose the finite-dimensional subspaces Xh and Yh to be of the same dimension and such that condition (9.8) is realized with a constant к > 0 independent ofh. Suppose further that there exist a subspace Ж of X dense in X and a mapping rh of' 3C into Xh such that lim||w—rh(w)\\x = G forallwsSC. /i-0 Then, the variational approximation method converges in the sense that we have \im\\u-ujx=0, /i-O where и is the solution of'(9.2) and uh the solution of (9.3). The demonstration of this theorem is identical to that showing the convergence of the approximate solutions of an elliptic variational problem, cf. for example Raviart and Thomas [1983, Theorem 3.1-3.3]. 10. Analysis and approximation of the solution of a variational problem with Lagrangian multipliers (theory of Brezzi) Equipped with the general results of the preceding section, we proceed with the analysis of a problem of the form (8.1) which we shall call a problem with Lagrangian multipliers: find a pair (q>, X) satisfying {q>,X)eWxM, а((р,ф) + Ь(ф,Х) = Дф) for all ^eW, A0.1) b(cp, p.) = g(p) for all цеМ. In particular if a pair (<p, X) satisfies A0.1) with g(')=0, then the first argument q> is a solution of the variational problem: find <p such that cpeV, а{<р,ф)=Яф) for all ^eИ, A0.2) where V is the subspace of W defined by F = {^€VF:V/i6M, Ъ(ф,ц) = 0}. A0.3) The following theorem is due to Brezzi [1974a, 1974b]:
568 J.E. Roberts and J.-M. Thomas Chapter III Theorem 10.1. Suppose that the bilinear form «(*,•) is continuous on WxW ana V-elliptic, i.e. inf а(ф,ф)>0. A0.4a) №eV-.m.w = l} Suppose that the bilinear form b(\ •) is continuous on WxM and that it satisfies inf sup Ь(ф,ц)>0. A0.4b) Then for each pair of continuous linear forms f(-)onW and g{-)on M, problem A0.1) has a unique solution. Proof. The original demonstration of this theorem is given in Brezzi [1974b]; a complete demonstration may also be found in Girault and Raviart [1979, Chapter I, Theorem 4.1] for example. We shall indicate here a variant of the demonstration of this fundamental result. Let V1- denote the subspace of W orthogonal to V; denote by ф° and ф1 the orthogonal projections of i^eWin Fand Fx, respectively. With this notation, problem A0.1) may be written: find q>L, q>°, and A such that <?xeK\ cp°eV, XeM, а((Р1,ф1) + а((р°,фх) + Ь{ф1,Л) = ЯФ1) for all ф1еУ1, a(cpL^°) + a(cp0,ф°) = Яф°) for all ф°е¥, Ь(<р1,ц) = g(fx) for all и е М. Note that we have the equality sup Ь(ф,1л)= sup Ь(ф,р) for all цеМ, and that we have the characterization of К as a subset of W: феУ iff sup W,ii) = 0. [цеМ:\Ым = 1} Using hypothesis A0.4b) one may then establish with the aid of Theorem 9.1 that the linear operator J1* from M into V1 defined by #, St*n))w = Ь{ф, ц) for all фе W, fieM, is an isomorphism of M onto V1. Thus the linear operator J1 from VL into M defined by A0.5) </, ц))м = #, 0*n))w for all фе W, цеМ is an isomorphism of V1 onto M. Finally, using hypothesis A0.4a) one may show with the aid of the Lax-Milgram theorem that the linear operator s/ from V into V defined by is an automorphism. Thus the linear system A0.5), to which is associated a matrix of
Section 10 Abstract theory of approximation 569 operators of the form / X X ®*\ \x л/ О \Я 0 0 / has a unique solution. □ Remark 10.1. Condition A0.4b) is often called the inf-sup condition; some authors refer to it as the Brezzi condition, others as the BabuSka-Brezzi condition. The current trend seems to be to use the expression LBB condition, where LBB stands for Ladyzhenskaya [1949], Babuska [1971], Brezzi [1974]. Remark 10.2. A necessary and sufficient condition for problem A0.1) to have a unique solution (<p, A)e W x M given the linear form /(•) on Wand g{ •) on M is that the inf-sup condition be satisfied and that the operator л/e J£{V; V) associated with the bilinear form a(% •) be an isomorphism. This result, though presented differently, can be found in Girault and Raviart [1979, Chapter I, Theorem 4.1]. With the goal of approximating the solution of A0.1), we take now finite-dimensional subspaces of W and of M, WhcW, MhczM, A0.6) and define the discrete problem: find (<ph, Xh) satisfying {q>h,kh)eWhy.Mh, <4фМ + Щн,**)=Ш) for all ii>heWh, A0.7) H(ph,l*h) = g(Hh) for all fJ.heMh. As in the continuous case, we introduce the subspace Vh of Wh given by Vh = {фие Wh: V^eM„, Щ„, №)=0}. A0.8) One should be careful to note that in general Vh is not a subspace of the space V defined in A0.3). Transposing to the finite-dimensional case the results of Theorem 10.1, we obtain Theorem 10.2. Suppose that the bilinear form a(',-) is Vh-elliptic A0.9a) and the bilinear form b(\-) satisfies inf sup Ь(ф„,цн)>0. A0.9b) {д„еМ„: ЦдьНм = 1} {<l>heWh-Mh\\vlr= 1}
570 J.E. Roberts and J.-M. Thomas Chapter III Then for each pair of linear forms /(•) on Wh and g(-) on Mh, problem A0.7) has a unique solution. Remark 10.3. Condition A0.9b) is called the discrete inf-sup condition or again the discrete LBB condition. It says that the choice of the subspace Wh of Ж and that of the subspace Mh of M cannot be made independently one from the other, that there is a compatibility relation between the two subspaces. If we introduce the bases (W )j = l dim wh and {j$ );=!,... ,dira Mh of the subspaces Wh and Mh respectively, the discrete inf-sup condition may be interpreted as expressing the fact that the matrix [BA] whose ij entry is [BJy = Ь{ф%], /4/') has rank equal to the dimension of Mh: rg[Bft]=dimM,,. In this form, the condition has been called the rank condition (cf. BabOska, Oden and Lee [1978]). Clearly a necessary condition for this condition to hold is that dimMh^dim Wh. Remark 10.4. Obviously if the bilinear form a(-,-) is K-elliptic and Fh <= F (as will be the case for dual mixed methods), then a(% •) is Vh -elliptic, or better yet if a(% •) is Ж-elliptic, so one no longer needs to have VhcV (as will be the case for dual hybrid and primal mixed methods), then a( •, •) is Vh-elliptic. In other cases it is necessary to verify directly the Кл -ellipticity of a(% •), something which can be rather technical (this will be the situation for primal hybrid methods). To verify the discrete inf-sup condition one can use the following result (cf. Fortin [1977]), the proof of which is immediate: Theorem 10.3. Suppose there is a Hilbert space iV aW,an operator 0tb from Mh into W satisfying b(@hfih,p,h) ~^C sup Ь(ф,цк) for all p.heMh, A0.10) for some C>0, and a continuous linear operator nh from iV into Wh such that the following diagram commutes: ЦГ -Ж -M A0.11) where ,? is the inclusion ofW into W, 2ft and 38h are the mappings from W to M and from Wh to Mh, respectively; induced by fr(vI and 8?h is the orthogonal projection of 1 The symbols Й? and 3%b have been reserved for the restrictions of Ш to V1 and of 38h to Кj, respectively.
Section 10 Abstract theory of approximation 571 M onto Mh. Then the inf-sup condition of Theorem 10.1 implies the discrete inf-sup condition of Theorem 10.2. We give an estimate of the error made in approximating the solution of problem A0.1) by that of problem A0.7). This estimate was obtained by Brezzi [1974a, 1974b]. Theorem 10.4. Suppose that a( •, •) and b{ •, •) are continuous bilinear forms onWxW and on WxM respectively. Let A and В be numbers such that aforfHAMwMw for all <peW, фе\¥, A0.12) *#,/*)<В|М1ИМ|„ foraUil,eW,peM. A0.13) Suppose further that the subspaces Wh and Mh have been chosen such that there exist positive numbers a and Pfor which we have a(vh,vh)>o>\\vh\\U2 forallvheV„, A0.14) inf sup Ь[ф„,Ць)>Р. A0.15) {рнеМкЫЫ = 1} {^еНЫЦЫиг = 1} Then there exists a constant C, depending only on A, B, a, and /?, such that for each pair of linear forms f(-)onW and g{') on M, if (<p, A) satisfies A0.1) and (q>h, Xh) satisfies A0.6), we have \\<P-<Pk\\w + U-h\\M^C\ inf \\cp-^h\\w + inf ||A-Ai»||Mj. A0.16) Proof. Let V^; be the subspace of Wh orthogonal to V. We introduce, as in the proof of Theorem 10.1, the operator ®he£e{Vi, M) and its adjoint ЩеУ{М, Vi). The discrete inf-sup condition A0.15) signifies that the operators 38h and Щ are invertible or more precisely that we have ll^1||y(Mh,^)=ll^r1||^i,M),)<V^ Similarly the condition of J^-ellipticity A0.14) implies that the operator s$he ^{Vh, Vh) associated with the bilinear form a(v) is invertible and that we have It is now easy to check that the bilinear form denned with the aid of (9.5) onlxl, where X — WxM (and Y=X), satisfies (9.6) with a constant К depending only on A and В and for the choice Xh = Whx MH (and Yh = Xh) satisfies (9.8) with a constant к > 0 which depends only on A, a and /?. An application of Theorem 9.3 then leads us to the estimation A0.16). □ Remark 10.5. This error estimate is optimal in the product space X=WxM. This does not mean that the estimation of a single term, \\q> — q>h\\w for example, is necessarily optimal. Say for instance that g( •) = 0 and Vh is a subspace of V. Then we obviously have
572 J.E. Roberts and J.-M. Thomas Chapter III \\<p-<Ph\\w^C inf \\(p-*l/h\\w This sort of refinement of the theory has been developed by Falk and Osborn [1980]. We will point out such refinements in the examples treated hereafter. Remark 10.6. We can deduce from Theorem 10.4 a convergence theorem similar to Theorem 9.4 for a family of pairs of subspaces [Wh, Mh) indexed by he Ж. On the other hand we point out that the approximation result of Theorem 10.4 is still of interest if the chosen norms or even the spaces VFand M themselves change with the parameter h as will indeed be the case for hybrid methods. The error bound A0.16) holds with constant С independent of h as long as the constants A, B, a and /J may be taken independently of h. Remark 10.7. Theorem 10.3 gives sufficient conditions for concluding that the inf-sup condition for b( •, •) on W x M is inherited by b( •, ■) on Wh x Mh. However, given a family of pairs of subspaces (Wh,Mh) indexed by кеЖ, to obtain convergence results (cf. Remark 10.6) we need to know that the discrete inf-sup condition holds uniformly in the sense that there exists /J independent of h such that for each he Ж inf sup b(il/k,tih)^j]>0. This will be the case if the norms of Rh and nh of Theorem 10.3 are independent of h. Remark 10.8. With the methods used here to analyze numerically a problem with Lagrangian multipliers of the form A0.1) whose solution is approximated by the solution of a finite-dimensional problem of the form A0.7), it is easy to find sufficient conditions for analyzing a problem of the following form: find q> and Я satisfying cpeW, leM, а{<р,ф) + Ь(\1/,Х)=Дф) for all \j/eW, A0.17) b{cp,fi) + d{?.,fi) = g(ji) for all деМ. This generalization introduces a supplementary bilinear form d( *, •) on M x M which we suppose is continuous, and we let D be a constant such that d{X,ii)<D\\X\\M\\n\\u for all AeM, цеМ. A0.18) Given finite-dimensional subspaces Wh<=W and Mh <= M, the corresponding discrete problem associated with A0.17) is written as follows: find q>h and lh satisfying а(ц>М + Щк,К) = №н) for all ijjheWh, A0.19) b(<ph,nh) + d(kh,iih) = g(ixh) for all nheMh. If in addition to the hypotheses of Theorem 10.4 we suppose that
Section 11 Abstract theory of approximation 573 а(ф„,фн)>0 for all фне\Уи, d(nh,fih)^0 for all nheMh, and that one of the two forms a( •, •) and d( ■, •) is symmetric, then problem A0.19) has a unique solution (q>h, Ал). Moreover, if {<p, A) denotes the solution of A0.17), an error bound analogous to that given by A0.16) may be obtained with a constant С depending only on A, B, D, a and /?. Another generalization of Theorem 10.4 is obtained by assuming in addition to the hypotheses of Theorem 10.4 that there is a positive number <5, 0<E<1, such that a B2 d{Xh,fih)^(l~6) r-rllAftlUlMjU for all lheMh, fi„eMh. Ja2+A2 A. The conclusion is the same as in the above generalization except that here the constant С depends also on S. The demonstration of the first generalization relies on the fact that if Я is a Hilbert space with inner product ((v))h and <^eif(ff, Я) is positive in the sense that ((<Шх,х))н^0, for all xeH, then J-\-<% is invertible in &(H,H) and ||(./ + ^)-1|1г.(Н,н)<1. То show that the second generalization is valid one uses that if Я is a Hilbert space and <%e if (Я, Я) has norm in if (Я, Я) less than 1 - в for some в in @,1], then J+ <Ш is invertible in &(H,H) and \\(S + a/l'1)\\sW,a) «S I/O- Remark 10.9. An extension of the above results of Brezzi [1974b] to the case of the following generalized saddle point problem: (cp,X)eWxM, а((р,ф) + Ь1(ф,2.) = /{ф) for all фе\¥, Ь2((р,ц) = д(ц) for all /xeM, for which the bilinear forms bi(-,-) and b2(',') on WxM are distinct, can be deduced from the more extensive generalization given in Nicolaides [1982], see also Bernardi, Canuto and Maday [1988]. 11. Effects of numerical integration Under the hypotheses of Theorem 10.2, problem A0.7), the problem with Lagran- gian multiplier, is reduced, once the bases of Wh and of Mh have been chosen, to the resolution of an invertible linear system of order JV, with iV = dim W^ + dimM,,. Several algorithms adapted to the resolution of such linear systems will be presented in Section 17. For the moment we remark only that even the formation of the linear system can turn out to be practically impossible or simply too expensive in calculation time. Thus we are led to replace problem A0.7) by the following problem:
574 J.E. Roberts and J.-M. Thomas Chapter III find (ф*,Я?) satisfying {cpt,H)eWhxMh, акЫ>*и) + ЪЖ>*-№Ж) for all i/,keWk, A1.1) bh((p*,^h) = gh(tih) for all fiheMh; where aft(v) and bh(',-) are bilinear forms on Wh x Wh and WhxMh, respectively, and fh{-) and gh(-) are linear forms on Wh and Mh, respectively. When gh(-) = 0, if (<р*,Я?) is the solution of A1.1), then the first argument (p% is a solution of the variational problem: find a solution cpf such that tfeVt, ah{q>t,*k)=№h) forall^eK?; A1.2) where the subspace Vf is defined by Vf = {^heWh:yfih6Mh,bh^h,nh)=0}. A1.3) Remark 11.1. For all of the examples that we shall consider, the physical parameters of the modelled problem actually play a role only in the expression of the bilinear form a(v)- The form £>(-,■) is independent of these parameters; the same is true, a fortiori, for the subspace V defined by A0.3). Furthermore, the spaces Wh and Mh are constructed in such a way that the exact calculation of the term b{\j/h,iih) poses no practical difficulty since it always reduces to the calculation of an integral on an affine manifold of a polynomial function. In the following we shall point out (cf. Remark 11.2) where it can be of some interest to replace the bilinear form b( •, •) by another bh( ■, •). Defining Xh to be the product space Wh x Mh and putting KD^h) = ah((ph^h) + bh(\l/h,2.h) + bh(q>h,fih), A1.4) h(»h)=fi№i.) + 0k(Hk), for uh = (<ph,lh) and vh — (\jjh,[ih), we can write, as in Section 9, problem A1.1) in the form: find u* satisfying uteXh, h(uh*,vh) = lh(vh) for all M„eXft. Then in a slightly more general setting, this leads us to study the approximation of the solution и of problem (9.3) by the solution Hjf of the following problem: find u% satisfying u%eXh, kh(ut,vh) = lh{vh) for all vheYh, A1.5) where kh{ •, •) is a bilinear form on Xh x Yh and lh( •) is a linear form on Yh, with Xh с X and Yh cz Y. The analysis is based on the following result: Theorem 11.1. Suppose that fc(v) is a bilinear form, continuous on XxY and let К be a number such that (9.6) is satisfied. We suppose that the choice of subspaces Xh
Section 11 Abstract theory of approximation 575 and Yh is such that dim Yh = dimXh, and that there exists a number кк>0 such that sup kh{uh,vh)^Kh\\uh\\x foralluheXh. A1.6) {vheYh:\\vh\\Y=l) Then there exists a constant C, depending only on К and Kh, such that if и is a solution of (9.3) and u* is the solution of A1.5), we have *м ^J :_r / м „ , ..._ KWh>»h)-kdWh,Vhj \\u-ut\\x^C< M [\\u-wh\\x+sup l^Xh \ Vheyh || Vk || r Kvh)-h(Vh)} M1_. + sup_^l—(• AL7) This is, of course, in the case kh{ •, •) = k{ •, •) and lh( •) = /(•), the result (9.9) given in Theorem 9.3. It generalizes the results of Strang [1972] developed for the elliptic variational problem. The demonstration may be obtained by simply adapting the demonstration of Strang's result given by Ciarlet [1977, pp. 186-187] for example. We return now to the discrete problem A1.1), Theorem 10.4 may be generalized to the following result: Theorem 11.2. Suppose that a( •, •) and b(-,-)are continuous bilinear forms onWxW and onWxM respectively, and let A and В be numbers such that A0.12) andA0.13) are satisfied. Let Ah be a number such that ah((Ph^k)^Ah\\q>h\\w\^h\\w for all <рнеЩ,,фкеЩ,. A1.8) Suppose further that there exist numbers ал>0 and fih>0 such that ah(vh,vh)^xh\\vh\\2w, forallvhBVt, A1.9) and inf sup bh№h,nh)>ph. A1.10) {Mh6Mb:|MhllM = l} {*f6lTh:||«(ihlln'=l} Then there exists a constant С dependent only on A, B, Ah, och and fih such that if(q>, X) satisfies A0.1) and (<р*,Я*) is the solution of A1.1) then we have \\<p-<pt\\w+U-ti\\M <C< mf \\<p-ij/h\\w+ sup — Ui.sWhV nheWh \\nh\\w + 'nf U-fiJM+ sup W*\ r,heWh II 1h II W + sup mzm+ sup sj^ziMh AL11)
576 J.E. Roberts and J.-M. Thomas Chapter III We may obtain this result by applying Theorem 11.1. With the given hypotheses and the definition of the bilinear form /cA(v) given in A1.4), condition (9.6) is satisfied with К depending only on A and В and condition A1.6) is satisfied with кн>0 depending only on Ah,ah and fih. A detailed demonstration of this type of result was given by Thomas [1977, Chapter I, Section 3]. Remark 11.2. The replacement of the bilinear form b( •, •) by the bilinear form bh( -, •) entails the replacement of the subspace Vh defined by A0.8) by the subspace V% defined by A1.3). The ellipticity hypothesis A1.9) can certainly be satisfied without A0.14)'s being satisfied, even in the case where afc(*,•) = «(',') on WhxMk. Also, the discrete inf-sup condition A1.10) may hold without A0.15)'s holding. On the contrary, the converses are valid; A0.14) implies A1.9) and A0.15) implies A1.10). However, we stress that a careless choice of numerical integration technique can turn a well-posed discrete problem of the form A0.7) into an ill-posed discrete problem of the form A1.1). One should be aware that by replacing b(v) by bh(-,') one is actually changing the numerical method, and this must be done with some care.
Chapter IV Mixed Finite Element Methods 12. Examples of primal mixed methods for a Dirichlet problem A primal mixed variational formulation of the model problem, find a solution (p, u) of the system D.4)-D.5) with the homogeneous Dirichlet boundary condition u = 0 on Г, is obtained by characterizing the pair (p, u) as the solution of (p,u)e(L2(Q))nxHh№ a(p,q) + b(q,u) = 0 for all qe(L2(Q)f, A2.1) b(p,v)=- fvdx for all veHl(Q), where and b(q,v)=- * q-grad vdx. We are here in the context of Chapter III with W=(L\Q)T, ||f || „ = ||* ко, and M = Hh(Q), \\v\\m = \v\i.q. A2.2) A2.3) A2.4) A2.5) With the given hypotheses concerning the coefficients atJ, the bilinear form a(', •) is continuous on Wx Wand VF-elliptic with constant of ellipticity o. = ad, cf. D.8). The bilinear form b( •, •) is continuous on W x M and we clearly have |o|i.o = l => sup %,u)=l. A2.6) {*e»F: ||,||o,o=l} Thus Theorem 10.1 guarantees that, for each feL2(Q), problem A2.1) has a unique solution. 577
578 J.E. Roberts and J.-M. Thomas Chapter IV In order to construct a primal mixed approximation to this solution, we introduce a finite-dimensional subspace Wh of (L2(Q)f and a finite-dimensional subspace Mh of Hq(Q). From Theorems 10.2 and 10.4 we deduce: Theorem 12.1. Suppose that the subspaces Wh and Mh are compatible in the sense that the discrete inf-sup condition is satisfied: A = inf sup b(qh,vh)>0. A2.7) Then the problem of finding a pair (ph, uh) satisfying (ph,uh)eWhxMh, aiPn,9h) + b(qk,uh) = 0 for all qheWh, A2.8) b(j>k> Vh)=-\К Ax for al1 vh£Mh, Q has a unique solution. Furthermore, for each family of pairs of subspaces (Wh,Mh) satisfying A2.7) uniformly in the sense that /? = inf ph>0, there exists a constant C, independent ofh, such that \\p-Ph\\o.a + \u~uh\hQ^c\ inf \\p~qh\\o,n+ inf \u-vh\UB>. A2.9) [qhsWh vneMh J Given a triangulation &~h by и-simplexes Г of a domain Q, supposed to be polyhedral, and a positive integer k, we take finite-dimensional subspaces Wh = Wf] and Mh = Mlb) given as follows: W; = {9e(L2@))":VTe^,9|re(P,„1(T)r}, A2.10) and Mh = {veHl0{Q);4T€rh,v\TePk{T)}. A2.11) (With the notation E.10) introduced in Chapter II we have Mh = H}>{Q)r\Lf ) Such a choice of subspaces is proposed in Reddy and Oden [1975]; cf. also BabuSka, Oden and Lee [1977, Section 4]. Theorem 12.2. For each positive integer k, the subspaces Wh and Mh defined by A2.10) and A2.11) satisfy the discrete inf-sup condition A2.7). Moreover, if {3~h:he^} is a regular family of triangulations, there is a constant C, independent ofh, such that if the solution (p, u) o/A2.1) belongs to (Hk(Q)f x Hk+1(Q) we have the error estimate ll/»-/»*llo,o+l«-MJk|lr0<Ch*(||»|t,n+|MU + ii0). A2.12) In virtue of Theorem 12.1, the demonstration of Theorem 12.2 consists essentially of establishing the existence of a constant /?>0 such that for each function vhsMh,vh=^0, there exists a function qheWh,qh¥:0, such that b(qh,vh)>P\\qh\\0,a\vh\lta.
Section 12 Mixed finite element methods 579 With the choice of subspaces given in A2.10) and A2.11), we clearly have that for each vkeMh, grad vh belongs to Wh. The compatibility condition is thus satisfied with /?=1. The proof is now terminated with an application of classical results giving error bounds for interpolation in (L2(Q)f and in Hq(Q). (As Lhk)u belongs to Mh, we may use Theorem 5.2.) Remark 12.1. When the coefficients ay are sufficiently regular, for example when atjeCk, the regularity hypothesis ueHk+1(Q) implies the regularity pe(Hk(Q))n. Remark 12.2. Even with no regularity hypothesis for the solution (p, u) one can show that the primal mixed method applied in the context of Theorem 12.2 is convergent: \\mph=p in (L2(Q))", \im uh = u in НЦО); h-0 f.^0 cf. Remark 10.6. Remark 12.3. With Wh=Whk) as given in A2.10) and with Mk = Mhk'] as given in A2.11) but with k' not necessarily equal to k, the compatibility hypothesis is satisfied once k' < k. Error estimates optimal in W x M are obtained when к' = к. Remark 12.4. Let T be the reference w-simplex and for each Те ^~k, let FT denote an affine bijection of Г onto Г with Jacobian JT. The spaces Wh and Mh given by A2.10) and A2.11) may also be denned as follows: Wh = {qe{L\Q)T-.4Terh,JTT>FT\q\T°FT)e{Pk_x{T)y}, A2.10') Mh = {veHh(Qy.VTeFh,v\T°FTePk(T)}. A2.11') It is with these definitions that the theory is extended to the case of curved finite elements where T is the image of f under a bijection not required to be affine. However, we must note that for vheMh, grad vh need no longer belong to Wh. To show the compatibility relation in this case we introduce for vheMh, the function qheWh defined on each Те3~к by (qh\ т) о FT=A/ JT)(DF T)(DFT)r(grai v„ | г) о F T and apply the relations F.23). These primal mixed methods may be easily adapted to the case in which Th is a triangulation by rectangles having sides parallel to the axes for и=2, by rectangular solids having faces parallel to the coordinate planes for n = 3. For a regular family of such triangulations one may show: Theorem 12.3. (Stated here only for the case и = 2, Га rectangle.) For each positive integer k, with Wh = Whk) and Mh = Mhk) defined as follows: Wh^{qe{L2{Q)J^Te3Th,q\TePk^,k{T)xPk,k^(T)} A2.13)
580 J.E. Roberts and J.-M. Thomas Chapter IV and Mfc = {pe^S(fi):VTe^i,»|rePM(r)}, A2.14) the inf-sup condition, condition A2.7), is uniformly satisfied. Moreover, if the solution (p, u) of A2.1) belongs to (Hk(Q))K x Hk+l(Q) and (ph, uh) is the solution of A2.8) for Wh and Mh as in A2.13) and A2.14), there exists a constant C, independent ofh, such that \\p-Ph\\o.Q + \u-uh\ua<Chk{\p\kta + \u\k+i,a)- A2.15) 13. Examples of dual mixed methods for a Dirichlet problem A dual mixed variational formulation of the model problem, find a solution (p, u) of system D.4)-D.5) with the homogeneous Dirichlet boundary condition u = 0 on Г, is given by characterizing the pair (p, u) as the solution of {p,u)EH(div;Q)xL2{Q), a{p,q) + b{q,u) = 0 for all qeH(div;Q), A3.1) b(p,v)=- fvdx for all veL2(Q), where 4P,9)= I UyPjfcdx A3-2) a and b{q,v)= pdiv^dx. A3.3) We are again in the context of Chapter III with now W=H(diT,Q), \\q\\w = ||?||fl(<U¥i0„ A3.4) and M = L\Q), NIM = INo,0- A3-5) The subspace V of functions qe Wiov which b(q, v) = 0 for all ve M is none other than V={qeH(di\;Q):divq=0}. A3.6) With the given hypotheses concerning the coefficients aip the bilinear form a{ •, ■) is continuous on Wx Wand F-elliptic with constant of ellipticity ot = o.d, cf. D.8). The bilinear form b(% •) is continuous on Wx M and the inf-sup condition is satisfied: inf sup b{q,v)>0. A3.7) {veM:\\vh.S}= 1} {»e»M»|lH(.Hv;n)= 1}
Section 13 Mixed finite element methods 581 To establish A3.7) we shall use the auxiliary problem. For each veL2(Q) denote by cpv the unique solution in H},(Q), of — Acpv = v on Q. Then qv =grad q>v belongs to H(di\;Q) and b(qv,v)=\\v\\o,a- Since inequality A3.7) follows. Thus we can apply Theorem 10.1 to obtain the existence and uniqueness of the solution {p,u) of A3.1). In order to construct a dual mixed approximation of the solution {p,u) of A3.1), we introduce a finite-dimensional subspace Wh of H(div; Q) and a finite-dimensional subspace Mh of L2(Q). From Theorems 10.2 and 10.4 we deduce: Theorem 13.1. Suppose that the subspaces Wh and Mh are compatible in the sense that the discrete inf-sup condition is satisfied: Рн= inf sup b(qh,vh)>0. A3.8) Then the problem of finding a pair (ph,uh) satisfying a(ph,qh) + b(qh,uh) = 0 for all qheWh, A3.9) b(ph, vh) = - fvh dx for all vheMh, has a unique solution. Furthermore, for each family of pairs of subspaces (Wh, Mh) such that for each qheWh we have vhdi\qhdx = 0 for all vheMh => divqh=0 A3.10) and such that A3.8) is satisfied uniformly in the sense that /?=inf /?h > 0, there exists a constant C, independent of h, such that \\p-Ph\\H(div;(})+\\u-Uh\\o,n ^C< inf \\p-qh\\Hidiv,a)+ inf ||u-iy0.o>- A3.11) («heWi, vheMh ) Hypothesis A3.10) expresses the fact that the space Vh, Vh = {qheWh: VvheMh, b(qh, vh)=0}, A3.12) is a subspace of the space V defined by A3.6). This hypothesis assures the uniform ellipticity of a(% •) on Vh, where the norm on Vh is the #(div; Q) norm. Given a triangulation 3Th by n-simplexes Г of a domain Q, supposed to be polyhedral, and a positive integer к, we take finite-dimensional subspaces Wh = W™ and Mh = M*' given as follows:
582 J.E. Roberts and J.-M. Thomas Chapter IV Wh = {qeH(div;Qy.VTerh,q\TeDk(T)}, A3.13) and M* = {t>eL2(fi):Vre.rfc,i>|re/»4_1(:r)}, A3.14) where Dk(T) denotes the space of restrictions to Г of the functions in the space Dk introduced in F.5). (With the notation F.17) introduced in Chapter II we have Wh-Ef)) These examples were proposed and analyzed in Raviart and Thomas [1977] in the two-dimensional case and in Nedelec [1980] in the three-dimensional case. In the literature the space Wh x Mh or W{k) x Mf is referred to as the Raviart-Thomas space of index к (or sometimes к—I; cf. Remark 6.2) or, in the three-dimensional case, as the Raviart-Thomas-Nedelec space. Theorem 13.2. For each positive integer k, the subspaces Wh and Mh defined by A3.13) and A3.14) satisfy the discrete inf-sup condition A3.8). Moreover, if {&~ъ.НеЖ} is a regular family of triangulations, there is a constant С independent ofh such that if the solution (p,u) o/A3.1) is such that (p,u)e(Hk(Q)f xHk(Q) and divpsHk(Q) and if {ph,uh) is the solution о/A3.9) associated with A3.13) and A3.14), then we have the error estimate " В~~Рь\\щй\ч;П) + \\U~Uh\\o,Q *S Chk(\u\k,Q + \p\KQ + \divp\kM). A3.15) Proof. With Wh and Mh as given in A3.13) and A3.14) we clearly have that div qheMh for each qheWh and thus that Vha V. To establish the discrete inf-sup condition we apply Theorem 10.3. Take for subspace of W—H(di\;Q) the space W = (H1(Q))" and for continuous linear operator nh from if into Wh, the operator nh =gf\ given by F.2) together with F.14). Then F.15) guarantees that A0.11) is satisfied. Let R be any continuous linear map from L2(fi) into (Я1 (£*))" satisfying div(Rv) = v, A3.16) and let Rh from Mh into тГ = (Я1№))" be defined by Rhvh =Rvh for all vheMh. We note that with sufficient regularity the function 0„eH(div; Q) used to establish the inf-sup condition A3.7) belongs to (Я1^))" and Rv could be taken to be qv. To avoid having to invoke regularity results for Dirichlet problems in polyhedral domains which would necessitate certain restrictions on the geometry of Q, if suffices to extend the functions veL2(Q) by zero to functions veL2((9) where (9 is a ball containing Q, to solve the Dirichlet problem Л cpv = v in &, (pv[6e = 0, and to define Rv to be the restriction to Q of grad (pv. To verify A0.10) we note there is a constant Ce depending only on the ball G> containing Q, such that \\Rhvh II 1,я^О, IK ||o,fi. Thus
Section 13 Mixed finite element methods 583 b(R„vh,vh) \\vh\\in ^ 1 "Id ii — ?d ii =* T^H^IIo.c- II ^ЛII-г \\Khvh\\i,o <-© Since sup W,t)fc)<||div^||o,nl|fj,llo,o<l|fJlo,Q, we have that A0.10) holds with constant C = l/C№ and A3.8) now follows from Theorem 10.3. Next suppose that {3~h; heJf} is a regular family of triangulations. We have just seen that Rh may be defined with norm independent of h, and Theorem 6.3 guarantees that for the operator nh = Sf] there is a constant С independent of h such that II49b(div; o)< C{%| 1>я + ||div tf ||0>o}. Thus the discrete inf-sup condition is satisfied uniformly and we have the error bound A3.11). Then to estimate the error of approximation infghBWh\\ p — qh ||H(div; a), we take qh =£f{p) and use F.19) and F.20). □ Remark 13.1. When the coefficients atj are sufficiently regular, for example when aye Ck, the regularity hypothesis/>e(Hk(£2))" and div/j e H(k\Q) is enough to give the error bound A3.15). If we suppose only that pe(Hk(Q))n we may easily adapt the last part of the demonstration of Theorem 13.2 to obtain ll/'-/>jH(div;ft) + !|u-Mjo,fl^Cfct-1||/7||w3, or with a more careful study we obtain \\p-Pb\\o,n<Chk\\p\\Kn. A3.17) If in addition we suppose the regularity of the Dirichlet problem in Q, a hypothesis which is satisfied once the open polyhedral domain Q is convex, we have II"-"* lien ^С^ЦцЦ^д with fc*=max(fc,2). A3.18) These results are demonstrated in Falk and Osborn [1980, Section 3]. Remark 13.2. Even with no regularity hypothesis for the solution (p,u) one may show that the dual mixed method applied in the context of Theorem 13.2 is convergent: lim ph=p in#(div;ff), limuh=u in L2(Q). Ji-»0 /i->0 Remark 13.3. With Wh = Wf] as given in A3.13) and with Mh=M^') as given in A3.4) but with k' not necessarily equal to k, the compatibility hypothesis is satisfied once k' < k. Error estimates, optimal in W x M, are obtained when k' = k. If 1 ^ к' < к the subspace Vh is no longer a subspace of V; however, the method remains convergent.
584 J.E. Roberts and J.-M, Thomas Chapter IV Remark 13.4. Let t be the reference «-simplex and for each Te^~h, let FT denote an affine bijection of T onto Г with Jacobian JT. The spaces Wh and Mh given by A3.13) and A3.14) may also be defined as follows: Wh = {?eH(div; fi): VTe^,, \JT\T>Fj\q\T°FT)eDk(t)}, A3-13') Mh = {veL2(Q):\/Te3rh,v\ToFTePk^1(f)}. A3.14') It is with these definitions that the theory is extended to the case of curved finite elements where Tis the image of t under a C1 bijection not required to be affine. To show the compatibility relation in this case, we construct for vheMh, the function qheWh such that div q„=9hvh, where 0леЬ°°(Й) is defined on each Te3~h by yeT rrWi for each xeT. Remark 13.5. In the context of Theorem 13.2 it is obvious that for qhe Wh we have div qheMh. Moreover, the proof of Theorem 13.2 shows that for each vkeMh there exists a function qh = nkRhvhe Wh such that div qh=vh. Thus in this case we have the characterization Mh = {vheL2(Q):3qheWh with div qh=vh}. A3.19) Such is not the case for the generalizations described in Remarks 13.3 and 13.4. For the dual mixed method analyzed above, given a triangulation 3~h of Q by n-simplexes, the vector functions in W=H(div; Q) are approximated in the space Wh defined by A3.13) which is the space E^ defined by F.17) in Chapter II. An alternative method is obtained by instead approximating the vector functions by functions in the space E*(k) also defined in Chapter II. For each integer k,k^2, take Wh = Wf and Mh = Mf as follows: Wh = {qeH(div;Q):4Te2Th, q\TeD*k{T)}, A3.20) M„ = {veL2(Q): VTeP„, v\TePk„2(T)}, A3.21) where D*(T) = (Pt_1(T))"; cf. G.1). These approximation spaces were introduced and the corresponding methods analyzed in Brezzi, Douglas and Marini [1985] in dimension n = 2 and in Brezzi, Douglas, Duran and Fortin [1987] in dimension n = 3. In the literature the space Wh x Mh or W^ x Mj,*' is referred to as the Brezzi-Douglas-Marini space of index к (or sometimes k— 1, cf. Remark 6.2), or in the three-dimensional case the Brezzi-Douglas-Duran-Fortin space.
Section 14 Mixed finite element methods 585 As for the Raviart-Thomas(-Nedelec) spaces it is clear that ifpheWh, then divpheMh, and one may show that the discrete inf-sup condition A3.8) is satisfied. In fact we again have the characterization A3.19). For a regular Dirichlet problem in Q we have the error estimates II i» —#»* llo.o ^ СЛ* || /»||*.о, 11«-и*11о,о^СЛ*_1||и|1*-1,я, lldiv^-^lko^C/^MldivpL-.^. For each of the dual mixed methods presented above for the case where STh is a triangulation by n-simplexes, there is a variant adapted to the case where 9~h is a triangulation by rectangles with sides parallel to the axes when n = 2 or by rectangular solids with sides parallel to the coordinate planes when n = 3. The description of these methods and the corresponding analyses may be found in the references cited above for the associated method for a triangulation by simplexes. Here we shall simply state for the case n = 2 the analogue of Theorem 13.2 when 3~h is a triangulation of Q by rectangles with sides parallel to the axes. Theorem 13.3. For each positive integer k, the subspaces Wh = Wf] and Mh =MJ,t) defined by Wh = {qeH(div;Qy.4TeFh,q\TeDP(T)}, A3.22) and Mh = {v6L2(Q):^Te^h,v\TePk-Uk^(T)}, A3.23)' with DP =Pk,k-i х-Рк-1,ь satisfy the discrete inf-sup condition A3.8). Moreover, if {^b'.heJf} is a regular family of triangulations, there is a constant C, independent of h, such that if the solution (p, u) of A3.1) is such that (p, u)e(Hk(Q))" x Hk(Q) and div peHk(Q) and if (ph,uh) is the solution of A3.9) associated with A3.22) A3.23), we have the error estimate \\P-Ph II H(div; a) + II« - ЩII 0,0 «S Chk (|«|tifl +1 p\kiB + |di v p\KCi). Other dual mixed methods are presented and analyzed in Brezzi, Douglas, Fortin and Marini [1987] and in Nedelec [1986]; cf. Remark 7.2. 14. Examples of mixed methods for a Neumann problem We consider the model problem, given feL2(Q) with Jfi/dx = 0, find и (to within a constant) and p satisfying D.4) and D.5) with the homogeneous boundary condition p • v — 0 on Г. A primal mixed variational formulation of this problem is obtained by
586 J.E. Roberts and J.-M. Thomas Chapter IV characterizing the pair (p, u) as the solution of {p,u)e(L2(Q))"xH1{Q)/U, a(p,q) + b(q,u) = 0 for all qe{L2(Q))", A4.1) b(p, v)=-\ fvAx for all veHl{Q)IU, where the bilinear forms a(-, •) and b{% •) are given by A2.2) and A2.3). Recalling the notation for the abstract problem of Section 10, here we put W=(L2(Q))\ lk«ir = lkllo.o, A4.2) and M = H1@)/R, N|M = |i>|i.0. A4.3) With Wh and Mh denoting finite-dimensional subspaces of (L2(Q)f and H1(Q)/U, respectively, we obtain the analogue of Theorem 12.1, with no modification of the statement, and also the analogue of Theorem 12.2 if we specify the subspaces Wh and Mh as follows: Wh = {qe(L2(Q)r.\fTe^h,q\THPk-i(T)T} A4.4) and Mh = {veHx(Q)IU:4TeFh,v\TePk{T)}, A4.5) for some k^l. In the case of a triangulation by rectangles, with the appropriate modification of the subspaces Wh and Mh, the analogue of Theorem 12.3 is obtained. Thus for primal mixed methods there is no difficulty in adapting the theory for the Dirichlet problem to obtain that for the Neumann problem. The construction of examples may even be easier for the Neumann problem as the functions of Mh need no longer satisfy, a priori, the zero trace condition on Г. The situation for dual mixed methods is quite the opposite. A dual mixed variational formulation of the homogeneous Neumann problem is obtained by characterizing the pair (p, u) as the solution of (p,u)eH0(div;Q)xL2{Q)/U, a(p, q) + b(q, u) = 0 for all ?eH0(div; Q), A4.6) b(p,v)=- fvdx for all veL2{Q)/U, where the bilinear forms a(% ■) and b(-,-) are given by A3.2) and A3.3). For convenience, L2(Q)/U will be identified with the subspace of functions veL2(Q) such that \0vdx = 0. Here the spaces Wand M and their norms are defined by W=H0(d\v;Q), ||f||„ = |l«llH<div.e>, A4-7)
Section 14 Mixed finite element methods 587 and M = <veL2(Q): udx = 0 j I0.fi- A4.8) The inf-sup condition analogous to A3.7) is also established with the aid of the dual problem. For each veM denote by <pv the unique element of H^QyU satisfying Acpv = v in Q and dcpv/dv-0 on Г. The function qv = gmd q>v belongs to W and satisfies div qv = v in Q. With Wh and Mh denoting finite-dimensional subspaces of #0(div; Q) and M respectively, a demonstration similar to that given for Theorem 3.1 shows that if the discrete inf-sup condition is satisfied, there exists a unique solution of the following problem: find (ph, uh) satisfying (ph,uh)eWhxMh, a(Ph,4h)+b{qh,uh)=0 for all qheWh, A4.9) 4ph,vh)=- fvhdx for all vheMh. To show that the discrete inf-sup condition holds for a pair of subspaces Wh and Mh, however, is a more delicate matter here than for the Dirichlet problem. It suffices, of course, to be able to associate to each vheMh a qhe Wh with div qh = vh, but this is made more complicated here by the requirement that qhe WkcW have a vanishing normal trace on Г. Nonetheless, one may show that the discrete inf-sup condition is satisfied when, for a triangulation 3Th by n-simplexes, the subspaces Wh and Mh are defined as follows: Wh = {qeH0(div; Q): \/Te<Th, q\TsDk(T)} and M„=<veL2(Q): i>dx=0 and VTe^i.^T-ePi-nT) A4.10) A4.11) for some positive integer k. We give the demonstration for the case k = 1: Let vheMh; thus vh is constant on each n-simplex Г and has zero average over Q. Denote by Фк the space of functions <ph£Hl{2Th) with \nq>hdx=Q, with <ph\T, for each Те $~h, an affine function, and with <ph continuous across the interior faces at the barycenters of the faces T" common to two и-simplexes of 3~h. For each vheMh, introduce the nonconforming solution (ph(vh) of the dual Neumann problem: find cph satisfying %^*, I Tefh grad^)A-grad^dx = A4.12) vh\jihdx for all фьеФк.
588 J.E. Roberts and J.-M. Thomas Chapter IV Let qh = qh(vh) be the function of (L2(Q))n whose restriction to T, for each Те &~h, coincides with the function of D^T) having normal trace on dT equal to that of tph\T- One can verify that qh belongs to Wh and that divqh = vh. One may further establish that, for a uniformly regular family of triangulations {J~h:heJf}, there exists a constant C, independent of h, such that Шоя^С] £ l^lf.rj1'2, A4.13) a result which amounts to a generalization of the Friedrichs-Poincare inequalities to the nonconforming case. One thus obtains the existence of a (new) constant C, independent of h, such that for the preceding construction we have llfcMUdivifljsSCKIIo.n, A4.14) thereby showing that the discrete inf-sup condition is satisfied uniformly. Hence we may deduce that there is a constant C, independent of h, such that \\P~Ph Ия(<Цу;Я) + II U~~uh llo.fi To verify the discrete inf-sup condition we have avoided using the regularity of the Neumann problem on polyhedral domains. Thus the above argument may be extended to the case of a problem with a Dirichlet condition on part of the boundary and a Neumann condition on the remaining part. For this type of problem, an extension of the above result to the case k>l may be found in Thomas [1977, Chapter V, Section 4]. The uniform regularity hypothesis for the family of triangulations can be removed using the regularity of the solution of the Neumann problem for the Laplace equation in a polyhedral domain; cf. Thomas [1989]. 15. Examples of mixed finite element methods for a Robin's boundary condition In this section we discuss the adaptation of mixed formulations to the following model problem: for a given function fe L2(Q) and function g e L2{T), find a solution (p, u) of system D.4)-D.5) with the Robin's boundary condition p-v + tu=g onf, A5.1) where t is a continuous, strictly positive function on Г. (Thus t is bounded above and away from 0.) A primal mixed variational formulation of this problem is obtained by characterizing the pair (p, u) as the solution of {p,u)e(L2(Q))KxH\Q\ a(p, q) + b(q, и) = 0 for all q 6 (L2{Q))\ A5.2) b(p, v) + d(u, v)= — fvdx- gvda for all veH\Q);
Section 15 Mixed finite element methods 589 where the bilinear forms «(•,•) and b(-,-) are defined as for the Dirichlet and Neumann problems by A2.2) and A2.3) and where the bilinear form d( •, •) is defined оаНЩхНЩЪу d(u, v) = tuv da. A5.3) Problem A5.2) is the first example we have encountered of the generalization A0.17) developed in the abstract setting in Remark 10.8 with here W=(L2(Q))n and M = H1(Q). A5.4) A5.5) The numerical analysis of a primal mixed finite element method derived from formulation A5.2) is not difficult when to a triangulation 3~h of Q by n-simplexes T are associated the finite-dimensional subspaces »i = {?6(L2(fl))":Vre^;>«|re(Pik_1(r)r} and Mh = {veH\Q):4Te?Th,v\TePk{T)}, A5.6) A5.7) where к is a positive integer. We remark nonetheless that in this situation the inf-sup condition can never be satisfied as the functions constant belong to Mh. With the dual mixed variational formulation, the solution (p, u) of system D.4)- D.5) with boundary condition A5.1) is characterized as the solution of {P,u)eJe(div;Q)xL2(Q), a{p,q) + b(q,u) = b(p,v)=- \fvdx (l/t)gq • v da for all q e Jf (div; Q), for all veL2(Q), A5.8) where the bilinear form b{ •, •) is defined as for the Dirichlet and Neumann problems by A3.3) and where the bilinear form a(v) is now given by a{p,q)= S M=l AijpJqidx + (l/t)p-vq-vda. We thus obtain a problem of form A0.1) with now W=Jf(div;Q) A5.9) A5.10)
590 J.E. Roberts and J.-M, Thomas Chapter IV and М = Ь2(Й). A5.11) There exists a continuous linear operator R from L\Q) into Ж(div; £2) satisfying div(Ru)=v for each v e L2{Q). (One may use, for example, the operator R from L2(Q) into {H1(Q)f constructed in the proof of Theorem 13.2.) Hence the inf-sup condition is satisfied, the norms for the spaces Wand M being given by || • || w■ = || • || #(й!у;я) and || • ||M= II-llo.fi- For a triangulation £Th of Q by n-simplexes Г, take for finite-dimensional sub- spaces Wh and Mh W„ = {qeM?(div;Q):4Terh,q\TeDk(T)} A5.12) and M„ = {veL2(a):iTerk,v\TePk-AT)}, A5.13) where к is a positive integer. One can show with no new difficulties beyond those already treated for the problem with Dirichlet boundary condition that the problem, find a pair (ph,uh) satisfying (ph,uh)eWhxMh, ' (l/t)gqh-vda for al\ qheWh, A5.14) for all vheMh, admits a unique solution (ph,uh). If {Уь:1геЖ} is a regular family of triangula- tions, the discrete inf-sup condition is satisfied uniformly and if further the solution (p, u) is sufficiently regular, then we have ll/'-/»J|Jr(div;O)+ll«-«Jl0.O = O(fc*). A5.15) 16. Other examples of mixed finite element methods All of the examples presented above treat only elliptic problems governed by second-order partial differential equations of the form The generalization of these methods to problems governed by an equation of the form a{p^9h) + b(q biph,vh)=- *."*) = r fvhdx
Section 17 Mixed finite element methods 591 poses no difficulty for the primal mixed formulation. The case is quite different for the dual mixed formulation. A mathematical analysis of this problem can be found in Douglas and Roberts [1982, 1985]. A generalization of these methods to the quasi-linear problem where the coefficients in the governing equation depend on и is analyzed by Milner [1985]. Several methods adapted to treat numerically the case where the convection term is large in comparison with the diffusion term have been studied by Jaffre [1984], Jaffre and Roberts [1985], Joly [1982], and Thomas [1987]. Another generalization of mixed methods to problems governed by an equation of the form _ v ±f <0L\-f with coefficients au which can degenerate has been studied by Le Roux [1982]. Estimations of the error made by a mixed method approximation have been given in norms other than the L2 norm and the #(div) norm. Estimates for u — uh in the L°° norm have been given by Scholz [1977, 1983], and then by Douglas and Roberts [1985], Gastaldi and Nochetto [1987] and Kwon and Milner [1988]. Negative norm estimates in H~S(Q), for s a nonnegative integer, are given for u — uh,p—ph, and div(/>— ph) by Douglas and Roberts [1985]. These estimates have been exploited by Douglas and Milner [1985] to obtain superconvergence results. Mixed methods have been "generalized" to fourth-order problems such as those governed by an equation of the form A2u=f, where we write "generalized" with quotation marks to indicate that these methods were introduced, for the most part, before those presented for the second-order problem and without the use of the general abstract theory for problems with Lagrangian multipliers. We cite in particular the work of Balasundaram and Bhattacharyya [1984, 1986]; Bramble and Falk [1983]; Brezzi, Marini, Quarteroni and Raviart [1980]; Brezzi and Raviart [1978]; Ciarlet and Raviart [1974]; Hellan [1967]; Herrmann [1967, 1983]; Johnson [1973]; Mercier [1974]; Miyoshi [1973]; Monk [1987]; Quarteroni [1980a]; Scapolla [1980]; and Scholz [1978]. A precise analysis of the best known of these methods can be found in Falk and Osborn [1980]. An excellent exposition is also given in Ciarlet [1978, Chapter 7]. A generalization to problems governed by an equation of the form ( — A)mu=f for an integer m, m^2 is proposed by Bramble and Falk [1985]. 17. Solution of the linear system We conclude this chapter with some remarks concerning the solution of the algebraic system resulting from a mixed method or, in particular, from a dual mixed method. We shall restrict our attention to the case of the model problem C.1)—C.4) as in Sections 12-14. We assume that the continuous problem is written as a problem with Lagrangian multipliers in the form (8.1), and that finite-dimensional subspaces Wh and Mh of the spaces Wand M where the solution is sought have been
592 J.E. Roberts and J.-M. Thomas Chapter IV introduced in such a way that A0.9a) and A0.9b) of Theorem 10.2 are satisfied. We thus suppose that the finite-dimensional problem is of the form A0.7): find (фи, Xh) satisfying <pheWh, lheM„, <*#н,ф„) + Щ,„К) = {Ш foraD^eWi, A7.1) Ь{<рк,ци)={д, ци) for all fiheMh; and we further suppose that a(v) is symmetric. Once bases for Wh and Mh have been chosen, we may write the algebraic problem associated with A7.1) as 'А B*\(cph\ (fh В 0 Д1» ' * - '' A7-2) where A is a symmetric, dim Wh x dim Wh matrix such that (Афк, ф'ь) = а(фи, ф'н) for all (фн, ф'н) in WhxWh, and В is a dim Mh x dim Wh matrix such that {Вфь, nh) = КФьг У-ь) f°r all 0K> Мл) in ^H x Mh. The functions fh and gh are the orthogonal projections of / onto Wh and g onto Mh, respectively. Theorem 10.2 guarantees that the matrix L, r (A B*\ Нв о > ™ of order NxN, iV = dim H^xdimM,,, is invertible; however, it is not positive- definite. Hence direct solution of the system A7.2) is generally not feasible. On the other hand A is here positive-definite. So, theoretically it is always possible to eliminate <ph from the system, q>h^A-\fh-B*Xh), A7.4) to obtain BA~'B*lh=-gh + BA-lfh. A7.5) However, there is no reason, a priori, why A "l should be sparse. For the primal mixed method, the space W is (L2(Q)f. So, among the degrees of freedom determining an element of Wh there is none needed to enforce a continuity requirement. Thus A (and hence A'1) is block diagonal, and the structure of the linear system A7.5) is the same as that associated with the conforming problem B.21). For the dual mixed method the situation is different. Thus while we cannot give here an exhaustive treatment of the solution of linear systems of the form A7.2), we would like to mention briefly some techniques that have been used when the system A7.2) results from a dual mixed method. Remark 17.1. The algebraic system associated with the primal or dual hybrid methods discussed in the following chapter can, under restrictions similar to those indicated above, take the form A7.2). However, as with the primal mixed method, the form of A is such that the linear system that must be solved is the same as that for a more standard problem, again as for a conforming problem for the dual hybrid
Section 17 Mixed finite element methods 593 method, as for a nonconforming problem for the primal hybrid method. Thus no section analogous to this one will be given in Chapter V. 17.1. Conjugate gradient method Even for the dual mixed method where A~l is full, the system A7.5) can be solved using conjugate gradient iteration as A~l never actually has to be computed. Whenever А~1ф is needed one can solve the system А\)/' = ф using, for example, Gaussian elimination as A itself is sparse. However, conjugate gradient iteration without preconditioning is known to converge slowly, and in general there seems to be no obvious preconditioner, though in Brown [1982] and in Ewing and Wheeler [1983] several preconditioners are proposed and tested for the case of a triangula- tion by rectangles. Moreover, conjugate gradient iteration is known to be especially sensitive to round-off error. Recently Bramble and Pasciak [1988] have proposed a new preconditioning technique. It seems appropriate to mention that there has been much research concerning the use of the conjugate gradient method and its variants to solve matrix equations involving matrices that are not necessarily symmetric or positive-definite. Thus one may consider applying such a method to solve equation A7.2) even though L is not positive-definite. In Joly [1984] several variants of the conjugate gradient method are described and their performance in solving equations of the form A7.2) coming from the implementation of a dual mixed method are compared. As pointed out in Joly [1984], for such systems the role of the preconditioner is no longer simply to speed up the convergence but in fact to guarantee the convergence. 17.2. Penalty method This method consists of replacing A7.1) by a more regular problem: find ((ph,e, AA>e) satisfying (ph,teWh, AMeMh, я(<Рм. M + *#/A.) = (/ iM for all i/jheWh, A7.6) b(<Pk,s,Hh)-s(K,c, l*h) = (g, Vh) for all iiheMh. The corresponding algebraic problem then becomes (j 'X£)-e> and we can eliminate XKe, h.MW№l>K*-Qk\ A7.8) to obtain (A + A/е)Д* Д)рм =/* + (l/e)B*gh. A7.9) For our model problem the matrix (A + (l/s)B*B) is symmetric, positive-definite,
594 J.E. Roberts and J.-M. Thomas Chapter IV and sparse, hence A7.9) can be solved by known techniques. In Bercovier [1978, Theorem 3.1] such penalized problems are studied and it is shown that, under the more general hypotheses that A0.9a) A0.9b) of Theorem 10.2 and inf \\а(ф„,ф11)\+ sup Ь(фн,^)\>0 A7.10) hold, there is a unique solution (фй>£, Ам) of A7.6), and if (<ph, Xh) is the solution of A7.1) then there is a positive constant C, independent of e, such that \\(Ph-(PhJw+Uh~-hjM^Ce. A7.11) The choice of e may thus be made independently of h; though, as pointed out in Bercovier [1978], for each h there is an optimal £. This method is surely the simplest to implement, but it has one crucial flaw. The equation is not satisfied by the solution <pA>£. Thus the method, at least for our model problem, is mostly of historical interest. 17.3. Augmented Lagrangian method Under the hypothesis that a(-,-) is symmetric and WA-elliptic, ((pk, Xh) is a solution of A7.1) if and only if (cph,Xh) is a saddle point of the Lagrangian Г defined on Wh x Mh as follows: The classical iterative method of Uzawa to obtain the saddle point of Г is described by the algorithm: Initialize: Let l|,0) be an arbitrarily chosen element of Mh. Calculate q>h: Once X^] is known, <р^] is defined to be the element of Wh minimizing r{\j/h, Afc"'); i.e. q>^ is the solution of A<pf=fh-B*Xf. A7.13) Advance Xh: Let X^+l) be defined by хГ'^ХГ + р^ВсрГ-д,). A7.14) This method in general converges quite slowly. The idea of the augmented Lagrangian method is to "penalize" the Lagrangian Г to obtain a Lagrangian Гг for which Uzawa's method converges more rapidly. For r>0, put гж.Рь)=ПФн,ь)+ЫВФн-дн\\2- A7.15) It is easy to see that (q>h, Xh) is a saddle point of Гг if and only if it is a saddle point of Г. Thus this method is not really a penalty method as Г does not have to tend toward oo to obtain the actual solution, and the augmented Lagrangian method does not
Section 17 Mixed finite element methods 595 have the defect of the earlier described penalty method. Applying Uzawa's method to Гг one obtains the algorithm: Initialize: Let lh0) be an arbitrarily chosen element of Mh. Calculate <ph: Once l[n) is known, cphx) is defined to be the element of Wh minimizing Гг{фь, X{n)); i.e. (phn) is the solution of Arcpf=(A + rB*B)cp^=fh-B4^ + rB*gh. A7.16) Advance Xh: Let Al" + 1) be defined by ЯГ+1) = 4я, + р„(вфГ-^). A7.17) The augmented Lagrangian method originated with the work of Hestenes [1969] and Powell [1969] and is studied extensively by Fortin and Glowinski [1983] where the following two results are shown (Theorems 2.1 and 2.2) under the hypothesis that A is symmetric and positive-definite: (i) There is a positive constant a such that if a < pn =% 2r for each n, then the sequence q>hB) converges to the solution <ph. (ii) If Хй is the unique element of Im В satisfying A7.1) with q>h, then the sequence lhn) converges to Xh + A°, where A° is the componant of Aj,0) in Ker B*. Remark 17.2. The algorithm of Uzawa, and hence that of the augmented Lagrangian, can be interpreted in the following manner: replace problem A7.1) by the virtual time-dependent problem a{n(t), фн) + Ъ(фн, Xh(t)) = (f, ф„) for all фке W„, \ <Ш8) and look for the solution (q>h, AJ as the steady state solution of A7.18). Introducing an explicit time discretization with nth time step pn = t"+1 —t", one obtains a{q>t\ Фъ) + Ъ(фк, Я1">) = (/, фк) for all ф„е Wk, /Я,п+1)— А(п> \ b((Phn),Hh)-{ — -,цЛ={д,цн) for all цъвМк, which gives the algebraic system for Uzawa's method and {A + rB*B)q>V+B*W = fh+rB*gh, PnBcpP-(lh"+»-W) = pngh for the augmented Lagrangian method.
596 J.E. Roberts and J.-M. Thomas Chapter IV Remark 17.3. Uzawa's algorithm can be viewed as a gradient type algorithm for the minimization of the functional J# on Mh: =i(^-1^*M*,^)-(^*,M*)+iH*.A)+(ff*,^), in that eliminating <pJT> in A7.13) and A7.14), ср^^А'НА-ВПП we obtain the algorithm Initialize: Let kf] be an arbitrarily chosen element of Mh. Advance kh: Once Aj,n) is known, put A{* + " =W -р„(ВА~1 В*?Г -BA-i fh+9h). Thus the augmented Lagrangian algorithm can be viewed as a gradient type algorithm for the minimization of the dual functional J^r on Mh: ■/*>*)=- min Гг(ф„,ць) = UBA;1B*fih,fih)-(BA;1{fH+rB*gh),iih) + UA74fk + rB*gh),fh+rB*gh) + (9k,Hk). However, A'1, like A'1, is, a priori, full. Similarly, one may eliminate A£° to obtain an algorithm involving q>^\ but again the matrix A ~l appears. Uzawa's algorithm indicates no specific method for solving the equation determining фь"'. As A and AT are positive-definite and sparse, either a direct method such as Gaussian elimination or an iterative method may be used to determine q>f in A7.13) or A7.16). The Arrow-Hurwitz algorithm is an algorithm that may be interpreted as being a variant of the Uzawa algorithm in which the method for calculating cp^ is specified: Initialize: Let Aj,0' and <p},0) be arbitrarily chosen elements of Mh and Wh, respectively. Advance <ph: Once А(,и) and cpP are known, <pi" + 1) is defined by <piK + 1>=rin)-a>n(Aq>W+B*W -fh). A7.19) Advance Xh: Let lt+1) be defined by АГ1,=Д",+р„(Я^п,-0,,). (П.20) Thus a variant of the augmented Lagrangian method is obtained by replacing A by Ar in A7.19).
Section 17 Mixed finite element methods 597 Remark 17.4. If a>„ is taken to be equal to pn, then the algorithm of Arrow-Hurwitz is also associated with the virtual time-dependent problem a(<p„(t\ ^h)+(d<ph(t)/dt, фн) + Ь(фк, Я„(£)) = (/, ф„) for all ф„еЩ„ H<Ph(t), №)-(dA(t)/dt,nh) = (g,цк) for all fiheMh, of which the solution of A7.1) is the steady state solution. 17.4. Alternating-direction method These methods can be used both in two dimensions and in three dimensions when the triangulation 9~h of the domain Q is a triangulation by rectangles. Here again one introduces a virtual time-dependent problem to obtain the solution of the original problem as its steady state. The idea is, heuristically speaking, to reduce the solution of the two- (or three-)dimensional problem to the solution of one-dimensional problems by advancing half (or a third of) a time step in the Xj -direction, then half (or a third of) a time step in the x2 -direction (and in the three-dimensional case, a third of a time step in the x3-direction). For example, in the two-dimensional case, for the Raviart-Thomas elements the subspace Wh of #(div;£2) in which the vectors are approximated is locally Рк+i.k x Pk,k+i and we suppose that the basis {wj}j™1 has been chosen such that WM=span{w;}7=1 is locally Рк+икх{0}, and Wha =span{wj}pm+1 is locally {0} x?u+1. Thus the vector wheWh may be written uniquely as wh = whA +wA2 where wh le Wh l and wh2eWh2, and the matrices A and В are block diagonal: As in Remark 17.4 a fictitious time-dependent problem is introduced: a{(Ph{t), Фн) + Ъ(фк, A*(t))=(/. Фь) for all i/f„eWh, b((ph(t),[ih)-(dXh(t)/<it,nk) = (g,nh) for all /j.heM„. The two-step iterative procedure is obtained by introducing an implicit time discretization as follows: аШ+<Р{н%Фн) + Ь(ф„,М+ш2))) = (Г,Фк) for all ^eWM, /j(n + (l/2))_ ;(n) N ЬШ + Ф$,А**)-(- -,/О = (&/*») for all fiheMh, а{(Рнп:{+Ч>ьТ\Фн) + Ъ{ФьАГУ))Ч1,Фь) for all *heWhw2, /5(n+l)_ ;(я + A/2)) \ HyW+rfj^nJ-i- у .rtJ=te,AO for all цкеМк, which gives the algebraic systems
598 J.B. Roberts and J.-M. Thomas Chapter IV PnB1<PZ)-4n+(U2})-Pn(9H'-B2<pi%)~?r> and АгсрЬ",!1' +В*22.Г1} =/и,2~АМпл\ РпВ2<рП1)-^Г1)=Рп(дь--В1<р^)-ХГ^\ From the first pair of the above equations Д"+1/2> can be eliminated to obtain (Al +pnBXB1)q>t\)=fh,l +pnB*1gb-(A2+pnB*lB2)<pi%~mW, and from the second pair, eliminating Ai"+1), one obtains {A2+PnBiB2)q,tV)^h^+PnBbH-(Al^pKBiBl)<pt\)-m^+{im). Thus the algorithm may be given as follows: Initialize: Let Д0) be an arbitrarily chosen element of Mh. The initialization (p^l of (рнл is obtained as the solution of A2<p№=f„.2-BtXL0). A7.21) Advance a half time step in the Xj -direction: Once Дп) and <pfy are known, (Al+paBrB1)q>W=fh,1+pl,Btgh-{A2+pKB1B2)<p®i-BlWK A7.22) Let A^1/2» be defined by ЯГ<1/2)) =4В) +РЛВМУ +B2<P{& -9h). A7.23) Advance a half time step in the x2-direction: Once Ajf+A/2)) and <р1$ are known, {A2+pnB*2B2)q>l$l) =fk.2+P.Bigh-(Al +рпВ$В1)<р№-ВПГа,2)). A7.24) Let 4"+ 1> be defined by 4-+1,=Д"+A/2И + Р,,(Л1«рГ1,+Л2ФЙ2+1,-9*). A7-25) Correct: Finally to obtain (p%tl), solve Л1Фе1+1) = /м-*ТАГ1). A7.26) This procedure without the correction step was introduced by Brown [1982], and as given here by Douglas, Duran and Pietra [1986,1987]. It is an adaptation of the original alternating-direction procedure of Peaceman and Rachford [1955]. Also in Douglas, Duran and Pietra [1986, 1987] an alternating-direction scheme for the three-dimensional case is introduced. This procedure is based on the scheme of Douglas [1962]. As with the classical alternating-direction iterative schemes, only limited convergence results have been obtained; cf. the above references.
Section 17 Mixed finite element methods 599 For the Brezzi-Douglas-Marini elements there is no natural decomposition of Wh as for the Raviart-Thomas elements. Thus the above algorithm is not applicable. However, an alternating-direction procedure for these elements involving the introduction of two virtual time variables is given for two dimensions in Douglas and Pietra [1986] and for three dimensions in Douglas, Duran and Pietra [1986, 1987]. 17.5. Mixed-hybrid method As pointed out earlier, the reason that the algebraic system derived from the dual mixed method poses special problems is that due to the degrees of freedom imposing the inter-element continuity requirements on the elements фк of Wk, the matrix A ~l is likely to be full. A technique often used to overcome this problem is to replace the dual mixed formulation by a mixed-hybrid formulation, cf. Fraeijs de Veubeke [1965]. The idea is to eliminate the inter-element continuity requirements from the space Wh thereby obtaining a space ffih and to impose instead the desired continuity on the solution <phe Wh via Lagrangian multipliers. Problem A7.1) is thus replaced by a problem of the following form: find <ph, %h, and r\h satisfying 4>heWh, XheMh, r\heNh, а(Фь,Фк)+Ь(фк,Ц + с(ф1пг11,) = {/,фн) for all фиещ, Ъ(Фи,1лн)=(д,Иь) for all fiheM„, c(<ph,Th)=0 for all TheNh, where JVtcnr<Te#-hL2(T) is the space of Lagrangian multipliers, and the bilinear form c(-, ■) on Wh x Nh is such that the induced linear map С from Wh into Nh has as kernel the elements of Wh which belong in fact to Wh. The linear system thus obtained is A7.28) where A is now block diagonal as well as symmetric and positive-definite, and фи and Xh are equal as functions to <ph and Aft, respectively. Eliminating <ph from the system, Ф„ = Л-1(/„-Ё*Х1,-С*г1к), A7.29) we obtain A7.27) (Я Ё* В о \C 0 c*\ 0 o/ ФЛ Ц w [h\ =u. w Sa'b* EA-1c*\nh\_(EA-ifl СА'Ё* CA-'C*)\nJ~\ СЛ-1/, h h A7.30) and the space Nh and the form c(-, •) have been constructed in such a way that this system is symmetric and positive-definite. But now we further note that BA^1!?* is
600 J.E. Roberts and J.-M. Thomas Chapter IV also block diagonal. Thus we have %„=№-! 6*)-l{EA-lfk-gh-EA->-C>rih} A7.31) and {CA-1C*-CA~1E*(EA-1B*r1EA-1C*}r1h = {С?-1 -СА-1 Ё*{ЁА'1 В*)-1 EA~l}fh + CA-lB*(BA-1B*)~1gh, A7.32) This system is symmetric, positive-definite, and sparse. Thus we can obtain r\h by solving A7.32), and then Xh and <ph are given by A7.31) and A7.29), respectively. It is interesting to note that r\h often has a physical significance. For our model problem r\h represents the value of cph on the faces T" and in fact in certain cases can be used together with фк to obtain a new approximation that converges more rapidly than <ph or equivalently cph; cf. Arnold and Brezzi [1985] and Brezzi, Douglas and Marini [1985]. Mixed hybrid methods will be taken up again in Section 21.
Chapter V Hybrid Finite Element Methods 18. Examples of primal hybrid finite element methods We shall develop several examples of hybrid formulations for the model problem: find p and и satisfying the system of first-order partial differential equations D.4)-D.5) and the Robin's boundary condition (l5A):p-v + tu = g on Г, with t a continuous, strictly positive function on Г. We shall indicate how to adapt the method to treat a problem with Dirichlet or Neumann boundary conditions. To a triangulation $~h of the domain Д we have associated the spaces H1 (&~k) and Ж(div; ^~ft), cf. E.11) and F.16). The space Hl{$~^ is a subspace of L2(Q) isomorphic to the product space Пт-^Я1 (Г), and the space Ж (div; &~h) is a subspace of (L2(Q)f isomorphic to the product space Т1ТвЗГ„Ж(длу; Т). The hybrid formulations, both primal and dual, will make use of a Lagrangian multiplier belonging to the space L2(d&~h) which is defined to be the product space L2(^h)= П Ь2EГ). A8.1) TeSh For the primal hybrid formulation, the subspace of multipliers will be E®3rh)={n = {fiT)TeyheL2{b$-h): 3qeH{div,Q) s.t. УТеЗГ„,цт= q-vT on dT}, A8.2) where for this definition q-vT denotes the normal trace on 3Г of the restriction to T of the vector function q. Thus q • vT is a priori an element of H ~1/2 (S T), cf. Remark 18.1 below. Theorems 1.2 and 1.3 allow us to characterize E{d^h) as being the subspace £F^)={^^rUeb'(S^): Vr' = Tinr2 with TuT2e $~h,nTl+nTi=0 on T'}. A8.3) For the dual hybrid formulation the subspace of multipliers will be ЦдГ„) = {ф = (<Аг W„ g L2{d3Th): 3veHl(Q) s.t. VTerh,ilfT = v on ЭГ}, A8.4) where in the expression "фт = v on ЗГ", v denotes the trace on 6 Г of the restriction to T of the function v. This trace is an element of Я1/2(ЗГ). 601
602 J.E. Roberts and J.-M. Thomas Chapter V With Theorems 1.1 and 1.3 we may characterize U$2T^ as being the subspace Цд^{ф=(фт)ТеЛ,е П Я1/2(ЭТ): Te!Th ЧТ'^Т^слТг with ТиТге^,фТ1=фТг on Г'}. A8.5) Remark 18.1. In fact the natural space of Lagrangian multipliers for the primal hybrid formulation is ЕфГк) = {ц = (цт)Тваке П Н-^фТ): 3?eH(div; Q) s.t. VTeSTh,pT =q-vT in Я" 1/2(бГ)}- The choice A8.2) while more restrictive is less cumbersome and leads to no restriction for examples of finite element methods. Remark 18.2. The first examples of hybrid formulations given in Section 3 were described as having as Lagrangian multipliers functions defined on all Q. Such a description is not in general well adapted for the numerical analysis of hybrid finite element methods. For hybrid formulations the Lagrangian multipliers are defined only on Ъ9~ь. It is sometimes possible to extend the multipliers to functions defined on all Q while taking into account complementary information when necessary. Such a procedure is a postprocessing technique. A primal hybrid formulation of problem D.4)-D.5) with the Robin's boundary condition A5.1) is given as follows: find a pair (и, k) satisfying (м,Я)еЯ1Eг1,)х£(8^), a(u,v) + b(v,k)= ifvdx for all veH1^), A8.6) a Ь(ы,/г)-Ы(Л,^)=- £ A/O^rder for all fieE{b3Th), TeSTh j where here a(u,v)= Y £ a"a7S7dx' A8-7) T b{v,n)=- Y TeSTh ST v\TiiTda A8.8) (in this expression, v\T denotes the trace on 9Tof the restriction of v to T), and d(AA*)=-£ [ (yt)kTtiTdcr. A8.9)
Section 18 Hybrid finite element methods 603 It is easy to show that problem A8.6) has at most one solution. If the solution (p, u) of A5.2), which is also the solution of A5.8) and thus belongs to Jf(div; Q) x Hl{Q), satisfies the regularity hypothesis that/?-vTeL2Fr) for each Te^h, then the pair (u, (p'VT)Teyh) is a solution of A8.6). Thus we have Ar = /rvr onSr for each Te^». A8.10) Remark 18.3. The spaces Hx($~h) and L2(d£~h) are supplied with Hubert space structures in a natural way. With W—H1 C~h) and with M = E(dS~h) having the norm induced by that of L2(d£~h), we note that the bilinear form b(-, •) does not satisfy the inf-sup condition on W x M. Toward the end of constructing an approximation (uh, Xh) of the solution (и, X) of problem A8.6) we suppose that for each Tef), we have two finite-dimensional spaces PTczHl{T) and XTczL2(dT), and we put W^iveHH^Y-VTe^vWePT} A8.11) and Mh = {(i=(fiT)TEyheE(d^hy.VTe?rh,nTeXT}. A8.12) We then seek (uh, Xh) satisfying (uh, X„)s Wh x Mh, a(uh,vh) + b{vh,Xh) = \fvhdx for all vheWk, A8.13) Q b(uh,nh) + d{Xh,fih)=- £ (l/t)gnhTda for all [iheMh. дТпГ Remark 18.4. To treat the analogous problem with a Dirichlet boundary condition и — й on Г, where й is a given function in Н1/2(Г) we would keep the same choice of subspaces Wh and Mh and look for (uh, Xh) satisfying (uh,Xh)eWhxMh, a{uh,vH) + b(vh,Xh) = fvh dx for all vheWh, b(uh,fih)=- £ unhTd<r for all (iheMh. Tefh J By contrast, to treat a homogeneous Neumann problem such as described in A4.1) or A4.6) we would introduce the subspace M0h defined by M0h = {fj,heMh:VTe^,nhT=Q on дТслГ}
604 J.E. Roberts and J.-M. Thomas Chapter V and look for (uh,lh) satisfying (uh,kh)eWh xM0h, a{uh,vh) + b{vh,lh)-- fvhdx for all vheWh, %/,,j"*)=0 for all nheM0h. The bilinear form a(*,-) is not Я'(^-elliptic and will not be Wh -elliptic. However, we shall introduce a hypothesis to make it J^-elliptic, where Vh = {vheWh:VLiheMh,b(vh,nh)=0}. A8.14) Theorem 18.1. We suppose that the two subspaces Wh and Mh are compatible in the sense that the discrete inf-sup condition is satisfied: {nheMh:4vkeWh,b(vh,iik) = 0} = {0}. A8.15) We suppose further that for each interface T = 7\ nT2 with T1,T2e^~h, there exists a function ХгбЬ2(Г') with \т>Хт da #0 such that the function A=(AT)Ts3-h in E(d^~h) defined by ATl — Хт оп Т', ЛТг — — Xr on T', and Лт = 0 on all other faces for all Те STh, belongs to the subspace Mh. Similarly we suppose that for each boundary face V с Г of an element T0 e 9~h, there exists a function Хт e L2 (T') with Jr Xr dc # 0 such that the function Л = {Лт)Те3гь in E(d$~h) defined by ЛТо = Zr on Г andAT=0 on all other faces for all Te^~h, belongs to the subspace Mh. Then problem A8.13) has a unique solution. Proof. Each function vhe Vh satisfying a(vh, vh) = 0 is a function whose restriction to T, for each Те 5^, is constant. The first part of the additional hypothesis guarantees that such a function may be identified with a function constant on each component of Q. The second part forces such a function to vanish on Г = dQ. We conclude that the form a(-,-) is F,,-elliptic and, in light of Remark 10.8, that the theorem follows. □ It is important to note that the compatibility condition A8.15) that should be satisfied by the subspaces Wh and Mh given in the forms A8.11) and A8.12) is satisfied as soon as, for each Te$~h, the spaces PT and XT satisfy the local compatibility condition jiud<r = oi = {0}. A8.16) цеХт: VvePT, er By contrast, the additional hypotheses of Theorem 18.1 demand that the functions belonging to Vh have a minimum amount of continuity at the interfaces of the triangulation. In the example that follows this continuity hypothesis will be trivially satisfied. (We will in fact choose Xr to be a constant function.)
Section 18 Hybrid finite element methods 605 Theorem 18.2. Let Q be an open polygonal domain in R2 triangulated by triangles T. Let к be a positive integer, and put (k, if к is odd, к — \ \k+l, if к is even. For each triangle Te^h, define the subspace XTcL2(dT) by XT=Dk(dT). A8.17) (With the definition of the space E%h given by F.30), we have Mh = Ewh.) Suppose that the subspace PTaHl(T) is chosen such that the following property is satisfied: 4wePkt(dT),3vePTs.t.v = w on ЭГ. A8.18) Then problem A8.13) has a unique solution. Proof. The elements of the space Dk(dT) defined in F.32) are polynomials of degree t^k — l on each edge. It is clear that condition A8.16) is satisfied if each element of Pk+x(9 T) vanishing at the vertices of Г is the trace of a function vePT. Even though we have dimPkF"F) = dimZ)k(9r), we know (cf. Raviart and Thomas [1977b, Lemma 4]) that the subspace of Dk(dT) consisting of all those functions ^eDk(dT) satisfying \ът№ dcr = 0 for each w e Pk(dT) is trivial only if к is odd. It is of dimension one when к is even. □ Before giving an error estimate we need to make precise our choice of norms. For the sake of simplicity we suppose here that the triangulation is uniformly regular. To each function yeH1^) we associate the function 5veL2{b^~h), constant on each edge Ted$~h, defined by 1 ' (v\T-v\T)da, HT' = T1nT2;T1,T2e^k mes T j| (the choice of Tx being arbitrary) 5v = { ' A8.19) mls-T^oT' if r-Tnr. - T' Once the hypotheses of Theorem 18.1 are satisfied, we clearly have <5u = 0 if veHo(£2)®Vh. The space L2(S^,) is given the norm Ы1о.а* = { I !IMo,ar}1/2, A8.20) the space Hl(£Th) the norm Mi^ = ! I MlT + h-'WSvWlzA112, A8.21) and the space E(d^~h) the norm III* III-1/2,ал defined in F.36). One can show (by adapting, for example, the demonstration of Thomas [1977, Theorem V.4.3]) that
606 J.E. Roberts and J.-M, Thomas Chapter V the Friedrichs-Poincare inequality, Ио,й^С1>]1Л for albеЯ1^), A8-22) is satisfied with a constant С independent of h. (Note that the two norms on Я'КЬ-^Иьл and MINIk^=(£re^Mi,r+b~2No,:rI/2> are equivalent uniformly in h.) One may also verify that there exist constants с and Ck independent of h such that IIMII-i/^w^cfc^llAillo.B*, forall/jeLWj, A8.23) and ll^lk^Q/r^lll^jJl-!,^ foraIl^e£&h. A8.24) Theorem 18.3. We suppose the hypotheses of Theorem 18.2 except here we fix for each TsSTh XT = Dk(dT), РТ = РАП A8.25) Then there exists a constant С independent of h such that we have lv-HHli,rh + \U-h\W-i/2,wh^Chk{\u\k+liSi + \p\k,Q} A8.26) once the solution (p,u) of A5.2) and of A5.8) belongs to (Hk(Q)J xHk+1(Q). (X is expressed as a function of p in A8.10).) Proof. It is clear that the bilinear form a( •, •) satisfies a{u, »)^^MUkMi,rh for all u,veH\^h), Фь,Ч)>а-Р1Рь¥и1Гъ> for a11 vh£ vh, as vheVh implies Svh=0. We can also establish the existence of a constant /J,/?>0, such that sup у^~>Р\Ш\\-т,^ for all ^eM». vheWh lAJl.O Finally we show the existence of a constant B0 such that Kv,H)<B0lv]1.rh\m\\-ii2,vh for all veH\rh),,ieE0(d^h), where E0(d&~h) is the subspace of functions ^e£C^,) such that jTfida — 0 on each edge T cz Г. Now, for a Dirichlet boundary condition or for a Neumann boundary condition, using Theorem 10.4, we obtain the existence of a constant C, independent of h, such that [«-"Jl/,, +1Р--У1-1/2,8*-,, <C{ inf [u-cfc]lf^+ inf IP-ma||!_1/2,s^}, A8.27) vhEWh {ftheMh-Л- ги,еЕ0(дУи)} and then using F.37) deduce the inequality A8.26).
Section 19 Hybrid finite element methods 607 To handle the case of Robin's boundary condition we introduce the direct sum decomposition of M - E(d^~h) in the form M = M0 Ф М, where M0 is the subspace of all \i e M such that for each Те &~h with Tn Г Ф 0, цт = 0 on each edge T" contained in the boundary Г. Problem A8.6), respectively A8.13), may be put in the abstract form A0.1), respectively A0.7), posed in (Я1^) x M) x M0, with for bilinear forms (w,X),(v,fi) -+ a{u,v) + b{v,I) — b{u,fi) — d{X,ji) and {v,jjl),Ho -* b(v,Ho). We then deduce as above [Н —ЫА]1]Л +||л — Ял||о,агь +IIUo — A0a|||-1/2,8^-), ^Chk{\u\k+ua+\p\k,B}. D Remark 18.5. If the integer к is even, then k* = k + l, and the choice PT=Pk+1 (T) is not optimal. We can obtain the same results by taking PT to be the space generated by ~Pk{T) together with an appropriately chosen element of Pk+l(T). For more details see Raviart and Thomas [1977b]. The primal hybrid finite element method corresponding to the subspaces Mh and Wh associated with the spaces XT and PT of A8.25) by A8.11) and A8.12) is convergent for each positive integer k, in the following sense: we can show, in the absence of any hypothesis concerning the regularity of the solution u, that lim||M-Mjo,fi=0. h-0 Approximation methods based on a principle analogous to that which was the basis for the development of primal hybrid finite element methods have been studied by Babuska [1973], Bramble [1981], Falk [1976], and PitkAranta [1979, 1980a, 1981]. For a problem with a nonhomogeneous Dirichlet boundary condition и = й on Г, we dualize the boundary condition using a multiplier ДеЯ/2(Г). The discrete inf-sup condition is established by means of compactness arguments; thus the results, notably existence and uniqueness, are assured only for "h sufficiently small". 19. Primal hybrid methods and nonconforming methods We consider the model Dirichtlet problem — Au=f in Q, н = й on Г, where / is a given function in L2(Q), й a given function in Я1/2(Г), and Q a bounded, open domain Ш2 with polygonal boundary Г. For a triangulaton &~h of Q by triangles, the simplest primal hybrid finite element method for solving this problem
608 J.E. Roberts and J.-M. Thomas Chapter V is given as follows: find (uh, Xh) satisfying (u„,Xh)eWhxMh, a(uh,vh) + b(vh,kh)= fvhdx for all vheWh, A9.1) я b(uh,fih)=- X йцкТ6а for all fikeMh, Tefh J дТпГ with W^iveHW-.VTePbMrePiiT)}, М» = {А* = (/17.O.вЛб£(8^):УГб^,ухгбР1(аГ)}, a{u,v)= £ Ге.Гь grad и • grad v dx, b(v,/i)=- X nTv\Tda. Tearh J дТпГ Eliminating the Lagrangian multiplier, we obtain a characterization of uh as the solution of the variational problem щe V\, a(uh, vh)=\ fvh dx for all vheVh, A9.2) Q with Vi = {vheWh:4nheMh,b{vh,nh)=- £ йцкТйа), агпг and Vh = {vheWh:4iiheMh,b(vh,nh) = Q}. It is a simple matter to check that Vf is the set of functions in Wh that are affine on each triangle Те 3~h, are continuous at the midpoint of each edge T" common to two triangles Tx and T2 of 2Th and have for value at the midpoint of each boundary edge T" <= Г the average value of й on T'. The set of midpoints of the three sides of a triangle is Fj-unisolvent. Thus a function vhe Vh is uniquely determined by its values at the midpoints of the edges common to two triangles of STh. It is not difficult to give a basis of Vh associated with this choice of degrees of freedom for a function in Vh. Thus, problem A9.2) can be solved directly. Such a procedure is said to be a nonconforming finite element method since Vh is not a subspace of V = Ho(£2), the space of test functions for the primal variational formulation. If tb solution и belongs to HZ(Q) we can obtain, using the results of the previous
Section 19 Hybrid finite element methods 609 section, the error bound { I \u-uh\lr\ +1|и-«*Но.г<СА|и|2,0, A9.3) an estimate that is classically obtained in a direct manner, cf. Strang [1972] or Strang and Fix [1973, Chapter 4.2]. After having calculated the solution of A9.2), to obtain the solution of A9.1) we have only to determine Xh e Mh satisfying ФАЛ)= fah dx—а(щ,vh) for each vheWh, A9.4) i.e. the functions XhT, constant on each edge of T, are determined by vXbT&G~- чт dT fvdx — grad щ • grad v dx for each v e P t (T). Thus once the nonconforming approximation uh of и has been calculated, a postprocessing procedure furnishes directly an approximation of the flux of grad и across each edge T. If T" is the edge common to Tx and Г2е^"л, then the values XhTi and XhT2 obtained from A9.4) satisfy Лг,+Лг2 = 0 onr' = 7inr2. The flux across T' from Ti into T2 is minus that from T2 into Tx. More elaborate examples of nonconforming finite elements derived from primal hybrid finite element methods for second-order elliptic problems on an open set Q in R2 may be found in Raviart and Thomas [1977b], For those examples the continuity of the functions of Vh is imposed at the Gauss points of the edges T. The example treated above, where the functions vh are locally affine, is readily generalized to the three-dimensional case, where Q is an open set of ^3. However, the situation quickly becomes more difficult for higher orders of approximation. An example with /c = 2 is given in Fortin [1985]. In fact nonconforming finite element methods were first developed for elliptic problems more complicated than the model problem above, such as: A) A fourth-order elliptic problem where the space V is a subspace of (H2(Q)J and is thus made up of functions of class C1 on Q. Examples and analyses of such methods may be found in Ciarlet [1974; 1978, pp. 362-380], Ciavaldini and Nedelec [1974], Lascaux and Lesaint [1975] and Zhong-ci Shi [1984a]. B) A Stokes problem formulated as a variational problem on a subspace of the space V of divergence-free functions in (H1(Q))". Analyses of such methods are given in Crouzeix and Raviart [1973], Fortin [1981], Hecht [1981] and Темам [1977, pp. 172-181]. C) A problem of linear elasticity formulated as a variational problem on a sub- space of V—(H1(Q))n. The motivation in this case is to obtain models less rigid than those provided by conforming methods, cf. in particular Pian [1971,1972], Fraeijs
610 J.E. Roberts and J.-M, Thomas Chapter V de Veubeke [1974b]. The most popular nonconforming finite element for this type of problem is Wilson's brick (Wilson et al. [1973]). An analysis of the method may be found in Lesaint [1976], Lesaint and ZlAmal [1980], Zhong-ci Shi [1984b] or as a particular case of primal hybrid methods in Thomas [1977, Chapter VI].1 Other interpretations using hybrid methods have been proposed in Irons [1972], Pian and Tong [1986]. We conclude this section by pointing out that the problem of finding necessary and sufficient conditions for convergence of a nonconforming finite element method has been the subject of many discussions between theoreticians and engineers, cf. in particular the response of Irons and Loikkanen [1983] to Stummel [1979, 1980a, 1980b]. When a nonconforming method can be interpreted as a primal hybrid method, possibly with reduced numerical integration, the study of its convergence is based in particular on the verification of the discrete inf-sup condition. 20. Examples of dual hybrid finite element methods Let 2Th be a triangulation of the domain Q. Associated to this triangulation are the spaces JT(div; ^defined by F.16) and Д9^) introduced in A8.4) and A8.5). Given a function feL2{Q) denote by Qf(&~h) the affine manifold defined by e^) = {?6Jf(div;^):Vre^, div(9|r)+/|r = 0}, B0.1) or more precisely defined as the space of vector functions qBl/fi&xs-.STy) which satisfy j0^-gradt)dx = jfi/pdx, for each function veH1{Q) whose trace on b3~h is trivial. In particular, Q {^h) is the vector space of functions q e Ж(div; $~h) for which q\T is divergence free for each Te^h. A dual mixed-hybrid formulation of our model problem D.4)~D.5) with Robin's boundary condition A5.1) is given as follows: find a pair (p, q>) satisfying a(p, q) + b(q, q>) = 0 for all qe Q°(^h), B0.2) TeSTh J g\j/T do- for all ф е L(8^"fc), ЪТпГ with here u=i Q ИъФ)=- I Te!Th dT AuPrti dx, B0.3) \}iTq'\TdG, B0.4) 1 An analysis of the method using Wilson's brick is also given in Sections 32-34 of Ciarlet's article in this volume (pp. 214-228).
Section 20 Hybrid finite element methods 611 (in this last expression, q • \T denotes the normal trace on 3 Г of the restriction of q to Г), and d(<p,ij/)=- £ нртфтйст. B0.5) TeSTh J emr After noting that, for each TsSTh, the normal trace on the boundary of Tis a surjective mapping of {q e Ж(div; T): div q = 0} onto {fi e L2(ST): |er/i dc = 0}, it is easy to verify that problem B0.2) has at most one solution. One shows that if the solution (p, u) of A5.2) and of A5.8) belongs to (#(div; Q)n Jf (div; ^)) x H\Q) (this is the same regularity hypothesis as that required for the existence of a solution to the primal hybrid problem A8.6)), then the pair (рАи\ът)Тед-н) is a solution of B0.2). One thus has cpT = u on 8Г for all TeFh. B0.6) Remark 20.1. If we equip the spaces Ж(<М\;&~Н) and LE^,) with Hilbert space structures, then b{-,') considered as a bilinear form on WxM, with W=Q°($'h) a subspace of Jf(div;^,) and M = L(d&~h), does not satisfy the inf-sup condition. (This situation is analogous to the case of the primal hybrid formulation, cf. Remark 18.3.) To construct an approximation of the solution (p, q>) of problem B0.2), we take for each Te^~h two finite-dimensional subspaces, Qrc?f(div; T) and УгсЯ1/2FГ). To a given function fe L2(Q), we associate the function fh e L2(Q) whose restriction to T, for each Te3~h, is the orthogonal projection in L\T) of f\T into the finite-dimensional space {dix q: q e QT}. We then put Wl = {qeJP(div;^hy.\/Te<rh,q\TeQT}nQfb(<rh), B0.7) Wh = W°k = {qe jf (div; STh): VTe3rh,q\Te QT}nQ°(£Th), B0.8) and М^{ф = (фт)т^еЦд^):УТе^,фте¥т}. B0.9) We seek a pair (ph, cph) satisfying 0>A,<P*)eW;f xMh, a(ph,qh) + Hqh, <P„) = 0 for all qheWh, B0.10) НРи>Фь) + <1((Рн,Фи)=- £ дфьт^а for all t/^eM,,. TeSTh . ЪТпГ Remark 20.2. The analogous problem with a Neumann boundary condition p-v=g, where g is a given function in Ь2(Г) satisfying Jn/dx + \rgda = 0, is
612 J.E. Roberts and J.-M. Thomas Chapter V formulated as follows: find a pair (ph, q>h) satisfying {ph,(ph)eW{xMh, a(Ph, 9h) + %,> <Ph) = ° for a11 1he wk> Ь{Рн,Фн)=~1£ вФнт&а forall^eMj, дТпГ where W{, W„, and M„ are as defined in B0.7), B0.8), and B0.9). To treat the homogeneous Dirichlet problem we seek a solution (ph, cph) of the following problem: (pk,(ph)eW(xMoh, a(A. **) + **»*. P*)=0 for all ft eW;, Ь(Рь,Фи) = 0 for all^eMo,,, where Moh is the subspace of Mh defined by MOfc = {iKeMft:Vre5"h,i^r = 0on8Tnr}. B0.11) By construction of the function fh, the affine manifold W{ is not empty. Applying Theorem 10.2 with Remark 10.8 we obtain with no difficulty the following theorem: Theorem 20.1. Problem B0.10) has a unique solution if and only if the subspaces Wh and Mk are compatible in the sense that the discrete inf-sup condition is satisfied: {^6MOi:VfteWi,b(ft,^) = 0} = {0}s B0.12) where M0h is the subspace of Mh defined in B0.11). The global condition B0.12) is satisfied once, for each Te^~h, the choice of spaces QT and YT satisfy the local compatibility condition фе YT:VqeQT with div? = 0, ij/q-vTdo = ol = P0(dT). B0.13) J ) 8Г Next we develop an example where the above theory is applied. Theorem 20.2. Let Q be an open polygonal domain in Ш2 triangulated by triangles T. Let к be a positive integer, and put ^ (k, if к is odd, \k+ 1, if к is even. For each triangle T^STh define the subspace YTcH1,2(dT) by YT = Pk(QT). B0.14)
Section 20 Hybrid finite element methods 613 (With the definition of the space Ldj-h given by E.14), we have Mh = L8/h.) Suppose that the space QTc:jf(div, T) is such that the following property is satisfied: V/ueDk.(dT),3qeQT s.t. q-vT = /i on ST. B0.15) Then problem B0.10) has a unique solution. To establish this theorem it clearly suffices to show that the local compatibility condition B0.13) is satisfied. A demonstration can be found in Thomas [1976, Lemma 4.1]. We point out that the subspace of all феРк(дТ) which satisfy Jarфц do = 0 for each p,e DJdT) with §STp. da — 0 reduces to P0FT) only when к is odd. Otherwise, when к is even, it is a subspace of dimension 2. We shall give error estimates using the norm ||*|1о,я on (L2(£?)J for vector functions (with this method, on each Те &~h, di\(p —ph) = — (/—fh) and is considered to be known or estimated a priori) and the norm ||| • |||i/2,a^ given in E.19) for the traces on d&~h of functions in H1 {Q). We suppose we have a uniformly regular family of triangulations, where the hypothesis of uniformness is made simply for ease of exposition. We note the existence of constants с and Ck independent of h such that \\Ф\\о,ш^с^'2\\\ф\\\112^ь for all феН112ф^н), B0.16) IIIWHi/Wb <C^/2l!-Aftllo,e^ for all ^еЩ.. B0.17) Theorem 20.3. We suppose the hypotheses of Theorem 20.2 except here we fix for each TeSTh YT=Pk(dT), QT=Dk.(T). B0.18) Then there exists a constant С independent ofh such that we have \\P~PH\\o^+\\\9-9k\\\i/2,m^Chk{\p\k,n + \u\k+1,n} B0.19) once the solution (p,u) of A5.2) and of A5.8) belongs to (Hk(Q)J xHk+1(Q) (</> is expressed as a function of и in B0.6)). Proof. To apply the general theory developed in Chapter III we begin by making the change of variables p to p — S^p, and ph to Рь — ^р. Thus we have P-Wpe Q°{3Th), ph-Wpe Wh<=Q°(tTh), where £"hk)p is the equilibrium interpolant of order к described in Section 6. The continuity properties and WVellipticity with constants independent of h relative to the norm || • ||0>n are obvious. Further, we can establish the existence of constants В and p0 > 0, independent of h, such that b(q, tf'KBIIf Ц0.0НМН1/2.ВД. for all qeQ°{^h), феЦд^И),
614 J.E. Roberts and J.-M. Thomas Chapter V and sup b^^ >jS0DM1/2iMil for all ф„еМ„, qheWh II ?й II 0,0 where the seminorm [*]i/2,a,% is defined by 1/2 W 1/2,Wh =\ I **-ШйЬ \фтйа i/2, аг. вг Using the Friedrichs-Poincare inequality we can show that [•]i/2,arh is a norm on Мол equivalent (uniformly in h) to the norm |||-|IIi/2,b^- Thus there exists /?>0, independent of h, such that sup ^~Г^ >Р\Ш\\щ**. for a11 ^eM0h. 4heWh №110,12 The theorem then follows easily. □ Remark 20.3. When the integer к is even, we can obtain the same result while taking the space QT to be the space generated by Dk(T) together with an appropriately chosen element of Dk + 1(T). For details and other examples cf. Thomas [1976], The dual hybrid finite element method corresponding to the choice of subspaces Mh and Wh associated with B0.18) by B0.8) and B0.9) is convergent for each integer к ^ 1 in the following sense (with no regularity hypothesis for the solution, one can show that): \im\\p -pj0,n=0- k-»0 The principle of the dual hybrid finite element methods given above for a second-order elliptic problem corresponds to that developed by specialists in structural mechanics and termed assumed stress hybrid finite element model. We cite first of all the work of Pun [1964, 1971, 1972, 1983]; cf. also Pian and Tong [1969, 1986], Tong [1983], and the recent articles of Pian and his collaborators where can be found a treatment of nonaffine-equivalent finite elements. Other examples are given by Atluri [1971], Atluri, Tong and Murakawa [1983], Punch and Atluri [1984], Spilker and Munir [1980a, 1980b] and Wolf [1972a, 1972b] among many other publications on the subject. For a mathematical analysis of such methods we refer to the work of Brezzi [1975, 1977], of Kikuchi [1973] and of Quarteroni [1979]. Remark 20.4. The construction of the examples given above followed this strategy: given a space Mh, find a space Wh large enough for the compatibility condition to be satisfied. Under reasonable hypotheses we obtain asymptotic orders of optimal error (relative to the given Wh). The construction of examples proposed by engineers often follows another strategy: given two spaces Wh and M*, the choice of spaces
Section 21 Hybrid finite element methods 615 being consistent with the physics of the problem, we note the existence of "mechanisms" (... the homogeneous problem does not have the trivial solution as unique solution). One then tries to eliminate these mechanisms, that is to reduce M% to a space Mh small enough for the compatibility condition with Wh to be satisfied. This technique does not lead to satisfactory asymptotic orders of convergence. 21. Hybridization of equilibrium methods The dual hybrid finite element method just constructed yields—when the compatibility hypothesis B0.12) is satisfied—an approximation ph ofp satisfying div ph + fh = 0 in the interior of each finite element T; however, the reciprocity of the normal traces on the interfaces is not realized pointwise along the entire interface. In particular, the solution ph is not in H(div; Q). A variant introduced by Fraeijs de Veubeke [1965] yields an approximated solutionis jf(div; Q) satisfying divph +fh =0 in Q. The mathematical analysis can be found in Thomas [1977, Chapter VIII], cf. also Raviart and Thomas [1979]. These results are completed by a postprocessing technique developed in Arnold and Brezzi [1985]. We thus reconsider the approximation of the solution (p, (p) of the model problem with the dual hybrid formulation B0.2)-B0.5). On the one hand, for each Te^h, we take a finite-dimensional subspace QT с j*F(div; T) and as before, cf. B0.7), B0.8), define the spaces W{ = {чеЖ{Ащ <Thy. 4Tefh, q\TeQT}nQf*(^h) B1.1) and Wh = Wl = {9еЖ{йщ STh): ЧТеГк, q\TeQT}nQ°(^h). B1.2) On the other hand, for each Te2Th, we put YT = {феЬ2(дТ): 3qeQTs.t.q-vT = tj/} B1.3) and define M^I^OMr^eLW*): 4Terh,\liTeYT,VT = T^T2, TuT2e <Гк,фТ1=фТ2 on T'}. B1.4) We seek {ph,q>h) satisfying (cf. B0.10)) (ph,(ph)eWfhxMh, 4Pk,9k) + Hqk,<Pk) = 0 fw all qheWh, B1.5) Ь(Ри,Фи) + ^(рк,Фн)=- £ #*rdff for all фъеМк. TeSTh . ЪТпГ
616 J.E. Roberts and J.-M. Thomas Chapter V Remark 21.1. Such an approximation method is not a dual hybrid approximation in the sense of the preceding section because the spaces YT are not subspaces of #1/2(Э Г), and thus the space Mh of multipliers is not a subspace of the space ЦЭ5~Л) denned in A8.4) or A8.5). Whereas examples of finite-dimensional subspaces of L2(csT) and of H~ 1/2(9Г) are identical, the examples of finite-dimensional subspaces of L2(dT) and of Я1/2(ЭТ) are fundamentally different: the functions in Я1/2(дГ) which are locally continuous are globally continuous on 9T. It is clear that for each choice of spaces QT с Ж(div; T) with YT associated to QT by B1.3), problem B1.5) has a unique solution (ph, q>). Suppose that the coefficient £>0 is constant on each face T" contained in the boundary Г of О (this hypothesis is made solely for simplicity of exposition). Then the elimination of the multiplier (ph in B1.5) yields a characterization of ph as the solution of the variational problem E АиРцЧы d* + A/0(Л • v)(ft • v) dcr= (l/t)gqh • v do- B1.6) i,j=l J J J T Г Г for all qhe Vh, with {qeJt(di\;Q): div ^ + /=0 on £2} n{qe(L2(Q))"-.VT6^h,q\TeQT}, B1.7) V°h = {qe Jf(div; Q): div q = 0 on 0} n{qe(L2(Q)T:\/Te^h,q\TeQT}. B1.8) The approximation ph of p thus obtained is an equilibrium approximation (cf. Section 2); i.e. ph is a vector function in Ж (div; Q) that satisfies the equilibrium equation div/>„ + /,,=0 in fl. B1.9) Conversely, to an equilibrium formulation B1.6)—B1.8) can be associated a hybrid method defined by B1.1)—B1.5). Contrary to what one might suspect at first glance, it is simpler to solve numerically B1.5) than B1.6). The numerical solution of B1.5) can be carried out by first eliminating the principal variableph. (This elimination is performed at the level of the n-simplex Г.) Then to calculate q>h, one must solve a linear system of the same structure as the one which results from a nonconforming finite element method, cf., in particular, Freijs de Veubeke [1974a]. Remrk 21.2. When the given function / is trivial, or sufficiently simple that one can numerically reduce the problem to the case/=0, to solve B1.6) (equilibrium v h vk =
Section 21 Hybrid finite element methods 617 formulation) it suffices to know how to construct a basis for Vh. In dimension n = 2, using stream functions we can treat directly problem B1.6), cf. Haslinger and Hlavacek [1975, 1976a, 1976b], Hlavacek [1980], and Hlavacek and Krizek [1984]. We have not chosen this presentation because it is not easily generalized in dimension n = 3 to the case where/is arbitrary and the boundary condition is a Robin's condition. When 3~h is a triangulation of Q by и-simplexes T, if we take for spaces QT the spaces Dk(T), where к is a given integer, к> 1, the described method is to take for spaces YT the spaces Dk(dT). Thus here we have W{ = {qeje(div; STh): VTe 2Th, q\TeDk(T)}r,Q^{3Th), B1.10) and Wh = Wt = {*e Jf(div; STh): VTe Pk, q\Te(Pk-i(TW}nQ0(rk), B1.11) where for the characterization of Wh we have used the fact that each vector function in Dk that is divergence-free belongs in fact to (Pk_ x (Г))". The associated space Vh is then none other than Vh = {qeje{d\\; Q): 6i\q=0onQ and s.t. ^Te^h,q\Te(Pk.1r}, B1.12) i.e. with the notation of F.17) Vh = Ef n {qe (L2 {Q)f: div q = 0 on Q}. For simplicity of exposition, to give error estimates we suppose that the family of triangulations STh is uniformly regular. With QT=Dk(T) and YT =Dk(dT) for each Те 3~h, there is no difficulty in showing that there is a constant /?0 >0, independent of h, such that sup b^^->hh-l,2^h'\o^ for all ^eM,,, where the seminorm [•Jo.a^ is defined by 1 Фт~ mesdT фтAа 0.8Г 1/2 Thus we have: Theorem 21.1. Suppose that, for each Te3~h, QT=Dk(T), YT=Dk{dT). B1.13) Then there exists a constant C, independent ofh, such that if(ph, q>h) is the solution of B1.5), then \p-Ph\\o,rh+h ll2\\(p-(Ph\\o,s^h^Chk{\p\k^ + \u\k + u!1}, B1.14)
618 J.E. Roberts and J.-M. Thomas Chapter V provided, of course, that the solution {p,u) of A5.2) and of A5.8) belongs to (Hk(Q)f x Hk~1(Q). (<p is expressed as a function of и in B0.6).) Once the solution (ph, q>h) of problem B1.5), with the spaces QT and YT as given in B1.13), has been obtained, we can easily calculate the function uheLz(Q) whose restriction to each Te^~h is the polynomial of degree at most k— 1 defined by \uhdi\qdx=- £ Myp^dxH- \cphTq-vTda J u=i J J т г ar for each qsDk{T). Using that (pk, <ph) satisfies B1.5) we remark that this function uh satisfies \uhdi\qhdx=- X J ij=l иР>,]Яш&х + {^/t)(9-Ph'v)qh-vda for each qheje(di\; Q) such that qh\TsDk{T) for each Ге^. The pair (ph, uh) is the unique solution of problem A5.14) (dual mixed formulation) obtained with A5.12) and A5.13). Thus we know uh is an approximation of и of order к relative to the norm || • ||0,n- Now for each Te&t,, we have both the approximation cphTeDkET) of and the approximation uh\TePk^i(T) of u\T. These two approximations provide complementary information about u\T and it is reasonable to try to use all of this information to obtain a better approximation of u\T. We take an example in dimension n = 2 when к is odd. The relations J (и$-(рнт)Ф do- = 0 for all феОк{дТ), dT and {u%~uh)vdx = 0 for all vePk.3{T) (if fc^3) define a unique function uteL2{Q) whose restriction to each triangle Te$~h is a polynomial of degree at most k. Under the standard regularity hypotheses, one can show (Arnold and Brezzi [1985, Theorem 2.2]) the order of approximation is thus improved: il"-"?llo,«=0(^ + 1). For similar results see also Stenberg [1988c].
Chapter VI Extensions and Variations 22. Other examples of mixed and hybrid methods The mixed and hybrid finite element methods given in the preceding sections were developed almost exclusively for second-order elliptic problems. Formulations of such methods for fourth-order elliptic problems were noted. The mathematical framework constructed for the numerical analysis of these methods for approximating the solutions of partial differential equations apparently ignores the physical origin of the model. We have not even sought to exploit the supplementary properties that might result from the fact that in most cases we are concerned with the solutions of constrained minimization problems. This does not however imply that the elaboration of such approximation techniques can be successfully carried out without taking into account the physical properties of the underlying problem. There is no lack of examples in computational mechanics in the recent scientific literature. We give here only a brief overview. Stokes problem is a fundamental problem of fluid mechanics. The primary unknown и is the velocity vector for the fluid displacement. It can be characterized as being the solution of a minimization problem in a subspace of the space F={n=(i)i):»iefl1(fi), l«Si<n, s.t. div » = 0} The Lagrangian multiplier associated with the constraint that the velocity field be divergence-free is identified with a pressure. The approximation of the solution of this problem has been and will continue to be the object of many publications. The state of the art in this domain can be found in the book of Girault and Raviart [1986], and it does not seem opportune to duplicate here the bibliography of this work. The system of linear elasticity constitutes another privileged domain for the application of the theory of mixed and hybrid methods. The variational formulation of the particle displacement being taken as primal formulation, the dual formulation called the equilibrium formulation, is a variational formulation of the stress equation. In the case of no volume forces this equilibrium formulation is a minimization principle in a subspace of the space Б-{а = ((Ти):аие12{0), о1} = о}1, l^t, j^n, di\a= 0}, where diva- = 0 means Zj=15ffy/9xj- = 0 for i = 1,2,3. With some simplifying 619
620 J.E. Roberts and J.-M. Thomas Chapter VI restrictions concerning the data and geometry of the problem under consideration, it is possible to use Airy functions to reduce the study of the equilibrium formulation to that of the two-dimensional biharmonic problem. The degree of sophistication necessary for constructing equilibrium finite element models is more easily understood with this analogy. Examples can be found in Johnson and Mercier [1978,1979], Krizek [1982,1983], and Krizek and NeittaamAki [1986]. Analyses of mixed and hybrid formulations for the system of elasticity are proposed by Amara and Thomas [1978,1979], Arnold, Brezzi and Douglas [1985], Arnold, Douglas and Gupta [1984], PitkAranta and Stenberg [1983], and Stenberg [1986,1988a, 1988b]. The principal source of difficulty in the approximation of the stress formulation comes from the necessity of respecting, rigorously or not, the symmetry relations that should be satisfied by the stresses. The article of Arnold and Falk [1988] proposes anew formulation with which one obtains distinct approximations of ffy and ар. Other recent articles concerning the numerical treatment of problems of solid mechanics with mixed and hybrid methods are referenced in the bibliography. While some of these articles may appear to lack a certain degree of mathematical rigor, experience has shown that the examples described therein cannot, nevertheless, be neglected. A survey of such examples in fluid and solid mechanics can be found in the book of Hughes [1987, Chapters 4 and 5]. Another domain where quite a few models using mixed or hybrid finite elements have been constructed is the subdomain of fluid mechanics dealing with fluid flow in porous media and in particular petroleum reservoir engineering. These reservoir models are based on nonlinear equations too complex to be described here in a few lines. We refer the interested reader to the book of Chavent and Jaffre [1986], where in Chapter V some models directly inspired by the mixed methods developed above may be found. Other models are described and analyzed in Chavent, Cohen, Dupuy, Jaffre and Ribera [1984], Darlow, Ewing and Wheeler [1984], Douglas [1984], Douglas, Ewing and Wheeler [1983], Douglas and Roberts [1983], Eymard, Gallouet and Joly [1988], Jaffre and Roberts [1985], and Wheeler and Gonzales [1984]. We conclude this section by mentioning that mixed finite elements have also been used for the equations of electromagnetism: Bendali [1984a, 1984b], Bossavit [1988], Nedelec [1980]. 23. Extensions and variations of the theory The methods of Lagrangian multipliers that have been presented here concern the approximation of elliptic problems. These methods have been extended to the study of eigenvalue problems; cf. Canuto [1978], Kikuchi [1987], and Mercier, Osborn, Rappaz and Raviart [1981]. They have also been extended to the study of problems of evolution: parabolic problems as in Johnson and Тномёе [1981] and in Тномёе [1984, Chapter 13], see also Squeff [1987]; hyberbolic problems as in Canuto [1981], in Douglas and Gupta [1986], in Geveci [1988], in Quarteroni [1980b]
Section 23 Extensions and variations 621 and more recently in Dupont, Glowinski, Kinton and Wheeler [1989]. Certain mixed methods have been successful only because of the use of reduced numerical integration. Some examples can be found in Johnson and PitkAranta [1982], Malkus and Hughes [1978], Noor and Peters [1983], and Shimodaira [1985]. Other examples of reduced integration are proposed in Raviart and Thomas [1977b] to obtain the interpretation of certain primal hybrid methods as nonconforming methods. In the preceding studies the approximation of elliptic problems of constrained minimization has been approached exclusively with the technique of dualization of the constraints using Lagrangian multipliers. The technique of penalization of the constraints constitutes an alternative way of attacking these minimization problems. These techniques can fortunately be combined. The resulting technique is called the augmented Lagrangian technique for which we refer the reader to the book of Fortin and Glowinski [1983]. Even though the penalization technique was not initially retained in the mixed and hybrid formulations of the problem, it can nonetheless be useful for the resolution of the linear system resulting from the mixed hybrid formulation; cf. Bercovier [1978], Felippa [1986]; cf. also Section 17. Other finite element methods such as nodal methods can be interpreted as being mixed or hybrid methods; cf. Hennart [1985,1986a, 1986b], and Hennart, Jaffre and Roberts [1988]. The so-called Trefftz methods can also be understood as being variants of the dual mixed hybrid formulations; cf. Jirousek and Lan Guez [1986] and Zielinski and Zienkiewicz [1985]. As an extension of the classical abstract theory of Ritz-Galerkin, with the development of the theory of internal approximation of variational problems which has been seen to be fundamental for the analysis of "standard" finite element methods, the abstract theory of Babuska-Brezzi for the approximation of problems with Lagrangian multipliers has become popular because of its applications to the analysis of mixed and hybrid finite element methods. Other applications of the Babuska-Brezzi theory have been proposed more recently; cf. for example the analysis of spectral methods proposed in the articles of Bernardi, Canuto and Maday [1988], and of Bernardi, Maday and Metivet [1987] or the analysis of modal decomposition methods in the articles of Stolarski and Belytschko [1986a, 1986b]. Domain decomposition methods for the solution of partial differential equations are currently enjoying a resurgence in popularity for their use in constructing algorithms well adapted to use on multiprocessing machines. The use of mixed and hybrid formulations seem adequate for this type of method; cf. Glowinski and Wheeler [1988] for a first example in this direction.
References Adams, R.A. A975), Sobolev Spaces (Academic Press, London). Amara, M. and J.-M. Thomas A978), Approximation par elements finis equilibres du systeme de Pelasticite hneaire, C.R. Acad. Sci. Paris Sir. A 286, 1147-1150. Amara, M. and J.-M. Thomas A979), Equilibrium finite elements for the linear elastic problem, Numer. Math. 33, 367-383. Arnold, D.N. A981), Discretization by finite elements of a model parameter dependent problem, Numer. Math. 37, 405-421 Arnold, D.N. and F. Brezzi A985). Mixed and nonconforming finite element methods: Implementation, postprocessing and error estimates, RAIRO Model. Math. Anal. Numer. 19, 7-32. Arnold, D.N., F. Brezzi and J. Douglas Jr A985), Peers: A new mixed finite element for plane elasticity, Japan J. Appl. Math. 1, 347-367. Arnold, D.N., J. Douglas Jr and C.P. Gupta A984), A family of higher order mixed finite element methods for plane elasticity, Numer. Math. 45, 1-22. Arnold, D.N. and R. Falk A988), A new mixed formulation for elasticity, Numer. Math. 53, 13-30. Atluri, S.N. A971), A new assumed stress hybrid finite element model for solid continua, AIAA J. 9, 1647-1649. Atluri, S.N., P. Tong and H. Murakawa A983), Recent studies in hybrid and mixed finite element methods in mechanics, in: S.N. Atluri, R.H. Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 51-71. BabuSka, I. A971), Error bounds for finite element method, Numer. Math. 16, 322-333. BabuSka, I. A973), The finite element method with Lagrange multipliers, Numer. Math. 20, 179-192. Babuska, I. and А.К. Aziz A972), Survey lectures on the mathematical foundations of the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 3-359. Babuska, I. and A. Miller A984), The postprocessing approach in the finite element method, Part 1: Calculation of displacements, stresses and other higher derivatives of the displacements, Internal. J. Numer. Methods Engrg. 20, 1085-1109. BabuSka, I., J.T. Oden and J.K. Lee A977), Mixed-hybrid finite element approximations of second-order elliptic boundary-value problem, Part 1, Comput. Methods Appl. Mech. Engrg. 11, 175-206. BabuSka, I., J.T. Oden and J.K. Lee A978), Mixed-hybrid finite element approximations of second-order elliptic boundary-value problem, Part 2: Weak-hybrid methods, Comput. Methods Appl. Mech. Engrg. 14, 1-22. BabuSka, I, and J.E. Osborn A983), Generalized finite element methods: Their performance and their relation to mixed methods, SIAM J. Numer. Anal. 20, 510-536. BabuSka, I., J.E. Osborn, J. Pitkaranta A980), Analysis of mixed methods using mesh dependent norms, Math. Сотр. 35, 1039-1062. Balasundaram, S. and P.K. Bhattacharyya A984), A mixed finite element method for fourth order elliptic equations with variable coefficients, Comput. Math. Appl. 10, 245-256. Balasundaram, S. and P.K. Bhattacharyya A986), A mixed finite element method for fourth order partial differential equations, Z. Angew. Math. Mech. 66, 489-499. Barlow, J. A986), A different view of the assumed stress hybrid method, Internal. J. Numer. Methods Engrg. 22, 11-16. Bendali, A. A984a), Numerical analysis of the exterior boundary value problem for the time-harmonic 623
624 J.E. Roberts and J.-M. Thomas Maxwell equations by a boundary finite element method, Part 1: The continuous problem, Math. Сотр. 43, 29-46. Bendali, A. A984b), Numerical analysis of the exterior boundary value problem for the time-harmonic Maxwell equations by a boundary finite element method, Part 2: The discrete problem, Math. Сотр. 43, 47-68. Bercovier, M. A978), Pertubation of mixed variational problems. Application to mixed finite element methods, RAIRO Anal. Numer. 12, 211-236. Bernardi, С A986), Contributions a 1'Analyse Numerique de Problemes Non Lineaires, These d'Etat, Universite Pierre et Marie Curie, Paris. Bernardi, С, С Canuto and Y. Maday A988), Generalized inf-sup condition for Chebyshev approximation of the Stokes problem, SI AM J. Numer. Anal. 25, 1237-1271. Bernardi, C, Y. Maday and B. Metivet A987), Calcul de la pression dans la resolution spectrale du probleme de Stokes, Reck Aerospat. 1, 1-21 Bossavit, A. A988), Mixed finite elements and the complex of Whitney forms, in: J.R. Whiteman, ed., MAFELAP 87, The Mathematics of Finite Elements and Applications (Academic Press, New York) 137-144. Bramble, J.H. A981), The Lagrange multiplier method for Dirichlet's problem, Math. Сотр. 37, 1-11. Bramble, J.H. and R.S. Falk A983), Two mixed finite element methods for the simply supported plate Problem, RAIRO Anal. Numer. 17, 337-384. Bramble, J.H. and R.S. Falk A985), A mixed-Lagrange multiplier finite element method for the polyharmonic equation, RAIRO Model. Math. Anal. Numer. 19, 519-557, Bramble, J.H. and E. Pasciak A988), A preconditioning technique for indefinite systems resulting from mixed approximations of elliptic problems, Math. Сотр. 50, 1-17. Brezzi, F. A974a), Theoremes d'existence, d'unicite et d'approximation numerique pour des problemes de point-selle, C.R. Acad. Sci. Paris Sir. A 278, 839-842. Brezzi, F. A974b), On the existence, uniqueness and approximation of saddle-point problems arising from Lagrangian multipliers, RAIRO Anal. Numer. 8, 129-151. Brezzi, F. A974c), Sur une methode hybride pour l'approximation du probleme de la torsion d'une barre elastique, Istit. Lombardo (Rend. Sci.) A 108, 274-300. Brezzi, F. A975), Sur la methode des elements finis hybrides pour le probleme biharmonique, Numer. Math. 24, 103-131. Brezzi, F. A977), Hybrid method for fourth order elliptic equations, in: I. Galligani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods, Lecture Notes in Mathematics 606 (Springer, Berlin) 35-^*6. Brezzi, F. A979), Non-standard finite elements for fourth order elliptic problems, in: R. Glowinski, E.Y. Rodin and O.C. Zienkiewicz, eds., Energy Methods in Finite Element Analysis (Wiley, Chichester) 193-211. Brezzi, F., J. Douglas Jr, R. Duran and M. Fortin A987), Mixed finite elements for second order elliptic problems in three variables, Numer. Math. 51, 237-250. Brezzi, F., J. Douglas Jr, M. Fortin and L.D. Marini A987), Efficient rectangular mixed finite elements in two and three variables, RAIRO Model. Math. Anal. Numer. 21, 581-604. Brezzi, F., J. Douglas Jr, and L.D. Marini A985), Two families of mixed finite elements for second order elliptic problems, Numer. Math. 47, 217-235. Brezzi, F. and М. Fortin A986), Numerical approximation of Mindlin Reissner plates, Math. Сотр. 47, 151-158. Brezzi, F. and L.D. Marini A975), On the numerical solution of plate bending problems by hybrid methods, RAIRO Anal. Numer. 9, 5-50. Brezzi, F., L.D. Marini, A. Quarteroni and Р.А. Raviart A980), On an equilibrium finite element method for plate bending problems, Calcolo 17, 271-291. Brezzi, F. and P.-A. Raviart A978), Mixed finite element methods for 4th order elliptic equations, in: J. Miller, ed., Topics in Numerical Analysis III (Academic Press, New York) 35-56. Brown, D.C. A982), Alternating-direction iterative schemes for mixed finite element methods for second order elliptic problems, Thesis, University of Chicago, Chicago, IL. Canuto, С A978), Eigenvalue approximations by mixed methods, RAIRO Anal. Numer. 12, 27-50.
References 625 Canuto, С. A981), A hybrid finite element method to compute the free vibration frequencies of a clamped plate, RAIRO Anal. Numer, 15, 101-118. Carey, G.F., S.S. Chow and M.K. Seager A985), Approximate boundary-flux calculations, Comput. Methods Appl. Mech. Engrg. 50, 107-120. Chavent, G., G. Cohen, M. Dupuy, J. Jaffre and I. Ribera A984), Simulation of two dimensional waterfiooding using mixed finite elements, Soc, Petroleum Engrs. J. 24, 382-390. Chavent, G. and J. Jaffre A986), Mathematical Models and Finite Elements for Reservoir Simulation, Studies in Mathematics and Its Applications 17 (North-Holland, Amsterdam). Cheung, Y.K. and Chen Wajni A988), Isoparametric hybrid hexahedral elements for threedimensional stress analysis, Internal. J. Numer. Methods Engrg. 26, 677-693. Ciarlet, P.G. A974), Conforming and nonconforming finite element methods for solving the plate problem, in: G.A. Watson, ed., Numerical Solution of Differential Equations, Lecture Notes in Mathematics 363 (Springer, Berlin) 156-176. Ciarlet, P.G. A978), The Finite Element Method for Elliptic Problems, Studies in Mathematics and Its Applications 4 (North-Holland, Amsterdam). Ciarlet, P.G. A982), Introduction а ГAnalyse Numerique Matricielle et a FOptimisation (Masson, Paris). Ciarlet, P.G. and P. Destuynder A979a), A justification of the two-dimensional linear plate model, J. Mecanique 18, 315-344. Ciarlet, P.G. and P. Destuynder A979b), A justification of a nonlinear model in plate theory, Comput. Methods Appl. Mech. Engrg. 17, 227-258. Ciarlet, P.G. and P.-A. Raviart A972), General Lagrange and Hermite interpolation in U" with applications to finite element methods, Comput. Methods Appl. Mech. Engrg. 1, 217-249. Ciarlet, P.G. and P.-A. Raviart A974), A mixed finite element method for the biharmonic equation, in C. de Boor, ed., Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 125-145. Ciavaldini, J.F. and J.-C. Nedelec A974), Sur l'element de Fraeijs de Veubeke et Sander, RAIRO Anal. Numer. 8, 29-45. Darlow, В., R.E. Ewing and M.F. Wheeler A984), Mixed finite element methods for miscible displacement problems in porous media, Soc. Petroleum Engrs. J. 24, 391-398. Dautray, R. and J.-L. Lions A984), Analyse Mathematique et Calcul Numerique pour les Sciences et les Techniques 1, Collection Commissariat a 1'Energie Atomique (Masson, Paris). Day, M.L. and T.Y. Yang A982), A mixed variational principle for finite element analysis, lnternat. J. Numer. Methods Engrg. 18, 1213-1230. Destuynder, P. A986), Une Theorie Asymptotique des Plaques Minces en Elasticite Lineaire, Recherches en Mathematiques Appliquees 2 (Masson, Paris). Destuynder, P. and J.-C. Nedelec A986), Approximation numerique du cisaillement transverse dans les plaques minces en flexion, Numer. Math. 48, 281-302. Douglas Jr, J. A962), Alternating-direction methods for three space variables, Numer. Math. 4,41-63. Douglas Jr, J. A984), Numerical methods for the flow of miscible fluids in porous media, in: R.W. Lewis, P. Bettess and E. Hinton, eds., Numerical Methods in Coupled Systems (Wiley, Chichester). Douglas Jr, J., R. Duran and P. Pietra A986), Alternating-direction iteration for mixed finite element methods, in: R. Glowinski and J.L. Lions, eds., Computing Methods in Applied Sciences and Engineering, VII (North-Holland, Amsterdam). Douglas Jr, J., R. Duran and P. Pietra A987), Formulation of alternating-direction iterative methods for mixed methods in three space, in: E.L. Ortiz, ed., Numerical Approximation of Partial Differential Equations, Mathematics Studies 133 (North-Holland, Amsterdam) 21-30. Douglas Jr, J., R.E. Ewing and M.F. Wheeler A983), The approximation of the pressure by a mixed method in the simulation of miscible displacement, RAIRO Anal. Numer. 17, 17-33. Douglas Jr, J. and C.P. Gupta A986), Superconvergence for a mixed finite element method for elastic wave propagation in a plane domain, Numer. Math. 49, 189-202. Douglas Jr, J. and A. Milner A985), Interior and super convergence estimates for mixed methods for second order elliptic problems, RAIRO Model. Math. Anal. Numer. 19, 397-428. Douglas Jr, J. and P. Pietra A986), A description of some alternating-direction iterative techniques for mixed finite element methods, in: W.E. Fitzgibbon, ed., Mathematical and Computational Methods in
626 J.E. Roberts and J.-M. Thomas Seismic Exploration and Reservoir Modeling (SIAM, Philadelphia, PA) 37-53. Douglas Jr, J. and J.E. Roberts A982), Mixed finite element methods for second order elliptic problems, Math. Appl. Comput. 1, 91-103. Douglas Jr, J. and J.E. Roberts A983), Numerical methods for a model for compressible miscible displacement in porous media, Math. Сотр. 41, 441-459. Douglas Jr, J. and J.E. Roberts A985), Global estimates for mixed methods for second order elliptic problems, Math. Сотр. 44, 39-52. Dupont, Т., R. Glowinski, W. Kinton and M.F, Wheeler A989), Mixed finite element methods for time dependent problems: Application to control, Research Rept. 54, Department of Mathematics, University of Houston, Houston, TX. Duvaut, G. and J.-L. Lions A972), Les Inequations en Mecanique et en Physique (Dunod, Paris). Ewing, R.E. and M.F. Wheeler A983), Computational aspects of mixed finite element methods, in: R.S. Stepleman, ed., Numerical Methods for Scientific Computing (North-Holland, Amsterdam) 163-172. Evmard, R., T. Gallouet and P. Joly A988), Hybrid finite element technics for oil recovery simulation, Publications du Laboratoire d'Analyse Nume'rique, Universite Pierre et Marie Curie, Paris. Falk, R.S. A976), A Ritz method based on complementary variational principle, RAIRO Anal. Numer. 10, 39-48. Falk, R.S. A978), Approximation of the biharmonic equation by a mixed finite element method, SIAM J. Numer. Anal. 15, 556-567. Falk, R.S. and J.E Osborn A980), Error estimates for mixed methods, RAIRO Anal. Numer. 14,249-277. Felippa, C.A. A986), Penalty-function iterative procedures for mixed finite element formulations, Internal. J. Numer. Methods Engrg. 22, 267-279. Fix, G.J. A976), Hybrid finite element methods, SIAM Rev. 18, 46CM84. Fix, G.J, M.D. Gunzburger and R.A. Nicolaides A979), On finite element methods of least squares type, Comput. Methods Appl. Mech. Engrg. 10, 175-198. Fix, G.J., M.D. Gunzburger and R.A. Nicolaides A981), On mixed finite element methods for first order elliptic systems, Numer. Math. 37, 29-48. Fortin, M. A977), Analysis of the convergence of mixed finite element methods, RAIRO Anal. Numer. 11, 341-354. Fortin, M. A985), A three-dimensional quadratic nonconforming element, Numer. Math. 46, 269- 279. Fortin, M. and R. Glowinski A983), Resolution de Problemes aux Limites par des Mithodes de Lagrangien Augmente (Dunod, Paris). Fortin, M. and R. GLOWINSKI A983), Augmented Lagrangian Methods: Application to the Numerical Solution of Boundary-Value Problems, Studies in Mathematics and Its Applications 15 (North-Holland, Amsterdam). Fraeus de Veubeke, B. A965), Displacement and equilibrium models in the finite element method, in: O.C. Zienkiewicz and G.S. Houster, eds., Stress Analysis (Wiley, New York) 145-197 (Chapter 9). Fraeus de Veubeke, B. A973), Diffusive equilibrium models, University of Calgary Lecture Notes, International Research Seminar on the Theory and Applications of the Finite Element Methods, Calgary, Alta. Fraeus de Veubeke, B. A974a), Finite element method in aerospace engineering problems in: R. Glowinski and J.-L. Lions, eds. Computing Methods in Applied Sciences and Engineering, Lecture Notes on Computer Science 16 (Springer, Berlin) 224-258. Fraeus de Veubeke, B. A974b), Variational principles and the patch test, Internal. J. Numer. Methods Engrg. 8, 783-801. Fraeus de Veubeke, B. A975), Stress function approach, in: World Congress on Finite Element Methods in Structural Mechanics 1, Bournemouth, England, J.1-J.51. Fraeus de Veubeke, B. and M.A. Hogge A972), Dual analysis for heat conduction problems by finite elements, Internal. J. Numer. Methods Engrg. 5, 65-82. Fraeus de Veubeke, B. and G. Sander A968), An equilibrium model for plate bending, Intemat. J. Solids Structures 4, 447-468. Gastaldi, L. and R. Nochetto A987), Optimal L™-estimates for nonconforming and mixed finite element methods of lowest order, Numer. Math. 50, 587-611.
References 627 Geveci, Т. A988), On the application of mixed finite element methods to the wave equations, RAIRO Model. Math. Anal. Numer. 22, 243-250. Girault, V. and P.-A. Raviart A979), Finite Element Approximation of the Navier-Stokes Equations, Lecture Notes in Mathematics 749 (Springer, Berlin). Girault, V. and P.-A. Raviart A986), Finite Element Methods for Navier-Stokes Equations: Theory and Algorithms, Lecture Notes in Computational Mathematics 5 (Springer, Berlin). Glowinski, R. and M.F. Wheeler A988), Domain decomposition and mixed finite element methods for elliptic problems, in: R. Glowinski, G. Golub, G. Meurant and J. Periaux, eds., First International Symposium on Domain Decomposition Methods for Partial Differential Equations (SIAM, Philadelphia, PA) 144-172. Haslinger, J. and I. Hlavacek A975), Curved element in a mixed finite element method, Apl. Mat. 20, 233-252. Haslinger, J. and I. Hlavacek A976a), Convergence of an equilibrium finite element method based on the dual variational formulation, Apl. Mat. 21, 43-65. Haslinger, J. and I. Hlavacek A976b), A mixed finite element method close to the equilibrium model, Numer. Math. 26, 85-97. Haslinger, J. and P. NeittaanmAki A984), On different finite element methods for approximating the gradient of the solution to the Helmholtz equation, Comput. Methods Appl. Mech. Engrg. 42, 131- 148. Hecht, F. A981), Construction d'une base d'un element fini Pj non conforme a divergence nulle dans R3. RAIRO Anal. Numer. 15, 119-150. Hellan K. A967), Analysis of elastic plates in flexure by a simplified finite element method, Acta Polytech. Scand. Civil Engrg. Ser. 46. Hennart, J.P. A985), Nodal schemes, mixed-hybrid finite elements and block-centered finite differences, Rapport de Recherche 386, INRIA Rocquencourt. Hennart, J.P. A986a), A general family of nodal schemes, SIAM J. Sci. Statist. Comput. 3, 264- 287. Hennart, J.P. A986b), A general finite element framework for nodal methods, in: J.R. Whiteman, ed., MAFELAP 1984, The Mathematics of Finite Elements and Applications (Academic Press, London) 309-316. Hennart, J.P., J. Jaffre and J.E. Roberts A988), A constructive method for deriving finite elements of nodal type, Numer. Math. 53, 701-738. Herrmann, L.R. A967), Finite element bending analysis for plates, J. Engrg. Mech. Div. ASCE 93 E), 13-26. Herrmann, L.R. A983), Mixed Finite elements for couple-stress analysis, in: S.N. Atluri, R.H. Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 1-17. Hlavacek, I. A979), Convergence of an equilibrium finite element model for plane elastostatics, Apl. Mat. 24, 427-457. Hlavacek, I. A980), The density of solenoidal functions and the convergence of a dual finite element method, Apl. Mat. 25, 39-55. Hlavacek, I. and M. Krizek A984), Internal finite element approximations in the dual variational methods for second order elliptic problems with curved boundaries, Apl. Mat. 29, 52-69. Hughes, T.J.R. A987), The Finite Element Method, Linear Static and Dynamic Finite Element Analysis (Prentice-Hall, Englewood Cliffs, NJ). Irons, B.M. A972), An assumed-stress version of the Wilson 8-nodes isoparametric brick, Computer Rept. CNME/56/, University of Wales, Swansea. Irons, B.M. A984), Alternative ways for the formulation of hybrid stress elements, Internat. J. Numer. Methods Engrg. 20, 780-782. Irons, B.M. and M. Loikkanen A983), An engineers' defence of the patch test, Internat. J. Numer. Methods Engrg. 19, 1391-1401. Irons, B.M. and A. Razzaque A972), Experience with the patch test for convergence of finite elements, in: A.K. AZIZ, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 557-587.
628 J.E. Roberts and J.-M. Thomas Jaffre, J. A984), Elements finis mixtes et decentrage pour les equations de diffusions-convection, Calcolo 23, 171-197. Jaffre, J. and J.E. Roberts A985), Upstream weighting and mixed finite elements in the simulation of miscible displacements, RAIRO Model, Math. Anal. Numer. 19, 443-460. Jensen, C. A979), A mixed finite element method with curved elements, Research Rept. 79.07 R, Department of Computer Sciences, Chalmers University of Technology and University of Goteborg, Sweden. Jirousek, J. and Lan Guex A986), The hybrid-Trefftz finite element model and its application to plate bending, Internat. J. Numer. Methods Engrg. 23, 651-693. Johnson, С A973), On the convergence of the mixed finite element method for plate bending problems, Numer. Math. 21, 43-62. Johnson, С and B. Mercier A978), Some equilibrium finite element methods for two-dimensional elasticity problems, Numer. Math. 30, 103-116. Johnson, С and B. Mercier A979), Some equilibrium finite element methods for two-dimensional problems in continuum mechanics, in: R. Glowinski, E.Y. Rodin and O.C. Zienkiewicz, eds., Energy Methods in Finite Element Analysis (Wiley, Chichester) 213-224. Johnson, C. and J. Pitkaranta A982), Analysis of some finite element methods related to reduced integration, Math. Сотр. 38, 375-400. Johnson, C. and V. Тномёе A981), Error estimates for some mixed finite element methods for parabolic type problems, RAIRO Anal. Numir. 15, 41-78. Joly, P. A982), La methode des elements finis mixtes appliquee au probleme de diffusion-convection, These de 3eme cycle, Universite Pierre et Marie Curie, Paris. Joly, P. A984), Resolution de systemes lineaires non symetriques par des methodes de gradient conjugue, Publications du Laboratoire d'Analyse Numerique, Universite Pierre et Marie Curie, Paris. Joly, P. A986), Presentation de synthese des methodes de gradient conjugue, RAIRO Model. Math. Anal. Numer. 20, 639-665. Kikuchi, F. A973), Some considerations of the convergence of hybrid stress method, in: Y. Yamada and R.H. Gallagher, eds., Theory and Practice in Finite Element Structural Analysis (University of Tokyo Press, Tokyo) 25-42. Kikuchi, F. A983), On a mixed method related to the discrete Kirchhoff assumption, in: S.N. Atluri, R.H. Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 137-154. Kikuchi, F. A987), Mixed and penalty formulations for finite element analysis of an eigenvalue problem in electromagnetism, Comput. Methods Appl. Meek Engrg. 64, 509-521. Krizek, M. A982), An equilibrium finite element method in three-dimensional elasticity, Apl. Mat. 27, 46-75. Krizek, M. A983), Conforming equilibrium finite element methods for some elliptic plane problems, RAIRO Anal. Numer. 17, 35-65. Krizek, M. and P. NeittaanmAki A986), Internal FE approximation of spaces of divergence-free functions in three-dimensional domains, Internat. J. Numer. Methods Fluids 6, 811-817. Kwon, Y. and F.A. Milner A988), Vю -error estimates for semilinear second-order elliptic equations, SI AM J. Numer. Anal. 25, 46-53. Ladyzhenskaya, O.-A. A969) The Mathematical Theory of Viscous Flow (Gordon and Breach, New York, 2nd ed.). Lascaux, P. and P. Lesaint A975), Some nonconforming finite elements for the plate bending problem, RAIRO Anal. Numer. 9, 9-53. Lee, S.W. and J.J. Rhiu A986), A new approach to the formulation of mixed finite element models for structural analysis, Internat. J. Numer. Methods Engrg. 21, 1629-1641. Lee, S.W., S.C. Wong and J.J. Rhiu A985), Study of a nine node mixed formulation finite element for thin plates and shells, Comput. Structures 21, 1325-1334. Le Roux, M.-N. A982), A mixed finite element method for a weighted elliptic problem, RAIRO Anal. Numer. 16, 243-273. Lesaint, P. A976), On the convergence of Wilson's nonconforming element for solving the elastic problem, Comput. Methods Appl. Mech. Engrg. 7, 1-6.
References 629 Lesaint, P. and M. Zlamal A980), Convergence of the nonconforming Wilson element for arbitrary quadrilateral meshes, Numer. Math. 36, 33-52. Li, Z.C. and G.P. Liang A981), On the simplified hybrid-combined method, Math. Сотр. 41, 13- 25. Lions, J.-L. and E. Magenes A968), Problemes aux Limites поп Homogenes et Applications 1 (Dunod, Paris). Malkus, D.S. and T.J.R. Hughes A978), Mixed finite element methods, reduced and selective integration techniques: A unification of concepts, Comput. Methods Appl. Mech. Engrg. 15, 63-81. Marini, L.D. A985), An inexpensive method for the evaluation of the solution of the lowest order Raviart-Thomas mixed method, SI AM J. Numer. Anal. 22, 493-496. Marsden, J.E. and T.J.R. Hughes A983), Mathematical Foundations of Elasticity (Prentice-Hall, Englewood Cliffs, NJ). Mercier, B. A974), Numerical solution of the biharmonic problem by mixed finite elements of class <ё°, Boll. Un. Mat. Hal. 10, 133-149. Mercier, В., J. Osborn, J. Rappaz and P.-A. Raviart A981), Eigenvalue approximation by mixed and hybrid methods, Math. Сотр. 36, 427-453. Mignot, A.L. and С Surry A981), A mixed finite element family in plane elasticity, Appl. Math. Modelling 5, 259-262. Milner, F. A985), Mixed finite element methods for quasilinear second order elliptic problems, Math. Сотр. 44, 303-320. Mirza, F.A. and M.D. Olson A980), The mixed finite element method in plane elasticity, Internal. J. Numer. Methods Engrg. 15, 273-289. Miyoshi, T. A973), A finite element method for the solution of fourth order partial differential equations, Kumamoto J. Sci. (Math.) 9, 87-116. Mizukami, A. A986), A mixed finite element method for boundary flux computation, Comput. Methods Appl. Mech. Engrg. 57, 239-243. Mizukami, A. and T.J.R. Hughes A985), A Petrov-Galerkin finite element method for convection- dominated flows: An accurate upwinding technique for satisfying the maximum principle, Comput. Methods Appl. Mech. Engrg. 50, 181-193. Monk, P. A987), A mixed finite element method for the biharmonic equation, SI AM J. Numer. Anal. 24, 737-749. Morley, L.S.D. A968), The triangular equilibrium element in the solution of the plate bending problems, Aero. Quart. 19, 255-269. Nedelec, J.-C. A980), Mixed finite elements in R3, Numer. Math. 35, 315-341. Nedelec, J.-C. A986), A new family of mixed finite element in R3, Numer. Math. 50, 57-81. Neittaanmaki, P. and J. Saranen A981), On finite element approximation of the gradient for solution of Poisson equation, Numer. Math. 37, 333-337. Nicolaides, R.A. A982), Existence, uniqueness and approximation for generalized saddle point problems, SI AM J. Numer. Anal. 19, 349-357. Noor, A.K. and J.M. Peters A983), Mixed models and reduced selective integration displacement models for vibration analysis of shells, in: S.N. Atluri, R.H. Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 537-564. Oden, J.T. A973), Some contributions to the mathematical theory of mixed finite element approximations, in: Y. Yamada and R.H. Gallagher, eds., Theory and Practice in Finite Element Structural Analysis (University of Tokyo Press, Tokyo) 3-23. Oden, J.T. and J.K. Lee A975), Theory of mixed and hybrid finite element approximations in linear elasticity, in: Proceedings 1U ТАМ, IUM Symposium on Applications of Methods of Functional Analysis to Problems of Mechanics, Marseille. Oden, J.T. and J.N. Reddy A975), Some observations on properties of certain mixed finite element approximations, Internat. J. Numer. Methods Engrg. 9, 933-949. Oden, J.T. and J.N. Reddy A976a), On mixed finite element approximations, SI AM J. Numer. Anal. 13, 392-404. Oden, J.T. and J.N. Reddy A976b), An Introduction to the Mathematical Theory of Finite Elements (Wiley, New York).
630 J.E. Roberts and J.-M. Thomas Peaceman, D.W. and H.H. Rachford Jr A955), The numerical solution of parabolic and elliptic differential equations, J. SIAM 3, 28-41. Pian, T.H.H. A964), Derivation of element stiffness matrices by assumed stress distributions, AIAA J. 2, 1333-1336. Pian, T.H.H. A971), Formulations of finite element methods for solid continua, in: R.H. Gallagher, Y. Yamada and J.T. Oden, eds., Recent Advances in Matrix Methods of Structural Analysis and Design (University of Alabama Press, Tuscaloosa, AL), 49-83. Pian, T.H.H. A972), Finite element formulation by variational principles with relaxed continuity requirements, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 671-687. Pian, T.H.H. A982), On the equivalence between incompatible displacement element and hybrid stress element, Appl. Math. Mech. (English Ed.) 3 F), 773-776. Pian, T.H.H. A983), Reflections and remarks on hybrid and mixed finite element methods, in: S.N. Atluri, R.H. Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 565-570. Pian, T.H.H. A985), Finite element based on consistently assumed stresses and displacements, Finite Elements Anal. Design 1, 131-140. Pian, T.H.H. and D.P. Chen A982), Alternative ways for formulation of hybrid stress elements, Internat. J. Numer. Methods Engrg. 18, 1676-1684. Pian, T.H.H., D.P. Chen and D. Kang A983), A new formulation of hybrid-mixed finite elements, Comput. Structures 16, 81-87. Pian, T.H.H., D. Kang and С Wang A987), Hybrid plate elements based on balanced stresses and displacements, in: T.J.R. Hughes and E. Hinton, eds., State-of-the-Art Texts on Finite Element Methods in Plate and Shell Structural Analysis (to appear). Pian, Т.Н.Н., M.-S. Li and D. Kang A986), Hybrid stress elements based on natural isoparametric coordinates, in: Proceedings Invitational China-American Workshop on Finite Element Methods, Chengde, People's Republic of China. Pian, T.H.H. and K. Sumihara A984), Rational approach for assumed stress finite elements, Internat. J. Numer. Methods Engrg. 22, 173-181. Pian, T.H.H. and P. Tong A969), Basis of finite element methods for solid continua, Internat. J. Numer. Methods Engrg. 1, 3-28. Pian, T.H.H. and P. Tong A986), Relations between incompatible displacement model and hybrid stress model, Internat. J, Numer. Methods Engrg. 22, 173-181. PiNSKY, P.M. and R.V. Jasti A989), A mixed finite element formulation for Reissner-Mindlin plates based on the use of bubble functions, Internat. J. Numer. Methods Engrg. 28, 1677-1702. Pitkaramta, J. A979), Boundary subspaces for the finite element method with Lagrange multipliers, Numer. Math. 33, 273-289. PitkAranta, J. A980a), Local stability conditions for the BabuSka method of Lagrange multipliers, Math. Сотр. 35, ШЗ-1129. PitkAranta, J. A980b), A conforming finite element method with Lagrange multipliers for the biharmonic problem, RAIRO Anal. Numer. 14, 309-324. PitkAranta, J. A981), The finite element method with Lagrange multipliers for domains with corners, Math. Сотр. 37, 13-30. PitkAranta, J. and R. Stenberg A983), Analysis of some mixed finite element methods for plane elasticity equations, Math. Сотр. 41, 399-423. Powell, M.J.D. A969), A method for nonlinear constraints in minimization problems, in: R. Fletcher, ed., Optimization (Academic Press, London) 283-298. Punch, E.F. and S.N. Atluri A984), Development and testing of stable, invariant, isoparametric curvilinear 2- and 3-D hybrid-stress elements, Comput. Methods Appl. Mech. Engrg. 47, 331- 356. QUARTERONI, A. A979), Error estimates for the assumed stresses hybrid methods in the approximation of fourth order elliptic equations, RAIRO Anal. Numer. 13, 355-367. Quarteroni, A. A980a), On mixed methods for fourth order problems, Comput. Methods Appl. Mech. Engrg. 24, 13-24.
References 631 Quarteroni, A. A980b), Mixed approximations of evolution problems, Comput. Methods Appl. Mech. Engrg. 24, 137-163. Rabier, P. and J.-M. Thomas A985), Introduction a VAnalyse Numerique des Equations aux Derivees Partielles, Exercices (Masson, Paris). Rannacher, R. A979), On nonconforming and mixed finite element methods for plate bending problems: The linear case, RA1RO Anal. Numer. 13, 369-387. Raviart, P.-A. A984), Mixed finite element methods, in: D.F. Griffiths, ed., The Mathematical Basis of Finite Element Methods, with Applications to Partial Differential Equations (Clarendon Press, Oxford), 123-156. Raviart, P.-A. and J.-M. Thomas A977a), A mixed finite element method for second order elliptic problems, in: I. Galligani and E. Magenes, eds., Mathematical Aspects of Finite Element Methods Lecture Notes in Mathematics 606 (Springer, Berlin) 292-315. Raviart, P.-A. and J.-M. Thomas A977b), Primal hybrid finite element methods for second order elliptic equations, Math. Сотр. 31, 391-413. Raviart, P.-A. and J.-M. Thomas A979), Dual finite element models for second order elliptic problems, in: R. Glowinski, E.Y. Rodin and O.C. Zienkiewicz, eds., Energy Methods in Finite Element Analysis (Wiley, Chichester) 175-191. Raviart, P.-A. and J.-M. Thomas A983), Introduction a TAnalyse Numerique des Equations aux Derivies Partielles (Masson, Paris). Reddy, J.N. and J.T. Oden A973), Convergence of mixed finite-element approximations of a class of linear boundary-value problems, J. Structural Mech. 2, 83-108. Reddy, J.N. and J.T. Oden A975), Mixed finite element approximations of linear boundary-value problems, Quart. Appl Math. 33, 255-280. Reissner, E. A984), On a certain mixed variational theorem and a proposed application, Internat. J. Numer. Methods Engrg. 20, 1366-1368. Rubinstein, R., E. Punch and S.N. Atluri A983), An analysis of and remedies for, kinematic modes in hybrid stress finite elements: selection of stable, invariant stress fields, Comput. Methods Appl. Mech. Engrg. 38, 63-92. Samuelsson, A. A979), The global constant strain condition and the patch test, in: R. Glowinski, E.Y. Rodin and O.C. Zienkiewicz, eds., Energy Methods in Finite Element Analysis (Wiley, Chichester) 47-58. Sander, G. and P. Beckers A977), The influence of the choice of connectors in finite element method, Internat. J. Numer. Methods Engrg. 11, 1491-1505. Sarigul, N. and R.H. Gallagher A989), Assumed stress function finite element method: two-dimensional elasticity, Internat. J. Numer. Methods Engrg. 28, 1577-1598. Scapolla, T. A980), A mixed finite element method for the biharmonic problem, RAIRO Anal. Numer. 14, 55-79. Scholz, R. A976), Approximation von Sattelpunkten mit finiten Elementen, Tagungsband, Math. Schriften 89, 53-66. Scholz, R. A977), L™-convergence of saddle-point approximations for second order problems, RAIRO Anal. Numer. 11, 209-216. Scholz, R. A978), A mixed method for fourth order problems using linear finite elements, RAIRO Anal. Numer. 12, 85-90. Scholz, R. A983), Optimal L„-estimates for a mixed finite element method for second order elliptic and parabolic problems, Calcolo 20, 355-377. Serrin, J. A959), Mathematical principles of classical fluid mechanics, in: S. FlOgge, ed., Handbuch der Physik VIH/1: Stromungs Mechanik I (C. Truesdell, coed.) (Springer, Berlin) 125-260. Shimodaira, H. A985), Equivalence between mixed models and displacement models using reduced integration, Internat. J. Numer. Methods Engrg. 21, 89-104. Spilker, R.L. A982), Invariant 8-node hybrid-stress elements for thin and moderately thick plates, Internat. J. Numer. Methods Engrg. 18, 1153-1178. Spilker, R. L. A983), Hybrid-stress reduced Mindlin isoparametric elements for analysis of thin plates, J. Structural Mech. 11, 49-66. Spilker, R. L. and T. Belytschko, A983), Bilinear Mindlin plate elements in: S.N. Atluri, R.H.
632 J.E. Roberts and J.-M. Thomas Gallagher and O.C. Zienkiewicz, eds., Hybrid and Mixed Finite Element Methods (Wiley, Chichester) 117-136. Spilker, R. L., S.M. Maskeri and E. Kania A981), Plane isoparametric hybrid-stress elements, Internat. J. Numer. Methods Engrg. 17, 1469-1496. Spilker, R. L. and N.I. Munir A980a), The hybrid-stress model for thin plates, Internat, J. Numer. Methods Engrg. 15, 1239-1260. Spilker, R. L. and N.I. Munir A980b), A hybrid-stress quadratic serendipity displacement Mindlin plate bending element, Comput. Structures 12, 11-21. Spilker, R. L. and N.I. Munir A980c), A serendipity cubic-displacement hybrid-stress element for thin and moderately thick plates, Internat. J. Numer. Methods Engrg. IS, 1261-1278. Squeff, M.C. A987), Superconvergence of mixed finite element methods for parabolic equations, RAIRO Model. Math. Anal. Numer. 21, 327-352. Stanley, G.M., K.C. Park and T.J.R. Hughes A986), Treatment of large rotations of thin shells using mixed finite element methods, in: First World Congress on Computational Mechanics, Austin, TX. Stein, E. and R. Ahmad A977), An equilibrium method for stress calculation using finite element displacement models, Comput. Methods Appl. Mech. Engrg. 10, 175-198. Stenberg, R. A986), On the construction of optimal mixed finite element methods for the linear elasticity problem, Numer. Math. 48, 447-462. Stenberg, R. A988a), A family of mixed finite elements for the elasticity problem, Numer. Math. 53, 513-538. Stenberg, R. A988b), Two low-order mixed methods for the elasticity problem, in: J.R. Whiteman, ed., MAFELAP 1987, The Mathematics of Finite Elements and Applications VI (Academic Press, London) 271-280. Stenberg, R. A988c), Postprocessing schemes for some mixed finite elements, Rapport de Recherche 800 INRIA, Rocquencourt. Stolarski, H. and T. Belytschko A986a), On the equivalence of mode decomposition and mixed finite elements based on the Hellinger-Reissner principle, Part I: Theory, Comput. Methods Appl. Mech, Engrg. 58 249-263. Stolarski, H. and T. Belytschko A986b), On the equivalence of mode decomposition and mixed finite elements based on the Hellinger-Reissner principle, Part II: Applications, Comput. Methods Appl. Mech. Engrg. 58, 265-284. Strang, G. A972), Variational crimes in the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 689-710. Strang, G. and G.J. Fix A973), An Analysis of the Finite Element Method (Prentice-Hall, Englewood Cliffs, NJ). Stummel, F. A979), The generalized patch test, SIAM J. Numer. Anal. 16, 449-471. Stummel, F. A980a), The limitations of the patch test, Internat. J. Numer. Methods Engrg. 15, 177- 188. Stummel, F. A980b), Basic compactness properties of nonconforming and hybrid finite element spaces, RAIRO Anal. Numer. 14, 81-115. Taylor, R.L., P.J. Beresford and E.L. Wilson A976), A nonconforming element for stress analysis, Internat. J. Numer. Methods Engrg. 10, 1211-1219. Темам, R. A979), Navier-Stokes Equations, Studies in Mathematics and Its Applications 2 (North- Holland, Amsterdam). Thomas, J.-M. A976) Methode des elements finis hybrides duaux pour les problemes du second ordre, RAIRO Anal. Numer. 10, 51-79. Thomas, J.-M. A977), Sur l'analyse numerique des methodes d'elements finis hybrides et mixtes, These d'Etat, Universite Pierre et Marie Curie, Paris. Thomas, J.-M. A987), Mixed finite elements methods for convection-diffusion problems, in: E.L. Ortiz, ed., Numerical Approximation of Partial Differential Equations, Mathematics Studies 133 A989), (North- Holland, Amsterdam) 241-250. Thomas, J.-M. A989), De la convergence des methodes d'elements finis mixtes, Les Annales de I'Enit 3, 5-18.
References 633 ТНОМЁЕ, V. A984), Galerkin Finite Element Methods for Parabolic Problems, Lecture Notes in Mathematics 1054 (Springer, Berlin). Tian, Z.S. and T.H.H. Pian A985), Axisymetric solid elements by a rational hybrid stress method, Comput. Structures 20, 141-149. Tong, P. A969), An assumed stress hybrid finite element method for incompressible materials, Internal. J. Solids Structures 5, 455-461. Tong, P. A982), A family of hybrid plate elements, Internal. J Numer. Methods Engrg. 15, 1771-1812. Tong, P. and T.H.H. Pian, A969), A variational principle and the convergence on a finite element method based on assumed stress distribution, Internat. J. Solids Structures 5, 463-472. Wheeler, M.F and R. Gonzales A984), Mixed finite element methods for petroleum reservoir engineering problems, in: R. Glowinski and J.-L. Lions, eds., Computing Methods in Applied Sciences and Engineering VI (North-Holland, Amsterdam) 639-657. Wilson, E.L., R.L. Taylor, W.P. Doherty and J. Ghaboussi A973), Incompatible displacement models, in: S.J. Fenves et al., eds., Numerical and Computer Methods in Structural Mechanics (Academic Press, New York) 43-57. Wolf, J.P. A972a), Structural averaging of stresses in the hybrid stress model, AIAA J. 10, 843-845. Wolf, J.P. A972b), Generalized hybrid stress finite element models, AIAA J. 11, 386-388. Zhong-ci Shi A984a), On the convergence properties of the quadrilateral elements of Sander and Beckers, Math. Сотр. 42, 493-504. Zhong-CI Shi A984b), A convergence condition for the quadrilateral Wilson element, Numer. Math. 44, 349-361. Zhong-ci Shi A984c), An explicit analysis of the Stummel's patch test examples, Internat. J. Numer. Methods Engrg. 20, 1233-1246. Zielinski, A.P. and O.C. Zienkiewicz A985), Generalized finite element analysis with T-complete boundary solutions functions, Internat. J. Numer. Methods Engrg. 21, 509-528. Zienkiewicz, O.C, S. Qu, R.L. Taylor and S. Nakazawa, A986), The patch test for mixed formulations, Internat. J. Numer. Methods Engrg. 23, 1873-1883. Zienkiewicz, O.C, Li Xi-kui and S. Nakasawa A984), Iterative solution of mixed problems and the stress recovery procedures, Research Rept. 476, Institute for Numerical Methods in Engineering, University College of Swansea, Wales.
List of Some Special Symbols Spaces J?f\ 557 ЛГ\П 557 Dk, 550 •*№ 557 D°, 585 РАП 546 Dk(T), 549, 550, 559 Л(ЭГ), 547 D*(T), 557, 558, 560 Л.1, 559 D°(T), 585 Л.»(П 559 /МЭГ), 556 £<*>, 553, 559 £a№), 557, 558, 560 Ed<rh, 555 Norms and seminorms E(d^h), 601 E0{b£Th), 606 IbI^, 529 Яи(й), 528 N(m>D, 529 НЛП 530 \\v\\m,T, 530 (Я""@))", 529 |||»|||1рГ, 548 Hh(Q), 529 М1Л,605 Я1/2(Г),529 ИМИ 1.^,606 Я-1/2(Л,529 |*U,529 Hm(^„), 547 ц^, 529 H(div; О), 529 !кНнМ, 529 H(div; Г), 550 ll?ll*(div;r„ 530 H0(div;O),532 IMWh^t), 556 Jf(div;fl), 530 Ikll^dhfl,, 530 Jf(div; Г), 549, 551 IMIw, 529 Jf (div; !?h\ 549, 553 |||^|||1/2 sr, 548 Lf\ 546, 547, 559 щ^щ , 548 L ,547 [<A] 1/2,8^,614 L(S^), 601, 602 Мо,з^,617 L\bZTh), 601 ||ju||-1/2,r, 529, 530 -*|(П 551 11ИН-1,2лт, 556 ^(П551 НИИ-i/2,e^, 556 Л%\ЪЪ\ IH<w„,605 635
Subject Index Alternating-direction method, 597 Approximation of normal traces, 555 Arrow-Hurwitz method, 596, 597 Augmented Lagrangian method, 594 Augmented Lagrangian technique, 621 BabuSka, theory of —, 564 BabuSka-Brezzi condition, 569 (cf. also Inf-sup condition) Brezzi, theory of —, 567 Brezzi condition, 569 (cf. also Inf-sup condition) Brezzi-Douglas-Duran-Fortin space, 584 Brezzi-Douglas-Marini space, 584, 599 Commuting diagram property, 551-552 Compatibility hypothesis, 570, 578, 579, 581, 583, 604, 612 Complementary energy principle, 534, 539 Conforming approximation method, 534 Conforming finite element method, 527 Conforming formulation, 535 Conforming variational formulation, 531, 535 Conjugate gradient method, 593 Convex quadrilateral, 558, 562 Curved domain, 562 Curved finite elements, 558, 579, 582 Diffusion-convection problem, 566 Dirichlet boundary condition, 577, 580,601, 603 nonhomogeneous —, 603, 607 — on part of the boundary, 588 Dirichlet problem, 532, 533, 536, 577, 580, 612 nonhomogeneous —, 541 Dirichlet's principle, 535 Discrete inf-sup condition, 570, 571, 576, 578, 581, 582, 585, 587, 588, 604, 607, 610, 612 Discrete LBB condition, 570 (cf. also Discrete inf-sup condition) Displacement method, 527 Domain decomposition methods, 621 Dual formulation, 535 Dual hybrid finite element method, 614 Dual hybrid formulation, 539, 540 Dual hybrid method, 592, 593 Dual mixed approximation, 581 Dual mixed formulation, 536, 591 Dual mixed-hybrid formulation, 610, 621 Dual mixed method, 580, 583,584, 585, 592, 593 Dual mixed variational formulation, 580, 586, 589 Dual problems, 535, 539 Electromagnetism, 620 Equilibrium approximation method, 534 Equilibrium finite element method, 527 Equilibrium formulation, 535, 536, 616, 619 Equilibrium interpolant, 553, 559 Equilibrium interpolation operator, 553 Equilibrium method, 527, 615 Equilibrium relation, 531 Equilibrium variational formulation, 531, 536 Eigenvalue problem, 620 Elasticity problem (cf. Linear elasticity problem) Ellipticity condition, 565 Ellipticity hypothesis, 576 Fluid mechanics, 528, 535, 619 Fourth-order problem, 591, 609 Friedrichs-Poincare inequality, 529 generalization of —, 588, 606 Gauss points, 609 Heat equation, steady state —, 531 637
638 J.E. Roberts and J.-M. Thomas Hellinger-Reissner principle, 527 Homogeneous Dirichlet problem, 532 (cf. also Dirichlet problem) Homogeneous Neumann problem, 532 (cf. also Neumann problem) Hybrid finite element method, 527, 619 Hybrid formulation, 541, 601, 602 (cf. also Dual hybrid formulation, Mixed hybrid formulation) Hybrid method, 527, 528, 619, 621 Hybridization, 615 Hyperbolic problem, 620 Multiprocessing machines, 621 Neumann boundary condition, 533, 611 — on part of the boundary, 588 Neumann problem, 532, 533, 536, 585, 586, 603 Nodal method, 621 Nonconforming finite element method, 607,608, 610,611 Nonconforming problem, 593 Nonconforming solution, 587 Nonhomogeneous Dirichlet problem, 541 Normal trace mapping, 529 Inf-sup condition, 569, 571, 572, 580, 603, 611 (cf. also Discrete inf-sup condition, Uniform discrete inf-sup condition) Interpolation, — of normal traces (cf. Approximation of normal traces) — of traces, 547 — of vector functions, 549 (cf. also Equilibrium interpolant, Equilibrium interpolation operator Lagrangian —, 545, 555, 559 Inverse hypothesis, 546 Kelvin's principle, 535 Langrangian, 527, 528, 537, 539, 540, 541, 594 Lagrangian interpolant, 546, 559 Lagrangian interpolation (cf. Interpolation) Lagrangian multiplier, 527, 528, 536, 537, 540, 541, 563, 564, 567, 599, 602, 620, 621 Lax-Milgram theorem, 565 LBB condition, 569 (cf. also Inf-sup condition) Linear elasticity problem, 527, 531, 609, 619 Mechanisms, 615 Mixed finite element method, 527, 619 Mixed formulation, 541 Mixed-hybrid finite element method, 527 Mixed-hybrid formulation, 599 Mixed-hybrid method, 599 Mixed method, 527, 619, 621 (cf. also Dual mixed method, Primal mixed method) Modal decomposition method, 621 Parabolic problem, 620 Penalization technique, 621 Penalty method, 593 Petroleum reservoir engineering, 620 Petrov-Galerkin approach, 566 Piola transformation, 555 Porous media, 620 Postprocessing technique, 602, 615 Potential energy form, 527 Prima! formulation, 535 Primal hybrid finite element method, 601, 607 Primal hybrid formulation, 540, 541, 601, 602 Primal hybrid method, 592, 593, 607, 610, 621 Primal mixed approximation, 578 Primal mixed formulation, 537, 591 Primal mixed method, 577, 579, 586, 592 Primal mixed variational formulation, 577, 585, 588 Rank condition, 570 Raviart-Thomas-Nedelec space, 582, 585 Raviart-Thomas space, 582, 585 Rectangular finite elements, 558 Reduced numerical integration, 610, 620 Regular family of triangulations (cf. Triangulation) Ritz-Galerkin, theory of —, 621 Robin's boundary condition, 588, 601, 602, 607, 610 Saddle point problem, 527, 528, 537, 538, 540, 541, 594 generalized —, 573 Sobolev space, 530 Spectral method, 621 Stokes problem, 609, 619
Subject Index 639 incompressible —, 528 Stress hybrid finite element model, 614 Structural mechanics, 527, 614 Superconvergence, 591 Trace mapping, 529 Trefftz method, 621 Triangulation, 545 regular family of —s, 546 uniformly regular family of —s, 546 Uniform ellipticity hypothesis, 542, 543 Uniform discrete inf-sup condition, 572, 583 Uniformly regular family of triangulations (cf. Triangulation) Uzawa's algorithm, 595, 596 Uzawa's method, 594 Wilson's brick, 610
Eigenvalue Problems I. BabuSka* Institute for Physical Science and Technology and Department of Mathematics University of Maryland College Park, MD 20742, USA J. Osborn** Department of Mathematics University of Maryland College Park, MD 20742, USA * Partially supported by the Office of Naval Research under contract N00014-85-K-0169 and by the National Science Foundation under grant DMS-85-16191. **Partially supported by the National Science Foundation under grant DMS-84-10324. HANDBOOK OF NUMERICAL ANALYSIS, VOL, II Finite Element Methods (Part 1) Edited by P.G. Ciarlet and J.L. Lions © 1991. Elsevier Science Publishers B.V. (North-Holland)
Contents Chapter I. Introduction and Preliminaries 645 1. Examples of eigenvalue problems 645 1.1. One-dimensional problems 645 1.2. Higher-dimensional problems 653 2. Sobolev spaces 658 3. Variational formulation of eigenvalue problems 663 4. Properties of eigenvalue problems 670 5. A brief overview of the finite element method for eigenvalue approximation 674 5.1. A physical interpretation of the finite element eigenvalue problem E.10) 678 Chapter II. Abstract Spectral Approximation Results 683 6. Survey of spectral theory for compact operators 683 7. Fundamental results on spectral approximation 685 8. Spectral approximation of variationally formulated eigenvalue problems 692 9. An additional result for multiple eigenvalues 700 Chapter III. Applications 715 10. The Ritz method for second-order problems 715 10.1. Vibrations of a free L-shaped panel 715 10.2. Vibration of a membrane 732 10.3. Eigenvalue problems for general second-order elliptic operators 747 11. Approximation by mixed methods 751 11.1. An abstract result 751 11.2. A mixed method for the vibrating membrane 755 11.3. A mixed method for the vibrating plate 761 11.4. A mixed method for a problem in one dimension with rough coefficients 765 12. Methods based on one-parameter families of variational formulations 769 12.1. The least squares method 769 12.2. The penalty method 772 13. Concluding remarks 776 References 779 Subject Index 785 643
Chapter I Introduction and Preliminaries 1. Examples of eigenvalue problems In this section we present several model eigenvalue problems arising in physics and engineering. Specifically, we will discuss briefly some important physical interpretations of eigenvalues and eigenfunctions. Some of the model problems we discuss here will serve as illustrative examples in connection with the approximation methods considered in Chapter III. We will attempt to provide a clear understanding of the fundamental ideas, but will not present a detailed treatment. For a more complete discussion of the material in this section we refer to Courant and Hilbert [1953]. 1.1. One-dimensional problems 1.1.1. The longitudinal vibration of an elastic bar We are interested in studying the small, longitudinal vibrations of a longitudinally loaded, elastically supported, elastic bar with masses attached to its ends. The bar is shown in Fig. 1.1. ELASTIC SUPPORT ELASTIC SUPPORT гЛ/li A4I rVII END MASS LOAD ELASTIC SUPPORT L i Fig. 1.1. Elastic bar. We now derive the governing differential equation and boundary conditions for the problem. First we consider the static problem. Suppose f(x), 0<x<l, represents the external longitudinal bad, with positive/(x) denoting a force directed to the right, 645
646 I. BabuSka and J. Osborn Chapter I u(x), 0<x<l, denotes the displacement of the cross-section of the bar originally at x, with positive u{x) denoting the displacement to the right, so that the position of a point originally at x is x 4- u(x), e(x), 0 < x < I, denotes the strain in the x-direction, i.e., the relative change in the length of the fibers in the bar (s(x) will be positive if it describes extension), <r(x), 0 < x < I, denotes the normal stress in the cross-section at x, i.e., the force per unit area exerted by the portion of the bar to the right of x on the portion to the left of x (<t(x) will be positive if it describes tension), A(x), 0 < x < /, denotes the area of the cross-section at x, E(x), 0 < x < /, denotes the modulus of elasticity of the bar at x, F(x), 0<x<!, denotes the internal force acting on the cross-section at x, i.e., the force exerted by the portion of the bar to the right of x on the portion to the left, with positive F(x) denoting a force directed to the right, p(x), 0<x<l, denotes the load due to the (continuous) elastic support, which is assumed to be of the form p{x) = — c(x)u(x), where c(x) > 0 is the spring constant of the support (the negative sign indicates that the force is directed opposite to the displacement), and m(x), 0 < x < I, denotes the specific mass at x, i.e., the mass per unit volume at x. The strain s(x) and the displacement ц(х) are related by <*)=£(*). This relation is valid for small displacements, i.e., when |e(x) |«1. The relation between stress and strain is described by the constitutive law of the material. We are assuming the linear relation given by Hooke's Law: ст(х) = £(x)e(x). Thus, since F(x) = a(x)A(x), we have F(x) = A(x)E{x)e{x) = A(x)E(x) -Лх). dx Now the equilibrium condition for the bar is ~(x)+f(x) + p(x) = 0, which, with the use of the relations discussed above, can also be written as d / du \ — I А(хЩх)—(х))+с(х)и(х)=Дх), 0<х</. A.1) This is the governing differential equation. We consider the three most important types of boundary conditions:
Section 1 Introduction and preliminaries 647 Dirichlet type: u@) = au u(l)-a2. A.2a) Here the displacements of the end points of the bar are given. Neumann type: -F@)=-(AE^j@) = b1, F(l)=(AE^)(l) = b2. A.2b) Here the forces at the ends of the bar are given. The different signs at 0 and / are used to express the outer normal derivative at the ends of the bar. Newton type: AE^\@) + yM0) = c1, (AE^)(l) + y2u(l) = c2 A.2c). where yi,y2>0. Here y2 is the spring constant of a spring attached to the bar at x — I and — y2u{l) is the force exerted on the right end of the bar by the spring. We are thus specifying the sum of the internal force and the spring force on the right end of the bar. The condition at x = 0 has a similar interpretation. Equation A.1) together with one of A.2a)-(c) determine the displacement u(x) in the static case. We now turn to the dynamic case. We assume the external load depends on the time t and is represented by/(x, t) and suppose a;, bh cv in the boundary conditions depend on t: at = a;(t), b, = bt(t), ct = ct(t), i= 1,2. We further suppose the bar is subject to a damping force represented by R. If и = u(x, t) is the displacement at time t, then from Newton's second law we have —( A(x)E(x) ~ (x, t)) + c{x)u(x, t) —f(x,t)—m(x)A(x)-^j(x,t) — R, 0<x<l, t>0. We next give the boundary conditions in the dynamic case. The Dirichlet conditions are nearly the same as in the static case, while the Neumann and Newton conditions require modification because of the forces exerted on the ends of the bar by the attached masses. Dirichlet type: u{0,t) = al{t\ u{l,t) = a2{t), t^O. A.4a)
648 /. BabuSka and J. Osborn Chapter I Neumann type: -^£^j@,t)=~m1-^@,t) + b1(t), f>0, A.4b) AE—Ul,t)= -m2^(l,t) + b2(t), where m^ and m2 are the masses attached to the left and right ends of the bar, respectively. Newton type: -^£^J@,t) + y1w@,t)=-m1^@,t) + c1(t), %J t^O. A.4c) ^1£— l(/,t) + y2u(U)=-m2-^2-(l,l) + c2(t), We remark that we can impose boundary conditions of different types at the two ends. For example, we. could impose a Newton type condition at 0 and a Dirichlet type at I. Finally in this (dynamic) case we need to impose initial conditions. We specify the initial position and velocity: u(x,0) = xdx), g^(x,0) = x2(x), CKx^l. A.5) Consider now equations A.3), with /=R=0, and one of the conditions A.4a)-(c), with ax =a2 —Ьг = b2—cx —c2 =0. If we seek separated solutions of the form u(x,t) = v(x)w(t), in which the spatial variable x and the temporal variable t are separated, from A.3) we find that d / dv , — I A(x)m{x) — (x) j +c{x)v(x) w{t)= ~m{x)A(x)v(x)in2(t) .dV dt2 or d / do \ d2w ~(A(x)E(x)-(x))+c(xm -^ m{x)A{x)v(x) w(t) 0<x<l, £>0. A.6)
Section 1 Introduction and preliminaries 649 Imposing the boundary conditions A.4a)-(c) onu = vw we find v@)w(t) = 0, v(l)w(t) = 0, t^O; A.7a) dv\ d2w ( dv\ d2w :S@, -^w U- <„ -wt rriiViO) w(t) m2v(l) w(t) dv\ d2w J>0; A.7b) mMO) w(t) dv\ d2w tso. <L7c» m2v(l) w(t) It is immediate that both sides of equation A.6) equal a constant, which we denote by X. We are thus led to seek a number X and a function u(x) # 0 so that d ( dv \ —— Л(х)£(х) — (x) +сШх)=МЫШ4 0<x</. A.8) dx \ dx / From A.7a)-(c) we get boundary conditions for v. Dirichlet type: t<0) = p@ = 0. A.9a) Neumann type: АЕ-^-)@)=^Хт1ф), (AE-^-\(l) = Xm2v(l). A.9b) Newton type: AE^\@) + ylV@) = XmlV@), (AE^)(l) + y2v(l) = bn2v(I). A.9c) The problem of finding X and v(x) ф 0 satisfying A.8) and a boundary condition A.9) of Dirichlet, Neumann, or Newton type is called an eigenvalue problem. X is called an eigenvalue and v(x) a corresponding eigenfunction, or eigenvector, of the problem, and (A, u) is often called an eigenpair. If Я is present in one or both of the boundary conditions, the problem is referred to as a Steklov-type eigenvalue problem. Since in the general theory of eigenvalue problems it is necessary to consider complex eigenfunctions and eigenvalues, we will use complex notation here. We note, however, that for the specific class of problems treated in this section, this generality is not necessary. For the sake of definiteness, let us suppose we have a Newton type boundary
650 1. BabuSka and J. Osborn Chapter I condition at 0 and a Dirichlet type at I, and further assume that m1 = 0. Thus we are considering the initial boundary value problem б / ди\ д2и -■~-\AE-~) + cu=-mA—T, 0<х<1, г>0; A.3') ox\ ox J or AEj£\@,t) + yiu@,t)=0; A.4c') u(/,t) = 0, t^O; A.4a') du u(x,0) = xdx), ^(х,0) = х2(х), 0<x<l. A.5') The corresponding eigenvalue problem is d / du\ —\AE—\+cv = XmAv, 0 < x < I, ax V ax J du\ AE~ 1@) + у гф) = 0, v(l) = 0. A.10) It is known that problems of this type have a sequence of eigenvalues 0<Яг^Я2^---f+oo A.11) and corresponding eigenfunctions v1(x),v2{x),.... A.12) The eigenfunctions satisfy i m(x)A(x)vi{x)vj(x)dx = eij, A-13) where ди is the Kronecker symbol, i.e., they are orthonormal; in addition they are complete in L2, i.e., any function h(x)eL2 can be written as h(x)= £ cjvj(x), A.14) where i тЛ/uJydx A.15) and the convergence is|in the L2 norm. Regarding A.11)—A.15), see D.10)-D.14).
Section 1 Introduction and preliminaries 651 2(t) + XjW(t) = 0, t>0 A.16) Corresponding to each A • we solve At (cf. A.6)), obtaining w(t) = wj(t)=aj sin V^/(t+вД where a} and 0,- are arbitrary. Thus the separated solutions are given by ajVj(x) sin ^/Xjit + 6j), j = 1,2,... . A.17) It is immediate that u(x,t)= Z ayt;/x)sinN//l/f + ^.) A.18) y=i is a solution of A.3'), A.4c'), A.4a') for arbitrary ai and 0,, provided the series converges appropriately. It remains to satisfy the initial conditions A.5'). For this, a,- and 6j must satisfy Ф, 0) = Z fl7sin у/Щ»Ах) = Zi W, у 8и 8f-(x, 0) = Z ay v h cos \/AjeJvAx)=*2 (*)■ From the complete orthonormality of the Vj(x) we see that these two equations uniquely determine a-t and #,-. Thus A.18), with this choice for a,- and 6j, is the unique solution of A.3'), A.4c'), A.4a'), A.5'). The simple motions given in A.17) are called the eigenvibrations of A.3'), A.4c'), A.4a'). All the points x of they'th eigenvibrations vibrate with the same (circular) frequency (defined to be the number of vibrations per 2л seconds) and phase displacement vAyfl,- and the point x vibrates with amplitude proportional to Vj(x). Thus y/Tj is the frequency with which the/th eigen vibration vibrates and Vj(x) gives the basic shape of the eigen vibration. The amplitude factor a,- and 6j are determined by the initial position and velocity of the eigenvibration, whereas Aj and Vj(x) are determined by the physical process itself, as represented by A.3'), A.4c'), and A.4a'). We have seen that any motion of A.3'), A.4c'), A.4a') can be written as a sum or superposition of eigenvibrations. So far we have been dealing with free vibrations, i.e., we have assumed f(x, t) and R in A.3) are zero. Now we briefly consider the case when /V0 and R=0, i.e., the case of forced vibrations. If we write /(*.*)= Z /ЛЩ(х)т(х)А(х), y=i then we easily see that u(x, r) = EJL t aj(t)vj(x) is a solution if a;(f) + V,(t)=.Ш-
652 I. BabuSka and J. Osborn Chapter I If, now, fj(t) = sin yJXjit + 0j), then we see that аДг), and hence u(x, t), will be unbounded as t-*co. This phenomenon is called resonance and / is called a resonant load; the resonant frequencies are ■JXj, j = 1,2,... . The damping term R could be defined in various ways. For example, we could take R to be /i(8u/8l), for a constant /i, which would lead to a term of the form n{du/dt) in equation A.3). Eigenvalue problems similar to A.8) and A.9) or A.10) arise in a number of other situations. We now briefly mention some of them. 1.1.2. The transverse vibration of a string We are interested here in the small, transverse vibration of a homogeneous string that is stretched between two points a distance I apart. Gravity is assumed to be negligible and the particles of the string are assumed to move in a plane. We denote the density of the string by r and the tension by p. We restrict our attention to the case of free vibrations. If the particles of the string are identified with the numbers 0 s% x ^ I and if u(x, t) denotes the vertical displacement of the particle x at time t, then w satisfies -P-^T-^~r~^-' 0<x<l, t>0, d2u(x, t) _ d2u(x,t) ° A.19) u{0,t) = u(l,t)=Q, tjsO, We see that A.19) is a very special case of A.3) and A.4a). The associated eigenvalue problem is -CV(x) = A«(jc), 0<x<l, p@) = »(/) = 0, where C2 = p/r. It is easily seen that the eigenvalues and eigenfunctions of A.20) can be given explicitly; they are Xk = (k2C2n2)/l2 A.21) and vk{x) = y/2flsm(kKx/l), fc=l,2,... . A.22) The entire discussion of the elastic bar—i.e., the discussion of separation of variables, of eigenvalues and eigenfunctions, and of eigenvibrations—applies to this problem. We note that it is possible to find the eigenvalues and eigenfunctions explicitly only in very special situations, roughly, just in the case of eigenvalue problems for differential equations with constant coefficients in one dimension. In general, one must resort to approximation methods. The discussion of such methods is the main topic of this article.
Section 1 Introduction and preliminaries 653 1.1.3. Characterization of the optimal constant in the Poincare inequality The Poincare inequality states that there is a constant С such that [и(х)]2 dx < С [u'(x)Y dx A.23) о о for all functions u(x) having a square integrable first derivative and vanishing at 0 and /. Let us consider the problem of finding the minimal constant C. We are thus interested in u2 dx C= sup -f . A.24) {и:и@)=и(!) = О} (и'J dx Using the elementary methods of the calculus of variations we find that the function и achieving the supremum in A.24) satisfies С I uVdx = uvdx о for all v having square integrable first derivatives and vanishing at 0 and I. By integration by parts we then find -m" = A/C)u, 0<x</, h@)=m(Z) = 0. A.25) Thus 1/C is the lowest eigenvalue of the eigenvalue problem A.25), and the optimal и in A.24) (which achieves equality in A.21)) is an associated eigenfunction. 1.2. Higher-dimensional problems 1.2.1. The vibrating membrane Consider the small, transverse vibration of a thin membrane stretched over a bounded region Q in the plane and fixed along its edges Г — dQ. The vertical displacement u(x, v, t) of the point (x, y) in Q at time t satisfies S2u 92a 92u "ex2 ~ by1 ~ ~ Эг2 ■^=-0-^2 = -^2. (x,y)eQ, t>0, A.26) u(x, y, t) = 0, (x, у) е SO, t $: 0. As with the vibrating elastic bar or the vibrating string, if we seek separated solutions of the form u(x, y, t) = v(x, y)w(t), we are led to the eigenvalue problem of finding л and
654 I. BabuSka and J. Osborn Chapter I v{x,y)¥zO satisfying — Av — lv, (x,y)eQ A.27) v{x,y)=0, {x,y)edQ, and for each eigenpair (A, v) of A.27), to the differential equation d2w ^-2-(t) + Aw(f) = 0, C^O, A.28) for w(t) (cf. A.16)). It is known that A.27) has an infinite sequence of eigenvalues 0<A1<A2<---/l+oo and corresponding eigenfunctions Vi(x,y), v2(x,y),,.. , The eigenfunctions are complete and orthonormal in L2(Q). ajVjix, y) sin 4/Ij(r + 0Д j = 1,2,..., are called eigenvibrations. ^/Xj is the frequency and Vj(x, y) is the shape of the jth eigen vibration. All solutions of A.26) can be obtained as a superposition of eigenvibrations (cf. A.18)). We note that if, instead of fixing the membrane on Г, we allowed it to move freely in the vertical direction, then we should have the Neumann boundary condition ди/дп = 0, where S/Sn denotes the outer normal derivative, instead of the Dirichlet condition и = 0. The approximation of the eigenpairs of a membrane is discussed in Sections 10.2, 11.2 and 12. 1.2.2. The problem of heat conduction Consider the problem of heat conduction in a body occupying a region Q in three-dimensional space. We suppose the temperature distribution throughout Q is known at time zero, the temperature is held at zero on 6fi for all time, and that we want to determine the temperature u(x,y,z,t) at the point (x,y,z)e Q at time t>0. From the fundamental law of heat conduction we know that ~ te {P{x^Z)b~x) ~ Vy {P{X^%) ~ Ы {PiX>y>4 du = -r(x,y,z)—, (x,y,z)eQ, t>0, A.29) u(x,y,z,t) = 0 u(x,y,z)edQ, t^O, u(x, y, z, 0) =f(x, y, z), (x, y, z) e Q, where
Section 1 Introduction and preliminaries 655 f(x, y, z) = the temperature distribution at t = 0, p(x,j;,z)=the thermal conductivity of the material at (x,y,z), r(x,y,z)=density of the material times the specific heat of the material. If we seek separated solutions u(x,y,z,t) = v(x,y,z)w(t) of the differential equation and the boundary conditions in A.29) we are led to the eigenvalue problem ~ Ьх Ы - Ъу {% ~ bz [Pbz = ' (W) ' V 7 V 7 Ч J A.30) v(x,y,z) = 0, (x,y,z)edQ, and for each eigenpair (A,v) of A.30) we are led to the equation w'+lw=0, t>0, A.31) for w(t) (cf. A.16) and A.28)). Problem A.30) has eigenvalues о<я1<л2<---/'°° and eigenfunctions Vi,V2,... satisfying vivjrdxdydz = Sij. Corresponding to each A,-, from A.31) we find w(i)=Wj(t)=ap Xjt. Thus the separated solutions are given by ajVj(x,y,z)e~Xi', ./'=1,2,...., and the solution of A.29) is u(x,y,z,t)= Y, ( \fVjrdxdydzJVj(x,y,z)c~^' A.32) a (cf. A.18)). We note that from A.32) and the positivity of the eigenvalues, one can show that lim,_ x u(x, y, z, t) = 0 and that the rate at which the temperature и decays to zero is largely determined by Ai. 1.2.3. The vibration of an elastic solid The vibration of an elastic solid Q, the three-dimensional generalization of the elastic bar, is governed by the Navier-Lame equations
656 J. BabuSka and J. Osborn Chapter I dfl {X + p)—~ +цАи — дв oy 80 (Х + ц) — + ^Aw- oz -x+pw ~Y+PW d2w (x, y,z)eQ, t> 0, A.33) where u(x, y, z, t), v(x, y, z, t), and w(x, y, z, r) are the х-, у- and z-components of the displacement of the point (x, y, z) e Q at time t,8 = ди/дх + dv/dy + dw/dz, X, Y, and Z are the components of the external force per unit volume acting at (x, y, z), A > 0, and fi > 0 are the Lame elastic constants, and p is the density of the material. As in the case of the bar, boundary conditions of various types may be prescribed. For example, the Dirichlet boundary conditions prescribe the values of u, v and w on Г-dQ. Neumann conditions are more complicated. Let n be the unit outer normal to Г, let nx, ny and nz be the х-, у- and z-components of «, and let Э 8 6 8 ■^-=nx~+nv~+nz-- on ox ' By oz be the outer normal derivative. Then define 9м Гбм Эр 9w Х"=Я0"* + ^Нб^+бГ' + 6х^ Y„=kdny + fi— +Ц dw Z„=X6nz + Ц— +[i on du dv dw ~nx+~ny+^-nz dy dy dy 8m dv dw dzn*+dzn>+Tzn> A.34a) A.34b) A.34c) The Neumann conditions then consist in prescribing X„, Yn and Z„ on the boundary. One can also mix the boundary conditions in various ways, e.g., impose Dirichlet conditions on one part of the boundary and Neumann conditions on the remainder of the boundary or prescribe X„, Y„ and w on Г. The eigenvalue problem associated with A.33) is given by 60 — (A + fAx pAu — сори, дв -(X + p)~ pAv — copv, (x,y,z)eQ. dy 80 -(X + u)- uAw = copw, oz A.35) where we have denoted the eigenvalue parameter by cu (to avoid confusion with the
Section 1 Introduction and preliminaries 657 Lame constants fi and Я), and where u, v, w and в denote functions of x, у and z only, i.e., the separation of variables has been written as u(x,y,z,t) = u(x,y,z)T(t), etc. For boundary conditions we can consider any of those mentioned above. If we consider Dirichlet conditions (u = v = w=0 on Г) we refer to the clamped solid and if we consider Neumann conditions (Xn = Y„ = Z„ = 0 on Г) we refer to the free solid. The approximation of the eigenvalues of the free L-shaped panel (a two- dimensional analogue of the elastic solid) is treated in detail in Section 10.1. 1.2.4. The Steklov eigenvalue problem The Steklov eigenvalues of the differential operator — A + I are those numbers Я such that for some nonzero u, — Att + u=0 in Q, ди/дп = Хи on Г — dQ. Problems of this type, in which the eigenvalue parameter appears in the boundary condition, arise in a number of applications (cf. A.9b) and A.9c)). 1.2.5. The problem of stability of a nonlinear problem Consider the quasilinear parabolic problem 9" 9и „ , ч „ — -Au + u— =0, (x,y)eQ, t>0, at ox u(x,y,t) = (p(x,y), (x,y)edQ, t>0. Suppose й(х,у) is a stationary solution, i.e., suppose 90 — Ай + й— =0, (x,v)eQ, ox й(х, у) = <р(х, у), (x, у) £ oQ. Then we consider a nearby time-dependent solution u{x,y,t) = u(x,y) + w(x,y,t) and ask whether й is a stable stationary solution, i.e., whether lim u(x, y, t) = й(х, у) f-»0O or, equivalently, lim w(x, y, t) — 0. Г-»оо We easily see that w satisfies 9vv — +Lw + Nw = 0, (x,y)eQ, t>0 A.36) w = 0, (x,y)eoQ,
658 I. BabuSka and J. Osborn Chapter I where 3w Эй Lw= — Aw + u-—h — w ox ox and dw ox Conditions ensuring w->0 as t-юо can be given in terms of the eigenvalues of Lw = Aw in Q, w=0 on 80. In fact, if all the eigenvalues of this problem have positive real parts, then й is asymptotically stable in the L2 norm, i.e., there is a constant ё > 0 such that if IN%%0)k2(o><<5, then \\wC,',t)h2(Q)-*0 asf->co. If the term N in A.36) is neglected, then this result is similar to that mentioned at the end of the discussion of heat conduction. Note that L is a nonselfadjoint operator and its eigenvalues will, in general, be complex (cf. Section 3). For further details on this type of stability results see Prodi [1962]. 2. Sobolev spaces The natural setting for a discussion of eigenvalue problems and their approximation is the theory of linear operators on a Hilbert space. In this section we will sketch the definitions and basic properties of the function spaces we will make use of. These are mainly the Sobolev and Besov spaces. Let Q be a bounded open subset of Ш" and denote by x = (xj,..., x„) a point in U". For each integer m^O, the real (complex) Sobolev space Hm{Q) is defined by Hm = ЯИ(Й) = {м: б"и е L2(fi)V|a| <m}, B.1) where L2{&) denotes the usual space of real- (complex-) valued square integrable functions on Q equipped with the inner product («,t>) = («,u)z.2(n) = and norm 111 = 11«к2(П) = uv dx B.2) 1/2 |u|2dx B.3)
Section 2 Introduction and preliminaries 659 On Hm{Q) we have the inner product ((«.4=((mU= I |a««£^dx B.4) and norm ll«llff-(Q)=llMlL = l|wL,o=( Z Я 1/2 |6ам|2 dx . B.5) With this inner product, Hm(Q) is a Hilbert space. Here a = (a1,..., a„), with a; a nonnegative integer, |а| = Е,аь and дхи = дми/дх\г ••• 9x*n. We also have the semi-inner product (u,v)Bn,w=(u,v)m={u,v)mt0= £ d'u&vdx B.6) W=m . Q and seminorm Mh-4d, = I«L = I«L.o = ( £ ||8*u|2dxY/2 B.7) \|«|=m J / It is immediate that H°(Q) = L2(&) and ||«Ho,o = I«Io,q = IMIljW)- If Г=8й is Lipschitz continuous, then Cm(Q) is dense in Hm(Q). (Г is called Lipschitz continuous if it can be locally represented by a Lipschitz continuous function; see NeCas [1967] for further details.) Ho(Q) is defined as the closure in H1{Q) of Cq(Q), the space of infinitely dif- ferentiable functions on Q which vanish near Г. The Poincare inequality, which states that Mo.o*£C|m|i.0 WeHl(Q), B.8) shows that |-|1>й is a norm on Hj(fl). Щ@) is the closure in Hm{Q) of Cg{Q). If Г is Lipschitz continuous, then we can define the space Ь2{Г\ which consists of functions и defined on Г for which ||м||[2(Г) = (jYM2 dsI/2 < oo, where us denotes the surface area. L2(F) is a Hilbert space with inner product (u, v)L (n = §ruvds. It is also known that a function ueHy{Q) has a well-defined restriction to Г, denoted by tr м, in the sense of trace; и = tr и satisfies NlL2(r>^C||«lli.fl ЧиеНЧО) B-9) and Hl{Q) = {u e H1(Q):u = 0 on Г in the sense of trace}. Furthermore, a function и e С'@) is in Яо(Ц) if and only if «==0 for all хеГ. We note that if Г is Lipschitz continuous, then the normal vector и is defined almost
660 /. BabuSka and J. Osbom Chapter I everywhere on Г. The outer normal derivative ди/дп is defined for и е H2(Q) and Я^(Й) = |ыбЯ2(Й):м=^=0опГ We shall occasionally make use of the vector-valued Sobolev spaces Hm(Q) which are defined by Mm(Q) = {(u1(x),..., uk(x)): Uj(x) e Hm{Q), j= 1,..., k} B.10) and 11«11н-(Я) = К llm.fi + ••• + Kllm.fi- BЛ1) In the study of eigenvalue problems, central use will be made of Rellich's theorem (cf. Agmon [1965]), which states that every bounded sequence in Hm(Q) has a subsequence which converges in HJ(Q) if/ < m, provided Q is a bounded open set in W with a Lipschitz continuous boundary. So far we have defined the Sobolev space Hm{Q) only for m an integer. We will sometimes use Hm(Q), for m fractional, and also the Besov spaces, so we now turn to their definition, using the K-method. For и e Hm(Q) and 0 < f < oo set K(u,t)= inf {||i>L,e + r||w|L+i.D}. B-12) v + w—u Then for т<к<т-\-1 define GO MHH^\M^WKa = (kreK(u,t)f^J12 B.13) о and \\и\\ЙНт= sup {reK{u,t)}, B.14) 0<I<oo where 6 = k — m. The space Hk(Q) = {uвHm(Q): \\u\\HHQ)<00} B.15) is the Sobolev space with fractional order к and Hk(Q)={ueHm(Q): ||«||ff4n)<oo} B.16) is a Besov space, the one often denoted by £%,coin order to fix these ideas and to obtain a fact we will use in the sequel (cf. Sections 10.1 and 10.2), we now consider the function u = r' for(r,0)eS = {(r,0):O<r<l,O<0<0o^2rc}, where — 1 <a<0, (r, 9) being polar coordinates, and prove that и е Йа+ 1(S).
Section 2 Introduction and preliminaries 661 Theorem 2.1. For — 1 <a<0, we have u = r"eE1+a(S). Proof. Let cp(x), 0<x<oo, be a function having derivatives of all orders and satisfying fO for 0<x<i </>(x)= 1 for l<x<oo. For 0<<5<1, define d = [1 — <p(r/c5)]u, w = (p{r/8)u. Then we obviously have u = v + w. Now ll»ll|o(S)<0o| r2°+4r=^^<C<52*+2 and w HHS) - \w\2 + dw 8Xi + 8w 8x- dxj dx2 s во 1 lw|2 + 8w ~б7 + r" 6w 80 rdrdfl -w0 H2 + 6w Hr rdr <C i l r2a+1dx + <x2 \r2a-ldr + 5-2 d/2 3/2 r2a+1dr a/2 ^CE2a, with С independent of S. Hence K(ii,tKC[a«+1 + t<5"] and thus r(e + 1)K(MKC[5" + 1r(« + 1) + t1-(e + 1)a"]. If 0<r<l, let <5 = r to get
662 I. BabuSka and J. Osborn Chapter I and hence sup {ria+1)K(u,t)}^2C. 0<t<l If-t^ 1, we obviously have K(u, £)=NIho<g)*SC" and hence sup {r(a+1)K(M,t)}^C. l^K to Therefore iMlfiU+«>(S)= sup {rA+e)X(u, r)}^C"<oo 0<(<oo and hence ueH1+x(S), as was to be proved. In a similar way, one can also prove that feH1+x(S) for a>0, not an integer. Finally we note that г'фН1 +tt(S), but r'eH1 +*-£(S) for any e>0. Q For a complete discussion of the Sobolev and Besov spaces we refer to Adams [1975], NeCas [1967], and Butzer and Berens [1967]. Remark 2.1. The definition of the Sobolev spaces with fractional order m has a very simple interpretation. For и to be in Я1 +X{S) means that for any 0 < t < oo, и can be split into the sum of a smooth function and a nonsmooth function in a natural way. We have employed this natural splitting in the proof of Theorem 2.1. So far we have considered only one special family of Sobolev spaces or Sobolev- type spaces. Several other families are important in various situations. For example, if QcU2 with 0e8O, and if 0</?<l and m^l^l, we can define ЩЩ = {иеН'-Щ: (д"иУ + М-1еЬ2(£2) for f<|a|<m} B.17) and m ll«ll&y@)=Nl!<-@>+ I llCa^+|ai-'ll£2(fi), B.18) where г = (х\ + х\У!г. Spaces of this kind are called weighted Sobolev spaces. For more details we refer to KUFNER [1985]. Consider the function и — r\ with 0 <y < 1. One can show that ueHf\Q), where Q = {(r, в): 0<r< I}, for Д> 1 - у, т^2, and 1 = 2. In fact, since \dxu\^C{ayM, we have |8аи|И'+|а:|'^Сг7+/'~2, and we see that иeH'p'XQ) for m, I and ft as given. We will also have occasion to use countably normed spaces constructed from Sobolev spaces. For example, consider the space ®}(Q) = {ueHf2{Q): ||(8"u)r'"|-2 + ',||t2(DLCd'"la! for |a|>2, with С and d independent of a}. B.19)
Section 3 Introduction and preliminaries 663 It is easy to see that all functions иеЩ{0) are analytic in £2\{0}. The function ry considered above belongs to @)}(Q) for /?>1—y. We have here only considered weights with respect to the origin. More generally, one can consider weights with respect to the vertices of domains with piecewise smooth boundaries. An important reason for introducing these spaces is to characterize the solution (eigenfunctions) of a problem as precisely as possible by embedding it (them) in as small a space as possible. There are other classes of function spaces that are important in various contexts, but we will not go further in this direction. Remark 2.2. We have followed the usual custom of using the same notation for real and complex function spaces. It will be clear from the context which version we are using. See Remark 4.1. 3. Variational formulation of eigenvalue problems In Section 1 the eigenvalue problems were stated in classical form, i.e., we were seeking an eigenvalue к and a corresponding nonzero eigenfunction u(x) such that the eigenvalue equation and boundary condition were satisfied in the classical pointwise sense. These problems can alternately be given as variational formulations. Since finite element approximation methods are most naturally defined in terms of variational formulations we now briefly indicate how eigenvalue problems can be cast in variational form. We will do this by discussing second-order elliptic eigenvalue problems in two dimensions in some detail. We begin by describing this type of problem. Consider the problem: Seek a real or complex number к and a nonzero real- or complex-valued function u(x) satisfying (Lu)(x) = k(Mu)(x), xeQ, (Bu)(x) = 0, xeT = dQ, ( ' ' where Q is a bounded, open, connected set in U2 with Lipschitz continuous boundary Г, and 2 2 Lu(x)=— Y, Э/(ау(х)8(М)+ Е Ь,(х)Э,и + с(х)и, 8,- = 8/cbc;, C.2) where аи(х) = ая(х), Ь((х), and c{x) are given real or complex functions on Q, Mu(x) = d(x)u(x), C.3) where d(x) is a given real function which is bounded below by a positive constant on Q, and 2 (Bu)(x) = u(x) or (Бн)(х)=— £ aijtijdiU, C.4) where n(x)=(n1,n2) is the exterior unit normal to Г = 80 at x. L is assumed to be uniformly strongly elliptic in Q, i.e., there is a positive constant a0 such that
664 I. BabuSka and J. Osborn Chapter I Re £ аи(х)Ь^а0 £ £? i,j=i ; = i Vxe£2 and V(£b£2)e[ C.5) In addition, a^, bh с and d are assumed to be bounded and measurable. (A portion of the theory of eigenvalue problems can be developed under the more general hypothesis that d(x) is merely assumed to be a bounded, measurable, complex function, but we will not pursue this direction.) (X, u) is called an eigenpair of the second-order differential operator L (relative to the zeroth-order differential operator M). If Bu = u, the boundary condition Bu-0 is the Dirichlet condition, and if Bu~ — Y, aijiijdiU-- du = the conormal derivative of u, then Bu — 0 yields the Neumann condition. It is immediate that all of the examples discussed in Section 1—except the Steklov-type eigenvalue problems and the problem of the vibration of an elastic solid—are of the form C.1) or its one- or higher-dimensional analogues. In any case, our discussion of approximation methods will be in terms of an abstract framework that will cover all the examples. Let 2 2 L*v(x)=- £ SiK-Sj»)- ^ Э|(?;с)+ст and 5d 9v* = ~ E ауп<б>--£ F,n.-!J. C.6) C.7) i,j=l i = l L* is called the formal adjoint of L. It is an immediate consequence of the divergence theorem that 2 .__ I \ Гди Luvdx= ]T aijdiuQjV+ Y, bidtuv + cuv jdx + i,j=l i=l 9v yds = uL*vdx+ du _ , I dv , —-vdx— u—— ds ov J ov* а г г for all smooth functions и and v. Hence we have C.8) Luvdx= uL*v dx C.9)
Section 3 Introduction and preliminaries 665 if either и = »=0опГог ди/dv = bv/dv* = 0 on Г. If atj and с are real and b,=0, then L*=L and S/8v = 6/Sv*. In this case we say L, M, В or, more briefly, L is formally selfadjoint, and we have d[ -L\uvdx = du\ -L\vdx C.10) if either и = v = 0 on Г or Su/Sv = 8r/8v = 0 on Г. All of the examples treated in Section 1 are formally selfadjoint except the operator arising in the stability analysis of the nonlinear initial boundary value problem. Now we turn to the derivation of a variational formulation for C.1). Suppose (A, u(x)) satisfies C.1) in the classical sense, i.e., the differential equation and the boundary condition hold pointwise, and consider first the case of the Dirichlet boundary condition: u(x) = 0 for хеГ. Multiplying C.1) by v, integrating over Q, and using C.3) and C.8) we find that Luvdx Xb{u, v) = X\ duv dx = я я 2 2 _ _\ CQu £ aijdiUdjV+ Y, bfiitiv + cuv ) dx + —v i.j=l i = l / J ^V 2 2 \ £ ciijQtudjV+ Y, bfiiuv + cuv Idx i,j=l i = l J = a(u, v) for all v e С 1(Q) that vanish on Г. ds C.11) a{u, v) and b(u, v), as defined in C.11), are bilinear forms (sometimes referred to as sesquilinear forms in the complex case) in u and v. They are clearly defined for и^еС^П) and, in fact, a(u, v) is defined for u, veH1(Q) and b(u, v) for u,veH°(Q) = L2(Q). Furthermore, using the fact that au, bh с and d are bounded, it follows from Schwarz's inequality that a is bounded on H\Q) and b is bounded on H°(Q), i.e., that W«,»KC1||m||1i0||»||1i0 ^u,veH\Ql |Ь(и,»)|<С2||и||о.о||»||о.о V«,»6fl°@). We note one further property of the form a(u, v): 'К1|и||!.о УиеНЩ, if Rec(x)^ia0+ib2/flo for all xeQ, C.12) C.13) Re a{u, и) ^ where b=max |b;(x)|, i=l,2 eol«lf.o>C||ii||?.fl VueHj(f2), if Ь4(х) = 0, i = 1,2, Re c(x) > 0. C.14)
666 /. BabuSka and J. Osborn Chapter I a0 here is the ellipticity constant in C.5); the result follows directly from C.5). Since the eigenfunction и vanishes on r,ueHl{Q). Thus, using C.12), C.13), and the fact that {ve C\Q): v = 0 on Г} is dense in Hl0(Q), it follows from C.11) that the eigenpair (A, u) satisfies ueHh(Q), иФ% a(u,v) = Xb(u,v) VveHh{Q). C.15) is called a variational formulation of C.1). We have shown that if (A, u) is an eigenpair in the classical sense then it is an eigenpair in the variational sense. We now show that the converse is true, provided Г, atj, bt, с and d are sufficiently smooth. Suppose (A, u) satisfies C.15) and suppose in addition Q is a bounded open set with Lipschitz continuous boundary Г and that и е C2(Q). Then from the equation in C.15) and from C.8) we have Г я Luv dx = a(u, v)+ — v ds ov — a{u, v)~Xb{u, v) = A duv dx Vi;e Сх@) that vanish on Г. C.16) Since {veC\Q): v = Q on Г) is dense in L2{0) we see from C.16) that Lu(x) = XMu(x), xeQ. Also, since Г is Lipschitz continuous and ueC2(Q)nHo(Q) we know that u(x)=0 for all x e Г. Thus we see that (Я, и) is an eigenpair in the classical sense. We next present conditions that guarantee that и е C2(Q). From C.15) we see that и is a weak solution of the boundary value (source) problem Lu =/ in Q, и = 0 on Г, where /= A du. Using standard regularity results for elliptic equations we find that и e C\Q) provided Г, atj, bh с and d are sufficiently smooth. In the two-dimensional case we are discussing it is sufficient to assume (i) Г is of class C\ (ii) aij,b^C\Q), (iii) c,deC2(Q). In the general n-dimensional case it is sufficient to assume (i) Г is of class C\ (ii) ау,Ь,еС*-ЧО), (iii) c,deCk~2(Q), where fc=[in] + 3. For these results we refer to Agmon [1965, Theorems 3.9 and 9.8]. Eigenvalue problems on domains with corners arise in many applications but are not covered by the above results because of the requirement that Г be smooth.
Section 3 Introduction and preliminaries 667 Nevertheless, when Q has corners, analogous results in a generalized form involving weighted Sobolev spaces can be proved for problems with smooth coefficients (see Grisvard [1985] and Babusjca and Guo [1987]). Furthermore these results provide information on the behavior of и near the corners that is important in assessing the accuracy of eigenvalue approximations. This matter will be taken up in Section 10. We now briefly outline the extent to which the eigenpair (A, и) of C.15) satisfies C.1) in the classical sense in the case in which Q is a polygon and L— — Л and d{x) = 1. From regularity results for elliptic equations we conclude that MeC°°@\{vertices of Q}). Thus we see that Lu(x) = XMu(x) for all x e Q and u(x) — 0 for x e Г\{vertices of Q}. и fails, however, to be an eigenfunction in the classical sense in that и ф С2 at any vertex of Q with interior angle larger than к. Under the hypothesis sketched above, the classical and variational formulations of C.1) are equivalent. For the remainder of this article, we will take the point of view that our eigenvalue problems are given in variational form. Thus we will consider problems of the form C.15), or problems that are generalizations of the form C.15); see Section 8. Consider now the case of the Neumann boundary condition: 3m(x)/Sv = 0 for хеГ. Suppose (A, u) satisfies C.1) in the classical sense. Then, using C.8) we find Щи, v) = aiu, v) + бн — v as 8v = a(u,v) for alive C\Q), and thus, using the fact that Cl(Q) is dense in H1(Q), we see that (A, u) satisfies ueH\Q), u#0, a(u, v) = Xb(u, v) Vuefl1^)- C.17) C.17) is a variational form for C.1) with the Neumann condition. Now suppose (X, u) satisfies C.17) and assume ueC2(Q). From C.17) and C.8) we obtain Luv dx = a(u, v) + = A 0M _ , ,, , . — v ax = Щи, v) + ov 5« _ . — v as 6v du C.18) 5v vds VveC\Q). duv dx + n г Taking vsCl{Q) which vanish on Г we find that Lu(x) = M(x)u(x) Vxefi. Thus C.18) reduces to
668 /. BabuSka and J. Osborn Chapter I which implies that 8m/6v = Q on Г. Thus we have shown that (X, и) satisfies C.1) in the classical sense. As with the Dirichlet condition, the analysis is valid under appropriate smoothness assumptions on Г, aih Ъь с and d. We will not state these in detail. Note that the Neumann boundary condition is not explicitly stated in C.17). It is, however, implicitly contained in C.17). We refer to the Neumann condition as a natural boundary condition, in contrast to the Dirichlet condition which is referred to as an essential boundary condition, and which is explicitly contained in the variational formulation C.15). The fact that the Neumann condition is natural has important implications for the approximation of eigenvalues; see Remark 10.4. In summary, for C.1) we get one of the following forms: Problem 3.1. Dirichlet boundary condition. Seek 1,иф0 satisfying ueHk(Q), a(u,v) = Xb(u,v) VueHo(f3). Problem 3.2. Neumann boundary condition. Seek 1,иф§ satisfying ueH\Q), a{u,v) = Xb{u,v) ЧьеИ\П). We will sometimes refer to (X, u) as an eigenpair of the form a relative to the form b. Regarding the forms a and b we assume C.12)—C.14) hold. In a similar way, many other problems—including all of the examples discussed in Section 1—can be given variational formulations. This is done for a number of problems in Chapter III. We mention in particular the eigenvalue problems corresponding to the vibration of a free L-shaped panel (a two-dimensional analogue of the elastic solid). Finally we wish to make one further point regarding variational formulations of eigenvalue problems, namely, that a given eigenvalue problem can often be given a variety of different variational formulations and that some of these may lead to more effective finite element methods than others. We illustrate the possibility of various variational formulations by considering the simple model problem — (a(x)u'Y + cu~Xu, 0 < x < 1, C 19) M@) = u(l) = 0. 1 ; This has already been cast into the variational form: Seek Я,и#0 satisfying иеЯ£@,1), C.20) a(u,v) = Xb(u,v) VueHj@, 1),
Section 3 where Introduction and preliminaries 669 a(u, v)= (au' v'+cuv) dx о and b(u, v) — uv dx are bounded bilinear forms in HqxHI. An alternate formulation is Seek А, 0#weL2@, 1) satisfying ax{u, v) = Xb1{u, v) Vi>eH2@, 1)пЯ£@, 1), where l 0 and i C.21) Ьг(и, v) = uv dx are bounded bilinear forms on L2 x [Я2@, 1)пЯ^@, 1)]. Formulations C.20) and C.21) are equivalent in the sense that (X, u) is an eigenpair of one if and only if it is an eigenpair of the other. Another formulation is obtained as follows. If we let a = au', then C.19) can be written as a first-order system of equations, — cr' + cu = hi, u' — a/a = 0, m@) = mA) = 0. System C.22) can then be given the variational formulation, Seek X, (a, u)eL2@, l)xHj@, I) satisfying a2(a, и, ф, v) = Xb2(a, и, ф, v) У(ф, v)eL2@, 1)хЯо@, 1), where l a2(a, и, ф, v)= ) I av' + cuv + u'ij/ J dx C.22) C.23)
670 /. BabuSka and J. Osborn Chapter I and b2(a, и, ф, ,)-J uv dx. a2 and b2 are bounded bilinear forms on L2 xHj. Formulations C.19) and C.22) are equivalent in the sense that if (X, u) is an eigenpair of C.19) and a = au', then (X, (u, <?)) is an eigenpair of C.22), and if (A, (a, u)) is an eigenpair of C.22), then (А, и) is one of C.19) and a — au'. C.22) and C.23) are called mixed formulations of the eigenvalue problem C.19); see Section 11. We can also write C.19) in the form C.24) (cr, u)eL2((. ), 1) x Hh@ А{о;ф) + В(ф, н) = 0 i B(a, vj— cuv dx = , 1), (ff,M)#@,0), V^eL2@, 1), I A — Xuvux VveHo, J ^ 0 0 where A(tr,ilf) = l — dx a and B(g, i))= — or' dx. о In Chapter III we will consider further examples of variational formulations and show how they can be used to define a variety of finite element methods. 4. Properties of eigenvalue problems In this section we discuss the basic properties of eigenvalue problems. As in Section 3 this discussion will be in terms of second-order elliptic eigenvalue problems. We thus consider the problem C.1) in variational form: D.1) Seek A, 0 ф и е Н satisfying a(u, v) = Xb(u, v) V v 6 H, where H = Hh(Q) for Dirichlet boundary conditions and H = Hl(Q) for Neumann conditions. The forms a{ •, •) and b( •, •) are assumed to satisfy \a(u,v)\^C1Mi.aPh.a, Vu,veH, D.2) |b(u,i>)|<C2||ii||ol0No,o, Vu,veH, D.3)
Section 4 Introduction and preliminaries 671 and Re a{u, иJ»a||и||?,e VыeЯ, D.4) where a>0. Sufficient conditions for D.2)-D.4) to hold were given in Section 3; cf. C.12И3.14). For the study of D.1) it is useful to introduce the operator T: H°(Q)->H defined by TfeH, a(Tf,v) = b(f,v) VveH. D.5) T is the solution operator for the boundary value (source) problem Lu = df in Q, Bu = 0 on Г, D.6) i.e., u=Tf solves D.6). Thus T is the inverse of the differential operator L, considered on functions that satisfy the boundary conditions. It follows immediately from D.2)-D.4) and the Riesz representation theorem in the special case in which a(% ■) is an inner product on H or the Lax-Milgram theorem (Lax and Milgram [1954]) in the general case, that D.5) has a unique solution T/ for each feH°(Q) and that 11ГЛ|1.0^(С2/а)||/||о.п V/etf°(fi), D.7) i.e., T: H°{Q)->H is bounded. In Section 2 we noted that Я is compactly embedded in H°(Q), provided Г is Lipschitz continuous (Rellich's theorem). From this fact and D.7) we see that T: H°(Q)^H°(Q) is a compact operator. We can also view T as an operator on Я; we will, in fact, mainly consider T on H. Another application of Rellich's theorem shows that Т:Я-> H is compact. It follows immediately from D.1) and D.5) that (A, u) is an eigenpair of D.1) (or of L) if and only if Tu = (l/X)u, u^O, i.e., if and only if (/i = A_1, и) is an eigenpair of T. Through this correspondence, properties of the eigenvalue problem D.1) can be derived from the spectral theory for compact operators. A complete development of this theory can be found in Dunford and Schwartz [1958, 1963]. We now give a brief sketch of it; a slightly more complete treatment is given in Section 6. We present this theory under the assumption that the space Я is complex. This leads to the simplest general statement of the theory. In the special case in which T is selfadjoint, Я can be taken to be real or complex. We will specialize to the selfadjoint case later. Denote by p(T) the resolvent set of Г, i.e., the set p{T) — {z: zeC = the complex numbers, (z— T)~l exists as a bounded operator on H), and by (х(Г) the spectrum of T, i.e., the set a{T) = C\p(T). o(T) is countable with no nonzero limit points; nonzero numbers in a(T) are eigenvalues; and if zero is in a{T), it may or may not be an eigenvalue. Let 0#/ieff(T). The space N([i—T) of eigenvectors corresponding to /x is finite-dimensional; its dimension is called the (geometric) multiplicity of fi. Now suppose L is formally selfadjoint. Then it follows immediately from their
672 /. BabuSka and J. Osborn Chapter I definitions that a(u, v) and b(u, v) satisfy a(u, v) — a(v, u) Wu,veH, D.8a) b(u,v) = b(v,u) Vu,veH°(Q), D.8b) i.e., a and b are symmetric (or Hermitian) forms. Thus from D.2)-D.4) we see that a{u, v) is an inner product on H that is equivalent to ((и, v))liS2. In a similar way we see that b{u, v) is an inner product on H°(Q) that is equivalent to («, v)QiB (recall that d(x) is bounded above and is bounded below by a positive constant). It follows from D.8) that a(Tu, v) = a(u, Tv) Vu,veH, D.9a) b(Tu, v) = b(u, Tv) \/u,veH°(Q), D.9b) i.e., T, considered as an operator on H, is selfadjoint with respect to a(u, v), and, considered as an operator on H°(Q), is selfadjoint with respect to b(u, v). (We have previously noted in C.10) that b({l/d)Lu, v) = b(u, (l/d)Lv) if u = v = 0 on Г or if 3u/3v = 8»/5v = 0 on Г, provided L is formally selfadjoint.) From the fact that T is selfadjoint on H it follows that the eigenvalues of T are real and the eigenfunctions corresponding to distinct eigenvalues are orthogonal with respect to a(u, v). We noted above that T is compact on H and it follows from D.5) that T is positive definite. Thus Г will have a countably infinite sequence of eigenvalues 0i---^/x2^^i and associated eigenfunctions Mi, U2, ..., which satisfy а(щ, Uj) = Aib(ui, щ) = ёи. It is further known that the eigenfunctions are complete in L2(&), i.e., that M== E CJUJ VueL2(fi), D.10) where Cj = hb{U, U:) — Xj duujux, D.11) and convergence is in either the L2 norm or the norm \\u\\b = yfb(u,u). Equation D.10) converges in the H norm for ueH. Now the spectral properties of D.1) (or of L) can be inferred from these facts by recalling that the eigenvalues of D.1) (or L) are the reciprocals of those of Г and that D.1) and T have the same eigenfunctions. Thus, if L is formally selfadjoint,
Section 4 Introduction and preliminaries 673 then D.1) (or L) has eigenvalues 0<А1^Л2<-/Ч-оо D.12) and corresponding eigenfunctions и,, «2,.. ■ D.13) satisfying a(uh ц) = кМщ, и^ = 5и. D.14) In the sequence Аь A2,..., eigenvalues are repeated according to their (geometric) multiplicity. The properties of eigenvalues and eigenfunctions used in Section 1 in the discussion of separation of variables (cf. A.11)—A.15)) all follow from the properties we have sketched here. Although our discussion has been in terms of second-order elliptic problems, it is immediate that the results hold for any eigenvalue problem in variational form provided the bilinear forms are symmetric and satisfy D.2)-D.4). We will refer to this as the selfadjoint, positive-definite case. In Section 8 this, as well as a more general, class of variationally formulated eigenvalue problems is discussed. Remark 4.1. The eigenvalues of selfadjoint eigenvalue problems are real and the eigenfunctions may be taken to be real. Thus these problems may be formulated in terms of real function spaces. Nonselfadjoint eigenvalue problems, on the other hand, may have complex eigenvalues and complex eigenfunctions, and are formulated in terms of complex spaces. We end this section with a discussion of the regularity of the eigenfunctions of the second-order elliptic operator L. L is not assumed to be formally selfadjoint here. Theorem 4.1. Suppose for /c^2, (i) Г = д£2 is of class Ck, (ii) aij,bieCk-1{Q),and (iii) c,deCk~2(Q). Then all eigenfunctions of L (see C.2)) lie in Hk(Q) and K.IU^CAflluJo.n, ; = U,.... Proof. This result is a direct consequence of standard results on the regularity of solutions of elliptic boundary value problems. In particular, we refer to Agmon [1965, Theorem 9.8]. □ Theorem 4.2. Suppose (i) Г is of class C°°, and (ii) aij^cdeC^Q). Then i^e C°°(G) for j= 1,2,....
674 1. BabuSka and J. Osborn Chapter I Proof. This result follows directly from Theorem 4.1. Q Theorem 4.3. Suppose (i) Г = дО is analytic, and (ii) Oy, ft;, c, d are analytic on Q. Then Uj is analytic on Q for each). Proof. For a proof of this result see Morrey [1966, Section 5.7]. □ In practice most of the domains of interest have piecewise analytic boundaries. Let us mention a result for such domains. Theorem 4.4. Suppose (i) ficl2, (ii) Г is piecewise analytic, and (iii) fly, ft,-, c, d are analytic on Q. Then every eigenfunction of L is analytic in Q\\J(vertices), and moreover, belongs to the space @j{Q), for properly chosen jS. Proof. This theorem follows from the results in BabuSka and Guo [1988a]. □ Remark 4.2. Assume that Lu= — Ли, Q is a polygon, and the boundary conditions are of Dirichlet type. If Q is a convex polygon, then the eigenfunctions и е H2(Q), and if О is a nonconvex polygon, then ueH\Q)nHl(Q), where k-(n/a)+1, with a the maximal interior angle of the vertices of Q, For a comprehensive treatment of regularity results for problems on domains with corners, we refer to Grisvard [1985]. 5. A brief overview of the finite element method for eigenvalue approximation In this section we give a brief overview of the use of finite element methods for approximating eigenvalues and eigenfunctions of differential operators. We will restrict the discussion to a simple model problem in one dimension and its approximation by the simplest type of finite element method. Consider the selfadjoint eigenvalue problem (Lm)(x) = — (a(x)u')' + c(x)u = kd(x)u, M@) = u(l) = 0, where aeCl[Q, /], c,deC°[0, /], and 0<x<l E.1) 0<a0^a(x), O^c(x), 0<do^d(x) for O^x^l
Section 5 Introduction and preliminaries 675 (cf. C.1)—C.4)). As indicated in Section 3, this problem has the variational characterization иeHh@, 1), a(u, v) = Щи, v) VveHh@, /), E.2) where a(u, v) = a(x)u'v' dx о and b(u, v) = duv dx. E.1) (or E.2)) has a sequence of eigenvalues 0<Я1^А2<---/'+оо and corresponding eigenfunctions Ml5 U2, ■ ■ ■ satisfying lt d(x)utUj dx = Sij. о On [0, Q consider an arbitrary mesh Л = {0 = x0 < x1 < ■ ■ ■ < x„ = /}, where n = n(A) is a positive integer, and let Sh = {u: ueC[0, /], u@) = u@=0, и is linear on Ip j=l,...,n}, where hj=Xj—Xj-! and Ij=(xj-1, Xj) iorj= 1,..., n and h = h(A) = ma.Xjhj. Sh is an (n— l)-dimensional subspace of Яо@, /). The pairs (A, u) have been characterized in E.2) as eigenpairs of the bilinear form a(u, v) relative to the form b(u, v) over the space Яо@, /) x Яо@,1). We now consider eigenpairs of a(u, v) relative to b(u, v) over the space ShxSh, i.e., we consider the eigenvalue problem, Seek kh, 0#MheSA satisfying a(uh, v) = khb(uh, v) VueSh, and then view the eigenpairs of E.3) as approximations to those of E.2). (Xh, uh) is called a finite element (Galerkin) approximation to (X, u). A wide variety of finite element methods for eigenvalue problems will be introduced and analyzed in
676 /. BabuSka and J. Osborn Chapter I Chapter III. Here we will outline the general features of these methods by examining the method E.3) as it applies to E.1). Since Sh is finite-dimensional, E.3) is equivalent to a generalized matrix eigenvalue problem. In fact, if фг, ...,ф„-1 is a basis for Sh, then it is easily seen that (kh, uh — l?jZ\z$j) is an eigenpair of E.3) if and only if Az = X„Bz, 2^0, E.4) where z = (zr,..., z„_ i)T and A = (Aij) with Ац = a$j, ф;), B = (By) with By = Ь(^ф^. Problem E.3) (respectively, E.4)) has eigenvalues and corresponding eigenfunctions ulyh,...,un-lth (respectively, Zj,h = (zj,i,h, • • ■ > 2j,„-i,*)T,;'= 1,.. •, n-1), satisfying г о (respectively, 2.uhzlhBZj,i, = <5y). We further note that if we choose as basis functions the usual hat functions determined by then A and В are sparse; in fact, they are tridiagonal. We easily see that the three nonzero diagonals are given by 4u+i = —h; хй(+1 a(x)dx + hi1hi+i (xi+x — x)(x — x,-)c(x) dx, E.5a) A^/if2 a(x) dx + /i1+2! a(x) dx r r + ht2 I (х-хг_02ф) cLx + йГЛ | (xi+1-xJc(x) dx, E.5b) 14_1(= -hT-ihT1 \ a{x)dx + hTJlh'[1 (x-xi)(x-xi_1)c(x) dx; J J E.5c)
Section 5 Introduction and preliminaries 677 Bi,t+i=hi hi+i (x;+1 — x)(x — x1)d(x)dx, E.6a) Bu = hr2 (x-xi^1Jd(x)dx + h7+\ x,-i (xi+1—xJd(x)dx, E.6b) Bt-u^hTMT1 (Xi — x){x~Xi-i)d(x) dx. E.6c) Now we specialize E.1) to the vibrating string problem discussed in Section 1, i.e., we let a(x)=p = the tension of the string, фс) = 0, and d(x) = r = the density of the string. We also suppose the mesh is uniform, i.e., we let xi = iln~1; we then have h = hi = ln~1. It is easily seen from E.5)-E.6) that 2 -1 -1 2 -1 -1 2 -1 A=ph'1 E.7) and 4 1 B=irh 1 4 1 1 4 E.8) x 14/ If the integrals defining the By are approximated by the trapezoid quadrature rule, then instead of the matrix В we would obtain the matrix B = rhl and instead of E.4) we would have Az=IEz- E.9) E.10)
678 I. BabuSka and J. Osborn Chapter I We finally note that the eigenvalues and eigenvectors of E.4) and E.10) can, in this special case, be explicitly found. The eigenvalues of E.4) are given by AjVl = 6/i ~ 2A - cos(M//))B + cosOWOr V " \ ./=1,2,...,n-l, and those of E.10) by Xj,ft = 2/T2(l-cosOW0)P''~\ ; = 1,2,. ...n-1. E.12) The unnormalized eigenvectors of both problems are given by г;,Л = (г7,1,*» ■ • •'2Л"-1./") ' E.13) where Zj,k,h = sm(jnkh/l), j,k = l,2,...,n-l. E.14) The eigenvalues and eigenfunctions of E.1), in this case, are given by Xj=j2i?p/l2r, j = 1,2,3,... E.15) and Uj(x) = y/2/irsin(jnx/l), j=l,2,... . E.16) From E.11) and E.15) we see that and from E.12) and E.15) we see that From E.13), E.14), and E.16) we see that, neglecting the normalizing factors, the eigenvector Zj,h consists of the values of иДх) at x — xx, x2, ■ ■ ■, xB-i- Equation E.17) shows that the eigenvalue error Xjth — X} is 0(hz). Thus the small eigenvalues of E.3) (or of E.4)) approximate the eigenvalues of E.2), but the larger ones do not since Xhh — X} is small only if j2h is small. If, for example,;' ~ n1/2, then;'2/; is of order one and we would not expect kiM — Xj to be small. Thus only a small percentage of the eigenvalues of E.4) are of interest. This observation influences the selection of numerical methods for the extraction of the eigenvalues of E.4). We also note that E.17) and E.18) show that Х,-,*^Я;^ЯЛЙ for h small. It is known that Aj^Xj,h for all h; cf. (8.42). 5.1. A physical interpretation of the finite element eigenvalue problem E.10) We consider here the vibration of a weightless elastic string loaded with several point masses. Suppose we have a weightless elastic string of length / loaded with и—1 particles of mass m at distances In'%, 2ln~1,..., (n— \)ln~1 from one end and fixed
Section 5 Introduction and preliminaries 679 at both ends. Gravity is assumed to be negligible and the particles are assumed to move in a plane. We shall study the small free vibrations of this system of n-1 degrees of freedom. Let p denote the tension in the string and let /i = /n-1. If q^t) denotes the vertical displacement of the ith particle, the particles being numbered from the left (see Fig. 5.1), then the equation of motion for the ith particle is easily seen to be -mq"(t)=-p , i = l,2,...,n-1, E.19) where we assume qo = qn:=0. Fig. 5.1. Elastic string with point masses. If we seek separated solutions of the form <?i@=zi#)> qn-1(t)^z„-lq{t), or, in vector form, q(t) = zq(t), in which the (discrete) spatial variable./' and the temporal variable t are separated, we find that - mztq '(t) =-p — q(t) or fa-j-lZi + Zi+Jlh q"(t) = — for all i and t. mzi q(t) Both members of this equation must equal a constant, which we denote by A. We are thus led to seek (А, гФЩ such that p(-Zi-i+2zi-zi+ 1)/h = Mzh i = l,..., n-1, i.e., to seek eigenpairs (А, г) of the matrix
680 I. BabuSka and J. Osborn Chapter I 2 -1 -1 2 -1 -1 2 -1 ph'1 | ... S I5-20) -1 -1 2 relative to the matrix ml, and, for each eigenvalue Я, solutions to the differential equation ^"@ + ^@ = 0, t>0. E.21) The matrix E.20) is positive-definite. Thus it has и — 1 eigenvalues and corresponding eigenvectors zi,„, • • •, ги- 1,ь which satisfy Zi,„, • •., zB-1,* thus form an orthonormal basis (i.e., are complete) in (n — ^-dimensional space. Corresponding to Xjth, the solutions of E.21) are given by q(t) = qj(t) = aj sin J~t~h{t + 0Д where a;- and 9} are arbitrary. Thus the separated solutions are given by Zj,„o,- sin^/X^r-^), y=l,..., n —1. E.22) As with the vibrating string, it is easily seen that all solutions of E.19) can be written as the superposition of the separated solutions E.22). These simple motions are called the eigenvibrations. The components of the jth eigenvibration all vibrate with some circular frequency ^/Л," an(^ phase displacement -J^j^j, and the components are proportional to the components of Zj,h- Thus ,/X^ is the frequency and Zj,h the shape of the jth eigenvibration. A complete discussion of the vibration of a weightless elastic string loaded with several point masses can be found in Courant and Hilbert [1953] and Synge and Griffith [1959]. We now draw a parallel with the finite element problem E.10). It follows immediately from E.7), E.9), and E.20) that the eigenvalue problem that we obtained, i.e., the problem of finding the eigenpairs of the matrix in E.20) relative to ml, is identical to the eigenvalue problem E.10) provided m = rh = rln~1. We have thus arrived at the following physical interpretation of E.10): Consider the problem of a vibrating string with density r and tension p. Divide the total mass rl of the string into n— 1 particles of mass m = rln~1, which are placed at the points xly...,x„-i, and two particles of mass rlBn)~l, which are placed at зс0 and x„. Then the eigenvalue problem corresponding to this system is identical to the problem E.10)
Section 5 Introduction and preliminaries 681 arrived at by approximating E.2) by the finite element method E.3), and then approximating the matrix В by В via the trapezoid rule. Thus the finite element eigenvalue problem E.10) is the same as the eigenvalue problem that arises when the mass of the string is "lumped" as indicated above. The matrix A in E.7) is called the stiffness matrix and В in E.8) is called the mass matrix. Because of the physical analogy we have noted, В is called the lumped mass matrix and, in contrast, В is sometimes referred to as the consistent mass matrix.
Chapter II Abstract Spectral Approximation Results In this chapter we present the abstract spectral approximation results we will use in the sequel. 6. Survey of spectral theory for compact operators Since the differential operators we consider have compact inverses, our approximation results will be developed for the class of compact operators. We turn now to a survey of the spectral theory for compact operators. A complete development of this theory can be found in Dunford and Schwartz [1963, Section XI.9]. Let A: X-+X be a compact operator on a complex Banach space X with norm II" II x= 11*11- We denote by p(A) the resolvent set of A, i.e., the set p{A) = {z:zeC = the complex numbers, (z — A)'1 exists as a bounded operator on X], and by a{A) the spectrum of A, i.e., the set a(A) = C\p(A). For any zep(A), Rz(A)={z — A)~1 is the resolvent operator. c(A) is countable with no nonzero limit points; nonzero numbers in o{A) are eigenvalues; and if zero is in a(A), it may or may not be an eigenvalue. Let pea(A) be nonzero. There is a smallest integer a, called the ascent of p.—A, such that N((p- Af) = N((p~ Af+1), where N denotes the null space. N({p-AY) is finite-dimensional and m = dim N((p — A)°) is called the algebraic multiplicity of p. The vectors in Ы({р.~А)х) are called generalized eigenvectors of A corresponding to p.. The order of a generalized eigenvector и is the smallest integer j such that ueN{(p—A)J). The generalized eigenvectors of order 1, i.e., the vectors in N(p—A), are, of course, the eigenvectors of A corresponding to p. The geometric multiplicity of p. is equal to dim N(p—A), and is less than or equal to the algebraic multiplicity. The ascent of p. — A is one and the two multiplicities are equal if X is a Hilbert space and A is selfadjoint; in this case the eigenvalues are real. If pis an eigenvalue of A and/ is a corresponding eigenvector, we will often refer to (p,f) as an eigenpair of A. Throughout this section we will consider a compact operator T:X->X and a family of compact operators Th: X-*X, 0 < h s% 1, such that Th-^ T in norm as h\0. 683
684 I. BabuSka and J. Osborn Chapter II Let p be a nonzero eigenvalue of T with algebraic multiplicities m. Let Г be a circle in the complex plane centered at p which lies in p(T) and which encloses no other points of (x(T). The spectral projection associated with T and p is defined by E = E(p)=^ RAT) dz. £ is a projection onto the space of generalized eigenvectors associated with p. and T, i.e., R(E) = N((jx — T)a), where R denotes the range. For h sufficiently small, Гср(Тк) and the spectral projection Eh = Eh{p)=~^Rz{Th)te г exists, Eh converges to E in norm, and dim R(Eh{p}) = dim R(E{p)) = m. Eh is the spectral projection associated with Th and the eigenvalues of Th which lie in Г and is a projection onto the direct sum of the spaces of generalized eigenvectors corresponding to these eigenvalues, i.e., R(Eh)= £ N((p(h)-Th)^% Mh)ett(Th) д(й) inside Г where ад№) is the ascent of p(h) — Th. Thus, counting according to algebraic multiplicities, there are m eigenvalues of Th in Г; we denote these by /лt (ft),..., pm{h). Furthermore, if Г" is another circle centered at p. with an arbitrarily small radius, we see that Pi(h),..., pm(h) are all inside of Г" for ft sufficiently small, i.e., limfc-0 Hj{h) = p. for j = 1,..., m. R(E) and R(Eh) are invariant subspaces for Г and Th, respectively, and TE = ET and ThEh = EhTh, {Rz{Th}: zeT, h small} is bounded. If p is an eigenvalue of T with algebraic multiplicity m, then p. is an eigenvalue with algebraic multiplicity m of the adjoint operator T" on the dual space X'. The ascent of p— V will be a. E' will be the projection operator associated with T" and p; likewise E'h will be the projection operator associated with % and p%{h),..., pm{h). If / 6 X and /' e X', we denote the value of the linear functional /' at / by </, /'>• T here is the Banach adjoint. If X=H is a Hilbert space, we would naturally work with the Hilbert adjoint T*, which acts on H. Then p would be an eigenvalue of Г if and only if p. is an eigenvalue of T*. Given two closed subspaces M and N of X, we define <5(M,AT)= sup distfx, N) and £(M,N) = max(<S(M,N), 5{N,M))- xeM 1*11 = 1 S(M,N) is called the gap between M and N. The gap provides a natural way in which to formulate results on the approximation of generalized eigenvectors. We will need the following: Theorem 6.1. // dim M = dim N < oo, then S(N,M)^8(M,N)ll-3(M,N)y\
Section 7 Abstract spectral approximation 685 For a discussion of this result and the result that S(N,M) = S(M,N) if X=H is a Hilbert space and 8{M, N)<\, we refer to Kato [1958]. 7. Fundamental results on spectral approximation In this section we present estimates which show how the eigenvalues and generalized eigenvectors of T are approximated by those of Th. Estimates for this type of approximation were obtained by VaInikko [1964, 1967, 1970], Bramble and Osborn [1973], and Osborn [1975]; our presentation follows Osborn [1975]. We refer also to Chatelin [1973, 1975, 1981], Grigorieff [1975], Chatelin and Lemordant [1978], Stummel [1977], and to the excellent and comprehensive monograph of Chatelin [1983]. Let ц be a nonzero eigenvalue of Г with algebraic multiplicity m and assume the ascent of \i— T is a. Let /*i(ft),..., цт(Н) be the eigenvalues of Th that converge to ц. Theorem 7.1. There is a constant С independent ofh, such that S(R(E),R(Eh))^C\\(T-Th)\R{E)\\ for small h, where (T— Th)\R(E) denotes the restriction of T— Th to R{E). G.1) Proof. For feR(E) with ||/|| = 1 we have Ц/-ВД = ||(£-£J/K \\E-EJ. Thus, since Eh converges to E in norm, limfc^0<5(.R(£), R(Eh)) — 0. Using Theorem 6.1, with M = R(E) and N=R(Eh), we thus have S(R(Eh), R(E))^S(R(E), R(Eh))[l-5(R(E), R{Eh))Yx ^2d(R(E), R(Eh)) and hence S(R(E), R(Eh)H23{R(E), R(Eh)) for small h. Now for feR(E) we have \\f-EJ\\ = \\Ef-Ehf\\ = 2тп iRz{T)-Rt(Th)\f 6z G.2) 2ra Rz{Th){T-Th)Rz{T)f dz and hence, recalling that R{E) is invariant for T and thus for RZ(T), \\f-Ekf\\ < r^lengthtDsupllJUT,,)!! ||(T- ГА)|К№)|| sup \\RZ(T)\\ ||/||. G.3)
686 /. BabuSka and J. Osborn Chapter II As noted above, supzsr ||Лг(Тл)|| is bounded in h. Thus from G.2) and G.3) we have S(R(E),R(Eh)HQ\(T-Th)\m)\\, where C=-length(r) sup ||Rz(rfc)||sup \\RX(T)\\. □ 0<h Remark 7.1. The proof of Theorem 7.1 also shows that №-Eh)\RwHC\\{T-Tk)\RUS)l Although each of the eigenvalues nl(h),..., fim(h) is close to ц for small h, their arithmetic mean is generally a closer approximation to ц (cf. Bramble and Osborn [1973]). Thus we define -j m Our next theorem gives an estimate for fi—fi(h). Theorem 7.2. Let ф1,...,фтЬе any basis for R(E) and let ф\,...,ф'тЬе the dual basis in jR(jE'), as defined in the proof to follow. Then there is a constant C, independent of h, such that 1 m \V-№\^- I К(Т-Г4)^,^>| + С||(Г-ГЛ)|Я(Е)|| ||(Г-ПIя(£')!1- G-4) m j-1 Proof. For small h, the operator Eh\R{E): R(E)-+R(Eh) is one-to-one since \\E—Eh || ->0 and Ehf = 0, feR(E) implies ||/|| = ||£/-£*/KI|£-£*||||/||, and Eh\R(E) is onto since dim R(Eh) = dim R(E) = m. Thus (Eh\R(Ej)~l: R(Eh)-+R(E) is defined; we write £A-1 for (Eh\R(E))~г. For h sufficiently small and feR(E) with ||/|| = 1 we have l-||£*/|| = ||£/||-||£»/NI|£-£*||<i and hence ||£л/|| >i||/||. This implies ||£»* || is bounded in h. We note that £fc£^1 is the identity on R(Eh) and E^1Eh is the identity on R{E). Now we define %=E^ThEh\m):R{E)^R{E). Using the fact that R(Eh) is invariant for Th we see that <т(Тл)= {pi(h),..., y-m(h)} and that the algebraic (geometric, respectively) multiplicity of any /цДй) as an eigenvalue of th is equal to its algebraic (geometric, respectively) multiplicity as an eigenvalue of Th. Letting f=T\R(E), we likewise see that cr(T) ={//}. Thus
Section 7 Abstract spectral approximation 687 trace Т=тц and trace Th = mp.(h) and, since t and th act on the same space, we can write fi-Kh) = -trace(f-fh). G.5) m Let ф!,.,.,фт be a basis for Я(£) and let ф\,...,ф'т be the dual basis to ф1,...,фт. Then from G.5) we get »-№ = -trace(f- f„) = - £ <(f- f„)<£,, </»}>. G.6) Here each ф'$ is an element of R(E)', the dual space of R(E), but we can extend each ф] to all of X as follows. Since X = R(E)®N{E), any /eI can be written as f = g + h with #еЯ(£) and heN(E). Define <./,Ф'^ = <д,Ф)}. Clearly <£}, so extended, is bounded, i.e., ф]еХ'. Now </,(^-Г )*<£}> = <(/*- Г)"/, ^> vanishes for all /. This follows from the observation that it obviously vanishes for /eR(E)=N((n~ T)x) and it vanishes for feN(E) since N(E) is invariant for li—T. Thus we have shown that ф\,..., ф'т е Я(£'). Using the facts that ThEh=EhTh and Eh~1Eh is the identity on R{E), we have <(f - T„№j, ф)У = <7ty-£Г1 ТкЕкф„ #> = <Е;1Ек(Т~Тк)ф^ф'^ = <(Г- Th№j, ф + <(£л-*£„ -1){Т- Тк)ф}, #>. G.7) Let Lh = E~H~1Eh. Lhis the projection on R(E) along N(Eh). Hence LJ, is the projection on N(E„I=R(E'h) along ВД1 =JV(£'). Thus <.(Ек1Ек-Г)(Т-Тк)ф}, ф'^ = ((Ь„-1)(Т-Тк)ф^ (F-£i)^>. G.8) From G.8), the boundedness of Lh, and Remark 7.1 (applied to T" and {T'h}) we have К(Ек1Ек-1)(Т-Тк)ф^ф'^\ ^(snp\\Lh~I\\\\\(T~Th)\Rm\\\\(E'~E'h)\m)\ Ш\\Ф'Л <С||(Г-ГА)|Я(£)||(Г-Г;)|Д(Г)||. G.9) Finally, G.6), G.7), and G.9) yield the desired result. □ Remark 7.2. Our treatment of the term <(£^ 1Eh-I){T- Ть)ф}, ф'}у, which differs from that in Osborn [1975], was suggested by Descloux, Nassif and Rappaz [1978b]. Remark 7.3. A slight modification of the proof of Theorem 7.2 shows that for any l<£7<n», К{Т—Тк)ф^ 4пУ\ is bounded by C5h, where m <5* = E 1<(Т-ГЛ)^,^>|+||(Г-ГЙ)|Л№)||||(Г-П)|Я(£,||.
688 /. BabuSka and J. Osborn Chapter II Noting that ((Т—Ть)ф^ ф\) is a matrix representation of T—th, we see that \\t-tj^CS„. G.10) Since it is immediate that |p-flfc)| = -|trace(f-?4)|<||?-?J|, m from G.10) we get \ц-№\**Сд„, G.11) an estimate that is similar to, and of equal use in applications as, G.4). We also have = -|trace(f-1-fft-1)| m ^Wt-'WWfb-TWWT^W^CWt-n. Hence we see that /Г1-(£>#)-4/m ^Cd„. G.12) It is also know that |M-M#)r<C||f-fh|| G.13) for any 1 <j ^ m. Hence \ц-фТ^С8к. G.14) Equation G.14) is established directly in Theorem 7.3. We note, however, that the proof of Theorem 7.3 is closely related to one of the ways of proving G.13). Remark 7.4. It follows immediately from G.4) that \fi-m\^c\\{T-Th)\m)i However, the second term on the right-hand side of G.4) is of higher order than W(T-Th)\Rm\\, namely of order !l(T-rjR(£)|||!(r-n)|R(£)|!. We will also see that in a large variety of applications, £"= 1((Т — Ть)фр ф}> is of higher order than ||(Т-Г„)|Я(£)||.
Section 7 Abstract spectral approximation 689 In addition to estimating ii-fi(h) we may estimate ju—ц7-(Л) for each j. Theorem 7.3. Let a be the ascent of \i — T. Let ф1,...,фтЬе any basis for R(E) and let ф\,- ■ -,ф'т be the dual basis. Then there is a constant С such that \fi-Hj(h)\^C\ X Ш-ТЛФиФМ + МТ-ТЛюШГ-П)^ li,k=l 1/a G.15) Proof. For each h, ц}Щ is one of the eigenvalues of th. Let fhwk =^(/i)wft, j| wh || = 1. We can choose w'h e N(Qi —T'f) in such a way that < wh, w'h > = 1 and the norms || wjj are bounded in h. First, using the Hahn-Banach theorem, choose w'h e R(E)' such that <wft,wj,> = l and ||w|,|| = l; then extend w'h to all of X as in the proof of Theorem 7.2. w'h, so extended will be in R{E') and satisfy \\Wh || ^ ||£||. Now, noting that (T'~fi)"w'h =0, we have \fl-Hj(h)\' = |<0i-At#))"w»,wi>| = |<(a*-Ai#))'-(^-T)«)w4,wi>| ( «-1 X (М-ФУГЬ-Т)'-1 -k(nj(h)-T)Wh,wi k = 0 <*£ |^-A£j(/»)|» max K^W-TK^^HI/i-T'ir-1-'1!!^! №1 = 1 G.16) For any ф'еЩ') with ||#'|l = l, \{{^{Щ-Т)щ,ф'У\ = 1<№-7>ь0'>| = |<£ir1£fc(n-r)wfc,^>| = |<(T-TAK,f> + <(£fc-1£ik-/)(T-rlb)wlk,^>| ^|<(Г»-ГК,0'>| + C\\(T-Th)\m)\\ ||(Г-П)|КЮ||. G.17) There is obviously a constant С such that m |((T,-r)wt,f)KC 21 1<№-Л4>ь&>1 G.18) for all wheR(E) and ^eR(E') with ||wj| = ||<£'|| = 1. From G.16)-G.18) we get the desired result. □
690 /. BabuSka and J. Osborn Chapter II Theorem 7.1 shows how the generalized eigenvectors of Гаге approximated by those of Th. Our next result concerns the proximity of certain elements of R{Eh) to certain elements of R(E). It shows, for example, that eigenvectors of Th are close to eigenvectors of T. Theorem 7.4. Let ц(И) be an eigenvalue of Th such that 11тл-»0 M^) = M- Suppose for each h that wh is a unit vector satisfying (fi(h)~Th)kwh=0 for some positive integer k^a. Then, for any integer I with k^l^a, there is a vector uh such that (fx — T)luh = 0 and iuH-wh\\^C\\(T-Th)\m)f-k + »'". G.19) G.20) Proof. Since Ы((ц — ТI) is finite-dimensional, there is a closed subspace M of X such that Х = Щ(ц-ТУ)®М. For уеЩц-ТI), the equation (ji-T)lx = y is uniquely solvable in M. Thus (ju— T)l\M: M-*R((fi—T)') is one-to-one and onto. Hence (fi — T)i\M1:R((fi—T)')->-M exists and, by the closed graph theorem, is bounded. Thus there is a constant С such that li/IKCIK/j-ry/H for all feM. Set uh =Pwk, where P is the projection on N((pt—TI) along M. Then (pi— T)luh = 0 and wh — uh e M, and hence \\щ-и„\\<С\\{ц-ТI(У>н-ик)\\. By Theorem 7.1 there are vectors UheR(E) such that II ws — йл || ^ С ||(Г— Г*)|Я(е) ||. Hence there is a constant C" such that ||[(ju-T)'-(A*-n)']wfc!| £ (ц-Т„У(Т-Тн)(ц-T)'"-'-1-[(w*-*») + **] ^C"\\(T-Th)\m)l Since fcjg/, i-i G.21) ||(/*-T»Lll = 1С) {и-тУит-тнУ-'ч X .)(^-M/i))^w-ny-jw, j = !-l[+l W <С"|^-Мй)Г*+1- Combining G.20)-G.22) we get G.22)
Section 7 Abstract spectral approximation 691 \\Wh-uh\\^C\\(n-T)l(wh-uh)\\ <C||(/*-TLll =c\\L(ii-Ty-(»-Thyiwh+(n-ThyWh\\ ^C[C"\\(T-Th)\R(E)\\ + C"\fi-f4h)rk+1l The result now follows immediately from Theorem 7.3. □ Remark 7.5. If X = H is a Hilbert space, we let T* and T% denote the Hilbert adjoints of Г and Th, respectively. In Theorems 7.2 and 7.3 we would let ф1,..., фт be an orthonormal basis for R(E) and let ф*=Е*ф]. Then ф?,..., ф*еЩ<&- Г*)а) and trace m (f-fA)=X((f-fA)^,4>n j=i where (v)=(v)h denotes the inner product on Я, and with only minor modifications all the results of this section remain valid. We end this section by specializing the results in Theorems 7.1-7.4 to the case where X = H is a Hilbert space and T and Th are selfadjoint. If ц is a nonzero eigenvalue of T, then, as noted above, the ascent txof/i—Tis one and the algebraic and geometric multiplicities of \i are equal. Likewise the eigenvalues Hj(h) of Th which converge to ц have equal algebraic and geometric multiplicities, ц and Hj{h) are, of course, real. Thus, under the present hypotheses, Theorems 7.2 and 7.3 give the estimate \и-ф)\^с\ £ \((T-тм» ФП+\\(Т-Th)\R(E)A, ;'=1,..., m. Now consider Theorem 7.4 in the selfadjoint case. Suppose /|(/г) is an eigenvalue of Th converging to ц. If wh is a unit eigenvector of Th corresponding to //(/г), then it follows immediately from Theorem 7.1 and the definition of 3(R(E), R(Eh)) that there is an eigenvector uh of T corresponding to ц such that \\uh-Wh\\^C\\(T-Th)\R(E)\\. This is Theorem 7.4 in the case a = 1. We further note that one may assume || и J| = 1. From Theorem 7.1 we can also conclude that if и is a unit eigenvector of T corresponding to ц then there is a unit eigenvector wh e R(Eh) of Th such that \\u-WJ^C\\(T-Tb)\RiE)\\. Compare the discussion of the Ritz method near the end of Section 8. Remark 7.6. In the selfadjoint case one may assume the Hilbert space H is real (cf. Remark 4.1). Starting with a real space H we can in the usual way obtain a complex
692 I. BabuSka and J, Osborn Chapter II space by complexifying. Then the contour integrals 1 2тп Rz(T)dz and J-r R,(Th)dz, which are the fundamental tools in the analysis, can be introduced and the results derived. The results will be in the complex context but can immediately be translated to the real context. Remark 7.7. Results for noncompact operators Г which parallel those in this section were proved by Descloux, Nassif, and Rappaz [1978]. See Remark 13.4. 8. Spectral approximation of variationally formulated eigenvalue problems As explained in Section 3, eigenvalue problems can be given variational formulations. For the most part, we will consider eigenvalue problems formulated in this manner. In this section we will first sketch the functional analysis framework for variationally formulated eigenvalue problems and then discuss their approximation. Results of the type presented in this section, specifically Theorems 8.1 and 8.3, were proved by BabuSka and Aziz [1973, Chapter 12] and Fix [1973] for the case of an eigenvalue with multiplicity one; in the general case they were proved by Kolata [1978]. Our treatment is similar to Kolata's. Let Hx and H2 be complex Hilbert spaces with inner products and norms (v)i and || • ||! and (vJ and || • ||2, respectively. Let a(v) be a bilinear (or sesquilinear) form on Я) x й2 satisfying laM^CJulUNb Vh6H„ Vt>eH2, (8.1) inf sup \a{u,v)\ = oc>0, (8.2) MeHi veH2 Nli = l 1Из = 1 and sup \a(u, v)\>0 Vi>eH2 with v #0. (8.3) reHi The Riesz representation theorem and (8.1) imply that there is a bounded linear map A from Hj to H2 such that a(u, v) = (Ли, vJ for all и е Hu v e Я2. The adjoint A' is a bounded map from H2 to Hi satisfying a(u, v) = {u, A'vI for all и е Hlt v e H2. Equations (8.1), (8.2), and (8.3) imply that A is an isomorphism of Hx onto H2. In fact, in the presence of (8.1), equations (8.2) and (8.3) hold if and only if A is an isomorphism, cf. BabuSka [1971] and BabuSka and Aziz [1973, Chapter 5]. Using the fact that A is an isomorphism if and only if A' is an isomorphism we see that in the presence of (8.1), (8.2) and (8.3) hold if and only if inf sup \a(u,v)\ = u>0 (8.4) veH2 ueHi Il»ll2=l Ml = l
Section 8 Abstract spectral approximation 693 and sup|a(u,j;)!>0 VueH2 with иФ0. (8.5) veHi (8.2) and (8.3) (or (8.4) and (8.5)) are called the inf-sup conditions. Suppose i| • ]|i is a second norm on Hx which is compact with respect to || ■ || 1, i.e., every sequence in Ht which is bounded in || • || x has a subsequence with is Cauchy in || • ||i. Let b(u, v) be a bilinear form on Hx x H2 satisfying |b(H,»KC2||u||iN2 VmsHj, veH2. (8.6) We remark that in many applications the form b(u, v) is defined on Wl x W2, where Hi с Wt with a compact imbedding, H2 с W2 with a bounded imbedding, and satisfies \b(u,v)\^C2\\u\\Wi\\v\\W2 Vue^i, ueW2. (8.7) If II' II i = II' IIhv tnen it is immediate that || • ||i is compact with respect to || • || j and that (8.6) holds. It is shown in Babuska [1971] and Babuska and Aziz [1973, Chapter 5] that (8.1)—(8.3) imply there are unique bounded operators T.Hx-^H^ and T*: Н2-*Н2 satisfying a(Tu, v) = b(u, v) УмеЯь Vi;eH2, a(u, T*v) = b(u,v) Уме Hi, Vz;e#2. [6 ' Furthermore l|fi*lli<(C2/a)||M||i VueHi. (8.9) If Uj is a bounded sequence in Я15 then, since || • Hi is compact with respect to || • ||b we know there is a subsequence ujt that is Cauchy in ||*||i. It then follows immediately from (8.9), applied to uh —uJk, that Ти1г is Cauchy, and hence convergent, in Нг. Thus Т.Н^-уНх is compact. It is immediate that a(Tu,v) = a(u, Ttv). The operator T* is related to T*, the usual adjoint of T on Н1г by the transformation Г* ^А'Т^А. Т* and Г„ are compact. A complex number A is called an eigenvalue of the form a relative to the form b if there is a nonzero vector и е Яь called an associated eigenvector, satisfying ф, v) = Щи, v) \/v e H2. (8.10) It is easily seen from (8.8) that (A, u) satisfies (8.10) if and only if XTu=u, i.e., if and only if (Л-1,м) is an eigenpair of the compact operator T. (8.10) is referred to as a variationally posed eigenvalue problem (cf. C.15)). The notions of ascent, generalized eigenvector, and algebraic and geometric multiplicities are defined in terms of T. The generalized eigenvectors of T corresponding to X can, however, be characterized in terms of the forms a(v) and fr(v). «J is a generalized eigenvector of order j > 1 if and only if a(uJ, v) = Щи', v)+Ха(ил~1, v) for all veH2, where uJ~1 is
694 /. BabuSka and J. Osborn Chapter II a generalized eigenvector of order /— 1. Since T% =A' lT*A', it is immediate that сг(Т;)=<7(:г*) and that NW1 -T^j) = A'~l {ЫЦХ'1 -T*)j)}. From this we see that the generalized eigenvectors of Г* have a similar characterization to those of Г, namely, a{u,vj)=Xb{u,vJ) + Xa{u,vj~l) for all ueHx. In particular, (X~i,v) is an eigenpair of Г* if and only if a(u, v) = Xb(u, v) for all и e Hu i.e., (A, v) is an adjoint eigenpair of (8.10). In order to construct approximations to the eigenvalues and eigenvectors of (8.10) we select finite dimensional subspaces Sljh с Ht and S2th <= H2, indexed by a parameter ht that satisfy inf sup \a(u,v)\^P^P(h)>Q (8.11) «65l.li VSS2,h IMl = l IM|2 = 1 and sup \a(u, v)\ > 0 for each v e S2th with v #0. (8.12) We also assume \/ueHu lim/?(fc)_1 inf ||u-z||1=0. (8.13) We note that if dimS1>A =dimS2,A, then (8.12) follows from (8.11). We assume dim SliA = dim S2,h for the remainder of this article. Slth and S2,h are referred to as test and trial spaces, respectively, and, if they consist of piecewise polynomial functions they are called finite element (approximation) spaces. Equation (8.11) is referred to as the discrete inf-sup condition. We then consider eigenpairs of the form a relative to the form b, but now restricted to S1<h x 52,fc, i.e., pairs (Aft, uh), where Xk is a number and 0#мл е Slpft, satisfying a{uh,v) = Xkb(uh,v) 4veS2,h, (8.14) and use Xk and щ as approximations to A and u, respectively. Equation (8.14) is called a variational approximation method or Galerkin method for (8.10) in general and, if Slth and S2,k consist of piecewise polynomial functions, it is called a finite element method. Since N = dim Slifc =dim S2>* < °°,(8-14) is equivalent to a matrix eigenvalue problem. In fact, ^ф1}...,фк and фх,..., \j/K are bases for Slh and S2_h, respectively, then {Xh, ик = И1-^1г]ф^) is an eigenpair of (8.14) if and only if Az = XhBz, (8.15) where z = (zb..., zN)T, А = (Аи), Аи=а(ф^,ф1), В = (Ви), ВУ = Ь(^,^). (Xh, uk) is an eigenpair of (8.14) if and only if (Xk 1, uh) is an eigenpair of the compact operator Th:H1-+Slrh defined by a(Thu,v)=b(u,v) У«еЯ1; veS2jk. (8.16)
Section 8 Abstract spectral approximation 695 The operator Th can be written as Ph T, where Ph is the projection of H% onto Slth denned by a{Phu,v)=a(u,v) VueH1; veS2ih. (8.17) Using the centrafresult in BabuSka [1971] and Babuska and Aziz [1973, Chapter 6], it follows from (8.1H8.3), (8.11), and (8.17) that '-''"■'«О+ЯбЫ"'"" Thus from (8.13) we see that Ph-+I pointwise. Since Г is compact, Th = PhT->T in norm on Hj. Let Я be an eigenvalue of (8.10) with algebraic multiplicity m, by which we mean that Я-1 is an eigenvalue of T with algebraic multiplicity m. Let a = ascent of Я-1 -T. Since rft-»T in norm, m eigenvalues Xi(h),..., lm(h) of (8.14) will converge to Я. The Xj{h) are counted according to the algebraic multiplicities of the fij(h)=Xj{h)~-1 as eigenvalues of Th. Let M = M(X) = {u: и a generalized eigenvector of (8.10) corresponding to Я, \\u\\t = 1}, (8.18) M* =M*(A) = {i>: v a generalized adjoint eigenvector of (8.10) corresponding to Я, \\v\\2 = 1}, (8.19) Mh =М/,(Я) = {и: и in the direct sum of the generalized eigenspace of (8.14) corresponding to the eigenvalues ЯДЙ) that converge to Я, ||и\\ j = 1}, (8.20) and define £A=£ft(A)= sup inf Hii-xld, (8.21) e?=e?W= sup inf ||»-ij||2. (8.22) ceM*(A) 4eS2,h Let М(Я) = К(£) and Mh(X) = R(Eh). We now state and prove four results which correspond to Theorems 7.1-7.4. Let a denote the ascent of Я — T. Theorem 8.1. There is a constant С such that дШ^М^^Ст'1^ (8.23) Theorem 8.2. There is a constant С such that 1 IUh)-1 ^Cp(hrxehst. (8.24) Theorem 8.3. There is a constant С such that
696 I. BabuSka and J. Osborn Chapter II Theorem 8.4. Let k(h) be an eigenvalue 0/(8.14) such that \ixah^o)J{h) = X, Suppose for each h that wh is a unit vector satisfying {X(h)~l — Tfwh =0 for some positive integer k^a. Then, for any integer I with k^l^oc, there is a vector uh such that {Г1-ТIщ = 0апй \\uh-wh\\i^Cmhr1^)il"k+1)lc' (8.26) Proofs. The eigenvalues and generalized eigenvectors of (8.10) and (8.14) have been characterized in terms of the compact operators Tand Th and we know that Th-+T in norm. Thus we can apply the results in Section 7, with X = Hi and T and Th as defined in (8.8) and (8.16), to estimate the eigenvalue and eigenvector errors. Note that M = R{E)n (unit sphere in Ях), where E is the spectral projection associated with T and A-1, and MA (A) = J?(£A)n (unit sphere in HJ where Eh is the spectral projection associated with Th and &]1(h), j = 1,..., m. Consider first the proofs of Theorems 8.1 and 8.4. These results will follow immediately from Theorems 8.1 and 8.4, respectively, if we show that \\{T-Th)\R(E)\\^Cm-Hh. (8.27) From Babuskla [1971] and Babuska and Aziz [1973, Chapter 6] and (8.1)-(8.3), (8.6), (8.8), (8.11) and (8.16) we have ||(Г-Тй)и||1<A+^) inf \\Tu-i\\„ \ P(h)J xeS^H Since M = R(E) is invariant for T, for и е R(E) we obtain inf IITYz-zlU^yrilHull!. XsSi,h (8.27) follows from these two estimates. Now consider the proofs of Theorems 8.2 and 8.3. The left-hand side of (8.24) is bounded by C< £ |((T- Тк)ф}ф^ I + ||(Г- Th)\m) || ||(Г* - Г?)|ад., I U,j=l We now show that this quantity can be bounded by Cp(h)~l Ehe*. For ue Ht with ||m||i = 1 and for v* e R(E*) with \\v* ||i = 1 we have ((T-Th)u,v*h =a{(T-Th)u, A'-'v*) = a{{T~Th)u,A'-4*-n) ^С^КГ-ВДиИ'-1»*-»,^, VneS2,h. We have here used the definition of the operator A, (8.8), and (8.16). Recalling the
Section 8 Abstract spectral approximation 697 A''1 maps R(E*) = N((FJ-T*)X) onto ^((Г1-Т*У) = М*{Х), we get ((Г- Th)u, »*)i <d ||(T- П)«!11 M'" lie?. From this it is immediate that ||(r*-Tfc*Klli= sup ЫТ*-П)^)Л^С1\\Т-Тк\\\\Аг1Ы (8.28) ueHi »u||i = l \fv*eR(E*) with ||u* ffx == 1, ((Г- Т„)фь ФГ\ =((Г- Г„Ж, £*^-)i <СХ ||£*|| ]|(Т-Г4)|ад|| И'"' «е^адГЧе?. (8.29) Now, using (8.27)-(8.29) we get £ |((Г- Гй)^, фГ)г| + ||(T- TA)|R(£)|| ||(Г* -T?)|R(£.,II ^C№~xWt- Thus Theorem 8.2 follows from Theorem 7.2 and Theorem 8.3 from Theorem 7.3. □ Remark 8.1. The proof we have given for (8.24), together with G.12), shows that \X-X{h)\*:CP(h)zhEt, (8.30) where 1 m Ш=- E W- (8.31) We end this section by specializing the results to the Ritz method for selfadjoint, positive-definite problems and then presenting a lower bound for the eigenvalue error. Suppose Ht =H2 =H, || • ||H = || ■ || is a real Hilbert space. Let a(v) be a symmetric bilinear form on Я satisfying (8.1) and a(u,u)^u\\u\\2 Уме Я, (8.32) with a a positive constant. Note that (8.1) and (8.32) imply that a(u, u}112 and ||м|| are equivalent norms; a(u, мI/2 is often called the energy norm of u. Let b be a symmetric bilinear form on ^satisfying (8.7), with W1 = W2 = W=>H compactly, and satisfying b{u,u)>0 УиеЯ, м#0. (8.33) Equation (8.32) implies (8.2) and (8.3) are satisfied. We note that (8.1) and (8.32) show that a{■,•) is equivalent to the given inner product (•,•) = (y)g on Я. We will now take a(•, •) to be the inner product on Я and take || • L = ^/a('->')t0 be the norm. We see that T* = Tm = T. Thus Г is selfadjoint; it is also positive-definite. It is, of course, compact. Let Slih = S2,h = Sh с Я be a family of finite-dimensional spaces satisfying (8.13). In this case the variational approximation method (8.14) is called the Ritz method. Inequalities (8.11), with j?(/i) = a, and (8.12) follow from (8.32).
698 I. BabuSka and J. Osborn Chapter II Under these hypotheses, the problem (8.10) has a countable sequence of eigenvalues 0<Я!^Я2^Я3< •-• f+oo and corresponding eigenvectors U\, U2, И3, . . . , which can be assumed to satisfy a{Ui,Uj) = kjb{ubUj)-5ij (8.34) (cf. Section 4). In the sequence {Xj}, the Xj are repeated according to geometric multiplicity. Furthermore, the k-} can be characterized as various extrema of the Rayleigh quotient R(u)=a(u, u)jb{u, u). We state these characterizations now. Minimum principle: li —minR(u) = R(u1), Xk = min i?(u)=K(ufc), fc = 2,3,.... (8'35) UEtf a(u,«,-) = 0 i= 1 k— 1 Minimum-maximum principle: Xk = min max.R(u)= max R(u), k=l,2,.... (8.36) VkczH ueVk ueVkssp(ui uk) Maximum-minimum principle: Xk = max min R(u) zit...,Zk-ieH ueH = min R(u), к = 1,2,... я(и,и,) = 0 i = l,...,lt-l Likewise (8.14) (with SUk =S2,h=Sh) has a finite sequence of eigenvalues 0<Xuh^X2ih^ ■■■ <Ял,ь, JV = dimS,„ and corresponding eigenvectors Mi,л» • ■ • > ujv,»i> which can be taken to satisfy Ф;,ь ujih)-=Xlhb(uirh, ulh) = dtj. (8.38) For the Ям we also have extremal characterizations.
Section 8 Abstract spectral approximation 699 Minimum principle: Alih = mmR(u)=R(ulth), AM = min R(u) = R(uKh), k = 2,...,N. ueSh o(u,ui,h) = 0 J=l „..,*-! (8.39) (8.41) Minimum-maximum principle: kKh= min тахЯ(ы)= max R(u), k = l,2,..., N. (8.40) Pk.hsSh ueVk.h «efkAssp(ai,h,...,Uk.F,) dim Vk,h = k Maximum-minimum principle: Ak,h — max min R(u) z i .h, • • • ,Zk - i ,h£H ueSh fl(«,z(.h) = 0 i=l 4-1 = min R(u), k=l,...,N. ueSh a(u,ui.h) = 0 i= 1 Л —1 It follows directly from the minimum and the minimum-maximum principles that 4<4,„, k=l,2,...,N = dimS„. (8.42) For a comprehensive treatment of such extremal characterizations of eigenvalues and their applications we refer to Courant and Hilbert [1953], Weinstein and Stenger [1972], and Weinberger [1974]. If Ak has geometric multiplicity q, i.e., if 2.k = lk+1 = ••• = At+e_1, then AM,..., 4+1-1,* 14> and combining (8.40) with Theorems 8.2 and 8.3 we see that Ak<hh^*k + Ceh(lk)> j=k,...,k+q-l. (8.43) (Recall that the ascent a of A* * — Г is one.) Regarding the approximation of eigenvectors, from Theorems 8.1 and 8.4 we see that if wh =ujth,j=k,..., k + q — l, then there is a unit eigenvector u = uh of (8.10) corresponding to Ak such that Ци-Will^Ce», (8.44) and if и is a unit eigenvector of (8.10) corresponding to At, then there is a unit vector wh in sp(wM,..., uk+q-1>h) such that llM-wJI^Ce». (8.45) If Ak is simple, i.e., its geometric multiplicity is one, we have Ци*-Им111<Сбл. (8.46) To be more precise, if ux, u2,. ■ . satisfy (8.34), then ulth, u2,k,..., uNth can be chosen so that (8.38) and (8.46) hold. Regarding these applications of Theorems 8.1-8.4, see the discussion of the selfadjoint case at the end of Section 7. We now state some refinements of (8.43)-(8.46) due to Chatelin. Consider Ab which we suppose has multiplicity of q. Let Е = Е(Як х) be the orthogonal projection
700 I. BabuSka and J. Osborn Chapter II of Я onto span{Mk,..., м4+,_ j} and let Eh = Eh{kk~l) be the orthogonal projection of H onto span{HM,..., uk+q-lih}. (These are the spectral projections introduced in Section 6.) Chatelin [1975, 1983] showed that \\и-Ени\\1=г^) inf ||н-х||в, Vm e span{ufc,..., %+,_:}, (8.47a) XeSh \\ujth-Eujih\\^r^M\\Euhh-zla, j=k,...,lc + q-l, (8.47b) (Ял„ -At)/A» = tf > inf ||Eu;,4 -z||a2, j = /c,..., k + q-1, (8.47c) xeSh where rj," -> 1 as /i->0, for I = a, b, с Regarding rj,", Babuska and Osborn [1988] have shown that |/f-l|*£di/2№. /=a,b,c, where ф)= sup inf ||Ти-xL- b(u,u)=l xeSh Note that (8.47c) provides a lower, as well as an upper, bound for the eigenvalue error. For example, if kk is simple, we have Cj £ft ^ Як>л — Як < C2 sh, where Ct and C2 are positive constants, showing that the eigenvalue error is of the same order as e2. If Як is a multiple eigenvalue and k(h) is any approximate eigenvalue with Я(/г)Ц, then one has Щ-1>С inf inf ||и-x II?. (8.48) ueAf(Ak) xeSh Kolata [1978] also proved this type of lower bound for the eigenvalue error. Pierce and Varga [1972] established improved estimates for the eigenvector error in the norm || * ||i=,/b(v). BabuSka and Osborn [1989] have also proved such results, as well as refinements for the case of multiple eigenvalues. 9. An additional result for multiple eigenvalues In the previous section we presented error estimates for variationally formulated problems and at the end of the section we specialized these results to the Ritz method for selfadjoint, positive-definite problems. Then we stated some refinements of these results due to Chatelin. In this section we present a result of Babuska and Osborn [1987] for multiple eigenvalues. As at the end of Section 8, we assume that a( •, •) is a symmetric bilinear form on H satisfying (8.1) and (8.32), that &(•,•) is a symmetric bilinear form on Wsatisfying
Section 9 Abstract spectral approximation 701 (87), with W1 = W2 = W^H compactly, and (8.33). We take a{- •) and ||«||e = yja(-,') to be the inner product and norm on H and set || • ||b = v/b(v). Then, as stated in Section 8, the eigenvalue problem (8.10), i.e., the problem ueH, ифО ,9_ a(u, v) = Щи, v) Vf e Я has a countable sequence of eigenvalues 0<Я!^А2< ••■ /Чао and corresponding eigenvectors "i>,..•, which can be chosen to satisfy а{щ, щ) = Щщ, uj) = 8Ц. (9.2) Furthermore, any ueH can be written as GO u= Yj aiui with а^=а(и,м^), (9.3) with convergence in || • ||a (cf. D.10) and D.11)). We assume ShcH is a family of finite-dimensional spaces satisfying (8.13) with f}{h) = u,. The eigenvalue problem (8.14) with S1:h=S2,h = Sh, i.e., the problem uheSh, иьФ0 (94) a{uh,v) = Xhb{uh,v) \/veSh has a finite sequence of eigenvalues 0</lliA<A2,fc^ ••• sSVb iV = dimSb, and corresponding eigenvectors U1,A> • • • > MW,ft> which can be chosen to satisfy а(Щ,к, ujih) = Xlhb(uUh, ujih) = 5u. (9.5) The Xk and Я4,л satisfy the extremal principles stated in Section 8. Our analysis makes use of the following lemma that expresses a fundamental property of eigenvalue and eigenvector approximation. Lemma 9.1. Suppose (A, u) is an eigenpair of(9.1) with \\u\\b = 1, suppose w is any vector in H with \\w\\b = 1, and let X=a(w, w). Then A*-A= ||w-M||a2-A||w-M||fc2. (9.6) (Note that we have assumed и and w are normalized with respect to || • ||6 here, whereas in (9.2) and (9.5) we assumed ut and щл are normalized with respect to |j • ||a.)
702 I. BabuSka and J. Osborn Chapter II Proof. By an easy calculation, ||w--u||;J-A||w-k||j| = \\w\\2a-2a(w, u)+\\u\\2a-M\w\\2b + 2Щм>, и)-к\\и\\1 Then, since IM»=ll«llb = l, IN|a2=X, ||и||2=А, and a(w, и) = Щуп, и), we get the desired result. □ For i= 1,2,... suppose kk. is an eigenvalue of (9.1) with multiplicity q{, i.e., suppose Лк, - 1 < Akt = Ak. + i = ••• =Ak. + q.-i <Ak. + q. =Ajt( + 1. Here fcx = 1, k2 is the lowest index of the second distinct eigenvalue, k3 is the lowest index of the third distinct eigenvalue, etc. Let eh(i,j)= inf inf Ци-xL, j=l,...,qh (9.7) Ф, "*,.*)= ■■■ = a(u,uk. +J--2,*)=0, where M(Xk.) is defined in (8.18). The restrictions a{u, ик.чк)=--- = а(и, ukj+j-2,h) — Q are considered vacuous if _/= 1. Note that eh(i,j)^Eh(Ai), where £Л(Я,) is defined in (8.21). We now estimate the eigenvalue and eigenvector errors for the Galerkin (Ritz) method (9.4) in terms of the approximability quantities eh(i,j). Theorem 9.1. There are constants С and h0 such that V0</2SC/zo, j=l,...,qh 2 = 1,2,..., and such that the eigenvectors ult u2, ■.. of (9.1) can be chosen so that llUfc,'+j- 1,й —"/Cj+j- 1 \\a^C£h{hj)> mQ) V0</isSfco, j=U---,9t, i=l,2,..., and so that (9.2) holds. Remark. 9.1. Equations (9.8) and (9.9) should be compared and contrasted with estimates (8.47a)-(c) of Chatelin [1975, 1983] and (8.48) of Kolata [1978]. Proof of Theorem 9.1. Overview of the proof. The complete details of the proof, which proceeds by induction, are given below. Here we provide an overview. In
Section 9 Abstract spectral approximation 703 Step A we give the proof for i= 1. The proof is very simple in this case and rests entirely on the minimum principle (8.39) and Lemma 9.1. The central part of the proof is given in Step B. There we prove the theorem for i=2, proving first the eigenvalue estimate (9.8) and then the eigenvector estimate (9.9). In particular, in Steps B.l and B.2, estimates (9.8) and (9.9), respectively, are proved for; = 1. We further note that the argument used in Step В proves the main inductive step in our proof, yielding the result for i = г +1 on the assumption that it is true for i^i. To be somewhat more specific, the argument in Step B.l proves (9.8) directly for any i > 2 (and j = 1) and that in Step B.2 proves (9.9) for i=i +1 (and; = 1) under the assumption that ||u/,*—M/L-+0 as h-+0 for l^ki+l — 1 (cf. (9.30)). Details of the proof. Throughout the proof we use the fact that eh(i,j) can also be expressed as <*(U)= inf , mf ||u-zL (9.7) a(u,Uki,h)- ••■=a(u,"bi + j-2,Ji) = 0 a(z,Uki,h) = — =a(x,"kj + j-2,h> = 0 Step A. Here we prove the theorem for i= 1. Step А.1. Suppose Akl (fcj = 1) is an eigenvalue of (9.1) with multiplicity qu i.e., suppose Я1 = А2 = "- = А41<Ав1 + 1. In this step we estimate Aljh —Ab the error between Л-! and the approximate eigenvalue among Xlih,...,Xquh that is closest to Al9 i.e., we prove (9.8) for i=/= 1. Note that £„A,1)= inf inf|ju-x||0 ueMUO xeSh is the error in the approximation by elements of Sk of the most easily approximated eigenvector associated with kv From the definition of eft(l, 1) we see that there is a uheM(l1) and an sheSh such that \\uh-sh\\a = Bh(l,l). (9.10) Let у/Шн, Щ) y/b(sh, Sh) By the minimum principle (8.39) we have Now apply Lemma 9.1 with (Я, м) = (А1;5А), w = sh, and l=a{sh,sh). We obtain <l|s,-aAL2<C||s,-u„L2. (9.12) Equations (9Л0)—(9Л2) yield the desired result. Step A.2. In this step we prove (9.9) for i—j = 1. Let u%,u2,... be eigenvectors of
704 /. BabuSka and J. Osborn Chapter II (9.1) satisfying (9.2). Write CO (cf. (9.3)). From (9.5) and (9.11)-(9.13) we have At ^«i+1/j = 4i + 1 L дач l да2и- ./ = «1 + 1 A, Еда21- 'я, Hence = |a(u1,*,ulih)-A1b(uliA,w1,ft) — (Ai,* — Ai)Altj, <CfiJ(l,l). 1/2 , 4-1/2 <c|i-^- e„(U). *«i + l/ Redefining Mj to be (9.13) (9.14) we easily see that ||н1||я= 1, so that (9.2) still holds, and from (9.14) we obtain ll«i.»-«iL^Ce*(l,lX (9-15) as desired. Note that ux may depend on h. Step A.3. Suppose qx >2. From (9.7') we see that ейA,2)= inf inf Цм-zl!.. ueM(Ai) XeSh (9.16) Choose инеМ{1г) with a(uh,ulh) = 0 and sheSh with a(sh,uijh)-0 so that l|M*-s*IL = e*(l,2), (9.17) and let
Section 9 Abstract spectral approximation 705 Since a(sh, wliA) = 0, from the minimum principle (8.39), Lemma 9.1, and (9.17), we have Я2,А-Я2^||5,-йА||а2<Сел2A,2). (9.18) This is (9.8) for i = 1 and j=2. Step A.4. In Step A.2 we redefined ut. Now redefine u2,. ■., uq> so that и1г...,и9 are a-orthogonal. Write 00 Now, proceeding as in Step A.2 and using (9.18), we have '«i + i/ j=«i + i I Ц?)J^ £(«<f>)^l-^ = \a(u2:k,u2th)-l2b(u2,h,U2,h) <Ce2(l,2). Thus M2,A- Ёа</4 7=1 <Ce„A,2). (9.19) But by (9.15), fl(i2| = aKb«i) = a(,*,"i-«i,fc) <ll«2,*ll«ll"l-Ml,*L <Ce*(l,l) ^Ce„(l,2). (9.20) Combining (9.19) and (9.20) we get u2,h- Efl</4 J' = 2 < + !|аA2)м1||а <Ce*(l,2). Redefining u2 to be I afu, J = 2 . «S% J' = 2
706 /. BabuSka and J. Osborn Chapter II we see that ||u2L = l and a(u1,u2) = 0, so that (9.2) holds and \\u2J,-u3\\a*ZCBh(l,2), (9-21) which is (9.9) for i—l,j = 2. Step A.5. Continuing in the above manner we obtain the proof of (9.8) and (9.9) for i = l and j'=l,,. .,qx. Step B. Here we prove Theorem 9.1 for i=2. Step B.l. Suppose Xkl {k2 — qi +1) is an eigenvalue of (9.1) of multiplicity q2. In this step we estimate Л*2,л — Хкг, the error between Xk2 and the approximate eigenvalue among Xk2th,... ,Xk2+q2~ljA that is closest to Xkl. Note that £„B,1)= inf inf||«-*L. (9.22) usM(Xk2) xeSh Introduce next the operators T,Th:H^H defined in (8.8) and (8.16), respectively, i.e., the operators defined by TfeH, a(Tf,v) = b(f,v) VveH and TJeSh, a(Thf,v) = b(f,v) \fveSh. It follows from (8.1), (8.7), (8.32), and (8.33) that Г and Th are defined and compact on H. Furthermore \\(T-Th)f\\a^Cmf\\Tf-xh- (9.23) We now suppose the space H and the bilinear forms a and b have been complexified in the usual manner. Let Г be a circle in the complex plane centered at №2 = Ak2\ with sufficiently small radius. Then for h sufficiently small, Гср(Тк) and Int(r)no<rfc) = {^2ih,...,^2+e2_1,fc}, where nk2+i = Xk2\i. Also, as we have seen in Section 6, Е(цк2), the spectral projections associated with T and /xk2, and EhiHk2\ the spectral projection associated with Th and ^*2+l-ifo f = 0,.. .,q2 — 1, respectively, can be written as E{fik2)-- 1 2ni (z-T)-ldz (9.24) and Elliik2) = ^\(z-Thr1dz. (9.25) i Let и6R{E(/j.k2)). Then vh = Ен{цк2)ие R(Eh(nkJ), and from the formulas (9.24) and (9.25) we obtain
Section 9 Abstract spectral approximation \\u-vJB=\\(E(nk2)-Eh(fik2Ma 1 — |B-ГкГ1(Г-Г,к)B-Г)-1«<к 707 — 2m (z-Thr\T-Th) -ft2 •dz <^- [2ti rad(D] sup||(z- г,)1|—|—||(Г- 7>||e Z7t z<=r rad(i) =(^2+92-1,,-^2+rad(r))-1|lG,-7;)H||e <С||(Г-Г>Ив. Equations (9.23) and (9.26) yield ||«-«Je<Cinf||r«-Z||e xeSh (9.26) = Cinf||^2u-Z||a<Cinf||u-z||a. (9.27) This is an eigenvector estimate; it shows that starting from any ueR(E((ik2j) we can construct a vh = vh(u)€R(Eh(fik2)) that is close to u. We now use (9.27) to prove the desired eigenvalue estimate. By the minimum principle (8.39) we have K2,h-K2= inf Ф,»)-Лк2. veSh 1Иъ = 1 a(v.ui:h) = 0, i=l fc2-l (9.28) Since vheR(Eh(nkJ), we know that a(vh, uih)=0, i = l,...,k2 — 1. Thus, from (9.28) we find *k2,h-*k2<a "All* I!"A lib. ~^k2- Combining this with Lemma 9.1 and (9.27) we obtain К2,ь~К2^ Ьн\\ъ Hull, Кк2 uh\\b ^C||i;A-«L2^Cinf||u-xL2 X*Sh for и e R(E(jikl)) with ||u||a = 1. Hence, using (9.22), K2,h~k2^C inf in ИбМ(Ак2) xeS which is (9.8) for г=2, j = 1. h2,H~h2^C inf inf||u-~x||2 = C£2B,l). иеМ(Ак2) xeSh (9.29)
708 /. BabuSka and J. Osborn Chapter II Comment on inequality (9.29). A careful examination of the proof of (9.29) shows that С depends only on цк1 ~~/ik2 and цкг —ft3, but is independent of h, and that (9.29) is valid for h^h0, where h0 is such that h^h0 implies r<zp(Th), Int(r)ncr(TA) = {/it2fA)..., nk2+t2-i,h}, and цк2 -nk2+q2-i,h is small, say цкг -[ikl+q2-lih ^irad(F). Note that if we were considering a family of problems depending on a parameter т, we could bound C — C(x) above, independent of т, provided й,(т)-^;(т) and fikrt(z)—ркз(г) were bounded away from 0, and we could bound h0{x) away from 0 if Г(-г)ср(Тк(т)), М(Г(т))г>о-(Г,,(т)) = {ixk2,M..., nkl+qi-Uh{x)}, and ^(^-^^^-^(тХ^гааСДт)), uniformly in т. Step B.2. Suppose, as in Step B.l, that Xki has multiplicity q2. We have shown in Step A.5 that we can choose the eigenvectors ux, u2, ■ ■ ■ of (9.1) so that (9.2) holds and so that \\uj,h~Uj\\a^CE„(lj), j=l,...,q1^k2-l. (9.30) Write CO 42,h=I42W (9-31) j=i From (9.31) we have 2, \ = 1Ф*2,*, Щ2,к)-^кгЬ(ик21н, uk2th)\ = (^k2,b ~^k2)^k1,h> I [«Г']2 A-7: which, together with (9.29), yields ^Ce2B,l). (9.32) д1 [«na (^^k J+e2 [a)]2 (!-^) Note that the first term inside the absolute value is negative and the second is positive. In addition Ci<|l-li2/A,KC2 yj^k2,k2+l,...,k2+q2-l, with C1,C2 positive numbers. Hence from (9.32) we obtain "J] laf»V^Dl£2hB,l) + D2 £ [af>]2 (9.33) J'=l j = k2+q2 and £ [a^>]2^D3e|B>l) + D4kS1[^)]2- (9-34) Write
Section 9 Abstract spectral approximation Чн~Щ= Y,hiup l'=l,...,fc2-1=tfi- Then, by (9.30), 00 I bfj= \\uUh-Ui\\2a^Ce2h(l, 0, »=1,..., k2-1. Next we wish to find constants ax,..., ai2-1 so that a( u„ £ а;и;>л j = af2\ i = 1,..., k2 -1. Using (9.35), these equations can be written as / /12-1/ =0 \\ (C2-1 i=l,..., fc2-l. 709 (9.35) (9.36) (9.37) (9.38) Since (8.13) implies e2B, l)->0 as h~>0, from (9.36) we see that the bhi are small for h^h0, with /i0 sufficiently small, and hence the system (9.38) is uniquely solvable, and, moreover, there is a constant L, depending only on fc2, such that /кг-1 \l/2 СИ<L ki-1 I (a?*J Lj=i 1/2 (9.39) Now, from (9.30) we obtain Wf2)\ = \a(uk2th, uj)\ = \a(uk2th, Uj-uM)\ <IIm*2,*ILK—M/,*L = 11«у-«л*И« <C£A(ij'), j=l,..., /c2-l. Letting ft22(*)= I e*2(lJ). we see that "t2-i №2) \2 j=l 1/2 ^cPkM and thus, from (9.39) 'кг-1 \l/2 ki-1 \1 ;=i / <LCpkM<Cpt2(h). (9.40) (9.41) (9.42)
710 Now let I. BabuSka and J. Osborn (£2-1 Then \J/eSh. Furthermore, from (9.35) and (9.37) we get 0, i^k2 — 1 «^H^-Y'A, i>K Chapter II (9.43) (9.44) 7=1 From (9.42) and (9.43), IWL-i| = IWL-IK.»IU ^\\Ф-Щ2,н\\а /кг-1 \l/2 ^Срк2(Л). (9.45) Using (9.29), (9.44), and (9.45), and the fact that pk2(h)->0 as h-+0, we get CslhB, 1) Ak2,h ^U\ M*2.*> JJ^T ) -ЛкМ Uk2th, m \ф\\, (9.46) = C I Л ^-Y «.*«)(» "x .<=к2 + «2 where С >0 and is independent of ft. Combining (9.36), (9.39), (9.40), and (9.46) we obtain ,№г)\2 <c «sc ^c *2-l e2B, 1)+ S |aH X |a,||bw &2 — 1 00 £,2B, i)+ x w I Mt2)! IM 1=1 £=&2+Я2 fc2-i / со Ч1/2/ °° £2B,i)+ Z ki I i<ri2 I im2 1=1 \! = *2+42 ( = 1B+92 1/2"
Section 9 Abstract spectral approximation 711 ===C ^C ^C ^C £2B, D+Yk-lf I \42)\2I/2 max eft(l, i) A2-1 V/2/ °° V/2~ ел2B, 1) + £аA,/с2-1)У)с^Т E Kl2 £ 1«Г2,12 \l=l / \J = IC2 + «2 / , 1=1 'кг-l \ 1/2/ да \1/2~ >=i / \;=*2+в2 /*2-l \1/2/ °° г,2B,1) + £кAД2-1) I [a?»]2 J] |ei (kill 2 1/2' (9.47) is a quadratic inequality in 00 \ 1/2 1 [Л2 . ;=*2+«2 / whose solution yields *2-i X [a?»]2<C82(l, k2 -1) £ [a?2)]2 + Ce2B,1). ! = k2 + 42 i=l Combining (9.33) and (9.48) we get *2-l „<bh2 *2~1 X (af2iJ<DlE2B, l)+D2Cfi»(l,fc2-l) I (af2)J + £>2Ce2B, 1), (9.47) (9.48) and thus, since елA Д2 — 1) is small for h small, &2-1 I (а?г,J<Я5г2B, 1). (9.49) Next, combining (9.34) and (9.49), we get £ (flf2)LiWB,1). i = *2 + «2 Finally, from (9.31), (9.49), and (9.50) we have *2+«2~l ,№2), j = k2 fa ~ 1 со I (а^У+ £ (<4*2)J О Lj'=l J = it2 + «2 <Ce»B,l). 1/2 Redefining мк2 to be *2+«2- I J = *2 1 \ / «F4/ // *2+«2-l I «ГЧ ;=*2 (9.50)
712 1. BabuSka and J. Osborn Chapter II we see that ||мЛ2||а = 1, so that (9.2) holds, and \\uk2,h-uk2\\a^CBhB, 1). (9.51) Thisis(9.9)for! = 2,j=l. Comment on estimate (9.51). In the proof of (9.51) we used (9.30), which was proved in Step A. A careful examination of the proof of (9.51) shows that we did not use the full strength of (9.30), but only the weaker fact that \\uM — u} ||e->0 as /j-»0 for j ^ k2 -1. (Cf. "Overview of the proof".) Step B.3. Suppose q2^2. In Step B.l we estimated Xk^h — Xkr In this step we estimate A*2 + lift-4,+j. We proceed by modifying problems (9.1) and (9.4) by restricting them to the spaces Hk2'h = {ueH:a(u, мк2,л) = 0}, Sth ={ueSh:a{u,uk2,h)~0}, respectively, i.e., we consider the problems (9.1)*2''' and (9.4)fc2,h obtained by replacing H and S„ by Hk2-h and Skh2-" in (9.1) and (9.4), respectively. (9Af2-h has the same eigenpairs {XJth, ujth) as does (9.4), except that the pair (Ai2it, uk2th) has been eliminated, {9.\)k2'h has eigenpairs {XJ'h, uJ'h), which in general depend on h. Nevertheless, Aj^ = 42 + b l = 0,...,q2-2, (9.52) i.e., At2, the eigenvalue under consideration, is an eigenvalue of multiplicity q2 — 1 for problem (9.1)k2'h. Its eigenspace is Mk2-\Xk2)= {иеМAк2У.a(u, uM2)=0}. We can now apply the argument used in Step B.l to problems (9.1f2,h and (9A)k2'h and, using (9.7'), we obtain (cf. (9.29)) Ak2 + i,*-At2 + i^Cei,2(h) forfc<V (9.53) Since uklth depends on h, the problems (9. if2*'' and {9Af2-h depend on h. It follows from the "Comment on inequality (9.29)" with x = h that we can apply the argument in Step B.l obtaining С and h0 that are independent of h. To see this, note that fik2{h) = nk2, by (9.52), цкз(К)^цкз, by the minimum principle, and nkl{h)^fikl, since цк1 - fiki(h) s$p.ki - fiklih, by the minimum-maximum principle, and fikith -*[iki (cf. (9.51)), and hence that fikl(h)-nkJh) and цкг{к) - цкз(к) are bounded away from 0. Then note that r{h)=r<=ip{Tk{h)) = p{Th)u{nk2,H}, Int(r(/i))nff(Tfc(ft)) = Int(r)n(ff(rfc)-{/it2ih})={/it2 + lit,..., fik2+qi-lih}, and Нф) ~ fik2+qz -! ,л (h) = цк2 - цк1 + q2 _ x <h «c |rad(r(/j)) = irad(F) for all h. Step B.4. Suppose <j2>2 as in Step B.3. Here we show that Ufc2 -f i СЗ.П be chosen
Section 9 Abstract spectral approximation so that \\uk2 + uh-uk2 + 1\\a^CehB, 2). We know that \Сф, j), j=l,..., qu llU-h-U^\CehB, 1), j = q1 + l = k2 713 (9.54) (cf. (9.15), (9.21), and (9.51)). Assume that uk2+u..., ui2+?2_! have been redefined so that (9.2) holds. Write En (k2 + l). U; J=l If we apply the argument used in Step B.2 to uk2 + ith, i.e., if we let k2 be replaced by k2 + 1 and use (9.53) instead of (9.29), we obtain кг+ Ч2- 1 ukl + i,h- £ af\ j = k2 < Csh{2, 2). But, by (9.54), akk22 + i)\ = \a(uk2 + Uh, uk2)\ = \a(uk2 + UH, uk2-uk2<h) <ll«*2-«*2l*||e<Ce»B, l)<Ce»B, 2) and hence fc2 + «2-l «*2 + i,*- £ af2)uj J' = *2+1 <Ce2i2(ft). Redefining ик2 +1 to be '*2+«-l fc2+«2-l j = k2+l we see that [|и*2 + 1||в = 1, а(щ2 + 1> u,)=0, j=l,..., k2, so that (9.2) holds, and \\Uk2 + i,h-uk2 + i\\a^Ce2a(h), which is (9.9) for i=j = 2. Step B.5. Continuing in this manner we prove (9.8) and (9.9) for i = 2 andy'= 1,..., q2. Step C. Repeating the argument of Step В we get (9.8) and (9.9) for г = 3, 4, This completes the proof. □ Remark 9.2. BabuSka and Aziz [1973], Fix [1973], Chatelin [1975], and Kolata [1978] proved the estimate Aki+j-i,h — л^+j-i ^Ceh(Ai), j — 1,..., qt, (9.55) where ей(А;) is defined in (8.21). Estimate (9.55) is weaker than (9.8). For ;' = 1,..., qi—l, (9.8) shows higher rates of convergence for certain problems; see the discussion of multiple eigenvalues in Section 10.2. Birkhoff, de Boor, Swartz, and Wendroff [1966] estimated Xkl+j-ljh—/iki+j-i in terms of the sum of the squares
714 /. BabuSka and J. Osborn Chapter II of the a-norm distances between Sh and the unit eigenvectors associated with all the eigenvalues Xi not exceeding Xkj. Remark 9.3. In BabuSka and Osborn [1989], simplified proofs of (9.8) and (9.9), as well as refinements of these estimates, are given. These proofs are based, in part, on the estimate (8.47a) of Chatelin. BabuSka and Osborn [1989] also contains improved eigenvector estimates in the norm ||«||ь. Cf. the comments at the end of Section 8.
Chapter III Applications In this chapter we apply the abstract results developed in Chapter II to several representative problems. 10. The Ritz method for second-order problems 10.1. Vibrations of a free L-shaped panel We consider the problems of the plane strain vibration of an L-shaped panel Q with free boundary. The specific shape of the panel is shown in Fig. 10.1. ft . V У A, @,0) As — . X •+■ H Fig. 10.1. The L-shaped panel Q. The equations governing the vibration of an elastic solid were discussed in Section 1 (see A.33)-A.35)). Corresponding to the L-shaped panel we have the eigenvalue problem n ^в * -(A + fi)- цАи = <ори, дв -(А + ц)- nAv = a>pv, ду (x,y)eQ, A0.1) 715
716 I BabuSka and J. Osborn Chapter III where в = (ди/дх) + (pv/ду). We obtain A0.1) from A.35) by assuming that u(x, y, z) and v(x, y, z) are independent of z and that w(x, y, z) = 0. The boundary conditions describing the traction-free boundary are Xn=Yn = Q, (x,y)eT = dQ, where Equations A0.2), with X„ and Y„ given in A0.3) are the Neumann conditions discussed in connection with the elastic solid specialized to the L-shaped panel. We now consider the specific case in which v= — =0.3, E=— = 1 (i.e., in which A = y§ and fi — ^). v is called Poisson's ratio and E is called Young's modulus of elasticity. G = \i is called the modulus of rigidity. Note that 0 =$ v < j for any material. We now discuss the basic steps in the finite element approximation of the eigenvalues and eigenfunctions of the problem A0.1), A0.2), or, more generally, of any eigenvalue problem. These steps are as follows (see Sections 10.1.1-10.1.3 and Section 10.1.1'): A) Derivation of a variational formulation (8.10) for A0.1), A0.2) and verification of conditions (8.1)-(8.3), and (8.6) ((8.1), (8.6), (8.32), and (8.33) in the selfadjoint, positive-definite case). B) Discretization of (8.10) and assessment of the accuracy of the approximate eigenvalues and eigenfunctions. The discretization proceeds by the selection of the trial space Slih and test space S2,h, verification of (8.11)—(8.13), consideration of the finite-dimensional eigenvalue problem (8.14), and explicit construction of the matrix eigenvalue problem (8.15). The accuracy of the approximation is assessed by means of the application of the results of Section 8. C) Solution of the matrix eigenvalue problem (8.15). The accuracy of the approximation method (8.14) depends in a crucial way on the trial and test spaces Slih and S2,h, and their rational selection is strongly influenced by the available information on the eigenfunctions, typically information regarding their regularity. Thus, also of importance is: (Г) Analysis of the regularity of the eigenfunctions. Remark 10.1. The approximation methods we will discuss in this section are referred to as Ritz methods, by which we mean that the eigenvalue problems under consideration are selfadjoint and positive-definite and that the test and trial space are equal (S1,ft = S,2,ft = S'(l); see the discussion in Section 8. A0.2) A0.3)
Section 10 Applications 717 10.1.1. Variational formulation We begin by casting our problem in the variational form и e H, a{u,v) = a>b{u,v) V v e H, A0.4) where H is an appropriately chosen Hilbert space and a and b are appropriately selected bilinear forms. This process was explained in Section 3. We typically proceed as follows. Multiplying the first equation in A0.1) by ф, the second by ф, adding the resulting equations together, and integrating over Q, we obtain -{Л + ц)—-цАи дх Ф+ ду ф > dx dy = Q) р(иф + vф) dx йу. A0.5) Now, integration by parts shows that Ф + дв -{X + ii)—-fiAu ex дв -{k + H)—-liAv dy ф > dx dy (X + H)e^-dxdy- ox (Х + 1л)вфпх ds + jxVu • Чф dx dy du ц—ф ds + on {k+nN^-dxdy- \(X + nWnyds + dv /zVu-Vi^ dx dy— | ц-г-ф ds i j/, л Jbu Ъ>\(Ъф дф [/8m dv дф дф\ ди дф dv дф ду дх/ дх ду ду 8х_ dx dy - (Хпф+У„ф)&. A0.6)
718 /. BabuSka and J. Osborn Chapter III Combining A0.5) and A0.6) we see that if (со, (и, v)) satisfies A0.1) and A0.2), then /ди дю\/дф дф \дх ду/\дх ду + fi ди Su\/9</> дф\ ди дф dv дф ду дх)\ду дх/ дх ду ду дх dx йу = со р(иф + ьф) dx йу A0.7) for all smooth (ф, ф), and, conversely, if A0.7) holds for all smooth (ф, ф), then A0.1) and A0.2) hold, provided и and v are smooth (u,veH2(Q)). From A0.7) we see how to choose H, a, and b in A0.4). Let H = H\Q)xH\Q\ ||(м, г)||| = and on H define the bilinear form a(u, v; ф, ф) Nlb+INb /n . ч, 9и dv A + ад^ + ё7 дф дф_ Ьх ду A0.8) + И ди dv ду дх дф дф_ ду дх (8м 9^ dv дф 'дх ду ду дх dx dy. A0.9) It is immediate that (8.1) is satisfied and that a is symmetric. Let us remark that a(u, v; u, v) has the physical meaning of the (double) strain energy and that Ja{u, v; u, v) is referred to as the energy norm of (u, v). Recall from Section 8 that b is to be defined on a space W^>H. Let W=L2(Q)xL2{Q), and define (u,v)\\2w=\\u\\ln+\\v\\ln A0.10) b(u,v; ф,ф) = р{иф + ьф) dx Ay. A0.11) It is immediate that b is symmetric and satisfies (8.7) and (8.33) and that Я с W, compactly. It remains to consider (8.32). Note that since a and b are symmetric, H1(Q) and L2(Q) may be taken to be real. We begin by expressing a(u, v; ф, ф) in terms of the Poisson ratio v and the modulus of rigidity G: a(u, v; ф, ф) 1G ~ l~2v , ди дф dv дф\ дх dx ду ду J
Section 10 Applications 719 (Ъиоф ЪоЪф\ 1/( /Эй 8»V8^ S^lj j „л,„ч +v(^ ^+ъу i)+iA -2v\ry+ы){ъу+£))dx dy- {Ш2> From A0.12) we have a(u, v; u, v) 2G l-2v A-v) ди\2 /81Л2" дх) +{бУ/ +*-K£+£)*u* 2G l-2v (l-2v) 8uV /aeV" „ 6m 3z; ox cy ,du 8u\2] +i(l-2v)(- + -J [dxdy. A0.13) Recalling that 0O<i, we see from A0.13) that a(u, v; u,v)"^0 Уи,и (as was to be expected from the physical interpretation), and that a(u, v; u,v) = 0 if and only if V = ^ci,c2,c3 = C3 — C2X A0.14) for some cb c2, c3. These displacements, which are characterized as having no strain energy, are the "rigid body motions," i.e., translations and rotations. Thus (8.32) does not hold with H defined by A0.8), but the above considerations suggest that it might hold if Я is replaced by a smaller space that did not include the rigid body motions. In fact, if we define A0.15) H=<(u, v)eH: p{uuCUC2<C3 + vvCuC^Ci) dx dy=0 Усъс2,сЛ, then it can be shown (see NeCas and HlavACek [1970] and Knops and Payne [1971]) that a(u,v; u,v)^a\\{u,v)\\2H = «(IMI?,o+INI?,o) V(K,»)efi, where a is a positive constant. This is (8.32). We thus restrict a(u, v; ф, ф) to H and b(u, v; ф, ф) to W=U u,v)eW: j f>(uuCuC2yC3 + vvCuC2iC3) dx dy = 0 Ус1,с2,с3 A0.16) A0.17) For the eigenvalue problem A0.1), A0.2) we therefore have the variational formu-
720 I. BabuSka and J. Osborn Chapter III lation 0^{u,v)eH, HO 18) а{и^;ф,ф) = а>Ь(и,ю;ф,ф) Щ,ф)еН. Thus, with a, b, H, H, W, and ^chosen as in A0,8)-A0.11), A0.15)-A0.17), we see that a and b are symmetric and conditions (8.1), (8.7), (8.32), and (8.33) are satisfied. Equation A0.18) is a selfadjoint, positive-definite problem of the type studied at the end of Section 8. As stated in Section 8, A0.18) has a countable sequence of eigenvalues 0<£01^ftJ^---f+CO and corresponding eigenfunctions («i, vx),{u2, v2),..., which can be chosen to satisfy а(щ, vt; Uj, Vj) = cojb(ui, vt; up v^oifa. When implementing our approximation method it is simpler to consider the eigenvalue problem on the space H instead of on H, i.e., to consider A0.18) with Й replaced by H. Then ш0=0 will be a triple eigenvalue with eigenfunctions (u, v) given by A,0), @,1), and (—y, x). These eigenpairs and their approximations are then ignored. If the rigid body motions are included in the space Slffl and S2,h, then ш0 = 0 is also a triple approximate eigenvalue with the rigid body motions again the corresponding approximate eigenfunctions. If this is not the case, then dealing with H and H does not lead to the same approximate eigenvalues and eigenfunctions. It is easy to analyze the case in which the rigid body motions are not in SljS and S2,h, but we will not do so. Alternately, the validity of A0.16) or (8.32) can be ensured by considering a(u, v; ф, ф) = а(и, v; ф, v) + b(u, v; ф, i//) instead of a(u, v; ф, ф). Then the triple eigenvalue 1 would be the lowest eigenvalue. Usually the first alternative is used. 10.1.Г. Regularity of the eigenfunctions We have seen in Section 8 that the accuracy of the approximate eigenvalues and eigenfunctions depends on the degree to which the exact eigenfunctions and adjoint eigenfunctions can be approximated by elements in the spaces Slih and S2,h> respectively (see (8.23)-(8.26)). In the selfadjoint, positive-definite case this reduces to the degree to which the eigenfunctions can be approximated by Sh (see (8.44)- (8.46)). Since the approximability of the eigenfunctions depends on their regularity, it is essential to determine the basic regularity properties of the eigenfunctions. The eigenfunctions («,-, v{) of A0.18) have the following properties: - The functions щ and vt are analytic in Q\[jAj, where At are the vertices of Q. This follows from the general theory of elliptic equations (see Morrey [1966, Section 6.6]).
Section 10 Applications 721 - The functions щ and vt are singular at the vertices of Aj, and the character of the singularity is known. The strength of the singularity at Aj depends on the interior angle at Aj. For the domain we are considering, the strongest singularity is at the vertex A0. The leading terms of щ and vt at a vertex have the form M* = C1r'F1(e), v* = C2r°F2(e), A0.19) where (r, 0) are the polar coordinates with origin at the vertex, a depends on the interior angle and on X and /i, and Ft@) and F2@) are analytic functions of 6. The value of a is characterized as the root of a nonlinear equation and, in general, can be real or complex. For our example of the L-shaped domain, сг = ст0=0.544481... for the vertex A0. For a more complete discussion of the singularities of solutions of elliptic equations in polygonal domains we refer to Kondratev [1968], Merigot [1974], and Grisvard [1985]. Using their results, any eigenfunction can be written as (u, v) = (u1, d1)+(h2, v2), where u2, v2eHk(Q), where к is an integer which is greater than or equal to 3, and (u1, v1) is a linear combination of functions of the type on the right-hand side of A0.19) with сг^ст0 and with coordinates centered in the vertices of Q. Application of the method used in the proof of Theorem 2.1 shows that, for our domain, u* and v* and thus щ and vt are contained in H°°+1{£2), with <x0 = 0.544481... . This statement of the regularity or smoothness of the eigen- functions is the strongest that can be made in terms of Sobolev spaces (without weights). - It is also possible to show that u,ve^j(Q), for any р><т0(Р = (т0 + е); see Theorem 4.4. Of course, the space &}{Q) is much smaller than Йао+1(С2) and hence we can make a more effective choice for Sh if we use Щ{0) instead of Hao+i(Q). 10.1.2. Discretization of (8.10) and assessment of the accuracy of the approximate eigenvalues and eigenfunctions The discretization of (8.10) is accomplished by selecting the trial and test spaces SUh and S2,h satisfying (8.11)—(8.13), considering the finite-dimensional eigenvalue problem (8.14), and deriving the matrix eigenvalue problem (8.15) from which the approximate eigenvalues are obtained. The selection of Slft and S2)* is the most important part of this process. It is influenced by three considerations. (a) The spaces Slth and S2ih have to satisfy (8.11) and (8.12). Note, however, that if the problem under consideration is selfadjoint and positive-definite, from (8.32) we see that (8.11) and (8.12) hold for Suh = S2,„ = Sh, for any Sh. Our problem A0.18) is selfadjoint and positive-definite and we will take S1,h = S2th = Sh. (b) Slh should accurately approximate the eigenfunctions of A0.18) and S2th should accurately approximate the adjoint eigenfunctions. Usually we also require that the rigid body motion functions are included in S1A and S2,ft. If this is not the case, then we have to assume that the rigid body motion functions are very well approximated. If they are not well approximated, although there will be no change in the asymptotic rate of convergence, the accuracy will deteriorate, especially with long domains (such as long beams), for which the rigid body motions for some parts of the domain could be relatively large. (c) The matrices A and В in (8.15) should be reasonably sparse, since sparsity is strongly related to computational complexity. Sparsity is achieved by choosing
722 /. BabuSka and J. Osborn Chapter HI finite element spaces for Slih and S2,h- These spaces then have bases consisting of functions with local supports, and, as a consequence, A and В will be sparse. We note that the sparseness of A and В is not required for the validity of the results of Section 8 and, in fact, in certain applications one does use non finite element type trial and test spaces, spaces consisting of global polynomials or trigonometric polynomials, e.g. We now describe some typical choices for Sh for the L-shaped panel. 10.1.2.1. The h-version on a uniform mesh. Let Q be covered by a mesh of uniform squares /y of size h as shown in Fig. 10.2. Then for p= 1,2,... let S>{oefi'(fl), u\Iti= £ A^xYWy}, where 3>(p) = {{m,n): 0^m,n, and m + n^p or (m,n) = (l,p) or (p, 1)}. A0.20) A0.21) > Fig. 10.2. A uniform mesh on Q. Spaces of this type are said to be composed of elements of type Q'p (the cases p = 1,2,3 are discussed in Ciarlet [1978]). Basis functions for these spaces can be constructed in various ways; for example, by means of Lagrange elements (see Ciarlet [1978]) or by use of hierarchical elements. Regarding the approximation properties of the family {S£}0<a, it can be shown (see Ciarlet [1978]) that if k^ 1 is an integer, then *Ч11и-х111.п^С(р)й* IMIw, eSl for any и е Hk(Q) and any h > 0, where /i = min(fc, p+ 1) A0.22) A0.23) and C(p) depends on p, к and Q, but is independent of и and h. Equation A0.22) is optimal in the sense that /i" on the right-hand side cannot be replaced by a higher
Section 10 Applications 723 power of h when the mesh is uniform. If к is not an integer, then we have mu-xh^Ctpih^Mew, A0.24) with \i given by A0.23). Note that we have not said anything about the dependence of C(p) on p. The proof in Ciarlet [1978] suggests that C(p) grows with p, and thus could lead to the conclusion that it is improper to use p>k — 1. However, this conclusion is not justified because, in fact, С(р)^Ср~(к~1} (see BABUSKAand Suri [1987b]). We will now derive A0.24)-A0.23) from A0.22)-A0.23) using the method outlined in Section 2 (cf. B.12)—B.16) and Theorem 2.1). Suppose m < к < m +1. Since INfi*@)= sup {r'K(u,t)}, 0<I< oo where 6 = k — m, we see that K{u,t)*bte\\u\\a*(py Let e>0. Then for any t>0 there exist vteHm(Q) and wteHm+1(Q) such that и = v, + w, and II v,1| m,„ +11| w, || m+! ,Q < K(u, t) + t'e < t"( || и I! ms2) + e). Therefore, using A0.22)-A0.23) we can choose Xi,r>X2,<eS£ so that ll»r-Zi.«lli.o<C(p)fc-1||»(IL.0 where ц1 = тт(т, р+1), and l|w,-Z2.,lli,n<C(p)A«-1||wt||11I+1,0 < C(p)ft" -1 te"!(|| u||e*@) + e), where /x2 = min(m+l, p+1). Letting Zt = Xi,r + Z2,u we tnus have ll«-Z,lli.0<C(p)(ft'"-1te + ft«-1t,-1)(||«||e140) + E) for any f > 0. If p^m, select t = /i in A0.25) to obtain inf ||«-zlli,o<C(p)fce+'"-1(ll«llffW) + e) = С(р)йк-1(||ц||йк(й) + е) = С(р)/г"-1(||"Ь^,+в), where // = min(fc, p+ 1). If p<m, let t=l in A0.25) to get inf ||м-х||1>о<С(р)Л'"-1A|и|1й'Ч0) + е) = C(p)h"(\\u\\BH[i) + £) = C(p)h"-1(\\u\\m(ii) + e), A0.25)
724 I. BabuSka and J. Osborn Chapter III with n=min(k,p +1). Letting s— !|u||g*(n) in these estimates yields A0.24)-A0.23). Now define SUh = S2,h = Sh = SpkxSl A0.26) We remark that the rigid body motions belong to Sh(d. A0.14)). Since A0.18) is selfadjoint and positive-definite, it satisfies (8.32), and hence (8.11), with f}(h) = a, and (8.12) hold. A0.22) and A0.24) show that Sh accurately approximates the exact eigenfunctions. Thus A0.22) and A0.24), together with a density argument, show that (8.13) is satisfied. If an appropriate basis is chosen for Sh, the matrices A and В in (9.15) can be calculated and they will be sparse. Thus the issues raised above in (a), (b) and (c) have been addressed. Now consider the problem (8.14) with this choice for S1>(! and SZth and denote its eigenvalues and eigenfunctions by 0<o>i.*<-<<»w.* and ("i,/,.»i.k)»---»("\,),.%,*)» where N = dimSA. To assess the accuracy of these approximate eigenpairs, the results of Section 8 will be applied. All of the hypotheses for these results have now been shown to be satisfied for our problem and approximation procedure. Theorem 10.1. Let Slh and S2 A be selected as in A0.26). Suppose a>k is an eigenvalue of A0.18) with multiplicity q, i.e., suppose (ok~1<ook = (Dk+1 = --- = a>k+q_1<a)k+q. Then \(о1И-сок\^С(р)^-088962--, j=k,...,k + q-l. A0.27) If(wk,zh) = {ujth,Vjih),j=k,...,k + q — l, then there is a unit eigenfunction (u,v) = (uh,vh) o/A0.18) such that \\u-wh\\un+\\v-zh\\ua^C(p)h0-54"81-, A0.28a) and if(u, v) is a unit eigenfunction of A0.18) corresponding to cok, then there is a unit vector K,zfc)esp{(nw, »w),..., (uk+q-i,h, vk+q„lih)} such that ll«-w^.0 + ll»-zJi,0<OP)&0-544*81---. A0.28b) If(ok is simple, the eigenfunction estimates reduce to ll«M-«*lli.o + ll"*.*-"*lli.O<C(p)fc0-344*81-. A0.28c) Proof. We saw in Section 10.1. Г that ut and Vj are in H"°+1, with <r0=0.544481....
Section 10 Applications 725 Thus from A0.22)-A0.24) we have £ft= sup inf ||(«j,0^-(Zi,Z2)IIh,(O)xh1(O) j = k,...,k+q-l X = (Xl,X2)sSh <C(p)h"° Equations A0.27) and A0.28) follow from this estimate and (8.44)-(8.46). □ To show the effectiveness of estimates A0.27)—A0.28) we would have to know the exact eigenfunctions and eigenvalues. Because these are not available we consider instead the quantity Q{p,h)= inf a{u*-il,v*-x2;u*~il,v*-x2), X = (XuXl)eSh A0.29) where u* and v* are given in A0.19). Q(p, h) can be computed numerically. Figure 10.3 shows the graph of \\e\\E,R = №p,h)Mu*,v*;u*,v*)y>2 40 . 30 IT <u о = cr _ СГ 2 ш о: о in.* c- < tz. r^UJ Id g z ■A) Ih 10 9 8 I i pr 1 X p-7, о. ^ *-., 0.5445 к ) ч С \ "TV.. i ^<± •■л """Д 1 4 MESH SIZE h 10 Fig. 10.3. The relative approximation error measured in the energy norm. The Ji-version. as a function of h, for various values of p. ||e||ER is the relative error in the energy norm measure of the degree to which (u*, v *) can be approximated by functions in Sh. The graph, which is plotted in log-log scale, is a straight line and thus IIE.R- ■ Ch\ where a is the slope of the line. We see that the slope is very close to the theoretically predicted a=0.544481.... Increasing p decreases the constant С but does not affect the slope a. From an analysis of Fig. 10.3 we can draw several conclusions: - To achieve an accuracy of 5% (respectively, 3%) with elements of degree p= 1
726 I. BabuSka and J. Osborn Chapter III we would require ЛГ to be about 25,000 (respectively, N to be about 170,000) and with elements of degree 2 we would require N to be about 19,000 (respectively, N to be about 124,000). This shows that a uniform or quasiuniform mesh is completely unacceptable for our problem. - Because the rate of convergence for eigenvalues is twice that for eigenfunctions, we see that the eigenvalues are much cheaper to compute than the eigenfunctions. Roughly speaking, we see that for eigenvalue calculations the required number of unknowns would be about N = 160 (respectively, about N — 400) for p — 1 and about iV=140 (respectively, iV = 350) for p=l - While A0.22) qualitatively characterizes the error behavior, it does not give all the desired quantitative information because С and ||u||t>fi are not known. More precise quantitative information can be gained only by a posteriori analysis. We will not, however, be able to pursue this direction. For a survey of results on a posteriori assessment of the quality of finite element computations, we refer to Noor and BabuSka [1987]. A posteriori error analysis is used also in connection with adaptive approaches, in which the goal is to let the computer construct the mesh required to achieve the desired accuracy. 10.1.2.2. The p-version. In the h-version of the finite element method accuracy is achieved by letting h\0, while p is held fixed. In the p-version of the finite element method, one, in contrast, fixes h and lets pfao. Let S% again be defined by A0.20)-A0.21). Regarding the (p-version) approximation properties of the family {8ь}р=1Л,..., it can be shown that if u = ut +u2, where uxeHk(Q), with k^2, and u2 =-КгпР{в), with a>0, then inn\u~X\\i,a^C(h)[Kp-2°+p-*-1)\\u1\\k,nl. (Ю.30) We remark that in A0.30) it is essential that the origin of Q lies on an element vertex; for in this case, the estimate for uz is of twice the order as would be obtained if we based our estimate on the assumption that M2e#ff+1 and used the h -version with a uniform mesh. For a proof of A0.30), see BabuSka and Suri [1987a]. Define SUp = S2,p = Sp = SphxSl A0.31) Then (8.11), with p\h) = a, and (8.12) are satisfied. A0.30) shows that Sp accurately approximates the exact eigenfunctions and thus that (8.13) is satisfied. We see that the issues raised in (a), (b) and (c) have been addressed. In connection with (c), however, we observe that the matrices A and В are less sparse than with the h-version. Note that the parameter p, which approaches oo, is here playing the role of the parameter h in Section 8, which approached 0. Now consider the problem (8.14) with this choice for S1>p and S2,p and denote the eigenvalues and eigenfunctions by and
Section 10 Applications 727 where N = dim Sp. As with the h- version, the accuracy of the approximate eigenpairs may be assessed with the results of Section 8. Theorem 10.2. Let St and S2tP be chosen as in A0.31). Suppose cok is an eigenvalue of A0.18) with multiplicity q. Then -cok\^C{h)p -2.177924... j=k,...,k + q-l, and ,+ I»» A0.32) A0.33) *k,p — "k II1 ,S2 ~т II vk,p — Dk li 1,0 ^ C(n)P Note that we have given the eigenfunction estimate the simplified form it has when a>k is simple; it would have to be modified if cok has multiplicity greater than 1. See the statement of Theorem 10.1. Proof. Suppose oik has multiplicity q and let w be either component of one of the eigenfunctions corresponding to cok. We have seen that w can be written in the form w = w1 +w2, where w2sHk(Q), with fc>3, and w2 is a sum of terms of the type A0.19) with o->CTo and with coordinate centers at the vertices of Q. Because a0 = 0.544481... in A0.19), from A0.30) we have ер^СДОр-1-088962"-. A0.32) and A0.33) follow from this estimate and (8.44)-(8.46). □ To illustrate the performance of the p-version we consider, as with the Ji-version, the relative error in the energy norm measure of the degree to which (u*, v*) can be approximated by Sp (cf. A0.29)). Figure 10.4 presents the graph of ||e||E>R as a function of p, for various values of h. Again the log-log scale is used. We see that 2 3 4 56789 10 DEGREE p OF ELEMENTS Fig. 10.4. The relative approximation error measured in the energy norm. The p-version.
728 /. BabuSka and J. Osborn Chapter III the slope is close to the theoretically predicted -1.088962.... This is valid only for p > 3, but recall that all our results are of an asymptotic nature. To assess the relative effectiveness of the h- and p-versions, to understand, in particular, their dependence on the choice of Sk and Sp is not easy. Here we content ourselves with a brief assessment in terms of the number of degrees of freedom: N = 2 dim S% = dim Sk = dim Sp, In Fig. 10.5, the relative error in the energy norm measure of the accuracy is plotted as a function of N. Since N^h~2 and N~p2, the rates of convergence shown in Fig. 10.5 are half those shown in Figs. 10.3 and 10.4. We see that with respect to degrees of freedom, the p-version with h=j performs better than the h-version with p = 1, 2, 3, or 4. 50 80 100 300 500 8001000 NUMBER OF DEGREES OF FREEDOM Fig. 10.5. The relative error in the energy norm in dependence on N. 3000 5000 8000 10.1.2.3. The h-p version. In this version of the finite element method accuracy is achieved by simultaneously decreasing the mesh size h and increasing the polynomial degree p. We here distinguish various cases. The major ones are: (i) uniform mesh and uniform p distribution (i.e., the polynomial degree p is the same on each mesh subdomain, i.e., element); (ii) refined (nonuniform) mesh and uniform p distribution; (iii) refined mesh and selective increase of degree p. We will now elaborate on cases (i) and (ii). Case (i) obviously combines the h- and p-versions discussed above. In this case one has: Theorem 10.3. Let SU(Kp) - S2,(h,P) = S(h>p) = Sphx5p and let сокЛКр) and (иклк<р), vkiM) be the associated eigenvalues and eigenfunctions. Suppose cok is an eigenvalue of A0.18) with multiplicity q. Then ;cir lo}uh,P)-o)k\^b<i[an hao, ^min(ffo,p-To)' „2ffo j = k,..., k + q-l, A0.34)
Section 10 and Applications 729 I! Щ,(ь,Р) ~ w* II i ,i? + II Чль,р) - vk 11 l ,o < C< min h. Lmin(<ro,P — ffo) -2G0 A0.35) w/геге ff0=0.544481... and С is independent of h and p. Proof. The basic approximation results for this type of approximation were proved by BabuSka and Suri [1987b]. A0.34) and A0.35) follow directly from these results and (8.44)-(8.46). □ In case (ii) we will consider only geometric meshes with ratio factor 0.15; see Fig. 10.6. This ratio is close to optimal. The space S(hiP) = §* x S£ is now more rB irk n=2 Fig. 10.6. The strongly refined mesh with n = 2 layers. complicated. 5% is defined by S£= {ueH1(Q): u\Ifj is the image of a polynomial in a square S = {(£, rj): |£|,|т/| ^ 1} or a triangle T={(f,i/):0<ij^^0<{<l}, for all subdomains 1и in the mesh}. For a more detailed description of §£ see BabuSka and Giro [1988b, 1989] and Szabo [1986]. For a thorough discussion of the h-p version in the one-dimensional setting and the optimality of the meshes and degree distributions. We refer to Gui and Babuska [1986]. For the two-dimensional setting see Gui [1988].
730 /. BabuSka and J. Osborn Chapter HI 400 800 1000 2000 4000 8000 NUMBER OF DEGREES OF FREEDOM Fig. 10.7. The relative error in the energy norm in dependence on N for various meshes. Figure 10.7 shows the performance of the p-version on meshes with various numbers of layers n, as well as the performance of the p- and the Jz-versions for uniform meshes. We typically see a reverse S curve for the accuracy of the p-version on a geometric mesh. The first part of the curve is convex and then it is concave, the slope approaching N ~"°. The h-p version appears as the envelope of the p-version on geometric meshes with various numbers of layers. This envelope shows the optimal relation between the number of layers and the polynomial degree. In BabuSka and Guo [1988b], it is shown that if us3$\{Q), with 0</?< 1, then a geometric mesh and a proper selection of the degree p leads to the exponential rate IE.R ;gCe" Using this approximation result and the results of Section 8, we obtain: Theorem 10.4. Suppose the components ut and vt of the eigenfunctions belong to Щ
Section 10 Applications 731 (in our case /? = 0.544481 ... + £). Then with a proper choice of geometric mesh and the degree p we have ](ok,p-cok\^Cs-2^ A0.36) and Kp-Mi,fl+KP-»Ji,^Ce-^, A0.37) where a. depends on the ratio of the mesh, the relation ofp and the number of layers, and the domain, but is independent of N. Proof. A0.36) and A0.37) follow directly from the results of Section 8 and the above estimate for ||e|]E,R. D Figure 10.7 clearly shows the effect of the proper selection of meshes and element degrees on the effectiveness of the finite element method. It also shows that the optimal choice depends on the required accuracy. The design of the mesh and selection of the degree of the elements is a delicate task. Various approaches to deal with this problem are in the research phase. One promising approach is to apply the principles of artificial intelligence (expert systems). For further details we refer to BabuSka and Rank [1987]. Figure 10.7 shows only the dependence of the accuracy on the number of degrees of freedom N. It is also essential to judge the complexity of the method with respect to such factors as number of operations, computer architecture, user's interaction, etc. For a detailed study of computer time, accuracy, and performance for various numbers of degrees of freedom, we refer to BabuSka and Scapolla [1987]. We can see directly from Fig. 10.7 that the proper mesh design leading to an accuracy of 5% has two layers (the ratio of the sizes of elements is of order 50) and p = 3. For an accuracy of 3%, the optimal p is 3 or 4 and the number of layers is 2 or 3 (which leads to size ratios from 50 to 300). The number of degrees of freedom is 200-300 (compared with 25,000-170,000 for a uniform mesh and p=l). 10.1.3. Solution of the matrix eigenvalue problem We have seen that the approximation procedure developed in Section 8 leads from the eigenvalue problem (8.10) or A0.18) to the generalized matrix eigenvalue problem (8.15), and that the matrices A and В in (8.15) are sparse if the bases for the trial and test spaces are properly chosen. We have proved convergence for each fixed eigenvalue, but convergence does not occur for a fixed percentage of the available eigenvalues. Nevertheless, from the error estimates in Section 8 we know that the low eigenvalues of (8.15) give reasonable approximations to the corresponding exact eigenvalues. The size of the matrix problem will thus be much larger than the number of eigenvalues we are attempting to calculate. The matrix eigenvalue solver, a crucial component of the complete computational procedure, should therefore be designed to effectively find the low eigenvalues of large, sparse, generalized matrix problems. An appropriate version of the Lanczos algorithm is suitable for this class of matrix problems and is often used in practice. We refer tr
732 1. BabuSka and J. Osborn Chapter III the monographs by Parlett [1980] and Cullum and Willoughby [1985]. Because the extraction of the eigenvalues is very expensive, various "tricks" are used in engineering practice to reduce the sizes of the matrices under consideration. We will not go further in this direction. Remark 10.2. It should be emphasized that, generally, the goal of the computation is to find, in addition to the eigenpairs, certain functionals of the eigenfunctions (u, v)—for example, the stress intensity factors, which are combinations of the derivatives of и and v. We will not pursue this direction since it lies beyond the scope of this article. We refer, e.g., to Babuska and Miller [1984] and Szabo and BabuSka [1986]. Remark 10.3. The complete computational resolution of an eigenvalue problem is influenced by a wide range of factors. Some of the most important of these— the smoothness of the eigenfunctions and the approximation properties of the trial and test space, for example—have been discussed in detail. Others—the accuracy of the matrix eigenvalue solver and the relation between the accuracy of the matrix solver and the error (okth~-cok, for example—have not been mentioned or have only been mentioned briefly. While these latter factors are important, we will not be able to pursue them. We also note that the important function of a posteriori analysis of computed data has not been discussed. Likewise we have not discussed any adaptive approaches. For some ideas on the assessment of the quality of the finite element computations we refer to the survey paper of Noor and Babuska [1987]. 10.2. Vibration of a membrane We consider here the eigenvalue problem associated with the small, transverse vibration of a membrane stretched over a bounded region Q in the plane and fixed along the edge Г = 90, i.e., the eigenvalue problem -Au = Xu inQ, u = 0 on Г A0.38) (cf. Section 1.2, in particular, A.27)). We turn now to a discussion of the basic steps A), (Г), B) and C) (cf. Section 10.1 above) in the finite element approximation of the eigenpairs of A0.38). The discussion can be brief since these steps are similar for the two problems A0.1)-A0.2) and A0.38), in fact for any eigenvalue problem. 10.2.1. Variational formulation Problem A0.38) is a special case of problem C.1) and the variational formulation C.15) of C.1) was derived in Section 3. Thus we see that the variational formulation of A0.38) is given by ueHi(Q), м#0 (Ю.39) f/cu 9t) du bv\ , , „ Г
Section 10 Applications 733 Let 5m dv bu dv\ . , a(u,v)= [ ^p £-+^- 5- jdxd.y = Vu-Vv dx dy be defined for u,veH = H}>{Q), and let b(u, v)= uv dx a be defined for u,ve W=L2(Q). Then A0.39) has the form of (8.10), and a and b are symmetric forms, (8.1), (8.7), (8.32), and (8.33) are satisfied, and HcW, compactly. All of this can be easily seen for the concrete problem we are considering; it also follows from the more general discussion in Section 3. Problem A0.39) is a selfadjoint, positive-definite problem. It has eigenvalues 0<Я1<Я2^---/'+оо and corresponding eigenfunctions uuu2,..., which can be chosen to satisfy Vut • Vvj dx dy = Aj utUjdx dv = <5;j. 10.2.1'. Regularity of the eigenfunctions From Theorems 4.1-4A we obtain the following regularity results for the eigenfunctions щ of A0.39) (or A0.38)). - For k^2, if Г=8й is of class C^then щеНЩ. - If Г is of class C00, then щ е C00^). - If Г is analytic, then щ is analytic in Q. - If Q is a curved polygon with analytic sides and with vertices A0, Au..., then щ is analytic in Q\\jAj. щ is singular at the vertices; the strengths of the singularities depend on the interior vertex angles. Moreover, ue3S}(Q) for properly chosen p. 10.2.2. Discretization of (8.10) and assessment of the accuracy of the approximate eigenpairs Suppose Q is a polygon. By a triangulation or mesh on Q we will mean a finite family т = {ri}f£(T1) satisfying: - each Tt is a closed triangle, - u=\J№Tt, - for any T; and 7}ет, Tir\Tj = 0 or a common vertex or a common side. For 0<a<7r, a triangulation г is said to be ос-regular if the minimal angle of every triangle Tez is greater than or equal to a. For any т, let
734 I. BabuSka and J. Osborn Chapter III h = h{x) = max diam Tt i = 1,..., M(i) and h{x)= min diam T;. i = l,...,M(t) т is said to be g-quasiuniform if h(T)/h(r)^q. We will often view triangulations т = zh as parameterized by h = h(x) and consider families у = {т} = {тй} of triangulations that are a-regular. An example of a irregular, l-quasiuniform triangulation of the domain Q~{(x, у): -1<х<1, — \<y<\} is shown in Fig. 10.8. It is called a uniform triangulation. Fig. 10.8. A uniform triangulation. For т a triangulation of Q and p= 1, 2,... let Sp(t) = {«: ueH1(Q), u\T = & polynomial of degree p for each Тег} and let SS(T) = S'(T)nHj(fl). Regarding the approximation properties of Sp(t) and Sob), if fc ^ 1 (integer or non- integer) and p > 1 and if у — {т} is a family of a-regular triangulations of Q, then inf ||«-zlli.a^C(fc(T)"-1/Pi)ll«llt.o *eS'<T> A0.40a) for any и е Я*(£2) and for any т е y,
Section 10 Applications 735 and inf ||«-zlli.0<a*W-1/Pi)ll«llw *eSS« A0.40b) for any иеHk(Q)nHl(Q) and for any xey, where li=raw(k,p + l). A0.41) The constant С in A0.40) is independent of p, т and u, but depends on Q, к and a. For a complete proof of these estimates we refer to Babuska and Suri [1987b]. Now define Si .<».,) = S2,lh,p) = S(M) = 5S(t). A0.42) Since A0.39) is selfadjoint and positive-definite and satisfies (8.32), we see that (8.11) and (8.12) are satisfied. Equation A0.40b) shows that S(ktP) accurately approximates the exact eigenfunctions and thus that (8.13) is satisfied. If a suitable basis is chosen for 5(fc>p), then the matrices A and В in (8.15) will be sparse. The issues raised in Section 10.1.2(a)-(c) above have now been addressed for this choice for SM. Note that in using the notation S{h:P) we are identifying h = h(?) with т. An alternate, and more explicit, notation would be S(tiP). Now consider problem (8.14) with S(AiP) defined as in A0.42) and denote the approximate eigenvalues and eigenfunctions by and where JV = dim S(AiP). To assess the accuracy of these approximate eigenpairs, we apply the results of Section 8, obtaining: Theorem 10.5. Let S(hrP) be selected as in A0.42) and suppose X} is an eigenvalue of A0.39) with multiplicity q. Then \AlAh,p)-Aj\^C(h2>-2/p2k-2), l=j,...J+q-l, A0.43) and Wuj^^-Ujh^Cih"-1^-1), A0.44) where к ^ 1 is such that the eigenfunctions corresponding to ki are in H\Q) and fi—min(k,p+l). Note that we have given the eigenfunction estimate the simple form it has when kj is simple; it would have to be modified in the general case. See the statement of Theorem 10.1. Proof. Suppose A,- has multiplicity q. Then A0.39) has eigenfunctions u,-,..., uj+g- x associated with Ay, by assumption, these eigenfunctions are in Hk(Q). Thus, by A0.40)-A0.41), we have
736 I. BabuSka and J. Osborn Chapter HI ea= max inf ||w,-x||lifl l=j,...,j + q- 1 JfeS(h,p) ^C^-Vp**1) max \\щ\\к,п /=j,...,j+e-i «C^-Vp*-1). A0.45) A0.43) and A0.44) follow directly from this estimate and (8.44)-(8.46). П Remark 10.4. If our membrane is free instead of fixed along its edge, then we would have considered the Neumann boundary condition 9u/3n=0. In this situation the eigenvalue problem would have the variational foundation иеН, и Ф 0, A0.46) a(u, v) = Xb(u, v), where a and b are as above, but Я = Ь:мбЯ1@), и dx Ay = 0 >, INlH=llMlli,fi- We would choose S№jP) = ^:u6S"(t), и dx dy = 0 for the trial and test space. Then all of the hypotheses in Section 8 are satisfied, approximation results similar to A0.40) can be proved, and for the approximate eigenpairs, the error estimates A0.43) and A0.44) follow. We note in particular that the Neumann boundary condition is only implicitly stated in A0.46), i.e., is natural, and thus that the boundary condition need not be imposed on the trial and test functions. This fact makes implementation easier, especially for domains with curved boundaries. See the discussion of natural and essential boundary conditions in Section 3. 10.2.3. Solution of the matrix eigenvalue problem The comments made in Section 10.1.3 apply here as well. Example 10.1. Consider the membrane eigenvalue problem A0.38) on the domain Q shown in Fig. 10.1. The eigenfunctions will be analytic on П, except possibly at A0. Because of the various symmetries in Q, the leading term in the singularities in the eigenfunctions at A0 are as follows: (a) for the first (simple) eigenfunction, u = Cr2/3sinf0+---,
Section 10 Applications 737 (b) for the second (simple) eigenfunction, K = Cr4/3sinf0+-, (c) for the third (simple) eigenfunction, и is analytic on Q, in fact, u = (fI n л ~1 sin nx sin ny. We see that the higher eigenfunctions can be smoother than the lower ones. The regularity of the eigenfunctions for problems with piecewise analytic data has been discussed in Babuska, Guo and Osborn [1989]. Babuska, Guo and Osborn [1989] also contains the results of detailed numerical computations for finite element approximation of the eigenvalue problem mentioned above; we will now report some of these numerical results. Strongly refined meshes similar to those introduced in Section 10.1 (see Fig. 10.6) were used. The sequence of meshes with и layers is shown in Fig. 10.9; the associated finite element spaces are denoted by S(n>p). n=0 n = l 7 \ 0.15 0.15 DETAIL n=2 O.^ 0.I52 DETAIL n=3 0A5° O.I52 0.15" 0.15' Fig. 10.9. Meshes for the finite element spaces.
738 /. BabuSka and J. Osborn Chapter III О cc or UJ ш > i 2 345678 10 DEGREE p Fig. 10.10. The relative error tt (n . In Fig. 10.10 we show the relative error s^n.p) in the best approximation of the first eigenfunction ux, when using n layers and polynomials of degree p, i.e., The theoretically predicted slope of f is also indicated. It is interesting to observe that for low accuracy the performance of undistorted elements (n = 0) leads to better results than refined meshes. Table 10.1 shows the value of Г — (^'•(r».p)~"^i)A £!.(".P) i=l, 2, 3, (cf. (8.47c)), where by А£>(в,р) we denote the approximate eigenvalues and £;,(«,?)= inf ||Mi-xlli,ft. xeS(„,p)
Section 10 Applications 739 Table 10.1. The values of C. .. p 1 2 3 i. J 6 7 8 1 2 3 i. ; 6 7 8 1 2 3 я 4 л3 ^ 6 7 8 0 — 0.912 0.967 0.962 0.977 0.984 0.992 0.995 — 0.867 0.787 0.780 0.871 0.948 0.966 0.975 — — 1.150 0.831 0.831 0.905 0.905 0.942 1 1.322 0.988 0.947 0.945 0.991 0.995 0.998 0.999 1.795 0.919 0.892 0.886 0.932 0.932 0.963 0.974 2.692 1.126 1.121 0.902 0.899 0.904 0.928 0.946 к 2 1.349 0.978 0.932 0.929 0.965 0.973 0.997 0.998 1.777 0.921 0.893 0.886 0.931 0.929 0.962 0.975 2.740 1.128 1.124 0.902 0.899 0.905 0.928 0.946 3 2.346 0.978 0.932 0.927 0.954 0.962 0.992 0.996 1.777 0.978 0.932 0.927 0.934 0.962 0.991 0.995 2.740 1.128 1.124 0.901 0.898 0.904 0.928 0.946 Figure 10.11 shows the relative error in k1A„iP) in dependence on the number of degrees of freedom N. We see that to get relatively high accuracy for the eigenvalues is not difficult. In our case, with TV = 2 we already have an accuracy of 5%. Note that an unrefined mesh (iV=0) is more effective than a refined mesh (N>0) for low accuracies (see Figs. 10.10 and 10.11). Thus, while it is worthwhile, in general, to refine the mesh at appropriate places, in some cases requiring only low accuracy, there is no advantage in refinement. Multiple eigenvalues. The results proved in this subsection and in Section 10.1 cover the case of multiple eigenvalues. Recall that the estimates for \XjAh_p) — i,-| and \\uuh,p)—M;!li,fl are in terms of £Л= max inf Wu^-xWi.o, 1=1,...,q *eSi,(h,p] where q is the multiplicity of X}and utl,...,uiq are the corresponding eigenfunctions. We now make some comments on multiple eigenvalues and then make an application of the refined error estimates for multiple eigenvalues proved in Section 9.
740 BabuSka and J. Osborn Chapter HI о се се ы > ш се 0.01 0.001 4 10 20 40 100 200 400 1000 NUMBER OF DEGREES OF FREEDOM N Fig. 10.11. Relative error in A l.(n.P)- The eigenvalues and eigenfunctions of the membrane problem on a square, i.e., the problem -Au = Xu inQ, m = 0 onf, A0.47) where Q={(x,y):\x\,\y\<n}, are easily seen to be given by and ukl = sin kx sin ly, k, I — 1, 2,.. . . Hence we see that there are multiple eigenvalues. Problem A0.47) is typical of problems with symmetries (A0.47) is symmetric with respect to x and y), and we thus see that multiple eigenvalues are common in applications. For i= 1, 2,..., let kki be an eigenvalue of A0.47) of multiplicity qt, i.e., suppose "■ki-l <^ki =^*i + l ='" — kki+qi-i <Ki + qi =лк( + 1. Note that we are here using the notation introduced in Section 10, whereby
Section 10 Applications 741 ki = 1, k2 is the lowest index of the second distinct eigenvalue, etc. Suppose now that qt > 1, i.e., that lk. is multiple. Let {Sh} be any family of finite-dimensional sub- spaces of Hq{Q) satisfying (9.14). Recall from Section 7 that the qt approximate eigenvalues 4„h? ■ "*,+«,■-!,* converge to Xki. While these approximate eigenvalues may be equal, i.e., we may have one distinct eigenvalue with multiplicity qt, consideration of the situation in which we choose Sh to be S1(t), where т is the triangulation shown in Fig. 10.8, shows that they may not be equal, since some of the symmetries present in A0.47) are not present in the discrete problem. Nevertheless, Theorem 10.5 provides estimates for each of the errors \Xki+j_ XM,..., &ki+J-1 \,j = 1,. ■ •, qt- As we have seen the estimates are \^ki+j-l,h ^-ki + j-ll ^Csl = C sup inf ||«-zlli,n _ueMUki) X^SHt) , j=\,...,qh A0.48) which suggest that the error in Xhk.+j_x depends on the degree to which Sl{x) can approximate all of the eigenfunctions corresponding to Xk.. Recall that in Section 9 (Theorem 9.1) we proved refined estimates, namely, IVy-M-Vj-il^uW. ./ = 1,■••.«£, (Ю.49) where Eu{h)= inf inf Ци-xlli.o, Ф, "*„*)= •••=fl(«,Mti+j-2,*) = 0. Now for the specific problem A0.47), all eigenfunctions have the same smoothness properties and Sl(r) with % given in Fig 10.8, will approximate them all with the same asymptotic accuracy and A0.48) and A0.49) would each lead to the same estimate in terms of h. The multiplicative constants in the estimates could, however, be different. We further note that there are eigenvalue problems for which the different eigenfunctions corresponding to a multiple eigenvalue have strikingly different approximability properties. For such problems A0.49) would provide a striking improvement over A0.48). As an example of such a problem, consider m (x) u{ — n) = u(n), 1 , —и <f> (-я) = ~1 ' —и' JP — k<p'(x)u, xel = (—n,n), (я), A0.50)
742 I. BabuSka and J. Osborn Chapter III where <p(x) = 7t"°!|x|1+'zsgnx, 0<a<l. It is easy to check that the eigenvalues and eigenfunctions are as shown in Table 10.2. We see that Aj =A2, A3 = A4, etc. Table 10.2. Eigenvalues and eigenfunctions of the eigenvalue problem A0.50). %i Щ 0 0.0 1 1 1.0 cos <p(x) 2 1.0 sin <p(x) 3 4.0 cos 2cp(x) 4 4.0 sin 2ф) We cast this problem into the variational form A0.1) by choosing f (f(«'J Y/2 f H = <u{x): \\u\\ =1 —r^x <oo,u(—7t) = uGc), <p'udx = 0^, а(н, v) -- u'v'— dx and b(u, v) = \ uvw' dx. With these choices (8.10) is equivalent to A0.50), with the understanding that the eigenpair (A0) u0) = @,1) of A0.50) is not present in (8.10). Note that ||u|| = ||u||e. Let ||и||ь=(Ь(ы, u)I/2. The assumptions made in Section 9 are clearly satisfied. Our approximation is defined by (9.4) with = {ueH:u linear on (—к+jh, —n+(j+l)h),j=0,l,. ..,n —1}, where h = 2%/n and и is an even integer. Now with this choice for {Sh} it is easily seen that inf ||cos<p(x)-xll2^C/i2 A0.51) and inf |jsin cp(x)-x\\2a-Ch1+'' A0.52) XsSh Hence from Theorem 10.1 we would expect Aiifc and А2,й, the two approximate eigenvalues that converge to the double eigenvalue Ax =A2, to have different convergence rates. From Table 10.3 we can find the errors in liih, i= 1,2, 3,4, for а = 0.4. These errors
Section 10 Applications 743 n Table 10.3. Numerical solution of the eigenvalue problem A0.50) for a=0.4. 16 32 64 128 256 1.0716754 1.1115481 5.0394692 5.2414639 1.0175850 1.0352431 4.2691915 4.3385100 1.0043740 1.0113741 4.0666055 4.0936974 1.0010921 1.0038431 4.0166006 4.0272875 1.0002729 1.0013431 4.0041468 4.0083380 1.0000682 1.0004811 4.0010365 4.0026645 0.2704 0 0.3423 0 0.1075 +1 0.1191 +1 0.1329 0 0.1881 0 0.5259 0 0.5869 0 0.6618 -1 0.1067 0 0.2589 0 0.3067 0 0.3305 -1 0.6202 -1 0.1289 0 0.1653 0 0.1651 -1 0.3665 -1 0.6440 -1 0.9135 -1 0.8255 -2 0.2193 -1 0.3217 -1 0.5162 -1 0.5637791 0 -0.4151973 -13 0.5558919 0 0.5022638 -13 0.5641633 0 -0.8916589 -12 0.5636643 0 -0.2689727 -12 0.5641879 0 0.1323421 -10 0.5641561 0 -0.7375504 -10 0.5641895 0 0.8670648 - 9 0.5641875 0 0.1415775 - 8 0.5641895 0 0.3251219 - 7 0.5641895 0 -0.9705611 - 8 0.5641896 0 0.7269570 - 6 0.5641896 0 -0.2601000 - 6 -0.1124891 -16 -0.5636998 0 0.1317809 -12 0.5516234 0 0.1596754 -12 0.5641519 0 0.1124328 -13 0.5637697 0 0.6411454 -11 0.5641830 - 0 0.1970954 -10 0.5641613 0 0.7729760 - 9 0.5641883 0 0.3641341 -10 0.5641858 0 0.4535626 - 7 0.5641893 0 0.4409247 - 7 0.5641890 0 0.8070959 - 5 0.5641895 0 0.6435344 - 6 0.5641895 0 1.5562955 1.1943249 2.0041570 1.2575063 2.6003887 1.4067517 3.5190001 1.6437659 4.9215830 2.0.107071 7.0542522 2.5706705 tr о (Г се ш 16 32 64 128 256 NUMBER OF INTERVALS n Fig. 10.12. The error in the eigenvalues Яьк, Хгл and A3ift, Я4,л for я = 0.4 in dependence on the number of intervals n.
744 /. BabuSka and J. Osborn Chapter III are plotted in Fig. 10.12 in log-log scale. We clearly see the different rates of convergence, specifically seeing the rates hz and h1+* = h1A for the errors in AijA, for i=l, 3 and j = 2, 4, respectively, as suggested by A0.51) and A0.52). It should be noted that the estimates presented in Theorem 10.1 are of an asymptotic nature in that they provide information only for small h (or large n), i.e., for h (or n) in the asymptotic range. From Fig. 10.12 we see that for a = 0.4 we are in the asymptotic range quite quickly, say for nM6. Consider ulth and и2,и, the approximate eigenfunctions corresponding to llyh and X2,h, respectively, normalized by |[-|[b = 1. The results of Section 9 suggest that и1>й should be close to Ccos<p(x) and u1M close to Csin<p(x) (cf. A0.51) and A0.52)), where С is such that Csin<p(x) and Ccoscp(x) are normalized by |Н|Ь = 1, i.e., С=it"'2. To illustrate this point we have computed Cf and Cf, i= 1,2,3,4, so that K(i) = II«j,* - Q cos (p(x) - С2 sin (p(x)\\a, i = 1,2, \\uUh - Cf cos 2cp(x) - Cf sin 2cp(x)\\a, i = 3,4, is minimal. We would expect that C^.C^C^.C^aO A0.53) and Cf = Cf = Cf = Cf =■ С = 0.564189583 .... A0.54) Table 10.3 shows some of the results for a=0.4. We see clearly the results predicted in A0.53) and A0.54). Table 10.3 also shows that K{l)<KB) and K{3)<KD), as we would expect. The last column in Table 10.3 and Figure 10.8 show that the ratios ^■i+l.ft ~^i+l •_ 1 -3 Uh~M increase as /i->0. This shows that in the whole /i-range we considered, the approximate eigenvalues converging to a multiple eigenvalue are well separated. Consider next the case when a = 0.01. Table 10.4 presents the same results for a = 0.01 as Table 10.3 does for a = 0.4. Figure 10.13 shows the graph of log— —, 1 = 1,3, as a function of the number of intervals n in a semilogarithmic scale. The computed values are indicated by • and x. The graphs are formed by interpolation (solid lines) and extrapolation (dotted lines). We note three related phenomena that did not occur with a=0.4. For small n the approximate eigenfunction associated with A1>A is u1:h^n~ll2sin q>(x), in contrast to ulth~n~ll2cos (p(x) when a = 0.4. We remark that rc~1/2cos q>(x) is more easily approximated by Sh than is 7t/2sin cp(x) for all 0 < a < 1. This anomaly is present for n < 64 but for n ^ 128 we get results which are in agreement with the (asymptotic) results in Section 9. For A3,j, and А4,л we have to take и ^256 to get results which agree with the asymptotic theory. For a = 0.01 we see that KB)<XA) for small n (n^64) and KB)>XA) for large
Section 10 Applications 745 Table 10.4. Numerical solution of the eigenvalue problem D.1) for a=0.01. къ K(i) C? c«> 16 32 64 128 256 1 1.0520268 2 1.0529172 3 4.8576239 4 4.8717141 1 1.0128661 2 1.0130098 3 4.2088367 4 4.2106542 1 1.0032139 2 1.0032360 3 4.0515675 4 4.0518629 1 1.0008063 2 1.0008077 3 4.0128623 4 4.0129086 1 1.0002018 2 1.0002031 3 4.0032196 4 4.0032230 1 1.0000504 2 1.0000515 3 4.0008054 4 4.0008079 0.2338 0 0.2268 0 0.9593 0 0.9615 0 0.1223 0 0.1052 0 0.4650 0 0.4577 0 0.7274 -1 0.3568 -1 0.2384 0 0.2205 0 0.5369 -1 0.3398 -1 0.1343 0 0.9792 -1 0.4196 -1 0.4775 -1 0.9166 -1 0.9745 -2 0.4372 -1 0.4614 -1 0.5011 -1 0.7741 -1 0.8181940 -11 0.5645965 0 -0.9346720 -13 0.5604533 0 0.8717399 -10 0.5647369 0 0.2507177 -10 0.5642694 0 -0.9345818 -9 0.5647430 0 0.3745461 -9 0.5644172 0 -0.1311961 - 5 0.5647430 0 0.2743681 - 7 0.5644235 0 0.5647430 0 0.7414162 - 6 0.2379072 - 6 0.5644235 0 0.5647429 0 -0.1553659 - 4 0.5644234 0 0.1165012 - 5 0.5634386 0 -0.2916448 -11 0.5597529 0 0.1167277 -11 0.5635957 0 -0.8480131 - 9 0.5636658 0 -0.3101833 -10 0.5636031 0 0.1273043 - 7 0.5638178 0 -0.4115544 - 9 0.5636032 0 0.2462939 - 7 0.5638240 0 0.3196172 - 8 0.3356056 - 5 0.5636032 0 0.5638239 0 0.1197135 - 5 0.1061527 - 4 0.5636031 0 -0.2123278 - 4 0.5638238 0 1.0171143 1.0164293 1.0111689 1.0087030 1.0068764 1.0057284 1.0017363 1.0035997 1.0064420 1.0010560 1.0218254 1.0031040 4.10 16 32 64 128 256 NUMBER OF INTERVALS n Fig. 10.13. The graphs of log (Я2.*—^VWi,*—^i) ancl log (Я4,А-Я4)/(Аз,л—Х3) for a=0.01 in dependence on the number of intervals n.
746 I. BabuSka and J. Osborn Chapter III n and KD)<KC) for small n (ns£l28) and КD)>ЩЗ) for large n. Recall that KB)>K(l) and KD)>KC) for all n when <x=0.4. Finally we note that when a=0.01 the ratio —: y—, ' = 1,3, first decreases as n increases, then for some n the two eigenvalue errors become equal, and then the ratio increases again. This is in contrast to the case for a = 0.4, in which the ratio increased over the whole range of n values. We further note that the value n for which the eigenvalue errors are equal—n~70 for i — \ and n~ 160 for f = 2 (see Fig. 10.9)—marks a transition in each of these situations from u1>hs; n~ll2sin(p(x) to ui<hozn~l,2cos(p(x) and uXh^n~il2sm2(p(x) to uXh^%~ll2cos2cp{x), from KB)<K(l)'and KD)<KC) to KB)>K{1) and KD)>KC), and from (^•i+i,* —^i+i)/(^i,* — Л-;), i- 1,3, decreasing to increasing. We have thus seen that for a = 0.4 the numerical results are in concert with the (asymptotic) results in Section 9 for the whole range of n considered, while for a=0.01 they are in disagreement for small n, but are in agreement for large n. We now make an observation that further illuminates these two phases of error behavior—the pre-asymptotic and the asymptotic. Toward this end we note that if (A1; иД with Hi*! ||,,= 1, and (Я1гА, «liA), with ||uliA ||ь = 1, are first eigenpairs of (9.1) and (9.4), respectively, then 0^Я1,й-Я=||и1,л-м||^-А1||м1,й-и1||^ = inf [Hz-MillJ-^llz-MiL2]. A0-55) JteSh IXllb = l If At is a multiple eigenvalue, then the ux in A0.56) can be any corresponding eigenvector with ||u1||Jl = l. (Note that we are here assuming ux and u1Jt have ||-||b-length equal 1, whereas in (9.2) and (9.5) they are assumed to have ||'|| „-length equal to 1.) The first inequality in A0.55) follows from the minimum principle (8.35) and has already been stated in (8.42). The first equality in A0.56) follows immediately from Lemma 9.1 with (Я, u)^^^ ut), w = uuh and Х=а(м1>л, ulth) = klih. If jeSh with ||xllb= 1, then from the minimum principle (8.35), KH-h^a{bl)-Xv A0.56) Again from Lemma 9.1, this time with^, м) = (Я1,м1), w = % and X=a(x, %), we have «o(Z, Z)-A1 = |lz-«iL2-A1|lz-«ill»- (Ю.57) The second equality in A0.55) follows from A0.56) and A0.57). It is clear from the above discussion that uy can be any eigenvector corresponding to kx. From A0.55) we have Кн-^\\х-иА1-кА1~иАъ VZ6Sft with ||jc||b=l. (Ю.58) If x is ||i0-close to uj, to be more precise, if x is taken to be the а-projection of ut onto Sh (cf. (8.17)), then the second term at the right-hand side of A0.58) is negligible
Section 10 Applications 747 with respect to the first term. This follows from the compactness assumption made in Section 9. On the other hand, if \\ui — x\\a *s not small, X1<h — Xt may still be small because of cancellation between the two terms on the right-hand side of A0.58). Regarding the case a=0.01, this explains why for h large (the pre-asymptotic phase), we can have «liA~7t_1/2sin q>(x) and K{l)>KB), and yet have AM, the approximate eigenvalue associated with u1<h, closer to Xx than is Х2Л, the approximate eigenvalue associated with u2th — it-1/2cos cp(x), while for h small (the asymptotic phase), we have Uithc^n~ll2cos cp(x), K(l)<K{2) and Xl<h closer to Xx than is X2M, showing that the eigenvalue error, Xlyh — Xh is governed by inf,,eSJx —"ill2- The analysis of example A0.50) we have presented is taken from Babuska and Osborn [1987]. 10.3. Eigenvalue problems for general second-order elliptic operators We consider here the approximation of the eigenpairs of general second-order elliptic operators. This problem is, in large part, similar to those discussed in Sections 10.1 and 10.2; we will thus be brief, discussing in detail only those issues that have a treatment in this case that differs from that for the L-shaped panel or the membrane, or those issues that did not arise with those problems. Consider the eigenvalue problem (Lu)(x) = A(Mu)(x), xefi, A0.59) (Bu)(x) = 0, xeT=dQ, where Q is a polygonal domain in U2, L is given in C.2), M in C.3), and В in C.4), L is assumed to be uniformly strongly elliptic (cf. C.5)), atj bh с and d to be bounded and measurable, and d to be bounded below by a positive constant (cf. Section 2). In Section 2 we saw that A0.59) has the variational form (8.10) (cf. C.15) and C.17)), with H1=H2 = Ho(Q) in the case of Dirichlet boundary conditions and H1=H2=H1(Q) in the case of Neumann conditions, and a and b given in C.11). Equations (8.1)—(8.3) hold, (8.2) and (8.3) being a consequence of C.5), provided Rec(x)>Uo + $b2/a0, A0.60) where b= max \bi(x)\ ;=1,2 (cf. 3.14). We remark that A0.60) can be easily achieved. It does not hold for the given operator L, L can be modified, by adding an appropriate multiple of d(x) to c{x), so that it does hold. This change shifts the eigenvalues and leaves the eigenfunctions unchanged. We also see that (8.7) is satisfied with Wx = W2 = L2@). Thus A0.59) has the form of the problem analyzed in Section 8. We remark that in this subsection, since we are not imposing any selfadjointness assumptions, the spaces H and S1AhiP) = S2,(*,p) must be taken to be complex and the eigenvalue parameter X must be considered complex. As we have seen, the selection of the trial and test spaces S1AKp) and S2i(h<p> is
748 I. BabuSka and J. Osborn Chapter III guided by the regularity properties of the exact eigenfunctions and adjoint eigenfunctions. In general, determining this regularity and then using it to choose effective trial and test spaces is a delicate task. The regularity can depend on the coefficients in the differential equation, e.g., on where they have discontinuities and where they are smooth, and on the domain, as we have seen with the L-shaped panel. We will not go further in this direction, but will instead assume the eigenfunctions belong to Hkl(Q) and the adjoint eigenfunctions to Hk2{Q), and select trial and test spaces so as to reflect this assumption. Remark 10.5. For eigenvalue problems with rough coefficients, which arise in the analysis of vibrations in structures with rapidly changing material properties (such as composite materials) it is known that the eigenfunctions do not lie in any high- order Sobolev space. Nevertheless, for one-dimensional problems, their regularity can be understood and, based on this understanding, one can select trial and test spaces that lead to very accurate and robust approximations. These trial and test spaces are not of the usual polynomial type, but instead closely reflect the coefficients. For details see BabuSka and Osborn [1983, 1985] and BabuSka, Caloz and Osborn [1990]. Cf. also Section 11.3. Remark 10.6. The mathematical study of the use of regularity information for the optimal selection of trial and test functions belongs to the area of complexity and information-based approaches. See, e.g., Wozniakowski [1985]. Based on the information that the eigenfunctions lie in Hki{Q) and the adjoint eigenfunctions in Hk2(Q), with kuk2>l, it is appropriate to discretize (8.10) by choosing |Sg(r) for Dirichlet conditions, S(M, = Si.<*.j» = Sun,,) = |sp(t) f0f Neumann conditions, as in Section 10.2, where теу and у = {т} = {тл} is a family of a-regular triangulations of Q. Equation (8.11), with P(h)=±a0, and (8.12) follow from C.5). Equation (8.13) follows from A0.40). Equation (8.14) (or (8.15)) can now be considered and from it we get eigenpairs (A№>p), щКр)) which serve as approximations to the eigenpairs (A, u) of A0.59) (or (8.10)). The errors in the approximate eigenpairs can be estimated with the results of Section 8. Let A be an eigenvalue of A0.59) (or (8.10)) with algebraic multiplicity m (by which we mean that A-1 is an eigenvalue with algebraic multiplicity m of the compact operator T introduced in (8.8)). Recall that M(A) = the unit ball (with respect to Я1^)) in the space of generalized eigenfunctions associated with A, and M*(A) = the unit ball in the space of generalized adjoint eigenvectors associated with A.
Section 10 Applications 749 From A0.40b) in the case of the Dirichlet problem and A0.40a) in the case of the Neumann problem we have %.,p)(A) = sup inf \\u-x\\i,a ^(A"'-1/^-1) sup \\u\\kuQ, ueM(k) £(*,p)W= SUP inf \\v-r\\\i,n veM*(X) qeSh.p where fi1 = min(p +1, kx), and up inf \\v ^а^-1/?) sup iiuiitJ>fl, геМ»И where /x2 = min(p + l, fc2). Let A^^p),..., Am,(A>p) be the eigenvalues of (8.14) that converges to A, let MKp(A) = {u: и in the direct sum of the generalized eigenspaces of (8.14) corresponding to the eigenvalues Л1>(Л,р),..., ЛтЛН,р), ||u||i,D = 1}, and let a = ascent of (A ~* — T). Applying Theorem 8.2 we have 1 m N _1 ^ C£(A]P)(A)£^p,(A) ^С(Л"+«-2/р*,+*2-2) sup \\u\\kuM sup Nt2,n. A0.61) иеМ(Я) veM*(X) In light of Remark 8.1 we also have 1 m $C(A"+«-2/pt'+fa-2) sup ||u||kl,m sup ||p||tl>e. A0.62) ueM(A) ceM*(A) From Theorem 8.3 we obtain |A — Aj>(/,jP)| <СГ'+«-2/Рк,+*!) sup HulL^ sup INk2,QI/a. A0.63) ы<=М(Л) i;eM*(A) Regarding eigenfunction estimates, we apply Theorems 8.1 and 8.2. From Theorem 8.1 we have S(M(X),M{Kp)(X))^C(h'"-1/pk'-1) sup !NUb«. A0.64) ueM(A) Let A(/IjP) be an eigenvalue of A0.1) (or 8.10)) such that limA_0iP^00A(A, p) = A and let w(htP) be a unit vector satisfying
750 /. BabuSka and J. Osborn Chapter HI (А(й, p) l — Г)'1 w(h>p) = 0 for some positive integer lt ^ a. Then, from Theorem 8.4, for any integer l2 with lY^l2^&, there is a vector u{Kp} such that (X~l — T)l2uM = 0 and II M(fc,p) — w(/i,p) lli.fi ^СР"/?*1-1) sup ||и||4ь0)('а~', + 1)/". A0.65) ueM(A) Remark 10.7. In this section we have considered triangular meshes. One could also consider quadrilateral meshes, which are a generalization of the type of mesh employed in Section 10.1, or curvilinear meshes. Since these generalizations properly belong to approximation theory we will not pursue them. We refer the reader to Ciarlet [1978], Babuska and Guo [1988b], and Szabo [1986]. Remark 10.8. We have mentioned here only estimates based on the information that ueHk(Q). If we know, e.g., that ue&j(Q), then we can say more, provided a proper mesh is selected. Remark 10.9. The approximate eigenvalues XjAhiP) here, as in any finite element method, are defined by the eigenvalue problem (8.14), which involves integrals over the domain Q. In practice these integrals often must be evaluated (approximated) by quadrature formulas. For estimates of the eigenvalue error due to this quadrature error we refer to Fix [1972] and Banerjee and Osborn [1990]. In Banerjee and Osborn [1990] it is shown that to preserve optimal order for eigenvalue error more quadrature points than for source problems are required. We note that the use of a finite element method in conjunction with a quadrature method often leads to a finite difference method for eigenvalue approximation. For example, if we approximate the eigenvalues of — Au = Xu in Q, и = 0 on Г with the finite element method corresponding to p = 1 and a uniform triangulation (cf. Fig. 10.8) and evaluate the resulting integrals with an appropriate quadrature formula, we obtain the standard five-point difference eigenvalue approximation for the Laplacian (cf. Section 5). This observation is due to Courant [1927, 1943]. For further results on finite difference methods we refer to Polya [1952], Hersch [1955, 1963], Weinberger [1956, 1958, 1974], Hubbard [1961, 1962], Kuttler [1970], and Kreiss [1972]. Remark 10.10. Since the eigenvalue АЛ(/1;Р) are defined by a Ritz method, they are upper bounds for the exact eigenvalues Xf. А7-<АЛ(А>Р) (cf. (8.42)). If we could derive a lower bound X,-,(a,p), then one would have bracketed Ay. Much attention has been directed to the derivation of lower bounds. Weinstein [1935, 1937, 1953, 1963] developed the method of intermediate problems. Many
Section 11 Applications 751 authors have contributed to the development of this and other related variational methods. We mention D.H. Weinstein [1934], Aronszajn and Weinstein [1942], Aronszajn [1948,1949-50], Weinberger [1952,1956,1959,1960], Bazley [1959], Bazley and Fox [1961,1963]. In addition we mention the monographs by Collatz [1948], Weinstein and Stenger [1972], and Weinberger [1974]. Remark 10.11. Most books and monographs that treat finite element methods contain a section or chapter on eigenvalue problems. For a survey of books and monographs on finite element methods we refer to Noor [1985]. Of the more mathematically oriented of these, we mention Strang and Fix [1973], Oden and Reddy [1976], and Oden and Carey [1982]. 11. Approximation by mixed methods In Section 3 we saw, in terms of an example, how eigenvalue problems can be given mixed formulations. Mixed formulations can be discretized and thereby lead to approximation methods referred to as mixed finite element methods. In this section we discuss three such methods. We begin by presenting an abstract result designed for the analysis of mixed methods. Remark 11.1. Mixed methods for source problems have received considerable attention. We mention Hermann [1967], Glowinski [1973], Johnson [1973], Oden [1973], Brezzi [1974], Ciarlet and Raviart [1974], Mercier [1974], Scholz [1976], Raviart and Thomas [1977], Brezzi and Raviart [1978], Falk [1978], BabuSka, Osborn and PitkAranta [1980], and Falk and Osborn [1980]. 11.1. An abstract result Let V, W, H and G be four real Hilbert spaces with inner products and norms (•, -)v, II • \\v, (", -V, II • Hit, (•, • )h, II • IIh and(-, • )G, \\ • ||G, respectively. We assume VcH and Wc G. Let A(a, ф) and В(ф, и) be bilinear forms on H x H and VxW, respectively, that satisfy Мг^КСИкЫМ!* Ч<т,феН A1.1а) and \В(ф,и)\^С2\\ф\\у\\иУ WeV, ueW. A1.1b) We assume A(a, ф) is symmetric and satisfies A(ff,ff)>0 У0#стбЯ, A1.2a) and assume sup|BftM)|>0 VO^ueW. A1.2b)
752 I. BabuSka and J. Osborn Chapter III We then consider the following eigenvalue problem: {tr,u)eVxW, (а,и)#@,0), А(а,ф) + В(ф,и) = 0 ЩеУ, A1.3) B{o, v)=— X(u, v)G Vt; e W. A discretization of A1.3) is obtained by selecting finite-dimensional spaces VhczV and Wh <r W and considering the approximate eigenvalue problem (oh, uh) eVhx Wh, (<тк, и„)#@,0), А{о„,ф) + В{ф,иь)~0 V^4 A1.4) B{ah, v)=~Xh{uh, vH Vi>e W„. We then view (Aft, {ch, uh)) as an approximation to A, (a, u)). Given bases for Vh and Wh, A1.4) becomes a matrix eigenvalue problem. Remark 11.2. If we let a({a, и), (ф, v)) = A{a, ф) + В{ф, и) + B(a, v), b({o,u)),{il/,v)= -{u,v)G, and H=VxW, then A1.3) can be written as (o,u)eH, (а,и)Ф{0,Щ A1.5) a{(a, и), (ф, v)) = ЯЬ((G, и), (ф, v)) Щ, v) e Я, which has the form of (8.10). Also A1.4) has the form of (8.14) with SUh=S2tll = Vh x Wh. Problems A1.3) and A1.5) do not, however, satisfy all of the hypotheses of the results in Section 8. We thus need an alternative analysis. This will be provided by Theorem 11.1, which is based on the results of Section 7. Note that even though the methods considered in this and the next section are not covered by the results of Section 8, it is still useful to discuss them, to the extent possible, in terms of the basic steps A), (Г), B'), and C) introduced in Section 10. In order to estimate the error in the approximate eigenpairs (kh,{oh,uh)) we consider the associated source and approximate source problems: Given g e G, find (a, u) e V x W satisfying А(а,ф) + В(ф,и) = 0 УфеУ, A1.6) B(c,v)=-(g,v)G VveW. Given g e G, find (ah, uh) e Vh x Wh satisfying А(а,,,ф) + В(ф,ик)=0 ЩеУн, A1.7) B(ok,v)=-(g,v)G 4veWh.
Section 11 Applications 753 We assume A1.6) and A1.7) are uniquely solvable for each geG. We then introduce the corresponding component solution operators: S-.G-+V, Sg = o, A1.8a) Sh:G->V, Shg = ah, A1.8b) T.G^G, Tg = u, A1.8c) Th:G^G, Thg = u„, A1.8d) where (a, u) and {ah, uh) are defined by A1.6) and A1.7), respectively. (Note that the T introduced here is different from that introduced in (8.8).) The eigenpairs (Я, (a, u)) of A1.3) can be characterized in terms of the operator T. Before establishing this we note that k = A(a, a)/(u, u)G, which shows that /l>0. This follows from A1.3) and the observation that both components и and a of an eigenvector are nonzero. Now, if (А, (с, и)) is an eigenpair of A1.3), then кТи = и,иФ0, and if XTu — u,иФО, then there is a ae V(a = S(Xu)) such that (Я, (a, u)) is an eigenpair of A1.3). Thus X is an eigenvalue of A1.3) if and only if X~l is an eigenvalue of T. The correspondence between eigenvectors is given by u<->(o-, u). In a similar way the approximate eigenvalues defined by A1.4) can be characterized in terms of the eigenvalues of Th. kh is an eigenvalue of A1.4) if and only if Aj^1 is an eigenvalue of Tk; the correspondence between the eigenpairs is given by uh++(ah,uh). We assume lir-rj|cc->0 as/i-^O, A1.9) where, for an operator A: D{A){cX)-+ Y, we let \\A\\xy= sup \\Aw\y\M\x. (In particular, we assume Tis a bounded operator on G.) Since dim R(Th)<co for each h, the Th are compact and A1.9) thus implies Tis compact. We also note that Tis selfadjoint on G. This is seen as follows. Let v=Tf in the second equation in A1.6) to obtain B(Sg,W=-(g,Tf)G. Again consider A1.6), but with g replaced by/, and let ф — Sg in the first equation to get A(Sf,Sg)+B(Sg,Tf) = 0. From these two equations we have @,Tf)G = A(Sf,Sg) Vf,geG. A1.10) Using A1.10) and the symmetry of A we get (Tg,f)e = (/". t9)g = A{Sg, Sf) = AiSf, Sg) = (g, Tf)G, showing T is selfadjoint. In a similar way we see Th is selfadjoint.
754 I. BabuSka and J. Osborn Chapter III We now apply Theorems 7.3 and 7.4 to the operator T and family of operators {Th} on the space G. By virtue of the correspondence between the eigenpairs of T and Th and those of A1.3) and A1.4) we will thereby obtain estimates for the errors in (Afc, (ah, uh)). The hypotheses have all been shown to be satisfied; cf. Remarks 7.5 and 7.6. Let A be an eigenvalue of multiplicity m. Since ||Г— Th\\GG-*0 we know that m eigenvalues Aj~j,..., A„j of Th converge to A-1. Since T and Th are self- adjoint the relevant ascents are one and all eigenvalues have equal geometric and algebraic multiplicities. Let A?(A_1) be the eigenspace of T corresponding to A-1. Recall that M—M(A~1) = R(E), the range of the spectral projection E associated with T and X~l. We have denoted this space by M to distinguish it from the set M of normalized eigenvectors introduced in Section 8. Theorem 11.1. Under the assumptions made above, there is a constant С such that \l-kb\^C{\\(S~Sh)\M\\2GH + \\{S-Sh)\uhv\\(T-Th)\u\\GW + \\(T-Th)\u\\2GG}, 1=1,..., m. A1.11) Proof. Let «i,..., um be an orthonormal basis for М(Х'г). From Theorem 7.3 with a = 1 we have lA^-AjVlsScj £ \{{T--Th)uhUj)G\ + \\(T~Th)\u\\lG\, A1.12) 1=1, ...,m. For gJ'eG we estimate ({T—Th)g,f)G. Adding the two equations in A1.6) and recalling the definitions of Tg and Sg in A1.8) we find {g, v)G = -A{Sg,ф)-Щ, Tg)-B(Sg, v) Щ,ь)вУх W. Setting v=(T- Th)f and ф=(8-5н)/ yields (g,(T-Tk)f)G = -A(Sg,(S-S„)f)-B((S-Sk)f, Tg)-B{Sg, (T-Tk)f). A1.13) Next note that substraction of the equations A1.7) from A1.6) (with g replaced by/) gives A((S-Sh)f,W + B(xl,,{T-Th)f) + B({S~Sh)f,v) = 0 W,v)eV„xWh. ( L ' Now, combining A1.13) and A1.14) and using A1.1) we have \(g,(T-Th)f)G\ = \A({S-Sh)f,5д~ф) + B((S-Sh)f, Tg-v) + B(Sg~ф,(Г-Th)f)\ ^c^(s-sb)fM\Sg-nii + C2\\{S~Sh)f\\v\\Tg-v\\w+C2\\Sg^\\v\\(T-Th)f\\w.
Section 11 Applications 755 Setting \j/~Shg and v=Thg gives \((TTh)g,f)G\ ^CJiS-S^fhWiS-S^gh + C2\\{S-Sh)f\\v\\{T-Th)g\\w + C2\\{S-Sh)g\\v\\{T-Th)f\\w. A1.15) Letting д = щ and /=«,- in A1.15) yields \((T~Th)uhUj)G\ ^CMS-SMlB + TCiUS-SbMevMT-TMew. A1.16) A1.11) follows immediately from A1.12) and A1.16). □ Theorem 11.2. Under the assumptions made above, there is a constant С such that \\u-uh\\c^C\\(T-Th)\M\\GG-' (П-17) Proof. This result is an immediate consequence of Theorems 7.1 and 7.4. Note that we have given this estimate the simplified form it has when к is simple, and it would have to be modified in the general case. Cf. the statement of Theorem 10.1 and (8.44H8.46). □ Theorems 11.1 and 11.2 were proved by Osborn [1979] and by Mercier, Osborn, Rappaz and Raviart [1981]. 11.2. A mixed method for the vibrating membrane We consider, as in Section 10.2, the vibrating membrane problem -Au = Xu infl, u = 0 опГ=9Д A1.18) where Q is a convex polygon in R2, but we will here give it a mixed variational formulation. Otherwise we will proceed in a parallel way, discussing in turn the steps A), (Г), B) and C) introduced in Section 10.1. We will clearly see how the variational formulation influences the entire approximation method. Before proceeding with the variational formulation, we introduce an additional function space. Let H(div, Q)= \G=(а1, a2): ox,a2 е H°(Q) and there exists 2 = div a e H°(Q) such that a • Vcj> dx dy = — tf II H(div, o) = [ffl + o\ + (div ffJ] dx dy. гфдхйу,^феС?@)
756 I. BdbuSka and J. Osborn Chapter III 11.2.1. Variational formulation Suppose (X, u) is an eigenpair of A1.18), by which we will mean 0#иеИ£(Й), VwVudxdy-X j Q uvdxdy Vi;eHo(fi), A1.19) i.e., we will assume A1.18) to have the variational formulation considered in Section 10.2. We now derive a mixed variational formulation for A1.18). Introduce the auxiliary variable From A1.19) we see that ereH(div,Q) and div a = — Xu. From A1.21) we get v div <r dx d у = — X UV&X&y VlNH°@) and from A1.20) and the definition of H(div, Q) we have a-i/j dxdy = IVicij/ dxdy = — и div \j/ dx d v V^ £ (div, Q). Combining A1.22) and A1.23) we obtain (a,u)e H(div, Q) x H° {Q\ {а, и) Ф @,0), a • ф dx dy + и div ф dx dy=0 V^ e H(div, Q), \v div a dx dy — — X j uv dx dy VveB°(Q). a n Now suppose (A, (a, uj) satisfies A1.24). Let й be the solution to Лй = Хи in О, й = 0 on Г, and let A1.20) A1.21) A1.22) A1.23) A1.24) A1.25)
Section 11 Applications 757 Then, by the argument used above, (a, u)eH(div,Q)xH°(Q), c-ij/dxdy+ \udivil/dxdy = 0 \/i//eH{div,Q), A1.26) n fi iV div a dx dy = — к \uv dx dy \/veH°(Q). n a Subtraction of the equations in A1.26) from those in A1.24) yields (a-a,u-u)e#(div, Q) x H°{Q), (a — ff)-i//dxdy+ (u-u)divij/dxdy=0 Vi/reH(div, Q), A1.27) v div((T - a) dx dy = 0 Vve H°(Q). In A1.27), if in the second equation we take v arbitrary in H°{Q) we get div(cr—a) = 0, and if we take ф = a—6 in the first equation we obtain 0 = \n(a—d) • (a — S) dx dy, which implies ег = <т. A1.28) Then the first equation in A1.27) implies (u-u)div^dxd.y = 0 V^eH(div,£2). A1.29) Let w satisfy Aw = u — u and let i/' = Vw in A1.29). Since div \j/ = u — й, this choice leads to и = й. A1.30) Equations A1.25), A1.28) and A1.30) show that (Я, u) is'an eigenpair of A1.18) (or A1.19)), and that cr = Vu. In summary, if (Я, и) is an eigenpair of A1.18) and a=Vu, then (Л, (u, a)) satisfies A1.24), and if (Я, (сг, и)) satisfies A1.24), then (Я, м) is an eigenpair of A1.18) and a - Vu. A1.24) is the desired mixed formulation. It is immediate that A1.24) has the form of A1.3) with V= Я(^у, Q), W=G = H°{Q), H = H°(Q), A(a, ф)- o-tydx dy, В(ф, u)= \u div ф dx dy.
758 I. BabuSka and J. Osborn Chapter III Furthermore, A is symmetric and A1.1) and A1.2) hold. The symmetry of A and A1.1) and A1.2a) are trivial. To prove A1.2b), let w solve Jw=m and set ^=Vw. Then div \ji=u and we have sup il/eH(diw,Q) и div ф dx dy > и div \jt dx dy u2dxdy>Q> for О^иеH°{Q), which proves A1.2b). From the fact that A1.18) has a sequence of positive eigenvalues and from the correspondence between the eigenpairs of A1.18) and A1.24) we see that A1.24) has a sequence of eigenvalues 0<A1<A2<---/'oo and corresponding eigenfunctions {a i, ux), (G2, u2\ .,., with Gj = VUj and with the (Aj-.Mj) being the eigenpairs of A1.18). 11.2.Г, Regularity of the eigenfunctions If {a, u) is an eigenfunction of A1.24), then и is an eigenfunction of A1.18) and a = Vu. Thus the regularity of (a, u) can be inferred from the regularity of the eigenfunction of A1.18), which was discussed in Section 11.2. 11.2.2. Discretization o/A1.24) and assessment of the accuracy of the approximate eigenvalues and eigenfunctions We will use a discretization of the general form of A1.4). It thus remains to select the subspaces Vh с H(div, Q) and Wh с H°(Q). This will be done with an eye toward ensuring A1.9) holds and the terms on the right-hand side of A1.11) in Theorem 11.1 are small. A mixed method approximation of the associated source problem (cf. A1.6) and (П.7)) has been proposed and analyzed in Raviart and Thomas [1977]. We will take their choice of trial and test functions. The source problem has also been analyzed by Falk and Osborn [1980]. Let T be the unit triangle in the (£, ??)-plane whose vertices are йх = A,0), a2 = @,1) and я3 =@,0). Then with pJsO an even integer and f associate the space Q-»+1*> of all functions ф=(ф1} $2) of the form Ф1 = ро\Д,П) + а0^1 + а1ё/п + - + ар12^2 + 1г11'12, Ф2 = ро1р(& г,) + Р0 г? +1 + Р, ^р + ■ ■' + h,i £Р/У/2 + \ where polp(£, ц) denotes an arbitrary polynomial of degree p and where Pl2 ; = o = 0,
Section 11 Applications 759 and with p^ 1 an odd integer and Г associate the space <2(p+1) of all ф of the form ^ = polp^^) + i50?7p+1 + i81^+-+Ap+i,/2^+1,/V''+1)/2, where (P+D/2 (P+D/2 I (-1L= I (-i)'A=o. i = 0 i = 0 We remark that for $eQlp+1\ фг and ^2 are polynomials of degree p+l. With a general triangle T in the (x, )>)-plane, we associate the space Q(p+1) defined by Q(/+" = ty: ^(x, y) = A/JT)BT$(F; x(x, y% fe 6(p+"}, where Fr(£,»/) = Br(«J, n) + fer is the linear transformation mapping Г onto T and JT=det(BT). Let у = {т} = {тл} be a family of a-regular triangularizations of fi. Then for p^O an integer let F„ = {^e#(div,0):^jre е?+1,УТбт4} A1.31a) and Wh = {ueH°(Q): u\T polynomial of degree p VTeTA}. A1.31b) Now we consider A1.4) with this choice for V„ and Wh. A1.4) will have eigenvalues and corresponding eigenfunctions (СГ1,Л> Mi,),), . . . , (<X|V,A> Mjv,)i), where JV=dim(Kft x Wh). It remains to derive error estimates by applying Theorems 11.1 and 11.2. Theorem 11.3. Let Vh and Wh be selected as in A1.31). Suppose the eigenfunctions of A1.18) belong to HP+2(Q). Then \\,H-*k\^C(p)h2»+2 A1.32) and \\uk,h-uk\\o,o^C(p)h"+i A1.33) Proof. We begin by showing that all of the hypotheses of Theorems 11.1 and 11.2 are satisfied. We have already noted that A is symmetric and that A1.1) and A1.2) are satisfied for the problem A1.24). The source problem A1.6) is uniquely solvable for each g e G = H°(Q). In fact the unique solution is {a, u\ where — Au = g, ueHo(Q)
760 /. BabuSka and J. Osborn Chapter III and (cf. the discussion in Section 11.2.1). To see that A1.7) is uniquely solvable for each g e G it is sufficient to show that g = 0 implies <xh and uh are zero. Now g = 0 implies, using the second equation in A1.7), that B(ah,v) = Q,Vve Wh. Setting ty = ah in the first equation and using this fact shows that A(ah,ah)=0 which, together with A1.2a), shows that ah=0. Then, using the first equation in A1.6) again we get В(ф, uh) — 0, Vi/r e Vh. For our specific problem this is Jfi uh div фбхйу — 0, 4\j/eVh. It is shown in Raviart and Thomas [1977, Theorem 4] that corresponding to any uhe Wh there is a \//eVh such that divi/ffc = wft. Using this ф we thus have !я|мл|2 dxd>> = 0 which implies uh =0. It remains to check A1.9). Falk and Osborn [1980, Section 3(d)] have shown that \\Ta-Ta\\ <IC*2™«» for P>1 ^Ch\\g\\0tQ forp^O, A1.34) which proves A1.9). We now apply Theorems 11.1 and 11.2. From Raviart and Thomas [1977, Theorem 5] we have l|(S-S»)»||„o@) = ||(S-S^||H(diy,o)^Ch''+1(IITff||JF+2,0 + ||e||J(+1,0) and 1!(Т-Г^||О1Й<С^+1(||Гд||р+21Й + ||0||р+1,п). If g e М{кь x), then Tg = k^x g and g is an eigenfunction of A1.18) corresponding to Xk and by our hypotheses, !Ы1р+2,п<°°- Thus \\{S — Sh)\jf\\Ho^)Ho(a)^:Chp , A1.35a) ]|(S~S,)¥|!HoOT,H(div,fi)^C/t"+1, A1.35b) II(T-TJ|mIIho(«),ho№)^C^+1. A1.35c) A1.32) follows immediately from Theorem 11.1 and estimates A1.35). A1.33) follows immediately from Theorem 11.2 and A1.35c). □ Remark 11.3. Theorems 11.1 and 11.2 estimate the errors in mixed method approximation of eigenpairs in terms of error estimates for the corresponding source problems. For our problem, these were mainly provided by the results of Raviart and Thomas [1977]. Note, however, the estimate A1.34)—the estimate that ensures the approximate eigenvalues converge—is not proved in Raviart and Thomas [1977].
Section 11 Applications 761 11.2.3. Solution of matrix eigenvalue problem The matrix problem A1.4) with Vh and Wh given in A1.31) is large and sparse, but is not positive-definite. 11.3. A mixed method for the vibrating plate The eigenvalue problem A2u = Xu in Q, м = 6и/6и = 0 in О A1.36) arises in connection with the small, transverse vibration of a clamped plate. A commonly used variational formulation of A1.36) is ueH2(Q), и#0, A1.37) AuAvdxdy = X uvdxdy УиеЯб(й). G A finite element method based on A1.37) would require trial and test space that were subspaces of Hl{Q), and this would require C1 -elements, i.e., piecewise polynomials that are C1 across interelement boundaries. In order to avoid this requirement we will use a different variational formulation for A1.37), one that permits the use of C°-elements. We do, however, use A1.37) to show that A1.36) has a sequence of eigenvalues 0<Я: <Я2< ••• foo and corresponding eigenfunctions Uj, U2,.■ ■ , which can be chosen so that lAUjAiijdx dy=Xj щUj dx dy = <5y. 11.3.1. Variational formulation Introduce the auxiliary variable <x= — Au. Then A1.36) can be written as a second- order system: a + Au=0 in Q, — Aa~Xu in Q, и=8и/8и=0 on Г. Multiplying the first equation by ф, the second by v, integrating over Q, and
762 /. BabuSka and J. Osborn integrating by parts leads to 0 = lenj/ dx dy + \Аиф dx dy Chapter III I Fu/8n)i o\(idxdy-\Vu-V\li+ Fu/8»# ds а г = [афдхйу- \Vu-V^dxdy Vi^eH1^) and X \uv dx dy = — n Aavdx dy Va • Vv dx dy — (da/dn)v ds Va-Vvdxdy VveHh(Q). Thus we arrive at the variational formulation (a, u)eHl{Q)x Hh(Q), (а, и) Ф @,0) | aij/dxdy- VtrV(^dxdj/=0 ЧфеНЩ, Vcr-Vudxdy= —X uvdxdy VveHo(Q). A1.38) We derived A1.38) formally from A1.36). One can, however, easily make the argument rigorous with the aid of a well-known regularity result: If w is the solution to A2w = f in Q, w = 6w/6n = 0 on Г, where Q is a convex polygon and/e H°(Q), then w e H3(Q) and ||w||3>0 <C||/||0fO, cf. Grisvard [1985] and Kellogg and Osborn [1975]. We assume Q is a convex polygon in the remainder of this subsection. Using this result we can show that if (A, u) is an eigenpair of A1.36) and о = — Au, then (X, (a, u)) is an eigenpair of A1.38), and if (Я, (a, u)) is an eigenpair of A1.38), then (Я, w) is an eigenpair of A1.36) and
Section 11 Applications 763 a= -Ли. Problem A1.38) has the form of A1.3) with V=HX{Q), W=H10(Q), H=G=H°(Q), А(сТ,ф)=: !г,н)=И аф dx dy, В(ф, и) = \Уф • Vu dx dy. It is easily seen that A is symmetric and that A1.1) and A1.2) are satisfied. A1.38) has eigenvalues and corresponding eigenfunctions ((т1,и1),(а2,и2),..., with (Xj = — Ли у 11.3.1'. Regularity of the eigenfunctions If (cr, и) is an eigenfunction of A1.38) then, as we have seen above, и is an eigenfunction of A1.36) and c= — Ли, and hence the regularity of (a, u) can be inferred from the regularity properties of A1.36). For results on this later regularity question we refer to Grisvard [1985] and Kellogg and Osborn [1975]. 11.3.2. Discretization of A1.38) and assessment of the accuracy of the approximate eigenpairs As in Section 11.2, our discretization will be via A1.4). For our specific problem, a mixed method for the associated source problem has been studied by Glowinski [1973], Ciarlet and Raviart [1974], Mercier [1974], and Falk and Osborn [1980]. We will use the same trial and test spaces employed in those papers. Let ? = {т} = {тл} be a family of a-regular, g-quasiuniform triangulations of Q. Then for p = 2,3,..., let Vh = S"(xh) A1.39a) and Wh=S№h)nHh(Q). A1.39b) We then consider A1.4) with these choices. We will have approximate eigenvalues, and eigenfunctions ^1.* ^ '" ^^N,h and where N=dim(Vh xWh). Theorem 11.4. Let Vh and Wh be as in A1.39) with p^2 and suppose the eigenfunctions
764 I. BabuSka and J. Osborn Chapter III o/A1.36) belong to Hp+l{Q). Then \k,H-h\^C(p)h2p~2 (И-40) and ll«w-Mo,fl<C(p)ft'. A1-41) Proof. The symmetry of Л and the validity of A1.1) and A1.2) for problem A1.38) have already been noted. The source problem A1.6) is uniquely solvable for each g e G=H°(Q). The unique solution is {a, u), where A2u = g, ueHl{Q) and a= —Ли (cf. the derivation of A1.38)). The unique solvability of A1.7) is easily checked. Falk and Osborn [1980, Section 3(a)] have shown that \\Tg-Thg\\0^Ch2\\Tg\\XQ. This, together with the regularity result mentioned above, gives 11(Т-ВД|0>0<СЛ2Ы|0(О, which proves A1.9). Thus, all of the hypotheses for Theorems 11.1 and 11.2 have been verified for the problem under consideration. Using the results in Falk and Osborn [1980, Section 3(a)], we have \\(S-ShHUji^Ch'-1\\Tg\\p+liO, \\(S~Sh)g\\i,o^Ch"-2\\Tg\\p+US2, \\(T-Th)g\\0tS2^CW\\Tg\\p+ua, \\(T-Th)g\\i,o^Ch"\\Tg\\^UQ, from which we obtain IKS - SJS I! „o„(fi) ^CH>-\ A1 -42a) \\{S-Sh)\s\\mQhHHQ)^Ch"-2, A1.42b) №-ThM\ma>.*>m<Ch>, A1.42c) ИСТ- Т„)\л ||№„,(fi) ^ Ch". A1.42d) A1.40) follows immediately from Theorem 11.1 and A1.42), and A1.41) follows from Theorem 11.2 and A1.42c). □ Remark 11.4. The estimates obtained in this subsection were first obtained by
Section 11 Applications 765 Canuto [1978]. We note, however, that the estimation techniques used here will yield an improvement over the estimates of Canuto in the case when the eigenfunctions have low regularity. Our method of proof does not yield any estimates for p = l. For this case, see Ishihara [1978]. 11.3.3. Solution of matrix eigenvalue problem See Section 11.2.3. For further results in eigenvalue approximation by mixed methods, and also by hybrid methods, we refer to Mercier, Osborn, Rappaz, and Raviart [1981], Mercier and Rappaz [1978], and Ishihara [1977]. Remark 11.5. We have seen in this section and in Section 10 that there are various methods available for the approximate calculation of the eigenvalues of a specific problem. For example, we have analyzed two methods for the membrane problem. Furthermore, this discussion, together with that in Section 3, shows that there are many more possibilities. Clearly the rational choice of a method for any particular concrete problem is important. The effective choice of a method is complex, depending on many aspects of the underlying problem. 11.4. A mixed method for a problem in one dimension with rough coefficients Consider the problem - (а(х)и'У = Щх)и, 0 < x < 1, Н@) = МA) = 0. ( ] This is a special case of the problem A.8), A.9a) discussed in Section 1. We will be especially interested here in the case in which the coefficients a(x) and b{x) are rough functions. Such problems arise in the analysis of the vibrations of structures with rapidly varying material properties, of composite materials, for example. In Section 3, we gave A1.43) the mixed formulation C.26) (or C.27)). In this subsection we analyze a mixed method based on C.27). Hence we consider the problem (<x,u)eL2@,l)xfli@,l), (а,и) тЧО,0), i (аф/а)йх~ u>dx = 0 УфеЬ2@,1), A1.44) i i av' dx — — X buv dx Vt> e Hi @,1), о о where a(x) is of bounded variation and b(x) is measurable and 0<ao^a(x)^a1, 0^bo^:b(x)^bl.
766 /. BabuSka and J, Osborn Chapter III A1.44) is of the form A1.3) with F=H = L2@,1), W^Hh@,l), G = L2@,l) with (и, v)G = buv dx, о А{а,ф)= I—die, B(ff,»)=- a i <n>' dx. о (Я, (<т, м)) is an eigenpair of A1.44) if and only if (Я, u) is an eigenpair of A1.43) and a = au'. We discretize A1.44) by letting t = {ri}fl(T1) be a mesh on [0,1], defining Vh = {a: a\Tj — constant, i = 1,..., М(т)} A1.45a) and W» = {v: v e Hl0(Q, 1), t;|T. =linear polynomial, i= 1,..., M(t)}, A1.45b) with /j = /i(t)= max diam Ti5 i=i,... ,лад and considering A1.4). The eigenpairs (lh, (ah, uh)) of A1.4) are then considered as approximations to the eigenpairs (Я, (а, и)) of A1.44). Although this approximation method satisfies the hypotheses of Theorem 11.1, a direct application of that result does not yield the best possible estimate. We will employ an analysis that is parallel to, but different than, that used in the proof of Theorem 11.1. The analysis begins by introducing the operators T,Th: G-*G and S,Sh: G->V that are defined in A1.8). Я is an eigenvalue of A1.44) if and only if Я-1 is an eigenvalue of T; the correspondence between the eigenfunctions is given by (a, u)*-*u. Likewise Xh is an eigenvalue of A1.4) if and only if klx is an eigenvalue of Tk, with the correspondence between eigenfunctions given by {ah,uh)<*-uh. \\T—Th\\GG-*0, as will be shown later, so we may apply Theorem 7.2 to T and Th on the space G. Let Я-1 be an eigenvalue of T. The eigenvalues of a problem of the type A1.43) are simple and hence Я is a simple eigenvalue of T. Thus one eigenvalue Ял of A1.4) converges to Я. By Theorem 7.2 we have \X-lh\<C{\{T-Th)u,u)G\+\\{T-Th)u\\i}, (П-46) where и is any eigenfunction of T corresponding to Я with |ju||c = l. We now proceed to analyze ((T— Th)u, u)G. From A1.6) we have ((T-T,Km)g= \bu(T-T„)uux
Section 11 Applications = -B(Su,(T-Th)u) = A(Su, (S- Sh)u) + B((S - Sh)u, Tu) -B(Su, (T- Th)u), and from A1.6) and A1.7) we get 0=A((S-Sh)u, 0+B((S- Sh)u, n) + Щ, (T- ВД, VrieWk, ZeVh. Combining A1.47) and A1.48) we get ((Г- Th)u, uH = A((S - Sh)u, Su + a + B((S-Sh)u,Tu + i1) + B(Z-Su,(T-ThH Vr,eWh, £eVh, which, letting rj= — Thu and £ = Shu, yields (G- Th)u, u)G=A((S-Sh)u, (S+SM = 2A((S - Sh)u, Su) - A((S - Sh)u, (S - SM ■ Now, again using A1.6) and A1.7) we get a(E - Sh)u, Su) = - B((S - Sh )u, Tu) = -B((S-SMTu-ShTu), where Ih Tu is the WA-interpolant of Tu, and hence, using l ShulTu-ShTuJ dx = 0 767 A1.47) A1.48) A1.49) and A1.6) we have A{(S - S„)u, Su) = - B(Su, Tu - Ih Tu) = bu[Tu - Z„ Тм] dx A1.50) Finally, combining A1.49) and A1.50) we get l l ((T-Th)u,uh=2 bu[Tu — Eh Tu] dx - \(S-Sh)u\: dx 2X 1 bu(u — Ehu)dx- \(S-SM: dx. A1.51) Now, using A1.50) and A1.46) we get
768 I. BabuSka and J. Osborn Chapter III |A-A„|^C i f bu(u~Zhu) dx о + ao1\\(S-Sh)u\\L2 + \\{T~Th)u\\l h (И-52) It remains to estimate the three terms on the right-hand side of A1.52). Recall that Zhu is the WVinterpolant of u. By a result of Prosdorf and Schmidt [1981] we know that \\u-Ihu\\L^Ch2Vl{u'\ A1.53) where Vq(u') denotes the variation of u'. Recall that u is an eigenfunction of A1.43) with || u ||G = 1. Since a{x) is of bounded variation, u' will be of bounded variation; in fact WKC, A1.54) where C = C(a0,ai,b0,b1, Vl(a\X) depends on a0,a1,bQ,bi, Vl(a) and X. Also llak-^C. A1.55) Using Holders inequality, together with A1.53)—A1.55), we get l bu(u — Ihu)dx <\\bu\\L4ti-Ihuh^Ch2V{a), A1.56) where C = C(a0,a1,b0,b1, Vl{a),X). Next we consider \\{S — Sh)u\\L2 and ||(T— Th)u\\L2. It is easily seen that the results in Falk and Osborn [1980] imply \\(S-Sh)u\\L2,\\(T-Th)u\\L2^C(a0,ai,b0,buX)h. A1.57) Note that A1.57) shows that ||r-rfc||GG-^0. Finally, combining A1.52), A1.56), and A1.57) we have: Theorem 11.5. Suppose X is an eigenvalue o/A1.43) (or o/A1.44)) and let Xh be the approximate eigenvalue defined by A1.4) with Vh and Wh defined by A1.45). Then |Л-Л»|<С(ао,а1,ЬоА,К&(а),Л)Л2. A1.58) The striking feature of estimate A1.58) is that the constant С depends on the bounds a0,ai,b0 and bx and on V\{a), but is otherwise independent of a{x) and b(x). This shows that the approximation method is effective for problems with rough coefficients (cf. discussion of alternate variational formulations at the end of Section 3). In fact, the rate of convergence indicated by A1.58) is the same as that for the usual Ritz method for problems with smooth coefficients. Estimate A1.58) was proved by Banerjee [1988]. The use of mixed methods for eigenvalue approximation for problems with rough coefficients was first suggested by Nemat-Nasser [1972, 1974]. Rate of convergence estimates for several such mixed methods were derived by BabuSka and Osborn [1978].
Section 12 Applications 769 Remark 11.6. It is of interest to note that the variable oh can be eliminated from A1.4) in the present context (i.e., with the choices for V, W, H, G, A, B, Vh and Wh we have made in this subsection) leading to the problem uh e Wh, A1.59) l atu'hv' dx=Xk buhvdx Vr e Wh, i=i J J г* о where aT is a step function with Thus A1.59) differs from the usual Ritz method only in that the coefficient enters the calculation through its harmonic averages over the subint°rvals of the mesh instead of through its averages. 12. Methods based on one-parameter families of variational formulations In our treatment of the membrane problem in Section 10.2, the trial and test functions satisfied the essential boundary condition и = 0 (cf. A0.42)). In fact, if one bases the approximation method on the usual variational formulation A0.39), one must impose the boundary condition on the trial and test functions. To avoid this, methods have been developed that use test and trial functions that are not required to satisfy essential boundary conditions. (See the discussion of essential and natural boundary conditions in Section 3.) In this section we discuss two such methods. They are both based on a one-parameter family of variational formulations. We will be rather brief and will not explicitly discuss each of the steps A), (Г), B), and C) of finite element approximation outlined in Section 10. 12.1. The least squares method Consider, as in Sections 10.2 and 11.2, the membrane problem -Au^ku infl, м = 0 опГ = 8й, A2.1) where Q is a bounded, open set with boundary Г, which, for the sake of simplicity, we assume to be of class C°°. Note that we are not assuming Q to be a polygon. Problem A2.1) has eigenvalues and eigenfunctions uuu2,
770 I. BabuSka and J. Osborn Chapter III We begin by introducing the least squares method for the corresponding source problem, ~Aw = f infl, w=0 опГ, A2.2) which is usually given the variational formulation (see Remark 12.1 for the reason for using complex functions here), w e Hh(Q), A2.3) VwVvdxdy= \fvdxdy Vt>etf£@). J n я We now give A2.2) a different variational formulation, w solves A2.2) if and only if weH2(Q), j AwAvdxdy + p \wvdxdy-- fAvdxdy, A2.4) г 4veH2(Q), V0<A<1, where /? = рл>1 is a parameter that approaches oo as /i->0. To pass from A2.2) to A2.4) is immediate. To go from A2.4) to A2.2) we proceed as follows. First take veH2 (Q) to satisfy Av = Aw + f in Q, v = 0 on Г. This choice for v in A2.4) yields — A w = / in Q. The equation in A2.4) then becomes vwdx = 0 VveH2{Q), which implies w = 0 on Г. In A2.4) the boundary conditions w=0 is not explicitly imposed. This is the major advantage of the formulation A2.4) over A2.3) for our purposes. We note that w can also be characterized by an extremal property: the solution w of A2.4) is the unique minimizer of the functional -Av-f\2dxdy + p \\v\2ds over veH2{Q). In order to discretize A2.4) we suppose we have a family у-{г} = {ть} of triangulations of Q', where Q' is some fixed rectangle containing Q. Then let S„ = S''2(t,,) = {не#2@'):и|г = ро1упогша1 of degree рУГетй}
Section 12 Applications 771 and let Sh consist of the restrictions of functions in Sp'2{ih) to Q. The family Sh satisfies the following approximation result: If p ^ 5, then inf tvWv-xhn^Ch'Mka for2<t<p+l. xeSh A2.5) J = 0 See Ciarlet [1978] for a proof of A2.5). Then we define an approximate solution wh to w by letting wh be the unique solution to wft e Sh, AwhAvdxdy+p whvds ~lfA' A2.6) dx dy Vve Sh. wh is called the least squares approximation to w since it can be alternately characterized as the unique minimizer of -Av-f\2dxdy+p \v\2 ds over veSh. Bramble and Schatz [1970] proposed and analyzed this method for p = ph=h~3. They also showed p = h~3 to be the optimal choice for p. Now we return to the eigenvalue problem A2.1). Proceeding in a similar way we see that A2.1) has the variational formulation ueH2(Q), AuAvdxdy + p Problem A2.7) is then discretized by Xh complex, 0 ф щ е Sh A2.7) uvds=— X \uAvdxdy VveH2{Q). AuhAvdxdy+p A2.8) uvds=— kh \uhAvdxdy VveSh. Problem A2.8) has eigenpairs (AJ>fc, uJih), j= 1,..., N, where N=dim Sh. lfforfeH°(Q) we define Tf=w and Thf—wh, where w and wh are defined by A2.2) (or A2.4)) and A2.6), respectively, then we easily see that (Л, м) is an eigenpair of A2.1) if and only if (p=X~1,u) is an eigenpair of Г and (Xk,uh)is an eigenpair of A2.8) if and only if (/Хн = Яй_1,Мл) is an eigenpair of Th. We will estimate the error in (jih, uh), and thus in (Xh, uh), by applying the results in Section 7. T and Th are clearly compact on H°(Q). We will show \\T—Tk\\-*0 in the next paragraph. In order to apply Theorem 7.3 on H°(Q) we need estimates for ((T— Th)u, u), ||(T— TA)u||0,fl and ||(Г— T£)u||0,fl, where и is an eigenfunction of A2.1) corresponding to the eigenvalue 1 (or p) we are approximating. These estimates are all contained
772 /. BabuSka and J, Osbom Chapter III in Bramble and Schatz [1970] (and also in Baker [1973]) for the choice р = й~3. In their Corollary 4.1 take y-\, X — t~2, g = 0, 1= —s and r~p + l to get A2.9) |((Г-Гл)ф,^)о,„|^№ + '||^|,-2.«|1'/'||5.й for(Ks«Sp-3, 2<t<p + l. Taking s=0 and t = 2 in A2.9) shows that |] Г-Г», || ~>0. Now take s=p-3 and t = p +1 to obtain |((Г-Т^,^)о,0|<СЛ2'-2||^||р-1.0||^||р_3>0, A2.10) take s=0 and t = p+1 to obtain |((Г-Т»)^^)о.0|<СЛ'+1|1Ф11р-1,0||^11о.0, and hence ||((Г-Г»)ф)||о.0<СЛ'+1 №llp-i.o, A2.11) and take s = p — 3 and t = 2 to obtain \((T-Th)<t>,M0,Q\ = \(<f>,(T-Tmo,n\^Ch*-1№\0,S} H-Allp-3.fi, and hence ||((Г-Г?)^)||0,п$СЛ'-1||^||р-з.о. A2.12) Theorem 12.1. Suppose the approximate eigenpairs {Xith,Ujth) are defined by A2.8) with p = h~3 and suppose the eigenfunctions o/A2.1) belong to Яр~1@). Then \lk,h~kk\^Ch^-2 A2.13) and ll«w-«t|lo,o<Cfc' + 1. A2.14) Proof. Let Xk be any eigenvalue of A2.1) and suppose its geometric multiplicity is q, i.e., the geometric multiplicity of цк = Xll is q. Since T is selfadjoint, the ascent is one and the algebraic multiplicity of цк is also q. q of the kith will converge to Xk. Let AM be one of them. Theorem 7.3 can now be applied and A2.13) follows directly from G.15) and A2.10)—A2.12) since all of the eigenfunctions of A2.1) corresponding to Xk belong to HP~1(Q). Estimate A2.14) follows from Theorems 7.1 and 7.4 and A2.11). □ Remark 12.1. Even though A2.1) is selfadjoint, A2.8) is a nonselfadjoint (finite- dimensional) problem. Thus one needs the general (not necessarily selfadjoint) theory in Section 7 to analyze the least squares method. The nonselfadjointness of A2.8) is the reason we have used complex function spaces in this analysis. 12.2. The penalty method We will once more consider the membrane eigenvalue problem A0.38) and assume the boundary Г of Q is of class C°° (cf. also A2.1)). In Section 10 we gave this
Section 12 Applications 773 problem the variational formulation ueHh@), a(u,v) = Ab(u,v) VveHliQ), A2.15) where a(u,v) = IVu-Vvdxdy A2.16a) ft and b(u, v) = \uv dx dy. A2.16b) Let us replace the boundary condition и=0 on Г in A0.38) by и + ф \ди/дп)=О, i.e., let us consider the problem -Au=Au in Q, A217} , , du и + ф'1 — =0 onF, on where ф = фь^\ is a parameter that approaches +oo as n->0. It is easily seen that A2.17) has the variational form ueH^Q), аф{и,ь) = Щи,ю) \fveH\Q), A2.18) where аф(и,ю)= Vu-Vudxdy + ф \uvds. A2.19) о й Note that in A2.18), in contrast to A2.15), we have not imposed any constraint on either и or v. This is the case since и + ф ~* фи/дп) = О is a natural boundary condition (cf. Section 3). We now estimate the error between the eigenvalues and eigenvectors of A2.15) and A2.18). Toward this end consider the corresponding source problems: -Au0=f inQ, м=0 on Г A2.20) and -Au=f infl, и + ф'1~ = 0 on Г. A2.21) We view A2.21) as an approximation to A2.20). Denoting by u0 (respectively u^) the solution of A2.20) (respectively A2.21)), we are interested in estimating иф — и0. It is shown in BabuSka and Aziz [1973, Section 7.2] that щ=и0~ф-^ + С, A2.22)
774 I. BabuSka and J. Osborn Chapter III where £ is the solution of the problem -Л£+£=0 infi, {=Эи0/8п inQ A2.23) and С is the solution to СеЯх@), аД,ю)=ф~1а(£,и) 4veHl{Q). A2.24) From A2.16), A2.19) and A2.24) we have 11С111чо)<МС,0=^-М5,0=^-111511я'(П,НС11нчо) and hence К\\нчп)^Г1и\\нчпу A2-25) From A2.23) and A2.25) we obtain \\щ~и0\\ИЧП)^2ф-1 Ш\ннпу A2-26) From A2.23) and regularity results for elliptic boundary value problems we get Ш1яч«)^С||Л1яо(й>. A2.27) Combining A2.26) and A2.27) yields II"* -«о Ннчя> *S Ct/ГJ ll/H но(Я)- A2.28) If we now introduce the operators T and T^ on H°{Q) by Tf=UQ and Тф/ = иф, then A2.28) implies that \\(Т~Тф)Л\но(П) ^Сф-1 ||/||0.о. A2.29) It is immediate that (А, и) is an eigenpair of A2.15) if and only if (ц = A, u) is an eigenpair of T; likewise (A^, u^) is an eigenpair of A2.18) if and only if щ = (А^*, щ) is an eigenpair of Тф. It thus follows immediately from Theorems 7.1-7.4 and A2.29) that lAj-^jKCiT1 A2.30) and !|И;-м^11я'№)<С|/'"> A2.31) where (lj, uj) and (A^j, w^) denote the eigenpairs of A2.15) and A2.18), respectively. Note that A2.30) and A2.31) are estimates of the same order for both the eigenvalue and eigenvector errors. This is in contrast to approximations we have analyzed previously in this article. An analysis of a one-dimensional model problem shows that, for the type of approximations we are considering, the eigenvalue and eigenvector error is, indeed, of the same order. Next we consider the problem A2.18) and approximate it by a finite element method, letting the resulting eigenpairs be (A^Jifc, u^jj,). Since и and v in A2.18) are taken in H1 (Q), we need not impose any boundary condition on the trial and
Section 12 Applications 775 test space Sh. If one now analyzes the error in the finite element approximation of A2.18), selects ф so that the error in passing from A2.15) to A2.18) is of the same magnitude of that incurred in the finite element approximation of A2.18), and then combines the error estimates A2.30) and A2.31) with those for the finite element approximations of A2.18), one obtains estimates for the difference between (A.j,ttj) and (^j,h,u^jih). We stress that the (A^-ifc, u$tjth) are calculated from a matrix eigenvalue problem corresponding to trial and test spaces that are not required to satisfy the essential boundary condition for the membrane problem A2.1). The approximation method we have outlined is referred to as the penalty method. We refer the reader to BabuSka and Aziz [1973, Section 7.2] for a detailed analysis of the penalty method for the source problem. Estimates for the errors in eigenvalue approximation can be easily derived from the corresponding source problem estimates by means of Theorems 7.1-7.4. Because this application of these error estimates to the eigenvalue problem is similar to those discussed above and raises no new issues, we will not give a formal statement of the results. Remark 12.2. If О is a polygon, then the choice ф = оо corresponds to satisfying the boundary condition on SO, i.e., constraining Sp(x) to be Sg(t), and the resulting method is identical with that discussed in Section 10.2. If Г is not polygonal, then ф = qo will lead to the constraint Sp(z) — Sg(r), where £g(t) consists of those и e 5р(т) which are zero on every triangle which intersects Г. The finite element solution then solves the problem on Q instead of Q, where Q consists of the union of all triangles which do not intersect Г. Sometimes the mesh is constructed so that Q\Q is as small as possible by interpolating Г by straight lines. Remark 12.3. In practical computation (codes) the penalty method (or some equivalent method) is often also used when Q is a polygon by taking ф to be very large (say ф = 108). This is just a way of imposing the essential boundary conditions in the code. Remark 12.4. The least squares method and penalty method are seldom used as a way to treat essential boundary conditions on a curved boundary because of the difficulty in the computation of a#(u, v), which requires area integrations over triangles which intersect the boundary. The usual approach is to use curvilinear elements, which allow exact satisfaction of the boundary condition in a similar way as when the domain is polygonal (cf. Remark 12.3). Let us end this section by noting some similarities and differences in the least squares and penalty methods. - Both methods circumvent essential boundary conditions by reformulating the original problem in terms of a one-parameter family of variational formulations. In both methods, the optimal value of the parameter depends on the mesh, i.e., on h. - With the least squares method, the optimal value of the parameter (p = h~3) is independent of the solution. This is related to the fact that the alternate variational
776 1. BabuSka and J, Osborn Chapter HI formulation characterizes the solution exactly for any value of the parameter. In the case of the penalty method, the optimal value of the parameter depends on the mesh and the smoothness of the solution or the eigenfunction. This is related to the fact that the exact solution does not exactly satisfy the one-parameter family of formulations for any value of the parameter ф ф + oo. - The least squares method employs C1-elements (i.e., subspaces of H2(Q)), whereas the penalty method employs C°-elements (i.e., subspaces of H1{Q)). As we have previously noted, C°-elements are easier to construct than С ^elements. 13. Concluding remarks Remark 13.1. We have illustrated the application of the general theory that was presented in Chapter II by considering several important model problems. It should be clear from the analysis of these model problems how to treat a wide variety of problems. We have seen, however, that the application of the general theory to a concrete problem may require subtle analysis. Remark 13.2. In Sections 10, 11 and 12 we have illustrated the main approach to finite element approximation of eigenvalue problems. We have seen that there are many available methods and that their basic theoretical properties can be established as an application of the results in Chapter II. Nevertheless, the implementation of these methods raises many other important questions; although we cannot address these questions in detail, we now mention some of them. A) Which method is most effective for a specific problem? What is the goal of the computation? We remark that sometimes high accuracy is achieved for eigenvalue approximation, but that only low accuracy is obtained for the approximation of other important quantities such as the stresses, moments, or shear forces. B) What types of meshes or adaptive mesh procedures are desirable? How should the quality of the computed results be assessed a posteriori? For a survey of results in this direction, see Noor and Babuska [1987]. C) Which matrix eigenvalue solvers should be used? What is the influence of the computer architecture? These questions are, of course, not restricted to eigenvalue computation. They also arise with finite element computation of source problems. Remark 13.3. The Ritz method, which was discussed in Section 10, is most easily analyzed with the results of Section 8, specifically with (8.44)-(8.46). Note that because of (8.32), (8.11) is satisfied with p = a and thus the major requirement on Sh is that it have good approximation properties. Remark 13.4. We have seen in Remark 11.2 that mixed methods for eigenvalue approximation have the form of (8.10) and (8.14). Thus, if a method satisfies the hypotheses of Section 8, specifically (8.1), (8.2), (8.3), (8.6), (8.11) and (8.13), then
Section 13 Applications 777 the method can also be analyzed with the results of Section 8. Most mixed methods, however, fail to satisfy at least one of these hypotheses, and we thus cannot rely on the results of Section 8. We now comment on two of the examples discussed in Section 11 in regard to which results in Chapter II their analysis is based on. A) Consider first the mixed method discussed in Section 11.2 for the membrane problem. It is easily seen that the variational formulation A1.24) satisfies (8.1), (8.2), and (8.3), but that it does not satisfy (8.6). In Section 8, assumption (8.6) is used to show that the operator Г defined by (8.8) is compact. For our example, for (f,g)eH(div,Q)xH°(Q), T(f,g)={c,u), where и solves ueHo(Q), — Au = g in Q and a = Vm, and, by noting in particular the dependence of a on g, we see that T: H(div, Q) x #°@)->#(div, Q) x H°(Q) is not compact. Since Г is not compact, Th> as defined by (8.16), cannot converge to T in norm. Because of these facts, the results of Section 7 do not apply (to this T). The analysis that we used for this problem (cf. Theorem 11.1) is based on Section 7 and circumvents this difficulty by using a different operator, namely T: Я°@)->Я°(Й) defined by Tg — u (cf. 11.8(c)). As mentioned in Remark 7.7, results for noncompact operators which parallel those in Section 7 have been proved by Descloux, Nassif and Rappaz [1978a, 1978b], and one can, if fact, use them to derive the estimates we obtained in Section 11.2, specifically A1.32) and A1.33). We will not present the details of this analysis but will comment briefly on the applicability of the results of Descloux, Nassif, and Rappaz [1978] to our problem. For their results, T is not required to be compact and Th is assumed to converge to Г in the sense that inf ||(*,«)-М11в-0 A3.1) for each (a, u) e H(div, Q) x H°(Q) and «r»-T|U= sup \\(Th-T)(f,g)\\H(div^HoiQ)-+0 as/j-+0. A3.2) if.g)eVh x Wh ll(/.9)lHldiv,n)X«0ini With Vh and Wh defined as in A1.31), A3.1) follows from the approximation result in Raviart and Thomas [1977]. We now verify A3.2), which is central point in any application of the results of Descloux, Nassif and Rappaz [1978]. For (/, g)eVhxWh, let {a, u) = T(J, g) and (<rA, uh)=Th(f, g), where Г and Th are defined by (8.8) and (8.16), respectively, for the problem discussed in Section 11.2. We know that ueH1{Q), —Au—g and
778 1. BabuSka and J, Osborn Chapter III a — Vu, and hence div o — g. Also, if g e Wh it is easily seen that div uh = —g. Thus H(div,») x Я°(П) = \\(Vh, W),) —(ff, M)||H(div>0)xHO(J3) =(l|div oh -div <x\\in + \\ffh-a\\io{S2)+ \\ии~и\Цп)ш ={\\с»~-°\\2вчп) + \\ин-и\\1п)'12 for (f,g)e Vh x Wk. A3.3) From the results in Falk and Osborn [1980] we have II fffc - ff II над, <Ch || н || 2>n <Cft || ff || ол A3.4a) and II"/, - «llo.o^Ch2 ||ы||2,„^СЙ2 ||ff||0iO. A3.4b) Combining A3.3) and A3.4) we get ||(ГЬ-ГХЛ0I1 ^СЩд\\ол <Ch\\{f,g)\\ for (f,g)e VhxWh. A3.5) A3.2) follows directly from A3.5). B) Consider next the method discussed in Section 11.3 for the vibrating plate problem. The variational formulation A1.38) for the problem does not satisfy (8.2) and (8.11). Note that the method was analyzed by means of Theorem 11.1 which is based on Theorem 7.3. Remark 13.5. The fact that many mixed approximation methods fail to satisfy the usual hypotheses (cf. BabuSka [1971], BabuSka and Aziz [1979] and Brezzi [1974]) for variational approximation methods is an issue for the approximation of source problems as well as eigenvalue problems. The abstract results in Falk and Osborn [1980] have as their main application the analysis of mixed methods which fail to satisfy the usual hypotheses for variational approximation methods. In this connection see also BabuSka, Osborn and PitkXranta [1980], where problem A1.38) is reformulated in terms of alternate spaces with alternate (mesh-dependent) norms so as to satisfy the usual hypotheses. Acknowledgement The authors would like to thank Professor Christopher Beattiz for his careful reading of the manuscript.
References Adams, R.A. A975), Sobolev Spaces (Academic Press, New York). Agmon, S. A965), Lectures on Elliptic Boundary Value Problems, Van Nostrand Mathematical Studies 2 (Van Nostrand, Princeton, NJ). Aronszajn, N. A948), The Rayleigh-Ritz and A. Weinstein methods for approximation of eigenvalues, I: Operators in a Hilbert space; II: Differential operators, Proc. Nat. Acad. Sci. U.S.A. 34, 474-480, 594-601. Aronszajn, N. A949-50), The Rayleigh-Ritz and the Weinstein methods for approximation of eigenvalues, Tech. Reps. 1-4, Oklahoma A&M College, Stillwater, OK. Aronszajn, N. and A. Weinstein A942), On the unified theory of eigenvalues of plates and membranes, Amer. J. Math. 64, 623-645. BabuSka, I. A971), Error bounds for finite element method, Numer. Math. 16, 322-333. BabuSka, I. and A. Aziz A973), Survey lectures on the mathematical foundations of the finite element method, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Application to Partial Differential Equations (Academic Press, New York) 5-359. BabuSka, I, G. Laloz and J. Osborn A990), Finite element method for second order partial differential equations with rough coefficients. BabuSka, I. and B.Q. Guo A988a), Regularity of the solution of elliptic problems with piecewise analytic data, Part 1: Boundary value problems for linear elliptic equations of the second order, SI AM J. Math. Anal. 19, 172-203. BabuSka, I. and B.Q. Guo A988b), The h-p version of finite element method with curved boundary, SI AM J. Math. Anal. 25, 837-861. BabuSka, I. and B.Q. Guo A989), The h-p version of finite element method for problems with non- homogeneous essential boundary condition, Comput. Methods Appl. Math. Engrg. 74, 1-28. BabuSka, I., B.Q. Guo and J. Osborn A989), Regularity and numerical solution of eigenvalue problems with piecewise analytic data. BabuSka, I. and A. Miller A984), The post-processing approach in the finite element method, Part 2: The calculation of stress intensity factors, Internat. J. Numer. Math. Engrg. 20, 1111-1129. BabuSka, I. and J. Osborn A978), Numerical treatment of eigenvalue problems for differential equations with discontinuous coefficients, Math. Сотр. 32, 991-1023. BabuSka, I. and J. Osborn A983), Generalized finite element methods and their relation to mixed methods, SIAM J. Numer. Anal. 20, 510-536. BabuSka, I. and J. Osborn A985), Finite element methods for the solution of problems with rough data, in: P. Grisvard, W. Wendland and J.R. Whitman, eds., Singularities and Constructive Methods for Their Treatment, Lecture Notes in Mathematics 1121 (Springer, Berlin) 1-18. BabuSka, I. and J. Osborn A987), Estimates for the errors in eigenvalue and eigenvector approximation by Galerkin methods, with particular attention to the case of multiple eigenvalues, SIAM J. Numer. Anal. 24, 1249-1276. BabuSka, I. and J. Osborn A989), Finite element-Galerkin approximation of the eigenvalues and eigenvectors of selfadjoint problems, Math. Сотр. 52, 275-297. BabuSka, I., J. Osborn and J. PitkAranta A980), Analysis of mixed methods using mesh dependent norms, Math. Сотр. 35, 1039-1062. BabuSka, I. and E. Rank A987), An expert-system-like feedback approach in the h-p version of finite element method, Finite Elements Anal. Design 3, 127-147. 779
780 /. BabuSka and J. Osborn Babuska, I. and T. Scapolla A987), The computational aspects of the h, p and h-p versions of finite element method, in: R. Vichnevetsky and R. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations VI (MACS, New Brunswick, NJ), 233-240. BabuSka, I. and M. Suri A987a), The optimal convergence rate of the p-version of finite element method, SIAM J. Numer. Anal. 24, 750-776, BabuSka, I. and M. Suri A987b), The h-p version of the finite element method with quasiuniform meshes, RAIRO Model. Math. Anal. Numer. 21, 119-238. Baker, G. A973), Simplified proofs of error estimates for the least squares method for Dirichlet's problem, Math. Сотр. 27, 229-235. Banerjee, U. A987), Approximation of eigenvalues of differential equations with non-smooth coefficients, RAIRO Model Math. Anal. Numer. 22, 29-51. Banerjee, U. and J. Osborn A990), Estimates of the effect of numerical integration in finite element eigenvalue approximation. Bazley, N.W. A959), Lower bounds for eigenvalues with applications to the helium atom, Proc. Nat. Acad. Sci. U.S.A. 45, 850-853. Bazley, N.W. A961), Lower bounds for eigenvalues, /. Math. Mech. 10, 289-308. Bazley, N.W. and D.W. Fox A961a), Truncation in the method of intermediate problems for lower bounds to eigenvalues, Л Res. Nat. Bur. Standards 65B, 105-111, Bazley, N.W. and D.W. Fox A961b), Lower bounds for eigenvalues of Schrodinger's equation, Phys. Rev. 124, 483-492. Bazley, N.W. and D.W. Fox A963), Lower bounds for energy levels of molecular systems, J. Math. Phys. 4, 1147-1153. Birkhoff, G., С de Boor, B.J. Swartz and В. Wendroff A966), Rayleigh-Ritz approximation by piecewise cubic polynomials, SIAM J. Numer. Anal. 3, 188-203. Bramble, J.H. and J.E. Osborn A973), Rate of convergence estimates for nonselfadjoint eigenvalue approximations, Math. Сотр. 27, 525-549. Bramble, J.H. and A.H. Schatz A970), Rayleigh-Ritz-Galerkin methods for Dirichlet's problem using subspaces without boundary conditions, Comm. Pure Appl. Math. 23, 653-675. Brezzi, F. A974), On the existence uniqueness and approximation of saddle-point problems arising from Lagrangian multipliers, RAIRO R2 8, 129-151. Brezzi, F. and P. Raviart A977), Mixed finite element methods for 4th order elliptic equations, in: J. Miller, ed., Topics in Numerical Analysis III (Academic Press, New York) 33-56. BUTZER, P.L. and H. Berens A967), Semi-Groups of Operators and Approximation, Lecture Notes in Mathematics 145 (Springer, Berlin). Canuto, C. A978), Eigenvalue approximations by mixed methods, RAIRO Anal. Numer. 12, 27-50. Chatelin, F. A973), Convergence of approximate methods to compute eigenelements of linear operators, SIAM J. Numer. Anal. 10, 939-948. Chatelin, F. A975), La methode de Galerkin, Ordre de convergence des elements propres, C.R. Hebd. Seances Acad. Ser. A 278, 1213-1215. Chatelin, F. A981), The spectral approximation of linear operators with applications to the computation of eigenelements of differential and integral operators, SIAM Rev. 23, 495-522. Chatelin, F. A983), Spectral Approximations of Linear Operators (Academic Press, New York). Chatelin, F. and J. Lemordant A978), Error bounds in the approximation of eigenvalues of differential and integral operators, J. Math. Anal. Appl. 62, 257-271. Ciarlet, P. A978), The Finite Element Method for Elliptic Problems (North-Holland, Amsterdam). Ciarlet, P. and P. Raviart A974), A mixed finite element method for the biharmonic equation, in: С de BOOR, ed., Symposium on Mathematical Aspects of Finite Elements in Partial Differential Equations (Academic Press, New York) 125-143. Collatz, L. A948), Eigenwertprobleme und Ihre Numerische Behandlung (Chelsea, New York). CoURANT, R. A927), tfber direkte Methoden in der Variationsrechnung und fiber verwandte Fragen, Math. Ann. 97, 711-736. Courant, R. A943), Variational methods for the solution of problems of equilibrium and vibrations, Bull. Amer. Math. Soc. 49, 1-23. Courant, R. and D. Hilbert A953), Methods of Mathematical Physics 1 (Wiley-Interscience, New York).
References 781 Cullum, J. and R.A. Willoughby A985), Lanczos Algorithms for Large Symmetric Eigenvalue Computations, I: Theory; II: Programs (Birkhauser, Boston, MA). Descloux, J., N. Nassif and J. Rappaz A978a), On spectral approximation, Part 1: The problem of convergence, RAIRO Anal. Numer. 12, 97-112. Descloux, J., N. Nassif and J. Rappaz A978b), On spectral approximation, Part 2: Error estimates for the Galerkin method, RAIRO Anal. Numer. 12, 113-119. Dunford, N. and J.T. Schwartz A958), Linear Operators, I: General Theory (Wiley-Interscience, New York). Dunford, N. and J.T. Schwartz A963), Linear Operators, II: Spectral Theory, Selfadjoint Operators in Hilbert Spaces (Wiley-Interscience, New York). Falk, R. A978), Approximation of the biharmonic equation by a mixed finite element method, SI AM J. Numer. Anal. 15, 556-567. Falk, R. and J. Osborn A980), Error estimates for mixed methods, RAIRO Anal. Numer. 14, 249-277. Fix, G.J. A972), Effects of quadrature errors in finite element approximation of steady state, eigenvalue, and parabolic problems, in: A.K. Aziz, ed., The Mathematical Foundations of the Finite Element Method with Applications to Partial Differential Equations (Academic Press, New York) 525-556. Fix, G.J. A973), Eigenvalue approximation by the finite element method, Adv. in Math. 10, 300-316. Glowinski, R. A973), Approximations externes par elements finis de Lagrange d'ordre un et deux, du probleme de Dirichlet pour l'operateur biharmonique, Methodes iteratives de resolutions des problemes approches, in: J.J.H. Miller, ed., Topics in Numerical Analysis (Academic Press, New York) 123-171. Grigorieff, R.D. A975a), Diskrete Approximation von Eigenwertproblemen, I: Qualitative Konver- genz, Numer. Math. 24, 355-374. Grigorieff, R.D. A975b), Diskrete Approximation von Eigenwertproblemen, II: Konvergenzordnung, Numer. Math. 24, 415-433. Grigorieff, R.D. A975c), Diskrete Approximation von Eigenwertproblemen, III: Asymptotische Entwicklung, Numer. Math. 25, 79-97. Grisvard, P. A985), Elliptic Problems in Nonsmooth Domain (Pitman, Boston, MA). Gui, W. A988), Hierarchical elements, local mappings and the h-p version of the finite element method, Part I, Part II, J. Сотр. Math. 6, 54-68, 142-156. Gui, W. and I. Babuska A986), The h, p, h-p versions of finite element methods in one dimension, Numer. Math. 49, 577-683. Guo, B.Q. and I. Babuska A986), The h-p version of finite element method, Part 1: The basic approximation results, Сотр. Mech. 1,21-41; Part 2: General results and application, Сотр. Mech. 1, 203-220. Herrmann, L. A967), Finite element bending analysis for plates, J. Engrg. Mech. Div. ASCE 93, 49- 83. Hersch, J. A955), Equations differentielles et fonctions des cellules, C.R. Acad. Sci. Paris 240,1602-1605. Hersch, J. A963), Lower bounds for all eigenvalues by cell functions: A refined form of H.F. Weinberger's method, Arch. Rational Mech. Anal. 12, 361-366. Hubbard, В.Е. A961), Bounds for eigenvalues of the free and fixed membrane by finite difference methods, Pacific J. Math. 11, 559-590. Hubbard, B.E. A962), Eigenvalues of the non-homogeneous rectangular membrane by finite difference methods, Arch. Rational Mech. Anal. 9, 121-133. Ishihara, K. A977), Convergence of the finite element method applied to the eigenvalue problem Ju + lu=0, Publ. Inst. Math. Sci. Kyoto Univ. 13, 48-60. Ishihara, K. A978a), The buckling of plates by the mixed finite element-method, Mem. Numer. Math. 5, 73-82. Ishihara, K. A978b), A mixed finite element method for the biharmonic eigenvalue problem of plate bending, Publ. Res. Inst. Math. Sci. Kyoto Univ. 14, 399-414. Johnson, С A973), On the convergence of a mixed finite element method for plate bending problems, Numer. Math. 21, 43-62. Kato, T. A958), Perturbation theory for nullity, deficiency and other quantities of linear operators, J. Anal. Math. 6, 261-322.
782 I. BabuSka and J. Osborn Kato, T. A966), Perturbation Theory for Linear Operators, Lecture Notes in Mathematics 132 (Springer, Berlin). Kellogg, R.B. and J.E. Osborn A975), A regularity result for the Stokes problem in aconvex polygon, J, Funct. Anal. 21, 397-431. Knops, R.J. and L.E. Payne A971), Uniqueness Theorems in Linear Elasticity (Springer, Berlin). Kolata, W. A978), Approximation of variationally posed eigenvalue problems, Numer. Math. 29, 159-171. Kondratev, V.A. A968), Boundary problems for elliptic equations with conical or angular points, Trans. Moscow Math. Soc. 16 A967) (translated by American Mathematical Society, Providence, RI). Kreiss, H.O. A972), Difference approximation for boundary and eigenvalue problems for ordinary differential equations, Math. Сотр. 26, 605-624. KuFNER, A. A985), Weighted Sobolev Spaces (Wiley, New York). Kuttler, J.R. A970a), Finite difference approximations for eigenvalues of uniformly elliptic operators, SI AM J. Numer. Anal. 7, 206-232. Kuttler, J.R. A970b), Upper and lower bounds for eigenvalues by finite differences, Pacific J. Math. 35, 429-440. Lax, P.D. and A.N. Milgram A954), Parabolic equations, in: Annals of Mathematics Studies 33 (Princeton University Press, Princeton, NJ) 167-190. Mercier, B. A974), Numerical solution of the biharmonic problems by mixed finite elements of class C°, Boll. Un. Mat. Hal. 10, 133-149. Mercies, В., J. Osborn, J. Rappaz and D. Raviart A981), Eigenvalue approximation by mixed and hybrid methods, Math. Сотр. 36, 427-453. Mercier, B. and J. Rappaz A978), Eigenvalue approximation via non-conforming and hybrid finite element methods, Rapport du Centre de Mathematiques Appliquees, Ecole Polytechnique, Palaiseau, France. Merigot, M. A974), Regularity des functions propres du Laplacien dans un cone, C.R. Acad. Sci. Paris Ser. A 279, 503-505. MoRREY, C.B. A966), Multiple Integrals in Calculus of Variations (Springer, Berlin). Necas, J. A967), Les Methodes Directes en Theorie des Equations Elliptiques (Masson, Paris). Necas, J. and I. HlavACEk A970), On inequalities of Korn's type, II: Applications to linear elasticity, Arch. Rational Mech. Anal. 36, 312-334. Nemat-Nasser, S. A972a), General variational methods for elastic waves in composites, J. Elasticity 2, 73-90. Nemat-Nasser, S. A972b), Harmonic waves in layered composites, J. Appl. Mech. 39, 850-852. Nemat-Nasser, S. A974), General Variational Principles in Non-linear and Linear Elasticity with Applications, Mechanics Today 1 (Pergamon Press, New York) 214-261. Noor, A.K. A985), Books and monographs on finite element technology, Finite Elements Anal. Design 1, 101-111. Noor, A.K. and I. Babuska A987), Quality assessment and control of finite element solutions, Finite Elements Anal Design 3, 1-26. Oden, J. A973), Some contributions to the mathematical theory of mixed finite element approximations, in: Theory and Practice in Finite Element Structural Analysis (University of Tokyo Press, Tokyo) 3-23. Oden, J.T. and G.T. Carey A982), Finite Elements Mathematical Aspects IV (Prentice-Hall, Englewood Cliffs, NJ). Oden, J.T. and J.N. Reddy A976), An Introduction to the Mathematical Theory of Finite Elements (Wiley, New York). Osborn, J.E. A975), Spectral approximation for compact operators, Math. Сотр. 26, 712-725. Osborn, J.E. A979), Eigenvalue approximations by mixed methods, in: R. Vichnevetsky and R. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations III (IMACS, New Brunswick, NJ) 158-161. Osborn, J.E. A981), The numerical solution of differential equations with rough coefficients, in: R. Vichnevetsky and R. Stepleman, eds., Advances in Computer Methods for Partial Differential Equations IV (IMACS, New Brunswick, NJ) 9-13.
References 783 Parlett, B.N. A980), The Symmetric Eigenvalue Problem (Prentice-Hall, Englewood Cliffs, NJ). Pierce, J.G. and R.S. Varga A972), Higher order convergence results for the Rayleigh-Ritz method applied to eigenvalue problems, 2: Improved bounds for eigenfunctions, Numer. Math. 19, 155- 169. Polya, G. A952), Sur une interpretation de la methode des differences finies qui peut fournir des bornes superieures ou inferieures, C.R. Acad. Sci. Paris 235, 995-997. Prodi, G. A962), Theoremi di tipo locale per il sistema de Navier-Stokes e stabilita della soluzioni stationarie, Rend. Sem. Mat. Univ. Padova 32, 374-397. Prosdorf, S. and G. Schmidt A981), A finite element collocation method for singular integral equations, Math. Nachr. 100, 33-60. Raviart, Р.А. and J.M. Thomas A977), A Mixed Finite Element Method for Second Order Elliptic Problems, Lecture Notes in Mathematics 606 (Springer, Berlin) 292-315. Scholz, R. A976), Approximation von Sattelpunkten mit finiten Elementen, Tagungsband, Bonn. Math. Schr. 89, 53-66. Strang, G. and G.J. Fix A973), An Analysis of the Finite Element Method (Prentice-Hall, Englewooi Cliffs, NJ). Stummel, F. A977), Approximation methods for eigenvalue problems in elliptic differential equations, in: E. Bohl, L. Collatz and K.P. Hadeler, eds. Numerik und Anwendungen von Eigenwertaufgaben und Verzweigungsproblemen (Birkhauser, Basel) 133-165. Synge, J.L. and B.A. Griffith A959), Principles of Mechanics (McGraw-Hill, New York, 34th ed.). Szabo, B.A. A985), PROBE: Theoretical Manual, NOETIC Tech., St. Louis, MO. Szabo, B.A. A986), Mesh design for the p-version of the finite element method, Comput. Methods Appl. Math. Engrg. 55, 181-197. Szabo, B.A. and I. BabuSka A986), Computation of the amplitude of stress, singular terms for cracks and reentrant corners, in: T.A. Cruse, ed., Fracture Mechanics, 19th Symposium ASTM, STP 1969 (American Society Test, and Mat., Philadelphia, PA) 101-126. VaInikko, G.M. A964), Asymptotic error bounds for projection methods in the eigenvalue problem, U.S.S.R. Comput. Math, and Math. Phys. 4, 9-36. VaInikko, G.M. A967), Rapidity of convergence of approximation methods in the eigenvalue problem, U.S.S.R. Comput. Math, and Math. Phys. 7, 18-32. VaInikko, G.M. A970), On the rate of convergence of certain approximation methods of Galerkin type in an eigenvalue problem, Amer. Math. Soc. Transl. 36, 249-259. Weinberger, H.F. A952a), Error estimation in the Weinstein method for eigenvalues, Proc. Amer. Math. Soc. 3, 643-646. Weinberger, H.F. A952b), An optimum problem in the Weinstein method for eigenvalues, Pacific J. Math. 2, 413-418. Weinberger, H.F. A956), Upper and lower bounds for eigenvalues by finite difference methods, Comm. Pure Appl. Math. 9, 613-623; also in: Proceedings Conference on Partial Differential Equations, Berkeley, CA A955). Weinberger, H.F. A958), Lower bounds for higher eigenvalues by finite difference methods, Pacific J. Math. 8, 339-368. Weinberger, H.F. A959), A theory of lower bounds for eigenvalues, Tech Note BN-183, Institute for Fluid Dynamics and Applied Mathematics, University of Maryland, College Park, MD. Weinberger, H.F. A960), Error bounds in the Rayleigh Ritz approximation of eigenvectors, J. Res. Nat. Bur. Standards 64B, 217-225. Weinberger, H.F. A974), Variational Methods for Eigenvalue Approximation, Regional Conference Series in Applied Mathematics 15 (SIAM, Philadelphia, PA). Weinstein, A. A935), Sur la stabilite des plaques encastrees, C.R. Acad. Sci. Paris 200, 107-109. Weinstein, A. A937), Etude des spectres des equations aux derivees partielles de la theorie des plaques elastiques, Mem. Sci. Math. 88. Weinstein, A. A953), Variational methods for the approximation and exact computation of eigenvalues, Nat. Bur. Standards Appl. Math. Ser. 29, 83-89. Weinstein, A. A963), On the Sturm-Liouville theory and the eigenvalues of intermediate problems, Numer. Math. 5, 238-245.
784 /. BabuSka and J. Osborn Weinstein, A. and W. Stenger A972), Methods for Intermediate Problems for Eigenvalues (Academic Press, New York). Weinstein, D.H. A934), Modified Ritz method, Proc. Nat. Acad. Sci. U.S.A. 20, 529-532. Wozniakowski, H.A. A985), A survey of information-based complexity, J. Complexity 1, 11-44.
Subject Index Adjoint eigenpair, 694 Algebraic multiplicity, 683, 693 Amplitude, 651 A posteriori error analysis, 726 Area, 646 Ascent, 683, 693 Asymptotic phase, 746 Besov space, 658, 660, 662 Boundary condition, — of Dirichlet type, 647,649,656,664,665,668 — of Neumann type, 647, 648, 649, 656, 664, 667, 668 — of Newton type, 647, 648, 649 essential —, 668 natural —, 668 Clamped plate, 761 Clamped solid, 657 Compact operator, 671, 683 spectral theory for —s, 683 Conormal derivative, 664 Constitutive law, 646 Countably normed spaces, 662 Damping force, 647, 652 Dirichlet boundary condition, 647, 649, 656, 664, 665, 668 Displacement, 646 phase —, 651 Domain with corners, 666, 674 Dynamic case, 647 Eigenfunction, 649, 655 regularity of —s, 673, 716, 733, 758, 763 Eigenpair, 649, 655, 683, 694 adjoint —, 694 Eigenspace, 695 Eigenvalue, 649, 693 algebraic multiplicity of an —, 683, 693 — approximation, 674 geometric multiplicity of an —, 683, 693 multiple —, 739 refined — error estimates, 699 Steklov —, 657 Eigenvalue problem, 649, 655 error estimates for —s for multiple eigenvalues, 700 generalized matrix —, 676 nonselfadjoint —, 673 properties of—s, 670 second-order elliptic —, 663, 747 selfadjoint —, 673, 674 Steklov-type —, 649, 657, 664 variational formulation of —s, 663, 692 Eigenvector, 649, 683, 693 generalized —, 683, 693 order of generalized —, 683, 693 Eigen vibration, 651 initial position of —, 651 initial velocity of —, 651 Elastic bar, 645 Elastic solid, 655, 664, 668 Elastic support, 646 Elasticity, modulus of —, 646, 716 Ellipticity constant, 666 Energy norm, 697, 718 relative error in —, 725, 727 Equilibrium condition, 646 Finite element approximation, 675 Finite element method, 674, 694 Finite element spaces, 694 Formal adjoint, 664 Formally selfadjoint operator, 665, 671 Free solid, 657 Frequency, 651 resonant —, 652 785
786 I. BabuSka and J. Osborn Galerkin method, 694 — approximation, 675 Gap, 684 Geometric multiplicity, 671, 683, 693 Generalized eigenvector, 683, 693 Generalized matrix eigenvalue problem, 676 Heat conduction, 654 fundamental law of —, 654 Hilbert space, 659 Hooke's law, 646 h-p version of the finite element method, 728 ft-version of the finite element method, 722 Inf-sup condition, 693 discrete —, 694 Initial position of an eigenvibration, 651 Initial velocity of an eigenvibration, 651 Internal force, 646 K-method, 660 Lame constants, 656, 657 Lanczos algorithm, 731 Lax-Milgram theorem, 671 Least squares method, 769, 770 Lipschitz continuous boundary, 659, 660 Load, external longitudinal —, 645 — due to continuous elastic support, 646 resonant —, 652 Longitudinal vibration, 645 L-shaped panel, 715, 722 vibration of—, 668, 715 Mass, 646 Mass matrix, 681 consistent —, 681 lumped —, 681 Matrix eigenvalue problem, 694, 731 generalized —, 676 Maximum-minimum principle, 698, 699 Membrane, 653, 732 Mesh, 733 curvilinear —, 750 quadrilateral —, 750 triangular —, 733 Minimum-maximum principle, 698, 699 Minimum principle, 698, 699 Mixed formulations, 670 Mixed method, 751, 755 — for problems with rough coefficients, 765 Modulus of elasticity, 646 Young's —, 716 Modululs of rigidity, 716 Navier-Lame equations, 655 Neumann boundary condition, 647, 648, 649, 656, 664, 667 Newton boundary condition, 647, 648, 649 Noncompact operator, 692 Nonselfadjoint operator, 658 One-parameter family of variational transformations, 769 Order of a generalized eigenvector, 683, 693 Outer normal derivative, 654, 656, 660 Penalty method, 772, 773 Phase displacement, 651 Poincare inequality, 653, 659 Poisson's ratio, 716 Pre-asymptotic phase, 746 p-version of the finite element method, 726 Rayleigh quotient, 698 Regularity of eigenfunctions, 673, 716, 720, 733, 758, 763 Rellich's theorem, 660, 671 Resolvent operator, 683 Resolvent set, 671, 683 Resonance, 652 Resonant frequency, 652 Resonant load, 652 Rigid body motion, 719, 720 Ritz method, 697, 700, 716 — for second-order problems, 715 Rough coefficients, 748 Second-order elliptic eigenvalue problem, 663, 747 Selfadjoint, 691 formally —, 664, 671 — operator, 671 Separated solution, 648, 651, 655 Sobolev space, 658, 660, 662 — with fractional order, 660, 662 vector-valued —, 660
Subject Index 787 weighted —, 662, 667 Source problem, 666, 671, 751 Space ®l(Q\ 662, 721, 750 Spectral approximation, 692 Spectral projection, 684 Spectral theory, 683 Spectrum, 671, 683 Stability, 657 Stable stationary solution, 657 Static problem, 645 Steklov eigenvalue problem, 649, 657, 664 Stiffness matrix, 681 Strain, 646 — energy, 718, 719 Stress, 646 — intensity factor, 732 String, 652, 677 homogeneous —, 652 transverse vibration of a —, 652 weightless elastic —, 678, 680 Support, elastic, 646 Thermal conductivity, 655 Trace, 659 Transverse vibration, 652, 653, 732, 761 Trapezoid (quadrature) rule, 677 Trial space, 694 Triangulation, 733 a-regular —, 733 <j-quasiuniform —, 734 uniform —, 734 Variational approximation method, 694, 697 Variational formulation, 663, 666, 692, 756, 761 one-parameter family of —s, 769 — of a weightless elastic string, 678, 680 Vibration, longitudinal — of an elastic bar, 645 transverse — of a clamped plate, 761 transverse — of a membrane, 653, 732 transverse — of a string, 652 — of an elastic solid, 655, 664 — of an L-shaped panel, 668, 715 Temperature distribution, 655 Test space, 694 Young's modulus of elasticity, 716
Evolution Problems Hiroshi Fujita Department of Mathematics, School of Science and Technology, Meiji University, Higashimata, Tamaku, Kawasaki-shi 214, Japan Takashi Suzuki Department of Mathematics, Faculty of Science, Tokyo Metropolitan University, Fukakusa, Setagayaku, Tokyo 158, Japan HANDBOOK OF NUMERICAL ANALYSIS, VOL. II Finite Element Methods (Part 1) Edited by P.G. Ciarlet and J.L. Lions © 1991. Elsevier Science Publishers B.V. (North-Holland)
Contents Chapter I. Elliptic Boundary Value Problems and Finite Element Methods: A Review and Remarks 793 1. Review of elliptic boundary value problems 793 2. Review of finite element methods for elliptic problems 797 3. Inverse assumptions and their consequences 800 4. Some L" estimates 802 5. Topics on spectra and fractional powers of approximate operators 804 5.1. Spectra of approximate operators 804 5.2. Negative powers of approximate operators 806 5.3. A discrete elliptic Sobolev inequality 807 Chapter II. The Semigroup Theory and the Finite Element Method 809 6. The evolutionary problem and the semigroup of operators 809 7. Semidiscrete approximation for parabolic equations 815 8. Fully discrete approximation for parabolic equations 821 8.1. Backward difference approximation 821 8.2. Forward difference approximation 823 9. Approximation for inhomogeneous equations 826 10. Remarks on approximation with higher accuracy 828 Chapter III. Evolution Equations and Error Analysis by Real Methods 831 11. Generation theory on evolution equations 831 11.1. Generation theory of Tanabe-Sobolevskii 833 11.2. Generation theory of Fujie-Tanabe 834 11.3. Generation theory of Kato-Sobolevskii 834 11.4. Generation theory of Kato-Tanabe 837 12. Semidiscrete approximation of temporally inhomogeneous parabolic equations 838 13. Fully discrete approximation of temporally inhomogeneous parabolic equations 845 14. Error analysis by energy methods 856 14.1. A priori estimates 856 14.2. Error analysis of semidiscrete approximations 860 14.3. Error analysis of fully discrete approximations 865 15. Discretization of hyperbolic equations 869 15.1. Error estimate by Helfrich's method 871 15.2. Error estimate by the energy method 876 15.3. Comments 879 Chapter IV. Refinements and Generalizations 881 16. L°° estimates 881 17. Rational approximation of semigroups 887 18. Multi-step methods 897 791
792 H. Fujita and T. Suzuki Chapter I Chapter V. Commentary 911 19. Variations of the finite element method 911 19.1. Lumping of mass 911 19.2. Upwind finite element method 912 20. Application to nonlinear problems 913 20.1. Nonstationary Navier-Stokes equation 913 20.2. Parabolic equations of blow-up type 916 20.3. Stefan problem 916 References 919 Subject Index 925
Chapter I Elliptic Boundary Value Problems and Finite Element Methods: A Review and Remarks The purpose of this article is to present a mathematical study of the finite element method applied to evolution problems such as parabolic or hyperbolic initial value problems. Our analysis is basically operator-theoretical, namely, from the viewpoint of evolution equations in Banach spaces. Therefore, our arguments on approximation, as well as those on the original problem in the continuous version, heavily depend on the study of elliptic boundary value problems and their finite element approximations. Those elliptic problems have been described in detail in the first article of this volume. However it seems to be convenient for later quotation to review here some fundamental facts together with some advanced results concerning them which are directly connected with our study of evolution problems. 1. Review of elliptic boundary value problems For the sake of simplicity, we suppose that Q с Ш2 is a polygon or a bounded domain with smooth boundary dQ, and consider the Poisson equation -Au = f in Й A.1) with the Dirichlet boundary condition ы = 0 on6& A.2) By a well-known argument, this boundary value problem is reduced to a weak form, that is, the variational problem in V=Hl{Q). Find ue Fsuch that a(u, ») = (/-,»), veV. A.3) Here, (,) and a(,) denote the L2 inner product and the Dirichlet form respectively. Namely, if, 9) = f(x)g(x)dx, f,geL2(Q), 793
794 H. Fujita and T. Suzuki Chapter I a{u, v) = Уи{х)-Щх)йх, uyeH^Q). Actually in this article, unless otherwise stated we assume that functions are complex-valued and that Hilbert spaces are complex spaces. We note that и е Яо(О) implies u\dQ=trace of и to 3C=0. Then, supposing that и is smooth, we have by Green's formula that a(u,v)= Vu*Vi?dx= — Auvdx = (f, v), n a since v e Ho{Q). Hence the identity A.1) follows formally from the arbitrariness of v. Henceforth, various generic constants are denoted indifferently by C. If it depends on some parameters, say a, /?,..., we shall denote it by Ca>p, Then by means of Poincare's inequality Ml^C0 || Vi>b, veV, A.4) where a(,) may be regarded as an inner product in V—Hl(Q). Let V* be the adjoint space of V. Then, regarding/e L2{Q) as an element in V* through v*(f, v}v = (/. v) for v e V, we can verify the unique solvability of A.3) from Riesz's representation theorem. In the case that Q has a smooth boundary or is a convex polygon, the solution ueV belongs to H2{Q) when/e X = L2(Q) and becomes a strong solution of A.1) with A.2). (See, Agmon [1965], Lions and Magenes [1968], or Grisvard [1985], for instance.) We proceed to the case of V=Hi(Q) in A.1). Supposing that и is smooth, we obtain by Green's formula that a(u, v) = Vw•Vvdx= dQ (du/dn)-vdS~ Au-vdx — (f,v). Here Э/8и is the differentiation along the outer unit normal vector n = (n1,n2) on dQ and dS denotes the surface element of 8£2. Taking veHl(Q), we have §bn(du/dn)-v dS = 0 so that the identity of A.3) follows. Therefore, (du/5n)-vdS=0 holds for every veH1{Q), hence 8м/8п = 0 on 8Q. In other words, the Neumann problem which is composed of -Au=f in Q, A.1') ди/дп = 0 on 80 A.2') is reduced to the variational problem A.3) for V = H'{Q).
Section 1 Finite element methods for elliptic problems 795 Unfortunately, since Poincare's inequality A.4) does not hold for V=H1(Q), a(,) is not an inner product in H\Q). Actually, the Neumann problem A.Г) with A.2') is not uniquely solvable. For instance, any constant function и=сеШ satisfies A.Г) with A.2') for /=0. To generalize the consideration above, let atj(x)=aJt(x), bj(x) and c(x) be real smooth functions on Q and suppose that the uniform ellipticity U=l holds with a constant Si > 0. We put ^ = ^(X,D)=~ £ _ev(x)_+£tyx)—+фс). Then, the boundary value problem for stfu=feX in О with u = 0 on 3fi is reduced to the variational problem A.3) if we take V=Ho(Q) and a(u, v) = 2 p, p. 2 ^ Л i,J=l Эх, Эх,- }ш1 дХ: A.5) A.6) A.7) Similarly, if <т(£) is a smooth function on 8fl and if we write 8 2 3 8v^ ,v=i s*; to denote the outer conormal differentiation associated with si, then the boundary value problem A.6) with 8u/3vrf + Gu = 0 on 80 is reduced to the variational problem A.3) with V=H1(Q) and a(u, v) = I V / ч 8 8 V и i s 8 - / ч -Ь ^ L flyW s~ " ■£— v + 2v M*) ^~ u-p + c(x)Mu>dx + Lj=i 0;icj 0xi j=\ 0xJ J A.70 guv dS, Si u,veV. The sesquilinear form a{,) which appears above is bounded in the sense that |o(u,f)|<C||m||v||p|Iv, u,veV. A.8',
796 Я. Fujita and T. Suzuki Chapter 1 Here we note that from the trace theorem (e.g. Lions and Magenes [1968], Mizohata [1973]) the following inequality follows for each e>0 with a constant CE>0. Therefore, by the uniform eUipticity A.5), it holds furthermore that for each <5, 0<3<дъ there exists a AeIR such that Kea{u,u)>6\\u\\b-M\u\\jt, ueV. A.9) If bj-sO, cSsO and V=HUQ), or if bj=0, c>0, a>0 and V=H1(Q), then we may take A=0 in A.9) In this case the variational problem A.3) is uniquely solvable by Lax-Milgram's theorem (see Yosida [1964]). Recall that V* denotes the adjoint space of V. Then, the boundedness A.8) of a{,) implies the well-definedness of a bounded operator A: V-* V* through the identity a(u, v) = v*<Au, v>Y, u,veV. On the other hand, identifying X* with X through Riesz's representation theorem, we get a triple of Hilbert spaces Fclc V* with continuous and dense inclusions. Setting D(A)= {ue V; AueX}, we write the restriction of A: V-* V* to D(A) again by A, which is regarded as an operator in X. Then, A.3) is simply expressed as an abstract equation in X, that is Au=f. A.10) Again, when Q has a smooth boundary or is a convex polygon, it is known that D(A) = H2{Q)nHl(Q) for the boundary condition A.7) and D(A)^H2(Q)n{dv/dv^ + GV = 0 on 80} for the boundary condition A.7') (see Agmon [1965], Lions and Magenes [1968], Grisvard [1985]). Therefore, in those cases we obtain the strong solution ueH2(Q) of A.6) with A.7) (or A.7')), provided that /1 = 0 in A.9). The operator A in X which arises as above from the sesquilinear form a(,) on Fis called an m-sectorial operator in view of the following property (see Kato [1966]). For simplicity, let us suppose 2 = 0 in A.9) and specify the constant С in A.8) as Cj. Taking в in 0<в<^% so that cos 0 = E/Cb we set: Ze = {zeC:Os$|argz|<0}. Then, we have Ze z> a(A) — "the spectrum of A", namely, C\Ze с р(А) = the resolvent set of A. Furthermore, for each 6V в<в1 <jit, the estimate ll(z-A)"lKd5^?)"R' "^{zeCi^lanszK*}
Section 2 Finite element methods for elliptic problems 797 holds, which implies that — A generates a holomorphic semigroup {e~tA}t>0 ш X as is discussed in detail in the next chapter. Here, we recall the following notation, commonly used in the semigroup theory: A densely defined closed operator A in a Banach space X is said to be of type (в, М), 0<#<7tand JW>1, if C\Zecp(A), ЩЛ-АУЦ^М, А<0, ||(A-y4)-1KM,/|A|, %>\arg2.\^e + s>e, with a constant M£>0. In this terminology, an m-sectorial operator A is of type @,M) for some в,0<в<$п. 2. Review of finite element methods for elliptic problems As in the preceding section, let V с X с V* be a triple of Hilbert spaces and let a{,) be a sesquilinear form on V. Assume that a(,) is bounded: |ф,»)|^С||и||к11»|1к. u,veV, B.1) and is "strongly coercive", that is, A.9) holds for X - 0: Rea{u,v)^S\\u\\$, ueV, B.2) where 3>0 is constant. Then for any given feX the variational problem, Find ueV such that is uniquely solvable. The Ritz-Galerkin method is the most classical approximating method for B.3) and consists of the following procedure. Namely, we prepare a family of finite- dimensional subspaces {Vh}h>0 of V with the property that Vh-+Vas Ц0 in some sense. Then, the variational problem, Find the uke Vh such that 4щ,х)=(/,х\ xzv„, B.4) is regarded as an approximation of B.3). The unique solvability of B.4) follows from the same reasoning as for B.3). Let Xh be the space Vh with the topology induced by the inner product in X and let Ph: X->Xh be the orthogonal projection. Then, the variational problem B.4) is equivalent to the equation Ahuh = PJ B.5) in Xh, which is of finite dimension. Let us refer to the stability of the approximate solution uh and to the estimate of
798 H. Fujita and T. Suzuki Chapter I the error eh — uh — u. To this end, we introduce the so-called Ritz operator Rh: V-* Vh through the relation a(Rhu,x) = a(u,x), ueV, z<=Vh. B.6) The well-definedness of Rh again follows from the Lax-Milgram theorem in the same way as for the unique solvability of B.4). IfueVis the solution of A.2), then the approximate solution uh of B.4) is nothing but Rhu. Therefore, we have RhA~1 = AblPh. Furthermore, in the case of symmetric a(,), namely, if a* —a, where a*(u, v) = a(v, u), u,veV, the operator Rh: V-* Vh is the orthogonal projection with respect to the inner product a(,). Thus, in this case, we have a(Rhu,Rhu)^a{u,u) and a(Rhu~u,Rhu-u)^a(x-u,x-u), x^^h- Again by the boundedness B.1) and the strong coerciveness B.2) of a(,) we get the stability of approximate solutions \\Rhu\\v^C\\u\\v B.7) and the error estimate \\Rhv-u\\r^Ciaf\\x-u\\y. B.8) xeVh Actually, in the general case that а* Ф a, these relations also hold true. In fact, from B.2) and B.3) we obtain 51| Rhu\\v ^ Re a(Rhu, Rhu) — Re a(u, Rhu) ^C\\Rhu\\v'\\u\\v and d\\Rhu — u\\v^Rc a(Rhu — u, Rhu — u) = Rea{Rhu — u, x~u) ^C\\Rhu-u\\y-\\x-u\\Y, xeVh, Hence B.7) and B.8) follow. Now, we proceed to the finite element approximation, and set X = L2(Q), V=Hl(Q) or H1(Q). Let feX and a(,) be the sesquilinear form associated with an elliptic boundary value problem in the preceding section. The finite element method is a kind of Ritz-Galerkin approximation for the variational problem B.3). In the present article, we mainly exemplify the method through the simplest finite elements. Namely, in the case that Q is a polygon, we divide it into small triangles with size parameter h>0. Let xh be the triangulation. Then each Texh is called an element. p{T) and o-(T) mean the radii of the inscribed and the outscribed circles of T,
Section 2 Finite element methods for elliptic problems 799 respectively. The family of triangulations {тА}л>0 is said to be regular if there exists a constant v > 0 such that p(DMT)>v, Texh, h>0. Vh denotes the totality of continuous piecewise functions in V which are linear on each Texh. Obviously, VhcV B.9) and Vh is of finite dimension. B.10) For each continuous function ue V, there exists a unique element in Vh whose values at the vertices ofTezh are equal to those of u. That function is called the interpolant of м and is denoted by nhu. The following inequality is due to Bramble and Hilbert [1970]: \\nhu-u\\L2m + h\\V(nhu-u)\\L4T)^Chz\\u\\H2iT) B.11) ueVnH2(Q), Texh. When Q has a curved boundary, we must modify nhu near the boundary. Such a modification has been given by Zlamal [1973], which assures us of the existence of V„ and nh subject to B.9)-B.11). From the general theory developed above the stability B.7) of the approximate solution uh = Rhu and the estimate B.8) of the error eh = uh — u follows. In particular, we have \\Rhu-u\\Hi^Ch\\u\\H2, ueH2{Q)nV. B.12) When Q has a smooth boundary or is a convex polygon, the elliptic estimate М"!Я1я><С||/||у, feX, B.13) holds true. In this case, the following L2 error estimate can be obtained: \\Rku-u\\L2^Ch2\\ul\H2 u<=H2(Q)nV. B.14) In fact, putting eh = Rhu — u, we have \\eh\\l2 = a{eh,A*~1eh) = a(Rhu~u,A*~1eh) = a(Rhu-u,A*-1eh-nhA*-1eh)^C\\Rhu-u\\v\\(nh-l)A*-1eh\\v ^c/i>||H2|M*-4ll*^c*2ll«MKb. The argument here is sometimes referred to as Nitsche's trick (Ciarlet [1978]). Similarly we can show that \\Rhu-u\\L2^Ch\\u\\Hl, ueV. B.15) As we have seen in the preceding section, the relation RhA ~ * = Лл~ 1Ph holds so that B.14) implies \\А^Рк-А~'ЫСк\ B.16)
800 H. Fujita and T. Suzuki Chapter I We should note that this inequality is weaker than B.14) in the case of V=H1(Q) where D(A) involves the boundary condition: D(A) = H2(Q)n{du/dv^ + au=0 on 60}. Henceforth, the family of triangulations {тй}А>0 is always supposed to be regular. 3. Inverse assumptions and their consequences We say that the family {xh}h>0 of triangulations satisfies the inverse assumption when there exists a constant v > 0 such that p(T)^vh Texh, h>0 (see Ciarlet [1978]). Under this assumption, the following inequality holds for WL-m^C/r^llxW), Texh, xsVh, h>0. C.1) We note that its converse always holds true. Furthermore, we have WxWvm^Cb'^zWiXT), Tsxh, XzVh, h>0, C.2) and.hencellVzb^Cfc-Mlzlli.- From these inequalities we can deduce the following propositions: Proposition 3.1. There exists a constant /?>0 such that WAJ^fih-2. C.3) Proof. In the case that a(,) is strongly coercive and symmetric, we have for xeVh that \\All2x\\2x = (Ahx,X) = a(x,xl so that C-'hWv^WAk'hWx^CWxWv, xeVh, h>0, where C>0 is independent of h>0. Therefore for u,veVh we get from C.2) that \(Ahu,v)\ = \(Al'2u,A},i2v)\ ^C\\u\\v-\\v\\^Ch'2\\u\\x\\v\\x, hence C.3) follows. In the general case, we break a(,) as a(u, v) = a°(u, v) + (Bu, v), u,veV, where a°{,) is strongly coercive and symmetric and В: V-* X is bounded. Then, we can write A„ = Ai + Bh,
Section 3 Finite element methods for elliptic problems 801 where Al is associated with a°\VhxVh and Bh = PhB\Vh. The inequality C.3) follows from \\A°h\\<Ch-\ WBah^cwxWv^ch-'Wxh, ievh. и Proposition 3.2. When Q has a smooth boundary or is a convex polygon, the L2 orthogonal projection Ph\X-+Xh satisfies the estimate \\Phv\\v<C\\v\\v, veV. C.4) Proof. Since ||PJx-»x^l> we have only to show that т/\-ОДЬ2<С|Ия1, veV. To this end let Rh: V-* Vh be the Ritz operator associated with a(u,v) = (Vm • V v + uv) dx, u,v e V, Since Ph\vh-identity, we have for % = RhveVh that \\V(Ph-m\L2=\\4Ph-l)(v-x)\\L^\\VPH(v-x)hi+\\V(v-x)h> ^Ch-^Pbiv-tiWLi + WvWv + WxW* <СЛ-1||1?-х1Ь+||»||я. + ||х||Я1. As we have seen in the preceding section, the inequalities Ыя'^Ияь \\X-v\\v^Ch\\v\\Bl hold. Hence C.4) has been established. □ The following lemma is due to Descloux. For the proof, see Тномёе [1984, p. 64], for instance. Lemma 3.1. When {тк}н>0 satisfies the inverse assumption, there exists a constant С > 0 such that for each T0 e xh and a subdomain Q0c:Q with TonU0 = 0, the function veL2(Q) with suppu<=T0 satisfies the inequality lin^no^expt-Cdist^o.Toyr1)!!^. C.5) Lemma 3.1 implies the Lq boundedness of the L2 orthogonal projection Ph: X —>Xh: Proposition 3.3. For Kq^oo, the estimate \\Phh^L^C C.6) holds true.
802 Я. Fujita and T. Suzuki Chapter I Proof. First, we note the identity Ll(Phu,v)L„ = L1(u,Phv)L„ for ueL2(Q) and veL^iQ). If C.6) is shown for q = oo, the equality above implies that C.6) holds also for q = l, and hence for 1 ;%g^oo by Riesz-Thorin's interpolation theorem. Suppose that v is real-valued and let the maximum of Phv be attained on T0, T0exh, For each Texh, we set \v on T, [0 otherwise. Then, v is decomposed as v = YijeZhvT, so that IIJ\»IIl-= 11Р*»11ь-(Го)<1 И^ргНх-чго)- г By virtue of Descloux's lemma, we have ll^*l'rllL«(ro)<C'I||f»frll^(To) «SС/г- 1ехр(-С dist(T0, Г)/г~*)||i>r\\L4T) ^Cexp(-Cdist(T0, T)h~x)\\vT\\L.. Now, starting from JR0 = T0 we define subsets RjcQ, j=0,1,2,..., recursively as follows. Namely, Rk is the union of (closed) triangles in xh which are not in but adjacent to (J, <kRt. Then, the points of Rk have a distance to T0 which is of order к/г. Furthermore, the number of triangles of Rk is bounded by the number of triangles of (JulR„ which is bounded by Ck2. In this way we have \\Phv\\L^ch Y e-a)lNL- \ к TbRu J <c(Yk2e-CkX\vh~ = C\\v\\L,. For other inequalities related to the inverse assumption we refer to Ciarlet [1978]. □ 4. Some L°° estimates We have shown that the Ritz operator Rh: V-> Vk associated with the finite element method described in Section 2 satisfies ||R*K||Hi^C||u||a„ ueV, D.1) and, under the elliptic estimate B.13), also \\Rhu-u\\HJ^Chk-j\\u\\Hk, veHk(Q)nV
Section 4 Finite element methods for elliptic problems 803 for /=0, 1 and к = 1,2. Unfortunately, the L2 stability \\Rhu\\L^C\\u\\Lz cannot be expected. In fact, if this were true, we should have from Rh A ~l = AZ 1Ph that \\AblPhA\\x^x4tC and hence the following type of estimate by taking the adjoint operator: \\AA;'Ph\\x^C, which is impossible because of Vhc£D(A). The following inequality has been obtained by Nitsche [1975] and Scott [1976] under the inverse assumption for the case V = Hl,{Q) and a(u,v) = JflVu • Wv dx. ||**i«||L.<C|logA|||K||b., ueV. D.2) Since Rh\vh— identity, D.2) implies the error estimate \\RHU-u\\L.= \\{Rh-l){u-x)\\L- <C{|logA| + l}||«-Z|k., XeVh. The inequality \\nhu-u\\L.^Ch2\\u\\W2,~, ueVnW^iQ), is known to hold so that we have \\Rhu-u\\L^Ch2\\ogh\\\u\\w,~. D.3) In obtaining this result Nitsche [1975] adopts the weighted norms while Scott [1976] makes use of the discrete Green function. We shall refer to these methods later in Section 16. Concerning the optimality of D.3), we note the following. First, in the case that Qc Ш2 is a ball: G = {x: \x\ <R}, and / is radial, the Dirichlet problem A.1) with A.2) is reduced to a two-point boundary value problem on [0,/?] with singular coefficients. Then, it is known that A2|log h\ is the optimal rate of convergence in L°° norm for the finite element approximation by piecewise linear trial functions for this reduced one-dimensional problem (Jesperson [1978], Fried [1980]). Next, due to Haverkamp [1984], the rate A2|log h\ is optimal generally in the two-dimensional problem. Namely, there exist a domain QcR2,a triangulation {tJ, and a constant c>0 such that for each h with ftjO there exists a ueCo(fi) such that \\Rhu-u\\L^ch2\logh\\\u\\w^. D.4) We should note that the boundedness of the Ritz operator implies its convergence estimate, but the converse does not necessarily hold as is seen at the beginning of this section with respect to the L2 norm. Thus, the boundedness is a problem on its own. A related topic is the discrete maximum principle due to Ciarlet and Raviart [1973]. See also Ciarlet [1978] for relevant works. The discrete maximum principle will be also dealt with in Chapter V of the present article.
804 H. Fujita and T. Suzuki Chapter I Recently Suzuki and Fujita [1986] have obtained by means of Stampacchia's method the inequality \\Rhu\\L.^Cp\\u\\wl.P, ueVnW^(Q) D.5) for p > 2, which may be regarded as a discrete Sobolev inequality. Actually, under the inverse assumption Rannacher and Scott [1982] have shown the estimate \\Rhu\\m,p^C\\u\\wl.P, ueVnW1-p(Q) D.6) for 2<p^ oo in the case that a(u, v)=\Vu-Vvdx with V = Hh(Q) a and Q has a smooth boundary or is a convex polygon. In that case, D.5) is implied by D.6), because of Sobolev's inequality \\v\\L^Cp\\v\\w,,P, p>2. Incidentally, D.5) holds for general a{,) without any elliptic estimates or inverse assumptions. 5. Topics on spectra and fractional powers of approximate operators Let A be the elliptic operator considered in Section 1 and Ah be its finite element approximation described in Section 2. We shall show some properties of the spectra and the fractional powers of Ah which are uniform in h. 5.1. Spectra of approximate operators The following theorem can be proven by the method of Agmon [1965]: Theorem 5.1. The spectrum a(Ah) of Ak is contained in a parabolic region in the complex plane independent of h>0. Further, for each 6, 0<9<jiz, there exist constants C>0 and M> 1 independent ofh such that Ah—C is of type (в, М) in Xh, where Xh is the set Vh endowed with the L2 inner product. Proof. As in Section 3, we split a{,) as a{u, v) = a°(u, v) + (Bu, o), u,ve V, where a0 is strongly coercive and symmetric and В: V-*X is bounded. Let A° be the self-adjoint operator in Xh associated with a°\VhXVh. Then ||(Л-,ф-Ч|М11<-1_. E.1) 1m л Therefore,
Section 5 Finite element methods for elliptic problems 805 \\A°h(X-A°hrh\\xJl+^)w\x^2^\\\v\\x, vzXh. E.2) By means of Heinz's inequality, we have \\А^\Х-А0к)-Цх^Хъ^^—г E.3) On the other hand, the relation Ah = A°h+Bh holds with Bh = PhB\Vh so that Bh: Vh-*Xh is uniformly bounded. Let us introduce positive constants Сх and C2 such that \\Bh\\v^Xh<Cu \\A0h-ll2\\v„^<C2. As we have seen in the proof of Proposition 3.1, these constants can be taken uniformly in h. Then, we have for ve Vh that WiX-Aly'BbVWy^CzWAni-Atr'BbVlx r- Ul1/2 \X\112 v |Im X\ |Im X\ From the identity (k-Al)-\k-Ah)=\-(X-A0hTlBh, the inequality М-|Я|1/2/|1тЯ|<^ implies Xep{Ah) and ll(A-A»)-1z||K<2||(A-^)-1Z||l„ XeVk. E.4) We also have {X-Alr^iX-A^'-iX-AtY'B^X-A,)-1 Therefore, \\^-^rh\\x^\U-Alr4x^h{\\ih+\\Bh{X-Ahy^\\x} 1 «£ |1тЛ| 1 mx+c№-Ah)-ix\\v} у^Ых + ^Ш-А^хЫ 1 |!^{к11х + 2С1С2||(ЛТ2(А-ЛА0)-1х11к} _2_ "|1тЯ| ^lizrrJxh,
806 H. Fujita and Т. Suzuki Chapter I when |Л|1/2 1 ]1тЯ|^2М' from which the assertion follows. □ 5.2. Negative powers of approximate operators We suppose that the sesquilinear form a(,} is strongly coercive. Then, the approximate operator Ah in Xh defined in Section 2 is uniformly of type (в, М) for 0<#<^7Г. Consequently, the negative powers Д-", a^O, are defined through Dunford integrals: Al' = t= \k-'(k-A)-lAk, 2tc/-1 J г Г being the positively oriented boundary of Gei~{zeC; 0x^|argz| ^ к) with 6<01<jn (for instance, see Tanabe [1979]). The following theorem is due to Ushijima [1979]. Theorem 5.2. For each a, 0<a<l, the inequality ||Л-РГу(-1<С1к2« E.5) holds true. Proof. The estimate \\{X-Ahr'Ph-{l-A)-'\\^Ch2, AeGe„ will be shown in Section 7. On the other hand, we have \\(X-Ahy'Ph-{X-A)-'\\ щ hence WiX-A.r'Py.-iX-Ay'w^ch^—^. for 0</?<l. We divide Г into Г1 = {ЯеГ: |А|^й} and Г2 = {АеГ: |А|>Л-2}. Then in dealing with AfPh-A-' = ^=j | + А Г: = 1 + 11, Л-^Л-ЛГ^-^-ЛГ1} dA
Section 5 Finite element methods for elliptic problems 807 we can estimate ,.,Ц,-—«,--ад-«-.*-*-<^ 0 with /?>a, and шике p-"-1+^ dp ft2" = Cxh-2<-°+42tl = Cxh 2a Л-2 with /? < a. Hence E.5) follows. □ Remark 5.1. The estimate UPb-WW^Ch2; 0<«<1, holds by the interpolation theory so that \\ArPh-PhA-'\\^CJi2\ 0<cc<l, holds also true. 5.3. A discrete elliptic Sobolev inequality Let us suppose that the elliptic estimate М-уНя^СЦЛи feX, holds true. Then, interpolation theory yields \\A-*f\\H^C\\f\\L2, feX, for 0^a< 1. By virtue of Sobolev's inequalities II/IIl-<Cp||/|U,.„ feWl-'(Q)nV with p > 2 and \\f\\m.P^C(a,p,Q) WfWtp., feVnH2x(Q), with a > \ and p close to 2, we arrive at H/IL.^C.M1'2^!!,.,, feDiA1'2*') where e > 0. Suzuki and Fujita [1986] have shown that a similar inequality holds for Ah uniformly in h. Namely: Theorem 5.3. Under the inverse assumption, for each e>0, there exists a constant Ce>0 such that ||ZIKCJA1/2+£jdlLJ, ZeVh, E_6) holds true.
808 H. Fujita and T. Suzuki Chapter I Proof. We recall the relation RhA~l=A^Ph in Section 2 and the estimate \\Rhu\\L^Cp\\u\\m,„ ueVnW^(Q), for p>2 of Section 4. Then, we have \\A^Phf\\L^Cp\\A^fUK„ feX, or \\l\\L^Cp\\A-lAhi\\w^, xeVh, Talcing a>-|, we can bound the right-hand side from above by Therefore, the desired estimate E.5) is reduced to for 0</?<^. In the adjoint form, the inequality is given as UlP„A-'h~x^Cf. E.7) In order to prove E.7), let us suppose first that the sesquilinear form a(,) is symmetric: a* = a. Then, the relations D(A1,2)= Fand C-MI/IIk^II^/IIx^CII/II,,, fev, hold. Similarly, the relation C-'Mv^Ul'hWx^CWxWy, xeV„, holds uniformly in h as we have seen in Section 3. Therefore, E.7) holds for fi = \ by virtue of Proposition 3.2. Inequality E.7) is obvious for /5=0. Therefore, it holds for the full range 0</i^i by the interpolation theory. As for the general case, we set a° = j(a + a*). Then the m-sectorial operator A° in Xh associated with a0\VhxVh is self-adjoint so that IKAjyp^r'KC holds for 0</?s$i. On the other hand, the inequalities WAliAtr'w^c,, IKAJVUjT'KC, are obtained for 0 < j? < \ according to Kato [ 1961 a]. Here, Cp > 0 is independent of h > 0 and the proof has been completed. □
Chapter II The Semigroup Theory and the Finite Element Method 6. The evolutionary problem and the semigroup of operators The purpose of the present chapter is to study the finite element method for initial value problems along the line of the semigroup theory, namely, the exponential functions of operators. Here in this section we describe the way in which the semigroup is applied as an abstract method in dealing with initial value problems for parabolic and hyperbolic equations. As in the preceding chapter, let us suppose again that Q is a bounded domain in U2 with boundary dQ smooth or a convex polygon. The most typical initial value problem for a parabolic equation is composed of the heat equation du/dt = Au, £>0, xeQ, F.1) the initial condition «l( = o = «oW, *еЦ F.2) and the boundary condition. To fix the idea, let the boundary condition be the homogeneous Dirichlet condition u = 0, t>0, xeSfi. F.3) It is well known that the initial value problem (to be exact, the initial boundary value problem) is uniquely solvable if the initial function u0 is not very singular. Furthermore, even if u0 is not continuous, for instance, if u0 e L2{Q), then the solution u(t, x) is quite smooth for t > 0. This fact is sometimes referred to as the smoothing property of the heat equation, and is reflected by the semigroup of operators arising from the initial value problems for parabolic equations that we now start to describe. We regard X = L2(Q) as a (complex) Hilbert space and denote — A associated with the boundary condition F.3) by A, as in the preceding chapter. Then, D(A) = H2(£2)nHh(Q) F.4) and the operator A : D(A)-+X is self-adjoint in X. By means of A, the initial value problem F.1)-F.3) is formulated as an abstract Cauchy problem for the unknown 809
810 H. Fujita and T. Suzuki Chapter II function u: [0, oo)-»X Namely, du/dt + Au = 0, t>0, F.5) with m@) = mo, F.6) If we introduce e~'A, the semigroup (of operators) generated by —A, then the solution of F.5) and F.6) is simply given by u{t) = t~,Au0. F.7) A general theory concerning generation of e~tA for given A has been established by Hille and Yosida (see, for instance, Yosida [1964, Chapter 9] and Kato [1966, Chapter 9]). However, since the present A admits of a complete orthonormal system of eigenfunctions, we can define e~M through the eigenfunction expansion. Namely, let {<?„}"= о be the complete orthonormal system of eigenfunctions of A associated with the eigenvalues {Я„}"=0; A<Pn = h<P», {<Pn, CPm) ~ («Pi., 9т)^(П) ~ <W Then any function v in X can be expanded as CO 00 v = E (v> Фп)<Р„ = £ P„<P„, F.8) и=0 л=0 where p„ = (v,<p„),n = 0,1,2,.... Now, the bounded operator e"'A in X may be defined by 00 e"M»= X ke-Un<f>n, t>0 F.9) for v represented as in F.8). Actually, from F.9) we have CO CO CO ^1 IW2=IN2- n = 0 This implies that e~tA is a contraction; ||e"MKl. F.10) Since e~tAcp„ = e,~an(pn from F.9), it is easy to see that e~tA-e~sAv=e~(t+s)Av for any veX. On the other hand, e~tA\t=0 ~ I (the identity) is obvious. Thus the family of operators {e~tA}t>o has the following semigroup property: U(t)-U(s)=U(t + s), t^O, s^O, U{0) = I. In view of F.10), we can say that {e~'A}!>0 is a contraction semigroup. Moreover, it
Section 6 Semigroup theory 811 is easy to see that e~tAv is continuous in t, particularly, z~,Av^>v as t-*0 + . In other words, {e~M}1>0 forms a strongly continuous semigroup, which is referred to as a (C0) semigroup. The condition for v in F.8) to belong to D{A) is £ |/U,!2< + «>, F.П) л = 0 which is equivalent to the convergence of 00 11 = 0 in X. By virtue of 0<Яе-'я=AД)(гЯе-'я)< 1/t, F.12) it is easy to see from F.11) that e~'Av belongs to D(A) for any veX and £>0. Actually we have 00 ||Ле-'^||2=||£ /?„Япе-'>в||2 п = 0 со = Е 1/У2|Апе-'я»|2 1 и = 0 1 hence it follows that \\Ae''A\\^~, t>0, F.13) which implies a certain smoothness in x of e Mt>. Finally, for t>0 we have d -,a d —e Mt;=— dt dt и=0 и=0 Namely, if we set u(t) = e~'Av, veX, then we have &v/<\t= -Av. In view of this we see that F.7) holds true, i.e., e'tAu0 solves the abstract Cauchy problem F.5) and F.6). From F.13) it follows also that \\A2e-tA\\ = \\Ae-ul2-Ae-,AI2\\^\\Ae-lAI2\\2^4/t2. Thus e~'A carries any veX into D(A2). Similarly, we can show that for any ne№ = {1,2,3,...} and t>0, A"e~'A is bounded and is subject to \\A"e-tA\\^MJtn, t>0, F.14) for some constant M„. Consequently, for any veX, u{t) = e~'Av is differentiable any
812 Я. Fujita and T. Suzuki Chapter II times and we have (d/dt)"e~'Av = {-A)"e~tAv, From what was stated above, we notice the smoothing property of e~tA for t>0. Also we note that the argument above can be applied with the same result to the case where the boundary condition F.3) is replaced by the Neumann condition 5ц/Эп=0, F.15) We proceed to the general case where A is the L2 realization of jtf = s/(x,D) —Х^х&кьлх>ъ+с{х) (блб) introduced in Section 1. As for the boundary condition, we adopt either the Dirichlet condition F.3) or the Robin condition -^-u + ffu=0 F.17) in A.7). Then the operator A : D{A)-*X is defined through the sesquilinear form a{,). By the argument in Section 1, A is an m-sectorial operator: ff 2 Q ~Q~ 2 g "J a(u,v)= < £ au{x)— u~v+ £ bj(x)~uv + c{x)uv>dx i, 7=1 °Xj OX; /Г-j OX; J Q 9Xj- ЙХ; •=! SXj- auvds, F.18) + Bfi where u,veV = H1{Q). Actually, О{А) = {иеН2(£2):ди/ду^ + аи = 0 on 60} F.19) as indicated in Section 1. When we consider the case of the Dirichlet boundary condition и=0 on 80, the last term in F.18) disappears, for we take V = #o@). Then D{A) = H2{Q)r>H10(Q) = {u e H2(Q): и = 0 on 60}. F.20) By considering u(t)=e~au(t) instead of u, we replace A by A + XI for some suitable XeU. Therefore, we may assume that Re a{u, u) Ss д \\ u\\l, ueV, F.21) holds true. Hence follows that || e-M || ^ 1, namely, e~'A is a contraction semigroup. Thus, for any w0eX = L2(O), u(t) = e~,Au0> t^O, solves du/dt + Au = 0, t>0, ф) = и0. F.22)
Section 6 Semigroup theory 813 In other words, u = e~'Au0 gives the unique solution of the initial value problem for the equation ди/dt + s/(x, D)u = 0 F.23) with the corresponding boundary condition and the initial condition u@, x) = u0(x). Furthermore, e~'A is a holomorphic semigroup in the sense that \\A'e~tA\\^MJf, t>0, F.24) holds for each a > 0 and with M„ > 0 depending on a. As to the details of this fact, we refer to Yosida [ 1964, Chapter 9] or Kato [ 1966, Chapter 9]. However, in view of an analogous argument in dealing with the finite element approximation in the Section 7, we here indicate an outline of the proof. First of all, with a constant c = co>0, a(,) satisfies A.8). Namely, \a(u,v)\^c0\\u\\y\\v\\v. F.25) Then, defining a sector Z0 с С by Zo = {zeC:O<|argz|<0o} for в0 with 0<в0< jtt and cos в0 = д/с0, we see that the numerical range v(A) of A is included in Z0. That is, v(A)= {{Au, u) = a(u, и): ueD(A),\\u\\ = l}cl0. F.26) From F.26), it follows that C\E(, is included in p{A). Furthermore, if we fix в1 with 0<в0<в1<{п and assume that 01<|argz|<7t, then we have «(г-ЛГЧК^-Тя — Д F.27) sm(e1-e0) \z\ by the following consideration. In fact, if we put d(z) = dist(z,5Z0)=inf{|z-C|:Ce£o}, then d(z)^|z|sin@i-0o)- On the other hand, |z-(AM,u)/|[u||2|^dist(z,v(^))^dist(z,Z0)==d(z) yields \((z-A)u,u)\^d(z)\[u\\2, ueD(A), F.28) hence ||(z — A)u\\ ^d(z)\\u\\ and F.27) follow. A useful consequence of F.27) is the
814 H, Fujita and T. Suzuki Chapter II following integral representation (Dunford integral) of e~'A: 1 2m (z-A)-le~,xdz, F.29) where Г is the boundary of with the sense that it starts from + ooe1*1, passes through the origin and goes to + ooe"'. Obviously, we may deform Г to some extent in C\I0 without affecting the integral representation F.26). Note that F.29) is a kind of Cauchy integral representation. In fact, it follows that „ 1 2ra z"(z-i4)~1e-'zdz. F.30) By virtue of F.27), we can deduce F.24) as 2n dr sin@! — 60) r = 1 1 j Xa~lQ-lcosei&X. ж t" Before concluding this section, we note that, in terms of e tA, the solution of the inhomogeneous equation du/dt + Au=f(t), O^t^T, F.31) with m@) = u0 is formally given by Г u(t) = e-lAu0+ \s-('-s)Af(s)ds, F.32) where /: [0, T]-* X is a given continuous function. For instance, u = u(t) in F.32) is shown to be the strong solution of F.31) in the case that / is Holder continuous. Remark 6.1. The hyperbolic equation can be dealt with by means of a semigroup, which is strongly continuous but not holomorphic. This will be described in Section 15 when we discuss approximation for hyperbolic problems.
Section 7 Semigroup theory 815 7. Semidiscrete approximation for parabolic equations In this section we consider the finite element approximation applied to the initial value problem for the equation du/dt = ^{x,D)u, t>0, xeQ, G.1) where л/ is the elliptic differential operator given by F.16). All assumptions made in Section 6 concerning Q are kept here. The boundary condition is either the Robin condition F.17) or the Dirichlet condition w = 0, r>0, xeQ. G.2) To fix the idea, we describe our results for the case of G.2) for the time being. The initial value и0 = ц0(х) is taken from L2(Q). Thus in terms of the operator A introduced before, the problem to be approximated is written as F.22) which we reproduce here: du/dt + Au = 0, t>0, u@) = mo. G.3) We denote this initial value problem in X=L2(Q) by (IVP). Since the solution и of (IVP) is given by u = e~tAu0, we may say that our aim is to approximate e~'A. The approximation which is discussed here is the so-called semidiscrete finite element approximation. Namely, we discretize the space variable xeQ according to the procedure of the finite element method, while the time variable is dealt with as a continuous variable. Actually, we triangulate the domain Q, which is now assumed to be a convex polygon for the sake of simplicity, and introduce the finite-dimensional space Vh in accordance with Chapter I. All notations made in Chapter I remain unchanged. Thus the family of triangulations [zh}h > 0, h being the largest diameter of triangles in xh, is assumed to be regular. We introduce the inverse assumption when necessary. Unless otherwise stated, Vh is composed of elementwise linear functions vanishing on the boundary, for we are considering the case of the Dirichlet boundary condition. The semidiscrete approximate solution uh is the function uh: [0, oo)->Vh determined by the following conditions: d —(uh(t), vh) + ah(u„(t), vh) = 0, vhe Vh, G.4) at for t>0 and {uh@),vh) = (uo,vh) vheVh. G.5) Here (,) is the L2 inner product and ah(,) is the restriction of д(,) onto VhxVh. Recalling that the operator Ah :Vh->Vh is defined by MVh, wh) = (Ahvh, wh)LHn), we can rewrite G.4) as duh/dt + Ahuh = 0, £>0. G.6)
816 Я. Fujita and Т. Suzuki Chapter II On the other hand, by means of the L2-orthogonal projection Ph:X~*Vh, the condition G.5) is equivalent to uh@) = Pkuo. G.7) We denote the initial value problem G.6) and G.7) for uh by (IVP)A. The solution of (IVP)fc is given by uh~e~tAhPhu0, t^O, G.8) although for practical computation one sets «*(*)= I ^(Оф!Л t>Q, G.9) j'=i with the standard basis function cp[}),j= 1,2,..., N, and determine £,){t) through the ordinary differential equation Mh{d/dt)l + Khl = Q, G.10) where <J =(^1,^*,..., ihN)T is an iV-vector-valued unknown function, and where Mh and Kh are NxN matrices defined, respectively, by Mh=(Mu(h)), Му(А) = (<р«,ф«)Ь,- G.11) and K„ = {Kv(h)), K,j{h) = ak{q>f,(piJ)). G.12) Since our analysis here is an abstract one, we mostly use the operator-theoretical representation of G.8) for the semidiscrete approximate solution uh. Thus we have to study operator-theoretical properties of Ah in the subspace VholX = L2(Q), which are uniform in h. In this direction, we have already shown Theorem 5.1 in the preceding chapter. However, we note that the crucial fact can be easily seen from a consideration of the numerical range as in the case of operator A. Actually, we firstly note that e~tAh is a contraction semigroup and that the numerical range v(Ah) = {{Ahvh, vh): vhe Vh, || v„ \\Li = 1} of Ah is included in the sector E0 of the preceding section, for (Ahvh, vh) = ah(vh, vh) = a(u, u). Therefore, by the same argument which has led to F.27), we have И^-^ГЧК-т-т^Ц-тп GЛЗ) sin^i-flo) \z\ for zeC subject to 0o<0i^|argz|^Tt. Moreover, we have Q~,Ah = hx jV-^rV^dz, G.14) r
Section 7 Semigroup theory 817 where the path Г is the same as in F.29). Consequently, we can write 1 2ra {z-Ahr'Pkc-'*uz. A corollary of G.14) is the estimate WAlt-^l^MJf, t>0, a>0, G.15) with Ma independent of h. The main result of this section is the following theorem which asserts that the error || uh(t)—u(t) || t2 in the semidiscrete approximation is of the optimal order 0(h2) for t > 0 even if the initial function u0 is not smooth, which may be regarded as a consequence of the smoothing property of parabolic equations. Theorem 7.1. Let и and uh be the solutions of (IVP) and (IVP)b respectively. Then for any u0eL2(Q), \\uh(t)-u(t)\\H^C(h/t)\\u0\\L2, t>0, G.16) and \\uh(t)-u{t)\\L^C{h2/t)\\u0\\L,, f>0. G.17) Remark 7.1. The proof below is adapted from Fujita and Mizutani [1976], while these estimates have been obtained for the case of self-adjoint A by Helfrich [1974] without resort to "complex analysis". See Section 12 concerning Helfrich's method. Before going into the proof of the theorem, we have to make a little more refined consideration of the numerical range of the operators A and Ah, or more exactly, the numerical range of the sesquilinear forms a(,) and ah{,). We recall that the acute angle в0 was defined by cos 60 = d/c0 with the constants S and c0 in F.21) and F.25), respectively and that 0^ is another acute angle with 0o<^i <£я. Now we choose a positive number d' such that cos #! <Eyco<<S/co=cos0o and define the angle в' by 0o<0'<0i, cos0' = E'/co. Also we put y0 = 6—S'. Then from F.21) and F.25) we have Rea(v,v)-y0\\v\\2Hi ^'IMIi.^--co||»||J.^cos0'|a(u,p)|, veV. G.18) Co Therefore, if (is a point in the numerical range of a(,), namely, if С=Ф, v)/1| v ||2 for some veV, then we have ReC-yoM»)>cos0'|C|, G.19)
818 H. Fujita and Т. Suzuki Chapter II where i4v)= IMIh'/NI2- From this, we see that £ = £(«) is contained in the shifted sector r' = {z: |arg(z — y0n(v))\^6'}. Now we have: Lemma 7.1. There exists a positive constant bx such that for any veV and for any z in Zi — {z: 0!^|argz| ^n}, the inequality \z\\\v\\b+Mh<8i\z\\v\\b-<tv,i>)\ G-20) holds true. Proof. We note that the distance dist(z, I') between z and the shifted sector above satisfies dist(z, Z')^\z\ *т{в х-в')+ ф) sin 9'. Consequently, we have \z\\v\\h-a(v,v)\ = IN2^-C(»)I^Nbdist(z,r) >\\v\\h{\z\sm{e1-e')+B\\v\\%i/\\v\\h)ainff} with ei =(min{sin@1 —в'), sin^'})-1. This proves the lemma. □ Proof of Theorem 7.1. Writing /=u0 for the time being, we put w = (z — A)~ 1fand wh={z — Ah)~1Pkf for zeF, respectively. Then the error eh(t) = uh(t)—u(t) is represented by e*(t) = i -w}dz. G.21) On the other hand, w and wh satisfy (zw-Aw, q>)L2 = (/, cp)L2, (peV, and (zwh ~Ahwh, cph)L2 =(/, (ph)Li, q>h e Vh, respectively. Namely, z(w, <p)L2 - a(w, (p)Lz = {f, <p)L2, G.22) zK, (Рикг-Фк, <?h)is- =(/. Vh)*- G.23) Since Vh с V, we have from G.22) and G.23) z{w-wh, cph)L2 +a(w-wk, q>h)L2 = 0, <pk e Vh. G.24)
Section 7 Semigroup theory 819 Henceforth, we write rh = wh - w. Then by virtue of G.20) and G.24), we have И Ы\Ь + ||r»|||. <*! |z||rj|£2 -a(r„, r»)| = (?! |z(rA, w-^Ь -a(rh, w-Xa)| for any X/i e КА. Furthermore, by F.25) we have Ы\ЫЬ + \Ы2ю <<51{(Z|1/2||rJ|L2-H1/2||w-xA!l^+Co||rh||H.-||w-zA^||H.}. G.25) On the other hand, we recall inf \}w-Xhh^Ch\\w\\H,, inf \\w-Xh\\H^Ch\\w\\B2. XheVh In order to estimate ||w||Hi, we use G.20) again, obtaining \A \H\h + W\h ^ i((z-x)w, w)L2| ^ n/iii2- \\w\b, G.26) hence it follows that Wllwiib^H/bllwlb, \W\l^CU\\L2/\z\. G.27) Substituting G.27) into G.26), we obtain \\w\\2H1 sgC||/||b/|z| and end up with MlH^CH/llWIzp. G.28) Estimation of ||w||H2 is made as follows: \\w\\ai^C\\Aw\\L, = СР(г-Л)-1/Их.» = C||((/l-z) + z)(z-^)-1/!|I.a ^cwfWv+WMz-ArVW» ^C\\f\\L*. Here we have used F.27). From what we have obtained above, we get inf llw-fcH^CAH/lb/N1/2, XheVh inf ||w-ZJ|H.<a||/||L3. Substituting these into G.25), we have This yields \z\\\rh\\h + \\rh\\h^Ch2\\f\\h. G.29)
820 Я. Fujita and T. Suzuki Chapter II Hence we have [\rh\\Hi <CA||/||L2, and in view of G.21), we have Н(Мв*<-2к !e-t2l!K-W||Hl|dz| -^•2|e-'S"'-C/i||/k2dr = Cft||/||LJ/t = Cfc||ttollWt. This proves G.16). To obtain G.17) we need which is deduced from G.24) by Nitsche's trick as follows; namely, for g e L2{Q) we put v^{z-A*)~lg, vh={z-At)-*Phg, and note j|rft||L2<sup—— = sup- leL* llSMll2 g*l? US'1112 and, moreover, \z(rh,v)L2-a(rh,v)\ = \z{rh,v~vh)Ll-a{rh,v-vh)\ ^\z\\\rh\\v\\v-Vh\\L2+C0\\rh\\Hi-\\v-Vh\\Hi ^C(h\z\ilz\\f\\LI'h\\v\\Hi+h\\f\b'h\\v\\m) < Ch2{\z\1'2 \\f\\L> ■ \z\ - W \\g\\L2 +1|/||t2 1ЫЫ, because the estimates obtained above for A and Ah hold also true for their adjoints. Thus G.30) has been verified. Substituting it into IM*^(VteIK-wb|dz|, r we immediately obtain G.17), which establishes the theorem. □ Remark 7.2. Similar results hold true for the case of the Robin boundary condition, particularly, for the Neumann boundary condition. Remark 7.3. If we are concerned only with the convergence, we can relax the assumption on Q considerably.
Section 8 Semigroup theory 821 Remark 7.4. The inequality G.30) is represented as Ш-ЛкГ1Рк-(Х-А)-Ц^Ск2, XeGtl. G.31) 8. Fully discrete approximation for parabolic equations In this section we consider approximation methods for (IVP) G.3), where the space variable x e Q is discretized just as in the preceding section and the time variable t is discretized by a uniform mesh, namely, as t = m, т>0, и=0,1,2,— For the sake of simplicity, we approximate du/df in G.3) by the simplest backward or forward difference quotient. Thus the approximation which we are going to discuss might be called the difference finite element approximation. 8.1. Backward difference approximation Firstly, we deal with the following approximation with backward difference for the time variable: uUt + z)-u№ , л (8.1) \-Ahul(t) = 0, t = m, n = 0,l,2,..., т i4@) = P„u0, where Ah and Ph mean the same as before. Thus the approximate solution u\ is a function from the discrete time t = m into Vh. Obviously, it is expressed as ul(t) = (I + rAhynPhu0, t = m, (8.2) although for actual computation one discretizes the time variable in G.9) and G.10). The approximate problem (8.1) is denoted by (IVP)?,,,. Our main concern here is to study the rate of convergence of u\ to u. However, before doing so, we mention the stability of (IVP)?,,,. Theorem 8.1. The fully discrete approximation (I VP)?,h with backward difference in t is absolutely stable. In fact IWWIlL^KIb t = m. (8.3) This theorem is obvious since {I + xAh)~l is a contraction because of Re(Ahvh, vh)Li =Re a{vh, vh)^0. As to the rate of convergence, which reflects the smoothing property of parabolic equations, we claim: Theorem 8.2. Let e\(t) = u\{t) — u(t) be the error for (IVP)?,,,. Then we have \\ei(t)\\L^C{h2+т)\\и0\\ьгЦ, t = m, « = 1,2,... (8.4)
H. Fujita and T. Suzuki Chapter II 822 Proof. Putting sa) = uk(t)-i4t) = e~,AhPhu0-e~tAu0, e<2) =(I + zAh)-"Phu0 -e-lA*Phu0 = K\Phu0. We notice e\ =eA) +eB). According to Theorem 7.1, ||eA>KC/i2||u0|!/t, t>Q, (8.5) while we can estimate \\КЦ\ = \\(I + zAhy — e~Mh||, as follows, to obtain \\Ki\\^Cr/t, f = nt>0. In fact, we can write (8.6) Kl = —{{I + sAh)-"e-n(z-s)Ah)us = n sA2h(I + sAh) -n— 1 ~-n(r-s)Ah ds = n sA3h'2(I + sAhy{n + 1)- All2e-"ix-s)Ah ds and use the inequalities \\A*h(I + sAhyk\\^C,(ksy°, fc>a>0, s>0, (8.7) which is an analogue of G.15) and can be proved by means of the Dunford integral (for details, see Fujita and Mizutani [1976]). Namely, we substitute ||s^2(/ + sA»r(" + 1)KC((n + l)sr3/2 ^C(nsy312, \\Alh12 e""(t"sMh|KC(n(t-sr1/2 into the integral representation of K\ above and obtain WKU^n s-C{ns)-3i2-l-C(n(x-s))-ll2ds = -|s'1/2(T-S)-1/2ds=C" n J nx о = Cz/t,
Section 8 Semigroup theory 823 which proves (8.6). Hence, in view of \\Ph || = 1, we get \\eB) || < Ст||и01|/£, which yields (8.4) when combined with (8.5). □ 8.2. Forward difference approximation We proceed to the forward difference approximation and define the approximate solution ul:t = nx>-*Vh by ul(t + x)-ul(t) -+Ahul{t)=0, t = nx, n = 0,1,2,..., (8.8) T uj,@)~PhUo. (8.9) The discrete initial value problem (8.9)-(8.I0) is denoted by (IVP)^. Formally, the solution of (IVP)f%fc is given by uW)=(I + rAhfPhu0, t=m, «=0,1,2,.... (8.10) Like in the usual difference approximation, the stability consideration is more crucial with the forward difference scheme. In this connection we claim: Theorem 8.3. Suppose that the condition tMJ<2cos0o (8.11) is satisfied as т,«->0, where 60 is the acute angle defined by cos в0 = 5fc0. Then (IVP)j?\h is stable. Precisely we have ||ul(t)||y<2||u0||y, t = nx, «=0,1,2,.... (8.12) Under the inverse assertion, in particular, (IVP)f h is stable ifx/h1 is sufficiently small. Proof. Putting Szh = I — xAh, we consider the numerical range of Srh. Let <ph e Vh with \\<ph\\L2 = l. Then (Slq>hr<ph)L2 = l-xt with (, = {Ah<ph,(p„)L2=a((ph,(ph) is seen to satisfy \(Sl<ph, <pft)| ^ 1 in view of Kl = \<AHq>k, (ph)Lz\^x\\Ah|| <2 cos 60 and |arg(rC)| = |arg CI ^80. Thus according to a theorem on the numerical range of iterated operators (see, for instance Kato [1966]), the numerical range of {Slf stays in the unit disk, which implies ||№)"И2, n = 0,l,2,.... This gives (8.12), since ul{t) = {S\fPhu0. Finally, (8.11) is obvious under the inverse assumption for small x/h1, since \\Ah || < C/h2 holds true then. □ As to the rate of convergence of (IVP)ftfc, we have:
824 Я. Fujita and T. Suzuki Theorem 8.4. Assume in (IVP)£ft that the condition sup t\\Ah\\<2 cos90 t,ft->0 Chapter II (8.13). is satisfied (which is a little more stringent than (8.11)). Then for the error e(t) = ul(t)--u(t)^(I-xAh)nPhu0-e-tAUo, t=m, n = l,2,,.. we have \\e(t)\\L^C(h2+x)\\u0\b/t. (8.14) Proof. We again split the error as e(t) = ew + sB) with £<1» = е_м,,Рй«о-е"мИо and eB> =(I-rAhTPhu0 ~z-tAhPhu0 =KlPhu0. (8.15) Since ||£A)||L2^C/22||M0||i.2/t according to Theorem 7.1, it suffices to show \\Kl\\^Cx/t, t = nz, n=l,2,.... (8.16) We can derive the integral representation of Кjj as -K\ = — ((I~sAhfe-"(t~s)Ah)ds sAi(I-sAh)n-1e-ntx-s)Ahds n sAll2(I-sAhfl-A1hi2e'"{x~s)A,'ds. In order to estimate the integral above, we make use of G.15) and its analogue \\At{I-sAhY\\^C<x(ns)-; a>0, 0<s^t, (8.17) to be proved below. We then have \\Kl\\^n s-C'((n-l)sr3l2-C(n(T-s)yl/2ds = Cnll2(n-iy3i2 s/2(T-s)-1/2ds <си_1 = а/т,
Section 8 Semigroup theory 825 which yields the required estimate of ||eB)||L2. Finally we give the proof of (8.17). We choose positive constants к and pi such that t||AJ|A + k)^^<2cos01, (8.18) which is possible because of (8.13). Then we introduce a positively oriented contour Г which is composed of the following two portions (as sets): rw = {re±m:0^r^R}, ГB) = {Кеш: -в^в^в,}, where R=fi/s. We put Fn(A) = (nl)x(l —X)" and represent Fn{sAh) by the following Dunford integral: "v h' 2ra F„(^)B-^)-1dz=^j(/<14/B)), where IU) means the contribution to the integral form Ги\ ./'=1,2. For z = re±l9lefA) we have \l-sz\2 = l+s2r2-2srcos6u 0<sr^/i. Since /i<2 cos 6X, there exists a positive constant у which depends only on вх and /л such that |1— sz|^l— ysr, O^sr^n. Thus we have UA)li^C dr (nsr)*(l-ysr)" — r <c {nsyft'yasy—=C r f ^""«d^Ci1». о о On the other hand, for z = ReieerB\ we have 1 1 (*-Л)" И l-MJI/W 1 1 1 1 Л l-MJ/Л Я 1-1/A +к) _1 1+к ~Я к ' |l-sz|<|l-/teifll| = (l+A«2-2/icoseiI/2s^1<l. (8.19)
826 H. Fujita and T. Suzuki Chapter II By virtue of these estimates, we have »2 ||/B)||<C UnsRYSI^—^-Rde ^С(п(хТ51^С?\ since rfd\-+§ as n-»oo by 0<<51<1, Combining this result with (8.19), we get to \\F„(sAh)\\ s£ Ся which implies (8.17). П 9. Approximation for mhomogeneous equations Let us consider the semidiscrete approximation applied to the inhomogeneous equation du/dt + Au=f(t), O^t^T, (9.1) with the initial condition u@) = u0eL2(Q) = X. A standard smoothness assumption for /: [0, T\-*X is the Holder continuity. Namely, we assume that rn II/(*)-/(*) II l' „ , ,0~ Ule= sup ——j <+oo (9.2) I,ss@,T] 1Г SI for some в, O<0<1. Then a unique solution иеС1+в([0, Т]->Х) exists and is expressed by u(t) = e-'Au0+ e-('~s,V(s)ds (9.3) о as mentioned in Section 6. Now we define our approximate solution uh: [0, T}-*Vh by duh/dt+Ahuh = PHf(t), O^t^T, (94) u„@) = PAuo. Then uh can be expressed by t г u„(t) = e-tAhu0+ e-(,-s)AhPJ(s)ds. (9.5) о Our aim is to estimate ||мА(г) —u(t)||. To this end, let us introduce the error operator Eh{t) = Q-tAhPh-t-lA (9.6) acting on X. First of all, we note
Section 9 НВДК2, Semigroup theory 827 (9.7) since e tA and e ,Ah axe both contractions under our assumptions. Furthermore, according to Theorem 7.1, the estimate \\Eh(t)\\^Ch2/t, t>0, holds true. We split the error eh(t) = uh(t)—u(t) as follows: eh(t) = Eh(t)u0 + E„(t-s)f(s)ds = еA)@ + £B)(£) + £C)(^ where ea\t) = Eh(t)u0, t £<2)(t)-k(t-s)(/(s)-/@)dS, (9.8) £<3>@- £„(t-s)/(f)dt. Then obviously we have \\^(t)\\^ch2\\uj/t, r о ^ch2me (t-sy^t-sfds 0 !,2г/ *в Incidentally, here and hereafter we may write simply || • || for || • ||L2 when no confusion arises. In order to estimate ||eC,(r)||, we compute eC)(f)= \e-(t's)Af{t)ds~ \e-(,-s)AhPhf{t)ds = (A^e-tA*Ph-A-^-tA)f{t)-{A^Ph-A-l)f{t)
H. Fujita and T. Suzuki Chapter II 828 and note that \\А^РИ-А^ЫСР (9.9) according to Section 2. Thus it is only necessary to estimate the operator norm of AZle~tAhPh-A- 1 2тп (9.10) z-1e-'z[(z-^)-1JP(l-(z-/l)-1]dl Here Г is a positively oriented contour which is obtained by deforming the original one in the following manner: Г = Га)иГт with ГA) = |-е,в:01<|0|<я|, rB) = jre±iei:-Ur< + oo Let Iu\./=1,2, stand for the contributions to the integral in (9.10) from ru\j = 1,2, respectively. Then we have In t by virtue of G.25). Furthermore, we have ||/B)K^--2 i-e-*'cos9'-Oi2dr In J r = Ch2 _e-pcoSe,dp = C/j2 P Thus we have !|eC>(t)KC/22||/(t)ll- Summing the estimates of Це0'^)!!,/=1,2,3, we have: Theorem 9.1. As to the error eh(t) — uh(i) — u(t) committed by the semidiscrete finite element approximation for the inhomogeneous initial value problem (9.4), we have (9.11) iMt)ii<c/i2(^+m/+ii/(£)ii 10. Remarks on approximation with higher accuracy As in the case of elliptic boundary value problems (e.g., Ciarlet [1978]), one may use a more sophisticated finite element space Vh for V and try to obtain approximate
Section 10 Semigroup theory 829 solutions with higher accuracy. In order to have a look into this direction, we again consider the same (IVP) du/df-Mu = 0, г>0, м@) = мо A0.1) as above. We study its semidiscrete approximation (IVP),, duh/dt + Akuh=0, uh@) = Phuo A0.2) of the same form as in Section 7, but assume that the finite-dimensional subspace Vh has the following properties: there exists an integer s^2 such that if veH%Q)nV, then for some vh in Vh we have Wv-vJ^Of-^M.-!, (Ю.З) 11»-р»11я^СА,~11И„ (Ю.4) where С is independent of v and ft. If s > 2, we suppose that the conditions A0.3) and A0.4) with s replaced by 2 hold true as well. Let us confirm that all assumptions on A and a{,) remain unchanged. Then we have: Theorem 10.1. As to the error eh(t) = u„(t)-u(t)=<i-,A*Phu0-e-uu0, we have H(t)~u(t)\\L2^c(^jJ\\u0\\L2, t>0, A0.5) for any u0bL2(Q). Concerning the details of the proof we refer to Fujita and Mizutani [1978]. Here we just mention the outline. The first step is to show, for w = {z—A)'1/ and z with вг ^ |arg z\ ^ jc, the inequality \МЛ\А112\Ы1-1 + ---+\А*,2Ы\о <C(||/||s_2 + |Z|1/2||/||s_3+- + |z|(s-2)/2||/||0), A0.6) where || • ||^ stands for || • ||я> J=0,1,..., s. As the second step, we recall that G.20) still holds true, i.e., \2\Ы1 + Ы\\^Ь^Ы\1-Ф^)\ A0.7) for any veV. Making use of A0.3), A0.4), A0.6) and A0.7), we can show that if feHs-2(Q), then we have for r„ = (z- A)~xj'-(z-Ah)~lPJ that IMo<CftWL~24^l1/2!l/L-3 + ---4#-2)/2imio)- (Ю.8) By means of A0.8) we can estimate the integral below which expresses the error
830 ФУ- ( Thus = uh(t) e„« = -u{t): ltd) г we obtain \\uh{t)- ""{(г-, -u(t)\\^Chsrh H. Fujita and T. Suzuki Chapter II {{2-А)^-{2-Ан)-1Рн}и0йг. ]"oL-.2 + r1/2||u0L_3 + - + r(s-^2|luo!lo) for u0eHs~2(Q). In order to relax the assumption u0eHs~2(Q) and obtain A0.5), we follow the idea due to Helfrich (see Section 12 and Fujita and Mizutani [1978]). As an example of the fully discrete approximation with higher accuracy, we consider the Crank-Nicolson scheme which defines the approximate solution u\{t), t = nz, n~0,1,2,..., by ^+T)-t4@+KMUt + T)+Hx(t)=Q A09) T with uxh@) = Phuo. Consequently, u\ is expressed as u№) = {l-bAhT{I + hAhTnPhu0. A condition for the stability is sup т|ИЛ< + оо, t,ft-0 which is always met if the inverse assumption is satisfied and xjh2 stays bounded. Moreover, let us assume that Vh is subject to A0.3) and A0.4) with s = 4. Then we have: Theorem 10.2. Under the assumptions stated above, the error el = ul(t) — u(t) committed by the fully discrete Crank-Nicolson approximation is established as 1|еИ011у<С^~_||ио||у, £ = пт>0. A0.10) As for the details of the proof, which is similar to that of Theorem 8.4 and is reduced to estimation of the corresponding Dunford integral, we refer to Fujita and Mizutani [1978]. Remark 10.1. Various works have been done to analyze approximations with higher accuracy. In this connection see Raviart and Thomas [1983] and its bibliography. See also Section 17.
Chapter III Evolution Equations and Error Analysis by Real Methods In the case of temporally inhomogeneous parabolic equations, the complex method developed in the preceding chapter does not work so well by itself. Here, we adopt first the method of Helfrich and then that of energy, in order to extend the error estimates obtained in the preceding chapter to the temporally inhomogeneous case. Also, we mention the discretization of hyperbolic equations. 11. Generation theory on evolution equations In the present chapter, we study temporally inhomogeneous parabolic equations; that is, QcU2 is as before, and л/ — $4(t, x, D) denotes a second-order elliptic differential operator with time-dependent real smooth coefficients: 2 А л 2 д s*=s/{t,x,D)=- t ^-аиA,х)-—+£Ь#,х)— + сA,х). A1.1) j,i=i cxi Gxj j=i oxj Uniform ellipticity £ atjiUxK&ZdM2, Z=(ZltZ2)eR2, A1.2) is assumed, dl>0 being a constant. We consider the parabolic equation Qu/dt + s/(t,x,D)u=0, 0<t^T, xeQ, A1.3) with the boundary condition either и|ея = 0, 0<t^T, (Ц.4) or 8ц/буд, + ам|8й=0, 0<t^T, A1.4') and with the initial condition u\t=0 = u0(x), xeQ. (П.5) 831
832 Я. Fujita and T. Suzuki Chapter III In A1.4'), a = a{t, £) is a smooth function on [0, T] x dQ, and 5/6^ denotes the differentiation along the outer conormal vector vy. 8/64,= E n;ay(t, х)(8/ЭхД A1.6) i,J=l where n — in^, n2) is the outer unit normal on dQ, Assuming u0eX = L2{Q), we can reduce the equation A1.3) with A1.4) (or A1.4')) and A1.5) to the evolution equation du/dt + A{t)u = 0, 0<£<Г, A1.7) with ы@) = ы0 A1.8) in X = LZ(Q). Namely, let V — Hq{Q) or Я'(£2) according to the boundary condition A1.4) or A1.4') and put at{u,v)= £ au{t,x)^~u-^~vdx+ £ bj(t,x)—u-vdx ■ J=l \; о n + c(r, х)м • и dx + tr(t, x)u • с dS A1.9) 12 Эй for u,ve V, where dS denotes the arc element of dQ. An m-sectorial operator A{t) in X can be defined through the relation a,(u,v)=(A(t)u,v), ueD(A{t))cV, veV A1.10) as before. As is described in Chapter I, the relation D{A(t)) = Hl0{Q)^H2{Q) A1.11) holds for the case V=Hq(Q) and D(A{t)) = {v0eH2(Qy.dv/dv^ + ov\dSi=0} A1.11') for the case К=Я1(Й). The generation theory of evolution operators {U(t,s)}T>t>s>0 in X due to Sobolevskii, Kato, Tanabe and others assures us of the unique solvability of A1.7). Namely, a unique solution u = u(t) e Cl((s, T]^X)nC°({s, T-]^D{A))nC°{[s, T]-+X) of du/dt + A{t)u = 0, s<t^T, A1.7) with ф) = и0еХ A1.8')
Section 11 Evolution equations 833 exists and is given by u(t)=U(t,s)u0. A1.12) It is worthwhile to give a short summary of these theories here. In fact, our error analysis on the finite element approximation of A1.7) is based on a certain stability of approximate solutions as well as on a certain smoothness of original ones, both of which can be established by re-examining these theories in our contexts. First we note that the crucial assumptions of the generation theory are that A) each —A(i) generates a holomorphic semigroup with certain estimates uniform in 1e [0, T], B) A(t) is smooth in t e [0, Г] in some sense or other. Then, the evolution operator {U(t, s)} is constructed by Levi's method. Actually, condition A) enables us to give a first approximation of {U(t, s)}, while condition B) makes an iteration scheme converge. As for condition A), we can verify it in the same way as in the preceding chapter. In fact, we have the boundedness of the sesquilinear form at(,): \a,(u,v)\^C\\u\\\\vl u,veV, A1.13) as well as its coerciveness Re at(u,v)^d\\u\\$-A\\u\\Jt, ueV, A1.14) with constants S and AeIR by A1.2). As in the preceding chapter, we may suppose A=0 in A1.14'), just by taking v — t~x,u(t) instead of u = u(t), namely Re а,(и,и)^5\\и\\$, ueV. A1.14') Then, the relation Ge = {ze C: rc>|argz\ >в} cp(A(t)) A1.15) holds for some в, 0<в<^к, as well as the estimate ||(Л-ЛЮГ1КС,/|А|, AeG9+E, A1.16) for each £ > 0. These relations are uniform in r e [0, Г]. Therefore, each — A(t) generates a holomorphic semigroup {e~sA{,)}s>0 uniformly bounded in t e [0, Г]. Actually, we have ||e"^wKl, 00<oo, O^t^T. A1.17) The requirement of condition B), on the other hand, depends on the generation theory. 11.1. Generation theory ofTanabe-Sobolevskii (Tanabe [1960], Sobolevskii [1961a]) In the case of V= Hq(£2), the relation A1.11) holds. Hence D(A(t)) is independent of t. Furthermore, from an integration by parts and the elliptic estimate by Agmon, Douglis and Nirenberg [1959], the inequality \\A{t)A{sr'-\\\^C\t-s\*, t,se[_0,Tl A1.18)
834 H. Fujita and T. Suzuki Chapter III follows with an a in 0<as$ 1. In this case, the evolution operators {U{t, s)}T>t>s>0 are constructed by L/(t,s) = e-(,~sM(s)+ e"('-rM(r)i?(r,s)dr, A1.19) s where R = R(t, s) is the unique solution of the integral equation t R{t,s)- \Rl{t,r)R(r,s)dr = R1(t,s) A1.20) s of Volterra type with K1(£,s)= -{A(t)-A(s))e-(t-siAls). A1.21) Furthermore, for the evolution operators constructed in this way, the estimates \\A{t)U(t,s)\\,\\U(t,s)A(s)\\^C(t-sy1, 0^s<t<T, A1.22) \\U(t,s)U\A(t)U(t,s)A(sr1\\^C, O^s^t^T, A1.23) \\A(t)lU{t,s)~U{r,s)']A(sy1\\^Ce(t-r)e{r~s)'e, 0^s<r<t^T, A1.24) can be derived, where 0 < 9 < a. 11.2. Generation theory of Fujie-Tanabe (Fujie and Tanabe [1973]) The m-sectorial operator A{t) in X defined through the equality A1.10) may be regarded as that in V*, which is denoted by A{f). Then, the domain of A{t) is independent of t, that is, D(A(t))—V. The relation A1.15) and the estimate A1.16) hold for A(t) in X= V*. The coefficients a^, bj, с and с are smooth so that the inequality к(«,»)-а,(«,»)|<С|Г-5|в||и||к||»|1к, u,veV, A1.25) holds with 0<ocsSl, which implies A1.18) for A{t) in X = V*. Therefore, by the preceding theory, A{t) generates an evolution operator {U(t, s)}T>t>s>0 in X = V*. It is shown that in the case of a > \, U{t,s)=U(t,s)\x A1.26) is a bounded operator in X and becomes the desired evolution operator generated by — A(t). Furthermore, this {U(t, s)} satisfies A1.22) and the first inequality of A1.23) in X. 11.3. Generation theory of Kato-Sobolevskii (Kato [1961b], Sobolevskii [1961b]) From the theory of fractional powers of m-sectorial operators due to Kato [1961a], the domain of A(t)" is independent of t for O^p<j. Furthermore, it is shown that
Section 11 Evolution equations 835 A1.25) implies M(C)M(s)-"-lKC|t-sr, t,se[0,rj. A1.27) From these facts, the evolution operators {U(t, s)}T>t>s>0 can be constructed in another way, provided that a + p> 1, namely, under the assumption a>i again. For simplicity, we consider the case p = l/m according to Kato [1961b], where m is an integer. By taking an appropriate approximation of A(t), say, the modified Yosida approximation: Ax(t)=A{t){l + XA(t)'')~m, where 1|0, we can reduce the theory to the case where each A(t) is bounded in X. In fact, we can show that A1.27) implies \\Ax(tyAx(s)-»-l\\^C\t-s\°, Me [0,7], A1.27) C>0 being independent of i>0. The existence of the evolution operator {Vk(t, s)}T>t>s>0 for the approximate operator Ax(t) is obvious from the bounded- ness of Ak{t). All we have to do is to derive some a priori estimates on Ax, and make them to pass to the limit: U(t,s) = s-limUx(t,s). A|0 Then, {U(t,s)} become the desired evolution operators. For instance, the estimate М(*УЕ/(М)|!<С,(*-яГ', 0<s<t<T, A1.28) follows from M^t/t/^sJIKC^t-s)"', A1.28') where 0^f3<a + p. We shall give an outline of the proof of A1.28'). Henceforth, we drop the suffix X for simplicity of writing. First, setting D{t,s) = A{tfA{sYp-h A1.29) we have ||D(t,s)KC|t-sf, A1.30) which is nothing but A1.27'). Furthermore, Sobolevskii's identity m A{t)-A(s)= Y A{tI-ppD(t,s)A(sI"' A1.31) p=i holds by p = l/m. Therefore, we have r " Э U(t,s)-e-(,",MW = [e-(«-'MW[/(r 5)]dr or = E p=l J e-<«-'M««U(t)' ~>">D(t, r)A(r)pp U(r, s) dr. A1.32)
836 H. Fujita and T. Suzuki Chapter III We introduce a few notations. For operator-valued functions K, = Kt(t, s), I = 1,2, on D = {(t,s): 7>f^s>0}, we define another K — K1*K2 by (K1*K2)(t,s) = K,(t,r)K2{r,s)ur. A1.33) Furthermore, for a > 0 and M > 0 we say that К е Q(a, M) if the inequality HK&sJKMtt-s)"-1 (И-34) holds. Then, K, e 6(a„ M,), 1= 1,2, implies that K^K^ s (?(% + a2, B(ax, a2)M1M2), where B(a, b) denotes the beta function: B(a,b) = (l-xf-'x^'dx. A1.35) Now, we put W{t,s) = U{t,s)~Q~«~s)m, Yq(t,s) = A(tr»W{t,s). Then, the equality A1.32) reads p=i where — 4f^l'"l'(, + №A-(|-sWt) Hq,p(t,s) = A(t) m ДМ), p=l with yPi_1(t,s) = 4@w'e-(,-,MW. For a technical reason, we take and transform A1.38) into a system of integral equations for Zq, q = 1,..., m: p=l A1.36) A1.37) A1.38) A1.39) A1.40) A1.41) A1.42) A1.43) where ^«.0 — 2, Д),р*^Р,0- A1.44)
Section 11 Evolution equations 837 Consequently, Z, q = 1,..., m, can be generated by an iteration scheme as Z9=ZZtJ A1.45) with « i = 0 Zq,i+i=YtH9tP*Zp,l, /=0,1,.... A1.46) p=l From the definition A1.39), we have H^eQia-qp+pp^,) A1.47) with a constant M, >0, because \\A(r)Ke-sA(r) || <cKs"K, 0<s<oo A1.48) holds for к>0 by A1.16). Furthermore, we can show that Zq<0eQ(l+a-qp,M0) A1.49) for some M0 > 0. Then, we get Z4iieQ(l+(i+l)a-«p,M() A1.50) with Mi+1/Mi=mM0M1B(a + p — l,(i+l)a) by induction. Thus Z,e 6A+а-9ЛС) A1.51) follows from A1.45) with a constant C>0. We can deduce from A1.51) an estimate on Yq with not necessarily integral q, provided that a — qp + p>0. In fact, A1.42) makes sense for nonintegral q > 0, while then Z,f0 is again given by A1.44) and Zq is to be defined by A1.43) (in which p takes integral values as before). Since Zq in A1.43) has been estimated by A1.51), A1.42) gives an estimate of Yq, because Yq,0eQ(l-qp,C) A1.52) can be shown. In this way, we obtain an estimate Yq e Q(l —qp, Cj, provided that a — qp + p>0. Writing qp = P, we thus arrive at the estimate A1.28') in view of A1.48). 11.4. Generation theory of Kato-Tanabe (Kato and Tanabe [1962]) Let us define another sesquilinear form a,(,) on К by 2 Л Л r\ Л 2 Л ft ft *(B,r)eu?i J a^W* Д J 5ibjit'x)^rdx о о , 9 + J —c(t,x)wvdx + -a(U&u-vdS. (П.53) at 8Я
838 H. Fujita and T. Suzuki Then, we have \a,{u, v)\^C\\u\\v\\v\\r, u,veV. \at{u,v)-as(u,v)\^C\t-s\"\\u\\r\\v\\r, u,v e V, lim sup '~*s u.veV j—{at-as\u,v)-as(u,v) = 0 Chapter III A1.54) A1.55) A1.56) for some a in 0 < a. ^ 1. From these relations, we can show that A(t) 1 is strongly C1 in t and that the inequalities i««-'-£*»-' ^C\t-s\', f,se[0, Г], dt (Я-Л(г))-1 <C,/|A|, AeGe+£, A1.57) A1.58) hold, where e>0. (See Kato and Tanabe [1962] or Suzuki [1982].) In this case, the evolution operators {U(t, s)}T9t>s^0 may be given by means of V(t, s) = e - ('~S)A{<) + e ~('"rM<() R(r, s) dr, s where R = R(t, s) is the unique solution of the integral equation t R{t,s)- R1{t,r)R(r,s)dr = Rl{t,s), where Ki(M) = 27U/-1 -(l-s)A_ at (A-^(t))_1dA, A1.59) A1.60) A1.61) Г being the positively oriented boundary of Ze+e for some e>0. Furthermore, the estimates A1.22) and the first inequality of A1.23) are also derived by this scheme. This theory of generation is particularly remarkable in the sense that any assumptions on the domains of A(t) are not made. Consequently, it is no wonder that a little stronger assumption on the smoothness in t of A(t) is imposed. In each generation theory stated so far, the construction yields estimates on the smoothness of evolution operators at the same time. For further details on these theories, see also Tanabe [1979]. 12. Semidiscrete approximation of temporally inhomogeneous parabolic equations As we have seen in the preceding section, the parabolic equation A1.3) with A1.4) (or A1.4')) and A1.5) is reduced to the evolution equation A1.7) with A1.8). In the same
Section 12 Evolution equations 839 way as in the preceding chapter, this equation is discretized with respect to the space variables x = (x1, x2). Namely, let us triangulate Q into small elements with the size parameter h > 0 and let Vh с V be the space of piecewise linear trial functions. Xh means Vh equipped with the L2 topology. The m-sectorial operator in Xh associated with a,\VhxVh is denoted by Ah(t). Finally, Ph: X-+Xh is the orthogonal projection. Then, the semidiscrete finite element approximation of A1.7) with A1.8) is given by duh/dt + Ah(t)uh=0, O^t^T, A2.1) with uh@)~Phuo A2.2) in Xh. According to the generation theory described in the preceding section, — Ah(t) generates the evolution operator {Uh(t, s)}T>!^s?s0. Furthermore, the inequalities \\Ah(t)Uh(t,s)\\,\\Uh(t,s)AM\^C(t-sr1, 0^s<t^T, A2.3) \\Uh(t,s)HC, O^s^t^T, A2.4) hold uniformly in h. In the present section, putting eh(t) = uh(t)-u(t\ A2.5) we shall show the estimate \\eh(t)\\x^C(h2/t)\\u0\\x, 0<t^T, A2.6) and extend the similar result in Section 7. To this end, we employ the method of Helfrich [1974,1975]. First, we introduced the error operator Eh=Eh(t,s) by Eh(t,s)=Uh(t,s)Ph-U(t,s). A2.7) Obviously, eH(t) = Eh(t,0)u0. A2.8) Thus we have only to derive \\Eh(t,s)\\^Ch2/(t-s), 0^s<t^T. A2.9) From the equality -- [Uh(t, r)PhEh(r, s)] = Uh(t, r)[Ah(r)Ph -P„A(r)]U{r, s) follows the identity P„Ek(t,s)-- Uh(t, r)\_Ah{r)Ph-PhA(r)-]U{r, s) dr. A2.10) Now, we introduce the Ritz operator Rh(t): V^>Vh through the relation a,(Rhmx) = at(v,X), veV, XeVh. A2.11)
840 H. Fujita and T. Suzuki Chapter HI Then, Ah(t)Rh(t)v = P„A(t)v, as is shown in Section 2. Therefore, the equality Eh(t, s) = (l ~Ph)Eh(t, s) + PhEh(t, s) = El{t, s) + E2h{t, s) + E3h(t, s) A2.12) follows with Ei(t, s) = (l - Uh(t, s)Ph)(Rh(t)~l)U(t, 5), A2.13) Ei(t,s) = U h{t, r)Ah(r)lR„(r)-Rh(t)]U(t, s) dr, A2.14) E3h(t,s) = Uh(t, r)Ah(r)Ph(Rh(r)~l)[U(r, s)- U(t, s)] dr. A2.15) It suffices to show \\E'h(t, s)\\^Ch2/{t-s) for 1= 1,2 and 3. A) Estimation ofEj;(t,s). In Section 2, we have shown the estimates \\(R„(t)-l)v\\y<Ch\\v\\Hi, veVnH2(Q), (Ш6) ||(ВД-1И*<СЛ21М|я1, t;eFnH2(f2). A2.17) Therefore, from the elliptic estimate we obtain \\El(t, s)|| ^A + ||[/fc(£,s)|| • ||P„||)- ||(ВД- \)A(t)-11 • M(t)I/(t,s)|| <Cfc2/(t-s). A2.18) B) Estimation ofE2(t,s). We shall show the inequality \\{Rh(t)-Rh(sM\x^Ch2\t-s\"\\v\\H2, veVnH2(Q), A2.19) according to Suzuki [1979b, 1982], from which we get \\E2h(t,s)\\ ^ | Uh(t, r)Ah(r)\\ • \\(Rh(r)-Rh(t))A(t)-' i| ■ \\A(t)U(t, s)\\ dr <C \(t~Syl+eh2 dr(t~s)~l = Ch2(t-s)~l+e ^Ch2/{t-s). A2.20) To this end, we introduce the adjoint sesquilinear form a,*(,) by af{u, v) = at(v, u), u,v e V, and denote by Rh(t) the Ritz operator associated with af(,): a?(&„{t)v,x) = a?{v,x}, veV, XzVh. A2.21)
Section 12 Evolution equations 841 The inequalities \\(Rh(t)-l)v\\v^Ch\\v\\H2 veVnH2(Q), A2.22) \\(Rh(t)-l)v\\x^Ch2\\v\\H2 veVnH2(Q) A2.23) hold as in A2.16) and A2.17), respectively. Now, setting z = (Rh{t)-Rh{s))veVh, A2.24) we obtain || z || | = at(z, A(t)*' lz) = at(z, R„(t)A(t)* ~xz) = at((l~Rh(s))v,Rk(t)A(t)*-1z) = (at-aMl-Rk(s))v, ШАA)*-^) = (a,-as)((l -R*(s))v, (Rh(t)- 1)А{1)*~^) + (at-as)((l-Rb(s))v,A(t)*-1z) =(a, - as)((l - ЫФ, (R„(s) - l)A(t)*~ h) + as((l-Ru(s))v,(A(s)*-1-A(t)*-1)z) = (a,-as)((l -ВДК (R„(s)- l)A(t)*-lz) + as((l-Rh(s))v,(l-RH(s))(A(s)*-l-A(t)*-1)z) ^Qt-srUl-RbisMvMAM-lMtF^zWy + C\\(l-Rk(s)v\\y\\(l-Rh(s))(A(s)*'1-A(t)*-l)z\\v ^CU-s^M^WAitr-hWHi + Ch1\\v\\m-\\{A{s)*-1-A(t)*-i)z\\Hi. Now, from the elliptic estimate follow МЮ'-^Ин^СИ*, UAisr^-Aitr-'M^Qt-sl'tizh, and the desired inequality A2.19) has been established. C) Estimation of El{t, s), Case 1. In the case of V=Hl(Q), a duality argument due to Helfrich is effective (Fujita and Suzuki [1979]). Namely, in this case the estimate A1.24) holds so that for u0eD(A{s))=D we have t ||£ft3(M)u0K \\Uh(t,r)Ah(r)\\' IK^W-IMW1!! s •\\A{t)[y{t,s)-U(r,s)-]A{sTx\\ • \\A(s)u0\\ ds t ^Ce (t-ryih^t-rfir-syOdrWAisM о = Ch2\\A(s)u0\\.
842 H. Fujita and T. Suzuki Chapter HI Similarly, from the second estimate of A1.23) we can show that }\Elh(t,s)u0\\^Ch2\\A(s)u0\\ for /=1,2. Hence \\Eh(t,s)A(s)-1\\^Ch2. A2.25) Now, by virtue of the semigroup property of evolution operators, which means Vh(t, г)Щг, s) = U„(t, s), U(t, r)U(r, s) = U(t, s), where 0 ^ s ^ r ^ t ^ Г, the identity E„(t, s)= Uh(t, s0)PhEb{s0,s) + Eb(t, s0)U(s0, s) A2.26) holds with s0-j{t + s). The second term of the right-hand side of A2.26) is estimated as \\Ek(t,s0)U(s0,s)\\ ^\\Eh(t,s0)A(s0r1\\-\\A(s0)U(s0,S)\\ ^Ch2/(t-s) A2.27) by A2.25). On the other hand, by A2.11) we have \\Uh(t,s)PhEh(s0,s)\\ = || Uh(t, so)A„(s0)Rk(s0)A{sQr1Eh(s0, s) || < || Uh(t, s0)Ah(So)Ph(Rh(s0)~l)A(s0)-1Eh{s0, s)\\ +1| Uh(t,s0)Ah(s0)PhA(s0)-1Eh(s0, s)\\ <C(t-s)-1{||(K*(So)-lM(s0)-1||-||£*(so,s)ll + M(so)-1£A(s0>s)||}. A2.28) Since HE^ScsJIKC by A2.4), the desired estimate A2.9) is reduced to Мф-'ЕЛМЖСЙ2. A2.29) We set L?(t,s)=U(T-s, T-t)*, C„(t,s)=U„(T-s, T-t)*. Then, {U(t,5)}r>r>sso and {Uh(t, s)}T>t>s>0 are nothing but the evolution operators generated by A(t) = A(T-t)* and Ah(t) = Ah{T-t)* in X and Xh respectively. Furthermore, the relation Eh(t,s)sV„(t,s)P„-U(t,s) = Eh(T-s, T-t)* holds. Therefore, in the same way as in A2.25), the inequality \\Eh(t,s)A{sr1\\^Ch2 A2.30) is obtained so that
Section 12 Evolution equations || A(t)" *£»(*, s)|| = ||E„(t, s)*A(s)* -x || ^ Ch2. 843 D) Estimation of Ei(t,s), Case 2. In the case of V=H\Q), D(A(t)) varies as t changes, and we cannot adopt the estimate or \\A(t)[U(t,s)-U(r,sJA(sr1\\^Ce(t~rf(r~sre, 0^s<r<t<T. The following argument is due to Suzuki [1979b, 1982]. Namely, by a telescoping, we have 1 = 1 A2.31) with Fl(t,s)-- (r-s)Uh(t, r)Ah(ryPh(Rh(r)- l)[L/(r, s)-t/(t,s)] dr, A2.32) F,2(f, 5)= (t-r)t/»(t, г)Л(г)-(Л»(г)-Ял(8))[С/(г, s)- I/(t, s)] dr, A2.33) Fl(t,s) = (t-r)[Uk(t,r)Ah(r)-U„(t,s)Ah(s)yPh(Rh(s)-l)[U(r,s)~U(t,s)] dr, A2.34) Ft(t, s) = - tf»(t, sM*(s) • Л(ВД -1) (r-s)[[/(r,s)-l/(t,s)]dr A2.35) FA5(t, s) = t/„(t, s)A4(s) • P„(Rh(s) - l)(t - s) [U(r,s)-U(t,sJ]dr. A2.36) We have only to derive \\Flh(t, s)\\ «SCh2 for /=1,2,..., 5. By virtue of the construction of the evolution operators [Uh{t, s)} and {U(t, s)} of Kato and Tanabe [1962] described in the preceding section and by the elliptic estimate (Agmon, Douglis and Nirenberg [1959]), we can show that \\U(t,s)-U(r,s)h^Hi ^Cxiit-rMr-sy'-^it-rftr-s)-1}, OsSs<r<f<T, A2.37)
844 H. Fujita and T. Suzuki Chapter III \\Uh(t,r)Ah(r)-Uh(t,s)Ah{$)\\ <C«{(t-r)-1-'(r-5)' + (t-r)-1(»,-s),c}, 0^s<r<t^T, A2.38) [t/(t,s)-l/(r,s)]dr ^C A2.39) for O^^^l and 0^к<а (see Suzuki [1982]). From these inequalities, we can derive the following estimates: \Fl(t,S)\\^Ur~s)\\Uh(t,r)Ah(r)\\ S '\\(Rh(r)~mH2^-\\U(r,s)-U(t,s)\\L^Hidr t <CKh2 (r-sKf-r) •{(t-r/(r-s)-^1 + (f-r)K(r-s)-1}dr ^Ch2, 0<p<l. F2(t,s)\\^\(t-r)\\Uh(t,r)Ah{r)\\ \\{Rh(r)~Rh(S)\\H^Li-\\U(r,s)-U(t,S)\\L^Hidr ^CKh2 (r~s){(t-r)"(r-s)-^1+(£-r)K(r--s)-}dr ^Ch2, 0<p<l. A2.40) A2.41) :\F3h(t,s)\\^ (t-r)\\Uh(t,r)Ah(r)-Uh(t,s)Ah(s)\\ • \\Rh(s)~1 II^l2' II U(t,s)-V{r, s)\\L^„>dr ^CKh2 f(r_r){(t-r)-1-''(r-s)' + (t-r)-1(r-s),c} £ • {(t-ry{r~sy1-> + (t-rT(r-s)-1} dr A2.42)
Section 13 Evolution equations \\Ft(t, 5)|| ^ || Uh(t, S)AH(S)\\ • ||**(S)- 1 ||H^ \U(t,3)-U(r,s)\\L^H2-(r-s)dr {(t--r)'i(r~srll-i + (t-r)K(r-sr1}(r-s)dr ^Ch2, 0</?<l. \F5h(t, s)\\ ^ || Uh(t, s)Ah(s)\\ ■ \\Rh(s)-11|„^ [U(r,s)-U(t,s)]dr L*~H* ^Ch2. Summing up these estimates, we obtain ||£*(t, s)|| ^Ch2/(t—s). 845 A2.43) A2.44) 13. Fully discrete approximation of temporally inhomogeneous parabolic equations We obtain fully discrete approximations by discretizing the semidiscrete equation duh/dt + Ah(t)uh = 0, O^t^T A3.1) with uh@) = Phuo, A3.2) furthermore with respect to the time variable t. In the present section, we adopt the backward difference method with the mesh length т>0, T=Nt, that is, A3.3) (ulh(t + т) - ul(t))/t + Ah(t + x)u#r + т) = О t = nz, n = 0,l,...,N in Xh with ul@)=Phuo. A3.4) Under the assumption A1.14'), the scheme is uniquely solvable and el(t) = uzh(t)-uh(t), t = m, A3.5) denotes the error. According to Suzuki [1979b, 1982], we shall derive lkS(t)llx<c(T/t)Kb. A3-6)
846 H. Fujita and T, Suzuki Chapter III and extend a similar estimate in Section 8. Combining A3.6) with A2.6), we obtain \\u№-u(t)\\x<C{(h2 + T)/t)\\u0\\x, t = m. A3.7) Henceforth, we set tn = nx and Ul(tn,tj) = i n>j, A3.8) (l. n=j. Then we have ul(t) = Ul(t,0)Phuo, t = tn. A3.9) On the other hand, we have uh(t) = Uh(t,0)Phuo. A3.10) Henceforth, we drop the suffix h for simplicity of writing. Then, by virtue of A3.1) and A3.3), we get for t = t„ that e\t + x)-e\t) t+t = I iA{r)u{r)-A{t + x)u\t + x)']ur X \_A(r)u{r) - A(t + x)u(t + t)] dr - xA{t + x)e\t + x). Hence ex(t + x) = (l+xA{t + x)y1e\t) + {l+xA{t + x)) 1 [A(r)u(r)-A(t + x)u{t + xJ]dr. t On the other hand, we have er@) = 0 so that e\tn) = E\tn)Pu0 = -£ \ (l+xAiQr'il+xA^-,))-1- A + хАAк))'" ie=l J Ifc-i • lA(tk)U{tk, 0)-A(r)U(r, 0)] At Pu0, A3.11) E\t) being the error operator, E\t) = U% 0) - U(t, 0)P. From the identity A3.11), we see that the error estimate A3.6) can be reduced to the inequalities on the stability of
Section 13 Evolution equations 847 the approximated solution and on the smoothness of the original one, that is, those on t/T(t„,tj+1) and on A(t)U(t, s)—A(r)U(r, s), respectively. Actually, we can show the following: Lemma 13.1. For each 0^j?<f, the inequality Wu^t^tj^n^c^-tj)-', n-j>p, holds true. Lemma 13.2. For each Д, 0</?<^, the equality A(t)U(t,s)-A(r)U(r,s) = A (t)[e - <' ~sM(,) - e - <'- s)^(,)] + A(t) % (t, r, s) holds for 0^s<r<t^T with Zfi(t, r, s) subject to the estimate \\Zfi(t,r,s)\\^C^t-r)(r-s)'-\ A3.12) A3.13) A3.14) For the moment, let us admit these lemmas. Then, the operator Ez in the right- hand side of A3.11) splits as E\t„)= £ (\+TA{tn))-l-{\+xA{tkT'A(tk) t=i . tk-i , re~tkA(tk)_e-rA{tk)-\ Jj. + 1 (И-тДа^-О + T^))-U(tt)%(tk,r,0)dr = {\+xA{tn)ylA{t„) |-g -1„A(l„) _ g - тA(t„ft fr + £ C/I(t.,tt-iMfe)[e-^(tk)-e-M"fc,]dr + £ [7ta„,t,_1Mfc)%fe,r,0)dr. k = l A3.15) Let Г be the positively oriented boundary of Z9l with 0j >#. Noting that relation A1.15) and estimate A1.16) hold for A(t)=Ah (t) uniformly in h, we obtain for к > — 1
H. Fujita and T. Suzuki Chapter HI that И(г)к[е-,л<г,-е~5Л(г)] 1 2ra т A,ce-,i[e-('"»)A-l](A-i4(r))-1 dr <C LKe-s"cosei(r-s)/z — о = CK(t-s)s~H-\ 0<s«St<oo. A3.16) Then, supposing n > 2 we can estimate the first term of the right-hand side of A3.15) as ||A+тЛ(ОГМ(У II e ~t„A{t„) _s~rA(_t„) || ^r sSCt (^-г^-Чг^Ст-^-!)-^^ On the other hand, by Lemma 13.2, the third term of the right-hand side of A3.15) is estimated as k = l £ J \\иХ1п,1к^)АAкП-\\г^к,г,0)\\^ 'k =S C/i I (n-fc + irWflkty-1 = С„т X (ii-fc+l)-^'-1. *=i We here recall the elementary inequality ^B(fl,b)= (l-xJ'-V-1^ о A3.17)
Section 13 Evolution equations 849 for 0<a^b and a^l. In fact, f(x)=(l — x)" 1xb 1 is monotonically increasing in [0,1) when b^l^a, while f(x) is convex in @,1) when a,b^ 1. Then, we have tk I f W{tn,tk-x)A(tJ\\-\\Z^tk,rM\^^CT. In this way, the desired estimate ||Е*(г, 0)|| ^C/n, t=t„, is reduced to an inequality concerning the second term of the right-hand side of A3.15), that is, "l [ u\t„, tt.x)A{h)\?-tkMtk)-z-rMtk4 & <c. In fact, taking ft in 0</?<|, we have A3.18) "I k=i U\t„, г,_1)Д£к)[е-'^)-е-^)] dr n-l = ^(n-k+l)U\t„,tk^)A(tkI+l! k~l »k 4(f J-^[e-"-4<(k)-e-M<,k)] dr + Y(k-l)U\t„,tk-1)A(tkI-p tk-i n-l lt=2 Гк ^(ft)^Ce_'^('k,-e-^(rk)]dr. A3.19) The first term of the right-hand side of A3.19) is estimated by Lemma 13.1 and A3.16) as £ (n-k + l)\\W(tn,tk^)A(hI+4- Mfor'Ce-^^-e-'^Hdr (tt-r)"-1*- S*C, X (n-fe+l)-^-1-"-!2^-1 *=1
850 Я. Fujita and T. Suzuki Chapter III = C„ £ (n-k+iyW'^C. *=1 Similarly, the second term of the right-hand side of A3.19) is estimated as "Zik-VWit^b-Mihf-'W ■ f ||^)'[e-^^-e-'^|| dr k = 2 J tk-i ^^№-1№л-!к-1Г1+'' «к я- I ^C„ X №-l)(»-i+ir1+'it+''t2-((t-l)t)-'-1 k = 2 n-1 k = 2 In this way, A3.18) has been established. In the remainder of the present section, we shall give outlines of the proof of Lemmas 13.1 and 13.2. Stability of the approximated solution {Proof of Lemma 13.1) First, we note that the inequality A3.12) is reduced to WAWV^t^Cfa-tj)-', (K/?<f, A3.20) by considering its adjoint form. Now, we recall that the generation theorem of Kato [1961b] described in Section 11 has yielded the continuous version of A3.20), that is, H(j)^(M)KC,(j-s)-». A3.20') All we have to do is to trace his computations in the contexts of the discrete version. In fact, we note that the inequality A1.27) holds for A(i) = Ah{t) uniformly in h by the theory of Kato [1961b]. Sobolevskii's identity A1.31) holds for A(t) = Ah(t) so that we obtain U\tK,tj)-(l + TA{tj)rO-» = "X [A +TA(te))-b-k-»V\tk+u 1})-{\ + хА{гп)Г*-*Щ1к,t,)] k = J ^(l + xA(tn))-^k\{l+TA{t„))-(l+TA(tk + i))W(h,i,tj) = t"YJ{l+TA(tn)r^-%A(t„)-A(tk+1)-]U\tk + i,tj)
Section 13 Evolution equations 851 = 1*A+^а"<""*,л(*.I"и' P=l k = j •D(tn,tk+1)A{tk+lY"U\tk+utj). A3.21) Let us introduce the following notations: For operator-valued functions Kt = K,{t„,tj),l=\,2, on Dz = {{tn,tj): N>n>j>0}, another function K = Xx *'K2 is defined by n-2 Furthermore, we set W\tn, tj)=U\tn, tj)-(l+xA(tm))-^-J\ A3.22) Yl(ra,tj) = A{tarW4t.,tj). A3.23) Then, the equality A3.21) reads: m Y^ZH^SYl+Yio, A3.24) p=i where H'iP(tn,tJ.) = A(t„I-p"+ni+T^„))"(',"y+1,i>(tn,ti), A3.25) m г^Ея;,,^;,-! A3.26) with Yl^(t„, ts) = A(t.r(l+zA{tn))-^\ A3.27) Note the elementary inequality A3.17). Then, the desired inequality A3.20) can be derived in a similar way to that of Kato [1961b] (see Suzuki [1982], for details). Smoothness of the original solution (Proof of Lemma 13.2) We restate the construction of the evolution operator {V(t, s)}T>t>s>0 by Kato and Tanabe [1962] described in Section 11, that is, U{t,s) = e-i'-s)A{t) + W(t,s) A3.28) with W(t,s) = e'i!-r)AA)R(r,s)dr, A3.29) where R = R(t, s) is the unique solution of
852 H. Fujita and T. Suzuki Chapter III R{t, s) = Ri{t,s) + Rj (t, r)R(r, s) ds for 2тп 3f From these relations, we have A(t)V(t,s)-A(r)U(r,s) = A@[e"(!"sM(r)-e"(r_sM(I)] + ^(t)^A(tI ~'e - lr's)m - AirI ~'e ~fr -*,il(r)] + ^O'C^tI -'W(t, s)- А(гУ -fW(r, s)] + A(t)'(l-i4(f)"'i4(r)')i4(rI-/lWr(r,s), so that 4 Zp{t,r,s)= £ Z'piU^s), 1=1 where Zj(t,r,s) = A@1-^rfr-lMW-A(rI-/,e-lr_lM(r), Z|(t,r,s) = (l-A(t)"'i4(r)'M(',I"'e-(r-*)i*<,), Z|(f, r, s) = 4(fI -'W(*, s)-^I ""Щг, s), Z£(t, r, ^(l-^)"'^)')^I -'W(r, s). By the Dunford integral, we have 1 Z}(t,r,s)= 2m ^-»e-i'-s)\{X-A{t))-l-{k~A(s))-l]dX, while \\{X-A(t)yl -{X- A(s))-l\\^Ct\t-s\l\k\, XeZe+e follows from A1.58). Therefore, we obtain 00 dfj, ||Zj(r,r,s)KC = Cf(t-r)(r-s)>-\ Bt>e. A3.30) A3.31) A3.32) A3.33) A3.34) A3.35) A3.36) A3.37) A3.38) A3.39)
Section 13 Evolution equations Next, Hi-^r^yiKC^-si, o«s/M follows from A1.27), and consequently \\ZJ(t, r, 5)|| < || l-AW-'AW'W • IM(rI -/»e~fr-*«"|| «^(f-rXr-s)'-1. The estimate WMI<C has been shown in Kato and Tanabe [1962] so that 853 \\A(t)l->W(t,s) A(tI~l!e-<-t-r)A(,)R(r,s)dr A3.40) A3.41) A3.42) (t-r)'-ldr = C,(t-s)>. Therefore, ||Zj(t,r,s)KC,(*-r)(/-s)'. Thus, the proof of Lemma 13.2 has been reduced to ||Zf (i, r, s)|| = M(tI -'Wit, s)-A(ry ~>W(r, s)\\ ^Ct{t-r)(T-s)>-\ 0<p<l We now recall A3.29) and obtain A(tY -' W(t, s) - AirI ->W(r, s) = £ Z\{t, r, s), 1=5 where Z|(r,r,s) = Л(гI -^е-(г-г)Л<,)[К(г, я)-Л(г, 5)] dz, A3.43) A3.44) A3.45) A3.46) A3.47) Zf(t,r,s) = [.4(fI -*е-<'-*^«- Л(гI -*e-<r-*MW] •[i?(z,5)-K(r,5)]d2, A3.
854 H. Fujita and T. Suzuki Chapter III t + [Л(tI-^-('-г)Л<r)-Л(rI-'Je-(r-г)Л(r,] dz-K(r,s). A3.49) s The estimate || R{t,s)-R(r,s)\\^Cy{t~-rY(r-s)-\ 0^s<r<t^T, A3.50) is shown for 0<y<l by Kato and Tanabe [1962]. Therefore, we get ||Z|(t,r,s)||< j^@1^e-('-^(f»||-||^(z,s)-i?(t,s)||dz Г г ^Cp<y \{t-Zy-\t-z)\z-s)-i&z t ^C„,y \(t-zf + y+i(z~ryl) + 1+ydz ^C^f-rKr-s)'-1 A3.51) by taking у in 1—/?<у<1. Next, from A3.16) and A3.39) we obtain |i4(tI-'e-('~,'M<,)-/i(rI-'e-(,"J,M(,,|| ^1#I-»е1('-!)«~#I-')е-(г-г,*)| + И(гI-^[е-(,-гL<'-)~е-(г""г)Л(г)]|| **Ct(t-r)(r-z)>-2, so that \\Z$(t,r,s)\\*b ЩгI-11^''^^ - А{гу-^-(г-2)Мг)\\ S '\\R(r,s)-R(z,s)\\dz r ^ CPi1 (t - r)(r - z)" " 2(r - z)\z - s) -y dz «C^t-rKr-s)"-1 A3.52)
Section 13 Evolution equations 855 holds when у is taken in 1 — /?<)>< 1. Finally, we have Z}(t, r, s) = [Л(()-'е-(,-*М(')]1 :j-*(t, s) +[4)-'е-(,-г)Л(,,-л(г)-^-(г-г,«]^;-к(и) = IZj(t,r,s), A3.53) ( = 8 where Zf (i, r,5)=(l -e-('-rMW)U(t)"'/l(t, s)-A(r)-»R{r, s)], A3.54) Zj(i, r, s)= -e-(,_,MW[yl(r)-'-X(t)-']ii(r,sX A3.55) Zj°(f, r, s)= -(^(t)-/,e-('-,MW-^(t)-'e-(r-,MW)/J(r, s), A3.56) ZJ1(t,'',s)=-(^(f)"/!e-(r-sMW-^(r)-/ie-(r-sM(r))/?(r,s). A3.57) The inequality M(t)-'-,4(ir'KC,|r-s|, 0<i?<i A3.58) follows from A1.27), and we get IIZlfcr.sJKC^t-rHi-s)'-1. A3.59) Furthermore, the inequalities ||Z^(t,r, s)||< C^-rXr-s)', A3.60) IIZ^fer.sJKC^t - r)(r-sf-1 A3.61; follow from A3.16) and A3.39), respectively. Therefore, the proof of Lemma 13.2 is now reduced to llAitr'Rbs^Airr'Rlr.sn^Cfit-rHr-s)»-1 0<s<r<t<T, A3.62) for0<j8<i By virtue of the integral equation A3.30), this inequality is furthermore reduced to \\A(trfRi(t,s)~A(rrilR1(r,sn^Cp(t-r)(r-sy-1 0^s<r<t^T, A3.63) Rx =Ri(t, s) being the right-hand side of A3.31). The estimate A3.63) is established by means of the spectral decomposition in the case that the A(t) are self-adjoint. In the general case, we take the "real part" of A{t) and compare it with A(t) as in Section 5. Then, A3.63) follows (see Suzuki [1982], for details).
856 H. Fujita and T. Suzuki Chapter III 14. Error analysis by energy methods In 1979, the second author obtained the estimate \№)-u(t)\\x^C((h2 + T)/t)\\u0\\x A4.1) by the methods described above in Sections 12 and 13 (Suzuki [1979a]). This method does not employ the Hilbert space structure of the problem and does work in the framework of Banach spaces as well. On the other hand, error analysis on the fully discrete approximation of temporally inhomogeneous parabolic equations has been done independently by Sammon, Luskin, Rannacher, Thomee, Huang, Biocchi, Brezzi and others (Baiocchi and Brezzi [1983], Huang and Thomee [1981, 1982], Luskin and Rannacher [1982], Sammon [1982,1983], e.g.). The methods of these authors are characterized by systematic use of the Hilbert space structure of the problem. In fact, by energy methods, Luskin and Rannacher [1982a] and Huang and Thomee [1982] succeeded in deriving the optimal rate of convergence, that is A4.1), for the backward difference finite element approximation. In the present section, we introduce the methods of Luskin and Rannacher [1982a] and Huang and Thomee [1982] for the semidiscrete approximation and the fully discrete one, respectively. 14.1. A priori estimates First, we shall show several a priori estimates on the evolution equation du/dt + A{t)u=f{t), 0<t^T, A4.2) in a Hilbert space X with the initial condition u@) = mo. A4.3) Here A(t) is an m-sectoria! operator associated with a sesquilinear form at(,} on V, where VczXcV* denotes the triple of Hilbert spaces described in Section 1. As before, we suppose that a,(,) satisfies |at(M,»KC||«||K-|l»lli'. u,veV, A4.4) and Re a((u,u) ^ *S || и || j>, ueV, A4.5) without loss of generality, where С and д are positive constants. Furthermore, we assume that at(u, v), u,ve V, is sufficiently smooth and put a,(u,v) = (d/dt)at(u,v), at(u, v)—(dz/5t2)at{u, v), etc. Then, A4.2) is equivalent to its weak form, nam. iv {u1,v) + at{u,v) = (f(t),v), vsV, A4.6)
Section 14 Evolution equations 857 (,) being the inner product in X. Therefore, we also have (u„, v) + at(u„ v)=- at(u, v) + (ft(t), v), veV. A4.7) Proposition 14.1. The solution u = u(t) o/A4.1) with A4.2) satisfies the inequality i u(t)\\2x+ \\\u(s)\\rds^C\\u0\\2x + C ll/№ ds. A4.8) о Proof. Putting v = u(t) in A4.6), we have i(d/df)iwt)iii+ыт\1 ^ ii/wiif*- \\u(t)\\v by A4.5). Therefore, from Schwarz's inequality, (d/df)iwt)||i + ||ii(OllNc||/(t)ll^ follows. Hence A4.8) holds. П We recall the adjoint a? and the real part a? of au that is, af(u, v) = at(v,u), u,veV, a° = j(a, + af). Henceforth we suppose that the identity at(u, v) = a?(u, v) + (Bu, v), u,v e V, A4.9) holds with a bounded operator B = B(t): V-*X. This assumption is satisfied when a,(,) is associated with a second-order elliptic operator with real coefficients as we have seen in Chapter I. Proposition 14.2. The solution u = u(t) o/A4.2) with A4.3) satisfies the inequality i u(t)\\2v+ \\\u,(s)\\xds^C\\u0\\2y + C о о ll/WIII ds. A4.10) Proof. Putting v = u, in A4.6), we have IM£+«>, "Л=(/, ",Kil!/ll£+iklli On the other hand, by means of A4.9) we obtain at(u, M,) = i( d/dt)at(u, u)—%at(u, v)+^{Bu, ut), so that ||и, III +(d/dt)a,(u,u)^C\\f\\2x + C\\u\\2y. A4.11) The inequality A4.10) follows from A4.5), A4.7) and A4.11). □
858 Я. Fujita and T. Suzuki Chapter. Ill Proposition 14.3. The solution u~u(t) of A4.2) with A4.3) satisfies the inequality ыт2х+ \\u,(s)\\2v <C||K,@)||i + C||M0||i + C {||/(s)||S. + ||/f(s)||Mds. A4.12) Proof. Putting v = ut in A4.7), we have №mut\\l+d\\uAb^C\\u\\v-\\ut\\v + \\fAv-'\\u,\\v ^Ikll^ + CllullKCII/.ll^, so that 1к(ш + lk(s)ll^ds<||af@)||i+C u(s)\\vds + C\ ||/r(s)||^,ds. A4.13) The inequality A4.12) follows from A4.8) and A4.13). П Proposition 14.4. The solution u = u(t) o/A4.2) with A4.3) satisfies the inequality A4.14) s2\\ut(s)\\2ydS^C\\u0\\2x + C {||/(s)ll£. + ll/,(s)llMds. Proof. Putting v — t2ut{t) in A4.7), we have Ud/dt){t2\\ut\\2x) + St2\\ut\\v ^-t2at{u,ut) + t\\ut\\jt+t2(ft,ut) <i&2\\u, \\v + C\\u\\2Y + t\\u, III + \5t2 \\ut \\2V + C\\ft Ц2, so that (d/dt)(t2l|ur|||) + f2||ut||^C||u||^ + j||M,||| + C||/(||^. Therefore, we get s21| ut(s) || jUs< С {\\u(s)\\v+s\\ut{s)\\x + \\Us)\\v.}ds. A4.15) Next, putting v = tut(t) in A4.6), we have t II "t II x + {d/dt)(ta, (u, u)) = a,(u, u) + at(u, u) + ta,{u„ u) + t(f, u,) ^st2\\ut\\2v+CE\\u\\v + CE\\f\\2v*
Section 14 Evolution equations 859 for each e>0. Therefore, t |5||ut(s)|||ds^£fS2||MI(s)^ds+CE|{||WE)^ + ||/E)||^.}dS A4.16) 0 0 0 holds. Taking e>0 small enough and combining A4.15) with A4.16), we have s2\\uM\yds^C {Hs)U + \\f(s)\\2v. + \\fM\h}ds. A4.17) The inequalities A4.8) and A4.17) imply A4.14). □ Proposition 14.5. The solution u — u(t) of A4.2) and A4.3) satisfies the inequality t t\\ut(t)\\v+\\\un(s)l\2xds 0 ( л <С||и,@)||| + С||ио1|| + С {||/,(s)||i + ||/(s)llM<k. A4.18) о Proof. Putting v = u„(t) in A4.7), we have \\uu\\x+at(ut,u„)=-at(u,uu)+{ft,u„). Here, we note %d/dt)at{ut, ut)-at(ut, utt) + $a,(u„ u,) + %{Bu,, utt) <a,(u„u„) + C||uj£+i|K||! as well as -at{u, u„)= -(d/dt)dr(u, ut) + at{u, tt)+at(u„ ut) ^-№t)at{u,ut) + C\\ut\\2v + C\\u\\2v and (/„"„)^4|i/r||i+i||u„||i Summing these relations, we obtain IK III + (d/dt)a,(u„ ut) + (d/dt)at(u, щ) ^C\\u,fv + C\\uU+C\\ft\\l A4.19) Multiplying with t on both sides, we get Фи II* +(d/dt)t{at(u„ ut) + at(u, u,)} ^C(t + l){\\ut\\2v + \\u\\2v + \\ft\\x},
860 H. Fujita and T. Suzuki Chapter III which implies t t ta,(u„ut) + MMs)||i<C ) {lk(s)|^ + Ms)||^ + ||/,(s)||i}ds. A4.20) о о The desired inequality A4.18) follows from A4.8), A4.12) and A4.20). □ 14.2. Error analysis of semidiscrete approximations Let du/dt + A(t)u=0, 0<t^T, A4.21) in X = L2(Q) be the evolution equation described in Section 11 with the initial condition m@) = u0. A4.22) We want to make an error analysis by the energy method concerning the semidiscrete finite element approximation duJdt + Ah(t)uh=0, Q^t^T, A4.23) in Xh with uh@) = Phuo, A4.24) described in Section 12. We recall that Rh = Rh(t): V-*Vh denotes the Ritz operator associated with a,(,): at(Rh{t)v,x) = at{v,l\ veV, ieVh. A4.25) In the present subsection, we only consider the case V=Hq(Q), for simplicity. First, we note the following: Lemma 14.1. Any V-valued C1 function v = v(t) satisfies mv-R^Uj^CW-JiMv + lMBb-,} A4.26) for k= 1,2 andj = 0,1. Proof. From relation A4.25), we have at(Qt(v - Rhv), x)=- Ф ~ RhV, x\ xeVh, so that \\Qt(v-R„v)\\$ ^a,(d,(v~Rhv),d,(v~Rhv)) = at{dt(v-Rhv),d,v-x)-at{v-Rhv,x) ^C\\dt(v~Rhv)\\v-\\dtv-x\\v + \\v-Rhv\\r-\\x\\v, xeVh.
Section 14 Evolution equations 861 We recall \\(Rh(t)-l)v\\HJ^Chk-J\\v\\m, k = l,2, 7=0,1, A4.27) and take x = Rh(t)cltv{t). Then, we get A4.26) forj = 1 and к = 1,2. The remaining case 7 = 0 is obtained by Nitsche's trick and the elliptic estimate of A(t). □ For u0 6 V, щ s= ыА@ е ZA denotes the solution of duh/dt + A„(t)uh=0, O^t^T, A4.28) with uh@) = Rk№0. A4.29) The following lemma holds for the error eh — uh — u: Lemma 14.2. We have \\eh(s)\\2xds^Ch2\\uJ2v. A4.30) Proof. From A4.21) and A4.28), we get ф,ё„,Х) + а,(ёк,х) = 0, xeVh. A4.31) Making use of A4.31) with х = -Кл(с)^а@> we have Ш1Щ\ёА2х+Ф»ёъ) = (д,ёк, A -Rh)eh) + a,(eh, A-Rh)eh) = -(д,ё„, A ~R„)u)-a,(eh,(\ -RM ^CEh2{\\u\\h + \\dteh\\2x}+s\\eh\\2v for each e>0 by A4.27). From the coerciveness A4.5) of at, we have (d/dt)||e»||l + \\h \\2v^Ch2{\\u\\h + №„\\2Х}, taking e>0 small enough. Hence \\eh(s)\\2rdx о r ^Ch2 j 0 {\№\\Ь + \\д,Ш\\2х}&+Ш0)\\ ^Ch2 {Msnh + ll"r(s)|ii + №Ms)\\2} ds+Ch2 \\u0 \\2V. A4.32)
862 И. Fujita and T. Suzuki Chapter III By the elliptic estimate, we have \\u(s)\\H^C\\A(s)u(s)\\x = C\\ut(s)\\x, while the estimate in Proposition 14.2 is valid for и = щ uniformly in h. Therefore, the right-hand side of A4.32) is further estimated from above by Ch2{\\u0\\2r + \\Rh@)u0\\2r}^Ch2\\u0\\2r- □ Remark 14.1. Under the inverse assumption, the L2 orthogonal projection Ph: X-*Xh satisfies ||PJ|K_K ^C (Proposition 3.2). Then, the estimate t \\eh{s)\\2Y&s^Ch2\\u0\\2v A4-30') о follows in a similar manner to the above calculation, where eh — uh — u. However, such an estimate can be verified without assuming the inverse assumption. That is, Lemma 14.3. In the case ofu0 e Уг\Нк~*(£2\к=1,2, j=0,1, the inequality IMsJlUdx^-^Klli*-, A4.33) о holds true. Proof. We fix t > 0 and consider the dual problems (v, dsw) - as(v, w) = (v, eh(s)\ v e V, A4.34) with w(t) = Q A4.35) in X and (X,dswh)-as(x,wh) = (x,eh(s)), XeVh, A4.36) with w„(t) = 0 A4.37) in Xh, respectively. These problems have unique solutions w = w(t) e C°([0, 0-»Щ(г)*))пСо([0, £]^Х)пСх([0, t)-+X), wh=wh(t)e C1 ([0, t]^X„\ respectively. We note the appearance of inhomogeneous terms in the present case, while we repeat the argument in the proof of Lemma 14.2 with reversing the time
Section 14 Evolution equations 863 variable t. Then, we obtain t 1«.4(8)-»<»)Л?(Ь«а! пыжа». Ь( о о Combining this inequality with the a priori estimate in Proposition 14.2, we get t {||8,w,(s)-erw(s)|||+/r2K(s)-MS)||£}dS 0 ( <|{2(iiatwA(S)iii + nerwE)iii)+r2i|WA(s)-WE)|i^dS 0 <C ||e»(s)||Jds. A4.38) 0 Now, we note that (Э,е»,х) + в,(е*.*) = 0, XeVh, A4.39) (z.S,(w»-w))-e,(z,Wj-w) = 0, xeF„. A4.40) Taking v = eh(s) in A4.34), we get He*(s)llx =(Ф), 6sw(s))-as(e„(s), w(s)) = {(e», 8s(w-wA))-as(eft, w- wft)} +(e„, Зяи>л)-ая(ел, wj. Since eft=(uft—Rhu)+(Rhu — u), we furthermore have Ik* (s) 111 = {{Rhu-u, ds(w-wh))~as(Rhu-u, w- wh)} + (d/dsXeh, wh)-(dseh, wh)-as(eh, wh) by A4.40). Finally, from A4.39) lki,(s)||! =(Rhu~u, ds(w-wh))-as(Rhu-u, w-wh) + (d/ds)(eh, wj. Therefore, by eh@) = wh(t)=0 Ikft(s)llld5 = {(Rhu-u,ds(w-wh))-as(Rhu-u,w-wh)}ds ^£ {||8sw-8sWJ|i+/r2||w-w,j£}d5
864 Я. Fujita and T. Suzuki Chapter HI t + Ce \{\\Rhu-u\\x+h2\\Rhu-u\\2v}ds, A4.41) о e > 0 being a constant. Taking e > 0 small enough, we arrive at t t J \\eh{s)\\2sds^C J {\\Rhu~u\\2x + h2\\Rhu~u\\2v}ds A4.42) V J 0 0 by A4.38) and A4.41). Now, the desired estimate A4.33) follows with the aid of A4.27). □ As is described in Section 12, the following theorem implies \\eh(t)\\x^C(h2/t)\\u0\\x A4.43) by Helfrich's duality argument. Recall that V=Hl(Q): Theorem 14.1. Ifu0e Vr\H2{Q), the estimate \\ek(t)\\x^Ch2\\u0\\H> A4.44) holds true. Proof. Putting Z = Z(t) = Rh(t)eh(t) = Rh(t)u(t)-uh(t)eVh, we have «,,0 + я««.й = (9г(Л»ии)-(Э,и,к,й + а1(Ллм,«-А,(м*,й = (dt(Rhu),0-@,u,® = -(8,(M-RfcMU)- Multiplying both sides by t, we get Ud/dt)(tU\\2x) + taM,0^m\\x~-t(dt(u~Rhu),Q so that №(t)\\i^C \H(s)\\xds + C 0 Hence t\\eh(t)\\2x^t\\(u-Rhu)(t)\\x s2\\ds(u-Rhu){s)\\xds. + C\ {||(u-KftU)(s)||i + 52||9s(U-i?AM)||l}ds+Cj \\eh(s)\\2xds. о о
Section 14 Evolution equations 865 By virtue of Lemma 14.1, we have t||e*(()lli<Crt*{max \\u(s)\\2H> O^ssSI + fs{ || u(s)h + ||ut(s)||%г} ds + С f || eh(s)||J ds 1. A4.45) о о On the other hand, |||efc(s)||!d^aA*||«o|& A4.46) о holds by A4.33). Furthermore, the inequalities ||иE)||,р<С||ио11я1, 0<s<T, A4.47) s{\\u(s)\\h + \Ms)\\h} ds^C||u0||^ A4.48) follow from Propositions 14.3 and 14.5, respectively. Summing up these inequalities A4.45)-A4.48), we obtain A4.44). □ 14.3. Error analysis of fully discrete approximations We adopt the backward difference finite element method and obtain the fully discrete approximation (ul(t + T)-ul{t))/T + Ak(t + fyl{t + T) = 0 t=t„, O^n^N, A4.49) in Xh with ul@) = Phuo A4.50) as in the preceding section. We want to estimate the error el(t) = ul(t) — uh(t) by the energy method. Henceforth, we drop the suffix h. Furthermore, we put Acu<(t)=(u<(tj)-u*(tj-i))/T, t = tj, j>1, A4.51) for simplicity of writing. First, we show the following: Lemma 14.4. The inequality и-1 *21к'@11КСт2||и0||1 + Ст£ \\e%)\\h, t = t„, A4.52) holds true.
866 Я. Fujita and T. Suzuki Chapter III Proof. From the relations (M„i;) + at(«,i;) = 0, v e V, A4.53) (Atu\v) + a,(u\v) = 0, veV, t = t}, A4.54) we have DеЧ*Д»)+а,,(е'(*А ")=(?,,»)» veV> A4-55> where tj yj^A,u(tj)-u,(tj)=- (s-tj-Ms)ds. A4.56) tj-i Therefore, putting we get (A,e(tj), v) + at.{e(tj), u) = (yJ- + e(£_/_1), v), veV. Now, we take v = e(t,-) to obtain 1 - (e(tj) - e[tj _!), e(^)) + а, .(ё(гД ё(^)) = (ft + e(tj _ t), %tj)). In other words, Н\Щ)\\х - \№j-i)\\l + ?2 иЛтЯ + Щ^Ы e{tj)) so that ll^)ll!-Htj-i)lll + 2T<5 ||ё(^)||? ^2T(y/ + e(t;_1),e(tJ.)) ^2t{C(||^||^ + ||e@-i)||^) + «5||^;)|]^}. Namely, \Щ)П- №j-i)lll <2CT(||yjl|J. + lle(O-i)ll^), which implies t2l|e'(t.)lll=lle(t-)lll ^2Ct( £ t)\\у;\\2у. + "^ \Щ)\\у\ (i4-57)
Section 14 Evolution equations 867 Here, by definition A4.56), we get ч Therefore, ч t2jbj\\h<r f s|Ms)||^ds tj-l follows, because of We now recall A4.7) and get ||м»И^<С||и,||к + С||и||к. A4.58) Therefore, the inequality In т Z^ll^ll^^t2fs2||M„(s)||^ds 3=1 J 0 t ^Ct {\№П+*2 Ы*Ш} ds о <Ct2||u0||1 A4.59) hold by Propositions 14.1 and 14.4. The desired inequality A4.52) follows from A4.57) and A4.59). □ For the operator T(t) = A(t)~l, the relation \\f\\v~\\T(t)f\\v, feVh, A4.60) holds uniformly in h. Furthermore, from T (t) = — A{t) ~* A' (t)A(t) ~x, the inequality \\T'(t)f\\y^C\\f\\v. A4.61) follows. In terms of T(t), A4.55) can be written as Setting Г, = T(tj)yj, we get MTjVitj) + e%) = rj +(А,Т{ф%-1). Therefore, by taking the inner product with T(tj)ez(tj), we have
868 Я. Fujita and T. Suzuki Chapter III Ha/TKiiT^H^iii-ii^-xK^-jii^+TiM.m^^))!!2} < ||rj||v,|| Т(ф%)||v,||{Л,Щ))е%-MxW Пф%)||x A4.62) as in the proof of Lemma 14.4, where ц > 0 is a constant related to the equivalence A4.60). By the mean value theorem, the right-hand side is furthermore estimated from above for ue(tj^1, t}) as lirjK." IIЩ)еЩ)\\у + ||T'(U)eitj-My || Щ)е%)\\х ^С{||Г;||Г,- \\е%Пу. + W,-i)\\v' \\T{tjy(tj)\\x} ^i^lle'WII^ + N^-OllM + cilir^K^lll + ll^llM so that ШПФЪШ-ЦЩ-.У^-гШ} Нщ{\\еШу*-МЬ-1Ш*}+Ъ^тЬ)\\у> ^Ст{||Г(г7И0)Ш+ 11^11^}- A4-63) From this, we obtain №,№я)\\1+т t \\е%Ш^Ст t ||Г,#. + Ст t \\Т(ф%)\\1 j=i j=i j=i By virtue of a discrete version of Gronwall's inequality, we furthermore get \\T(t.y(tn)\\x + i t \\e%)\\h^Cx t \\rj\\v.. A4.64) j=i j=i Now, in the same way as in the proof of Lemma 14.4, Schwarz's inequality gives h 1|Г;^. = |ТО^#.Ц J (S~tj^J\\T(tj)utt(snv,ds. tj-i On the other hand, T(t)u, + и — 0 implies T(t)u„ = — щ — T'(t)ut, so that for s £ [tj_x, tj2, s» e (tj-1, t,-): lir@)«„(s)||K.<||rE)H„(s)||F, + T||r(s.)H„(s)||F. sj |кE)||к. + \\T'(s)ut(s)\\v* + ||T'(s.)m„(s)||f* <C(||u,(S)||K, + ||ur(s)||r+T||ut((S)||F.). However,
SECTION 15 Evolution equations 869 is obvious. Therefore, noting A4.58), we obtain \\T(tj)utt(s)\\v^C(\\u(s)\\r +T|k(s)||„). A4.65) Consequently, the inequality X t Wrjfy^C t f E-0-lJ{IWs)|l?+T2||«((s)||^ d5 <Ct 2l(||u(s)||^+s2|iMt(s)||^)dS 0 <Ст2||РЫо||! A4.66) has been established by Propositions 14.1 and 14.4. In this way, combining A4.52) with A4.64) and A4.66), we have proven the following: Theorem 14.2. The estimate \\e\t)\\x^C(x/t)\\u0\\x, t = t„, A4.67) holds true. Remark 14.2. As for error analysis on the schemes of higher accuracy, we refer to Baiocchi and Brezzi [1983], Luskin and Rannacher [1982b], and Sammon [1982, 1983]. See also Section 17. 15. Discretization of hyperbolic equations Let us consider the hyperbolic equation 62u/6t2+j/(x,D)u = 0, O^t^T, xeQ, A5.1) with M|efi = 0, 0<t<T, A5.2) or 8M/c4v + <n4fl = 0, O^t^T, A5.2') with «4 = o =«1, (8и/бЧIг = о=«о> xeQ. A5.3) Then, it is reduced to the evolution equation of second order in X=L2(Q), d2u/dt2+ylu = 0, Oscrsjr, A5.4)
870 H. Fujita and T. Suzuki Chapter III with u@) = m15 u'@) = uo, A5.5) Therefore, its finite element approximation can be constructed in a natural manner. In this section, we develop error analysis on the semidiscrete approximation in an abstract framework, for the case that A is self-adjoint. To this end we recall the triple of Hilbert spaces Fc XcV* given in Section 1 and suppose that a = a(,) is a sesquilinear form on V, satisfying a* = a, \a(u,v)\^C\\u\\v'\\v\\v, A5.6) а{и,и)>д\\и\\Ь, A5.7) with positive constants С and 5. Let A be the self-adjoint operator in X associated with a(,). We consider the evolution equation A5.4) with A5.5) of hyperbolic type, where ut e V and u0 e X. We can transform the equation to a system of first order. Namely, let x} «'{л о Then, equation A5.4) with A5.5) is reduced to dl//dt + J3ft/ = 0, O^t^T, A5.8) in Ж with l/m-I/.. ./.Q, V.-($. (.5.9, The relation D(A112)- V and the equivalence ll/llv~M1/2/lli, feX, A5.10) follow from a* =a, and we replace the norm ]|* ||^ in Ж, where № = II«oII* + IKIIk fort/=() by HI'lllo, where !!1Щ1о = 11«о11х + И1/2«1|1х- A5.П) Then, j/ becomes skew-adjoint and hence generates a (C0) semigroup {T(t)} in Ж (Yosida [1964, Chapter 9], Hille and Phillips [1957], e.g.). In fact, by an easy calculation we can verify that Ш is associated with a symmetric sesquilinear form & = &(,) on-T = |
Section 15 Evolution equations 871 such that Щи, W) = i{-a(u0,w1) + a(H1,w0)} /и A /wA A5.12) ■ for L/= 4 W=l ' sf. Therefore, in the case of the function U(t) = T(t)U0 is C1 in Ж on [0, T], contained in D(A), and actually satisfies A5.8) with A5.9). Furthermore we have ||r(f)KC, 0^f<J. A5.13) Now, let Vhc V, h>0, be a family of finite-dimensional subspaces having the property inf||Z-»||K<CAMi>||, veD(A). A5.14) xeVh Xh denotes the space Vh equipped with the topology induced from X. Furthermore, Ph:X-*Xh is the orthogonal projection, and Rh:V-+Vh is the Ritz projection associated with a(,): a{Rh v> X) = Ф, X), veV, xeVh. It is shown in Section 2 that the estimate \\AJ'2(Rhv~v)\\x^Chk'j\\Ak'2v\\x A5.15) holds for k = 1,2 and j=0,1. Let Ah be the m-sectorial operator in Xh associated with a\Vh x v„- The semidiscrete approximation of A5.4) with A5.5) is now given as d2uh/dt2 + Ahuh = 0, O^t^T, A5.16) in Xh with uh@) = RhUl, u'h@) = Phuo. A5.17) We can derive certain estimates on the error eh{t) = uh(t) — u(t) by the real methods developped above. Here, we note that the equivalence \\x\\r~Ull2Xh, X£Vh, A5.10') holds uniformly in ft. 15.1. Error estimate by Helfrich's method In the same way as in A5.4) and A5.5), equation A5.16) with A5.17) is reduced to
872 H. Fujita and T. Suzuki Chapter III dUk/dt+jrfhUh=0, Osjf^r, шХк = \^Л A5.18) with l/»@)=l/o», A5Л9) where The operator — sfk also generates a (C0) semigroup (Tk(t)} in Ж"Л) and the solution l/fc = l/„(t) of A5.18) with A5.19) is given by Uh{t)=Th(t)U0h. A5.20) The inequality TO)KC A5.21) holds. Henceforth, we put yD(Al) Of __ Шад = РЧ11х + Р' + 1/211х for l/0= Wear,, A5.22) where Z=0,i,l- This notation is compatible to |||*(llo f°r & = &o introduced in A5.11). Then the inequalities |||r(t)t/olll,<C|||[/0|||, A5.23) hold true. We can trace the computations in Section 12 to obtain certain error estimates, just noting that (i) T(t) and Th(t) are merely (C0) semigroups and (ii) s/ and sfh are independent of t. In fact, note that is the orthogonal projection and let eh{t)=Th(t)P„-T(t) be the error operator. Then, first we have Ill(i -^,K(t)i/0|||0 <inf|IMt)t/0-zlllo xeSCy,
Section 15 Evolution equations 873 ^ inf \\\T(t)U0 -X\\\o<Ch\\All2u'(t)\\x + \\Mt)\\x] хеХн <Cfc|||l/o|||i/2. Next, the identity -0/8r)[rft(f-r)^£fc(r)] = Тк{1-ф*ъ»ь-9к*<тг) implies &ъФУ- Th(t-r)№„<?h-<?hrfmr)dr A5.24) — T^t-^^lsf-'-sf^^s/T^dr = -[Th(t-rH>h№-1-^;10>hWT(r)Trz'o t = E ei(t), ; = i where eA3(t) = T„(t ~ rHM\j/ -l-s/^»h-\s/2 T{r) dr. Here, we have --'--4 *."K "o'"')(Ro" A о -r'+^'P* so that A5.25) '-1-л/Г1^И*^^СЛ
874 Я. Fujita and T. Suzuki Chapter III holds. Consequently, we obtain ll|eil(t)t/olllo<CA|||j/l/olllo = CA|||l/ollli/2, ll|eft2@^olllo^C/I||K[/o!llo = CA|||[/o|||1/2, ll|e2(t)Uolllo<Crt|||j/2Uol||o = C(h|||t;ollli, which implies the following: Theorem 15.1. The "energy estimate" ||u»(t)-«(t)||K+||Mi(f)-H'Wb<CA{t|l|C/ollli + ll|t/ollli/2} A5.26) holds true. The "L2 estimate" is obtained in the following way. We set The (C0) semigroups {T(t)} and {Th(t)} extend to those on X-x and 3C-lh, respectively, satisfying IITWII,.^.,, \\Th(t)\\x_lh^h^C. A5.27) Under the inverse assumption, A5.10) implies the equivalence )\xM~U-xl2X\\x~UZll2x\\x, xeVh. In fact, we have \\v\\v,~\\A-h\\v, veV, HxIk-MjT^lk, xeV„, while Ы1г*~Ык for X^Vh, follows from ||Ph||^F^C: Hzlk=supp{|tei;)|:oeK,ll«llK<l} ^\\x\\v. = sixp{\(x,v)\:veV,\\v\\v^l} = sup{\(x,Phv)\:veV,\}v\\v^l} ^llzlksup{||Pfcu||K:»6KII»llK<l} < llxllИ! IIP» Нк-к- Furthermore, 11^щ^4щ1Ь-.х = И1/2Л-1/2Р»Ь-.х<С holds, and hence ||PJV,^.^C. A5.28) On the other hand, from a theorem by Ushijima described in Section 5, we have for
Section 15 Evolution equations 875 veX and хеКл that WX-vWy^CWA-^ix-mx ^C{\\{A-^-A^!2Ph)X\\x+\\A^lh-A~ll2v\\x} ^Ch\\X\\x + C\\A^l2x-All2v\\x. Taking x = All2RhA~ll2v, we obtain inf \\x-v\\yt^Ch\\v\\x. A5.29) teVb Since Ph\rh=identity, the inequalities A5.28) and A5.29) yield \\(Ph-l)v\\v^Ch\\V\\x. A5.30) Therefore, first «"ft JO satisfies ll^o»ILr-,-.*-,<C. Furthermore, from j/_1-j/1T1^o» = L_ it follows that Н-яГ1-^-1^*!!*-,- because ||(Л-1рА-л-1)и0|Ь= 0 /i^P» ♦*-,<СЛ, = ||(Л»-1М" ^СЙЦЛ^ИоЦ Now, for WHW^-rW Г'> 411* к^Сй||и0||к* A5.31) A5.32) in #"_!, we repeat the computation given above for eh(t) in 9C0. By virtue of A5.27), A5.31) and A5.32), we shall get ll|eo*(^olll-i<Cfc{t|||t/0|||1/2 + |||l/0|||0}. A5.33) However, the identity h(t) = ZoH(t) + Th{i)@>h-0>Oh) A5.34) holds with Rb-Ри 0 0 0
876 Я. Fujita and T. Suzuki Chapter III Since \\(Rh-phMx=\\Pk(RH-m\x^№k-m\x^chMr, the estimate W^H-^okh-^o^Ch A5.35) holds. By means of \\Tk(t)\\Xo->Xo^C, we have thus proven the following: Theorem 15.2. The "L2 estimate" lk(t)-«(t)b+IM/2(«iW-«'W)b<Cfc{t|||C/o|||i/2 + ll|l/olllo} A5-36) holds true. 15.2. Error estimate by the energy method We recall the weak forms A5.4) with A5.5) and A5.16) with A5.17): (u", v) + a(u, v) = 0, veV, t/@) = ub и'@) = ио, A5.37) (и*л) + ФьХ) = 0, xeVh, uh@) = Rhuu ui,@) = FftMo. A5.38) The energy equalities \\umx+\\A1>1u(t)\\x=\\uo\\2x+\\All2uA2x, A5.39) \\u'bm\x+\\A1hl2uh(t)\\2x=\\PhuQ\\x+\\Aii2Rhu1\\2x A5.40) follow at once from these equations. First, we shall derive the energy estimate of the error. To this end, we put fh(tJ=\\u'h(t)-PHu'(t)\\x + a(uh(t)-Rhu(t),uh(t)-Rhu(t)). Then, we have fh@) = 0 and fh-f'h = {ul-Phu",u'h-Phu') + a(uh-Rhu,u'h-Rhu') = {(u'b,u'h-Phu') + a(uh,u'h-Rhu')} - {(u", u'h -Pku') + a{u, u'h - Rhu')} = a(uh, (Ph-Rh)u')-ф, (Ph-Rh)u') = a{uh~Rku,Ph(l-Rh)u'} <fh\\P„(l-Rh)u!\\v. Supposing the inverse assumption Mv^Ch-'Mx, XtVh, A5.41) we can estimate the right-hand side from above furthermore by Ch~lfh ||A — Rh)u'\\x.
Section 15 Evolution equations 877 Hence f'^Ch-'Ul-R^u'h^ChWAu'h^ChWlUolWu so that Mxatiiii/oiiii. Therefore, we obtain \\uh(t)-u{t)\\y+\\u'h(t)-u{t)\\x </*(*) + ||(Я*-1М011к+ \\u'h(t)-Phu'(t)\\x ^fh(t) + Ch{\\Au(mx+\\u'(t)\\v} <Ch{t|||Uollli + ll|Uoil|1/2}, A5.42) which is the energy estimate given in Theorem 15.1. The L2 estimate is shown in the following way: Setting gh(tJ = \\All2(u'h-PbU')\\x+\\uh-Rhu\\x, A5.43) we have gh@)=0 and ghg'h={Abm(ul-Phu"),A;m{u'b-PbU'))+{uh-Rhu,u'h-Rhu') = {ulA^(u'h-Phu')) + a{uh,A;Y{u'h-Rhu'))} - {(«", A;1 (u'h - Phu>)) + a(u, A;! (Hi - Rhu'))} = a(uh, Alx (Ph - Rh)u') - a(u, A;l (Ph - Rh)u') = a(uh - Rhu, A^ x {Ph - Rh)u') = (uh-Rhu,Ph(l-Rh)u') ^gh\\PH(l-RH)u'\\^gh\)(l-Rb)u'\\x. A5.44) Therefore, g'h^Ch\\u'\\v^Ch\\\U0\\\m, so that gh(t)^Cth\\\U0\\\ll2. A5.45) We note uh-u={uh-Rhu)+{\-Rh)u A5.46) with \\(l-Rh)u\\x^Ch\\u\\v^Ch\\\U0\\\o- Furthermore, A-42(u'h-u')=(A^>2Ph-A-v2)(u'-u'h) + A^<2(u'h-Phu') A5.47)
878 H. Fujita and T. Suzuki Chapter III holds with \\(A^'2Ph-A~v2)(u'-u'h)\\x ^Ch\\u'~u'h\\x ^Ch{\\u'\\x+Wk\\x}<Ch{\\\Uo\\\o + \\\Uok\\\o} ^САЦЦ/olllo A5.48) so that the L2 estimate ll«»W-«(f)ltx+P/2D(t)-«'W)b<Cfc{t|||l7ollli/2 + lllt/olllo} A5-49) follows. Furthermore, we see that the argument given above implies: Theorem 15.3. The estimate \\uh(t)^u(t)\\x+\\A-1'2(u^)-uXt))\\x^Ch2{t\\\U0\\\1 + \\\U0\\\^ A5.50) holds true. Proof. In fact, we have g'^Wil-R^u'h^Ch^Au'W^Ch^WUolW, by A5.44) and \\(l-R„)ux^Ch2\\Au\\x^Ch2\\\U0\\\ll2 in A5.46). Furthermore, from Theorem 15.1, we get \\(A^'2Ph-A-^)(u'-u'k)\\x^Ch\\u'-u'h\\x <Cft2{t|||l/o||ll+|||l/ollll/2} in A5.47). These inequalities give A5.50). □ 15.3. Comments As to the discretization for the time variable t of the semidiscrete approximation A5.16), there is a scheme called "Newmark's jS's", that is, Dziul(t) + Ahu\{t) + px2AhDtiul(t)=0, j = t„ = nx, n = 0, l,...,N A5.51) with ul(Q) = Rhuu иЦ-1) = Я»и1-тР»и0. A5.52) Here, т > 0 is the size parameter, /? 5= 0 is a fixed constant, and DIfu(£) = T-2{u(£ + T)-2u(r) + u(t-T)}, t = t„. Taking 0<y<v/4/(l-4/J) in case 0<j8sSi, Fujii [1974] proved the following:
Section 15 Evolution equations 879 Theorem 15.4. Suppose the inverse assumption A5.41) and the inequality т/И^ку A5.53) in case 0</?<£, where k>0 is an absolute constant. Then, the "energy estimate" HW-ttrnv+iiDfum-i/imx^cih+TMUoiih, t=te<T, A5.54) holds. See also Ushijima [1975, 1979], for the proof. Theorems 15.1 and 15.2 are also shown there under the inverse assumption A5.41), by a systematic and sophisticated use of Helfrich's method. For related works, we refer to Baker and Bramble [1979] and Geveci [1984].
Chapter IV Refinements and Generalizations 16. L °° estimates Motivated by the L°° error analysis on the finite element approximation of elliptic equations by Nitsche and Scott, a similar study for parabolic equations has been done by Nitsche himself, Thomee and others (Nitsche and Wheeler [1981-82], Schatz, Thomee and Wahlbin [1980], Thomee and Wahlbin [1983]). In the present section, we shall present the method of Thomee [1984] on semidiscrete approximations. To fix the idea, we consider again the parabolic equation Ъи/Ы=Ли, 0<t<co, xeQ, A6.1) with the boundary condition w|8fi = 0, 0<£<oo, A6.2) and with the initial condition u|(=0 = u0(x), xeQ, A6.3) where the bounded domain flcR2 has a smooth boundary 30 or is a convex polygon. Taking X = L2(Q), V=Hl(Q) and a(u,!;)=|fiVwVvdx, u,veV, we can reduce equation A6.1) with A6.2) and A6.3) to the evolution equation du/dt + Au = 0, 0<t<co, A6.4) in X with ы@) = мо, A6.5) where A is the self-adjoint operator associated with a(,). Let VhcV be the finite-dimensional space introduced in Section 1 and let Xh be the space Vh with the inner product induced from X. Then, as we have seen in Section 7, the semidiscrete approximation of A6.4) with A6.5) is given by the evolution equation duh/dt + Ahuh-Q, 0<t<co, A6.6) in Xh with uh@) = Phuo, A6.7) 881
882 Я. Fujita and T. Suzuki Chapter IV where Ah is the self-adjoint operator in Xk associated with a\VhxVh and Ph: X->Xh is the orthogonal projection. Let {e~u}t>0 and {<z~tAh}t>0 be the semigroups generated by A and Ah in X and Xk, respectively. Then, Eh(t) = e-'AhPh~c-'A A6.8) denotes the error operator. In this section, we want to give an upper bound of Ц£*@мо1к°°- То this end, we must mention the L00 stability of approximate solutions. Namely, Theorem 16.1. The estimate l|e-rAhxllL^qiog/i|||Z|iL», %eVh, A6.9) holds true. In order to prove Theorem 16.1, we introduce the discrete delta function <5£e Vh and the discrete fundamental solution rt(t)€Vh for each xeQ through №,*) = *(*), X*Vhl m)=e-tA*6i, respectively. For xeVh, we have (rj(t),z) = (e-^S,Z)=(«)e-'^z)=(e-Mhz)W, so that He-'^k-^suplirjKOk'-llzk-- xefi Therefore, A6.9) is reduced to \\rxh{t)\\L^C\\ogh\, t>0, xeQ. A6.10) According to Nitsche, we introduce the modified distance function (o(y)^coxh(y) = (\y-x\2 + h2)^2. Then, we have с к lib m'2dx^C rdr r2 + h2' = C|logfc|, and hence lirSCOIIb' < «a» IU- - lleorSCOII^^CIlog Л11'2 HorjfWII^. Consequently, we have only to show that wcomn^Qioghf'2. (i6.il) Henceforth, we suppose the inverse assumption. Then, Descloux's lemma holds with ||PfcHioo^oD^C, as is described in Section 3. Furthermore, we simply write || • ||
Section 16 Refinements and generalizations 883 instead of ||*|Il2- The following lemma is related to the superapproximation property of Nitsche and Schatz [1974]: Lemma 16.1. The estimate \\V(cD2x-Ph(co2x))HCh{l\x\\ + \\a>Vx\\}, X^K, A6.12) holds uniformly in xeQ. Proof. Let ф = пк{оJх) be the interpolant of w2x- Then, for each Texh we have \\о>Ч-Ф\\*(т) + ЧЧ(е>2Х-Ф)\\*т)<Ск2 £ W((o2x)\\b\Ty |a|<2 By virtue of Leibniz's rule and Dxx=®, |«l = 2, we find |«|«2 so that \\a>2x-<t>\\+h\mco2x-<i>n^Ch2{\\x\\ + \\coVx\\}. On the other hand, for \]/ = Ph{a>2x)eVh the inequality \тф-ш\^с^1\\ф-ф\\=с^'\\рк{ф-(о2х)ЫС^'\\ф-(а2х\\ holds. The desired estimate A6.12) is an immediate consequence of these two inequalities. □ The following lemma shows that the modified distance function a compensates the singularity of the discrete delta function 5% in a certain sense. Lemma 16.2. The inequality IfflKIKC A6.13) holds true. Proof. We fix xeQ and set <у=<у£ and 5=61. Let QJ={yeQ:2j-1h<\y-x\<2Jh}, ;>1, J20 = {ye Q:\y- x\<h}. Obviously, aO)< Ch2{ yeQj, so that \\соё\\^С £ h2j\\3\\L2(Qj). j»o To estimate ||5||x.2(nj), we take q>eC§{Qj). Supposing хеТетА, we have \F, q>)\ = |E, P„<p)\ = \{Phcp){x)\ < Cft-11| Phcp \\L4n by the inverse assumption. Furthermore, from Descloux's lemma the right-hand
884 H. Fujita and T. Suzuki Chapter IV side is furthermore estimated above by Ch-1 sxv{-C2i)\\(p\\LHnjy Therefore, mL4iij)^ch-i^{-cv) holds and A6.13) has been established. □ The following lemma is a kind of discrete Sobolev imbedding: Lemma 16.3. The inequality (Ixll^Cllog/il^llVxL ieVh, A6.14) holds true. Proof. We combine Sobolev's inequality HxllL,«y/2||Vzil, %eVh, for p > 2 and the inequality hh^Ch-^Wxhp, xeVh, which is derived from the inverse assumption. Putting p = \\ogh\, we obtain A6.14). D Now, we can proceed to prove Theorem 16.1. Proof of Theorem 16.1. We have to show A6.11). In fact, we have i(d/dt) || шГ ||2 + || coVr 12=(8, Г, ш2Г) + (У Г, V(aJT)) - 2(УГ, аГУш) while (д,Г,ф) + (ЧГ,Чф) = 0 for фе Vh. Therefore, i(d/dt) || шГ || 2+ || coVr ||2 = 1 + 11 + 111, where I = (9, Г,со2Г- ф), II = (УГ, Щсо2Г - ф)\ III = - 2(Vr, соГУсо). Taking ф = Рк(со2Г)е Vh, we first have I = 0. By means of the inverse assumption and Lemma 16.1, we next get ||IIK||Vr|HIV(a>2r-4>)|| ^aririiftfliru + iitovrii} = C{||n2 + ||r|||]a>Vr||},
Section 16 Refinements and generalizations Finally, liniKCiirHwvrii because of ||Vo>||L<»<C. Summing these inequalities, we obtain (d/dr)||Q>r||2 + ||fflVr||4C||r||2, so that t t \\wr(t)\\2+(\\a>w\\2ds^\\a>e*h\\2+c{\\r\\2ds. о о Consequently, A6.11) has been furthermore reduced to ||r||2dS<C|logfc| A6 by virtue of Lemma 16.2. As in Section 14, we put ГА = /1Л~1. Then Г satisfies Thdtr+r=o, t>o, no)=si, and i(d/dt)(Tfcr,r)+||r||2=o holds. Therefore, we have t i(T,r,r)+f||r||2ds=i(Tfc«)(x). о Now, we set GJ=VJ. Th is positive-definite and the inequality A6.15) has been reduced to GJ(x)<C|logfc|. A6 The function G£e Vh is nothing but the discrete Green function: (VGJ, V*) = tfTk8S, VX) = EL x) = X(x), %e Vh. In particular, G£(x)=]|VG£||2. Therefore, from Lemma 16.3 we obtain ^(xJ^qiogAI^HVGjfH = C(\logh\-GUx)I12, which implies A6.16). Thus, the proof has been completed. □
886 Я. Fujita and T. Suzuki Chapter IV By means of the real methods described in Chapter III, we can show, for instance, the following error estimates from Theorem 16.1 and Nitsche-Scott's inequality \\Rhv-v\\L«,^Chz\logh\ И и-*.*, which is described in Section 4. We shall make use of Helfrich's method to prove the estimate presented in the following: Theorem 16.2. The inequality \\uh(t)-u(t)\\L*>^Ch2\\ogh\2J sup ||и(£) 11,^,00+ ||M(||r2,=cdsl A6.17) ]ossr«r holds true for the solutions u = u(t) and uh = uh(t) of A6.4) with A6.5) and of A6.6) with A6.7), respectively. Proof. As we have shown in Sections 12 and 15, the error operator Eh(t) is expressed as Eh(t) = {l-e-tA*Pk)(R„-l)e-tA t Г + U-^-r)A"AhPh(Rh-l)(t'rA-e-tA)dr. о Integrating by parts, we have t - \t-{t'r)AhAhPh{Rh~\){e-rA-Q-,A)ur 0 t = \c-{,-r)AhPh{Rh-l){&~rA~eA)JTV0+ e-(t-r)AhPk{Rh-\)AQ-rAdr о and write E„(t) = i Elh{t) with El(t)=(Rk-l)e-tA, E2h{t)=-e-'-A»Ph{Rh-l), Elit)- e~«-r)A»Ph(Rh-l)Ae-rA dr.
Section 17 Refinements and generalizations 887 From Nitsche-Scott's inequality it follows that \\El(()u0h^Ch2\logh\\\u(tnw^. On the other hand, Theorem 16.1 yields \\E^t)u0\\L^Ch2\logh\2\\u0(t)\\W2^, |£A3W"olli^C/!2|log/l| K(t)||*ri.«dr. Thus, A6.17) has been established. □ We can deduce other error estimates even for nonsmooth initial data u0, taking account of the smoothing properties of {e~'A} and {e~'Ah}. One simple way to establish these inequalities is to use the discrete elliptic Sobolev inequality given in Section 5. However, we do not go into details here. Another way based on the energy method can be seen in Тномёе [1984]. 17. Rational approximation of semigroups In Chapters II and III, we studied the fully discrete finite element approximation for parabolic equations. There, we considered the forward, backward and Crank- Nicolson schemes. However, many other methods exist as for the time discretization. In the present section, we present a general way to deal with them. To begin with, let {е~м}1>0 be a uniformly bounded (C0) semigroup in a Banach space X and put r{z) = A + z)~i, zeC. Then, r is a rational function with the properties Кг) = е-г + 0(|г|"+1), |z||0, A7.1) for p = 1 and |r(z)|<l forRez^O. A7.2) These properties A7.1) and A7.2) are sometimes referred to as being of order p and (Dahlquist's) Л-acceptability, respectively. The operator rn{xA) is regarded as an approximation of e~'A for t = nz, т>0 being the time mesh parameter. Actually, from the identity (d/ds)(rn{sA)e-n(T-s)A) = nrn-1(sA){r'(sA) + r(sA)}Ae-n(z-s)A = nrn+1(sA)sA2e-niz-s)A it follows that
888 H. Fujita and T. Suzuki Chapter IV Г lrn(iA)~-c'-niA2A'2 = n rn+1(sA)se-"{x-s)Ads. о On the other hand, the uniform boundedness of {е~ы}(>0> ||е-'л|КС, 0<t<oo, implies the stability ||г"(ъ4ЖС by a standard argument in the semigroup theory (Yosida [1964, Chapter 9], Kato [1966, Chapter 9], e.g.). Therefore, we obtain \\[r\tA)-t-mA^A-p-l\\^Ctxp, t = nz, A7.3) for p= 1. Hersh and Kato [1979] and Brenner and Thomee [1979] have shown the error estimate A7.3) for a general rational function r = r(z) with the properties of being of order p and Л-acceptability. In the present section, we shall give an error estimate when {<z~'A}t>0 is holomorphic, taking account of its smoothing effects and following the method of Baker, Bramble and Thomee [1977]. Before doing that, we generalize the notion of Л-acceptability and introduce the following: Definition 17.1. For 0^в<к, let Ze = {zeC:|argz|^0}. Then, a rational function r = r(z) is said to have the properties (i)fl if IKz)|<l, zeZe, and |r(oo)|<l, (ii)e if|Kz)|<l,z6Ze\{0}, (iii)e if |r(z)|<l, 0<|z|<<5, zeZ, for some S. The property |r(z)|<l, zeSe is called Ae-acceptability. Then, ^-acceptability means ^4„/2-acceptability in this terminology. Remark 17.1. Each rational function of order p ($sl) has property (iii)e for 0<в<^п. In fact, for (ре[ — в, д~] we have (8/8p)|r(pei',)|2 = 2 Re Kpe^rXpe'V- Since r@) = e-*|z„o = l, r'@) = (d/dz)e-z|2=0=-l, we get (9/ep)|r(^)|2]p = 0=-2Ree^ = — 2 cos <p «£-2cos0<O.
Section 17 Refinements and generalizations 889 Remark17.2. Let r = r(z) be the Pade approximation of e ~2 with degrees n and m of the numerator and the dominator, respectively. Then r is of order p = n + m. Furthermore, it has property (i)„, (ii)e or (iii)e for some в, 0 < в < jn for n <m, n = m or n>m, respectively. In fact, in this case we have r{z) = R„m(z) = P„m(z)/Q„m(z) with " (n + m-j)\n\ , . Г0 (n + m)!j!(n-j)! &»(*) = Z (и + m—j)!m! j.fo (n + m)\jl(m-jy. The relation |i?nmB)-e-2KC|z|"+m+1, |z[|0, is well-known (Hitotumatu [1963], e.g.). Put (n + m-j)\nl n . (n + m)!j!(n-j)! Then, in the case of n < m we have a»mO")^0, anm(i)^am„(j), Q^j^n, so that, when т^^^тс, IReP^pe^) Z fl-.mO')(-p)J'cOS,/(p J = 0 < Z anm(j)PJCOSJ'<p J = 0 m < Z amn(j)pj cosjcp j=o ^ReQ^pe^). Similarly, under the assumption тф ^ \к we have llmJWpe**)!- Z a«>nO')(-p)-'sin(+» j=o < Z a„m(j)pJ sin jcp j=o m < Z am„(j)pJ sin j(p j=o m Z flnmO*)P''sin(+^) = |Ime„m(pe±i<!')|.
890 Л. Fujita and T. Suzuki Chapter IV Furthermore, m RcQnm(pe±iv)= E am,(J)picosjq>^0, iimcp^^iz. Therefore, \r{pe±iv)\^l,0^(p^^i/m. Now in the case of n < m, we have r(oo) = 0. Hence r = Rnm satisfies (i)e for в = jn/m. If n = m, then |r(oo)| = 1 < 4- oo so that |r(z)| ^ 1, геГл/2т, by Phragmen-Lindelof's theorem. Therefore, |r(z)| <1,геГв\{0}, for в<%п/т by the maximum principle, which means that r = R„m satisfies (ii)9. Finally, by virtue of Remark 17.1, r = Rnm always satisfies (iii)e for some в in 0<6<in. Remark 17.3. Incidentally, the Pade approximation r = Rnm of e~z is /^-acceptable if and only if m — 2 ^ n < m. This was conjectured by Ehle and has been proven by Wanner, Hairer and Norsett [1978]. In the case of n = m and n — m—\, R„m obeys a recursive formula based on the continued fraction expansion (Mori [1974]). Namely, from the expansion e = 1-г 1+г 2-2 Ъ + z — which is expressed by z \ z z z z z z z e =T-1 + 2-3 + 2-5+ —H2 —2/—H—* we get , ( 2z z2 z2 z2 e2=l + 2-z+6+10 + --- + 2B/-l)+- Accordingly, a rational approximation H2k + 1{z) = G2k+i(z)/F2k+1(z) of ez is introduced inductively with F1 = l, F3 = 2-z, Gx = \, G3 = 2 + z, F2j+1^2Bj-l)F2j-1+z2F2J-3, G2j-+1 = 2Bi-l)G2j-1+z2G2j-3. Similarly, H2k(z) = G2k{z)/F2k(z) is defined by F0 = l, F2 = l-z, Go = 0, G2=l, 2 _b ,2/-l,, F2J- = |2B/-1)+—^zjF^-.+^z^.^, h-1) + 27bZb-2+l^G—
Section 17 Refinements and generalizations 891 Then, the relations H2k+i(—z) = Rkk(z) and H2k(—z) = Rk-ik(z) can be verified. Now, let X be a Banach space and — A be the generator of a holomorphic semigroup {e~M} of type Fo,Mo),0<0o<%n, M0^l, that is, <т(А)с1во\{0}, WHk-Aj'^l^Mo, A>0, ||(Я-А)-1||<А#./|Я|, £>0, ЯеС\Я+а. Furthermore, suppose that A is bounded for simplicity. Then, Theorem 17.1. The error estimate \\r"(xA)~e'mA\\^C(r/t)p, 0<t = m<T, A7.4) holds under the assumption that for some 6>в0 one of the following holds: (a) r(z) has property (i)e, (b) r(z) has property (ii)e and т\\А || ^iWj < oo, (c) r(z) has the property (iii)e and т||Л||<Aft <<5. Here, £/ie constant C>0 ш A7.4) depends only on 90, 9 and M6l, 90<в1<6, for the case (a) and on 60, в, M$l, 0o<0i <#> and Mx for the cases (b) and (c). Remark 17.4. Baker, Bramble and Thomee [1977] have shown the theorem for the case that A is self-adjoint under slightly weaker assumptions on r{z). Theorem 17.1(a) has been obtained by Le Roux [1979]. The other cases can also be derived by her method. We have to prepare some propositions for the proof of Theorem 17.1. Henceforth fi > 0 and С > О stand for small and large absolute constants, respectively. Proposition 17.1. If a rational function r = r(z) is of order p (^ 1) and Ae-acceptable, 0<d<27t, then there exist constants с > О and ft>0 such that \r"{z)-Q-nz\^Cn\z\p+1Q-^ez, zeZe, |z|^ff. Proof. First, we note that n r"(z)-e-"z= £ r->-1(z)(e-z-r(z))e-(""J'+1)z. On the other hand, the inequality |r(z)-e-*|^C|2|"+1, М^го, holds for some a0 > 0 so that |r(z)Ke-Rez + C(Rez)"+1, zele, |z|«r0.
892 H, Fujita and T. Suzuki Chapter IV Set f{t) = e -' + Ctp +1. Then, / =/(t) is real-valued, /@) = 1 and/'@) < 0. Therefore, f(t)^e~p,,0^t^a1, holds for some /J>0 and ot >0. Namely, |r(z)|<e-'R", IzKminK.ffo), ze£e. Therefore, if \z\^a = min(ai,<j0) and ze Es, we have n |r"(z) —e~BZ|< У g_0'-l)?Rez^42|p+le-(n-J+l)cosflRez j=l = Cn\z\p+1e~l>nRez. П Lemma 17.1. Under the assumption of Theorem 17.1, the estimate \\1гп(тА)-е~,А^А'!'\\^Стр, f = nt, (П.5) holds, where the constant С >0 depends on в0, в and MBl, в0 < вх < в, for the cases (a) and (b), and on в0,в,Ме1,в0<в1<в, and M2 for the case (c). Proof. We fix a constant M>0 such that rjj Л || <M,M1 <MandMi <M<<5for the cases (a), (b) and (c), respectively. We may suppose that cr < M in Proposition 17.1. We take a path of integration Г0 which is divided into three parts: Г0 = Г1иГ2иГ3, where the sets Г^,Г2 and Г3 are given by Г1={Я=ре±1в1:0</7^сгт-1}, Г2 = {Я = ре±!в,:сгт~1^р<Мт-1}, Г3 = {Мт-1е^: \ф\^вх) with в0<в1<в. Then, we have lrn(rA)-e'tA']A-1'-- 1 / + 2тп А Гг Гз = 1 + 0 + 111. By virtue of Proposition 17.1, we have !|I|KC \n(zp)l'+1e-ll',tpcoseip-p-?- o + (/•"(тЯ)-е-а)Я-р(А-Л)-МЯ (mp)p +1 e " ^"^>cos e' (nip)" " — тр 0 From the assumption it follows that
Section 17 Refinements and generalizations 893 |II||<C {|г-(тА)Ц-|е-'Ч}|АГ'||(Я->4)-1 || |<1Я| Гг «sc P Finally, КА-ЛРК 1 IAI-MII M-x\\A\ holds on Г, so that пике {ИтА)| + |е-(Я|} IАГ* || (А-ЛГ1 |l IdAI <C Шг-1)-" МйФ -С х' {Mt } М-х\\А\\-СмХ ■ In the cases (a) and (b), the constants С in the estimates of ||I|| and ||II|| are independent of M, while CM goes to zero as M-*oo. Thus, the proof has been completed. □ Proposition 17.2. If r — r(z) satisfies (iii)e, then for each S', 0<5'<8, there exists /? > 0 such that HzJI^e-'1'1, |z|<<5', zeZe. When r~r(z) satisfies (ii)e, 3' can be arbitrarily large. Proof. As is shown in Remark 17.1, the inequality C/8p)|r(^)|2|p=o<-2cos0, \ф\^в, A7.6) holds so that for each /?0,0</?0 <cos#, there exists a <x0 >0 such that |r(pe*)|<exp(-0op), 0^p^ao, \ф\^в. Next, we set тах{|г(ре!*)|:ре[ао,П4>е[-0,0])} = 1-е<1, and take j9t >0 so that exp( — P^S') — 1-е. Then, we have \r(p^)\^expi-piP), <J0^p^S', \ф\^в. If we set p = min(fi0,P1), the desired inequality holds. □
894 H. Fujita and T. Suzuki Chapter IV Lemma 17.2. In the cases (b) and (c) of Theorem 17.1, the estimate М«г"(ъ4)||<С.(птГ" A7.7, holds for <x>0. Here the constant Co>0 depends on Mx, Proof. We take a path of integration as in the proof of Lemma 17.1, and obtain ){тХУгп{тХ){Х-АУ1&к 1 (mAfrn(xA) = ~-. 2tci + Г1+Г2 Гз = 1 + 11. From the assumption, we can take d' of Proposition 17.2 in M<S'. Then, we have ||IКС (итЯ)ае-^со5в1 — =С«, IIIIII^C M {mMyt~m\t . йф^Ся. П M-Mi By means of Helfrich's duality argument, we can now give the Proof of Theorem 17.1(b)-(c). Dividing NeN into n = J + m with l,meN and 0^/ —m^l, we have rn{zA) - e -tA = (r\zA) ~Q-hA)A~ prm(xA) + e~ltAA ~p{rm{xA) - e " mtA). The first term of the right-hand side is estimated as || [r\iA)-t-lxAlA~T || • I Aprm(xA) || < Стр{тг)-р <C/np = C{%ltf by Lemmas 17.1 and 17.2. The second term is estimated similarly by the adjoint form of these lemmas. □ Unfortunately, r(oo)=0 does not hold in general even in the case (i)9. Therefore, we cannot take & = 00 in Proposition 17.2. Consequently, we have to give other considerations to prove Theorem 17.1(a). Lemma 17.3. Let ф be a meromorphic function. Suppose that there exist functions fi,f2: R+-+IR + and a constant R>0 such that for 0Ь в0 <в1г Г dr Г dr Ш/-)у <oo, \f2{r)—<co, о о
Section 17 Refinements and generalizations \ф(г)ЫМг\), \фB)-ф(со)\^М\2\), \z\^R, |argz| = 0b Then, the inequality 895 | #)|KCS i !fdr)y + {f2(r)y +Moo)| holds true. Proof. Putting h(z) = <p(z)- 1+z cp(co), A7.8) we have (f>(A) = h{A) + <p(cc)A(l + A)-1. Here holds. Therefore, we have \\<р(А)ШКА)\\ + С\<р(со)\. To estimate || h(A) ||, let Г be the path of integration given in Chapter II, that is, the positively oriented boundary of Zei,90 <0t <6. Then, the identity h{A) = 2rci /j(z)B-^)_1dz holds. We have |fc(z)| = 1+Z <ЛA*1) + МИ<х>I if \z\^R and |argz| = 0b while IMz)l = 1 <p(z)-(p(oo)+ —-(p(co) I +z </2(|г|)+Лм°°I holds for \z\^R and |argz| = 0i. Therefore, we get R oo ||А(ЛЖС^ !{Mr) + r\cp(oo)\}y + f|/2(r)+^r)|ly = C Л(г)у+ Г/2(г)у+(л + |||ф(оо)|^
896 И. Fujita and T. Suzuki Chapter IV Hence the desired estimate A7.8) has been established. □ Now, we can give the following proof. Proof of Theorem 17.1(a): Take 6t in в0<91 <в and cr>0 as in Proposition 17.1. Then, the function ф(г)=е~пг — r"(z) satisfies, for some /?>0, j0(z)|^Cn|z|p+1e-M2|cosf\ zeSei, \z\«r. A7.9) On the other hand, making /? > 0 smaller if necessary, we may suppose that sup{|r(z)|:zelei,|zj^ff}=e-'f<l from the assumption. Furthermore, |r(z)-r(oo)|s£C/|z|, |z|><x, holds because r — r{z) is rational so that /•"(z)- -r"(oo)| = |r(z)-r(oo)| ^Ce-"*/|z|, B-l E Лгу j = 0 ze£e , '1-;M |z|><7. Therefore, l</»(z) —Ф(с»)| ^e-"'x|coe*1 + Ce_"^/|z|, zeStl, \A>°- A7.10) Finally, |ф(оо)|^е-пр. A7.11) By virtue of these three inequalities A7.9)—A7.11), we obtain ШхА)\\ = \\г"{тА)-е-тЛ || <C/np from Lemma 17.3. □ The above abstract theorem can be applied to the semidiscrete finite element approximation of parabolic equations, that is, duh/dt+Ahuh = 0, 0<t<T, A7.12) in Xh with uh@) = Phuo, A7.13) Ah, Xh and Ph being as before. We recall that the spectrum of Ah lies in a parabolic region in the complex plane which is uniform in h. Furthermore, for each 60 in 0<60<%к, there exist constants Mo>0 and A>0 independent of h such that — Ah + к is of type (в0, M0)- Therefore, taking vh — e ~ Xtuh instead of щ, we can make the exponent 0>O as small as we like. On the other hand, under the inverse assumption holds the inequality M»K/№
Section 18 Refinements and generalizations 897 with an absolute constant /?>0. Therefore, we have obtained: Theorem 17.2. Let r = r{z) be a rational function of order p(>l) such that for some в>0 one of the following is satisfied (a) (i)e, (b) (ii)e and г/h2 ^M1<co, (c) (iii)„ and x/h2^Mt<ph. Then, the estimate \\f(zAh)-e-,A*^a?/tY, t = m holds with a constant C>0 independent ofh. We can also apply Theorem 17.1 for the semidiscrete finite element approximation with higher accuracy. In that case, we will obtain a more natural result from the viewpoint of the correspondence of the rate of convergence with respect to the time discretization and the space discretization. The theorems on the backward, forward and Crank-Nicolson schemes in Sections 8 and 10 may be obtained as special cases of that general result. 18. Multi-step methods The time discretization considered in the preceding section determines, at the wth time mesh, u„ = w(f„) from the (n- l)th value u„_ t =u(t„-l), starting from the given initial data u0 = u(t0), where t„ = nx. In this sense, it may be called the single-step method. On the contrary, a multi-step method of order q (^2) is the way to determine un from the values at the preceding g-steps: u„_x, м„_2,..., u„-e after determining ul,...,uq-1 from нр by other suitable methods. In the present section, we adopt this kind of schemes to discretize in time the evolution equation du/dt + Au = 0, (Kf^T, A8.1) in a Banach space X with и@) = ио. A8.2) Here — A is the generator of a holomorphic semigroup {e~'A} of type (в, М) with O<0<^r7i and M^ 1. For simplicity, A is supposed to be bounded. In the scheme which we consider here, one firstly determines uly...,uq_1 from u0 by a single-step approximation in use of a rational function, and then computes un,n^-q through the relation £(а; + тМК+.=0, n=0,l,..., A8.3) ;=o where а^Ь^Ш.
898 H. Fujita and T. Suzuki Chapter IV Without loss of generality, we may take aq = 1 in A8.3). Setting P(Q = Е*=0вгСг апс* S@ = Z?=obiC'. we call the scheme A8.3) the multi-step method (P,S). Then: Definition 18.1. A multi-step method (P,S) is said to be of order p (> 1) if I a;=0, X w,= Z К i = 0 > = 0 i = 0 f Va^jt^'K 2<j<p. Understanding 0° = 1 and 0-0 = 0, we can write these relations simply as i = 0 t = 0 The meaning of the above equalities is seen as follows. Regard A as the differential operator d/dx, and make the Taylor expansion around т = 0 of the finite difference operation LtI>]= £ [а;у(х + г'т) + тЬг/(х + гт)]. i = 0 Then, the relations are obtained by making the coefficients up to pth powers of т to be zero. The scheme described by A8.3) implies the backward, forward, Crank- Nicolson and modified Crank-Nicolson (which is of order 1) in the case of q= 1. Putting w((, z) — P(Q + zS(Q, we have the following definition: Definition 18.2. For 0<в<%к, a multi-step method {P,S) is said to have the property: (III)e if each of the roots {£,-: l<j<^} of P(£) = 0 is simple and lies in the closed unit disk |£|^1, and moreover, if for any Cj with |C,| = 1 the inequality Rcy\lj\>sme holds for Xj = CjS(Cj)/P'{Cj), (II)e if it has the property (III)e and each of the roots {(,-(z): 1 </ < q} of w(C, z) = 0 is simple and lies in the open unit disk |£| < 1 for ze £e\{0}, (I) if it has the property (II)e, each root of S(Q = 0 is simple and lies in the open unit disk |C|<1, and bq >0. We note the following: Remark 18.1. If {P, S) has the property (III)„, then there exists a constant к>0 such that the condition on {C/z): 1 </<q} stated in (II)e holds for 0< \z\ < к and zeSe. In fact, £.(z) is continuous in z and we may suppose that £Д0) = Cj by reordering the numbers if necessary. We have only to show that | £Дг) | < 1,0 < | г | < к, z e Ze, holds for some к > 0, assuming | Cj I = 1 • Actually, Cj is simple and g(p)=|СДре'*) |2, | ф | ^ в, is differentiable in p at p = 0. Therefore, we have 3'@) = 2Re((CJ.@)C}@)ei^).
Section 18 Refinements and generalizations 899 On the other hand, we find £K0)=-S(y/F(Q from P(tj(z)) + zS{£j(z)) = 0 so that g'@) = -2Ке^е{ф. Therefore we have sup{0'(O): \ф\^0} <0, and the assertion follows. Let the rational function r=r(z) which is used to construct the approximate solution of A8.1) with A8.2) for the values of щ = u(ti), 1 < i^q-1, be of order p-1, and let the multi-step method (P, S) for the values u„ = u(t„), n~^q,be of order p. In this case, we have: Theorem 18.1. The error estimate ||и.-е-ми0КС(тЛП|<о11, 0<£ = пт<Т, n^q, A8.4) holds, if for some в>в0 one of the following is satisfied: (a) (P, S) and r have the properties (I)e and (ii)e, respectively (b) (P, S) and r have the properties (III)e and (iii)e, respectively, together with the condition T\\A№M1<wm@,K,l/\bt\). Here the constant S>0is introduced so that r has no poles in the part \z\<5 ofz e Ee. Further, к is the positive constant specified in Remark 18.1. In the case (b), the constant C>0 in A8.4) depends on M1. Furthermore, min(£, к, 1/| bq |) can be replaced by min(£, 1/| bq |) and min(£, к), provided that (P, S) has the property (II)e and that bq > 0, respectively. Le Roux [1979] has treated the case (a). The other case can be also proved by her method. Before proceeding to the proof of Theorem 18.1, we introduce a few notations. We set at + biZ $i(z)=—-r~ aq+bgz for l^i^q. Then, the relation A8.3) reads: £<5;(тЛ)цл+; = О, n = 0,l,2, .... i = 0 Therefore, let us consider first the functions u„ = u„(z), zeC, satisfying t ^(z)u„+j(z) = 0, «=0,1,2,.... A8.5) t = 0 Then, for these functions we have:
900 H. Fujita and T. Suzuki Chapter IV Lemma 18.1. If the {(j(z): 1 ^j^q] are distinct, then u„ = un(z) (n^q) satisfies )u„(z)]^CY Uz))« j=i where С > 0 depends only on u0,...,uq-y, sup{iCj(z)\:Kj^q} and ini{\^{z)~Uz)\;i^j). Proof. From A8.5) follows 9-1 u„+q(z)=- £ di(z)u„ + i(z), A8.6) i = 0 while the relation P{Q + zS(Q~(aq + bqz) t ({-tj(z)) j=i implies c« = -2>,-(*K'+ fl &-№)> A8-7) i=0 J=l where at = <Si{z), i=Q,. ..,q — l is determined by For the moment, we write dt = <5;(z), Cj — C/z) and u„ = «„(z), for simplicity. If M; = Qj holds for 0 < i < g — 1 with some j, then we have u„ = (", и = g, g +1,,.. . In fact, we can show inductively that «»+,=- ei^».+i=- ("i ^ Ъ=сгл. i' = o \г=о / Therefore, in the case where «<= £ аД 0^j<?-l, A8-8) j=i for some а,е С, l^j^q, we obtain « j=i In particular, this yields KKmaxflajl.-l^e} £ |f,|". The linear transformation S:(a1(...,a,)T>-»(u0,...,tt,_i)T
Section 18 Refinements and generalizations 901 is expressed by the matrix /' - '\ Ci - f, W_1 - c; 7 whose determinant is that of Vandermonde. Since the {£/. Kj^g} are distinct, such a {u0,..., uq-!} as in A8.8) can represent an arbitrary element in C. Thus the proof has been completed. □ Remark 18.2. In the case that {[j{z): 1 <j ^ q) has the maximum of the multiplicities m = m{z) > 2, then the inequality \un{z)W{l+np*-' tUz)\" j=l can be proved. Let us turn to the scheme under consideration. The approximation operator is denoted by Tln(A): u„ = Tl(A)u0. Defining a rational function s„ = s„ (z), n — 0,1,2..., inductively as s„(z) = r"(z), 0^n^q-l, * A8.9) £ й(г&+«(*) = 0, и = 0,1,2,..., j=o we obtain ГВИ) = 5„(М). Therefore, £ <5;B)(е-<"+£,г-5п+1.B)) = ^(г) A8.10) i = 0 holds, where Fj(z)= t ,5,.(z)e-(J-+'->z,j>0. A8.11) >=o The following lemma represents the error operator Tl(A)—t~tA through the operator Fj(tA) just introduced. Let us define the rational functions yj(z),jeZ,
902 Я, Fujita and T, Suzuki Chapter IV inductively as f0, j<0, A8.12) X yj.k(zMt-t(z) = 0, j>0. « I fc = 0 Lemma 18.2. The identity e-'^A-TUq(A) = t y^j(zA)Fj(zA)~4Y iy^k^A)dj.k(z)(e-^A-rhA)) j=0 j=l t = 0 holds for n = 0,1,... . Proof. From the definition A8.11) of Fj} we have £ yn.j(z)Fj(z)= t t ШУп-№-и+* j=0 j=0< = 0 = t S ^(z)y„_,(z){e-(J-+i»z-Sj.+/(z)} j = 0 i = 0 = *ZBj{z)(e-*-s}(z)), j'=o where j Bj(z}= £ 7»-i(z)^-t(z), O^j^q, k=0 B4+j(Z) = S ^-j-tW^-Jz). Note that yy=0,;<0, by the definition. Here, Bj(z) = 0 for <?<;<n + g-1 from the definition A8.12) of y,(z). On the other hand, we have q B„+q(z)= X 7-k{z)Sg-k{z) = Sq(z)=l, so that e-<-+«*-s„+,(z) " 9-1 J" = E7n-j(z)F,-(z)-i; E y„-)t(z)^_k(z)(e^2~sJ.(z)). Q J = 0 j = 0 fc = 0
Section 18 Refinements and generalizations 903 By virtue of the identity in Lemma 18.2, the error estimate \\cxp{-tn+eA) -TUM)W is reduced to those on yy(z) and Fj(z). First, we have for yj=yj{z) that: Proposition 18.1. Let (P, S) have the property (III)e and к>0 be the constant given in Remark 18.1. Then, for each к'<тпт(к, l/|b,|), there exist constants C>0 and /?>0 such that |ye(zKe-*"*', |z|<k\ zeZ,. Proof. By the argument in Remark 18.1, the roots {Cj(z): 1 <y'^g} of w(£, z) = P@ + zS(C) = 0 satisfy the inequalities |^(z)|<e-^!, |z|<k', *e2T.. for some /?>0. Therefore, we have |ye(z)l<Ce-"w, |z|^k', zeXe, by Lemma 18.1. Here, the constant С >0 depends on^o,...,?,-!, which, however, are polynomials ofd0,...,dq-1. These <50,..., Sq~ t are bounded on |z| ^ jc' < l/|b,|, hence C>0 also is bounded there. Recall that aq= 1. □ As for Fj = Fj(z) we have: Proposition 18.2. If(P,S) is of order pC*l), then for к'<1/|Ь,| гйе inequality |F,-(z)|^C|z|"+1e-JR", Rez^O, |z|<k', fcoMs true. Proof. Putting v(t) = e~'z, we have Fj(z)= t SiW-^^ + b.zy'li cvU+i)- t hv'U + i)}. i = 0 U = 0 t = 0 J Since (P, S) is of order p, the right-hand side is equal to (а, + М[Ев< J°'+p~f)V+1)(t)dt j
904 In fact, we have H, Fujita and T. Suzuki Chapter IV A) = (j + i-ty „<*+ Pi vip+1\t)dt 'Щ¥^гнЩ^*>-Ч) (p-1)! t=j + i + -+U+i~t)v'(t)+v(t) = ~IT,v{l\j) + vU+i), 1 = 0'• B\= \U+/~Z 1 v^ + 1\t)dt (P-1)! 1 (p-1)! U+ (p-2)! U + -+(j + i-t)v"(t) + v'(t) Jt=j > il which yield = t {а,А)-ЫЪ} i = 0 = t {atvfj + O-btvfU + ty-i tat )vU) ( = 0 \i = 0 J p vw(i) ( ч q I i = 0 = ЁКФ'+O-Mj'+O} in view of Definition 18.1.
Section 18 Refinements and generalizations 905 Now, into the identity J + i j bj'^v-ц we substitute z;(p+1)(r) = ( —z)p+1e Гг, which gives the desired inequality. □ Remark 18.3. In the case of bq >0, we can take к' < к and к' = oo in Propositions 18.1 and 18.2, respectively. Now, we can give: Proof of Theorem 18.1(b). We fix к' in Mx <к' <min(J,K, l/\bq\). The rational function r — r(z) satisfies Hz)-e-J'z|^C|z|', Uj<«-1, \z\^o, zE£e, for some ct>0. Then, in the second term of the right-hand side of the equality in Lemma 18.2 for O^k^j^q— 1 and j^k, we have for | z | ^ ct0 = min(<r, к') and z e Se. On the other hand, for a fixed M in M i < M < к', we have for <r0^|z|^M and ze.£e. We take a path of integration Г0 which consists of three parts (as sets): Г0=Г1иГ2иГ3, where Г1={Я = ре±Ш1:0^р<стот}, Г2 = {A = pe±ie': стот ^p^Mz'1}, Г3 = {Мт-1е^:|(Р|^01} with 60<6i<6. Then, we have 7„_lk(T^_t(T/l)(e-^-fJM)) 1 2ra + J + I Ь„-1к(тЯ^_к(тА)(е-^-г'(тА))(Л-у1)-1<и r, r2 r3 =1+11+111,
906 Я. Fujita and T. Suzuki of which each term admits of the following estimates: 00 г МЫС e-№s~ic)wT „\p dp --С(п-кГр^Сп-р, n = q,q+l,... IHKC | е-К"-ъг*1=с > ^Cn~p, Chapter IV 1ШКС e-№-(cmd(/,^Cn-p Thus the second term in Lemma 18.2 is estimated as j=l k=0 Next, to estimate the first term of the right-hand side of the identity in Lemma 18.2, we note that the inequality Z yn-j(z)Fj(z) <Спе-*,|1|Ыр+1 follows from Propositions 18.1 and 18.2 if \z\ ^ M and z e Se. As to the representation j = 0 1 + 2ra Г1+Г2 Гз = IV + V, Ху„-,-(т1)^-(т1)(Я-Л)-Ч1 we obtain IIV К С ие-'"'(тр)'+1 —=Си-', о J М-Л^ Therefore, the first term in Lemma 18.2 is also estimated as «SOT". Thus the inequality A8.4) has been established. □ £ y„-j(TA)Fj(xA)
Section 18 Refinements and generalizations 907 When (P,S) has the property (II)e, we can replace тт(к, l/\bq\) by l/\bq\ in Proposition 18.1, and accordingly, min(£, l/|b,|) by min(£, l/|ft«|) in our theorem. Similarly, min(£, к, l/\bq|) is replaced by min(£,к) in the case of bq >0 by Remark 18.3. Proof of Theorem 18.1(a). If (P,S) has the property (I)„, then the inequality K/z)|^e-"W, |z|<K'f zeZe> holds for arbitrarily large к' > 0 with some /? > 0. Therefore, the conclusion of Proposition 18.1 holds for arbitrarily large к'. On the other hand, by making /?>0 smaller if necessary, we have IC,(z)|<e-'<l, |z|>k', ze£e, A8.13) so that \yn(z)\<Ce-", \z\>k', zelt, A8.14) is obtained by the argument in the proof of Proposition 18.1. The first term of the right-hand side of the identity in Lemma 18.2 is estimated as follows. We fix M > т || Л || and represent ty„-j(TA)Fj(TA)=±( j +j\fiye-J{rX)Fj(TX№-Ar1M. Г1+Г2 Гз = 1 + 11. As in the proof for the case (b), we have || 11|< Си"", where the constant С > 0 is independent of M. On the other hand, in the case of ;>1, we get 1 2ra уп-№)¥}(тк)(к-А)-1йк «SC t~(n~MMp+lt~iMmseiA(p, of which the right-hand side goes to zero as M^oo. To estimate the term fory'=0, we have to make the same consideration as in the preceding section. Namely, first we note lim F0{z) = b0/bq \z\~>oo,ze2e and obtain \yK(z)F0(z)-yH(co)(b0/b9)\ <\yn(z)-yH(co)\\Fa(z)\ + \F0(z)-F0(co)\\ye(ao)\.
908 Я. Fujita and T. Suzuki Chapter IV Here, we can show the inequality \уя(г)-У»(со)\^Се->Ч\г\, \z\>R, zeSe, A8-15) for R >0, /?>0 being taken smaller if necessary. To this end, we have only to derive |C;(zr-0(°o)"|<Ce-'"/|z|, \z\>R, zeS9, for the roots {£j(z): 1 </<#} of w(£,z) = P(Q + zS(Q = 0, by means of the proof of Lemma 18.1. In fact, A8.13) implies KlW-UcoT| = IO(zW;MI-1"! CJ_t(z)Cj(oo) U = o where £,-(z) —£y(oo) = 0(l/|z|),|z|-KX>,zeZe> because {P,S) has the property (I)e. Thus A8.15) has been proven. On the other hand, |F0(z)]=$C is obvious and also \F0(z)-F0(co)\ = a0-b0/bq A , „ -HAT+I:<5i(z)e ^C/|z| + Ce"№osei. Therefore, we obtain iy„(z)F0(z)-y„(OT)(fe0/bfl)|<Ce^" -+e-Wcos9' . Thus, by virtue of Lemma 17.3 the first term of the right-hand side of the identity in Lemma 18.2 is estimated as t Vn-j(tA)Fj(TA) kc{B-' + e-'" + |y„(«)F0(«>)l} j = o II Let us proceed to the second term in Lemma 18.2. As is shown in the proof for the case (b), the inequality \yn.k(zK^k(z)(e-j2-^(z))\^C\z\'>e-^-'k^ A8.16) holds for |z|^cr0 and zeZe with e-snppPJl=Cn-p On the other hand, we have \yn-k(z)ej^(z)(c-^-r\z))-y^k(co)ej.k(^)(-rj(^))\ < 1У--*(г) —y„-*(oo)| l^-t^W^)! -Цу„_*(оо)| |^j_fc(^) —^-_*(г)| |г^(г)| + |?„-lt(QO^-t(oo)||r'(z)-r'(oo)|. A8.17)
Section 18 Refinements and generalizations 909 Here, |у„_л(г)-уя_ь(ооЖСе~|2| for |z|>a0 and zeZe as we have seen before. Furthermore, у and Sj-k satisfy \Sj-k(z)-5j-k(co)\^C/\z\, |r(z)-r(c»)KC/|z|, \z\>a0, respectively, because they are rational functions. Also, we recall that ly.MKCe-'», \8„-k(z)\^C. Summing these relations, we can give a bound from above of the right-hand side of A8.17), namely, Ce~fn/\z\. Therefore, the second term in Lemma 18.2 is estimated as "t t уя-кЫЪ-к(тА)(е-"л-г>(гА)) j=l k = 0 by Lemma 17.3. Thus, the proof has been completed. D
Chapter V Commentary Concluding our article, we give some commentary. First we describe some variants of the finite element method, and then show applications to nonlinear problems. 19. Variations of the finite element method 19.1. Lumping of mass In Chapter II, we constructed a semidiscrete finite element approximation of the heat equation du/dt~Au = f m(Q,T)xQ A9.1) with и|8п = 0, и|г=0 = и0(х), A9.2) where Q c: U2 is a convex polygon. Namely, taking account of its weak form (d/dt){u,v)+a(u, ») = (/, v), veV=Hh@) A9.3) with M|f=o = «o A9-4) in X = L2(Q), we discretize it as (d/dt)(uh,x)+a(uh,X) = (f,xl XeVh, A9.3), with (Uh\t=o,X) = (uo,X), X£Vh, A9.4)s in Vh, the approximate space ^constituted by piecewise linear trial functions, where a(u,v)= \VuVvdx. A9.5) a As a variant, there is a method for semidiscretization called lumping. Although there are several methods of lumping, we state here a procedure based on 911
912 Я. Fujita and T. Suzuki Chapter V the barycenter. Let the barycenter of the triangular element Те xh whose nodes are Ръ P2 and P3 be GT, and let the midpoints of РгР2, P2P3 and РгРх be P'3, P\, and P'2, respectively. We divide Г into three subdomains by three segments GTP'3, GrP\, and GTP'2 and assign each subdomain to the corresponding node. For example, the quadrilateral PiP'3GTP'2 is assigned to the node Px. After making this assignment for all the nodes, we combine the subdomains assigned to a particular node into a region. This region is called the barycentric region corresponding to the node. Then, for each inner nodeje/ in the triangulation тй we define a piecewise constant function 7pj whose value is 1 in the barycentric region corresponding to the jth inner node and is 0 otherwise. From these Tpj, a finite-dimensional space Vh is generated. There is a natural isomorphism kn The lumped mass finite element approximation for A9.3) is described as {d/dt){LHuh,Lhx) + a(uh,x)=(Lhf,Lhx), ieVh, A9.5),, in Vh with (w*Uo,x) = ("o,;d l£Vh. A9.6)A Writing uh(t) = 'LjllIVj(t)<pj, we can reduce A9.5),, to (d/dt)MhUh + AhUh = Fh with suitable matrices Mh and Ah and a vector Fh. Here, Жй is diagonal, which is useful in real computations. Error analysis for A9.5)л has been done by Ushijima [1979]. 19.2. Upwind finite element method It is known that for the diffusion equation (du/dt)-dAu + (b-V)u = Q in @, T)xQ A9.7) with w|en = 0, t4 = o = "oM A9-8) involving the drift terms b ■ V, we obtain a fine approximate solution by combining finite element with "upwind difference", especially in the case that ||b||Lco is large relatively to d>0. Among several manners of upwind difference, we describe a method employing the lumping for the elliptic equation -dAu + {b-V)u=f in О A9.9) with 4.0 = 0 A9.10) according to Tabata [1979].
Section 20 Commentary 913 For simplicity, let fie R2 be a convex polygon. Subject to its triangulation xh, we take the spaces of piece wise linear and piecewise constant trial functions Vh and Vh, respectively as above. Then, the upwind finite element approximation of A9.9) is given as da(uh,X)+(Bhuh,LhX) = (Lhf, Lhx), xeVh, A9.9), in Vh, where Lh:Vh-+Vh is the isomorphic mapping described before. Here, the upwind approximate operator Bh:Vh-+ Vh is defined as follows: First, we say that an element Texh is upwind to its vertex je T if the vector b(j) starting from/does not cross T\{j}. (Here, b = b(x) is the coefficient in A9.9).) For each inner node je/ we select an element T= TjBih which contains/ as a vertex and is upwind to/ Then, for vhe Vh we set where {(pf.jel) denotes the basis of Vh given before. These methods are also applicable for parabolic equations and their error analysis has been made. See Tabata [1979] and Ikeda [1983]. 20. Application to nonlinear problems 20.1. Nonstationary Navier-Stokes equation The nonstationary Navier-Stokes equation du/dt-Au + (u-V)u + Vp=f in@, T)xQ B0.1) with V-i/=0, B0.2) гя = 0, u|t=0 = «oW B0.3) can be treated from the semigroup-theoretical viewpoint of Fujita and Kato [1964]. Namely, let QeU2 be a convex polygon and set X = {veL2(QJ:divv = 0], vn = 0 on Sfi, V={veHUOJ: divv=0}. A sesquilinear form a{,) on Vx V is defined as a(w, v) = (V w, Vv), w, v e V, where (,) denotes the natural inner product in L2(QJ. A self-adjoint operator A in X is associated with a - a(,) and the relation D(A) = H\QJ V holds. For each у > \, the nonlinear operator F: D{Ay)^-X is defined through (Fw, v)=((w • V)w, v), veX,
914 H. Fujita and T. Suzuki Chapter V by virtue of Sobolev's imbedding. Explicitly, Fw = P(wV)w, where P:H= L\QJ-*X is the orthogonal projection. Then the system B0.1)-B0.3) is reduced to the evolution equation du/dt + Au + Fu = Pf B0.4) with ц@) = ыо B0.5) in X. The self-adjoint operator —A, called the Stokes operator, generates a semigroup {е~ы}. Hence B0.4) is furthermore reduced to the integral equation u(t)-- e~('"sMF(u(s))ds + c~u's)APf(s)ds. B0.6) Iteration works for this Volterra equation because the estimates \\F(W)\\x^C\\AyW\\x\\A^w\\x, weD(Ay), \\F(w)~F(v)\\x^C{\\AvWUAll2(">--v)\\x+\\All2v\\x\\A\w-v)\\x}, w, veD(Ay) follow from Sobolev's imbedding. Thus, for instance if/=0 and u0eX, there exists a unique solution и = и(г)еС([0, оо)-^Х)<=С1(@, oo)-*AT)nC°(@, <x>)-+V) of B0.6). Consequently, a semidiscrete approximation of B0.1) will be obtained by dis- cretizing the stationary Navier-Stokes equation -Au + (u-V)u + Vp = f in £2 B0.7) with V-u=0, B0.8) и|м, = 0, B0.9) or simply stationary Stokes equation -Au + Vp = f in О B0.10) with V-m = 0, B0.11) и|м = 0. B0.12) Put H = L2(£2J and M = {qeL2(Q): |„q(x)dx = 0}. Then the relation V={veH:(v,Vq) = 0for any qeM] holds. Noticing this, Bercovier and Pironneau [1979] introduced the following way of approximation for the stationary Stokes problem. Namely, let xh, h > 0, be the triangulation described in Section 19.2. We break each element T=AP1P2P3<BTh into two four parts by taking middle points P\, P'2 and P'3 of P2P3, P3P1 and P^P2, respectively, and construct AP\P'2P3, AP2P3Pi, AP'3P\P2 and APiP'2P'3. Thus
Section 20 Commentary 915 we obtain a new triangulation zh. The approximate space Hh of H consists of the functions that are continuous, linear on each Tezh, and 0 at the boundary 90. Moreover, M» denotes the set of functions that are continuous, linear on each Гет», and 0 integral over Q. Then we put Vh = {vkeHh:(vh,Vqh)=0 for any qheMk}. We note that Vh<£V. A self-adjoint operator Ah in Vh is associated with the sesquilinear form a\VhxVh. It is taken as an approximation of the Stokes operator A. Such a way of approximation is sometimes referred to as the mixed finite element method. If the condition each Tezk has at least one vertex in Q B0.13) is satisfied, then what is called Brezzi's condition (Brezzi [1974], Kikuchi [1973]), there exists a /J>0 such that sup %^l >Р\№Лх, <ZAeM„ as/tjO B0.14) Z*Vh \\X\\vh holds. See Bercovier and Pironneau [1979] or Glowinski and Pironneau [1979]. From this follows the unique existence, stability, and error estimate about the approximate solution uhe Vh of Ahu„-QJ B0.10)» for B0.10), where Qh'H-+Vh denotes the orthogonal projection. On the other hand, approximation Fh: Vh-+ Vh of the nonlinear term F in B0.4) is done in the following way. Namely, noting (Fw,v)=((wV)w,v) = b(w,w,v), weD(Ay), veV for %;w,tO=i{((w-V)w,t;)-((u-V)y,w)}, ueD(Ay), v,weV, we impose (Fhw,v„)=b(w; w, vh) for any v„eVh. Under these situations, a semidiscrete approximate equation duh/dt + Ahu„ + Fhuh = QhPf B0.4)» with u„@) = Qhu0 B0.5)* in Vh for B0.4) arises, which is reduced to i i uh(t)=e-'AhQhu0- |e-««-^Ffc(tt»(s))ds+ | e-('-»A*QhPf(s)ds. B0.6)»
916 H. Fujita and T. Suzuki Chapter V Its error eh(t)=uh(t)—u(t) has been analyzed by Heywood and Rannacher [1982] and Okamoto [1982b]. For instance, the following estimates are derived by Okamoto [1982b], where b is a positive constant. Theorem 20.1. Ifu0eV, then \\uh(t)-u(t)\\x^C(\\u0\\v)e-St(h2/t112), 0<t<oo. holds. Theorem 20.2. Ifu0eD(A), then \\uh(t)~u(t)\\x + h\\uh(t)-u{t)\\v^C(\\Au0\\x)e-s,h2, 0<t<co, holds. Other mixed finite element methods for Stokes or Navier-Stokes equations are described by Girault and Raviart [1986]. 20.2. Parabolic equations of blow-up type Asymptotic behavior of the semilinear parabolic equation ди/Ы-Ли = /{и) in@,T)xQ B0.15) with w|en = 0, и|г = о = "оМ B0.16) has been studied theoretically. Some equations allow only local solutions in time. Such a phenomenon is called blow-up of the solution. See, for instance, Fujita [1966, 1969]. Using the finite difference method, Nakagawa [1976] gave numerical computations and their numerical analysis for such equations. Later Nakagawa and Ushijima [1977] studied similar problems for the finite element method. Recently, Weissler [1984] has discovered what is called one-point blow-up for this type of equations. Motivated by this, Chen [1986] reexamined the finite difference method for such equations and studied blow-up of approximate solutions. 20.3. Stefan problem The Stefan problem invol' es the phenomenon of phase transition in heat equations. The most simple case is that of one space dimension and one phase. Then, the problem is to find the temperature и = u(t, x) and the free boundary s = s(t) through 8ц/6г = <7(82и/бх2), 0<x<s(t), 0<t<T, B0.17) with w|* = o=0(O5 и|»=*) = 0, и|,=о =/(*)> 0<x<b = s@), B0.18) ds/dt = - ux(t, s(t)), 0 < f < T. B0.19)
Section 20 Commentary 917 Here, 0 < x < s(t) represents the region of water. Ice is supposed to be located in s(t)<x< oo. For this problem, Mori [1976] proposed a scheme employing finite difference and finite element for the time and space variables respectively, and showed stability and convergence to the genuine solution of its approximate solutions. Mori [1978] studied similar problems for the two-phase case.
References Agmon, S. A965), Lectures on Elliptic Boundary Value Problems (Van Nostrand, Princeton, NJ). Agmon, S., A. Douglis and L. Nirenberg A959), Estimates near the boundary for solutions of elliptic partial differential equations satisfying general boundary conditions, I, Comm. Pure Appl. Math. 12, 623-727. Baba, K. and M. Tabata A981), On a conservative upwind finite element scheme for convective diffusion equations, RAIRO Numer. Anal. 15, 3-26. Ваюссш, С and F. Brezzi A983), Optimal error estimates for linear parabolic problems under minimal regularity assumptions, Preprint. Baker, G.A. and J.H. Bramble A979), Semidiscrete and single step fully discrete approximations for second order hyperbolic equations, RAIRO Numer. Anal. 13, 75-100. Baker, G.A., J.H. Bramble and V. Thomee A977), Single step Galerkin approximations for parabolic problems, Math. Сотр. 31, 818-847. Baker, G.A. and V.A. Dougalis A980), On the L°°-convergence of Galerkin approximations for second-order hyperbolic equations, Math. Сотр. 34, 401-424. Bales, L.A. A984), Semidiscrete and single step fully discrete approximations for second order hyperbolic equations with time-dependent coefficients, Math. Сотр. 43, 383-414. Bercovier, H. and O. Pironneau A979), Error estimate for finite element method solution of the Stokes problem in the primitive variables, Math. Сотр. 34, 401-424. Bramble, J.H. and S.R. Hilbert A970), Estimation of linear functionals on Sobolev spaces with applications to Fourier transforms and spline interpolation, SI AM J. Numer. Anal. 7, 112-124. Bramble, J.H. and Р.Н. Sammon A980), Efficient higher order single step methods for parabolic problems, Part I, Math. Сотр. 35, 655-677. Bramble, J.H., А.Н. Schatz, V. Thomee and L.B. Wahlbin A977), Some convergence estimates for semidiscrete Galerkin type approximations for parabolic equations, SIAM J. Numer. Anal. 14, 218-241. Bramble, J.H. and V. ThomeeA972), Semi-discrete least square methods for a parabolic boundary value problem, Math. Сотр. 26, 633-648. Bramble, J.H. and V. Thomee A974), Discrete time Galerkin methods for a parabolic boundary value problem, Ann. Math. Pure Appl. 101, 115-152. Brenner, P. and V. Thomee A979), On rational approximations of semigroups, SIAM J. Numer. Anal. 16, 683-694. Brenner, P. and V. Thomee A980), On rational approximations of groups of operators, SIAM J. Numer. Anal. 17, 119-125. Brezzi, F. A974), On the existence, uniqueness and approximation of saddle point problems arising from Lagrangean multipliers, RAIRO Numer. Anal. 8, 129-151. Chen, Y.G. A986), Asymptotic behaviours of blowing-up solutions for finite difference analogue for u, = uxx + u1+", J. Fac. Sci. Univ. Tokyo, Sec. IA 33, 541-574. Ciarlet, P.G. A978), The Finite Element Method for Elliptic Problems (North-Holland, Amsterdam). Ciarlet, P.G. and P.A. Raviart A973), Maximal principles and uniform convergence for the finite element method, Comput. Methods Appl. Mech. Engrg. 2, 17-31. Crouzeix, M. A980), Une methode multipas implicite-explicite pour l'approximation des equations devolution parabolique, Numer. Math. 35, 257-276. Dougalis, V.A. A979), Multistep-Galerkin methods for hyperbolic equations, Math. Сотр. 33,563-584. 919
920 H. Fujita and T. Suzuki Douglas Jr, J. and T. Dupont A970), Galerkin methods for parabolic equations, SI AM J. Numer, Anal. 7, 575-626. Douglas Jr, J., T. Dupont and L. Wahlbin A975), The stability in L« of the L2 -projection into finite element function spaces, Numer. Math. 23, 193-197. Dupont, T. A973), L2-estimates for Galerkin methods for second order hyperbolic equations, SI AM J. Numer. Anal. 10, 880-897. Fix, G. and N. Nassif A972), On finite element approximation in time dependent problems, Numer. Math. 19, 127-135. Fried, I. A980), On the optimality of the pointwise accuracy of the finite element solution, Internat. J. Numer. Methods Engrg. IS, 451-456. Fujie, Y. and H. Tanabe A973), On some parabolic equations in Hilbert space, Osaka J. Math. 19, 127-135. Fujii, H. A974), A note on finite element approximation for evolution equations, Кдкуйгоки RIMS Kyoto Univ. 202, 96-117. Fujita, H. A966), On the blowing up of solutions to the Cauchy problem for u, = Au + u1 *", J. Fac. Set Univ. Tokyo Sec. 1A 13, 109-124. Fujita, H, A969), On the nonlinear equations Au+e"=0 and ct>/or = Av+e", Bull. Amer. Math. Soc. 75, 132-135. Fujita, H. and T. Kato A964), On the Navier-Stokes initial value problem, I, Arch. Rational Mech. Anal. 16, 269-315. Fujita, H. and A. Mizutani A976), On the finite element method for parabolic equations, I: Approximation of holomorphic semi-groups, J. Math. Soc. Japan 28, 749-771. Fujita, H, and A. Mizutani A978), Remarks on the finite element methods for parabolic equations with higher accuracy, in: Functional Analysis and Numerical Analysis (Japan-France Seminar in Tokyo and Kyoto 1976) (Japan Society for the Promotion of Science, Tokyo). Fujita, H. and T. Suzuki A979), On the finite element approximation for evolution equations of parabolic type, in: Computing Methods in Applied Sciences and Engineering 1977, Lecture Notes in Mathematics 704 (Springer, Berlin). Gekeler, E. A978), A priori error estimates of Galerkin backward differentiation methods in time-mhomogeneous parabolic problems, Numer. Math. 30, 369-383. Geveci, T. A984), On the convergence of Galerkin approximation schemes for second-order hyperbolic equations in energy and negative norm, Math. Сотр. 42, 393-416. Girault, V. and P.A. Raviart A986), Finite Element Methods for Navier-Stokes Equations (Springer, Heidelberg). Glowinski, R. and O. Pironneau A979), On a mixed finite element approximation of the Stokes problem, I: Convergence of the approximate solutions, Numer. Math. 33, 397-424. Grisvard, P. A985), Elliptic Problems in Non-smooth Domains (Pitman, London). Haverkamp, R. A984), Eine Aussage zur LK-Stabilitat und zur genauen Konvergenzordnung der Hj-Projektion, Numer. Math. 44, 393-405. Helfrich, H.P. A974), Fehlerabschatzungen fur das Galerkinverfahren zur Losung von Evolutions- gleichungen, Manuscripta Math. 13, 219-235. Helfrich, H.P. A975), Lokale Konvergenz des Galerkinverfahrens bei Gleichungen vom parabolischen Тур in Hilbertraumen, Thesis. Hersh, R. and T. Kato A979), High-accuracy stable difference schemes for well-posed initial-value problems, SI AM J. Numer. Anal. 16, 670-682. HeywoOD, J.G. and R. Rannacher A982), Finite element approximation of the nonstationary Navier-Stokes problem, I: Regularity of solutions and second-order error estimates for spatial discretizations, SI AM J. Numer. Anal. 19, 275-311. Hille, E, and R.S. Phillips A957), Functional Analysis and Semi-groups (American Mathematical Society, Providence, RI). Hitotumatu, S. A963), Approximation Formulae (Takeuchi Shoten, Tokyo) (in Japanese). Huang, MY. and V. Тномёе A981), Some convergence estimates for semidiscrete type schemes for time dependent nonselfadjoint parabolic equations, Math. Сотр. 37, 327-346.
References 921 Huang, MY. and V. Thomee A982), On the backward Euler method for parabolic equations with rough initial data, SI AM J. Numer. Anal. 19, 599-603. Ikeda, T. A983), Maximum Principle in Finite Element Models for Convection-Diffusion Phenomena (Kinokuniya/North-Holland, Tokyo/Amsterdam). Jesperson, D. A978), Ritz-Galerkin methods for singular boundary value problems, SI AM J. Numer. Anal. 15, 813-834. Kato, T. A961a), Fractional powers of dissipative operators, J. Math. Soc. Japan 13, 246-274. Kato, T. A961b), Abstract evolution equations of parabolic type in Banach and Hilbert spaces, Nagoya Math. J. 5, 93-125. Kato, T. A966), Perturbation Theory for Linear Operators (Springer, Berlin). Kato, T. and H. Tanabe A962), On the abstract evolution equation, Osaka Math. J. 14, 107-133. Kikuchi, F. A973), Some considerations of convergence of hybrid stress method, in: У. Yamada and R.H. Gallagher, eds., Theory and Practice in Finite Element Structural Analysis (University of Tokyo Press, Tokyo). Ladyzhenskaya, O.A. A963), The Mathematical Theory of Viscous Incompressible Flow (Gordon and Breach, New York). Le ROUX, M.N. A979), Semidiscretization in time for parabolic problems, Math. Сотр. 33, 919-931. Lions, J.L. and E. Magenes A968), Problemes aux Limites Non Homogines et Applications 1 (Gauthier- Villars, Paris). Luskin, M. and R. Rannacher A982a), On the smoothing property of Galerkin method for parabolic equations, SIAM J. Numer. Anal. 19, 93-113. Luskin, M. and R. Rannacher A982b), On the smoothing property of the Crank-Nicolson scheme, Applicable Anal. 14, 117-135. Mizohata, S. A973), The Theory of Partial Differential Equations (Cambridge University Press, Cambridge). Mori, M. A974), Approximation of exponential function of a matrix by continued fraction expansion, Publ. RIMS Kyoto Univ. 10, 257-269. Mori, M. A976), Stability and convergence of a finite element method for solving the Stefan problem, Publ. RIMS Kyoto Univ. 12, 539-563. Mori, M. A978), Stability of a finite element method for solving the Stefan problem in one space dimension, in: H. Fujita, ed., Functional Analysis and Numerical Analysis (Japan Society for the Promotion of Science, Tokyo), Mori, M. A986), The Finite Element Method and Its Applications (Macmillan, New York). Nakagawa, T. A976), Blowing up of a finite difference solution to u,=uxx + u2, Appl. Math. Optimization 2, 337-350. Nakagawa, T. and T. Ushijima A977), Finite element analysis of the semi-linear heat equation of blow-up type, in: J.J.H. Miller, ed., Topics in Numerical Analysis Ш (Academic Press, London). Nitsche, J.A. A975), L°° -convergence of finite element approximation, in: Proceedings Second Conference on Finite Elements, Rennes. Nitsche, J.A. A976), t)ber L^-Abschatzungen von Projektionen auf Finite Elemente, Bonn. Math. Schr. 89, 13-30. Nitsche, J.A. A979), Lm-convergence of finite element Galerkin approximations for parabolic problems, RAIRQ Numer. Anal. 13, 31-54. Nitsche, J.A. and A.H. Schatz A974), Interior estimates for Ritz-Galerkin methods, Math. Сотр. 28, 937-958. Nitsche, J.A. and M.F. Wheeler A981-82), Lc-boundedness of the finite element Galerkin operation for parabolic problems, Numer. Funct. Anal. Optim. 4, 325-353. Okamoto, H. A982a), On the semi-discrete finite element approximation for the nonstationary Stokes equation, J. Fac. Sci. Univ. Tokyo Sec. IA 29, 241-260. ОкамотЬ, Н. A982b), On the semi-discrete finite element approximation for the nonstationary Navier-Stokes equation, J. Fac. Sci. Univ. Tokyo Sec. IA 29, 613-652. Rannacher, R. A984), Finite element solution of diffusion problems with irregular data, Numer. Math. 43, 309-327.
922 H. Fujita and T. Suzuki Rannacher, R. and R. Scott A982), Some optimal error estimates for piecewise linear finite element approximation, Math. Сотр. 38, 437-445. Raviart, P.A. A978), Multistep methods and parabolic equations, in: Functional Analysis and Numerical Analysis (Japan-France Seminar in Tokyo and Kyoto 1976) (Japan Society for the Promotion of Science, Tokyo). Raviart, P.A. and J.M. Thomas A983), Introduction a t'Analyse Numerique des Equations aux Dirivees Partielles (Masson, Paris). Sammon, P.H. A982), Convergence estimates for semidiscrete parabolic equation approximations, SI AM J. Numer. Anal. 19, 68-92. Sammon, P,H. A983), Fully discrete approximation methods for parabolic problems with non-smooth initial data, SIAM J. Numer. Anal. 20, 437-470. Schatz, A.H. A980), A weak maximum principle and stability of the finite element method in L„ on plane polygonal domains, I, Math. Сотр. 30, 681-697. Schatz, A.H., V. Тномёе and L.B. Wahlbin A980), Maximal norm stability and error estimates in parabolic finite element equations, Comm. Pure Appl. Math. 33, 265-304. Schatz, A.H. and L.B. Wahlbin A979), Maximum norm estimates in the finite element method on plane polygonal domains, Part 2: Refinement, Math. Сотр. 33, 465-492, Schatz, A.H. and L.B. Wahlbin A982), On the quasi-optimality in !„ of the $l-projection into finite element spaces, Math. Сотр. 38, 1-22. Scott, R. A976), Optimal Lm estimates for the finite element method on irregular meshes, Math. Сотр. 30, 681-697. Sobolevskii, P.E. A961a), Parabolic type equations in Banach spaces, Trudy Moscov. Math. 10, 297-350 (in Russian). Sobolevskii, P.E. A961b), On equations of parabolic type in Banach spaces with unbounded time-dependent generators whose fractional powers are of constant domains, Dokl. Acad. Nauk SSSR 13, 59-62 (in Russian). Suzuki, T. A979a), On the rate of convergence of the difference finite element approximation of parabolic equations, Proc. Japan Acad. Ser. A 54, 326-331. Suzuki, T. A979b), On some approximation theorems for evolution equations of parabolic type: An operator theoretical approach to the finite element method, Kokyuroku RIMS Kyoto Univ. 357. 165-188 (in Japanese). Suzuki, T. A982), Full-discrete finite element approximation of evolution equation u, + A(t)u=Q of parabolic type, J. Fac. Sci. Univ. Tokyo, Sec. IA 29, 195-240. Suzuki, T. and H. Рилтл A986), A remark on the L™ bounds of the Ritz operator associated with a finite element approximation, Numer. Math. 49, 529-544. Tabata, M. A979), Lre -analysis of the finite element method, in: H. Fujii, F. Kikuchi, T. Nakagawa and T. USHLTIMA, eds., Numerical Analysis of Evolution Equations (Kinokuniya, Tokyo). Tanabe, H. A960), On the equations of evolution in a Banach space, Osaka Math. J. 12, 363-376. Tanabe, H. A979), Equations of Evolution (Pitman, London). Темам, R. A979), Navier-Stokes Equation (North-Holland, Amsterdam). Тномёе, V. A984), Galerkin Finite Element Methods for Parabolic Problems (Springer, Berlin). Тномёе, V. and L.B. Wahlbin A983), Maximum-norm stability and error estimates in Galerkin methods for parabolic equations in one space variable, Numer. Math. 41, 345-371. Ushijima, T. A975), Approximation theory for semi-groups of linear operators and its application to approximation of wave equations, Japan J. Math. 1, 185-224. Ushijima, T. A979a), Error estimates for the lumped mass approximation of the heat equation, Mem. Numer. Math. 6, 65-82. Ushijima, T. A979b), On the finite element approximation of semi-groups of linear operators, in: H. Fujii, F. Kikuchi, T. Nakagawa and T. Ushijma, eds., Numerical Analysis of Evolution Equations (Kinokuniya, Tokyo). Wanner, G., E. Hairer and S.P. Norsett A978), Order stars and stability theorems, BIT 18,475-489. WEISSLER, F.B. A984), Single point blow up for a semilinear initial value problem, J. Differential Equations 55, 204-224.
References 923 Wheeler, M.F. A973), A priori L2 estimates for Galerkin approximations to parabolic partial differential equations, SIAM J. Numer. Anal. 10, 723-759. Yosida, K. A964), Functional Analysis (Springer, Berlin). ZlAmal, M. A973), Curved elements in the finite element method, I, SIAM J. Numer. Anal. 10,229-240. Zlamal, M. A974), Finite element methods for parabolic equations, Math. Сотр. 28, 393-404. Zlamal, M. A975), Finite element multistep discretizations of parabolic boundary value problems, Math. Сотр. 29, 350-359.
Subject Index ^-acceptability, 887 Dahlquist's —, 887 /lo-acceptability, 888 Abstract Cauchy problem, 809 Adjoint sesquilinear form, 840 Adjoint space, 794 Approximation, backward difference —, 821 Crank-Nicolson —, 830 difference finite element —, 821 forward difference —, 823 fully discrete —, 845, 865 lumped mass finite element —, 912 modified Yosida —, 834 Pade —, 889 rational — 890 Ritz-Galerkin — 798 semidiscrete —, 838, 860 semidiscrete finite element —,815 upwind finite element —, 913 A priori estimate, 856 Backward difference approximation, 821 Backward difference finite element method, 865 Barycenter, 912 Barycentric region, 912 Blow-up, 916 one-point —, 916 Boundary condition, Dirichlet —, 793, 812, 815 homogeneous Dirichlet —, 809 Boundary method, curved —, 799 Boundary value problem, elliptic —, 793 Brezzi condition, 915 Brezzi-Kikuchi condition, 915 Cauchy integral representation, 814 Cauchy problem, abstract —, 809 Coerciveness, 833 strong —, 797 Continued fraction expansion, 890 Continuity, Holder —, 826 Contraction semigroup, 810, 812 Crank-Nicholson approximation, 830 Curved boundary method, 799 Dahlquist's ^-acceptability, 887 Descloux lemma, 801, 882 Difference approximation, backward —, 821 forward —, 823 Difference finite element approximation, 821 Diffusion equation, 912 Dirichlet boundary condition, 793, 812, 815 homogeneous —, 809 Discrete delta function, 882 Discrete elliptic Sobolev inequality, 807 Discrete fundamental solution, 882 Discrete Gronwall inequality, 868 Discrete maximum principle, 803 Discrete Sobolev imbedding, 884 Discrete Sobolev inequality, 804 Drift term, 912 Dual problem, 862 Duality argument, Helfrich's — 864, 894 Dunford integral, 806, 814 Elementary inequality, 848 Elliptic boundary value problem, 809 Elliptic differential operator, 815 second-order —, 831 Elliptic estimate, 799 Elliptic Sobolev inequality, discrete —, 807 Energy estimate, 874, 876 Energy equality, 876 925
926 H. Fujita and T. Suzuki Energy method, 831, 860 error estimate by the —, 876 Error analysis, — of fully discrete approximation, 865 — of semidiscrete approximation, 860 Error estimate, 797, 798 — by the energy method, 876 — by Helfrich's method, 871 Error operator, 846, 882, 886 Evolution equation, 831, 856, 914 second-order —, 869 Evolution operator, semigroup property of —s, 842 Exponential function, 809 Finite difference operator, 898 Finite element approximation, difference —, 821 lumped mass —,912 semidiscrete —,815 upwind —,913 Finite element method, 798 backward difference —, 865 mixed —,915 upwind —,912 Forward difference approximation, 823 Fractional power, 804 Kato theorem on —, 808 Fujie-Tanabe generation theory, 834 Fully discrete approximation, error analysis of —, 865 — for parabolic equations, 821 — of temporally inhomogeneous parabolic equations, 845 Fundamental solution, 882 Generation theorem of Kato, 850 Generation theory, 831 — of Fujie-Tanabe, 834 — of Hille-Yosida, 810 — of Kato-Sobolevskii, 834 — of Kato-Tanabe, 837 — of Tanabe-Sobolevskii, 833 Gronwall inequality, discrete —, 868 Heat equation, 809,911 smoothing property of the —, 809 Heinz inequality, 805 Helfrich's duality argument, 864, 894 Helfrich's method, 831, 871 error estimate by —, 871 Hilbert-Bramble inequality, 799 Hille-Yosida generation theory, 810 Holder continuity, 826 Holomorphic semigroup, 813, 833 — of type (S0, M0), 891 Homogeneous Dirichlet boundary condition, 809 Hyperbolic equation, 869 Inhomogeneous equation, 826 Inhomogeneous parabolic equation, temporally —, 831 Initial value problem, 809 — for parabolic equations, 809 Integral equation, 914 Volterra-type —, 834 Integra! representation, Cauchy —, 814 Dunford —, 806, 814 Interpolant, 799 Interpolation theorem, Riesz-Thorin —, 802 Inverse assumption, 800, 876 Kato-Sobolevskii generation theory, 834 Kato-Tanabe generation theory, 837 Kato theorem, — on fractional power, 808 — on numerical range, 821 L2 estimate, 874, 876 1° error analysis, — for parabolic equations, 881 L™ stability, 882 Lax-Milgram theorem, 796, 798 Leibniz rule, 883 Levi method, 833 Lumped mass finite element approximation, 912 Lumping, 911 Maximum principle, 890 discrete —, 803 Mixed finite element method, 915 Modified Yosida approximation, 834 m-sectorial operator, 796, 797, 812, 832 Multi-step method. 897 Navier-Stokes equation, nonstationary —, 913 stationary —,914
Subject index 927 Negative powers of approximate operators, 806 Neumann condition, 812, 820 Neumann problem, 794 Newmark jS's, 878 Nitsche-Scott inequality, 886 Nitsche's trick, 799, 820 Nonstationary Navier-Stokes equation, 913 Numerical range, 813, 816 Kato theorem on —, 823 One-point blow-up, 916 Operator-valued function, 836 Outer conormal differentiation, 795 Pade approximation, 889 Parabolic equation, fully discrete approximation for —s, 821 initial value problem for —s, 809 L°° error analysis for —s, 881 semilinear —,916 temporally mhomogeneous —, 831 Phase transition, 916 Phragmen-Lindelof theorem, 890 Poincare inequality, 794 Poisson equation, 793 Radius, — of inscribed circle, 798 — of outscribed circle, 798 Rational approximation, 890 Rational function, 888 Regular triangulation, 799 Resolvent, 796 Riesz representation theorem, 794, 796 Riesz-Thorin interpolation theorem, 802 Ritz-Galerkin approximation, 798 Ritz-Galerkin method, 797 Ritz operator, 798, 802, 839, 860 Ritz projection, 871 Robin condition, 812, 815, 820 Second-order evolution equation, 869 Second-order elliptic differential operator, 831 Self-adjoint operator, 804 Semidiscrete approximate equation, 915 Semidiscrete approximation, error analysis of —, 860 — of temporally inhomogeneous parabolic equations, 838 Semidiscrete finite element approximation, 815 Semidiscretization, 911 Semigroup, 809 contraction —, 810, 812 holomorphic —, 813, 833 holomorphic — of type (e0,M0), 891 — property of evolution operators, 842 — theory, 809 Semilinear parabolic equation, 916 Sesquilinear form, 797 adjoint —, 840 Single-step method, 897 Skew-adjoint, 870 Smoothing property, 812 Sobolev imbedding, 914 discrete —, 884 Sobolev inequality, 884 discrete elliptic —, 807 discrete —, 804 Sobolevskii identity, 835 Spectrum, 796, 804, 896 Stability of approximation solution, 797, 798 Stampacchia's method, 804 Stationary Navier-Stokes equation, 914 Stationary Stokes equation, 914 Stefan problem, 916 Stokes equation, stationary —, 914 Stokes operator, 914, 915 Strongly coercive, 797 Tanabe-Sobolevskii generation theory, 833 Temporally inhomogeneous parabolic equation, 831 fully discrete approximation of—s, 845 semidiscrete approximation of —s, 838 Trace, 794 Trace theorem, 796 Trial function, 911 Triangulation, 798 regular —, 799 Uniform ellipticity, 795 Upwind — difference, 912 — finite element approximation, 913 — finite element method, 912
928 H. Fujita and T. Suzuki Ushijama's theorem, 874 Weak form, 856 Variational problem, 793, 797 Yosida approximation, Volterra equation, 914 modified —, 834 Volterra-type integral equation, 834