Text
                    V I' H E AND A V V L I R \) M AT II EMATIC S
A Scries of Monographs and Textbooks
INVERTI ILIT
'NDSINGUL RITY
F R UN ED
LINE'R E 'T RS
i
Robin Harte


Invertibility and Singularity for Bounded Linear Operators
PURE AND APPLIED MATHEMATICS A Program of Monographs, Textbooks, and Lecture Notes EXECUTIVE EDITORS Earl J. Taft Rutgers University New Brunswick, New Jersey Zuhair Nashed University of Delaware Newark, Delaware CHAIRMEN OF THE EDITORIAL BOARD S. Kobayashi University of California, Berkeley Berkeley, California Edwin Hewitt University of Washington Seattle, Washington EDITORIAL BOARD M. S. Baouendi Purdue University Jack K. Hale Brown University Marvin Marcus University of California, Santa Barbara W. S. Massey Yale University Leopoldo Nachbin Centro Brasileiro de Pesquisas Fisicas and University of Rochester Anil Nerode Cornell University Donald Passman University of Wisconsin-Madison Fred S. Roberts Rutgers University Gian-Carlo Rota Massachusetts Institute of Technology David Russell University of Wisconsin-Madison Jane Cronin Scanlon Rutgers University Walter Schempp Universitat Siegen Mark Teply University of Wisconsin-Milwaukee
MONOGRAPHS AND TEXTBOOKS IN PURE AND APPLIED MATHEMATICS 1. K. Yano, Integral Formulas in Riemannian Geometry (1970) (out of print) 2. S. Kobayashi, Hyperbolic Manifolds and Holomorphic Mappings (1970) (out of print) 3. V. S. Vladimirov, Equations of Mathematical Physics (A. Jeffrey, editor; A. Littlewood, translator) (1970) (out of print) 4. B. N. Pshenichnyi, Necessary Conditions for an Extremum (L. Neustadt, translation editor; K. Makowski, translator) (1971) 5. L. Narici, E. Beckenstein, and G. Bachman, Functional Analysis and Valuation Theory (1971) 6. D. S. Passman, Infinite Group Rings (1971) 7. L. Dornhoff Group Representation Theory (in two parts). Part A: Ordinary Representation Theory. Part B: Modular Representation Theory (1971,1972) 8. W. Boothby and G. L. Weiss (edsj, Symmetric Spaces: Short Courses Presented at Washington University (1972) 9. Y. Matsushima, Differentiable Manifolds (E. T. Kobayashi, translator) (1972) 10. L. E. Ward, Jr., Topology: An Outline for a First Course (1972) (out of prin t) 11. A. Babakhanian, Cohomological Methods in Group Theory (1972) 12. R. Gilmer, Multiplicative Ideal Theory (1972) 13. /. Yeh, Stochastic Processes and the Wiener Integral (1973) (out of print) 14. /. Barros-Neto, Introduction to the Theory of Distributions (1973) (out of print) 15. R. Larsen, Functional Analysis: An Introduction (1973) (out of print) 16. K. Yano and S. Ishihara, Tangent and Cotangent Bundles: Differential Geometry (1973) (out of print) 17. C. Procesi, Rings with Polynomial Identities (1973) 18. R. Hermann, Geometry, Physics, and Systems (1973) 19. N. R. Wallach, Harmonic Analysis on Homogeneous Spaces (1973) (out of print) 20. /. Dieudonne, Introduction to the Theory of Formal Groups (1973) 21. /. Vaisman, Cohomology and Differential Forms (1973) 22. B. -Y. Chen, Geometry of Submanifolds (1973) 23. M. Marcus, Finite Dimensional Multilinear Algebra (in two parts) (1973, 1975) 24. R. Larsen, Banach Algebras: An Introduction (1973) 25. R. O. Kujala and A. L. Vitter (edsj, Value Distribution Theory: Part A; Part B: Deficit and Bezout Estimates by Wilhelm Stoll (1973) 26. K. B. Stolarsky, Algebraic Numbers and Diophantine Approximation (1974) 27. A. R. Magid, The Separable Galois Theory of Commutative Rings (1974) 28. B. R. McDonald, Finite Rings with Identity (1974) 29. /. Satake, Linear Algebra (S. Koh, T. A. Akiba, and S. lhara, translators) (1975)
30. /. S. Golan, Localization of Noncommutative Rings (1975) 31. G. Klambauer, Mathematical Analysis (1975) 32. M. K. Agoston, Algebraic Topology: A First Course (1976) 33. K. R. Goodearl, Ring Theory: Nonsingular Rings and Modules (1976) 34. L. E. Mansfield, Linear Algebra with Geometric Applications: Selected Topics (1976) 35. N. J. Pullman, Matrix Theory and Its Applications (1976) 36. B. R. McDonald, Geometric Algebra Over Local Rings (1976) 37. C. W. Groetsch, Generalized Inverses of Linear Operators: Representation and Approximation (1977) 38. /. E. KuczkowskiandJ. L. Gersting, Abstract Algebra: A First Look (1977) 39. C. O. Christenson and W. L. Voxman, Aspects of Topology (1977) 40. M. Nagata, Field Theory (1977) 41. R. L. Long, Algebraic Number Theory (1977) 42. W. F. Pfeffer, Integrals and Measures (1977) 43. R. L. Wheeden and A. Zygmund, Measure and Integral: An Introduction to Real Analysis (1977) 44. /. /-/. Curtiss, Introduction to Functions of a Complex Variable (1978) 45. K. Hrbacek and T. Jech, Introduction to Set Theory (1978) 46. W. S. Massey, Homology and Cohomology Theory (1978) 47. M. Marcus, Introduction to Modern Algebra (1978) 48. E. C. Young, Vector and Tensor Analysis (1978) 49. S. B. Nadler, Jr., Hyperspaces of Sets (1978) 50. S. K. Segal, Topics in Group Rings (1 978) 51. A. C. M. van Rooij, Non-Archimedean Functional Analysis (1978) 54. L. Corwin and R. Szczarba, Calculus in Vector Spaces (1979) 53. C. Sadosky, Interpolation of Operators and Singular Integrals: An Introduction to Harmonic Analysis (1979) 54. /. Cronin, Differential Equations: Introduction and Quantitative Theory (1980) 55. C. W. Groetsch, Elements of Applicable Functional Analysis (1980) 56. /. Vaisman, Foundations of Three-Dimensional Euclidean Geometry (1980) 57. H. I. Freedman, Deterministic Mathematical Models in Population Ecology (1980) 58. S. B. Chae, Lebesgue Integration (1980) 59. C. S. Rees, S. M. Shah, and C. V. Stanojevic, Theory and Applications of Fourier Analysis (1981) 60. L. Nachbin, Introduction to Functional Analysis: Banach Spaces and Differential Calculus (R. M. Aron, translator) (1981) 61. G. Orzech and M. Orzech, Plane Algebraic Curves: An Introduction Via Valuations (1981) 62. R. Johnsonbaugh and W. E. Pfaffenberger, Foundations of Mathematical Analysis (1981) 63. W. L. Voxman and R. H. Goetschel, Advanced Calculus: An Introduction to Modern Analysis (1981) 64. L. J. Corwin and R. H. Szcarba, Multivariate Calculus (1982) 65. V. I. Istratescu, Introduction to Linear Operator Theory (1981) 66. R. D. Jarvinen, Finite and Infinite Dimensional Linear Spaces: A Comparative Study in Algebraic and Analytic Settings (1981)
67. /. K. Beem and P. E. Ehrlich, Global Lorentzian Geometry (1981) 68. D. L. Armacost, The Structure of Locally Compact Abelian Groups (1981) 69. /. W. Brewer and M. K. Smith, eds., Emmy Noether: A Tribute to Her Life and Work (1981) 70. K. H. Kim, Boolean Matrix Theory and Applications (1982) 71. T. W. Wieting, The Mathematical Theory of Chromatic Plane Ornaments (1982) 72. D. B. Gauld, Differential Topology: An Introduction (1982) 73. R. L. Faber, Foundations of Euclidean and Non-Euclidean Geometry (1983) 74. M. Carmeli, Statistical Theory and Random Matrices (1983) 75. /. H. Carruth, J. A. Hildebrant, and R. J. Koch, The Theory of Topological Semigroups (1983) 76. R. L. Faber, Differential Geometry and Relativity Theory: An Introduction (1983) 77. S. Barnett, Polynomials and Linear Control Systems (1983) 78. G. Karpilovsky, Commutative Group Algebras (1983) 79. F. Van Oystaeyen and A. Verschoren, Relative Invariants of Rings: The Commutative Theory (1983) 80. /. Vaisman, A First Course in Differential Geometry (1984) 81. G. W. Swan, Applications of Optimal Control Theory in Biomedicine (1984) 82. T. Petrie and J. D. Randall, Transformation Groups on Manifolds (1984) 83. K. Goebel and S. Reich, Uniform Convexity, Hyperbolic Geometry, and Nonexpansive Mappings (1984) 84. T. Albu and C. Nastasescu, Relative Finiteness in Module Theory (1984) 85. K. Hrbacek and T. Jech, Introduction to Set Theory, Second Edition, Revised and Expanded (1984) 86. F. Van Oystaeyen and A. Verschoren, Relative Invariants of Rings: The Noncommutative Theory (1984) 87. B. R. McDonald, Linear Algebra Over Commutative Rings (1984) 88. M. Namba, Geometry of Projective Algebraic Curves (1984) 89. G. F. Webb, Theory of Nonlinear Age-Dependent Population Dynamics (1985) 90. M. R. Bremner, R. V. Moody, and J. Patera, Tables of Dominant Weight Multiplicities for Representations of Simple Lie Algebras (1985) 91. A. E. Fekete, Real Linear Algebra (1985) 92. S. B. Chae, Holomorphy and Calculus in Normed Spaces (1985) 93. A. J. Jerri, Introduction to Integral Equations with Applications (1985) 94. G. Karpilovsky, Projective Representations of Finite Groups (1985) 95. L. Narici and E. Beckenstein, Topological Vector Spaces (1985) 96. /. Weeks, The Shape of Space: How to Visualize Surfaces and Three- Dimensional Manifolds (1985) 97. P. R. Gribik and K. O. Kortanek, Extremal Methods of Operations Research (1985) 98. J.-A. Chao and W. A. Woyczynski, eds., Probability Theory and Harmonic Analysis (1986) 99. G. D. Crown, M. H. Fenrick, and R. J. Valenza, Abstract Algebra (1986) 100. /. H. Carruth, J. A. Hildebrant, and R. J. Koch, The Theory of Topological Semigroups, Volume 2 (1986)
101. R. S. Doran and V. A. Belfi, Characterizations of C*-Algebras: The Gelfand-Naimark Theorems (1986) 102. M. W. Jeter, Mathematical Programming: An Introduction to Optimization (1986) 103. M. Altman, A Unified Theory of Nonlinear Operator and Evolution Equations with Applications: A New Approach to Nonlinear Partial Differential Equations (1986) 104. A. Verschoren, Relative Invariants of Sheaves (1987) 105. R. A. UsmanU Applied Linear Algebra (1987) 106. P. BlassandJ. Lang, Zariski Surfaces and Differential Equations in Characteristic p > 0 (1987) 107. /. A. Reneke, R. E. Fennell, and R. B. Minton. Structured Hereditary Systems (1987) 108. H. Busemann and B. B. Phadke, Spaces with Distinguished Geodesies (1987) 109. R. Harte, Invertibility and Singularity for Bounded Linear Operators (1988). 110. G. S. Ladde, V. Lakshmikantham, and B. G. Zhang, Oscillation Theory of Differential Equations with Deviating Arguments (1987) 111. L. Dudkin, I. Rabinovich, and I. Vakhutinsky, Iterative Aggregation Theory: Mathematical Methods of Coordinating Detailed and Aggregate Problems in Large Control Systems (1987) 112. T. Okubo, Differential Geometry (1987) 113. D. L. Stand and M. L. Stand, Real Analysis with Point-Set Topology (1987) 114. T. C. Gard, Introduction to Stochastic Differential Equations (1988) 115. S. S. Abhyankar, Enumerative Combinatorics of Young Tableaux (1988) 116. H. Strade and R. Farnsteiner, Modular Lie Algebras and Their Representations (1988) Other Volumes in Preparation
Invertibility and Singularity for Bounded Linear Operators ROBIN HARTE University College Cork, Ireland MARCEL DEKKER, INC. New York and Basel
Library of Congress Cataloging-in-Publication Data Harte, Robin Invertibility and singularity for bounded linear operators. (Monographs and textbooks in pure and applied mathematics ; 109) Bibliography: p. Includes index. 1. Linear operators. 2. Singularities (Mathematics) I. Title. II. Series: Monographs and textbooks in pure and applied mathematics ; v. 109 QA329.2.H38 1987 515.7'246 87-9148 ISBN 0-8247-7754-9 COPYRIGHT © 1988 by MARCEL DEKKER, INC. ALL RIGHTS RESERVED Neither this book nor any part may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, microfilming, and recording, or by any information storage and retrieval system, without permission in writing from the publisher. MARCEL DEKKER, INC. 270 Madison Avenue, New York, New York 10016 Current printing (last digit): 10 987654321 PRINTED IN THE UNITED STATES OF AMERICA
for EMMA with love and squalor
Preface Suppose T : X —> Y is a bounded linear operator between normed spaces: then T is said to be invertible if there is another bounded linear operator S : Y -► X for which ST = I :X —► X and TS = I: Y —► Y (0.1) To be invertible an operator must obviously be nonsingular in various ways: for example, it must be one-one in the sense that x, x' e X and Tx' = Tx => x' = x (0.2) and it must be onto in the sense that y e Y => y = Tx for some xeX . (0.3) It will be familiar that the conditions (0.2) and (0.3) are together sufficient for (0.1) to hold for some (unique) linear 5, which may or may not be bounded: to ensure that S is also bounded we may expect to strengthen one or other of the conditions (0.2) and (0.3). This book is about the various kinds of nonsingularity that arise in this way, and their relationships with one another and with invertibility, together with "relative" analogs in which one operator is compared with another, "essential" analogs which hold "to within finite dimensions," and "algebraic" analogs, aimed at ensuring that for example an inverse S should belong to some specified subspace of operators from Y to X. We have a story to tell which involves perhaps three deep theorems and two major constructions: we attempt to bring the reader in totally elementary fashion from the definition of a normed space to the construction of the Taylor spectrum of a system of commuting operators. v
VI Preface We have tried to write an "introduction to operator theory," accessible to students meeting the definitions for the first time; in all honesty the joke will probably be better appreciated by those who have already read, or written, a more orthodox course of functional analysis. In a sense we are perpetrating two confidence tricks in this account: for as long as possible we are discussing Banach spaces without making the completeness assumption, and spectral theory without defining the spectrum. To explain what we mean, recall that if the normed spaces X and Y are complete then the conditions (0.2) and (0.3) are together sufficient for the condition (0.1) - but only because of the first of our deep theorems, the "open mapping theorem." The proof of the open mapping theorem consists in showing, for complete spaces, that an onto mapping is "open"; the only way of doing this passes through the auxiliary concept of an "almost open" mapping. We are thus in the presence of two variations of the condition (0.3), of which we can be quite unaware on complete spaces: our experiment in this book is to introduce them, and explore their properties, in an incomplete environment. It turns out that we are able to describe most of the consequences of the open mapping theorem before we assume completeness: of course the description involves three different kinds of "ontoness" instead of one. In a similar way the operator T : X —► Y can sometimes represent either a system of operators on the space X or a system of operators on the space Y, and provides us with a vehicle for discussing about half of the "multiparameter spectral theory" of such a system. The other half of course depends on the third of our deep theorems, Liouville's theorem from complex analysis. The second of our deep theorems is the Hahn-Banach theorem, which accompanies the important dual space construction, and almost reduces normed spaces to coordinate geometry. Duality enables us to interchange conditions of the form (0.2) and (0.3): our other major construction is a process of "enlargement," which enables us to use for example the condition (0.2) in its simplest form as a substitute for its more subtle variants, and similarly with conditions of the form (0.3). In detail, Chapter 1 and Chapter 2 introduce normed spaces and bounded operators respectively, including the enlargement process; we note in particular a group of three deceptively simple results in Section 1.5 which we have collectively called "the Riesz lemma." Chapter 3 is perhaps the core of the book: we introduce here two variants of condition (0.2), three variants of condition (0.3), and several kinds of variant of condition (0.1). Both the "regular" operators of Section 3.8 and the "essentially invertible" operators of Section 3.9 offer premonitions of the Fredholm theory to follow; the "algebraically invertible" operators of Section 3.10 are a preview of Banach algebra theory. All this is developed without the help of completeness, which comes with the open mapping theorem in Chapter 4, or of duality, which comes with the Hahn-Banach theorem in Chapter 5. Chapter 6 begins with what amounts to classical linear algebra, the theory of
Preface vn finite dimensional spaces and operators, together with their opposite, "Fred- holm operators." We offer a very simple algebraic framework for Fredholm theory, and also the "essential" analogs of the enlargement process: these enable us to work with conditions (0.2) and (0.3) in place of their own essential analogs. The normed algebras of Chapter 7 deserve a book of their own: we meet them as the logical generalization of operators T : X —► Y for which Y = X, and also of spaces of continuous functions; in a sense spectral theory consists of the attempt to use continuous functions to represent operators. The Hilbert spaces of Chapter 8 may well be the only normed spaces that matter: for us their significance is the way in which invertibility and nonsingularity coalesce for operators on Hilbert spaces. At the same time a Hilbert space is "self-dual," giving operators the chance to interact with their own adjoints; we see this most clearly in the corresponding normed algebras which, in total defiance of standard terminology, we have insisted on calling "Hilbert algebras." Spectral theory proper comes in Chapter 9, with the extension to operators of Liouville's theorem from complex analysis; in a sense our account has now converged to a more standard treatment. In Chapter 10 however we turn to the "relative" nonsingular- ities, and normed space versions of "exactness": this is in preparation for the "multi-parameter" spectral theory of Chapter 11. There can be no doubt that the Koszul complex construction of Section 11.9, discovered by Joseph Taylor, is the right "spectrum" for a system of commuting operators on a Banach space; we offer the conceptually simple "left" and "right" spectrum of Section 11.1 as a sort of halfway stage, but find that we can do enough with them to achieve some of the simpler applications of Taylor's theory, although of course not a functional calculus. We end the book with a pastiche of the functional calculus for commutative algebras, leaving the reader on the brink of Taylor's construction. We have found and corrected hundreds of silly mistakes in this writing: we know that there remain as many uncorrected, and offer them as a series of unspoken exercises for the reader. More substantially, we have certainly not said the last word at several places in the book: for example we invite the reader to see if he can formulate a working concept of "almost regular," fusing Definition 3.7.1 and Definition 3.8.1. It would have been nice to have been able to do enough Fredholm theory in incomplete spaces using only the essential enlargement functor P of Definition 6.7.4 to achieve the analog of Theorem 6.11.3 for almost Fredholm operators, without having to use the auxiliary Px. We would expect to use the argument of Theorem 10.3.9 to prove that the Euler number is a continuous function of a chain, at least if it terminates with zeros; we also believe that Theorem 10.5.6 could be improved to show that almost invertible chains between complete spaces are invertible, and that the "(left,right) bounded below" chains of Definition 10.7.1 form open sets. All this and more is offered to the reader to see if he or she can succeed where we have failed. The reader will also probably
viii Preface find our obsession with incomplete spaces tedious: there would have been a case for working entirely within complete spaces from Chapter 5 onward. Thus as an exercise we invite the reader to see how much more neatly the Fredholm theory of Chapter 6 can be written out if completeness is built in from the start. It is impossible adequately to acknowledge all the help and advice we have had both during and before this writing: but Edward Bach of Trinity College looked hard at an early version of the first half of the book, and led us to a clearer picture of the relationship between enlargement and completeness, while Raul Curto in Iowa prevented us from making many silly mistakes toward the end. Others among those whose unpublished ideas appear in these pages are Ernst Albrecht, Manuel Gonzalez, Graeme Jameson, Martin Mathieu, Gerard Murphy, Miceal O'Searcoid, Roger Smyth, Timothy Starr, Trevor West, Tony Wickstead, Tommy Wilansky, and Wie- slaw Zelazko. I am more grateful than I can say for the bemused tolerance extended to me over the years by the mathematical Faculty in Cork, for a year's hospitality in Iowa, and for the extraordinary patience of the editorial staff at Marcel Dekker, Inc. My wife and daughter, Chris and Emma, have had to live through the destruction of three typewriters, at all hours of the day and night: forgive me. The title of the book is a tribute to the smooth operators of West Cork, who occupy that subtle territory between the invertible and the singular. Robin Harte
Contents PREFACE 1. NORMED LINEAR SPACES 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 1.10 1.11 Norms or length functions Metric and topology Translation invariance Subspaces and quotients The Riesz lemmas Cartesian products Isometry and equivalence Sequence and function spaces Enlargements Normed linear algebras Partially ordered spaces 2. BOUNDED LINEAR OPERATORS 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 Continuity of linear operators The normed space of bounded operators Subspaces and quotients Cartesian products Projections Sequence and function spaces Enlargements Shift operators Composition operators Normed linear algebras Partially ordered spaces 1 3 4 5 7 10 12 14 17 19 21 25 25 26 28 31 33 39 41 43 46 49 51 ix
X Contents INVERTIBILITY AND SINGULARITY 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 3.9 3.10 3.11 3.12 Invertibility and isomorphism Monomorphisms and epimorphisms Boundedness below Openness Boundary mappings Left and right invertibility Almost invertible operators Regular operators Essential invertibility Algebraic invertibility Subspaces, quotients, and products Sequence and function spaces 53 53 56 60 65 70 73 76 80 86 91 98 102 BANACH SPACES AND COMPLETENESS 4.1 Cauchy sequences 4.2 Completeness 4.3 Spaces of functions and operators 4.4 Extension by continuity 4.5 Completions 4.6 The open mapping theorem 4.7 Almost open and onto mappings 4.8 Complemented subspaces 4.9 Uniform boundedness 107 107 110 113 115 123 127 130 132 134 LINEAR FUNCTIONALS AND DUALITY 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 5.9 5.10 5.11 5.12 The dual space and the dual operator Poles and polars The Hahn-Banach theorem Duality theory The separation theorem Composition operators Enlargements Sequence and function spaces The second dual An uncomplemented subspace Extreme points Differential calculus 137 137 139 141 143 145 150 154 156 160 163 165 166 6. FINITE DIMENSIONAL SPACES AND COMPACTNESS 173 6.1 Linear dependence and independence 173 6.2 Finite dimensional spaces 178
Contents 6.3 Operators of finite rank 183 6.4 Fredholm operators 187 6.5 Weyl operators and the index 192 6.6 Compactness and total boundedness 197 6.7 Essential enlargement 202 6.8 Compact operators 206 6.9 Semi-Fredholm operators 208 6.10 Almost Fredholm operators 214 6.11 Completeness 220 6.12 Duality theory 222 6.13 Composition operators 229 7. OPERATOR ALGEBRA AND COMMUTIVITY 237 7.1 Commutants and double commutants 237 7.2 Maximal ideals and the radical 240 7.3 Regularity 245 7.4 Quasinilpotent elements 251 7.5 Polar and quasipolar elements 256 7.6 Homomorphisms and Fredholm theory 261 7.7 Browder operators 265 7.8 Ascent and descent 271 7.9 Semi-Browder operators 276 7.10 Connectedness and homotopy 280 7.11 Generalized exponentials 290 7.12 Continuous functions 296 7.13 • Linear functional and states 304 8. INNER PRODUCTS AND ORTHOGONALITY 309 8.1 Inner products 309 8.2 Orthogonality 313 8.3 The nearest point theorem 317 8.4 Completeness 320 8.5 Duality 323 8.6 Positive operators 325 8.7 Regularity 329 8.8 Hilbert algebra 332 8.9 Enlargements 334 9. LIOUVILLE'S THEOREM AND SPECTRAL THEORY 337 9.1 Liouville's theorem 337 9.2 The spectrum 341 9.3 The spectral boundary 345
xii Contents 9.4 Subalgebras and quotients 348 9.5 The spectral radius 352 9.6 Gelfand's theorem 356 9.7 The functional calculus 362 9.8 Essential spectra 369 9.9 Hilbert algebra 378 9.10 States and representations 383 10. COMPARISON OF OPERATORS AND EXACTNESS 391 10.1 Majorization and factorization 391 10.2 Mixed interpolation 394 10.3 Exactness 401 10.4 Composition operators and duality 414 10.5 Enlargement and completion 419 10.6 Essential exactness 425 10.7 Algebraic exactness 435 10.8 Hilbert spaces 441 10.9 Skew exactness 448 11. MULTIPARAMETER SPECTRAL THEORY 457 11.1 Left and right spectra 457 11.2 Polynomials 461 11.3 The spectral mapping theorem 466 11.4 Many variables 470 11.5 The Silov boundary 474 11.6 Composition operators 481 11.7 Tensor products 493 11.8 Quasicommuting systems 509 11.9 The Taylor spectrum 517 11.10 Algebraic and essential spectra 526 11.11 Functional calculus 532 NOTES, COMMENTS, AND EXERCISES 541 REFERENCES 559 NOTATION 581 INDEX 585
Normed Linear Spaces A normed linear space is both a linear space and a metric space. Because the distance function for the metric structure is introduced through the medium of vector "length" there is a certain compatibility between the two structures. 1.1 NORMS OR LENGTH FUNCTIONS Let K be either the real field R or the complex field C—much of what we have to say applies equally in both cases—and recall that a linear space or vector space X over K is a system consisting of a set X together with a zero element 0 G X and mappings of addition x, y —► x + y from X x X to X, subtraction x —► — x from X to X and scalar multiplication t,x —► tx from K x X to X, subject to the usual axioms: for each x,y,z G X and each s,te K, and (x + y) +z = x+ (y + z) y + x = x + y x + 0 = x x+(-x)=0 (st)x = s(tx) (s + t)x = (sx) + (tx) s(x + y) = [sx) + (sy) lx = x (1.1.0.1) (1.1.0.2) It is then familiar that if x + y = x we must have y = 0, and that if x + y = 0 we must have y = — x\ also, *0 = 0 = Ox (-i)x = -(tx) = t(-x) (-t)(-x)=tx (1.1.0.3) As a matter of notation we should make an early apology for certain per- l
2 1. Normed Linear Spaces sistent bad habits: for example, the same letter X is being used for the set of vectors X as well as for the whole system consisting of the set and its additional structure, while the same letter 0 is being used for the zero elements of different vector spaces X and Y as well as for the number 0 in the field K. We will (without apology) write x — y = x + (—y), and sx + ty = (sx) + (ty) for a linear combination of x and y. If H and K are subsets of X and C/CKwe shall write (1.1.0.4) sH + tK = {sx + ty: xe H,y e K} K + x = K + {x} = {y + x:y e K} and UK = {sx: seU,xeK} Ux = {sx: seU} (1.1.0.5) The reader is invited to amuse himself by discovering how much of (1.1.0.1) and (1.1.0.2) remain valid with subsets H and K in place of vectors x and y; they should also distinguish clearly between H - K = {x - y: x e H,y e K} and (1.1.0.6) H\K = {xeH:x£K} A norm on a vector space is essentially a definition of length: 1.1.1 DEFINITION A norm on the linear space X is a mapping x —► ||x|| : X —► [0, oo] such that, for each x, y G X and each iGK, ||x||<oo (1.1.1.1) ||x|| =0=^x = 0 (1.1.1.2) 11**11 = 1*111*11 (1-1-1-3) ll* + y||<IWI + l|y|| (1.1.1.4) Here [0,oo] is obtained by adjoining an element oo to the positive half-line [0, oo[ C R and making the obvious extensions of addition, multiplication, and inequality: in particular we define Ooo = 0 = ooO. If in Definition 1.1.1 we relax condition (1.1.1.2) we get what we call a semi- norm; if instead we relax (1.1.1.1) we get a quasinorm; if we relax both (1.1.1.1) and (1.1.1.2) we get what we shall call a quasi-seminorm. The remaining conditions (1.1.1.3) and (1.1.1.4) are together equivalent to ll« + *y||< WN + I*ll|y|| (1.1.1.5) for each x, y and each s,t. The condition (1.1.1.4) is perhaps the most
1.2 Metric and Topology 3 important component of Definition 1.1.1: we shall sometimes refer to it as the triangle inequality. 1.2 METRIC AND TOPOLOGY If X is a normed linear space, in the sense of a linear space X with a norm ||-|| (we are falling into more bad habits of speech), then the distance between two points is defined as the norm or length of their vector difference: for each x, y G X, dist(x,y) = ||y - x|| (1.2.0.1) The usual metric space conditions are satisfied: if x, y, z G X then 0<dist(x,y) <oo (1.2.0.2) dist(x, y) = 0 <=> y = x (1.2.0.3) dist(y, x) = dist(x, y) (1.2.0.4) dist(x, z) < dist(x, y) + dist(y, z) (1.2.0.5) For example, we verify the "triangle inequality" (1.2.0.5) by observing dist(x,*) = ||* - x|| = ||(y - x) + (z - y)\\ < \\y - x|| + ||* - y|| using (1.1.0.4). For the record, the topology associated with the metric dist(-,-) is described by declaring that a set U C X is a neighborhood of a point x G X, written U G Nbd(x) (1.2.0.6) if and only if there is 6 > 0 for which Disc(x ; 6) = {y G X: dist(x, y) < 6} C U (1.2.0.7) The interior and the closure of a set K C X are the sets int(if) of points x G X for which K G Nbd(x), and c\(K) of points x G X for which U G Nbd(x) <= KC\U ^ 0; then a set K C X is called open iff K = int{K), and is called closed iff if = cl(if). It is easily checked that the family of open sets satisfies the usual conditions for a "topology": the empty set and the whole space, and all finite intersections and all unions of open sets, are open. The disc Disc(x ; S) of (1.2.0.7) is an example of a closed set, called the closed disc of center x and radius 6] we can also check that int(Disc(x ; 6)) = {y G X: dist(x, y) < 6} (1.2.0.8) As a matter of choice, we shall work with closed discs rather than open discs.
4 1. Normed Linear Spaces As in any topological space, a sequence (xn) in X is said to converge to an element y G X iff for every neighborhood U G Nbd(x) there is a natural number N G N for which n > N =>• xn G U. In any metric space the topology can be completely described by convergent sequences: for example, if K C X then its closure c\(K) is the set of those points t/Gl for which (xn) converges to y for some sequence (xn) in X with all its terms xneK. If x G X and if 0 ^ X C X we shall write dist(x,if) = inf {||y - x\\:y G X} (1.2.0.9) Here inf denotes the usual "greatest lower bound" of a set of real numbers. From this, it is clear c\{K) = {xeX: dist(x, K) = 0} (1.2.0.10) If X is a normed linear space and if H is a topological space then a mapping x : H —► X is said to be continuous at a point £ G H iff for each neighborhood U G Nbdxx(£) there is a neighborhood V G Nbdn(£) for which x(V) C £/. For example, the norm ||-|| is a continuous mapping from the topological space X into the scalar field K: this follows from the inequality |||y||-||x|||<||y-x|| (1.2.0.11) which is just the triangle inequality (1.1.1.4) turned inside out. The reader is invited to verify that, more generally, the mappings x —► dist(x,iC) : X —► K are continuous for each nonempty K C X. 1.3 TRANSLATION INVARIANCE The most immediate impact of the compatibility between metric and linear structure is "translation invariance": for each x,y,z G X, dist(x + z, y + z) = dist(x, y) (1.3.0.1) "Geometrically" this means that all the mappings x —► x + z move sets of points around X in a rigid manner. The neighborhoods of each point x G X can be described in terms of the neighborhoods of the origin: Nbd(x) = x + Nbd(0) = {x + U: U G Nbd(0)} (1.3.0.2) The process of taking either the interior or the closure is translation invariant: mt{K + x) = mt(K) + x and c\{K + x) = c\{K) + x (1.3.0.3)
1.4 Sub spaces and Quotients 5 It follows at once that all the translates of an open set are open, and of a closed set are closed. Continuity and convergence are also translation invariant: the sequence (xn) converges to the element y iff the sequence (xn — y) converges to 0, while the mapping x : ft —► X is continuous at the point t G ft iff the mapping y = x — x{i) : ft —► X is continuous there. A more comprehensive summary of the compatibility between linear and topological structures is that the defining maps for the linear structure are continuous: 1.3.1 THEOREM If X is a normed space then the mappings x —> x + y and x —> tx and t —► tx are continuous at each point of X or of K. Further, the mappings x,y —► x + y and t,x —► tx are jointly continuous at each point of X x X or K x X. Proof: These are simple exercises using the properties of the norm. If x G X and U G Nbd(x + y) then U = V + y for some V G Nbd(x): thus the mapping x —► x + y is continuous at x G X. If instead Disc(£x ; e) C £/ G Nbd(*x) we may take V = Disc(x;£) with \t\6 < e to find V G Nbd(x) with tV C £/: thus the mapping x —► £x is continuous at x G X. Alternatively, we may take V = {s G K: \s -1\ < 6} with ||x||£ < e to find V G Nbdx(*) with Vx C U: thus the mapping t —► £x is continuous at £ G K. For the joint continuity we are working with the cartesian product topology of Y x X, with Y = X or Y = K: here W G Nbd(y,x) means that V x U C IV for some V G Nbd(y) and U G Nbd(x). Thus if Disc(x+y;e:) C W G Nbd(x+y) we may take C/ = Disc(x ; 6) and V = Disc(y ; 6) with 2£ < e to find £/ G Nbd(x), V G Nbd(y), and U + V C W, which shows that addition is continuous at (x,y) G XxX. If instead Disc(£x;e) C W G Nbd(£x), we may take U = {s G K: \s - t\ < 6} and V + Disc(x ; 6) with (|*| + ||x|| + 6)6 < e to get U G Nbd|<(0> V = Nbd(x), and UV C W, which shows that scalar multiplication is continuous at (t, x) G K x X. Probably the simplest choice of 6 in this last calculation is 6 = Min(l,e/(l + \t\ + ||x||)). ■ 1.4 SUBSPACES AND QUOTIENTS If Y is a linear subspace of a normed space X, in the sense that Y ^ 0 and 57 + f7 C 7 for each «,i G K, then it is not hard to use Theorem 1.3.1 to show that the closure of Y is also a subspace. The interior of Y is almost always empty: 1.4.1 THEOREM If Y C X is a substance of the normed space X then cl(y) is also a subspace, and there is implication Y ^ X =^ int(y) = 0 (1.4.1.1)
6 1. Normed Linear Spaces Proof: For the first part we must show that if x,y G cl(Y) and s,t G K, then for each e > 0 there is z €Y for which \\sx + ty — z\\ < e: and we can see this by taking z = sx1 + ty1 with x',t/' G y satisfying ||x' — x|| < 6 and \W -y\\<6 with <5 > 0 and (|s| + \t\)S < e. For (1.4.1.1) we show that int(y) ^ 0 =^ o e int(y) (1.4.1.2) and o g int(y) =^ y = x (1.4.1.3) Indeed if x G int(y) then also —x G int(y), which means that there is 6 > 0 for which Disc(x ; S) C y and Disc(-x ;^)C7 Thus, if y G Disc(0 ; ^) is arbitrary then y = |(x + y) + |(—x + y) G y + y C y, which means that 0 G int(y). Further, if this holds, then 0 ^ x G X implies x1 + (£/||x||)x G Y, since ||x'|| = 6, and hence also x=(||x||/^'er. ■ Approximately two thirds of Theorem 1.4.1. extends to the absolutely convex subsets of X, in the sense of those Y C X for which \s\ + \t\ < i =^ sY + *y c y (1.4.1.4) The argument above shows that if y is absolutely convex then so are cl(y) and int(y). In particular (1.4.1.2) holds if Y C X is absolutely convex. If y is a subspace of X, then the norm on X is still a norm when restricted to the subset Y C X: thus Y becomes a normed space in its own right. The quotient space X/Y = {x + Y:xeX} is another linear space, with linear combination defined by setting s{x + y) + t{y + Y) = {sx + ty) + Y for each x,y G X and 5,<6K. If we now define ||x + y|| =dist(x,y) = inf{||x-y||:t/Gy} (1.4.1.5) then we obtain a well-defined mapping ||-|| : X/Y —► [0,oo]. We claim that this is a seminorm: 1.4.2 THEOREM If Y is a subspace of the normed space X then the mapping x + y —► dist(x,y) is a seminorm on X/Y, and is a norm if and only if y = cl(y).
1.5 The Riesz Lemmas 7 Proof: The condition (1.1.1.1) is trivial, and so is (1.1.1.3) for t = 0. If* ^ 0 then dist(*x,y) = infzey \\tx - z\\ = |*|infzey \\x- z/t\\ = |*| dist(x,y), giving (1.1.1.3). If x,y G X then dist(x + y,Y) = infz€y ||x + y — z\\ = infz/jZ„€y ||x—z'+y—z"\\ < infz/ey ||x—^/||+infz„€y \\y—z"\\ = dist(x,y) + dist(t/,y), giving (1.1.1.4). Finally (1.2.0.9) says ||x + y || = o <=> x e ci(y) (1.4.2.1) so that (1.1.1.2) holds if, and only if, Y = c\{Y). m Evidently there is a strong case for replacing a subspace Y C X by its closure cl(y) before forming the quotient X/Y. If ||«|| : X —► [0,oo] is a quasi-seminorm on a linear space X then the subsets y = {x e X: \\x\\ < 00} and Z = {x G X: x = 0} are linear subspaces of X, and the mapping y+Z —* dist(y, Z) = mfzeZ \\y— z\\ will be a norm on the quotient Y/Z in the full sense of Definition 1.1.1. Conversely, if Z C Y C X are subspaces, then every norm on the quotient Y/Z can be lifted to a seminorm on the subspace Y, and then extended to a quasi-seminorm on X, taking ||x|| = ||x + Z\\ if x G Y and ||x|| = oo if x G X \ Y. Later we shall see that many specific normed spaces can be obtained by starting with a quasi-seminorm on some more primitive space. 1.5 THE RIESZ LEMMAS It is time we did some work. We have grouped together three results which superficially resemble one another: the second of these is usually called the "Riesz lemma." 1.5.1 THEOREM If Y C X is a subspace of a normed space X and x0 G X then there is a sequence (yn) in Y for which \\yn — x0|| —► dist(x0,y) as n —► oo and (1.5.1.1) hnW < \\xo\\ + dist(x0,y) < 2||x0|| Proof: By definition of dist(x0,y) there is (yfn) in Y for which ||*o ~ y'n\\ ^ dist(x0,y) + J!^ for each neN (1.5.1.2) n
8 1. Normed Linear Spaces which implies \\y'n\\ < ll*oll + Wv'n ~ xo\\ < U + M ll*oll + dist(x0,y) (1.5.1.3) If we now take yn = ——- y'n for each neN (1.5.1.4) n + 1 then Ill/nil < ll*oll + "XT dist(x0,y) < ||x0|| + dist(x0,y) (1.5.1.5) and Iko-ynll ^ H*0-2/nll + ll2/n-2/nll 1 ii / n (1.5.1.6) -||x°"yJI + ^TT)l|yJI_"° asn-+°° ■ If the subspace Y is closed and if e > 0 is arbitrary, then for each i0Glwe can find yeG7 for which ll*0-y«ll<(i + e)dist(x0,y) and (1.5.1.7) ||yc||<||x0|| + dist(x0,y)<2||x0|| Whether or not Y is closed, (1.5.1.7) holds with Ve = *o (1.5.1.8) if x0 G y, while if x0 £ cl(y) so that dist(x0,y) > 0, we can replace (1.5.1.2) by Iko-I/nll < dist(x0,y) + -dist(x0,y) (1.5.1.9) n and then put 3+ ||x0||/dist(x0,y) ye=yN with " oll/ V 0? } <e (1.5.1.10) where yN is derived from yfN by the formula (1.5.1.4). 1.5.2 THEOREM If Y C X is a subspace of a normed space X and if x0 G X then for each real number t G ]0, l[ there is xt G X for which cl(y) + Kx0 = cl(y) + Kxt and dist(xt,y) > t||xt|| (1.5.2.1)
1.5 The Riesz Lemmas 9 If x0 £ cl(y) we can also arrange ||xt|| = 1. Proof: If x0 G cl(y) then (1.5.2.1) holds with xt = 0. By definition of dist(-,y) there is yt G Y with Now take llxo-2/tll < -dist(x0,y) xt = xo~ Vt (1.5.2.2) (1.5.2.3) It is clear that the first part of (1.5.2.1) is satisfied, while for arbitrary y G Y we have llz*-y|| = llzo- {y + vM > <list(x0,y) >t\\x0-yt\\ = *||xt|| This gives (1.5.2.1), whether or not x0 G cl(y). If x0 £ cl(y) then certainly x0 — yt ^ 0: thus if we want ||xt|| = 1 we may take instead (*o ~ Vt) \\xo-yt\\ (1.5.2.4) 1.5.3 THEOREM If Y C X is a subspace of a normed space X and if x0 G X is not in cl(y) then there is M > 0 such that ||2/ + sx0\\ < \\y\\ + |«s|||x0|| < M\\y + sx0| for each t/67 and each 6 G K. Proof: If x0 G X is not in cl(y) write (1.5.3.1) and note that ||y + 6X0|| = \s\ giving *(*o) xo~y dist(x0,y) > |6|dist(x0,y) if 6 ^0 (1.5.3.2) k|||x0|| < S(x0)\\y + sx0\\ for each y G Y,s G K (1.5.3.3) But now also ||y|| < \\y + «x0ll + \\sxo\\ < (i + *(*o))lly + sxo\\ (1.5.3.4) giving (1.5.3.1) with M = 1 + 26(x0). m
10 1. Normed Linear Spaces Theorem 1.5.3 is rather a crude result: in (1.5.3.1) we must have M > 3 since S(x0) > 1. Ewe take advantage of Theorem 1.5.2 to replace x0 with an "equivalent" xt then we will ensure that 6(x0) < l/t and hence Mt < 1+2/1. We shall see later that there are circumstances in which (1.5.2.1) can be obtained with t = 1: this happens if the subspace Y is finite dimensional, or, alternatively, if the norm on X has a certain "quadratic" property. It seems to be an interesting problem to try and obtain (1.5.1.1) with ||yn|| < ||x0|| instead of ||yn|| < 2||x0||. We do get this in one particular case: x0 G cl(y) ^ ||yn - x0 II—+0 and WVnW < \\xo\\ +dist(x0,y) = ||x0 In particular, if the closure of Y is X then also the closure of Discy(0 ; 1) in X is Discx(0; 1). 1.6 CARTESIAN PRODUCTS If Y and Z are normed linear spaces then the cartesian product Y x Z = {{y>z): V € Y,z € Z} is a linear space, where linear combination is defined by setting *(</, z) + t{yf, z') = [sy + ty\ sz + tz') for each y,yf G Y, z,zf G Z and s,t G K. There are several obvious ways to define a norm on Y x Z: for example put, for each yG7 and z G Z, ll(y.*)lli = ll»ll + H (1-6.0.1) ll(y.*)IL=n««(ll»ll. 11*11) (1-6-0.2) ll(y,z)ll2 = (lly|l2 + NI2)1/2 (1-6.0.3) We claim that each of these is indeed a norm: 1.6.1 THEOREM If Y and Z are normed spaces then the mappings ||-|| are norms for each p = l,2,oo, and give the same cartesian product topology for Y x Z. Proof: For the first part there are twelve assertions to be checked, only one of which is not immediately clear. This is the condition (1.1.1.4) for (1.5.3.5)
1.6 Cartesian Products 11 the mapping ||«||2: we argue lis/ + y'll2 + h + z'f < (||y|| + ||y'||)2 + (||«|| + ||*'||)2 = NI2 + l|y'll2 + W2 + lk'll2 + 2(||y||||y'|| + H||*'||) <l|y||2 + l|y'll2 + IN2 + l|z'll2 + 2(||y||2 + INI2)1/2(||yT + ll*T)1/2 where the last inequality follows from the equality (llyll2 + IN2)(lly'll2 + Ik'll2) - (llylllly'll + Nllk'll)2 (1611) = (l|y||lk'll-WI|y'll)2>o For the second part we note that the norm IHIoq certainly induces the usual cartesian product topology, as described in the proof of Theorem 1.3.1: if 6 > 0 is arbitrary then DisCoo((t/,2) ;«) = Disc(y ; «) x Disc(* ; 6) (1.6.1.2) We claim that the norms ||-1| x and ||-||2 induce the same topology as the norm H"!^: for if y G Y and z G Z are arbitrary then ||(y,*)lloo < ll(y.*)lla < ll(y.*)lli < 2||(y,*)IL (i-6.i.3) and hence if 6 > 0 then Discoo((0,0) ;S) C Disc^OjO) ; 2d) C Disc2((0,0) ; 26) C Discoo((0,0) ; 26) (1.6.1.4) This means that Nbdp(y,^) is the same for each p = l,2,oo, provided (y,z) = (0,0), and hence by (1.3.0.2) for arbitrary {y,z). m If Y and Z are normed spaces then, unless otherwise stated, we shall assume that Y x Z is normed in such a way as to have the cartesian product topology. Occasionally we shall be more specific: if p G {l,2,oo} we shall write Y 0p Z to denote the space Y x Z with the norm ||-||p from the appropriate (l.6.0.q). More generally, if X is a normed linear space, a pair of supplemented subspaces of X consists of closed subspaces Y C X and Z C X which satisfy Y + Z = X and Y n Z = {0} (1.6.1.5) For example if X = Xx x X2 is a cartesian product of normed spaces Xx and X2 then the subspaces Y = Xx x {0} and Z = {0} x X2 satisfy (1.6.1.5).
12 1. Normed Linear Spaces Conversely, if subspaces Y and Z satisfy (1.6.1.5) then there is a one-one correspondence j/ + ^M:7xZ^I (1.6.1.6) in the sense of a mapping which is one-one and onto, and "linear" in the sense that it respects linear combination on either side. Whether or not it also respects topology turns out to be rather a delicate matter, to which we shall return later: here we shall make a further definition, and say that the subspaces Y C. X and Z C. X are complemented subspaces iff they are supplemented in the sense of (1.6.1.5), while the topology of X is the same as that induced on it by the mapping (1.6.1.6) and the cartesian topology of the product Y x Z. We shall sometimes write X=Y@Z (1.6.1.7) to indicate that this is so. If Y C X is a closed subspace then we shall say that it is supplemented in X if there is a closed subspace Z C X for which (1.6.1.5) holds, and that it is complemented in X if there is a closed subspace Z C X for which (1.6.1.7) holds. Just occasionally we shall be prepared to speak of a nonclosed subspace Y C X as supplemented if there is a closed subspace Z C X satisfying (1.6.1.5). 1.7 ISOMETRY AND EQUIVALENCE Normed spaces X and Y are called isomorphic, written X~Y (1.7.0.1) iff there is a one-one correspondence x <—► y : X <—► Y (1.7.0.2) between them which preserves linear algebra and topology: x <—> y and xf <—> yf =>- sx + tyf <—> sy + ty and (1.7.0.3) Nbdx(x) <—>Nbdy(y) X and Y will be called isometric, written X^Y (1.7.0.4) if in addition length is preserved: ' — y => \\x\\x = llvlly (1-7-0.5)
1.7 Isometry and Equivalence 13 Thus if, for example, Y and X are "supplemented" subspaces of X in the sense (1.6.1.5) then X and Y x Z are isomorphic, or if Y and Z are normed spaces then all three products Y ©p Z are isomorphic. If normed spaces X and Y are isomorphic, or in particular isometric, then there is a point of view that they are "the same space." We will be conscious however of a certain schizophrenia: sometimes isomorphism and sometimes isometry will be relevant. Indeed, certain key theorems in the sequel come in two strengths—a basic "topological" version, and a more detailed "metric" version. Actually the "topological" version will usually be the important result, although sometimes the only way of reaching the topological version will be through the "metric" version. The difference between isomorphism and isometry appears in an acute form when we compare the topologies of two different norms on the same vector space, and in particular ask under what conditions they will be the same. If ||-|| and ||-||; are norms on a vector space X we shall describe ||-||; as weaker, or coarser, than ||-||, if it generates a "weaker" or "coarser" topology, in the sense that Nbd'(x) C Nbd(x) for each x e X (1.7.0.6) Synonymously we shall describe ||-|| as stronger, or finer, than ||-||. Equivalent conditions are that all the ||-||'-open sets are ||-||-open, or that the || ^-closure of an arbitrary subset is contained in the ||«||'-closure. In terms of sequences, (1.7.0.6) is equivalent to the implication, for arbitrary (xn) in X and arbitrary t/Gl, that \\xn ~ y\\ —* o asn —* °° => \\xn ~ y\\f —* o2LS n —* °° (1.7.0.7) It turns out that there is a nice condition expressing (1.7.0.6) in terms of the norms ||«|| and ||«||;: 1.7.1 THEOREM If ||-|| and ||-||' are the norms on a linear space X then necessary and sufficient for ||«||; to be weaker than ||«|| is that there is k > 0 for which ||x||' < Jfc||x|| for each x G X (1.7.1.1) Necessary and sufficient for ||«|| and ||«||; to generate the same topology on X is that there are k > 0 and k' > 0 for which fc'||x|| < ||x||' < Jfc||x|| for each xeX (1.7.1.2) Proof: The condition (1.7.1.1.) is obviously sufficient for (1.7.0.6), since if x e X and U e Nbd'(x) there is 6 > 0 for which Disc'(x ; 6) C U, from
14 1. Normed Linear Spaces which (1.7.1.1) gives Disc(x ; S/k) C Disc'(x ; S) C U (1.7.1.3) putting U G Nbd(x). Conversely, if we assume (1.7.0.6) then Disc'(0 ; 1) G Nbd(0) (1.7.1.4) so that there is 6 > 0 for which Disc(0 ; 6) C Disc'(0 ; 1). Now if 0 ^ x G X then £x/||x|| G Disc(0 ;£), so that ||(^x/||a:||)||' < 1, giving W-H £x SH Thus (1.7.1.1) holds with A; = l/S. Interchanging the roles of ||-|| and ||-||; then gives (1.7.1.2). ■ The reader will recall that we used the easy half of this argument for the second part of Theorem 1.6.1, to show that three norms on the product Y x Z give the same topology. He will also see how Theorem 1.5.3 imposes the condition (1.7.1.2) for two norms on a subspace Y + Kx0 C X provided *0*ci(y). 1.8 SEQUENCE AND FUNCTION SPACES The most trivial examples of a normed space are the scalar field K, normed with the absolute value |-|, and the zero space 0 = {0} with only one element. Each of the norms ||-||p of (1.6.0.1)-(1.6.0.3) makes K2 = K x K into a normed space: for example the absolute value |«| on C is just the norm ||«||2 on R2. By induction we obtain norms ||«|| on Kn for each n G N. More generally, suppose that H is a nonempty set and that X is a normed linear space: then the set XQ = Map(H,X) of all mappings from H to X is another linear space, where we define (rx + sy)(t) = rx{i) + sy{i) for each t G H (1.8.0.1) for each x,y G Xn and each r,6 G K. For each p = l,2,oo we define a quasinorm ||«|| on Xn: if x G Xn we define ||x||00 = sup{||x(t)||:tGn} (1.8.0.2) where this is the usual "least upper bound" of the set of real numbers {||x(£)||:£ G H} provided that set is bounded, and is oo otherwise. Ep=l
1.8 Sequence and Function Spaces 15 or p = 2 put Nip = £ ll*(')llp)1/p = sup{( £ ll^Wir) ": finite«' £ n} ten ^ ^ ten' ' > (1.8.0.3) The conditions (l.l.l.2)-(l.1.1.4) are almost all easily verified: if p = 2 we must follow the argument of Theorem 1.6.1 for the condition (1.1.1.4). If we now put /p(H, X) = {xe Xn: \\x\\p < oo} (1.8.0.4) then we get a normed space. When X = K we shall write /p(H, K) = /p(H): we might notice the inclusion {<£©x:<t>e /p(n),x e x} c /p(n,x) (1.8.0.5) where for each <j> G K and each iGlwe define 0 © x G Xn by setting [<t> © x)(*) = <£(*)s for each t G H (1.8.0.6) 1.8.1 THEOREM If H is a nonempty set and X is a normed linear space then Nloo ^ Ilxll2 < llxlll for each ^ ^ X° (1.8.1.1) and /i(n,x) c /2(n,x) c un,*) (1.8.1.2) If and only if H is finite there is equality throughout (1.8.1.2) (if X ^ 0). Proof: The inequalities (1.8.1.1) were essentially observed in Theorem 1.6.1, and (1.8.1.2) follows at once. If H has cardinal number #H = m then also ||z||i < m||z||oo for each x e XQ (1.8.1.3) giving equality throughout (1.8.1.2). If H is infinite then it contains a copy of N: thus it is enough to see that neither equality holds if fi = N and X ^ 0. In fact if 0 ^ x G X then the constant function x = 1 © x is in /^(NjX) and not in /2(N,X), while if y{n) = n~1x for each n G N then y is in /2(N,X) and not in /1(N,X). ■ Theorem 1.8.1 extends to the situation in which {Xt)teQ is a family of normed spaces associated with the points of H, and Xn is replaced by the cartesian product Ilten ^t: we ^° n°t 6° m*° details. If H is not just a set but a topological space then we shall write C(Cl,X) for the set of those
16 1. Normed Linear Spaces mappings x G Xn which are continuous at each point iGf], and c^x) = c/ootn.x) = c(n,x) n/oo(n,x) (i.s.i.4) for the bounded continuous functions on H. It is rather easy to see that C(H,X) and hence also C^Q, X) is a linear subspace of Xn: 1.8.2 THEOREM If H is a topological space and X is a normed space then C^tfl, X) is a closed subspace of /^(f^X). Proof: For each iGfi the set {x G ^(HjX^x is continuous at t} (1.8.2.1) is (by translation invariance as in Section 1.3) a linear subspace of/^(n, X): we claim that it is also closed in the IHIoq topology. To see this suppose y G /oo(n,X) and ||y — xn||00 ->0asn-^oo, with xn in the set (1.8.2.1) for each n€N: then for arbitrary e > 0 there are N G N and U G Nbdn(£) for which n > N ==> IK - ylloo ^ Ie a114 5 e U => llXtf(5) - xtfWII < |* It follows «€P=>||yM-y(t)|| < IlyM -**MII + IKrM -**WII + ll**W -y(OII < * The claim is substantiated, and now C^H, X) is the intersection of all the closed subspaces (1.8.2.1). ■ Theorem 1.8.2 records a familiar fact: the uniform limit of continuous functions is continuous. If H is again a nonempty set and X is a linear space then we can look at the "finitely additive set functions" from H to X, meaning the set M(H,X) of those mappings \i : P{ti) —► X defined on the set P(H) of all subsets of H, for which m(0) = o and (1.8.2.2) H{K UH) + n{K HH)= n{K) + n{H) for each K,HC{1
1.9 Enlargements 17 If X is a normed space then we obtain a quasinorm on M(H, X) by setting II^Hj = / d\fi\ = sup< Y^ ||/x(iir)||:/C is a finite partition of H > (1.8.2.3) By a finite partition we mean a finite, pairwise disjoint, set of subsets whose union is H. If we now put Af^n.x) = {neM{n,X):\\n\\1 <oo} (1.8.2.4) then we get another kind of normed space. A refinement of the idea is to restrict \i to certain "measurable subsets" of H. 1.9 ENLARGEMENTS If H is a nonempty set, then a bounded structure, or homology, on H will be a set B of its subsets for which {t} e BQ for each t G H (1.9.0.1) and K e B whenever K C K'U K" and K', K" e B (1.9.0.2) We shall say that the homology B is nontrivial provided H £ B (1.9.0.3) For example, the set of all finite subsets of H is a homology. If Q has a bounded structure and X is a normed linear space then the finite or terminating functions from H to X form the subspace of l^Q^X), c00(n,x) = {xe i^in^xy.it e n:*(t) ^ 0} e BQ} (1.9.0.4) The null functions are given by c0(H,X) = jxe/TO(n,X):i*en:||x(*)|| > - [ e Bn for each n G l\li (1.9.0.5) and the convergent functions are given by c^n^X) ={xe l^n^Xy.x- y e c0(H,X) for some yeX} (1.9.0.6)
18 1. Normed Linear Spaces 1.9.1 THEOREM If H has a bounded structure and X is a normed space then, in the topology of the norm H-H^, c0(n,x) = cic00(n,x) (1.9.1.1) For arbitrary x G l^Q^X), dist(x,c0(H,X)) = inf sup||x(*)|| (1.9.1.2) KeBci t£K If BQ is nontrivial and x = 1 © y is constant then dist(x,c0(fi,X)) = ||y|| (1.9.1.3) Proof: To see that c0(H,X) is closed we follow the srgument of Theorem 1.8.2. Thus if y G l^iX) and \\y - xj^ -+ 0 with xn G c0(H,X) then for each e > 0 there is N G N and if G Sq for which n > N => \\y - xJn < §e and t£K^ \\xN{t)\\ < \e (1.9.1.4) so that \\y(t)\\ < e if t g K. To see that c00(n,X) is a dense subset of c0(H,X) suppose that x G c0(H,X) is arbitrary and find K G Bn for which H^MII <s'&t$.K\ now define xx : H —► X by setting for each t G H, 1° Evidently xK G c00(n,X) and Since e is arbitrary, this proves (1.9.1.1). Toward (1.9.1.2) we have for each inf sup ||x(t) || = inf ||x-xx|| > dist(x,c0(H,X)) (1.9.1.7) K t(£K K and if y G c0(H,X) is also arbitrary, infsup||x(*)|| =infsup||s(*)-y(*)|| < ||x-y||oo (1.9.1.8) K t(£K K t(£K Taking the inf over y G c0(H,X) gives (1.9.1.2). Finally it is clear, toward (1.9.1.3), that dist(x,c0(n,X)) < Hx^ = ||y|| if x = 1 © y is constantly equal to y; conversely, provided Bn is nontrivial, if z G c00(n, X) is arbitrary then there will always be t G H for which \\x(t) — z(i)\\ = \\y — 0|| = ||y||. Since c00(n,X) is dense in c0(H,X) we have proved (1.9.1.3). ■ \iteK tit&K (1.9.1.5) x — x klloo <e (1.9.1.6)
1.10 Normed Linear Algebras 19 When H = N with the bounded structure of finite subsets, then the expression in (1.9.1.2) is familiar: if x G l^X) = /^(NjX) then dist(x,c0(X)) = limsup||xj| (1.9.1.9) n—nxi The space cx(X) is also recognizable: if as in (1.9.0.6) i-t/G co(^0 *nen the vector y is unique: y = lim xn (1.9.1.10) n—►oo The "enlargements" of a normed space X are built out of the quotients of the spaces 1^(0, X) by the subspaces c0(H,X) for nontrivial H: 1.9.2 DEFINITION If H has a nontrivial bounded structure and X is a normed space then Qn(x)=ioo(n,x)/c0(n,x) (1.9.2.1) If in particular H = N with the homology of finite subsets then Q(x) = UX)Ao(x) (1-9-2-2) We are entitled to call Qn(X) an "enlargement" because it contains a copy of the space X, made up of the cosets x + c0(H,X) of constant functions x = l©y. For each y G X this coset consists of exactly the sequences x G c1(H,X) which "converge to y." 1.10 NORMED LINEAR ALGEBRAS Many of the normed spaces of Section 1.8 have an additional element of structure, in that the underlying vector space X = A is also a ring, and in fact a linear algebra: there is a multiplication a, b —> a • 6 = ab : Ax A —> A (1.10.0.1) for which, if a, 6, c G A, a • (b - c) = (a- b) • c K J K J (1.10.0.2) a • (6 + c) = (a • 6) + (a • c) (a + 6) • c = (a • c) + (6 • c) and if also t G K then (*a) • b = t(a • 6) = a • (*6) (1.10.0.3)
20 1. Normed Linear Spaces We will always assume that a linear algebra has an identity, in the sense of an element 1 G A for which 1. a = a = a • 1 for each a G A (1.10.0.4) 1.10.1 DEFINITION A normed linear algebra A is a normed linear space and a linear algebra for which lla* b\\ < Nlll&ll for each a, 6 € A (1.10.1.1) and, unless 1=0, ||1|| = 1 (1.10.1.2) We have not quite ruled out the possibility that 1 = 0: it occurs if and only if A = 0. For example if H is a nonempty set then /^H, K) is a normed linear algebra if we put, for each a,b, (a • b)(t) = a(t)b{t) for each t G H (1.10.1.3) More generally, if A is a normed linear algebra then so is /^(n, A). If B C A is a linear subspace for which leB and B-BCB (1.10.1.4) then B is a subalgebra of A, and becomes a normed algebra in its own right. If J C A is a linear subspace for which A-JCJ and J-AC A (1.10.1.5) then J is two-sided ideal of A, and the quotient A/ J is a linear algebra. Here the quotient seminorm of (1.4.1.5) satisfies the compatibility condition (l. 10.1.1): thus if also J is closed in the norm topology then A/J becomes a normed algebra. For example if H is a topological space then C^Q, A) is a subalgebra of /^(n, A); if n has a bounded structure then c0(fi,A) is a two-sided ideal of /^(n, A). If A is a linear algebra and a G A then we define an for each n G N by induction, setting a1 =a and an+1 =a-an for each nGN (1.10.1.6) Then if k G N and c0, cl9 ..., ck G K and / = c0 + c^ H + c^* is a polynomial we put /(a) = c0l + cxa + • • • + ^a* (1.10.1.7)
1.11 Partially Ordered Spaces 21 In this way we can treat a polynomial as a mapping / : A —> A. We can even interpret rational functions: if a G A we write a~l for the inverse of a, in the sense that a-a"1 = 1 = a"1-a and a"1 G A (1.10.1.8) It is a familiar exercise that (1.10.18) determines a~l uniquely, if it exists at all: thus we get a mapping z~l : A~l —> A defined on a certain subset A'1 C A. One final comment: if A is a normed linear algebra, with multiplication a, 6 —► a • 6, then the mapping a, 6 —>b-a = a-'b:AxA —► A (1.10.1.9) will also satisfy the compatibility condition (1.10.1.1) as well as the algebraic conditions (1.10.0.2) and (1.10.0.3). What we obtain in this way is called the algebra formed by reversal of products: this process sometimes enables us to prove two theorems at once. 1.11 PARTIALLY ORDERED SPACES Many of the normed spaces of Section 1.8 have another element of structure: the underlying set also carries a partial order. By a partial order on a nonempty set H we shall understand a "relation" < —which the reader may or may not care to identify with its "graph," graph(^) = {(x,t/) G n2:x < y} C H x H (1.11.0.1) which satisfies three conditions: for each x,y,z, x<x (1.11.0.2) x<y <x => y = x (1.11.0.3) x<y <z => x<z (1.11.0.4) Variants are possible: both the "reflexivity" (1.11.0.2) and the "symmetry" (1.11.0.3) are dispensable. The crucial condition is the "transitivity" (1.11.0.4). If H is a partially ordered set then so is every subset of H, by the same relation <. We call a partially ordered set totally ordered if there is implication, for each x,y G H, x<y or y < x (1.11.0.5) A weaker requirement is that H be directed, in the sense that if x,y £ H then there is z G H for which x < z and y < z (1.11.0.6)
22 1. Normed Linear Spaces For example the "homologies" of Section 1.9 are directed if we interpret UK < H" to mean UK C #." If H is a partially ordered set and K C H and i6fi then we say that x is an upper bound for K provided y<x for each y G K (1.11.0.7) Thus, in a directed set every finite subset has an upper bound. 1.11.1 DEFINITION The partially ordered set H satisfies Zorn's condition if every totally ordered subset of H has an upper bound in H (1.11.1.1) If x E H is an upper bound for H then we call it a maximum element: it is clear from (1.11.0.3) that a maxium element is uniquely determined. A weaker condition is to describe x G H as maximal if there is implication x^yGH=^y = x (1.11.1.2) We are now in a position to state 1.11.2 ZORN'S LEMMA If the partially ordered set H satisfies Zorn's condition then H has maximal elements (1.11.2.1) We offer no proof of Zorn's lemma: in the presence of "the usual axioms of set theory" it is logically equivalent to "the axiom of choice." The reader may care to use Zorn's lemma to prove the following sharpened version: 1.11.3 THEOREM If H satisfies Zorn's condition then every totally ordered subset of H has a maximal upper bound. (1.11.3.1) Proof: If K C H is totally ordered, observe that KA = {xetlix is an upper bound of K} (1.11.3.2) also satisfies Zorn's condition. ■
1.11 Partially Ordered Spaces 23 The reader should note that the condition (1.11.3.1) implies Zorn's condition. In applications we usually specialize (1.11.3.1) to single elements K = {x}: thus if H satifies Zorn's condition then each element x G H satisfies x <y for some maximal element y G H. EJisa linear space then we are interested in partial orders which are compatible with the linear structure: if x,y,z G X we ask that x <y => x + z<y + z (1.11.3.3) and x<y=>tx<ty ifO<*GR (1.11.3.4) If we now write X+ = {xeX:0<x} (1.11.3.5) then we find that, for each x,y G X, x<y <=> y - x G X+ (1.11.3.6) and also that X+ is a cone, in the sense that it has the following three properties: X+ + X+CX+ (1.11.3.7) tX+ C X+ if 0 < t e R (1.11.3.8) (X+) H (-X+) = {0} (1.11.3.9) Conversely, if X+ is a cone in X then the relation < defined by (1.11.3.6) satisfies the conditions (1.11.3.3) and (1.11.3.4), as well as (1.11.0.2)- (1.11.0.4). 1.11.4 DEFINITION A partially ordered normed space X is a normed space X with a cone X+ for which X+ = cl(X+) (1.11.4.1) If X is a partially ordered normed space we shall usually write x < y ^=> y-xeX+ (1.11.4.2) For example, if H is a nonempty set and X is one of the spaces /p(H, K) we may take X+ = {x e X: 0 < x(t) e R for each t G H} (1.11.4.3)
2 Bounded Linear Operators The compatibility between the algebraic and the topological structures of a normed space is reflected in the theory of linear operators between them: it is particularly easy to decide whether or not a linear operator is continuous. 2.1 CONTINUITY OF LINEAR OPERATORS A mapping T : X —► Y between linear spaces is called linear if it respects linear combination: for each x,y G X and each 5,iGK T(sx + ty) = sTx + tTy (2.1.0.1) We recall from Section 1.2 that T is continuous at a point x G X iff for each e > 0 there is 6 > 0 for which yeX and \\x - y\\ < 6 ==> \\Ty - Tx\\ < e (2.1.0.2) If this holds for each x in a set K C X then T is said to be continuous on K: note that the number 6 > 0 satisfying (2.1.0.2) will in general depend both on e > 0 and on x G K. If for each £ > 0 there exists 6 > 0 for which (2.1.0.2) holds for each x G if, then T is said to be uniformly continuous on K. The remarkable fact about a linear operator T : X —► Y between normed spaces is that if T is continuous at any single point x G X, then it is uniformly continuous on the whole of X. To see this we begin by observing, with the aid of translation invariance, that if x G X and T : X —> Y is linear, then T continuous at x <=> T continuous at 0 (2.1.0.3) To go further, we make a definition: 25
26 2. Bounded Linear Operators 2.1.1 DEFINITION A linear mapping T : X -> Y between normed spaces is called bounded if there is k > 0 for which ||Tx|| < ifc||x|| for each xeX (2.1.1.1) The greatest lower bound of the set of k > 0 for which (2.1.1.1) holds is called the operator bound of T: \\T\\ = inf {A; > 0: llr-H < Jk||-|| on X} (2.1.1.2) The literal-minded reader should note that a bounded linear operator T : X —► Y need not belong to the space /^(X, Y) of (1.8.0.4): indeed he may like to check that the only linear mapping in ^{X, Y) is the constant mapping 0. For linear operators, boundedness is necessary and sufficient for continuity: 2.1.2 THEOREM If T : X -► Y is linear between normed spaces then the following are equivalent: T is continuous at 0 (2.1.2.1) T is bounded (2.1.2.2) T is uniformly continuous on X (2.1.2.3) Proof: It is trivial that (2.1.2.3) implies (2.1.2.1) and rather easy to see that (2.1.2.2) implies (2.1.2.3): for if k > 0 satisfies (2.1.1.1) and e > 0 then (2.1.0.2) holds for all x G X with 6 = e/k. We claim that (2.1.2.1) implies (2.1.2.2): for suppose that T is continuous at 0 and that 6 > 0 is such that ||jTx|| < e = 1 whenever ||x|| < 6; then if 0 ^ x G X the vector £x/||x|| has norm 6 and hence ||rx|| = (||x||/*)||r(*x/||x||)||<||x||/« (2.1.2.4) But this is (2.1.1.1) with k = 1/6. Since (2.1.1.1) always holds for x = 0, the proof is complete. ■ The reader should remember this argument from Theorem 1.7.1. 2.2 THE NORMED SPACE OF BOUNDED OPERATORS If X and Y are normed spaces over the field K we shall write L(X,Y) = LK(X,Y) (2.2.0.1) for the set of all linear mappings from X to Y. If S and T are in 1,{X,Y) and 6, t G K, then the mapping sT+tS : x —> sTx + tSx from X to Y is also
2.2 The Normed Space of Bounded Operators 27 linear, and the process of forming the linear combination sT + tS satisfies the rules (1.1.0.1) and (1.1.0.2): thus L{X, Y) is another vector space over K. If, in particular, S and T are continuous then Theorem 1.3.1 shows that all the linear combinations sT + tS are also continuous: thus the continuous linear operators form a linear subspace of the space of linear operators. We shall prefer to derive this "topological" statement from a more detailed "metric" statement: we begin by finding several expressions for the operator bound. 2.2.1 THEOREM If X ^ 0 and if T : X -> Y is bounded and linear then llTsll ||r|| =supI1n-nli = sup ||Tx|| = sup ||Tx|| (2.2.1.1) x^O ||*|| ||x|| = l ||x||<l Proof: If 0 ^ x G X then ||Tx|| < ifc||x|| if and only if ||rx||/||x|| < k, which establishes the first equality, and the second follows if we divide x by ||x||. The last expression is certainly no smaller than its predecessor. Conversely, whenever 0 < ||x|| < 1 there is xf = x/||x|| for which ||x'|| = 1 and ||2V|| > \\Tx\\, giving the final equality. ■ In the degenerate case X = 0, two out of three expressions in (2.2.1.1) are meaningless, bu the last is meaningful, and confirms ||T|| = 0. The literal-minded reader will now see that a linear operator T G L{X, Y) is "bounded" if and only if its restriction to the set H = Discx(0 ; l) is in the space l^n^Y) of (1.8.0.4): indeed the operator bound ||-|| then coincides with the norm H-^ of (1.8.0.2). The operator bound is a quasinorm on the space L{X, Y): 2.2.2 THEOREM If X and Y are normed linear spaces and S,T G L{X,Y) are bounded then so are sT + tS for each s,t G K, with \\sT + tS\\ < \s\\\T\\ + |*|||S|| (2.2.2.1) If Z is another normed space and U G L{Y, Z) is also bounded, then so is UT, with ||C/r||<||C/||||r|| (2.2.2.2) Proof: If x G X is arbitrary, then ||(*T + tS)x\\ < \s\\\Tx\\ + \t\\\Sx\\ < (H||m + |t|||5||)||x|| so that (2.1.1.1) holds for sT + tS with A; = |s|||r|| + |*|||S||, giving (2.2.2.1),
28 2. Bounded Linear Operators and confirming that sT + tS is bounded. Also ||prx||<||P||||rx||<||P||||r||||x|| so that (2.1.1.1) holds for UT with k = ||l/||||r||, giving (2.2.2.2) and confirming that UT is bounded. ■ The "topological" fact that the composition of two continuous mappings is continuous tells us that if U and T are both bounded then so is UT: the second part of Theorem 2.2.2 is just a more detailed metric version of this. We shall write ||r|| = oo if T e L{X,Y) is not bounded (2.2.2.3) and BL(X,y) = BLK(X,y) = {Te L{X,Y): \\T\\ < oo} (2.2.2.4) for the normed space of bounded operators from X to Y. In the special case Y = X, Theorem 2.2.2 tells us that BL(X,Y) = BL(X,X) is a normed linear algebra, in the sense of Definition 1.10.1. In particular, if T G BL(X,X) then we define the powers Tn of T as in (1.10.1.6), and more generally f(T) for a polynomial / = c0 + cxz + h ckzk : K -> K, as in (1.10.1.7). 2.3 SUBSPACES AND QUOTIENTS If X and Y are normed linear spaces then the mapping 0 : X —► Y defined by setting 0(x) = 0 e Y for each xeY (2.3.0.1) is always linear, and bounded with bound ||0|| =0. If Y = X then the mapping / = Ix : X —► X defined by setting I{x) =xeX for each xeX (2.3.0.2) is also linear, and bounded with bound ||/|| < 1. Unless X = 0 in fact ||/|| = 1, as we see from the second expression in (2.2.1.1): indeed the "sup" can here be replaced by "max". More generally, if X1 is the space obtained from X by giving it a new norm ||-||;, then the mapping /' : X —► X1 defined by (2.3.0.3) will be continuous if and only if ||-||; is weaker than ||-|| in the sense of (1.7.0.6): thus Theorem 1.7.1 is actually a special case of Theorem 2.1.2. If Y C X is a subspace, normed with the norm of X, then the mapping
2.3 Subspaces and Quotients 29 J = JY :Y —► X defined by setting J{y) =yeX for each y eY (2.3.0.3) is known as the natural injection of Y in X; evidently it is linear and bounded, with \\J\\ < 1 and \\J\\ = 1 unless Y = {0}. When the subspace Y is closed, so that the quotient X/Y has a norm, then the mapping K = KY : X -+ X/Y defined by setting K{x) = x + Y G X/Y for each xeX (2.3.0.4) is again linear and bounded, with bound < 1: we shall call it the quotient map induced by Y. If for example Y = X, so that X/Y = 0, then K = 0 has bound 0. If Y = c\(Y) ^ X then we claim that ||iir|| = 1, although we must appeal to the Riesz lemma: in fact (1.5.2.1) tells us that ||jK"|| > t whenever 0 < t < 1. If subspaces and quotients give rise to bounded linear mappings, then too, bounded linear mappings give rise to subspaces and quotients. If T G BL(X,y), then T~l(0) = {x G X: Tx = 0 G Y} (2.3.0.5) is a linear subspace of X, called the null space of T. We claim that it is closed: for if \\xn - x\\ -+ 0 as n -+ oo with xn G T_1(0) then Tx = 0 by continuity. We shall sometimes refer to the natural injection of T_1(0) in X as the kernel of T: ker(T) = J : r_1(0) —> X (2.3.0.6) The range of T is the linear subspace T{X) = {Tx: x e X} C Y (2.3.0.7) of the "codomain" Y of X: it will turn out to be quite difficult to know whether or not T(X) is closed in Y. To be rather pompous, we will take the quotient map induced by the closure of the range of T, and call it the cokernel of T: coker(T) = K :Y —► *7 cl{TX) (2.3.0.8) The kernel and cokernel mappings can be used to decide whether or not the product of two operators is zero: 2.3.1 THEOREM If X, Y and Z are normed spaces and T G BL(X, Y), S G BL(y, Z), then the following are equivalent: ST = 0eBL(X,Z) (2.3.1.1)
30 2. Bounded Linear Operators cl(rX) C 5_1(0) C Y (2.3.1.2) T = (kerS)U for some U e BL(X,5_1(0)) (2.3.1.3) S = y(cokerT) for some V G BL{Y/cl{TX),Z) (2.3.1.4) Proof: The equivalence of (2.3.1.1) and (2.3.1.2) is clear, recalling that 5_1(0) is closed in Y, and it is also clear that if either (2.3.1.3) or (2.3.1.4) holds then so does (2.3.1.1). If we assume (2.3.1.1) then we can obtain U : X —*■ 5_1(0) to satisfy (2.3.1.3) by the simple expedient of writing Ux = Tx for each x € X. There is also a well-defined mapping V : Y/ cl(TX) —> Z defined by setting V{y + cl(TX)) = Sy for each y EY, which is evidently linear, and satisfies VcokerT = 5. To see that V is bounded we go back to the Riesz lemma again: (1.5.2.1) tells us that ||V|| < ||S*||/£ whenever 0 < t < 1, and hence that ||V|| < ||5||. ■ As special cases of Theorem 2.3.1 we can take either T = J : X —> Y for a subspace XCYoiS = K:Y^Z = Y/W for a subspace W C Y: we learn that X C 5"1 (0) iff S can be factored through V : Y/X -> Z, and that TX C W iff T can be factored through U : X -> W. This suggests certain isomorphisms between spaces of bounded operators: 2.3.2 THEOREM If Z C X and V7 C y are closed subspaces there is isometry BL{X,W) = {76 BL(X,y):TX C^}C BL(X,y) (2.3.2.1) and BL(X/Z,y) ^{TGBL^yjiZCr-^O)} CBL(X,y) (2.3.2.2) Proof: The correspondences indicated by the notation (1.7.0.4) are those derived from (2.3.2.3) and (2.3.1.4), and are certainly one-one and onto, and linear; we must see that they preserve operator bounds. This is trivial for (2.3.2.1), since it is rather hard to distinguish between an operator T : X -> y for which TX C W and the induced mapping T : X -> W; for (2.3.2.2) however we need the Riesz lemma, as in the proof of (2.3.1.4). ■ The reader may at this point be tempted to guess that the spaces BL(X, Y/W) and BL(Z, Y) are isomorphic to certain quotients of the space BL(X, y). He is invited to read on. For the moment, we will see how near, and yet so far, the kernel and the cokernel can get to the operator T: 2.3.3 THEOREM ETG BL(X,Y) there is a bounded linear operator core(T) : X/T'^O) —► cl{TX) (2.3.3.1)
2.4 Cartesian Products 31 for which T = ker(coker(T)) o core(T) o coker(ker(T)) (2.3.3.2) Proof: This can be built up from Theorem 2.3.1. Since Tker(T) = 0 there must be V : X/r_1(0) -► Y for which T = V o coker(ker(T)), and then, since coker(r)T = 0 it follows that coker(r)V = 0, giving V = ker(coker(T))C/ for some U : X/T'1^) -► cl(TX): thus take core(T) = U. Alternatively, simply define core(T)(x + T_1(0)) = Tx e cl(Tx) for each xeX (2.3.3.3) and confirm that core(T) is well defined, linear, and bounded, and satisfies (2.3.2.2). ■ Theorem 2.3.3 is sometimes referred to as the canonical factorization of T: Y/cl(TX) coker(T)^^^ ker(coker (T)) ^V^ ^^coker(ker(T)) core(T) cl(TX) *+ X/T"1(0) 2.4 CARTESIAN PRODUCTS If X = Y x Z is a cartesian product of normed spaces, normed so as to have the cartesian product topology, then the mappings 7rx : X —► Y tt2 : X —► Z ttJ : Y —► X <k'2 : Z —► X defined by setting, for each y EY and z G Z, ^i(y>«) = y and 7r2(y^) =z (2.4.0.1) and *i(y) = M) and *£(*) = (M (2.4.0.2) are linear and continuous. If, in particular, X = Y ©p Z with pG {l, 2,00} as in (1.6.0.l)-(l.6.0.3), then each of these operators is of bound < 1, with equality unless Y = 0 or Z = 0. If W is another normed space we shall define col{S,T){w) = {Sw,Tw) eY xZ for each w G W (2.4.0.3)
32 2. Bounded Linear Operators if S G BL{W, Y) and T G BL(W, Z), and define row(5, T) (y, z) =Sy + TweW for each {y,z) eY x Z (2.4.0.4) if S G BL(y,V7) and T G BL(Z,V7): mentally we are viewing the product Y x Z as a space of "column vectors." 2.4.1 THEOREM If X = Y xZ and W are normed spaces then there is isomorphism BL{W,X) = {col(5,r):5 G BL{W,Y),T G BL(W,Z)} - BL(VP,y) x BL(VP, Z) (2.4.1.1) and BL(X,W) = {row(5,r):5GBL(y,V^),rGBL(Z,^)} ~ BL(Y,W) x BL{Z,W) (2.4.1.2) Proof: If 5 G BL(VP,y) and T G BL(VP,Z), then certainly col(5,T) = 7r[ • S + 71*2 • T is linear and bounded; conversely, if U G BL(W, X), then U = col(5,r) with S = 7rxU G BL(VP,y) and T = tt2U e BL{W,Z). If instead 5 G BL(y,V7) and r G BL(Z,W), then row(5,T) = Sirx + Ttt2 is linear and bounded; conversely, if V G BL(X,W), then V = row(5,T) with 5 = Vtt; G BL(y, W) and T = Vir'2 G BL(Z, W). This establishes the equality in each of (2.4.1.1) and (2.4.1.2), and the linear isomorphisms at the end. To see that these linear isomorphisms preserve topology we shall be more specific, and show that there is isometry BL{W, y ©^ Z) = BL{W, Y) ©^ BL{W, Z) (2.4.1.3) and BL(y ©! Z,W) s BL[Y,W) ©^ BL(Z,W) (2.4.1.4) Indeed, if S G BL(W,Y) and T G BL{W,Z) are arbitrary, then it is elementary that sup max(||Sw||,||rw||) = max( sup \\Sw\\, sup ||Tti;||) (2.4.1.5) IMI<i IMI<i|l IMI<i which is (2.4.1.3), while if instead S G BL{Y,W) and T G BL(Z,V7) then for each y €Y,z G Z \\Sy + r*|| < ||S||||y|| + ||r||||s|| < max(||5||, ||r||)(||y|| + \\z\\) (2.4.1.6) which says that || row(5, T)|| < max(||5||, ||T||). Conversely, by taking z = 0 and varying y G y it is clear that ||5|| < || row(5,T)||; similarly ||r|| < ||row(5,r)||. ■
2.5 Projections 33 The reader is invited to ponder which norms are induced on the products BL{W, Y) x BL{W, Z) and BL(y, W) x BL(Z, W) by their isomorphism with BL{W,Y ©! Z) and BL{Y ©^ Z,W), respectively. If X = XY x X2 and Y = Yx xY2 are both cartesian products then two applications of Theorem 2.4.1 show that each T G BL(X,Y) can be represented as an "operator matrix": (2.4.1.7) where for each i and j we have Tty G BL(Xy,yt). In the special case in which T12 = 0 and T21 = 0 we shall sometimes write r„ rM r» ^22. xl .X2. T11x1 + .r12x2 ^21Xl + ^22X2 r = r11er22 = diag(r11,r22) (2.4.1.8) If X = Y x Z is a cartesian product then Xj = 7x {0} and X2 = {0} x Z are subspaces of X, both closed, hence giving rise to quotient spaces X/X1 and X/X2: 2.4.2 THEOREM IfX = yxZisa product of normed spaces, then X/{Y x {0}) c± Z and X/({0} x Z) ~ y (2.4.2.1) Proof: We claim, more specifically, that (ye»We.{0})S7 (2.4.2.3) with correspondence (y, 2) + y x {0} <-► 2: in fact if y, 2 are arbitrary then inf maxdly-y'lUI*- 0||) = ||'ll ■ (2.4.2.4) y'EY Theorem 2.4.2 shows that the "topological" part of Theorem 2.4.1 is contained in Theorem 2.3.2. 2.5 PROJECTIONS If y and Z are a pair of supplemented closed subspaces of a normed space X, in the sense (1.6.1.5) that Y + Z = X and Y [\Z = {0}, then every vector xGl has a unique representation in the form x = y + z with y EY and z G Z. The mapping 7Ty : x —> y from X to X which implements this is called the projection of X upon Y in the direction of Z: formally if x G X then 7rf (x) G y and x - 7rf (x) G Z (2.5.0.1) The conditions (1.6.1.5) ensure that (2.5.0.1) indeed defines a mapping,
34 2. Bounded Linear Operators which is evidently linear, and also idempotent in the sense that, if E = tt§ (2.5.0.2) then E2 = E (2.5.0.3) If (2.5.0.2) holds then the subspaces Y and Z each have two expressions in terms of E: Y = {E{x):x eX} = {xeX: E{x) = x} (2.5.0.4) and Z = {xeX: E{x) = 0} = {x - E{x):x e X} (2.5.0.5) Conversely, if E : X —* X is linear and idempotent in the sense (2.5.0.3) and the subspaces Y and Z are defined by (2.5.0.4) and (2.5.0.5), then Y and Z satisfy the conditions (1.6.1.5), and (2.5.0.2) holds. Further, if we only assume half each of the conditions (2.5.0.4) and (2.5.0.5), then the other halves each follow. Also if (2.5.0.3) holds, then [I-E)2 = I-E (2.5.0.6) and if (2.5.0.2) holds then J - E = tt| (2.5.0.7) If linear subspaces satisfy the condition (1.6.1.5) and if the mapping 7Ty defined by (2.5.0.1) is continuous, then both Y and Z must be closed, since by (2.5.0.4) and (2.5.0.5) we have Y ={I-E)~1(0) and Z = £_1(0), where E = 7Ty. If, conversely, the subspaces Y and Z are both closed, then it becomes a surprisingly delicate matter whether or not E = 7Ty is bounded. We begin by observing that the boundedness of 7Ty is equivalent to the condition that the subspaces Y and Z are complemented, in the sense of (1.6.1.7): 2.5.1 THEOREM If closed subspaces Y and Z of the normed space X are supplemented, in the sense that Y + Z = X and Y f]Z = {0}, then the following are equivalent: YxZ~X (2.5.1.1) X/Z-Y and X/Y c* Z (2.5.1.2) 7Ty is bounded (2.5.1.3) 7r£ is bounded (2.5.1.4)
2.5 Projections 35 Proof: It is to be understood that in (2.5.1.1) and (2.5.1.2) the isomorphisms are those induced by the natural mappings. If closed subspaces Y C X and Z C X are supplemented in the sense of (1.6.1.5) then the mappings row(JY, Jz) : (y, 2) —► y + z from Y x Z to X (2.5.1.5) and co\(Kz, KY) : y + z —► (y + z + Z, y + z + Y) (2.5.1.6) = (y + Z, z + y) from X to (X/Z) x {X/Y) are linear, of bound < 1, as well as being one-one and onto. Thus, the topology of the space X is weaker than that induced by the product Y x Z, and stronger than that induced by the product {X/Z) X (X/Y). Thus the condition (2.5.1.2) implies the condition (2.5.1.1); conversely if (2.5.1.1) holds, then Theorem 2.4.2 shows that (2.5.1.2) does. If (2.5.1.1) holds then the mapping E = 7Ty is bounded, since it is the composition JY o<KloT<m(JY,Jz)-1 :y + z—► [y,z) —>y—>y (2 5 17) from X to X where the mapping 7r1 is given by (2.4.0.1). Conversely, if E = 7Ty is bounded then (2.5.1.2) holds, since for each y €Y and z G Z we have ||y|| < l|£||||y + *ll and ||*|| < ||J-£||||y + 2|| < (l + ||£||)||y + z\\ (2.5.1.8) and hence ||y|| < \\E\\dist{y + z,Z) and (2.5.1.9) ||*|| < ||/-E||dist(y + *,y) We have proved (2.5.1.1) => (2.5.1.3) => (2.5.1.2); this with a similar argument for (2.5.1.4) finishes the proof. ■ The reader should never forget that the projection E = 7Ty of (2.5.0.1) depends on the subspace Z just as much as on the subspace Y. If E is bounded and Y ^ {0} then \\E\\ > 1 (2.5.1.10) and it may be quite difficult to arrange Z in such a way that ||i5|| = 1. If E = E2 and F = F2 are projections on the same space X then the operator EF may or not be a projection:
36 2. Bounded Linear Operators 2.5.2 THEOREM If E = E2 and F = F2 are projections on X for which EF = FE (2.5.2.1) then EF and E + F — EF are projections, with {EF){X) = (EX)n(FX) and (2.5.2.2) (EF)-10 = E~10 + F-10 and [E + F- EF){X) = {EX) + {FX) and (2.5.2.3) {E + F - EF)-1!) = E^O + F~x0 Proof: If EF = FE then {EF)2 = EFEF = E2F2 = EF (2.5.2.4) so that EF is also a projection; also I - {E + F - EF) = {I - E){I - F) = {I - F){I - E) (2.5.2.5) is similarly a projection, and hence E-\-F—EF is. Towards (2.5.2.2) observe EF{X) C E{X) and EF{X) = FE{X) C F{X) (2.5.2.6) while, conversely, if x G EX f] FX then x = £"x and x = Fx and hence x = F-Fx G EF{X). This gives the first equality: for the second we have E-x0 C E"1^-^) = (FE)"^ = {EF)-1!) and (2.5.2.7) F_10 C F-^E^O) = {EF)-1!) while conversely if x G {EF)-1!) then x = {x-Fx)-\-Fx with x-Fx G F_10 and Fx G £'~10. This gives the second equality in (2.5.2.2), and now both parts of (2.5.2.3) follow by application of (2.5.2.2) to I - E and I- F. m There is a connection between the "algebraic" and the "spatial" relation of an operator to a projection:
2.5 Projections 37 2.5.3 THEOREM If T G BL[X,Y) and E = E2 e BL(X,X) and F = F2 eBL{Y,Y), then T{EX) CFY <=>TE = FTE (2.5.3.1) and T(E~l0) C i^O <=> FT = FTE (2.5.3.2) Hence [T{EX) C Fy and T{E~l0) C i^O) <=>TE = FT (2.5.3.3) Proo/: For (2.5.3.1) argue T(£X) Cfy = (J- F)"^ <=»(/- F)2\E = 0 (2.5.3.4) and for (2.5.3.2) F_10 2 r(E_10) = T(J - E)X <=> FT{I -E)=0 (2.5.3.5) Forward implication in (2.5.3.3) now follows immediately: conversely, observe TE = FT=^TE = FTE = FT. m (2.5.3.6) It is not necessary for projections E and F to commute for EF to be a projection: 2.5.4 THEOREM If E = E2 and F = F2 are projections on X for which {I - F){I - E) = 0 (2.5.4.1) then EF = (EF)2 is a projection on X for which E(EF) = (EF)E = EF (2.5.4.2) and F{X) =E-1{0) + {EF){X) and (2.5.4.3) E^^niEF^X) ={0}
38 2. Bounded Linear Operators Proof: To see that EF is a projection note, using (2.5.4.1), {EF)2 = E{FE)E = E{F + E- EF)F = EF2 + E2F - E2F2 = EF (2.5.4.4) For (2.5.4.2) note E{EF) = E2F = EF and (2.5.4.5) {EF)E = E{F + E - I) = EF + E2 - E = EF Towards (2.5.4.3) the second equality follows from (2.5.4.2), using Theorem 2.5.2; inclusion E'1^) C F(X) is just a restatement of (2.5.4.1), while inclusion {EF)X C F{X) follows from (J - F)EF = EF-{F + E-I)F = 0 (2.5.4.6) Conversely, inclusion F(X) C £'~1(0) + (EF)(X) follows from the implication x e F{X) => x = Fx => x = (x - Ex) + Ex = {I- E)x + [EF)x e E_1(0) + [EF)X -i,^ . ,„^v _ (2-5-4-7) The reader should note carefully that we do no£ claim F=[I-E)+EF (2.5.4.8) if (2.5.4.1) holds, then (2.5.4.8) is equivalent to (2.5.2.1). The reader should also write out the three other theorems to be obtained by interchanging E and I — E, or F and I — F, or both. With no restriction at all on the projections E and F we have VU = I - [E - F)2 = UV (2.5.4.9) where U = FE + [I-F){I-E) and (2.5.4.10) V = EF+{I-E){I-F) Also U{EX) C FX and U(E~l0) C F"1^) (2.5.4.11) and V{FX) C EX and ^(F"^) C E"1^) (2.5.4.12)
2.6 Sequence and Function Spaces 39 (2.5.4.9) does not quite tell us that U and V are mutually inverse. The reader is invited to see what happens if, for example, E and F have the same range, or instead the same null space. 2.6 SEQUENCE AND FUNCTION SPACES Suppose that H is a nonempty set that T : X —> Y is a linear mapping: then the mapping Tn : Xn -» Yn defined by setting {Tnx)(t) = T[x[t)) for each t G H for each x e Xn (2.6.0.1) is again linear. If S : X —> Y and U : Y —> Z are also linear then {sT + tS)n = sTn + tSn for each s,teK (2.6.0.2) and {UT)Q = UQTn (2.6.0.3) For bounded linear mappings between normed spaces it is relevant to restrict Tn to the normed spaces /p(H,X) of (1.8.0.4). 2.6.1 THEOREM If T G BL(X,y) is a bounded linear mapping of normed spaces and H is a nonempty set, then for each p G {l,2,oo} there is inclusion T%(n,x)cip(n,Y) (2.6.1.1) and inequality \\TnX\\p < \\T\\\\x\\p for each x G /p(H,X) (2.6.1.2) If H is a topological space then also rncoo(n,x)cc„(0!y) (2.6.1.3) Proof: We begin by establishing (2.6.1.2) for arbitrary x G Xn, and the quasinorms ||-||p (p = 1,2,00). For p = 1 and p = 2 let H' be an arbitrary finite subset of H: then Yl llr(*W)llP < ll^ir Yl llxWHP for each xeXn (2.6.1.4) ten' ten' Raising to the power l/p, and taking suprema over finite H' C H gives the extended version of (2.6.1.2) for p = 1 and p = 2; taking p = 1 and H' = {t} in (2.6.1.4) and then the supremum over all t G H, gives the extended version of (2.6.1.2) for p = 00. Now both (2.6.1.1) and (2.6.1.2)
40 2. Bounded Linear Operators follow. For (2.6.1.3) we combine (2.6.1.1) with the inclusion rnc(n, x) c c(n, y) (2.6.1.5) which holds because the composition of continuous functions is continuous. ■ Strictly speaking, we should write something like / (H,T) for the restriction of the mapping T^: we shall however continue to write Tn. We can be quite explicit about its bound: 2.6.2 THEOREM If T G BL[X,Y) and H is a nonempty set, then for each p G {1,2,00} ||/p(n,r)|| = ||rn||p = ||r|| (2.6.2.1) If H is a topological space, then also 11^(0,^11 = 1^11 (2.6.2.2) Proof: It is clear from (2.6.1.2) that the restriction /p(n,T) of Tn to Zp(n,X) is bounded, with ||/p(n,r)||<||r|| (2.6.2.3) KteH and iGl consider the mapping 6t 0 x : H —> X defined by setting [ x if 6 = £ &©*)(«) = {n .r ^ (2.6.2.4) for each s G H: 6t is the "Kronecker delta," and the notation is (1.8.0.6). Evidently ||£ 0 x||p = ||x|| for each p G {1,2,00} (2.6.2.5) and Tn {St Gx) = 6teTxeYn (2.6.2.6) We can how reverse the inequality (2.6.2.3): for a fixed iGfi and arbitrary x G X, ||Tx|| = ||*t0rx||p = ||rn(*t©*)||p < ||rn||||*t©*||p = ||rn||||x|| This argument is liable to break down for (2.6.2.2), since it is most unlikely that the mappings St © x : H —* X will be continuous. Instead, however, we have the constant mappings 1 0 x : H —* X which send each
2.7 Enlargements 41 t G H into the same vector x: thus ||Tx|| = ||1 © Tx\\„ = \\T*(1 0 x)!^ < ||rn||||l © x\\„ = ||Tn||||x|| ■ If H is again a set and T G BL(X, Y), then there is also an operator Tn: M(H, X) —> M(H, Y) between the spaces of "finitely additive set functions" of (1.8.2.2), defined by setting (r°M)(*) = W0) (2627) for each K C H for each \i G M(fi, X) The reader may like to verify that, in the notation of (1.8.2.3) and (1.8.2.4) r^M^n,*) cM^n.y) and (2.6.2.8) l|rnM|li<||r|||Hli ifMGM^ax) Also the bound of the operator Tn on M1(H,X) is equal to ||T||. 2.7 ENLARGEMENTS If H has a bounded structure, or bornology, and T G BL(X, Y), then the operator Tn respects the spaces c00(n,X), c0(H,X) and c1(H,X) of Section 1.9. 2.7.1 THEOREM If ft has a bounded structure and T G BL(X, y), then rnc+(n,X) C c+(H,y) for each c, = Cqo,^^! (2.7.1.1) There is equality ||c.(n,T)|| = ||r|| (2.7.1.2) in each case. Proof: If x G Xn and t G 0, then x(i) = 0 ==> (Tnx)(*) = T{x{t)) = 0 (2.7.1.3) it follows that if x G c00(n,X) then Tnx G c00(n,y). If fc > 0, then also {t G H: ||x(t)|| < A:} C {t G H: ||r(x(t))|| < ||r||*} (2.7.1.4) This implies that if x G c0(H,X) then Tx G c0(0,y). Finally, if x G c^H,*), with z G X for which x- ze c0(n,X), then Tx - Tz = T{x - z) G c0(n,y), so that Tx is in c^f^y). This completes the proof of
42 2. Bounded Linear Operators (2.7.1.1). Inequality ||c+(n,r)|| < ||T|| follows from (2.6.1.2) with p = oo, and is reversed by observing that the functions 6t © x of (2.6.2.4) are all in c00(n,X). ■ Since the operator Tn : 1^(0,,X) —> l^Cl^Y) has the property that Tnc0(fi,X) C c0(n,y), there is by Section 2.3 an operator induced from /«„(*), X)/c0(n,X) to/^n^/co^y). We shall call it an "enlargement" of the operator T: 2.7.2 DEFINITION If H has a nontrivial bornology and T G BL(X, y), then the operator Qn(r) : Qn(X) —> Qn(y) (2.7.2.1) defined by setting Qn(T)(x + c0(n,x)) = Tnx + c0(n,Y) (2722) for each x G /^(njX) is called an enlargement of T. When H = N we write QN(T)=Q(T) (2.7.2.3) Explicitly, if x = (x1,x2,x3)...) G ^(X) Q(r)((x1,x2,x3,...) + c0(x)) = (rx1,rx2,rx3,...) + c0(y) (2.7.2.4) It is clear that Qq(T) is bounded and linear, with bound < ||T||. If H is nontrivial then there is equality: 2.7.3 THEOREM If H has a nontrivial bornology and T G BL(X,y), then ||Qn(T)|| = ||r|| (2.7.3.1) If also S G BL(X,y) and U G BL{Y,Z), then Qn(sT + tS) = sQn(T) + *Qn (S) for each s, * G K (2.7.3.2) and Qn(^) = Qn(U)Qn(T) (2.7.3.3)
2.8 Shift Operators 43 Proof: We know already that ||Qn(T)|| < ||T||. Conversely, if x e X is arbitrary write qn(x) = {ze i^xy.z-1 o x e c0(n,x)} (2.7.3.4) for the set of sequences z which "converge to x": then by (1.9.1.3) ||rx|| = ||qn(rx)|| = ||Qn(r)qn(x)|| < ||Qn(r)||||qn(*)ll = IIQn(r)IIMI This proves (2.7.3.1), and the rest is clear. ■ One observation will be useful later: if Y = X and T = I : X —> X, then Qn(T) is the identity on the space Qn(X): QM = I (2.7.3.5) 2.8 SHIFT OPERATORS Suppose, in contrast to Section 2.6, that X is a linear space and that <f>: A —> H is a mapping of sets: then the mapping X^ : Xn —> XA defined by setting {X*x){t) = x{<t>{i)) for each * G A, for each x e Xn (2.8.0.1) is well defined and linear. More generally, if <f> : A0 —> H is defined only on a subset A0 C A, we shall define X^ : Xn —> XA by setting, for each tGA and x e Xn, . (xUii)) if * e A0 {X+x){t) = \ m" ° (2.8.0.2) [ 0 if tg A0 If X is a normed space then we again restrict to the spaces lp(Q,X): 2.8.1 THEOREM If X is a normed space and <j> : A0 C A -> H is a mapping of sets then there is inclusion X^00(n,X)Ci00(A,X) (2.8.1.1) and inequality ll***lloo < Iklloo for each x e /oo(n.^0 (2.8.1.2) If in particular <£ is one-one, then the analogous results hold for p = 1 and p = 2. If <f> is an everywhere-defined continuous mapping of topological spaces, then also X+C^X) C CTO(A,X) (2.8.1.3)
44 2. Bounded Linear Operators Proof: If x G Xn and t G A0 then ||(X**)(*)|| = ll*(*M)ll < sup ||x(*)|| = \\x\U (2.8.1.4) sen and then (2.8.1.2) follows if we take the supremum over t G A0, since the t G A \ A0 are powerless to affect the result. We have in fact proved an extended version of (2.8.1.2), from which (2.8.1.1) follows. If <f> is one-one and p = 1 or p = 2, then for arbitrary finite A' C A0 and x G Xn, £||(x**)(*)||p = £N^))||p tEA' t€A' (2.8.1.5) = E ll*Mllp<£ll*MP *e<t>(A') flGA Raising to the power 1/p and taking the supremum over all finite A' C A0 gives the (extended) analogue of (2.8.1.2), and hence also the analogue of (2.8.1.1). For (2.8.1.3) we observe X*C(H, X) C C(A, X) (2.8.1.6) by the composition of continuous mappings. ■ If A and H have bounded structures then we shall call a mapping <f> : A —* H cofinal if for each K G Bn there is H G SA for which <f>{A \ H) C H \ K (2.8.1.7) If for example H = A = N the condition is that <f>(n) —► oo as n —► oo (2.8.1.8) 2.8.2 THEOREM EXisa normed space and <j> : A -» H is cofinal, then X+c+{n,X) C c,(A,X) for each c, = Coq,^,^ (2.8.2.1) Proof: If x G c00(H, X), then {t G H: x(*) = 0} = H \ K is the complement of a "bounded" subset if C H, and by (2.8.1.7) there is a "bounded" subset iZ" C A for which 5 e A \ H => <£(s) G H \ X => x(<£(s)) = 0 so that {5 G A: {X*x){s) ^ 0} C H is "bounded"
2.8 Shift Operators 45 This proves the first of the inclusions of (2.8.2.1), and the same argument, with {t G H: \\x(t)\\ < 1/n} in place of {t G Vl:x(t) = 0}, gives the second. For the third inclusion suppose x G c1(H, X), so that there is y G X with x — 1 © y G c0(n,X), and X+x.-X+{l © y) = X*{x -y)e c0(A,X) by the previous result, giving X^x G c1(A,X). ■ The most celebrated example of (2.8.0.1) is the simple backward shift (x1,x2,x3,...) —► (x2,x3,x4,...) (2.8.2.2) on the space lp = Zp(N, K): the mapping <j> is given by <j>{n) = n + 1. For the forward shift (x1,x2,x3,...) — (0,x1,x2,...) (2.8.2.3) on lp we need (2.8.0.2), defining <t>(n) = n - 1 if n G N + 1 = {2,3,4,....} The superficially similar operators given by <f>{n) = n ± 1 for each n G Z = {... — 1,0,1,2,...} turn out to behave quite differently. If H = N, there is inclusion, for arbitrary normed spaces X, c00(X) C l^X) C l2(X) C c0(X) C ^(X) (2.8.2.4) and we might observe that, if ^(X), /2(X) and c0(X) are each given their appropriate norms, then each of the natural injections j : /i(X) —► c0(X) J : /2(X) —, c0(X) J : lx{X) — /2(X) (2.8.2.5) is bounded, with bound < 1; this is clear from the inequalities (1.8.1.1). In each case we also have \\J\\ = 1, as is clear by looking at the vectors 6t © x of (2.6.2.5). Finally, since in each case the closure of the subspace c00(X) is the whole space, the same is true of the closure of the range of J. In the reverse direction we have a linear mapping J^ : XN —> XN defined by setting (Ei*)n = Si + *2 + --- + Zn (2.8.2.6) for each n G N, for each x G X The reader may like to verify the inclusion Ei(<iP0)CU*) (2-8.2.7) and also that the induced mapping J^ : /X(X) —> /^(X) is bounded, with II Ex || = i.
46 2. Bounded Linear Operators If H C R is an interval and a G H then there is a well-defined linear mapping /a:C(n,K)—>C(n,K) (2.8.2.8) defined by setting t (Sax)(t)= J x(s)ds (2.8.2.9) s=a the reader may interpret this either as a limit of Riemann sums or as an antiderivative. When the interval H is bounded (closed or not) then there is also inclusion /^(n^CC^K) (2.8.2.10) and the resulting linear mapping is again bounded. In the sequel we will see that such a mapping can be induced on the spaces C(H,X) for certain kinds of normed spaces X (those which are "complete"). Perhaps the most famous linear mapping of analysis is differentiation: it is disturbing to report that it is not bounded. If, for example, Y = C[0, l] = C^fO, l] and X is the subspace of Y consisting of those x G Y for which the derivative x1 exists and belongs to Y, then the mapping D : x —+ xf is linear from X to Y. We claim that, relative to the usual norm of Y, D is not bounded. Indeed if zn G X is denned by setting zn(t) = tn for each t G [0, l] then ||;gn||00 = l and H-D^Hoo = ||nsn~1||00 = n for each n G N (2.8.2.11) Thus (2.1.1.1) cannot hold for any k > 0. It is perhaps fortunate for the relevance of the theory of bounded linear operators that much of the theory of the operator D can be carried out in terms of the operators Ja, which are bounded. 2.9 COMPOSITION OPERATORS If T G BL(X, Y) and if W is another normed space then we shall write LT = BL{W,T):U —► TU from BL(W, X) to BL(W, Y) (2.9.0.1) and RT = BL(r,W) : V —► VT from BL{Y,W) to BL{X,W) (2.9.0.2) for the left and right compositions, or multiplications, associated with T. Evidently LT is obtained by further restricting the operator Tw : Xw -» Yw of (2.6.0.1), while RT is a special case of the operator W* of (2.8.0.1). As we might expect, LT and RT are also bounded linear operators:
2.9 Composition Operators 47 2.9.1 THEOREM If T G BL(X,y) and W is a normed space then LT and RT are bounded and linear, with ||LT|| < ||r|| and \\RT\\ < \\T\\. If also S G BL(X,y) and U G BL{Y,Z) then ■^aT+ts = sLj,-\-tL§ and RsT+ts = ^Rt^^^s ^or eacn s,£ G K (2.9.1.1) and ut = ^u t and. Rut = Rj>Ru ^2.9.1.2] Proo/: All this is clear from Theorem 2.2.2. ■ We shall see later—with some difficulty, in fact—that provided W ^ 0 there is equality ||LT|| = ||r|| = \\RT\\ for each T G BL(X,y) (2.9.1.3) The reader may like to verify the first equality here if W = X and if W = K, and to verify the second equality if W = Y. When W = K the space BL(W, X) is just X: 2.9.2 THEOREM If X is a normed space then there is isometric isomorphism x <—► Lx : X ^ BL(K,X) (2.9.2.1) where Lx{t)=txeX foreachJGK (2.9.2.2) Proof: This is clear. ■ If X and Y are normed spaces, and x G X, we shall also write £x : r —► Tx from BL(X, y) to Y (2.9.2.3) and call it an evaluation map. When W = K the left multiplications LT of (2.9.0.1) are indistinguishable from the operators T; by contrast the right multiplications RT = BL(T, K) of (2.9.0.2) will play a crucial role in the theory to follow. When y = K the elements of the space BL(X,Y) = BL(X, K) are known as bounded linear functionals. Finally, if X and Y are normed spaces and if y G y is a vector and / G BL(X, K) is a linear functional then the mapping fQy.x —► f{x)y from X to Y (2.9.2.4)
48 2. Bounded Linear Operators as in (1.8.0.5), is bounded and linear: the reader may like to verify that ||/0y|| = ||y||||/|| for each yG7and/G BL(X,K) (2.9.2.5) For later reference, we shall make a distinction between this "rank one operator" and a linear functional on a space of operators, writing / ® y : S —► /(St/), from BL(Y,X) to K (2.9.2.6) If X, Y and Z are normed spaces then we can look at mappings from X into the operators from Y to Z, and from Y to the operators from X to Z. 2.9.3 THEOREM If X, Y and Z are normed spaces then there is isometric isomorphism $ <—► # : BL[X,BL{Y,Z)) = BL{Y,BL{X,Z)) (2.9.3.1) given by the formula $(x)(y) = #(y)(x) for each xe X and each yG7 (2.9.3.2) Proof: In the notation (2.9.2.3) we have ¥(y) = £y o $ for each yeY (2.9.3.3) so that the correspondence (2.9.3.2) gives a linear mapping $ —> \£ of bound < 1. Also $(x)=£zo# for each xeX (2.9.3.4) giving the isometry. ■ In the sequel (Section 11.7) we shall see that we can also write BL(X,BL(y,Z)) = BL{W,Z) (2.9.3.5) with a normed space W = X ® Y depending only on X and Y. 2.9.4 THEOREM If H is a nonempty set and X and Y are normed spaces then there is isometric isomorphism Q^+TzBLiXJ^Y)) s/^n.BLpT.y)) (2.9.4.1) given by the formula $(*)(*) = Tt(x) for each x G X and each * G H (2.9.4.2)
2.10 Normed Linear Algebras 49 There are also bounded linear mappings T —> T.{x.) : /^(n.BL^y)) — ^(11, y) for each x G l^X) (2.9.4.3) and x —> r.(x.): /oo(n,x) —> /oo(n,y) for each r g /^(n.BLpr.y)) (2.9.4.4) given by the formula (T.[x.))(t) = Tt(xt) for each * G H (2.9.4.5) Proof: The argument for the first part is the same as for Theorem 2.9.3, and the second part can be left to the reader. ■ 2.10 NORMED LINEAR ALGEBRAS If A is a normed linear algebra in the sense of Definition 1.10.1, then each element a G A gives rise to two multiplication operators on the normed space A, La:b —► a • b from A to A (2.10.0.1) and Ra:b —> 6 • a from A to A (2.10.0.2) 2.10.1 THEOREM If A is a normed linear algebra then for each a G A we have \\La\\ = \\a\\ = \\Ra\\ (2.10.1.1) If also 6 G A then Lsa+tb = sL + tLb and ^*a+t& = sRa + *^6 for each 5^K (2.10.1.2) and La6 = LaLb and £a6 = £6£a (2.10.1.3) Proof: We already know that ||La|| < ||a|| and ||#a|| < ||a|| (2.10.1.4) This is simply a restatement of (1.10.1.1). To reverse the inequalities, evaluate at the identity 1: provided 1^0 |M| = ||L0(1)|| < ||L0||||1|| = ||LJ| and ||a|| = ||J2a(l)|| < ||fi8|| (2.10.1.5)
50 2. Bounded Linear Operators This argument does break down if 1 = 0, but if that happens, then A = 0 and ||a|| = 0. We have proved (2.10.1.1), and everything else is clear. ■ Theorem 2.10.1 says, in particular, that there is isometric isomorphism a <—► La : A ^ {La: a e A} C BL(A, A) (2.10.1.6) Here we intend the symbolism to record the preservation of multiplication, as well as everything else. In a sense, therefore, we have no need of Definition 1.10.1: we need only consider subalgebras of the algebras BL(X, X). If A and B are normed linear algebras then it is relevant to look at those operators T G BL(A, B) which are also homomorphisms, in the sense that T(l) = 1 and T{ab) = T{a)T{b) for each a, 6 G A (2.10.1.7) An example occurs in (2.10.1.6). More generally, T may satisfy T(l) = 1 and T(ab) + T(ba) = T(a)T(b) + T(b)T(a) (2.10.1.8) for each a, b G A We sometimes call this the Jordan property. The reader should be warned that the product of two homomorphisms is a homomorphism, but not in general the sum: the homomorphisms from one normed algebra to another do not even form a linear space. We shall write HBL(A,£) ={Te BL(A,B) : T is a homomorphism} (2.10.1.9) If T is a homomorphism of normed algebras, then its range is a subalgebra in the sense of (1.10.1.4), while its null space is an ideal in the sense of (1.10.1.5): 2.10.2 THEOREM If A and B are normed algebras and T G HBL(A, B) is a homomorphism, then T(A) is a subalgebra of B and T_1(0) is a closed two-sided ideal. If A is a subalgebra of B and if J is a closed two-sided ideal of A, then the injection J : A —+ B and the quotient K:A—+A/J are homomorphisms. Proof: This is left to the reader. ■ If a G A, then the mappings La and Ra are usually not homomorphisms. Their ranges and null spaces are however "one-sided ideals" of A: specifically L~ 1(0) and La(A) are right ideals of A, while R'1^) and
2.11 Partially Ordered Spaces 51 Ra{A) are left ideals. As usual both La1(0) and Ra 1(0) are closed, while La(A) and Ra{A) need not be. 2.11 PARTIALLY ORDERED SPACES If X and Y are partially ordered linear spaces, with cones X+ and Y+ as in (1.11.3.4), then a mapping T : X —* Y will be called positive if it has the property r(x+) c y+ (2.H.0.1) and will be called increasing, or monotonic, if for each x,y € X y - x e X+ =^ Ty - Tx eY+ (2.11.0.2) Evidently if T : X -> Y is linear then T positive <=> T monotonic (2.11.0.3) It is clear also that sums and products, and positive scalar multiples, of positive linear mappings are positive. Thus, the positive linear mappings nearly form a cone in the space of all linear mappings. The only condition that is not clear is (1.11.3.8). We can say that if T : X —> Y is linear then T positive and - T positive <=> T(X+) = {0} (2.11.0.4) For this to ensure that T = 0 we have to know that the cone X+ is generating, in the sense that the linear subspace of X generated by X+ is the whole of X. This can be simplified: if X is a real vector space the condition is X+ - X+ = X (2.11.0.5) while if X is a complex vector space the condition is X+ - X+ + iX+ - iX+ = X (2.11.0.6) 2.11.1 THEOREM If X and Y are partially ordered normed spaces and X has a generating cone X+, then BL(X, Y) is a partially ordered normed space with cone BL+(X,y) = {TeBL{X,Y):T{X+) C Y+} (2.11.1.1) Proof: All that remains to be checked is that BL+(X, Y) is closed: we leave this to the reader. ■
52 2. Bounded Linear Operators E X and Y have generating cones, so that BL(X, Y) is a partially- ordered normed space, it ip not clear that also BL(X, Y) should have a generating cone.
3 Invertibility and Singularity A bounded linear operator T : X -» Y between normed spaces is called "invertible" if it has a bounded linear inverse, in the sense of S G BL(Y, X) for which ST = I on X and TS = I on Y. If T is to be invertible then it will have to be "nonsingular" in various ways; conversely, if T is nonsingular in enough of these ways then it is invertible. 3.1 INVERTIBILITY AND ISOMORPHISM If normed linear spaces X ~ Y are isomorphic in the sense (1.7.0.1), then we can write the one-one correspondence X <-► Y in the form x <—► Tx : X <—► Y (3.1.0.1) thus defining a mapping T : X —* Y which will be one-one and onto, and linear in the sense of (2.1.0.1). To preserve topology it must also satisfy Nbdy {Tx) = T(Nbdx(x)) for each x e X (3.1.0.2) In particular the mapping T will have to be continuous. By symmetry we can equally well write the correspondence X <-► Y in the form y <—> Sy : Y <—► X (3.1.0.3) thus the mapping T : X —► Y must have an inverse S : Y —► X which is both linear and continuous. Such mappings T are called invertible. 3.1.1 DEFINITION T G BL(X,Y) is said to be invertible if there is SeBL{Y,X) for which ST = I eBL{X,X) and TS = IeBL{Y,Y) (3.1.1.1) 53
54 3. Invertibility and Singularity We shall write BL *(X, Y) for the set of invertible mappings in BL(X, Y), and aBL(X,y) = BL(X,y)\BL_1(X,y) (3.1.1.2) We can summarize the connection between invertible mappings and isomorphism: X ~ Y <^> BL_1(X,y) ^ 0 (3.1.1.3) The identity / : X —► X is always invertible, and provides its own inverse. The zero 0 : X —► Y is almost never invertible, unless in fact X = Y = 0, in which case 1 = 0; more generally if an idempotent E = E2 : X —► X is invertible, then E = I. The inverse of an invertible mapping is unique, and the product of invertible mappings is invertible: 3.1.2 THEOREM If T e BL(X,y) is invertible, then its inverse T'1 is unique. If also S G BL(y, Z) is invertible then so is ST, with (ST)'1 =T~1S-1 (3.1.2.1) Proof: If U and V are inverses to T G BL(X, Y), then U = UI = U(TV) = (UT)V =IV = V (3.1.2.2) If S and T are invertible, with inverses 5_1 and T_1, then [T^S'^iST) = r-1(5"15)T = T~lT = I and (3.1.2.3) {ST){T~1S-1)=SS-1 = 1 m The formula (3.1.2.1) is the famous "reversal of product" for inverses. In general the sum of two invertible operators need not be invertible: 3.1.3 THEOREM If T e BL(X,y) and S e BL(y,X), then i-STeBL-1 (x,x)^^i-ts eBL-1 (y,y) (3.1.3.1) If T e BL_1(X,y) is invertible and U e BL(X,y) then J - T~lU e BL_1(X,X) =^ T - U e BL_1(X,y) (3.1.3.2) and / - ut~1 e BL-1 (y, y)=^t-u e bl_1 (x, y ) (3.1.3.3)
3.1 Invertibility and Isomorphism 55 Proof: Suppose that J — ST is invertible with inverse U, so that U{I - ST) = I = {I - ST)U (3.1.3.4) Then we claim (J + TUS){I-TS) = I={I- TS){I + TUS) (3.1.3.5) For example (J + TUS){I -TS) = I + T(U-I- UST)S = 1 + 0 (3.1.3.6) This proves forward implication in (3.1.3.1), and the reverse is obtained by interchanging S and T. For (3.1.3.2) use (3.1.2.1) with the observation that T-U = T{I-T~1U) The proof of (3.1.3.3) is similar; alternatively it follows from (3.1.3.2) and (3.1.3.1). ■ The significance of the second part of Theorem 3.1.3 will be that if we add a "sufficiently negligible" operator to an invertible operator we get another. In (3.1.1.2) we characterize topological isomorphism X ~Y: for isom- etry X = Y as in (1.7.0.4) we require T G BL_1(X,y) with ||Tx|| = ||x|| for each xeX (3.1.3.7) Necessary and sufficient is T G BL_1(X,y) with ||r|| < 1 and {{T^W < 1 (3.1.3.8) We can extend the concept of isomorphism between spaces to one of isomorphism between operators: this if T G L{X, Y) and Tf G L{Xf,Yf) are linear we shall write T ~T' (3.1.3.9) to mean that there are invertible operators U G BL_1(X, X1) and V G BL"1 {Y,Y') for which VT = T'U (3.1.3.10) and write T = Tf (3.1.3.11) if, in addition, the operators U and V satisfy (3.1.3.7). Evidently if
56 3. Invertibility and Singularity (3.1.3.10) holds then T bounded <^ T' bounded (3.1.3.12) while if (3.1.3.11) holds then also ||r'|| = ||r|| (3.1.3.13) We conclude with the remark that the formation of inverses is itself a continuous process: 3.1.4 THEOREM If X and Y are normed spaces, then the mapping T —-> T~l : BL_1(X,y) —► BL{Y,X) (3.1.4.1) is continuous at each point S G BL_1(X, Y). Proof: If S and T are both in BL_1(X,y), then T~l -S~l =T-1{S-T)S~1 = (T-1 - 5_1)(5 - r)5_1 + S'^S - r)5_1 (3.1.4.2) and hence (i - \\s - rmi^-^Diir-1 - s_1|| < lls^Hlls - r||||-s,-1|| (3.1.4.3) This forces T"1 -> S"1 as T -> 5: specifically, as soon as lis-rims"1!! < | (3.1.4.4) we have ||r-i_5-i|| <2||5-1||2||r-5||—>0 as ||r-5|| —>0 ■ (3.1.4.5) 3.2 MONOMORPHISMS AND EPIMORPHISMS We recall the various subspaces associated with T G BL(X, Y) in Section 2.3: 3.2.1 DEFINITION T G BL(X, Y) and is said to be onto if T{X) is said to be one-one if = {0} (3.2.1.1) = Y (3.2.1.2)
3.2 Monomorphisms and Epimorphisms 57 More generally T is said to be dense if cl(TX) = Y (3.2.1.3) We shall write 7rleftBL(X,y) = {Te BL(X,Y):T is not one-one} (3.2.1.4) and 7rrightBL(X,y) = {Te BL(X,y):T is not dense} (3.2.1.5) ETG BL(X, Y) is invertible in the sense of Definition 3.1.1 then it will certainly be both one-one and onto; conversely, in the presence of certain background conditions, if T is one-one and onto then it is also invertible. To see this, however, requires a deep theorem: much of what follows will be devoted to just that. We begin with a certain "monomorphic" property of one-one operators, and find that the corresponding "epimorphisms" are the dense operators rather than the onto operators. These notions use the left and right multiplications LT and RT of Section 2.9: 3.2.2 THEOREM If T G BL(X,y), then T one-one «<=> LT = BL(W, T) one-one for all normed spaces W (3.2.2.1) and T dense <=> RT = BL(T,W) one-one for all normed spaces W (3.2.2.2) Proof: If T G BL(X,y) is one-one, then for arbitrary W and U G BL{W,X) TV = 0 ==> TUw = 0 for each w e W ==> Uw = 0 for each w e W (3.2.2.3) which is that we mean by U = 0. Conversely, if T is not one-one then the null space T_1(0) is not {0}, and hence the natural injection ker(T) = J : T_1(0) —► X is not the zero operator. Thus U = ker(T) =^TU = 0^U (3.2.2.4) Towards (3.2.2.2), if T is onto and V : Y -► W is linear, then VT = 0 => V = 0. If we only assume that T is dense, but insist that V is bounded, then VT = 0 ==> VTx = 0 for each xeX =^Vy= lim VTxn = 0 for each y eY (3*2-2*5)
58 3. Invertibility and Singularity Conversely, if T is not dense then the quotient Y/ cl(TX) is not 0, and the quotient mapping coker(T) = K : Y —► Y/ c\(TX) is not the zero operator. Thus, V = coker(T) ==> VT = 0 ^ V m (3.2.2.6) There is an alternative derivation for the first part of Theorem 3.2.2: we can take the space W to be the scalar field K. Indeed, if T is not one-one there will be x G T_1(0) for which x ^ 0, and now we can take U = Lx : t -► tx as in (2.9.2.2): 0 ^ x e T_1(0) =*► TLX =0^LX (3.2.2.7) Very much later in Section 5.6 we shall see that we can also use W = K for the second part of Theorem 3.2.2, but only with the help of another deep theorem. The reader is invited to speculate under what conditions on T the operators LT will all be dense, or the operators RT. The conditions under which a product ST is either one-one, or dense, are interesting: 3.2.3 THEOREM If T e BL(X,Y) and S e BL{Y,Z) then there is implication S one-one and T one-one => ST one-one => T one-one (3.2.3.1) and S dense and T dense =*► ST dense =*► S dense (3.2.3.2) Proof: If S and T are both one-one and if STx = 0 then Tx = 0 and hence x = 0, which is the first implication of (3.2.3.1). Also if ST is one-one and Tx = 0, then also STx = 5(0) = 0, giving x = 0. The same argument, using the operators RT and Rs, gives (3.2.3.2): alternatively the reader may like to supply the direct argument. ■ We remark that also S onto and T onto =*► ST onto =*► S onto (3.2.3.3) ETG BL(X, Y) and if W is a closed subspace of X then whether or not T is one-one on X can be tested by looking at the spaces W and X/W. 3.2.4 THEOREM If T e BL(X, Y) and if^CXandZCF are closed subspaces then T one-one <=> r_1(0) n V7 = {0} and T_1(0) C W (3.2.4.1)
3.2 Monomorphisms and Epimorphisras 59 and T dense <=► c\{Z + cl(TX)) = Y and Z C cl{TX) (3.2.4.2) Proo/: The right-hand side of (3.2.4.1) is equivalent to r_1(0) = {0}. If T is dense then the conditions on the right-hand side of (3.2.4.2) certainly hold. Conversely, if they are assumed, then for each y £ Y there are (xn) in X and (zn) in Z for which ||y — Txn — zn\\ < l/n, and (xfn) in X for which \\zn - Tx'J < l/n, giving \\y - T{xn + x(J|| < 2/n. ■ One way round, one-oneness and denseness can be tested by enlargement. 3.2.5 THEOREM If T e BL{X,Y), then Q(r) one-one =^ T one-one (3.2.5.1) and Q(r) dense =J> T dense (3.2.5.2) Proof: If T is not one-one then there is x G X with i^0 = Tx, and hence q(x)^0 = Q(r)q(x) (3.2.5.3) where q(x) = qN{x) is given by (2.7.3.4). This proves (3.2.5.1). If Q(T) is dense then for each y G Y and each e > 0 there is xe = (x*) in /00(X) for which limsup \\y - Txen\\ < e (3.2.5.4) n—*oo Thus, for each e > 0 there is ze = xeN for which ||y - Tze\\ < 2e. ■ The analogues of (3.1.3.1) hold: 3.2.6 THEOREM If T e BL{X,Y) and S e BL{Y,X), then I- ST one-one <=> I-TS one-one (3.2.6.1) and J - ST dense <=> I-TS dense (3.2.6.2) Proof: If J — ST is one-one and y G Y, then (/ - TS)y = 0=^{I- ST)Sy = S{I - TS)y = 0 =j> Sy = 0 =}> y = T(5y) = 0
60 3. Invertibility and Singularity This proves (3.2.6.1). Towards (3.2.6.2) suppose that I - ST is onto, so that if x G X then x = (I — ST)x' for some x' G X: then it follows yeY=^y = {I-TS){y + T{Sy)') where Sy = {I - ST){Sy)' (3.2.6.3) Thus we have shown J - ST onto <=► I-TS onto (3.2.6.4) To prove (3.2.6.2) suppose that ||x - {I - ST)x'€\\ < e and argue \\y-{I-TS){y + T{Sy)'t\\ = \\T{Sy-{I-ST){Sy)'e)\\<\\T\\e ■ The gulf between invertibility, and being one-one and dense, can be seen in the canonical factorization (2.3.3.2): evidently the mapping core(T) : X/T~10 —► c\[TX) is always both one-one and dense, and, in the sense of (3.1.3.10) T one-one and dense <=>► T = core(T) (3.2.6.6) Indeed we have T one-one <*=> ker(T) invertible (3.2.6.7) and T dense <*=> coker(r) invertible (3.2.6.8) In a sense, the kernel and the cokernel mappings perform a preliminary, crude analysis of the invertibility or otherwise of T G BL(X, Y), and if we clear them away, we are left with the mapping core(T), which in a sense performs more delicate experiments: 3.2.7 DEFINITION T G BL(X, Y) will be called proper iff core(T) is invertible (3.2.7.1) Evidently T invertible <$=> T one-one, dense and proper (3.2.7.2) 3.3 BOUNDEDNESS BELOW In general, it turns out not to be sufficient for T G BL(X, Y) to be invertible, that it be one-one and dense. To bridge the gap we can either strengthen one-oneness, or strengthen denseness. Here we strengthen one-oneness:
3.3 Boundedness Below 61 3.3.1 DEFINITION T e BL(X,Y) is said to be bounded below if there is k > 0 for which ||x|| < Jfc||rx|| for each xeX (3.3.1.1) and will be called closed if T is bounded below and TX = c\(TX) (3.3.1.2) We shall write rleft BL(X, Y) = {Te BL(X, Y): T is not bounded below} (3.3.1.3) and rleft BL(X,y) = {Te BL{X,Y):T is not closed} (3.3.1.4) For example if ||Tx|| = ||x|| as in (3.1.3.6) then T is bounded below. In general, boundedness below lies somewhere between invertibility and one-oneness, and has the same kind of semigroup property as one-oneness. 3.3.2 THEOREM If T e BL(X,y) and S e BL{Y,Z) then there is implication 5, T bounded below ==> ST bounded below ==> T bounded below (3.3.2.1) and 5, T closed =^ ST closed =^ T closed (3.3.2.2) Also T invertible =*► T closed =*► T bounded below =*► T one-one (3.3.2.3) Proof: If ||x|| < Jfc||rx|| and ||y|| < k'\\Sy\\ for each x e X and each yeY, then ||x|| < Jfc||rx|| < A;'A;||5rx|| for each x e X giving the first part of (3.3.2.1). If instead ||x|| < A;"||5rx|| then ||x|| < A;"||5rx|| < A;"||5||||rx|| = Jfe||rx|| with k = k"\\S\\ giving the second part of (3.3.2.1). Toward (3.3.2.2) we claim that if T e BL(X,y) is closed and K C X then K = c\{K) =}> T{K) = c\{TK) (3.3.2.4)
62 3. Invertibility and Singularity for if ||2/ — Txn\\ —► 0 with y £ Y and xn G K, then, in particular, y G cl(rX) = TX, so that there is x G X for which y = Tx: now ||z - xj < k\\Tx - TxJ = k\\y - TxJ — 0 so that x e K and hence y = Tx e T(K). Applying (3.3.2.4) with S in place of T and TX in place of K gives the first part of (3.3.2.2). If ST is closed and \\y - Txn\\ -► 0 then \\Sy - STxn\\ -► 0, so that Sy = STx for some iGl: now \\y - Tx\\ < k'\\Sy - STx\\ = 0 (3.3.2.5) For (3.3.2.3) the first part follows from the second part of (3.3.2.2) with S = T_1, the second part is obvious, and the third part follows from the argument of (3.3.2.5). ■ The set of bounded below operators has an interesting topological property: 3.3.3 THEOREM If X and Y are normed spaces then {J G BL(X,Y):T is bounded below} is an open set (3.3.3.1) Proof: If T G BL(X,Y) is bounded below, with k > 0 satisfying (3.3.1.1), and if T' e BL(X,F) satisfies A;||r'-r|| < 1, then (1.2.0.11) gives, for each xeX, \\T'x\\ > \\Tx\\ - \\{T' - T)x\\ > i(l - JfeHT' - r||)||x|| ■ (3.3.3.2) Boundedness below can be tested by left composition operators: 3.3.4 THEOREM If T e BL{X,Y), then T bounded below <$=>► LT bounded below for all normed spaces W (3.3.4.1) Proof: If T G BL(X,F) is bounded below, with k > 0 satisfying (3.3.1.1), then for arbitrary W and U G BL(W,X) we have ||0ti;|| < Jfc||r0ti;|| for each w €iW, hence \\U\\= sup \\Uw\\ < sup ife||r^ti;||=ife||ri7|| IMI<i IIHI<i Thus, each LT = BL(W,T) also satifies (3.3.1.1). Conversely, if LT = BL(K,T) is bounded below, where W = K is the scalar field then there is
3.3 Boundedness Below 63 k > 0 for which, for each iGl, ||x|| = ||£J| < k\\LT(Lx)\\ = k\\LTx\\ = k\\Tx\\ Here we are using the same operators Lx of (2.9.2.2) as in (3.2.2.7). ■ Boundedness below can be tested by enlargement: 3.3.5 THEOREM If T G BL(X,y), then T bounded below «$=>• Qq{T) one-one for all bornological spaces 0 (3.3.5.1) In particular, taking 0 = N, Q(r) one-one =>• T bounded below =>• Q(T) bounded below (3.3.5.2) Proof: Suppose T is bounded below, with k > 0 satisfying (3.3.1.1), and suppose that x G l^Q^X): we claim Tnx e c0(n, y) =j> x e c0(n,x) (3.3.5.3) This is because, for each e > 0, {t e 0: ||x(t)|| > e}C{te n:ife||rx(t)|| > e} bounded in 0 Conversely, if T is not bounded below, then (3.3.1.1) must fail for each k = n G N, so that there are vectors x'n G X for which HTxJJI < rc-1!!^!!' and now if xn = x'n/\\xfn\\ then ||xn|| = 1 and ||T£n|| < l/n, giving x = ixn) € /oo(X) with x £ c0{X) and TNx G c0{Y) (3.3.5.4) This finishes the proof of (3.3.5.1), and gives the first part of (3.3.5.2). If T is bounded below, with k > 0 satisfying (3.3.1.1), then for arbitrary dist(z,c0(X)) = limsup||a;n|| < limsupfc||ra;n|| = fcdist(TNz,c0(F)) n—»-oo n—^oo which is the second part of (3.2.5.2). ■ Boundedness below and closedness can be tested on a subspace and its quotient:
64 3. Invertibility and Singularity 3.3.6 THEOREM If T e BL(X,Y) and W C X is a subspace then T is bounded below iff there are k' > 0 and k" > 0 for which HI < k'\\Tw\\ for each w e W and dist(x, W) < k"\\Tx\\ for each x e X Also T is closed iff in addition (3.3.6.1) (3.3.6.2) TW = c\{TW) and c\{TX) CTX + TW (3.3.6.3) Proof: If T is bounded below, with k > 0 satisfying (3.3.1.1) then (3.3.6.1) and (3.3.6.2) hold with k' = k" = k. Conversely, if x G X is arbitrary then there is (wn) in W with ||x — tun|| —► dist(x, W), and now using (3.3.6.1) lire < llx — tu_|| + l|wj| " -" n" " n" (3.3.6.4) < \\x-wn\\+k\\T{wn-x)\\+k'\\Tx\\ foreachnGN Letting n —► oo and using (3.3.6.2) gives ||x|| < (1 + A;,||r||)dist(x,^) + A;,||rx|| < (A;'+A;"+A;V||r||)||rx|| (3.3.6.5) If also (3.3.6.3) holds and ||y-rxn|| -»• 0 then y = Tx + Tw = T(x + w). ■ We conclude by noting that boundedness below, and closedness, do their duty as strengthened versions of one-oneness: 3.3.7 THEOREM If T e BL(X, Y) then T is invertible iff T is bounded below and onto (3.3.7.1) and also iff Also T is closed and dense (3.3.7.2) T closed <=> T one-one and proper (3.3.7.3) Proof: If T is invertible then both (3.3.7.1) and (3.3.7.2) are satisfied, and indeed are obviously equivalent to one another. Conversely, if (3.3.7.1) holds, with k > 0 satisfying (3.3.1.1), then, in particular, T is one-one and onto, hence has a linear inverse S : Y —► X. We claim that S is bounded: for if y G Y is arbitrary then by (3.3.1.1) \\Sy\\ < k\\TSy\\ = k\\y\\ (3.3.7.4)
3.4 Openness 65 Towards (3.3.7.3) we observe, using the canonical factorization (2.3.3.2), T one-one =► T = J coie(T) with J closed (3.3.7.5) also T and core(T) have the same range TX. Thus, if T is closed then it is one-one, giving (3.3.7.5), which combined with (3.3.2.2) makes core(T) closed, therefore by (3.3.7.2) invertible, which is what (Definition 3.2.5) we mean by calling T proper. Conversely, if T is one-one and proper then again (3.3.7.5) holds, and this time (3.3.7.2) combines with (3.3.7.5) to make T closed. ■ 3.4 OPENNESS It would be satisfying to find that the strengthened version of "dense" was "onto." At this stage the situation is much more complicated: 3.4.1 DEFINITION T e BL(X,Y) is said to be open if there is k > 0 for which y e {Tx: \\x\\ < k\\y\\} for each y eY (3.4.1.1) and is said to be almost open if there is k > 0 for which y e cl {Tx: \\x\\ < k\\y\\} for each y eY (3.4.1.2) We shall write rrightBL(X,y) = {Te BL{X,Y):T is not open} (3.4.1.3) and rright BL(X, Y) = {T e BL(X, Y): T is not almost open} (3.4.1.4) By translation invariance, T G BL(X, Y) is open if, and only if for each KCX K = int(ff) =}> T{K) = int(rfT). (3.4.1.5) Openess lies somewhere between invertibility and ontoness, while almost openess lies somewhere between openness and denseness: 3.4.2 THEOREM If T e BL(X,F) and S e BL{Y,Z) then there is implication 5, T open => ST open =^ S open (3.4.2.1) and S,T almost open ==> ST almost open ==> S almost open (3.4.2.2)
66 3. Invertibility and Singularity Also T invertible =► T open =► T onto =► T dense (3.4.2.3) and T open =► T almost open =► T dense (3.4.2.4) Proo/: If y G {Tx: ||x|| < k\\y\\} and 0 G {Sy: \\y\\ < k'\\z\\} for each y eY and z €: Z then 0 G {STx: ||x|| < *||y|| < kk'\\z\\} for each zeZ giving the first part if (3.4.2.1). If instead z G {STx: \\x\\ < k"\\z\\} then « e {firiri Hirll = ||r*|| < lirilH^H < *##||r||||*||> giving the second part of (3.4.2.1). The proof of each part of (3.4.2.2) is almost the same, word for word, putting in the closure operator. The first part of (3.4.2.3) follows from the second part of (3.4.2.1), with S = T"1, and the rest is clear. ■ The almost open operators form an open subset of BL(X, Y): 3.4.3 THEOREM If X and Y are normed spaces, then {T G BL(X,Y):T is almost open} is an open set (3.4.3.1) Proof: If T G BL(X, Y) is almost open, with k > 0 satisfying (3.4.1.2), and if &||T'—T|| < 1, we claim that T' is almost open, with k! = k/(l—S) in place of A; whenever A;||r'-r|| < 6 < 1. Indeed if e > 0 withe+A;||r'-r|| = 8 < 1, then for each y G Y there is xx G X for which ||y — Txx|| < e||y|| with Ikill <*l|y||, so that also Wy-T'x^ < (e+*||r-r'||)||y|| = %||, and then x2 G Xfor which \\y-T'x1-Tx2\\ < eHy-T'xJ with ||x2|| < JfcHy-T'xJ, so also ||y — T'xx — Tfx2\\ < £2||y||- Inductively there is a sequence (xn) in X for which ||y-r'(*1+*2+ • •• + *„)II <<nyII with||xn||<W',-1||y|| (3.4.3.2) which gives ||*1+*2 + ... + *n||<(l + «5+.-. + 5n-1)<^M ■ (3.4.3.3) One way around almost openness can be tested by composition operators:
3.4 Openness 67 3.4.4 THEOREM If T G BL{X,Y), then T almost open ==> RT bounded below for all normed spaces W (3.4.4.1) Proof: If T G BL(X,Y) is almost open, with k > 0 satisfying (3.4.1.2), then for arbitrary W and V G BL(Y,W) ||Vr||=sup{A;||Vry||:y€cl{rx:||x||<l}} = A; sup \\VTx\\ = k\\VT\\ IMI<i which says that (3.3.1.1) holds for RT = BL(T, W). ■ Much later, in Section 5.6, using a deep theorem, we will find that if RT = BL(T,W) is bounded below with W = K, then T has to be almost open. We can also test for almost openness using enlargements. 3.4.5 THEOREM If T G BL{X,Y), then T almost open ^=> Q^(^) almost open for all bornological spaces 0 (3.4.5.1) In particular, taking 0 = N, Q(r) almost open =^ T almost open => Q(T) open (3.4.5.2) Proof: Suppose T G BL(X,Y) is almost open, with k > 0 satisfying (3.4.1.2): then for arbitrary y G ^(OjY) and arbitrary <f> : 0 —► ]0, oo[ there is x G Xn for which \\x[t)\\ < k\\y{t)\\ and \\y{t) - Tx{t)\\ < <f>{t) for each t G 0 (3.4.5.3) In particular IHIco < *ll»lloo < «> (3-4-5-4) ensuring x G l^QjX). By taking <£(•) = e for arbitrary e > 0 we obtain the forward implication in (3.4.5.1); by taking, if 0 = N with the finite homology, 4>{n) = 1/n for each neN (3.4.5.5) we also obtain the second implication of (3.4.5.2). For the first implication of (3.4.5.2), which will also reverse the implication (3.4.5.1), take 0 = N and suppose that Q(T) is almost open, with k > 0 satisfying the analogue of (3.4.1.2): then for each y G Y and each e > 0 there is xe = (x£) in l^iX) for which limsup ||x* || < &||y|| and limsup ||y - Txen\\ < e. (3.4.5.6) n—KDo n—*oo
68 3. Invertibility and Singularity Thus if k < kf, and e>Owe can find ze = xeN for which \\ze\\ < k'\\y\\ and \\y - Tze\\ < 2e ■ (3.4.5.7) In section 5.7, using duality, we can improve (3.4.5.2) to the analogue of (3.3.5.2). Openness and almost openness can be tested on a subspace and its quotient. 3.4.6 THEOREM If T G BL(X,Y) and Z C Y is a subspace, then T is open iff there are k' > 0 and k" > 0 for which y G {Tx: \\x\\ < k"dist{y,Z)} + Z for each y eY (3.4.6.1) and z G {Tx: \\x\\ < k'\\z\\} for each z G Z (3.4.6.2) Also T is almost open iff there are k' > 0 and k" > 0 for which y e cl {{Tx: \\x\\ < k" dist{Y, Z)} + Z) for each y eY (3.4.6.3) and z e cl {Tx: \\x\\ < k'\\z\\} for each z e Z (3.4.6.4) Proof: If T is open, with k > 0 satisfying (3.4.1.1), then (3.4.6.1) and (3.4.6.2) hold whenever k" > k! = k. Conversely, if y G Y is arbitrary, then by (3.4.6.1) there are x G X and z G Z for which y = Tx + z with ||x|| < k"dist(y,Z) < k"\\y\\, and by (3.4.6.2) x' e X with Tx' = z and ||x'||<A;'||2||. Thus y = T(x + *') with ||x + x'|| < ||x|| + k'\\y - Tx\\ <(l + *W|)*ly|| + *'||y|| This proves the first part. If instead T is almost open, with k > 0 satisfying (3.4.1.2), then (3.4.6.3) and (3.4.6.4) hold with k" = k' = k. Conversely, if y G Y is arbitrary, then by (3.4.6.3) there is (xn) in X for which dist(y - Txn,Z) < 1/n and ||xn|| < k"dist{y,Z) < k"\\y\\ (3.4.6.6) and by the Riesz lemma result (1.5.1.1) there is (zn) in Z for which \\y-Txn-zn\\<dist(y-Txn,Z) + ± and \\zj < 2||y - TxJ (3.4.6.7)
3.4 Openness 69 Finally by (3.4.6.4) there is (x'J in X for which |K - 2VJ| < \ and ||x'J| < k'\\zn\\ (3.4.6.8) n Putting these together gives ||y-rxB-2VB||<3/n—►(> and (3.4.6.9) \\xn + x'n\\<{k" + 2k' + 2k'k"\\T\\)\\y\\. ■ We conclude by noting that openness does its duty as a strengthened version of denseness: 3.4.7 THEOREM If T e BL(X, Y) then T is invertible iff T is one-one and open (3.4.7.1) Also T open <$=>• T dense and proper (3.4.7.2) Proof: If T is invertible then (3.4.7.1) holds. Conversely, if (3.4.7.1) holds, with k > 0 satisfying (3.4.1.1), then in particular T is one-one and onto, hence has a linear inverse S : Y —> X. We claim that S is bounded, for if y G Y there is x G X with T{Sy-x) =y-Tx = 0 and ||x|| < *||y|| (3.4.7.3) Since T is one-one this gives ||5y|| = N|<*||y|| (3.4.7.4) Towards (3.4.7.2) we claim T dense =^ T = coie(T)K with K open (3.4.7.5) This uses the canonical factorization (2.3.3.2), and the Riesz lemma (1.5.2.1) which says that the quotient mapping K = coker(ker(T)) : X —► X/r_10 satisfies (3.4.1.1) with k = l/t whenever 0 < t < 1. Now if T is open, then it is also dense, giving (3.4.7.5), and (3.4.7.5) together with (3.4.2.1) says that core(T) is open, therefore by (3.4.7.1) invertible. This gives forward implication (3.4.7.2). Conversely, if T is dense and proper then core(T) is open and now (3.4.7.5) together with (3.4.2.1) tells us that T is open. ■
70 3. Invertibility and Singularity The precise relationship between almost openness and invertibility seems more elusive. As part of the argument for (3.4.7.2) we observed that the quotient K : X —> X/Y associated with a closed subspace Y —► X is always an open mapping. Another example is a nonzero linear functional (bounded or not). Indeed, if 0 ^ / G L{X, K) then there be x1 G X for which f{x1) = 1 (take xx = x0/f(x0) if f(x0) ^ 0): now if t G K then t = /(toj and HtxJ < Wx^t] (3.4.7.6) which is (3.4.1.1) with k = {{x^. 3.5 BOUNDARY MAPPINGS Theorem 3.4.3 tells us that the almost open mappings form an open subset of BL(X,y). Thus if T G BL(X,Y) is in the topological boundary of the set of almost open mappings, then T is not itself almost open. It turns out that T is not bounded below either. The auxiliary results needed for this are of interest in their own right: 3.5.1 THEOREM If T e BL(X, Y) and Tn e BL(X, Y) for each neN, then T bounded below, Tn dense, and ||T - Tn|| —► 0 =^ T dense (3.5.1.1) and T bounded below and dense ==> T almost open (3.5.1.2) Proof: Towards (3.5.1.1) suppose y G Y is arbitrary: then there is (xn) in X for which ||y-TBa:J|<||r-rJ|—0 asn^oo (3.5.1.3) which gives, using (3.3.1.1), ||y-rxn||<||y-rnxn|| + ||r-rn||||xn||<||y-rnxn|| + fc||r-rn||||rxj| < \\y - Tnxn\\ + k\\T - Tj\\Txn - y\\ + k\\T - Tn\\\\y\\ so that (i-*||r-rj|)||y-r*B|| < ||y-rB«j|+*||r-rj|||y|| < (i+fc||y||)||r-rj| This forces ||y — Txn|| —► 0, and since y G Y is arbitrary, makes T dense. Towards (3.5.1.2) we can apply the Riesz lemma result (1.5.1.1) to see that Y C cl(Z) ==> y G c\{z G Z: \\z\\ < \\y\\} for each y G Y (3.5.1.4)
3.5 Boundary Mappings 71 whenever Z is a linear subspace of Y. The natural injection J : Z —► Y is almost open whenever it is dense. Alternatively we can prove (3.5.1.4) directly by finding (zn) in Z for which ||y — zn\\ —► 0 and then taking if|M<W . . (llvll/ll*»ll)*» if|l*»ll>llvll The reader should verify that ||y — 2JJI —► 0. Now we obtain (3.5.1.2) by taking Z = TX in (3.5.1.4): if k > 0 satisfies (3.3.1.1) then Y C cl(rX) =>ye c\{Tx: \\x\\ < k\\Tx\\ < k\\y\\} for each y G Y ■ (3.5.1.6) The reader may, alternatively, like to verify that the sequence (xn) of (3.5.1.3) satisfies \\xfn\\ < kf\\y\\ for some k' > 0. The boundary result follows easily: 3.5.2 THEOREM If T e BL(X,Y) is in the boundary of the almost open operators, then T is not bounded below. Proof: Suppose that T G BL(X, Y) is in the closure of the almost open operators, so that there is (Tn) in BL(X,Y) for which Tn is almost open and ||T - Tn\\ —► 0 (3.5.2.1) Then by (3.4.2.4) Tn is dense. By (3.5.1.1) T would be dense if it were bounded below, and hence (3.5.1.2) almost open: but this would exclude T from the boundary. ■ The product of two of these "boundary operators" is a boundary operator: 3.5.3 THEOREM If T e BL(X,Y) and 5 G BL(Y,Z) are boundary operators, then so is ST. Proof: By (3.4.2.2) there is implication, S,T G closure (almost open) ==> ST G closure (almost open) =>• S G closure (almost open) (3.5.3.1) and, hence, if S and T are both in the boundary of the almost open operators then ST is in the closure of the almost open operators. If ST is actually almost open, then by (3.4.2.2) S is almost open, hence by (3.4.3.1) not in the boundary. ■
72 3. Invertibility and Singularity We cannot expect that if ST is a boundary operator then so is 5, or that if ST is a boundary operator then so is T: for if either S = 0 or T = 0 then ST = 0, and if Y = X ^ 0 then .0 G cl{e:J:e: >:0} is a boundary operator. In the notation of (3.2.1.5), (3.3.1.5), and (3.4.1.4), Theorem 3.3.3 and Theorem 3.4.3 tell us that rleft BL(X, Y) and rright BL(X, F) are closed subsets of BL(X, Y), and Theorem 3.5.1 gives inclusion fright BL(X, Y) C fleft BL(X, Y) U int 7rright BL(X, Y) C fleft BL(X, y) U int tright BL(X, F) Then Theorem 3.5.2 says that af right BL(X? y) g f left gj^ y} and Theorem 3.5.3 says that afright BL(X?y) . afright BL(X?y) g <9fright BL(X,y) Theorem 3.5.1 has an application to enlargements. 3.5.4 THEOREM If T e BL{X,Y) then Q(T) one-one and dense ==> T bounded below and dense =^ Q(r) invertible (3.5.4.1) Proof: If Q(T) is one-one and dense then by (3.3.5.2) and (3.4.5.8) T is bounded below and dense, hence by (3.5.1.2) bounded below and almost open, which by (3.3.5.2) and (3.4.5.2) makes Q(T) bounded below and open, hence invertible. ■ Theorem 3.5.1 has an application to proper mappings. We begin with a definition. 3.5.5 DEFINITION T e BL(X,Y) is called relatively open iff the induced mapping x —► Tx : X —> TX is open (3.5.5.1) and is called relatively almost open iff the induced mapping x —>TX:X —> TX is almost open (3.5.5.2) This induced mapping is liable to be bypassed in the canonical factorization (2.3.3.2). To catch it we should look at core0(r) : x + T_1(0) —> Tx from X/r_1(0) to TX (3.5.5.3) (3.5.3.2) (3.5.3.3) (3.5.3.4)
3.6 Left and Right Invertibility 73 This "precore" of T is always one-one and onto, and we have core(r) = Jcore0(r) (3.5.5.4) with J : TX —► cl(rX) bounded below and dense (3.5.5.5) Using the arguments of (3.3.7.3) and (3.4.7.2) it is clear that T relatively open «*=>• core0(T) open ^=> core0(T) bounded below (3.5.5.6) and T relatively almost open <*=>> core0(T) almost open (3.5.5.7) 3.5.6 THEOREM If T 6 BL(X,Y), then T relatively open <$=>► core(T) bounded below (3.5.6.1) T relatively almost open <=> core(T) almost open (3.5.6.2) T proper <=► core(T) open <=> core(T) closed (3.5.6.3) Proof: By (3.3.2.1) and the factorization (3.5.5.4), core(T) is bounded below if and only if core0(T) is bounded below. This proves (3.5.6.1). By Theorem 3.5.1 the mapping J of (3.5.5.5) is almost open: now by (3.4.2.2) and the factorization (3.5.5.4), core(T) is almost open if and only if core0(T) is almost open. The last part (3.5.6.3) is only (3.3.7.2) and (3.4.7.1) applied tocore(r). ■ 3.6 LEFT AND RIGHT INVERTIBILITY A left or a right inverse satisfies only half the conditions for an inverse: 3.6.1 DEFINITION T 6 BL{X,Y) is said to be left invertible if there is T' GBL(y,X) for which r'r = jgbl(x,x) (3.6.1.1) and is said to be right invertible if there is T" G BL(Y,X) for which TT" = IGBL(Y,Y) (3.6.1.2) We shall write aleft BL(X, Y) = {Te BL(X, Y):T is not left invertible} (3.6.1.3)
74 3. Invertibility and Singularity and aright BL(X, Y) = {T e BL(X, Y): T is not right invertible} (3.6.1.4) We emphasize that it is part of the definition of a left or a right inverse that it is bounded, and everywhere defined on Y. 3.6.2 THEOREM If T e BL(X, Y) and S e BL(F, Z) then 5,T left invertible => ST left invertible =>• T left invertible (3.6.2.1) and S, T right invertible => ST right invertible => S right invertible (3.6.2.2) Also T invertible => T left invertible => T closed (3.6.2.3) and T invertible =>• T right invertible => T open (3.6.2.4) Proof: (3.6.2.1) and (3.6.2.2) are simple algebra, and the first parts of (3.6.2.3) and (3.6.2.4) are trivial, while the second parts follow from (3.3.2.2) and (3.4.2.1). ■ We might recall that if T is both left and right invertible then it is actually invertible: we did this in (3.1.2.2). We can prove more: 3.6.3 THEOREM If T e BL{X,Y) then T left invertible and dense =^ T invertible (3.6.3.1) and T right invertible and one-one ==> T invertible (3.6.3.2) Proof: If T' G BL(F, X) satisfies T'T = I and if T is dense then (3.2.2.2) gives (/ - TT')T = 0=>I = TT' (3.6.3.3) If instead T" e BL(Y,X) satisfies TT" = I and T is one-one then (3.2.2.1) gives T{I - T"T) =0=^1 = T"T ■ (3.6.3.4) Of course, (3.4.7.1) is already an improvement on (3.6.3.2).
3.6 Left and Right Invertibility 75 3.6.4 THEOREM If T e BL(X, Y) and S e BL(F, X) then J - ST left invertible <=> I-TS left invertible (3.6.4.1) and J - ST right invertible <=> I-TS right invertible (3.6.4.2) Proof: As in Theorem 3.1.3, if U e BL(X,Y) is a left inverse for J - ST then J + TUS is a left inverse for J — TS\ the rest of the argument is left to the reader. ■ If T is either left or right invertible then so are all the left composition operators LT = BL(W,T), and also the right composition operators RT, but the other way round: indeed by (2.9.1.2) T'T = I=> LT,LT = I: BL{W,X) and (3.6.4.3) RTRT, = I: BL{Y,W) —> BL{Y,W) and TT" = I=> LTLT„ = I: BL{W, Y) —^ BL(W, Y) and (3.6.4.4) RT,,RT = I: BL{X,W) —> BL{X,W) If we choose the space W carefully then we get something much more dramatic. 3.6.5 THEOREM If T e BL(X, Y) then, with LT = BL(F, T) and RT = BL(r,X), RT onto ==> T left invertible ==> RT open (3.6.5.1) and LT onto ==> T right invertible ==> LT open (3.6.5.2) Proof: If RT = BL(T,X) is onto then there must be T' e BL(F,X) for which / = RT{T') = T'T (3.6.5.3) which is (3.6.1.1). By (3.6.4.3) RT is right invertible, hence by (3.6.2.4) open. If instead LT = BL(Y,T) is onto then there must be T" G BL(F,X)
76 3. Invertibility and Singularity for which I = LT{T") = TT" (3.6.5.4) which is (3.6.1.2). By (3.6.4.4) LT is right invertible, hence by (3.6.2.4) open. ■ If T is either left or right invertible then so are all its enlargements: for if VI is a bornological space then by (2.7.3.3) rT = I=> Qn(r')Qn(r) = I : QnPO — QQ(X) (3.6.5.5) and TT" = I =► Qn(T)Qn(n = I : Qn(F) — Qn(F) (3.6.5.6) 3.7 ALMOST INVERTIBLE OPERATORS The almost invertible operators bear the same relation to the invertible operators as the almost open operators do to the open operators. 3.7.1 DEFINITION T e BL(X,Y) is said to be almost left invertible if there is [T'n) in BL{Y,X) for which \\I-T^T\\ —>0asn—> oo with supn||r^|| < oo (3.7.1.1) and is said to be almost right invertible if there is (T^) in BL(Y,X) for which III-rZ^H —> 0 as n —» oo with supn||r^|| < oo (3.7.1.2) If T is almost left and almost right invertible it is called almost invertible. We shall write <7left BL(X,y) = {Te BL(X,y):T is not almost left invertible} (3.7.1.3) and aright BL(X,y) = {Te BL(X,Y):T is not almost right invertible} (3.7.1.4) It is not hard to see that almost left and right invertibility obey the same product rules as left and right invertibility, and is transmitted to left and right compositions, and to enlargements. We begin however with the analogue of Theorem 3.6.5.
3.7 Almost Invertible Operators 77 3.7.2 THEOREM If T e BL(X, Y) then, with LT = BL(F, T) and RT = BL(r,X), RT dense ==> T almost left invertible ==> RT almost open (3.7.2.1) and LT dense => T almost right invertible ==> LT almost open (3.7.2.2) Proof: If RT = BL(r,X) is dense then there must be T£ e BL(Y,X) for which ||J-^r|| = ||J-i2r(^)||<l (3.7.2.3) Now with U = I-T'QT and T'n = (/ + U + • • • + Un)T'0 for each n e N (3.7.2.4) we have \\I-T'nT\\ = HCr+Ml < \\U\\»+1 ^0 and ||l£|| < ^Jj^ (3.7.2.5) giving (3.7.1.1). Further, if this holds, then for arbitrary V € BL(X, X) we have, with k= ||^||/(1-||^||), \\v-RT{vrn)\\ = \\v{i-rnT)\\ < \\u\\n+1\\v\\^o and (3.7.2.6) \\vrj < k\\v\\ which says that RT is almost open. The argument for (3.7.2.2) is the same. ■ The reader will observe that the definition of the almost left invertible operators can be simplified by dropping the requirement that supn \\T^\\ < oo in (3.7.1.1), with a similar simplification of the definition of almost right invertibility. In the sequel, however, he will see that our version is the one to work with. Using Theorem 3.7.2, we can systematically derive the properties of the almost left and right invertible operators from those of the almost open operators.
78 3. Invertibility and Singularity 3.7.3 THEOREM If T e BL(X, Y) and S e BL(F, Z) then S, T almost left invertible ==> ST almost left invertible (3.7.3.1) T almost left invertible and 5, T almost right invertible ==> ST almost right invertible (3.7.3.2) =>» T almost right invertible v * * * ' Also T left invertible => T almost left invertible ==> T bounded below (3.7.3.3) and T right invertible ==> T almost right invertible ==> T almost open (3.7.3.4) Proof: For (3.7.3.1) and (3.7.3.2) apply (3.4.2.2) for RT and for LT. Alternatively, it is not hard to argue directly. The first part of each of (3.7.3.2) and (3.7.3.3) is trivial. Towards the second part of (3.7.3.3), suppose that ||J-2^r||—0 withsupJftH* Then if k < k' we may take U = T'N in such a way that \\I-UT\\<6 with k = (1 - 6)k' (3.7.3.5) Now for arbitrary x G X \\x\\ < ||(/-C/r)x|| + ||C/rx|| < «||x|| + A;||rx|| =► ||x|| < k'\\Tx\\ (3.7.3.6) Finally, for the second part of (3.7.3.4), suppose that ||/_rz*||_o with supn\\T':\\<k Then for arbitrary y G Y we have, with xn = T„y, l|y-r*j|<||i-rz3|y||->o and (3.7.3.7) w<raii»ii<%ii ■ The almost left and right invertible operators form open sets:
3.7 Almost Invertible Operators 79 3.7.4 THEOREM If X and Y are normed spaces then {T e BL(X,Y)):T is almost left invertible} is an open set (3.7.4.1) and {T e BL(X,Y):T is almost right invertible} is an open set (3.7.4.2) Proof: By Theorem 2.9.1 the mappings T —> RT and T —► LT are both continuous, and by Theorem 3.4.3 the almost open operators between two normed spaces form an open set. Thus the sets in (3.7.4.1) and (3.7.4.2) are continuous counterimages of open sets. ■ Temporarily, we shall describe T G BL(X, Y) as a left topological zero divisor iff LT = BL(Y,T) : BL(Y,X) —► BL(Y, Y) is not bounded below (3.7.4.3) and as a right topological zero divisor iff RT = BL(r,X) : BL(Y,X) —► BL(X,X) is not bounded below (3.7.4.4) Evidently both these form closed sets of T G BL(X, Y). 3.7.5 THEOREM If X and Y are normed spaces then d{T e BL(X,Y):T almost left invertible} C {J e BL(X,Y):T topological right zero divisor} and d{T e BL(X, Y): T almost right invertible} C{T e BL(X,Y):T topological left zero divisor} Proof: If T G BL(X, Y) is in the closure of the almost left invertible operators then RT : BL(Y, X) —► BL(X, X) is in the closure of the almost open operators, and if T is not almost left invertible then RT is not almost open. It follows that if T is in the boundary of the almost left invertible operators, then RT is in the boundary of the almost open operators. By Theorem 3.5.2 it follows that RT is not bounded below, which is what we mean (3.7.4.4) by calling T a topological right zero divisor. This proves (3.7.5.1), and the argument for (3.7.5.2) is the same. ■
80 3. Invertibility and Singularity- It is easy to see that if T G BL(X,Y) is almost left invertible then so are all the left compositions LT = BL(W, T) and all the enlargements Qn(T), while all the right compositions RT = BL(T,W) are almost right invertible. The reader can write this out for himself, together with the corresponding result for almost right invertible operators. It is clear also that the almost invertible operators form an open set, the intersection of the open sets of almost left and almost right invertible operators. We might observe a certain "uniqueness of almost inverse" : 3.7.6 THEOREM If T e BL{X,Y) and if {T'n) and (T%) in BL{Y,X) satisfy (3.7.1.1) and (3.7.1.2), then 11^-^11—^0 (3.7.6.1) Proof: Observe K - r^|| < K(/- m?)II + \\{T'nT - jjrj'H <(supnK||)||/-rr:i| + Kr-/||suPnK|| —o . 3.8 REGULAR OPERATORS For a common generalization of left and right invertibility, assume the existence of a "generalized inverse": 3.8.1 DEFINITION T e BL(X,Y) is called regular or relatively Fred- holm, if there is V G BL(F,X) for which T = TT'T (3.8.1.1) We shall write rBT(X,y) = {Te BL{X,Y):Te T • BL(y,X) . T} (3.8.1.2) If (3.8.1.1) holds we shall call T' a generalized inverse, or pseudo- inverse, for T. For example if T' is either a left inverse or a right inverse for T, then it is a generalized inverse. If Y = X and T = T2 is a projection, as in (2.5.0.2), then (3.8.1.1) holds with T' = T, or alternatively with T = I. In a sense, regular operators are the common generalization of invertibles and idempotents.
3.8 Regular Operators 81 3.8.2 THEOREM If X and Y are normed spaces, then T e BL(X, Y) is regular iff T is proper and T_1(0) is complemented and cl(TX) is complemented (3.8.2.1) Proof: Suppose T is regular, with T = TT'T: then T'T = (rT)2 = P and TT' = [TT')2 = Q (3.8.2.2) are projections on X and Y, respectively, with T-^O) = P_1(0) and Q{Y) = T{X) = c\{TX) (3.8.2.3) Thus, T~l (0) and z\{TX) are both complemented, and also T(X) = c\(TX) is closed. Further, the mapping TA : P{X) —► Q{Y) (3.8.2.4) induced by T is invertible, with inverse (rA)_1 = T'A : Q{Y) —► P{X) (3.8.2.5) induced by T', and finally there is isomorphism TA - core(r) (3.8.2.6) induced by the isomorphism (2.4.2.1) X/T-^O) = X/P-^O) c* P{X) (3.8.2.7) It follows that core(T) is invertible, so that T is proper. Conversely, if T G BL(X,Y) satisfies (3.8.2.1), then we may choose projections P = P2 on X and Q = Q2 on Y satisfying (3.8.2.3), and use (3.8.2.6) to see that the mapping TA of (3.8.2.4) is invertible. Now, if we define T'A : Q{Y) -► P{X) by (3.8.2.5), then (3.8.1.1) will be satisfied if T' : Y —► X is defined by setting T'{y) = T'AQ{y) e X for each y eY ■ (3.8.2.8) If T is regular, and T" is a generalized inverse of T, then there is nothing to tell us that T' is regular, or that if T' is regular then T is a generalized inverse of T". However, this can be arranged: if (3.8.1.1) holds, then T" = T'TT' => T = TT"T and T" = T"TT" (3.8.2.9) If T G BL(X, Y) is regular then so are all the composition operators LT = BL(W,T) and RT = BL(T,W), and all the enlargements Qn(T). Indeed,
82 3. Invertibility and Singularity if T' satisfies (3.8.1.1) then, by (2.9.1.2), Jbrp = L/fpL/fpiL/rp and Rrp = RrpRrpiRrp ^3.8.2.10] while, by (2.7.3.3), Qn(T) = Qn(T)Qn(T')Qn(T) (3.8.2.11) In general, the product of regular operators need not be regular, and it is not easy to know what one can add to a regular operator without losing regularity: 3.8.3 THEOREM If T e BL(X, Y) and T' e BL(Y, X), then T - TT'T regular => T regular (3.8.3.1) and J - T'T regular ^=> I - TT' regular (3.8.3.2) If also S e BL{Y,Z) and S' e BL(Z,Y), and if V and S' are generalized inverses for T and 5, respectively, then ST regular <<=> S'STT' regular (3.8.3.3) Proof: If U e BL(y, X) is a generalized inverse for T - TT'T then T = T{T' + (/ - T'T)U{I - TT'))T (3.8.3.4) giving (3.8.3.1). If instead U is a generalized inverse for J — T'T then I-TT' = {I- TT') {I + TUT') {I - TT') (3.8.3.5) giving forward implication in (3.8.3.2). The argument for backward implication is similar. Finally, if ST is regular, with generalized inverse UeBL{Z,X), then S'STT' = S'S{TUS)TT' = {S'STT')TUS{S'STT') (3.8.3.6) giving forward implication in (3.8.3.3). Conversely, if S'STT' is regular, with generalized inverse V G BL(Y, Y), then ST = S{S'STT')T = S{S'STT'VS'STT')T = ST{T'VS')ST m (3.8.3.7)
3.8 Regular Operators 83 It is sufficient for S'STT' to be regular that [S'S){TT') = {TT'){S'S) (3.8.3.8) since by (2.5.2.2) S'STT' is then a projection. Sufficient for (3.8.3.8) is that either S'S = I or TT' = I. Thus, S left invertible, T regular => ST regular (3.8.3.9) and S regular, T right invertible => ST regular (3.8.3.10) Conversely, looking at the canonical factorization (2.3.2.2), every regular operator T can be written as the product of a left and a right invertible operator: T = UV with U left invertible and V right invertible (3.8.3.11) We can also observe T left invertible <=> T regular and one-one (3.8.3.12) and T right invertible <=> T regular and dense (3.8.3.13) The first part of Theorem 3.8.3 begins to tell us how we can sometimes add to a regular operator without disturbing regularity: 3.8.4 THEOREM If T e BL(X,Y) is regular, and T' e BL{Y,X) is a generalized inverse for T, and if K G BL(X, Y) satisfies I - T'K is invertible and (J - TT')K{I - T'K)-1 {I - T'T) is regular (3.8.4.1) then T - K is regular (3.8.4.2) Proof: Writing K' = {I - T'K)'1 we have (/ - TT')KK'{I - T'T) = {{T{I - T'K) + [K - T))K'{I - T'T) = T{I - T'T) + {K- T)K'{I - T'T) = 0+(K-T)K'(I-T'K) (3.8.4.3) + {K-T){K'T'){K-T) = K-T+{K- T)K'T'{K - T)
84 3. Invertibility and Singularity It follows from (3.8.4.1) that the last expression is regular, and hence by (3.8.3.1) that (3.8.4.2) holds. ■ It is clear that (3.8.4.2) will also hold if (3.8.4.1) is replaced by J - KT' is invertible and (3.8.4.4) (J - TT'){I - KT')'1^ - T'T) is regular. The details are left to the reader. Regular operators with invertible generalized inverses are special: 3.8.5 DEFINITION T G BL(X,Y) is called decomposably regular, or relatively Weyl, if there is T' G BL(Y,W) for which T = TT'T and T' G BL"1 [Y, X) is invertible (3.8.5.1) For example, idempotents are decomposably regular, if T = T2 G BL(X,X) then (3.8.5.1) holds with T' = I. More generally, {SP:Se BL"1 [X,Y),P = P2 e BL(X,X)} (3.8.5.2) = {QS-.se BL^p^y),*? = q2 e BL(y,y)} is the set of decomposably regular operators from X to Y. In particular, unless it is actually invertible, a decomposably regular operator must be a "boundary operator" in the sense of Section 3.5: if T G BL(X,Y) then T decomposably regular => T e cl BL"1 (X, Y) (3.8.5.3) Indeed if S G BL(X, Y) is invertible and P G BL(X, X) is a projection then sp-s(p + i(/-p)j and (3.8.5.4) s(p + -{i-pyj GBL-^y) Decomposably regular operators can be characterized "spatially": 3.8.6 THEOREM T G BL(X,y) is decomposably regular if and only if T is regular and T"1 (0) ~ Yj cl(TX) (3.8.6.1)
3.8 Regular Operators 85 Proof: If T is regular, and P = P2 e BL(X,X) and Q = Q2 e BL(Y,Y) satisfy (3.8.2.4), then by (2.4.2.1) the second part of (3.8.6.1) is equivalent to ^(O)^"1^) (3.8.6.2) If this holds, and SfV : Q x(0) —► P x(0) is invertible, then we can define T' : Y -+ X by setting T'{y) = T'AQ{y) + S'V{I - Q){y) e X for each y eY (3.8.6.3) where T'A = (rA)_1 is given by (3.8.2.5): evidently T' satisfies (3.8.5.1). Conversely, if T' satisfies (3.8.5.1), and P and Q are given by (3.8.2.2), then T'{QY) C PX and T'{Q-l0) C P~l0 (3.8.6.4) so that the restriction of T" to Q_10 supplies an isomorphism (3.8.6.2). ■ If T G BL(X, Y) is regular then we have a matrix representation, as in (2.4.1.7), r = diag(rA,o) "rA o" 0 0 p(xy .p~1q. -^ Q(Y)' (3.8.6.5) in which T ^ core(T) is invertible. If, in particular, T is decomposably regular then there is also an invertible operator 5 = diag(rA,5v) Evidently "rA o " 0 5V : p(xy P-!Q -^ Q(Y)' T = SP = QS (3.8.6.6) (3.8.6.7) as in (3.8.5.2). The decomposably regular operators are in a sense "orthogonal" to the left and right invertible operators in 'BL'(X, Y): the reader can verify that T decomposably regular and left invertible ^=> T invertible (3.8.6.8) and T decomposably regular and right invertible ^=> T invertible (3.8.6.9) We conclude with a sort of converse to (3.8.5.3):
86 3. Invertibility and Singularity 3.8.7 THEOREM If T G BL{X,Y) is regular, with T = TT'T, and if there is U G BL(Y,X) for which C/GBL_1(y,X) and /+ {U - T)T' G BL_1(X,X) (3.8.7.1) then T is decomposably regular. Proof: Since (J + {U - T)T')T = {T- TT'T) + UT'T = UT'T we have T = SP (3.8.7.2) with P = T'T and S = {I + {U - r)r')_1C/ (3.8.7.3) Evidently P is idempotent and S is invertible. ■ 3.9 ESSENTIAL INVERTIBILITY An operator T G BL(X,Y) which is not invertible may still be left and right invertible "modulo" certain subspaces of operators. 3.9.1 DEFINITION If X and Y are normed spaces and A C BL(X,X) and B C BL(Y, Y) are linear subspaces, then we shall say that T G BL(X,y) is left invertible modulo A if there is T' G BL(F,X) for which I-T'T e A (3.9.1.1) and that T is right invertible modulo B if there is T" G BL(Y,X) for which j _ TTn e B (3.9.1.2) We shall write alff BL(X,y) ={Te BL{X,Y):T is not left invertible mod A} (3.9.1.3) and <rfj>ht BL(X,y) = {Te BL(X,Y):T is not right invertible mod B} (3.9.1.4) For example, if A = {0} then (3.9.1.1) reduces to left invertibility (3.6.1.1), while if A = BL(X,X), then (3.9.1.1) holds for all T G BL(X,y):
3.9 Essential Invertibility 87 3.9.2 THEOREM If A C A! C BL(X,X) and B C B' C BL(y,y) and rGBL(X,y), then T left invertible ==> T left invertible mod A' ==> T left invertible mod A (3.9.2.1) and T right invertible => right invertible mod Bf (3.9.2.2) =>► T right invertible mod B Proof: If T e BL(y, X), then I - T'T = 0 =► J - T'T e A =}> / - T'T e A' (3.9.2.3) giving (3.9.2.1), and similarly (3.9.2.2). ■ If the product of two operators is essentially invertible then so is one factor: 3.9.3 THEOREM If X, Y and Z are normed spaces and A C BL(X, X), B C BL(y,y), D C BL(Z,Z) are linear subspaces, then, if T e BL(X,y) andSGBL(y,Z): ST left invertible mod A => T left invertible mod A (3.9.3.1) and ST right invertible mod D => S right invertible mod D (3.9.3.2) If A, B and D satisfy BL(y, X) • B • BL(X, y) C A and BL(y, Z) • B • BL(Z, Y) C D (3.9.3.3) then 5 left invertible mod B.T left invertible mod A (3.9.3.4) =^ ST left invertible mod A and S right invertible mod jD, T right invertible mod 5 (3.9.3.5) ==> ST right invertible mod D
88 3. Invertibility and Singularity Proof: If I-UST e A, then (3.9.1.1) holds with T' = US, giving (3.9.3.1). Conversely, if J - T'T e A and J - S'S G B, then I-T'S'ST = I + T'{I-S'S)T+{I-T'T) e I + T'BT + A C A (3.9.3.6) where the inclusion at the end uses (3.9.2.3). This proves (3.9.3.4), and similarly (3.9.3.5). ■ If, for example, X = Y = Z and A = B = D is a two-sided ideal of BL(X,X) then (3.9.3.3) holds. For the analogue of Theorem 3.6.5 we ask that A and B are closed ideals, and write RT/A : T' —► T'T + A e BL(X, X)/A for each T' e BL(F, X) (3.9.3.7) and LT/B : T' —► TT' + B e BL(F, Y)/B for each T' e BL(F, X) (3.9.3.8) 3.9.4 THEOREM If A is a closed left ideal of BL(X, X) and B is a closed right ideal of BL{Y,Y), and if T e BL{X,Y), then RT/A onto =^ T left invertible mod A => RT/A open (3.9.4.1) and LT/B onto => T right invertible mod B => LT/B open (3.9.4.2) Proof: If RT/A is onto, then / G BL(X, X) = RT BL(y, X) + A (3.9.4.3) giving T' G BL(y,X) to satisfy (3.9.1.1). Conversely, if (3.9.1.1) holds, then provided A is a left ideal of BL(X,X), U e BL(X, X) =}> U - UT'T eA=>U + A= {RT/A) [UT') (3.9.4.4) so that RT/A is onto. Further, if U' e BL(X,X), then U'-UeA=>U- U'T'T eA=>U + A = {RT/A) [U'T') (3.9.4.5) and finally, if A is closed and 0 < t < 1 then by the Riesz lemma (1.5.2.1) we can choose U' e U + A with \\U'\\ < (l/t) dist(Z7, A), giving U + A= {RT/A){U'T') with WU'T'W < (1/011^1111^ +A|| (3.9.4.6)
3.9 Essential Invertibility 89 This means RT/A is open, and finishes the proof of (3.9.4.1). The argument for (3.9.4.2) is the same. ■ If, instead of a left ideal, A is a right ideal of BL(X, X), then T left invertible mod A => T not a left zero divisor mod A (3.9.4.7) in the sense that if U G BL(X, X), then TV = 0 =>» U e A (3.9.4.8) We can prove a lot more: 3.9.5 THEOREM If A C BL(X,X) is a closed two-sided ideal, and if T G BL(X,Y) is almost left invertible mod A, in the sense that there is [T'n) inBL(y,X) for which dist(/-r^r,A) —► 0 and supn||r£|| < oo (3.9.5.1) then T is not a topological left zero divisor mod A in the sense that there is k > 0 for which dist(*7, A) < k\\TU\\ for each U e BL(X,X) (3.9.5.2) Proof: If (3.9.5.1) holds, then there is (U^) in BL(X,X) and k' > 0 for which ||/-r;r-ET;||—.0 and U'neA and supnK|| = *' < oo (3.9.5.3) Then, if U e BL(X,X) and U" e A are arbitrary, we have U-U" = {I-T^T-U^iU-U^ + U^U-U^-T^TU^ + T^TU (3.9.5.4) and hence \\U - (U" + U'n(U - U") + rnTU")\\ < 11/-i^r-tr;i|||tr-tr"||+ 11^1111^11 Since A is both a left and a right ideal, U" + U'n{U - U") + T'nTU" eA + A{U - U") + {T'nT)A C A (3.9.5.6) so that dist(CT,A) < \\I-T'nT-U%\\U-U'% + \\T'n\\\\TU\\<6\\U-U''\\ + k'\\TU\\ (3.9.5.7)
90 3. Invertibility and Singularity for arbitrary S > 0, by choice of n: finally allowing \\U — U"\\ —► dist(C/, A) gives (3.9.5.2) with k = k'/(l - S). m The reader should state and prove the corresponding result for almost right invertibility modulo B C BL(Y,Y). If A C A! C BL(X,X), then combining Theorem 3.9.5 and Theorem 3.9.2 gives, using notation based on (3.7.1.3) and (3.3.1.3), TlftBL(X,Y)Cfl$BL(X,Y) (3.9.5.8) C a)f, BL(X, Y) C o~)f BL(X, y) In particular, if it were possible to reverse the implication of Theorem 3.9.5 for a particular ideal A C BL(X, X), then this would also hold for all larger ideals A'DA If T £ BL(X,y) is regular in the sense of Definition 3.8.1, and the ideal A C BL(X,X) satisfies a regularity condition, then we can actually reverse the implication (3.9.4.8): 3.9.6 THEOREM If A C BL(X, X) is a right ideal and satisfies BL(X, Y) • A C ^X, Y) (3.9.6.1) then there is implication for T G BL(X, Y), T left invertible mod A ^=> T regular with ^(O) C A (3.9.6.2) If B C BL(Y, Y) is a left ideal and satisfies B • BL(X, Y) C rBT(X, Y) (3.9.6.3) then there is implication, for T £ BL(X, Y), T right invertible mod B <*=> T regular with JKy^O) C B (3.9.6.4) Proof: If T = rrT is regular, then I-T'Te L?1 (0) and I-TT'G R^fi) (3.9.6.5) giving backward implication in each of (3.9.6.2) and (3.9.6.4). Conversely, if T is left invertible mod A then L? (°) £ A bv (3.9.4.8), and if T' £ BL(Y,X) satisfies (3.9.1.1), then T - TT'T = T{I - T'T) £ BL(X, Y) • A (3.9.6.6)
3.10 Algebraic Invertibility 91 is regular by (3.9.6.1), so that (3.9.3.1) makes T regular. This finishes the proof of (3.9.6.2), and the argument for (3.9.6.4) is the same. ■ If, in particular, A = {0}, then (3.9.6.2) reduces to (3.8.3.12), while if B = {0}, then (3.9.6.4) reduces to (3.8.3.13). If N C BL{X,Y) is a linear subspace for which BL(y, Y)-N + N- BL(X, X) C N (3.9.6.7) then, by an abuse of language, we shall sometimes call N a two-sided ideal of BL(X, y); if in addition N satisfies a regularity condition, iVCrBL1(X,y) (3.9.6.8) then by the argument of (3.9.6.6) it follows rBL1(X, y) + N C rBL1(X, Y) (3.9.6.9) Indeed if T = TT'T e rBL1(X, Y) and K C N then T+K- [T + K)T'{T + K) = K-KT'T-TT'K-KT'K e N C ^^y) (3.9.6.10) so that (3.8.3.1) makes T + K regular. We can also improve on Theorem 3.8.4: if T = TT'T e "W{X,Y) is regular and K e BL(X,y), then it is sufficient, for T — K G 'BL'(X, Y) to be regular, that there is a two- sided ideal A C BL(X, X), satisfying the regularity condition (3.9.6.1) such that J — T'K is both left and right invertible mod A and has an "essential inverse" K' e BL(X,X) for which (/ - TT')KK'{I - T'T) e rBT(X, Y) (3.9.6.11) Indeed, if these conditions are satisfied then the argument of Theorem 3.8.4 shows that there must be U G BL(Y,X) for which T - K - [T - K)U{T - K) e BL(X, Y) • A C rBL1(X, Y) (3.9.6.12) which makes T - K regular by (3.8.3.1). 3.10 ALGEBRAIC INVERTIBILITY For the first time we meet a condition that is actually stronger than invertibility: 3.10.1 DEFINITION If M e BL(y,X) is a subspace, then T e BL[X,Y) is said to be left M-invertible if there is T' G BL(y,X) for which T'T = I and T'e M (3.10.1.1)
92 3. Invertibility and Singularity and is said to be right M-invertible if there is T" G BL(Y,X) for which TT" = I and T" e M (3.10.1.2) We shall write a£ft BL(X, Y) = {T G BL(X, Y): T is not left M-invertible} (3.10.1.3) and a^ght BL(X,y) ={Te BL{X,Y):T is not right M-invertible} (3.10.1.4) For example, if M = BL(Y, X), then left M-invertibility reduces to left invertibility (3.6.1.1): 3.10.2 THEOREM IfMCM'C BL(F,X) and T G BL(X,Y), then T left M-invertible =>» T left M'-invertible => T left invertible (3.10.2.1) and T right M-invertible ==> T right M'-invertible ==> T right invertible (3.10.2.2) Proof: If T' G BL(F, X) then J' G M =}> T' G M' =>T' G BL(y, X) ■ (3.10.2.3) 3.10.3 THEOREM If M C BL(y,X), N C BL(Z,y) and MiV C L C BL(Z,X), and T G BL(X,y), 5 left iV-invertible, T left M-invertible =► ST left L-invertible (3.10.3.1) and 5 right iV-invertible, T right M-invertible => ST right L-invertible (3.10.3.2) Proof: If 5;5 = I and T'T = I and T G M and 5' G iV then T'S'ST = I and T'S'eMN<ZL (3.10.3.3) If 55" = J and TT" = I and T" G M and 5" G JV then STT"S" = I and T"S" eMN CL ■ (3.10.3.4)
3.10 Algebraic Invertibility 93 To argue that ST left L-invertible =► T left M-invertible (3.10.3.5) and that ST right L-invertible =► S right iV-invertible (3.10.3.6) we must know that LS CM and TLQN (3.10.3.7) Thus, for example, if X = Y = Z and M = N = L is a subalgebra A C BL(X, X) and we restrict to T G A and S G A, then the conditions for Theorem 3.10.3, as well as (3.10.3.7) will hold. In this situation the restrictions of the composition operators LT : A —► A and RT : A —► A (3.10.3.8) coincide with the operators of (2.10.0.1) and (2.10.0.2). If A is a normed linear algebra then the analogues of Theorem 3.6.5 and Theorem 3.7.2 hold: 3.10.4 THEOREM If A is a normed linear algebra and a £ A, then Ra onto ==> a left invertible ==> Ra open (3.10.4.1) and La onto => a right invertible => La open (3.10.4.2) Also Ra dense ==> a almost left invertible ==> Ra almost open (3.10.4.3) and La dense ==> a almost right invertible ==> La almost open (3.10.4.4) Proof: If Ra is onto, then 1 G A = Ra{A), so that there is af G A for which 1 = a'a (3.10.4.5) Conversely, if this holds then for arbitrary 6 G A b = [ba')a with ||6a'|| < ||6||||a'|| (3.10.4.6)
94 3. Invertibility and Singularity so that Ra is open. This proves (3.10.4.1), and the argument for (3.10.4.2) is the same. For (3.10.4.3) we follow the argument for (3.7.2.1): if Ra is dense then there is af0 G A for which ||l — a'Qa\\ < 1, and now with u = 1 — af0a and a!n = (1 + u + \- un)a! for each nGN (3.10.4.7) we have ||l-a'a|l = lkn+1||<||u| n+l and lla'J|< i-W (3.10.4.8) so that a is "almost left invertible." Further, if this is so, then for arbitrary 6 e A we have, with k = ||ao||/(l - ||u||), in+li 0 with||6a'n||<A;|| \\b-Ra(ba'n)\\ = \\b(l-a'na)\\<\\u\ (3.10.4.9) so that Ra is almost open. The argument for (3.10.4.4) is the same. ■ Theorem 3.10.4 can be extended to more general "algebraic invertibility" provided there is some compatibility between T G BL(X, Y) and M C BL(y,X). We need, in fact, T to belong to some subspace N C BL(X, Y) for which there are subalgebras A C BL(X, X) and B C BL(Y, Y) for which A M N B is a subalgebra of BL(X,X) BL(y,X) BL(x,y) BL(y,y)J X =-([?]•[?]) (3.10.4.10) Then for the analogue of Theorem 3.10.4 we work with the composition operators [Rt)m : M —► A and (LT) M M B (3.10.4.11) The condition for all this to be possible is simply stated, and reminiscent of (3.8.1.1): MTM CM If A is a normed algebra and cGA then a left invertible ==> a not a left zero divisor in the sense that La : A —► A is one-one: u £ A and au = 0 ==> u = 0 (3.10.4.12) (3.10.4.13) (3.10.4.14)
3.10 Algebraic Invertibility 95 We can prove more: 3.10.5 THEOREM If A is a normed linear algebra and cGA, then a almost left invertible ==> a not a topological left zero divisor (3.10.5.1) and a almost right invertible ==> a not a topological right zero divisor (3.10.5.2) Proof: We follow the argument for (3.7.3.3): if ||1 - a'na\\ —► 0 and supn||a'n|| = k' < 0 (3.10.5.3) then for arbitrary u £ A ||u|| < (1 - a'na)u\\ + \\a'J\\au\\ < S\\u\\ + k'\\au\\ (3.10.5.4) as soon as ||1 — a'na\\ < 6, giving if 0 < 6 < 1, k' \\u\\ < k\\au\\ with k = - (3.10.5.5) l — o This proves (3.10.3.1), and the argument for (3.10.5.2) is the same. ■ Of course we are calling a G A a topological left zero divisor if La is not bounded below. We may now adapt all the notation of (3.2.1.4), (3.2.1.5), ... (3.7.1.4). 3.10.6 THEOREM If A is a normed linear algebra, then aleft {A) aright {A) aleft {A) aright {A) (3.10.6.1) are closed subsets of A, each of whose complements are closed under multiplication, and there is inclusion 7rleft(A) C fleft(A) C <7left(A) C aleft(A) D rleit{A) D fleft(A) (3.10.6.2) and 7rright(A) C fright(A) C aright(A) C aright(A) D rright(A) D fright(A) (3.10.6.3) Also 7rleft(A) Uaright(A) = 7rright(A) Ualeft(A) = rleft(A) U aright(A) = A \ A"1 (3'10-6-4)
96 3. Invertibility and Singularity Finally daleft(A) C fright(A) and daright(A) C fleft(A) (3.10.6.5) and, hence d(aleft(A) U aright(A)) C fright(A) n fleft(A) (3.10.6.6) Proof: Each of the sets in (3.10.6.1) is the counterimage, under one of the continuous mappings a —► La or a —► i2a, of the complement of the set of almost open, or bounded below, operators on A, each of which is open (3.4.3.1), (3.3.3.1), and closed under multiplication (3.4.2.2), (3.3.2.1). In each of (3.10.6.2) and (3.10.6.3) the first inclusion is (3.3.2.3), the second is Theorem 3.10.5, the third is (3.4.2.4), the fourth is (3.6.2.3), and the fifth is (3.3.2.3). If a is right invertible and not a left zero divisor then 1 = aa" and a(l — a"a) = 0, giving part of (3.10.6.4). If instead a is almost right invertible and closed then La is dense and closed, hence invertible by (3.3.7.2), which means that a is right invertible and not a left zero divisor. For (3.10.6.5) apply Theorem 3.5.2 to Ra and to La. Finally, for (3.10.6.6) argue dK C H <md dH C K => d{K UH)C (dK) n {dH) ■ (3.10.6.7) If A is a subalgebra of a larger normed algebra B and a e A then we can compare A invertibility and B invertibility for La and Ra: 3.10.7 THEOREM If A is a subalgebra of the normed algebra B and a G A then u{B) nAC W(A) for each w G {aleft,aright,aleft,aright} (3.10.7.1) and u{B) n A D u{A) for each w G {7rleft, 7rright, fleft, fright} (3.10.7.2) There is inclusion d(aleft(A) U aright(A)) C (d(aleft(E)) U aright(5)) n A (3.10.7.3) Proof: The first two inclusions of (3.10.7.1) are Theorem 3.10.2, and the second two follow from the analogue of Theorem 3.10.2 for almost invertibility. The first two inclusions of (3.10.7.2) are given by (3.2.4.1) and the second two by Theorem 3.3.6. Combining (3.10.7.1), (3.10.7.2), and (3.10.6.5) gives d(aleft(A) U aright(A)) C rleft(E) n fright(5) n A (3.10.7.4)
3.10 Algebraic Invertibility 97 Now for (3.10.7.3) argue that if K and H are subsets of A then dKCHQK=>dK CdH ■ (3.10.7.5) Detailed proofs of (3.10.7.5) and (3.10.6.7) will be given in Theorem 7.10.3 below. In particular, from (3.10.7.1) and (3.10.7.2) we get implication A C B and aleU{A) = fleft(A) =► aleft(E) n A = fleft(E) n A (3.10.7.6) and A C B and aright(A) = fright(A) =► aright(E) n A = fright(E) n A (3.10.7.7) As part of the argument for (3.10.6.4) we learn that a G A is invertible if and only if La £ BL(A, A) is invertible. a e A-1 <=> La e BL_1(A, A) <=> Ra e BL_1(A, A) (3.10.7.8) Indeed, if af = a-1 is an inverse for a in A, then La, and Ra, are inverses for La and Ra in BL(A, A). Conversely, if either La or i?a is invertible then either a is right invertible and not a left zero divisor, or a is left invertible and not a right zero divisor. If A is a normed linear algebra and 0 is a bornological space then the enlargement Qq{A) of (1.9.2.1) is a normed algebra, and the mapping qn of (2.7.3.4) is a homomorphism in the sense of (2.10.17). For each a G A there is equality Qn(£.) = ^qn(a) and QQ(*B) = iJqo(a) (3.10.7.9) When 0 = N then the topological zero divisors of A are represented by zero divisors in Qn(A): 3.10.8 THEOREM If A is a normed linear algebra and a e A, then q(o) e fleftQ(A) =^ae fleft(A) =► q(o) € ^leftQ(A) (3.10.8.1) and q(o) e frightQ(A) => a G fright(A) =► q(o) € 7rrightQ(A) (3.10.8.2) Also q(o) € aleftQ(A) ^aG aleft(A) =► q(o) € aleftQ(A) (3.10.8.3)
98 3. Invertibility and Singularity and q(o) e <7rightQ(A) ^cG <7right(A) =► q(o) € arightQ(A) (3.10.8.4) Hence q(a) almost invertible ==> a almost invertible =>• q(a) invertible (3.10.8.5) Proof: For (3.10.8.1) and (3.10.8.2) apply (3.3.5.2) with, respectively, T = La and T = Ra\ for (3.10.8.3) and (3.10.8.4) apply (3.4.5.2) with, respectively, T = Ra and T = La. The implication (3.10.8.5) follows at once from (3.10.8.3) and (3.10.8.4). ■ 3.11 SUBSPACES, QUOTIENTS, AND PRODUCTS Suppose X and Y are normed spaces, with T G BL(X, Y), and suppose that closed subspaces W C X and Z CY satisfy T{W) C Z (3.11.0.1) Then T induces operators Tw,z :W —> z and T/w,z : xlw —> Ylz (3.11.0.2) We shall not always be so careful about how we write them down. The invertibility and singularity of T, Twz and Ttw^z are constrained by one another: 3.11.1 THEOREM If T e BL(X, Y) and T{W) CZ CY, then -^W z one~°ne smd T/yy % one-one m m (3.11.1.1) ==> T one-one ==> Tw z one-one Tw z bdd below and T/w zbdd below ' (3.11.1.2) =► T bdd below =► Twz bdd below Twz closed and T,w^z closed ==> T closed ==> Tw z closed (3.11.1.3) T/w,z onto and Twz onto => T onto => T/WtZ onto (3.11.1.4) T/w,z dense and Twz dense => T dense => T/WtZ dense (3.11.1.5) T/WiZ open and Twz open ==> T open ==> T;^ open (3.11.1.6) T/ix/ ^ almost open and TVy ^ almost open », , ' m (3.11.1.7) =*• T almost open =*• Tiw z almost open
3.11 Subspaces, Quotients, and Products 99 Proof: (3.11.1.1) is (3.2.4.1), (3.11.1.2) and (3.11.1.3) are Theorem 3.3.6, (3.11.1.4), (3.11.1.6) and (3.11.1.7) are Theorem 3.4.6 and (3.11.1.5) is (3.2.4.2). ■ We have also some "mixed" results: 3.11.2 THEOREM If T G BL(X, Y) and T(W) CZCY, then Tw z onto and T one-one ==> T/W^z one-one (3.11.2.1) T\v,z dense and T bounded below ==> T;^ bounded below (3.11.2.2) Twz onto and T closed => T/W}Z closed (3.11.2.3) T/w,z one-one and T onto ==> Twz onto (3.11.2.4) T/w,z one-one and T open ==> Twz open (3.11.2.5) TiWjz hounded below and T dense ==> Tw z dense (3.11.2.6) T/w z bounded below and T almost open ' ' (3.11.2.7) => Tw z almost open Proof: If Tw z is onto and T is one-one and if Tx G Z, then Tx = Tw with w G W, giving x = w G W and making T/WfZ one-one. If Twz is dense and T is bounded below, with k > 0 satisfying (3.3.1.1), then for each x G X inf llx -w\\<k inf \\T(x - w)\\ = k inf \\Tx - z\\ (3.11.2.8) making T/W^z bounded below. If in particular T is also closed then \\y-Txn-zj -^ o =► ||y-rxn-ru>n|| — o =* y = Tx 2 Q) with \\T{x - xn - wn)\\ —» 0 =► ||z - xn - Wn\\ — 0 making Ttw^z closed. If T/W^z is one-one and T is onto then for each z G Z there is x G X, for which Tx = z € Z, forcing x G W, making Tw z onto. If T is open, with A; > 0 satisfying (3.4.1.1), then we can insist on ll^ll < ^II2I|j making Twz open. If T is dense and TiWjZ is bounded below, with dist(x,W) < kdist(Tx,Z), then for each z G Z there is (xn) in X for which ||z — Txn\\ —► 0, giving dist(xn,W) < fcdist(rxn,Z) < k\\Txn - z\\ —► 0 (3.11.2.10) and hence there is also a sequence (wn) in W for which ||xn — tun|| < dist(xn,W) + l/n-+0. It follows ||z-rti;n|| < ||*-rxn|| + ||r||||xn-ti;n|| -+
100 3. Invertibility and Singularity 0, giving Twz dense. If in particular T is almost open, so that we can arrange ||xn|| < &'||z||, then we can use the Riesz lemma argument (1.5.1.1) to ensure ||u;n|| < 2||xn|| < 2fc'||2||, making Twz almost open. ■ The invertibility of T is also constrained by that of Twz and T,w^z: 3.11.3 THEOREM If T e BL(X, Y) and T{W) QZ CY, then Twz invertible and Ttwz invertible ==> T invertible (3.11.3.1) T invertible and Twz invertible ==> T/^z invertible (3.11.3.2) T invertible and T/WfZ invertible ==> Twz invertible (3.11.3.3) Proof: We have proved this many times over: for example (3.11.3.1) follows from (3.11.1.2) and (3.11.1.4), with (3.3.7.1), or alternatively from (3.11.1.1) and (3.11.1.6), with (3.4.7.1). For (3.11.3.2) we can use (3.11.1.4) and (3.11.2.2), with (3.3.7.1); for (3.11.3.3) we can use (3.11.1.1) and (3.11.2.5), with (3.4.7.1). ■ If W = Wx and Z = Zx are complemented subspaces of X and Y, then the left and right invertibility of T, Twz and T/WfZ are mutually constrained. If W2 and Z2 are closed complements for W = W1 and Z = Z2, respectively, then (2.4.2.1) X/W1 ~ W2 and Y/Z1 ~ Z2 (3.11.3.4) Recalling that T(W1) C Zu without the assumption that T(W2) C Z2, gives an "upper triangular" representation for T : X —> Y: o r99 3.11.4 THEOREM If T e BL{X,Y) satisfies T{W1 <ZZ1<ZY, for complemented Wx C X and Zx CY, then in the notation (3.11.3.5) ^i 15 Too (almost) left invertible 11 22 V * (3.11.4.1) ==> T (almost) left invertible ==> Tn (almost) left inv. ^in^92 (almost) right inv. 11 22 V ' (3.11.4.2) =>» T (almost) right inv. =>» T22 (almost) right inv. T left invertible, Tn dense =>» T22 left invertible (3.11.4.3) T right invertible, T22 one-one ==> Tn right invertible (3.11.4.4) w1 Zi (3.11.3.5)
3.11 Subspaces, Quotients, and Products 101 T almost left invertible, Tn almost open T22 almost left invertible T almost right invertible, T22 bounded below ==> Tn almost right invertible Proof: If SnTn = In and S22r22 = J22 then If ^11 ^11^12^22 0 S22 ^ii ^ii Tn L12 ^22 hi '22 ^21 ^22 T„ 0 r12" ^22. "Ai 0 o " A2. (3.11.4.5) (3.11.4.6) (3.11.4.7) (3.11.4.8) then S^T^ = I11. This proves both implications of (3.11.4.1) for left invertibility, and the corresponding implications for almost left invertibility are obtained if we replace each 5X- • by a sequence (Sft). We leave the details to the reader. The arguments for (3.11.4.2) are exactly similar, and can also be left to the reader. Towards (3.11.4.3) suppose (3.11.4.8) holds and note that ^21-^11 — 0 an(* ^21-^12 "+" ^22-^5 22-* 22 z22 (3.11.4.9) If also Tn is dense, then (3.11.4.9) gives S21 = 0 and hence S22T22 = ^22? which proves (3.11.4.3). If we assume that T is almost left invertible, then 11^ ul and \\S^T12 + 52n2r22 - J22|| —> 0 (3.11.4.10) If also Tn is almost open then (3.4.4.1) HSJJ -► 0, giving 11^22^22-^22II -+ 0, which proves (3.11.4.5). The arguments for (3.11.4.4) and (3.11.4.6) can now safely be left to the reader. ■ If the subspaces W = Wx and Z = Zl for which T(W1) C Zx have complements W2 Q X and Z2 CY for which also T(W2) C Z2 then we can say even more: 3.11.5 THEOREM If T e BL{X,Y) satisfies T{WX) C Z1 C Y and T(W2) C Z2 C y, where Wx and W2 are a complemented pair in X and Zj and Z2 are a complemented pair in Y, then, in the notation of (3.11.3.5), T nonsingular <$=> Tn nonsingular and T22 nonsingular (3.11.5.1)
102 3. Invertibility and Singularity for each "nonsingularity" among one-one-ness, boundedness below, closed- ness, denseness, ontoness, almost openness, openness, left and right invertibility, almost left and right invertibility. Proof: Much of this can be obtained by applying Theorem 3.11.1 separately for WUZX and for W2,Z2; Theorem 3.11.2 and Theorem 3.11.4 between them account for anything left over. ■ 3.12 SEQUENCE AND FUNCTION SPACES We begin by seeing how much "nonsingularity" is inherited from T G BL(X,y) by operators /p(0,r) : /p(0,X) -► /p(0,y): 3.12.1 THEOREM If 0 is a nonempty set and T G BL(X,Y) then for each p G {1,2, oo} there is implication T nonsingular <$=>► /p(0,T) nonsingular (3.12.1.1) where "nonsingular" means: one-one; bounded below; closed; almost open; open. Proof: Begin with the case p = oo: if T is one-one and Tnx = 0 with x G ^(OjX), then T(x(t)) = 0 for each t G 0, giving x(t) = 0 for each t G 0, so that x = 0. Conversely, if ^(H,!1) is one-one and x = 1 0 x0 is the constant function xQ then Tx0 = 0 =>» Tn{l 0 x0) = 0 =>» 1 0 x0 = 0 =>» x0 = 0 (3.12.1.2) If T G BL(X,y) is bounded below, with k > 0 satisfying (3.3.1.1), then if xel^X), IHL = 8uPt||x(t)|| < 8uPt*||r(x(0)|| = tnr^iL (3.12.1.3) which makes ^(OjT) bounded below. The converse may be left to the reader. If T is closed and ||Tnxn — t/H^ —► 0 then for each t G 0 we have ||r(a:„(*)) - w(*)|| < ||rQ*„ - »||«x, —► 0 asn^oo (3.12.1.4) giving x^t) G X for each t satisfying y(t) = T(xOQ(t)): also ll*oolloo < ll*oo - *tf lloo + Halloo < k\\V - T^XnWn + Hx^ll^ < OO (3.12.1.5) so that Xco G ^(OjX). This proves that /^(ftjT) is closed. The converse needs a moment's thought: if x G XN and ||Txn - y|| -► 0, then \\Tn(l ©
3.12 Sequence and Function Spaces 103 xn) — 1 ® y||oo ~~* ° an(* nence there is z G 1^(0,,X) for which l©y = Tn(z) (3.12.1.6) If we choose t G 0 and 2^ = 2(£) G X it follows that y = T{zOQ). If T G BL(X, Y) is open, with k > 0 satisfying (3.4.1.1), and if y G /^(n, Y) is arbitrary, then there is x : 0 —► X for which for each t G Q,T{x{t)) = y(t) and ||x(*)|| < *||y(0H (3.12.1.7) Necessarily x G 1^(0,,X), since H^H^ < &||y||oo < °°» an(* indeed this is the analogue for ^(OjT) of the condition (3.4.1.1). The converse is clear, arguing as in (3.12.1.6). The reader can easily make the modifications needed to do the same thing for almost openness. The corresponding results for p = 1 and p = 2 can also be left to the reader. We work with "delta functions" St © x instead of constants 1 © x throughout. ■ The analogue of Theorem 3.12.1 is valid for the sequence spaces associated with a bornological space 0: 3.12.2 THEOREM If 0 is a bornological space and T G BL(X, Y) and if c+ G {c0o>co>ci}> then there is implication T nonsingular <$=> c+(n,T) nonsingular (3.12.2.1) where "nonsingular" means one-one, bounded below, closed, open, almost open. Proof: By (3.11.1.l)-(3.11.1.3) the one-one, bounded below, and closed components of (3.12.2.1) extend from l^Cl^T) to each c+(0,r). If y G c+(0,T) and T is open, with k > 0 satisfying (3.4.1.1), then there is x G /oo(0,r)with TnX = y and ||x(*)|| < &||y(*)|| for each t G 0 (3.12.2.2) If y G c00(0, Y) then necessarily x G c00(Q,X), and if y G c0(0, Y), then necessarily x G c0(0, X). If more generally y G cx (0, Y), with y — 1 © y^ G c0(0, Y), then there are z G c0(0, Y) and x^ G X for which 7^ = 2/oo with HxJI < *||yj| and (3.12.2.3) J"* = y - 1 © Voo with 11*11^ < k\\y - 1 © yJI^
104 3. Invertibility and Singularity It follows Tn(z + 1 0 xj = y with ||* + 1 0 xJL < 3*11^11^ (3.12.2.4) This gives (3.12.2.1) for openness, and the argument for almost openness is the same. ■ If 0 is a topological space and T 6 BL(X, Y) then T nonsingular <*=> £7^(0, T) nonsingular (3.12.2.5) for each nonsingularity among one-one-ness, boundedness below, and closedness. For openness and almost openness the argument of Theorem 3.12.2 only gives us the analogue for the larger subspaces of l^Q^X) and /^(n, Y) consisting of the functions which are continuous at a particular point t G 0. For certain special topological spaces, the invertibility of a G A = £7^(0, K) is interesting: 3.12.3 DEFINITION The topological space 0 is called normal if, for arbitrary K0,K1 C 0, c\(K0) C int(^) =► c\(K0) C int(if1/2) C c\(K1/2) , x (3.12.3.1) C mtyKi) for some K1/2 Q 0 For example, if 0 is a metric space then (3.12.3.1) is easily checked. If cl(ff0) C in^ffj C 0, then repeated application of (3.12.3.2) throws up a family {Kx)x G 0 of subsets of 0, indexed by a dense subset AC [0, l] C R, for which for each A,/xGA:A</x=^ c\(Kx) C int^) (3.12.3.2) We shall call {Kx)XeA a relief map of 0. The Urysohn function of a relief [Kx) is the mapping ifA : 0 —► [0, l] defined by setting KA(t) = inf {A e A:t e Kx} for each t € (UA6AiTA) \ ((UA6AXA) (3.12.3.3) with {0 if * € nA6AiCA Urysohn's lemma says that the Urysohn function of a relief is continuous:
3.12 Sequence and Function Spaces 105 3.12.4 THEOREM If 0 is a normal space and {Kx)XeA is a relief map of 0, then the Urysohn function KA : 0 —> [0, l] is continuous. Proof: For each A G A the set {t G 0: KA{t) <X} = UM<AffM is open (3.12.4.1) and {t G 0: KA{t) >X} = nA<MffM is closed ■ (3.12.4.2) If 0 is a normal topological space, and cl(ff0) C intf-K^) C 0, then we can construct a relief (Kx) and then a Urysohn function for the relief KA = u, giving u G C(0, [0,1]) for which u(t) = 0 if t G K0 and u(t) = 1 if tg Kx (3.12.4.3) We shall call the topological space 0 separated if {t} = cl {t} for each t G 0 (3.12.4.4) 3.12.5 THEOREM If 0 is a normal separated space and A = 0^(0, K) then A\A~X =rle{t{A) =fright(A) = (aG A: inf |a(t)| = o} (3.12.5.1) and 7rleft(A) = 7rright(A) = {a G Aiinto^O £ 0} (3.12.5.2) Proof: If infn |a(-)| > 0, then 6 = l/a : t —► l/a(t) is continuous on 0, bounded with H&Hoo = l/infn |a(-)| < °°> an<^ since 6 • a = 1 = a • 6, it is clear that a G A-1. Conversely, if infn |a(-)| = 0 then for each e > 0 there is t G 0 for which \a(t)\ < ^e, and then U G Nbd(i) for which 5 G £7 => |a(s)| < e. Now by (3.12.4.3) with K0 = {t} and KX = U there is u G A for which Hu^ < 1; u(t) = 1; u(s) = 0 if s G 0 \ £/. Evidently ||u|| = 1 and \\a • u\\ = \\u • a\\ < e, so that a G rleft(A) and a G fright(A). Towards (3.12.5.2), if the interior of the set a_10 is empty, then for arbitrary u G A there is implication a - u = 0 => u(t) = 0 for each £ G 0, extending by continuity from t G 0 \ a_10, excluding a from 7rleft(A) = 7rright(A). Conversely, if t G inta_10, then by (3.12.4.3) there is u G A for which u(i) = 1 and u{s) = 0 if 5 ^ a_10. Evidently a-u = u-a = Q^u. ■
4 Banach Spaces and Completeness A normed space which as a metric space is complete is called a Banach space. From our point of view the most dramatic effect of this is that "open," "almost open," and "onto" mappings are the same thing. 4.1 CAUCHY SEQUENCES Suppose that 0 is a bornological space, in the sense of (1.9.0.1) and (1.9.0.2), and consider the product OxO. If if C 0 x 0 is declared to be "bounded" whenever H C K x K' with bounded K, K' C 0 (4.1.0.1) then OxO becomes another "bornological space." For the present discussion we need a different definition: we shall say that H C 0 x 0 is "bounded" provided H C (K x 0) U (0 x K') with bounded K,K' C 0 (4.1.0.2) If also X is a linear space and x : 0 —► X is a mapping we shall define xv : 0 x 0 —► X by setting xw(s,t) = x(s) - x(t) for each s,t e 0 (4.1.0.3) When X is a normed space and x G /^(0,-X") then xv G /^(O x 0,X) with Halloo <2|Wlc„ (4.1.0.4) and the mapping x —► xv is also linear. If 0 is a bornological space and 107
108 4. Banach Spaces and Completeness 0 x 0 is given the bounded structure (4.1.0.2) then xe c+(0,X) =>• xv e c+(n x 0,X) for each c* e {coojCojCi} (4.1.0.5) 4.1.1 DEFINITION If X is a normed space and 0 is a bornological space then x e /oo(n,X) is called Cauchy iff xw e /^(n x 0,X) is null: c(n,x) = {xe zoo(n,x):xv e c0(n x n,x)} (4.1.1.1) Necessary and sufficient is that there should be (Ke)e>0 in Bn for which if e > 0 and s,t e 0 \ if£ then ||x(s) - x(*)|| < e (4.1.1.2) When 0 = N with the finite homology, the condition is that there should be {Ne)e>0 in N for which if e > 0 then m>n>Ne=3> \\xm - xn\\ < e (4.1.1.3) 4.1.2 THEOREM If 0 is a bornological space and X is a normed space then Cl(0,X) C c{n,X) = cl(c(0,X)) C l^i^X) (4.1.2.1) If <t>: A —> 0 is cofinal then X*c(0,X) Cc(A,I) (4.1.2.2) Also if x G l^QyX) then, provided A is nontrivial, xec(Q,X) andxo^GCifA,!) =J>xGc1(0,X) (4.1.2.3) Proo/: By (4.1.0.4) and (1.9.1.1), c(0,X) is a continuous counterimage of a closed subspace, therefore closed. If x G c1(0,X) then there is z G X and (ife) in Bn for which s,t e 0 \ Ke =>» ||x(s) - x(t)|| < ||z(s) - *|| + ||* - x(*)|| < 2e (4.1.2.4) so that x e c(Q,X). The inclusion (4.1.2.2) follows from (2.8.2.1) with c+ = c0, noting that the induced mapping <j> x <j> : A x A —>• 0 x 0 is also cofinal. If, for (4.1.2.3), x is Cauchy and x o <j> is convergent then for each e > 0 there are bounded Xe C H, Fe C A with x o <f>[A \ He) C Disc(*,e) and t,t' e 0 \ Ke => ||x(*) - x(*')|| < £• With <£(A \ H'e) C 0 \ ff£ it follows (A \ [He U H'e)) cn\Ke and x(<£(A \ {He U #£)) C Disc(*, e) and hence iGO\iC£ =>» ||x(*)-*|| <e ■ (4.1.2.5)
4.1 Cauchy Sequences 109 Theorem 4.1.2 says that subsequences of Cauchy sequences are Cauchy, and that if a Cauchy sequence has a convergent subsequence then it must itself be Cauchy. Bounded linear operators map Cauchy sequences into Cauchy sequences: 4.1.3 THEOREM If T G BL(X,Y) is bounded and 0 is a bornological space then Tnc{n,X) Cc(0,y) (4.1.3.1) If T is bounded below, then iGUOJ) andrnxGc(0,y) =^xGc(0,X) (4.1.3.2) Also T bounded below <=> c(0,T) bounded below (4.1.3.3) Proof: If x G ^(OjX) is arbitrary then {Tnxy = {Tnxn)xw e /^(n x n,y) (4.1.3.4) where xv is defined by (4.1.0.3), and now (4.1.3.1) follows from (2.7.1.1) together with Definition 4.1.1. Alternatively it is rather easy to verify (4.1.3.1) directly. Similarly (4.1.3.2) follows from (3.3.5.3). Forward implication in (4.1.3.3) follows from (3.3.6.1) and (3.12.1.1). To reverse it just note that all the constants are in c(Q,X). Alternatively, use (3.12.2.1) and Definition 4.1.1. ■ It is clear that the analogue of (4.1.3.3) holds with either "one-one" or "closed" in place of "bounded below." If T is either open, or almost open, and y G c(0, Y), then Theorem 3.12.2 says that we can approximate to y with Tnxnz where z G c0(O x 0,X). It is not clear that we can do it with z = xw and x G c(0,X). In the special case O = N we do have something: 4.1.4 THEOREM If T G BL(X,F) then T open => c(T) onto (4.1.4.1) and T almost open =^ c(T) dense (4.1.4.2)
110 4. Banach Spaces and Completeness Proof: Suppose T G BL(X, Y) is open, with k > 0 satisfying (3.4.1.1), and suppose (em) is a sequence of positive real numbers for which oo E £m < °° (4-1-4.3) m=l Then if y G c(0, Y) is arbitrary we may define <j> : N —► N in such a way that, for each m,n,nf in N, <j>(m) <n<n' =>• ||yn, - yn|| < em and <£(m) < <f>{m + l) and <f>{l) = 1 (4.1.4.4) Now choose x = (xn) in X with Tx = Y in such a way that, for each m,n inN *M < n < <£(m + l) =► ||xn-x^(m)|| < *||yn-y^(m)|| and M < k\\yi\\ (4.1.4.5) We claim 4>{m) <n< <t>{m + l),<t>{rri) < ri < <j>(m' + l),n < n' =>» ||xn, - xn|| < 2fcem + em, —> 0 as m —► oo (4.1.4.6) since ll*n' - *nll < ll*n' - X4>{m')\\ + ll^(m') " X<f>{m)\\ + ll^(m) " *nll < *l|yn' -^(m')ll +*H^(m') -^(m)ll +*l|y^(m) -Vnll This proves (4.1.4.1), and the argument for (4.1.4.2) is similar. ■ 4.2 COMPLETENESS A normed space is said to be complete if all its Cauchy sequences converge: 4.2.1 DEFINITION The normed space X is said to be complete iff c{X) =c1{X) (4.2.1.1) More generally, a subset K C X will be called complete iff {x e c{X):{xn:n e N} C K} C K + c0{X) (4.2.1.2) The reader should check that, if K C X, then if complete =► if closed (4.2.1.3)
4.2 Completeness 111 and X complete, K closed ==> K complete (4.2.1.4) If X is complete then (4.2.1.1) extends to arbitrary bornological 0: 4.2.2 THEOREM If X is a Banach space and 0 is bornological, then c(n,X) = Cl(n,X) (4.2.2.1) Proof: If 0 is itself a bounded set then (4.2.2.1) is trivial. If not then for each x G c(0,X) there is [Kn)neU in Bn for which, for each n G N, t,t'en\Kn=^ \\x{t') - x{t)\\ < 1/n and Kn C fTn+1 (4.2.2.2) Now choose <£ : N —► 0 so that for each neN, <j>{n) eVl\Kn (4.2.2.3) The argument now resembles Theorem 4.1.2: xo<j>ec{x) = c1{x) =^xec1{n,x) ■ (4.2.2.4) Theorem 4.2.2 has some "practical" consequences. Many "limit" constructions can only be justified by showing that some mapping is "Cauchy." For example, if X is a Banach space and x : J —► X is continuous on an interval JCR then J3 f x{t) dt = lim{£(*y-*,-_!)*(*;) : * — [a,P}\ if [a,/?] C J (4.2.2.5) can be interpreted by treating the set of "partitions" t = (t0>*i>* • • >*n) of [a,^] as a bornological space 0, and using the uniform continuity of x on [a,/3] to prove that the induced mapping from 0 to X is "Cauchy." There are indeed two different ways of defining the "bounded subsets" of 0: as the subsets of the complements of sets of the form {t G 0: max(£ • — tj_i) < 8} for some S > 0, or the subsets of the complements of sets of the form {t e 0: {*y} D E} for some finite E C [a,/?]. Completeness in normed spaces can be tested by means of series convergence. We recall the mapping £i : XN -+ XN of (2.8.2.6): 4.2.3 THEOREM If X is a normed space, then £*!(*) Cc(X) (4.2.3.1)
112 4. Banach Spaces and Completeness Necessary and sufficient for X to be complete is that Y/1h(X)Cc1(X) (4.2.3.2) Proof: If xel^X), then m < n ■ y=m+i 11, x3 - 11, WxjW—>0asm—¥<x> (4.2.3.3) y=m+i giving (4.2.3.1). The reader should check that if the partial sum of the ||xy|| does not converge to 0 then the series JZy llxyll cannot possibly be finite. From (4.2.3.1) it is clear that the condition (4.2.3.2) is necessary for completeness. Conversely, if X is not complete there will be x G c(X) \ Ci(X), and then {Nk)keti for which, for each k G N, m > n > Nk =► \\xm - xn\\ < 2~k and Nk < Nk+1 (4.2.3.4) Define </> : N -► N by setting </>(/:) = Nk for each A; G N: then by (4.1.2.2) and (4.1.2.3) x' = X<f> = (xHn)) e c(X) \ Cl(X) (4.2.3.5) Finally define y G XN by setting y1 = x\ and yn+1 = x'n+1 - x^ for each n eN (4.2.3.6) Evidently yel^X) and x' = ^y^cl{X) ■ (4.2.3.7) If X is complete then the sum of an absolutely convergent series is "rearrangement independent": if x G li{X), then oo oo y^ £^n) = ^inGl if </> : N —► N is one-one and onto (4.2.3.8) n=l n=l One way to see this is to observe that the associated mapping x : 0 —>• X is Cauchy, therefore convergent, where 0 is the set of all finite subsets of N in the natural boundedness, and x(K) = ^2neK xn for each finite K C N. We shall sometimes encounter a generalization of the idea of a complete subset of a normed space X. Recall that K C X is called convex if 0 < t < 1 =J> tK + (1 - *)ff C K (4.2.3.9)
4.3 Spaces of Functions and Operators 113 and write A= j*G[0,l]N:fSy = l] (4.2.3.10) L i=i } Then we shall say that K C X is completely convex iff there is implication oo t e A and x e KN =► 3 ^ *yxy G if (4.2.3.11) y=i Evidently, if K C X, if convex and complete =>» if completely convex =>» if convex (4.2.3.12) 4.3 SPACES OF FUNCTIONS AND OPERATORS Certain standard spaces are complete: for example the scalar field K in the norm |-|, and more generally Kn for each nGN: 4.3.1 THEOREM If Y and Z are Banach spaces then so is their product Y x Z. Proof: We recall that Y x Z is supposed to be normed in such a way as to have the usual cartesian product topology. If (xn) = [yn,zn) is Cauchy in Y x Z, then [by (4.1.3.1)] (yn) is Cauchy in Y and (zn) is Cauchy in Z. By assumption therefore there are y^ G Y and z^ G Z for which ||yn — y^W —► 0 and \\zn — z^W —► 0, so that also (xn) converges to (y«»*oo)- ■ Theorem 4.2.1, and induction, says that Kn is complete for each nGN, provided we know that K is complete, and indeed C ~ R2 is complete because R is. We may assume that the completeness of R is known: a proof can however be extracted from Theorem 4.5.3 below. Closed subspaces and quotients of complete spaces are complete: 4.3.2 THEOREM If Y C X is a closed subspace of a Banach space X, then Y and X/Y are complete. Proof: The completeness of Y is (4.2.1.2): if (yn) in Y is Cauchy then it is Cauchy, therefore convergent, in X\ if (yn) in Y is convergent in X then it must be convergent in Y. The completeness of X/Y is more substantial: we use (4.2.3.2). If z e (X/y)N is in lx{X/Y) then by the Riesz lemma (1.5.2.1) there is (xn) in X with xnezn and ||xn||<||*n|| + 2-n for each n € N (4.3.2.1)
114 4. Banach Spaces and Completeness By the completeness of X it follows £x x G cx{X) =^Y,,ze ciWy) (4'3-2-2) By (4.2.3.2) it follows that X/Y is complete. ■ If X is complete then so are the sequence spaces Z (0,X): 4.3.3 THEOREM If X is a Banach space and 0 is a nonempty set then Zp(0,X) is complete for each p G {1,2, oo}. If 0 is a topological space then (7^(0, X) is complete. Proof: If (xn) is Cauchy in /^(fyX) then by (4.1.3.1) {xn{t)) is Cauchy in X for each t G 0, hence converges to some element x^t) G X. We claim that x^ : 0 —► X is bounded: for each £ G 0 11*^(011 =lim||*n(*)ll <sup||xn(0|| ^supllsJL (4.3.3.1) n n n Thus we have an element x^ G 1^(0,,X) for which xn(t) —► £^(2) as n —>• oo for each £ G 0: we have to show that \\xn — x^W^ —► 0. To do this we must go back to the Cauchy condition on (xn): there is {Ne)e>0 in N for which m > n> N. => \\xm — x„ II „ < e, so that m>n>Ne=3> \\xm{t) - xn{i)\\ < e for each t G 0 (4.3.3.2) Allowing m —> oo in (4.3.3.2) says that ||xn(i) — rc00(^)|| < e if t G 0 and n> Ne, hence n>JV,=H|*B-*JL<e (4-3.3.3) The argument for Zj (0, X) is very similar: if (xn) is Cauchy in Zx (0, X) then again there is x^ : 0 —► X for which \\xn(t) — x^t)]] —► 0 for each £ G 0. This time the Cauchy condition says that there is {Ne)e>0 for which m>n> Ne => \\xm — xJli < e, so that m>n>Ne=^^2 HxmM - xnWII < e for arbitrary finite J C N (4.3.3.4) and hence, allowing m —> oo, » > We =► ||xn - zj^ = sup J£ ||x„(t) - Xoo{t)\\ finite J C N J < e (4.3.3.5)
4.4 Extension by Continuity 115 In particular it now follows ll'oolli < ll*oo - *«lli + ll*«lli < e+ Wxm\\i < <*> if m>Ne (4.3.3.6) so that x^ G /1(0,X). The argument for l2{ft,X) can at this stage be left to the reader. If 0 is a topological space then, since £7^(0, X) is a closed subspace of /^(n, X) by Theorem 1.8.2, the completeness of £7^(0, X) is just (4.2.1.4) again. ■ By the same argument it is clear that, if 0 is a bornological space and X is complete, then c0(O,X), c1(0,X) and c(0,X) are all complete, being closed subspaces of l^ (0, X), and hence by Theorem 4.3.1 the enlargements Qn(X) are also complete. We shall see below that Qn(X) is still complete even when X is not. If 0 is a nonempty set and X is complete then the reader may like to check that the measure space M1(0,X) of (1.8.2.4) is also complete. 4.3.4 THEOREM If X and Y are normed spaces and Y is complete then so is BL(X,y). Proof: If (rn) is Cauchy in BL(X,Y) then by (4.1.3.1) (Tnx) is Cauchy in Y for each iGl, therefore converges to some element T^x G Y. The mapping x —► T^x is evidently linear, and we claim also bounded: for each xeX WT^xW = iim||rBx|| < suP||rnx|| < (sup||rn||)||z|| (4.3.4.1) n n n Thus we have an element T^ G BL(X,Y) for which Tnx -► T^x. To see that \\Tn — ^ 11 -^Owe must go back to the Cauchy condition for (Tn): if m > n > Ne => \\Tm - Tn\\ < e then m>n>Ne=^ ||rmx-Tnx|| < e||x||,=> WT^x-T^W < e||x|| (4.3.4.2) for each x G X, allowing m —► oo, giving ||Tn — T^H —► 0. ■ Alternatively we can regard BL(X, Y) as isometric to a closed subspace of /^(^y), where 0 = Discx(0 ; l). 4.4 EXTENSION BY CONTINUITY Bounded linear operators can be extended from dense subspaces to complete spaces:
116 4. Banach Spaces and Completeness 4.4.1 THEOREM Suppose X and Y are normed spaces, and that Y is complete: then each T0 G BL(X0,Y) for which cl(X0) = X has a unique extension T G BL(X,Y): BL(X0, Y) = BL(X, Y) (4.4.1.1) Proof: Suppose x G X is arbitrary: then there is (xn) in X for which ||xn-x||—>0 and x G X0 (4.4.1.2) By (4.1.2.1) the sequence (xn) is Cauchy, therefore by (4.1.3.1) the sequence (T0xn) is Cauchy in Y, therefore by Definition 4.2.1 converges: write Tx = limroxn if (4.4.1.2) holds (4.4.1.3) n The mapping x —> Tx is well denned in two senses: the limit exists by what we have just said, and does not depend on the particular choice of sequence (xn). x = limxn = limx'n =* \\T0x'n - T0xJ — 0 (4.4.1.4) n n Now that it is well denned, it is easy to see that T is linear, and also bounded, since ||T*|| = lim||rosn|| < ||T0||limK|| = ||ro||||x|| (4.4.1.5) n n Thus T0 G BL(X0,y) has at least one extension T G BL(X,Y). The uniqueness follows at once from (3.2.2.2): any two continuous mappings T : X —► Y and T' : X —► Y which agree on the dense subset X0 C X must be equal, whether or not Y is complete. ■ We have written equality "=" in (4.4.1.1) to emphasize that the induced isomorphism is derived from the restriction mapping: formally, if J : X0 —► X is the natural injection, so that the composition operator Rj : BL(X,y) —► BL(X0,y) is defined, we have proved that Rj is invert- ible. When a bounded linear operator has been "extended by continuity," then certain kinds of nonsingularity are transmitted: 4.4.2 THEOREM Suppose X and Y are normed spaces, and that T G BL(X,y) satisfies T(XQ) C Y0, where X = c\{X0) and Y = cl(Y0): then T0 bounded below <=> T bounded below (4.4.2.1)
4.4 Extension by Continuity 117 and T0 almost open «*=>• T almost open (4.4.2.2) Proof: If T0 is bounded below, with k > 0 satisfying (3.3.1.1), and if x G X is arbitrary, then x = limxn =► ||x|| = lim||xn|| < lim*||roxn|| = *||Tx|| (4.4.2.3) n n n This is implication one way in (4.4.2.1), and the reverse is (3.3.6.1). If T0 is almost open, with k > 0 satisfying (3.4.1.2), then for arbitrary y G Y there is by (3.5.1.4) (yn) in Y0 for which ||yj| < ||y|| and \\yn-y\\ -+ 0, and (xnm) in X0 for each nGN such that ||xnm|| < fc||yn|| and ||T0xnm — yn\\ < 1/m. It follows ||xnB|| < *||y|| and ||rxnn - y\\ < \\Txn - yn\\ + \\yn - y\\ —. 0 (4.4.2.4) Conversely, if T is almost open and y67 then (3.5.1.4) y € cl {Tx: ||x|| < *||y||} C cl {Tx0: ||x0|| < ||x|| < *||y||} ■ (4.4.2.5) The analogue of (4.4.2.2) holds with "dense" in place of "almost open," but obviously not with "open" or "onto." It also is not clear that we can replace "bounded below" by "one-one" in (4.4.2.1). If X is complete and T G BL(X, Y) is either bounded below or open, then the range T(X) is also complete: 4.4.3 THEOREM If T e BL{X,Y) and X is complete, then T bounded below =^ T(X) complete (4.4.3.1) and T open =^Y = T(X) complete (4.4.3.2) Proof: Suppose y = (yn) G c(Y) is Cauchy, with yn G T(X) for each n: then there is (xn) in X for which yn = Txn, and necessarily (4.1.3.2) (xn) is Cauchy: \\xm ~ xn\\ < khm ~ Vn\\ —* ° as m,n —v oo (4.4.3.3) Since X is complete it follows that (xn) is convergent, with limit x^ say: now ll»» - Tx^W < k\\Txn - TXoo\\ < \\T\\\\xn - xj\ — 0 (4.4.3.4)
118 4. Banach Spaces and Completeness For (4.4.3.2) suppose y = (yn) G c(Y) is Cauchy: then by (4.1.4.2) there is x = (xn) G c(X) for which TNx = y. Since X is complete it follows that x G cx(X) is convergent, and hence (2.7.1.1) y G ^(Y). ■ Alternatively, for (4.4.3.2), we can use Theorem 4.2.3. When X is complete then certain kinds of nonsingularity coalesce for T G BL(X, Y): \AA THEOREM If T G BL(X, Y) and X is complete, then T bounded below => T closed (4.4.4.1) and T almost open => T open (4.4.4.2) Proof: The implication (4.4.4.1) is almost immediate, using (4.4.3.1) and (4.2.1.3). For (4.4.4.2) we follow the argument for Theorem 3.4.3, taking T' = T. If y G Y is arbitrary and if x = (xn) G XN is constructed as in (3.4.3.2), taking T' = T and 6 = e, then by (3.4.3.3) the sequence x = (xn) is in /i(X). From (3.4.3.2) again it follows V = T"'£ix€T"c1(X)Cc1(Y) (4.4.4.3) using also (4.2.3.2). ■ Notice that in Theorem 4.4.4 we do not assume that the space Y is complete; if however T G BL(X,Y) is almost open, then by (4.4.4.2) and (4.4.3.2) it follows that the space Y is complete. 4.4.5 THEOREM If T G BL(X, Y) and X is complete, then T almost left invertible =^ T left invertible (4.4.5.1) and T almost right invertible ==> T right invertible (4.4.5.2) Proof: If X is complete then (Theorem 4.3.4) so is BL(Y, X), and hence (4.4.4.2) RT almost open ==> RT open (4.4.5.3) and LT almost open => LT open (4.4.5.4)
4.4 Extension by Continuity 119 where RT = BL(T,X) and LT = BL(Y,T) as in Theorem 3.6.5 and Theorem 3.7.2. But now (4.4.5.3), (3.7.2.1), and (3.6.5.1) give (4.4.5.1), while (4.4.5.4), (3.7.2.2), and (3.6.5.2) give (4.4.5.2). ■ If X is complete then it is clear from Theorem 4.4.4, Theorem 3.3.3, and Theorem 3.4.3 that the closed and the open operators form open subsets of BL(X,y). In the notation (3.3.1.3), (3.3.1.4), (3.4.1.3), and (3.4.1.4), rleft BL(X, Y) = rleft BL(X, Y) is closed (4.4.5.5) and rright BL(X, Y) = rright BL(X, Y) is closed (4.4.5.6) Similarly, using Theorem 4.4.5 and Theorem 3.7.4, the left and right invertible operators, and the invertible operators, form open sets: aleft BL(X, Y) = aleft BL(X, Y) is closed (4.4.5.7) aright BL(X, Y) = aright BL(X, Y) is closed (4.4.5.8) and BL_1(X,y) = BL(X,y) \ (aleftBL(X,y) UarightBL(X,y)) is open (4.4.5.9) If X is complete and A and B are closed ideals of BL(X, X) and BL(y,y) and T e BL(X,y) then, applying (4.4.4.2) to RT/A and LT/B from (3.9.3.7), (3.9.3.8) gives T almost left invertible mod A => T left invertible mod A (4.4.5.10) and T almost right invertible mod B ==> T right invertible mod B (4.4.5.11) If instead X = A is a normed algebra, and is complete, and a G A, then applying (4.4.4.2) to Ra and La gives (Theorem 3.10.4) a almost left invertible => a left invertible (4.4.5.12) and a almost right invertible => a right invertible (4.4.5.13) Also A-1 is open in A (4.4.5.14)
120 4. Banach Spaces and Completeness In particular oo ||a|| < 1 =► 3(1 - a)-1 =l + ^cnGA (4.4.5.15) n=l By Theorem 3.1.4 the mapping a —► a-1 : A-1 —► A is continuous. We shall see later that in a sense it is also "differentiate." We conclude with the observation that for certain incomplete normed algebras the invertible elements A-1 can still form an open subset of A. An example would be the algebra A = c00 of all terminating numerical sequences: more generally if there is a sequence (An) of complete subalgebras of A for which oo A = |J An and 1 G An C An+1 n=1 (4.4.5.16) and {a~l:ae A~l n An} C An for each neN then it will follow from (4.4.5.14) that A-1 is open in A. To make the argument, begin with an observation about the range of an operator on an incomplete normed space: 4.4.6 THEOREM If X is a normed space and T € BL(X, X), then 2/GX and (Y,Try) € cx(X) =► y € (J-T)(X) (4.4.6.1) and the following are equivalent: (Tnx) e c0(X) for each xeX (4.4.6.2) (/-r)-1(o) = {o> and (I-T)(X) C^yeX: (l>r2/) €Cl(X)} (4.4.6.3) Proo/: If ?/ G X and ?/ + ^ Tr?/ —^xGX asn —>oo (4.4.6.4) r=l
4.4 Extension by Continuity 121 then necessarily Tny —> 0 as n —► oo, giving y = lim(y - rn+1</) = (J - T) lim(y + Ty + • • • + rn</) (4.4.6.5) = (/-r)(x)e(/-r)(x) which proves (4.4.6.1). If (4.4.6.2) holds and x G X is arbitrary then (J - T)(x) = 0 => x = Tx = T2x = • • • = Tnx —► 0 as n —► oo (4.4.6.6) so that (I - r)_1(0) = {0}, while for each neN y={I- T){x) =>x = y + Tx = y + T{y + Tx) = --- = {y + Ty + --- + Tny) + Tn+1x (4.4.6.7) giving y + Ty + • • • + Tny = x- rn+1x —► x as n —^ oo (4.4.6.8) Thus, both parts of (4.4.6.3) hold. Conversely, if the second part of (4.4.6.3) holds, then for arbitrary x £ X there is x' G X for which (7 + r + --- + rn)(/-r)x—►*' asn—^ oo (4.4.6.9) giving Tn+1x —> x - x' e {I - T)-1 (0) asn-^oo noting, for the inclusion at the end, that (J - T)rn+1x —► 0 asn —► oo If also the first part of the condition (4.4.6.3) holds then x (4.4.6.10) gives (4.4.6.2). ■ If the condition (4.4.6.2) is strengthened to ||rn||—►() asn—>oo (4.4.6.12) then the operator I — T is almost invertible in the sense of Definition 3.7.1, and hence by (3.7.3.3) I— T is also bounded below. To find out when A-1 is open in A we apply Theorem 4.4.6 with T = Lx or T = Rx for elements xe A: (4.4.6.10) (4.4.6.11) = x1 and now
122 4. Banach Spaces and Completeness 4.4.7 THEOREM If A is a normed algebra then the following are equivalent: A-1 is open in A (4.4.7.1) there is 8 G ]0, l] for which {1 - x: \\x\\ < 8} C A"1 (4.4.7.2) there is 8 G ]0,l] for which ||x|| < 8 =>» (^zr ) € cx(A) (4.4.7.3) v=o ' aleft {A) = aleft {A) and aright {A) = aright {A) (4.4.7.4) Proof: If (4.4.7.1) holds then 1 G A-1 lies in the interior of A-1, giving (4.4.7.2). Conversely, if (4.4.7.2) holds then {a(l - x): \\x\\ <8}C A'1 for each a G A'1 (4.4.7.5) giving (4.4.7.1). If (4.4.7.2) holds then also ||x|| < 8 =>» ||2£(1)|| —► 0 as n —> oo and 1 G (/ - LX)(A) (4.4.7.6) So that also ||x|| < 8 => \\L"(a)\\ —> 0 as n —► oo for each aeA (4.4.7.7) The condition (4.4.6.2) of Theorem 4.4.6 is therefore satisfied, and hence also the condition (4.4.6.3). Since (4.4.7.6) 1 is in the range of I — Lx this gives (4.4.7.3). Conversely, if (4.4.7.3) holds then (4.4.6.1) with y = 1 and T = Lx gives ||x|| < 8 =► 1 G (1 - x)A =► 1 - x £ aleft(A) (4.4.7.8) while (4.4.6.1) with y = 1 and T = Rx gives ||x|| < 8 =► 1 G A(l - x) =► 1 - x £ aright(A) (4.4.7.9) Together (4.4.7.8) and (4.4.7.9) give (4.4.7.2). Finally if (4.4.7.2) holds and if a G A is almost left invertible then there is (an) in A for which || 1 — ana\\ —► 0 as n —> oo, so that for sufficiently large nGN ana = 1 - x with ||x|| < 8 =J> ana G A-1 =J> a £ aleft(A) (4.4.7.10) which is the first part of (4.4.7.4). The argument for the second part of (4.4.7.4) is the same; conversely, if both parts of (4.4.7.4) hold then A-1 = A \ (aleft(A) U aright(A)) is open in A (4.4.7.11) giving (4.4.7.1). ■
4.5 Completions 123 We might remark that by the same argument aleft(A) closed in A <=> aleft(A) = <rleft(A) (4.4.7.12) and aright(A) closed in A <=> aright(A) = aright(A) (4.4.7.13) Hence if A-1 is open then each of the sets of left invertible and of right invertible elements are separately open in A. 4.5 COMPLETIONS It is familiar that every metric space can be regarded as a dense subset of some complete space. For a normed space X we can find this in the enlargement Q(X). We begin by comparing Cauchy sequences in l^X) with Cauchy sequences in Q{X): 4.5.1 THEOREM If X is a normed space and K : /^(X) -► Q{X) is the quotient, then ifNo(lM(I))Cc1(Q(I)) (4.5.1.1) Proof: Suppose x G XN is arbitrary and write (xA)n = {K"x)n = xn + c0{X) for each neM (4.5.1.2) It is clear from (2.6.1.1), (2.7.1.1) and (4.1.3.1) that if x is bounded, or convergent, or Cauchy, then so is xA. We claim that if x is Cauchy then xA is convergent: if {Nm)meN is such that for each n,nf,m in N n' > n > Nm => \\xn, - xj.,, < ^ and Nm < Nm+1 (4.5.1.3) then define x^ G XN by setting (*oo)m = (*Njm &>r each m € N (4.5.1.4) Since n > Nm =* ||(xjm - (x0O)m|| < ||xB -*WJI~ < ± (4.5.1.5) m m we have limsup||(xn)m - (XoJ Jloo —► 0 as n —► oo (4.5.1.6)
124 4. Banach Spaces and Completeness In particular *oo € U*) (4-5.1.7) and now (4.5.1.7) reads IK-*IIq(x)—>0 where 2 = xoo + c0(X) ■ (4.5.1.8) Theorem 4.5.1 and Theorem 4.1.4 tell us that enlargements are complete. 4.5.2 THEOREM If X is a normed spaced then Q(X) is complete. Proof: Suppose y G c(Q(X)) is a Cauchy sequence, and recall, as we remarked following the proof of (3.4.7.2), that the natural quotient K : /^(X) —y Q{X) is open: thus by (4.1.4.1) we have y = xA for some x G c^X)), and hence by (4.5.1.1) y G c^QpT)). ■ If we are prepared now to regard a normed space X as a subset of its enlargement Q(X), then the closure of X in Q(X) will be its "completion." As is familiar, this closure will be precisely the quotient c(X)/c0(X) C Q(X), while the dense subspace which we are identifying with X is really the quotient c1(X)/c0(X): 4.5.3 THEOREM If X is a normed space, then q(X) = c1{X)/c0{X) is dense in c{X)/c0{X) (4.5.3.1) and c(X)/c0(X) is complete (4.5.3.2) If X- = c{X)/c0{X) then also Q(X) ^ Q(X~) is complete (4.5.3.3) Proof: Towards (4.5.3.1) suppose x G c(X) is arbitrary, so that there is {Ne)e>0 in N for which n' > n > Ne =^ \\xn, - xj < e (4.5.3.4) and now define z G X by setting z = xNe (4.5.3.5) Evidently the coset q(z) G q(X) is within distance e of the coset x-\-c0(X) G c(X)/c0(X).
4.5 Completions 125 Towards (4.5.3.2) suppose that x = (xn) is a sequence in c(X) whose coset sequence xA = (xn + c0(X)) is Cauchy in c(X)/c0(X), so that there is {Ne)e>0 in N for which n' > n > Ne =>• limsup \\xn,m - xnm\\ < e (4.5.3.6) m—*<x> Then by (4.5.3.1) there is y = (yn) in X such that limsup ||xnm — yn\\ < — for each n G N (4.5.3.7) m—*oo 71 We claim now that yec(X) and \\xn - y + c0(X)||Q(x) —► () as n—► oo (4.5.3.8) For if m G N is arbitrary then llVn- " y»ll < l|y»- - *»'mll + K'm - *»mll + ll*»m " »»ll (4-5-3.9) and hence \\yn' ~ 2/nll < —, + limsup ||xn,m - xnm|| + - —> 0 as n' > n —► oo ft m—oo » (4.5.3.10) This shows that the sequence y = (yn) is Cauchy, and then (4.5.3.7) shows that the coset y + c0(X) is the limit of the Cauchy sequence x. Finally, for (4.5.3.3), observe that the mapping q : X —> X~ is bounded below and dense, so that by (3.5.4.1) Q(q) : Q(^0 —► Q(X~) is invertible; also Theorem 4.3.3 and Theorem 4.3.2 tell us that Q(X~) is complete. ■ The space X~ = c(X)/c0(X) is the familiar way of constructing the real field R from the rationals Q. We have given the proof of (4.5.3.2) for sentimental reasons, since we already know the result from Theorem 4.5.2; at the same time the reader may perhaps consider the deduction of (4.5.3.3) from (4.5.3.2) to be easier than the proof of Theorem 4.5.2. As so often in mathematics, it is if anything counterproductive to know exactly how to construct the completion X~: 4.5.4 DEFINITION A completion of the normed space X is a complete normed space X~ together with J G BL(X, X~) for which J is isometric and dense (4.5.4.1) A completion of the bounded linear operator T G BL(X, Y) is T~ G
126 4. Banach Spaces and Completeness BL(X~,y~) for which T~JX = JYT (4.5.4.2) where X~ and y~ are completions of X and Y and Jx G BL(X, X~) and JY G BL(y,y~) are isometric and dense. Completions are essentially unique: 4.5.5 THEOREM If X~ and X~' are completions of X, then X~ ^ X~', and if T~ and r~' are completions of T, then T~ S r~'. Proof: This is Theorem 4.4.1: each of the mappings J : X —► X~ and J' : X —► X~' has unique extensions J : X~' —► X~ and J' : X~ —► X~;, whose products J J' : X~ —► X~ and J'J : X~' —>• X~; are extensions of the identity I: X -+ X. Thus, J J' = / : X~ —► X~ and J'J = I: X~' —> X~' (4.5.5.1) This shows that X~ and X~' are isomorphic. For the second part observe that the operator T~ of (4.5.4.2) is determined uniquely by X~ and Y~, or at least by Jx and Jy. ■ We have of course given a "metric" definition of completion. More generally for the "topological" definition, relax (4.5.4.1) to J is bounded below and dense (4.5.5.2) Some of the nonsingularity of T G BL(X, Y) can be expressed in terms of its completion T~ G BL(X~,y~): 4.5.6 THEOREM If T~ G BL(X~,y~) is a completion of T G BL(X,y),then T bounded below <$=> T~ bounded below <$=> T~ closed (4.5.6.1) and T almost open <4=^ T~ almost open <4=^ T~ open (4.5.6.2) Proof: For (4.5.6.1) the first equivalence is (4.4.2.1) and the second (4.4.4.1), while for (4.5.6.2) the first equivalence is (4.4.2.2) and the second (4.4.4.2). ■
4.6 The Open Mapping Theorem 127 We also have one way implications. T almost left invertible =► T~ left invertible (4.5.6.3) and T almost right invertible ==> T~ right invertible (4.5.6.4) The reader is invited to ponder what is preventing us from reversing them. In spite of (4.5.6.3) and (4.5.6.4), almost invertibility in a normed algebra is exactly the same as invertibility in the completion: 4.5.7 THEOREM If A is a normed algebra, then aleft(A) =Analeft(A~) (4.5.7.1) and aright(A) = A n aright(A~) (4.5.7.2) Proof: If a e A is arbitrary, apply (4.5.6.2) with T = Ra for (4.5.7.1) and withT = La for (4.5.7.2). ■ 4.6 THE OPEN MAPPING THEOREM We have already, in Theorem 4.4.4, proved half of the "open mapping theorem"; the other half rests on Bairt 's theorem: 4.6.1 THEOREM If X is a Banach space and if (Kn) in X satisfies oo |J int(cl(tfj) = 0 (4.6.1.1) n=l then \jKn?X (4.6.1.2) n=l Proof: If (4.6.1.1) holds we shall construct a Cauchy sequence x = (xn) in X which has no limits in \J<^>_1Kn, so that if X is complete then (4.6.1.2) holds. In fact, if X is complete, then we claim that (4.6.1.1) implies oo int |J Kn = 0 (4.6.1.3) n=l . Indeed if (4.6.1.1) holds and (4.6.1.3) does not then there is xx G
128 4. Banach Spaces and Completeness intU^Lji^ and £± for which oo Disc(x1 jeJC [j Kn\ cl^) and 0 < ex < \ (4.6.1.4) n=l For if not then we would have in^c^if x)) D int U^L1Kn ^ 0, contradicting (4.6.1.1). Next choose x2 G X and e2 for which Disc(x2 ; e2) — intDisc(x1 ; e^) \ cl{K2) and 0 < e2 < ^ (4.6.1.5) which is possible because intDisc(x1 ; e^) g cl(if2)- Inductively there are sequences (xn) in X and (en) > 0 for which, for each nGN 0 < en < 2~n and Disc(xn+1 ; en+1) C int Disc(xn ; sn) \ cl(tfn+1) (4.6.1.6) The sequence (xn) is Cauchy in X because m>n=> \\xm -xj< ||xn+1 - xj| + • ■ ■ + ||*m - xm_1 \\ < 2_n_1 + • • • + 2_m < 2_n —► 0(n —► oo) (4.6.1.7) but if the sequence (xn) converges to x^ G X then *oo € H Disc(*n : £n) C (int( Q ifB)) \ Q «•» ■ (4-6.1.8) n=l ^ Si=l ' ' ^ n=l Baire's theorem is true for any nonempty complete metric space, and can be used in unlikely places: for example if X = C[0,l] then the subset Y of those functions x G X for which there exists t G ]0, l[ for which the derivative x'{t) exists can be put in the form \J™=1Kn where [Kn) satisfies (4.6.1.1). Thus we deduce that there exist continuous functions which are nowhere differentiable. When a topological space X has the property expressed by Theorem 4.6.1 then subsets of the form \J™=1Kn for which (4.6.1.1) can only make up very small subsets of X. In a sense, therefore, a Banach space is "large," and also it is very much the exception rather than the rule for a continuous function to be differentiable. We may now proceed with the other half of the "open mapping theorem." 4.6.2 THEOREM If Y is complete and T G BL(X, Y), then T onto =*► T almost open (4.6.2.1)
4.6 The Open Mapping Theorem 129 If X and Y are both complete, then T onto =► T open (4.6.2.2) Proof: If T is onto, so that TX = Y, then oo Y={J{Tx:\\x\\<n} (4.6.2.3) n=l so that if Y is complete then by (4.6.1.1) there exists n G N for which int cl T Disc(0 ; 1) = - int cl {Tx : ||x|| < n} ^ 0 (4.6.2.4) n Since TDisc(0 ; 1) is absolutely convex it follows from (1.4.1.2) that 0 e int cl T Disc(0 ; 1) (4.6.2.5) But this means that there is k > 0 for which (3.4.1.2) holds. This proves (4.6.2.1), which together with (4.4.4.2) gives (4.6.2.2). ■ Theorem 4.6.2 gives an immediate characterization of invertibility: 4.6.3 THEOREM If X and Y are Banach spaces and T e BL(X, Y) then T one-one and onto ==> T invertible (4.6.3.1) Proof: This is (4.6.2.2) and (3.4.7.1). ■ Theorem 4.6.3 is satisfying in the sense that if T G BL(X, Y) is algebraically capable of being inverted then its inverse is automatically bounded. We might observe that (4.6.2.2) can be deduced from (4.6.3.1): for if T is onto then T = coTe(T)K with K = coker(ker(r)) open (4.6.3.2) and if core(T) is invertible then T is open. Theorem 4.6.3 simplifies the characterization of the "proper" mappings of Definition 3.2.7: 4.6.4 THEOREM If X and Y are Banach spaces and T e BL{X,Y), then T(X) closed =^ T proper (4.6.4.1)
130 4. Banach Spaces and Completeness Proof: If X and Y are complete then so are X/r_1(0) and cl(rX), and if TX is closed then core(T) is one-one and onto, therefore invertible by (4.6.3.1). ■ For the record, we note that if X is complete and T G BL(X, Y) then T bounded below and dense => T invertible (4.6.4.2) This is just (4.4.4.1) and (3.3.7.2). Once again it now follows that Y must also be complete. 4.7 ALMOST OPEN AND ONTO MAPPINGS It has been conspicuous that, for general normed spaces X and Y and T G BL(X, y), we have been unable to combine the two chains of implication (3.4.2.3) and (3.4.2.4) in a single line. To see why we are unable to do so, we make an auxiliary definition: 4.7.1 DEFINITION If X and Y are normed spaces then Y is strictly weaker than X if there exists T G BL(X,y) for which T is one-one and onto but not invertible (4.7.1.1) If y is obtained from X by imposing a new norm ||-||' then the condition is that (1.7.1.1) holds but not (1.7.1.2). In the same spirit, we shall say that X is a dense proper subspace of Y if there is T G BL(X, Y) for which T is bounded below and dense but not invertible (4.7.1.2) 4.7.2 THEOREM If X is a dense proper subspace of Y and T = J : X —> Y is the natural injection then T is almost open but not onto (4.7.2.1) Proof: Since J is isometric it is bounded below, therefore almost open by (3.5.1.2). ■ For example if X is any incomplete normed space we may take Y = X~ to be a completion of X. More specifically, take Y = C[0, l] with X QY the subspace of continuously different iable functions, or of polynomials. The reader might like to use Theorem 4.7.2 to see that the implication (4.4.4.2) serves to characterize complete spaces X.
4.7 Almost Open and Onto Mappings 131 4.7.3 THEOREM If X is complete and Y is strictly weaker than X, and T = I: X -► Y is the identity, then T is onto but not almost open (4.7.3.1) Proof: If J were almost open then it would be open by (4.4.4.2), and therefore invertible by (3.4.7.1). ■ For example if X = C[0, l] in its usual norm H-H^ then we may obtain Y by imposing the norm ||-1| j_ defined by setting l Nil = / lxMI * for each x € x (4.7.3.2) It is easily verified that J : X —► Y is bounded, and also that Y is not complete: alternatively if we define zn(t) =tn{0<t<l) for each neN (4.7.3.3) then ||*n||i = —^— and ||*n||oo = 1 for each nGN (4.7.3.4) n + 1 which shows that J : X —► Y is not bounded below. There is still a possibility to be considered; perhaps "almost open" and "onto" together imply "open": 4.7.4 THEOREM If Y is complete and strictly weaker than X and T = I-.X-+Y then T is almost open and onto, but not open (4.7.4.1) and T is one-one and almost open, but not bounded below (4.7.4.2) Proof: Trivially J is onto, and therefore almost open by (4.6.2.1). If it were either bounded below or open then it would also be invertible, by either (3.3.7.1) or (3.4.7.1). ■ Regrettably we are not, at this stage, able to follow up with a concrete example; to prove that the situation of Theorem 4.7.4 can occur we must wait until (5.5.6.2) and duality theory. It will then be clear (4.7.4.2) that a most plausible "dual" to the implication (3.5.1.2) fails.
132 4. Banach Spaces and Completeness 4.8 COMPLEMENTED SUBSPACES In a Banach space, supplemented subspaces are complemented: 4.8.1 THEOREM If X is a Banach space and if E = E2 e L(X, X) is a linear idempotent for which E{X) = c\{EX) and E'1^) = cl^"1^)) (4.8.1.1) then EeBL{X,X) (4.8.1.2) Proof: Write Y = E(X) and Z = E_1(0): then the mapping row( JY,JZ) :{y>z) —>y + zeX for each (y,z) e Y x Z (4.8.1.3) is bounded and linear, one-one and onto. If X is complete and Y and Z are closed then (Theorem 4.3.2) Y and Z are both complete, hence also (Theorem 4.3.1) Y x Z: it therefore follows from (4.6.3.1) that co\{E,I-E) =TOVt{JY,Jz)-1 :X—>Y x Z (4.8.1.4) is bounded, and hence also E = JY o^ocoliEJ-E) :X—>Y x Z—>Y —>X m (4.8.1.5) A related result says that certain complemented subspaces must be closed: 4.8.2 THEOREM If X and Y are complete and T e BL(X,Y), and if there is a closed subspace Z C Y for which T{X) + Z = Y and T{X) n Z = {0} (4.8.2.1) then T{X) = cl(rX) (4.8.2.2) Proof: Define 5 : X/T^O x Z -+Yby setting 5(x + T~x0,z) =Tx + zeY for each x e X <ind z e Z (4.8.2.3) Then S is bounded and linear, one-one and onto, between complete spaces, therefore by (4.6.3.1) invertible. Now E = TS'1 :Y —> Y (4.8.2.4)
4.8 Complemented Subspaces 133 is the projection of Y upon T(X) in the direction of Z, is bounded, and has null space T{X) =E~1{0) closed ■ (4.8.2.5) Theorem 4.8.1 can be used to show that, if X is complete and / : X —► K is linear and has closed null space /_1(0) then it must be continuous: for if f{x0) = 1 and E = / © x0 then E = E2 : X —► X is linear with E{X) = Kx0 and ^(O) = /^(O) (4.8.2.6) which form a pair of supplemented closed subspaces. By Theorem 4.8.1 E and hence also / are bounded. In fact however the result survives for incomplete X: 4.8.3 THEOREM If / G L(X, K) has closed null space f~l (0) then / G BL(X,K). Proof: We show that if / is not bounded then clf-^O) =X (4.8.3.1) Indeed, if x £ X is arbitrary and e > 0, then there is x' G X for which ||*'||<e and |/(*')|>|/(*)| (4.8.3.2) Now t = y^- =}> ||*x'|| < e and x - tx' € /_1(0) ■ (4.8.3.3) We conclude with what is known as the closed graph theorem: 4.8.4 THEOREM If X and Y are complete and T e L(X,Y) is such that graph(r) = {(x,Tx):xe X} is closed inlx7 (4.8.4.1) then T e BL(X,Y) is bounded. Proof: Since graph(T) is a linear subspace of X x Y, which is complete by Theorem 4.3.2, the condition (4.8.4.1) ensures by Theorem 4.3.2 that graph(T) is a Banach space in its own right. Evidently the mapping Sx : {x,Tx) —> xeX for each {x,Tx) e graph(T) (4.8.4.2) obtained by restricting the projection 71^ : X x Y —► X is bounded and
134 4. Banach Spaces and Completeness linear, one-one and onto, therefore invertible by Theorem 4.6.3: but now T = tt2 • S'1 : X —> Y is bounded ■ (4.8.4.3) The condition that graph(T) C X x Y is closed says that if (xn) G XN then \\*n ~ *JI — 0 ^d l|rxn - 2/ooH — 0 =► Voo = TXoo (4.8.4.4) Thus, if T e BL(X,Y) then graph(T) is closed (whether or not X and Y are complete). If T G BL(X,y) is one-one and onto then the linear inverse T-1 : Y —> X always has closed graph. Thus we can derive the open mapping theorem from the closed graph theorem. One more example: if Y and Z are a pair of closed supplementary subspaces of a normed space X then graph(E) is closed in X, where E is the projection of X on Y in the direction of Z. 4.9 UNIFORM BOUNDEDNESS If X and Y are Banach spaces, then the boundedness of a subset of BL(X,Y) can be tested "pointwise": 4.9.1 THEOREM Suppose that X is a Banach space, that Y is a normed space and that H is a nonempty set, then if (Tt)ten is a family in BL(X, Y) for which then also sup ||rtx|| < oo for each x G X (4.9.1.1) ten sup||rt|| < oo (4.9.1.2) ten Proof: if (4.9.1.1) holds then oo X={jKn (4.9.1.3) n=l where, for each nGN, Kn = I x e X:sup ||rtx|| <n\ (4.9.1.4) I ten ) and hence if X is complete then by Baire's Theorom 4.6.1 there must be n G N for which 0^int(tfn)=intcl(tfn) (4.9.1.5)
4.9 Uniform Boundedness 135 noting that each of the sets Kn is already closed. Since also each Kn is absolutely convex, (1.4.1.2) again says Oe'mtKn (4.9.1.6) which means that there is S > 0 for which ||x|| < 6 =>» sup ||rtx|| < n (4.9.1.7) ten But this means sup||rt||<5<oo ■ (4.9.1.8) ten o For an alternative derivation of Theorem 4.9.1, observe that as in (2.9.4.3) the family {Tt)ten induces a mapping TA : x -+ T.(x) from X to /00(n,y~), where Y~ is a completion of Y and the boundedness is given by (4.9.1.1): now the reader should verify that graph(rA) is closed in X x l^Q^Y^) (4.9.1.9) One kind of application of the uniform boundedness principle is to show that certain kinds of linear mapping, if they are defined at all, must be bounded. For example if x G K and we write Lx:y—►x-?/eKN for each y e KN (4.9.1.10) where (x • y)n = xnyn for each nGN (4.9.1.11) then the reader should use Theorem 4.9.1 to show that £x('i) Ql1=^xelOQ=^Lxe BLil^lJ (4.9.1.12) Another application is known as the Banach-Steinhaus theorem: 4.9.2 THEOREM If X and Y are Banach spaces and H is a bornological space, and if {Tt)ten in BL(X, Y) satisfies T.x e c(H, Y) for each x e X (4.9.2.1) then there is T^ e BL(X,Y) for which T.x - T^x e c0(H, Y) for each xeX (4.9.2.2)
136 4. Banach Spaces and Completeness Proof: We recall from (4.1.2.1) that c^yjG/oo^y) (4.9.2.3) so that by (4.9.2.1) the condition (4.9.1.1) is satisfied. Since Y is complete we have by (4.2.2.1) c(n,y) = c1{n,Y) (4.9.2.4) Thus, for each x G X there is T^x G Y for which (4.9.2.2) holds. Evidently the mapping T^ : x —> T^x is linear: what remains is to show that it is bounded. If X is complete then by Theorem 4.9.1 we argue, for each x G X, WT^xW = lim ||3>|| < sup ||3>|| < (sup ||Tt||) ||x|| (4.9.2.5) * ten ^ten ' using the condition (4.9.1.2), so that IIT^H <sup||Tt|| < oc ■ (4.9.2.6) ten The reader should note that we do not prove that r.-r^ecofn.BLpr.y)) (4.9.2.7)
5 Linear Functionals and Duality We have seen how the behavior of a linear operator is to some extent determined by its interaction with other bounded operators through the medium of all the composition operators LT = BL(W,T) and RT = BL(T,W) associated with normed spaces W. We are ready to improve the precision with which this can be done, and to do this using only the scalar field W = K. We can do this only with the aid of a deep theorem, the Hahn-Banach theorem. 5.1 THE DUAL SPACE AND THE DUAL OPERATOR If X is a real or a complex normed space, with scalar field K = R or K = C, then the dual space is the space X1" = BL(X, K) = BLK(X, K) (5.1.0.1) of bounded linear functionals on X. More traditional notation is to write X* or X1. Like all bounded linear operators, the elements of the dual space carry norms: ||/||= sup |/(x)| for each / G X1" (5.1.0.2) ll*ll<i If T G BL(X, Y) then for each g G Yt the product gT : X -> K is bounded and linear, therefore belongs to the space X^: thus we have a well-defined mapping T^ : Y^ —> X^ given by the formula T*(g)=g-T for each g G Ff (5.1.0.3) Of course this is RT = BL(T,W) with W = K. We shall refer to Tf as the dual of T: it is more usually called the adjoint or conjugate, written T* orT'. 137
138 5. Linear Punctionals and Duality 5.1.1 THEOREM If T G BL(X, Y), then T* G BL{Y^X^), with ||rt||<||r|| (5.1.1.1) If also S G BL(X, Y), then for each s,t G K {sT + *S)f = sT1" + tS* (5.1.1.2) If also 17GBL(Y,Z), then (l/r)1" = r^1" (5.1.1.3) Proof: This is obtained by substituting W = K in Theorem 2.9.1. ■ Evidently the mapping T —> J1* is linear and bounded, with bound < 1. We shall see that it is actually one-one, and isometric, though not necessarily onto: this will use the Hahn-Banach theorem. If K = C, so that X is a complex normed space, then the alert reader is entitled to a certain anxiety: since every complex space is automatically also a real space, there are two definitions of X*. Fortunately they are isomorphic, and indeed isometrically isomorphic. 5.1.2 THEOREM If X is a complex normed space then there is isometric isomorphism * —> / : BLR(X, R) = BLC(X,C) (5.1.2.1) given by the formulas g(x) = Re f(x) for each x G X (5.1.2.2) and f(x) = g(x) - ig{ix) for each x G X (5.1.2.3) Proof: If / : X —> C is complex linear, then the formula (5.1.2.2) defines a mapping g : X —> R which is certainly real linear, and bounded if / is bounded, with \\g\\ < \\f\\. We claim that if g is derived from / by means of (5.1.2.2) then (5.1.2.3) also holds: for we certainly write f{x) = g(x) + ih(x) for each x G X, and then observe that h : x —> Im/(x) is another well- defined, real linear mapping from X to R; but now g{ix) + ih{ix) = f{ix) = if{x) = ig{x) — h{x) for each x G X (5.1.2.4) and by equating real and imaginary parts we find h(x) = — g(ix). Conversely, suppose g : X —> R is real linear, and consider the mapping / : X —> C defined by (5.1.2.3). It is clear that / is real linear, and
5.2 Poles and Polars 139 evidently also f(ix) = i/(x), so that / is complex linear. Finally g also satisfies (5.1.2.2). We have thus a linear isomorphism between LR(X, R) and LC(X, C). We claim that it preserves boundedness, and indeed respects the value of the norm. We have already seen that if / is bounded then so is g, with ||<7|| < ||/||; conversely, if g is bounded then so is /, with ||/|| < 2\\g\\, as the reader can check. To see more precisely that ||/|| < ||^|| write, for each x G X, /(*) = |/(x)|e" (5.1.2.5) where iGR depends on x, without of course being uniquely determined: we make a choice. Now \f(x)\=e-itf(x) = f(e-itx) (5.1.2.6) using complex linearity, and then—this is the subtle step— f(e-itx)=g(e-itx) (5.1.2.7) since by (5.1.2.6) it is already real. Now for arbitrary x G X we have, using (1.1.1.3) for complex scalars, |/(x)| = ff(e-"*x) = |*(e-«x)| < ||ff||||e-"*x|| = IWIINI ■ There is a genuinely larger space associated with a complex normed space X, namely the space BLR(X, C) of real linear maps from X to C, and then the dual has a complementary subspace BL£(X, C) in this space, where we write BLtpf,Y) = {T g BLR{X,Y):T(sx) = sTx for each x G X and 5 G C} in which 5 denotes the complex conjugate of a complex number s. The mappings of (5.1.2.8) are called conjugate linear, or sesquilinear. 5.2 POLES AND POLARS There is a correspondence between subspaces of a normed space and sub- spaces of its dual. If Y is a subspace of X we shall write Y° = {fe Xf: Y C f-1!)} (5.2.0.1) and call it the annihilator of Y. If Z is a subspace of X^ we shall write Z0 = f]{r10:f€Z} (5.2.0.2)
140 5. Linear Functionals and Duality and call it the back annihilator of Z. Before recording some simple properties of these constructions we find it convenient to extend to more general subsets, in perhaps not the obvious way: 5.2.1 DEFINITION If K C X is nonempty then its polar is the subset K° = \f e Xf: sup |/(x)| < 11 C X* (5.2.1.1) I xGK ) If H C X^ is nonempty then its pole is the subset H0 = \ x e X: sup |/(x)| <l|CX (5.2.1.2) It is clear that the pole, or polar, of a linear subspace is just its back annihilator, or annihilator. Thus, we are entitled to use the same notation. The pole or polar of a nonempty set is always absolutely convex in the sense (1.4.1.4), and further K0 = (cyxk(K))° = (cl(cvxK(X)))° (5.2.1.3) where cvxK{K) = f){K':KC X'abscvx} fv^ A, , 1 (5.2.1.4) S=i y=i } is the absolutely convex hull of the set K. Thus we are entitled to restrict our discussion of poles and polars to absolutely convex sets, or to closed absolutely convex sets. In particular the annihilator, or back annihilator, of a subspace is always a closed subspace. We record the most elementary facts about poles and polars: 5.2.2 THEOREM If K, K' are nonempty subsets of X and H, H' of Xf, then K C [K°)0 and H C (H0)° (5.2.2.1) and KCK'=^> (X')° £ K° and H C H' => (H')0 C H0 (5.2.2.2) Hence also ((K°)0)° = K° and ((H0)\ = H0 (5.2.2.3)
5.3 The Hahn-Banach Theorem 141 Finally {0}° = Xf, X° = {0} and {0}0 = X (5.2.2.4) Proof: This is very straightforward, and left to the reader. Theorem 5.2.2 remains valid if the dual space X* is replaced by the space of operators BL(X,W) for a more general normed space W. The reader is entitled to a feeling of relief at (5.2.2.3), and should contemplate the deafening silence at the end of (5.2.2.4). 5.3 THE HAHN-BANACH THEOREM Every normed space X has at least one bounded linear functional—the zero mapping 0 : X —> K. As yet we have no evidence that there need be any more. Such evidence, and much more, will be furnished by the Hahn-Banach theorem: we begin with a preliminary version. 5.3.1 THEOREM If X is a real normed space and if gx : Xx —► R is bounded and real linear on a real subspace Xx C X, then for each x2 C X there is a bounded real linear extension g2 : X2 —> R of gx to the subspace X2 = Xx + Rx2, for which \\g2\\ < \\9l\\. Proof: If x2 G Xx (we have not excluded this possibility), then we can (and must) take g2 = gx> If x2 £ Xx then Xx fl Rx2 = {0} and so each point x G X2 can be written in the form x = y + sx2 with y G Xx and 5 G R. The most general linear extension ht of gx to X2 can therefore be written as ht(x) = gi(y) + ts if x = y + sx2 with y G Xx and 5 G R (5.3.1.1) for some iG R: the problem is to show that there exists t G R for which \\ht\\ < \\gi\\. It is evidently necessary that -lltfilllly + *2ll <9i{y)+t< \\9i\\\\y + *2II for each y G Xx (5.3.1.2) This is obtained from looking at (5.3.1.1) with 5 = 1. Conversely, if (5.3.1.2) holds then \ht{y + sx2)\ < ||<7i||||2/ + <sx2|| for each 5 > 0, and for each 5 < 0: thus (5.3.1.2) is necessary and sufficient, as is the condition -II01IIII2/ + Z2II -9\{y) <* < Il0illl|y + *2ll+0i(y) for each t/GXx (5.3.1.3) Since the middle expression is independent of y G Xx this is the same as -Ikilllly + Z2II - 0i(y) < * < ll^illll2 + «2II + 9i(z) for each y>z e xi (5.3.1.4)
142 5. Linear Punctionals and Duality On the face of it this is a step backwards, but actually we have moved forward: necessary and sufficient that there should exist t G R for which (5.3.1.3) holds is that -II01IIII2/ + S2II ~9i{y) < l|0illl|2 + x2|| +0i(*) for each t/,2 G Xx (5.3.1.5) Obviously (5.3.1.5) is necessary for (5.3.1.4). Conversely, if (5.3.1.5) holds then the left-hand side of (5.3.1.5) is bounded above by each representative of the right-hand side, therefore has a least upper bound, which in turn is a lower bound for the right-hand side. Thus the sup of the left-hand side is less than or equal to the inf of the right-hand side of (5.3.1.5), leaving room for t G R in the middle of (5.3.1.4). It remains to establish (5.3.1.4): but this reduces to the defining properties of the norm and of the operator bound; for each y,z G Xx 9iiz)-9i{y) =9i{*-y) < ll0illll*-y|| = Ik II IK* + *2) - iv + *2)ll (5.3.1.6) <lkilllk + *2ll + lkilll|y + *2ll This is just a rearrangement of (5.3.1.5). ■ Theorem 5.3.1 might be referred to as the "one-step real" Hahn-Banach theorem: ordinary mathematical induction converts it to a "finite-step real" Hahn-Banach theorem, which can then be made into a complex theorem using Theorem 5.1.2. To get the general Hahn-Banach theorem from Theorem 5.3.1 needs Zorn's lemma Theorem 1.11.3: 5.3.2 THEOREM If X is a real or complex normed space and f0 G [Xoy is a bounded linear functional on a subspace X0 C X, then there is an extension / G X* of f0 to X with ||/|| < ||/0||. Proof: We consider the real and the complex cases separately. If X is a real normed space and k > 0 consider the partially ordered set Vk = {/o € (Xoy-.X0 C X and ||/0|| < k} in which the partial order is extension of mappings: fx ^ /2 ^^ xx C X2 and f2(x) = /x(x) for each x G Xx (5.3.2.2) We must verify that £lk satisfies Zorn's condition (1.11.1.2): if {f\)\£\ is a totally ordered family in flk then XA = UAGAXA is a linear subspace of X-the reader should carefully check that it is closed under addition-and we may define fA : XA —> R as the common extension of all the fx to XA. (5.3.2.1)
5.4 Duality Theory 143 By (1.11.3.1) it now follows that each f0 in Qk has a maximal extension f = fi in £lk: all that remains is to see that if fx G nk is maximal then Xx = X (5.3.2.3) But this is Theorem 5.3.1: if Xx ^ X then there is /2 G £lk for which We have thus established the real case of Theorem 5.3.2: of course if f0 G {X0)^ 1S prescribed we choose £lk with k = ||/0||. For the complex case we combine the real case with Theorem 5.1.2: indeed if f0 G {Xoy for a complex linear subspace X0 of a complex normed space X then we define g0 : X —► R by means of (5.1.2.2), with f0 in place of /, use what has just been proved to determine an extension g : X —► R of g0 with the same norm as g, and finally define / by means of (5.1.2.3). ■ The reader is invited to derive a "one-step complex" Hahn-Banach theorem from Theorem 5.3.1 and Theorem 5.1.2 without Zorn's lemma: he is warned to "watch his steps." One other remark is that the entire argument is valid for seminomas: we have at no stage used the condition (1.1.1.2). 5.4 DUALITY THEORY With the aid of Theorem 5.3.2, the dual space can really get to work on a real or complex normed space. Essentially we are entitled to treat the bounded linear functional on X as a "system of coordinates" for X: while perhaps we do not have a "system," at least we have "enough." We have for example "enough" bounded functionals to distinguish between elements of the space, to calculate their norms, and to determine closed subspaces: 5.4.1 THEOREM The dual X* separates points of a normed space X, in the sense that if y ^ x in X then there is / G X^ for which f(y) ^ /(x). Further ||x|| = sup |/(x)| for each x G X (5.4.1.1) ll/ll<i and \\T*\\ = \\T\\ for each T G BL{X,Y) (5.4.1.2) Also (Y°) = Y for each closed subspace Y C X (5.4.1.3)
144 5. Linear Functionals and Duality Proof: The idea of the proof is to first find f0 on a suitable subspace X0 and then extend, using Theorem 5.3.2. Thus, if y ^ x take X0 = K(y — x) and define f0 by setting fo{t(y — x)) =t for each iGK: evidently f0 is well defined and linear, and bounded with ||/0|| = l/\\y — x\\. By Theorem 5.3.2 there is an extension / G X* of f0: evidently f(y — x) = f0(y — x) ^ 0, so that f(y) ^ f{x). Towards (5.4.1.1) it is clear that the right-hand side is always less than or equal to the left, and equality is trivial if x = 0. If i^O define f0 : X0 —► K by taking X0 = Kx, and f0(tx) = t\\x\\ for each t G K; then again f0 is well defined and linear, with this time ||/0|| = 1. Theorem 5.3.2 gives an extension / G X^ with ||/|| = ||/0|| = 1, and of course /(x) = f0(x) = ||x||. This gives equality in (5.4.1.1), and hence also in (5.4.1.2): for if x G X then ||rx||= sup |<7(rx)|<||x|| sup ||^r|| = ||x||||rt|| IWI<i llffll<i Finally, for (5.4.1.3), suppose that Y is a closed subspace of X and that x0 G X \Y: since Y is closed this implies dist(x0, Y) > 0. If we define X0 = Y+Kx0 then vectors in X0 can be written uniquely in the form y+sx0 with y G Y and 5 G K, giving a well-defined linear mapping f0 : y + sx0 —> 5 from X0 to K. We claim that /0, which is of course linear, is bounded: for if 0 ^ s G K then ll y ll ||t/ + 5x0|| = |5| - + x0 > |s|dist(x0,Y) II 5 II which makes ||/0|| < l/dist(x0, Y). Now by Theorem 5.3.2 again there is / G X^ extending f0: evidently feY° and /(x0)^0 (5.4.1.4) which proves that x0 is not in (Y°)0. ■ The first part of Theorem 5.4.1 breaks the "deafening silence" at the end of (5.2.2.4): it is now clear that (Xt)0 = {0} C X (5.4.1.5) This is in a sense the "topological" content of the "metric" (5.4.1.1). With the aid of the uniform boundedness principle, we can use the dual space to decide whether or not a subset is bounded: 5.4.2 THEOREM A subset K C X is bounded if and only if f{K) is bounded in K for each / G X^.
5.5 The Separation Theorem 145 Proof: If ||x|| < M for each iGif, then obviously |/(x)| < M||x|| for each x G K: thus, if / G X^ then f{K) is bounded in K. Conversely, recall (2.9.1.5) the operators Rx : / — /(*) : *t _^ K (5.4.2.1) If each set /(if) is bounded in K then {Rx)x^k 1S a family in BL(Xt,K) satisfying the condition (4.9.1.1) of Theorem 4.9.1, while of course X is complete by Theorem 4.2.4. Theorem 4.9.1 says that the condition (4.9.1.2) is satisfied, which gives with (5.4.1.1) inequality ||x|| = HtfJI <M for each xeK ■ (5.4.2.2) 5.5 THE SEPARATION THEOREM So far our "system of coordinates" X^ has succeeded in distinguishing between points of X, evaluating their norms, as well as in delineating closed subspaces and characterizing bounded subsets. We claim that X^ can also delineate closed absolutely convex subsets, although we have to work a little harder: 5.5.1 THEOREM If 0 ^ K C X is closed and absolutely convex then (K°)o = K. Proof: Of course K C (K°)0 by (5.2.2.1); if x0 G X \ K we must find /Glf for which |/(x0)|>sup|/| = sup|/(x)| (5.5.1.1) K xGK We begin by replacing the set K by a larger set: if 0 < 8 < dist(x0,if) then x0 £ K' = K + Disc(0 ;6) = {y + z:y€ K, \\z\\ < 6} and then define a mapping q : X —> [0, oo] by setting q{x) = inf {k > 0: x G kK'} for each x G X (5.5.1.2) Evidently q satisfies the conditions (1.1.1.3) and (1.1.1.4) of Definition 1.1.1, and is also finite since q(x) < ||x||/£ for each x G X. Thus q is a seminorm on X, sometimes called the Minkowski functional of the absolutely convex set K'. We might think of the set K' swelling up under scalar multiplication until it swallows the vector x. For us the important property of q is the implication, for each x G X, q(x) < 1 =^ x G K' =J> q(x) < 1 (5.5.1.3)
146 5. Linear Punctionals and Duality Now with X0 = Kx0 define f0 : X0 —► K by setting f0{tx0) = t for each iGK. Evidently f0 is linear, and satisfies |/o(^o)l — tf(*xo) f°r eacn * £ K. This holds for t = 1 by (5.5.1.3), and hence for all t G K. The Hahn-Banach theorem 5.3.2 holds with the seminorm q instead of the norm ||-||, which means that there is a linear extension / : X —► K of f0 which satisfies |/(x)| < q{x) < ^ for each x G X 6 Thus / G X*. Noting that (3.4.7.6) a linear functional is always open, we have finally f{x0) = 1 > sup? > sup |/| = sup \f{y + z)\ ><5' + sup|/| K' k' yGK,\\z\\<6 K with 8' > 0, giving (5.5.1.1). ■ The reader might like to note a passing resemblance between the Minkowski functional q and the "Urysohn function" KA of (3.12.3.3). If T G BL(X, Y) then it is elementary that r(X)° = (Tt)-1(0) (5.5.1.4) The null space of the dual operator is the annihilator of the range. This would be true with any normed space W in place of K. By the Hahn- Banach theorem it follows (5.4.1.3) that the closure of the range is the back annihilator of the null space of the dual: clTX= (r^-1^ (5.5.1.5) 5.5.2 THEOREM If T G BL(X, Y) then there is implication T dense <^ Tf one-one (5.5.2.1) and T almost open ^^ Tf bounded below (5.5.2.2) Proof: The implication (5.5.2.1) is part of the equality (5.5.1.5). If T is almost open then the dual J1*, like all the compositions RT = BL(T,W), is bounded below, by Theorem 3.4.4: indeed if k > 0 satisfies (3.4.1.2) then for each g G Y^ \\gT\\= sup |jrx| = Bup{|j(»)|:yecl{rx:||i||<l}}>||j||/* 11*1161
5.5 The Separation Theorem 147 Conversely, we need the separation Theorem 5.5.1: if \\g\\ < fc||<7T|| for each geY* then {Tx: \\x\\ < k}° C {y G Y: \\y\\ < l}° and hence Disc(0;l) C (Disc(0 ; l)°)0 C ((T(Disc(0 ; A;))°)0 = clTDisc(0 ; k) using the separation theorem for the last equality. ■ In the notation of (3.2.1.4), (3.2.1.5), (3.3.1.3), and (3.4.1.4) we have t e 7rrightBL(x,y) <^ r1" g 7rleftBL(yt,xt) (5.5.2.3) and T e pight BL(x,y) ^^ r1" g fleft BL(yt,xt) (5.5.2.4) When the spaces X and Y are complete then we can also say that T^ is bounded below iff T is onto, or equivalently iff T is open. In a sense Theorem 5.5.2 is useful, because it is easier to decide whether or not a mapping is one-one than to decide whether or not a mapping is dense, or onto. The dual result would appear to be not so useful: it is also not so exact. 5.5.3 THEOREM If T G BL(X,y) then there is implication T' dense => T one-one (5.5.3.1) and T* almost open => T bounded below => T^ open (5.5.3.2) Proof: If rt(yt) is a dense subspace of X* and x G T_10 then for arbitrary/ G X* /(x)=lim<7nrx = 0 n giving (5.4.1.4) x = 0, so that T is one-one: this proves (5.5.3.1). If T* is almost open, with k > 0 satisfying the analogue of (3.4.1.2) then (5.4.1.1) gives, ||x|| = sup \f{x)\<suv{\gTx\\\\g\\<k}<k\\Tx\\ for each x G X ll/ll<i which is (3.3.1.1): this is the first implication of (5.5.3.2). Conversely, if t is bounded below, with k > 0 satisfying (3.3.1.1), then if / G X1" is
148 5. Linear Punctionals and Duality- arbitrary there is a well-defined mapping g0 : TX —> K defined by setting g0(Tx) = f{x) for each x G X Evidently g0 is linear, and bounded with \\g0\\ < k\\f\\. By Theorem 5.3.2 there is an extension g G Y* of g0 with the same bound: thus f = gT and ||$|| < *||/|| which means that T satisfies the condition (3.4.1.1). ■ Since X* and Y* are both complete (Theorem 4.3.4) we already know by Theorem 4.4.3 the implication T^ almost open =>• open; it is interesting that we get the same information independently. In the notation of (3.2.1.4), (3.2.1.5), (3.3.1.3) and (3.4.1.4) we have shown that T G 7rleftBL(X,Y) => Tf G 7rrightBL(Yt,Xt) (5.5.3.3) and T G rleftBL(X,Y) <^ T G rright BL(Yt,Xt) (5.5.3.4) By the argument for the second implication of (5.5.3.2) we can identify the range of J1*: T*Y* = {fe X*:3k > 0 such |/(x)| < *||T(x)|| for each x e X} (5.5.3.5) We shall see below that the implication (5.5.3.3) cannot in general be reversed. 5.5.4 THEOREM If X and Y are complete and T G BL(X,Y) then there is implication T invertible <=> Tf invertible (5.5.4.1) Proof: If T is invertible then (5.1.1.3) shows that T* is invertible, with (Tt)-i = (T-i)t (5.5.4.2) Conversely, if T^ is invertible then it is bounded below and almost open, so that T is almost open and bounded below, therefore open and bounded below (using Theorem 4.4.3), therefore invertible. ■ If X and Y are complete then T G BL(X, Y) is proper if and only if Tt is:
5.5 The Separation Theorem 149 5.5.5 THEOREM If T G BL(X, Y) then core(rf) = (coreT)1" (5.5.5.1) If X and Y are complete then T proper <=> T'proper Proof: If Z C X is a closed subspace then it is (2.3.2.2) that [XjZ^ = ^°Clt while by the Hahn-Banach Theorem 5.3.2 we have Zf = X^/Z0 (5.5.5.4) It follows that the canonical factorization (2.3.3.2) is obtained for the dual J1* by taking duals in the canonical factorization of T; in particular (5.5.5.1) holds, and then (5.5.4.1) gives (5.5.5.2). ■ The dual of Theorem 3.5.2 holds: 5.5.6 THEOREM If T G BL(X, Y) is in the topological boundary of the bounded below operators then T is not almost open. Proof: If T is in the boundary of the bounded below operators then there is (TJ in BL(X, Y) for which \\T - Tn\\ -> 0 with each Tn bounded below, while by Theorem 3.3.3 T is not itself bounded below. By (5.1.1.1) \\T* - Tn\\ —► 0 while by (5.5.3.2) each T^ is almost open and T^ is not itself almost open. Theorem 3.5.2 now says that T^ is not bounded below, which by (the easy part of) (5.5.2.2) says that T is not almost open. ■ In the notation of (3.3.1.3) and (3.4.1.4) dfleft BL(X, Y) C fright BL(X, Y) (5.5.6.1) While the dual of Theorem 3.5.2 holds, the dual of Theorem 3.5.1 fails, as we noted in (4.7.4.2). To see that that situation actually can arise suppose Z is a Banach space and that g : Z —> K is a discontinuous linear functional. If z0 G Z satisfies g(z0) = 1 then by the Hahn-Banach Theorem 5.3.2 there is / G Z^ for which f(z0) = 1, and we now define a mapping U : /_1(0) -> X by setting U(z) = z - g{z)z0 for each z G /_1(0). The space Y of Theorem 4.7.4 is now obtained by renorming X as follows: II^MIIy = \\z\\z for each z € /_1(°) (5.5.6.2) (5.5.5.2) (5.5.5.3)
150 5. Linear Punctionals and Duality 5.6 COMPOSITION OPERATORS Theorem 5.5.2 and Theorem 5.5.3 show that the dual operator T* tells us as much about an operator T as all the operators RT = BL(T,W) put together. With the help of the Hahn-Banach theorem 5.3.2, and the "rank one" operators of (2.9.2.4), it turns out that ifW^O then the composition RT = BL(T,W) can do nearly as well. It is appropriate to review the simplest properties of the rank one operators / © y G BL(X,Y) associated with y G Y and / G X* in (2.9.2.4): 5.6.1 THEOREM If X and Y are normed spaces and y G Y and /elf then 11/0 2/11 = 112/1111/11 (5.6.1.1) In particular /©y = 0 iff either y = 0G7or/ = 0Glt. If U G BL(Y,Z) and V GBL(Z,X) then U{fOy) = f(D{Uy) and {f<Dy)V = [fV)®y = [V^f)ey (5.6.1.2) Proof: If x G X is arbitrary then ||(/ © y){x)\\ = \\f{x)y\\ = \\y\\\f{x)\ giving (5.6.1.1). Also U(f © y)(x) = U{f{x)y) = f{x)Uy and (/ © y)Vz = {fVz)y, giving (5.6.1.2). ■ Suppose now that W ^ 0 is a fixed normed space, and write LT = BL {W, T) RT = BL (T, W) Rx = BL(x,W) : BL(X,W) —► W (5.6.1.3) for each T G BL(X,Y) and each x G X, as in (2.9.2.3): then 5.6.2 THEOREM If W ^ 0 and X is a normed space then BL{X,W) separates points of X, with ||x|| = \\RX\\ for each x G X (5.6.2.1) If TGBL(X,Y) then also ||r|| = ||Lr|| = ||iEr|| (5.6.2.2) Proof: Towards (5.6.2.1) it is clear that Rx is bounded, with \\RX\\ < ||x||. Conversely, using (5.4.1.1) ||x|| = sup |/(x)| = sup |/(x)|HI= sup ||(/©w)(x)||<||Bs|| ll/ll<i ll/llll>«ll<i ll/©HI<i
5.6 Composition Operators 151 Towards (5.6.2.2), Theorem 2.9.1 gives ||Lr|| < ||T|| and \\RT\\ < \\T\\. Conversely, ||Lr||= sup ||TI7||> sup \\h®(Tx)\\ = sup ||r*||||fc|| \\U\\<1 \\hQx\\<l \\x\\\\h\\<l = sup ||Tx|| = ||n| IWI<1 and ||i2r||= sup \\VT\\> sup \\gTOw\\= sup \\w\\\\gT\\ \\v\\<i \\gQw\\<i IMIIWI<i = sup \\gT\\ = ||Tt|| = ||T|| l|y||<i using (5.4.1.2) at the end. ■ The reader will recall from the remarks following (2.9.1.3) that for W = X and W = Y we already knew one or other of the equalities (5.6.2.2). Extending Theorem 5.5.2 and Theorem 5.5.3, we have: 5.6.3 THEOREM If W ^ 0 and T G BL(X, Y) then there is implication LT dense =>- T dense <$=^ RT one-one (5.6.3.1) LT almost open => T almost open <=> RT bounded below (5.6.3.2) RT dense ==> T one-one <=> LT one-one (5.6.3.3) RT almost open => T bounded below <$=^ LT bounded below (5.6.3.4) Proof: Towards (5.6.3.1) suppose LT is dense. Then for arbitrary y € Y and h G W^ there is [Un) in BL{W,X) for which TUn ^ hey. If h ^ 0 then there is w G W with h(w) = 1, and now Txn —* y where xn = Unw, showing that T is dense. Towards the second implication it is Theorem 3.2.1 that if T is dense then RT is one-one, while if T is not dense then by (5.5.2.1) there is g G Y^ for which g ^ 0 = gT, and now if w ^ 0 in W then V = g © w gives V ^ 0 = VT, which means that RT is not one-one. Towards (5.6.3.2) suppose LT is almost open. Then there is k > 0 for which, for arbitrary t/G7 and h G W^ there is (Un) in BL(W,X) for which TUn -> h(Dy, with in addition ||Un\\ < k\\h®y\\ = A;||t/||||/i||. If, in particular, ||ty|| = 1 and h{w) = \\h\\ = 1, we have Txn —► y and ||xn|| < k\\y\\ where xn = Unw, showing that T is almost open. Towards the second implication it is Theorem 3.5.4 that if T is almost open then RT is bounded below. Conversely, if RT is bounded below there is k > 0 for which || V|| < fc|| VT|| for each V G BL(Y,W), in particular for V = g © w with w G W and
152 5. Linear Functionals and Duality g G Yf, so that ||t£;||||^|| < ib||t£;||||^T|| and hence ||^|| < k\\gT\\. Thus, T is bounded below and hence (5.5.2.2) T is almost open. Towards (5.6.3.3) suppose RT is dense: then if Tx = 0 we have, for arbitrary / G X^ and w G W a sequence (Vn) in BL(Y,W) for which f ®w = limn VnT and hence f(x)w = (/ © w)(x) = limn t^Tx = 0: thus f{x) = 0 for each / G Xt, giving (Theorem 5.4.1) x = 0. Towards the second implication, if T is one-one then by Theorem 3.2.1 LT is one-one. Conversely, if T is not one-one there is x G X with i^0 = Tx, and now if O^/iG^tandl7 = /i0xwe have TU = 0^U. Towards (5.6.3.4) suppose RT is almost open: then there is k > 0 for which if / G X^ and w G W are arbitrary there is (Vn) in BL(Y,W) for which / © w = limnVnr with ||Vn|| < fc||/©w|| = fcHI 11/11- Taking h G W^ and w G W with \\h\\ = \\w\\ = h(w) = 1 gives gnT -> / with ||<7nll — ^11/IN where <7n = hVn: since / is arbitrary it follows that T^ is almost open and hence (5.5.3.2) that T is bounded below. Towards the second implication, if T is bounded below then LT is bounded below by Theorem 3.3.4. Conversely, if T is not bounded below then there is (xn) in X for which ||xn|| = 1 and ||Txn|| -> 0, and now if h G W1" with \\h\\ = 1 then Un = h © xn gives ||l/n|| = 1 and \\TUn\\ -> 0. ■ If X, y, Z, and W are normed spaces and if T G BL(X, Y) and 5 G BL(jy, Z) are bounded operators then we may combine the left and right multiplications with the row and column operators of (2.4.0.3) and (2.4.0.4), writing row(B5,Lr) : [V,U) -+VS + TU (BL(Z,Y) x BL{W,X) —► BL{W,Y)) and col(L5,*r) : W — (SW,WT) (BL(Y,W) —► BL(y,Z) x BL(X,W)) 5.6.4 THEOREM If T G BL(X,Y) and S G BL(W,Z), then there is implication i(w{Rs,LT) dense => co\(Ls,RT) one-one =>- 5 one-one or T dense (5.6.4.1) and icw(Rs,LT) almost open => co\(Ls,RT) bdd below => 5 bdd below or T almost open
5.6 Composition Operators 153 Proof: If iow(Rs,LT) is dense, then for arbitrary R G BL(W, Y) there are sequences [Un) and (Vn) in BL{W,X) and BL(Z, Y) for which \\R-VnS- ri/n|| -> 0, so that for arbitrary W G BL(Y, W) we have || {WVn) {SW) + {WT) [UnW) - WRW\\ —► 0 as n —► oo (5.6.4.3) Taking in particular R = h® y with h G W* and t/ G Y gives || (WVn) {SW) + (WT) (17nW) - {hW) 0 (Wy) || —♦ 0 as n —► oo (5.6.4.4) Thus, if SW = 0 = WT, then the left-hand side of (5.6.4.4j^vanishes for arbitrary y G Y and arbitrary /i G W", which means that W = 0: for if f^O then by definition there is y G Y for which Wt/ 7^ 0, and by Theorem 5.4.1 there is h G W^ for which hW ^ 0. This proves the first implication of (5.6.4.1). Towards the second we claim that if co\{Ls,RT) is one-one then there is implication, for arbitrary w G W, g eY and R G BL(W, Y), gT = 0 = Sw => gRw = 0 (5.6.4.5) for apply co\{Ls,RT) to the operator W = {gRw)g © w. From (5.6.4.5) it then follows that either S is one-one or T is dense: for if Sw = 0 ^ w and gT = 0 ^ g,v/e can manufacture # = /i©t/ for which <7#iu = <7(t/)/*(w) 7^ 0. This finishes the proof of (5.6.4.1), and the arguments for (5.6.4.2) follow the same pattern. If iovt{Rs,LT) is almost open, then there is k > 0 such that for arbitrary R G BL(W, Y) we have \\R - VnS - TUn\\ -> 0 with in addition \\Un\\ < k\\R\\ and ||Vn|| < &||-R||, so that for arbitrary W G BL(Y,W) (5.6.4.3) holds with the same qualification. This time take R = hnOyn with yn G Y and hn G W"l" chosen so that l|y»ll = l=IIU and ||WVn|| > ±\\W\\ ^ __ (5.6.4.6) \\hnW\\ > $\\W\\ t/n exists by definition of ||W|| and hn by definition of ||Wt||, using (5.4.1.2). But now ||W||2 < 4||WW|| < 4\ffivnsw\\ + 4||i?rtfnw|| <4*||«||||w||(||5W|| + ||wr|| This proves the first implication of (5.6.4.2). Towards the second take W = (gRw)g © w to see that if co\{Ls,RT) is bounded below then there is
154 5. Linear Punctionals and Duality k > 0 for which, for arbitrary w G W, g G Yf and R G BL(W, Y), kfitn|<*||fi||(|W|||5ti;|| + |^r||||ti;||) (5.6.4.7) This is turn forces either S to be bounded below or T to be almost open: for, if not, then there are (wn) in W and (gn) in 7^ with ||wn|| = 1 = ||<7n|| and \\SwJ + ||^nT|| -+ 0, and then [hn) in W* and (t/J in Y with fc»W = ! = QniVn) and IKII = 1 > H^nll But nOW Rn = K® Vn Sives <7nii>n = 1 and ||5ti;n||||i2n||||^n|| + IKIHKUII^II - 0, contradicting (5.6.4.7). ■ 5.7 ENLARGEMENTS Using the Hahn-Banach theorem, we can replace each of (3.4.5.2) and (3.2.5.2) by the full analog of (3.3.5.2): 5.7.1 THEOREM If T G BL(X, Y) then there is implication Q(r) dense => T almost open => Q(T) open (5.7.1.1) Proof: If T G BL(X, Y) is not almost open then by the separation theorem (5.5.2.2) the dual operator T^ is not bounded below, and hence there is g = (gn) in Y* with ||*n|| = l and ||<7nr||-*0 (5.7.1.2) and therefore also y = (yn) in Y for which \\yj = l and \gn(yn)\ > ± (5.7.1.3) Now we claim x e U*) =► dist(y - Tx,c0(Y)) > \ (5.7.1.4) Indeed if x G ^{X), then for each n G N we have llv» - rxB|| = ||<7n||||yn - TxB|| > \gn(yn - Txn)\ > \9n(yn)\ - \\9nT\\\\xj > i - lixiLH^rii — \ (5-7-L5) It follows that if y € l^Y) is given by (5.7.1.3) then distfo.rp^X)) + c0(Y)) > i (5.7.1.6) This gives the first implication of (5.7.1.1), and the second is part of (3.4.5.2). ■
5.7 Enlargements 155 An alternative derivation of the first implication of (5.7.1.1) involves certain linear functionals on the numerical space l^ = /^(N, K): 5.7.2 THEOREM If 0 G (Z^)1" satisfies CoC^-^O) (5.7.2.1) then for each normed space X there is a bounded linear mapping <t>x : Q(Xt) ->Q(X)t given by the formula <f>*x(f+c0(Xi))(x+c0(X)) = *(/.(*.)) for each x G lOQ(X),fe l^X*) (5.7.2.2) where, as in (2.9.4.5), /.(x.) G l^ is defined by setting (/.(*.))» = /nW ^r each neM (5.7.2.3) If T G BL(X, Y) is a bounded linear mapping then there is equality Q(r)t o ft = ^x o Q(rt) (5.7.2.4) Proof: If 0 G (Jqo)* is arbitrary then <f>(f.(x.)) G K is well defined for each x G ^(-X^) and / G l^X^) and induces a bounded linear mapping 4>X{f) ■ * —■ *(/.(*.)) from /„,(*) to K (5.7.2.5) for each / G /00(X"''). If in particular 0 satisfies the condition (5.7.2.1) then <£((/. (x.)) =0 if either x G c0(X) or / G c0(Xf) (5.7.2.6) Thus the mapping <££■ is well defined by (5.7.2.2). If also T G BL(X, Y) then (5.7.2.4) follows from the associativity <f>(g.(T"x).) = <f>((g o TN).(x.)) for each x G /«,(*)>0 € U^f) ■ (5.7.2.7) If / G /00(X"'') is not in c0(Xt), so that limsupn ||/n|| > 0, then there is x G Ioq{X) for which limsupn |/n(xn)| > 0, which means that /.(*.) £c0 (5.7.2.8) Now by (5.4.1.3) there is <j> G (/00)t for which (5.7.2.1) holds, satisfying <£(/.(*.)) ^0 (5.7.2.9) It now follows easily that if Q(T)t is one-one then so is Q(Tt):
156 5. Linear Punctionals and Duality 5.7.3 THEOREM If X and Y are normed spaces and T G BL(X,Y) then Q(r)f one-one => Q(Tf) one-one (5.7.3.1) Proof: Suppose Q(Tf) : Q(Yf) -> Q(Xf) is not one-one: then there is 0£'oo(yt) for which $ o TN G c0(Xf) and (? 0 c0{Y^) (5.7.3.2) and hence also y G /^ (y) for which 0.(y.)gco (5.7.3.3) As in (5.7.2.9) there is now <j> G (/00)t satisfying (5.7.2.1) for which *((*(y.))^0 (5.7.3.4) But now Q(r)t(^(^ + c0(yt)) = OGQ(x)t and (5.7.3.5) O^0y(^ + co(^t))eQ(y)t Thus Q(r)t is not one-one. ■ An alternative derivation of the first implication of (5.7.1.1) now follows: by (5.5.2.1) and (5.7.3.1) we have Q(r) dense => Q(T)f one-one => Q(Tf) one-one (5.7.3.6) and then by (3.3.5.2) and (5.5.2.2) we have Q(Tf) one-one => Tf bounded below => T almost open (5.7.3.7) 5.8 SEQUENCE AND FUNCTION SPACES If X = K is the scalar field then obviously the dual space is given by K: this is just (2.9.2.1). More generally for each ?iGN the dual of Kn is isomorphic to Kn: this is (2.4.1.2). Generalizing still further, we can show that the duals of the sequence spaces c0, lx and l2 are given by /1? l^, l2 respectively. For roughly the same work, we can show much more. Suppose that H is a nonempty set, and W a normed space: then if x : n -► W and y : Q -► W^ are arbitrary we shall define VA(X) = Y,yt(xt) e K if Y,yt(xt) exists (5.8.0.1) ten ten
5.8 Sequence and Function Spaces 157 For example if either x or y is "terminating" in the sense of (1.9.0.4) then t/A (x) is certainly well defined. 5.8.1 THEOREM The mapping y -> t/A : Y -> X1" is well defined and an isometric isomorphism if {X,Y) is (c0(Q,W), /1(Q,Wt)), or (/1(Q,W), /oo(n,wt)),or(/2(n,w),/2(n,wt)). Proof: If either x or t/ are "terminating" then |y* (x)| < ]T \\*t\\\\Vt\\ < mindlxlLHyJl!, Hxll^ML, ||x||2||y||2) (5.8.1.1) ten Thus in all three cases the mapping y —► t/A : Y —► X^ is well defined and linear, with bound < 1. We must show that it is in each case isometric, and onto. For isometry llvll < llvl (s-8-1-2) we apply the mapping t/A to a shrewdly chosen family of elements x G X, usually depending on y. For example if X = /1(Q,ty) take x = 6tQq (5.8.1.3) as in (1.8.0.6), with t G H and w eW: evidently Hxll! = \\w\\ and t/A(x) = t/» (5.8.1.4) If e > 0 is arbitrary and (GHwe can arrange yt{w) = \\yt\\ and \\w\\ <l + e (5.8.1.5) It now follows that for each t WvtW = ytH = yA(*) < Ils/IIMI = Il3/Alllhll < llvAll(i + *) giving (5.8.1.2) since e > 0 is arbitrary. If instead X = c0(Q, W), or alternatively /^(n, W), take x = ^2 sgnyt{zt)6t © zt (5.8.1.6) tGK with a finite subset K C ft and a mapping z : Q —► W with zt = 0 if t g K chosen in such a way that w = zt satisfies (5.8.1.5) for each t G K. In (5.8.1.6) we are writing sgn(0) = 0 and sgn(re^) = eid if r > 0 and 6 G R (5.8.1.7) so that sgn(re^) = e~id.
158 5. Linear Punctionals and Duality Evidently x G c00(Q,W) C c0(Q,W) and satisfies Iklloo < IklL < 1 + s and yA(x) = £ Vt(*t) = E Wl C5-8-1-8) Allowing K -> n gives 112/111 < ||2/A||(1 + e) and hence again (5.8.1.2). If instead X = l2{n,W) take x=Y,yt(zt)St®zt (5.8.1.9) with K and 2 as before, giving / \ 1/2 INl2<(EW2) (1 + &) and yAW = EW2 (5-8-1-10) Once again (5.8.1.2) follows. Finally we must see that in each case the mapping y —► t/A is onto. Thus if / G X* is arbitrary we are looking for a sequence y = A/ G 7 for which t/A = /. Since it is necessary that t/A(x) = f(x) for each x = 6t © W from (5.8.1.3) we must take [Af)t(w) = f(St © w) for each t G H and each w eW (5.8.1.11) Thus, to finish the proof we must verify, in each case, that AfeY and (A/)A=/ for each / G X1" (5.8.1.12) The first inclusion follows from the proof of the inequality (5.8.1.2); towards the second equality it is clear that / and (A/)A agree on the subspace c00(Q,W) C X in each case, and hence we have only to confirm that in each case c00(n,W) is a dense subspace, in the appropriate norm, of the space X. m Theorem 5.8.1 tells us something about the dual of the space ^(H, W) for a nonempty set Q and a normed space W: there is isometric embedding y — yA : 1,(0^) —♦ l^W)* (5.8.1.13) given by the formula (5.8.0.1). Indeed it is only at the last ditch that we fail to identify the dual of ^(r^W) here: the terminating sequences c00(Q, W) are not in general a dense subspace of l^ (Q, W), and so / G l^ (H, W) t need not be uniquely determined by the associated sequence determine the dual of l^(fi, W) we begin by extending (5.8.1.13) to additive
5.8 Sequence and Function Spaces 159 set functions. Write /xA(x) = I x dfj, = lim ^2 n(K)(x(tK)) for each /x G Mx{n9W^) (5.8.1.14) meaning that if e > 0 there is a partition Ke of Q for which Uxdp- £ m(#)(*(**)) KeK if K€ C /C and lKGX for each KgK. < e (5.8.1.15) 5.8.2 THEOREM There is isometric isomorphism fi -► /xA : Mx (H, K) = /oo(n,K)t. Proof: When W and hence also W^ is the scalar field K then the "Riemann sums" of (5.8.1.12) becomes a little simpler: ■ 0 under refinement of partition K (5.8.2.1) By the completeness of the field K the limit exists whenever Wx]]^ and \\fi\\i are both finite: |MA(x)| < IklLIMIi (5-8.2.2) If K is a finite partition of H and we take x= ^2sgRfi{K)6K (5.8.2.3) writing now 6K for the characteristic function of the set K, then ||x|L < 1 and M*(x) = £ \„(K)\ (5.8.2.4) Thus IHi < llMll (5.8.2.5) Finally, suppose / is a bounded linear functional on the space ^(H, K) and define (Af)(K) = f(6K) for each K C Q (5.8.2.6)
160 5. Linear Functionals and Duality Then evidently \i = A/ satisfies the conditions (1.8.2.2), and we have (*/)*(*) = f{x) if x = £ *(**)«* (5.8.2.7) The proof is completed by showing that such "simple functions" form a dense subspace of 1^(0). In the real case K = R suppose x G ^(H) and e > 0, and take K€ = {Hj:j = l,2,...,n} withify = {ten:cj+1 <x{t) <Cj}{j = l,2,...,n) (5.8.2.8) with b — a . . . c=a+ ——(j = 1,2,...,n) where a < — |lxlloo5& > H^Hoo and n > b — a (5.8.2.9) This will give KeK < e whenever K£ C K (5.8.2.10) The reader is invited to make up some suitable sets H ■ in the complex case K = C. ■ 5.9 THE SECOND DUAL When we call X^ the "dual" of X there is a suggestion that X is the dual of X^. It is not as simple as that. Recall the evaluation maps Rx:f —► f{x) from X to K of (2.9.2.3), with W = K. (5.9.0.1) 5.9.1 THEOREM If X is a normed space then, under the evaluation map x —► Rx, there is isometric isomorphism X = XA, where XA is a subspace ofXtt = (xt)t. Proof: This is just (5.4.1.1) ■ In general, the subspace XA C X^ need not be the whole space X^, even if X is complete. For example if X = c0, then X^ = l^, with a
5.9 The Second Dual 161 correspondence zv <-> z given by the formula zv(yA) = yA(z) for each y G lx and each z G l^ (5.9.1.1) Then it is clear that if z G l^ there is implication zv G (c0)A = {Rx: x G c0} «=> zGc0 (5.9.1.2) Theorem 5.9.1 offers an alternative candidate for a "completion" X~ ofX: X~ =clXA CXft (5.9.1.3) It is much easier, using Theorem 4.3.4, to see that clXA is complete than to see that the space c(X)/c0(X) of Theorem 4.5.3 is. On the other hand, to show that the embedding J : x —> R from X to X~ is isometric uses the Hahn-Banach Theorem 5.3.2. If X and Y are normed spaces and T G BL(X, Y), then T^ = (Jt)t : X^ —► yl"l" is, through the medium of Theorem 5.9.1, just an extension of r. The situation of (5.9.1.1), in which XA is not even a dense subspace of X, shows that the "dual" of the bipolar result (5.4.1.3) is liable to fail: indeed (XA)0 = {0} C Xf and hence ((XA)0)° = Xft (5.9.1.4) We can also show that the implication (5.6.2.1) of Theorem 5.6.2 cannot in general be reversed: for if X = lx, Y = c0 and T = J : lx —► c0 is the natural injection then, through the medium of the isomorphisms of Theorem 5.8.1, T^ : Y^ —*■ X^ is represented by the natural injection J' : lx —*■ c0. Since J1 is defined on the same set as J, and takes the same values at each point, the distinction between J and J' is, to say the least, elusive; but of course J has dense range, while J1 has not. Thus, T is one-one while T^ is not dense. Sometimes the space X does furnish the dual of its dual X^: 5.9.2 DEFINITION A normed space X is called reflexive if there is equality {Rx:xeX} = X^ (5.9.2.1) For example the space K, and Kn for each n G N, are reflexive, as is the space l2. Curiously enough the condition X = Xft (5.9.2.2) is not sufficient for (5.9.2.1), although the known examples are difficult to
162 5. Linear Functionals and Duality- understand. If X is reflexive then so are its closed subspaces Z C X and their quotients X/Z: 5.9.3 THEOREM If X is a Banach space and Z C X is a closed sub- space, then X reflexive <£=> Z reflexive and X/Z reflexive (5.9.3.1) Proof: Two applications of (5.6.4.3) and (5.6.4.4) give isomorphism {X/Z)" S X^/Z00 and Z^ S Z00 (5.9.3.2) in which the correspondences are those of (2.3.2.2). If we write Y = X W = Z°° and T(x) = Rx e Xft for each x G X (5.9.3.3) then T(Z) C W and hence Theorem 3.11.1 applies. Further all three mappings T, Tz, and T/Z are isometric, in particular one-one. Thus, (3.11.1.4) gives implication X/Z reflexive, Z reflexive => X reflexive => X/Z reflexive (5.9.3.4) while (3.11.2.4) gives implication X reflexive =>■ Z reflexive ■ (5.9.3.5) A space X is reflexive if and only if its dual is: 5.9.4 THEOREM If X is a Banach space then X reflexive <£=*> X* reflexive (5.9.4.1) whether or not X is reflexive, (Xf)A is a complemented subspace of (Xf)ft (5.9.4.2) Proof: Towards (5.9.4.1) we have X reflexive =>- X^ reflexive =>- X" reflexive =>- X reflexive (5.9.4.3) the first implication is obvious after a moment's reflection, the second follows from the first, while the third is (5.9.3.5). Towards (5.9.4.2) write Rx for the evaluation map form X to X", and observe (J2x)t • Rxt = Ixt (5.9.4.4)
5.10 An Uncomplemented Subspace 163 is the identity on X\ and hence Rx^ • (RxV : X^ ~~*" X^ 1S idempotent, with range (Xf)A. ■ 5.10 AN UNCOMPLEMENTED SUBSPACE We have not yet had evidence that there is such a thing as an uncomplemented subspace. In fact there is one under our nose: c0 is an uncomplemented subspace of l^. We begin with an auxiliary result: 5.10.1 THEOREM If fn G (J^)1" for each n G N then there is implication oo oo «o £ D fn » => f) f-^O) g c0 (5.10.1.1) n=l n=l Proof: We shall construct a family (xa)A€r in l^ and prove that if c0 C n^=i/rT1(°) then for at least one A G R the function xA is in n^/"1^), but not in c0. We begin with an enumeration of the rationals, in the sense of a mapping q : N —► R which is one-one, and onto the subset of rationals Q C R, and next associate with each point A a sequence (An) of distinct rationals converging to A: now we set Kx = {neN:qne {Am:m G N}} for each A G R (5.10.1.2) and finally take xA to be the characteristic function of the set Kx: xx{n) = lifneKx and xx{n) = 0 if n G N \ Kx (5.10.1.3) Observe that each Kx is infinite, while the intersection of any pair of distinct Kx is finite. Thus for each A,/x G R dist(xA,c0) = limsupxA(ra) = 1 and A^/i =>■ xAxM G c00 C c0 n (5.10.1.4) Suppose now g G [1^)^ and-write Q(9)={\eR:g(xx)?0} = U~=1Qn(g) where Qn{g) = {A G R: \g{xx)\ > 1/n} (5.10.1.5) We claim c0 G <7_1(0) =>- cardinal Qn{g) < ||<7|| for each nGN =>- Q(^) countable (5.10.1.6) Indeed if H is an arbitrary (finite) subset of Qn{g) and ZH=Y, sgn<7(*A)xA = Y, (|0(xa)I/<7(*a))*A (5.10.1.7)
164 5. Linear Functionals and Duality and also z'H G l^ is defined by setting 4W = (^> «i*e*-e*J-m (51018) [ 0 else so that z'H (n) = 0 unless n is in one and only one of the sets {Kx)XeH, then we have z'wW = $hM unless n G K\ n if „ with A ^ a ^ A M (5.10.1.9) =>*2r-**e0 '(0) and now Ikirlloo ^ ! and 9{zh) = 9(zh) = \9(zh)\ > ~ cardinal (H) (5.10.1.10) giving (5.10.1.6). From (5.10.1.6) the implication (5.10.1.1) follows: for if c0 C n^f-1^) then U™=1Q(fn) is countable and hence oo 3AGR\ [J Q(/J (5.10.1.11) n=l ■^oo f-l( But for such A we have xx G H^L1/n 1(0), by (5.10.1.5), and xx & c0 by (5.10.1.4). ■ Theorem 5.10.1 means that c0 cannot be a complemented subspace of/„: 5.10.2 THEOREM c0 is not complemented in Z^, and hence is not a dual space. Proof: Suppose there were E = E2 G BL^,^) for which ^(Z^) = c0, then we would have oo co=fl/n1(0) (5.10.2.1) n=l where for each nGN fn{x) = {{I - E)x)n for each x e l^ (5.10.2.2) But this contradicts (5.10.1.1). Thus, c0 is not complemented, and hence by (5.9.4.2) cannot be the dual of any Banach space X. ■
5.11 Extreme Points 165 5.11 EXTREME POINTS The "extreme points" of a convex set are like the vertices of a triangle: 5.11.1 DEFINITION If K = cvx(K) is a nonempty convex subset of a real linear space X then a face of K is a nonempty convex subset H C K with a convex complement in K: H G Face(X) <=> 0 ^ H = cvx{H) C X and X \ H = cvx{K \ H) (5.11.1.1) An extreme point of K is a face consisting of a single point: Extreme(X) = {x e X: {x} G Face(X)} (5.11.1.2) Here we are writing cvx{K) = lY^tjXjine N,*G [0,l]n,^^- = l,xeKn\ (5.11.1.3) S=i 3=1 ' for the set of "convex combinations" of elements of K. Evidently H G Face(K),#' G Face(if) =>► H' G Face(K) (5.11.1.4) and {#y:i G J} C Face(K), f] H) ^ 0 => f] Hj G Face(K) (5.11.1.5) If a convex combination of elements of K is contained in a face, then so must some of the vectors: if H G Face(if) and t G [0, l]n with 2y=i *y = * then n x G iT\ 5^*yxy Gff,^0=^ xt- G if (5.11.1.6) Linear functional cut faces off convex sets: 5.11.2 THEOREM If K C X is a nonempty compact convex subset of a normed space X and if / G X*, then k = sup {Re f{x):x e K} =^ Kf = {x e K: Re /(x) = A;} G Face(X) (5.11.2.1)
166 5. Linear Functionals and Duality Proof: By compactness Kj is nonempty, closed, and hence compact, and it is easy to see that K* is also convex. If 0 < t < 1 and x0,x1 G K satisfy (1 — t)x0 + tx1 G Kj, then (1 - t) Re f(x0) + t Re /(xj = Re /((l - *)x0 + txx) = k and since Re/(xy) < k this forces Re/(x0) = Re/(x1) = k and hence x0,x1eKf. m The Krein-Milman theorem guarantees that compact convex sets have lots of extreme points: 5.11.3 THEOREM If K C X is a nonempty compact convex subset of a normed space X, then K = cl(cvx(Extreme(i:C))) (5.11.3.1) Proof: We begin, using Zorn's lemma and the Hahn-Banach theorem, by showing Extreme^) ^ 0 (5.11.3.2) Certainly the set of closed faces of K, partially ordered by downward inclusion, satisfies Zorn's condition (1.11.1.2), using (5.11.1.5) and compactness. There is, therefore, H G Face(if) which is minimal with respect to set inclusion. If such a minimal face H is not an extreme point, containing distinct points x0, x1? then by the Hahn-Banach Theorem 5.3.2 (for real space) there will be (Theorem 5.4.1) /GXf for which Re/(x0) ^ Re/^). But since both x0, xx cannot be in the set Hf of (5.1.1.2.1) this would contradict the minimality of H. We have proved (5.11.3.2): for (5.11.3.1) we will need the separation theorem. Indeed if there exists x0 G K \ cl(cvx(Extreme(K))), then by Theorem 5.5.1 there is / G X1" and k1 G R for which Re/(x0) > A;'Re/(x) for each x G cl(cvx(Extreme(K))) (5.11.3.3) Evidently the set Kf of (5.11.2.1) is disjoint from Extreme(if), but by (5.11.3.2) there are extreme points in Kf, and by (5.11.1.4) these must lie in Extreme(if). This contradiction means that x0 cannot exist, and proves (5.11.3.1). ■ 5.12 DIFFERENTIAL CALCULUS The differential calculus for mappings from the real numbers into a normed space is partly elementary, partly relies on the Hahn-Banach theorem, and
5.12 Differential Calculus 167 partly needs the space X to be complete. The derivative is just the limit of the obvious difference quotient: 5.12.1 DEFINITION If X is a normed space and x : J -> X is denned on an interval JCR and 5 G J is an interior point then the derivative of x at 5 is the limit (*®) =«'W = IimfWzfW (5.12.L1) V * )t=. K ' ^ t-s K If x'(s) exists for each 5 G J, we shall say that x is differentiable on J. More generally, if 5 is an end-point of J, we may speak of a "left-hand" or "right-hand" derivative of x at s. The elementary rules for differentiating conbinations of functions need neither completeness nor the Hahn-Banach theorem: 5.12.2 THEOREM If x : J -> X is differentiable at 5 G J, then x is continuous at 5. If also y : J —► X and T : J ^ BL(X,Y) are differentiable at 5, and ^ : J' —> J is differentiable at 5' G J' with <£(s') = s, then 3(ax + 0y)'{s) = ax'(s) + /?t/(s) for each a,/? G K (5.12.2.1) 3 (^^)t_3 = r'M*M + T(sW(s) (5.12.2.2) 3(x o 4>y(s') = <f>'{s')x'{<t>{s')) = ^'(« VM (5.12.2.3) Proof: Examine, respectively, the difference quotients ax(t) + 0y(t) - ax(s) - 0y(s) = ^x(t) - x(s) + ^yffl-yfc) g Z 5 Z 5 Z 5 T(tHt)-T(s)x{s) = T{t)-T[s) + T{s) «(«)-«(«) £ — 5 £ — 5 t — S remembering here that x(t) is a continuous function of t at £ = 5, and *' - 5' t'- s' <j>{t') - <j>{s') where, as is customary when X = R, we interpret (5.12.2.6) x{*m~-x$)]) = AHs')] ifm = Hs']and' * ' " (5.12.2.7)
168 5. Linear Functionals and Duality For example the derivative of a constant is 0: if a, b G X then — (ta + b) ) = a for each s G R (5.12.2.8) dt J t=a This gives an interesting special case of (5.12.2.2): if T G BL(X, Y) then 4(T(«(*)))=r (£:(*)) (5.12.2.9) Two other cases are of interest: if T(t) is scalar multiplication by <j>{t) then -^(t)x(t)) = <f>'(t)x(t) + 4>(t)x'(t) (5.12.2.10) If instead T : J -> BL(X, Y) and S : J ->■ BL(F, Z) are differentiate, then 4-AS{t)T{t)) = S'{t)T(t) + S(t)T'(t) (5.12.2.11) at Finally, if x : J —► A and y : J —> A are different iable for a normed algebra A then -^(s(t)y(O) = *'(')</(') + *(')</'(') (5.12.2.12) If T : J —> BL(X, Y) is different iable at 5 G if, and also invertible near 5 G J and if we know that r(£)_1 is a different iable function of £ at t = s, then it is clear from (5.12.2.11) what the derivative of T^)-1 must be. We can of course prove more: 5.12.3 THEOREM If T : J -> BL(X,Y) is different iable at 5 G J, and T(t) is invertible for each t G J sufficiently close to 5, then T_1 is different iable at 5, with (^§-)t_3 = -rW^wrw-1 (5.12.3.1) Proof: By the first part of Theorem 5.12.2 together with Theorem 3.1.4, the inverse r(£)_1 is a continuous function of t. Now examine the difference quotient ^ = .rMr(,rI . (5.12.3.2)
5.12 Differential Calculus 169 When X and Y are complete, so that BL_1(X, Y) is an open set, then the invertibility of T(t) for t sufficiently close to 5 G J follows automatically from the invertibility of T(s). If BL-1 (X, Y) is not an open set then we can sustain a version of Theorem 5.12.3 in which the differentiability of r(£)_1 is analogous to the "left" or "right"-hand differentiablility above. As we observed in (5.12.2.8), the derivative of a constant is 0. For the converse we need the Hahn-Banach theorem: 5.12.4 THEOREM If x : J -> X is differentiate on an interval J G R and [a,P] C J then \\x{p)-x{a)\\<(p-a) sup ||x'(*)|| < (/? - <*)sup ||*'(-)|| (5.12.4.1) Ot<t<0 J In particular there is implication x'(.) = 0 on J =>■ x(-) constant onJ (5.12.4.2) Proof: If / G X^ = BLR(X, R) is an arbitrary (real) linear functional on X then by (5.12.2.9) the numerical function / o x is differentiable on J, with derivative (/ o x)' = / o x', and by the numerical mean value theorem there is inequality |/(*(/?)) - /(x(a))| < (/? - a) sup |/(x'(t))| < (/? - o)||/|| sup ||x'(.)|| ot<t<(3 [a,/?] (5.12.4.3) This together with (5.4.1.1) gives (5.12.4.1), and hence (5.12.4.2). ■ If the normed space X is complete, then every continuous mapping x : J —► X can be represented as a derivative: 5.12.5 THEOREM If x : J —► X is a continuous mapping from an interval JCR into a Banach space X, then for each 5 G J and each a G X there is a unique mapping y : J —► X for which t/(5) = a and t/(*) = x(t) for each * G J (5.12.5.1) Proof: The uniqueness of y : J —► X satisfying (5.12.5.1) follows from (5.12.4.2). The existence of y is through the medium of the integral (4.2.2.5), interpreted as a limit of Riemann sums: for each t G J y(t) =a+ [ x{t') dt' = a + lim I £(tj. - t'^Jxpj) : t' —+ [s,t] I (5.12.5.2)
170 5. Linear Functionals and Duality where t1 = (*o,*i,... 9tfn) is a partition of [s,t] and t'j G W_i,*y] for each j. The continuity of x ensures that the Riemann sums are "Cauchy" on the space of partitions of the subinterval [5, t] C J, and Theorem 4.2.2 then ensures that the limit exists in X. To see that the derivative of y is x we examine the difference quotient — f x(t') dt'-x(s)\\ = — [(x(t')-x(s)) dt'\\ <sup||x(.)-x(5)|| ~SJ II 11*-s J II [«,*] 3 3 (5.12.5.3) and the right-hand side tends to 0 as t — s tends to 0, while of course the equality and the inequality follow from the corresponding assertions for Riemann sums. ■ If x : J —► X is continuous then generally I x(t) dt\\ < f \\x(t)\\ dt<{/3- a) sup ||x(.)|| if [a,/?] C J \\J II J [a,/?] (5.12.5.4) and P / x(t) dt e (p-a)c\cvx{x{t):a<t < (3} (5.12.5.5) When x = y' is continuous and is the derivative of y then also, whether or not X is complete, P / y'(i) dt = y{$) - y{a) for each [a,/?] C J (5.12.5.6) We conclude by a discussion of power series. We shall write z : R —► R for the function defined by setting and occasionally write z(t) = t for each iGR dx . x'(t) = —(t) for each t G J dz (5.12.5.7) (5.12.5.8) when x : J —► X is differentiate.
5.12 Differential Calculus 171 5.12.6 THEOREM If X is complete and k > 0, and if x = (xn) is a sequence in X for which oo E KH*B < °° (5-12-6-1) n=0 then the function oo ^2 zTlxn = ] " M[ —► oo (5.12.6.2) n=0 is bounded and continuous, with derivative , oo oo — E znxn = £>+ l)znxn+1 e C^G - k,k[,X) (5.12.6.3) n=0 n=0 Proof: If (5.12.6.1) holds then the sequence n (£*%)n€N € 0(^(1 ~k,k[,X)) (5.12.6.4) r=0 is Cauchy in the space C^Q — k,k[,X), which is complete by Theorem 4.3.3. This gives the first part of the theorem. Towards the second, the linear operator J0 : CTO(] - k,k\,X) —> CU] - k,k[,X) (5.12.6.5) defined by setting 3 (l0y) (s) = / y(0 * for each«e ] - M[,y e £«>(] - *, *[,x) 0 (5.12.6.6) is well-defined, linear, and bounded. It follows that, if — k < s < k, t oo oo 1 °° 1 / E <xdt = E ^Tsn+1^ = E -•"*-! (5-12-6-7) q n=0 n=0 n=l and hence, by (5.12.5.3), E *"*» = Tz E ^2"x»-i (5.12.6.8) n=0 n=l To deduce (5.12.6.3) we go back to (5.12.6.1), and claim that if (5.12.6.1)
172 5. Linear Functionals and Duality holds then oo 0 < k' < k => J2 nWxn\\{k')n < °° (5.12.6.9) n=0 since eventually n{k')n < kn. We have therefore proved the set membership claim at the end of (5.12.6.3). This means that (5.12.6.8) holds with ((n + l)xn+1) in place of (xn), which gives the equality at the beginning of (5.12.6.3). ■
6 Finite Dimensional Spaces and Compactness If bounded linear operators between normed spaces are to be thought of as a generalization of finite dimensional linear algebra, the reader is entitled to wonder why "boundedness" and "continuity" seem to be missing from matrix theory. 6.1 LINEAR DEPENDENCE AND INDEPENDENCE Suppose X is a linear space over the field K. Then a family x = {xt)ten in X is said to be linearly independent if there is implication, for arbitrary finite tfCH and arbitrary 5 = {st)ten, in K, ^2 stxt = ° e x => st = ° for each t e n' (6.1.0.1) ten' Otherwise it is called linearly dependent. The family {xt)ten is said to be linearly generating, or spanning, if for each y G X there is finite Q' G H and ist)ten' m K for which y=Yl stxt (6.1.0.2) ten' If x = [xt) is both linearly independent and linearly generating then it is called a Hamel basis for X. If y = (t/JieA 1S an extension of x = {xt)ten, in the sense that Q C A, and xt = yt for each t G H, then there is implication y linearly independent =£► x linearly independent (6.1.0.3) and x linearly generating =$■ y linearly generating (6.1.0.4) 173
174 6. Finite Dimensional Spaces and Compactness Also, if x = {xt)ten is linearly dependent, and Q has more than one element, then for at least one point t G H, xte ^2 Kxt' (6.1.0.5) tjtt'en When Q is a subset of X and xt—t for each iGfl, then the family (xt)ten reduces to the set Q. Partially ordered by inclusion, it is easy to check that the linearly independent subsets of X satisfy Zorn's condition (1.11.1.2), and also that a linearly independent subset of X is maximal if and only if it is linearly generating. Thus every linearly independent subset of X can be extended to a Hamel basis. If Q = {1,2,..., 7i} is finite then, making an amalgam of the notations (2.4.0.4) and (2.9.2.2), we shall write, for each x G Xn, row(Lx)(s) = s1x1 + 52x2 H h snxn G X for each 5 G Kn (6.1.0.6) Now evidently x linearly independent <£=$> row(Lx) one-one (6.1.0.7) and x linearly generating <£=$> row(Lx) onto (6.1.0.8) If there is n G N for which X has a basis x G Xn then X is said to be finite dimensional. It is familiar that the number n is uniquely determined: 6.1.1 THEOREM If x G Xn is linearly independent and y G Xm is linearly generating then n<m (6.1.1.1) and there is a basis z G Xp for which {x:j = l,2,...,n} C{2.:j = l,2,...,p} (6.1.1.2) C {Xj:j = 1,2,... ,n} U {y-.j = 1,2,... ,m} Proof: The argument is known as the Steinitz replacement process. We begin with a deletion process: if y G Xm is a sequence for which m > 2 and y is linearly dependent (6.1.1.3) then we will write 6{y) G Xm_1 for the sequence obtained by deleting the first term y • that is a linear combination of its predecessors. If yx =0 we
6.1 Linear Dependence and Independence 175 will delete y1. Formally, *(v) = (Vi,"-,*y,-,Vm) (6-1.1.4) where j = 1 if yx = 0 and j = min I k: yk G ^ Kt/- I ify1^0 (6.1.1.5) Evidently m > 2 and y linearly generating =>- 6{y) linearly generating (6.1.1.6) By repeating the operation 6 we either reach a single element 0 or a basis for X, assuming we start with a generating sequence. Thus in a sense (6.1.1.2) holds if n = 0. If x G Xn is linearly independent and y G Xm is linearly generating, then the sequence (xn,yi)...)3/m)eXm+1 (6.1.1.7) is both linearly generating and linearly dependent. ^n,y) = {xn,y\,...,ylm)eXm (6.1.1.8) is still linearly generating, and of course the deletion process cannot affect the first term xn. Similarly '(*»-i>*(*»>v)) = (*»-i> *».»!> •••.»«) e*m (6.1.1.9) is linearly generating, and neither of its first two terms can be affected by deletion. Eventually we reach a sequence (x1,x2,...,xn,...,y^)GXm (6.1.1.10) which is linearly generating, and may or may not be linearly independent; the notation should not disguise the possibility that there are no t/'s in the sequence (6.1.1.10). The inequality (6.1.1.1) is clear from (6.1.1.10). Of course the proof of (6.1.1.1) consists in the fact that if x G Xn is linearly independent then there can be no obstacle to the process of passing from (6.1.1.7) to (6.1.1.8). If the sequence (6.1.1.10) is linearly independent then we may take it to be z G Xp for (6.1.1.2); if not then we apply 6 repeatedly until we reach a linearly independent z G Xp. ■ If x G Xn and y G Xm are both bases for X then (6.1.1.1) tells us n < m < n => m = n (6.1.1.11)
176 6. Finite Dimensional Spaces and Compactness We are hence justified in calling it the dimension of X. If X = 0 we shall count it as finite dimensional, with dimension 0. If / G L(X, K)n is a linearly independent sequence of linear functionals on X then we can find a sort of "dual" sequence in X: 6.1.2 THEOREM If / = (/l5/2,. • • ,/J is linearly independent in L(X,K) then there is x G Xn for which /t(x) = 6- for each i,j G {1,2,... ,n} (6.1.2.1) and n n ifge L(X, K) satisfies f) /r1 (0) C ^(O) then g G J^ Kfj t6-1-2-2) 3=1 3=1 Proof: We use induction on n. The trick is to prove both at once. We begin by showing that (6.1.2.2) follows from (6.1.2.1): for if E = E1+E2 + --- + En with Ej = fj © xj {j = 1,2,... ,n) (6.1.2.3) using the notation (2.9.2.4), then n E = E2 and jE?_1(0) = f) /"^O) (6.1.2.4) 3 = 1 Note that EiEj = S^Ej for each ij G {1,2,... ,n}. Now if T : X -> y is arbitrary then n n T_1(0) D f| /ri(o) = (/- £)(X) ^=> T = T£ = £ fj ©Txy (6.1.2.5) Applying this to T = g G L(X,K) then gives (6.1.2.2). To see that (6.1.2.1) holds if n = 1 suppose / = fx ^ 0, so that there is x[ G X for which fi{x[) ^ 0, and then take xx = x^/f^x^). If finally (6.1.2.1) and (6.1.2.2) hold for n = k, we must show that (6.1.2.1) holds with n = k + 1. Thus suppose that / = (/i,/25- • • ifk>fk+i) is linearly independent in L(X, K), and apply (6.1.2.l)n=fc to find (x^x^, ..., x'k) G X* for which fi(x'j) = *»y for each *>i ^ (M, • • • >*} (6.1.2.6) The linear independence of /, together with (6.1.2.2)n=fc, implies that /fc+i(0) does no^ C°ntain the intersection of the remaining /T1(0): thus
6.1 Linear Dependence and Independence 177 there is xjfc+1 G X for which 4+1 (4+i) ^ 0 = /y(4+1) for each j G {1,2,..., A;} (6.1.2.7) Now, for (6.1.2.1), define [x1,x2,...,xk,xk+1) by taking xj = x'j + six'k+iU = 1,2,...,A;) and **+i = **+i4+i (6.1.2.8) where the scalars Sj,sk+i are chosen in such a way that 4+iK) + ^4+i(4+i)=0 (j = l,2,...,fc) and (6.1.2.9) 5fc+i/fc+i(xfc+i) =1 ■ For a sort of dual to Theorem 6.1.2, say that the subspace Y C L(X, K) separates points of X iff, for each x, x' G X, t/(x') = t/(x) for each y eY => x' = x (6.1.2.10) For example, if Y = X^ then (6.1.2.10) is part of Theorem 5.4.1, and if yci^, then necessary and sufficient for (6.1.2.10) is Y0 = {0} (6.1.2.11) 6.1.3 THEOREM If Y G L(X, K) is a subspace which separates points of X and if x G Xn is linearly independent then there is y G Yn for which Viixj) = Sij for each *>y G {l,2,...,n} (6.1.3.1) Proof: If 7i = 1 suppose xx ^ 0 and find ^67 for which 21(x1) 7^ 0 and put yx = z1/z1(x1). If Theorem 6.1.3 holds for n = k, and if y G Xfc+1 is linearly independent, then there is z G Yk for which *;(*;) = *»i for each ^J e {1,2,...,/;} (6.1.3.2) Then by linear independence we have k and then, since Y separates points of X, an element zk+1 G Y for which k zk+i ixk+i ~ Yl ZJ (xk+i)xj) = 1 (6.1.3.4)
178 6. Finite Dimensional Spaces and Compactness Now put y*+i = **+i and y3-= z3-- z3ixk+i)zk+i (6 13 5) for each j G {1,2,... ,k} m By Theorem 5.4.1, (6.1.2.1) is a consequence of (6.1.3.1). Of course (6.1.2.1) was proved without the use of the Hahn-Banach Theorem 5.3.1. 6.2 FINITE DIMENSIONAL SPACES If X is a finite dimensional linear space then it can be normed, and all possible norms give the same topology. We begin by applying one of our "Riesz lemmas": 6.2.1 THEOREM If norms ||-|| and ||-||' on a linear space X both make ycXa closed subspace, and if the topology of ||-|| is stronger on Y than that of ||• ||', then for each x1 G X the same is true for the subspace Y-\-Kx1. Proof: By Theorem 1.7.1 we are assuming that there is k > 0 for which ||t/||' < k\\y\\ for each y eY (6.2.1.1) Then if 5 G K we have lit/ + sXl\\' < \\y\\' + MUsJ' < fc||y|| + H(||sJ7II*iII)II*iII <max(A;,||x1||7||x1||)(||t/|| + |5|||x1||) and, provided xx £ Y = cl(Y) = cl'(Y) we have by (1.5.3.1) M > 0 for which ||t/|| + H||x1||<M||t/ + 5x1|| Thus, if y G Y and 5 G K \\y + Mill' < k'\\y + sxj with *' = Mmax(fc, ||*i||7ll*ill) (6.2.1.2) Of course if xx G Y then (6.2.1.2) reduces to (6.2.1.1). We must see that the larger subspace remains closed: thus suppose Y is ||»||-closed and that l|y» + sn*i - A\ —♦ 0 with x G X,yn G y,and sn G K (6.2.1.3) Then by (1.5.3.1), provided xx & Y, \\yn-ym\\ + K-sm\\\xi\\ <M\\yn+snxi-ym-smxiII —^° as m > n —> oo (6.2.1.4)
6.2 Finite Dimensional Spaces 179 Thus (t/n) and (sn) are Cauchy in Y and K, respectively. Since the scalar field K is complete, it follows that (sn) converges, to s G K say: but this forces (t/n) also to converge, with limit y G X. However, since Y is closed in X we must have t/G7, and hence x = y - sx1 G Y + Kxx (6.2.1.5) Thus, if Y is closed in the ||-||-topology then so is Y + Kx1? and similarly for the ||• ||'-topology. ■ The reader might like to verify that also Y complete => Y + Kx1 complete (6.2.1.6) The uniqueness of norm topology for finite dimensional spaces is now clear: 6.2.2 THEOREM If X is finite dimensional, then there exists a norm ||*|| : X —*> [0, oo]. All norms define the same topology on X, and make it complete. Proof: If X = 0 then the result is clear, and rather trivial. If X is of dimension ?iGl\l, with a basis e = (e1,e2,... ,en), then we obtain a norm on X by setting ||51e1 + 52e2 + --- + 5nen|| = maxflsj, |s2|,- • •, \sn\) for each 5 G Kn (6.2.2.1) The reader can readily supply alternatives. In this norm the completeness of X follows from Theorem 4.3.1. To prove that the topology is unique we proceed by induction. If n = 1 then all possible norms are positive scalar multiples of one another, making X isomorphic to K, and of course complete. If the result is known for n = k, and if X is of dimension k + 1, then we can write X = Y + Kxx with yciof dimension k (6.2.2.2) If ||• || and ||-||' are norms on X then by the inductive hypothesis they induce the same topology on Y, which will necessarily be closed in X since (4.2.1.3) it is complete. Now Theorem 6.2.1 (twice) says that ||-|| and ||-||' give the same topology on X. m If X is a finite dimensional normed space, then all its linear subspaces are closed, and every linear mapping T : X —*Y into a normed space Y is bounded:
180 6. Finite Dimensional Spaces and Compactness 6.2.3 THEOREM If T : X -> Y is a linear mapping between normed spaces then X finite dimensional => T bounded (6.2.3.1) X finite dimensional and T one-one => T closed (6.2.3.2) Y finite dimensional and T dense => T open (6.2.3.3) Proof: If ||• || is a norm on X and T : X —► Y is linear, into a normed space Y, then we obtain another norm ||»||' on X by setting ||x||' = ||x|| + ||Tx|| for each x G X (6.2.3.4) and evidently the induced operator T : X' —> Y is bounded, with \\T\\' < 1, where X' is the space obtained by imposing the norm ||-||' on the space X. If X is finite dimensional, then by Theorem 6.2.2 the new norm ||»||' gives the same topology as the norm ||»||. This proves (6.2.3.1). Towards (6.2.3.2) consider the operator S : TX -► X/r_1(0) defined by S(Tx) = x + T~1{0) e X/T^O for each x e X (6.2.3.5) which is well defined and linear without restriction on T. If T is one-one then S goes into X, and if X is finite dimensional then also TX C Y is finite dimensional, so that by (6.2.3.1) the operator S is bounded, and hence the operator x —-► Tx : X —-► T{X) (6.2.3.6) is left invertible, and hence closed in the sense of (3.3.1.2). An application of (3.3.2.2) gives (6.2.3.2). If instead T is onto, then S goes from Y to X/T~10, and if Y is finite dimensional then S is bounded, making the operator x + r_10 —> Tx : X/r_10 —> Y (6.2.3.7) right invertible, hence open; now (6.2.3.3) follows from (3.4.2.1). Of course if Y is finite dimensional and T is dense then T is onto. ■ The implication (6.2.3.3) is slightly clearer when T is assumed to be bounded, and hence T-1(0) is closed in X. When T-1(0) is not closed then X/r_1(0) is only a seminormed space. The continuity of the operator S is still valid, since the argument for (6.2.3.1) extends to the case in which the space Y carries only a seminoma. In the notation of (3.2.1.4), (3.2.1.5), (3.3.1.4), and (3.4.1.3), we have shown X finite dimensional => rleft BL(X, Y) = 7rleft BL(X, Y) (6.2.3.8)
6.2 Finite Dimensional Spaces 181 and Y finite dimensional => rright BL(X, Y) = 7rright BL(X, Y) (6.2.3.9) It should now be clear why these ideas of "boundedness" and "continuity" are missing from matrix theory. They happen automatically, and all the "spatial" nonsingularities reduce either to one-oneness or ontoness. One slightly unexpected consequence is that the set of bases for an ra-dimensional space X is open in Xn: 6.2.4 THEOREM If X is a normed linear space and n G N then {x G Xn:x is linearly independent} is open in Xn (6.2.4.1) and {x G Xn: x is linearly generating} is open in Xn (6.2.4.2) Proof: Recall the topological isomorphism x <—► row(Lx) : Xn ~ BL(Kn, X) = L(Kn, X) (6.2.4.3) given by (6.1.0.6): then by (6.1.0.7) and (6.2.3.2) x linearly independent <$=^ row(Lx) bounded below (6.2.4.4) while by (6.1.0.8) and (6.2.3.3) x linearly generating <$=^ row(Lx) almost open (6.2.4.5) Now (6.2.4.1) and (6.2.4.2) follow from (3.3.3.1) and (3.4.3.1), respectively. ■ Finite dimensional spaces are isomorphic if and only if they have equal dimensions: 6.2.5 THEOREM If X and Y are finite dimensional spaces then dim(X) < dim(Y) (6.2.5.1) if and only if there are T G BL(X, Y) and S G BL(Y,X) for which ST = I GBL(X,X) (6.2.5.2) There is implication X - Y <^ dim(X) = dim(Y) (6.2.5.3)
182 6. Finite Dimensional Spaces and Compactness Proof: If x G Xn and y €Ym are bases, with n < m, define S and T by setting (n \ n I m \ n E «y*y = E W and 5 E W = E *y*y (6-2-5-4) y=i y y=i ^y=i y y=i for each 5 G Km: then S and T are both bounded, by (6.2.3.1), and satisfy (6.2.5.2), while n = m=^TS = Ie BL(Y, Y) (6.2.5.5) This proves that (6.2.5.1) => (6.2.5.2). Conversely, if (6.2.5.2) holds and x G Xn is linearly independent then also Tx G Yn is linearly independent, giving (6.2.5.1) by (6.1.1.1) from Theorem 6.1.1. ■ Linear operators between finite dimensional spaces are always regular: 6.2.6 THEOREM If X and Y are finite dimensional and T G BL(X, Y) then T is regular, and dim(X) = dim(y) <$=^ T decomposably regular (6.2.6.1) Proof: If W C X is a subspace and w G Xp is a basis for W then by (6.1.1.2) there is a basis x G Xn for which w^ = x^{j = 1,2,... ,p): now the mapping n p E : ^2 sjxj —> E 5i™i G X for each 5 G Kn (6.2.6.2) j=i 3=1 is a projection with range W, and of course bounded by (6.2.3.1). Thus, for arbitrary T G BL(X, Y) both the range and the null space are complemented, while the core is invertible (Theorem 6.2.3). This gives the first part. For (6.2.6.1) observe dim(X) = dimr_10 + dimX/r_10 and (6.2.6.3) dim(Y) = dimTX+ dimY/TX and dimX/T^O = dim TY (6.2.6.4) Now (6.2.6.1) follows from (3.8.6.1) and (6.2.5.3). ■
6.3 Operators of Finite Rank 183 The first part of Theorem 6.2.6 gives an improvement of (6.2.3.8) and (6.2.3.9): aleft BL(X, Y) = 7rleft BL(X, Y) (6.2.6.5) and aright BL(X, Y) = 7rright BL(X, Y) (6.2.6.6) 6.3 OPERATORS OF FINITE RANK Perhaps, if we are trying to generalize matrix theory, we should look at "finite dimensional" operators: 6.3.1 DEFINITION A linear operator from X to Y is said to be of finite rank if T(X) is finite dimensional (6.3.1.1) If X and Y are normed spaces write KL0(X,Y) = {Te BL(X,Y):T{X) is finite dimensional} (6.3.1.2) for the bounded finite rank operators. For example the "rank one" operators / © y of (2.9.2.4) are in KL0(X,Y): 6.3.2 THEOREM If X and Y are normed spaces and T G KL0(X, Y) then there are n G N, y G Yn and / G (X*)n for which n Proof: If T = 0 take n = 1, yx = 0 G Y and fx = 0 G Xf. If T ^ 0 take t/ G Yn to be a basis for T(X), so that for arbitrary z G T(X) we can write n * = £*yO%y (6.3.2.2) Evidently the coefficients g®{z) are uniquely determined by z [this is (6.1.0.3) and (6.1.0.4)], and the resulting mappings g^ : T(X) —*■ K necessarily linear. By (6.2.3.1) each g? is also bounded: now if we write, for each iG{l,2,...,7i},
184 6. Finite Dimensional Spaces and Compactness fj (x) = g°{Tx) for each x G X (6.3.2.3) then /y G X1" for each j G {1,2,... n}, and (6.3.2.1) is satisfied. ■ The bounded finite rank operators form a "two-sided ideal" of BL(X, Y) in the sense of (3.9.6.7): 6.3.3 THEOREM If X, Y, and Z are normed spaces then KL0(X,Y) +KL(X,Y) C KL0(X,Y) BL(Y,Z)KL0(X,Y) C KL0(X,Z) and KL0(X,Y)BL(Z,X) C KL0(Z,Y) Proof: If T and T' are both of finite rank then (r + r')(x) cr(i)+r'(i) is necessarily of finite dimension, giving (6.3.3.1). If T = ^?= KL0(X,Y)then n ST = Yl fj © Sy3- G KL0(X, Z) for each S G BL(Y, Z) 3=1 and n TS = Y,fjS® Vj e KLo(^> Y) f°r each 5 e BL(Z, X) giving (6.3.3.2) and (6.3.3.3). ■ The finite rank operators in BL(X, Y) are regular in the sense of Definition 3.8.1, with finite rank generalized inverses: 6.3.4 THEOREM If T G KL0(X, Y) then T is regular, with a regular generalized inverse, and there is inclusion rBL1(X, Y) + KL0(X, Y) C rBL1(X, Y) (6.3.4.1) Proo/: If T = 0 then we can take T' = 0 to satisfy (3.8.1.1). If T ^ 0 take y G Yn to be a basis for T(X) and form again the coefficient functional g*j : T(X) —► K as in (6.3.2.2), which can now be regarded as bounded linear functionals on T(X). By the Hahn-Banach Theorem 5.3.1 there is g G (Y*)n for which each g^ is an extension of #°, and then / G (X*)n (6.3.3.1) (6.3.3.2) (6.3.3.3) (6.3.3.4) i fj © Vj € (6.3.3.5) (6.3.3.6)
6.3 Operators of Finite Rank 185 given by /y = g?T = g3-T(j = l,2,...,n) for which (6.3.2.1) holds. We claim that / G (Xf)n is linearly independent (6.3.4.2) for otherwise the range of T would be of smaller dimension than the space generated by y G Yn. Now by Theorem 6.1.2 there is x G Xn for which (6.1.2.1) holds, and we take n T' = Y, 9j © xj € BL(Y, X) (6.3.4.3) y=i to satisfy (3.8.1.1). This proves the first part of Theorem 6.3.4, so that KL0(X, Y) C rBL1(X, Y) (6.3.4.4) now (6.3.4.1) follows from (3.9.6.9) and Theorem 6.3.3. ■ In particular, finite dimensional subspaces of normed spaces are complemented: 6.3.5 THEOREM If Y and Z are closed subspaces of a normed space X then Y finite dimensional, Z complemented => Y + Z complemented (6.3.5.1) and Y finite codimensional, Z complemented => Y n Z complemented (6.3.5.2) Proof: Towards (6.3.5.1) suppose first that Z = {0}: then if T = J : Y —► X is the natural injection we have T G KL0(Y,X), regular by Theorem 6.3.4, so that, in particular Y = T(X) is complemented. Now for (6.3.5.1) suppose P = P2 G BL(X,X) with Z = P(X), and put Y' = {I-P)Y. Evidently Y'CY' + Z = Y + Z (6.3.5.3) By what we have just proved, the finite dimensional space Y' is complemented in Y' + Z. Now, if Q = Q2 :Y + Z->Y + Zisa continuous projection with range Y' then P + Q{I - P) : X —+ X (6.3.5.4) is a continuous projection on X whose range is Y + Z.
186 6. Finite Dimensional Spaces and Compactness Towards (6.3.5.2), suppose first that Z = X: then if T = K : X -> X/Y is the quotient we have T G KL0(X, X/Y), regular by Theorem 6.3.4, so that in particular Y = T_1(0) is complemented. Now for (6.3.5.2), suppose P = P2 G BL(X,X) with Z = P{X), and observe that Y n Z has finite codimension in Z. By what we have just proved there is a continuous projection Q = Q2 : Z —► Z with range Y n Z: but now QP :X —>X (6.3.5.5) is a continuous projection on X whose range isYdZ. m If T = ]£y=i /y © t/y G BL(X, Y) is of finite rank, with generalized inverse T' = £*=1 </y © Xj G BL(Y, X) as in Theorem 6.3.4, then n Q = ^2gjeyjeBL(Y,Y) (6.3.5.6) and n P = ^ /y © xy G BL(X,X) (6.3.5.7) y=i are continuous projections with the same range and null space as T, respectively. We conclude by showing that a finite-rank operator is bounded if and only if its null space is closed: 6.3.6 THEOREM If X and Y are normed spaces and T G L(X, Y) is of finite rank, then T G KL0(X, Y) <£=> T"1 (0) closed (6.3.6.1) Proof: Forward implication is clear. If T ^ 0 is of finite rank, and t/G7n is a basis for T(X) C Y, then we recover the linear functional g® on T{X) of (6.3.2.2), and the linear functional /y on X of (6.3.2.3): now we claim T-1 (0) closed => Fy bounded for each j" G {1,2,..., n} (6.3.6.2) Indeed, for each j G {1,2,..., n} write Wj = r_1(0) + ^ Kxt- (6.3.6.3) where x G Xn is chosen so that Txy = t/y for each j G {1,2,..., n} (6.3.6.4)
6.4 Fredholm Operators 187 Then by Theorem 6.2.1 each space Wj is closed if T 1(0) is closed. We claim W,. = /^(O) for each j G {1,2,... ,n} (6.3.6.5) Certainly, T_1(0) C ff1^), and also x{ G fj~1{0) if i i1 j, giving inclusion. Conversely, if x G X is in /~* (0) then This proves (6.3.6.5). It now follows from Theorem 4.8.3 that each /• is bounded, and hence also (6.3.2.1) T = Y%=i fj ® 2/y- ■ 6.4 FREDHOLM OPERATORS The "Fredholm operators" are in a sense at the opposite extreme from the operators of finite rank: 6.4.1 DEFINITION If X and Y are normed spaces then T G BL{X,Y) is essentially one-one iff T_1(0) is finite dimensional (6.4.1.1) is essentially dense iff Y/c\(TX) is finite dimensional (6.4.1.2) and is Fredholm, or spatially Fredholm, iff T is essentially one-one, essentially dense, and proper (6.4.1.3) We shall write tt^ BL(X, Y)={T€ BL(X, Y): T is not essentially one-one} (6.4.1.4) and 7r^ht BL(X, Y) = {Te BL(X, Y):T is not essentially dense} (6.4.1.5) and <7essBL(X,Y) ={Te BL(X,Y):T is not Fredholm} (6.4.1.6) The reader will recall that an alternative description for the regular operators of Definition 3.8.1 was "relatively Fredholm":
188 6. Finite Dimensional Spaces and Compactness 6.4.2 THEOREM Necessary and sufficient for T G BL(X, Y) to be Fred- holm is that it be regular, not a left zero divisor modulo KL0(X, X) and not a right zero divisor modulo KL0 (Y, Y). Proof: We begin with an "essential" analogue of Theorem 3.2.2. We claim T essentially one-one <£=*> L^}{0) C KL0(W,X) for all W and LT = BL{W,T) (6.4.2.1) and T essentially dense <£=> R^(0) C KL0(Y,W) for all W and RT = BL(T,W) (6.4.2.2) Certainly if r_1(0) is finite dimensional and U G BL(W,X) then TU = 0 => U{W) C r_1(0) finite dimensional => 17 G KL0(W,X) (6.4.2.3) while if instead Y/cl(TX) is finite dimensional and V G BL(Y, W) then VT = 0 => ^-1(0) D cl(TX) w " v ; (6.4.2.4) => dim^(y) = dimW/V^O < dimW/cl(TX) < oo Conversely, if T_1(0) is not finite dimensional take W = T_1(0) and U = J : T_1(0) —► X the natural injection, to find U G L^x(0) which is not of finite rank, while if cl(TX) is not of finite codimension take W = Y j cl(TX) and V = K : Y -> Y/c\(TX) the quotient map, to find V G R^{0) which is not of finite rank. This proves (6.4.2.1) and (6.4.2.2). Now, if T is Fredholm, then (6.4.2.1) with W = X says that T is not a left zero divisor modKL0(X,X), and (6.4.2.2) with W = Y says that T is not a right zero divisor modKL0(Y,Y). At the same time, by Theorem 6.3.5, both T_1(0) and cl(TX) are complemented, so that if T is also proper then the condition (3.8.2.1) holds, making T regular. Conversely, if T is regular, with T = TT'T, then r_1(0) and cl(TX) are both complemented, with projections P = T'T and Q = TT' satisfying (3.8.2.3). If at the same time T is not a left zero divisor then J — P G KL0(X,X), giving (6.4.1.1), and if T is not a right zero divisor then I — Q € KL0(Y, Y), giving (6.4.1.2); also if T is regular then it must be proper. ■ Atkinson's theorem says that Fredholm operators are "essentially in- vertible":
6.4 Fredholm Operators 189 6.4.3 THEOREM If X and Y are normed spaces then T G BL(X, Y) is Fredholm iff T is left invertible modulo KL0(X,X) (6.4.3.1) and T is right invertible modulo KL0(Y, Y) (6.4.3.2) Proof: If T is Fredholm then by Theorem 6.4.2 it is regular, with a generalized inverse T' G BL(Y,X) satisfying (3.8.1.1), and since T is essentially one-one we have I-T'T G Z^1 (°) ^ KLo(*>*) (6.4.3.3) giving (6.4.3.1), and since T is essentially dense we have I + TT' G ^(O) ^ KL0(Y,Y) (6.4.3.4) giving (6.4.3.2). Conversely, if either (6.4.3.1) or (6.4.3.2) hold then T - TT'T G KL0(X, Y) C rBLI(X, Y) (6.4.3.5) using Theorem 6.3.4, so that T is regular by (3.8.3.1). At the same time (6.4.3.1) and the implication (3.9.4.8) say that T is not a left zero divisor: TU = 0 => U = (I - T'T)U G KL0(X,X) (6.4.3.6) while (6.4.3.2) says that T is not a right zero divisor: VT = 0=^V =V{I- TT') G KL0(Y,Y) ■ (6.4.3.7) The product of Fredholm operators is Fredholm: 6.4.4 THEOREM If X, Y and Z are normed spaces and T G BL(X, Y), S eBL{Y,Z) then 5, T Fredholm => ST Fredholm =^ S,T regular (6.4.4.1) and ST Fredholm => (S Fredholm <£=> T Fredholm) (6.4.4.2) If T is regular with generalized inverse T' G BL(Y, X), then T Fredholm => T' Fredholm (6.4.4.3) If also XGBL(X,Y), then T Fredholm, J + T'K Fredholm => T + K Fredholm (6.4.4.4)
190 6. Finite Dimensional Spaces and Compactness and T Fredholm, J + KT' Fredholm => T + K Fredholm (6.4.4.5) In particular, T Fredholm, K finite rank =» T + K Fredholm (6.4.4.6) Proof: The first part of (6.4.4.1) follows from Theorem 3.9.3 together with Atkinson's theorem, since the ideals A = KL0{X,X) B = KL0{Y,Y) and D = KL0{Z,Z) (6.4.4.7) satisfy the condition (3.9.3.3), enabling us to apply (3.9.3.4) and (3.9.3.5). For the second part of (6.4.4.1) recall the argument of (6.4.3.5): if U is a generalized inverse for the Fredholm operator ST, then T - T{US)T e KL0(X, Y) C rBL1(X, Y) and (6.4.4.8) S - S(TU)S e KL0{Y,Z) C rBL1(Y,Z) Now apply (3.8.3.1). For (6.4.4.2) we use Atkinson's theorem again . If U is a generalized inverse for ST, then J - UST e KL0(X,X) and J - STU e KL0{Z,Z) (6.4.4.9) so that T Fredholm <=> I - TUS e KL0(Y, Y) <£=> S Fredholm (6.4.4.10) For (6.4.4.3) we recall from (6.4.3.3) and (6.4.3.4) that J - T'T and j _ rprp/ are Q£ flnjte rank? and use Atkinson's theorem to deduce that T' is Fredholm. Towards (6.4.4.4) and (6.4.4.5), we begin with (6.4.4.6). If K is of finite rank then I-T'{T + K) = {I- T'T) - T'K e KL0(X, X) (6.4.4.11) and I - [T + K)T' = {I- TT') - KT1 e KL0(Y,Y) (6.4.4.12) so that T + K is Fredholm by Atkinson's theorem. Now for (6.4.4.4) we argue T'{T + K) = (I + T'K) -(I- T'T) e {I + T'K) + KL0(X, X) (6.4.4.13)
6.4 Fredholm Operators 191 using (6.4.3.3). Thus, by (6.4.4.6) the product T'{T + K) is Fredholm, and then (6.4.4.2) and (6.4.4.3) together make T + K Fredholm. The argument for (6.4.4.5) is the same. ■ If we specialize the perturbation slightly, we deduce slightly more: 6.4.5 THEOREM If T G BL(X, Y) is Fredholm, with generalized inverse T' G BL(y,X), and if K G BL(X, Y) satisfies J + T'K G BL_1(X,X) invertible (6.4.5.1) then &m{T + K)-l{ti) < dimT-^O) (6.4.5.2) and dim Y/cl(T + K)X < dimY/cl(TX) (6.4.5.3) Proof: Towards (6.4.5.1), suppose J + T'K one-one (6.4.5.4) Then by (6.4.4.13) we have (T + X)_1(0) n (/ - T'T)-1^) C (/ + T'K)-1^) = {0} (6.4.5.5) so that (T + if)-:l(0) is contained in some complementary subspace for the finite dimensional space (J — T'T)-1^), giving dim(T + K)-\0) < dimX/(I - TT)"1^) (6.4.5.6) = dim^T)-1^) = dimr-^O) This proves (6.4.5.2). Towards (6.4.5.3) recall from (3.1.3.1) that if (6.4.5.1) holds then also i" + KT' G BL_1 {Y,Y) is invertible. If we merely assume J + KT' onto (6.4.5.7) then by the analogue of (6.4.4.13) we have (T + K)X + {I - TT')X D {I + KT')X = X (6.4.5.8) giving dimY/{T+K)X < dim(/-rr')X = dimY/TT'X = dimY/TX (6.4.5.9) Since of course both TX and (T + K)X are closed, we have proved (6.4.5.3). ■
192 6. Finite Dimensional Spaces and Compactness 6.5 WEYL OPERATORS AND THE INDEX The difference between the two finite dimensions which characterize Fred- holm operators turns out to play a significant role: 6.5.1 DEFINITION If T G BL(X,Y) is a Fredholm operator between normed spaces, then index(T) = dimr_1(0) - dimY/cl(TX) (6.5.1.1) We shall call T G BL(X, Y) a Weyl operator iff T is Fredholm and index(T) = 0 (6.5.1.2) We shall write ue3SBL(X,Y) ={Te BL(X,Y) : T is not Weyl} (6.5.1.3) We offer two characterizations of Weyl operators: 6.5.2 THEOREM If T G BL(X, Y) then the following are equivalent: T is a Weyl operator (6.5.2.1) T is Fredholm and decomposably regular (6.5.2.2) T G BL_1(X,Y) +KL0(X,Y) (6.5.2.3) Proof: If T G BL(X,Y) is Fredholm then by (6.2.5.3) the condition (6.5.1.2) is equivalent to isomorphism T"1 (0) ~ Y/ c\(TX) (6.5.2.4) which for regular T is the decomposable regularity condition (3.8.6.1). Thus the conditions (6.5.2.1) and (6.5.2.2) are equivalent. If (6.5.2.2) holds then T has an invertible generalized inverse T' G BL(Y, X), which also satisfies (6.4.3.3). Thus, T = (T')"1 - (T')"1^ - T'T) G BL_1(X, Y) + KL0(X, Y) (6.5.2.5) giving (6.5.2.3). Conversely, if (6.5.2.3) holds then we can write T = S(I + K) with S G BL_1(X, Y) and K G KL0(X,X) (6.5.2.6) and to deduce (6.5.2.1) we will show I + Ke BL(X,X) is Weyl (6.5.2.7)
6.5 Weyl Operators and the Index 193 To see this we must decompose the space X into four subspaces, taking X0 = K'1 (0) n c\{KX) = K'1 (0) n KX (6.5.2.8) and then choosing closed subspaces Xx, X2 and X3 in such a way that X0 + Xx = K{X) and X0nX1 = {0} (6.5.2.9) and X0 + X2 = K~10 and X0 n X2 = {0} (6.5.2.10) while (KX+K~10) + X3 = X and (XI + X^O) nX3 = {0} (6.5.2.11) The existence of X1? X2 and X3, chosen in such a way that the induced projections are bounded, follows from (6.3.5.1); in fact Theorem 6.3.6 shows that the induced projections must always be bounded. Since each of X0, Xx and X3 are finite dimensional it is clear that also X2 = X0 + Xx + X3 is finite dimensional If x E X is arbitrary we claim {I + K)xeX'2^>xeX'2 This is clear from the matrix representation (6.5.2.12) (6.5.2:i3) I + K- ^01 0 K( 03 0 I + Kn 0 K13 0 0/0 0 0 0/ rx°i x2 Lx3J — rx°i *2 Lx3J (6.5.2.14) In particular there is inclusion (/ + K)X2 C X2, and hence / + K induces a linear operator—necessarily bounded— U:X2 X' (6.5.2.15) To verify (6.5.2.7) we show dim^+X)"^ = dimtf^O = dimX'2/UX'2 = dimX/{I+K)X (6.5.2.16) The first equality follows from (6.5.2.13), which implies (/ + K)-^ = U^O C X2 (6.5.2.17)
194 6. Finite Dimensional Spaces and Compactness The second equality in (6.5.2.16) is (6.2.6.1), and finally, for the third, we have x = (i + k)x e w = {i + K)x2 e (i + k)x2 e w with X2 = UK'2 0 W ■ (6.5.2.18) The product of Weyl operators is Weyl: 6.5.3 THEOREM If X, Y and Z are normed spaces and T G BL(X, Y), SGBL(Y,Z),then 5,T Weyl => ST Weyl => (5 Weyl <£=> T Weyl ) (6.5.3.1) If T is regular with generalized inverse T' G BL(Y,X), then T Weyl => T' Weyl (6.5.3.2) If also XGBL(X,y), then T Weyl, / + T'X Weyl => T + X Weyl (6.5.3.3) T Weyl, / + XT' Weyl => T + K Weyl (6.5.3.4) and in particular T Weyl, X finite rank => T + X Weyl (6.5.3.5) Proof: Most of this follows from the characterization (6.5.2.3). If T = U + K and 5 = V + if with invertible U and V and finite rank K and H then ST = (^17) + [VK + HU + HK) G BL_1 (X, Z) + KL0(X, Z) (6.5.3.6) giving the first implication of (6.5.3.1), while if instead T = U + K and ST = V + H then 5 = (VU-^ + iH-SK^-1 GBL_1(y,Z) + KL0(y,Z) (6.5.3.7) which proves half of the second implication. The other half is similar. For (6.5.3.2) suppose T = U + K with invertible U and finite rank K; then T + TT'T => T' - 17"1 G KL0(Y, X) (6.5.3.8) The implication (6.5.3.5) is immediate from (6.5.2.3), and now the argument for (6.5.3.3) and (6.5.3.4) is similar to that for (6.4.4.4) and (6.4.4.5). ■
6.5 Weyl Operators and the Index 195 The most surprising thing about the index is that it is a homomor- phism: 6.5.4 THEOREM If X, Y, and Z are normed spaces and T G BL(X, Y), S e BL{Y,Z) are Fredholm, then index(ST) = index(S) + index(T) (6.5.4.1) Proof: To see this we decompose the space Y into four subspaces, taking Y0 = (TX) n 5_10 = cl(TX) n 5_10 (6.5.4.2) and then choosing closed subspaces Y1? Y2 an(^ ^3 so that Y0 + Y1= TX and Y0nY1= {0} (6.5.4.3) and Y0 + Y2 = S~10 and Y0 n Y2 = {0} (6.5.4.4) while (TX + 5-10)+Y3 = Y and (TX + 5_10) n Y3 = {0} (6.5.4.5) The existence of Y1? Y2, and Y3, chosen in such a way that the induced projections are continuous, follows from Theorem 6.3.5. We now claim dim(5r)_10 = dimr_10 + dim Y0 (6.5.4.6) since the operator TA = core(T) : X/T~10 —► TX induces isomorphism {ST^O/T^O -Tin S-x0 = Y0. Also dim Z/STX = dim Z/SY + dim Y3 (6.5.4.7) since the operator SA = core(S) : Y/S~10 —► SY induces isomorphism Y/{TX-\- 5_10) ^ SY/STX. At the same time it is clear from (6.5.4.3) and (6.5.4.4) that dimY0 + dimY2 = dim5_10 (6.5.4.8) and dim Y2 + dim Y3 = dim Y/TX (6.5.4.9) If we now combine (6.5.4.J) for j = 6,7,8,9 we get (6.5.4.1). ■
196 6. Finite Dimensional Spaces and Compactness The proof of Theorem 6.5.4 incorporates an alternative proof for the first part of (6.4.4.1). If S and T are both Fredholm, then (6.5.4.6) and (6.5.4.7) tell us that (5r)_1(0) is finite dimensional and that c\(ST)X D ST(X) is of finite codimension. It remains to see that ST is also proper. In fact, we can show that it is regular. If we write 'T0' 2\ 0 _ 0 :X—► 'Y0~ Y1 Y2 Y3_ and [0 S, 0 S3 Y1 then we can find S[ : Y1 —► Z and T[ : Yx —> X for which S'1S1=Tiri = I:Y1-+Y1 giving ST + S^ = S^T^iS^S^ = ST{T[S[)ST (6.5.4.10) (6.5.4.11) (6.5.4.12) With the help of the index theorem we can add more detail to the perturbation theory of Fredholm operators, and rederive that for Weyl operators: 6.5.5 THEOREM If X and Y are normed spaces and T G BL(X, Y) is Fredholm, with generalized inverse T' G BL(Y,X), then index(T') = -index(T) (6.5.5.1) If also XGBL(X,Y), then I+T'K Fredholm => index(r+iT) = index(T)+index[I+T'K) (6.5.5.2) and I+KT' Fredholm => index(T+iT) = index(r)+index(/+iTr/) (6.5.5.3) In particular K finite rank => index(T + K) = index(T) (6.5.5.4) Proof: For (6.5.5.1) we use (6.5.4.1) and (3.8.1.1) to get index(T) = index(T) + index(T') + index(T) (6.5.5.5) For (6.5.5.4) we argue that if K is of finite rank then T\T + K) = (I + T'K) -{I- T'T) is Weyl (6.5.5.6)
6.6 Compactness and Total Boundedness 197 so that T'(T + K) has index zero, giving index(T + K) = -index(T') = index(T) (6.5.5.7) From (6.5.5.4) we get (6.5.5.2) by arguing index(T + K) = index(J + T'K) - index(T') (6.5.5.8) The argument for (6.5.5.3) is the same. ■ We conclude with an extension of the characterization (6.5.2.3) to more general Fredholm operators: 6.5.6 THEOREM If T G BL(X,Y) is Fredholm then there are S G BL(X, Y) and S' G BL(Y,X) for which T-Se KL0(X,Y) is finite rank (6.5.6.1) with S'S = 1 if index(T) < 0 (6.5.6.2) and SS' = 1 if index(T) > 0 (6.5.6.3) Proof: Since the Fredholm operator T is regular we can find projections P = T'T and Q = TT' satisfying (3.8.2.3), where T' G BL(Y,X) satisfies (3.8.1.1), and by (6.2.5.2) we can find Sv : P_10 -► Q_10 and S'v . q-i0 _► p-i0 for which either S'vSv = J'v or SvS'v = Jv, according as which finite dimensional subspace has the greater dimension. For (6.5.6.2) or (6.5.6.3) we now take S = TP + SV{I-P) and S' = T'Q + S'w {I - Q) ■ (6.5.6.4) When T is Weyl then both (6.5.6.2) and (6.5.6.3) hold, so that S is invertible as in (6.5.2.3), and indeed coincides with the operator S of (3.8.6.6). 6.6 COMPACTNESS AND TOTAL BOUNDEDNESS We recall that a subset K of a topological space Q is compact iff every open cover has a finite sub cover: equivalent ly Ut G Nbd(*) for each t G K => K C |J Ut for some finite X'cn teK1 (6.6.0.1)
198 6. Finite Dimensional Spaces and Compactness If Q = X is a normed space and (6.6.0.1) holds for discs of equal radius then K C X is said to be totally bounded: if e > 0 then K C Disc(if'; e) = (J Disc(x ; e) for some finite K' C X (6.6.0.2) Compactness and total boundedness can be tested with sequences: 6.6.1 DEFINITION if C X is said to be sequentially compact iff every sequence (xn) in K has a subsequence (x^) which converges in K (6.6.1.1) and is said to be sequentially precompact iff every sequence (xn) in K has a Cauchy subsequence (x^) (6.6.1.2) These three notions are related, together with completeness: 6.6.2 THEOREM If X is a normed space and K C X then if totally bounded <£=£> if sequentially precompact and K complete and totally bounded <£=£> K sequentially compact (6.6.2.2) Proof: If K C X is not totally bounded then there must be £ > 0 for which there does not exist finite K' C.K such that K C Disc(if ;£). Then inductively there is a sequence (xn) in K for which n xn+1 G if \ (J Disc(xy ; <5) for each n G N (6.6.2.3) and evidently (xn) has no Cauchy subsequences. Conversely, if K is totally bounded and (xn) is an arbitrary sequence in K, and e1 > 0 is arbitrary, then there must be a subsequence (xjj of (xn) for which {x\:n € N} C Disc(t/1 ; 6-l) for some yx G X. Repeating the argument with (xJJ in place of (xn), inductively we get subsequences (xJJ1) of x for each m G N such that {x^: n G N} C Disc(ym ; em) and (^+1) is a subsequence of (x^1) (6.6.2.4) (6.6.2.1)
6.6 Compactness and Total Boundedness 199 If we do this with em —► 0 then the sequence (xJJ defined by setting x'n = x* for each n G N (6.6.2.5) is a Cauchy subsequence of (xn). This proves (6.6.2.1), and makes the (standard) argument for (6.6.2.2) very easy: trivially, K sequentially precompact and complete => compact => precompact (6.6.2.6) while if (xn) is a Cauchy sequence in a sequentially compact subset K, and therefore has a convergent subsequence, then by (4.1.2.3) the sequence (xn) must itself converge. ■ For the record, 6.6.3 THEOREM If X and Y are normed spaces, with K C X and HCY then K,H totally bounded => K x H totally bounded (6.6.3.1) and K, H sequentially compact => K x H sequentially compact (6.6.3.2) Also if <t>: K —> Y is continuous, then K totally bounded => <£(if) totally bounded (6.6.3.3) and K sequentially compact => <j>{K) sequentially compact (6.6.3.4) If Y = X and H C K then K totally bounded => if totally bounded (6.6.3.5) and K sequentially compact => K closed and cl(if) sequentially compact (6.6.3.6) Proof: This is left to the reader. ■ For subsets of normed spaces, compactness and sequential compactness are the same thing.
200 6. Finite Dimensional Spaces and Compactness 6.6.4 THEOREM If K C X for a normed space X, then K sequentially compact => K compact (6.6.4.1) Proof: We claim K compact => K Bolzano-Weierstrass => K seq. compact (6.6.4.2) and K seq. compact => K Lebesgue compact => K compact (6.6.4.3) Here the "Bolzano-Weierstrass property" says that any infinite subset H C K has accumulation points in X: acc(JJ) = 0 => H finite (6.6.4.4) "Lebesgue compactness" mean total boundedness together with the "Lebesgue covering property": if Ux G Nbd(x) for each x G K there must be 6 > 0 and a mapping <f>: K —> K for which * Disc(x; g) C 170(x) for each xeK (6.6.4.5) To verify the first implication of (6.6.5.2), note that if H C K has no accumulation points then there is {Ux)xeK for which H d Ux C {x} at each point. Clearly any subcover of K will have to use all the [Ux)xGH. For the second implication, observe that if (xn) is a sequence in K with the Bolzano-Weierstrass property then either ace {xn: nGN}^0or there is y G X for which {n G N: xn = y} is infinite. The second implication of (6.6.5.3) is very easy, and can be left to the reader. For the first we will have to argue by contradiction. Indeed, if K does not have the Lebesgue covering property then there is for each neighborhood family {Ux)xGK a sequence (xn) in K for which none of the Disc(xn; l/n) lie in any of the U . If in spite of this K is sequentially compact then there will be z = x^ = limn x^ for some subsequence (^)(x(n)) of (xn). But now Uz must eventually contain A topological space Q is called locally compact if for each t G H and each U G Nbd(i) there is compact V G Nbd(*) with V CU (6.6.4.6) If Q = X is a normed space then by translation invariance X locally compact <£=£> DiscjJC(0 ; 1) sequentially compact (6.6.4.7)
6.6 Compactness and Total Boundedness 201 6.6.5 THEOREM If X is a normed space, then X locally compact <£=£> X finite dimensional (6.6.5.1) Proof: If X is finite dimensional then X ~ Kn for some n G N (6.6.5.2) and if we give Kn the H-jj^ norm then (6.6.3.2) shows that Disc^O ; 1) C Kn is sequentially compact. Conversely, we use the Riesz lemma (Theorem 1.5.2), and part of Theorem 6.2.1 , to show that if X is infinite dimensional then there is a sequence (xn) in X for which / » \ 1 for each n e N, ||xn|| = 1 and dist i -=1, Y^ Kiy > - (6.6.5.3) V i=i J 2 Using (6.6.2.1), it is clear that Discx(0 ; 1) is not totally bounded. ■ The reader will observe that again we have proved more than we have stated. He is invited to reflect on which, if any, normed spaces are actually compact. We conclude here with the famous Tychonoff product theorem, which says that arbitrary products of compact spaces are compact. 6.6.6 THEOREM If (Hy)yGj is a family of compact topological spaces, then JJ Qy is compact (6.6.6.1) J'€J Proof: Suppose J is a set of subsets of Yljej ^y w^h the finite intersection property, so that Q7' ^ 0 for each finite 7' C J: then with the help of Zorn's lemma 1.11.2 we can find a family Q of subsets of ILgj fy> maximal with respect to having the finite intersection property, for which Q D 7. If we suppose that each member set of J is closed in the topology of Yljej Hy, we should not assume that this is true of the sets in Q. For each j G J the family {GfG € 9} = {*3(G):G € 9} (6.6.6.2) of subsets of Qy has the finite intersection property, as does the related family {cl(Gy):G e 9} = {clK-(G)): G € 9} (6.6.6.3) By the compactness of H • it follows that the intersection of each family (6.6.6.3) is nonempty. Thus (with another concealed application of Zorn's
202 6. Finite Dimensional Spaces and Compactness lemma) we can find a point x G Ilyej fy f°r which xj G D {*y (G): G G 5} for each j'eJ (6.6.6.4) The compactness of the product will be established if we can show that this point x lies in the intersection of the sets in J'. Certainly for each j G J and each neighborhood U- of x ■ and each G G Q we have ^yn7ri(G)^0 (6.6.6.5) It follows that for each j G J for each Ge Q, 7rr1(Uj)nG^0 (6.6.6.6) and hence, by the maximality of §, ^Jl{Uj) G Q for each j eJ (6.6.6.7) This is true if each Uj is the projection of a basic neighborhood U of p in nyeJny. Thus, UnG^0 for each G G 7 (6.6.6.8) Since each G G J is closed, we have proved x G nGeyCr ^ &- ■ 6.7 ESSENTIAL ENLARGEMENT If H is a nonempty set and X is a normed space, then the "compactness" or otherwise of a mapping x : Q —► X is classified by its range x(Q) = {xt:*GH}CX: 6.7.1 DEFINITION If Q is a nonempty set and X is a normed space, then the totally bounded mappings from Q to X form the set m(Q,X) = {x G xn:{xt:£ G H} is totally bounded in X} (6.7.1.1) and the relatively compact mappings from Q to X form the set m^HjX) = {x G Xn:cl{xt:£ G H} is compact in X) (6.7.1.2) If x G /^(njX) is bounded then the measure of noncompactness of x is given by p(x) = inf {A; > 0: {xt : t G H} C Disc(X'; Jfe) for some finite K' C X} (6.7.1.3)
6.7 Essential Enlargement 203 The totally bounded and the relatively compact mappings are linear subspaces of ^(H, X), and the measure of noncompactness is a seminoma: 6.7.2 THEOREM If Q is a nonempty set and X is a normed space, then m(Q,X) and m1(Q,X) are linear subspaces of 1^(0,X) for which m^n.X) Cm(Q,X) =clm1(Q,X) Cl^faX) (6.7.2.1) with dist(x,m1(Q,X)) = dist(x,m(Q,X)) = p(x) for each x G l^V^X) (6.7.2.2) Proof: The inclusions (6.7.2.1) are clear, and to see that linear combinations of relatively compact mappings are relatively compact, use (6.6.3.2) and (6.6.3.4), while for totally bounded mappings use (6.6.3.1) and (6.6.3.3). To see that m(Q,X) is closed in l^fi^X) suppose that ||xn — x^ —► 0, with xn G m(Q,X) and x G Z^,X): then for each S > 0 and e > 0 there is N G N and finite K' C K for which n > N => ||xn - xH^ < 8 and x^(Q) C Disc(X'; e) (6.7.2.3) But now x(Q) C Disc(if' ; 6 + s) (6.7.2.4) and since 6 and e were both arbitrary it follows x G m(Q,X). To see that the closure of m1(Q,X) is m(Q,X) it will be sufficient to prove (6.7.2.2). Toward this suppose 6 > dist(x,m(Q,X)) is arbitrary. There then is y G m(Q,X) for which ||x H^ < 6, and for each e > 0 there is finite K' C X for which t/(Q) C Disc(if; e), giving x(Q) C Disc{K',6 + s). By choice of 6 and e it follows p(x) < dist(x,m(Q,X)) (6.7.2.5) Conversely, if £ > p(x) is arbitrary then there is finite K' C X for which x(Q) C Disc (if'; 6), and hence for each ?iGN there is yn G K' for which ||xn — t/n|| < £: now we have y G m1(Q,X) and ||x — t/H^ < S. By choice of £ it follows dist(x,m1 (Q,X)) < p(x) ■ (6.7.2.6) The reader may like to see that the argument of Theorem 5.8.2 identifies the dual space of m(Q,X): under the correspondence (5.8.1.12) m(n,X)f ^ M^Q,^) (6.7.2.7) When n = N we write m(X) for m(Q,X), and m^X) for m1(Q,X).
204 6. Finite Dimensional Spaces and Compactness 6.7.3 THEOREM If X is a normed space, then m(X) = {iG /^(X): every subsequence of x has a Cauchy subsequence} (6.7.3.1) and mAX) ={iG I (X): every subsequence of x (6.7.3.2) has a convergent subsequence} Hence also c{X) C m(X) and cx(X) C mx(X) (6.7.3.3) If T G BL(X, Y) is a bounded linear operator between normed spaces, then TNm(X) C m{Y) and rV^X) C m^Y) (6.7.3.4) Proo/: Equality (6.7.3.1) is given by (6.6.2.1) for K = {xn:n G N}, and equality (6.7.3.2) by (6.6.2.2) for K = cl{xn:n G N}. The first inclusion of (6.7.3.4) follows from (6.6.3.3), and the second from (6.6.3.4). ■ More generally, Theorem 6.7.3 is valid for a nonempty set Q with the homology of finite subsets. When Q = N we can use m{X) in place of c0(X) in the definition of an enlargement. 6.7.4 DEFINITION If X is a normed space, then the essential enlargement of X is P(X)=UX)/m(X) (6-7.4.1) and the semi-essential enlargement of X is P1(X)=l00(X)/m1(X) (6.7.4.2) If T G BL(X, Y) is a bounded operator, then the essential and semi-essential enlargements of T are the operators P(T) : P(X) —+ P(Y) and PX(T) : PX(X) —+ P^Y) (6.7.4.3) induced by T: for each iG/^ (X) P(T)(x + m(X)) =TNx + m(Y) (6.7.4.4)
6.7 Essential Enlargement 205 Unless it coincides with ~P(X), the space PX(X) carries a seminorm which is not a norm. It would have been nice to express the norm ||P(T)|| in terms of the quotient norm ||r||e3S = dist(r,KL(x,y)) (6.7.4.5) 6.7.5 THEOREM IfreBL(X,Y),then l|P(T)ll<P1i;»<2||P(T)|| (6.7.5.1) where ||r||,ess = p{rx:||x||<l} (6.7.5.2) Proof: It is clear that ||P(r)|| = sup p(TNx) < sup j>(TK) (6.7.5.3) p(x)<l pffl<l and also, by taking K = {x G X : \\x\\ < 1}, that \\T\\'ess< sup j>(TK) (6.7.5.4) P(K)<1 We claim that \T\\'e33= sup p(«0<2||P(T)|| (6.7.5.5) pW<i Half of the first equality in (6.7.5.5) is given by (6.7.5.4): conversely if j>(K) < 1 and e > 0 and 6 > 0 then there are finite K'e C X and HS CY for which K C Disc(i^; 1 + e) and {Tx: \\x\\ < 1} C Disc(if<5; 6) (6.7.5.6) It follows that for each x G K'€ there is inclusion T Disc(x; 1+e) C Disc(if5 + Tx;^(l + e)) and hence T(K) CTDisc{K'e;l + s)cmsc{H6 + TK'c;6{l + e)) (6.7.5.7) The finiteness of the set HS + TK'e and the choice of 8 and e finishes the proof of equality in (6.7.5.5). For the second inequality suppose that 0 < S < p(K) < 00 and find x G l^X) for which {xn: n G N} C K and p(x) > \d (6.7.5.8)
206 6. Finite Dimensional Spaces and Compactness Simply choose xx G K arbitrarily and then arrange n xn+1 e K \ (J Disc(xy; 6) for each n G N ■ (6.7.5.9) 3 = 1 6.8 COMPACT OPERATORS A more comprehensive definition of "finite dimensional" operators exploits compactness or total boundedness. 6.8.1 DEFINITION A linear operator T e L{X, Y) is said to be compact iff cl {Tx: \\x\\ < 1} is compact in Y (6.8.1.1) and is said to be totally bounded iff {Tx: \\x\\ < 1} is totally bounded (6.8.1.2) We write KL^Y) = {Te L{X,Y):T is compact} (6.8.1.3) and KL(X,y) = {Te L(X,Y):T is totally bounded} (6.8.1.4) The compact and the totally bounded operators form "two-sided ideals" of BL(X,y) in the sense of (3.8.3.12): 6.8.2 THEOREM If X and Y are normed spaces, then KL0(X,y) C KL^X^Y) C KL{X,Y) = clKL(X,y) C BL{X,Y) (6.8.2.1) There is inclusion KL(X,y) + KL(X,y) C KL(X,y) and (6.8.2.2) KL^y) + KL1(X,y) C KL^X^Y) and if Z is another normed space BL(y,Z)KL(X,y) C KL{X,Z) and (6.8.2.3) BL(y,Z)KL1(X,y) C KL^XtZ)
6.8 Compact Operators 207 and KL(X,Y)BL(Z,X) C KL(Z,Y) and (6.8.2.4) KL^YJBL^X) CKL^Y) Proof: Much of this can be derived from Theorem 6.7.2, if we observe that the totally bounded and the compact operators are those whose restrictions to Q = DiscjJC(0 ; 1) are in m(Q,Y) and m^tl^Y) respectively. It is clear that if T G L(X,Y) is totally bounded then it is also bounded, and that if T is compact then it is totally bounded. If T G KL0(X, Y) is of finite rank, then by Theorem 6.6.5 the subspace T(X) is locally compact, and also complete, and hence T(X) n Discy (0 ; k) has compact closure for each k > 0, which makes T compact. To see that KL(X, Y) is closed in BL(X, Y) we repeat the argument which proves that m(Q,X) is closed in l^Q^X) from Theorem 6.7.2. This proves (6.8.2.1). Towards (6.8.2.2), if S and T are in KL(X,Y) and s,tGK then {{sT + tS)x: \\x\\ < 1} C 5 {Tx: \\x\\ < 1} + t {Sx: \\x\\ < 1} (6.8.2.5) is totally bounded, and if in particular S and T are both compact then c\{[sT + tS)x\\\x\\ < l}C5cl{Tx:||x|| < l} + *cl{Sx: ||x|| < 1} (6.8.2.6) is compact. Also if U G BL(Z,X) and V G BL(Y,Z) and T G KL(X,Y) then {TUz: \\z\\ < 1} C {Tx: \\x\\ < \\U\\} = \\U\\ {Tx: \\x\\ < 1} (6.8.2.7) is a subset of a totally bounded set, and of a compact set if T G KLX (X, Y), while {VTx: \\x\\ < 1} = V {Tx: \\x\\ < 1} (6.8.2.8) is the continuous image of a totally bounded or of a "relatively compact" subset. ■ Compactness and total boundedness of operators can be tested with sequences: 6.8.3 THEOREM If X and Y are normed spaces and T G BL(X,Y), then T G KL(X, Y) <£=> T^l^X) C m(Y) <£=> P(T) = 0 (6.8.3.1)
208 6. Finite Dimensional Spaces and Compactness and T G KL^X, Y) <£=> T^l^X) C m^Y) <£=> P^T) = 0 (6.8.3.2) Proof: The first implication of (6.8.3.1) is obtained by applying (6.6.2.1) with K = {Tx: \\x\\ < l}, while the second comes from the definition of P(T). The first implication of (6.8.3.2) comes from (6.6.2.1) with K = cl{Tx: ||x|| < 1}, and the second from the definition of P^T). ■ Finite dimensionality can be tested with the essential enlargements: 6.8.4 THEOREM If X is a normed space, then / G KL(X, X) <£=> X finite dimensional <£=> I e KLX(X, X) (6.8.4.1) and PX(X) = 0 <£=> X finite dimensional <£=> P(X) = 0 (6.8.4.2) Proof: If X is finite dimensional, then by Theorem 6.6.5 the subset Disc(0; 1) is compact, giving / G KL1(X,X). If X is not finite dimensional, then again, by Theorem 6.6.5 the subset Disc(0; 1) is not totally bounded, giving / £ KL(X,X). This proves (6.8.4.1), and hence also (6.8.4.2). ■ We conclude with the observation that the only regular compact operators are of finite rank: 6.8.5 THEOREM If X and Y are normed spaces, then KL(X, Y) n BT(X, Y) C KL0(X, Y) (6.8.5.1) Proof: If T G 'BL'(X, Y) is regular, with generalized inverse T" G BL(Y,X),then TT'y = Iy for each y G T{X) (6.8.5.2) and hence if T G KL(X,Y) is totally bounded then the restriction of the identity / to the range T{X) is totally bounded. By (6.8.4.1) T{X) is finite dimensional. ■ 6.9 SEMI-FREDHOLM OPERATORS A "semi-Fredholm" operator has about two thirds of the Fredholm property:
6.9 Semi-Eredholm Operators 209 6.9.1 DEFINITION If X and Y are normed spaces, then T G BL(X, Y) is upper semi-Fredholm iff T is essentially one-one and proper (6.9.1.1) and is lower semi-Fredholm iff T is essentially dense and proper (6.9.1.2) We shall write cr+sBL(X,y) = {T e BL(X,Y):T is not upper semi-Fredholm} (6.9.1.3) and cr"sBL(X,y) ={Te BL(X,Y):T is not lower semi-Fredholm} (6.9.1.4) Evidently T Fredholm <£=£> T upper semi-Fredholm and essentially dense (6.9.1.5) and T Fredholm <£=$> T lower semi-Fredholm and essentially one-one (6.9.1.6) We can use the semi-essential enlargement functor Px of (6.7.4.2) to test for upper and lower semi-Fredholmness: 6.9.2 THEOREM If X and Y are normed spaces and T G BL(X,Y), then Px(r) one-one <£=> T upper semi-Fredholm (6.9.2.1) Proof: Forward implication is divided into three parts. We claim that PX(T) one-one => T essentially one-one (6.9.2.2) PX(T) one-one => T relatively open (6.9.2.3) PX(T) one-one and T relatively open => T proper (6.9.2.4) The argument for (6.9.2.2) is the same as for Theorem 6.6.4: if T_1(0) is infinite dimensional then by the Riesz lemma, Theorem 1.5.2, and part of Theorem 6.2.1, there is a sequence x = (xn) in X satisfying the conditions
210 6. Finite Dimensional Spaces and Compactness (6.6.4.3), so that xel^X) and x^m{X)Dm1{X) and (6.9.2.5) Tx = 0 G c0{Y) C m^Y) C m{Y) which means that neither PX(T) nor P(T) is one-one. For (6.9.2.3) suppose that T is not relatively open: then there is x = (xn) in X for which ||xn|| = 1 and dist(xn,T_1(0)) > \ and ||Txn|| —> 0 (6.9.2.6) We claim 16/^(1) and Tx G c0{Y) C m^Y) and xgm^X) (6.9.2.7) Indeed the first two observations are immediate. If, however, x G m1(X) then there is a subsequence x' of x converging to an element x^ G X: but now T(x'oo)=0 and dist^r^O)) > i (6.9.2.8) a contradiction. Toward (6.9.2.4), suppose that T is relatively open, so that there is k > 0 for which y e T(X) =>y€ {Tx: \\x\\ < k\\y\\} (6.9.2.9) Then if y G cl(TX) there is x' = (x'J in X for which ||t/n-rx'J| -► 0, and by (6.9.2.9) there is x = (xn) in X for which Txn = Tx'n with ||xn|| < A:||rx^||. Since the sequence Tx' is bounded, it follows that x G /^(X), and then also the sequence Tx = Tx' G c^Y) C 77^ (Y). If, therefore, P^T) is also supposed to be one-one, then we must have x G m1(X) and hence there must be a subsequence x" of x converging to an element x'^ G X. Now we have t/ = lirnTx'^ = Tx'4 G T(X) (6.9.2.10) n Since t/ G cl(TX) was arbitrary it follows that T has closed range, which together with being relatively open makes it proper by (3.5.6.1). For backward implication in (6.9.2.1) suppose that T is upper semi- Fredholm: then since T_1(0) is finite dimensional there is by (6.3.5.1) P = P2 G BL(X,X) with P_1(0) = r_1(0), and since T is relatively open there is k > 0 for which, if x G X, \\Px\\ < ||P||dist(x,p-1(0)) = ||P||dist(x,T-1(0)) <A;||P||||rx|| = A;||P||||rPx|| (6.9.2.11)
6.9 Semi-Fredholm Operators 211 Now if x G ^(-X^) is such that Tx G tu1(Y) then for each subsequence x' of x there is a subsequence x" of x' for which Tx" has a limit t/^ G Y, and since T{X) is closed there must be x'^ G X for which y£ = Tx'^ = TPx^. From (6.9.2.11) it follows that Px'^ -► Px'^. Since P"1^) = (7-P)(X) is finite dimensional there must be a subsequence x'" of x" for which (I-P)x'" converges to an element x'£ = {I - P)x'£ in X. But now x'" = Px'" + (/ — P)x!n converges to Px^ + x'^ G X. Since x'" is a subsequence of x', which was an arbitrary subsequence of x, we have shown that x G m1(X), and hence that PX(T) is one-one. ■ Lower semi-Fredholmness can very nearly be tested with Px: 6.9.3 THEOREM If X and Y are normed spaces and T G BL(X, Y), then PX(T) onto <£=£> T essentially dense and relatively almost open (6.9.3.1) and Px(r) one-one onto => T Fredholm => Px(r) invertible (6.9.3.2) Proo/: Towards (6.9.3.1) suppose T is not essentially dense, so that Y/cl(TX) is infinite dimensional. Then by the Riesz lemma (Theorem 1.5.2) and part of Theorem 6.2.1 there is y = (t/n) in Y for which, for each n G N, n \\yn\\ < 2 and dist(t/n+1,cl(TX) + £ Kt/y) > 1 (6.9.3.3) 3 = 1 We claim y G UY) and y £ T^X) + m(Y) D T^l^Y) = m^Y) (6.9.3.4) Certainly y is bounded, while if there were x = (xn) in l^X) for which t/ — Tx had a Cauchy subsequence then there would be n and m > ra for which ||r*n-r*m-yn+ym||<i (6.9.3.5) contradicting (6.9.3.3). Thus if T is not essentially dense then neither PX(T) nor P(T) is onto. If instead T is not relatively almost open, then there is g = (gn) in Y* for which \\gj = 1 and distfon, (T)"1^)) > I and ||^T|| —♦ 0 (6.9.3.6)
212 6. Finite Dimensional Spaces and Compactness Evidently g^rn^) =m{Y^) (6.9.3.7) for if g' = (g^) is a subsequence of g for which g'n —*■ g^ G^ then dist(^, (T)"1^)) > I and g^T = 0 (6.9.3.8) a contradiction. By the completeness of Y\ m^Y^) = m(Y^) and since </ £ m(yt) 0 < P(^) = inf {6 > 0:{gn:ne N} C Bisc(H ;6) for some finite H C Ff} (6.9.3.9) If now 0 < £ < P(<7), then there is a subsequence g' = [g!)n of g for which, if ne N, n 0n+i£ |J Disc(^- ; *) (6.9.3.10) 3 = 1 and then there is {y'n m)n^m in F for which » # ™ =► IKmll = 1 and K(y;>m) - ^«J| > |* (6.9.3.11) Now claim that if x'n m G X for each n ^ m then sup K.JI < oo =► P {j£ - rx'nim: n?m}>\6 (6.9.3.12) Indeed if e > 0 and {t/^m - Tx'nrn\ n ^ m) C Disc(if ; e) for some finite ifC7 (6.9.3.13) then 9'n(y'n,m)-9'm(y'n,m)£ {J I>isc((g'n - g'm)(y") ; s) y"€K + (g'n-g'm)(Tx'ntm) ifn^m. (6.9.3.14) and using the last part of (6.9.3.7) there is {N£,)£l>0 in N for which, if e'>0, n>m>Ne,^g'n(y'nim)-g'm(y'nim)e \J D\sc(K,e + e') (6.9.3.15) y"GK From (6.9.3.13) and (6.9.3.11) it follows \6 < e' for each e' > 0 => ±<5 < e (6.9.3.16)
6.9 Semi-Fredholm Operators 213 giving (6.9.3.12). If we now arrange the pairs (n,m) with n ^ m in a sequence and write y = (y„) = (y'n,m) (6.9.3.17) then (6.9.3.12) can be rewritten * € foot*) => IIS/ - Tx||g > \6 (6.9.3.18) so that in particular the coset y + m1(Y) is not in the range of the operator Pi(T). We have now proved forward implication in (6.9.3.1). Conversely, if T is essentially dense and relatively almost open then by (6.3.5.2) there is Q = Q2 e BL(Y, Y) with Q{Y) = cl(rX), and then k > 0 for which y = Qy => y e el{rx: ||x|| < %||} (6.9.3.19) Now if y G ^(Y) is arbitrary we claim there is x = (xn) in X for which iG^fl) and y-Txem^Y) (6.9.3.20) Certainly for each n 6 N we can find xn 6 X for which ||Qyn -TxJ| < I||yn|| and ||xj| < *||yn|| (6.9.3.21) Thus xG^fl) and Tx-Qyec0{Y) (6.9.3.22) It follows y-Tx = y-Qy + Qy-Txe ^(Q"1 (0)) + c0(Y) = ^i(Q_1(0)) + c0{Y) C mi(Y) (6.9.3.23) using the fact that Q_1(0) is a finite dimensional subspace of Y. Toward the implication (6.9.3.2) it is now clear from (6.9.3.1) and (6.9.2.1) that PX(T) one-one and onto implies T upper semi-Fredholm and essentially dense, therefore Fredholm by (6.9.1.5). Conversely, if T is Fred- holm, with generalized inverse T', which necessairly satisfies (6.4.3.1) and (6.4.3.2), then P1(T')P1(T)-P1(J)=0 and P^rjP^f) - P^J) = 0 (6.9.3.24) so that PX(T') is a two-sided inverse for PX(T). ■ Of course both implications in (6.9.3.2) are two-way, since the last condition implies the first. We should not, however, forget that PX(T) is
214 6. Finite Dimensional Spaces and Compactness in general defined on seminormed spaces rather than on normed spaces. An immediate consequence of Theorems 6.9.2 and 6.9.3 is the extension of Atkinson's Theorem 6.4.6 and the perturbation Theorem 6.4.3 from finite rank to compact operators: 6.9.4 THEOREM If X and Y are normed spaces and T G BL(X,Y) satisfies T left invertible modulo KL^X.X) (6.9.4.1) and T right invertible modulo KLX{Y,Y) (6.9.4.2) then T is Fredholm. Also if K G BL(X, Y), then T Fredholm, K compact => T + K Fredholm (6.9.4.3) Proof: If (6.9.4.1) and (6.9.4.2) hold then there is V G BL(Y, X) for which / - T'T and / - TT' are both compact, giving (6.9.3.24), so that T is Fredholm by (6.9.3.3). Also if T is Fredholm and K is compact then P^T) is invertible and P^jK") = 0, so that VX{T + K) = P^T) is invertible, and hence by (6.9.3.3) T + K is Fredholm. ■ 6.10 ALMOST FREDHOLM OPERATORS The "almost semi-Fredholm operators" are slightly more general than the semi-Fredholm operators: 6.10.1 DEFINITION If X and Y are normed spaces, then T G BL(X, Y) is almost upper semi-Fredholm iff T is relatively open and essentially one-one (6.10.1.1) is almost lower semi-Fredholm iff T is relatively almost open and essentially dense (6.10.1.2) and is almost Fredholm iff T is almost upper and almost lower semi-Fredholm (6.10.1.3) If T is almost Fredholm we shall define index(T) as in (6.5.1.1). We shall write 3+,BL(X,Y) = {T€BL(X,Y): T is not almost upper semi-Fredholm} (6.10.1.4)
6.10 Almost Fredholm Operators 215 &-sBL(X,Y)={TeBL(X,Y): T is not almost lower semi-Fredholm} (6.10.1.5) and <ressBL(X,y) ={Te BL{X,Y):T is not almost Fredholm} (6.10.1.6) Evidently upper semi-Fredholm => almost upper semi-Fredholm => essentially one-one (6.10.1.7) and lower semi-Fredholm =>• almost lower semi-Fredholm => essentially dense (6.10.1.8) The almost Fredholm operators form an open set, and the index is continuous: 6.10.2 THEOREM If X and Y are normed spaces and T G BL(X,Y), then there is ST > 0 for which, if T' G BL(X,Y) satisfies ||r' - T|| < ST, then T almost upper semi-Fredholm => Tf almost upper semi-Fredholm (6.10.2.1) and T almost lower semi-Fredholm => Tf almost lower semi-Fredholm (6.10.2.2) and T almost Fredholm => T' almost Fredholm, with index (T') = index(r) (6.10.2.3) Proof: Suppose T is almost upper semi-Fredholm. Then since T_1(0) is finite dimensional, there is by (6.3.5.1) P = P2 e BL(X,X) for which P_1(0) = r_1(0), and since T is relatively open there is k > 0 for which, if x e X, \\Px\\ < ||P||dist(x,p-1(0)) = ||P||dist(x,r-1(0)) <A:||P||||rx|| = A:||P||||rPx|| (6.10.2.4)
216 6. Finite Dimensional Spaces and Compactness If now T' e BL(X,y) satisfies Jfc||P||||r'-r|| <1 (6.10.2.5) then, for arbitrary x G X, k\\P\\\\T'x\\ > k\\P\\\\Tx\\ - k\\P\\(T' - T)x\\ > \\Px\\ - \\Tf - T\\\\x\\ (6.10.2.6) so that in particular x = Px =* fc||P||||T'x|| > (1 - *||P||||r' - r||)||x|| (6.10.2.7) Thus the restriction of T' to P(X) C X is bounded below, and in particular one-one: (r')_1(o) n p(x) = {o} (6.10.2.8) It follows that dimtr')"1^) < dimX/P(X) = dimP-^O) = diml^O) < oo (6.10.2.9) and hence that V is essentially one-one. By (6.3.5.1) both (T')"1^) and the sum (r')_1(0) + P(X) are complemented: thus we obtain P' = (P')2 G BL(X,X) with (P')-!(0) = (r')_1(0) and P'(X) = P{X) +W (6.10.2.10) where W is a closed subspace complementary to T_1(0) + P(X). We claim that the induced mapping T : X/P/_1(0) —»• Y is bounded below, for by (6.10.2.7) the condition (3.3.6.2) of Theorem 3.3.6 is satisfied with (r')A in place of T and W = (P(X) + P/-1(0))/P/"1(0), while the restriction of (T')A to W is one-one since already (T')A is one-one. Since W is finite dimensional (6.2.3.2) says that the condition (3.3.6.1) of Theorem 3.3.6 also holds, and now Theorem 3.3.6 gives the result. This proves the implication (6.10.2.1). If instead T is almost lower semi-Fredholm, then since cl(TX) is of finite codimension there is by (6.3.5.2) Q = Q2 e BL(y,y) for which Q(Y) = cl(TX), and since T is relatively almost open there is k > 0 for which, if y G Y, y = Qy => y e el {Tx: \\x\\ < k\\y\\} (6.10.2.11) Now if T' e BL(y,y) satisfies k\\T' - T\\ < 1, then by the argument of
6.10 Almost Fredholm Operators 217 Theorem 3.4.3 we have y = Qy => y e cl {T'x: \\x\\ < k'\\y\\} with *' = (1_fe||*,_r|[) (6.10.2.12) so that in particular Q{Y) C cl(r'X) (6.10.2.13) It follows that dimY/cl(r'X) < dimQ-^O) = dimY/Q(Y) = dimYcl(rX) < oo (6.10.2.14) and hence that T' is essentially dense. By (6.3.5.2), c\(T'X) and c\(T'X) D Q_1(0) are complemented, giving us Q' = (Q')2 G BL(Y, Y) with Q'{Y) = cl(r'X) and Q_1(0) = (Q')"1!0) + z' (6.10.2.15) where Z1 is a closed complement for cl(T'X) fl Q_1(0). We claim that the mapping T' is relatively almost open; for by (6.10.2.13) the condition (3.4.6.4) is satisfied with {T')A : X -> Q'{Y) in place of T : X -> Y and Z = Q{Y), while the induced mapping (r')A~ : X -> 0;(y)/0(y) is dense since already (T')A is dense. Since Q'(Y)/Q(Y) is finite dimensional (6.2.3.3) says that the condition (3.4.6.3) of Theorem 3.4.6 also holds, and hence by Theorem 3.4.6 the mapping (T')A is almost open, giving the result. This proves the implication (6.10.2.2). If T is almost Fredholm and if V G BL(X, Y) satisfies both (6.10.2.5) and the condition following (6.10.2.12), then all four projections P, Q, P' and Q' are present, and we can write X=P{X)®W®{P')-10 and Y = Q{Y)eZe{Qf)~10 (6.10.2.16) We claim that the operator (r')A : P{X) © W —> T'(PX) © T'{W) = Q'tQ^O) (6.10.2.17) induced by T' is one-one, so that index(r') = dim(r')_10 - (dimQ"1^) - dimr'(^)) = dim(r')_10 + dimW - dimQ"1^) = dimr"1(0) -dimQ-^O) = index(r) ■
218 6. Finite Dimensional Spaces and Compactness On the face of it we have again proved more than we have stated. The index is not only continuous, but actually locally constant. Since, however, normed spaces are connected, and locally connected, this is the only way that a mapping into the integers could have been continuous. To test for almost Fredholmness it is more appropriate to use the enlargement functor P instead of the semi-enlargement P x: 6.10.3 THEOREM If X and Y are normed spaces and T G BL(X, Y), then T almost upper semi-Fredholm => P(T) bounded below (6.10.3.1) and T almost lower semi-Fredhom => P(T) open (6.10.3.2) and hence T almost Fredholm => P(r) invertible (6.10.3.3) Proof: If T is almost upper semi-Fredholm, then P(T) is certainly one-one: for if P = P2 G BL(X,X) with P_10 = r_10 and k > 0 satisfies (6.10.2.4), and if x G l^X) with TNx G m(Y), then for each subsequence x' of x there is a subsequence x" of x' for which TNx" G c(Y) is Cauchy, so that by (4.1.3.2) Px" G c(X) is also Cauchy. At the same time, the sequence (/ — P)x" is bounded and in the finite dimensional space (I — P)X = T_10, which is locally compact by Theorem 6.6.4. Therefore, there is a subsequence x,n of x" for which (/ — P)x,n G m^X) C m(X), and hence x'" = Px'" + (J - P)x'" G m{X) (6.10.3.4) To see that the operator P(T) is bounded below we show that for each dist(x, m(X)) = dist(Px, m{X)) < 2k dist(rx, m{Y)) (6.10.3.5) For if 6 > dist(Tx, m(Y)) is arbitrary, then by (6.7.2.2) there is a finite set H C Y for which {Txn:n G N} C Disc(H ; <5), hence a sequence y = (yn) in/^(Y) with {yn: n G N} C H and ||y - Tx^ < 6 (6.10.3.6) and finally a finite subset K C {xn:7i G 6A^} and a sequence x' G ^oo(^) for which {x'n:ne N} C K and |lTx' - y]]^ < 6 (6.10.3.7)
6.10 Almost Fredholm Operators 219 Thus {Txn: n G N} C Disc(r(K) ; 26) (6.10.3.8) and hence {Pxn: n G N} C Disc(P(K) ; 2k6) (6.10.3.9) By (6.7.2.2) again we have dist(x,m(X)) = dist(Px,m(X)) < 2kS (6.10.3.10) which by the choice of 6 gives (6.10.3.5). If instead T is almost lower semi-Fredholm then there is Q = Q2 G BL(y,y) with Q{Y) = d{TX), and k > 0 for which (6.10.2.12) holds. Now if y G Joo(y) is arbitrary we claim that there is x = (xn) in X for which xel^X) and y-Txem{Y) and H*^ < kM^ (6.10.3.11) Certainly for each n G N we can find xn G X for which ||Txn - Qyn|| < - and ||xn|| < k\\yj (6.10.3.12) n which ensures that x = (xn) is bounded and with norm dominated by Hyll^. At the same time the sequence Qy — Tx is in c0(Y) C m^Y), while the sequence y — Qy has its terms in the finite dimensional subspace Q_1(0), therefore is in m1(Y): thus ^ y-Tx = y-Qy+Qy-Tx G m1{Y) + c0{Y) C m^Y) C m{Y) (6.10.3.13) We have proved (6.10.3.1) and (6.10.3.2), and hence also (6.10.3.3). ■ As we have already proved in the argument for (6.9.2.1), P(T) one-one =>■ T essentially one-one (6.10.3.14) we claim that also P(r) dense =^ T essentially dense (6.10.3.15) For if T is not essentially dense, so that y/cl(TX) is infinite dimensional, and if y = (yn) is a sequence satisfying the conditions (6.9.2.18), then we claim p(y - TNx) > \ for each x G /^(X) (6.10.3.16) where p is the measure of noncompactness (6.7.1.3).
220 6. Finite Dimensional Spaces and Compactness Theorem 6.9.3 and the continuity of the index enable us to improve (6.4.4.6.) and (6.5.5.4): 6.10.4 THEOREM If X and Y are normed spaces and T G BL(X, Y), KeBL{X,Y) then T Fredholm, K compact => T + K Fredholm with index(r + K) = index(r) (6.10.4.1) Proof: By (6.9.3.2) T + tK is Fredholm for each t e K (6.10.4.2) and by (6.10.2.3) index(T + tK) is a continuous function of t ■ (6.10.4.3) 6.11 COMPLETENESS If X and Y are Banach spaces, then the theory of compact and of Fredholm operators is considerably simplified: 6.11.1 THEOREM If X and Y are Banach spaces and T G BL(X, Y), then T totally bounded => T compact (6.11.1.1) T almost upper semi-Fredholm => T upper semi-Fredholm (6.11.1.2) T almost lower semi-Fredholm => T lower semi-Fredholm (6.11.1.3) and hence T almost Fredholm => T Fredholm (6.11.1.4) Proof: Implication (6.11.1.1) follows from Theorem 6.6.2, applied to K = cl{Tx: ||x|| < 1}. Implications (6.11.1.2) and (6.11.1.3) follow from Theorem 4.4.3 and Theorem 4.4.4 applied to the operator core(T) : T/X~l (0) —► ci(rx). ■ If X and Y are complete, then Theorem 6.8.2 tells us that the compact operators KL1(X,Y) = KL(X, Y) form a closed ideal of BL(X, Y), and Theorem 6.10.2 tells us that the Fredholm operators form an open subset of BL(X, Y). Alternatively, this can now be deduced from either (6.4.6.4) or (6.4.6.5) of Theorem 6.4.6. For if K is so small that ||3r'|| ||JFC|| < 1 then by the observation (4.4.5.15) both / + TK and / + KT1 will be invertible
6.11 Completeness 221 and hence Fredholm. At the same time, since I+T'K and I+KT' also have index zero, (6.5.3.2) and (6.5.3.3) from Theorem 6.5.3 offer an alternative proof that the index is continuous. When X and Y are complete then both "essential enlargements" coincide: 6.11.2 THEOREM If X and Y are complete and T G BL(X, Y), then m{X) = mx(X) and m{Y) = m^Y) (6.11.2.1) and P(r) =Pi(r) (6.11.2.2) Proof: Equality (6.11.2.1) follows by the same argument as (6.11.1.1), and then gives (6.11.2.2). ■ If X and Y are complete, then Theorem 6.9.2 and Theorem 6.10.3 coalesce: 6.11.3 THEOREM If X and Y are Banach spaces and T G BL(X, Y), then P(T) one-one => T upper semi-Fredholm => P(r) bounded below (6.11.3.1) and P(r) one-one dense => T Fredholm => P(r) invertible (6.11.3.2) Proof: For (6.11.3.1) use (6.9.2.1) and (6.10.3.1), and for (6.11.3.2) use (6.10.3.15) together with (6.9.2.1). ■ Theorem 6.11.3 displays the upper semi-Fredholm operators, and the Fredholm operators, as continuous counterimages of open subsets of BL(P(X),P(Y)), and therefore as open subsets of BL(X,Y). Alternatively, since by (4.4.5.9) the invertible operators form an open set, (6.4.4.4) tells us that the Fredholm operators form an open set. We can also see that the topological boundary of the Fredholm operators is disjoint from the upper semi-Fredholm operators. 6.11.4 THEOREM If X and Y are complete and T G BL(X, Y) is upper semi-Fredholm, then T G closure {Fredholm operators} => T Fredholm (6.11.4.1)
222 6. Finite Dimensional Spaces and Compactness Proof: If T is upper semi-Fredholm and \\T - Tn\\ -> 0 with Fredholm (rj, then by (6.11.3.1) and (6.11.3.2), P{T) is bounded below and ||P(T) — P(rn)|| -► 0 with dense P(TJ, so that by Theorem 3.5.1 P(r) is almost open, and therefore (6.11.3.2) T is Fredholm. ■ When the spaces X and Y are complete there is a rather simpler characterization of Fredholm operators: 6.11.5 THEOREM If X and Y are complete, then T G BL(X,Y) is Fredholm iff T_1(0) and Y/T{X) are finite dimensional (6.11.5.1) Proof: Whether or not the spaces X and Y are complete, the condition (6.11.5.1) is necessary for T to be Fredholm, since Fredholm operators have closed range, and is sufficient for T to be both essentially one-one and essentially dense. We claim that if X and Y are complete, then (6.11.5.1) implies that T is proper. For Theorem 4.8.2 says that T(X) is closed, and now Theorem 4.5.3 says that core(T) is invertible, so that T is proper. ■ Theorem 6.11.5 means that some of the arguments in our development can be shortened if we confine ourselves to complete spaces. For example, if we use the argument of Theorem 6.5.4 to prove that the product of Fredholm operators is Fredholm, it is unnecessary to make the observation (6.5.4.12), while if we want to prove that the Fredholm operators form an open set we need only establish (6.10.2.9) and (6.10.2.14) from the argument for Theorem 6.10.2. 6.12 DUALITY THEORY The dual of a compact or a Fredholm operator has the same property: 6.12.1 THEOREM If X and Y are normed spaces and T G BL(X,y), then t e KL0(x,y) <^> rf g KL0(yt,xt) (6.12.1.1) T Fredholm ^=> Tf Fredholm (6.12.1.2) t e KL(x,y) <^> rf e KL(yt,xt) (6.12.1.3) Proof: If T* G BL(yt,Xt) is not of finite rank then there is a linearly independent sequence / = (fn) in T^(Y^) C X*, and if m G N is arbitrary there is by (6.1.2.1) a sequence (x1,x2,... ,xm) is X for which QiiTxj) = /t-(xy) = «.. for each ij G {1,2,...,m} (6.12.1.4)
6.12 Duality Theory 223 where g = (gn) in Y^ is chosen so that gnT = fn for each n. Since it is clear from (6.12.1.4) that (Tx1,Tx2,... ,Txm) is linearly independent, it follows from Theorem 6.1.1 that T(X) cannot be of dimension < m. Since m is arbitrary it follows that T cannot be of finite rank. This proves forward implication in (6.12.1.1). Conversely, if T* is of finite rank then so is the second dual T^, by what we have just proved, and hence also T. Towards (6.12.1.2) we claim T essentially dense <£=> T' essentially one-one (6.12.1.5) and T* essentially dense =>• T essentially one-one (6.12.1.6) For by (5.6.0.1) we have T1" essentially one-one <!=> T(X)° finite dimensional (6.12.1.7) and by (2.3.2.2) the dual space of y/cl(rX) is (TX)°: thus we may apply (6.12.1.1) to the identity I : y/cl(rX) -+ y/cl(rX). This proves (6.12.1.5), and hence also (6.12.1.6): T* essentially dense => T'' essentially one-one => T essentially one-one (6.12.1.8) If we recall (5.6.4.2) that T is proper if and only if T* is, then we have shown T lower semi-Fredholm <£=> T^ upper semi-Fredholm (6.12.1.9) and forward implication in T* lower semi-Fredholm ^=^ T upper semi-Fredholm (6.12.1.10) Conversely, if T is upper semi-Fredholm, and P = P2 G BL(X,X) has P_1(0) = r_1(0), then I- P is of finite rank and there is k > 0 for which ||Px|| < ||rx|| for each x G X (6.12.1.11) By (6.12.1.1) the dual operator P*, also a projection, has finite dimensional null space since it — P* is of finite rank, and we claim Pf(Xf) CT^yt) (6.12.1.12) Indeed, if / = fP e X1" we may define g° : TX -+ K by setting g°{Tx) = f(x) for each x G X*, using (6.12.1.11) to see that g° is well defined,
224 6. Finite Dimensional Spaces and Compactness and then bounded, and then extending it to g G Y1" by the Hahn-Banach Theorem 5.3.2. Toward (6.12.1.3) we claim that, for arbitrary T G BL(X, Y), p {T*g: \\g\\ < l} < 4p {Tx: \\x\\ < 1} (6.12.1.13) where p is given by (6.7.1.3). Indeed if 6 > p{Tx: ||x|| < 1} is arbitrary then there is n G N and (x1?x2,... ,xn) G Xn for which {Tx: ||x|| < 1} C Uy=1 Disc(Txy ; 26), and then since the mapping W : g —ygTx from Y1" to Kn (6.12.1.14) is of finite rank, there is for each 6 > 0 a finite set H C Y* for which {Wg: \\g\\ < 1} C Disc(W(#) ; e) (6.12.1.15) We claim {T*g: \\g\\ < l} C Disc^jff) ; 4(5 + e) (6.12.1.16) for if g G Y1" with ||^|| < 1 there is h e H for which max,- I^Tx^ - /iTx^l < ||W<7 — W/i||,andnow if ||x|| < 1 we can choose j for which ||Tx —Txy|| < 26, so that |<?rx - hTx\ < \gTx - gTxj\ + \gTxj - hTx^ + \hTXj - hTx\ < \\g\\ \\Tx - Txyll + e + ||/i|| ||rxy - Tx|| < 4(5 + e The choice of 6 and e now gives (6.12.1.13). In particular it follows from (6.12.1.13) that T totally bounded => Tf totally bounded (6.12.1.17) and hence also Tf totally bounded =^ Tft totally bounded =^ T totally bounded ■ (6.12.1.18) The implication (6.12.1.6) cannot in general be reversed. For example, if T = J : lx —► c0, as discussed following (5.9.1.3), then T is one-one but is not essentially dense. Using Theorem 5.6.1 and Theorem 5.6.2, the reader may like to show T almost lower semi-Fredholm <£=> T almost upper semi-Fredholm (6.12.1.19)
6.12 Duality Theory 225 and T almost upper semi-Fredholm <£=> T* almost lower semi-Fredholm (6.12.1.20) We can also supplement (6.12.1.2) by adding T Fredholm => index(rf) = -index(r) (6.12.1.21) In particular T Weyl <^> Tf Weyl (6.12.1.22) Theorem 6.12.1 and Theorem 6.11.3 combine to give the "perturbation theory" for Fredholm and semi-Fredholm operators: 6.12.2 THEOREM If X, y and Z are Banach spaces and T G BL(X, Y), S<EBL(y,Z),then 5, T upper semi-Fred =>• ST upper semi-Fred =>• T upper semi-Fred (6.12.2.1) and 5, T lower semi-Fred => ST lower semi-Fred => S lower semi-Fred (6.12.2.2) and if also K G BL(X,Y), then T upper semi-Fred, K compact => T + K upper semi-Fred (6.12.2.3) and T lower semi-Fred, K compact => T + K lower semi-Fred (6.12.2.4) Proof: (6.12.2.1) follows from (6.11.3.1) with either (3.2.3.1) or (3.3.2.1) applied to P(r) andP(5), while (6.12.2.2) follows from (6.12.2.1) applied to r1" and 5f, together with (6.12.1.9). For (6.12.2.3) use (6.11.3.1), and finally (6.12.2.4) follows from (6.12.2.3) applied to T* and K^ using (6.12.1.9) and (6.12.1.3). ■ For a converse to (6.12.2.3) and (6.12.2.4), we begin with an auxiliary result: 6.12.3 THEOREM Suppose X and Y are normed spaces and T G BL(X,y); then if T is not relatively open there are (xn) in X and (fn)
226 6. Finite Dimensional Spaces and Compactness in X^ for which oo fi{*j) = Sij for each iJ and L Hrxnllll/nll < oo (6.12.3.1) n=l while if T is not relatively almost open there are (yn) in Y and (gn) inY^ with oo SiiVj) = S{j for each ij and ]£ 11^^1111^11 < oo (6.12.3.2) n=l Proof: Suppose (en) is an arbitrary sequence of positive numbers. Then is T is not relatively open we shall find (xn) and (fn) for which fiixj) = sij and ||rxy||||/y|| < Gj for each ij G N (6.12.3.3) while if T is not relatively almost open we shall find (gn) and (yn) for which fc(y;)=fy and II^IHI^-II^^ for each j'jGN (6.12.3.4) Thus, if X)yLi ej < oo we will satisfy (6.12.3.1) and (6.12.3.2). Towards (6.12.3.2) there is certainly xx G X for which HxJI = 1 and IITxjH < el9 since core(T) is not bounded below, and then by the Hahn- Banach Theorem 5.3.2 there is fx G X1" with H/J = 1 = /JxJ. If n G N and sequences (x1,x2,... ,xn) in X and (/!,/25- • • ,fn) in X* satisfy (6.12.3.3) for each i,j G {1,2,... ,?i}, we claim that there are xn+1 G X and /n+1 G X1" for which (6.12.3.3) holds for ij G {1,2,... ,71,71 + l}. The subspace n Wn=P[fi1W (6.12.3.5) 3 = 1 is of finite codimension in X, and by Theorem 3.11.1 T cannot be bounded below on Wn. Thus, if e°+1 > 0 is arbitrary we can find xn+1 G Wn with K+ill = 1 and ||r*n+1|| < e°+1, and then /°+1 € X* with ||/°+1|| = 1 = fn+iixn+i)- Now put OO /n+1 = /n+1 " £ /n+1 K)/y (6.12.3.6)
6.12 Duality Theory 227 Evidently fj{xn+\) =° = /n+i(x>) =° for eachjG {l,2,...,m} and (6.12.3.7) ll/n+lll<l + Ell/ill 3 = 1 To extend (6.12.3.5) therefore it is only necessary to do all this with (l + Ell/,llW°+i<^+i (6-12-3.8) Toward (6.12.3.4) observe that the operator core(Tt) is not bounded below, so that there is gx G Y^ for which ||^x || = 1 and ||^1T,|| < ^e1, and then yx e Y with ||j/iII < 2 and 9i{Vi) = 1- If sequences (y1?y2,... ,yj and (g1,g2, • • • ,<7n) are consistent with (6.12.3.4) then the restriction of T* to the annihilator of 5Z?=i ^Vj cannot be bounded below, so that if e^+i is arbitrary there will be <7n+1 in this annihilator for which ||<7n+1|| = 1 and \\9n+iT\\ <e°+1, and then y°+1 6 F with ||y°+1|| < 2 and<7n+1(y°+1) = 1. Now put n Vn+i = yj+i - E^(Ci)^ (6.12.3.9) 3 = 1 Evidently 9j{yn+i) = ° = dn+iiVn) for each J G {1,2,... ,n} and / » \ (6.12.3.10) ll»n+lll<2|l + X)Nl] For consistency with (6.12.3.4) we need only do this with 2 1 + L INI K+1 < eB+1 ■ (6.12.3.11)
228 6. Finite Dimensional Spaces and Compactness 6.12.4 THEOREM If X and Y are Banach spaces, then T G BL(X, Y) is upper semi-Fredholm iff K compact =>• T — K essentially one-one (6.12.4.1) and T G BL(X, Y) is lower semi-Fredholm iff K compact =>• T — K essentially dense (6.12.4.2) Proof: If T is upper semi-Fredholm then (6.12.4.1) is part of (6.12.2.3), and if T is lower semi-Fredholm then (6.12.4.2) is part of (6.12.2.4). If T is not upper semi-Fredholm then either it is not essentially one-one, in which case (6.12.4.1) fails with K = 0, or else it is not proper, therefore not relatively open, in which case (6.12.4.1) fails with oo K=Y,fn®T*n (6.12.4.3) n=l where (xn) and (/n) satisfy (6.12.3.1): for by the completeness of BL(X, Y) and the closedness of KL(X,Y) = KL1(X,Y) the operator K is well defined, and xn e {T - K)_1(0) for each neN (6.12.4.4) If instead T is not lower semi-Fredholm then either it is not essentially dense, in which case (6.12.4.2) fails with K = 0, or else it is not proper, therefore not relatively almost open, in which case (6.12.4.2) fails with oo ff=X>„T©yB (6.12.4.5) n=l where (gn) and (yn) satisfy (6.12.3.2). For again, K is well defined and compact, with cl((r - K)X) C g-l(Q) for each neN ■ (6.12.4.6) Theorem 6.12.4 enables us to round off Theorem 6.11.3: P(r) dense =^ T lower semi-Fredholm =^ P(r) open (6.12.4.7) Indeed, the second implication is (6.10.3.2), while the first follows from the second part of Theorem 6.12.4 if we apply (6.10.3.15) to T - K for all compact K eBL{X,Y).
6.13 Composition Operators 229 6.13 COMPOSITION OPERATORS We have seen in Theorem 6.12.1 that the dual of a finite rank or compact operator has the same property. This does not extend to more general compositions RT = BL(r,PY) and LT = BL(W,T): 6.13.1 THEOREM If 0 ^ T G BL(X, Y) and W is an infinite dimensional normed space, then neither LT = BL(W,T) nor RT = BL(T,W) is totally bounded. Proof: If W is infinite dimensional then by the Riesz lemma 1.5.2 there is (wn) in W for which, for each m,n G N, |K|| = land^m^ \\wn - wm\\ > \ (6.13.1.1) and if T ^ 0 there is / G X* for which ||/r|| = 1: now ||/ 0 wn\\ = ll/H and n + m => \\RT(f 0 wn) - RT(f 0 "m)ll > J||/|| (6.13.1.2) Since W is also infinite dimensional there is [hn) in W for which, for each m,n G N, ||fcB|| = 1 and n ± m =► ||fcn - fcj| > ± (6.13.1.3) and since T ^ 0 there is x G X for which ||Tx|| = 1: now \\hn 0 x|| = ||x|| andn^m^ ||LT(/in 0 x) - LT{hm 0 x)|| > ±||x|| ■ (6.13.1.4) In spite of Theorem 6.13.1, total boundedness can still be transmitted to composition operators. If T : X —> Y and T'X —> Y are linear mappings between the same two spaces we shall write T' A T : (I" - r)_10 —> Y (6.13.1.5) for the common restriction of T and T' to the subspace {x G X: T'x = Tx} on which they agree. We shall also recall || • ||gSS from (6.7.5.2): 6.13.2 THEOREM If X, Y, Z, W and E are normed spaces and if T G BL(X,Y), 5 G BL(PY,Z), U : E -+ BL(Y,Z) and Y : £ -+ BL(X,PY) are bounded linear operators, then ||(Br • V) A (Ls • V)||;„ < 4\\U\\\\T\\'ess + 2||V||||5||iM (6.13.2.1) In particular T,S totally bounded => (RT • U) A [Ls • Y) totally bounded (6.13.2.2)
230 6. Finite Dimensional Spaces and Compactness Proof: Write F={RT-U-LS. V)-1^) = {eeE: U{e)T = SV{e)} (6.13.2.3) and $ = (RT • U) A {Ls • V) : F —-► BL(X, Z) (6.13.2.4) Choose arbitrarily 6 > ||T||gSS and e > \\S\\fe33: we then claim that Unless <4||tf||5 + 2||V||£ (6.13.2.5) Indeed, since ||T||gSS < 6, there is a finite subset K6 C Discx(0;l) for which T Disc(0 ; 1) C Disc{T{K6) ; 26) (6.13.2.6) and for each x' G K6 there is a finite subset He(X') C Disc^O ; 1) = jPDisc^^ ; 1) for which {SV(e)x': e G DiscF(0 ; 1)} C \\V\\SDiscw(0 ; 1) C Discz{SVHe{x')x'; 2\\V\\e) (6.13.2.7) where once again the factor of 2 comes from the effort to take the finite subset of W to be of the form VHe(xf). Consider now the finite set He)6= (J ff,(x')CDiscF(0;l) (6.13.2.8) x'EK6 If e G Disc^(0 ; 1) and x G Discx(0 ; 1) are arbitrary, then there are by (6.13.2.6) x' G K6 for which ||rx-rx;|| < 26, and by (6.13.2.7) e' G He{x') for which ||5VP(c)x/-5VP(c/)x/|| < 2\\V\\e. Taking account of the definition (6.13.2.3) of F we have ||(«(e) - «(c'))x|| < \\U(e)(Tx - Tx')\\ + \\S(V(e) - V(e'))x'\\ + \\U(e')(Tx-Tx')\\ <2||J7||« + 2||Vr||e + 2||J7||« Since x G Discx(0 ; 1) is arbitrary it follows ||$(e) - $(e')|| < 4\\U\\6 + 2\\V\\e (6.13.2.9) which by the choice of 6 and e gives (6.13.2.1). ■ For example (6.12.1.13) follows from (6.13.2.1) if we take W = Z = K; E = Y^; S = I; U = I; V = T1" (6.13.2.10)
6.13 Composition Operators 231 so that F = E = yf and RT • U = Ls . V = Tf and ||S||'ess = 0 (6.13.2.11) 6.13.3 THEOREM If X, Y, Z and W are normed spaces and T G BL(X,Y), S e BL{W,Z), then ii^^Tiress<4ii5iiiir'iress+2iiriiii5iress (6.13.3.1) In particular t e KL(x,y), s e kl{w,z) => lsrt e kl{bl{y,w),bl{x,z)) (6.13.3.2) If Y = X then also l|£rA£r||Us<6|miUs (6.13.3.3) In particular T e KL(X,X) =>LTARTe KL{comm{T),BL{X,X)) (6.13.3.4) Proof: For (6.13.3.1) apply (6.13.2.1) with E = BL(Y, W) [7 = L5 V = RT (6.13.3.5) so that F = E = BL(Y, W) and £5 • *7 = LT • V = LSRT (6.13.3.6) For (6.13.3.3) take W = Z = Y = X E = BL{X,X) S = T U = V = I (6.13.3.7) so that F = comm(r) ={56 BL(X,X): ST = TS} and (6.13.3.8) RT • U = RT and Ls • V = Lr ■ We can slightly generalize Theorem 6.13.1 to give a converse to (6.13.3.2):
232 6. Finite Dimensional Spaces and Compactness 6.13.4 THEOREM If X, Y, Z and W are normed spaces and T G BL(X, Y), S G BL(W,Z), then there is implication LSRT totally bounded => (S totally bounded or T = 0) and (S = 0 or T totally bounded) (6.13.4.1) Proof: If S ^ 0 and T is not totally bounded, so that by (6.12.1.3) T* is not totally bounded, and then by (6.8.3.1) there is (gn) in Y* and w G W for which ye^^) and fonr)gm(Xt) and Sw^ 0 G Z (6.13.4.2) Since now (gn0w)eloo{BL(Y,W)) and {S(gnOw)T) $m(BL(X,Z)) (6.13.4.3) it is clear that LSRT is not totally bounded. If instead S is not totally bounded and T ^ 0, then again by (6.8.3.1) there is (wn) in W and # G Y* for which KJe'ooW and (Su;n)gm(Z) and ^^06^ (6.13.4.4) This time (0©™JeUBL(Y,WO) and (% © wjr) £ m(BL(X,Z)) ■ (6.13.4.5) Theorem 6.13.3, and in particular (6.13.3.2), says something about the "essential invertibility" of the operators co\(Ls,RT) and row(JR5,LT) of (5.6.3.6) and (5.6.3.5): we are using Definition 3.9.1 relative to the ideals of totally bounded operators. 6.13.5 THEOREM If X, Y, Z and W are normed spaces and T G BL(X,Y), S G BL{W,Z), then S essentially left and T essentially right invertible (6.13.5.1) implies co\(Ls,RT) essentially left and tow(Rs,Lt) essentially right invertible (6.13.5.2) Proof: If (6.13.5.1) holds, then there are S' G BL{Z,W) and V G BL(Y,X) for which TT'-I=KeKL{Y,Y) and S'S-I = H G KL{W,W) (6.13.5.3)
6.13 Composition Operators 233 and hence, using (6.13.3.2) = lhrk € KL(BL(y,Py),BL(y,Py)) (6.13.5.4) and yii>jnli>jnt — JtC>jn)[ljQljQt — ■^JTj = \-*JS S1 — l)\ TJ"'T,, — I) = rhlk e KL(BL(Z,X),BL(Z,X)) (6.13.5.5) Noting that each left and each right multiplication commute with one another, this can be rewritten in the form row((L5/L5 — I)RT,,RS,) co\(Ls,RT) + I = LHRK totally bounded (6.13.5.6) and row(JR5, LT) co\{Rs,{LTLT, — I),LT) + I = Rh^k totally bounded ■ (6.13.5.7) It is left to the reader to state and prove the analogue of Theorem 6.13.5 for "almost essential invertibility." In the other direction, we have the essential analogue of Theorem 5.6.4: 6.13.6 THEOREM If X, Y, Z and W are normed spaces and T G BL(X,Y), S e BL{W,Z) then tow(Rs,Lt) essentially dense or co\(Ls,RT) essentially one-one (6.13.6.1) implies (S essentially one-one or T dense) and (S one-one or T essentially dense) (6.13.6.2) and iow(Rq,LT) almost lower semi-Fredholm ^ S T) (6.13.6.3) or co\(Ls,RT) almost upper semi-Fredholm implies (S almost upper semi-Fredholm or T almost open) and (S bounded below or T almost lower semi-Fredholm) (6.13.6.4)
234 6. Finite Dimensional Spaces and Compactness Proof: If (wn) in S 1(0) and (gm) in (T*) 1(0) are linearly independent, then (gm ®wn) in col(L5,jRr)-1(0) is linearly independent (6.13.6.5) and {dm ® wn) m (row^S'^r)*)-1^) *s linearly independent (6.13.6.6) where gm <g> wn : R —► ((7mJRiun) is the linear functional introduced in (2.9.2.6) immediately following our first encounter with the finite rank operator gm 0 wn in (2.9.2.4). This already proves (6.13.6.1). If we can find either an infinite linearly independent sequence (wn) and a single nonzero (gn) =5l5ora single nonzero (wn) = w1 and an infinite linearly independent (<7m), then the infinite sequence (gm Qwn) will mean that co\(Ls,RT) is not essentially one-one, while the infinite sequence (gm <g> wn) will mean that iow(Rs,LT) is not essentially dense. If we only assume the failure of the condition (6.13.6.4) then we must work harder, and construct sequences {gn 0 wn) and (gn ® wn) with the help of Theorem 6.12.3 for which \\Sn © "nil = \\9n ® Wn\\ = 1 for each "> (6.13.6.7) and \\9n ®Wn-9m® Wm\\ > 1 and K ®wn-9m® Wm\\ >1 if n + m (6.13.6.8) and ||col(L5,jRr)(0n 0 wn)\\ + ||[gn ® iyn)row(JR5,Lr)|| —»• 0 as n —► oo (6.13.6.9) There are of course two cases. If S is not almost upper semi-Fredholm then we can find (wn) and (hn) for which, as in (6.12.3.1), oo IK|| = 1 and hn(wm) = 6nm and ^\\Swn n=l and ||ffj| = l and ||ffnr||—>0 incorporating the assumption that T is not almost open. Both (6.13.6.7) and (6.13.6.8) are clear. Since \\hn\\ > 1, it follows that the sequence {Swn) is absolutely summable and therefore converges to 0, giving (6.13.6.9). If instead T is not lower semi-Fredholm and S not bounded below, then find ll^nll < °° (6.13.6.10)
6.13 Composition Operators 235 (wn), (gn) and (y) for which, as in (6.12.3.2), oo lkll = l and 9n(ym) = 6nm and £ ||ff„r||||yB|| < oo n=1 (6.13.6.11) and |K|| = 1 and ||5u;B||—»0 The reader can again verify (6.13.6.7), (6.13.6.8), and (6.13.6.9). ■
7 Operator Algebra and Commutivity When Y = X the space of operators BL(X, Y) is a normed algebra in the sense of Definition 1.10.1. Much of the theory of BL(X, X) can be extended at no extra cost to more general normed algebras A: 7.1 COMMUTANTS AND DOUBLE COMMUTANTS If A is a normed linear algebra, or more generally a ring, then elements a, 6 G A are said to commute iff ba = ab (7.1.0.1) 7.1.1 DEFINITION If K is a subset of the ring A then its commutant is the set comm(K) = commA{K) = {a G A:ba = ab for each 6 G K} (7.1.1.1) and its double commutant is the set comm2(jFC) = comm^(jFC) = comm(comm(JK')) (7.1.1.2) If a G A is an element we shall write comm(c) = comm({a}) (7.1.1.3) Commutants and double commutants have properties analagous to the poles and polars of Theorem 5.2.2: 237
238 7. Operator Algebra and Commutivity 7.1.2 THEOREM If A is a normed linear algebra and K C A, then comm.{K) is a closed subalgebra of A. If also K' C A there is implication K Ccomm2{K) (7.1.2.1) and Ka'=> comm(K') C comm(K) (7.1.2.2) Hence also comm(comm2(K)) = comm(K) (7.1.2.3) Proof: This is left to the reader. ■ Notice that if a G A, then commA(a) = {La - flj"1^) (7.1.2.4) is the null space of the linear operator La — Ra : A —> A of (2.10.0.1) and (2.10.0.2). A subset K C A is called commutative if (7.1.0.1) holds for every pair a, 6 G if. Necessary and sufficient is that K C comm(K) (7.1.2.5) or equivalently that comm2(K) C comm(K) (7.1.2.6) If K = A, then its commutant comm(A) is called the center of A. For example, the algebras ^(ft) and (7^(17) are always commutative, while BL(X, X) is about as noncommutative as it is possible to be: 7.1.3 THEOREM If fi is a topological space and A = C^fi), then commA(A) = A (7.1.3.1) If X is a normed space and A = BL(X,X), then KL0(X,X) C K C A => commA(K) = K/ (7.1.3.2) Proof: The implication (7.1.3.1) is clear. Towards (7.1.3.2) suppose T G BL(X,X) commutes with KL0(X,X), so that for arbitrary x G X and / G X* we have / 0 (Tx) = T o (/ 0 x) = (/ 0 x)T = (/ o T) 0 x (7.1.3.3) It follows /(y)Tx = /(Ty)x for each y67 (7.1.3.4)
7.1 Commutants and Double Commutants 239 If, in particular, X ^ 0, then by Theorem 5.4.1 there are / G X* and y G X for which f(y) = 1, and now (7.1.3.4) gives T = kl with k = f{Ty) (7.1.3.5) This proves (7.1.3.2) if X ^ 0, and the reader can check that it remains valid when X = 0. ■ We recall the subgroup A-1 of invertible elements of A introduced following (1.10.1.8). By the arguments of Theorem 3.1.2 and Theorem 3.1.3 we have implication, for a, 6 G A, {a, 6} C A-1 <^=> {6a, ab} C A-1 (7.1.3.6) and 1 - ba G A-1 <^=> 1 - a6 G A-1 (7.1.3.7) and also ab G A"1 =^(aG A-1 <^=> 6 G A-1) (7.1.3.8) We recall also the Definition (1.10.1.7) of /(a) G A when a £ A and / : K —► K is a polynomial. Alternatively this can be presented "inductively," by setting z(a)=a and t(a) = t for each t G K (7.1.3.9) and requiring that if g and h are polynomials then (g + h){a) = g{a) + h(a) and (a • h){a) = g(a)h(a) (7.1.3.10) Thus to prove a proposition about f(a) it is sufficient to verify it for the constants t and the coordinate z, and to prove that it is transmitted to sums and products. For example, we claim that f(a) G A is always in the "double commutant" comm^(o): 7.1.4 THEOREM If A is a normed linear algebra and a G A, then there is implication a G A"1 => a"1 G comm^(a) (7.1.4.1) If / : K —► K is a polynomial, then also f{a) Gcomm^(a) (7.1.4.2) and there is inclusion /commA(a) C commA(a) C commA /(a) (7.1.4.3)
240 7. Operator Algebra and Commutivity and commA f(a) C commA(a) C commA /(a) (7.1.4.4) Proof: Suppose a G A-1 is invertible; then if 6 G commA(c) we have a~lb = a~l{ba)a~l = a~l{ab)a~l = 6a"1 (7.1.4.5) This proves (7.1.4.1). Also, (7.1.4.2) is clear if the polynomial / is either the coordinate z or one of the constants t G K, and if (7.1.4.2) holds when f = g and when f = h then it must also hold when f = g + h and when / = g • h. By the remarks following (7.1.3.10) this proves (7.1.4.2) for all polynomials /. Towards (7.1.4.3) observe that if a, 6 G A are arbitrary, then 6 G commA(a) <£=> a G commA 6 (7.1.4.6) The second inclusion of (7.1.4.3) now follows: by (7.1.4.2) and (7.1.4.6) 6 G commA(a) => /(a) G commA(6) => 6 G commA /(a) (7.1.4.7) Now also the first inclusion follows: using (7.1.4.7) 6 G commA (a) => a G commA (6) => a G commA /(6) => /(6) G commA(a) (7.1.4.8) Towards (7.1.4.4), the first inclusion follows from the second inclusion of (7.1.4.3) together with (7.1.2.2), while the second follows from (7.1.4.2), (7.1.2.6), and (7.1.2.2). ■ We conclude by noting, via Zorn's lemma, that every commutative subset of a ring A is contained in some maximal abelian subset, which will necessarily be a subring, and a closed subalgebra if A is a normed algebra. The reader may like to verify that if K C A is a nonempty commutative subset then commA(jFC) = I) {B D K: B is a maximal abelian subalgebra of A} (7.1.4.9) and comm^jK") = [j {B D K: B is a maximal abelian subalgebra of A} (7.1.4.10)
7.1 Commutants and Double Commutants 241 7.2 MAXIMAL IDEALS AND THE RADICAL We recall the "two-sided ideals" J C A of (1.10.1.5): more generally a left ideal J C A is a linear subspace J for which A-JCJ (7.2.0.1) while a right ideal J C A satisfies instead J -AC J (7.2.0.2) A proper ideal J is of course one for which J ^ A (7.2.0.3) For example, if a G A, the set Aa = #A(A) is a left ideal, proper iff a e aleft(A), while the set L'1^) = {b e A:ab = 0} is a left ideal, proper iff a G 7rleft(A). A "maximal ideal" will mean a maximal proper ideal: 7.2.1 DEFINITION A maximal left ideal of the ring A is a proper left ideal J C A for which there is implication, for arbitrary left ideals Jf C A, J C J' => J = J' or J' = A (7.2.1.1) Maximal right, and maximal two-sided ideals are defined in the same way. We write MLI{A) = {J C A: J is a maximal left ideal} (7.2.1.2) and MRI(A) = {J C A: J is a maximal right ideal} (7.2.1.3) An easy application of Zorn's lemma shows that there are lots of maximal left and maximal right ideals: 7.2.2 THEOREM If A is a ring and J C A is a proper left, right, or two- sided ideal, then there is a maximal left, right, or two-sided ideal MCA for which J C M. Proof: We recall our convention that a ring A always contains an identity 1. Then it is clear that an ideal J C A is proper if, and only if, 1 ^ J. Now we claim that, relative to set inclusion, the proper left ideals of A satisfy Zorn's condition (1.11.1.2) since the union UAeAJA of any totally ordered family («/A)AeA of proper left ideals is a proper left ideal. Thus Theorem 7.2.2 for left ideals follows from Theorem 1.11.3. The argument for right and for two-sided ideals is the same. ■
242 7. Operator Algebra and Commutivity If A-1 is open (in particular, if A is complete), then maximal ideals are always closed: for if M C A is a maximal left or maximal right ideal then there is 6 > 0 for which, if x G A is arbitrary, ||x|| < 6 => 1 + x G A"1 => 1 + x £ M (7.2.2.1) giving 1 £ cl(Af) DM=>MC cl(Af) ^ A => M = cl(Af) (7.2.2.2) We can use maximal ideals to test for left or right invertibility in A. 7.2.3 THEOREM If A is a ring, then <rleft(A) = (J MLI(A) and aright(A) = (J MRI(A) (7.2.3.1) Also .{aGA:l-AaC A"1} = {a G A: (1 - Aa) n aleft(A) =0} = f| MLI(A) (7.2.3.2) and {aGAil-aACA-^laG A: (1 - aA) n aright(A) = 0} = |°| MRI(A) (7.2.3.3) Hence also P| MLI(A) = P| MRI(A) is a two-sided ideal (7.2.3.4) Proof: If a G A is in aleft(A), then Aa is a proper left ideal and so by Theorem 7.2.2 there is M G MLI(A) for which a £ Aa C M. Conversely, if a G M G MLI(A) then l^MD Aa, and hence, a G aleft(A). This proves the first part of (7.2.3.1), and the argument for the second part is identical. Towards (7.2.3.2) suppose a g PlMLI(A), so that a g M for some maximal left ideal MCA; then M + Aa is a left ideal of A for which M g M + Aa. By (7.2.1.2) it follows that M + Aa = A, and hence there is 6 G A for which 1-6-aGM Caleft(A) (7.2.3.5) Thus, 1 — Aa is not disjoint from aleit(A). Conversely, if this is so, with 6 G A satisfying (7.2.3.5), then certainly a £ M, since otherwise 1 = (1 - 6a) + ba G M (7.2.3.6) Thus, a ^ flMLI(A). This proves the second equality in (7.2.3.2). Towards the first, it is clear that the first set in (7.2.3.2) is a subset of the second. We
7.2 Maximal Ideals and the Radical 243 have to reverse the inclusion. If, therefore, 1 — Aa is disjoint from aleft(A), then for each 6 £ A there is b' £ A for which 6'(1 -ba) = l (7.2.3.7) so that b' = 1 - ca with c = -b'b (7.2.3.8) and hence, by (7.2.3.7) there is also c' G A for which c'bf = c'(l - ca) = 1 (7.2.3.9) As is very familiar (Theorem 3.1.2), it follows that c' = 1 — ba = (6')"1 making 6' a two-sided inverse for 1 — ba G A-1. This finishes the proof of (7.2.3.2), and (7.2.3.3) is exactly the same. Now (7.2.3.2) and (7.2.3.3) together with (7.1.3.7) give the equality (7.2.3.4), and then force the common set to be both a left and right ideal. ■ We shall write n MLI(A) = n MRI(A) = Radical(A) (7.2.3.10) and refer to the elements a £ Radical(A) as the radical elements of A. From Theorem 7.2.3 we can extract four apparently different characterizations of a radical element. 7.2.4 THEOREM If H is a topological space, then Radical CTO(n) = {0} (7.2.4.1) If X is a normed linear space, then Radical BL(X, X) = {0} (7.2.4.2) Proof: If 0 ^ a G C^H), then there must be t G H for which a(t) ^ 0, and now 6 = l/a{t) GKC CTO(n) =J> (1 - 6a) {t) = 0 (7.2.4.3) which means by (3.12.5.1) that 1 — ba is not invertible, proving (7.2.4.1). If 0 ^ T e BL(X,X) then there must be x e X for which Tx ^ 0, and then by Theorem 5.4.1 there is / G X1" for which f{Tx) = 1. Now in the notation (2.9.2.4) 5 = /0x=^(J- ST){x) =0^x (7.2.4.4)
244 7. Operator Algebra and Commutivity which means by (3.2.7.2) that I-ST is not invertible, proving (7.2.4.2). ■ If a G Radical(A) then it is clear from (7.2.3.2) that A_1+aC A-1 (7.2.4.5) Conversely, the reader may like to verify that (7.2.4.5) implies a G Radical (A) provided A~l + A~l = A (7.2.4.6) Through the medium of the radical, we have a process of enlarging two-sided ideals to their "inessential hulls": 7.2.5 DEFINITION If A is a ring and J is a two-sided ideal of A then Hull(J) = {aeA:a + Je Radical(A/J)} (7.2.5.1) Evidently, Hull (J) is a two-sided ideal of A, with J C Hull(J) (7.2.5.2) and J C J' => Hull(J) C Hull(J') (7.2.5.3) For example Radical(A) = Hull({0» (7.2.5.4) From (7.2.3.2) and (7.2.3.3) it is clear that each of the following conditions is equivalent to having a G Hull (J): for each 6 G A there is c G A for which 1 - c(l - 6a) G J (7.2.5.5) for each 6 G A there is c G A for which 1 - (1 - ab)c G J (7.2.5.6) It turns out that J and Hull (J) give rise to exactly the same "essentially invertible" elements of A: 7.2.6 THEOREM If A is a ring and J C A is a two-sided ideal then, if ae A, a+Je ale{t{A/J) <^> a + Hull(J) G aleft(A/Hull(J)) (7.2.6.1) and a + Je aright(A/J) <^> a + Hull(J) G aright(A/Hull(J)) (7.2.6.2)
7.3 Regularity 245 Hence also a = Je {A/J) <^> a + Hull(J) G (A/ Hull(J))"1 (7.2.6.3) and Hull(Hull(J)) = Hull(J) (7.2.6.4) Proof: If a + J has a left inverse in A/ J then it is cbvious that a + Hull(J) has a left inverse in A/ Hull (J). Conversely, if a + Hull (J) has a left inverse in Aj Hull (J) then there is 6 G A for which 1-bae Hull(J) (7.2.6.5) and then applying (7.2.5.5) with 1 — 6a in place of a and 1 in place of 6 gives c G A for which 1 - c(6a) = 1 - c(l - (1 - 6a)) G J (7.2.6.6) But this means that cb + J is a left inverse for a + J in A/J. This proves (7.2.6.1), and similarly (7.2.6.2), and hence also (7.2.6.3). Finally, combining (7.2.6.3) with (7.2.3.2), there is implication a + Je Radical(A/J) <^=> a + Hull(J) G Radical(A/ Hull(J)) (7.2.6.7) giving (7.2.6.4). ■ If A = BL(X,X) and J = KL0(X, X) is the ideal of finite rank operators, then the inessential hull of J includes the compact operators: 7.2.7 THEOREM If X is a normed space, then KL^X,*) C HullKL0(X,X) (7.2.7.1) Proof: If T G KL^XjX) is compact and S G BL(X, X) is arbitrary, then ST G KLX(X,X) is also compact, and by Theorem 6.9.3 it follows that / - ST is Fredholm . By Atkinson's Theorem 6.4.3 it follows that I - ST has an invertible coset relative to the ideal of finite rank operators. ■ 7.3 REGULARITY The "regularity" of Definition 3.8.1. has an analogue in the style of the "algebraic invertibility" of Definition 3.10.1, in which the generalized inverse T' G BL(y, X), which satisfies (3.8.1.1), is also supposed to lie in M C BL(y,X), as in (3.10.1.1). We give the analogue here for an element of a ring.
246 7. Operator Algebra and Commutivity 7.3.1 DEFINITION If A is a ring, then a G A is said to be regular, or relatively Fredholm, iff a G aAa (7.3.1.1) and is said to be decomposably regular, or relatively Weyl, iff aeaA~la (7.3.1.2) We shall write r£= {ae A: a is regular} (7.3.1.3) Theorems 3.8.3 and 3.8.4 are valid for the regular elements of a ring. 7.3.2 THEOREM If A is a ring and a, a! G A, then a - aa'a G 'A1 <^> a e^ (7.3.2.1) and 1 - a'a G ^ -£=> 1 - ad G ^ (7.3.2.2) If also 6, 6' G A with a = aa'a and 6 = 66'6, then a& G ^ <^> a'a66' G 'A1 (7.3.2.3) If a = aa'a and if d G A satisfies 1 - a'd G A"1 and (1 - aaf)d{l - a;d)_1(l - a'a) G 'A1 (7.3.2.4) then a-de'A! (7.3.2.5) Proof: This is exactly the same as the proofs of Theorems 3.8.3 and 3.8.4. ■ If J C A is a two-sided ideal then, as for (3.9.6.9), JCrA,=>rA,+ JCrA1 (7.3.2.6) We can prove more. 7.3.3 THEOREM If A is a ring and if J C A is a two-sided ideal satisfying JC'A1 (7.3.3.1)
7.3 Regularity 247 then for each a G A there is implication a + Je {A/J)^ =^aerA! (7.3.3.2) Also A_1 + Hull(J) C'A1 (7.3.3.3) Proof: If a + J is regular in Aj J, then there is a' G A for which a - aa'a G J (7.3.3.4) so that (7.3.3.1) and (7.3.2.1) give a G "A1, proving (7.3.3.2). Towards (7.3.3.3) we claim that if a G A then a + Hull(J) £ aleft(A/ Hull(J)) =>aerA! (7.3.3.5) and a + Hull(J) £ aright(A/ Hull(J)) =^aerA! (7.3.3.6) For example, if a+Hull (J) has a left inverse in A/ Hull (J), then by (7.2.6.1) a + J has a left inverse in Aj J, and hence, by (7.3.3.2), a E^. Since every element of A-1 + Hull(J) has a two-sided inverse in A/Hull(J) it is clear that (7.3.3.3) follows from (7.3.3.5). ■ Theorem 7.3.3 and Theorem 7.2.7 confirm what we already knew in (6.9.3.3) the sum of an invertible operator and a compact operator is regular. The decomposably regular elements can be decomposed into products of invertible and idempotent elements: 7.3.4 THEOREM If A is a ring and A = {ae A: a2 = a} (7.3.4.1) is the set of idempotents of A, then {a G A: a is decomposably regular} = A~lA = AA~l (7.3.4.2) A left or right invertible decomposably regular element is invertible: [A-1 A) \ aleft(A) = {A-1 A) \ aright(A) = A"1 (7.3.4.3) If A is a Banach algebra, there is equality rA1 n cl(A_1) = A'1 A (7.3.4.4)
248 7. Operator Algebra and Commutivity Proof: Suppose a = aa'a with invertible a'\ then p = a'a G A and a = (a')~1p G A-1 A. Conversely, if a = cp with c G A-1 and p G A, then c~la = c~1ac~1a, and hence a = ac~1aaA~1a. This proves the first equlaity in (7.3.4.2), and similarly the second. Towards (7.3.4.3) suppose a = cp has a left inverse a', with invertible c and idempotent p. Then also p = p2 has a left inverse, forcing p = 1 and a = c G A-1. This proves about a quarter of (7.3.4.3). We leave the rest to the reader. Towards (7.3.4.4) if A is a normed algebra and a = cp G A~lA then \\a — an\\ —► 0 with an = c(p + (l/n)(l — p)) G A-1. Conversely, with no restriction on A, we can follow the argument for Theorem 3.8.7. If a = aa'a G 'A1 and if 6 G A satisfies 6 G A-1 and 1 + (6 - a)af G A-1 (7.3.4.5) then p = a'a and c = (l + (6 - a)a')~lb (7.3.4.6) gives a = cp (7.3.4.7) Evidently p is idempotent and c in invertible. If A is complete and a G cl(A_1) we can satisfy (7.3.4.5) by taking be A'1 and ||6 - a\\ \\a'\\ < 1 ■ (7.3.4.8) It is clear from the proof that (7.3.4.4) extends to those normed algebras A for which A-1 is open, as in Theorem 4.4.7. For a rather more special kind of regular element, only available in BL(X, Y) when Y = X, we make the following: 7.3.5 DEFINITION If A is a ring, then a G A is called simply polar iff a G aBa with B = commA(a) (7.3.5.1) For example, invertible elements and idempotents are simply polar, as are commuting products of invertibles and idempotents. The simply polar elements are decomposably regular. If a = aa'a with a'a = aa' = p, then a" = a' + (1 - p) => a = aa"a and (a")"1 = a + (1 - p) (7.3.5.2) When A = BL(X,X) for a normed space X then we can very nearly characterize the simply polar elements spatially:
7.3 Regularity 249 7.3.6 THEOREM KTG BL(X, X) for a normed space X is simply polar, then T is regular and T{X) = T2{X) and T'1^) = T~2{0) (7.3.6.1) Conversely, if (7.3.6.1) holds, then T is simply polar if either X is complete, or T is Fredholm, T is of finite rank. Proof: If T is simply polar then it is certainly regular, and if T' G comm(T) satisfies (3.8.1.1) then T{X) = TT'T{X) = T2T'{X) C T2{X) C T{X) (7.3.6.2) and r-^o) = (rr'r)-1^) = (r'r2)-1^) = r-^r'-^o) 2 r-*(o) d r-^o) (7.3.6.3) Conversely, if (7.3.6.2) and (7.3.6.3) hold, then T{X) + r_1(0) = X and T{X) n r_1(0) = {0} (7.3.6.4) and hence there is a linear projection P : X —> X for which P{X) = T{X) and P_1(0) = r_1(0) (7.3.6.5) Explicitly if x e X then Px = Ty where Tx = T2y and y G X (7.3.6.6) Further, by the argument for Theorem 3.8.2., it is clear that if T is proper and P is continuous then there is T' G BL(X,X) for which T'T = TT' = P (7.3.6.7) which makes the operator T simply polar. Now if the space X is complete then the projection P is continuous by Theorem 4.8.1, if T is Fredholm then I — P is continuous by Theorem 6.3.6, and if T is finite rank then P is continuous by Theorem 6.3.6. ■ To see that the projection P may fail to be continuous look at X = c00 and consider the operator T = : X2 —- X2 (7.3.6.1 I 0 0 I V
250 7. Operator Algebra and Commutivity where for each iGl (Wx)n = {l/n)xn for each n G N (7.3.6.9) Evidently W is one-one and onto, but not bounded below: the linear inverse W is unbounded. The operator T is regular in the sense of Definition 3.8.1, with (3.8.1.1) satisfied by the bounded operator T' = 0 -I' -I 0 Also, in the ring of all linear operators on X2 the conditions of Definition 7.3.5 are satisfied by the unbounded operator rpff _ W o o W (7.3.6.10) so that the conditions (7.3.6.2) and (7.3.6.3) both hold. The projection P of (7.3.6.6) is, however, not bounded. Specifically, P rprpll rpllrp i w' 0 0 (7.3.6.11) The product of regular operators need not be regular. For example, let X = lp with p e {l,2,oo} or X = c0, and recall the forward and backward shifts of (2.8.2.2) and (2.8.2.3). Define U and V on X by setting for each xeX and Evidently U(x1,x2,x3, ) — (0,x1,£2, ) V(X1,X25X3' • • V = lx2'X3'X45 • • V (7.3.6.12) (7.3.6.13) (7.3.6.14) VU = I ^ UV Thus, both U and V are regular and neither of them are invert ible. If also W{xl9x29x39...) = (xl5 \x2, \xz,...) (7.3.6.15) Then W is one-one and dense but not invertible, and is also compact but not of finite rank, and therefore by Theorem 6.8.5 not regular. Now if A = BL(X2,X2) then 0 0 W 0 eA=>a2=0erA!a.nda^rA! (7.3.6.16)
7.4 Quasinilpotent Elements 251 since a generalized inverse for a in A would give rise to a generalized inverse for W in BL(X,X). If instead t G K is sufficiently large then 0 0 U 0 tl W W tl e^ and a2^ (7.3.6.17) Indeed, 0 V 0 0 is a generalized inverse for "0 0" u o. , and 'tl W~ W tl_ IS invertible if |£| > \\W\\, which means that a is regular, while 9 a2 = "0 0" .u o. " 0 W W 0 . 'tl W W tl. — 0 0 UWU 0 tl W W tl (7.3.6.18) and since U is also Fredholm on X the operator UWU is again compact and not of finite rank, hence again not regular. 7.4 QUASINILPOTENT ELEMENTS A quasinilpotent is nearly as noninvertible as it can be: 7.4.1 DEFINITION If A is a normed algebra, then a G A is said to be quasinilpotent if \\nn\\lln 0 and is said to be nilpotent if (7.4.1.1) (7.4.1.2) 0e{an:ne N} Quasinilpotents are topological zero divisors: 7.4.2 THEOREM If A ^ {0} and a e A, then a quasinilpotent => a a left and a right topological zero divisor (7.4.2.1) and a nilpotent => a a left and a right zero divisor (7.4.2.2) Proof: If 0 ^ a G A is nilpotent then there is m G N for which am+1 = 0 ^ am (7.4.2.3) which means that Lau = Rau = 0^u with u = am (7.4.2.4)
252 7. Operator Algebra and Commutivity If a = 0 then we may take u = 1. This proves (7.4.2.2), and also gives the idea for the proof of (7.4.2.1). If 0 ^ a is quasinilpotent, then for each sufficiently small e > 0 there is m G N for which ||am||l/m > e > ||am+l||l/(m+l) (7.4.2.5) which means that ||Lat*|| = \\Rau\\ < e||u|| with u = am ■ (7.4.2.6) Sums and products of commuting quasinilpotents are quasinilpotent: 7.4.3 THEOREM IfcGA and if 6 G A is quasinilpotent and commutes with a, then ab is quasinilpotent (7.4.3.1) a quasinilpotent => a + b quasinilpotent (7.4.3.2) 1 — ab is almost invertible (7.4.3.3) a almost invertible => a — b almost invertible (7.4.3.4) Proof: Towards (7.4.3.1) suppose that 6 is actually nilpotent; then {ab)m = ambm =0 if bm = 0 (7.4.3.5) so that ab is also nilpotent. More generally ||(a6)m||1/m < ||a||||6m||1/m < e||a|| if ||6m||1/m < e (7.4.3.6) This gives (7.4.3.1). If a and 6 are both nilpotent then so is a + 6: 2m am = bm = 0 =► (a + bfm = Yi ( ) ay62m_y = bmY,(2m\a?bm-i+am ^ (2m)ay-162m-1=0 More generally, if 0 < e < min(||a||, ||6||) and if n > N => \\an\\ < en and ||6n|| < en (7.4.3.8)
7.4 Quasinilpotent Elements 253 then „ > 2JV =H|(a + 6)"|| < £ (") 11^11116^11 (N 2N-1 n \ , v £+ £ +£ (") 11^1111^11(7.4.3.9) y=o y=jv+i j-2NJ ^J ' < (e + e) max I , This proves (7.4.3.2). Towards (7.4.3.3) suppose a G A is nilpotent; then am+1 = 0 => (1 - a)(l + a + --- + am) = l = (l+a + --- + am)(l - a) (7.4.3.10) Thus a nilpotent => 1 - a G A"1 (7.4.3.11) More generally limsup||an||1/n < 1=> ||l-(l+a + --- + an)(l-a)|| —>0 (7.4.3.12) n In particular a quasinilpotent => 1 — a almost invertible (7.4.3.13) Now (7.4.3.3) follows from (7.4.3.13) together with (7.4.3.1). Towards (7.4.3.4), if a is actually invertible then a + 6 = a(l + a-16) is almost invertible by (7.4.3.3). More generally, suppose that a is almost left invertible, with || 1 - a'na\\ —► 0 and sup ||a'J| < oo (7.4.3.14) n Then for each m,n G N / m \ m !-<( ! + DO'*')(«"*) = l-<a+^(a'J^(l-a'na)^+(a'J^16^1 (7.4.3.15) Now if k = supn ||aJJ| then oo 1 + ]T A;r||6r|| = k' < oo (7.4.3.16) r=l
254 7. Operator Algebra and Commutivity Then if 6 > 0 and e > 0 are arbitrary there is N G N for which n > N => ||1 - a'na\\ < 6 and ||6n|| < en (7.4.3.17) Now mn,> N =► ||1 - a'n f 1 + X^K)r6r ) (a - 6)|| < A;'<5 = jfcm+1em+1 (7.4.3.18) Since <5 and e are entirely arbitrary it is clear that a — b is almost left invertible. This proves the analogue of (7.4.3.4) for almost left invertibility. The argument for almost right invertibility is identical, and hence (7.4.3.4) follows. ■ When the algebra A is complete, then almost invertibility implies invertibility, and hence, if the algebra is also commutative then the quasinil- potents all lie in the radical: 7.4.4 THEOREM If A is a normed algebra, then the quasinilpotents lie in the closure of the almost invertible elements, and hence, if 0 ^ A is complete, then a quasinilpotent => a G d(A-1) (7.4.4.1) If also A is commutative, then a quasinilpotent => a G Radical(A) (7.4.4.2) Proof: If a G A is quasinilpotent and O^iGK, then t — a = t(l — (l/t)a) is almost invertible by (7.4.3.3), and by allowing t to be arbitrarily close to 0 we find a quasinilpotent => a G cl {6 G A: b almost invertible} (7.4.4.3) By (4.4.5.12) and (4.4.5.13) it follows that the almost invertible elements of A are in A-1 when A is complete, giving (7.4.4.1). Finally, if A is also commutative, then (7.4.3.1), (7.4.3.13), and (7.2.3.2) give (7.4.4.2). ■ We might remark that (7.4.3.12) gives an improvement on (4.4.5.15) for complete algebras A. The reader may like to verify that the following condition is necessary and sufficient for a G A to be quasinilpotent: ||(*a)n||—>0 for each teK (7.4.4.4)
7.4 Quasinilpotent Elements 255 It then follows that if a G A is quasinilpotent, and m £Y\ and 61,62,..., 6m,c1,...,cmGA are arbitrary, then also \\bm • • • &£6?ancnc?c£ • • • Cll —► ° ™ n —> °° (7.4.4.5) At the other extreme from quasinilpotents and radical elements, we shall call an element a G A conservative iff there is k > 0 for which IKH > A^HI* for each n G N (7.4.4.6) We conclude here with an introduction to Lomonosov's lemma: 7.4.5 THEOREM If X is a normed space and K G BL(X, X) is compact, quasinilpotent and nonzero, then it has a nontrivial hyperinvariant closed subspace: there is a closed subspace W C X with {0} ^ W ^ X for which, ifTGBL(X,X), TK = KT => T{W) C Py (7.4.5.1) Proof: We shall show that if if is compact with ||jK"|| = 1 and has no nontrivial hyperinvariant closed subspaces then it cannot be quasinilpotent. Choosing x0 G X for which ||ifx0|| > 1, so that also ||x0|| > 1, write U0 = Disc(x0; 1) and V0 = cl K{U0) (7.4.5.2) Evidently U0 and V0 are both closed, neither of them contain 0 G X, and V0 is compact in the full sense of (6.6.0.1). Suppose now 0 ^ x G X: then Wx = cl {Tx: T G comm(K)} = X (7.4.5.3) since evidently Wx ^ {0} satisfies (7.4.5.1). In particular x0 G Wz if 0 ^ x G X, which means I\{0}CU {r-^int^o)):T G comm(K)} (7.4.5.4) The right-hand side of (7.4.5.4) is therefore an open cover for the compact set V0. By (6.6.0.1) there is a finite set of operators H C comm(jFC) for which VoCUlT-^intiUjy.TeH} (7.4.5.5) Inductively we may pick a sequence (Tn) in BL(X, X) for which, for each neN, TneH and TnK • • • T2KTxKx^ G U0 (7.4.5.6) Indeed there is I\ G # for which T^Xq G C/0, giving KT-^Kxq G V0, and
256 7. Operator Algebra and Commutivity then T2eH for which ^(KT^Xq) G U0, and so on. If we write m= inf llxll and M = max||r|| (7.4.5.7) then, remembering that H C comm(jFC), we have m < \\TnK... T^T.KxoW = \\KnTn ... T2Txx0\\ < ||Kn|||Mn|||x0|| for each n G N (7.4.5.8) which means that ||#n||1/n > ^(iM)1^ —^ m as n —+ oo (7.4.5.9) Since m > 0, K cannot be quasinilpotent. ■ 7.5 POLAR AND QUASIPOLAR ELEMENTS Begin by recalling the idempotents p = p2 G A of a normed linear algebra as in (7.3.4.1): if p G A, then pAp = {pap: a G A} = {a G A:a = ap = pa} (7.5.0.1) is a normed linear algebra in its own right, and also a closed subspace of A, but (unless p = 1) not a subalgebra in the sense of (1.10.1.4), since the identity of pAp is p and not 1. The commutant of an idempotent is the direct sum of two algebras: commA(p) = pAp + (l — p)A(l — p) and (7.5.0.2) pAp fl (1 - p)A(l - p) = {0} When a G A commutes with p G A we shall say that it is reduced by p. For example, if P = P2 G BL(X,X) then Theorem 2.5.3 tells us that comm(P) = {T G BL(X,X):T{PX) C PX and r(P_10) C P^O} (7.5.0.3) If p = p2 E A then we shall say that a G A is invertible relative to p iff pap G (pAp)-1 (7.5.0.4) with similar conventions for left and right invertibility, and for almost invertibility. When a is reduced by p then invertibility in A can be tested in pAp and (l — p)A(l — p):
7.5 Polar and Quasipolar Elements 257 7.5.1 THEOREM If a G A commutes with p G A, then a e A"1 <t=> pap G [pAp)~l and (l - p)a(l - p) G ((1 - p)A(l - p))"1 (7.5.1.1) Proof: Suppose ba = 1 with 6 G A: then {pbp)(pap) = pbap = p (7.5.1.2) so that pap has a left inverse in pAp, and similarly (l — p)a(l — p) in (1 — p)A(l — p). Conversely, if b'ap = p and 6"a(l - p) = (l - p) (7.5.1.3) then 1 = p + (1 - p) = (b'p + 6"(1 - p))a (7.5.1.4) so that a has a left inverse in A. This proves the analogue of (7.5.1.1) for left inverse. The argument for right inverse in identical, and together they give the argument for two-sided inverses. ■ When a commutes with p then the condition (7.5.0.4) is equivalent to p G (Aa) n (aA) (7.5.1.5) A "polar" element of a ring or algebra is an element a G A for which one can find an idempotent p = p2 G commA (a) such that a is invertible relative to p and nilpotent relative to 1 — p: 7.5.2 DEFINITION If A is a normed algebra then a G A is called almost quasipolar if there is p G A for which p = p2 and ap = pa (7.5.2.1) with p G cl(Aa) n cl(aA) (7.5.2.2) and ||an(l-p)||1/n—>0 asn—> oo (7.5.2.3) If in addition p G (Aa) n (aA) (7.5.2.4) then a is said to be quasipolar. If also 0e{an(l-p):ne N} (7.5.2.5) then a is said to be polar.
258 7. Operator Algebra and Commutivity For example T G BL(X,X) is quasipolar iff there is commuting P = P2 G BL(X,X) for which T has an invert ible restriction to P(X) and a quasinilpotent restriction to P_1(0); also <f> G C^Q) is quasipolar iff the set <£_1(0) is open as well as closed, with inf^tw0 \<f>(t)\ > 0. In general, in- vertible, idempotent, and quasinilpotent elements are all quasipolar, while an almost invertible element is almost quasipolar. The simply polar elements of Definition 7.3.5 are just the polar elements for which (7.5.2.5) can be improved to a = ap (7.5.2.6) More generally, the reader can verify that a polar <£=> an simply polar for some n G N (7.5.2.7) If a G A is quasipolar then the idempotent p of Definition 7.5.2 is unique, and lies in the double commutant of a: 7.5.3 THEOREM If A is a normed algebra and a G A is quasipolar, then the idempotent p = a of Definition 7.5.2 is unique and lies in comm^(a), and there is 6 G A with ab = ba = p and b = bp = pb (7.5.3.1) If (7.5.3.1) holds then b = ax is unique and lies in comm^(c) Proof: We begin by verifying (7.5.3.1). From (7.5.2.4) there are u,v G A with p = ua = av, giving p = {pup) (pap) = [pap)(pvp), and hence, pup = pvp = b say. Now if also q = q2 satisfies the conditions (7.5.2.1), (7.5.2.4), and (7.5.2.3), with ac = ca = q and cq = qc = c, then we have p - pq = pn(l -q)= bnan{l - q) —>0 asn —>oo (7.5.3.2) and q-pq = (l- p)qn = (l - p)ancn —> 0 as n —> oo (7.5.3.3) Thus, p — pq = q is unique. Also if w G commA(a) then wp — pwp = (1 — p)wp = (1 — p)wpn = (1 — p)wanbn (7.5.3.4) = (1 — p)anwbn —> 0 as n —► oo giving wp = pwp and similarly pwp = pw. The uniqueness of the element b = ax satisfying (7.5.3.1) is just the uniqueness of inverse (3.1.2.2) for
7.5 Polar and Quasipolar Elements 259 {pap) x G pAp. Finally, if w G commA(o) then, using the fact that pw = wp, wb = wpb = bawp = bwap = bwp = bwp = bpw = bw ■ (7.5.3.5) If a G A is almost quasipolar we shall call p = a the support of a, and if a G A is polar then the element b = ax is known as the Drazin inverse of a. The perturbation theory for polar and quasipolar elements is suggested by the perturbation theory for quasinilpotents: 7.5.4 THEOREM If a G A is quasipolar and b G A commutes with a, then b quasipolar =>► ab quasipolar (7.5.4.1) b quasinilpotent =>► a + b almost quasipolar (7.5.4.2) b and 1 + axb almost invertible => a + b almost invertible (7.5.4.3) If, in particular, a is polar and b commutes with a, then b polar => ab polar (7.5.4.4) b nilpotent =>► a + b polar (7.5.4.5) b and 1 + axb invertible =>• a + b invertible (7.5.4.6) Proof: If a and b are quasipolar and commute then by the doubly commuting components of Theorem 7.5.3 the set I a,6,a,6,ax,6x > is commutative (7.5.4.7) In particular a and b commute, and hence ab G A is idempotent (7.5.4.8) To prove (7.5.4.1) we claim that ab is the support for ab. The condition (7.5.2.1) follows from (7.5.4.7) and (7.5.4.8), then condition (7.5.2.4) follows from (6X ax)ab = ab = ab{bx ax) (7.5.4.9) and finally the condition (7.5.2.3) comes from the observation that ab{l - ab) = [aa)(6(1 - b)) + [bb) (a(l - a)) + (a(l - a)) (6(1 - 6)) (7.5.4.10) is by (7.4.3.1) the sum of three commuting quasinilpotents, and therefore quasinilpotent by (7.4.3.2). If, in particular, a, 6 are polar then a6(l — ab) is
260 7. Operator Algebra and Commutivity nilpotent by (7.4.3.5) and (7.4.37), giving (7.5.4.4). To prove (7.5.4.2) note that (a + b)a is almost invertible in aAa by (7.4.3.4), and that (a + 6)(l —d) is quasinilpotent by (7.4.3.2). If, in particular, a is polar and b is nilpotent then (a-\-b)a is invertible in aAa by (7.4.3.11) and (a+b)(l — a) is nilpotent by (7.4.3.7), giving (7.5.4.5). To prove (7.5.4.3) we have (1 + axb)a = ax(a + b)a (7.5.4.11) so that (a + b)a is almost invertible in aAa by (3.7.3.1) or (3.7.3.2), while (a+ 6)(1 — a) is almost invertible in (1 — a)A(l — a) by (7.4.3.4), and hence a + b is almost invertible in A by the analogue of (7.5.1.1). If in particular a, b and 1 + axb are invertible then (7.1.3.6), (7.4.3.11), and (7.5.1.1) give (7.5.4.6). ■ In general we cannot expect implication: ab = ba polar => a almost quasipolar (7.5.4.12) since the left-hand side holds with b = 0 for arbitrary a G A. When A is complete then the almost quasipolar elements are quasipolar, which offers some simplification in Theorem 7.5.4: 7.5.5 THEOREM If A is a Banach algebra and a G A, then a almost quasipolar => a quasipolar (7.5.5.1) If a G A is quasipolar and b G A commutes with a then b quasinilpotent =>► a + b quasipolar (7.5.5.2) and a quasipolar b and 1 + axb invertible => a + b invertible (7.5.5.3) Proof: By (4.4.5.12) and (4.4.5.13) the almost invertible elements of the algebra pAp are invertible in pAp whenever p G A is an idempotent in A, so that the conditions (7.5.2.2) and (7.5.2.4) are equivalent when ap = pa. This gives (7.5.5.1), which together with (7.5.4.2) gives (7.5.5.2). Finally, (7.5.5.3) is clear from (7.5.4.3) together with (4.4.5.12) and (4.4.5.13). ■ The second part of the condition on the left-hand side of (7.5.4.3) is satisfied if II«X|HHI<1 (7-5.5.4)
7.6 Homomorphisms and Fredholm Theory 261 in particular by taking 0 ^ b G K we can improve (7.4.4.3) to a quasipolar => a G cl {b G A: b almost invertible} (7.5.5.5) so that if A is complete then a quasipolar => a G cl(A_1) (7.5.5.6) When a G A is quasipolar then its Drazin inverse is regular, since a is itself a generalized inverse for a: ax =ax.a-ax =^ ax G T (7.5.5.7) As we saw in (7.3.6.16) however it is possible for a G A to be polar but not regular. 7.6 HOMOMORPHISMS AND FREDHOLM THEORY If T G HBL(A,B) is a homomorphism in the sense of (2.10.1.7), so that T(l) = 1 and T(ab) = T(a)T(b), then there is inclusion T(A_1) C B'1 (7.6.0.1) By analogy with Atkinson's Theorem 6.4.3, we may regard the elements of T-1(B~1) as another kind of "Fredholm" element: 7.6.1 DEFINITION If T G HBL(A,5) is a homomorphism of normed algebras, then a G A will be called T-Fredholm iff T{a) G B~l (7.6.1.1) and almost T-Fredholm iff T(a) is almost invertible in B. Iff aeA'1 +T~1{0) = {c + d:ceA-\T{d) = 0} (7.6.1.2) then we shall call a T- Weyl . If T G BL(A, B) is a homomorphism then r_1(0) is a closed two-sided ideal of A (7.6.1.3) proper iff B ^ {0}. Conversely, if J is a closed two-sided ideal then the quotient JA : A —> A/J (7.6.1.4) is a homomorphism. Evidently the "Weyl" elements are unchanged if the homomorphism T is replaced by the quotient JA with J = T_1(0), although
262 7. Operator Algebra and Commutivity the T-Fredholm elements may be more general than the JA-Fredholm elements: this is familiar when, for example, the homomorphism T is one-one, or isometric. If homomorphism T is induced by an ideal J which satisfies the regularity condition (7.3.3.1) then much of the "spatial" Fredholm theory carries over: 7.6.2 THEOREM If J C ^ is a regular two-sided ideal of a normed algebra A, then a G A is JA-Fredholm iff aG^ and L'1^) C J and R'1^) Q J (7.6.2.1) Whether or not the ideal J is regular, if instead 1 + JCA-1! (7.6.2.2) then a G A is JA-Weyl iff JA{a) G {A/J)'1 and a G A'1 A (7.6.2.3) Proof: The conditions (7.6.2.1) are as in the characterization Theorem 6.4.2, and to prove the first part of Theorem 7.6.2 we have only to repeat the argument for Atkinson's Theorem 6.4.3, using (7.3.3.2). Towards (7.6.2.3) suppose a = cp is Fredholm with invertible c G A-1 and idempotent pGi: then p = p2 is Fredholm, so 1-peJ (7.6.2.4) and hence a = cp G c(l + J) C A~l + J. Conversely, if (7.6.2.2) holds, then A-l + J<ZA~lA ■ (7.6.2.5) Theorem 7.6.2 characterizes the Fredholm and the Weyl elements when A = BL{X,X) and J = KL0(X,X) (7.6.2.6) As we have said, we proved (7.6.2.1) in Atkinson's Theorem 6.4.3, while the condition (7.6.2.2) was part of Theorem 6.5.2. The same characterizations are valid when A = BL{X,X) and J = KL1{X,X) (7.6.2.7) using the fact (7.2.7.1) that the compact operators lie in the essential hull of the finite rank operators. Indeed, Theorem 7.3.3 says that (7.6.2.1) is valid whenever J C Hull(J0) and J0 C ^ (7.6.2.8)
7.6 Homomorphisms and Fredholm Theory 263 which applies if J = KL1(X,X), while the condition (7.6.2.2) follows from Theorem 6.10.4, which says that everything in 1 + J has index zero, and Theorem 6.5.6, which tells us that Fredholm operators of index zero are Weyl, and therefore decomposably regular as in Theorem 6.5.2. If T = Jv :A-+A/J with J C 'A1, then we can weaken the conditions for (7.3.2.5). If a = aa'a G *A! and if d G A satisfies either T{l-afd)e{A/J)~1 and (7.6.2.9) T((l - aa')d){T{l - a'd))-lT{l - a'a) G [A/Jf T{l-daf) e{A/J)-1 and (7.6.2.10) T(l - a'a){T{l - da!))-lT(d(l - a'a)) G (A/Jp then (7.3.2.5) says that T(a - d) G (-A/J)r~1 and now (7.3.3.2) says that a — d G 'A1. The analogue of Theorem 6.4.4 extends to T-Fredholm elements: 7.6.3 THEOREM If T G HBL(A, B) is a homomorphism of normed algebras and a,b G A then a, b T-Fredholm ^=> ba, ab T-Fredholm (7.6.3.1) and ab T-Fredholm =>► (a T-Fredholm <=>> b T-Fredholm) (7.6.3.2) and 1 - ab T-Fredholm => 1 - ba T-Fredholm (7.6.3.3) If in particular a = aa'a G *A! then a T-Fredholm => a' T-Fredholm (7.6.3.4) and a T-Fredholm,l + a'b T-Fredholm =>► a + b T-Fredholm (7.6.3.5)
264 7. Operator Algebra and Commutivity Proof: The first three implications follow from (7.1.3.6), (7.1.3.8), and (7.1.3.7) applied to the elements T(a), T{b) in B. For (7.6.3.4) observe that T{a) = T{a)T{a!)T{a) G B~l =>► T{a') G B~l (7.6.3.6) Finally, for (7.6.3.5) argue T{a')T{a + b) = T(l + a'b) G B~l =^{T{a!)eB-1 =^T{a + b)eB~l) m (7.6.3.7) The analogue of Theorem 6.5.3 extends to T-Weyl elements: 7.6.4 THEOREM If T G HBL(A, B) is a homomorphism of normed algebras and a, b G A, then a, b T-Weyl =>► ab T-Weyl =>► (a T-Weyl <^> b T-Weyl) (7.6.4.1) If, in particular, a = aa'a G 'A1, then a T-Weyl =>► a' T-Weyl (7.6.4.2) and a T-Weyl, 1 + a'b T-Weyl =>► a + b T-Weyl (7.6.4.3) and a T-Weyl, 1 + bo! T-Weyl =^ a + 6 T-Weyl (7.6.4.4) Proof: If a = c + tt and 6 = d + v with invertible c,d and u,v in T_1(0) then a6 = cd + (ci; + ud + uv) G A"1 + T_1(0) (7.6.4.5) giving the first implication of (7.6.4.1). If instead a = c + u and ab = d + v, then b = c~ld + c_1(t; - ub) G A'1 + T_1(0) (7.6.4.6) giving half of the second implication of (7.6.4.1), and the other half is similar. If a = aa'a = c + u with c G A~l and T(u) = 0 then a! = c~l + c~\d - da'c - ca'd - da'd)c~l G A'1 + T"1^) (7.6.4.7) giving (7.6.4.2). Finally, for (7.6.4.3) we have a'{a + b) = {l + a'b)-{l-a'a) G (A"1 -hT"1^)) -hT"1^) = A^+T'1^) (7.6.4.8)
7.7 Browder Operators 265 so that a + b is T-Weyl by (7.6.4.2) and the second part of (7.6.4.1). The argument for (7.6.4.4) is the same. ■ There can be no analogue of the reverse implication in (7.6.3.1) for Weyl operators. For example if X = lp and A = BL(X2,X2) with J0 = KL0(X2,X2) and Jx = KL^X2,*2), and if U and V are the shifts of (7.3.6.12) and (7.3.6.13), then a = U + I 0 0 V -I (a+l)(a-l) e A~l + J0 and (7.6.4.9) a + 1 £ A'1 + JX a-lg A'1 + Jx One way to see this is to compute the index: since index(a + 1) = index(J7 + 21) + index(Vr) =0 + 1 = 1 (7.6.4.10) and index(a - 1) = index(J7) + index(F - 21) = -1 + 0 = -1 (7.6.4.11) Neither a + 1 nor a — 1 can be Weyl operators, while Theorem 6.5.4 gives index((a + l)(a - 1)) = index(a + 1) + index(a - 1) = 0 (7.6.4.12) Alternatively, (7.3.6.14) says that U has a left inverse and V has a right inverse, while neither of them is actually invertible. It follows that the same is true of a—1 and a+1, respectively. Thus, by (3.8.6.8) and (3.8.6.9) neither of them can be a Weyl operator. On the other hand, if {U + 2I)U [U + 2J)(J - UV){V - 21) 1 0 V{V-2I) \ V(U + 2I)~l 0 1 {V - 2I)~1{I - UV){U + 2/)"1 (V - 2I)~lU\ c = (7.6.4.13) then the reader should verify that c'c = 1 = cc' and (a + l) (a - 1) - c G J0 (7.6.4.14) 7.7 BROWDER OPERATORS The Browder operators are even more special than the Weyl operators:
266 7. Operator Algebra and Commutivity 7.7.1 DEFINITION ' If X is a normed space then T G BL(X,X) is said to be Browder, or spatially Browder, iff T is Fredholm and polar (7.7.1.1) By Theorem 7.3.6, a Fredholm operator T is polar if and only if there is m G N for which r"m(0) = r"2m(0) and Tm{X) = T2m{X) (7.7.1.2) or equivalently r"m(0) = T'171-1^) and Tm{X) = Tm+1{X) (7.7.1.3) Using the Riesz lemma, Theorem 1.5.2, T = I + K is Browder if K is compact. 7.7.2 THEOREM If X is a normed space and K G BL(X, X), then K G KLX (X, X) => I + K Browder (7.7.2.1) Proof: If the first part of (7.7.1.3) fails for T = / + K then we claim that K cannot be totally bounded: for if {0} £ T-^O) £ r~2(0) £ • • • (7.7.2.2) then by Theorem 1.5.2 there is a sequence x = (xn) in X for which, for each n G N, ||xn|| = 1 and xn G r-n_1(0) and dist(xn,r_n(0)) > \ (7.7.2.3) Since T = I + K, it follows that m?n=>\\Kxm-Kxn\\>± (7.7.2.4) which by (6.8.3.1) means that K cannot be totally bounded, and hence cannot be compact. If instead the second part of (7.7.1.3) fails, and if in addition Tm(X) is closed for each m €N, then again T = I + K means K is not totally bounded. For if X ^ T{X) ^ T2 (X) ^ • • • (7.7.2.5) then by Theorem 1.5.2 there is a sequence x = (xn) in X for which, for
7.7 Browder Operators 267 each n G N, ||xj| = 1 and xn G Tn(X) and dist(xn,T"+1(X)) > \ (7.7.2.6) With T = I + K we again get (7.7.2.4). It is now impossible for K to be compact, since on the one hand the compact operators are totally bounded, while on the other hand, if K is compact then by (6.9.3.3) T = I + K is Fredholm and hence Tm(X) is closed for each m EN. m To derive (7.7.2.1) for an operator K G KL0(X, X) of finite rank we can alternatively return to the proof of (6.5.2.7), with the decomposition (6.5.2.12) of the space X and the operator U : Xf2 —*■ Xf2 induced by I + K on the finite dimensional space X'2. It is clear at once that the operator U must be polar, and the reader can verify that, for each m EN, {I + K)-m{0) = (J + K)"m_1(0) <^=> U~m{0) = U-™-1^) (7.7.2.7) and [I + K)m{X) = (7 + K)m+1(X) ^ Um{Xf2) = Um+1{Xf2) (7.7.2.8) To see the relationship between Weyl and Browder operators, and the perturbation theory for Browder operators, it is convenient to look again at the Fredholm theory associated with a homomorphism of normed algebras: 7.7.3 DEFINITION If T G HBL(A,5) is a homomorphism of normed algebras then a G A is called T-Browder if oGA'^r^O) = {c + d: c e A'1 ,d e T'1 {0),cd = dc} (7.7.3.1) Here we are writing H<±.K = {x + v: x G H, y G K, xy = yx} (7.7.3.2) for a sort of "commuting sum" of the subsets H and K in a ring A. Evidently if a G A there is implication a invertible => a T-Browder => a T-Weyl => a T-Fredholm (7.7.3.3) Conversely, a T-Fredholm element which is polar must also be T-Browder: 7.7.4 THEOREM If T : A -> B is a homomorphism of normed algebras and a G A, then a T-Fredholm and polar => a T-Browder (7.7.4.1)
268 7. Operator Algebra and Commutivity Conversely, provided the homomorphism T satisfies the "Riesz condition," l + r_1(0) C {a G A: a polar} (7.7.4.2) there is implication a T-Browder => a T-Fredholm and polar (7.7.4.3) Proof: If a G A is polar then also T(a) G5 is polar, with support T(a) G B, and if, in particular, T(a) G B~l then by the uniqueness component of Theorem 7.5.3 it follows T(p) = T(l), and hence, 1 - p G r_1(0). Now if ax is the Drazin inverse of a and we write c = ad+(l-d) c' = axd+(l-d) d=(a-l)(l-d) (7.7.4.4) we have a = c + d cd = dc c'c=l = cc' T(d) = 0 (7.7.4.5) which means that a is T-Browder. Conversely, if a is T-Browder, with c,d satisfying (7.7.4.5), then by the Riesz condition (7.7.4.2) the element 1 + c~ld is polar, and since cd = dc it also commutes with 1 + c_1d, so that a = c(l + c~1d) is polar by (7.5.4.4). ■ It is clear that the Riesz condition (7.7.4.2) is necessary for the implication (7.7.4.3). The Riesz condition plays a vital role in the perturbation theory of T-Browder elements. We begin with an auxiliary result: 7.7.5 THEOREM If T G BL(A, B) is a homomorphism of normed algebras and a, 6 G A commute with one another, then ab T-Fredholm polar => a T-Browder, b T-Browder (7.7.5.1) a T-Fredholm polar, T(b) = 0 => a + b T-Browder (7.7.5.2) a, 1 + ax b T-Fredholm polar => a + b T-Browder (7.7.5.3) Proof: For (7.7.5.1) we repeat the argument of (7.7.4.1). If ab = ba is T-Fredholm and polar, with support (ab)' = p and Drazin inverse (ab)x, then again 1 — p G T_1(0), and is we take c = ab+{l-p) c' =(ab)xbp+{l-p) d={a-l){l-p) (7.7.5.4) then the conditions (7.7.4.5) are satisfied, so that a is T-Browder. The argument for b is the same: the reader is invited to write it down. For
7.7 Browder Operators 269 (7.7.5.2) we can write a + b= c + d with c = aa+(l-a) and d= (a-l)(l-a) + b (7.7.5.5) and observe that c is invertible as in (7.7.4.4), that T(d) = 0, and finally that since a is in the double commutant of a we have cd = dc. For (7.7.5.3) we have a(l + axb)a = (a + b)a (7.7.5.6) and hence also a(l + axb)p = (a + 6)p where p = d(l + ax6)', giving a+6 = a(l + axb)p+{a+b)(l-p) = a(l+axb) + (-a(l+axb) + {a+b))(l-p) (7.7.5.7) which is T-Browder by (7.7.5.2). ■ When T_1(0) has the Riesz property, Theorem 7.7.5 translates into the perturbation theory for T-Browder elements: 7.7.6 THEOREM If the homomorphism T G BL{A,B) has the Riesz property (7.7.4.2) and if a, b G A commute with one another, then ab T-Browder «=* a, b T-Browder (7.7.6.1) a T-Browder, T(b) =0=>a + b T-Browder (7.7.6.2) a, 1 + ax b T-Browder => a + b T-Browder (7.7.6.3) a T-Browder, 6,1 + axb invertible => a + b invertible (7.7.6.4) Proof: If a and 6 are T-Browder, then by (7.7.4.3) they are T-Fredholm polar, so that ab is T-Fredholm polar by (7.5.4.3) and (7.6.3.1), hence T- Browder by (7.7.4.1). Conversely, if ab is T-Fredholm polar by (7.7.4.3), so that a and b are T-Browder by (7.7.5.1). This proves (7.7.6.1). Each of (7.7.6.2) and (7.7.6.3) follows from (7.7.4.3) together with (7.7.5.2) or (7.7.5.3). Finally (7.7.6.4) is just (7.5.4.6) with again (7.7.4.3). ■ Forward implication in (7.7.6.1) is the more significant in view of the failure of the analogous result for general polar elements (7.5.4.8), and also of the analogous result for Weyl operators (7.6.4.9). By Theorem 7.7.2 each of the ideals KL0(X,X) and KL^XjX) has the Riesz property (7.7.2.3), and hence the spatially Browder operators of Definition 7.7.1 are the JA-Browder operators of Definition 7.7.3 for the homomorphism JA : BL(X,X) -► BL(X,X)/J with J = KL0(X,X) and also with J = KLX (X, X). It follows that the spatially Browder operators satisfy Theorem 7.7.6.
270 7. Operator Algebra and Commutivity If the algebra A is complete, then Theorem 7.7.4 and Theorem 7.7.5 have analogues in which "polar" is replaced throughout by "quasipolar". The reader is invited to check the details. Whether or not the normed space X is complete however, it follows from (6.8.5.1) that if T G BL(X, X) then T Fredholm and quasipolar => T polar (7.7.6.5) Indeed if P is the support projection of T then I — P is compact and therefore of finite rank, and hence also T(I — P) is of finite rank. Now a quasinilpotent operator of finite rank is necessarily nilpotent. To see that there actually exist Weyl operators which are not Browder, recall the operator (a + l)(a - 1) G A = BL(X2,X2) of (7.6.4.9): we saw there that this is a Weyl operator and that neither a + 1 nor a — 1 is Weyl. It follows that neither a+ 1 nor a — 1 can be Browder operators, and hence by (7.7.6.1) that (a + l)(a — 1) cannot be a Browder operator. For another example claim a = Here again U and V are the shifts of (7.3.6.12) and (7.3.6.13). Indeed if o uv-n o o J (7.7.6.7) then a = c + d c'c = l = ccf deKL0{X2,X2) (7.7.6.8) so that a is a Weyl operator, while if x = 6n+1 = (0,0,... ,0,1,0,...) G X = lp, then Vnx ^ 0 = Vn+1x, and hence (7.7.6.9) Thus, the first part of (7.7.1.3) fails with T = a, which therefore cannot be a Browder operator. The condition that a and b commute cannot be dropped from (7.7.6.1), for if T e BL(X,X) is Weyl and not Browder, then by (7.6.2.3) T = SP = QS with invertible S and Fredholm idempotents P, Q (7.7.6.10) so that S and P are Browder but SP is not Browder (7.7.6.11) =^ a Weyl but not Browder (7.7.6.6) U I-UV O V V 0" I-UV U d = u _0 0 vm 71 0 X ± 0 .0. — u .0 0 vm rt-r i
7.8 Ascent and Descent 271 and also S~1T and TS'1 are Browder but T is not Browder (7.7.6.12) 7.8 ASCENT AND DESCENT If T : X —► X is linear then the range and the null space of T are just the leading terms of two monotonic sequences of subspaces. {o} c r_1o c r~2o c • • • c r~no c r~n_1o c • • • (7.8.0.1) and X D T{X) 2T2{X)D'"2 Tn{X) D Tn+1{X) D • • • (7.8.0.2) 7.8.1 DEFINITION If T : X -> X is a linear operator then the hyper- kernel of T is oo r-°°(o) = P| r-n(o) (7.8.1.1) n=l and the hyperrange of T is the subspace oo r°°(X) = P| Tn{X) (7.8.1.2) n=l We shall say that T has ascent < k iff r-°°(0) =T~k{0) (7.8.1.3) and that T has descent < k iff r°°(X) =T/C(X) (7.8.1.4) If k exists for which (7.8.1.3) holds, then T is said to be "of finite ascent," in which case "the ascent of T" will be the smallest such value of A;: similarly for "the descent of T." 7.8.2 THEOREM If T : X -► X is linear and k G N and k < kf G N, then ascent(r) < k ^ r~*(o) = r~*#(o) <^ r~*(o) = r-°°(o) (7.8.2.1) and descent(r) < k ^ Tk{X) = Tk'[X) ^ Tk{X) = T°°{X) (7.8.2.2)
272 7. Operator Algebra and Commutivity If also 7i G N is arbitrary, then Tk{X) n r_1(0) = {0} => ascent(r) < k => Tk{X) n r"n(0) = {0} (7.8.2.3) and T'k0+T{X) =X=> descent(r) < k => T-k{0)+Tn{X) = X (7.8.2.4) If T has finite ascent and finite descent then ascent(r) = descent(r) (7.8.2.5) Proof: For (7.8.2.1) and (7.8.2.2) we need only observe r-*(o) = r-*-1^) =► r-*-1^) = r~*-2(o) (7.8.2.6) and Tk{X) = Tk+1{X) =► Tk+1{X) = Tk+2{X) (7.8.2.7) For (7.8.2.3) and (7.8.2.4), suppose T : X -> Y and S : Y -> Z are linear. We then claim T{x) n 5_1(o) = {0} <^> r-^o) = (5r)_1(o) (7.8.2.8) recalling the isomorphism between (5r)_1(0)/r_1(0) and T(X) n 5_1(0) used in the proof of (6.5.4.6) as part of the index theorem, while T{X) + S_1(0) = Y <^> S{Y) = {ST){X) (7.8.2.9) recalling the ismorphism between S{Y)/(ST)(X) and Y/{T(X) + 5_1(0)) used for (6.5.4.7). Taking (T,Tk) in place of (S,T) in (7.8.2.8) and (7.8.2.9) gives the first implications in (7.8.2.3) and (7.8.2.4), while for the second implications we argue (suj-^o) = (su2)-\o) = s-^o) =► (su^y^o) = s~\o) (7.8.2.10) and (V.S^X) = (V2S)(X) = S(X) =* (^F25)(X) = S(X) (7.8.2.11) Towards (7.8.2.5) we show that descent(T) < ascent(T) by proving the implication [VTU){W) = {VT){X) and [VT)'1^) = r_1(0) => {TU){W) = T{X) (7.8.2.12)
7.8 Ascent and Descent 273 and that ascent(T) < descent(T) by proving the implication (VTU)-lQ = (TU)-^ and [TU){W) = T{X) => (VT)"1*) = T"1^) (7.8.2.13) The details are left to the reader. ■ When X is a normed space and T G BL(X, X) then the subspaces T~n(0) are all closed, but there is nothing to suggest that either r~°°(0) or T°°(0) should be closed. They both have the "hyperinvariant" property (7.4.5.1): 7.8.3 THEOREM If X is a normed space and T G BL(X, X), then r-i(r-°°(o)) C r-°°(0) (7.8.3.1) and T essentially one-one => T{T°°{X)) = T°°{X) (7.8.3.2) If S e BL(X,X) commutes with T then S{T-°°{0)) C r-°°(0) and S{T°°{X)) C r°°(X) (7.8.3.3) If S G BL(X, X) is invertible and commutes with T, then [T - 5)-1(0) C r°°(X) and r-°°(0) C (T - S){X) (7.8.3.4) Proof: For (7.8.3.1) observe that if n G N is arbitrary and x G X then Tx g r-n(o) => x g r-n"1(o) c r-°°(o) (7.8.3.5) For (7.8.3.2) suppose y G T°°(X). Then if y = Tx' the cosets T~1y = x1 + r_1(0) are the same and evidently, for each n G N, (x' + r_1(0)) n Tn{X) ^ 0 (7.8.3.6) There are therefore sequences z = (zn) in X for which, for each n G N, ||*J|=1 zneT~1(0) x' + zn€Tn(X) (7.8.3.7) If T is essentially one-one, so that T_1(0) is finite dimensional, then by Theorem 6.6.4 the sequence (zn) has convergent subsequences zf = (zfn). Now, if we take x = x' + limn z'n then oo x 6 fl (x' + r_1(0)) n T"(X) (7.8.3.8) n=l
274 7. Operator Algebra and Commutivity so that y = Tx and xeT°°{X) (7.8.3.9) The inclusions (7.8.3.3) are clear. For (7.8.3.4) note that if S is invertible and commutes with T then for arbitrary n G N x e [T - 5)_1(o) => x = 5_1rx = s~nTnx = Tns~nx g Tn{x) (7.8.3.10) giving the first part of (7.8.3.4), while if x G X then rn+1x = 0=)> x = (5-r)5"1(/+5-1r+-+5-nr)iG (r-s)(x) ■ (7.8.3.11) The first part of (7.8.3.4) holds if instead S commutes with T and T — S is onto and X/Tn(X) is finite dimensional for each n G N (7.8.3.12) For by (3.11.1.4), the induced mappings (T - S)A : X/Tn{X) -> X/rn(X) are also onto, and by finite dimensionality also one-one, giving {T - 5)-1(0) C Tn{X) = {T- S){Tn{X)) for each neN (7.8.3.13) The second part of (7.8.3.4) holds if instead S commutes with T and T — S is one-one and T~n(0) is finite dimensional for each n G N (7.8.3.14) For by (3.11.1.1) the induced mappings (T - S)A : T~n(0) -> r_n(0) are also one-one, and by finite dimensionality also onto, giving {T - 5)-1r-n(0) = r"n(0) C (T - S){X) for each n G N (7.8.3.15) The hyperrange T°°(X) is used in the "punctured neighborhood theorem": 7.8.4 THEOREM If T G BL(X,X) is Fredholm, with generalized inverse T' G BL(X,X), and if S and S' in BL(X,X) are both invertible and commute with T, and also satisfy / + T'S invertible and / + T'S' invertible (7.8.4.1) then dim(r - 5;)_1(0) = dim(r - 5)_1(0) < dimr-1^) (7.8.4.2)
7.8 Ascent and Descent 275 and dimX/(r - S')X = d\mX/{T - S)X < dimX/TX (7.8.4.3) Proof: If I + T'S is invertible then Theorem 6.4.5 gives the inequalities at the end of (7.8.4.2) and (7.8.4.3). Writing *7A : r°°(X) -► r°°(X) for the operator induced by U G comm(T), we claim dimfr-S)-1^) = dim(r-5)A-1(0) = index(r-S) = index(r) (7.8.4.4) The first equality comes from the first inclusion of (7.8.3.4), the second equality comes from the fact that, by (7.8.3.2), (T — 5)A is onto, and the third equality is Theorem 6.5.5. Since the right-hand side of (7.8.4.4) is independent of S, equality follows in (7.8.4.2), and hence also in (7.8.4.3). ■ Theorem 7.8.4 derives its name from the consequence that, under certain circumstances, every operator in a certain "punctured neighborhood" of a Fredholm operator must be invertible: 7.8.5 THEOREM If X is a Banach space and A C BL(X,X) is a sub- space, and if T eAC comm(r) (7.8.5.1) then there is implication T Fredholm, Ted{A\ BL_1(X,X)) => T e iso{A \ BL_1(X,X)) (7.8.5.2) Proof: We are writing iso(K) = K\ acc(K) (7.8.5.3) for the isolated points of K C H, where the accumulation points form the set acc(K) = {t e fi: U e Nbd(*) =>UnK\{t}^0} (7.8.5.4) We claim T Fredholm, T G cl(BL_1(X,X) n comm(r)) => T £ acc(comm(r) \ BL_1(X,X)) Indeed, if T is Fredholm, then by Theorem 7.8.4 there is 6 > 0 for which, whenever S and Sf are in the set W{T,S) = Disc(0;<5) nBL_1(X,X) ncomm(r) (7.8.5.6)
276 7. Operator Algebra and Commutivity there is equality dim(r - 5'/)"1(0) = dim(r - 5)_1(0) and (7.8.5.7) dimX/(r - S')X = dimX/{T - S)X If, in addition, T is in the closure of BL_1(X,X) n comm(T) then for arbitrary 6' > 0 there is S' G W(T,Sf) for which T - Sf is invertible, so that dim(r - 5")_1(0) = dimX/(r - S')X = 0 (7.8.5.8) Taking in particular 8' = 6 it follows from (7.8.5.7) that for arbitrary SeW{T,6) dim(r - S)-1 (0) = dimX/(r - S)X = 0 (7.8.5.9) Since X is complete it follows from Theorem 4.6.3 that T — S is invertible. This proves (7.8.5.5), and hence (7.8.5.2) with A = comm(T), and hence (7.8.5.2) whenever A satisfies (7.8.5.1). ■ 7.9 SEMI-BROWDER OPERATORS A semi-Browder operator will satisfy about two-thirds of the conditions for a Browder operator: 7.9.1 DEFINITION If X is a normed space then T e BL(X, X) will be called upper semi-Browder if T is upper semi-Fredholm of finite ascent (7.9.1.1) and will be called lower semi-Browder if T is lower semi-Fredholm of finite descent (7.9.1.2) As for semi-Fredholm operators, we can only obtain the perturbation theory for semi-Browder operators in complete spaces. 7.9.2 THEOREM If X is a Banach space and S, T e BL(X, X) commute then S,T upper semi-Browder ^=> ST upper semi-Browder (7.9.2.1) S,T lower semi-Browder <£=> ST lower semi-Browder (7.9.2.2) T upper semi-Browder, S compact => T + S upper semi-Browder (7.9.2.3)
7.9 Semi-Browder Operators 277 and T lower semi-Browder, S compact (7.9.2.4) T + S lower semi-Browder Proof: If S and T are upper semi-Browder then ST is upper semi-Fredholm by (6.12.2.1), and also of finite ascent: for if r_n(0) = r-n_1(0) and S-n(0) = S-n_1(0) then, remembering that ST + TS, (ST)-n(0) = S-nT-n(0) = S-nT-n-1(0) K J K J w w (7.9.2.5) = r"n"15,"n(o) = r_n"15'"n"1(o) = (ST)-n~1(o) Conversely, if ST = TS is upper semi-Fredholm then S and T are upper semi-Fredholm by (6.12.2.1), and for some n G N, (ST)-°°(0) = (ST)-n(0) is finite dimensional (7.9.2.6) giving for arbitrary n G N r~m(o) c T~ms~m(o) v (7.9.2.7) = (ST)-m(0) C (ST)-°°{0) finite dimensional Thus T, and similarly 5, are also of finite ascent, finishing the proof of (7.9.2.1). If instead S and T are lower semi-Browder then by (6.12.2.2) ST is lower semi-Fredholm, and also of finite descent. For if Tn(X) = Tn+1(X) andSn(X) =5n+1(X) then {ST)n{X) = SnTn{X) = SnTn+1{X) = Tn+1Sn{X) = Tn+1Sn+1{X) = [ST)n+1{X) (7.9.2.8) Conversely, if ST = TS is lower semi-Fredholm then S and T are lower semi-Fredholm by (6.12.2.2), and for some n G N (ST)°°(X) = {ST)n{X) is closed of finite codimension (7.9.2.9) giving for arbitrary m €N Tm{X) D TmSm{X) = [ST)m{X) D {ST)°°{X) of finite codimension (7.9.2.10) Thus T, and similary S, are of finite descent, finishing the proof of (7.9.2.2). Towards (7.9.2.3) it follows from (6.12.2.3) that if T is upper semi- Browder and S is compact then T + S is upper semi-Fredholm, and hence by (6.12.2.1) also (T + S)n for each n e N, so that in particular (T + S)~n(0) is finite dimensional for each n e N (7.9.2.11)
278 7. Operator Algebra and Commutivity If, in particular, T is bounded below and S commutes with T then by (7.8.3.15) r_1(r + S)-n{0) = {T + S)-n{0) C T{X) for each n G N (7.9.2.12) If k > 0 is such that ||x|| < A;||Tx|| for each iGl then we claim dist(x, (r + S)~n(0)) < ikdist(rx, (r + S)~n(0)) for each n G N (7.9.2.13) Indeed, if x G X and yn G (T + S)-n(0) are arbitrary then by (7.9.2.12) there is zn G X for which yn = Tzn, and also by (7.9.2.12) it follows that zne {T + S)-n{0): now dist(s,(r + S)-n(0)) < \\x-zj < k\\T(x-zn)\\ =k\\Tx-yn\\ (7.9.2.14) Since yn was arbitrary,. (7.9.2.13) follows. We now claim that T + S must be of finite ascent. For if not then there is a sequence (xn) in X for which, for each n G N, ||xj| = 1 xn G (T + 5) —x(0) dist(xn, (T + 5)-»(0)) > J (7.9.2.15) It follows that if n and m> n + 1 are arbitrary, then *||Sxm - SxJ = k\\Sxm - (T + 5)xn + TxJ > kdist{Txn,{T + S)~n{0)) > \ (7.9.2.16) which contradicts the compactness of the operator S. This contradiction proves (7.9.2.3) in the special case when T is bounded below. If more generally T is upper semi-Browder, with T~p(0) = r~p_1(0) finite dimensional, then the induced mapping TA : X/T~p(0) -+ X/T~p(0) is bounded below and hence, by what we have just proved, the induced mapping (r + S)A : X/T~p{0) -+ X/T~p(0) is upper semi-Browder, in particular, of finite ascent. It follows that there is m G N for which n > m implies (r + S)~n{0) C(T + S)-nT~p{0) = {T + S)-mT-p{0) finite dimensional (7.9.2.17) which forces T + S to be of finite ascent, and finishes the proof of (7.9.2.3). Towards (7.9.2.4) it follows from (6.12.2.4) that if T is lower semi- Browder and S is compact then T + S is lower semi-Fredholm, so that using (6.12.2.2) (T + S)nX is closed of finite codimension for each neN (7.9.2.18)
7.9 Semi-Browder Operators 279 If, in particular, T is open and commutes with S then by (7.8.3.13) r_1(0) C (T + S)n{X) = T{{T + S)n{X)) for each n G N (7.9.2.19) If k > 0 is such that dist(x,r_1(0)) < A;||Tx|| for each x G X then we claim dist(x, (r + s)n(x)) < A;dist(rx, (r + s)n(x)) for each n G N (7.9.2.20) Indeed, if x G X and yn G (T + S)n(X) are arbitrary, then by (7.9.2.19) there is zn G (T + S)nX for which yn = Tzn: now using again (7.9.2.19) dist(x, (T + 5)n(X)) = dist(x - zn), (T + S)n(X)) < dist(x - z^T-'iO)) < k\\Tx - yn\\ (7.9.2.21) Since yn was arbitrary, (7.9.2.20) follows. We now claim that T + S must be of finite descent. For if not there is a sequence (xn) in X for which, for each n G N, KH = i xne(T + s)n(X) distfo.cr + sr+^ui (7.9.2.22) It follows that if 7i and m > n + 1 are arbitrary, then *||Sxm - SxJ = k\\Sxm - (T + 5)xn + TxJ| > k dist(rxn, (r + 5')n+1 (X)) > \ (7.9.2.23) which contradicts the compactness of S. This contradiction proves (7.9.2.4) in the special case when T is open. If, more generally, T is lower semi- Browder, with TP(X) = TP+1(X) closed and of finite codimension, then the induced mapping TA : TP(X) —► TP(X) is open, and hence, by what we have just proved, the induced mapping (T + S)A : TP(X) -+ TP(X) is lower semi-Browder, in particular of finite descent. It follows that there is m G N for which n > m implies (r + S)n{X) D {T + S)nTp{X) = {T + S)mTp{X) of finite codimension (7.9.2.24) which forces T + S to be of finite descent, and finishes the proof of (7.9.2.4). ■ For Fredholm operators of finite ascent or descent the perturbation theory of Theorem 7.9.2 extends to incomplete spaces, since now we can use (6.9.3.3) instead of Theorem 6.12.2. A semi-Browder operator which is Weyl must be a Browder operator:
280 7. Operator Algebra and Commutivity 7.9.3 THEOREM If X is a normed space and T G BL(X, X) then T Weyl and upper semi-Browder =>• T Browder (7.9.3.1) and T Weyl and lower semi-Browder =>► T Browder (7.9.3.2) Also T Fredholm of finite ascent =>► index(r) < 0 (7.9.3.3) and T Fredholm of finite descent =>► index(r) > 0 (7.9.3.4) Proof: We use the Index Theorem 6.5.4. If T is Weyl then for arbitrary n e N dimr"n(0) - d\mX/Tn{X) = index(rn) = n index(r) = 0 (7.9.3.5) so that if either dimT~n(0) or dimX/Tn(X) is a constant then so is the other. This proves both (7.9.3.1) and (7.9.3.2). If T is Fredholm of finite ascent p, then n index(r) = index(rn) = dimT"p(0) - dimX/rnX whenever n > p (7.9.3.6) so that either T has finite descent q, in which case n index(r) = dimr_p(0) - dimX/r9(X) independent of n > max(p, q) (7.9.3.7) which forces index(T) = 0, or T does not have finite ascent, in which case n index(r) = dimr_p(0) - dimX/Tn(X) —> -oo as n —► oo (7.9.3.8) which forces index(T) < 0. This proves (7.9.3.3), and the argument for (7.9.3.4) is exactly similar. ■ 7.10 CONNECTEDNESS AND HOMOTOPY A subset K of a topological space H is said to be connected if there are no disconnections of K in H, where a disconnection of K in H is a pair (G1, G2) of open subsets of H for which KCGjUGj KHG1nG2 = 0 KnG1^0^KnG2 (7.10.0.1) For example, the connected subsets of R are the intervals: J connected in R <^=> (a < b =^ [a,b] C J) for each a,b e J (7.10.0.2)
7.10 Connectedness and Homotopy 281 If K and H are subsets of f2, then K connected, K C H C cl(jRT) => jff connected (7.10.0.3) and if, jff connected, Kfl^^0=^KUff connected (7.10.0.4) If K C n and jff C A, then if connected in fi, if connected in A => K x H connected infix A (7.10.0.5) If <f> : H —► A is a mapping then ^ continuous on if, if connected in H =>• <£(if) connected in H (7.10.0.6) Both (7.10.0.4) and (7.10.0.5) have extensions to arbitrary families [K-) -€J: in particular if t G K C H then CompK(i) = (J {# C K:t E H connected in H} C K is connected in Q (7.10.0.7) We call the set (7.10.0.7) the connected component of t in if. Evidently if 5 and t are points of K then either CompK(£) = CompK(s) or CompK(£)nCompK(s) =0 (7.10.0.8) Thus, the set of connected sets {CompK(£):£ G K} forms a "partition" of the set K. If if C H and if t G H \ if then the complement of the component of t in the complement of if will be known as the "connected hull" of if with respect to t: 7.10.1 DEFINITION If if C fi and t G fi \ if then the connected hull of if with respect to t is the set rjtK for which n\tltK = CompQ\K{t) (7.10.1.1) A hole in if with respect to t is a component of H \ if other than H \ rjtK. Thus the connected hull of if is obtained by "filling in the holes." 7.10.2 THEOREM If if and H are subsets of fi \ {t} then Tjt{TjtK) = VtK (7.10.2.1)
282 7. Operator Algebra and Commutivity and H C K => rjtH C 77tK (7.10.2.2) If K is closed and H is locally connected then rjtK = c\(rjtK) (7.10.2.3) Proo/: (7.10.2.1) and (7.10.2.2) are clear from (7.10.1.1). If fi is locally connected, in the sense that t e n, U e Nbd(r) => * G V C Z7 for some connected 7 G Nbd(*) (7.10.2.4) and if s G fi \ t^K is arbitrary, then since fi \ if is a neighborhood of 5 there is a connected neighborhood V of 5 in H for which V C fi \ jK". Since V and fi \ rjtK are connected and not disjoint then also V U (fi \ rjtK) is connected. By the definition of r\tK it follows that V C fi \ rjtK. This means that fi \ rjtK is open, giving (7.10.2.3). ■ There is an interaction between the connected hull and the topological boundary: 7.10.3 THEOREM If H, K and L are subsets of U then H C K and dK C jff U L => aiT C (Sjff) U L (7.10.3.1) and if if and K are closed, then dK CH and dH C K => a(K uF)c (ax) n (an) ca(KnF)cxn£ (7.10.3.2) If fi is locally connected and H and if are closed subsets of fi \{t}, then dKCHCK=^HCKCTjtH<^ drjtK C H C K (7.10.3.3) and drjtK CdK CK CTjtK = TjtdK (7.10.3.4) Proof: If H C if and <9K C H U L, then aK C (HuL)n(fi\int(K)) C (jffUL)n(n\int(jff)) C (dH)uL (7.10.3.5) giving (7.10.3.1). If dK C H and K is closed then d{KuH)CdKudHCHCKuH (7.10.3.6)
7.10 Connectedness and Homotopy 283 and hence by (7.10.3.1) dK C dH. Interchanging the roles of K and H now gives the first inclusion of (7.10.3.2). At the same time (7.10.3.1) also gives dKCHnKCK=>dKC d{H n K) C K (7.10.3.7) and interchanging the roles of K and H gives the second inclusion of (7.10.3.2). Towards the first part of (7.10.3.3) suppose that L C fi is connected: then ^nL = 0^(KaorKDLorKnL^0) (7.10.3.8) since otherwise (int(jK"),fi \ cl(K)) would be a disconnection of L in H. If in particular L = H \ 77t.ff with if C if C Q \ {£} then the left-hand side of (7.10.3.8) holds, while neither the second nor the third of the alternatives on the right-hand side is possible. This gives the first implication of (7.10.3.3). Towards the second implication recall that by local connectedness each component of H \ K is open, giving {rjtK) \ K C int(i7ttf) (7.10.3.9) if K = cl(jRT) C fi \ {t}, so that d{rjtK) CK CrjtK (7.10.3.10) By (7.10.2.1) and (7.10.2.2) it follows that if H C K C rjtH then rjtH = rjtK, and applying (7.10.3.10) with H in place of K gives drjtK = drjtH C H. This gives the second implication in (7.10.3.3). Finally (7.10.3.10) is already part of (7.10.3.4), and gives the first inclusion by application of (7.10.3.1) with [H,K,L) replaced by (K,rjtK,0): the rest then follows at once. ■ As a supplement to (7.10.3.4) observe that if fi = X is a normed space and if 5 G dK \ drjtK then there are sequences (sn) in X and [Hn) of connected components of X \ K for which snedHn and ||5-5J|—>0 (7.10.3.11) Indeed, if we do not already have 5 G dH for some fixed component H of X\K, then there will be [s'n) in X \ K for which \\s'n - s\\ -> 0, and for sufficiently large n we must have sfn G Hn with connected Hn C X\K\ now if we look at the line segment {ts + (1 — t)sfn:0 < t < 1} we will find at least one point sn G dHn. The first inclusion of (7.10.3.3) is not in general reversible: for example
284 7. Operator Algebra and Commutivity take H = C and H = {seC:2 < \s\ <3} K = {s G C: 1 < \s\ < 3} and \t\ > 3 (7.10.3.12) If X and Y are normed spaces then, in the notations (3.3.1.3), (3.3.1.4), (3.7.1.3), and (3.7.1.4), it follows from (7.10.3.2) that d{rleft BL(X, Y) U f right BL(X, Y)) C fleft BL(X, Y) n fright BL(X, y) (7.10.3.13) and <9(<jleft BL(X, Y) U <rright BL(X, Y)) C aleft BL(X, y) n <Jright BL(X, y) (7.10.3.14) For by (3.5.3.3) and (5.5.6.1) the left-hand side of (7.10.3.2) holds with K = <rleftBL(X,y) and H = frightBL(X,y), while by (3.7.5.1) and (3.7.5.2) the same is true with K = <jleft BL{X,Y) and H = <jri*ht BL{X,Y). By (7.10.3.1) it now follows that the left-hand side of (7.10.3.13) is contained in the right-hand side of (7.10.3.12). The analogues of (7.10.3.12) and (7.10.3.13) also hold when the space of operators BL(X,y) is replaced by a normed algebra A. Since normed spaces are easily seen to be locally connected, we can also derive results about connected hulls: 7.10.4 THEOREM If A is a closed subalgebra of a normed algebra B, then <9(<7left(A) U <jright(A)) C rleit{B) n fright(5) n A (7.10.4.1) and <jleft(A) U <jright(A)) C T)i{Tleit{B) n fright(£) n A (7.10.4.2) where we are taking the connected hull relative to the identity 1 G A C B. Proof: For (7.10.4.1) we combine (3.10.7.1), (3.10.7.2) and (3.10.6.5), and then (7.10.2.4) is the first implication of (7.10.3.3). ■ The "punctured neighborhood" Theorem 7.8.5 suggests applications of a modified version of (7.10.3.3), holding "to within isolated points." If H and K are subsets of H then H CK => acc(H) C acc(K) (7.10.4.3)
7.10 Connectedness and Homotopy 285 and we have a decomposition of the boundary of K: dK = <9(acc(K)) U iso(K) with <9(acc(K)) n iso(jRT) = 0 (7.10.4.4) Thus also acc(dK) = <9(acc(K)) and iso(di<:) = iso(jRT) (7.10.4.5) 7.10.5 THEOREM If fi is locally connected <md H C K C Q \ {t} are closed subsets, then t]tK = 77t(acc(K)) U iso(K) (7.10.5.1) and dK C H U iso{K) =>KCrjtHU iso{K) => drjtK CHU iso{K) (7.10.5.2) Proo/: Toward (7.10.51) it is clear that K C 77t(acc(K)) U iso(K) C rjtK (7.10.5.3) We claim that also, whenever L is the closure of a hole in K, L C 77t(a77tL) C 77t(acc(K)) (7.10.5.4) To prove (7.10.5.4) we show dL n iso(K) C n \ a?7tL (7.10.5.5) If s G iso(K), then by the local connectedness of H there is a connected neighborhood U G Nbd(s) for which U D K \ {s} = 0, and if also 5 G dL then LnU\{s} ^ 0. By (7.10.0.4) it follows that U \ {s} C int(L), and hence that 5 £■ d77t£. This establishes (7.10.5.5), so that also drjtL C acc(K) (7.10.5.6) which with (7.10.4.3) gives (7.10.5.4). This completes the proof of (7.10.5.1). Towards (7.10.5.2) suppose that H C K <md dK C H U iso{K): then by (7.10.4.4) <9(acc(K)) C H (7.10.5.7) and hence by (7.10.3.2) acc(K) C rjtH (7.10.5.8)
286 7. Operator Algebra and Commutivity giving the first implication of (7.10.5.2). If instead K C rjtH U iso(K) then also (7.10.5.8) holds, giving with (7.10.5.1) rjtH C rjtK C rjtH U iso(K) (7.10.5.9) and hence by (7.10.3.1) d{t]tK) C d{rjtH) U iso(jRT) CffU iso(jRT) ■ (7.10.5.10) For example, if X is a Banach space and A G BL(X, X) is a linear subspace, and if T G A C BL(X,X) as in (7.8.5.1) then (7.8.5.2) gives implication TedK=>TeHU iso{K) (7.10.5.11) where K = {S G A: S is not invertible} and (7.10.5.12) H = {S e A: S is not Fredholm} Since we may not have inclusion dK C H U iso(K) it does not immediately follow that T e K => T e rjtH U iso(K) (7.10.5.13) If H and A are topological spaces, then continuous functions f,g G C(H, A) are said to be homotopic if there is a mapping H : [0,1] x H —► A for which fT(O,.) = /(0 H(l,.)=^(«) and (7.10.5.14) if G C([0,1] x n, A) is continuous When (7.10.5.4) holds we shall write f~g and ge[f]„ (7.10.5.15) and often speak of (^t)o<t<i = (-^(^0)o<t<i BS a homotopy between h0 = f and hx= g. If / G C(H, A) is homotopic to a constant we shall say that / is contractible. If the identity I: Q —> Q is homotopic to a constant we shall call the space H contractible. For example, normed spaces, convex subsets of normed spaces, and bounded linear mappings are all contractible. If / : H —► A and g : A —► M are continuous, then / contractible or g contractible => g o f contractible (7.10.5.16)
7.10 Connectedness and Homotopy 287 In particular, every continuous mapping in or out of a contractible space is contractible. A space fi is said to be arcwise connected if every pair of mappings into fi from the one-point space {0} are homotopic. Here the homotopy (^t)o<i<i is referred to as an arc. Evidently if H is a topolgical space H contractible => fi arcwise connected => fi connected (7.10.5.17) In partial converse, if fi is "locally arcwise connected," in the sense that every neighborhood of every point contains an arcwise connected neighborhood of that point, then K C fi open and connected => K arcwise connected (7.10.5.18) A topological space fi is called simply connected if every pair of continuous mappings from the circle S C C to H are homotopic: evidently necessary and sufficient is fi arcwise connected with the property that mapping from S to H is contractible. It is significant, and perhaps not immediately obvious, that the circle S is not simply connected. Towards a proof of this, introduce mappings ex : R —► S and lg:S\ {-1} —► R (7.10.5.19) where ex{t) = e27rit if t e R and lg{e27rit) = t if - \>k < t < \>k (7.10.5.20) If 4> : fi —► S is an arbitrary mapping, then a lift of <f> will be a mapping <f>v : H —► R for which </> = ex o <f>w. 7.10.6 THEOREM If fi is a topological space and <f> e C(S, fi), then <f> contractible <£=> <f> has a continuous extension 4>A : D —> Q (7.10.6.1) If instead <f> G C(H,S), then <f> contractible ^=^ <j> has a continuous lift <j>w : fi —> R (7.10.6.2) Proof: Reverse implication in each case is clear from (7.10.5.16). Conversely, for (7.10.6.1), suppose that {ht)0<t<1 is a homotopy in C(S,H): then we claim 3/ig e c(D, n) => 3/ij e c(d, n) (7.10.6.3)
288 7. Operator Algebra and Commutivity Specifically, define for each iGR and each r G [0,1], h${rc2xit) = h*{2re2Kit) if 0 < r < \ and (7.10.6.4) h^re2*") = h2r^{e2irii) if \ < r < 1 Intuitively, we have /^ : D -> A -> fi with A = (D x {0}) U (S x [0,1]), where the embedding of D in A is achieved by pasting the interior of the disc across the top of the open cylinder down the sides and across the bottom. This gives the forward implication in (7.10.6.1) if we take h0 to be constant and hx = <f>. For the forward implication in (7.10.6.2), suppose that {ht)0<t<1 is a homotopy in C(n,S): then claim 3/tg e C(fi, R) <=>> 3h\ e C(fi, R) (7.10.6.5) By the compactness of [0,1] there is a partition (t-)^=1 with 0 = t0 < t-L < - - - < tn = 1 for which supwGn \htj-(u) — /^.^(w)! < 2 for each j = 1,2,..., n: if we now define g (u) = 9tAy for each ueVt,j = l,2,...,n (7.10.6.6) then ^y(H) C S\{—1} for eachj and h^w) = h0(u)g1(uj).. .gn{u) for each cj; thus we can lift hx by taking n MM = MM + X^MW)) for each w G n (7.10.6.7) Thus for forward implication in (7.10.6.2) take h0 to be constant and hx = Theorem 7.10.6 enables us to define a winding number for continuous functions on the circle: winding number [4>) = ^(1) - ^(0) (7.10.6.8) where <£* = V>v is a continuous lift for ip = <f> o ex : R —► S (7.10.6.9) Explicitly c2«Mt) = <£(e27rt"<) for each t G R (7.10.6.10)
7.10 Connectedness and Homotopy 289 The winding number is well defined, and an integer, since if H is connected then any two lifts for a continuous function </> : H —► S must differ by a constant. The winding number picks out the contractible continuous functions on the circle: 7.10.7 THEOREM If <fr : S -> S is continuous then the following are equivalent: <f> is contractible (7.10.7.1) <j> has a continuous extension <f>A : D —► S (7.10.7.2) 4> has a continuous lift <f>v : S —► R (7.10.7.3) winding number (cf>) = 0 (7.10.7.4) The circle S is not contractible. Proof: The equivalence of the first three conditions is Theorem 7.10.6. If {ht)0<t<1 is a homotopy in C(S,S) then we claim winding number (h0) = winding number (/ix) (7.10.7.5) This is because the mapping t —► winding number (ht) is continuous, and maps the connected space [0,1] into the integers Z. Since of course the winding number of a constant is 0 we have proved that (7.10.7.1) implies (7.10.7.4). Conversely, if (7.10.7.4) holds then so does (7.10.7.3): for we may define <f>y by setting P(e2irit) =<f>*{t) if 0 < t < 1 (7.10.7.6) This completes the proof that the four conditions are equivalent. Since for each n G Z it is clear that winding number (zn) = n (7.10.7.7) the identity / = z : S —> S fails the condition (7.10.7.4), so that by (7.10.7.1) the circle is not contractible. ■ Intuitively, the winding number of <f> : S —► S counts the number of times <f>(u) G S winds round the point 0 G C when the point wG S winds once round 0 G C. If also ip : S —► S is continuous then winding number [<f> • ip) = winding number (<f>) + winding number (tp) (7.10.7.8)
290 7. Operator Algebra and Commutivity thus acting as a group homomorphism from C(S,S) to Z, and also winding number (ip o cj>) = winding number (<f>) • winding number (tp) (7.10.7.9) 7.11 GENERALIZED EXPONENTIALS If A is a normed algebra, then the invertible elements A~* form a subgroup of the multiplicative semigroup A, and of course also a topological space. The connected component of 1 in A-1 turns out to be a subgroup of A~x: 7.11.1 DEFINITION If A is a normed algebra we shall write A'1 = CompA_! (1) C A'1 (7.11.1.1) In the notation of Definition 7.10.1 we are writing A\A^1=nl{A\A-1) (7.11.1.2) The subset Aq1 is both a subgroup, and a relatively closed subset, of A-1: 7.11.2 THEOREM If A is a normed algebra, then Aq1 is a subgroup of A-1 £ A, and there is inclusion a~xA^xa C Aq1 for each a G A'1 (7.11.2.1) and equality A'1 fl c\{Aq x) = Aq1 (7.11.2.2) If A-1 is an open subset of A then also Aq1 and A'^Aq1 are open subsets of A (7.11.2.3) and d{Ao x) C fleft(A) n fright(A) (7.11.2.4) Proof: By definition, A^1 is a connected subset of A-1 containing 1, and there is implication, using (7.10.0.4), for subsets K C A, 1EKCA-1 and K connected => K C Aq1 (7.11.2.5) By the continuity of multiplication and inversion, together with (7.10.0.5) and (7.10.0.6), the left-hand side of (7.11.2.5) is satisfied if K = (Aq x)(Aq x) and if K = (Aq1)"1, so that by the right-hand side of (7.11.2.5) Aq1 is a group. For the same reason each of the sets a~1A0~1a with a G A-1 satisfies
7.11 Generalized Exponentials 291 the left-hand side of (7.11.2.5), so that we obtain (7.11.2.1). By (7.10.0.3) the left-hand side of (7.11.2.5) is also satisfied if K = A~x n c^Aq1), so that we obtain (7.11.2.2). Finally if A~* is open in A, so that there is 6 > 0 for which {1 - x: \\x\\ <6}C A'1 (7.11.2.6) then the same argument (7.11.2.5) says that {l-x:\\x\\<6}CA^ (7.11.2.7) It follows at once that if a G A-1 is arbitrary then aeAQ1=^ {a(l - x): \\x\\ < 6} C Aq1 (7.11.2.8) and cGA_1\ A'1 => {a(l - x): \\x\\ < 6} C A'1 \ Aq1 (7.11.2.9) giving (7.11.2.3). Also Theorem 7.10.3 and Theorem 7.10.4, together with Theorem 4.4.7, give 0(Ao x) = drj1{a]eit{A)uaTlsht{A)) C f]eit{A) nfright(A) ■ (7.11.2.10) The condition (7.11.2.1) is what it means for Aq1 to be a normal subgroup of A-1. If it holds then the sets of left and right cosets {aA-^.aeA-1} = {A^aiaeA'1} = A~1/Aq1 (7.11.2.11) coincide, and carry a natural group structure. The argument for (7.11.2.4) can be made more explicit. We claim, if A~* is open in A, c1(Aq *) \ (fleft(A) n fright(A)) C Aq1 (7.11.2.12) Indeed, suppose a G c^A^"1), with (an) in A~l satisfying \\a — an\\ —► 0 as n —► oo. Fixing 6 > 0 for which (7.11.2.6) holds, it is possible that there is m G N for which \\*m\\\\*-*m\\<* (7.11.2.13) in which case by (7.11.2.7) a^a =l-xmeAo1=^a = am{a^a) G A"1 (7.11.2.14) If on the other hand there does not exist m G N for which (7.11.2.13) holds then, for each nGN, maxfllcOlJa-^ll) < (1 + l/tf)||o-on|||K1|| (7.11.2.15)
292 7. Operator Algebra and Commutivity and, since \\a - an\\ -> 0, this means that a G r]eit(A) n fright(A). If A is actually complete then we can express the elements of Aq1 in terms of exponentials: 7.11.3 DEFINITION If A is a Banach algebra and a G A, then oo n exp(a) = ea = 1 + Y" ^- (7.11.3.1) n=l and Exp(A) = {efllefl2---efln:n€ N,{al5a2,... ,an} C A} (7.11.3.2) More generally, we can talk about exponentials of certain elements of an incomplete algebra. If afc+1 = 0 then ea = exp,(a) = l+£^- (7.11.3.3) while if instead a2 = ta for some iGK then ea = 1 + (c* - l)a (7.11.3.4) In general, however we may as well work in the completion A~ of an incomplete algebra A, and possibly inquire afterward whether or not an exponential happens to lie in the original. When A is complete then the sets Exp [A) and Aq1 coincide: 7.11.4 THEOREM If A is a Banach algebra and a, 6 G A, then ab = ba => exp(a + 6) = exp(a) exp(6) (7.11.4.1) and if x G A, then ||x|| <l=>l-x = ea with a = - ^ — (7.11.4.2) n=l There is equality Aq1 = Exp(A) (7.11.4.3)
7.11 Generalized Exponentials 293 Proof: We offer two proofs of (7.11.4.1). If a and 6 commute in A then for each n G N exPn(a) exPnW - exPn(a + h) = Y, { V?: r^ G {0,1,..., n} , n + 1 < r + 5 < 2n 1 (7.11.4.4) and hence ||expn(a)expn(6) - expn(a + 6)|| < expn(||a||) expn(||6||) - expn(||a|| + H&H) —+ 0 asn-^oo (7.11.4.5) Alternatively, still assuming ab = 6a, (5.12.6.3) and (5.12.2.12) give — (exp(*a + 6)) = aexp(*a + 6) (7.11.4.6) at and — (exp(-ia) exp(*a + 6)) = 0 (7.11.4.7) at for all t G R. By (5.12.4.2) it follows that exp(—ta) exp(ta + 6) is independent of t, giving again (7.11.4.1). Towards (7.11.4.2) define, if ||x|| < 1 and -Kt\\x\\<l9 f{t) = exp(-</(*)) and g(t) = £ — (7.11.4.8) By (5.12.6.3) both g and / are differentiate, with g'(t) = x(l - tx)-1 and f'(t) = -g'(t)f(t) (7.11.4.9) so that ^((l-*x)-7W)=0 (7.11.4.10) By (5.12.4.2) it follows that /(l) = /(0)(l-x), which is (7.11.4.2). Towards (7.11.4.3) it is clear that if a G A then ea G A'1 with (ea)_1 = e~a (7.11.4.11) and then, by applying the argument of (7.11.2.5) with K = {eta: 0 < t < l}, that ea G Aq1 for each aeA (7.11.4.12)
294 7. Operator Algebra and Commutivity This proves that the set Exp [A) is a subset of Aq1. With the aid of (7.11.4.2) we claim that also Exp(A) and A'1 \ Exp(A) are open in A (7.11.4.13) Indeed if a G A-1 then a G Exp(A) => {a(l - x): \\x\\ < 1} C Exp(A) (7.11.4.14) and cGA_1\ Exp(A) => {a(l - x): ||x|| < 1} C A"1 \ Exp(A) (7.11.4.15) Now if equality (7.11.4.3) were to fail, then (Exp(A),A_1 \Exp(A)) would give a disconnection of A^"1, which would contradict the fact that Aq1 is connected. ■ Homomorphisms of normed algebra map generalized exponentials into generalized exponentials, and sometimes onto: 7.11.5 THEOREM If A and B are normed algebras and T G HBL(A, B) is a homomorphism then T{Aq j) C Bq-1 (7.11.5.1) If A and B are complete, and T is onto, then T{AZx) = B0_1 (7.11.5.2) and r-1^1) C A'1 + r_1(0) (7.11.5.3) If in addition A-1 is connected then (7.11.5.3) holds with equality. Proof: For (7.11.5.1) apply (7.11.2.5) with Bq1 in place of Aq1, taking K to be the set T{Aq1). If B is complete and 6 G i?^1 is arbitrary, then by (7.11.4.3) there is ra G N and dl9 d2, • • •, dm in 5 for which 6 = edl ed2 • • • edm G Exp(E) (7.11.5.4) If T is onto then there are c1,c2,..., cm in A for which dy = T(cy) for each y = 1,2,..., m (7.11.5.5) By the continuity of T we have a = eCl eC2 • • • eCm => b = T{a) with a G Exp(A) C Aq x (7.11.5.6)
7.11 Generalized Exponentials 295 This proves (7.11.5.2), and hence also T"1^1) = Aq1 + T-^O) (7.11.5.7) from which (7.11.5.3) follows at once. Finally if A~* is connected then A~l = Aq1, so that (7.11.5.7) becomes (7.11.5.3) with equality. ■ If T G BBL(A,B) is an onto homomorphism then, in the notation of (7.6.1.4), T = JA where J = T_1(0) is obtained by quotienting out an ideal of A: now (7.11.5.3) says that the elements of A which have cosets which are generalized exponentials must all be "Weyl" elements relative to the homomorphism. Conversely, an element 6 G B~x will be a generalized exponential if and only if there is implication r_1(6) cr^r^o) whenever T G HBL(A,5) is a homomorphism onto (7.11.5.8) For consider the algebra A = {/ S C([0,l],B):/(0) C Kl} (7.11.5.9) together with the homomorphism T : A —► B defined by setting T(f) = /(l) for each / G A (7.11.5.10) If 6 G B'1 can be put in the form 6 = T{f) then {f{t):t G [0,1]} is a continuous arc in B~x connecting 6 to an element of the form tl with t G K, which in turn is in the same component as 1 G -B-1, forcing 6 G Bq1. The connected component of the identity in the algebra A = C^ft) can be described in terms of homotopy: 7.11.6 THEOREM If Q is a separated normal space, then Coo^o"1 = {<t>^ Coo(n)"1:sgn(^) is contractible} (7.11.6.1) where sgn(^) G C(S,S) is defined by setting sgn(^)(w) = </>(u)/\<l>{u)\ for each wGH^G (^(n)"1 (7.11.6.2) There is abelian group isomorphism cM(n, q-'/c^n, c)^1 = [cun, s) (7.11.6.3)
296 7. Operator Algebra and Commutivity Proof: If a and 6 are in C^ (H) * then we claim ab'1 e C^Q)-1 ^=> sgn(a) ~ sgn(6) (7.11.6.4) and sgn(a)o-1 G CM(n)o J (7.11.6.5) Indeed, if a and 6 are in the same connected component of C^Q)-1, which is an open subset of the normed space (^(H), which is locally arcwise connected, then by (7.10.5.18) there is a homotopy {ht)0<t<1 in C^Q)_1 with h0 = a and h± = b, which induces a homotopy sgn(ht) between sgn(a) and sgn(6), and gives forward implication in (7.11.6.1). Also the family (a/|a|*)0<i<1 constitutes a homotopy between a and sgn(a), interpreted for the moment as an invertible mapping from ft to C. This proves (7.11.6.5), and gives backward implication in (7.11.6.1). Finally, to prove (7.11.6.3) when the scalar field K is complex, we have only to confirm that every homotopy equivalence class of mappings from Q to S corresponds to some coset in C00(n)~1/C00(n)^"1: but since C(f2,S) can be interpreted as a subset of (^(n)_1 this is clear. ■ The condition that C'00(r2)_1 is connected takes the form [CUn,S) = {[l]J. (7.11.6.6) which is in a sense dual to the condition that ft be simply connected: [CUS,n) = {[lU (7.11.6.7) The condition that the space H is connected can also be expressed in terms of the exponential function. If the scalar field K is complex then n connected 4=> exp_1(l) = 2ttZ1 C C^fi) (7.11.6.8) More generally, if a G C'00(n) is arbitrary then exp(a) = 1 if and only if for each connected component K of Q there is n G Z for which a = 27rin on K. 7.12 CONTINUOUS FUNCTIONS If A = 0^(0.) = (^(n, K) and if H is a separated normal space then there is a duality between closed subsets of f2 and closed ideals of A. For if CO and JCiwe shall write K° = {aeC^Ciy.KC a_1(0)} C A (7.12.0.1) and J0 = n{c-1(0):cGJ} CO (7.12.0.2)
7.12 Continuous Functions 297 The analogue of Theorem 5.2.2 is immediate: KCK' CQ=> (K')° C K° and K C {K°)0 (7.12.0.3) and JCJ'U^ (J')0 C J0 and J0 C (J0)° (7.12.0.4) Using Urysohn's lemma (Theorem 3.12.4), we can get sharper results analogous to the consequences of the Hahn-Banach theorem: 7.12.1 THEOREM If Cl is a separated normal space and K C ft is a closed subset, then K = (K°)0 (7.12.1.1) and there is isometric isomorphism a + K «—» 0|K : C^nyK0 = C^tf) (7.12.1.2) In particular Coo(20 = {a|jr:o€Coo(n)} (7.12.1.3) Proof: If a; G H \ jfiT, then by Urysohn's lemma (Theorem 3.12.4) there is u G C(n,[0,1]) for which u(u) = 1 and u = 0 on K, which means that u G K° is not annihilated by u: thus, u £ [K°)0. This reverses the second inclusion of (7.12.0.3) and proves (7.12.1.1). Towards (7.12.1.2) it is clear that the restrictiion mapping is well defined on the quotient C00(n)/JK'0, linear and multiplicative, and bounded with bound < 1. To see that it is isometric, suppose that e > 0 is arbitrary, and put He = | u G n: |a(w) | > e + sup |a(.) 11 (7.12.1.4) Then by Urysohn's lemma there is ue G C(f2, [0,1]) for which ue = 0 on K and ue = 1 on He. Now be = uea satisfies b£ G K° and \\a - b£\\ < e (7.12.1.5) and it follows that the quotient norm in (7.12.1.2) is < \\a>\K\\ + s. Since e is arbitrary, the isometry (7.12.1.2) follows. It remains to be proved that the mapping from a + K° to a* is onto C^K), which we do by proving (7.12.1.3). We consider the real and the complex cases separately, and suppose first that 6 G C00(iiC) satisfies sup6(.) = 1 and inf &(•) = -1 (7.12.1.6) K K
298 7. Operator Algebra and Commutivity (7.12.1.7) iw2x»i (7.12.1.10) Putting Ko={u€K:b(u)<-±} K$ = {u€K:b{U)>±} there is by Urysohn's lemma w0GC(n,[-|,|j) for which u0 = -| on Kq and u0 = | on Xj" (7.12.1.8) Evidently ||u0|| < | and 116!|| < | where bx = b - u0 on K (7.12.1.9) Inductively we now define sequences (un) in (^(H), (6n) in C00(JK'), and (jfiT"), (if+) of closed subsets of H, taking for each n K~ = {weK:bn(w) < -(!)(§)"} 1C = {«S If: 6» >(|)(|)"} using Urysohn's lemma to find un with «.6C(n,-(i)(i)*,(i)(i)") «n = ± (J) (|r on K± (7.12.1.11) and finally *>n+i = K ~ un = b ~ K> + *i + * * * + tO on ^ (7.12.1.12) Since ||un|| < (|)(|)n and C'00(r2) is complete (Theorem 4.3.3), we can now define oo «=E«»eCj(l) (7.12.1.13) n=0 Since ||6n|| < (|)(|)n -> 0, it is clear that a = b on K, with ||a|| < 1 = ||6||. This proves that 6 = a* whenever 6 G (^(H) = C^HjR) satisfies (7.12.1.6). More generally, if c G C'00(r2, R) is arbitrary we can write c= (az + 0)ob with/? + a = supc(-) and 0 - a = inf c(-) (7.12.1.14) K K where 6 satisfies (7.12.1.6). Now, if a G C^Q) is given by (7.12.1.13) then the composed mapping (az+/3) oa is an extension of c to H. This completes the proof of (7.12.1.3) for real scalars. Finally, for complex scalars, we need only extend real and imaginary parts separately. ■
7.12 Continuous Functions 299 The extension theorem incorporated in Theorem 7.12.1 is known as Tietze's theorem: it is interesting that Urysohn's lemma is an immediate consequence. Indeed, if c\{K0) C in^l^) C ft then the mapping 6 : K —► [0,1] obtained by setting K = c\{K0) U (n \ (KJ) b = 1 on cl(lf0) 6 = 0 on n \ int^) (7.12.1.15) is continuous, and if u = a : ft —► [0, l] is a continuous extension of 6 then u satisfies (3.12.4.3). In the special case of a metric space there is a direct derivation of Tietze's theorem which bypasses Urysohn's lemma: if be C^HjR) satisfies inf 6(.) = 1 and sup6(«) = 2 (7.12.1.16) K K we obtain a continuous extension a of 6 to ft if we define a{u) = b{u) if u G K and , x . _ dist(u,uf)b(uf) ._ ^ % ,., a(w) = inf —J.\,/ V\ if a; G n \ X v y w'gk dist(a;,i;r) x (7.12.1.17) The continuity of a must be established separately for points interior to K, points outside K, and points of the topological boundary of K. We might remark that we could have used Tietze's theorem in the proof of (7.10.6.1) instead of the explicit construction we actually used. On the other hand, Theorem 7.10.7 serves to warn us that we need not expect an analogue of Tietze's theorem in which the unit interval is replaced by, for example, the circle. Theorem 7.12.1 tells us something about certain special ideals of C^Q): it is clear from (7.12.0.3) and (7.12.0.4) that if K C Q then J = K° => J = (J0)° (7.12.1.18) and, if in particular, ft is normal and separated, then J = K°=* CM(n)/J = CM) (7.12.1.19) If, in particular, the space f2 is compact then this holds for all closed ideals: 7.12.2 THEOREM If n is compact Hausdorff and J C C^Cl) is an ideal, then d(^) = (J0)° (7.12.2.1)
300 7. Operator Algebra and Commutivity Proof: We begin by proving, for arbitrary a G (^(H), J0 C int a"1 (0) => a G J (7.12.2.2) Indeed, for each u G H \ J0 there is 6W G J for which &w(u;) = 1, and then a neighborhood Uu = {u/efh |&w(k/)| > ^} of a; in Cl. By compactness there is a finite subset fl'CO\ inta_1(0) for which n\a_1(0) cn\inta_1(0) C |J Uu (7.12.2.3) wen' If we now put &=£U>W (7.12.2.4) then 6 G J and |6(-)| > 1/4 on n \ inta_1(0) (7.12.2.5) It follows that the reciprocal 1/6 is bounded and continuous on the closed set H \ a_1(0), and therefore by Tietze's Theorem 7.12.1 has a continuous extension to H. If c G 0^(0.) is such an extension then we have a = abce acJ C J (7.12.2.6) We now claim that if a G (^(H), then JQ C a"1 (0) => a G cl( J) (7.12.2.7) Indeed, for each e > 0 there is by Urysohn's lemma ue G C^Q) for which, if u G H, 0 < u£(cj) < 1; |a(w)| < |c => ue(u) = 0; |a(w)| > e => u£(cj) = 1 (7.12.2.8) If we now take be = ueA then ||a-6j| = ||a(l-ue)|| < e and J0 C a_1(0) C int&J^O) (7.12.2.9) By (7.12.2.2) be G J for each e > 0, giving (7.12.2.7). ■ As a special case of (7.12.2.2) it follows J0 = 0 => J = C^Q) (7.12.2.10) We note that in this case the function 6 of (7.12.2.4) has a continuous reciprocal on the whole of H, so that we do not need to use Tietze's theorem
7.12 Continuous Functions 301 to extend it. It is also clear exactly what are the maximal ideals of C^fi): in the notation of Definition 7.2.1 n compact => MLIC^n) = MRIC^H) = Ut}° :teCl\ (7.12.2.11) Urysohn's lemma, and in particular (7.12.1.1), tells us that the algebra A = 0^(0.) separates points of the space H, in the sense that if a;,a/ G H, then u/ /. v =» a{u') ^ a(w) for some a G A (7.12.2.12) The converse to this is called the Stone- Weierstrass theorem. In the complex case we need to assume that the algebra A is self-adjoint, in the sense that A = A*={a:aeA} (7.12.2.13) of course every subset of C^HjR) is automatically self-adjoint. Before stating the Stone-Weierstrass theorem, we note Dini's theorem: 7.12.3 THEOREM If Q is a compact Hausdorff space and if (un) and v in (7^(0, R) satisfy for each t G H,0 < un{i) < un+1{t) —> v{t) as n —> oo (7.12.3.1) then ||v-un||—>0 as n—> oo (7.12.3.2) Proof: IfoOandiGfi there are Uf G Nbd(*) and Nf G N for which n>Nf=> v(t) - e < un{t) < v(t) (7.12.3.3) and 6 G U* => max(|t;(«) - v{t)\9 \uNte{s) - uN.{T)\) < e By compactness there is a finite subset (]£C(] for which Q = Now if n > maxten N* and if t G H is arbitrary then there is which t G U*,, and we have < Ht) - v(t')\ + +(v(t') - uN, («')) + |uw. (0 - aw. (t)| t' t' t' <36T ■ (7.12.3.4) *' G H£ for (7.12.3.5)
302 7. Operator Algebra and Commutivity Dini's theorem is a partial converse to Theorem 1.8.2, where C^ft) is proved to be closed in 1^(0,). We need it to see that \z\ is a uniform limit of polynomials on [0,1]. Indeed, if (pn) is defined inductively by setting p0 = 0 and pn+1 =Pn + \{z- p2n) for each n = 0,1,2,... (7.12.3.6) then we have, by induction on ra, 0 < pn{t) < pn+1(*) <y/t if 0 < t < 1 (7.12.3.7) so that by (7.12.3.2) 0< sup Vi-pn{t)—>0 asn=^oo (7.12.3.8) 0<t<l The Stone-Weierstrass theorem is established separately for real and for complex scalars. The theorem for real subalgebras proceeds though the medium of the corresponding theorem for sublattices of the partially ordered space C^H, R): 7.12.4 THEOREM If Q is a compact Hausdorff space of more than one point and if A C C^Q) is a self-adjoint subalgebra which separates points of H, then cl(A) = CUO) (7.12.4.1) Proof: We begin with the real case, and observe that if A separates points of H then it has the following "two-point transitive" property: if a,/? G R and s ^ t m Cl there is a G A with a(s) = a and a(t) = f3 (7.12.4.2) Indeed if 6 G A satisfies 6(5) ^ b(t) we can take v; b{t) - 6(5) H b{s) - b(t) K J Next, we claim that the closure of a subalgebra is a sublattice: {a, 6} C cl(A) => {max(a,6),min(a,6)} C cl(A) (7.12.4.4) Indeed, in view of the identity max(a,6) = |(a + 6+|a-6|) min(a,6) = \{a + b- \a-b\) (7.12.4.5) we need only establish a e A => \a\ e cl(A) (7.12.4.6)
7.12 Continuous Functions 303 But this is clear from (7.12.3.8), since with the polynomials (pn) of (7.12.3.6) we have a G A => \a\ = lim \\a\\pn (^^j G c\(A) (7.12.4.7) The real case of the Stone-Weierstrass theorem will now follow if we can show that (7.12.4.1) holds when A is a sublattice with the two-point transitive property. If this is the case then for arbitrary a G C^Q), e > 0 there are b3 t G A for each pair s, t of distinct points of Q for which b3 t(s) = a(s) and b3jt(t) = a(t). Now ult = {*' € n: &,jt(t') < a{t') +e}e Nbd(s) (7.12.4.8) for each s G H and each t G f2\{s}. By compactness there are finite subsets Hf of H for which {U*ts G H*} s an open cover for ft. Now put &f = min bea t for each t G n (7.12.4.9) Evidently &£ G A and satisfies 6f (t) = a(t) and 64(-) < a(-) + e. Also Ul = {t' G n: bt(t') > a{t') -e}e Nbd(t) (7.12.4.10) for each £ G f2. By compactness there is a finite subset Cle for which {£//:£ G H£} is an open cover for Q. Now put 6* =max6f (7.12.4.11) <Gne * v ; Evidently be G A and satisfies a(.) - e < &(•) < a(-) +e on n (7.12.4.12) This means that ||a — 6e|| < e, and since & is arbitrary, that a G cl(A). Since a G (7^(0) was arbitrary, we have proved that A C C^Q) is dense provided it is a linear sublattice, containing the constants, which has the two-point transitive property. All this applies to c\(A) if A C C^ft) is a subalgebra which separates points of H. Thus, we conclude that c\(A) is dense in, and hence equal to, C00(r2) = C^H, R). Finally, if K = C is the complex field, suppose that A C C^Q) is a subalgebra which separates points of CI: then we claim Re(A) = {Re(a): a G A} C (^(H, R) separates points of n (7.12.4.13) and also that C^Ct, R) fl A = {a G A: a(H) CR}C (^(H, R) is a subalgebra (7.12.4.14)
304 7. Operator Algebra and Commutivity Indeed (7.12.4.14) is immediately obvious, while if 5 ^ t in Cl there is a G A for which a(s) ^ a(t), and now either Re(a(s)) ^ Re(a(*)) or Re(2a(s)) ^ Re(2a(*)) (7.12.4.15) giving (7.12.4.13). If in addition the subalgebra A has the self-adjoint property (7.12.2.12), then also Re(A) = C^H, R) fl A (7.12.4.16) For inclusion one way is trivial; if A is self-adjoint then aeA=> Re(a) = \{a + a*) G (^(H, R) fl A (7.12.4.17) From (7.12.4.13), (7.12.4.14), and (7.12.4.16) and the real case of (7.12.4.1) it follows that C^n.R) =clRe(A) =clC00(f2,R)nA (7.12.4.18) which at once gives the complex case of (7.12.4.1). ■ 7.13 LINEAR FUNCTIONALS AND STATES If A is a real or a complex normed algebra, then its dual space is a Banach space, but not in any obvious way another algebra. It does have "two-sided A-module structure": if a G A and / G A^, we can define (/ . a) (6) = f{a • 6) and (a • /) (6) = f(b • a) for each 6 G A (7.13.0.1) The resulting products satisfy conditions like (1.1.0.2) from the definition of a linear space over the field K. We also get an analogue of the normed algebra condition (1.10.0.1): For each a £ A and each f E A* ||a./||<||a||||/|| and ||/ • a\\ < \\a\\\\f\\ (7.13.0.2) Certain subspaces MCA will have the status of left or right submodules, in the sense that A-MCM or M-ACM (7.13.0.3) This is the case if for example M = J° in the sense of (5.2.0.1), where J C A is a right or a left ideal of the algebra A. If a G A, then we recall the bounded operators La : A -► A and Ra : A -► A of (2.10.0.1) and (2.10.0.2), and how (Theorem 3.10.4) they can be used to decide whether or not the element a is almost invertible; then by Theorem 5.5.2 the same information can be expressed in terms of the dual operators (LaY : A* —> A* and (iJjt^t^At:
7.13 Linear Functionals and States 305 7.13.1 THEOREM If A is a normed algebra and a G A then there is implication {Ra)^ one-one => a almost left invertible => [RaV bounded below (7.13.1.1) and (La)t one-one => a almost right invertible => {La)^ bounded below (7.13.1.2) Also a not a topological left zero divisor 4=> [La)^ almost open (7.13.1.3) and a not a topological right zero divisor 4=> (Ra)^ almost open (7.13.1.4) Proof: The implication (7.13.1.1) is just the combination of (3.10.4.3) with (5.5.2.1) and (5.5.2.2), while (7.13.1.2) is (3.10.4.4) with (5.5.2.1) and (5.5.2.2). For (7.13.1.3) we note that a is a topological left zero divisor if and only if La is not bounded below, and then apply (5.5.3.2). For (7.13.1.4) we repeat the argument with Ra m It might be thought that a more natural "dual" for a normed algebra would be the subset of multiplicative linear functionals on A, the bounded homomorphisms from A to K: 7.13.2 DEFINITION A state on the normed algebra A is a linear functional f £ A* for which 11/11 < 1 = /(l) (7.13.2.1) and a character is a state for which f[a . 6) = f{a)f{b) for each a, 6 G A (7.13.2.2) Of course, unless 1 = 0 and A = 0, the norm of a state must actually equal 1, and the characters are just the bounded homomorphisms from A to K in the sense of (2.10.1.7). The null space of a character is both a closed two-sided ideal of A, and a "maximal proper subspace" of A, and therefore certainly both a maximal left and a maximal right ideal in the sense of Definition 7.2.1. We can improve the first part of Theorem 7.13.1, and determine almost invertibility using states:
306 7. Operator Algebra and Commutivity 7.13.3 THEOREM If A is a normed algebra and a G A, then there is equality [J 0 (/ • »rHO) = *""(A) = {a € A: jrf ||/ • Ra\\ = o} /GState(A) 6GA ^ V ' } (7.13.3.1) and U fl (»• Z)"1 (0) = -rigLt(A) = {« € A: b£ ||/ . L.|| = o} (7.13.3.2) Proof: We begin by proving the first equality of (7.13.3.1). If a G A is in the left-hand side then there is a state / G State(A) for which A-a C /_1 (0), and since the null space of / is closed we have l£/_1(0) Dcl(A.a) (7.13.3.3) so that a is not almost left invertible. Alternatively, this part of the argument is contained in (7.13.1.1). Conversely, if a is almost left invertible, so that 1 §? c\(A • a), then by the Hahn-Banach Theorem (5.4.1.3) there is / G A for which /(l) ^ 0 and A-cC /_1(0). If we inspect the proof of (5.4.1.3) it is clear that we can arrange more, getting '« = 1 and ii'i"***^*1 (7-13-3-4) deriving inequality at the end from (3.10.4.3): ||a - 1|| < 1 => 1 € cl(A .a)<=>a<£ d]e{t{A) (7.13.3.5) This gives the first part of (7.13.3.1). If we combine this with (7.13.1.1) we now have |J (f.A)0Da^(A) /GState(A) 2{aeA:MJ\f.Ra\\=o} 2f6AA)IKI=0} (7-13-3-6) This forces equality throughout (7.13.3.6), and finishes the proof of (7.13.3.1). The argument for (7.13.3.2) is identical. ■
7.13 Linear Functionals and States 307 We conclude with some characterizations of the characters of Definition 7.13.2: 7.13.4 THEOREM If A is a normed algebra and / G A1" satisfies /(l) = 1 then the following are equivalent for each a G A, /(a) = 0 => /(a2) = 0 (7.13.4.1) for each a G A, /(a2) = /(a)2 (7.13.4.2) for each a G A, /(a) = 0 => {f{ab):b G A} = {0} (7.13.4.3) for each a,6 G A, f{ab) = /(a)/(6) (7.13.4.4) Proo/: If /(l) = 1 then for each a G A, /(a - /(a)l) = 0 (7.13.4.5) Combining this with (7.13.4.1) gives 0=/((a-/(a))2) = /(a2)-/(a)2 which is (7.13.4.2). If this is assumed then for each a, 6 G A we have f{ab + 6a) = 2/(a)/(6) and hence implication /(a) = 0 => f{ab + 6a) = 0 => f{{ab + 6a)2) = 0 (7.13.4.6) Since (a6 — 6a)2 = 2a(6a6) + 2(6a6)a — 2(a6 + 6a)2 we have also implication /(a) = 0 => f{{ab - 6a)2 = 0 => f(ab - 6a) = 0 (7.13.4.7) Adding (7.13.4.6) and (7.13.4.7) gives (7.13.4.3). If this is assumed then, using (7.13.4.5) again, 0 = /((a - f(a))b) = f(ab - f{a)b) = f(ab) - f(a)f(b) which is (7.13.4.4). If this is assumed then (7.13.4.1) is obvious. ■ When the algebra is complete then every multiplicative linear functional is a character: 7.13.5 THEOREM If A is a Banach algebra then HL(A, K) = HBL(A, K) C State(A) (7.13.5.1)
308 7. Operator Algebra and Commutivity Proof: Suppose that / : A —► K is multiplicative and linear, with /(l) = 1. We claim that there cannot exist x £ A for which ||*|| < 1 = f{x) (7.13.5.2) Indeed, if (7.13.5.2) holds, then OO y = Y,xU => y = x(x + y) (7.13.5.3) n=l and now /(y) = f{x) + fix)fiy) = 1 + /(y) (7.13.5.4) Since (7.13.5.4) is impossible, (7.13.5.2) cannot happen, which means that / must be a state. ■
8 Inner Products and Orthogonality There is a point of view which says that the only normed spaces of any significance are those in which the norm is derived from an "inner product." 8.1 INNER PRODUCTS An inner product on a linear space X over the field K is a certain kind of mapping of pairs of vectors into scalars: 8.1.1 DEFINITION An inner product on the linear space X is a mapping (;) : X x X —► K for which, if x, y, z G X and s, t G K, (sx + ty\z) = s (x \z)+t (y ; z) (8.1.1.1) (y ; x) = (x ; y)~ the complex conjugate of (x ; y) (8.1.1.2) (x ; x) > 0 (8.1.1.3) (x;x) =0=>x = 0 (8.1.1.4) Note that if K is the real field R then the complex conjugate of (x ; y) is just (x; y). From (8.1.1.1) and (8.1.1.2) it is clear that (z ; sx + ty) = s(z;x) + i(z ; y) (8.1.1.5) Thus, an inner product is linear in the first variable and "conjugate linear" in the second variable, as defined in (5.1.2.8). If (;) is an inner product on X we shall write y*{x) = (x ; y) for each x,y G X (8.1.1.6) 309
310 8. Inner Products and Orthogonality Evidently the mapping y* is a linear functional on X, and the mapping y —► y* : X —► LK{X, K) is conjugate linear in the sense of (5.1.2.8). An inner product induces a norm: if x G X we shall write ||s|| = <x;s>1/2 = V^> (8.1.1.7) Of course this definition is made possible by (8.1.1.3). We claim that ||-|| really is a norm: 8.1.2 THEOREM If (;) is an inner product on the linear space X then ||-|| is a norm, with the additional quadratic property that ||x + y||2 + ||x-y||2 = 2W2 + 2||y||2 (8.1.2.1) for each x,y G X. Conversely if ||-|| is a norm on the linear space X which satisfies the quadratic property then there is an inner product (;), given by the formula 4Re(x;y) = ||x + y||2 - ||z-</||2 for each x,y G X (8.1.2.2) from which the norm ||*|| can be derived by the formula (8.1.1.7). Proof: If (;) is an inner product and ||-|| is given by (8.1.1.7), then the conditions (1.1.1.1), (1.1.1.2), and (1.1.1.3) of Definition 1.1.1 are evident. We must prove the triangle inequality (1.1.1.4). Explicitly, we must prove that if x, y G X, then ||x + y||2<W2 + 2||x||||y|| + ||y||2 (8.1.2.3) Since Ik + y||2 = {x + y;x + y) = ||x||2 + (x ; y) + (y ; x) + \\y\\2 = ||x||2 + 2Re(x;y) + ||y||2 it is evidently sufficient to prove the Schwarz inequality |<*;y>|<NI||y|| (8.1.2.4) Necessary and sufficient is that (8.1.2.4) should hold when \\y\\ = 1: in this case the intuitive idea of the argument is conveyed by the following diagram inR2:
8.1 Inner Products 311 ToTT^I The diagram of course is not necessary for the proof: if ||y|| = 1 then ||x||2 - | (x ; y> |2 = \\x -(x;y) y\\2 > 0 (8.1.2.5) We have, therefore, shown that an inner product space does indeed carry a norm, and it is now an easy exercise to verify that the norm ||-|| must satisfy the quadratic property (8.1.2.1), and that also the (real part of the) inner product can be expressed in terms of the norm by means of the formula (8.1.2.2). If, in particular, X is a complex space, and if we know the inner product to be linear in its first variable, then the argument of Theorem 5.1.2 shows that (x ; y) = Re (x ; y) - i Re (ix ; y) (8.1.2.6) Thus in the complex case (8.1.2.2) implies 4 (x ; y) = \\x + y\\2 - \\x - y\\2 + i\\x + iy\\2 - i\\x - iy\\2 (8.1.2.7) Suppose finally that ||-|| is a norm on the space X which satisfies, in addition to the conditions of Definition 1.1.1, the quadratic property (8.1.2.1), and then consider the mapping Re(;) : X X X —► R given by the formula (8.1.2.2). It is clear that the conditions (8.1.1.3) and (8.1.1.4) of Definition 8.1.1 are satisfied (with Re(;) in place of (;)), and also that the relationship (8.1.1.7) holds. Towards (8.1.1.1), it is also clear that Re (tx ; ty) = \t\2 Re (x ; y) for each t G K We claim that also, for each x,y,z G X, (8.1.2.8) Re (x + y ; z) = Re (x ; z) + Re (y ; z) (8.1.2.9)
312 8. Inner Products and Orthogonality for we have 4Re(x;y)+4Re(y;z} = \\x + z\\2-\\x-zf + \\y + z\\2-\\y-z\\2 = (||x + 2||2 + ||y + 2||2)-(||x-2||2 + ||y-^||2) = l(\\x + y + 2z\\2 + \\x-y\\2) -!(||z + y-2*||2 + ||z-y||2) using (8.1.2.1) = L\\x + y + 2yf-±\\x + y-2z\\2 = 2||I(x + y)+2||2-2||I(x + y)-2||2 = 8Re(±(x + y);*) In particular, taking y = 0, 4Re(x;z)+0 = 8Re(±x;z) (8.1.2.10) and then putting this (with x + y in place of x) back in the previous expression gives (8.1.2.10). Together (8.1.2.10) and (8.1.2.11) give (8.1.1.1) for arbitrary rational s and t. The proof of (8.1.1.1) for real s and t now follows by the continuity of the mappings x-+\\x + y\\* -\\x-y\\2 :X -+R for each y G X. Explicitly, if s G R and \s — sn\ —► 0 with rational (sn) then 4Re(6x;y)-4Re(6nx;y) = ||6X+y||2-||6x-y||2-||6nx+y||2 + ||6nx-y||2 and Ulac + y||2 - ||snx + y\\2\ = \\\sx + y|| - ||«nx + y||| x(||«x + y|| + ||«nx + y||) < |||5x + y|| - ||snx + y||| x (2||«x + 2/|| + Hlac + 2/|| - ||«nx + y|||) <|6-6n|||x||(2||6x + y|| + |6-6n|||x||) using (1.2.0.11) —► 0 as n —► oo ■ The fact that a quadratic norm can be derived from an inner product is of almost no practical use to us. The fact that the inner product from which such a norm can be derived can be expressed in terms of the norm is much
8.2 Orthogonality 313 more useful. The formulas (8.1.2.2) and (8.1.2.8) are sometimes termed "polarization." The quadratic property (8.1.2.1) has a nice interpretation in R2: it says that the sum of the squares on the diagonals of a parallellogram is equal to the sum of the squares of the sides. We met an early example of an inner product space in (1.6.0.3): if Y = Z = K, then ||(y,*)||2 = (|y|2 + H2)1/2 = (yj/+^)1/2 = «V,*) 5 (v,*)> where <(y, z); (y', *')> = yy' + ^' if y, *,v\^eK (8.1.2.11) evidently satisfying the conditions of Definition 8.1.1. More generally the norm ||-||2 introduced in (1.9.0.3) also comes from an inner product, defined by setting (x ; y) = linJ ]T x(t)y(t)~: finite O0Cf] (8.1.2.12) [ten0 J The reader will remember that we gave a foretaste of the proof of the Schwarz inequality (8.1.2.4) in Theorem 1.6.1. 8.2 ORTHOGONALITY If X is an inner product space, then vectors x, y in X for which <s;y> = 0 (8.2.0.1) will be called orthogonal or perpendicular. We shall also write x JL y (8.2.0.2) If K C X is a nonempty subset we shall write KL = {y eX:{x\y)=0 for each y G K} (8.2.0.3) and refer to K1- as the orthogonal complement of K. The orthogonal complement has properties similar to those of the annihilator K° of a sub- space K: 8.2.1 THEOREM If X is an inner product space and i,t/Gl, then y ±x<=>x±.y (8.2.1.1) and x JL x =► x = 0=^ x±.y (8.2.1.2)
314 8. Inner Products and Orthogonality If K and H are nonempty subsets of X, then K1- = [K')L (8.2.1.3) is a closed subspace of X, where K' is the closed subspace generated by K, and there is implication K<ZH=>HL <ZKL (8.2.1.4) K C KLL (8.2.1.5) and Kfl^CjO} (8.2.1.6) Finally {0}± = X and X± = {0} (8.2.1.7) Proof: The first two implications follow from the Definition (8.2.0.2), and the arguments for the rest are the same as the arguments of Theorem 5.2.2. ■ We could also have included in Theorem 8.2.1 the observation that K1- = KLLL (8.2.1.8) analogous to (5.2.2.3). Also, if H and K are nonempty subsets, or sub- spaces, then [K U H)L = {K + H)L =KLf\HL (8.2.1.9) One consequence of orthogonality is "Pythagoras' theorem": xJLy^||x + y||2 = ||x||2 + |M|2 (8.2.1.10) The reader will remember this from (8.1.2.6). Generalizing (8.2.0.2), we shall refer to a system [xAjeJ of vectors in X as orthogonal iff ,-94i=>(z,.;xy) = 0 (8.2.1.11) and as orthonormal, iff in addition i = j=>(xi;xi) = l (8.2.1.12) giving therefore a pairwise orthogonal system of unit vectors. An orthonor- mal system of vectors is always linearly independent:
8.2 Orthogonality 315 8.2.2 THEOREM If X is an inner product space and {xj)jej is a system of vectors in X, then ixj)jeJ orthonormal =$■ [xj)jeJ linearly independent (8.2.2.1) If (sn)nGN is a linearly independent sequence in X, then there is an or- thonormal sequence (yn)nGN which is linearly equivalent, in the sense n n ]T Kyy = ]T Kxj for each n e N (8.2.2.2) y=i y=i Proof: To say that [xj)jeJ is linearly independent is to say (6.1.0.1) that (xy)y£Jo *s ^nearly independent for each finite subset J0 C J. If (xy)jGJ is orthonormal and J0 C J is finite and (*y)yej0 is arbitrary in K, then ]T *jxj = ° => *fc = X] (*jxj ; xk) = \Y1 lixi ' xfc / = <° '» xk) = ° jGJo j'GJo \jGJo / for each k £ J0, which proves (8.2.2.1). Toward (8.2.2.2), we construct two sequences [zn) and (yn), starting with * = 2i = ii?ii (8-2-2-3) llxill noting that x1 ^ 0 since (xn) is linearly independent, and then z2 = x2- (x2 ; yx) yx and y2 = y-^- ll22ll noting that z2 ^ 0, since otherwise (x1,x2) would be linearly dependent. Inductively, if (t/^yj,... , yn) is orthonormal and linearly equivalent to (x1,x2,...,xn), define n *n+1 = xn+1 - £ (xn+1 ; y ■) y ■ and yn+1 = -^±1- (8.2.2.4) y = 1 IFn+lll Evidently zn+1 7^ 0, since otherwise xn+1 would be a linear combination of (x1,x2,...,xn), and then Hy^J = 1 and (yn+i 5 ty) = ° for each i = 1,2,..., n Finally, the two systems (x1?x2,... ,xn,xn+1) and (yi,y2, • • • 52/n>2/n+i) are linearly equivalent. ■
316 8. Inner Products and Orthogonality The construction of (yn) is called the Gram-Schmidt process. As well as "orthonormalizing" a linearly independent sequence (xn), it also serves to detect linear dependence: if (x1?x2,... ,xn) is linearly independent and (x1,x2,..., xn, £n+1) is linearly dependent, then (y1? y2? • • • ? 2/n) W1^- be or~ thonormal and linearly equivalent to (x1? x2,..., xn) and zn+1 = 0. If the Gram-Schmidt process is just a generalization of the argument (8.1.2.6) used in the proof of the Schwarz inequality, then it should come as no surprise that there is a corresponding generalization of the Schwarz inequality, known as (the finite) BesseVs inequality. 8.2.3 THEOREM If (ey)yGJ is a finite orthonormal system in an inner product space X, then for arbitrary x £ X there is inequality £l(*;<v>2<NI (8.2.3.1) with implication Ml2 = E I <* s e;> I2 <=►x e E Kei (8-2-3-2) and membership (8.2.3.3) Proof: By induction on the cardinal number of the set J the argument for the Schwarz inequality and the Gram-Schmidt process gives Ml8 = E !<*!«/> I' + x-^(x;ey)ey (8.2.3.4) To see it directly just expand the second term on the right-hand side, using the orthonormality of the sequence (e,-)yGj. This gives the inequality (8.2.3.1), and also implication one way in (8.2.3.2). If there is equality in (8.2.3.1), then jeJ jeJ The inclusion (8.2.3.3) is also clear, if we just write down the inner product of the right-hand side with each ek. Finally, to finish the proof of (8.2.3.2),
8.3 The Nearest Point Theorem 317 suppose that x = YljeJ *jej ^ ^jeJ ^ej' an(* °^serve tk = (x ; ek) for each k £ J and hence ||s||2 = <s;s> = £|(s;ey>|2 . jeJ 8.3 THE NEAREST POINT THEOREM Certain subsets of an inner product space have "nearest points" to every other point of the space: 8.3.1 THEOREM If K C X is a nonempty complete convex subset of an inner product space X, then for each x £ X there is a unique point y = EKx £ X for which yeK and ||x - y\\ = dist(x, K) (8.3.1.1) The mapping EK is idempotent and continuous. Proof: This follows by a combination of the convexity of the set K and the quadratic property of the norm. By definition of dist(x, K) there must be a sequence (yn) in X for which yneK and ||x -yn\\ —► dist(x,K) (8.3.1.2) Rather unexpectedly, we claim (yn) is Cauchy in X (8.3.1.3) For each m and n in N we have o<l|y„-ym||2 = ll(yn-s)-(ym-s)ll2 = 2||yn - x||2 + 2||ym - x||2 - ||yB + ym - 2x||2 = 2||yB - x||2 + 2||ym - x||2 - 4|| \{yn + yj - x||2 < 2||yB - x\\2 + 2\\ym - x\\2 - 4 dist(x,iT)2 since by convexity \{yn + ym) € K, and now this last expression tends to 0 as m,n —> oo. Thus, (yn) is Cauchy in X, and lies in the complete subset K, and therefore has a limit y € K; evidently ||i — y\\ = lim||a; — yn\\ = dist(i,iif)
318 8. Inner Products and Orthogonality This proves the existence of y G X satisfying (8.3.1.1). If also yf satisfies (8.3.1.1), then 0 < \\y' - y\\2 = 2\\y' - x\\2 + 2\\y - x\\2 - 4||±(y' + y) - x\\2 < 0 so that y' = y. This proves the uniqueness of y G X satisfying (8.3.1.1), and entitles us to speak of the mapping EK : x —► y. To see that EK is continuous we follow the same kind of argument. If y = EKx and yf = EKx\ then 0 < \\yf - y\\2 = 2||y' - s'||2 + 2\\y - s'||2 - 4|| \{y + y') - x'f < 2dist(x/,JfiT)2 + 2\\y - x'||2 - 4dist(x/,JfiT)2 <2(||y-x|| + ||x-x'||)2-2dist(x',ii:)2 < 2| dist(x,K)2 - dist(x',K)2\ + 2\\x - x'\\ dist(x, K) + ||x - x'\\2 —> 0 as x1 —► x remembering that dist(x, K) is a continuous function of x. ■ When the complete subset K is also a linear subspace then we find that the mapping EK is also linear, and bounded: 8.3.2 THEOREM If K is a complete subspace of an inner product space X then EK is linear, with H-Ej^ll < 1. There is equality K + KL = X and K n KL = {0} (8.3.2.1) and KLL = K (8.3.2.2) Proof: If x G X is arbitrary and y = ^x, then we claim x-yeK1- (8.3.2.3) For if y' G if and t EK are arbitrary, then y — tyf e K, so that ||x-y||2 = dist(x,y)2<||x-y + V||2 giving ||*y'||2 + 2 Re(* (x - y ; y')) > 0 (8.3.2.4) If, in particular, t = k(y';x-y) with A; G R
8.3 The Nearest Point Theorem 319 we get I (x - y; y')2 (\\y'\\2k2 + 2fc) > o (8.3.2.5) Taking -6 < k < 0 for sufficiently small 6 > 0 forces ||y'||2fc2 + 2k < 0, which means that the only way to salvage (8.3.2.5) is to have \(x-y,y')\2 = 0 (8.3.2.6) Since y' £ K is arbitrary, this gives (8.3.2.3). If we combine (8.3.2.3) with (8.2.1.6) we get (8.3.2.1). Thus (8.3.1.1) can be rewritten in the form yeK and x-yeKL (8.3.2.7) This shows that EK = n^ coincides with the linear mapping of (2.5.1.10), "the projection of X upon K in the direction of K-1" To see its bound we recall the Pythagoras theorem (8.2.1.10): if y = EKx, then \\EKx\\2 = ||y||2 < ||y||2 + ||* - y||2 = ||z||2 (8.3.2.8) This has proved everything except (8.3.2.2), which is now easy. If x G KJ-L is arbitrary and y = EKx, then x and y are both orthogonal to x — y, and hence x — y JL x — y, giving x = y G K. ■ Equation (8.3.2.1) tells us that K C X is supplemented, in the sense of (1.6.1.5). Of course, the continuity of EK already tells us the stronger fact that K is complemented. We usually refer to EK as "the orthogonal projection on X upon K." Since the only time we ever want to know that EK is continuous is in the linear case, the slightly tedious argument at the end of Theorem 8.3.1 can be ignored. Unless K = {0} we recall from (2.5.1.10) that the norm of a projection on K is never less than 1. Thus K±{0}^\\EK\\ = l (8.3.2.9) We are now in a position to also substantiate some of the remarks following the proof of Theorem 1.5.3: if K C X is a complete subspace of an inner product space, then the "Riesz lemma" result (1.5.2.1) can be sharpened to hold with t = 1. We conclude here with a sort of converse to (8.2.1.3): 8.3.3 THEOREM If K, if, and G are subspaces of an inner product space X, then there is implication HCK + G^HnK-LcG (8.3.3.1)
320 8. Inner Products and Orthogonality If, in particular, K is complete, then also KCH^HCK+iHHK-L) (8.3.3.2) Proof: For (8.3.3.1) we argue H fl K1- C (K + G) fl K1- = G fl if-1 C G For (8.3.3.2) we use (8.3.2.1) to write, for each y G H, y = x + z with x E K and 2 G KL Now if if C H we have y-i = 2G(if + F)nxi = J?nifi giving yex+{Hf\KL)<ZK + {Hf\KL) ■ 8.4 COMPLETENESS An inner product space which is complete is called a Hilbert space. Since (4.2.1.4) closed subsets of complete spaces are complete, Theorems 8.3.1 and 8.3.2 became slightly simpler: 8.4.1 THEOREM If K C X is a nonempty closed convex subset of a Hilbert space, then there is a mapping EK : X —► X which is continuous and satisfies EK{x) e K and dist(x,K) = ||x - ^(x)ll f°r eacn x e X (8.4.1.1) If, in particular, if is a closed subspace of X, then K is complemented and EK = tt^ e BL(X, X) (8.4.1.2) Proof: This is just a summary of Theorems 8.3.1 and 8.3.2. ■ When X is a Hilbert space, then Bessel's inequality extends to arbitrary orthonormal systems, and leads to an alternative construction of the orthogonal projections EK:
8.4 Completeness 321 8.4.2 THEOREM If {ej)jej is an orthonormal system in a Hilbert space X, then for arbitrary iGX there is inequality ]TKs;ey>|2<||s||2 (8.4.2.1) with implication ll*U2 = £ I <* 5 «y> I2 <=> * € cl I £ Key I (8.4.2.2) yej V€J y The generalized sequence [Yljej (x i ey) ey: finite J0 C J) converges, with *"£<*; <V> <V € {ey:y € J}"1 = [ £ Key (8-4-2-3) j€J \jeJ J Proof: Recognizing the left-hand side of (8.4.2.1) as the supremum over finite subsets J0 C J makes (8.4.2.1) an immediate consequence of (8.2.3.1). This in turn ensures that the numerical sequence E-Gj | (x; e;) |2) (finite J0 C J) is Cauchy in the sense of Definition 4.1.1. It next follows easily that the vector-valued sequence {Yl3ej0 (x 5 ey) ey) is Cauchy in X. If X is complete, then Theorem 4.2.2 ensures that this Cauchy generalized sequence converges in X. From (8.2.3.4) it follows IWI2 = £ I <* 5 <V> I2 + »*-£<*; «y> «ylla (8-4-2.4) From (8.4.2.4) we get implication one way in (8.4.2.2), exactly as in Theorem 8.2.3, and also membership (8.4.2.3). Finally if x is in the closed subspace generated by the (e-) then the generalized sequence (x — YljeJo (x *' ey) ey) converges to 0 as J0 —> J, so that (8.4.2.4) gives equality in (8.4.2.1). ■ Everything we can say about orthonormal "systems" applies to orthonormal sets: we just take the index set J to be a subset of the space X. Under the partial order of inclusion, orthonormal subsets of an inner product space X clearly satisfy Zorn's condition (1.11.1.2). Thus, by Zorn's lemma (1.11.2) orthonormal subsets of X are contained in maximal orthonormal sets (8.4.2.5) Theorem 8.4.2 gives us the nicest possible characterization of maximal sets:
322 8. Inner Products and Orthogonality 8.4.3 THEOREM Necessary and sufficient for the orthonormal set J C X in the Hilbert space X to be maximal is that X = cl^Ke (8.4.3.1) eeJ Proof: If (8.4.3.1) fails then we can find x G X for which ||x|| = 1 and xeJL (8.4.3.2) For take x = x1/\\x1\\ where xx = XQ—YleeJ (xo > e) e w^n xo & c^YljeJ ^e* Evidently the set J U {x} is orthonormal and properly contains J. Conversely, if this is so, then equality must fail in (8.4.2.1), and hence by (8.4.2.2) x is not in the closed subspace generated by J. ■ Theorem 8.4.3 leads us to call maximal orthonormal subsets of Hilbert space orthonormal bases. To obtain an alternative expression for the orthogonal projection EK on a closed space if C X we have only to find an orthonormal basis {^j)j^.j for K. Then we have ek{x) = YL (x ' ej) ei for each x e X (8.4.3.3) Of course many different bases for K will give the same final answer. The complemented subspace property means that different kinds of nonsingularity coincide in a Hilbert space: 8.4.4 THEOREM If T G BL(X, Y) is a bounded linear mapping between Hilbert spaces, then T bounded below =► T left invertible (8.4.4.1) and T almost open =► T right invertible (8.4.4.2) More generally T proper => T regular (8.4.4.3) Proof: The last part (8.4.4.3) follows from Theorem 3.8.2 and Theorem 8.4.1. Now this together with (3.3.7.3), (4.4.4.1), and (3.8.3.12) gives (8.4.4.1), and together with (3.4.7.2), (4.4.4.2), and (3.8.3.13) gives (8.4.4.2). ■
8.5 Duality 323 In the notation of (3.3.1.3), (3.4.1.4), (3.6.1.3), and (3.6.1.4) this is rleft BL(X, Y) = aleft BL(X, Y) (8.4.4.4) and f rieht BL(X, Y) = aright BL(X, Y) (8.4.4.5) 8.5 DUALITY A Hilbert space furnishes its own dual: 8.5.1 THEOREM If X is a Hilbert space there is isometric conjugate- linear isomorphism y <—* y* : X = X^ (8.5.1.1) given by the formula y* (x) = (x ; y) Proof: The formula is (8.1.1.6), which associates with each vector y £ X the (possibly discontinuous) linear functional y* : X —► K; now the Schwarz inequality (8.1.2.4) shows that y* is always continuous, with ||y*|| < ||y|| for each y eY (8.5.1.2) and then equality is obtained in (8.5.1.2) by applying y* to the vector x = y: l|y||2 = v*(v)<l|y||||yl (8.5.1.3) The mapping y —► y* : X —► XMs therefore conjugate linear and isometric, but possibly not onto. If, however, 0 ^ f £ X^, we can find y0 £ X with /(y0) = l and y0 e /^(O)1- (8.5.1.4) This uses Theorem 8.3.2. We now claim If x £ X is arbitrary we can write x = {x - f{x)y0) + f(x)y0 (8.5.1.6) and find y*(x) = {x- f(x)y0 ; y0/lkl|2>+/(*) <% 5 Vo/IIVoll2> = 0+/(x) (8.5.1.7) since y0 is orthogonal to x — /(«)y0 ^ /_1(^)- ■
324 8. Inner Products and Orthogonality Under the correspondence y —► y* the orthogonal complement of a subspace Y C X is mapped onto the annihilator Y° of the same subspace. y* (x) = 0 for each x eY <=» y JL x for each iG7 (8.5.1.8) The resemblance between Theorems 8.2.1 and 5.2.2 is therefore no accident. The Hahn-Banach Theorem 5.3.1 is obvious for a Hilbert space: if f0 for a subspace Y C X, then by Theorem 8.5.1 we have / = y* for some vector y G cl(Y), where for the moment we interpret y* as the restriction to Y of the mapping x —► (x ; y). Clearly, however, the same vector y defines a linear functional y* on the whole of X. If T G BL(X, Y) is a bounded linear mapping between Hilbert spaces then Theorem 8.5.1 shows that the dual mapping T^ : Y* —► X* can be transferred to the spaces Y and X: 8.5.2 DEFINITION If X and Y are Hilbert spaces, then the adjoint of T G BL(X, Y) is the mapping T* : Y -+ X given by (r*y)* = y* • r = rf(y*) for each y G Y (8.5.2.1) Equivalent ly, (x ; T*y) = (Tx ; y) for each xeX,yeY (8.5.2.2) Note that the inner products on each side of this may be defined in different spaces. The adjoint has the same sort of properties as the dual; abstractly the purely algebraic properties are those of an "involution." 8.5.3 THEOREM If X and Y are Hilbert spaces, then the mapping T —► T* : BL(X, Y) —► BL(Y, X) is conjugate linear and isometric. There is equality (r*)*=r (8.5.3.1) and {sT + tS)* = sT* + tS* (8.5.3.2) for each s,t G K and each S,T G BL(X, Y). If also U G BL(Y,Z), then {UTY =T*U* (8.5.3.3) Also ||r|| = ||T|| (8.5.3.4)
8.6 Positive Operators 325 and ||r*r|| = ilHlimi = llr||2 (8.5.3.5) Proof: Out of all this only (8.5.3.5) need detain the reader more than a moment: we shall give a proof of (8.5.3.5) in Theorem 8.6.3 below. ■ The identity 7": X —► X is its own adjoint: I* = 1 (8.5.3.6) The invertibility or singularity of T G BL(X, Y) is determined by that of its adjoint. From (5.5.1.4) and (8.4.2.1), or directly, it is clear that r_1(0) = [T*Y)L C X (8.5.3.7) and cl(rX) = [T*)-l{0)L C Y (8.5.3.8) This gives about a third of the next result. 8.5.4 THEOREM If T G BL(X, Y) between Hilbert spaces, then T dense 4=> T* one-one (8.5.4.1) and T almost open 4=> T* bounded below (8.5.4.2) Proof: This is Theorem 5.5.2 and Definition 8.5.2. ■ In the notation of (3.2.1.4), (3.2.1.5), (3.3.1.3), and (3.4.1.4) this is 7rrightBL(X,Y)* =7rleftBL(Y,X) (8.5.4.3) and fright BL(X, Y)* = fleft BL(Y,X) (8.5.4.4) 8.6 POSITIVE OPERATORS As well as its obvious similarity to the dual operator T^, the adjoint T* of T G BL(X, Y) has some of the flavor of a generalized inverse for T. We begin by specializing to the situation in which Y = X: here the adjoint is a bit like complex conjugation.
326 8. Inner Products and Orthogonality 8.6.1 DEFINITION If X is a Hilbert space we shall say that T G BL(X, X) is hermitian, or self-adjoint, iff T* = T (8.6.1.1) and that T G BL(X, Y) is positive iff in addition (Tx ; x) > 0 for each x G X (8.6.1.2) For example, the identity / : X —► X is positive, and real scalar multiples of i" are self-adjoint. When the field K is the complex field C then (8.6.1.1) is a consequence of (8.6.1.2): 8.6.2 THEOREM If X is a complex Hilbert space and T G BL(X,X), then T = 0 4=> (Tx ; x) = 0 for each x G X (8.6.2.1) and T = T* 4=> (Tx ; x) G R for each x G X (8.6.2.2) Proof: Whether K is real or complex, the following polarization identity holds for arbitrary T G BL(X,X): 4Re (Tx ; y) = (T(x + y) ; x + y) - (T{x -y);x-y) for each x,y G X (8.6.2.3) The reader need only expand the right-hand side. In the complex case it follows, as in (8.1.2.7), that 4 (Tx ; y) = (T{x + y) ; x + y) - (T{x -y)]x-y) + * (T(x + iy) ; x + *y) - * (T{x - iy) ; x - *y) (8.6.2.4) This makes (8.6.2.1) immediately obvious, and (8.6.2.2) not very hard to see. ■ Theorem 8.6.2 fails in R2: the mapping T : (s,t) —> (—t,s) of rotation through ^7r satisfies the right-hand side of (8.6.2.1) but not the left-hand side of (8.6.2.2). If T G BL(X, Y) then the operators T*T : X -> X and TT* : Y -> Y are both "positive:" 8.6.3 THEOREM If T G BL(X,Y), then T*T G BL(X,X) is positive, and if also S G BL(X, X) is positive, then for each x £ X ||r*||2<||(rr + s)z||H (8.6.3.1)
8.6 Positive Operators 327 and ||5x||2 < \\S\\(Sx;x) (8.6.3.2) and ||(||5||-5)x||2<||5||(||5||||x||2-(5x;x)) (8.6.3.3) Proof: For (8.6.3.1) we observe ||rx||2 = (Tx ; Tx) = (T*Tx ; x) < (T*Tx ; x) + (Sx ; x) = ({T*T + S)x ; x) < \\(T*T + 5)x||||x|| (8.6.3.4) Taking the supremum over ||x|| < 1 gives ||r||2 < ||r*r + 5|| (8.6.3.5) Specializing to S = 0 proves the formula (8.5.3.5) from Theorem 8.5.3. For (8.6.3.2) we argue, using (8.5.3.5), ||r*rx||2 < ||r*||2||rx||2 = ||rf (Tx; Tx) = \\t*t\\ (t*tx ; x) (8.6.3.6) From (8.6.3.2) it is easy to see (8.6.3.3): the reader need only expand the left-hand side. ■ Theorem 8.6.3 will enable us to express the invertibility of T in terms ofrTandrr*: 8.6.4 THEOREM If T G BL(X, Y) and if S e BL(X, X) is positive, then T bounded below => T*T + S invertible (8.6.4.1) and X ^ {0} =► T*T - \\T\\2I <£ BL_1(X, X) (8.6.4.2) Proof: If T is bounded below, then (8.6.3.1) says that T*T + S is also, which means by (8.4.4.1) that it is left invertible and hence, being self- adjoint, invertible. For (8.6.4.2) find a sequence (xn) of unit vectors in X for which ||Txn|| —> \\T\\ and substitute them into the right-hand side of (8.6.3.3) with S = T*T - \\T\\2I. m Theorem 8.6.4 has implications for self-adjoint operators: if S = S*, then S one-one 4=> S dense 4=> S one-one and dense (8.6.4.3)
328 8. Inner Products and Orthogonality and S bounded below 4=> S almost open 4=> S invertible (8.6.4.4) 8.6.5 THEOREM If T e BL(X,Y) is bounded linear between Hilbert spaces, then T one-one 4=> T*T one-one and dense (8.6.5.1) and T dense 4=> TT* one-one and dense (8.6.5.2) Also T bounded below 4=> T*T invertible (8.6.5.3) and T almost open 4=> TT* invertible (8.6.5.4) Proof: From (8.6.3.1) it follows that, if S e BL(X,X) is positive, (r*r + s,)-1(o) cr_1(o) c (r*r)_1(o) (8.6.5.5) Specializing to S = 0 gives equality, and hence proves (8.6.5.1). Taking adjoints in (8.6.5.1) gives (8.6.5.2). Taking S = 0 in (8.6.4.1) gives (8.6.5.3), and taking adjoints in (8.6.5.3) gives (8.6.5.4). ■ We have a little more to say about self-adjoint operators: 8.6.6 THEOREM If S e BL(X, X) for a Hilbert space X, then S = S* 4=> ||S||J - S positive (8.6.6.1) If K = C is the complex field, then 5 = 5* =► {S + il, S - il} C BL"x(X, X) (8.6.6.2) Proof: If ||5||/ — 5 is positive then, in particular, it is self-adjoint, which forces S* = S. Conversely, if 5* = 5, then ||5||/—S is also self-adjoint, and then also positive because {Sx;x) < ||5||||x||2. For (8.6.6.2) we observe, using S* = 5, (5 + iI)(S -U) = S*S + I={S- iI){S + il) (8.6.6.3) and apply (8.6.4.1). ■
8.7 Regularity 329 8.7 REGULARITY When an operator between Hilbert spaces is regular in the sense of Definition 3.8.1, then we can arrange that the induced projections P and Q are orthogonal: 8.7.1 THEOREM If X and Y are Hilbert spaces and T G BL(X,Y), then T e rBL1(X, Y) 4=> T* e W:{Y, X) 4=> T*T e rBL1(X, X) (8.7.1.1) If T e rBL1(XJ Y) is regular there is unique S e BL(Y, X) for which T = TST S = STS [ST)* = ST {TS)*=TS (8.7.1.2) Proof: The first part of (8.7.1.1) is immediate: T = TST => T* = T*S*T* (8.7.1.3) To reverse the implication (8.7.1.3), substitute T* for T. To see that the last part of (8.7.1.1) implies the first, we claim rT = {T*T)U{T*T) =^T = T{UT*)T (8.7.1.4) To see this, argue that the first part of (8.7.1.4) implies {T{UT*)T-T)*{T{UT*)T-T) = T*T-T*T-T*T + T*T = 0 (8.7.1.5) which with (8.5.3.5) gives the right-hand side of (8.7.1.4). Conversely, to reverse this implication, assume (8.7.1.2), which gives {T*T){SS*){T*T) = T*{TS){TS)*)T = T*{TS)2T = T*{TST) = T*T (8.7.1.6) This finishes the proof of (8.7.1.1), provided we can prove (8.7.1.2). To satisfy (8.7.1.2), find projections P = P2 e BL(X,X) and Q = Q2 e BL(Y, Y), using Theorem 8.3.2, with P* = p P-1(0) = r_1(0) Q* = Q Q{Y) = c\{TX) (8.7.1.7) and then define S = V as in (3.8.2.8) from the proof of Theorem 3.8.2. The uniqueness of S follows from the uniqueness of P and Q: to see that P and Q are both unique suppose that orthogonal projections E and F on X have the same range and argue F = FE = F*E* = (EF)* = E* = E m (8.7.1.8)
330 8. Inner Products and Orthogonality If T £ BL(X,Y) is regular, then T*T is decomposably regular: more generally, if S = S* e BL(X,X), then 5-^0) = (5*)"1(0) S X/c\{SX) (8.7.1.9) The analogue of Theorem 8.7.1 holds with "Fredholm" in place of "regular." We prove more: 8.7.2 THEOREM If X and Y are Hilbert spaces and T G BL(X,Y), then T upper semi-Fredholm 4=> T* lower semi-Fredholm (8.7.2.1) and T upper semi-Fredholm 4=> r*T Fredholm 4=> r*T Weyl (8.7.2.2) If T e BL(X,y) is Fredholm, then index(r*) = -index(r) (8.7.2.3) Proof: The implication (8.7.2.1) follows from (8.7.1.1) and (8.5.3.7), while the first implication of (8.7.2.2) follows from (8.7.1.1) and (8.6.5.1). Then the second implication of (8.7.2.2) follows from (8.7.1.9). Finally, (8.7.2.3) follows from (8.5.3.7) and (8.5.3.8). ■ Part of the argument for (8.7.2.1) tells us that T essentially one-one 4=> T* essentially dense (8.7.2.4) When we compare Definition 6.4.1 and Definition 3.9.1, and recall Theorem 6.4.2, we have equivalence T essentially one-one 4=> T not a left zero divisor mod KL0(X, X) (8.7.2.5) and T essentially dense 4=> T not a right zero divisor mod KL0 (Y, Y) (8.7.2.6) When X and Y are Hilbert spaces, and we replace the finite rank operators KL0(X,X) and KL0(Y,Y) by the compact operators KL(X,X) and KL(y, y), then we get left and right Fredholm operators:
8.7 Regularity 331 8.7.3 THEOREM If X and Y are Hilbert spaces and T G BL(X,Y), then T not a left zero divisor mod KL(X,X) 4=> T left Fredholm (8.7.3.1) and T not a right zero divisor mod KL(Y, Y) 4=> T right Fredholm (8.7.3.2) Proo/: If T G BL(X, Y) is left Fredholm, with I-T'Te KL(X, X), then it is upper semi-Fredholm, and by Theorem 6.11.3 we have P(T) one-one. This gives implication, for arbitrary U G BL(X,X), TU G KL(X, Y) => P(r)P(l7) = P(ri7) = 0 ^P([/)=0^I7G KL(X, X) (8.7.3.3) This proves implication one way in (8.7.3.1); the corresponding implication of (8.7.3.2) folllows in the same way. For the converse we recall Theorem 6.12.4: if T G BL(X, Y) is not upper semi-Fredholm, then there is K G KL(X, X) for which (T — jfiT)-1(0) is infinite dimensional. Now, with U = U2 a projection whose range is (T — jfiT)-1(0) we have U & KL(X, X) and TU = KU G KL(X, Y) (8.7.3.4) This reverses the implication of (8.7.3.1), and similar argument does the same for (8.7.3.2). ■ When X and Y are Hilbert spaces we can simplify somewhat the proof of Theorem 6.12.4. If T{X) ^ cl(rX), then neither J"1^)1- nor T(X) can be finite dimensional (Theorem 6.2.3). If we find orthonormal sequences (en) in T-^O) and (/J in T(X), then oo *=E-<0ren£KL(X,Y) n=1 (8.7.3.5) and {em:men}<Z{T-K)-1{0) and K=JT±KTQfn€KL(X,Y) and {fm:m e N} C {T-K){X)L (8.7.3.6)
332 8. Inner Products and Orthogonality 8.8 HILBERT ALGEBRA If A is a linear algebra over K, then an involution on A is a mapping * : A —> A such that, for each a, 6 £ A and each s,t £ K, (*a+*&)* = sa*+f&* (8.8.0.1) {a*)*=a (8.8.0.2) (a&)* = b*a* (8.8.0.3) 1* = 1 (8.8.0.4) 8.8.1 DEFINITION A Hilbert algebra A is a Banach algebra with an involution * for which ||a||2 = ||a*a|| for each a G A (8.8.1.1) The condition (8.8.1.1) is also called "the B* condition," and more usual terminology is to refer to our "Hilbert algebra" as either a "£*- algebra" or a "C*-algebra." Indeed, it is more usual to apply the term "Hilbert algebra" to the rather special situation of a Banach algebra which is also a Hilbert space. Of course the most important example of Definition 8.8.1 is the algebra of operators on a Hilbert space: 8.8.2 THEOREM If X is a Hilbert space, then BL(X,X) is a Hilbert algebra (8.8.2.1) If H is a topological space, then (7^(0) is a Hilbert algebra (8.8.2.2) Proof: If * is denned on BL(X,X) by (8.5.2.1) then (8.8.2.1) is Theorem 8.5.3. If * is defined on C'00(r2) by setting /*(*) = /(*)" for each t G H, for each / G C^Q) (8.8.2.3) then (8.8.2.2) is clear, as may be checked by the reader. ■ It is easily checked that if (8.8.1.1) holds then also ||a*|| = ||a|| for each a £ A (8.8.2.4)
8.8 Hilbert Algebra 333 8.8.3 DEFINITION If A is a Hilbert algebra, then a G A is called normal iff a*a = aa* (8.8.3.1) If, in particular, a* =a (8.8.3.2) then a is called hermitian, or self-adjoint. If instead a*a = 1 = aa* (8.8.3.3) then a is called unitary. We shall define h^(a) = \{a + a*) for each ae A (8.8.3.4) Evidently /i+ : A —► A is bounded and real linear on A, and idempotent. The set h+{A) of self-adjoint elements of A is a real subspace, and if A is commutative a subalgebra: if a = a* and 6 = 6* then (a& = (a&)* 4=> a6 = 6a) (8.8.3.5) In the complex case we can say more: 8.8.4 THEOREM If A is a complex Hilbert algebra, then A = ht(A) +ih+{A) (8.8.4.1) and K{A) Ccl(A_1) (8.8.4.2) Proof: To verify (8.8.4.1) we have only to observe, if a G A is arbitrary, a = 6 + ic with 6 = \{a + a*) and c = -\i{a - a*) (8.8.4.3) Towards (8.8.4.2), we claim that if a = a* G A, then 0^eR=>c + tte A-1 (8.8.4.4) To see this suppose r > ||a||2/2£, so that (r + t)2>\\a\\2+r2 = \\a-ir\\2
334 8. Inner Products and Orthogonality using the B* condition (8.8.1.1), giving a + it = i{r + t)(l- i{r + i)~l{a - *>)) G A~l using the geometric series argument of (4.4.5.15). ■ When A = BL(X,X) and t = 1 or t = -1 then (8.8.4.4) reduces to (8.6.6.2) and (8.8.4.2) combines with (7.3.4.4) from Theorem 7.3.4 to give implication a = a* erA!=^ae A'1 A (8.8.4.5) We saw a special case of this in (8.7.1.9). We conclude here with a characterization of normal operators: 8.8.5 THEOREM If A is a Hilbert algebra and a G A is normal, then ||a*6|| = ||a6|| for each be A (8.8.5.1) If X is a Hilbert space, then necessary and sufficient for T G BL(X, X) to be normal is that ||r*x|| = ||rx|| for each x G X (8.8.5.2) Proof: If a G A is normal and 6 G A is arbitrary, then ||a*6||2 = ||(a*6)*(a*6)|| = ||&*aa*6|| = \\b*a*ab\\ = ||(a&)*a&|| = ||a&||2 which is (8.8.5.1). If T G BL(X,X) is normal and iGl, then ||r*x||2 = (TT*x ; x) = (T*Tx ; x) = ||rx||2 giving (8.8.5.2). For the converse, use (8.6.2.1). ■ Condition (8.8.5.2) implies that (8.7.1.9) holds when S is normal: thus also T normal regular => T decomposably regular (8.8.5.3) and T normal Fredholm =► T Weyl (8.8.5.4) 8.9 ENLARGEMENTS The enlargements of a Hilbert space should really be Hilbert spaces; for the enlargement Q based on the natural numbers N this is almost never so: X ^ 0 =► Q(X) not a Hilbert space (8.9.0.1)
8.9 Enlargements 335 Indeed, if 0 ^ e G X define x and y in l^X) by setting x = (e,0,e,0,...) and y = (0,e,0,e,...) (8.9.0.2) Then, in violation of (8.1.2.1), II* + V\\2 + II* " yf = 2||e||2 * 4||e||2 = 2||x||2 + 2||y||2 (8.9.0.3) A modification in the construction of Q does give rise to Hilbert spaces: the idea is to replace the bounded structure on N used in Definition 1.9.2 by one which is "maximal" in a Zorn's lemma sense. Rather than do that, we can achieve the same effect in a different way: 8.9.1 DEFINITION A Banach limit A : l^ -> K is a linear functional for which and A(x*) = A(x)~ A(x) < limsupxn for each iG^ if x = x* e l^ (8.9.1.1) (8.9.1.2) It is not hard to verify that these conditions imply liminf xn < A(x) < limsupxn (8.9.1.3) n n xn < yn{n £N)^ A(x)n < A(y)n(n € N) (8.9.1.4) y» = Vi(^N)^A(i) = A(y) (8.9.1.5) Thus a "Banach limit" is a possible extension of the limit of a convergent sequence to all bounded sequences. That such things exist at all follows from the proof of the Hahn-Banach Theorem 5.3.2: the argument for Theorem 5.3.1 extends to the situation in which limsup replaces a multiple of the norm. If X is a Hilbert space and if A is a Banach limit on l^ in the sense of Definition 8.9.1, then we can define a Hilbert space enlargement QA{X) by imposing the following "semi-inner product" on the normed space l^X): (x;y)A = A(x. ; y.) for each x,y el^ (8.9.1.6) Exactly as in (5.7.2.2), we are evaluating the functional A at the sequence (x. ; y.) el ao for which ((*.; y.»n = <*»; y»> f°r each * e N (8.9.1.7)
336 8. Inner Products and Orthogonality 1 II The vector space l^X) thus inherits a seminorm x —► (x ; x)A7 : evidently (x ; x)A = 0 -<=> x e c0{X) (8.9.1.8) It follows that the quotient space Q{X) = loo(X)/c0(X) is actually normed by A, although not necessarily complete. To construct QA(X) we now need only complete Q{X) with respect to this norm, as in Theorem 4.5.3. If X and Y are Hiibert spaces and T £ BL(X, Y) is a bounded operator then it is clear how we define QA(T) : QA(X) —► QA{Y). In fact, we can regard QA(T) as the unique extension of Q(T) as defined in (2.7.2.3) from Q{X) to QA(X). It is clear that all the conclusions of Theorem 2.7.3 hold with QA in place of Qn. In addition, QaCO = QA(r) (8.9.1.9) For the record, (3.3.5.2) from Theorem 3.3.5 and (5.7.1.1) from Theorem 5.7.1 hold with QA in place of Q: 8.9.2 THEOREM If T e BL(X, Y) is a bounded linear operator between Hiibert spaces and if A : l^ —► K is a Banach limit, then there is implication QA(T) one-one => T bounded below => QA(T) bounded below (8.9.2.1) and QA(r) dense => T almost open => QA(T) open (8.9.2.2) Hence also T invertible <£=> QA(T) invertible (8.9.2.3) Proof: For (8.9.1.2) repeat the proof of (3.3.5.2), and for (8.9.2.2) apply (8.9.2.1) with T* in place of T. m In practice while it is satisfying that the "enlargement" of a Hiibert space should be another Hiibert space, we will find the original nonquadrat- ic version from Definition 1.9.2 with ft = N does everything we need. The same comments apply to the "essential enlargements" T*{X) and PX(X) of Definition 6.7.4. The reader is invited to see whether he can invent a modified version of P(X) which is always a Hiibert space, and for which he can prove the analogue of Theorem 6.11.3. If he succeeds, he is then invited to see whether his version has been able to tell him anything he has not already learned from P(X).
9 Liouville's Theorem and Spectral Theory The "spectrum" of a linear operator generalizes the set of eigenvalues of a matrix, and the range of a numerical function. 9.1 LIOUVILLE'S THEOREM The differential calculus for mappings from the complex field into a normed space is superficially the same as the real calculus of Definition 5.12.1: 9.1.1 DEFINITION If X is a complex normed space and / : Q -► X is defined on a neighborhood ft C C of the complex number s, then the derivative of / at 5 is the limit If f'{s) exists for each sGflwe shall say that / is differentiable on H. If / is differentiable on Q and if the mapping /' : Q —► X is continuous on H, then / is said to be holomorphic on H. If z : C —> C denotes the complex coordinate we shall also write If mappings x : C —> R and y : C —► R are defined by the identity z = x + iy (9.1.1.3) 337
338 9. Liouville's Theorem and Spectral Theory then we shall define the partial derivatives of / at 5 by setting f£M = lim|Mz/W:yW = y{s),t _> s| (9.1.1.4) and g(S) = lim |MzM:xW = ,(,),, _ SJ (9.1.1.5) Thus if <7(u) = /(u + i/3) and /i(u) = /(a + zu) we have |£(a + »7?) = </'(a) and |^(a + »7?) = &'(/?) (9.1.1.6) where g' and /i' are defined as in (5.12.1.1). It is at once clear that if the complex derivative (df/ dz) (s) exists then so do both the real derivatives {df /dx)(s) and (df /dy)(s): they are also related by the Cauchy-Riemann equation The converse holds, but not for single points: 9.1.2 THEOREM If n G C is open and if / : n -> X maps n into a Banach space X and has continuous partial derivatives df/dx and df/dy for which -L(s)=i-L(s) foreachsGH (9.1.2.1) oy ox then / is holomorphic on H. Proof: Suppose s0 G H and suppose K = {t G C: \t — s0\ < R}: then we claim I—ol<*=>/W = ^/^7 (9-1.2.2) dK and hence ~&5) = 2^ / {I-**)* for each 5 with |5"5°' < * t9-1-2-3) ax
9.1 Liouville's Theorem 339 Here we interpret J gdz= I g(s0 + Reie)i Reie dK -7T do (9.1.2.4) whenever g is continuous from dK to X, using (4.2.2.5) and the completeness of X for the integral. To prove (9.1.2.2) we use a special case of what is known as Green's theorem. For certain compact subsets K' of f2 we have equality / gdx + hdy= / ( -^ J dx A dy (9.1.2.5) dK' whenever dg/dy and dh/dx are continuous on K'. For example, (9.1.2.5) is rather clear if K' is a rectangle {s + it: a_ < s < a+,/?_ <t< /?+}, if we interpret the right-hand side as either of two repeated integrals, and the left-hand side as the sum of four integrals which correspond to following an anticlockwise path round the boundary. To establish (9.1.2.2) we need (9.1.2.5) for sets of the form K'5 = {t e C: \t - s0\ < R and \t - s\ > 6} (9.1.2.6) Here the boundary dKf5 is made up of the circle dK of (9.1.2.4), taken "positively" or anticlockwise, together with another circle r5, of center 5 and radius 6, this time described "negatively" or clockwise. If we apply (9.1.2.5) with Kf = Kf6 and ig = h = i[z — s)-1/ then we find, for sufficiently small 6 > 0, /^/^+/((i-£)<->-'/)^»<-») dk r6 K' To finish the proof of (9.1.2.2) the reader should verify that and that v fdz -27rif(s)\ as S (9.1.2.8) (9.1.2.9)
340 9. Liouville's Theorem and Spectral Theory The right-hand side of (9.1.2.2) is an instance of what is known as "Cauchy's integral formula": the argument of Theorem 9.1.2 also extends this to more general compact subsets K C Q. One consequence is the maximum modulus principle and Liouville's theorem: 9.1.3 THEOREM If n C C is open and if / : n -> X is continuously differentiate on f2, then, whenever K C ft is a compact disc, sup ||/(5)|| < sup ||/(,)|| (9.1.3.1) seK sedK If, in particular, / is continusouly differentiable on C, then / bounded => f constant (9.1.3.2) and ||/(5)|| —► 0 as \s\ —> 00 =► / = 0 (9.1.3.3) Proof: The inequality (hence equality) (9.1.3.1) follows immediately from Cauchy's integral formula (9.1.2.2). For (9.1.3.2) we differentiate the Cau- chy formula, as in (9.1.2.3), and argue that if this holds for circles dK of arbitrarily large radius then (df/dz)(s) = 0 for each s G C. Finally (9.1.3.3) follows from (9.1.3.2), because first the left-hand side implies that / is bounded, therefore by (9.1.3.2) constant, and then the left-hand side forces that constant to be zero. ■ The most famous consequence of Liouville's theorem is the "fundamental theorem of algebra": 9.1.4 THEOREM If p : C —► C is a nonconstant polynomial, then p has at least one zero in C; hence, if c G Cn there is s G Cn for which zn + c^-1 +... + Cn = (z-s1)(z-s2)---{z- sn) (9.1.4.1) Proof: If p is a nonvanishing polynomial on C, then l/p is continuously differentiate on C, and we observe that p nonconstant = p(s) 0 as \s\ —► 00 (9.1.4.2) By (9.1.3.3) it follows that if p is not constant, then l/p = 0, which is impossible. Next (9.1.4.1) follows by induction on n: if p is the left-hand side of (9.1.4.1) and p(s1) = 0, then p=(z-s1)(zn-1+c'1zn-2 + ... + c'n_1) m (9.1.4.3)
9.2 The Spectrum 341 An alternative derviation of Theorem 9.1.4 is achieved by examining the "winding number" of the mappings </>: S —> S induced by a nonvanish- ing polynomial p : C —► C, in the sense of (7.10.6.8). We remark that we have chosen to repeat for vector-valued functions a simplified version of the very standard theory of "functions of a complex variable." The reader is invited to see how, using the Hahn-Banach theorem, he can extend results valid for numerical functions <f> : Q —> C, for an alternative derivation of Theorems 9.1.2 and 9.1.3 in normed spaces. 9.2 THE SPECTRUM The "spectrum" of a linear operator, or more generally a Banach algebra element, is closely analogous to the range of a continuous function. While the theory really works properly for complete algebras, we can as usual say quite a lot in incomplete algebras. We will however look only at the complex case. 9.2.1 DEFINITION If A is a complex normed algebra, then the spectrum of an element a G A is the set °a(°) = {seC:a-s <£ A'1} (9.2.1.1) for which a — s is not invertible in A. The left and the right spectrum are the sets ^A* M = {s e C:l ^ A{a - s)} = {s e C:a - s e cle{t{A)} (9.2.1.2) and <^ight(a) = {s G C: 1 g (a - s)A} = {s G C:a - 6 G aright(A)} (9.2.1.3) The almost spectrum of a is given by ^W=^'(«)^fW (9.2.1.4) where ^AftW = {* € C: 1 g cl(A(a - s))} = {s G C: a - 6 G aleft(A)} (9.2.1.5) and £PJght(a) = {5 G C: 1 g cl((a - *)A)} = {s G C:a - 6 G aright(^)} (9.2.1.6) There are some obvious inclusions among these sets:
342 9. Liouville's Theorem and Spectral Theory 9.2.2 THEOREM If A is a complex normed algebra and a G A, then *T («) C *!f M ^ht(a) C a^"(a) (9.2.2.1) and *a(«) C aA(a) = o^(a) U o*#\a) (9.2.2.2) If A is complete, then the spectrum and the almost spectrum are the same: <^ft(a) = a]fft(a) and 5A*ht{a) = <£lght(a) (9.2.2.3) Proof: The inclusions (9.2.2.1) are contained in (3.10.6.2) from Theorem 3.10.6. This gives the first inclusion of (9.2.2.2), and the second equality is (3.1.2.2). Finally, (9.2.2.3) follows from (4.4.5.12) and (4.4.5.13). ■ The almost spectrum is closed and bounded, and nonempty unless A = 0: 9.2.3 THEOREM If A is a complex normed algebra and a £ A, then 5A(a) = c\dA(a) C {s e C: \s\ < \\a\\} (9.2.3.1) and cA(a) =0=^1 = O=>A = {O} (9.2.3.2) Proof: By (3.10.6.1) both <7leftU) and crl&ht(A) are closed subsets of A, and hence <7|fft(a) C C and aTAe (a) C C are the counterimages of closed sets by the continuous mapping a — z : C —► A. This proves the first part of (9.2.3.1); for the second suppose that \s\ > \\a\\ and observe that a fd\n II / cl\ , II II , / <i\ || (9.2.3.3) To establish (9.2.3.2) we begin by assuming that A is complete, so that the almost spectrum and the spectrum coincide, and note that the resolvent function (a - z)-1 : C \ oA{a) —► A (9.2.3.4) is holomorphic: ±{a-z)-l = {a-z)-2 (9.2.3.5) Evidently also IKa-s)"1!!—>0 as \s\—► oo (9.2.3.6)
9.2 The Spectrum 343 Thus, we may use (9.1.3.3) to argue aA(a) = 0 => (a - s)_1 = 0 =► 1 = (a - s){a - s)_1 = 0 (9.2.3.7) To prove (9.2.3.2) we pass from A to its completion A, and observe, using (4.5.6.3) and (4.5.6.4) for example, that ^(a)CaA(a)CaA(a) ■ (9.2.3.8) Of course Theorem 4.5.7 tells us ax(a) = aA(o) (9.2.3.9) The spectrum and the almost spectrum of elements of a normed algebra behave very properly when under attack by polynomials: 9.2.4 THEOREM If a e A for a normed algebra A and if p : C -> C is a polynomial, then there is equality *aP(<0 =P°a(°) (9.2.4.1) and *aP(«0 =P*a(«0 (9.2.4.2) Proo/: The left-hand side of (9.2.4.1) and of (9.2.4.2) is defined using (1.10.1.7), while the right-hand side is of the form {p(t):t G K} for K = (jj{a) C C. If p is not constant and t £ C then by (9.1.4.1) we can write, with 5 e Cn and 0 ^ c0 G C: p - t = c0(2 - sJiz -82)---{z- sn) (9.2.4.3) and hence also (1.10.1.7) p(a) - t = c0(a - 5x)(a - s2) • • • (a - sn) (9.2.4.4) By (7.1.3.6) and induction on n it follows that t e oAp{a) *=> {6l552,... ,5n} n cA(a) ^0<=>te pcA{a) (9.2.4.5) The analogue for oA is exactly similar, using Theorem 3.10.6. If the polynomial p is constant then (unless A = 0) (9.2.4.1) and (9.2.4.2) follow from (9.2.3.2); if A = 0, then of course both sides are always empty. ■
344 9. Liouville's Theorem and Spectral Theory Theorem 9.2.4 holds separately for left and for right spectra, although we have not yet proved that these are always nonempty. There is also an extension to rational functions: 9.2.5 THEOREM If a G A for a normed algebra A and if p and q are polynomials, with q ^ 0 not constantly zero, then q-\0) n aA{a) =0^aA (?) (a) = ^aA{a) (9.2.5.1) and q-1(O)naA(a)=0^aA(^ya) = P-aA{a) (9.2.5.2) Proof: Here of course -(a) = qM^pU) = p{a)q{a)-1 (9.2.5.3) Q We rely on (9.2.4.1) to see that it is well defined and on Theorem 7.1.4 to see that the order of the factors is immaterial. For each iGCwe now write, assuming that p/q is not constant, I t= Llll = ±c (* _ Sl){z - S)... (* _ 8) (9.2.5.4) q q q and hence -(a) " * = <l{a)~lc0(a - sx){a - s2) • • • (a - sn) (9.2.5.5) Now the argument of Theorem 9.2.4 applies. ■ The reader is invited to reflect upon why we cannot replace oA by aA in the left-hand side of (9.2.5.2). We conclude with a "semicontinuity" property for the almost spectrum: 9.2.6 THEOREM If (an) and a in the complex normed algebra A, and (sn) and 5 in C, satisfy IK "all —>° and K"5I—>° (9.2.6.1) then there is implication sn e cA(an) for each n G N =► 5 G cA{a) (9.2.6.2)
9.3 The Spectral Boundary 345 Proof: We prove this separately for left and right spectra. If 5 is not in c^|ft(a), so that a — s £ cleit(A), then since an — sn converges to a — s we eventually find an - sn & cle{t{A). m The "upper semicontinuity" of a mapping u> : A —► P (f2) from A into the subsets of a topological space Q is defined to mean that if a G A and w(a) C int(V) then there is U G Nbd(a) for which u(U) C V. 9.3 THE SPECTRAL BOUNDARY We begin by looking at the "fine structure" of the spectrum: 9.3.1 DEFINITION If a G A for a normed algebra A we define the left and right approximate eigenvalues of a to be the sets fleft (a) = j« e C: ^nf x \\ax\\ = o| = {s € C:a - , e f'^A)} (9.3.1.1) and ^"W = {seC: ^fajf x ||xa|| = o| = {, G C:a - * G f^*(A)} (9.3.1.2) The left and the n'flrfa eigenvalues of A form the sets *!?*(«) = {* S C:L7i.(0) ^ {0}} = {S € C:a - 3 <= 7rleft(^)} (9.3.1.3) and ^8ht(a) = {s G C:12-i,(0) ^ {0}} = {s G C:a- 6 G ^ht(A)} (9.3.1.4) The exponential spectrum of a forms the set ^aM = {^G C:a-6g Aq1} (9.3.1.5) Here Aq1 is the connected component of 1 in A-1 as in Definition 7.11.1. Theorem 9.2.2 extends to include the sets of Definition 9.3.1: 9.3.2 THEOREM If a G A for a normed algebra A, then tJFM <= ***(«) C a™ (a) C ^(a) (9.3.2.1) and TA8ht(«) C ?right(a) C 5*"(a) C «#**(«) (9.3.2.2)
346 9. Liouville's Theorem and Spectral Theory and n^(a) U <^ht(a) = «™*{a) U a™ (a) = cA{a) C eA{a) (9.3.2.3) Proof: For (9.3.2.1) and (9.3.2.2) we need only refer to (3.10.6.2) and (3.10.6.3) from Theorem 3.10.6, and for the first two equalities in (9.3.2.3) we refer to (3.10.6.4). The last inclusion of (9.3.2.3) simply reflects the inclusion Aq1 C A~l. ■ The approximate eigenvalues, and if A is complete the exponential spectrum, form closed subsets: 9.3.3 THEOREM If a e A for a normed algebra A, then f||ft(a) and f^g (a) are closed and bounded in C, and satisfy ^(a)Crf(a)nff(«) (9.3.3.1) which is nonempty if A ^ 0. If A is complete, then also eA{a) is closed and bounded, with deA(a) CdaA{a) (9.3.3.2) Proof: It is clear from (3.10.6.1) that f J|ft(a) and f^ght(a) are both closed, being continuous counterimages of closed subsets of A. It is also obvious from Theorem 9.2.3 and Theorem 9.3.2 that they are both bounded. From (3.10.6.5) it follows that dclXft{a) C TTJlsht{a) C 6™ht{a) (9.3.3.3) and ^fWCff(a)C5f(a) (9.3.3.4) Now (9.3.3.1) follows from (3.10.6.7). Since (9.2.3.2) says that cA(a) is nonempty if A ^ 0, and of course bounded, and since C is connected, (7.10.3.8) implies that daA{a) is nonempty if A ^ 0. Towards (9.3.3.2), it is clear from (7.11.2.3) that eA{a) is closed in C if A-1 is open in A, and since {s G C: \s\ > \\a\\} is disjoint from oA{a) and connected in C there is inclusion eA{a) C {s e C: \s\ < \\a\\} (9.3.3.5) Finally, (7.11.2.4) gives inclusion 3«a(«) C f!T(«) n ^ght(a) C eA(«) (9.3.3.6) so that (7.10.3.1) gives (9.3.3.2). ■
9.3 The Spectral Boundary 347 The exponential spectrum is contained in the connected hull of the spectrum: whether or not A is complete there is inclusion £a(«) C n°A{°) (9-3-3.7) Here we are using a modified version of Definition 7.10.1: if K C C is bounded write nK = riooK = 'ntK if \t\ > sup \s\ (9.3.3.8) seK If a G A is arbitrary, then (3.10.6.4) from Theorem 3.10.4 gives the first two equalities of (9.3.2.3) and oA{°) = *r («) U ^ght(a) = <^ht(a) U ^(a) (9.3.3.9) and ^)=-fWu5f(«) (9.3-3.10) where r^ft (a) consists of those 5 G C for which the operator La — si fails to be "closed" in the sense of Definition 3.3.1. If in particular A = BL(X, X) for a normed space X, then in addition aA{T) = TlXn{T) U r*ght(r) for each T G A (9.3.3.11) noting that rAlg (T) consists of 5 G C for which T — si is not open and using (3.3.7.1). The spectral mapping theorem for polynomials extends to the approximate eigenvalues, and almost to the eigenvalues: 9.3.4 THEOREM If a G A for a normed algebra A and if p is a polynomial, then there is equality vp{a) = pLj{a) for each « G {a^ft,a^ght,f^ft,f7ht} (9.3.4.1) There is also inclusion P^ia) C 7r^eftp(a) and p^lght(a) C 7r^lghtp(a) (9.3.4.2) with equality for nonconstant polynomials, and inclusion eAP{°) ^P^W (9.3.4.3)
348 9. Liouville's Theorem and Spectral Theory Proof: If 6 and c are in A there is implication {6, c} C A \ w(A) 4=> {6c, cb}CA\ w(A) (9.3.4.4) for each of the w of (9.3.4.1). This together with the factorization of (9.2.4.3) proves (9.3.4.1) for nonconstant p. If p is constant then (unless A = 0) equality (9.3.4.1) follows from the fact that a;(a) ^ 0, and of course if A = 0 then both sides are always empty. The same argument proves (9.3.4.2), since (9.3.4.4) holds with w = 7rleft and with w = 7rrisht. Finally, (9.3.4.3) follows from the one way implication {6, cjCA"1^ {6c, c6} C Aq1 m (9.3.4.5) Theorem 9.3.4 also extends to rational functions p/q for which the zeros of the denominator q are disjoint from the spectrum 0A(a), or for (9.3.4.3) the exponential spectrum eA(a). We need only adapt the proof of Theorem 9.2.5. 9.4 SUBALGEBRAS AND QUOTIENTS If T : A —► B is a homomorphism of normed algebras, then (7.6.0.1) tells us that the image T(a) of an invertible element a £ A must be invertible in E. 9.4.1 THEOREM If T : A —► E is a homomorphism of normed algebras and a £ A, then "bW^xW (9-4.1.1) If T is continuous, then also a^{t{Ta)CalXn{a) and af'(Ta) C ^ght(a) (9.4.1.2) and eB{Ta) C eA(o) (9.4.1.3) If instead T is one-one, then, in the other direction, ir^c) C n%{t{Ta) and ^"(To) C 7r£ght(a) (9.4.1.4) If T is bounded below and continuous, then f!?ftM C ?£ft(ra) and f*ght(a) C f£ght(Ta) (9.4.1.5)
9.4 Subalgebras and Quotients 349 Finally, if A and B are complete and T is continuous and onto, then *B(r<0= p| eA(a + d) (9.4.1.6) Td=0 Proof: For (9.4.1.1) apply (7.6.0.1) to elements a - s with s G C. When T is continuous observe T{A\cleit{A)) CB\cleit{B) and T{A\cT'lsht{A)) C5\aright(B) (9.4.1.7) giving (9.4.1.2). Also, (7.11.5.1) says that T maps Aq1 into Bq1, giving (9.4.1.3). For (9.4.1.4) we claim, if T is one-one, that T-^BXn^iB)) CA\7rleit{A) and (9.4.1.8) T~1{B \ 7rright(J5)) C A \ 7rright(A) If, for example, a G A and Ta is not in 7rleft (B) then, for each x G A, ax = 0 =► r(a)r(x) = T(ax) = 0 =► T(x) = 0 =► x = 0 (9.4.1.9) Similarly, for (9.4.1.5), if ||-|| < ik||r(-)|| on A and ||-|| < h\\{Ta) • || on B then \\x\\ <ik||rx|| </iA;||(ra)(rx)|| = /iA;||r(ax)|| < /iA;||r||||ax|| for each x G A (9.4.1.10) Finally, (9.4.1.3) gives inclusion one way in (9.4.1.6). Conversely, (7.11.5.2) says that T maps Aq1 onto Bq1 so that, if 5 §? eB{Ta), there is a' G Aq1 for which Ta — s = Taf, but now rf = a'-a + 5=l>rfG r_1(0) and 5 £ eA(a + d) ■ (9.4.1.11) Of course, (9.4.1.1) holds separately for left and right spectra. The contrast between the inclusions of (9.4.1.1) and (9.4.1.2) and those of (9.4.1.5), together with (9.3.3.1), gives rise to a certain "spectral permanence": 9.4.2 THEOREM If 1 G A C B for a normed algebra B and a G A, then ^A(a)Cafl(a)CaA(a) (9.4.2.1) If A is complete and J is a closed two-sided ideal of A, then °a/A" + J)^r\cAa + d)^ *I°a/j{* + J) (9.4.2.2)
350 9. Liouville's Theorem and Spectral Theory Proof: For (9.4.2.1) we argue, using (9.3.3.1), (9.4.1.2), and (9.4.1.5), **a(«) £ rlf{a) C f*f (a) C aB(a) C aA{a) (9.4.2.3) The first inclusion of (9.4.2.2) is just (9.4.1.1), while for the second we argue, using in succession (9.3.2.3), (9.4.1.6), and either (9.3.3.7) or (9.3.3.2) with (7.10.3.3), D °A{a + d) C f| eA{a + d) = eA/J{a + J) C WA/J{a+ J) m (9.4.2.4) deJ dej Example (7.10.3.12) cautions us against concluding from (9.4.2.2) that d( f| "a(* + «*)) C aA/J(a + J) (9.4.2.5) To see that this is so we will have to work a little harder. If T : A —► B is a homomorphism of normed algebras then we shall refer to oB{Ta) as the T-Fredholm spectrum of a, with similar terminology for the various subsets of the spectrum introduced in Definition 9.3.1. Clearly, anything that can be said about the spectrum of a can also be said about the spectrum of Ta, which is the Fredholm spectrum of a. The "Weyl spectrum" of a is defined in terms of the "Weyl elements" of A of Definition 7.6.1: 9.4.3 DEFINITION If T : A -> B is a homomorphism of normed algebras then the T- Weyl spectrum of a G A is the set uT(a) = {seC:a-s<£A-1+T-1(0)}= f]cA{a + d) (9.4.3.1) Td=Q and the almost T- Weyl spectrum is the set uT(a) = P| cA(a + d) (9.4.3.2) Td=0 We can make analogous definitions for left and right spectra, eigenvalues and approximate eigenvalues. The almost Weyl spectrum is closed and bounded, and satisfies half the spectral mapping theorem: 9.4.4 THEOREM If T : A —► B is a homomorphism of normed algebras and a G A, then uT(a) is closed in C and satisfies cB{Ta) C wT(a) C aA{a) and oB{Ta) C CT(a) C cA{a) (9.4.4.1)
9.4 Subalgebras and Quotients 351 with uT(a) C uT(a) (9.4.4.2) There is equality in (9.4.4.2) if A is complete. If p is a polynomial 0JAp(a) C puT(a) and uTp(a) C puT(a) (9.4.4.3) Proof: It is clear that uT(a) is the intersection of closed subsets of C and hence closed. The first inclusion of (9.4.4.1) follows from A'1 C A'1 + r_1(0) C r-1^"1) (9.4.4.4) and the second from the analogue of (9.4.4.4) for "almost invertible" elements of A and B. Since every invertible element is almost invertible the inclusion (9.4.4.2) follows, together with the converse for complete A. Finally, (7.6.4.1) says that the product of Weyl elements is Weyl, so that the first inclusion of (9.4.4.3) is the same as the proof of (9.3.4.3), and then the second inclusion is similar. ■ The element r 11 + T n 1 n EA = BL{X2,X2) U + I 0 0 V -I of (7.6.4.9) provides an example of the failure of equality in (9.4.4.3), taking T : A —> B = A/J to be the Calkin homomorphism and the polynomial p = [z + l)(z — 1). The same example can also be used to show that equality can fail in (9.3.4.3): we need to know that the invertible group A-1 is actually connected when X is a Hilbert space, and then observe that A'1 = Aq1 =► uT{a) = eB{Ta) (9.4.4.5) We are ready to finish the job begun in Theorem 9.4.2: 9.4.5 THEOREM If T : A —► B is a homomorphism of Banach algebras for which T{A) is closed in B, then for each a G A duT{a) C cB{Ta) C uT{a) (9.4.5.1) Proof: If J = T_1(0) we can factorize B^A = B^-A/J^-A (9.4.5.2) and observe that U is bounded below and V is onto. By Theorem 9.4.2
352 9. Liouville's Theorem and Spectral Theory (9.4.5.1) holds with U in place of T, and the weaker oB{Ta) C uT(a) C rjaB{Ta) (9.4.5.3) holds separately for U and for V. It follows that also (9.4.5.3) holds for T = UV: u>T(a) C rjaA(a + J) C rjrj(7B(Ta) = rjaB{Ta) From (9.4.5.3) and Theorem 7.10.3 it follows that drjuT{a) C aB(Ta) C uT(a) (9.4.5.4) Thus suppose that 5 G duT(a): if there is a hole if in uT(a) for which 5 G dH, then, taking arbitrarily t £ H, apply the mapping / = (2 — £)_1 to an element of the form a + d with d G T_1(0) and argue f{s) G /(&#) = drjfojT(a) = drjujTf(a + d) C cBTf{a + d) = fcBT{a) (9.4.5.5) Here the first equality is because the mapping / = (2 — £)-1 sends the hole H onto the unbounded component of the image of ujT(a), the second equality is because the one-way spectral mapping theorem (9.4.4.3) extends to rational functions, the third inclusion is (9.4.5.4) applied to f{a + d), and the final equality is (9.2.5.1) applied to Ta G B. Acting on (9.4.5.5) with the polynomial z — t now gives inclusion 5 G oB[Ta). To finish the proof we must recall (7.10.3.11), which says that if 5 G dK\dr]K, then we can write 5 = limn sn with sn G dHn for a sequence [Hn) of holes in K. Applying this with K = uT(a) gives, by what we have just proved, 5 = lim sn G cl cB [To) = oB [Ta) m 9.5 THE SPECTRAL RADIUS The spectral radius measures how far away from 0 traces of the spectrum can be found: 9.5.1 DEFINITION If A is a normed algebra and a £ A has nonempty bounded spectrum then the spectral radius of a is the number |a|a=sup{|«|:«GaA(a)} (9.5.1.1) We can extend the idea by declaring \a\a = 00 if the spectrum of a is unbounded, and \a\a = —00 if the spectrum is empty: of course, Theorem 9.2.3 says that this cannot happen if A ^ 0. We can also extend Definition 9.5.1 to |a|w for each of the subsets uA C aA of Definitions 9.2.1
9.5 The Spectral Radius 353 and 9.3.1, and to the Weyl spectrum of Definition 9.4.3. The crucial property of the spectral radius is its relation to the norm: 9.5.2 THEOREM If a e A for a Banach algebra A ^ 0, then limsup||an||1/n < \a\a < inf ||an||1/n (9.5.2.1) n->oo neti Proof: From (9.2.3.1) and (9.2.4.1) there is inclusion, for each n G N, {sn:secA{a)} =cA{an) C {t G C: \t\ < \\an\\} (9.5.2.2) giving the second inequality of (9.5.2.1). To establish the first we claim \a\a <1=> \\an\\ —> 0 => limsup ||an||1/n < 1 (9.5.2.3) n—KX> To establish this we show that, if \a\a < 1, 2m = i [z - a)"1 dz = 2m lim (l - an)_1 (9.5.2.4) J n—► oo 1*1 = 1 where the notation indicates that the integral is taken round the circle S = {\z\ = 1} = {s £ C: |s| = 1} once in a counterclockwise direction. To establish (9.5.2.4), recall that, with un = ex]>(2m/n), we have zn-l = (z- wB) (* - «2) -(*- «») (9-5-2-5) and hence (9.5.2.6) One other observation is that lim n(un - 1) = — exp(27r*t)t=0 = 2m (9.5.2.7) n—►oo dt We are ready to prove the second equality of (9.5.2.4), assuming only that the spectrum oA{a) is disjoint from the circle S: by taking "Riemann sums" it follows that i (z- a)"1 dz = lim XK"1 - a)"1 K - <~') J n—►oo * ■* n = nU™ E(X - w»y°)-1(w» - !) = '^"K - 1)(1 - «n)_1 n—►oo r -* n—►oo J = l
354 9. Liouville's Theorem and Spectral Theory The second equality of (9.5.2.4) now follows from (9.5.2.7). For the first equality of (9.5.2.4) we need to know that the spectrum oA{a) is confined to {\z\ < 1} = {s G C: |s| < 1} the interior of S: then Green's theorem (9.1.2.5) gives, as in the proof of Theorem 9.1.2, R > 1 => <b [z — a) 1 dz = <b [z — a) l dz —> 2ni as R oo 1*1=* For the passage to the limit at the end argue that if R > \\a\\ then <f> [z — a)~x dz — 2ni = f {{z — a)~l — z~x) dz\\ (9.5.2.8) \z\=R \z\=R I \z\=R z [z — a) adz < Ml (9.5.2.9) R-\\a\\ With the proof of (9.5.2.4) we have established (9.5.2.3), and if we apply (9.5.2.3) with (l/k)a in place of a, taking k = la^ + e > la^ and allowing e —► 0, then we will obtain the first inequality of (9.5.2.1). ■ Theorem 9.5.2 gives an immediate characterization of the quasinil- potent elements of Definition 7.4.1, at least when the algebra is complete. 9.5.3 THEOREM If A ^ 0 is a Banach algebra and a G A, then a quasinilpotent 4=> oA{a) = {0} (9.5.3.1) In particular, the radical elements of A are quasinilpotent: a e Radical(A) => ||an||1/n —► 0 as n —► oo (9.5.3.2) Proof: The first equivalence follows at once from (9.5.2.1). For (9.5.3.2) combine (7.2.3.10) with (7.2.3.2) and (7.2.3.3) from Theorem 7.2.3 to see that Radical(A) = {aeA:l-AaC A'1} = {aeA:l-aAC A'1} (9.5.3.3) Now compare this with the implication aA(a) = {0} 4=> 1-CaCA -l (9.5.3.4) and apply (9.5.3.1).
9.5 The Spectral Radius 355 Theorem 9.5.2 of course guarantees that the sequence (||an||1/n) always converges; it very nearly but not quite says that it is monotonically decreasing. The argument used to prove Theorem 7.4.3 also shows that if ab = ba, then l« + ft|a<l«L + H<r and M„ < l«U*l<r (9-5-3.6) If A is commutative, then (7.4.4.2) and (9.5.3.1) can be combined to reverse the implication in (9.5.3.2). Theorem 9.5.2 has an unexpected significance: the two quantities related in (9.5.2.1) have different invariance properties. Thus, the limit limn ||an||1/n obviously depends on the norm ||*||, but is completely independent of the algebra A: conversely, the supremum \a\a depends on the algebra A but is independent of the norm ||-|j. For equality (9.5.2.1) it is necessary that the norm satisfies the multiplicative condition (1.10.1.1) and that the algebra is complete: both these conditions will be preserved if we only look at topologically equivalent algebra norms on A. It is now not surprising that there is another expression for the spectral radius: 9.5.4 THEOREM If a e A for a Banach algebra A, then \a\a = inf {p{a):p an equivalent norm on A} (9.5.4.1) Proof: The left-hand side is by (9.5.2.1) less than or equal to the right- hand side. For the reverse inequality first specialize to the algebra BL(X, X) of bounded operators on a Banach space. If T G BL(X, X) and e > 0, then there is n G N for which ||T»+i||i/(»+i) < i^ + £ = k (9.5.4.2) Now put ||x||' = fcn||x|| + A^Hrxll + • • • + ||rnx|| for each x G X (9.5.4.3) and \\S\\'= sup \\Sx\\' for each S£BL(X,X) Nl;<i Evidently fcn||x|| < llxH' < {kn + kn-x\\T\\ + • • • + ||rn||)||x|| for each x e X (9.5.4.5) (9.5.3.5) (9.5.4.4)
356 9. Liouville's Theorem and Spectral Theory Thus by (1.7.1.2) ||-||' is an equivalent norm on X, and hence also ||-||' is an equivalent norm on BL(X,X). Taking S = T in particular gives in,<||T||'<*=|T|„ + e (9.5.4.6) This proves Theorem 9.5.4 when A = BL(X,X), and hence in general, if we apply what we have just proved to X = A and T = La £ BL{A, A), m 9.6 GELFAND'S THEOREM A homomorphism T : A —> B will be said to have the Gelfand property if there is implication, for arbitrary a £ A, Ta e B'1 =^aeA-1 (9.6.0.1) In the terminology of Definition 7.6.1, all the T-Fredholm elements of A must be invertible. A simple example is the quotient map associated with the radical: Theorem 7.2.6 says, in particular, that if J = Radical(A) a + Je {A/J)'1 =^aeA-1 (9.6.0.2) The Gelfand theorem provides each commutative normed algebra A with a Gelfand homomorphism T : A —> B = C^Q) for a topological space H. We begin with an auxiliary result, which tells us that the only normed division algebra is the complex field. This is known as the Gelfand-Mazur lemma: 9.6.1 THEOREM If A ^ 0 is a normed division algebra, in the sense that A = A'1 U {0} (9.6.1.1) then A reduces to the complex field: A = CI (9.6.1.2) The same is true if more generally there is k > 0 for which ||a||||6|| < fc||a&|| for each a,6 G A (9.6.1.3) Proof: From (9.6.1.1) there is implication 5 e cA{a) => a = s e CI (9.6.1.4) and (9.2.3.2) guarantees that oA{a) is not empty. If we assume only (9.6.1.3) we have implication 5 e rlX{t{a) => a = s e CI (9.6.1.5) and (9.3.3.1) guarantees that fj|ft(a) is nonempty. ■
9.5 The Spectral Radius 357 The topological space H which furnishes the Gelfand homomorphism is manufactured from the "characters" of Definition 7.13.2: 9.6.2 DEFINITION If A is a complex normed algebra we shall write A* = {/ G Af: Il/H < 1 = /(l) and f(ab) = f{a)f(b) for each a, 6 G A} (9.6.2.1) furnished with the topology of pointwise convergence on A, namely, U C A* is a neighborhood of / G A* iff there is e > 0 and finite K C A for which ig G Ax:max\g(a) - f(a)\ < e\ C U (9.6.2.2) The reader should note that unless A is finite dimensional this is not the same as the topology induced by the norm of A*. To see that Q = A* is big enough to produce a Gelfand homomorphism we combine Theorem 7.2.3 with the Gelfand-Mazur lemma: 9.6.3 THEOREM If A is a complex Banach algebra and n = A* is its character space, then the homomorphism T : A —► C^Q) defined by setting (Ta)(f) = f{a) for each / G H, for each aeA (9.6.3.1) is well defined, of bound < 1, with the Gelfand property. aeA-x^=^Tae ^(H)"1 (9.6.3.2) Proof: To see that T is well defined we need to know that Ta is continuous on fh but this is clear from (9.6.2.2). The boundedness and the homomorphic properties of T are clear from the corresponding properties (9.6.2.1) of the characters, and then forward implication in (9.6.3.2) is given by (7.6.0.1). Conversely, we claim that if a G A then a <£ A"1 => 0 G {/(a): / G Ax} (9.6.3.3) Indeed, if a §? A-1 then by (7.2.3.1) there is a maximal ideal J C A for which a £ J. By (7.2.2.2) J = cl(J) is closed, and now A/J is a normed division algebra. If we define / : A —► C by setting °A/j{b + J) = {/{*>)} ^ each 6 G A (9.6.3.4) then it is clear that / G A* is a character for which f(a) =0. ■ An equivalent definition of the character / induced by the maximal
358 9. Liouville's Theorem and Spectral Theory ideal J in (9.6.3.4) is to say that 6-/(6)16 J for each be A (9.6.3.5) In (9.6.3.3) we have apparently proved more than we have stated: we do not need to know that the continuous function Ta : Cl —> C has a bounded reciprocal l/(Ta) : Q —> C, merely that it never vanishes. In fact, the character space f2 = A* is compact, so that every continuous function is bounded: 9.6.4 THEOREM If A is a complex normed algebra, then A* is compact. Proof: If we impose the topology described in (9.6.2.2) on the whole of the unit disc {/ G A*: ||/|| < l}, then the character space A* is a closed subset. To see that the unit disc is compact in this "weak*topology" we see how it can be embedded in the product space n{N^Ni}=n{sec:H^Hi} (9-6-4-1) By Theorem 6.6.4 each of the factor spaces is compact, and by Tychenoff's Theorem 6.6.6, so is their product. ■ Theorem 9.6.4 explains why we have abandoned the norm topology of in our discussion of A*. The "weak* topology" which we are using is exactly the topology induced by the embedding of A* in the product C , and is the weakest topology for which all the functions Ta are continuous. There is a superficial analogy between the dual space X* of a normed space X and the character set A* of a normed algebra A. In general, the Gelfand homomorphism need be neither one-one nor isometric, in contrast to the embedding of a normed space in its second dual: 9.6.5 THEOREM Necessary and sufficient for the Gelfand homomorphism on A to be one-one is that A is semisimple, in the sense that Radical(A) = {0} (9.6.5.1) Necessary and sufficient for the Gelfand homomorphism on A to be isometric is that A is conservative, in the sense that ||a2|| = ||a||2 for each a e A (9.6.5.2) If A is conservative, then necessary and sufficient for the Gelfand homomorphism to be onto is that its range is self-adjoint in C00(A*).
9.5 The Spectral Radius 359 Proof: If T : A -> C^Cl) is the Gelfand homomorphism then (9.6.0.1) says *a(«) = {/(«): / € A*} = (Ta)(n) (9.6.5.3) The spectrum of a is the range of its Gelfand transform Ta. Thus, using (7.4.4.2) since A is commutative, Ta = 0 4=> a quasinilpotent 4=> a G Radical(A) (9.6.5.4) This proves the first part. Towards the second, it follows from (9.6.5.3) that |o|a =sup{|/(o)|:/ € A*} = ||Ta|L (9.6.5.5) The spectral radius of a is the standard norm of the function. If, therefore, T is isometric, so that ||a|| = \a\a, then (9.6.5.2) follows from the spectral mapping theorem (9.2.4.1). Conversely, if (9.6.5.2) is assumed, then ||an|| = ||a||n whenever n = 2m is a power of two and hence ||a|| = ||an||1/n if n G 2N and aG A (9.6.5.6) By (9.5.2.1) this implies ||a|| = \a\a, proving the second part. The last part now follows at once from the complex case of the Stone-Weierstrass Theorem 7.12.4. Note that T{A) "separates points" of Cl = A* by definition, and is necessarily closed in C^Cl) if T is isometric. ■ The "commutative Gelfand-Naimark theorem" says that the last part of Theorem 9.6.5 applies to the "Hilbert algebras" of Definition 8.8.1: 9.6.6 THEOREM If A is a commutative Hilbert algebra, then the Gelfand homomorphism is an isometric *-isomorphism. If, in particular, A is the closed algebra generated by a and a*, then Ax =cA{a) C C (9.6.6.1) Proof: Begin with the observation, using (8.8.4.4), that if a G A then a = a* => cA{a) C R (9.6.6.2) For if 5 and t ^ 0 are in R then a + 5 is also self-adjoint and therefore a + 5 + it is invertible. It follows at once that if / G A* is a character then a = a* => f(a) G R (9.6.6.3) and hence, writing a = b + ic and a* = 6 — ic as in (8.8.4.3), /(a*) = /(a)" for each a G A and / G Ax (9.6.6.4)
360 9. Liouville's Theorem and Spectral Theory We have proved that the Gelfand homomorphism is also a *-homomor- phism. It follows that the range of the Gelfand homomorphism is a self- adjoint subalgebra of C00(A*). It remains only to verify the "conservative" condition (9.6.5.2). But, since A is commutative, every element a G A is "normal" in the sense of (8.8.3.1), and now (8.8.5.1) with the B* condition (8.8.1.1) gives ||a2|| = ||a*a|| = ||a||2 (9.6.6.5) Toward (9.6.6.1), it is clear that each element of A* is mapped into the spectrum of a under evaluation at a. Conversely, if A is generated by a and a* then each / G A* is determined on A by the number /(a). ■ Whether or not a Hilbert algebra A is commutative, if a G A is normal in the sense of (8.8.3.1) then the closed subalgebra B generated by a and a* will be a commutative Hilbert algebra, hence subject to Theorem 9.6.6. In the case A = BL(X, X) this is sometimes called "the spectral theorem" for the normal operator a: linear functional on C^B*), or equivalently measures on B*, can be brought to bear on a through the medium of the Gelfand homomorphism. It is less obvious that the double commutant comm^(c) of a normal element a, as defined in (7.1.1.2), is also "self-adjoint" and therefore a commutative Hilbert algebra. This result is known as Fuglede's theorem: 9.6.7 THEOREM If a G A for a Hilbert algebra A there is implication a*a = aa* =$■ a* G comm^(a) (9.6.7.1) Proof: The argument, due to Rosenblum, is a cunning use of Liouville's theorem: if a and 6 are in A there is implication ab = ba =$■ ex-p(isa)b • exp(—isa) = b for each 5 G C (9.6.7.2) while if a is normal then (7.11.4.1) exp(zsa* + isa) = exp(isa*) exp(zsa) for each 5 G C (9.6.7.3) Since sa* +sa is self-adjoint in the sense of (8.8.3.2), the exponential on the left-hand side of (9.6.7.3) is unitary in the sense of (8.8.3.3). Thus, there is k > 0 for which ||et5a* m b m t-isa* || < fc||6|| for ea(.h 3 e c (9.6.7.4) Since the function txza* -b-e~lza* is evidently holomorphic on C, Liouville's
9.6 Gelfand's Theorem 361 theorem (9.1.3.2) says that it is constant. In particular, i{a*b - ba*) = Y^b' e~iZa*)z=o = ° " (9.6.7.5) We conclude with a characterization of the characters in the dual space of a Banach algebra: 9.6.8 THEOREM If A ^ 0 is a complex Banach algebra, then Ax = {/ e A^: /(l) = 1 and 0 £ /{A'1)} (9.6.8.1) Proof: It is clear, since A ^ {0}, that the left-hand side of (9.6.8.1) is included in the right. Conversely, we claim that if / is in the right-hand side of (9.6.8.1), then the implication (7.13.4.1) holds: /(a) = 0 => /(a2) = 0. To see this, suppose /(a) =0 and n £ N and write f((z-a)n) = (z-Sl)(z-s2)---(z-sn) (9.6.8.2) using the fundamental theorem of algebra (9.1.4.1). At the same time f((z - a)n) =zn- nf{a)zn-1 + \n{n - l)f{a2)zn~2 + • • • + (-l)n/(an) (9.6.8.3) Thus, if /(a) = 0 then also Y^=i sj = 0> while ^j<A: sjsk = \n{n — l)/(a2): it follows 0 = E sn = E 5; +2 E sJsk = E 5y + n(n ~ W) (9-6-8-4) V J J J i<k i and hence n{n-l)\f{a2)\ < El5j|2 <ttmax|sy|2 (9.6.8.5) 3 Suppose now that the functional / lies in the right-hand side of (9.6.8.1), so that /(l) = 1 and 0 £ /(A-1). By (9.6.8.2) b=(a- s1)(a - s2) • • • (a - sn) => f(b) = 0 => 6 £ A-1 (9.6.8.6) It follows, using (9.2.3.1) at the end, {61,62,...,6n} CcA{a) =4>max|6-|2 < ||a||2 (9.6.8.7) 3
362 9. Liouville's Theorem and Spectral Theory Thus, for each n G N (n-l)|/(a2)|<||a||2 (9.6.8.8) forcing /(a2) = 0. This proves the implication (7.13.4.1), and hence by Theorem 7.13.4 that / is a character on A. m An equivalent condition to membership of the right-hand side of (9.6.8.1) is f{a) G cA(a) for each a G A (9.6.8.9) Combined with Theorem 9.6.5, Theorem 9.6.8 gives its own extension: if A is arbitrary and if B is commutative and semisimple, then necessary and sufficient for T G BL(A,B) to be a homomorphism is that r(l) = 1 and r(A_1) C B~l (9.6.8.10) 9.7 THE FUNCTIONAL CALCULUS If we extend the "Cauchy integral formula" of (9.1.2.2) from complex numbers to Banach algebra elements a G A, then it is plausible that we will be entitled to call the answer "/(a)". We begin with the observation that, if T C C is a "piecewise smooth" image in C of the circle S, then we can define a mapping Wr:s —>—. f(z- s)-1 dz from C \ T to Z (9.7.0.1) 27TI J r which then divides the complement of T in C into subsets VTIT1(n)nGZ; some of these may be empty, or possibly not connected: but no connected subset of C \ T can intersect more than one of them, and always C\riTCWf1(0) (9.7.0.2) In particular, if T is "simple," in the sense that the piecewise smooth mapping from S onto T can be made one-one, then C \ T = Wf1 (1) U Wf\0) (9.7.0.3) We shall sometimes speak of Wf1^) as the inside, and V^IT1(0) as the outside, of the curve T, and then extend this terminology to the situation in which T is the union of finitely many such curves. Of course the interior of the set T, as defined in (1.2.0.8), is the empty set. For brevity we shall extend the term "simple" to apply to finite unions of curves for which (9.7.0.3) holds. With this preamble, we offer the "Cauchy integral formula" as our definition of the functional calculus:
9.7 The Functional Calculus 363 9.7.1 DEFINITION If a G A for a complex Banach algebra A and if / : 0 C C is holomorphic on an open set 0 C C for which oA{a) C O, then we write /W = i / f{z){z-a)-1dz=±Jf[z){z-a)-1dz (9.7.1.1) <rA(a) r where T C C\cA(a) is piecewise smooth with oA{o) C VTIT1(l). In words, we integrate once anticlockwise around the spectrum of a. We must at once check that f(a) is well defined: 9.7.2 THEOREM If a G A for a Banach algebra A and if /, g and h are holomorphic on a neighborhood of oA{a)> then /(a) G A is well defined, and satisfies /(a) G comm^(a) (9.7.2.1) /(a) = l if / = 1 (9.7.2.2) /(a) = a if / = z (9.7.2.3) /(a) = g{a) + &(a) if / = g + /i (9.7.2.4) f(a)=g(a)h(a) if / = g • /i (9.7.2.5) Proof: The element /(a) is supposed to exist and be independent of the particular curve T winding +1 round the spectrum of a. The existence, as a limit of Riemann sums, follows from Theorem 5.12.2; the independence of particular T follows from Cauchy's theorem and Green's theorem (9.1.2.5). To see that f(a) is in the double commutant of a, apply (5.12.2.9) to the operators Lb and Rb associated with each 6 G commA(c): (27r*)6/(a) = / f{z)b{z - a)'1 dz <7A(a.) f{z){z -a)-1bdz = (2iri)f(a)b -f o-A(a) We essentially proved (9.7.2.2) as part of Theorem 9.5.2. If R > \a\a, then (27rz)l(a) = j> (z — a)~1dz= j> [z — a)-1 dz—> 2m as J?—> oo <rA(a) \z\=R
364 9. Liouville's Theorem and Spectral Theory This now Combines with Cauchy's theorem (9.1.2.5) to give (9.7.1.3): (2iri)(z(a) — a) = <f> z[z — a)-1 dz — a = <f> [z — d)[z — a)-1 dz = 0 <rA{a) <Ja{o>) For (9.7.2.4) we need only look at Riemann sums to see that the integral of a sum is the sum of the integrals. For (9.7.2.5) we need to work just a little bit harder. Taking advantage of the independence of f(a) from the particular curve T round the spectrum of a, if g and h are each differentiable in a neighborhood of oA{a) we can arrange that Tg and Tk are finite unions of piecewise smooth curves in C \ oA{a) for which 'aWC^'W and WF/MUIVCW^I) (9.7.2.6) Thus, also rkC ^(0) (9.7.2.7) In particular, T and Tk are disjoint sets. We may now compute (27ri)2g(a)h{a) = / g{z)(z — a)-1 dz / h(w)(w — a)-1 dw zevg werh = / ( / g(z)h(w)(w - z)-1 zevg wevh X ((z - a)-1 — (w — a)-1) da J dz zevg wevh + /" /iW^-aj-M f f{z){z-w)-1dz\dw wevh zevg = / g{z)(z- a)~1(27ri)h(z) dz zeTg + / h(w)(w-a)~1 -Odw werh = 2ni / g(z)h(z)(z - a)"1 dz + 0 = (27rz)2(^ • /i)(a) We used (9.1.2.2) with h in place of / for the first of these integrals, and Cauchy's theorem (9.1.2.5) for the second. ■
9.7 The Functional Calculus 365 Theorem 9.7.2 in particular shows that if / is a polynomial then f(a) agrees with the formula of (1.10.1.7): we might remark that this can be done by the argument for (9.7.2.3), without going to the trouble needed for (9.7.2.5). It is also clear from Theorem 9.7.2 that if / is a rational function (well defined on oA{a) of course) then f(a) agrees with the formula of (9.2.5.3). We need only verify this when / = l/q is the reciprocal of a polynomial, and in this case (9.7.2.5) says that *(«)(£)(«)=!= (£)(«)«(«) (9-7.2.8) There is a certain continuity about the mapping / —► /(a): our statement is complicated by the fact that the space of all relevant functions / is not a normed space. 9.7.3 THEOREM If a e A for a Banach algebra A and if K C C is a compact set for which oA{a) C int(jK'), then there is k > 0 for which ||/(a)||<fcsup|/| (9.7.3.1) K whenever / : O —► C is holomorphic on an open set Q ~D K. In particular, if / = c0 + Yl^Li cnzU ls defined on a disc {\z\ < R} with R > \a\a, then oo /(«) = co + J2 cn<*n (9.7.3.2) n=l Proof: If we fix a piecewise smooth curve T, lying in the interior of K and winding +1 round oA{a)> we can argue ||/(a)|| < i- length(r) max \\{z - a)'1]] sup ||/|| (9.7.3.3) Zn r k giving (9.7.3.1). It follows that the mapping / —► f(a) is bounded and linear on the subspace of C00(jfiT) consisting of those continuous functions which are differentiable at each point of int(jfir). In particular, if / is a power series then f(a) is the limit of pn(a) for the partial sums pn of /. ■ Necessary and sufficient for the power series / = c0 + Yl^Li cnzTl to be defined on the disc {\z\ < R} is the familiar condition, limsup|cn|1/n < -^ (9.7.3.4) The spectral mapping Theorem 9.2.4 extends to holomorphic functions:
366 9. Liouville's Theorem and Spectral Theory 9.7.4 THEOREM If a G A for a Banach algebra A and if / : n -> C is holomorphic on an open neighborhood of oA{a)^ then *a/(«)=/*a(«) (9-7-4.1) Proof: If 5 G H is arbitrary we can write / - f(s) = (Z-S)g:n—+C (9.7.4.2) with holomorphic g : Q —► C. It is clear how to define g(t) if t ^ 5, and we take (7(5) = f'(s). Alternatively, the reader can write out a Cauchy integral for g(t). It follows that /(a) - f(s) = (a - s)<7(a) = <7(a)(a - s) (9.7.4.3) giving inclusion one way in (9.7.4.1). MeftW^r/(«) and fa^\a)<Za^f{a) (9.7.4.4) Conversely, if £ G C \ /crA(a), then (/ — i)~l = h is holomorphic on some open neighborhood of oA{a). Since (/ — i)h = 1 on a third neighborhood of aA (a) it follows that Ma)(/(a) -0 = 1= (/M " 0Ma) (9.7.4.5) which excludes £ from the spectrum of /(a). ■ It is satisfying to know that the functional calculus for f(a) is consistent with the calculus for a: 9.7.5 THEOREM If a e A for a Banach algebra A and if / : Qf -> C and <7 : f2g —► C are functions holomorphic on open sets containing o~A(a), and o~Af(a), respectively, then there is equality (ff °/)(<*) =ff (/(a)) (9.7.5.1) Proof: By the spectral mapping theorem 9.7.4 there is inclusion ^(^/-'(n,) (9.7.5.2) and now g o / is holomorphic on the open set «* = r1^) nn;5 aA{a) (9.7.5.3) We can now choose piecewise smooth curves T* and T , winding +1 round cA{a) and 0-A/(a), respectively, in such a way that T* and its "inside" lies
9.7 The Functional Calculus 367 in Qf, while Tg and its inside lies in VIg. Also, by continuity there will be an open set n'h in f2 *, containing T* and its inside, for which f(n'h) lies "inside" Tg. Now for each point t £ Tg the function ht = (t — /)_1 is differentiate on nj^. By (9.7.2.5) ht(a) = {t- /(a))-1 for each t £ Tg (9.7.5.4) Now (2*i)g(f(a)) = J g{w)(w - /(a))"1 dw = J ^H^7 J K(z)(z-°)~ldz\ dw = j{z - a)"1 l±-Jg(w)(w - /(*)) dwdz = J{z-a)-1[g-f){z)dz m The functional calculus also commutes with homomorphisms T € HBL(A,B): if a £ A and if / : Qf —► C is differentiate on a neighborhood of aA (a) D aB (Ta), then f(Ta) = T(f(a)) (9.7.5.5) One rather familiar application of the functional calculus is a characterization of the "quasipolar" elements of Definition 7.5.2: 9.7.6 THEOREM If a e A for a complex Banach algebra A, then a quasipolar -<=>• 0 £ accaA(a) (9.7.6.1) Proof: If a £ A is quasipolar, then (7.5.5.3) from Theorem 7.5.5 ensures that a — s € A-1 for sufficiently small complex numbers 5^0. Conversely, if 0 is not an accumulation point of the spectrum we can find an open neighborhood f2 of oA (a) which contains a disc K of center 0 for which K and fi \ K are open in C and cA{a) n K C {0} (9.7.6.2)
368 9. Liouville's Theorem and Spectral Theory If we now define / : VI —► C by setting f{s) = \ 9.7.6.3 [0 ifsGK then / is holomorphic on f2 and satisfies the identity f(z)2 = f(z). Using the functional calculus formula (9.7.1.1) gives an element a = f(a) = f{a)2 GA (9.7.6.4) which commutes with a, as required by (7.5.2,1) from Definition 7.5.2. If we define g : f2 —> C by setting f - ]fsen\K g(s) =\s (9.7.6.5) I 0 if 5 £ K then g is also holomorphic on U and satisfies f(z) = zg[z). Again the functional calculus formula (9.7.1.1) gives a = ag(a) = g(a)a € a A D Aa (9.7.6.6) as required by (7.5.2.4). Finally, the spectral mapping theorem 9.7.4 says aA(a(l - /(a))) = {*(1 -s):sG cA{a)} = {0} (9.7.6.7) which combines with (9.5.3.1) from Theorem 9.5.3 to give (7.5.2.3). ■ It is the idempotent 1 — a whose functional representation is most familiar: 1 - a = 1 - f{a) = — l{z- a)'1 dz (9.7.6.8) 27T2 J 0 integrating counterclockwise once along the boundary dK of a disc K of center 0 which contains no other point of <JA{a). More generally, if K C <JA{a) is, relative to <7A(a), both open and closed, then P = XK{°) = ^lf{z-°)~ldz (9.7.6.9) K is a projection commuting with a for which (unless K = 0) K = cA(ap) (9.7.6.10) We are now able to extend (7.11.6.8) to more general A:
9.8 Essential Spectra 369 9.7.7 THEOREM If a e A for a Banach algebra A, then oo exp(a) = 1 <=> a = ^ 2ninXn(a) (9.7.7.1) n=—oo for a finite family {xn{a))neZ °f pairwise commuting projections with n ± m => XnWxm(a) = 0 and £x»(<0 = 1 (9.7.7.2) n Proo/: If p = p2 £ A, then exp(27T2» = 1 + (e2™ - l)p = 1 (9.7.7.3) and hence, if a is in the right-hand side of (9.7.7.1), then exp(a) is a finite product of l's. Conversely, if a is in the left-hand side of (9.7.7.1) then, using Theorem 9.7.4, aA(a)C27rlZ (9.7.7.4) In particular, <JA{a) has no accumulation points: thus, using (9.7.6.8), oo b= Yi 2«nx„(o)=>aA(o-6) = {0} (9.7.7.5) n=—oo We now have ||(a-6)n||1/n —>0 and exp(a-6) = l (9.7.7.6) This gives oo (a-6)(l+d) = ^(a-6)n/n! = exp(a-6)-l = 0 (9.7.7.7) n=l with n=2 Since evidently crA(d) = {0} it follows that 1 + d (= A-1 and hence that a-6 = 0. ■ 9.8 ESSENTIAL SPECTRA If the "Weyl spectrum" relative to a homomorphism is derived from the "Weyl elements," then the "Browder elements" give rise to a "Browder spectrum":
370 9. Liouville's Theorem and Spectral Theory 9.8.1 DEFINITION If T : A -> B is a homomorphism of normed algebras, then the T-Browder spectrum of a € A is the set c4omm(a) = {s G C:a - 5 £ A"1^"1^)} = H ^(a + d):T(d) = 0,ad = da} (9.8.1.1) We can obviously make analogous definitions for left and right spectra, almost spectra, and (approximate) eigenvalues. The Browder spectrum is closed and bounded and, if the homomorphism satisfies the "Riesz" condition (7.7.4.2), satisfies the spectral mapping theorem: 9.8.2 THEOREM If T : A -> B is a homomorphism of Banach algebras and a£ A, then <rBiTa) C wr(a) C c4omm(a) C aB(Ta) U accaA(a) C aA{a) (9.8.2.1) and if / : f2 —► C is holomorphic on a neighborhood f2 of cA(a), then , .comm t(„\ r~ t, .comm/j (9.8.2.2) C aBr(/(a)) U accaA/(a) C f(aB(Ta) U accaA(a)) If the homomorphism T : A —► B has the Riesz property, then cB{Ta) U accaA(a) (9.8.2.3) (a) = c4omm/(a) (9.8.2.4) Proof: The inclusions (9.8.2.1) are obtained by applying (7.7.3.3), and (7.7.4.1) from Theorem 7.7.4, to elements a — s with s € C The inclusions (9.8.2.2) are mostly clear: the middle one comes from (7.7.5.1) from Theorem 7.7.5. If T has the Riesz property (7.7.4.2), then (7.7.4.3) from Theorem 7.7.4 gives (9.8.2.3), which combines with (9.8.2.2) to give (9.8.2.4). ■ When A = BL(X,X) for a Banach space X and $ : BL(X,X) -> BL(X, X)l KL(X, X) is the Calkin homomorphism then, changing notation, we shall write <7essCn = ^BL(X>X)/Kt(x>x)(r + KL(X.X)) (9.8.2.5) "ess(r) = ««(T) (9.8.2.6) » and
9.8 Essential Spectra 371 and ^essmm(T) = t4°mmCn (9.8.2.7) respectively, for the Fredholm, Weyl, and Browder spectra of T. By Theorem 7.7.2 the homomorphism $ has the Riesz property (7.7.4.2), so that Theorem 9.8.2 applies: 9.8.3 THEOREM If T € BL(X, X) for a complex Banach space X, then °V) = ^essCH U 7rleft(r) U 7rrieht(r) (9.8.3.1) and o(T) = wess(r) u (7rleft(r) n 7rright(r)) (9.8.3.2) and »%T(T) = <ress(r) U acea(T) (9.8.3.3) Proof: If 5 € C is not in the right-hand side of (9.8.3.1), then T — si is Fredholm, and hence in particular (6.4.4.1) regular, as well as being both one-one and dense, therefore (3.8.3.12), (3.8.3.13) invertible. If instead 5 £ C is not in the right-hand side of (9.8.3.2), then T - si is by (6.5.1.2) Fredholm of index zero, so that if it is either one-one or dense then it must be both one-one and dense. By (9.8.3.1) it follows that T — si is invertible. This proves the first two equalities, and the third is (9.8.2.3). ■ We shall write ^00(r) = isoa(r)\Wess(r) (9.8.3.4) for the Riesz points of the spectrum of T. Evidently 5 £ 7r00 (T) means that 5 is isolated in cr(T), and an eigenvalue for which the eigenspace (T — 5/)_1(0) is finite dimensional, while the range (T — sI)(X) is closed and has equal finite codimension. Using the "punctured neighborhood" Theorem 7.8.5, if we remove the connected hull of the essential spectrum from the spectrum, only Riesz points will remain: 9.8.4 THEOREM If T £ BL(X, X) for a complex Banach space X, then da(T)\ae33(T) Cisoa(T) (9.8.4.1) and hence <r(T) C r,aess(T) U xM(T) (9.8.4.2)
372 9. Liouville's Theorem and Spectral Theory In particular, T £ KL(X,X) => a{T) C {0} U 7r00(r) (9.8.4.3) Proof: For (9.8.4.1) apply Theorem 7.8.5 to the operators T - si, taking for the subspace A C BL(X,X) the set {T -tl:te C}. From (9.8.4.1) and (7.10.5.2) from Theorem 7.10.5 it follows <r(T)CVce33(T)Uisoc(T) (9.8.4.4) and of course ^ess(r) U isoa(T) = r,v„(T) U (isoa(T) \ aeas(T)) (9.8.4.5) By (9.8.3.3) we have isoa(r) \ aess(r) = isoa(T) \ Wec°3mm(r) = ^oo(r) (9.8.4.6) This proves (9.8.4.2), and then (9.8.4.3) follows at once, noting that if T is compact then its essential spectrum {0} is the same as the connected hull n{o} = {o}. m It is not necessary to use the punctured neighborhood Theorem 7.8.5 to derive (9.8.4.3): 9.8.5 THEOREM If T € BL(X, X) for a normed space X and if 0 ^ e3 G (T - sl)-1^) for each sGK (9.8.5.1) then {e8)8eK is linearly independent in X. Hence, T G KL(X, X) => ace 7rleft (T) C {0} (9.8.5.2) Proof: The linear independence of eigenvectors corresponding to distinct eigenvalues reduces to the linear independence (6.1.0.1) of finite families of vectors, which can be proved by induction. Thus, if (ey)j€j are nonzero eigenvectors corresponding to complex numbers {sj)jeJ with either J = {1} or J = {1,2,..., m}, then the analogue of (9.8.5.1) is evident when m = 1, and follows for m = n + 1 from the case m = n: for if (e1? e2,..., en) is linearly independent and if en+1 = t-^e-^ + t2e2 H + £nen, then n 0 = (J - 6n+1/)en+1 = J^«J.(6n+1 - «,.)cy => *x = *2 = • • • = tn = 0 forcing en+1 = 0, a contradiction.
9.8 Essential Spectra 373 If, in contradiction to (9.8.5.2), there is an infinite sequence (sy)jGN of eigenvalues with inf • \sA = k > 0, then there will be corresponding eigenvectors (e •), necessarily linearly independent. We write n E^^T-Sjir^O) (9.8.5.3) 3 = 1 and note that this is an increasing sequence of closed subspaces, none of which equals its successor. By the Riesz lemma (Theorem 1.5.2) there is a sequence [xn) in X for which, for each n £ N, " *n € En; \\xn\\ = 1; dist(sn+1,£n) > \ (9.8.5.4) If we notice also that, for each n £ N, (T-snI)En+1CEn then it is easy to see that n?m=>\\Tzn-Txm\\>$k (9.8.5.6) Specifically, we observe that if n > m+ 1 then Txn — Txm — snxn € En_1. This gives \\Txn - Txm\\ > dist^x^E^) > i|«„| > \k. m We can offer an alternative argument which shows that, relative to the finite rank ideal, a "Browder operator" must be quasipolar: T e BL_1(X,X)€KL0(X,X) => 0 0 acca(r) (9.8.5.7) Indeed, if T = S + K with finite rank K and invertible S there will be 6 > 0 for which S — si is invertible whenever \s\ < 6, and hence if x £ X (J - sl)x = 0 => x = (5/ - 5)_1ii:x = ^(5/ - 5)_1x G ^(X) if SK = KS Since -K^X) is assumed to be finite dimensional, the linear independence of eigenvectors (9.8.5.1) ensures that for only finitely many 5 G C with \s\ < 6 can the operator S — si not be one-one. Since it is also a Weyl operator it will be invertible whenever it is one-one. We are also able now to finish Lomonosov's theorem, which we started in Theorem 7.4.5: (9.8.5.5)
374 9. Liouville's Theorem and Spectral Theory 9.8.6 THEOREM If X is a Banach space and if K € KL(X,X) \ CI is a nonscalar compact operator, then it has a nontrivial hyperinvariant subspace. Proof: If 5 £ 7rleft(iJT) is an eigenvalue of K, then W = (K-sI)-1^) (9.8.6.1) is (unless K is a scalar) a nontrivial hypervariant subspace for K: TK = KT^ T({K - sl)-1^)) C(K- s/)"1^) (9.8.6.2) If 7rleit(K) = 0, then (9.8.4.3) says that c{K) = {0}, Theorem 9.5.3 says that K is quasinilpotent, and now Theorem 7.4.5 says that K has a non- trivial hypervariant subspace. ■ We conclude by showing that, on Hilbert space, it is possible with a single compact operator to strip the spectrum of a bounded operator down to its essentials. The proof divides into two parts, one of which works on Banach space: 9.8.7 THEOREM If T e BL(X,X) for a Banach space I and O 0 then there is K € KL(X,X) with \\K\\ < e for which Cm(^^)^essm (9.8.7.1) Proof: Let {sn:n £ N} be a dense subset of C \ u;ess(T) and put e:0 = e: then by Theorem 6.5.2 there is Kx £ KL0(X,X) with H^! || < \e0 andT + Kx- sj GBL_1(X,X) (9.8.7.2) Note that if K° is given by Theorem 6.5.2 then all nonzero multiples of K° differ from T by an invertible operator. By the openness of BL_1(X,X) there is ex > 0 for which, for arbitrary S (E BL(X, X), \\S\\ <e1 =>► (J + Kx - sj) + S GBL-\X,X) and e± < \e0 (9.8.7.3) By Theorem 6.5.2 again there is K2 E KL0(X, X) with \\K2\\<^e1 and T + Kx + K2 - s2I G BL_1(X,X) (9.8.7.4) By (9.8.7.3)) we also have T + Kx + K2 - sj e BL_1(X, X) (9.8.7.5)
9.8 Essential Spectra 375 Inductively we obtain sequences [en) > 0 and [Kn) in KL0(X,X) for which ||5||<en=^lr + f^iry-«jbI:* = ll2l...lnl C BL_1(X,X) and en < \en_l (9.8.7.6) and ll*n+ill<Kand <r+^AJ.-«jbI:* = l,2,...,n+ll CBL_1(X,X) (9.8.7.7) Put oo K=Y,Kn (9-8.7.8) n=l Evidently the series converges, giving K £ KL(X,X) with \\K\\ < e\ also {T + K- snI: n £ N} C BL_1(X,X) (9.8.7.9) Since {sn: n £ N} is dense in C \ cjess(T) it follows that a(T + K) \ ue33(T) = c(T + K) \ ue33(T + K) C da(r + K) (9.8.7.10) and now Theorem 9.8.4 gives (9.8.7.1). ■ The second part of the proof assumes the condition (9.8.7.1): 9.8.8 THEOREM If T £ BL(X,X) for a Hilbert space X then there is K£ KL(X,X) for which c(T + K)=ue33(T) (9.8.8.1) Proof: We begin by finding K £ KL(X, X) for which a{T + K)C c^s°smm(r) (9.8.8.2) Since no compact subset of C \ wl33mm(T) can contain more than finitely many points of c{T) we can write a{T) \ u-ec°smm(r) C K: B£N} (9.8.8.3)
376 9. Liouville's Theorem and Spectral Theory Now put oo I-P=Y,Qn withQ0 = 0 n=1 (9.8.8.4) and Qn = Qn = Ql With Qn(X) = (T- SnI)-°°(0) the orthogonal projection whose range is the closure of the sum of the hyperkernels. Since each sn is at worst in n00(T) each of these is finite dimensional. Now find (s'n) for which, for each n € N, *'» G du~™(T) and \,'n - sn\ = dist(5„,W™(r)) (9.8.8.5) Finally put oo K=J2«-Sn)(Qn-Qn-l) (9-8.8.6) n=l Since the projections Qn — Qn_i are mutually orthogonal the series converges to a compact operator of bound < supn \s'n — sn\: we must verify (9.8.8.2). Begin by noticing a(T + K) \ uttr(T) C a(T + K) \ ue33(T) = o(t + k)\ cjess(r + k)c 7rleft(r + K)n 7rright(r + k) (9.8.8.7) Also, if tG C \ c^s°smm(r) then (T + K- tl)*-1^) = {{T + K- t^X^ C P{X) C (ir)-1(0) (9.8.8-8) Thus aright(r+ii:)\^s0smm(r) =7rrisht(r)\^s°smm(r) cfc^N} (9.8.8.9) But now t = sn => (J - */)*_1(0) C Qn{X) C P(X)-1- (9.8.8.10) Thus if tea(r + K)\ ^°smm(r) then the operator (T + K - tl)* must be not one-one, and its null space must coincide with that of (T — tl)*. By (9.8.8.8) and (9.8.8.10), however, this null space lies in the intersection of P(X) and P(X)-1-, which is {0}. This contradiction proves (9.8.8.2). For (9.8.8.1) use Theorem 9.8.7 to find K0 £ KL(X,X) for which Cm? + K0) = cess(r) = wess(r + K0) (9.8.8.11)
9.8 Essential Spectra 377 and then apply (9.8.8.2) to find Kx £ KL(X,X) for which a(T + K0 + Kx) C a;ecs°smm(r + K0) Then (9.8.8.1) holds with K = K0 + K1. m (9.8.8.12) It would be nice to be able to report equality in (9.8.8.2) for a compact operator K commuting with T. This, however, may not be possible even when T is a Riesz operator, in the sense that °ess(T) = {0} (9.8.8.13) so that by Theorem 9.8.4 also u;ess(r) = w|^mm(r) = {0}. If we take X = Y xY with Y = l2 and put T = W U 0 -W (9.8.8.14) where U is the forward shift of (7.3.6.12) and W the compact weight of (7.3.6.15), then K = W 0 0 -W and S = 0 U 0 0 (9.8.8.15) give Ke KL(X,X) and 0 and K + S = T (9.8.8.16) In particular, T satisfies the Riesz condition (9.8.8.13). It is clear also, since U is not compact, that T is not compact. Notice that W has lots of one-dimensional eigenspaces: (W - (l/n)/)"1^) = C6n for each n e N (9.8.8.17) We observe that for arbitrary A € BL(/2,y OO , v -1 ^_1(0) 2 |J [W --IJ (0) => A = 0 (9.8.8.18) since linear combinations of the Kronecker deltas 6n are dense in l2. We then claim that T has a similar property: for arbitrary S € BL(X, X), s~1(o)d |J [t-±i) (o)u |J fr+^/J (0)^5 = 0 n=l ^ ^ n=l ^ ' (9.8.8.19)
378 9. Liouville's Theorem and Spectral Theory Indeed, for each nGNwe have (0) = C and T + i/f(0) = (9.8.8.20) n(n+l)6n+l -(2n + l)*n so that if S € BL(X, X) is quasinilpotent and commutes with T then the operator induced by S on each of the one-dimensional eigenspaces (T — (l/n)/)-1^) and (T + (l/n)/)-1^) is quasinilpotent and therefore 0. It is now clear that (9.8.8.2) cannot hold with compact K commuting with T: for if so then T + K is quasinilpotent and commutes with T, therefore is 0, which would make T compact. One last observation is the extent to which the upper and lower semi- Fredholm spectra mimic left and right approximate eigenvalues: 9.8.9 THEOREM If X is a Banach space and T e BL(X, X) then ^ess(T)=at33(T)Ua-3(T) and aaess(r)cae+ss(r)na-s(r) Proof: Theorem 6.11.3 and (6.12.4.7) say that 7rleftp(r) = ae+ss(r) = f leftp(r) and 7rleftp(r) u 7rriehtp(r) = aess(r) = aP(r) and 7rrightp(r) = ae-ss(r) = frightp(r) Now (9.8.9.1) follows from (9.3.3.11) and (9.8.9.2) from (9.3.3.1). ■ 9.9 HILBERT ALGEBRA The alert reader may have noticed an omission in Definition 8.8.3: we offered no analogue in a general "Hilbert algebra" of the "positive" operators of Definition 8.6.1. We begin by making an observation: (9.8.9.1) (9.8.9.2) (9.8.9.3) (9.8.9.4) (9.8.9.5)
9.9 Hilbert Algebra 379 9.9.1 THEOREM If a G A for a complex Hilbert algebra and if k > \a\a, then the following conditions are equivalent: cA{a) CR+ = [0,oo[ (9.9.1.1) aA(o) C R and \k - a\a < k (9.9.1.2) Proof: If (9.9.1.1) holds, the first part of (9.9.1.2) is clear, and by rather a trivial part of the spectral mapping theorem 9.2.4 we have cA{k — a) = k — cA{a) C [0,A;] giving the second part of (9.9.1.2). Conversely, if (9.9.1.2) holds, then k — crA(a) = oA[k — a) C [—k,k] so that aA(a) C k-[-k,k] = [0,2k] C [0,oo[ ■ We are now prepared to make 9.9.2 DEFINITION If A is a Hilbert algebra, then aGAis called positive iff a = a* and aA(a) C R+ (9.9.2.1) We write A+ = {a€ A:a positive} (9.9.2.2) Since (9.6.6.1) self-adjoint elements have real spectra, a £ A will be positive iff a = a* and IIH -a|| < ||a|| (9.9.2.3) The positive cone A+ of a Hilbert algebra A satisfies the conditions of Definition 1.11.4: 9.9.3 THEOREM If A is a complex Hilbert algebra, then A+ n (-A+) = {0} (9.9.3.1) A+ + A+ C A+ (9.9.3.2) R+A+ C A+ (9.9.3.3) cl(A+) = A+ (9.9.3.4)
380 9. Liouville's Theorem and Spectral Theory Proof: If a G A+ and —a (E A+, then cA{a) = {0} and hence \a\a = 0. By (9.5.3.1) and the "conservative" property (9.6.6.4) it follows that a = 0, giving (9.9.3.1). If a and b are in A+ then (9.9.2.3) gives llll^ll H- 11*11 — C« H- *)|| < ||ll«ll — «H H- llll*H — *ll < H«H H- H*H and now (9.9.1.2) with k = \\a\\ + ||6|| gives a + b € A+, hence (9.9.3.2). For (9.9.3.3) we use (9.9.2.1) together with another trivial instance of the spectral mapping theorem 9.2.4, and for 9.9.3.4 we combine the definition (9.9.2.1) with the "upper semicontinuity" result Theorem 9.2.6. ■ It is rather easy for a positive element to be invertible: 9.9.4 THEOREM If a e A+ and b £ A+ in a Hilbert algebra A, then aU'^a + ^r1 (9.9.4.1) and max(||a||,||6||)<||a + 6||<||a|| + ||6|| (9.9.4.2) Proof: Using the notation (1.11.4.2) for the partial order induced by A+, which is transitive by (9.9.3.2), we can argue that if a £ A+ D A~l then k = inf aA(a) => 0 < k < a < a + b => 0 < k < inf aA(a + b) (9.9.4.3) which excludes 0 from crA(a + b) and gives (9.9.4.1). For (9.9.4.2) we argue 0 < a < a + b < \\a + 6|| => ||a|| < \\a + 6|| using (9.9.1.2) with k = \\a + 6|| at the end. This gives one quarter of (9.9.4.2). Interchanging a and b gives another quarter, and the remaining half is just a restatement of the triangle inequality (1.1.1.4). ■ If a = a* is self-adjoint in the Hilbert algebra A, and B is the closed subalgebra of A generated by a and a*, then (9.6.6.1) and (9.4.2.1) give B**aB(a)=aA(a) (9.9.4.4) since (9.6.6.2) says that crB(a) C R coincides with crA(a). Thus the commutative Gelfand-Naimark Theorem 9.6.6, applied to B, can be turned inside out to yield a massive extension of the functional calculus (9.7.0.1): if / € C^aM) ^s arbitrary, then f(a) € A is defined by setting <£(/(a)) = /(4(a)) for each <j> G Bx (9.9.4.5)
9.9 Hilbert Algebra 381 If in particular a £ A+ is positive, so that cA{a) C [0,oo[, then (9.9.4.5) applies to the square root function z1!2 : [0,00[ —► [0,oo[. This gives us another characterization of the positive elements of A: 9.9.5 THEOREM If A is a complex Hilbert algebra and a £ A+ is positive then there is a unique positive element a1/2 £ A for which a1/2 e A+ and (a1/2)2 = a (9.9.5.1) Proof: To show that a1/2 exists apply (9.9.4.5) with / = 21/2. To see that it is unique suppose that b £ A+ and b2 = a: since a1/2 £ comm^(a) it follows that 6a1/2 = a1/26, and now the double commutant D of {a1/2,6} in A is a commutative Hilbert subalgebra of A. Evidently a1/2 and b have the same Gelfand transforms in C(jD*), and therefore coincide. ■ Of course it is pretty clear that the square of a self-adjoint element is always positive: we can prove more. 9.9.6 THEOREM If a £ A for a Hilbert algebra A, then a*a£A+ (9.9.6.1) Proof: We begin, rather mysteriously, with an auxiliary result: for arbitrary a £ A there is implication -a* a U+=^a = 0 (9.9.6.2) To see this write a = b + ic and a* = b — ic, so that b and c are self-adjoint, and aa* = 262 + 2c2 - a*a (9.9.6.3) Since b and c are hermitian, and hence have real spectra, the spectral mapping theorem 9.2.4 says that b2 and c2 are both positive in the sense of Definition 9.9.1. If -a*a £ A+ then (9.9.6.3) and (9.9.3.2) imply that aa* £ A+, and now (7.1.3.7) says that also a*a £ A+. An application of (9.9.3.1), with aa* in place of a, finishes the proof of (9.9.6.2). Toward (9.9.6.1) we now write a*a = b-c with 6,c £ A+ and be = cb = 0 (9.9.6.4) In terms of the functional calculus we may take b= \(a*a+{{a*a)2y!2) and c = \{-a*a+ {{a*a)2)^2) (9.9.6.5)
382 9. Liouville's Theorem and Spectral Theory We now have (ac)*ac = c*a*ac = c{b — c)c = — c3 £ —A+ which by (9.9.6.2) gives ac = 0 and hence c3 = 0 and hence c = 0, so that finally a = b £ A+. ■ Theorems 9.9.6 and 9.9.4 can be used to extend the spectral permanence from self-adjoint elements to arbitrary elements: 9.9.7 THEOREM If a £ A for a Hilbert algebra A, then *Aft(a) C {s £ C: (a - s)*{a - s) <£ A"1} C ^{a) and <^lght(a) C {s £ C: (a - s){a - s)* <£ A'1} C f^ght(a) (9.9.7.2) Proof: If a£ A and a*a£ A~l then certainly a is left invertible and hence not a topological left zero divisor. Conversely, if a is not a topological left zero divisor, and if k > 0 satisfies ||a6|| > k\\b\\ for each & G i, then, recalling the B* condition (8.8.1.1), ||6||||a*a6|| > \\b*a*ab\\ = \\ab\\2 > k2\\b\\2 for each b £ A so that a* a is also not a topological left zero divisor in A. Since a* a is self-adjoint, therefore (9.6.6.2) has real spectrum, which therefore coincides with its boundary, (9.3.3.1) tells us that if a*a is not a topological left zero divisor then it is invertible. Applying this argument to a — s with s £ C finishes the proof of (9.9.7.1), and applying (9.9.7.1) with a* in place of a gives (9.9.7.2). ■ Rather late in the day, we are able to prove that the quotient of a Hilbert algebra by a closed two-sided ideal is a Hilbert algebra: 9.9.8 THEOREM If A is a complex Hilbert algebra and if J C A is a closed two-sided ideal of A, then J* =J (9.9.8.1) and inf \\a*a - d\\ = ( inf \\a - d\\ J for each a £ A (9.9.8.2) (9.9.7.1)
9.10 States and Representations 383 Proof: If b € A+ is arbitrary and c = 6(1 + 6)_1 then, using (9.9.4.5), c£A+ and ||c|| < 1 and ||1 - c\\ < 1 (9.9.8.3) If K C J is a finite set, with nK elements, write bK = ^2 x*x and CK = bK^ + hK)~1 (9.9.8.4) xeK Evidently these are elements of J for which, in the sense of (1.9.0.5), \\d-dcK\\ —► 0 as K —► J for each d G J (9.9.8.5) This means that if d € J then d* G cl(J) = J, giving (9.9.8.1). Also if a £ A is arbitrary then, in the same sense, \\a - ack\\ —v dist(a, J) as K —► J (9.9.8.6) This now transfers the B* condition (8.8.1.1) from A to A/ J. m In particular, the Calkin algebra is a "Hilbert algebra": if X is a Hilbert space then for each T (= BL(X, X) dist(r*r, kl(x, x)) = dist(r, kl(x, x))2 (9.9.8.7) 9.10 STATES AND REPRESENTATIONS The "states" of Definition 7.13.2 can be used to define another kind of spectrum: 9.10.1 DEFINITION If A is a normed algebra, then the numerical range of an element a £ A is the set V{a) = VA(a) = {/(a): H/ll = 1 = /(l)} = {/(a):/ € State(A)} (9.10.1.1) and the pure numerical range is the set a* {a) = {/(a): / G Extreme (State (A))} (9.10.1.2) The numerical range is closed and convex, and includes the almost spectrum:
384 9. Liouville's Theorem and Spectral Theory 9.10.2 THEOREM If a £ A for a normed algebra A, then *a(«) £ Va{°) = cl(cvx(aj(a))) C {|*| < ||a||} (9.10.2.1) and for arbitrary s,t £ C there is equality VA(sa + t)= sVA(a) +1 . (9.10.2.2) If also b £ A there is inclusion VA(a + b)C VA(a) + VB(a) (9.10.2.3) If T £ HBL(A, B) is a bounded homomorphism, then T relatively open => VB(Ta) = f] VA(a + d) (9.10.2.4) Td=0 Proof: Since State (A) is closed in the "weak* topology" of pointwise convergence on A induced by the closed unit disc in A*, which by the proof of Theorem 9.6.4 is compact, it is clear that VA(a) C C is the continuous image of a compact set, and therefore closed. It is obvious that VA(a) is contained in the disc {\z\ < \\a\\} C C; to see that it is convex observe that if f,g £ State(A) and 0 < t < 1 then \\(l-t)f+tg\\ < [l-t+t)l = {l-t)f{l)+tg{l) < ||(l-«)/+^|| (9.10.2.5) From (5.11.3.1) it is clear that the numerical range is contained in the closed convex hull of the pure numerical range. To see why it includes the almost spectrum suppose 5 £ ^a{°) ls arbitrary, and define a linear functional /0 : Ca + C —> C by setting f0(ra + t) = rs + t for each r,t £ C (9.10.2.6) By (9.2.3.1) from Theorem 9.2.3 this is well defined and satisfies the condition (7.13.2.1), and by the Hahn-Banach Theorem 5.3.2 can be extended to / £ State(A). Equality (9.10.2.2) and inclusion (9.10.2.3) can be left to the reader. With no restriction on T £ KBL(A,B), if 5 = g{Ta) with g £ State(B) then f = gT £ State(A), giving VB(Ta)CVA(a) (9.10.2.7) and, hence, forward inclusion in (9.10.2.4). Conversely, if 5 £ C is in the right-hand side of (9.10.2.4) we can define g0 : CTa + C —> C by setting g0{rTa + t) =rs + t for each r,t £ C (9.10.2.8)
9.10 States and Representations 385 Since T is relatively open, g0 is continuous, and of course well defined if 5 is in the right-hand side of (9.10.2.4). We can now extend by Theorem 5.3.2 to0GState(5). ■ One small consequence of (9.10.2.4) is that the numerical range of an element is independent of the algebra: if a £ A C B then (9.10.2.4) gives VB(a) = VA(a). We shall write Extreme(State(A)) = A* (9.10.2.9) In a Hilbert algebra, the numerical range of an element determines whether it is positive, self-adjoint or zero. 9.10.3 THEOREM If a, b € A for a Hilbert algebra A and / e State(A), then a£A+=> /(a) £ R+ (9.10.3.1) f(a*)=f(a)- (9.10.3.2) |/(6*a)\2 < f(a*a)f(b*b) (9.10.3.3) Proof: For (9.10.3.1) suppose that f(a) = s + it with s,t £ R. We must show that t = 0 < s. If k > 0 is sufficiently small then o~A(l — ka) C [0,1], giving || 1 — A;a|| = |1 — ka\a < 1 and hence |1 - ks\ < |1 - k(s + it)\ = |/(1 - ka)\ < 1 so that 5 >0. Also 6 = a +5 +Ogives ||62|| = ||6*6|| = ||(a + s)2 + A;2*2|| < ||a + s||2 + A;2*2 so that (A;2 + 2k + l)*2 = |/(6)|2 < \\a + 6||2 + A;2*2 for all k > 0 Allowing k —> oo forces t = 0. This gives (9.10.3.1), and hence, by the argument of (9.9.6.4) the numerical range of a self-adjoint element is always real. Writing a = b + ic and a* = b — ic now gives (9.10.3.2). Finally, (9.10.3.1) and (9.9.6.1) show that f{a*a) > 0 if a € A and / € State(A) (9.10.3.4) Now (9.10.3.3) follows by exactly the same argument used to prove the Schwarz inequality (8.1.2.4). ■ Theorem 9.10.3, together with a certain amount of "pulling oneself up by the bootstraps," now gives
386 9. Liouville's Theorem and Spectral Theory 9.10.4 THEOREM If a £ A for a complex Hilbert algebra A, then of (a) = {0} <=> VA{a) = {0} <=> a = 0 (9.10.4.1) a*[a) C R <=> VA(a) C R <=> a = a* £ h+(A) (9.10.4.2) of (a) = R+ <=> VA(a) CR+^fla+ (9.10.4.3) Proof: Theorem 9.10.3 suggests that at least a = a* and VA{a) = {0} =>► a = 0 (9.10.4.4) but this is clear, since (9.10.2.1) and (9.6.6.5) give °a (<0 C VA(a) and ||a|| = \a\a (9.10.4.5) To generate the second part of (9.10.4.1) from this write a = b + ic with a* = b — ic, and recall from (9.10.3.2) that f(b) and /(c) are real for all / £ State(A): thus if VA(a) = {0} then for arbitrary / £ State(A) we have 0 = f(a) = f(b) + if(c) => /(&) = /(c) = 0 (9.10.4.6) Now (9.10.4.4) gives b = c = 0 and hence a = b + ic = 0. This proves the second part of (9.10.4.1), which in turn gives the second part of (9.10.4.2): for if VA(a) C R then for arbitrary / £ State(A) we have, using (9.10.3.2), f(a - a*) = /(a) - /(a*) = /(a) - /(a)" = 0 (9.10.4.7) giving a = a* £ h^{A) by (9.10.4.1). This proves the second part of (9.10.4.2), which then combines with Definition 9.9.2 to give the second part of (9.10.4.3). Finally the first part of each part of Theorem 9.10.4 follows from the middle of (9.10.2.1) from Theorem 9.10.2. ■ From (9.10.4.5) it is also clear that a normal in A => ||a|| C i\s\:s £ <jf{a)\ C {\s\:s £ VA(a)} (9.10.4.8) Theorem 9.10.4 says, among other things, that a £ A is "positive" iff /(a) > 0 for every state f £ A*; dually a linear functional f £ A* with /(l) = 1 will be a state iff /(a) > 0 for every positive element a £ A: 9.10.5 THEOREM If A is a complex Hilbert algebra and / £ A^, then /(A+)CR+«=>||/||=/(1) (9.10.5.1)
9.10 States and Representations 387 Proof: Backward implication is (9.10.4.3). Conversely, if the left-hand side of (9.10.5.1) holds, then (9.9.6.4) gives inclusion f(h+(A)) C R. Also, if a (E h+(A) is arbitrary, then ||a|| — a and ||a|| + a are both in A+, so that a = a*=>-||a||/(l)</(a)<||a||/(l) Applying this to the element a*a for arbitrary a £ A, and using (9.10.3.3), gives |/(a)|2 = |/(l*a)|2 < /(a*a)/(l * l) < ||/||||a||2/(l) ■ Linear functional f £ A* which satisfy either side of (9.10.5.1) make up the "positive" elements of the dual space A*: (A^)+ = {fe A:f{A+) C R+} (9.10.5.2) The reader should verify the analogue of Theorem 9.9.3, showing that this satisfies the conditions of Definition 1.11.4. If A is a normed alegbra then, rather trivially, the mapping a —► La : A —► BL(A, A) (9.10.5.3) shows that A can be regarded as a closed subalgebra of the bounded operators BL(X, X) on a normed space X. Rather less trivially, the "noncom- mutative Gelfand Neumark theorem" says that if A is a Hilbert algebra then we can do this with a Hilbert space: 9.10.6 THEOREM If A is a complex Hilbert algebra, then there exists an isometric *-homomorphism tta# : A — BL(tfA#,tfA#) (9.10.6.1) from A into the bounded operators on a Hilbert space HA#. Proof: This is acheived by the pure states on A. We begin by showing how each / € State (A) induces a cyclic representation ivf:A —► BL(Hf,Hf) (9.10.6.2) from A into a Hilbert space H^ in the sense of a *-homomorphism of bound < 1 for which there is a "cyclic vector" £f e Hf for which Hf = cl {irf(a)Zf: a € A} (9.10.6.3) Specifically, if we write [taking advantage of (9.10.3.3)] Af = {a£ A: f(Aa) = {0}} = {a £ A: f(a* a) = 0} (9.10.6.4)
388 9. Liouville's Theorem and Spectral Theory for the "left kernel" of /, then we can take Hf = (A/A,)" (9.10.6.5) to be the completion of the quotient space A/A, with respect to the norm induced by the inner product defined by setting, for each a, b £ A, (a + A, ; b + A,) = f(b*a) (9.10.6.6) Evidently the coset 1 + A, of the identity 1 £ A is the cyclic vector £,. If we now take a nonempty subset K C State (A) then we can combine the cyclic representations 7r, associated with / £ K in the crudest possible way, by setting HK = l2(Hf)feK = \x £ n Hf' £ H*/H2 < °° [ (9-10'6-7) [ feK feK J and then defining *K = *2(T/)/ejc : A —> BL(HK,HK) (9.10.6.8) in the obvious way: {nK{a)x)f = irf(a)(xf) for each a£A,f £ K,x£ HK (9.10.6.9) Evidently nK is a *-homomorphism of bound < 1 whenever K C State (A) is nonempty: to make it isometric it will be sufficient, using (9.6.6.5), to ensure that it is faithful in the sense of being one-one. Now (9.10.4.8) shows that nK is faithful whenever Extreme(State(A)) = A* C K ■ (9.10.6.10) When the Hilbert algebra is commutative then the pure states A# coincide with the characters A*: 9.10.7 THEOREM If A is a Hilbert algebra there is inclusion A* C A* (9.10.7.1) with equality whenever A is commutative.
9.10 States and Representations 389 Proof: If / £ A* is a character and if / = (l-t)/o+«/i (9.10.7.2) with 0 < t < 1 and states /0 and fx then certainly A+ n r1 (o) c /o"1 (o) n /f1 (o) (9.10.7.3) and, hence, using first the decomposition (9.9.6.4) of a hermitian element as the difference of positive elements and then the decomposition (8.8.4.3) of an arbitrary element as a linear combination of hermitian elements, /"'(O) C f-'iO) and r^O) C /^(O) (9.10.7.4) Note that we need the explicit formula (9.9.6.5) for the "positive" and "negative" parts of a hermitian element here. From (9.10.7.4) and (6.1.2.2) it follows that /0 and f1 are both scalar multiples of /, and hence that fo = fi = /• Conversely, if A is commutative and / is a pure state we claim that if a € A there is implication 0<a<l=>foLa = f(a)f (9.10.7.5) Indeed, if 0 < f{a) < 1 we have (9.10.7.2) with < = /(*) /iW"VoLa /0 = (l-/(a))-1(/-/oLa) (9.10.7.6) giving the right-hand side of (9.10.7.5) by the pure condition. If instead /(a) = 0 then for arbitrary b = b* £ h+(A) there is k £ R for which —k < b < k, which means (using the commutivity of A) that — ka < ab < ka, and hence, that —kf(a) < f(ab) < kf(a). Thus / o La annihilates hermitian elements and is therefore 0, giving again (9.10.7.5). If instead f(a) = 1 then the same argument shows that foLa = f. Since every element of A is a linear combination of elements a with 0 < a < 1 it follows that f°LA = f for all a, which means that / is a character. ■
10 Comparison of Operators and Exactness The various kinds of invertibility and singularity have "relative" analogues, in which one operator is compared to another. If we mix both left and right comparisons and then specialize, we come down to concepts of "exactness." 10.1 MAJORIZATION AND FACTORIZATION If T £ L(X,Y) and S (E L(X, Z) are linear operators between normed spaces with the same domain space X then there seem to be five basic comparisons between S and T: 10.1.1 DEFINITION S e L(X,Z) is called a left multiple of T e BL(X,y) if there is U £ BL(Y, Z) for which S = UT (10.1.1.1) and an almost left multiple ofTe BL(X,Y) if there is (Un) in BL(Y,Z) for which ||5 - UnT\\ —► 0 and sup \\Un\\ —► oo (10.1.1.2) n and an approximate left multiple of T if there is (Un) in BL(Y, Z) for which \\S-UnT\\^0 (10.1.1.3) We shall say that S is majorized by T if there is k > 0 for which \\Sx\\ < k\\Tx\\ for each x £ X (10.1.1.4) 391
392 10. Comparison of Operators and Exactness and that S is determined by T if there is inclusion r_1(0) C5_1(0) (10.1.1.5) For example, S (E L(X, Z) is bounded if and only if it satisfies (10.1.1.4) with Y = X and T = J; if instead S = I then (10.1.1.1) means that T is left invertible, (10.1.1.2) and, using Theorem 3.7.2, (10.1.1.3), mean that T is almost left invertible, (10.1.1.4) means that T is bounded below, and (10.1.1.5) means that T is one-one. It is not clear how to extend the "closed" condition (3.3.1.2) from i" to more general S. We recall from (6.9.2.11) how (10.1.1.3) holds with S = P when T is upper semi-Fredholm and P = P2 € BL(X,X) satisfies P_1(0) = T_1(0). Each of the conditions in Definition 10.1.1, with the exception of (10.1.1.3), is transitive: 10.1.2 THEOREM Each of the conditions (lO.l.l.j) with j ^ 3 is transitive. If T G BL(X,y) and S G BL(X,Z) there is implication (10.1.1.1) =>► (10.1.1.2) =>► (10.1.1.4) | | (10-1-2.1) (10.1.1.3) =>► (10.1.1.5) If T is proper with complemented range there is implication (10.1.1.5) =)> (10.1.1.1) (10.1.2.2) Proof: If S = UT and R = VS then R = {VU)T, so that (10.1.1.1) is transitive. If \\S - UnT\\ -> 0 and \\R - VnT\\ -> 0 then, provided supn||Vn||<oo, ||JZ - VnUnT\\ < \\R - VnS\\ + (sup ||VJ|) ||S - UnT\\ —> 0 (10.1.2.3) m and, if in addition, supn \\Un\\ < oo, then also supn ||Vnl7n|| < oo. This proves that (10.1.1.2) is transitive. If ||5(-)|| < *||T(-)|| and ||-R(-)|| < h\\S{-)\\ then ||iE(.)|| < ^||r(-)||, so that (10.1.1.4) is transitive, and the transitivity of (10.1.1.5) is obvious. Towards (10.1.2.1) it is clear that if (10.1.1.1) holds, then so does (10.1.1.2) with Un = U for each n £ N. If (10.1.1.2) holds, then (10.1.1.3) is obvious, and (10.1.1.4) also holds with k = supn \\Un\\: for if x € X is arbitrary then \\Sx — UnTx\\ —► 0 and ||C/nTx|| < fc||Tx||. We leave the last two implications to the reader. Finally, if (10.1.1.5) holds we can define a linear mapping U0 : TX —► Z by setting U0(Tx) = Sx for each xeX (10.1.2.4) Now if T is relatively open then U0 is bounded, and if T(X) is closed and
10.1 Majorization and Factorization 393 complemented, with Q = Q2 £ BL(Y,Y) satisfying Q(Y) = T(X)9 then we can satisfy (10.1.1.1) by setting U = U0Q ■ (10.1.2.5) The argument for (10.1.2.2) also shows that if TX is closed and complemented, then (10.1.1.4) =>» (10.1.1.1). The proof of the transitivity of the relation (10.1.1.2) also shows that if S is almost left invertible and T is approximately left invertible then the product ST is approximately left invertible. If instead T £ L(X, Y) and S € L(Z, Y) have the same codomain space Y then there are seven basic comparisons between S and T: 10.1.3 DEFINITION S e L(Z,Y) is called a right multiple of T e BL(X,y) if there is U € BL{Z,X) for which S = TU (10.1.3.1) and an almost right multiple of T if there is (Un) in BL(Z,X) for which \\S-TUn\\ —>0 and sup \\Un\\ < oo (10.1.3.2) n and an approximate right multiple of T if there is (Un) in BL(Z, X) for which \\S-TUn\\ —>0 (10.1.3.3) We shall say that S is comajorized by T if there is k > 0 for which Sz e TDisc(0 ; k\\z\\) for each z e Z (10.1.3.4) and almost comajorized by T if there is k > 0 for which Sz £ clTDisc(0 ; A;||^||) for each zG Z (10.1.3.5) We shall say that S is range-included in T if there is inclusion S(Z) C T{X) (10.1.3.6) and almost range-included in T if there is inclusion S{Z) C cl(rX) (10.1.3.7)
394 10. Comparison of Operators and Exactness For example, if Y = X and T = J, then each of (10.1.3.4) and (10.1.3.5) reduce to the condition that S be bounded: if instead Y = Z and S = I, then the conditions of Definition 10.1.3 reduce to T being, respectively, right invertible, almost right invertible (twice, using Theorem 3.7.2), open, almost open, onto, and dense. We recall from (6.9.3.18) how (10.1.3.5) holds when T is lower semi-Fredholm and S = Q = Q2 with Q(Y) = cl(TX). With the exception of (10.1.3.3), each of the conditions of Definition 10.1.3 is transitive: 10.1.4 THEOREM Each of the conditions (10.1.3J) with j ^ 3 is transitive. If T G BL(X,y) and S G BL(Z,Y) there is implication (10.1.3.1) => (10.1.3.4) => (10.1.3.6) i i (10.1.3.2) => (10.1.3.5) i \ J L (10.1.3.3) > (10.1.3.7) (10.1.4.1) If T is proper with complemented null space there is implication .(10.1.3.7) => (10.1.3.1) (10.1.4.2) Proof: Using the corresponding arguments for Theorem 10.1.2 as a guide, we leave the transitivity arguments to the reader. If (10.1.3.1) holds then so does (10.1.3.2), with Un = U for each n € N, and so also does (10.1.3.4), with k = \\U\\. If (10.1.3.2) holds then (10.1.3.3) is obvious, and (10.1.3.5) also holds, with k = supn \\Un\\. We leave the rest of the implications of (10.1.4.1) to the reader. Finally, if (10.1.3.6) holds we can define a linear mapping U0 : Z —► X/T_1(0) by setting, for each z € Z and x £ X, U0z = x + T-1(0) ifSz = Tx (10.1.4.3) If T(X) = c\(TX) is closed then of course (10.1.3.7) implies (10.1.3.6). If also T is relatively open then U0 is bounded, and finally if T_1(0) = P_1(0) with P = P2 £ BL(X, X) we may replace U0 by U : Z -> X for which Uz = Px ifSz = Tx m (10.1.4.4) 10.2 MIXED INTERPOLATION If linear operators T G L(X,Y) and S G L(Y, Z) satisfy S-1{0)=T{X) (10.2.0.1)
10.2 Mixed Interpolation 395 then the pair (S, T) is sometimes described as exact This is a rather special case of the common generalization of the comparisons of Definitions 10.1.1 and 10.3.1: 10.2.1 DEFINITION If T £ L(X,y), S e L(W,Z) and R e L(W,Y), then we shall call R a (left,right) multiple of (S,T) if there are U € BL(W,X) and V e BL(Z,Y) for which R = VS + TU (10.2.1.1) and an almost (left,right) multiple of (S, T) if there are (Un) in BL(W,X) and (VJ in BL(Z,Y) for which \\R-VnS-TUn\\^0 and sup ||iy +sup \\Vn\\ < oo (10.2.1.2) n n and an approximate (left,right) multiple of (S,T) if there are (Un) in BL{W,X) and (VJ in BL(Z,Y) for which \\R-VnS-TUn\\-+0 (10.2.1.3) We shall say that R is (left,right) majorized by (S, T) if there are k > 0 and h > 0 for which iZiu G Disc(0 ; /i||5t(;||) + T Disc(0 ; Jb||u;||) for each w G W (10.2.1.4) and almost (left,right) majorized by (S, T) if there are k > 0 and h > 0 for which iZiu G cl(Disc(0;fe||5u;||)+rDisc(0;ik||t(;||)) for each w £ W (10.2.1.5) We shall say that R is (left,right) determined by (S,T) if JR(S'-1(0)) CT(X) (10.2.1.6) and almost (left,right) determined by (S,T) if RiS'1 (0)) C cl(rX) (10.2.1.7) The reader may notice that we have used just seven of a possible 35 combinations of the conditions of Definitions 10.1.1 and 10.1.3. If, for example, 5 = 0 then Definition 10.2.1 reduces to Definition 10.1.3 with R in place of 5; if instead T = 0 then this all reduces to Definition 10.1.1 with (R,S) in place of (5,T). If W = Y and R = I, and if ST = 0, then (10.2.1.5) reproduces the exactness condition (10.2.0.1). Thus, if R = I and ST = 0 we can think of each of the conditions of Definition 10.2.1 as
396 10. Comparison of Operators and Exactness some kind of "topological exactness." For example the condition (10.2.1.5) is satisfied if R = I and T = J : X —► Y is the natural embedding of a subspace X C Y, with S = K : Y —► Z = Y/c\(X) the corresponding quotient; in this situation the condition (10.2.1.1) says that X is closed and complemented in Y. The conditions of Definition 10.2.1 satisfy the same pattern of implication as those of Definition 10.1.3: 10.2.2 THEOREM If T £ BL(X, Y), S £ BL(W, Z) and R £ BL(W, Y) there is implication (10.2.2.1) (10.2.1.1) => (10.2.1.4) => (10.2.1.6) i i (10.2.1.2) => (10.2.1.5) i \ J (10.2.1.3) > (10.2.1.7) If S and T are both regular there is implication (10.2.1.7) => (10.2.1.1) (10.2.2.2) Proof: If (10.2.1.1) holds, then so does (10.2.1.2), with Un = U and Vn = V for each n £ N, and so also does (10.2.1.4), with k = \\U\\ and h = \\V\\. If (10.2.1.2) holds, then (10.2.1.3) is obvious, and (10.2.1.5) also holds with k = supn \\Un\\ and h = supn ||Vn||: for if w £ W is arbitrary then \\Rw — VnSw - TUnw\\ -> 0 with VnSw £ Disc(0 ; h\\Sw\\) and TUnw £ T Disc(0 ; k||w||). The remaining implications of (10.2.2.1) are left to the reader. For (10.2.2.2) suppose T = TT'T and S = SS'S with T £ BL(Y,X) and S' £BL(Z,W) and write P = S'S and Q = TT' (10.2.2.3) for the projections of (3.10.1.3): then if (10.2.1.7) holds we have (/- Q)"1^) = Q(Y) = c\(TX) D R(S-l(0)) = R(P-1(0))=R((I-P)(W) which, is equivalent to (J - Q)R{I - P) = 0 (10.2.2.4)
10.2 Mixed Interpolation 397 Thus R = QR- QRP + RP = (RSf - TT'RS')S + TlT'R) V (10.2.2.5) = [RS')S + T{T'R - T'RS'S) giving us two candidates for a pair (V, 17) to satisfy (10.2.1.1). ■ The transitivity properties of the relations of Definition 10.2.1 are more complicated, and fall into two sets. The first six are fairly obvious: 10.2.3 THEOREM Suppose T £ BL(X,y), S £ BL(W,Z) and R £ BL(W,Y): then if T £ BL(X',Y) and S' £ BL(W,Z') we have the following implications: R a (left,right) multiple of (S, T) and S a left multiple of S' and T a right multiple of T' (10.2.3.1) =^i2a (left,right) multiple of (S',T') R an almost (left,right) multiple of (S, T) and S an almost left multiple of S'and T an almost right multiple of Tf (10.2.3.2) => R an almost (left,right) multiple of (Sf,Tf) R (left,right) majorized by (5, T) and S majorized by Sf and T comajorized by T' (10.2.3.3) => R (left,right) majorized by (S',T') R almost (left,right) majorized by (5, T) and S majorized by Sf and T almost comajorized by Tf (10.2.3.4) => R almost (left,right) majorized by (Sf,Tf) R (left,right) determined by (S,T) and S determined by S' and T range-included in T' (10.2.3.5) => R (left,right) determined by (S',T') R almost (left,right) determined by (S, T) and S determined by Sf and T almost range-included in T' (10.2.3.6) => R almost (left,right) determined by (Sf,Tf)
398 10. Comparison of Operators and Exactness Proof: lfR = VS + TU and S = V'S' and T = T'U' then R = [VV')S' + T'{U'U), giving (10.2.3.1). If \\VnS + TUn - R\\ -> 0 and \\V^S' - S\\ -> 0 and ||T'i74 - T|| -> 0, and if supn ||V^|| < oo and supn ||C/J| < oo, then \\VnV^S' + T'U'nUn - R\\ -> 0, giving (10.2.3.2). If for each w e W and each xGX we have J)isc{Rw; h\\Sw\\) nTJ)isc{0; k\\w\\) ^ 0 and ||5u;|| < h'\\S'w\\ and Tx £ {T'x'i \\x'\\ < k'\\x\\} then also D'isc(Rw ; hh'\\S'w\\) D r'Disc(0 ; ikik'Hwll) ^ 0, giving (10.2.3.3). The derivation of (10.2.3.4) is very similar, and then (10.2.3.5) and (10.2.3.6) are left to the reader. ■ Not quite so obviously, 10.2.4 THEOREM Suppose T £ BL(X,Y), S £ BL(W,Z) and R £ BL(W,Y): then if S' £ BL(Y,Y') and T £ BL(W',W) satisfy S'T = 0 = ST' (10.2.4.1) We have the following implications: R a (left,right) multiple of (S,T) => S'R a left multiple of S (10.2.4.2) and RT' a right multiple of T R an almost (left,right) multiple of (5, T) => S'R an almost left multiple of S (10.2.4.3) and RT' an almost right multiple of T R (left,right) majorized by (S, T) (10.2.4.4) => S'R majorized by S and RT' comajorized by T R almost (left,right) majorized by (5, T) => S'R majorized by S (10.2.4.5) and RT' almost comajorized by T R (left,right) determined by (5, T) (10.2.4.6) => S R determined by S and RT range-included in T R almost (left,right) determined by (5, T) => S'R determined by S (10.2.4.7) and RT' almost range-included in T
10.2 Mixed Interpolation 399 Proof: If R = VS + TU then (10.2.4.1) give S'R = (S'V)S and RT' = r(UT'), which proves (10.2.4.2). If instead \\R - VnS - TUn\\ -► 0 then (10.2.4.1) gives \\S'VnS-S'R\\ -► 0 and \\TUnTf-RT'\\ -► 0, which proves (10.2.4.3), and indeed the analogue for approximate (left,right) multiples. If for each w £ W we have Disc^w; ; k||Su;||) D TDisc(0 ; k\\w\\) ^ 0 so that there is x £ X with \\Rw — Tx\\ < h\\Sw\\ and ||x|| < k\\w\\ then (10.2.4.1) says that ||«S"iZiu|| = \\S'(Rw - Tx)\\ < h\\Sw\\ and also that if w = T'w' then \\Rw - Tx\\ = \\RT'w' - Tx\\ < h\\ST'w'\\ = 0 with ||x|| < Jb||u;|| = ik||rV|| < fc||r'||||u/||. This proves (10.2.4.4), and the argument for (10.2.4.5) is the same. We leave (10.2.4.6) and (10.2.4.7) to the reader. ■ The implication (10.2.2.2) at the end of Theorem 10.2.2 can be extended to more general S and T if we specialize R: 10.2.5 THEOREM If T £ BL(X,y), S £ BL(W,Z), and R £ BL(W,Y) satisfy S relatively almost open and R of finite rank (10.2.5.1) then there is implication (10.2.1.6) => (10.2.1.1) (10.2.5.2) Proof: If R £ BL(W, Y) is of finite rank then by Theorem 6.3.2 we may write n+m R=YlkJ® yj (10.2.5.3) y=i where (t/y)?^^ is a basis for the finite dimensional space R(W) C Y, and by Theorem 6.1.1 we can arrange that n J^ Cyy = R(W) n T{X) (10.2.5.4) y=i Thus there is (x;)y=1 in X for which Txj = t/y for each j £ {l, 2,..., n} (10.2.5.5) We claim, using condition (10.2.1.6) that iE(5_1(0)) C T(X), that m S-1^) C fl fc,-+m(0) (10.2.5.6) t=l
400 10. Comparison of Operators and Exactness because if Sw = 0 then R(w) £ T(X), which means that kj(w) = 0 if j > n + 1. As in the proof of Theorem 5.5.3 this means that there is (ky)*+£+i in Z* for which kj = hjS for each j e {n + 1,..., n + m} (10.2.5.7) Define tf : S(W) -► C by setting h^Sw) = kj(w) for each w £ W, using the relatively almost open property of S to see that h% is well defined and bounded, and then extend to h- by the Hahn-Banach Theorem 5.3.2. But now n n+m U = Y,kjQxj,V = Y, hj® Vj =>VS + TU = R (10.2.5.8) y=i y=n+i giving (10.2.1.1). ■ Since the totally bounded operators KL(X, Y) form a "two-sided ideal" in the sense of (3.9.6.7) it is clear that if (10.2.1.1) holds then T £ KL(X, y), S £ KL(W, Z)=>R£ KL{W, Y) (10.2.5.9) We can prove more: 10.2.6 THEOREM If T G BL(X,y), S G BL(W,Z), and R £ BL(W,Y) satisfy Rw G c\(D'isc(0 ;h\\Sw\\) + TDisc(0;A;||w;||) for each w G W (10.2.6.1) then \\R\\'~<4k\\T\\'m + 2h\\S\\'„ (10.2.6.2) In particular (10.2.1.5) =>► (10.2.5.9) (10.2.6.3) Proof: By definition (6.13.1.6) there are for each e > 0 finite subsets Ke C X and He C W such that {Tx: ||s|| < 1} C Disc(T(Ke) ; 2||r||'ess + e) and (10.2.6.4) {Sw: H| < 1} C Disc(S(2T.); 2||5||US + e) Now if w € W is arbitrary there is w' £ W for which w' e \\w\\H£ and \\Sw - Sw'\\ < {2\\S\\'m + e)\\w\\ (10.2.6.5)
10.3 Exactness 401 and then for arbitrary 6 > 0 there is x £ X for which \\R(w - w') - Tx\\ < h\\Sw - Sw'\\ + S\\w\\ and ||x|| < fc||tc; - w'\\ (10.2.6.6) Finally there is xf (E X for which x'€\\x\\Ke and \\Tx-Tx'\\ < (2\\T\\'eaa +e)\\x\\ (10.2.6.7) Thus \\Rw - Rw' - Tx'\\ < h\\Sw - Sw'\\ + \\Tx - Tx'\\ + 6\\w\\ < h(2\\S\\'ess + e)\\w\\ + (2||r||'ess + e)||x|| + 6\\w\\ <HnS\\'ess + s)\\w\\ + (2\\T\\'ess + s)k\\w-w'\\ + 6\\w\\ < (2&||5||U + 4||T||U)lhll + {he + 2ke + 6)\\w\\ noting that \\wf\\ < \\w\\. This gives {Rw:\\w\\ < 1} C Disc(L,„2fc||5||^, +4*11111^ + ^) (10.2.6.8) with Le, = R(He) + T(kKe) C W (10.2.6.9) as soon as he + 2ke + 6 < e'. ■ 10.3 EXACTNESS When W = Y and R = I the conditions of Definition 10.2.1 can be thought of as various kinds of exactness, particularly when ST = 0. We shall write BL(X,y,Z) = {{S,T) GBL{Y,Z) x BL(X,Y):ST = 0} (10.3.0.1) for the set of these chains from X to Y to Z: note that they form a closed subset, not necessarily linear, of the product space BL(Y,Z) x BL(X, Y). 10.3.1 DEFINITION The chain (5, T) € BL(X, Y, Z) is called (hft,right) invertible, or decomposably exact, if there are U £ BL(Y,X) and V € BL(Z,Y) for which VS + TU = I (10.3.1.1) It is called almost invertible, or almost decomposably exact, if there are (Un) in BL{Y,X) and (VJ in BL(Z,Y) for which \\VnS + TUn-I\\^>0 and sup \\UJ +sup ||VJ| < oo (10.3.1.2) n n
402 10. Comparison of Operators and Exactness and is called approximately invertible, or approximately decomposably exact, if there are (17J in BL{Y,X) and (VJ in BL{Z,Y) for which \\VnS + TUn-I\\^>0 (10.3.1.3) We shall call the chain (S, T) exact if there are k > 0 and h > 0 for which for each y £ Y, y £ Disc(0 ; h\\Sy\\) + TDisc(0 ; fc||y||) (10.3.1.4) and almost exact if there are k > 0 and h > 0 for which for each y £ Y, y £ cl(Disc(0 ; h\\Sy\\) + TDisc(0 ; ife||y||)) (10.3.1.5) We shall call the chain (S, T) linearly exact if S_1(0) CT(X) (10.3.1.6) and almost linearly exact if S_1(0) Ccl(rX) (10.3.1.7) As a matter of notation we shall write BL_1(X,y,Z) = {{S,T) £ BL(X,Y,Z):(S,T) invertible} (10.3.1.8) aieft,right BL^ y? ^ = BL^Xj y? Zj \ BL-i (x, Y, Z) (10.3.1.9) aleft'rightBL(X,y,Z) = {{S,T) £ BL{X,Y,Z): (S,T) not almost invertible} fleft'rightBL(X,y,Z) = {(S,T) £ BL(X,Y,Z): (S,T) not almost exact} 7rleft'rightBL(X,y,Z) = {{S,T) £ BL(X,Y,Z): (S,T) not almost linearly exact} Finally we shall write (10.3.1.10) (10.3.1.11) (10.3.1.12) rleft>rightBL(X,y,Z) = {(S,r) £ BL(X,Y,Z):(S,T) not strictly exact} (10.3.1.13) where we call (S,T) £ BL{X,Y,Z) strictly exact iff (S,T) is exact and S(Y) is closed (10.3.1.14) If the chain (S,T) is invertible in the sense of (10.3.1.1) then it is also
10.3 Exactness 403 regular in the sense that each of S and T are regular: rBT(x, y, z) = bl(x, y, z) n fBTfy, z) x rBT(y, x)) (10.3.1.15) In this case we can arrange that its "generalized inverse" is another chain. Indeed if ST = 0 and T = TT"T and S = SS"S (10.3.1.16) with T" = T"TT" and S" = S"SS" and we take (T', 5") = (T", (J - TT")S") or (T', S') = (T"(I - S"S),S") (10.3.1.17) then T = TTfT and S = SS'S and r'S' = 0 (10.3.1.18) Definition 10.3.1 extends to "chains" of length 3 and longer. If for example, T G BL(X,Y), S £ BL(Y,Z) and R G BL(Z,W) satisfy ST = 0 = RS (10.3.1.19) we shall write (R, S, T) £ BL(X, y, Z, W) (10.3.1.20) and again speak of a chain. To extend Definition 10.3.1 we simply impose corresponding conditions on each of the subchains (S,T) and (R, S). For the first two conditions, the result can be improved: 10.3.2 THEOREM If (R, S, T) € BL(X, Y, Z, W) is invertible, then there is a chain (T',S',Rf) e BL(W,Z,Y,X) for which S'S + TT' = I = R'R + SSf (10.3.2.1) If (R, S, T) € BL(X, y, Z, W) is almost invertible, then there are (T'n), (S'n), and (R'n) for \\S'nS + TT'n - I|| + \\R'nR + SS' - I\\ — 0 and (10.3.2.2) 8Up||^||+8Up|K||+8UP|K||<00
404 10. Comparison of Operators and Exactness Proof: If ST = 0 = RS and S'S + TT' = I = R"R + SS" (10.3.2.3) then, multiplying on either side by S' or S", (J - SS') (J - R"R) = 0 = (J - TT') (J - S"S) (10.3.2.4) which gives (10.3.2.1) with R'=(I- SS')R" (10.3.2.5) If we assume only that (R,S,T) is almost invertible then there are (T£), (5^), (54'), and (#£) furnishing "almost inverses" for (S,T) and (£,£), respectively. Now the reader can confirm that (10.3.2.2) holds with R'n = (I - SS')R% for each n £ N ■ (10.3.2.6) If we observe how Theorem 10.3.2 is proved, it is clear how the result extends to sequences (T1? T2,..., Tn) of operators in which 2y+1Ty = 0 for each j. Regularity is again the bridge between both ends of Definition 10.3.1: 10.3.3 THEOREM If (S,T) e BL(X,Y,Z) is invertible then it is also regular, with an inverse chain (17, V) (E BL(Z, Y, X) for which VS + TV = I and UV = 0 (10.3.3.1) Necessary and sufficient for (S,T) £ BL(X, Y, Z) to be invertible is (S,T) is regular and almost linearly exact (10.3.3.2) Necessary and sufficient for T £ BL(X,Y) to be regular is that there are Z and S £ BL(Y, Z) for which ST = 0 and (S,T) is invertible (10.3.3.3) Necessary and sufficient for S £ BL(Y, Z) to be regular is that there are X and T £ BL(X,Y) for which (10.3.3.3) holds. Necessary and sufficient for T £ BL(X,Y) to be decomposably regular is that there is S £ BL(Y,X) for which ST = 0 = TS and (S,T) and (T, S) are invertible (10.3.3.4)
10.3 Exactness 405 Proof: If (S, T) £ BL(X, Y, Z) satisfies the condition (10.3.1.1), then multiplying on the left by 5, and multiplying on the right by T, give S = SVS and T = TUT (10.3.3.5) which tells us that S and T are both regular. If (10.3.1.1) and (10.3.3.5) both hold then U' = UTU and V = VSV give V'S + TU' = 1 and U' = U'TU' and V = V'SV' and U'V = 0 (10.3.3.6) since U'V = U(TU)(VS)V and (TU)(VS) = {TU){I - TU) = 0. This establishes (10.3.3.1), and shows that the first part of (10.3.3.2) is necessary. The second part of (10.3.3.2) is necessary, from Theorem 10.2.3, which also shows that the whole condition (10.3.3.2) is sufficient. If T = TT'T is regular then the condition (10.3.3.3) holds with Z = Y and S = I - TT'\ if instead S = SS'S is regular then (10.3.3.3) holds with X = Y and T = I - S'S. Finally if the condition (10.3.3.4) holds then there are U',V,U" and V" for which / = u"T + SV" = V'S + TU' Now, if we take U = U"TU' and V = V'SV", then we get, with P = UT, R = T + V(I - P) and R' = R + PU, P = P2 and RP = T and R'R = I = RR' (10.3.3.7) By (3.8.5.2) this is the condition that T is decomposably regular: conversely if (10.3.3.9) holds then so does (10.3.3.4) with S = {I - P)R'. m If (10.3.3.1) holds we shall call (U,V) £ BL{Z,Y,X) a (left,right) inverse for the chain (5,T) e BL(X,Y,Z). Almost invertible chains come in open sets; we begin with an auxiliary result: 10.3.4 THEOREM Each of the following conditions is sufficient for J G BL(y,y) to be a (left,right) multiple of {S + H,T + K) G BL(Y,Z) x BL(X,y): VS + TU = I and VSTU = V(S + H)(T + K)U and (10.3.4.1) H'{I + VH) =/=(/ + KU)T'
406 10. Comparison of Operators and Exactness V(S + H) + (J + K)U e BL-\Y,Y) and (10.3.4.2) V(S + H){T + K)U = (T + K)UV(S + H) V(S + H) + (T + K)U and U(T + K) are invertible and (10.3.4.3) V(S + H)(T + K) =0 ^(5 + H) + (J + K)U and (5 + ^)7 are invertible and (10.3.4.4) (5 + ^)^ + ^)17 = 0 Proof: The reader can verify the identity V(S + H)(I+KU) + {I+VH)(T+K)U-(I+VH){I+KU) = VS + TU-I (10.3.4.5) This means that the first two parts of (10.3.4.1) together imply V(S + H)(I + KU) + (/ + V#)(r + K)U = (I + VH)(I + KU) and this with the last part of (10.3.4.1) gives (H'V) {S + H) + {T + K) [UK1) = I (10.3.4.6) If instead (10.3.4.2) holds then, with R = V(S + H) + (T + K)U R^ViS + H) + (T + K)UR~l = I (10.3.4.7) If instead (10.3.4.3) holds, then .R-1^^ + H) + (J + K)(U(T + K))-1 = I (10.3.4.8) If instead (10.3.4.4) holds, then ((S + HW-^S + ^ + iT + K^R-1 =1 m (10.3.4.9) The open set result now follows: 10.3.5 THEOREM If X, Y and Z are normed spaces, then aleft>riehtBL(X,y,Z) is closed: {(5,T) £ BL(X, y, Z): (5, T) is almost invertible} (10.3.5.1) is open in BL(X, y, Z).
10.3 Exactness 407 Proof: If (S,T) G BL{X,Y,Z) satisfies (10.3.1.2) and if (S',T') = {S + H, T + K) € BL(X, Y, Z) is sufficiently close to (S, T) that sup||J+Vn#|| < 1 and sup \\I + KUn\\ < 1 (10.3.5.2) where of course (Un) and (Vn) are taken from (10.3.1.2), then by the argument of Theorem 3.7.2 there are (Hfn) and [K'n) for which \\H'n(I + VnH) - I|| + ||(/ + KUn)K'n -1\\ — 0 and (10.3.5.3) sup|K||+sup|K||<oo n n Using the identity (10.3.4.5) we now argue \\H'nVn(S + H) + (T + K)UnK'n-I\\ < \\H'n(VnS + TUn - I)K'J + \\H'n(I + VnH)(I + KUn)K'n - I\\ + \\H'nVn(S + H)(I-(I + KUn)K'J + \\(I-H'n(I + VnH)(T + K)UnK'J^0 Bsn^n noting that for each n, VnSTUn = 0 = Vn(S + H)(T + K)Un m The fundamental examples of exact and almost exact chains come from subspaces, closed or not, and their associated quotients. Recall the natural injection Jw : W —► Y of (2.3.0.3) induced by a subspace W C Y, together with the quotient mapping Kw : Y —► y/cl(W) induced by cl(W) as in (2.3.0.4). Examples are the mappings ker(T) and coker(T) of (2.3.0.6) and (2.3.0.7): 10.3.6 THEOREM If W C Y is a subspace, then (KW,JW) is almost exact (10.3.6.1) and if W is closed, then (KW,JW) is exact (10.3.6.2)
408 10. Comparison of Operators and Exactness If T e BL(X, Y) and S G BL(y, Z), then the following are equivalent: J is almost (left,right) majorized by (coker(T),ker(5')) (10.3.6.3) ST = 0 (10.3.6.4) J is (left,right) majorized by (coker(r),ker(S)) (10.3.6.5) Proof: If W C Y is a subspace then the Riesz lemma Theorem 1.5.1 says that (10.3.1.5) holds with (S,T) = (KW,JW) and k = 2 and h = 1. If, in particular, W is closed then the auxiliary result (1.5.1.7) says that (10.3.1.4) holds with (S,T) = (Kw, Jw) and k = 2 and h = 1 + e > 1. Implications (10.3.6.5) =>» (10.3.6.3) => (10.3.6.4) follow from Theorem 10.2.2. We complete the proof by using the Riesz lemma result (1.5.1.7) to show that (10.3.6.4) implies (10.3.6.5). Indeed if y € Y and e > 0 are arbitrary then (1.5.1.7) gives w £ Y for which Sw = 0 and ||y-w|| < (* + £) dist^S"1^)) (10.3.6.6) and ||w;|| < 2\\y\\ The first part of this condition says w = kei(S)w (10.3.6.7) while if ST = 0 then the middle part of this condition gives \\y - w\\ < (1 + e) dist(t/,cl(rX)) = (l + e)\\ cokei(T)y\\ ■ (10.3.6.8) From Theorem 10.3.6 we are able to resolve exactness and almost exactness of chains into components: 10.3.7 THEOREM Necessary and sufficient for (S,T) e BL(X,Y,Z) to be almost exact is that there are k > 0 and h > 0 for which, for each t/^7, dist(t/,rX) <h\\Sy\\ (10.3.7.1) and Sy = 0 => y G cl {Tx: \\x\\ < k\\y\\} (10.3.7.2) Necessary and sufficient for (S, T) € BL(X, Y, Z) to be exact is that in addition Sy = 0 => y G {Tx: \\x\\ < k\\y\\} (10.3.7.3)
10.3 Exactness 409 Proof: If J is almost (left,right) majorized by (5, T) then, whether or not ST = 0, both (10.3.7.1) and (10.3.7.2) follow at once, without change of constants. If in particular J is (left,right) majorized by (S, T) then also the condition (10.3.7.3) holds. Conversely, if ST = 0 then (10.3.6.3) says that for each y GY there is (wn) in Y for which Swn = 0 and limsup ||y + iun|| < dist(t/,Tx) (10.3.7.4) and KJ| < 2||t/|| By (10.3.7.2) there is (x J in X for which ||u;n-rsn||—0 and ||xn|| < A||u;n|| and now (10.3.7.1) says limsup ||2/ — Txn\\ = limsup ||y — wn\\ < d\st(y,TX) < h\\Sy\\ n n and (10.3.7.5) Kll < *KJ| < 2*||y|| But this means that (5, T) satisfies the condition (10.3.1.5). If instead of (10.3.6.3) we use (10.3.6.5) then for each y GY and arbitrary e > 0 there is w £ Y for which Sw = 0 and \\y - w\\ < (1 + e) dist(y,TX) and \\w\\ < 2\\y\\ (10.3.7.6) If (10.3.7.3) is assumed then there isxGX for which w = Tx and ||x|| < fc||tu|| Going back to (10.3.7.1) now gives ||y - Tx\\ = \\y - w\\ <(l + e) dist(y,TX) < h\\Sy\\ and (10.3.7.7) ||*|| <*HI<2*||y|| which means that (S,T) satisfies the condition (10.3.1.4). ■ The reader will of course have noticed that the "additional" condition (10.3.7.3) actually includes the condition (10.3.7.2). The conditions (10.3.7.1) and (10.3.7.2) say that coker(T) is majorized by S, in the sense of (10.1.1.4), and that ker(S) is almost comajorized by T in the sense of (10.1.3.5). The proof that these conditions together give (5, T) almost exact is just the transitivity argument (10.2.3.4) with (coker(T),ker(5)) in
410 10. Comparison of Operators and Exactness place of (S,T) and {S,T) in place of (Sf,Tf); the reverse implication is the other transitivity argument (10.2.4.5) with T' = ker(S) and Sf = coker(T). We have however preferred to prove Theorem 10.3.7 explicitly; we use it to show that almost exact pairs come in open sets: 10.3.8 THEOREM If X, Y and Z are normed spaces, then fleft>riehtBL(X,y,Z) is closed: {{S, T) € BL(X, y, Z): {S, T) is almost exact} (10.3.8.1) is an open subset of BL(X, Y, Z). Proof: Suppose that k > 0 and h > 0 satisfy the condition (10.3.1.4) from Definition 10.3.1: then if (S',T') € BL(X,Y,Z) satisfies k\\T' - T\\ + h\\S' - S\\ + e = 6 < 1 with e > 0 (10.3.8.2) we claim that (10.3.7.1) and (10.3.7.2) hold with (S',T') in place of (S9T) and h/(l — 6) and k/(l — 6) in place of h and k. Indeed if y £ Y is arbitrary then by (10.3.1.5) there is xx € X for which llv-TsJ^fcHSyll + cllyll and \\Xl\\ < k\\y\\ (10.3.8.3) and hence \\y-T'x1\\<\\(T'-T)x1\\+h\\Sy\\+e\\y\\ < h\\S'y\\ + 11(2- - T)xx\\ + h\\(S' - S)y\\ + e\\y\\ so that, with yx = y — Tfxx, y = yx+ Tfx1 and S'yx = S'y and (10.3.8.4) llyill^fcll^yll + tfHyll and KH < %|| Iteration gives sequences (t/n) in Y and (xfn) = [xx + x2 + • • • + xn) in X for which y = yn + T'{x'n) with S'yn = S'y (10.3.8.5) and ||yn|| <h(l + 6 + --- + S^WS'yW + 6n\\y\\ < h'\\S'y\\ + Sn\\y\\ (10.3.8.6)
10.3 Exactness 411 and \\x'n\\<hk(n + (n-l)6 + --- + 6n-1)\\S'y\\ + *(1 + 6 + • • • + 6n~l)\\y\\ < nk"\\S'y\\ + k'\\y\\ (10.3.8.7) with h' = h/(l-6),kf = k/(l-S) and k" = hk/{l-S). From (10.3.8.5) and (10.3.8.6) it follows that (S",T") satisfies the condition (10.3.7.1), in which no control is needed on the norm of xfn = x1+x2H hxn, and together with (10.3.8.7) they show that (S",T') satisfies the condition (10.3.7.2), since our lack of control of the norm of x'n is confined to a term which is multiplied by ||5"t/||. By Theorem 10.3.7 the pair (S",T') is almost exact. ■ The openness just proved, and also that of Theorem 10.3.5, is relative to the closed subset BL(X, Y, Z), and does not hold in the larger space BL(Y,Z) x BL(X,y). For example let X = Y = Z = C2 and look at 1 0' 0 0 T = 0 0 0 1 (10.3.8.8) Evidently (5, T) is an invertible chain in the sense of (10.3.1.1), but if (10.3.8.9) st = 1 t 0 0 Tt = 0 t~ 0 1. then ||5t -S\\ + \\Tt - T\\ -> 0 as t -> 0, while if 0 ^ t G C the pair (St, Tt) is not even almost linearly exact in the sense of (10.3.1.7). We conclude with a boundary result, generalizing Theorem 3.5.1: 10.3.9 THEOREM If {S,T) and {Sn,Tn) in BL(X,Y,Z) satisfy S almost open and T bounded below (10.3.9.1) and* {Sn,Tn) almost exact (10.3.9.2) and \\S - Sn\\ + \\T - Tn\\ —► 0 as n — oo (10.3.9.3) then (5,r) is almost exact (10.3.9.4)
412 10. Comparison of Operators and Exactness Proof: By (10.3.9.1) there are k > 0 and h > 0 for which if x € X and z G Z then ||x|| < *||Tx|| and z G cl {Sy : \\y\\ < h\\z\\} (10.3.9.5) and hence by Theorem 3.3.3 and Theorem 3.4.3 \\x\\ < kn\\Tnx\\ and *€cl{SBy:||y||<fcn||*ll} (10.3.9.6) with K = r^ ^77 and hn = —^— (10.3.9.7) n l-*||Tn-r|| n 1-Sn y ] assuming k\\Tn — T\\ < 1 and h\\Sn — S\\ < Sn < 1. In particular, taking z = Sny, there is for arbitrary (en) > 0 a sequence (y'n) = [y — yn) for which PJJ < «»ll»ll with ||y - y'J < hJSny\\ (10.3.9.8) By (10.3.9.2) there are (possibly unbounded) sequences (k£) > 0 and (h£) > 0 for which, for arbitary (e:*) > 0, there is a sequence (xfn) in X with \\y'n - Tnx'J < hjSny'J + ejy'j with ||<|| < *B||yB||. (10.3.9.9) Thus if y € Y is arbitrary we have ||y - Tx'J < \\y - y'J + \\y'n - Tnx'J + ||r„x'n - Tx'J < h - y'J + KWSM + <WJ + \K - n\K\\ < (1 + eB)||y - y'J + fcBeB||y|| + <||y|| + ||r„ - T||fc||rx'J| <(l + <)hJSny\\ + h^Jy\\+eJy\\ + ||TB-r||*(||r«'B-y|| + ||y||) giving (1 - *||rB - T||) ||y - Tx'J < (1 + <)hjS\\ + <rB||y|| (10.3.9.10) where e'B = fc||TB - T|| + (1 + <)hjSn - 5|| + Ken + < (10.3.9.11) The coefficient of ||y — Tx'n\\ is bounded below as n —► oo, the coefficient of 11 St/11 is bounded as n —► oo if we assume e^ —► 0, and finally we are free to arrange that h^en —► 0, which makes e:" —► 0; thus the chain (5, T) is
10.3 Exactness 413 almost linearly exact in the sense of (10.3.1.7). At the same time (1 - k\\Tn - T\\) K|| < (1 - k\\Tn - T\\) (k\\TJn - y\\ + ||*||y||) < *Z||y|| (10.3.9.12) giving (10.3.9.4). ■ Theorem 10.3.9 generalizes Theorem 3.5.1, and hence furnishes a generalization of Theorem 3.5.2: if a chain (S, T) £ BL(X, Y, Z) is in the topological boundary (among chains) of the set of almost open chains then either S € BL(Y,Z) fails to be almost open or T £ BL(X,Y) fails to be bounded below. We have been unable to settle a "dual" result: if chains (S,T) and [Sn,Tn) satisfy the conditions (10.3.9.4), (10.3.9.5) and Sn almost open and Tn bounded below (10.3.9.13) does it follow that (S,T) satisfies the condition (10.3.9.1)? If z £ Z is arbitrary then the first part of (10.3.9.13) gives (yn) in Y for which \\z — Snyn\\ —► 0, and then the condition (10.3.9.4) gives k > 0 and h > 0 and a sequence (xn) in X for which ||yn - TxJ < h\\Syn\\ + hz\\ with ||»„|| < *||y„|| (10.3.9.14) n Now \\z - Syn\\ < \\z - Snyn\\ + \\Sn - S\\\\yn - TxJ + \\(Sn - S)TxJ <\\"-SnyJ + \\Sn-S\\(h\\SyJ + ^\\z\\j + \\{Sn-S)TxJ giving (l-fc||SB-S||)||*-SyB|| / i\ (10.3.9.15) < W* ~ SnyJ + \\Sn - S\\ [h + -J \\z\\ + \\(Sn - S)TxJ If we knew that [Txn) were bounded this would show that S is dense; without this information we are unable to proceed, and the reader is invited to see if he can succeed where we have failed. For example if X, Y, and Z are Hilbert spaces then we will see below that the result follows from Theorem 3.5.1. The alert reader will possibly have noticed that specialization of Theorem 10.3.9 to the case 5 = 0 does not yield quite the full strength of Theorem 3.5.1: for if S = 0 = Sn then our assumption (10.3.9.2) says that
414 10. Comparison of Operators and Exactness each Tn is almost open, while for Theorem 3.5.1 we need only assume that Tn is dense. Inspection of the proof of Theorem 10.3.9 however shows that the constants (k£) play no part in the argument, so that we are actually working with an assumption weaker than (10.3.9.2). We cannot similarly weaken the assumptions on (Sn), as we see by specializing instead to T = 0 and recalling Theorem 4.7.4. 10.4 COMPOSITION OPERATORS AND DUALITY The various relationships of Definition 10.2.1 are transmitted to composition operators, and also some of them have alternative expressions in terms of composition operators. For example the condition (10.2.1.1) says that R C U{E) with U = iow{Rs,LT) and E = BL(Z,Y) x BL{W,X) (10.4.0.1) while the condition (10.2.1.3) says that R C c\(U(E)) with U = iow{Rs,LT) and E = BL(Z,Y) x BL(X,Y) (10.4.0.2) The reader is invited to offer a formulation in these terms of the important condition (10.2.1.2). If (10.2.1.1) holds then it also holds for induced left and right compositions: LR = LVLS + LTLV and RR = RVRT + RSRV (10.4.0.3) where we may take LR = BL(E,R) for arbitrary E and then define the other compositions consistently. Similarly (10.2.1.2) is transmitted: if (5, T) satisfy it then also \\LVnLs + LTLUn-LR\\^0 and \\RUnRT + RsRVn - RR\\ — 0 (10.4.0.4) and of course the sequences {LUn), (-^yn)» i^un) an(^ G^vn) are bounded. The condition (10.2.1.5) has an interesting expression in terms of composition operators: 10.4.1 THEOREM If T € BL(X,Y) and S £ BL(Y,Z) and R £ BL(W, Z) the following are equivalent: R is almost (left,right) majorized by (S,T) (10.4.1.1) for arbitrary U G BL(E,W) and V G BL(y,F), \\VRU\\<h\\V\\\\SU\\+k\\VT\\\\U\\ (l0-4-L2) for arbitrary w € W and g € Y^, \gRw\ < /i||^||||5t(;|| + A;||^r||||w;|| (10.4.1.3)
10.4 Composition Operators and Duality 415 Proof: If (10.4.1.1) holds and E and U £ BL(£, W) are arbitrary, then for each u £ E we have (xn) in X with supn ||xn|| < fc||C/u|| and limn ||i2!7u — Txn\\ < h\\SUu\\, so that for arbitrary F and V £ BL(Y,F) there is inequality \\VRu\\ lim\\V(RUu - Txn)\\ + sup ||7r*B|| n n < \\v\\h\\suu\\ + \\vt\\ sup kii < fe||F||||rc;«|| + *||7T||||CTu|I n Taking sup||u|i<:l gives (10.4.1.2), and hence by specializing to E = F = K, (10.4.1.3). Finally, the implication (10.4.1.3) =^ (10.4.1.1) is the separation Theorem 5.5.1: if (10.4.1.1) does not hold there is w £ W for which Rw <£ c\(K) = (K°)0 with K = Disc(0 ; h\\Sw\\) + TDisc(0 ; k\\w\\) giving g £ Y^ for which (10.4.1.3) fails. ■ The analogue of Theorem 10.4.1 holds for the condition (10.2.1.7): 10.4.2 THEOREM If T £ BL(X,y), S £ BL(W,Z), and R £ BL(W,Y), then the following are equivalent: R is almost (left,right) determined by {S,T); (10.4.2.1) for arbitrary U £ BL(£,W) and V £ BL(Y,F), SU = 0 = VT^ VRU = 0 for arbitrary w £W <md g £Y\Sw = 0 = gT ^>> gRw = 0 (10.4.2.3) Proof: If (10.4.2.1) holds, so that i?^-1^)) C cl(TX), suppose that SU = 0 = VT: then for arbitrary u £ E we have SUu = 0, and hence, RUu £ cl(TX), giving VRUu = 0. This gives (10.4.2.2) and hence (10.4.2.3). Finally, if (10.4.2.1) does not hold then there is y £ R{S~1{0)) C cl(TX), so that y = Rw with Sw = 0 and also y £ cl(TX): now by the Hahn-Banach theorem (5.4.1.3) there is g £ Y^ for which gT = 0 ^ g{y). Evidently g and y together violate (10.4.2.3). ■ If in Theorem 10.4.1 and Theorem 10.4.2 we specialize by taking 5 = 0 then we have in particular R almost comajorized by T <£=$► R* majorized by T* (10.4.2.4) and R almost range-included in T <£=> R^ determined by Tt (10.4.2.5)
416 10. Comparison of Operators and Exactness The dual of (10.4.2.5) is only partially valid, while the dual of (10.4.2.4) is even stronger: 10.4.3 THEOREM If T e BL{X,Y), S € BL{W,Z), R € BL{W,Y) the following are equivalent: R is almost (left,right) majorized by (T],S]) (10.4.3.1) R is almost (left,right) majorized by (5,T) (10.4.3.2) R* is (left,right) majorized by {T\S^) (10.4.3.3) Proof: If (10.4.3.1) holds then by Theorem 10.4.1 we have for arbitrary gGY^wG W^9 \wR^g\ < h\\wS*\\\\g\\ + *||r^||||o7|| (10.4.3.4) Specializing to w = w (= W C W^ gives (10.4.1.3), and hence again by Theorem 10.4.1 we get (10.4.1.1), which is (10.4.3.2). If (10.4.3.2) holds then for arbitrary w £ W there are y GY and (xn) in X for which \\Rw-Txn-y\\ —► 0 with \\y\\ < h\\Sw\\ and ||xj| < Jb||u;|| (10.4.3.5) This enables us to define, for each g £ Y"l", a mapping /0 : SW —► K by setting f0{Sw) = g{y) if {w,y, (xj) satisfy (10.4.3.5) (10.4.3.6) Since \g(y)\ < ||^||||y|| < fc||<7||||Siu||, ft is clear that /0 is well defined, evidently linear, and then bounded with ||/0|| < k\\g\\. By the Hahn-Banach Theorem 5.3.1 we can extend /0 to / (E X^ with ||/|| = ||/0||: now \fSw - gRw\ = limllffTzJ < ||0T||*|HI (10.4.3.7) n and taking supy^n^ gives (10.4.3.3). Finally the implication (10.4.3.3) =>• (10.4.3.1) is obvious. ■ Taking T = 0 in Theorem 10.4.3 gives Ri almost comajorized by 5' =>• S majorized by R «+ ..,,„+ (10.4.3.8) => R] comajorized by S] 10.4.4 THEOREM If T £ BL(X,Y), S € BL(W,Z), and R € BL(W,Y) then there is implication R] almost (left,right) determined by (T^S^) (10.4.4.1) => R almost (left,right) determined by (S,T)
10.4 Composition Operators and Duality 417 Proof: If the first part of (10.4.4.1) holds then by Theorem 10.4.2 there is implication for arbitrary g eY^w€ W^,wS^ = 0 = T*g => wR^g = 0 (10.4.4.2) Specializing to C7 = w gives condition (10.4.2.3), and hence again by Theorem 10.4.2 the condition (10.4.2.1), which is the second part of (10.4.4.1). ■ Taking T = 0 in Theorem 10.4.4 gives Ri almost range-included in S* =>• R determined by S (10.4.4.3) Even when R = I the implication (10.4.4.3) is not reversible, as we saw following Theorem 5.4.1. Theorem 10.4.2 and Theorem 10.4.3 help us to a result complementary to Theorem 5.6.4: 10.4.5 THEOREM If T € BL(X, Y) and S e BL{Z, W) then (iovt(—RT,Ls),co\(Ls,RT)) almost linearly exact =>• (S or T one-one and S or T dense) (10.4.5.1) and (iow(—RT,Ls),co\(Ls,RT)) almost exact =>• (S or T bounded below and S or T almost open) Proof: We are looking at the diagram (10.4.5.2) BL(X,W) [ —Rt Ls BL(y,W)l [Rt\ BL(X, BL(Y,Z): (10.4.5.3) the reader should note that there are two different operators Ls and two different operators RT. If the right-hand side of (10.4.5.1) fails we claim that there are x £ X, f eX\yeY,g eYi, zeZ,heZi,w eW and k € Wt for which [k <g) y h <g) x] lirp - [ -RT Ls} g ®w fQz t^ [fc® y h®x] = 0 g ®w fez (10.4.5.4)
418 10. Comparison of Operators and Exactness for if neither S nor T are one-one we may take Tx = 0 ^ f(x) and Sz = 0 ^ /i(^) and (y, g,w,k) = (0,0,0,0) (10.4.5.5) and if instead neither S nor T are dense we may take gT = 0^g(y) and kS = 0 ^ k(w) and (x, /, 2, h) = (0,0,0,0) (10.4.5.6) By Theorem 10.4.2 the condition (10.4.5.4) gives the failure of the left- hand side of (10.4.5.1), and hence proves the implication. The argument for (10.4.5.2) is the same, working with sequences of vectors and functionals and using Theorem 10.4.3. Details are left to the reader. ■ Decomposable exactness for composition operators, as in (10.4.0.3) but without that specific "(left,right) inverse," is not at first sight sufficient for the condition (10.2.1.1). For a flow of information in that direction we have a generalization of Theorem 3.6.5: 10.4.6 THEOREM If [R,S,T) G BL(X,Y,Z,W) and if (R,S) G BL(y, Z,W) is (left,right) invertible then there is implication (LS,LT) linearly exact =>• (S,T) (left,right) invertible (10.4.6.1) r [ljQ)ljrp] exact If instead (5, T) £ BL(X, Y, Z) is (left,right) invertible then there is implication (RS,RR) linearly exact => (R,S) (left,right) invertible => (RS,RR) exact Proof: The second implication in (10.4.6.1) is part of (10.4.0.3), with no restriction on (R, S). Conversely if RfR + SSf = i" then S(I - S'S) = {I- SSf)S = R'RS = 0 (10.4.6.3) and if (LS,LT) is linearly exact this gives I-S'S £ Lsx (0) C LT BL(y, X) so that I-S'S = TT' for some T' € BL(Y, X) (10.4.6.4) This proves (10.4.6.1), and the argument of (10.4.6.2) is exactly the same. ■
10.5 Enlargement and Completion 419 The analogue of Theorem 10.4.6 holds for almost exactness, generalizing Theorem 3.7.2: if (R, S) is almost (left,right) invertible then (LS,LT) almost linearly exact =>• (S,T) almost (left,right) invertible (10.4.6.5) =>• (LS,LT) almost exact while if instead (5, T) is almost (left,right) invertible then (RS,RR) almost linearly exact =>• (R,S) almost (left,right) invertible (10.4.6.6) =>• (RS,RR) almost exact The details are left to the reader. Both Theorem 10.4.6 and its analogue have immediate extensions to more general chains (Tn,... ,T2,T1) and (0,rn,...,7^,0). 10.5 ENLARGEMENT AND COMPLETION The enlargement process can be used to test for majorization and co- majorization: 10.5.1 THEOREM If T £ BL(X,Y), S £ BL(W,Z) and R £ BL(W,Y) then the following are equivalent: Q(R) is almost (left,right) majorized by (Q(S),Q(T)) (10.5.1.1) R is almost (left,right) majorized by (5,T) (10.5.1.2) Q(R) is (left,right) majorized by (Q(5),Q(r)) (10.5.1.3) Proof: If (10.5.1.2) holds, so that for each w £ W and each e > 0 there is x£X with H-Rw - Tx\\ < h\\Sw\\ + e||tc;|| and ||x|| < fc||tc;|| then for each w = (wn) £ l^iW) there is x = (xn) in X with, for each n£ N, \\Rwn - TxJ < h\\SwJ + illtt,^ and ||xj| < *||«,n|| n Evidently x £ /^(X), and also limsup \\Rwn — Txn\\ < /ilimsup ||^w;n|| n n with limsup ||xn|| < klimsup \\wn\\ n n
420 10. Comparison of Operators and Exactness which is (10.5.1.3). Trivially (10.5.1.3) =>> (10.5.1.1). Finally, if (10.5.1.1) holds then for each w € W and each e > 0 there is xe £ l^ (X) for which limsup \\Rw - Txen\\ < h\\Sw\\ + e||u;|| and limsup \\xen\\ < k\\w\\ n n Thus, if k' > k there is x' — x£N for which ||-Rt£7 - Tx'e\\ < h\\Sw\\ + 2e|H| and \\x'e\\ < k'\\w\\ giving (10.5.1.2). ■ When R = I and ST = 0 then we can improve: 10.5.2 THEOREM If (S, T) € BL(X, Y, Z) then the following are equivalent: ((Q(S),Q(T)) is almost linearly exact (10.5.2.1) (S,T) is almost exact (10.5.2.2) ((Q(S),Q(r)) is exact (10.5.2.3) Proof: If (10.5.2.3) holds then so by Theorem 10.5.1 does (10.5.2.3), which trivially implies (10.5.2.1). To see that (10.5.2.1) implies (10.5.2.2) we use the subspace-quotient characterization Theorem 10.3.2. If (10.5.2.2) does not hold then one of the two conditions (I0.3.2.y) must fail. If (10.2.3.1) fails then there is (yn) in Y for which ||yj| = l and dist(yn,TX) > \ and ||SyJ| — 0 (10.5.2.4) and then by the Hahn-Banach theorem (gn) in Y"l" for which ll*»ll = lff»(y»)l = l ^d ||flBr||=0 (10.5.2.5) Evidently y € l^Y) and Sy € c0(Y) and (x € l^X) => dist(y - Tx,c0(Y)) > |) (10.5.2.6) If instead (10.2.3.2) fails, then using the Hahn-Banach separation theorem, there is (gn) in Y for which IbJIs-Ho) = 1 and \\snT\\^0 (10.5.2.7) and then (yn) in Y for which ||y„|| = l and Syn = 0 and \gn(yn)\ > \ (10.5.2.8)
10.5 Enlargement and Completion 421 Now for arbitrary x € Ioq{X) we argue l|y» - TxJ = ||ffB|lllv» - TxJ > \gn(yn - Txn)\ >lfl„(y„)l-llff»r||KH>|- so that limsup ||yn - TxJ > limsup (§ - ||*|U|ffBT||) n n Thus again we get (10.5.2.6), which violates (10.5.2.1). ■ We cannot extend Theorem 10.5.2 to more general R, as in Theorem 10.5.1. If for example X = Y = Z = l2 and T = S2 where S = W : (x J -> (xjn) is the operator of (7.3.6.15), then (10.1.1.5) holds with (Q(S), Q(T)) in place of {S,T), but (S,T) does not satisfy (10.1.1.4): ||5x||2 < ||x||||rx|| for each xGX (10.5.2.9) 10.5.3 THEOREM If T € BL(X, Y) and S € BL(X, Z), and if either S is bounded below or T is relatively open, then Q(S) determined by Q(T) <=> S majorized by T (10.5.3.1) Proof: Backward implication follows from Theorem 10.5.1 with (5, T,0) in place of (R, S,T). Conversely, if S is not majorized by T, then there is a sequence (xn) in X for which \\Sxn\\ = 1 > n||rxj| for each n £ N (10.5.3.2) Provided we know that (xn) is in l^X) this will show that Q(S) is not determined by Q(T). If S is bounded below this is certainly the case: ||xn|| < k\\Sxn\\ = k for each n £ N (10.5.3.3) If instead T is relatively open then we can arrange to replace (xn) by (xn + un) in /^(X), but we have to argue by contradiction. Indeed, if T is relatively open then there is k > 0 for which dist(x,r_1(0)) > ik||rx|| for each xGX (10.5.3.4) and hence, if (10.5.3.2) holds there is a sequence (un) in X for which Tun = 0 and ||xn + un|| —► 0 (10.5.3.5) Evidently the sequence x + u = (xn + un) is in l^X). If we now assume that Q(S) is determined by Q(T), so that also S is determined by T, then IMUIff»r|| _ 1. — 2
422 10. Comparison of Operators and Exactness we also have 5(tt„)=0 and ||5(xB + ttB)|| = ||5(*B)|| = l (10.5.3.6) which means that (10.5.3.2) holds with x + u in place of x. Since x + u £ /^(X), this contradicts the assumption that Q(S) is determined by Q(r). ■ Argument similar to Theorem 10.5.1 shows that Q(S) almost range-included in Q(T) =>• S almost range-included in T (10.5.3.7) If the spaces X, Y, and Z are complete, then exactness and almost exactness for (S,T) £ BL(X,Y,Z) coincide: 10.5.4 THEOREM If X, Y, and Z are Banach spaces and (S,T) £ BL(X, Y, Z), then the following are equivalent: (S,T) is almost exact (10.5.4.1) 5_1(0) = TX and S(Y) = c\(SY) (10.5.4.2) (S,T) is exact (10.5.4.3) Proof: If (5,r) £ BL(X,Y,Z) is almost exact then by Theorem 10.3.7 the conditions (10.3.7.1) and (10.3.7.2) both hold. By Theorem 4.4.3 and Theorem 4.4.4, applied to the operators S and T : X —► 5_1(0), the condition (10.5.4.2) holds. By the open mapping Theorem 4.6.2 the condition (10.5.4.2) gives (10.3.7.1) and (10.3.7.3), which by Theorem 10.3.7 again gives (10.5.4.3). Trivially (10.5.4.3) implies (10.5.4.1). ■ Baire's theorem gives us an interesting generalization of the open mapping theorem: 10.5.5 THEOREM If T £ BL(X, Y) and S £ BL(Z, Y) and if Z is complete, then S range-included in T => S almost comajorized by T (10.5.5.1) and S range-included in T =)> 5t majorized by Tf (10.5.5.2)
10.5 Enlargement and Completion 423 Proof: If S is range-included in T, then oo oo Z = S~1 [j {Tx: \\x\\ <n}= [j 5_1rDisc(0 ; n) (10.5.5.3) n=l n=l and if Z is complete then by Baire's Theorem 4.6.1 there is n £ N with 0 ^ int cl S~XT Disc(0 ; n) = int S"1 cl T Disc(0 ; n) (10.5.5.4) and hence 0 ^ int cl T Disc(0 ; 1) = (l/n) int S'1 c\T Disc(0 ; n) (10.5.5.5) As usual (1.4.1.1), this means that 0 £ int 5_1 cl{Tx: ||x|| < l}, giving the condition (10.1.3.4). This proves (10.5.5.1), and hence also (10.5.5.2), by the duality result (10.4.2.4). ■ It would be nice to extend Theorem 10.5.5 to the situation of Definition 10.2.1. The best we can do is to make the observation that if T G BL(X, Y), S e BL(W,Z), and R e BL(W,Y), and if the space W is complete, then (10.5.5.1) R{S-1{0)) CT(I) =>5-1(0) <Z{weW:Rwec\{Tx:\\x\\ <k\\w\\}} (10.5.5.6) The implication (10.5.5.1) is not in general reversible. If X = c0, Y = Z1? Z = K, and if S and T are defined by setting (Tx)n = n-*xn and (10.5.5.7) (St)n = tn~2 for each n € N, x € c0, t € K then the right-hand side of (10.5.5.2) holds but the left-hand side does not. If we combine Theorem 10.5.5 with Theorem 10.2.6 we find that if T e BL(X, Y) and S £ BL(Z, Y) for Banach spaces X, Y, and Z then S range-included in T compact => S compact (10.5.5.8) It would have been nice to report implication (10.3.1.2) =>• (10.3.1.1) for complete spaces; the best we have been able to do is the following: 10.5.6 THEOREM If X, Y, and Z are complete and T £ BL(X,Y), S £BL(Y,Z) satisfy \\VnS + TUn - I|| — 0 with sup ||tfj| + sup ||Vn|| < oo (10.5.6.1)
424 10. Comparison of Operators and Exactness then each of the following is sufficient for J to be a (left,right) multiple of (5,r): VnSTUn = TUnVnS for each n £ N (10.5.6.2) \\UnT -1\\ —> 0 and VnST = 0 (10.5.6.3) \\SVn — JT|| —^ 0 and STUn = 0 (10.5.6.4) If the spaces are complete, then BL_1(X, Y, Z) is an open subset of BL(X,Y,Z). Proof: Assuming only that the space Y is complete, (10.5.6.1) says that for sufficiently large n € N the operator VnS+TUn is invertible. Now (10.3.4.2) from Theorem 10.3.4 says that (10.5.6.2) makes i" a (left,right) multiple of (S,T). If also the space X is complete, so that eventually UnT — I is also invertible, then the sufficiency of (10.5.6.3) is given by (10.3.4.3). If instead the space Z is also complete, then the sufficiency of (10.5.6.4) is given by (10.3.3.4). Finally, if the space Y is complete and (S,T) G BL(X,Y,Z) is invertible, with VS + TU = J, then by taking (S", T') = (S + H, T + K) so close to (S, T) that i" + VH and i" + KU are both invertible we find using (10.3.4.1) that (S",T') is also invertible. ■ It would as we have suggested have been more satisfying to prove, in complete spaces, that almost invertible chains are invertible, and hence to derive the last part of Theorem 10.5.6 from Theorem 10.3.5. An application of (10.5.6.2) shows that if the space Y is complete then it is sufficient for the chain (S,T) to be invertible that there are [Un) in BL(Y, X) and [Vn) mBL(Z,Y) for which \\VnS + TUn-I\\^>0 and (10.5.6.5) sup||iy +sup||Vn|| < oo and UnVn = 0 n n In words we require that (S,T) have an "almost inverse" consisting of chains (Un, Vn). If we are unable to modify the argument for (10.3.2.1) to show that (10.5.6.5) holds for every almost invertible chain we might be tempted to replace the condition (10.3.1.2) by the condition (10.5.6.5) as the definition of almost invertibility: it would then be important to show that (10.5.6.5) defines an open subset of BL(X, Y, Z). The reader is invited to see if he can show this by modifying the proof of Theorem 10.3.4: in the notation of that proof he should try to arrange that UnK'nH'nVn = 0 for each n £ N (10.5.6.6)
10.6 Essential Exactness 425 The pair of operators (S,T) of (10.3.8.8) shows that BL"1 (X,Y,Z) need not be an open subset of the product space BL(Y,X) x BL(X, Y) even for complete spaces. 10.6 ESSENTIAL EXACTNESS The first three conditions of Definition 10.2.1 have obvious "essential" analogues in the spirit of Definition 3.9.1, and in particular relative to the finite rank operators as in Atkinson's Theorem 6.4.3. An "essential" analogue of the last condition of Definition 10.2.1 also generalizes the "zero divisor" condition of (3.9.4.7) and (3.9.4.8), and in particular the essential one-oneness and denseness conditions of Definition 6.4.1: 10.6.1 DEFINITION We will call the pair (S,T) £ BL{X,Y,Z) essentially (left,right) invertible, or spatially (left,right) Fredholm, if there are T' £ BL(y,X) and S' £ BL(Z,Y) for which S'S + TT' - I £ KL0(y, Y) (10.6.1.1) and essentially (left,right) almost invertible if there are (T£) and (Sfn) for which \\S'nS + TT'n -I\\ —> 0 and sup \\T^\\ + sup \\S'n\\ < oo (10.6.1.2) n n We shall call the pair (S,T) £ BL(X, Y, Z) essentially (left,right) one-one, or weakly (left,right) Fredholm, if 5"1(0)/cl(rX) is finite dimensional (10.6.1.3) Once again, regularity is the bridge between the two extremes: 10.6.2 THEOREM If T £ BL(X, Y) and S £ BL(Y, Z) then ST £ KL0(X,Z) <=► (S_1(0) +cl(rX))/5'-1(0) finite dimensional (10.6.2.1) and each of the following conditions are equivalent: UT = 0 = SV ^UV £ KL0(X', Z') (10.6.2.2) (S-^O) + cl(rX))/cl(rX) finite dimensional (10.6.2.3) UT £ KL0(X,Z') and SV £ KL0(X',Z) =>► UV £ KL0(X',Zf) (10.6.2.4) Necessary and sufficient for a chain (S,T) £ BL(X, Y, Z) to be (left,right)
426 10. Comparison of Operators and Exactness Fredholm is (S,T) is regular and weakly (left,right) Fredholm (10.6.2.5) Proof: For (10.6.2.1) observe that the mapping SA : Y/S'1^) -► S(Y) induces isomorphism (S-^O) +cl(rX))/5-1(0) S E with ST{X) CEC c\ST{X) (10.6.2.6) Towards the next part, note that both sides of (10.6.2.1) are equivalent to cl(rX) C S_1(0) + W for some finite dimensional W C Y (10.6.2.7) If (10.6.2.3) holds and if U £ BL(Y,Zf) and V £ BL(X',Y) are such that SV and UT are of finite rank, then there are finite-dimensional subspaces E, F, and G of Y for which cl(VX') C S_1(0) + E C cl(rX) + G + £ C J7"1 (0) + F + G + E which means by (10.6.2.7) that UV is of finite rank. Conversely, if (10.6.2.3) fails then (10.6.2.2) is violated by U = coker(T) and V = ker(S). This proves the three conditions equivalent. Towards (10.6.2.5), it is clear that a (left,right) Fredholm pair (S,T) is weakly Fredholm, whether or not it is a chain, and if also ST = 0 then the operators T - TT'T and S - SS'S are both of finite rank, hence regular by Theorem 6.3.4, which makes T and S regular by (3.8.3.1). Conversely, if (S,T) is regular and weakly (left,right) Fredholm, then we can apply (10.6.2.2) with U = 1- TT' and V = I - S'S to see that (S,T) satisfies (10.6.1.1): T = TT'T,S = SS'S => (I-TT')(I-S'S) £ KL0(Y,Y) m (10.6.2.8) The definition of Fredholmness extends to triples (R, S, T) and longer chains in the same way as the definition of invertibility, following Definition 10.3.1. The reader can show the analogue of Theorem 10.3.2, that if (R,S,T) is a Fredholm chain then there is another chain (T',S',Rf) providing essential inverses for (S,T) and (R, S). At the same time it is clear that we can always take the "essential inverse" of a Fredholm chain to be another chain. If, in addition, the essential inverse is itself "invertible" then we shall speak of a "Weyl" chain: 10.6.3 DEFINITION The chain (5, T) £ BL(X, Y, Z) will be called spatially (ltft,right) Weyl if there is a chain (T',5") £ BL(Z,Y,X) for which S'S + TT' - I £ KL0(y,y) and / £ SfBL(Y,Z) + BL(X,y)r' (10.6.3.1)
10.6 Essential Exactness 427 If (S, T) is spatially Fredholm we shall write Euler(5,r) = dim5_1(0)/cl(rX) (10.6.3.2) If more generally (Tn,..., ^jTq) is spatially Fredholm we shall write n Eu\ei(Tn,...,T1,T0) = X>l)P_1 Buler^,^) (10.6.3.3) P=i We leave the reader to see whether he can deduce (10.6.3.1) from the corresponding assumption using a pair (T", Sf) without the assumption that T'S' = 0. To extend the definition of "Weyl" to longer chains we make the obvious extension of (10.6.3.1), making explicitly the assumption that the "invertible essential inverse" is another chain. The most important examples of (10.6.3.3) are chains of the form (0, Tm,..., T1? 0), but we shall also encounter chains of the form (S,T,S) with T £ BL(X, Y) and S £ BL(y,X). When the spaces are complete, Weyl and Fredholm chains come in open sets: 10.6.4 THEOREM If chains (5, T) and (S + H, T + K) in BL(X, Y, Z) and (r',5") £ BL(Z,y,X) satisfy S'S + TT'-I <EKL0(Y,Y) and {/ + SfH,I + KT'} C BL'1 {Y,Y) (10.6.4.1) then (S + H,T + K) is (left,right) Fredholm. If in addition there are T" £ BL(X, Y) and S" £ BL(Y, Z) for which, with K' = (/ + KT)-1 and H'=(I + S'H)-\ S's" + T"T' = I and (10.6.4.2) {/ = t"t'(k' -/),/ + {Hf - i)s's"} c bl"1 (y,y) then (S + H, T + K) is (left,right) Weyl. Proof: If we just assume that there are Hf and K' in BL (Y,Y) for which {(/ + KT')K' - /, H'(I + S'H) - 1} C BL"1 (y, y) (10.6.4.3) then the proof of Theorem 10.3.4, and in particular the identity (10.3.4.5) with (T',5") in place of {U,V), shows that {H'S')(S + H) + (T + K)(T'Kf) -l£ KL0(y,y) (10.6.4.4)
428 10. Comparison of Operators and Exactness This shows that (S + H,T + K) is Fredholm. To see that it is Weyl we ought to replace the pair {T'K',H'S') by a chain, and show that that chain is invertible. Also, the only way we can hope to show such a chain to be invertible is to keep it close to the original chain {T',S'), and use Theorem 10.3.4 again. In fact the assumption (10.6.4.1) already tells us that {T'K',H'S') is a chain: for if J + S'H and J + KT' have two-sided inverses H' and K' then (3.1.3.1) says that J + HS' and J + T'K are also invertible, with H" = {I + HS')-1 =1- HH'S' and (10.6.4.5) K" = (I + T'K)-1 =1- T'K'K It now follows that also T'K' = K"T' and H'S' = S'H" (10.6.4.6) For example, T' = T'K'{I + KT') => T'{I - K') = T'K'KT' =^T'K' = [I-T'K'K)T' This at once makes [T'K', H'S') a chain: {T'K'){H'S') = {K"T'){S'H") = K"{T'S')H" = 0 (10.6.4.7) This together with (10.6.4.2) shows that the condition (10.3.4.1) is satisfied with (r',5") in place of (S,T) and {T'K',H'S') in place of {S + H,T + K), and now Theorem 10.3.4 says that {T'K',H'S') is invertible, so that (10.6.4.4) says that {S + H, T + K) is Weyl. ■ The openness follows when the space Y is complete: the condition (10.6.4.1) can be achieved by taking ||r'||||ii:|| < 1 and ||S"||||#|| < 1, and then the condition (10.6.4.2) by taking K and H if necessary even smaller. The reader should observe that ||i^' — i"|| —► 0 as \\K\\ —► 0, and that ||jy; - J|| -> 0 as ||#|| -> 0: he is invited to find bounds for ||ii:|| and ||#|| which will ensure \\I + ifr'HII-fr'H < 1 and \\I + S"#||||#'|| < 1, respectively. Theorem 10.3.4 and Theorem 10.6.4 both extend to triples {R,S,T) and longer chains. If for example S'S + TT' = I = R'R + SS' (10.6.4.8)
10.6 Essential Exactness 429 and S'STT' = S'(S + H){T + K)T' and (10.6.4.9) R'RSS' = R(R + L)(S + H)S' and H'(I+S'H) =1= (I+KT')Kf and L'(I+R'L) = 1= {I+HSf)H" (10.6.4.10) then Theorem 10.3.4 tells us that (H'S')(S + H) + (T + K)(T'K') = 1= (L'R')(R + L) + (S + H)(S'Hf) (10.6.4.11) and since (10.6.4.9) and again (3.1.3.1) tell us that both I+S'H and I+HSf have two-sided inverses we have, as in (10.6.4.6), S'H" = H'Sf (10.6.4.12) As for single operators, the index of a Fredholm chain (0, S, T, 0) vanishes when the chain is Weyl: 10.6.5 THEOREM If (5, T) e BL(X, Y, Z) is a regular chain, then necessary and sufficient for (0, S, T, 0) to be Fredholm is that r_1(0) and 5"1(0)/cl(rX) and Z/c\(SY) are finite dimensional (10.6.5.1) Necessary and sufficient for (0, S, T, 0) to be Weyl is that in addition Euler(0, S, T, 0) = 0 (10.6.5.2) Proof: The first part is clear from (10.6.2.5). If (0,5, T,0) is Fredholm then there is an essential inverse (0, T", Sf, 0) for (0, S, T, 0) for which T'S' = 0 and T = TT'T and S = SS'S (10.6.5.3) Now write T'T = Pr, TT' = Qr, S'S = Ps and SS' = Qs (10.6.5.4) As in (3.8.2.2) these are idempotents, and since PSQT = 0 we recall from Theorem 2.5.4 that (J — QT)(I — Ps) is another idempotent, with VP) = QAY) + ((J - QtW ~ PS))(Y) (10.6.5.5)
430 10. Comparison of Operators and Exactness Thus, we have isomorphisms Pf1(0)=T-1(0),((I-QT)(I-Ps))(Y) = S-\0)/c\(TX), Q^{0) = Zlc\{SY) Necessary and sufficient for Euler(5, T) = 0 is therefore ((/- QT){I- PS){Y) - Pf!(()) x Qs\0) = (I-PT)(X) x (I-Qs)(Z) (10.6.5.7) which is equivalent, by the definition of the cartesian product, to the existence ofUG BL(X,y), V £ BL(y,Z), U' £ BL(y,X), and V G BL(Z,Y) for which VU = 0 = U'V and (10.6.5.8) U,U = I-PT,V'V + UU' = (I-QT)(I-PS),VV' = I-QS If (10.6.5.8) holds, then we obtain an essential inverse (0, T',5',0) with an inverse (0,S"',r",0) by taking Tf = U\ Sf = V\ T" = T + U and S" = S + V (10.6.5.9) Conversely, if we already have Sf, T', S" and T" then we can solve (10.6.5.9) to get U, V, U' and V" satisfying (10.6.5.8). ■ The alert reader may noticed that the operators T' and S' of (10.6.5.9) need not be the same as the operators Tf,Sf of (10.6.5.3): the same projections PT and QT can be induced by more than one generalized inverse r'. Definition 10.6.1 does not include the common generalization of upper and lower Fredholmness, and exactness, or of almost upper and lower Fredholmness and almost exactness: we settle for an uneasy compromise. 10.6.6 DEFINITION We shall call a chain (5,T) £ BL(X,Y,Z) nearly essentially exact if 5"1(0)/cl(rX) is finite dimensional (10.6.6.1) and S is relatively open (10.6.6.2) and S has closed range (10.6.6.3)
10.6 Essential Exactness 431 and T is relatively almost open (10.6.6.4) Near essential exactness can be tested with the essential enlargement functors of Definition 6.7.4: 10.6.7 THEOREM If (5, T) £ BL(X, Y, Z) then (P^S^P^r) linearly exact (10.6.7.1) implies (5, T) nearly essentially exact (10.6.7.2) which implies (P(5),P(r)) exact (10.6.7.3) If X, y, and Z are complete then all three conditions are equivalent. Proof: The first implication is a synthesis of the arguments of Theorems 6.9.2 and 6.9.3, while the second follows the arguments of Theorem 6.10.3. If (10.6.6.1) fails, so that the quotient 5_1(0)/ cl(TX) is infinite dimensional, then by the Riesz lemma (Theorem 1.5.2) there is y = (yn) in Y for which Syn = 0 and ||yn|| = 1 and (10.6.7.4) disth/n+1,cl(rX) + ^Ct/y >I which, means y € JTO(y) and Sy € e0{Z) C m^Z) and (10.6.7.5) y^Tl00(X)+m1(Y) so that S + m1(7)€PI(S)-1(0)\P1(r)P1(X)CP1(7) (10.6.7.6) violating (10.6.7.1). If instead (10.6.6.2) fails then there is y = (yn) in Y
432 10. Comparison of Operators and Exactness for which ||yn|| = l and distfo^ST^O)) > \ and ||SyJ| — 0 (10.6.7.7) We claim that this also gives us (10.6.7.5). The first two parts are clear: If the last part fails then there is x € l^X) for which y — Tx has a convergent subsequence, so that there is (xf, y') < (x, y) for which y'n — Tx'n —► y'^ € Y: but now Sy'^ = 0 and distfy^Sr *(())) > \ (10.6.7.8) This contradiction verifies the last part of (10.6.7.5). To verify (10.6.6.3) we assume both (10.6.7.1) and (10.6.6.2): If z € c\(S(Y)) then by (10.6.6.2) there is yn in Y for which \\z - Syn\\ —> 0 with ||yj| < h\\z\\ (10.6.7.9) which means that y (E l^Y) and Sy (E c1(Z) C m1(Z). But now by (10.6.7.1) there is x € l^X) for which y—Tx € m1 (Y), giving subsequences (x',y') < (x,y) for which y'n-Tx'n ^y^GY; since \\z-S{y'n-Tx'n)\\ -> 0 it follows that z = Sy'^ G S(Y). If (10.6.6.4) fails then there is g = (gn) in y t for which \\9n\\ = Klls-i(0) = 1 and (10.6.7.10) dist^trt)"1^))^! and llfcTH—.0 We claim g <£ m^Y*) = m{Y^) (10.6.7.11) Equality at the end follows from the completeness of Y^: if in violation of the first part g^ £ Y^ were the limit of a convergent subsequence g' < g then we would have g'JT = 0 and dist(^, (rt)-1(0)) > \. Now if 0 < 6 < p(^) = inf {k > 0 : {gn : n £ N} (10.6.7.12) C Disc(# ; k) with finite H CY^} then there is gf ■< g for which, for each n (E N, S'n+i * U Disc(^ ->6) (10.6.7.13)
10.6 Essential Exactness 433 and then {yfn}m)n^m in Y for which n^m implies ||y;>m|| = l and \g'n(y'n,m) ~ ?m«m)l > 5 (10.6.7.14) We claim sup ||x'Bim|| < oo =► p {y'n>m - Tx'n>m : n ? m} > \S (10.6.7.15) Indeed if e > 0 and {yfn m — Tx'n m : n ^ m} C Disc(iJT ; e) with finite KCY then ^(<J-^(<J£ U Disc((^-^)(t/");-) + (^-^)(Tx'njm)CC (10.6.7.16) and if n>m>Nel^g'n(y'n)rn)-g'n(y'n)Je [j Disc(K;e + e') (10.6.7.17) y"€K then it follows that ^£ < e + e:' and hence, letting e1 —► 0, that j8 < e. If we now arrange the pairs {(n,m) : n ^ m} in a sequence and write y = (yn) = (y'n,m)n*m (10.6.7.18) Then (10.6.7.15) can be rewritten x € l^X) => p(y - Tx) > \6 (10.6.7.19) which means that (10.6.7.6) holds again. This finishes the proof of the first implication (10.6.7.1) =^ (10.6.7.2): toward the second observe that if (10.6.7.2) holds then there are k > 0, h > 0 and Q0 = Q20 £ BL(S,-1(0),S'-1(0)) for which (?o(5'"1(0)) = cl(TX) C S_1(0) C Y (10.6.7.20) and y £ cl(rX) => y £ cl {Tx : \\x\\ < k\\y\\} (10.6.7.21) and yG7=^ dist^S-1 (0)) < h\\Sy\\ (10.6.7.22) If now y £ loo{Y) then by the anti-Riesz lemma (Theorem 1.5.1) there is
434 10. Comparison of Operators and Exactness w = (wn) in Y for which and (10.6.7.23) \\yn - wj < distfo^ST^O)) + (l/»)||yj| and ||u;n|| < 2||yn|| and then by (10.6.7.20) and (10.6.7.21) there is x = (xn) in X for which \\Q0wn - TxJ < (±) \\wn\\ and ||xj| < k\\Q0wn\\ (10.6.7.24) Evidently w € l^Y) and x € /^(X); also Q0w -TxG c0(Y) C m^Y) C m{Y) (10.6.7.25) and, since by (10.6.6.1) Qq1^) is finite dimensional, w-Q0w€ l^Qo l{0)) = m^Qo'(0)) C m^Y) C m{Y) (10.6.7.26) thus P1(y-rx)=P1(y-u;) and P(y - Tx) = P(y - w) (10.6.7.27) To see that (10.6.7.3) holds we have to verify that the chain (P(5),P(T)) satisfies (10.3.1.4); but we have from (10.6.7.23) and (10.6.7.24) limsup||yn-Txn|| = limsup||yn-u;n|| <felimsup||5yn||+0 (10.6.7.28) n n n and limsup ||xj| < 2lfe||Q0|| limsup ||yn|| ■ (10.6.7.29) n n Theorems 6.9.2 and 6.9.3 suggest that we ought to be able to reverse the implication (10.6.7.1) =>• (10.6.7.2) for incomplete spaces X, Y, and Z. To do this we would have to be able to construct the sequence w of (10.6.7.23) in such a way that y-wGm^Y) (10.6.7.30) The reader can certainly do this if 5_1(0) is complemented in Y, taking w = Py with P = P2 G BL(Y, Y) for which P"1^) = S"1^). When X, Y, and Z are complete then "near essential exactness" looks much simpler, and also coincides with obvious notions of "essential exactness" and "almost essential exactness":
10.7 Algebraic Exactness 435 10.6.8 THEOREM If X, Y, and Z are Banach spaces then necessary and sufficient for (S,T) £ BL(X,Y,Z) to be nearly essentially exact is that dim5'-1(0)/r(X) < oo and S is proper (10.6.8.1) Proof: If (S,T) is nearly essentially exact then S is relatively open by (10.6.6.2) and therefore has closed range by Theorem 4.4.3, and hence is proper by Theorem 4.6.4, while T is relatively almost open by (10.6.6.4), therefore relatively open by Theorem 4.4.4, therefore has closed range by Theorem 4.4.3, which together with (10.6.6.1) gives the first part of (10.6.8.1). Conversely if (10.6.8.1) holds then (10.6.6.1) is immediate, and since S is proper so is (10.6.6.2). The first part of (10.6.8.1) together with Theorem 4.8.2 (and Theorem 6.2.2) says that T(X) is closed, therefore proper by Theorem 4.6.4, and hence relatively almost open as needed for (10.6.6.4) ■ 10.7 ALGEBRAIC EXACTNESS The first three conditions of Definition 10.2.1 have obvious "algebraic" analogues in the spirit of Definition 3.10.1; we will confine ourselves to looking at the analogue of Definition 10.3.1 in a normed algebra: 10.7.1 DEFINITION A pair of elements (6, a) £ A2 in a normed algebra A will be called (left,right) invertible if 1 £ Ab + aA (10.7.1.1) and almost (left,right) invertible if there are (afn) and (bfn) in A for which ||&'n6 + aa'n-l|| —>0 and sup ||a'J| +sup ||6'n|| < oo (10.7.1.2) n n We shall call the pair (6, a) £ A2 (left,right) bounded below if there are k > 0 and h > 0 for which, for arbitrary u,v £ A, \\uv\\ < &||u||||6v|| + fc||ua||||v|| (10.7.1.3) and (left,right) one-one if there is implication, for arbitrary u,v £ A, bv = 0 = ua => uv = 0 (10.7.1.4) Once again we shall call a pair of elements (6, a) for which ba = 0 (10.7.1.5) a chain.
436 10. Comparison of Operators and Exactness 10.7.2 THEOREM If (6, a) £ A2, there is implication (10.7.1.1) => (10.7.1.2) => (10.7.1.3) => (10.7.1.4) (10.7.2.1) Necessary and sufficient for (6, a) (E A2 to be (left,right) invertible is that (6, a) is regular and (left,right) one-one (10.7.2.2) Proof: The first and third implications of (10.7.2.1) are both obvious. To establish the second suppose that (10.7.1.2) holds, and see that for arbitrary u,v€Avte have |M| < ||«(6'n6 + aa'n - 1)„|| + ||«||K|||NI + IMIKIIIMI for each n (E N Allowing n —► oo gives (10.7.1.3) with k = supn \\a'n\\ and h = supn \\b'n\\. To prove (10.7.2.2) simply repeat the argument for (10.3.3.2) from Theorem 10.3.3. ■ It is left to the reader to see how the rest of Theorem 10.3.3 holds for pairs of normed algebra elements, and how to extend the first two conditions of Definition 10.7.1 to triples (c,6, a) and longer chains. For the record, we shall write out the proof that, if the algebra is complete, invertible chains come in open sets: 10.7.3 THEOREM If A is a normed algebra, then each of the following conditions is sufficient for the pair (6 + h, a + k) (= A2 to be (left,right) invertible: b'b + aa' = 1 and b'baa! = b'{b + h)(a + k)af and (10.7.3.1) ti{l + b'h) = 1 = (l + ka!)k' b'(b + h) + (a + k)a' GA'1 and b'(b + h)(a+k)a! = {a + k)a'b'{b + h) (10.7.3.2) {&'(6 + h) + (a + k)a',a'[a + k)} C A'1 and b'{b + h)(a + k)=0 (10.7.3.3) {b'(b + h) + (a + k)a', (6 + h)b'} C A~l and (6 + h) (a + fc)a' = 0 (10.7.3.4)
10.7 Algebraic Exactness 437 Proof: Repeat the argument for Theorem 10.3.4: for example the second part of the condition (10.7.3.1) gives the identity b'(b + h)(l + kaf) + (1 + b'h)(a + k)a' - (1 + b'h)(l + ka') = b'b + aa'-l ■ (10.7.3.5) Theorem 10.7.3 tells us that almost invertible chains come in open sets: 10.7.4 THEOREM If A is a normed algebra, then the set {(6,a) £ A2:ba = 0 and (6,a) is almost invertible} (10.7.4.1) is an open subset of the closed set of chains in A. If the algebra A is complete then also the set {(6, a) e A2: ba = 0 and 1 e Ab + aA] (10.7.4.2) is open in the set of chains. Proof: For the first part repeat the argument for Theorem 10.3.5, which uses the identity (10.7.3.5); for the second part repeat the argument for the last part of Theorem 10.5.6. ■ Even in the algebra of 2 x 2 matrices the openness of Theorem 10.7.4 is only relative to the closed set of chains: we recall again the pair of operators of (10.3.8.8). In a normed algebra A we would expect to find that the chains (6, a) which satisfy the boundedness below condition (10.7.1.3) form another open set: we encourage the reader to see if he can succeed where we have failed. If, in particular, A = BL(X, X) then this is the case, because then the bounded-below chains coincide with the pairs of operators which are almost exact in the sense of (10.3.1.5): 10.7.5 THEOREM If A = BL(X,X) for a normed space X and (6, a) € A2 is a chain, then (6, a) (left,right) bounded below -<=>• (6, a) almost exact (10.7.5.1) and (6, a) (left,right) one-one -<==> (6, a) almost linearly exact (10.7.5.2)
438 10. Comparison of Operators and Exactness Proof: If (6, a) is almost exact, then by Theorem 10.4.1 the condition (10.4.1.2) holds, in particular with the spaces E and F both equal to the space X = Y = Z, which says that (6, a) is (left,right) bounded below. Conversely, if this is so then we may further restrict the condition (10.4.1.2) to apply only to operators U and V of rank one: but this gives the linear functional condition (10.4.1.3), and hence by Theorem 10.4.1 the almost exactness condition (10.4.1.1). This proves both implications of (10.7.5.1), and the argument for (10.7.5.2) is the same, using Theorem 10.4.2 instead of Theorem 10.4.1. ■ If we apply Definition 10.7.1 in the seminormed algebra BL(X, X)/ KL0(X, X), then the first and last conditions applied to a pair (6, a) = (S + KL0(X,X),T + KL0(X,X)) reproduce the Fredholm and weak Fredholm conditions of Definition 10.6.1 for the operators (S,T). This is obvious for the first condition. To see it for the last condition we use Theorem 10.6.2 and argue as in Theorem 10.7.5. In the other direction, the exactness condition latent in Theorem 10.6.5 can be reproduced in the form (10.7.1.1): 10.7.6 THEOREM If (S, T) e BL(X, Y, Z), then the following are equivalent: (0,5, T,0) is (left,right) invertible "0 0 0" T 0 0 .0 S 0. ? "0 0 0" T 0 0 .0 S 0. is (left,right) invertible (10.7.6.1) (10.7.6.2) Proof: If (0,5, T,0) is (left,right) invertible, then, using Theorem 10.3.2, there is (r',S") £ BL(Z,Y,X) for which T'T = I and S'S + TT' = I and SS' = I (10.7.6.3) This gives OS' 0 0 0 T' 0 0 0 "0 0 0" T 0 0 .0 S 0. + "0 T .0 o o- 0 0 s o. 0 S' 0 0 0 0 o ■ r 0 . = -/ o o- 0/0 .0 0 /. (10.7.6. 4)
10.7 Algebraic Exactness 439 and hence 'I 0 0" 0/0 .0 0 I. / GBL V 'X- Y _Z. ? ■x- Y _Z_ \ 1 ; 0 0 0 TOO 0 S 0 + 0 0 0" TOO 0 S 0. ( BL V 'X- Y _Z_ ? -xi Y _Z_ (10.7.6.5) Conversely, if (10.7.6.5) holds, then there is U = (U{j) and V = (V{j) for which 0 0 0 TOO L0 S 0J + "0 0 0" T 0 0 l0 S 0. U = -I 0 0" 0/0 .0 0 /. and hence W "0 T .0 o o- 0 0 s o. + 0 T 0 0 0" 0 0 s o. w = "I 0 0" 0/0 .0 0 /. with W = U 0 0 0 TOO 0 S OJ But now (10.7.6.3) holds with and T' = W23 and S' = W12 (10.7.6.6) Similar representations exist for the exactness of chains of the form (0,Tn,...,T2,T1,0) withr^BLp^,*,.) (10.7.6.7) (0,an,...,a2,a1,0) with aj £ A ■ (10.7.6.8) Theorem 10.4.6 has an analogue for normed algebra elements: 10.7.7 THEOREM If (c,6,a) € A3 is a chain and if (c,6) e A2 is (left,right) invertible then there is implication (Lb,La) linearly exact => (b,a) (left,right) invertible =>• (Lb,La) exact. (10.7.7.1)
440 10. Comparison of Operators and Exactness If instead (6, a) £ A2 is (left,right) invertible then there is implication (Rb,Rc) linearly exact =>• (c,6) (left,right) invertible =>• (Rh,Rc) exact. (10.7.7.2) Proof: Repeat word for word the proof of Theorem 10.4.6 ■ The analogue of Theorem 10.7.7 also holds for almost exactness: analogous to (10.4.6.5), if (c,6) £ A2 is almost (left,right) invertible then (Lb,La) almost linearly exact =>• (6, a) almost (left,right) invertible (10.7.7.3) =>• (Lb,La) almost exact while, analogous to (10.4.6.6), if instead (6, a) £ A2 is almost (left,right) invertible then (Rb,Rc) almost linearly exact =>• (c,6) almost (left,right) invertible (10.7.7.4) =>• (Rb,Rc) almost exact Like Theorem 10.4.6, Theorem 10.7.7 and its "almost" analogue extend to longer chains (an,... , c^,*^) and (0,an,... ,a2,a1,0). The Fredholm and Weyl theory of a homomorphism of normed algebras extends to pairs of elements, giving another perspective on "essential exactness." 10.7.8 DEFINITION If T £ BBL(A,B) is a homomorphism of normed algebras then a chain (6, a) £ A2 will be called (left,right) T-Fredholm if l£BT(b)+T(a)B (10.7.8.1) weakly (left,right) T-Fredholm if there is implication, for arbitrary u,v £ B, uT(a) = 0 = T(b)v =)> uv = 0 (10.7.8.2) and (leftjright) T-Weyl if there are a', 6', a", b" in A for which b'b + aaf -1£T~1(0) and a'bf = 0 and b'b" + a" a' = 1 (10.7.8.3) This gives us back Definitions 10.6.1 and 10.6.3 if we take A = BL(X,X) and B = BL(X,X)/KL0(X,X), with T the "finite Calkin functor." For certain such homomorphisms, we recover the relationship between regularity, Fredholmness, and weak Fredholmness:
10.8 Hilbert Spaces 441 10.7.9 THEOREM If T £ HBL(A, B) satisfies T-^OJC^ and T(A) = B (10.7.9.1) then it is necessary and sufficient for a chain (6, a) € A2 to be (left,right) T-Fredholm that (6, a) is regular and weakly T-Fredholm (10.7.9.2) Proof: This is a rewrite of the proof of (10.6.2.5) from Theorem 10.6.2. ■ Theorem 10.6.4 and Theorem 10.7.3 have a common generalization which gives conditions sufficient for a pair (6 + h, a + k) to be T-Fredholm, or T-Weyl: we leave it to the reader to state and prove. 10.8 HILBERT SPACES For Hilbert spaces, most of the conditions of Definition 10.1.1 and of Definition 10.1.3 are the same: 10.8.1 THEOREM Suppose that X, Y, and Z are Hilbert spaces, and that T £ BL(X,y) and S G BL(X,Z): then there is implication S majorized by T =>► S a left multiple of T (10.8.1.1) If instead T £ BL(X,Y) and S £ BL(Z,Y), there is implication S almost comajorized by T =>• S a right multiple of T (10.8.1.2) and S range-included in T =)> S a right multiple of T (10.8.1.3) Proof: For (10.8.1.1) we need only assume that Y is a Hilbert space and that Z is complete, and follow the argument at the end of Theorem 10.1.2. If the condition (10.1.1.4) holds, then the mapping U0 : TX -► Z of (10.1.2.4) is well defined, bounded, and linear; if the space Z is complete then there is a bounded extension U1 of U0 to the closure cl(TX), and finally, if Y is a Hilbert space there is Q = Q2 € BL(Y,Y) with Q(Y) = cl(TX). If we now define U = UlQ:Y—> Z (10.8.1.4) analagously to (10.1.2.5) then we will satisfy the condition (10.1.1.1). For (10.8.1.2) we use instead the argument at the end of Theorem 10.1.4: if the condition (10.1.3.5) is satisfied then the operators U of (10.1.4.4) will
442 10. Comparison of Operators and Exactness satisfy the condition (10.1.3.1). Finally, since Hilbert spaces are complete, the condition (10.8.1.3) is derived from (10.8.1.2) using (10.5.5.1). ■ For example Theorem 10.8.1 and Theorem 9.9.5 combine to give the "polar decomposition" of a bounded operator between Hilbert spaces: If TGBL{X,Y) then T = sgn(r) . \T\ (10.8.1.5) with |r| = (r*r)1/2 e bl(x,x) (10.8.1.6) and sgn(r) = sgn(r)sgn(r)*sgn(r) e BL(X,y) (10.8.1.7) Here the positive operator \T\ is given by (9.9.5.1) with a = T*T and A = BL(X,X): since \T\ = \T\* and \T\2 = T*T we have |||r|||2 = (|r|2x;x) = {T*Tx;x) = \\Tx\\2 for each x £ X (10.8.1.8) so that T is majorized by \T\ in the sense of Definition 10.1.1. We shall define sgn(T) to be the operator U obtained in the analogue of (10.8.1.4) with T and \T\ in place of S and T, taking Q to be the orthogonal projection on the range of \T\. An operator U for which U = UU*U is called partial isomttry. It would have been nice to extend the argument of Theorem 10.8.1 to the conditions of Definition 10.2.1: the reader is invited to try his hand. In the particular case of the exactness of Definition 10.3.1 we have the equivalence: we first pass through an auxiliary result. 10.8.2 THEOREM If X, Y, and Z are Hilbert spaces and (S,T) e BL(X, Y, Z), then the following are equivalent: T is one-one and S is dense T*T and SS* are one-one and dense col(5,r*) is dense row(5*,T) is one-one Each of the following are also equivalent: T is left invertible and S is right invertible T*T and SS* are invertible col(5,T*) is right invertible row(S*,T) is left invertible (10. (10. (10. (10. (10. (10. (10. (10. 8.2.1) 8.2.2) 8.2.3) 8.2.4) 8.2.5) 8.2.6) 8.2.7) 8.2.8)
10.8 Hilbert Spaces 443 Proof: The equivalence of (10.8.2.1) and (10.8.2.2) follows from (8.6.5.1) and (8.6.5.2) from Theorem 8.6.5. In particular, (8.6.5.5) gives equality r_1(o) = (r*r)_1(o) (10.8.2.9) which establishes half of the equivalence (8.6.5.1), the other (8.6.5.2) following by taking adjoints. If we apply (8.6.5.1) with row(S*,T) in place of T, and apply (8.6.5.1) with col(5, T*) in place of T, and observe col(5,r*)row(5*,r) = S ' T* IS* L ri = J 'SS* 0 0 T*T (10.8.2.10) then we obtain the equivalence of each of (10.8.2.4) and (10.8.2.3) with (10.8.2.1). For the second part the equivalence of (10.8.2.5) and (10.8.2.6) is given by (8.6.5.3) and (8.6.5.4) from Theorem 8.6.5, and then the equivalence of (10.8.2.5) with each of (10.8.2.8) and (10.8.2.7) follows from (10.8.2.10). ■ More generally, 10.8.3 THEOREM If X, Y and Z are Hilbert spaces and (S,T) € BL(X, y, Z) then the following are equivalent: (S,T) is almost linearly exact (10.8.3.1) S*S + TT* is one-one and dense (10.8.3.2) col(5,r*) is one-one (10.8.3.3) row(S"\T) is dense (10.8.3.4) Each of the following is also equivalent: (S,T) is (left,right) invertible (10.8.3.5) S*S + TT* is invertible (10.8.3.6) col(S,r*) is left invertible (10.8.3.7) row(5*,r) is right invertible (10.8.3.8) Proof: Whether or not the operators T € BL(X,Y) and S € BL(Y,Z) form a chain, there is implication s-1(o) c d(rx) ^=> sr^o) _l (r*)-1^) => s-\o) n (r*)-1^) = {0} (10.8.3.9)
444 10. Comparison of Operators and Exactness while {ST = 0 and 5"x(0) n (T*)"1^) = {0}) =► 5"x(0) _L (T*)"1^) (10.8.3.10) This follows from (8.3.3.2) from Theorem 8.3.3 by taking K = c\{TX) and H = 5_1(0). Thus if ST = 0 and the condition (10.8.3.1) holds then so does the condition (10.8.3.3), which is therefore equivalent to (10.8.3.1). The equivalence of (10.8.3.3) and (10.8.3.4) is just (8.5.4.1). Finally, the equivalence of (10.8.3.3) and (10.8.3.2) is an application of (8.6.5.1) with col(5,T*) in place of T, noting that row(S*,T)col(S,r*) = [S* T] S = S*S + TT* (10.8.3.11) Towards the second part suppose that (5, T) is (left,right) invertible, with VS + TU = J: then it is also (left,right) bounded below in the sense of (10.3.1.4) and hence satisfies the condition (10.4.1.3). With k = \\U\\ and h = ||V|| we have for arbitrary y,w £ Y Ky;u»>|<fc|HH|5y|| + fc||y||||r*u»|| (10.8.3.12) and hence, taking w = y and then dividing across by ||y||, \\y\\ < h\\Sy\\ + &||r*y|| for each y eY (10.8.3.13) Thus if (10.8.3.5) holds then the operator col(5,T*) is bounded below, and hence by (8.4.4.1) is left invertible, giving (10.8.3.7). The equivalence of (10.8.3.7) and (10.8.3.8) is (8.5.4.2) together with Theorem 8.4.4: we finish the whole argument by showing that (10.8.3.6) => (10.8.3.5). This is given by the argument of Theorem 10.3.4: if ST = 0 then also T*S* = 0 and hence (5*5)(TT*) = (TT*)(5*5) (10.8.3.14) S*S + TT* =ReBL-\Y,Y) (10.8.3.15) If, in addition, is invertible then (jR_15*)5 + r(r*jR_1) = / (10.8.3.16) which is the condition (10.8.3.5). ■ Part of the second part of Theorem 10.8.3 gives the promised implication (10.3.1.4) => (10.3.1.1) for Hilbert spaces, and hence of course also (10.3.1.2) => (10.3.1.1). The essence of Theorem 10.8.2 follows from Theorem 10.8.3 by taking separately 5 = 0 and T = 0. As part of the proof
10.8 Hilbert Spaces 445 of Theorem 10.8.2 we noted in (10.8.2.9) that T £ BL{X,Y) is determined by T*T e BL(X,X) in the sense of (10.1.1.5). The alert reader may have wondered whether for the second part of the argument we could have shown that T was majorized by T*T in the sense of (10.1.1.4): but this fails for the operator T = W : (xn) —► {xn/n) of (7.3.6.14), defined on the sequence space S = Y = l2: for if Sn = (0,0,..., 0,1,0,...) is the "Kronecker delta" then, for each nGN, ll*»ll = 1 ^d IITSJI = n"1 and (10.8.3.17) \\T*TSn\\=n-i = n-l\\TSJ Together with Theorem 3.5.1, these last two results give us a boundary result dual to Theorem 10.3.9: 10.8.4 THEOREM If X, Y and Z are Hilbert spaces and if chains (5,T) e BL(X,y,Z) and (5n,Tn) G BL{X,Y,Z) satisfy (S,T) is (left,right) invertible (10.8.4.1) and Sn is dense and Tn is one-one (10.8.4.2) and ||5n - 5|| + \\Tn - T\\ —> 0 as n —> oo (10.8.4.3) then S is dense and T is one-one (10.8.4.4) Proof: By Theorem 10.8.3 the operator col(S,T*) is left invertible, therefore bounded below, while by Theorem 10.8.2 each of the operators col(Sn,T^) is dense, and also || col(5n,T^) - col(S,T*)|| -> 0. By Theorem 3.5.1 the operator col(5,T*) is also dense, which by Theorem 10.8.2 again gives (10.8.4.4). ■ The second part of Theorem 10.8.3 extends to the "Hilbert algebras" of Definition 8.8.1: 10.8.5 THEOREM If (6, a) G A2 is a chain in a Hilbert algebra A then the following are equivalent: (6, a) is (left,right) bounded below (10.8.5.1) 6*6 + aa* is not a topological zero divisor (10.8.5.2)
446 10. Comparison of Operators and Exactness 6*6 + aa* is invertible (10.8.5.3) (6, a) is (left,right) invertible (10.8.5.4) Proof: By Theorem 9.9.4 and Theorem 9.9.6 there is inequality, for arbitrary a, 6 £ A, max(||a||2, ||6||2) < ||6*6 + aa* || (10.8.5.5) If now (6, a) £ A2 is (left,right) bounded below in the sense of (10.7.1.3) then there are k > 0 and h > 0 for which, for arbitrary u £ A, H|2 = ||u*u|| < A||ul||fru|| + *N|||u*a|| = \\u\\(h\\bu\\+k\\a*u\\) (10.8.5.6) which with the elementary (1.6.1.1) and the Hilbert algebra condition (8.8.1.1) gives ll^ll < (A:2 -H ^2)1/2([[&^||2 -H H^^H2)1/2 = (k2 + h2)1/2{\\u*b*bu\\ + \\u*aa*u\\)1/2 and hence, using (10.8.4.5), INI2 < (A:2 + /i2)(||u*6*6u|| + ||u*aa*u||) < 2{k2 + h2)\\u*{b*b + aa*)u|| (10.8.5.8) and hence finally, INI2 < 2{k2 + /i2)||u||||(6*6 + aa*)u\\ for each u £ A (10.8.5.9) Dividing across by ||u|| gives (10.8.5.2), and hence by Theorem 9.9.7 also (10.8.5.3). Using for the first time the assumption ba = 0 and following the argument for the last part of Theorem 10.8.3 gives the condition (10.8.5.4). If c = 6*6 + aa* is invertible and (6*6)(aa*) = (aa*)(6*6) then (c-x6*)6 + a{a*c-x) = 1 (10.8.5.10) The final, obvious, implication (10.8.5.4) => (10.8.5.1) is part of Theorem 10.7.2. ■ Part of the argument for Theorem 10.8.5 shows that (6, a) (left,right) one-one =*> 6*6 + aa* not a zero divisor (10.8.5.11) Indeed whether or not ba = 0 there is implication, for arbitrary u £ A, (6*6 + aa*)u = 0 =^ u*(6*6 + aa*)u = 0=^bu = u*a = 0 (10.8.5.12)
10.8 HUbert Spaces 447 The reader is invited to see whether he can reverse the implication (10.8.5.12) when ba = 0, or indeed extend the implication (10.8.5.1) => (10.8.5.4) to the situation in which this is not assumed. This implication also gives a "spectral permanence" for chains (6, a) £ A2 in a Hilbert algebra A: if A C B is isometrically embedded in a larger normed algebra B, then any chain (6, a) which is (left,right) invertible in B will be (left,right) invertible in A, and any chain which fails to be (left,right) bounded below in B will also fail in A. Theorem 10.8.5 applies, in particular, in the Calkin algebra A = BL(X, X)/KL(X, X) on a Hlbert space X. If we argue similarly for the seminormed algebra BL(X, X)/ KL0(X, X) obtained by quotienting out the finite rank operators, and then relax the requirement that all three spaces be the same, we reach the following result. 10.8.6 THEOREM If (5, T) <E BL(X, Y, Z) for Hilbert spaces X, Y and Z then the following are equivalent: (S,T) is weakly (left,right) Fredholm (10.8.6.1) S*S + TT* is essentially one-one and essentially dense (10.8.6.2) col(5,T*) is essentially one-one (10.8.6.3) row(5*,T) is essentially dense (10.8.6.4) Each of the following is also equivalent: (5,T) is (left,right) Fredholm (10.8.6.5) S*S + TT* is Fredholm (10.8.6.6) col(S,T*) is left Fredholm (10.8.6.7) row(S*,T) is right Fredholm (10.8.6.8) Proof: This is left to the reader. ■ We can add to the list in the second part of Theorem 10.8.6 if we observe (S,T) (left,right) Fredholm <£=> (5,T) nearly essentially exact (10.8.6.9) To see this combine (10.6.2.5) with Theorem 10.6.8. We can also compute the Euler number of certain kinds of Fredholm chains in terms of the index of a related operator:
448 10. Comparison of Operators and Exactness 10.8.7 THEOREM If (S,T) <E BL{X,Y,X) and (T,S) <E BL{Y,X,Y) then (S, T, S) (left,right) Fredholm <^=> T + S* Fredholm (10.8.7.1) with Euler(S, T, S) = index(T + S*) (10.8.7.2) in particular (S, T, 5) (left,right) Weyl <=> T + 5* Weyl (10.8.7.3) Proof: The first part follows from (10.8.6.6) when we observe S*S + TT* = (5*+T)(T*+5) and SS*+T*T = (S + T*)(T + S*) (10.8.7.4) Towards (10.8.7.2) the argument of Theorem 10.8.3 says that if ST = 0 then 5"1(o)/ci(rx) ^5"1(o)n(r*)-1(o) = (5*5 + rr*)"1(o) (10.8.7.5) and hence, 5"1(0)/cl(rX) © (T* + 5)_1(0) = r"1(0)/cl(5y) © (T + 5*)_1(0) (10.8.7.6) When (S,T) and (T, S) are Fredholm we can add dimensions and get (10.8.7.2), and in particular (10.8.7.3) ■ 10.9 SKEW EXACTNESS Suppose T £ BL(X,y) and S £ BL(Y,Z): then each of the conditions T{X) H S"1 (0) = {0} (10.9.0.1) and T(X) + S"1 (0) = Y (10.9.0.2) is in some sense "orthogonal" to the exactness condition (10.2.0.1). We begin by recalling a relationship between these conditions and the index Theorem 6.5.4: 10.9.1 THEOREM If T <E BL(X, Y) and S <E BL(y, Z) there is implication T{x) n s_1(o) = {o} <=> r_1(o) = (5r)_1(o) (10.9.1.1)
10.9 Skew Exactness 449 and T{X) + 5_1(0) = Y <=> S{Y) = {S{T{X)) (10.9.1.2) Proof: The mapping TA : X/T~x{0) -> T(X) is one-one and onto, and induces the isomorphism used in the proof of (6.5.4.6): (sr)-1(o)/r-1(o) = [tx) n 5_1(o) (10.9.1.3) since if x <E X then Tx <E S_1(0) & {ST)x = 0. The mapping 5A : y/5_1(0) —> 5(y) is also one-one and onto, and induces the isomorphism used in the proof of (6.5.4.7): Y/{TX + 5_1(0)) = S{Y)/{ST{X)) (10.9.1.4) since if y <E Y then Sy <E S(TX) &yeTX+ S_1(0). ■ There is an interaction between exactness and skew exactness which generalizes our results on subspaces and quotients, Theorem 3.11.1 and Theorem 3.11.2, as well as our "ascent and descent" Theorem 7.8.2: 10.9.2 THEOREM If U e BL{W,X), T e BL(X,Y) and V <E BL(y,Z), then there is implication J7_10 = (rm_10 and (VT)_10 C U(W) V J ,~ , , (10.9.2.1) =► T_10 = 0 =J> U-x0 = (Ttf) 0 [TU)W D V-x0 and VY = [VT)X =^TX = Y =^VY = [VT)X (10.9.2.2) [TU)W D V'-^O) and T~l{ti) = 0 =► [VT)'1^) C tf(W) (10.9.2.3) {VT)-1^) C UW and T(X) = y =► (T^)(W) D y_1(0) (10.9.2.4) Also [VTU){W) = (VT)(X) and (VT)-1^) = r_1(0) =► (TJ7)ty = TX (10.9.2.5) and (VTUy'iO) = (TU)-1^) and (TU)W = TX q q g =>• (vr)_1o = r_1o
450 10. Comparison of Operators and Exactness Proof: If U-x0 = (TU)-1*) and (VT)_10 C UW suppose Tx = 0, so that also VTx = 0 giving x = Uw with w e (TU)-x0 = U~x0, which makes x = 0. This is the first implication of (10.9.2.1), and the second is clear from (10.9.1.1). If y e Y is arbitrary and VY = VTX then there isiGl for which y - Tx <E 7_10, and if also TU{W) D 7_10 then there is w e W for which y = Tx + TUw = T{x + Uw) G TX. This is the first implication of (10.9.2.2), and the second is clear from (10.9.1.2). If x e X satisfies VTx = 0 and V_10 C (TU)W then there is w <E W for which Tx = TUw, and if also T is one-one then x = Uw: this proves (10.9.2.3). If Vy = 0 and TX = Y then y = Tx with x £ (7r)_1(0), and if also (7r)_10 C UW then x = Uw, giving y <E [TU)W. This proves (10.9.2.4). To prove (10.9.2.5) argue y = Tx => Vy = VTUw with T(x - Uw) G^O, hence T(x - Uw) = 0. Finally, for (10.9.2.6) argue VTx = 0 and Tx = TUw => VTUw = 0=>Tx = TUw = 0. ■ Theorem 10.9.2 relates the weakest possible kind of "exactness" for the pairs {VT,U) and {V,TU): 10.9.3 THEOREM If U <E BL(W,X), T G BL(X,Y) and V <E BL(y,Z), then there is implication U majorized by TU and (VT, U) almost exact => T bounded below ==> U majorized by TU (V,TU) almost exact and V almost comajorized by VT => T almost open ==> V almost comajorized by VT (V, TU) almost exact and T bounded below => (VT, U) almost exact (VT, U) almost exact and T almost open =r> (V,TU) almost exact Also VT almost comajorized by VTU and T majorized by VT =r> T almost comajorized by TU and TU majorized by VTU and T almost comajorized by TU =^ T majorized by VT (10.9.3.1) (10.9.3.2) (10.9.3.3) (10.9.3.4) (10.9.3.5) (10.9.3.6)
10.9 Skew Exactness 451 Proof: We closely follow the pattern of argument for Theorem 10.9.2, substituting those parts of Theorem 3.11.1 and Theorem 3.11.2 which involve boundedness below and almost openness for those which involve one-one- and ontoness. Thus for (10.9.3.1) we suppose that there are k > 0 for which ||J7(.)|| < k\\TU{.)\\ on W and &',&" > 0 for which each x £ X has [wn) in W for which limsupn ||x - Uwn\\ < k'\\VTx\\ with supn ||u;n|| < k"\\x\\. Now for arbitrary iGlwe have ||s||<||*-CTWn|| + ||tf«,B|| <\\x-UwJ+k\\T{Uwn-x)\\+k\\Tx\\ <(l + *||T||)||s-CT«;J+*||T*|| and hence, taking limsupn, ||*|| < (1 + *||r||)A'||VTs|| + fc||Tx|| < (k + (1 + *||T||)A'||V|| Wyr)||T*|| This is the first implication of (10.9.3.1), and the second is obvious. Towards (10.9.3.2) we suppose there are k,k',k" > 0 for which limsupn ||y — TUwJ < k'\\Vy\\ with supn||Wn|| < fc||y|| and \\Vy - TxJ -» 0 with supn ||zj| < fc"||y||. Then if y e Y is arbitrary and \\V(y - Txn)\\ -»• 0 there is Km) in W with limsupm ||(y - Txn) - wnm\\ < k'\\V{y - Txn)\\ and supm ||Wnm|| < A||y - TxJ < fc(l + ||T||fc")||y||. This gives limsup||y - T{xn + Uwnm)\\ < limsupfc'||y(y - Txn)|| = 0 n,m n and K-^»«,ii<*"+*(i+imi*") This proves the first implication of (10.9.3.2), and the second is obvious. For (10.9.3.3) we have for arbitrary x £ X a sequence {wn) in W with limsupn||x - Uwn\\ < A;limsupn \\Tx - TUwn\\ < kk'\\VTx\\ with supn||w;n|| < ||Tx|| < A;"||T||||x||. For (10.9.3.4) we have for arbitrary y £ Y a sequence (xn) with ||y — Txm|| —> 0 and supm ||xm|| < A^'lls/H, and then sequences {wmn) for which limsupn \\xm — Uwmn\\ < k'\\VTxm\\ with supn \\wnm\\ < k\\xm\\9 giving limsupmjrl \\y - TUwmn\\ < limsupm \\y - Txm\\ + ||T|| limsupmjrl \\xm - Uwmn\\ < ||T||A;'||V<,|| using again ||y — Txm|| —> 0. Towards (10.9.3.5) we have for arbitrary x £ X a sequence (wn) with ||VTx - VTUwn\\ -> 0 and supn ||u;n|| < k'\\x\\9 while also ||T(.)|| < A;||7r(.)||. Thus, \\Tx-TUwn\\ < k\\VTx - VTUwn\\ -> 0. Finally, for (10.9.3.6) we have for arbitrary x £ X a sequence (wn) with supn ||Wn|| < fc'||x|| and \\Tx - TUwJ -» 0, while \\TU(-)\\ < k\\VTU{-)\\. Thus, ||Tx|| < ||r*-r^wB|| + ||Tl7u;n|| < \\Tx- TUwJ + k\\VTUwn\\ < \\Tx - TUwJ + jfc||Vri7u;n - VTx\\ + k\\VTx\\, and taking limits gives the result. ■
452 10. Comparison of Operators and Exactness Naturally, there is an analogue for the "decomposable exactness" condition (10.3.1.1): 10.9.4 THEOREM If U <E BL(W,X), T £ BL(X,y), and V <E BL{Y,Z), then there is implication U = T'TU and I = V'VT + UU' =^ I = T"T =► U = T"TU (10.9.4.1) I=V'V + TUU" and V = VTT" =► J = TT' =► V = VTT' (10.9.4.2) I=V'V + TUU" and I = T'T => I = V'VT + UU' (10.9.4.3) J = V'VT + UU" and I = TT' =^ I = V"V + TUU' (10.9.4.4) Also VT = VTUU' and T = V'VT =^T = TUU" (10.9.4.5) and TU = V'VTU and T = TUU' =J> T = V'VT (10.9.4.6) Proof: If U = T'TU and I = V'VT + UU' then 0 = (/ - T'T)UU' = {I- T'T){I - V'VT) giving (10.9.4.1) with T" = T'+V'V-T'TV'V. If instead J = V'V+TUU" and V = VTT", then 0 = V'V{I - TT") = {I- TUU") {I - TT") giving (10.9.4.2) with V = T" + UU" - UU"TT". For (10.9.4.3) we argue I = T'T = T'{V'V + TUU")T = [T'V')VT + U{U"T) and for (10.9.4.4) we argue I = TT" = T(V'VT + UU")T" = [TV')V = TU{U"T") Finally, if VT = VTUU' then V'VT = V'VTUU', giving (10.9.4.5) with U" = U, and if TU = V'VTU then TUU' = V'VTUU', giving (10.9.4.6) with V" = V. ■ It is left to the reader to show that the analogue of Theorem 10.9.4 holds with almost exactness and almost left or right multiples in place of exactness and left or right multiples. We conclude with another kind of analogue of the result which says that if a product of two operators is left or right invertible then so is one factor:
10.9 Skew Exactness 453 10.9.5 THEOREM If T e BL{X,Y), S e BL(Y,Z), R e BL(Z,W), U e BL(y,y), and V e BL{Z,Z) satisfy vs = su then there is implication (jR,5) and (S, T) nonsingular = R 0 V S S 0 -U T for each of the nonsingularities of Definition 10.3.1. Proof: If R'R + SS" = I = S'S + TT' then B! 0 -S'VR' S' R 0 V S + S 0 -U T S" 0 T'US" T' (10.9.5.1) nonsingular (10.9.5.2) (10.9.5.3) / 0" 0 /. (10.9.5.4) The only nontrivial detail is the bottom left hand corner, which in the left-hand side is given by -S'VR'R + S'V - US" + TT'US" = S'V{I - R'R) -{I- TT')US" = s'vss" - s'sus" = S'(VS -SU)S" This establishes (10.9.5.2) for the (left,right) invertibility of (10.3.1.1), and the argument for the almost (left,right) invertibility of (10.3.1.2) is almost identical. The same pattern of argument establishes (10.9.5.2) for the exactness of (10.3.1.4): if for each z £ Z there are y' and x' for which \\z-Sy"\\<h'\\Rz\\ with ||y'|| < fc'W and lly-Ts'll^fcUSyll with ||x'|| < *||y|| then there is also x" e X with ||x"|| < k\\y + Uy'\\ and \\(y + Uy')-Tx"\\<h\\S(y + Uy')\\ = h\\Sy + VSy'\\ < h\\Sy + Vz\\ + h\\V\\\\Z - Sy'\\ (10.9.5.5) (10.9.5.6) (10.9.5.7) Together (10.9.5.5) and (10.9.5.7) show that the right-hand side of (10.9.5.2) holds for exactness in the sense of (10.3.1.4). The argument
454 10. Comparison of Operators and Exactness for almost exactness in the sense of (10.3.1.5) is the same, and left to the reader. Finally, if jR_1(0) C S(Y) and 5_1(0) C T{X) then for arbitrary (10.9.5.8) ' z .y. e R 0" y s. -i "0" .0. we have, since Rz = 0, y' £ Y for which z = Sy' and Sy = -Vz = -VSy' = -SUy' Since this means S(y + Uyf) = 0 there must be x' £ X with z = Sy' and y + Uy = Tx' which means that ' z .y. = ' S 0' -u r. [y'l This establishes (10.9.5.3) for linear exactness in the sense of (10.3.1.6), and the argument for almost linear exactness can now be left to the reader. ■ Conversely, 10.9.6 THEOREM If T e BL{X,Y), S e BL{Y,Z), R <E BL{Z,W), u e BL(y,y), v e bl{z,z) then R 0 V S S 0 -U T nonsingular =^ (col(jR,7),5) and (5,row(T,C7)) nonsingular for each of the nonsingularities of Definition 10.3.1. Proof: If (10.9.6.1) R' W V S' R 0 V S + S 0 -U T s" w" -U' T' then [jR; W] + SS" = I = S'S + [-U T] I 0 0 / r (10.9.6.2) (10.9.6.3)
10.9 Skew Exactness 455 which proves (10.9.6.1) for (left,right) invertibility, and similarly for almost (left,right) invertibility. If instead the left-hand side of (10.9.6.1) holds for the exactness of (10.3.1.4) then there are h > 0 and k > 0 for which, for arbitrary z £ Z and y GY, there are yf GY and x' £ X such that S 0] \y' -U T <h R 0 V S with y] <k (10.9.6.4) Taking the cartesian product norm to be the maximum of the norms on the factors, and specializing in turn to y = 0 and to z = 0, ||s-Sy'|| < max{\\z-Sy'\\,\\Tx'-Uy'\\) < hmBx{\\Rz\\9\\Vz\\) withmax(||y/||l||a/||)<*H and \\y + Uy' - Tx'\\ < max(||S</'||, \\y + Uy' - Tx'\\) < h\\Sy\\ with maxdli/'lUlx'l!) <A;||y|| (10.9.6.5) (10.9.6.6) This proves (10.9.6.1) for exactness, and similarly for almost exactness. Finally, if ~R 0" -i 0' .0 c " S 0' -u r. Y~ then and jR_1(o) n v-^o) c 5(^_1(rx)) c s{Y) S~x(0) C T{X) + ^(5_1(0)) C T{X) + U{Y) (10.9.6.7) (10.9.6.8) (10.9.6.9) This proves (10.9.6.1) for linear exactness, and similarly for almost linear exactness. ■ If, in particular, RS = 0 = ST and VS = SU then there are well-defined operators colA(jR, V) : Z/c\(SY) —-> W x Z/c\(SY) (10.9.6.10) (10.9.6.11)
456 10. Comparison of Operators and Exactness and rowA(T,U) : X x 5_1(0) —-> S_1(0) (10.9.6.12) Then (10.9.6.5) and (10.9.6.8) say that colA(jR,V) is bounded below and one-one, respectively, while (10.9.6.6) and (10.9.6.9) say that rowA(T,J7) is open and onto, respectively.
11 Multiparameter Spectral Theory The various kinds of "spectra" of a normed algebra element have extensions to n-tuples, and more general systems, of elements: for systems of linear operators there is a more subtle concept of joint spectrum available. 11.1 LEFT AND RIGHT SPECTRA If a = (al9 a2,..., an) £ An is an ra-tuple of elements in a normed algebra A we shall write n n row(Lj : b —► ]T ay6y and row(jR0) : b —► ]T 6yay (11.1.0.1) y=i y=i for the "row" operators induced from An to A, and col(L0) : c —> (axc, a2c,..., anc) and (11.1.0.2) col(jRj :c—► (ca^caj^.^caj for the "column" operators induced from A to An. 11.1.1 DEFINITION If a e An in a normed algebra A, then the /e/* spectrum of a with respect to A is the set o*J*{a)= «eC»:lj!£x(«,.-,,.) I ,tl J (11.1.1.1) = {s 6 Cn:row(2E0_.) e rri*ht BL^",^)} 457
458 11. Multiparameter Spectral Theory and the right spectrum is the set oTleht{a)=\SeCn:lzJ2(a}.-Si)A\ { ,=i J (11.1.1.2) = {se Cn:row(L„_,) e r"*ht BL(An, A)} The almost left spectrum of a with respect to A is the set ar(a) = |«eC":l*cl£>(o,.-«,.)| I i=i J (11.1.1.3) = {« 6 Cn:row(i?a_3) e ?right BL{An, A)} and the almost right spectrum is the set af (a)= 5GCn^clEK--^ [ y=i J (11.1.1.4) = {se Cn:row(La_J £ fright BL(An,A)} Theorem 3.10.4 extends to ra-tuples a £ An, so that the row operator #a_5 is onto iff it is open, and is dense iff it is almost open: this has been incorporated in Definition 11.1.1. It is clear that the right and almost right spectrum of a £ An are the same as the left and the almost left spectrum of a with respect to the algebra derived from A by "reversal of products." We make similar extension to ra-tuples of eigenvalues and approximate eigenvalues: 11.1.2 DEFINITION If a £ An in a normed algebra A, then the left approximate eigenvalues of a with respect to A form the set f^\a) = \seCn: mf T\\(aj-Sj)x\\=o\ { M^ J (11.1.2.1) -left i = {sE Cn:co\(La_s) 6 fIrftBL(A», A)}
11.1 Left and Right Spectra 459 and the right approximate eigenvalues form the set f^(a) = ls€Cn: inf x £ ||x(a,- - *,-)ll = 0 y=i ) (11.1.2.2) = {se Cn:col(Ra_s) 6 fMtBL(An,A)} The left eigenvalues of a with respect to A form the set O) = 5 e C": f] (La, - ,yI)-i(0) ^ {0} I ;=i J (n.i.2.3) = {5 e C»:col(L0_.) 6 *uitBL(An,A)} and the rtyfa eigenvalues form the set ^ghtW = {*€ C»: H (*., - V)_1(°) * W [ I y=i J (n.i.2.4) = {s 6 Cn:col(J?0_s) e ?rleftBL(^n,A)} We have the obvious inclusions: 11.1.3 THEOREM If a e An for a normed algebra A, then *!?» £ f^W C 5f(«) C a™(a) (11.1.3.1) and right. -right w»«"(a) c f?<"(a) c ^rw c ,r» right / (11.1.3.2) Proof: Only the middle inclusion needs any attention, and (11.1.3.2) will follow from (11.1.3.1) by "reversal of products." By considering ra-tuples a — s £ An it is sufficient to test membership of s = 0 £ Cn in each set. Thus, if 0 £ <7l£{t{a) there is (6m) in An for which £y=1 ||6mj-ay - 1|| -> 0 with supmmaxy ||6my|| = k1 < 00, and hence if 0 < 6 < 1 we have for arbitrary x £ A x < 1-Y^bmiaj\X + supmax||6my||^||o,-x|| m } y=i (11.1.3.3) <*MI+*'£>y*||
460 11. Multiparameter Spectral Theory as soon as ||1 — X)y=i &my ay II < ^> giving for each x E A ||s|| < k JT \\ajX\\ with k = ^ (11.1.3.4) y=i We are of course following the argument of Theorem 3.10.5. This proves (11.1.3.1), and similarly (11.1.3.2). ■ Joint spectra behave under homomorphism in the same way as ordinary spectra: 11.1.4 THEOREM If T : A —► B is a homomorphism of normed algebras and aE An, then <7Bft(Ta) C alX{t{a) and a£ght(Ta) C a*ght(a) (11.1.4.1) ETG HBL(A, B) is a continuous homomorphism, then also a^{t{Ta) C alX{t{a) and agght(Ta) C a^ght(a) (11.1.4.2) If instead T is one-one, then in the other direction, ^W £ TBft(ro) and ^ght(a) C 7r£ght(Ta) (11.1.4.3) If T is bounded below and continuous, then also rlXU{a) C fjg^Ta) and f*ght(a) C r*eht{Ta) (11.1.4.4) Proof: We consider only the "left" inclusions, leaving "right" inclusions to reversal of products, and test only membership of 6 = 0 E Cn. Ifa = (ax, a2,..., an]) e An then of course Ta = [Tax, Ta2,..., Tan) e Bn. Thus, suppose for (11.1.4.1), that 0 £ <jj|ft(a), so that 1 = £y=1 6yay with 6 £ An, giving 1 = Y%=i T{bj)T{a>j) € B, which means that 0 £ aJfft(Ta). If, more generally, 0 £ a^ft(a), with || £y=1 6myay-l|| -> 0 and supm maxy ||6my|| < oo, then provided T is bounded we have || Ey=i T(&my)r(ay) ~ 1|| -+ °> giving 0 £ ofjgftpra), and proving (11.1.4.2). If 0 G ^{a) and if ajx = ° ^ x then, provided T is one-one, T(aj)T(x) = 0 ^ Tx, giving 0 £ 7r^ft(Ta) and proving (11.1.4.3). Finally, if 0 £ ^(Ta) and if T is bounded and bounded below, then for arbitrary iGAwe argue INI < *||T*|| < khJ2 ll(Tay)(T*)|| = khJ2\\T[a,-x)\\ < kh\\T\\ £ H*ll y=i y=i y=i giving 0 £ f!|ft(a) and proving (11.1.4.4). ■
11.2 Polynomials 461 11.2 POLYNOMIALS To generalize to ra-tuples the spectral mapping theorem for polynomials, we begin by looking at "noncommutative" polynomials. As a matter of notation we shall write Polyn = Alg{zuz2,...,zn) (11.2.0.1) for the free algebra with identity generated by n independent elements zl9z2, • • • ,zn. As usual we feel free to interpret the complex field C as a subalgebra of Polyn, the constant polynomials. The remainder theorem holds, and is proved by induction: 11.2.1 THEOREM If / <E Polyn and 5 <E Cn there are /', /" <E (Polyn)n for which / - /W = £ fr (*/ " 5>) = D*y" 5y)'" t11-2-1-1) y=i j=i Proof: If either / = c0 is a constant or / = z- is a coordinate, then (11.2.1.1) is clear. We claim that (11.2.1.1) holds if either / = g + h or / = g • h for polynomials g and h which satisfy (11.2.1.1). Certainly, if / = g + h we can satisfy (11.2.1.1) by taking f' = g' + ti and /" = g" + h"', where g', g" and h', h" come from (11.2.1.1) for g and h, respectively: to see how to extend (11.2.1.1) to a product / = g • h observe g- h - g{s)h{s) = h(s){g - g(s)) + g{h - h{s)) = [g - g{s))h +{h- h{s))g{s) noting that the scalars g(s) and h(s) commute with h — h(s) and g — g{s) respectively. We can now leave it to the reader to write down /' and /" in terms of g', h', g", and h". ■ If A is a linear algebra and / £ Polynj ^nen we write / : An —► A for the obvious mapping. Formally it is defined inductively, setting f(a) =seC if / = 6 G C (11.2.1.2) f(a) = ajeA if / = Zj <E Polyn (11.2.1.3) f(a) = g{a) + h(a) if / = g + /i (11.2.1.4) /(a) = ^(a) . /i(a) if / = g • h (11.2.1.5) If, for example, A = C is the complex field then the induced mapping / is just a "polynomial in n complex variables." This in turn determines
462 11. Multiparameter Spectral Theory / : An —► A if A is commutative. More generally, if / = (/l9 /2,..., /m) £ Poly™ = (PolyJm we shall write f : An -+ Am for the mapping a -> (/1(a),/2(a),...,/m(a)), in which each /^(a) is defined inductively as above. With these interpretations, we immediately obtain about half of the spectral mapping theorem: 11.2.2 THEOREM If a £ A2 for a normed algebra A and / £ Poly™, then /aJf(a)CaJf/M and /^ght(a) C <£*h7(a) (11.2.2.1) /^(aJCal^/H and /^ght(a) C a*ght/(a) (11.2.2.2) /ffWCfr/W and /ff'(a) £ ?f V(«) (11.2.2.3) Meft(a)C^ft/W and KJght(a) C ^ght/(a) (11.2.2.4) Proof: Suppose 5 G Cn is such that f(s) is not in <7^ft/(a): then by the remainder theorem, applied to each component fk, and the definition of left spectrum, we have m m n i = £ MAM - AW) = Eb*E A»K - «,•) fc=l fc=l j'=l 7 = 1 \A:=1 / y=l \fc=l which means that 5 is not in the left spectrum of a. This proves the first part of (11.2.2.1), and the second follows by "reversal of products." The argument for (11.2.2.2) is very similar, but messier: we leave it to the reader. If 5 £ Cn is in 7r^ft(a) then there is x £ A with x ^ 0 = (a;- — Sj)x for each j, giving n (/*(*) - fM)x = Y, f'kj(a)(aj - sj)x = ° for each k 3 = 1 so that f(s) is in 7r^ft/(a). This proves (11.2.2.4): we leave (11.2.2.3) to the reader. ■ The almost left and right spectra are closed and bounded, and upper semicont inuous:
11.2 Polynomials 463 11.2.3 THEOREM If a <E An then for each u <E {a^ft,a left -right -left -right A '"A oj(a) = clo;(a) C {s £ Cn: \sA < \\a-\\ for each j} If U C Cn is a neighborhood of u>{a) there is 8 > 0 for which (11.2.3.1) • u(a') C U Proof: As a special case of Theorem 11.2.2 there is inclusion oj(a) C (jj{ax) x oj(a2) x • • • x c<;(an) (11.2.3.2) (11.2.3.3) Simply take / = z- for each j. An application of the first part of Theorem 9.2.3 now proves the second part of (11.2.3.1) for each of aj|ft and <5^ght, and hence also for f^ft and f^ght. For the first part of (11.2.3.1) we need only observe that in each case uj(a) is the continuous counterimage of a subset of BL{An,A) or BL(A, An) which is closed, either by (3.4.3.1) or by (3.3.3.1). The same argument gives the upper semicontinuity (11.2.3.2). ■ Unlike the single element case Theorem 9.2.3, joint spectra are liable to be empty. If, for example, A = BL(X,X) with X = C2 and a = (a1,a2) with a, = 0 1 0 0 0 0 1 0 (11.2.3.4) then it follows from (11.2.3.3) that a||ft(a) and <j^lght(a) are contained in the set {(0,0)}, while the fact that 2 , 2 1 0 0 1 (11.2.3.5) excludes this point. This same example shows that we cannot expect Theorem 9.2.4 to extend to arbitrary ra-tuples: indeed, if / = z\ + z\ and a = (a1,a2) is as in (11.2.3.4) then for all possible w we have 0 = /(0) = fu(a) ± uf{a) = u(l) = {1} Alternatively, whenever u(a) =0 take / to be a constant. The one-way spectral mapping theorem extends to noncommutative rational functions, interpreted as products of polynomials and their reciprocals. If, for example, / £ Polyn is a polynomial and A is an algebra we define 1// on part of An by setting DomainA(l//) =/"1(r1)an (11.2.3.6)
464 11. Multiparameter Spectral Theory and (l//)(o) = /(a)"1 if o € rl{A~l) (11.2.3.7) 11.2.4 THEOREM If A is a normed algebra, if a 6 An and / 6 Polyn, then for each u, 6 {o^o^^o^o^^f™,^*™,^} a € rl[A~l) =► w(») C C" n /-1(A_1) and (l//)w(o) C w(/(a)_1) (11.2.4.1) Proof: We have first a "remainder theorem": if a S An and s£Cn are both in /_1(A-1) then /(a)"1 - /(s)-1 = /(a)"1/(a)"1 (/(a) - /(,)) (11.2.4.2) Adding in the remainder theorem for /, it follows as in Theorem 11.2.2 that (l//)Ma) n /-'(A-1)) C ^(/(a)-1) (11.2.4.3) It remains only to be shown that w (aJC/-1^"1) (11.2.4.4) It is sufficient to show this for w = a^ft U aAlg : but this follows from (11.2.2.1) and the assumption that /(a) £ A~l: a £ /-'(A-1) => 0 £ oAf(a) D f(o^(a) U a?ht(a)) so that if 5 £ <^ft(a) u ^Aght(a) then /(5) is invertible. ■ Equality does hold in the one-way spectral mapping theorem for certain special polynomials or rational functions: 11.2.5 THEOREM If a £ An and if / £ Poly™ satisfies either m = n and gof = z = fogfoi some g £ Poly^ (11.2.5.1) or m > n and / = (z,$) for some g £ Poly™~n (11.2.5.2) then /w(a) =w/(a) (11.2.5.3) for each a, £ {a™,o**\a™,a?ht,f^,f^^^A18"}-
11.2 Polynomials 465 Proof: For (11.2.5.1) we need only apply Theorem 11.2.2 twice: "/(a) = /(*("/(a))) C fojg(f(a)) = fu[a) C uf(a) Towards (11.2.5.2), observe that if a £ An and b £ Am are arbitrary then w(a,6) C a;(a) x u(b) (11.2.5.4) by two applications of Theorem 11.2.2 to the (m + 7i)-tuple (a, 6). Putting m — 7i in place of m and g(a) in place of 6 in this and applying Theorem 11.2.2 again gives {(5>0(5)):5 £ v(a)} C ct;(a,y(a)) C u;(a) x W0(a) To complete the proof we must verify that (M) <E w(a,y(a)) =► t = g{s) (11.2.5.5) This also follows from Theorem 11.2.2: define h <E Poly™~n by setting h(z,w) = w — g(z) and observe t-g(s) = h{s,t) e hoj{a,g{a)) = w(0) = {0} ■ We shall refer to (11.2.5.4) as the subprojective property: 11.2.6 DEFINITION If w is a subprojective system of mappings from An into subsets of Cn we shall write w6=t(a) ={se Cn: (s, t) e a; (a, b)} (11.2.6.1) and ub=b(a) = (J ub=t(a) (11.2.6.2) tecm For example, if a;(a) is the range of a mapping a : Q —> Cn, then o;6_t(a) is the range of its restriction to the level surface b~x(t) C H. The "relative" spectrum ub=b(a) measures the failure of equality (11.2.5.3): for if w satisfies the "one-way" Theorem 11.2.2 then (11.2.5.5) gives "a=af{") = M<0 (11.2.6.3) and also WM ="/(a)=/(a)(a) (11.2.6.4)
466 11. Multiparameter Spectral Theory 11.3 THE SPECTRAL MAPPING THEOREM For commuting systems of elements, equality holds in Theorem 11.2.2 for the almost spectrum and the approximate eigenvalues. Not surprisingly, we need Liouville's Theorem 9.1.3. We begin with some very simple observations: 11.3.1 THEOREM If U <E BL(X,X), T e BL{X,Y), and V <E BL(y,y) are bounded between normed spaces then there is implication col(T, U) one-one => U one-one on T-1(0) (11.3.1.1) col(T, U) bounded below =>• U bounded below on T_1(0) (11.3.1.2) col(T, U) closed =^ U closed on T"1 (0) (11.3.1.3) row(T, V) dense =^ V dense to Y/ cl{TX) (11.3.1.4) row(T, V) onto =J> V onto Y/ c\{TX) (11.3.1.5) row(r,y) almost open =>• V almost open to Y/c\{TX) (11.3.1.6) row(T, V) open =^ V open to Yj cl(TX) (11.3.1.7) Proof: If col(T, U) is one-one from X to Y x X then for arbitrary x £ X there is implication Tx = Ux = 0 => x = 0. Restricted to points x £ T_1(0) this gives implication Ux = 0 => x = 0, which is (11.3.1.1). If col(T, U) is bounded below on X then there are k,h > 0 for which, for each x £ X, \\x\\ < k\\Tx\\+h\\Ux\\: restricted to T_1(0) this gives ||x|| < h\\Ux\\, which is (11.3.1.2). If in addition there is implication ||Txn—2/|| + ||lfxn—z\\ —> 0 => {y>z) £ {(Tx, 17x):x £ X}, then restricted to T_1(0) there is implication ||[/xn-z|| -+0=> ze {Ux:x e T-^O)}, which is (11.3.1.3). If row(r,V) is onto Y then it is clear that (the mapping induced by) V is onto Y/ cl(TX), as required by (11.3.1.5): we leave (11.3.1.4), (11.3.1.6) and (11.3.1.7) to the reader. ■ We combine the elementary observations of Theorem 11.3.1 with Liouville's Theorem 9.2.3: 11.3.2 THEOREM If U <E BL(X,X), T <E BL(X,y), and V <E BL(y,y) satisfy ^(r_1(o)) cr^o) (n.3.2.1) and V{T{X)) C cl(T(X)) (11.3.2.2)
11.3 The Spectral Mapping Theorem 467 then there is implication T e 7rleft BL(X, Y) =* col(T, U-sI)e fleft BL(X, XxY) for some s <E C (11.3.2.3) and TG7rrightBL(X,y) (11.3.2.4) =* row(T, V -sl)e fnght BL(X x Y, Y) for some s <E C Proof: If T G 7rleft BL(X,y) so that (3.2.1.4) r_1(0) ^ {0} then Theorem 9.2.3 says that the operator U : r_1(0) -+ r_1(0) has bounded nonempty "almost spectrum" relative to the algebra BL(T~1(0),T~1(0)), which therefore by (7.10.3.8) has nonempty topological boundary, which by (9.3.3.1) lies in the set of approximate left eigenvalues of U. By (5.6.3.4) this means that there is 5 £ C for which the operator U—sI is not bounded below on r_1(0), and by (11.3.1.2) this means that col(T, U - si) is not bounded below on X. This proves (11.3.2.1), and the argument for (11.3.2.2) is very similar: if T G 7rrisht BL(X,Y) so that (3.2.1.5) cl(TX) ^ Y then Theorem 9.2.3 says that the operator V : Y/c\(TX) -* y/cl(TX) has bounded nonempty almost spectrum, which again by (7.10.3.8) has nonempty topological boundary, which by (9.3.3.1) lies among the right approximate eigenvalues of V on y/cl(TX). By (5.6.3.2) this means that there is 5 G C for which V is not almost open on Y/c\(TX), and by (11.3.1.6) this means that row(T, V — si) is not almost open to y. ■ Note that (11.3.2.1) says that the operator TU is determined by T in the sense of (10.1.1.5), while (11.3.2.2) says that the operator VT is almost range-included in T in the sense of (10.1.3.7). Theorem 11.3.2 only begins to bite when it is combined with the enlargement process: 11.3.3 THEOREM If U e BL(X,X), T e BL(X,y), and V <E BL(y,y) satisfy TU majorized by T (11.3.3.1) and VT almost comajorized by T (11.3.3.2) then there is implication T e fleft BL(X, y) =J> col(T, U-sI)e fleft BL(X, Y x X) for some seC (11.3.3.3)
468 11. Multiparameter Spectral Theory and refrightBL(x,y) . v (11.3.3.4) => row(T, V - si) £ f nght BL(X x Y, Y) for some seC Proof: If (11.3.3.1) holds then Theorem 10.5.1 (and Theorem 10.1.2) says that (11.3.2.1) holds with (Q(T),Q(17)) in place of (T,17), while if the first part of (11.3.3.3) holds then Theorem 3.3.5 says that the first part of (11.3.2.3) holds with Q(T) in place of T. By (11.3.2.3) there is 5 G C for which col(Q(T), Q(U) - sQ(I)) is not bounded below, and by Theorem 3.3.5 this means that (T, U — si) is not bounded below. This proves (11.3.3.3), and the argument for (11.3.3.4) is very similar: if (11.3.3.2) holds, then Theorem 10.5.2 (and Theorem 10.1.4) says that (11.3.2.2) holds with (Q(T),Q(V)) in place of (T, V), while if the first part of (11.3.3.4) holds, then Theorem 5.7.1 says that the first part of (11.3.2.4) holds with Q(T) in place of T. By (11.3.2.4) there is 5 £ C for which row(Q(r),Q(y) -sQ(J)) is not almost open, and by Theorem 5.7.1, or alternatively Theorem 3.4.5, this means that row(T, V — si) is not almost open. ■ 11.3.4 THEOREM If a £ An is commutative and commutes with b £ Am, then for each w £ {a^ft,a^ght,f^ft,f^ght}, there is equality "(*)="a=a(*) (11.3.4.1) and hence also w/(a) = /a; (a) (11.3.4.2) if a £ An is commutative and / £ Poly™. Proof: Equality (11.3.4.2) follows at once from (11.3.4.1) if we put b = f(a) and use (11.2.6.3). It is sufficient, for (11.3.4.1), by induction on n, to consider only the case of a single element a = ax commuting with each element bk, and of course by "reversal of products" we need only consider the almost left spectrum and the left approximate eigenvalues. To establish (11.3.4.1) with n = 1 and w = aj|ft we use the second part of the elementary Theorem 11.3.2: if t £ Cm is in a^ft(6) then T = iov/(Rb_t) £ 7rrightBL(X,y) with X = Am and Y = A, remembering (11.1.1.3) from Definition 11.1.1 and Theorem 3.10.4. If a = ax commutes with each bk then V = Ra satisfies the condition (11.3.2.2). By (11.3.2.4) there is 5 £ C for which (s,t) £ aj|ft(a, 6) = w(a, b) giving (11.3.4.1) in this case. To derive (11.3.4.2) for left approximate eigenvalues needs the enlargement process, funnelled here through Theorem 11.3.3. If££Cmin ^I|ft(6), then by (11.1.2.1) col(L6_t) is in fleft BL(A, Am), and hence by (11.3.2.3)
11.3 The Spectral Mapping Theorem 469 there is s £ C for which col(L6_t,L0_a) is in fleft BL(A, Am+1), which means by (11.1.2.1) that (s,t) is in fj|ft(a,6). ■ For an alternative derivation of (11.3.4.1) for the almost left spectrum suppose t £ £j|ft(6) and write m N = cl^Mh~h) (11.3.4.3) k=i for the induced closed left ideal of A, together with M = {c £ A: Nc C N} (11.3.4.4) Then M is a closed subalgebra of A, containing the identity 1 and the ideal iV, and N is actually a two-sided ideal of M: NM C M (11.3.4.5) Now if c £ M we claim, since N ^ M, 0 * d*M/N(c + n)Q f™}„{c + AT) C (<?!f)6=t(<) (11.3.4.6) If, for example, c'{c — s) + X^fcLi ^5c(^A: — *fc) = *> then x = c'(c — 5)x + Y^k=i ^kiPk — tk)x £ c'{c — s)x + N for arbitrary x £ M giving \\x + iV|| < ||c'j|||(c — s)x + N\\. If we assume only (s,t) £ aj|ft(c,6) the details are messier, and left to the reader. It is clear from the proof of Theorem 11.3.4 that equality (11.3.4.2) can be attained for certain noncommutative systems a £ An. If n = 1 then it is sufficient for (11.3.4.1) with u = aleft that ax be in M for each t £ Cm, where M is the algebra of (11.3.4.4); for more general n it is then sufficient to impose in succession the corresponding conditions on each a -+1 with (6, al9..., a;) in place of 6. If a £ An and 6 £ Am are arbitrary and £ £ Poly£+m then Theorem 11.2.2 says that for eight different w there is inclusion gu{a, b) C ug{a, b) (11.3.4.7) while if (a, 6) £ An+m is commutative then Theorem 11.3.4 say that for four different u> ug(a,b) = gu(a,b) C g(u(a),u(b)) = g{u{a) x v{b)) (11.3.4.8) Here we are writing u>(K,H) = u>(K x H): we shall also write ug{K,b) = (J $(«,&) and ug(a,H) = (J w^(a,*) (11.3.4.9)
470 11. Multiparameter Spectral Theory With this notation we can state and prove a result intermediate between (11.3.4.7) and (11.3.4.8): 11.3.5 THEOREM Suppose a £ An, g £ Poly£+m and u £ {a^ft, ^ng t^ieft^ng ty, ^^ ^ a jg commutative and commutes with b there is equality ua=3g{a, b) = ua=3g{s,b) ^ each 5 € Cn (11.3.5.1) and ug{a,b) =va=ag{a,b) = (J wfl=^(6,6)Cwff(w(a),6) (11.3.5.2) 3Gw(a) If instead 6 is commutative and commutes with a there is equality Ub=t9ia>b) = Vb=t9{a> *) for each t e c™ (11.3.5.3) and ug{a,b) =ub=bg{a,b) = (J u>h=tg(a,t) C wy(a,w(6)) (11.3.5.4) tGw(6) Proof: If a £ An commutes with 6 £ Am then for each s £ Cn and r £ Cp the (p + ra)-tuples (<7(a, 6) — r, a — s) and (0(5,6) — r, a — s) (11.3.5.5) generate the same left and right ideals in A, which establishes (11.3.5.1) for the left and the right spectrum and hence also, taking closures, for the almost left and the almost right spectrum. Passing to the "annihilators" of these ideals gives also (11.3.5.1) for the left and the right point spectrum, and hence, considering enlargements, for the left and right approximate point spectrum. This now gives the second equality of (11.3.5.2); for the first we must apply (11.3.4.1) with g(a,b) in place of 6. For (11.3.5.3) and (11.3.5.4) we just interchange a and b in what just proved. ■ 11.4 MANY VARIABLES The spectra of Definition 11.1.1 and Definition 11.1.2 have extensions to more general systems of elements: 11.4.1 DEFINITION If A is a normed algebra and a = (ay)jGJ £ A3 is a mapping from the nonempty set J into A then for each #. a /^-left „.right ;;left fright -left -right left right \ ill A 1 l\ Uje\aA i°A i°A *°A >TA >TA >*A >*A J (11.4.1.1)
11.4 Many Variables 471 we define oj{a) = js G CJ: (sy)jGJ, G o((ay)yGJ,) for each finite J' C j} (11.4.1.2) In each case the resulting set uj(a) can be represented by a definition formally very like the definition for ra-tuples. The extension of Theorem 11.1.3, Theorem 11.1.4, and Theorem 11.2.2 is mechanical: 11.4.2 THEOREM If a G AJ and u is in (11.3.1.1), then oj{a) o(j>C u{a o <f>) (11.4.2.1) for arbitrary mappings <f> : K —> J. There is inclusion *r (a) C f^a) C a™(a) C a™ (a) (11.4.2.2) and **"(a) C f*ht(a) C a*"(a) C a^ht(a) (11.4.2.3) If T G HBL(A, B) is a homomorphism, then uB{Ta) C wA(a) for each a; G {aleft,aright,aleft,aright} (11.4.2.4) If T is one-one, then also ^ W £ ^ft(Ta) and ^lght(a) C 7r£ght(Ta) (11.4.2.5) if T is bounded below, then also rlXft{a) C f^Ta) and f*ght(a) C T*ght(Ta) (11.4.2.6) If / G Polyj is a system of polynomials then there is inclusion fu(a) C w/(a) (11.4.2.7) for each a; in (11.4.1.1). Proof: This can be left to the reader. Note that each "polynomial" fk G Polyj is in the algebra generated by the coordinates z^(j G J). ■ The almost left and right spectrum, and the left and right approximate eigenvalues, of a G AJ form compact, but possibly empty, subsets of the cartesian product space C . When J = H is either a topological space or an algebraic system, and a G AJ is either continuous, or a homomorphism, then this is inherited by every 5 G CJ in the left or right spectrum of a:
472 11. Multiparameter Spectral Theory 11.4.3 THEOREM If H is either a topological space, or a semigroup, or a linear space, and if a £ An is either bounded, or continuous, or homo- morphic, or linear, then so is each 5 G Cn in ^Aft(a) u °Ag (a)* ^ m particular n = A and at = t for each t £ ft (11.4.3.1) then alX{t{a) = a^ght(a) = A* = HBL(A, C) (11.4.3.2) Proof: If £, £', and £" are in H and / £ Poly3 then st is in the spectrum of at, st, —st is in the spectrum of at, —at and f{st,st,,st,,) is in the spectrum of /(at,at/,at//), which means that \st\< \\at\\ and |st, "5tl< ||at, - at\\ and (11.4.3.3) |/(«t,«t/,«t//)|< ||/(at,at,,at„)|| The first of these ensures that 5 is bounded with a, the second that 5 is continuous with a, and the third that 5 satisfies any of the obvious homomorphic or linearity properties of a. In particular if a is given by (11.4.3.1) then this ensures that the left and the right spectrum of a are both included in the Gelfand space A* of A: conversely (11.4.2.4) ensures that the Gelfand space of A lies in both the left and the right spectrum of a. ■ If B C A is a subset and at = t for each t £ B, then we are tempted to write w(a) = u{B) (11.4.3.4) for each of the u> of Definition 11.4.1: when B = A this conflicts with some of our earlier notations. Thus we shall write in the situation of (11.4.3.1) w(a) = a; (A) = A* (11.4.3.5) Commuting systems of elements a £ A3 have the "spectral mapping property," in the sense that for arbitrary K and arbitrary / £ Polyj /w(a) = w/(a) (11.4.3.6) for the almost left and right spectrum w, and for the left and right approximate eigenvalues w: 11.4.4 THEOREM If a £ AJ is commutative and commutes with 6 £
11.4 Many Variables 473 AK, then for each w 6 {af.^.ff.f J1*} «(*)=««=«(*) (11.4.4.1) In particular, if a £ A*7 is commutative and / £ Poly^f oj(f(a)) = fu[a)) (11.4.4.2) Proof: This is left to the reader. ■ The reader should also be able to write out an extended version of Theorem 11.3.5. Together with (11.4.3.2), Theorem 11.4.4 gives an alternative derivation of Gelfand's Theorem 9.6.3: 11.4.5 THEOREM If A is a Banach algebra the following are equivalent: A has the left spectral mapping property (11.4.5.1) A has the right spectral mapping property (11.4.5.2) A/Radical(A) is commutative (11.4.5.3) Proof: If a £ AJ is arbitrary, then (7.2.5.4) and Theorem 7.2.6 say that ^eftW=^/tRad(A)(« + Rad(A)) and (11.4.5.4) ^8ht(«)=^^d(A)(« + Rad(A)) and then if (11.4.5.3) holds Theorem 11.4.4 gives the spectral mapping property for the system a + Rad(A) £ (A/Rad(A))J. This means that a £ A3 has the spectral mapping property, in particular, if it is the system given by (11.4.3.1). This means that both of the conditions (11.4.5.1) and (11.4.5.2) follow from (11.4.5.3). Conversely, if say (11.4.5.1) holds then for arbitrary a, 6, c £ A we have <j^eft(c(6a - aft)) = {0} (11.4.5.5) and hence, recalling the connected hull (9.3.3.8) of Definition 7.10.1, aA{c{ba - ab)) C 7yaleft((c(6a - ab)) = r\ {0} = {0} (11.4.5.6) Thus, if a, b £ A then c £ A =J> 1 - c{ba - ab) £ A'1 (11.4.5.7)
474 11. Multiparameter Spectral Theory By Theorem 7.2.3 and (7.2.3.10) this means that ba - ab £ Radical(A), which gives the condition (11.4.5.3). If instead we start with the assumption (11.4.5.2) the argument is the same. ■ We conclude with a converse to the "upper semicontinuity" result Theorem 9.2.6: 11.4.6 THEOREM If a^ £ A and a £ AN satisfy llaoo — an\\ —* 0 as n —► oo and {a^} U {an:n £ N} is commutative (11.4.6.1) then, for each a, £ {d™,e«*\?f,f«**}9 "(O = { lim sn:s £ c0 n J] "("») \ (11.4.6.2) I nGN J Proof: Theorem 9.2.6 says that the right-hand side of (11.4.6.2) is included in the left. Conversely, if t £ ^(a^) is arbitrary then Theorem 11.4.4 says that there is a sequence 5 £ C for which (M)^(a,aJ (11.4.6.3) By Theorem 11.4.3 it follows that \sn — t\ —> 0 as n —> oo, and by part of Theorem 11.4.2 we must have sn £ oj(an) for each n £ N. ■ 11.5 THE SILOV BOUNDARY We recall from Theorem 9.1.3 that if / : H —> C is holomorphic on an open set H containing a disc K C C, then it attains its maximum modulus on the circle dK: 11.5.1 DEFINITION If E C C(H) is a linear subspace which separates points of the compact Hausdorff space H, then the closed boundaries of E in H are the sets AE(Q) = Ix = cl(lf) C n:sup |/| = sup |/| for each f e e\ (11.5.1.1) I k n J The main result about boundaries is easily stated: 11.5.2 THEOREM If E C C(H) is a linear subspace of C(H) which separates points of the compact Hausdorff space H, then nAs(n) £/>F(n) (11.5.2.1)
11.5 The Silov Boundary 475 Proof: We actually prove that nAs(n) 2 dE(n) e AE(n) (11.5.2.2) where dE(n) = {t £ H: for each U £ Nbd(*) there is K £ A^H) with K\U £ A^H)} For the second part of this suppose that W = int(PV) is an arbitrary open neighborhood of dE(Q), so that for each t £ H \ W there is Ut £ Nbd(£) for which AE{n)\utcAE{n) (11.5.2.4) in the sense that K \ Ut is a boundary whenever K is a boundary: then by compactness there is a finite subset H C H \ PV for which n \ W C utGH^t = UyLify- But now m n\71GAB(n)=>(n\71)\72GAB(n)=>---=>n\|j7yGAB(n) and hence m ten y=i Since this holds for arbitrary neighborhoods W of d#(n) it follows that dE(Q) is also a closed boundary, as required by the second part of (11.5.2.2). For the first part of (11.5.2.2) suppose that t £ n is not in the intersection flA£7(n), so that there is Kt £ A^ft) for which t g Kt, and then for each point 5 £ Kt a function gs £ E for which gs(t) = 0 and gs(s) = 2. By compactness there is a finite set K[ C Kt for which Kt C UsGKi {\gs\ > 1}: now we claim that if we put Ut = H {1^1 < !> (11.5.2.5) then (11.5.2.4) will be satisfied. Certainly Ut £ Nbd(*) and 17t n ift = 0 (11.5.2.6) If (11.5.2.4) is not satisfied there will be K £ A^H) for which K\Ut <£ A£7(H), and then / £ E for which sup^^ |/| < 1 = supn |/|. Now for
476 11. Multiparameter Spectral Theory sufficiently large nGNwe have maxsup |/n<75| < sup |/| maxsup|<75| < 1 (11.5.2.7) sGK't q \K\Ut J sGK't q Since Kt G A^H) there is t' e Kt for which \f{t')\ = 1, which with (11.5.2.7) means that \g3{t')\ < 1 for each 5 £ K'v Thus> t'eKtn ut (n.5.2.8) The contradiction between (11.5.2.8) and (11.5.2.6) flows from the attempt to deny (11.5.2.4), which is therefore proved. This finishes (11.5.2.2), which gives (11.5.2.1). ■ From (11.5.2.2) it is clear that the set dE{Q) must actually equal the intersection C\AE{Q), and now we can also simplify (11.5.2.3): dE(Q) = {teQ: for each U <E Nbd(*), H \ U <£ AE{Q)} (11.5.2.9) Certainly the right-hand side of (11.5.2.9) is a subset of the left: for the opposite inclusion use (11.5.2.1) to see that the intersection of two boundaries is always a boundary, which gives implication n \ u c A^(n) =j> As(n) \ u c A^(n) (11.5.2.10) When H = A* is the maximal ideal space of a commutative Banach algebra A and E = AA is the image of A under the Gelfand transform then the Silov boundary looks rather like the approximate eigenvalues: 11.5.3 DEFINITION If A is a commutative Banach algebra and a £ AJ then the Silov boundary of a is the set &a(°) = \ s £ CJ: inf > \(a- — sAu\„ = 0 for each finite Jr C J } \ '"'-^ife ' "J (11.5.3.1) In particular, the Silov boundary of A is the set 8(A) = \<t> e Ax: inf ]T \au\a = 0 for each finite K C ^_1(0) > (11.5.3.2)
11.5 The Silov Boundary 477 As we are suggesting, the Silov boundary of A coincides with the set (11.5.2.3): 11.5.4 THEOREM If A is a commutative Banach algebra, then the Silov boundary of A is the intersection of the closed boundaries of AA in A^\ B{A) = 3AA (A*) = nAAA (A*) (11.5.4.1) Proof: Suppose <j> £ H = A* is in the set dAA (H) and use the formula (11.5.2.9): then if a £ An satisfies <£(ay) = 0 for each j and if 0 < e < 1 there is U £ Nbd(<£) for which n tf Gtf =>£|tf(ay)|<e (11.5.4.2) y=i and then by (11.5.2.9) there is u £ A for which \u\a = 1 and sup \ip(u)\ < ef with Y" \aAa e1 < e (11.5.4.3) Use the condition that n \ U is not a boundary, normalize to make {u^ = 1 and then raise it to a sufficiently high power: we claim n n |ttU = l and ^2\aj-u\a= sup^2\tp(aj-u)\<e (11.5.4.4) y=i ^Atj=i for look at hKtt)l£y=i May)l with ^> G 17 and with ^ £ ft \ 17. This shows that the intersection of the closed boundaries is a subset of the Silov boundary: conversely, if <j> £ A* and U £ Nbd(<£), then by the definition of the topology of A* there is s £ ]0,1[ and a £ An for which I 1> £ A*: X] W*y) " *(ay)| < * > Q U (11.5.4.5) and if ip £ 5(A) there is by (11.5.3.2) an element u £ A for which lu^ = 1 and Ey=i l(ay — ^(ay))ulo- < is* We cla™ that the right-hand side of (11.5.2.9) holds with <f> in place of t: iPeA*\U=^> \ip{u)\ < \ < 1 = \u\a (11.5.4.6)
478 11. Multiparameter Spectral Theory Indeed, iitp&U then £y=1 |^(ay) - ^(ay)| > e by (11.5.4.5), while y=i n n = £ WK - *(a/))*)l ^ E IK - <t>^M* < is ■ y=i y=i The Silov boundary of a system of elements is a subset of the approximate eigenvalues: 11.5.5 THEOREM If a £ An for a commutative Banach algebra A then ^WCr~rW=rTht(a) (11-5.5.1) Hence also 3(A)Cf™(A)=f***(A) (H-5-5.2) Proof: If 0 is not in f J|ft (a) then there is k > 0 for which n E Ha/ttll ^ *HttH for each u e A (11.5.5.3) y=i Consider the algebra B=\f: £ 6^(1)4(1)-<W:(OinA,f; £ ||6j<ool (^m=0|r/|=m m=0 |i/|=m J (11.5.5.4) normed as indicated, where we write \i/\ = 1/(1) + i/(2) H \-i/(n) for each multi-index v (11.5.5.5) Evidently A is a closed subalgebra of B: look at the element n c = Eay^ GjB (11.5.5.6) y=i and claim that, for each uGA, ||cu|| > fc||u|| and ||cr+1u|| > k\\cru\\ for each r eH (11.5.5.7)
11.5 The Silov Boundary 479 Indeed, writing a? = a^(1)4(2)- ••<(") (11.5.5.8) we have, using "multinomial coefficients," |i/|=r ^ ' |i/|=r ^ ' and then ||c'+1«|l = II £>,•*,■£ Q°"«*l y=i |„|=r v y = EEQ IK(1) • • • <(y)+1 • • • <w«r(1) • • • <w+1 • • • <(n) ii y=i H=r v y = e(:)eik^h \»\=r V y using (11.5.5.3). From (11.5.5.9) it follows, replacing uby ur, that ||(cu)r|| = ||crur|| > kr\\ur\\ for each u <E A and r <E N (11.5.5.10) and hence, using Theorem 9.5.2, and (9.5.3.5), n n /J l^y^U > | 2_J CLjZjU^ > k\u\a for each uGA (11.5.5.11) y=i y=i This proves (11.5.5.1): for (11.5.5.2) we can argue f Jjft(A) = f^ght(A) e AAA (A*) is a closed boundary for AA (11.5.5.12) using (9.4.2.1) from Theorem 9.4.2 and (7.10.3.3) from Theorem 7.10.3 to see that \c\a = sup \s\ = sup \s\ for each c £ A ■ (11.5.5.13)
480 11. Multiparameter Spectral Theory If A is commutative and a £ AJ is arbitrary, then there is inclusion *a(<0 £ fA(a) C f| aB(a) C aA(a) (11.5.5.14) ACB For a single element cGA there is equality in the middle of (11.5.5.14): 11.5.6 THEOREM If a £ A for a commutative Banach algebra A, then *a(*)= fl 'bW (11.5.6.1) ACB If jK" C A is a finite subset disjoint from the topological zero divisors, then there is a commutative Banach algebra B for which ACB and K C B'1 (11.5.6.2) Proof: Suppose 0 £ ^(a), so that there is A: > 0 for which ||ac|| > k\\c\\ for each c £ A (11.5.6.3) If t > 0 we shall write D{t) = \c0 + Yl cJzJ': cj e A> Hcoll + L Hc; II*"' < °° \ (H-5.6.4) normed as indicated, with the multiplication one would expect of "power series with coefficients in A," and then put B(t) = D{t)/J{t) where J{t) = clD(t)(l - az)J(t) (11.5.6.5) quotienting out the closed ideal generated by 1 — az. Evidently the mapping T : b —> b + J(t) from A to B(t) (11.5.6.6) is a homomorphism, and of course B{t) is a commutative Banach algebra. Also, the image T{a) = a + J{t) has an inverse in B{t), given by the coset z + J{t). We claim that for sufficiently large t the embedding T of A in B{t) is isometric. To see this observe that if b £ A and c0 + Sylj CjZ3 £ £)(£) then b- (l-az)(c0 + J^c3-A v y=i y = n + (** -1) (11.5.6.7)
11.6 Composition Operators 481 This is because b-{l-az)lc0 + Y, C3zJ 1 = b ~ co + D(acj-i " ci)zJ and hence \b — (1 — az )(co + X/y; v y=i ll*-*oll + X^K-i-'yll*y y=i oo > (11*11 - Ikoll) + £(%y-ill - Ml)* = 11*11 + £(*' " ^V y=i y=i We have proved (11.5.6.2) for the set K = {a} consisting of a single element: just ensure that kt > 1 and take B = B(t). For (11.5.6.1) apply what we have just proved to elements a — s for arbitrary sGC, and for (11.5.6.2) take a to be the product of all the elements in K. m The reader should compare this characterization of the topological zero divisors as "permanently noninvertible" with the characterization of generalized exponentials obtained in (7.11.5.8). 11.6 COMPOSITION OPERATORS It is instructive to compare the spectra of a system a £ An of normed algebra elements with the corresponding spectra of the systems La £ Dn and Ra £ Dn of left and right multiplications in the algebra D = BL(A, A), in particular when A is itself an algebra of operators: 11.6.1 THEOREM If A is a normed algebra and a £ An then, with D = BL{A,A), and Also and **?(*) = ^(La) and ^ght(a) = *™{Ra) ?!f M = W.) and ?^\a) = f™(Ra) aF(a)=f**\Ra)=*«*t(Ra) (11.6.1.1) (11.6.1.2) (11.6.1.3)
482 11. Multiparameter Spectral Theory and <#"(«) = r^\Ra) and a^\a) = r^(La) (11.6.1.4) Proof: This is just a restatement of Definition 11.1.2 and Definition 11.1.1: in (11.6.1.3) we are also using the second part of the remark immediately following Definition 11.1.1. ■ If we had extended Definition 11.1.2 to establish notation for the case in which the operators row(La) and row(jRa) were onto, as distinct from open, then we would have been able to similarly augment (11.6.1.4). We can also say something about systems (La, Rb) £ Dn+m associated with (a, 6) £ An+m, or more generally about systems (La,Rb) associated with (a,6) £ An x Bm in the situation discussed immediately following Theorem 3.10.4. If A and B are normed algebras then we shall call the normed space M a normed (left A,right B)-module if there are mappings L:A—>BL(M,M) and R:B —> BL(M,M) (11.6.1.5) linear and of bound < 1, which satisfy, for arbitrary a, a' £ A and 6, bf £ B, La'a = LafLa, Rb,b = RbRbi, L± = I = R±, and (11.6.1.6) LaRb = RbLa For example, this is the case if X and Y are normed spaces and A = BL{X,X) B = BL{Y,Y) and M = BL{Y,X) (11.6.1.7) As the reader will recall from (3.10.4.10) the general case can be interpreted by regarding A, B, and M as subspaces of BL(X, X), BL(y, Y), and BL(y, X). The other special case is of course A = B = M is a normed algebra (11.6.1.8) Once again (3.10.4.10) suggests that this is actually quite general. 11.6.2 THEOREM If A and B are normed algebras and M a normed (left A,right £)-module, and if a £ An and b £ Bm are arbitrary, then, with £> = BL(M,M), S™{La,Rb)ce™{a)xeW[b) and (11.6.2.1) af(Lfl)^)Caf(a)xaf(&)
11.6 Composition Operators 483 and <^(i.^)Cfff(a)xa3'ht(i) and (11.6.2.2) Proof: This is essentially Theorem 11.1.4 and Theorem 11.2.2. If, for example, (5, t) <E Cn+m is not in a^ft (a) x a£ght (6) then either £y=i aj (ay - 5y) = 1 G A for some a' £ An or ^2™=1{bk — tk)b'k = 1 E 5 for some 6' 6 Bm, giving E;=1 W«, - *y J) + Er=i f*(*fc - '*') =^P with either (17, V) = {La,,0) or (17, V) = (0,jR6/), which means that {s,t) is not in 0pft(La,jR6). This proves the first part of (11.6.2.2), and all the rest is left to the reader. ■ Specializing again to the case (11.6.1.8) and combining (11.6.1.3) with (11.6.2.1), and taking either b = 0, or a = 0 after first interchanging a and 6, gives and (11.6.2.3) and and (11.6.2.4) If instead we specialize to the case (11.6.1.7) then we can prove more: 11.6.3 THEOREM If X and Y are normed spaces and A = BL(X,X), B = BL(y,y), M = BL(y,X), and N = BL{X,Y) then for arbitrary a e An and b <E Bn we have, with D = BL(Af,Af) and E = BL(iV,iV), tJFW x *?"(*) Q *D{t(La,Rb) C ir?ht(iiolL6) (11.6.3.1) and f!2«(a) x frjght(6) C fj5»(L0,J?6) C f^ht(i?0,L6) (11.6.3.2) Proof: This is Theorem 5.6.4: if 5 G Cn and t £ Cm are arbitrary, set S = a — s and T = b — t and then use (5.6.4.1) to argue that row(jR5,Lr)
484 11. Multiparameter Spectral Theory- dense => co\{Ls,RT) one-one and that col(L5,jRr) one-one => (S one-one or T dense), giving (11.6.3.1). The same argument, using (5.6.4.2), gives (11.6.3.2). ■ Interchanging a and b gives of course ^ght(a) x ^eft(6) C *%\Ra,Lh) C *i&\La,Rh) (11.6.3.3) and f^ht(a) x ff\b) C rlf(Ra,Lb) C f^ht(La,Rb) (11.6.3.4) Theorem 11.6.3 extends to more general algebras and modules A, B, M, and N as in (3.10.4.10): for (11.6.3.1) it is sufficient that there is implication, for arbitrary u and t; in M, uwv = 0 for each w e N =^ u = 0 e M oi v = 0 e M (11.6.3.5) while for (11.6.3.2) it is sufficient that there are k > 0 and h > 0 for which fc||tt||||tyt;|| + /i||twy||||t;|| < ||myt;|| for each u,v G M and w £ N (11.6.3.6) For example, suppose that (11.6.3.5) holds and that a £ A and b £ B are such that the operator row(jRa,L6) : iV2 —> N is dense: then for arbitrary w G N there are (aJJ and (6JJ in iV for which ||ty — o!na — bb'n|| —> 0, and hence if u £ M is arbitrary ||uwu — ua'nau — ubb'nu||—► 0 as n—► oo (11.6.3.7) From (11.6.3.7) it follows that au = 0 = ub =^ uwu = 0 (11.6.3.8) This together with (11.6.3.5) says that u = 0, and hence that the operator col(La, Rb) : M —> M2 is one-one. This is the second inclusion of (11.6.3.1), or at least a special case; for the first inclusion argue that if co\(La,Rb) is one-one then for arbitrary u, v in M and w £ N we have au = vb = 0 =r> a(uiut;) = (uiut;)6 = 0 ==> uiuv = 0 (11.6.3.9) This together with (11.6.3.5) says that either u = 0 or t; = 0, and hence that either La or Rb is one-one. It is clear that there is implication (11.6.3.6) => (11.6.3.5). If A = BL(X,X) and B = BL(Y,Y), then it is sufficient for (11.6.3.6) that N
11.6 Composition Operators 485 contain all the operators of finite rank: KL0(X,y) C JV (11.6.3.10) The reader can verify this by inspecting the proof of Theorem 5.6.4. If, for example, N is equal to the finite rank operators, or their closure in the norm of BL(X, Y), then the compatibility conditions of (3.10.4.10) will be satisfied if either M = BL(Y,X) or M = KL0(y,X) or its closure. Theorem 11.6.3 and the special case of Theorem 11.6.2 have an essential analog: 11.6.4 THEOREM If X and Y are normed spaces and S £ BL(X,X)n, T £ BL(y, Y)m are arbitrary systems of operators, then ^(Ls,RT)^st\Rs,LT) C (aJ2(5)xari«ht(T))u(aM(5)xa2ht(r) (11.6.4.1) (ir™(S)xirri«ht^ (11.6.4.2) (*e+ss(*) x fri«ht(T)) U (f™(S) x a-s(T)) C a+.(L5,iir) n *+.(**, Lr) (11.6.4.3) Proof: If (s,t) £ Cn+m is not in the right-hand side of (11.6.4.1) then there are three cases to consider: either S — si is almost left invertible, or T — tl is almost right invertible, or both S — si is almost essentially left invertible and T — tl is almost essentially right invertible. In the first case Ls — si is almost left invertible and Rs — si almost right invertible, while in the second case RT — tl is almost left invertible and LT — tl almost right invertible. Thus, in each of the first two cases (Ls — si, RT — tl) is almost left and [Rs — si, LT — tl) almost right invertible. For the third case we apply the "almost" analog of Theorem 6.13.5, and note that the analog of the condition (6.13.5.1) is satisfied with col(5 — si) : X —> Xn and col(T - tl) : Y -► Ym in place of S : W -► Z and T : X -► Y: then the analog of the condition (6.13.5.2) says that (Ls — sI,RT — tl) is almost essentially left invertible and (Rs — si, LT — tl) almost essentially right invertible. In all three cases therefore (s,t) £ cn+m is excluded from the left-hand side of (11.6.4.1). For (11.6.4.2) and (11.6.4.3) we go to Theorem 6.13.6: if (*,*) £ Cn+m is not in the right-hand side of (11.6.4.2), or of (11.6.4.3), then condition (6.13.6.1), or condition (6.13.6.3), is satisfied with S — si and T — tl in place of S and T, so that we deduce that also condition (6.13.6.2), or condition (6.13.6.4), is satisfied, which excludes {s,t) £ Cn+m from the left-hand side of (11.6.4.2), or of (11.6.4.3). ■ The obstacle, in incomplete spaces, to a corresponding result for left
486 11. Multiparameter Spectral Theory and right essential spectra, as distinct from "almost" left and right essential spectra, is that we ought to work with actually compact operators rather than totally bounded operators in that context: unfortunately, we have only established (6.13.3.2) for totally bounded operators. Of course when we restrict attention to complete spaces these distinctions can be ignored. For Hilbert spaces, and for Banach spaces and single operators, Theorem 11.6.2 and Theorem 11.6.3 combine effectively, as do both parts of Theorem 11.6.4: 11.6.5 THEOREM Suppose X and Y are Banach spaces and S £ BL(X,X)n, T e BL(y,y)m: then if X and Y are Hilbert spaces there is equality aleit{Ls,RT) = aright(jR5,Lr) = aleft(5) x aright(T) (11.6.5.1) aright(L5,jRr) = cleit{Rs,LT) = aright(S) x aleft(T) (11.6.5.2) = (<£?(*) x *right(T)) U (aleft(5) x aHf *(T)) aesfht(^5^r) = aleJsiRSiLT) (11 6 5 4l = Wjf'P) x ^CH) U (aright(S) x aJ^(T)) If instead n = m = 1, so that S = S1e BL(X,X) and T = Txe BL(Y,Y) are single operators, then there is inclusion d(a(S) x a(T)) C <j(Ls,RT) C <j(S) x a{T) (11.6.5.5) d{a{S) x a{T)) C a{Rs,LT) C a{S) x a{T) (11.6.5.6) d((oe33(S)xo(T))u{o{S)xoe33{T))) C ce33(Ls,RT) C (aess(5) x a(T)) U (a(S) x aess(T)) d((<Je33(S)xv(T))u(<j(S)x<je33(T))) C ce33(Rs,LT) C (aess(5) x o(T)) U (a(5) x aess(T)) (11.6.5.7) (11.6.5.8) Proo/: Towards (11.6.5.1), Theorem 8.4.4 applied with col(5 - si) and with col(T — tl) in place of T gives fleft(5) x fright(T) = aleft(S) x <7right(T) (11.6.5.9) which combines with (11.6.2.2) and (11.6.3.2) to give (11.6.5.1), and now (11.6.5.2) follows from (11.6.5.1) if we interchange S and T. Towards (11.6.5.3), Theorem 8.7.3 gives *e+ss(S) x a-s(T) = aJ£(S) x a*f*(T) (11.6.5.10)
11.6 Composition Operators 487 which combines with (11.6.4.1) and (11.6.4.3) to give (11.6.5.3), and now (11.6.5.4) follows by interchange of S and T. Towards (11.6.5.5) begin by noting d(K xH) = {(dK) xH)U{K x (dH)) (11.6.5.11) in particular, with K = a{S) and H = c{T), and then recall Theorem 9.3.3, and (9.3.3.11), which say da{S) C fleft(5) H fright(S) and da{T) C rle{t{T) n fright(T) (11.6.5.12) and G(S) = fleft(5) U fright(S) and c{T) = rleit{T) U frisht(T) (11.6.5.13) The first part of (11.6.5.5) now follows: using (11.6.5.11) and (11.6.3.2) d{a{S) x a(T)) C (fleft(5) x fr'lght(T)) U (fright(5) x fleft(T)) C fleft(L5^r) Ufright(L5,.Rr) C c(Ls,RT) The first part of (11.6.5.6) follows by interchange of S and T, and in each case the second part is clear. The argument for (11.6.5.7) and (11.6.5.8) follows the same pattern, and will mostly be left to the reader: we supplement (11.6.5.11) with (7.10.3.6), which says that the boundary of the union is contained in the union of the boundaries, and we replace (11.6.5.12) and (11.6.5.13) with, using Theorem 9.8.7, ^ess(S) C ae+ss(S) H a-3(S) and aess(T) C ae+ss(T) n ^(T) (11.6.5.14) and *ess(S) = *e+ss(S) U ^(5) and <7ess(T) = ae+ss(T) U c~3(T) m (11.6.5.15) One byproduct of the proof of Theorem 11.6.5 says that if X and Y are Hilbert spaces and S £ BL(X,X)n, T G BL(y,y)m then aleft {Ls, jRr) = rleft [Ls, jRt) and aright (Ls, jRr) = fright (Ls, jRr) (11.6.5.16) and °™(I<sM = °L(Ls>Rt) and ***ht(Ls,RT) = g-„[LS9Rt) (11.6.5.17) Theorem 11.6.5 also gives us some more spectral mapping theorems: 11.6.6 THEOREM If S e BL(X,X)n and T £ BL(y,y)m for Hilbert
488 11. Multiparameter Spectral Theory- spaces X, Y and g £ Poly£+m then g(oie{t(S),cr>*ht(T)) C aie{tg(Ls,RT) and (11.6.6.1) ffK^(5),aleft(T)) C <Tri^(Ls,i?r) g(a^(S),a"^(T)) U ^""(S^a^T)) C aJ^(Ls,*r) (H-6.6.2) and <,(<f'(^.^"(T)) U ?(arieht(S),<£"(r)) C ae^htff(Ls, J?r) (11.6.6.3) If in particular S £ BL(X, X)n is commutative, then aIeft<7(LSli2r)=ari^<7(aleft(5),r) and (11.6.6.4) a^tyLs.lEj.) = alett<7(ari^(5),r) <£?*(£*.%) = ^r',ght<7(^(5),r) U<£f'^(S)^) (11.6.6.5) and aeriihtff(Ls,i?r) = aIeftff(o2f *(5),T) U a^g(a"^(S),T) (11.6.6.6) If instead T £ BL(y,y)m is commutative, then cMtg(Ls,RT)=aUttg(S,ar'^(T)) and (11.6.6.7) ari^ff(Ls, RT) = a^g^o^T)) a^g(Ls,RT) = ^(S.a^CT)) U o^g(Sto^*(T)) (11.6.6.8) and <£f*g(Ls,RT) = a^g{S,alett{T)) U ^htff(S,a^(T)) (11.6.6.9) If both S and T are commuting systems of operators, then cMtg(Ls,RT) = g(aM\S),c"^{T)) and (11.6.6.10) a^htg(Ls,RT) = g(a"^(S),aleii(T)) c£MLs,Rt) = g(^{S),a^(T))Ug(cleti(S),c^(T)) (11.6.6.11)
11.6 Composition Operators 489 and <7l^g(Ls,RT) = <7(aeriiht(5),aleft(r))U<7(a^ht(5),^t(T)) (11.6.6.12) Proof: This is mostly what we would have expected from the combination of Theorem 11.6.5 with the spectral mapping theorems (11.3.4.7), (11.3.4.8), and Theorem 11.3.5: in particular we are using the notation (11.3.4.9). For (11.6.6.1) we use (11.3.4.7) together with (11.6.5.1) and (11.6.5.2), and for (11.6.6.10) we use (11.3.4.8) together with (11.6.5.1) and (11.6.5.2). For (11.6.6.2) and (11.6.6.3), and for (11.6.6.11) and (11.6.6.12), we do the same, substituting (11.6.5.3) and (11.6.5.4) for (11.6.5.1) and (11.6.5.2). Towards (11.6.6.4) we use Theorem 11.3.5 to see cleitg(Ls,RT) = |J al^=3l9{sI,RT) = (J o^R^t) se<rle{t(S) 5Go-left(5) (11.6.6.13) and c"^g(Ls,RT)= (J ot?=si9{sI,RT) right (H.6.6.14) ~ U aLs=sIRg(sI,T) 5Go-right(5) and then use (11.6.5.1) and (11.6.5.2), together with (11.6.1.3), to see that s e aleft(S) =► c^=sIRg{sIiT) = vle{tRg{3l,T) = ari«ht*(a/,T) (11.6.6.15) and (11.6.6.16) This proves (11.6.6.4), and the argument for (11.6.6.7) is exactly the same, The arguments for the corresponding essential results follow the same pattern, and are left to the reader. ■ To exploit the other half of Theorem 11.6.5 we need an auxiliary result about polynomials. 11.6.7 THEOREM If K C Cn and H C C are compact and g <E Polyn+1, then g(K xH)= g{d(K x H)) (11.6.7.1)
490 11. Multiparameter Spectral Theory Proof: We have g(d{K x H)) = g{dK x H) U g{K x dH) (11.6.7.2) and g(K xH) = g{K x dH) U g(K x int(JET)) (11.6.7.3) We claim g{K x int(tf)) C g{dK x int(JET)) (11.6.7.4) To see this suppose r £ C is arbitrary and define iVr : if —> C by setting JVr(s) =#{te mt(H):g{s,t) - r = 0} for each sGX (11.6.7.5) the number of zeros of the polynomial ga — r = g(s, •) — r in the bounded open set H C C. Obviously, Nr(s) £ N U {0} is always an integer; it is also a continuous function of 5, in view of its integral representation N^ = hjWr~r *»<**•€* (H.6.7.6) Thus if iVr vanishes on dK it must vanish on K, and hence if r £ C is not in the right-hand side of (11.6.7.4) it must also be excluded from the left. ■ Using Theorem 11.6.7, the other half of Theorem 11.6.5 gives spectral mapping theorems for single operators on Banach spaces: 11.6.8 THEOREM If X and Y are Banach spaces, if 5 £ BL(y, Y) and T £ BL(X,X) are single operators, and if g £ Poly2 is a polynomial in two variables, then <jg{Ls,RT) = g{c{S),c{T)) (11.6.8.1) and *e3a9(Ls,RT) = g(oess{S),o(T))U9(*(S),c7ea3(T)) (11.6.8.2) Proof: By (11.6.5.5) there is inclusion gd(c(S) x g(T)) C gc(Ls,RT) C g{c(S) x c(T)) (11.6.8.3) and (11.6.7.1) means that there is equality throughout. By (11.3.4.8) this applies to the spectrum of g(Ls,RT), giving (11.6.8.1). The argument for (11.6.8.2) is exactly the same, using (11.6.5.7) in place of (11.6.5.5). ■ As a simple application of Theorem 11.6.8, look at sums and products:
11.6 Composition Operators 491 11.6.9 THEOREM If X and Y are nonzero Banach spaces and S e BL(X,X), T e BL(Y, Y), then Ls - RT invertible <=> o{S) n o{T) = 0 (11.6.9.1) and LSRT invertible <£=> 0 £ <j(S) U o{T) (11.6.9.2) Also Ls - RT Fredholm ^=> a{S) n aess(T) = aess(5) n c{T) = 0 (11.6.9.3) and LSRT Fredholm <=► 0 £ a(5)aess(T) U aess(5)a(T) (11.6.9.4) Hence, if neither X nor Y is finite dimensional, then LSRT Fredholm <==> LSRT invertible (11.6.9.5) Proof: If, more generally, M is a normed (left A, right B)-module and a e A, b e B, then with D = BL(M,M) aD(La - Rb) C aD(Lj - aD{Rh) C aA(a) - aB(6) (11.6.9.6) and aD(Ltti?6) C BD(La)5D(Rh) C <?A(a)aB(6) (11.6.9.7) so that aA(a) fl aB(6) = 0 =>• La - jR6 almost invertible (11.6.9.8) and 0 £ oA{a) U <7B(6) =>► LaRb almost invertible (11.6.9.9) When A = BL(X,X), B = BL(Y,Y), and M = BL(Y,X) and X, Y are complete then all the inclusions of (11.6.9.6) and (11.6.9.7) become equality, according to (11.6.8.1) from Theorem 11.6.8. This proves (11.6.9.1) and (11.6.9.2). For (11.6.9.3) and (11.6.9.4) we use instead (11.6.8.2). Finally, if neither 0e33(S) nor ^essl^O *s emPty, the right-hand side of (11.6.9.4) takes the form 0 £ a(5) U aesa(T) U aess(S) U o(T) = o(S) U o{T) u
492 11. Multiparameter Spectral Theory The implications (11.6.9.6) and (11.6.9.7) hold for left and right spectra separately: in particular a^ft(a) H a£ght(6) = 0 =► La - Rb one-one and bright. *a (a) n 5B V>) = ®=^La-Rb onto (11.6.9.10) (11.6.9.11) When A = B = M these two conditions have an unexpected consequence: we shall write D=L-Rn for each a £ A (11.6.9.12) 11.6.10 THEOREM If A is a normed algebra and a, b <E A satisfy rlZ{t{a) H f*ght(6) = 0 (11.6.10.1) then (a, 6) and (a + 6, a6) have the same commutator, in the sense that D^{0) n JV(o) = D~lb(o) n p^(o) (11.6.10.2) Proof: With no restriction on a and 6 there is inclusion D^io) n 1^(0) c P-^(O) n ^(o) (11.6.10.3) One way to see this is to observe r t t 1 r n i (11.6.10.4) (11.6.10.5) [Dab\ = ' I I ' Rb K. .Db. Formally taking the "adjugate" or "cofactor" matrix gives L„-Rh 0 L„-R b J -R, ^a+b Dab This gives implication La - Rb one-one =► D~lb(0) n D^(0) C D~X(0) n P^ (°) (11.6.10.6) Finally, the argument for (11.6.9.6), together with (11.6.1.2), gives fleft(L0 - Rb) C fIrft(L„) - T™\Rb) = f ™(a) - fr/ht(6) (11.6.10.7) Thus the condition (11.6.10.1) ensures that La—Rb is bounded below, hence one-one. ■
11.7 Tensor Products 493 11.7 TENSOR PRODUCTS Just as the cartesian product, or direct sum, of Theorem 1.6.1 enables certain kinds of two-variable functions to be represented as linear operators, so the "tensor product" enables "bilinear" mappings, such as the algebra multiplication (1.10.0.1) or the inner product of Definition 8.1.1, to be represented as linear mappings: 11.7.1 DEFINITION If X and Y are linear spaces over the field K then their tensor product is the linear space X®Y = X®KY = c00{X xY/ (ST1 (0)) (11.7.1.1) where (S)_1(0) is the linear subspace of c00(X <g> Y) generated by the set U {{**+*',* ~ S*,y ~ 6x',y>6x,y+y> ~ Sx,y ~ 6x,y" ? l 6sx,y ~ SSx,y>tx,sy ~ s6xiy}'^^ eX,y,y' EY^sE K} If X and Y are normed spaces then a crossnorm on X <8> Y is a norm for which ||rc <8> 2/|| = ||x||||y|| for each x <E X,y <E Y (11.7.1.3) A uniform crossnorm on X <8> Y is a crossnorm for which j=i £/(*yMyy) <ll/lllkll E*i®yy J=l (11.7.1.4) for each / G X\g <E Y\ ^iy®yyGX®y We shall call the normed space Z a tensor product of X and Y if it has a dense subspace isomorphic to the tensor product X <8> Y with a crossnorm, and a uniform tensor product if the crossnorm is uniform. Here c00{X xY) = c00(X x Y, K) is the space of finite or terminating functions c00(H) of (1.9.0.4), where H is X x Y with the bornology of finite subsets, and Sxy is the Kronecker delta of the point (x,y) £ X x Y. We are writing E xi ® Vi = I £ *»y.w + ®_1(°) e coo(^ x n/ ®_1 (0) (H.7.1.5) for the coset of a typical element of c00{X x Y). Later on, in a further
494 11. Multiparameter Spectral Theory abuse of notation, we may write X <S> Y to represent "a tensor product of X and Y" in the sense of the spaces Z at the end of the definition: in particular if X and Y are both complete then we will be able to take Z = X <S> Y also complete. If X = A and Y = B are linear algebras then the tensor product X <8> Y = A <8> B becomes an algebra if we define (a <g> b)(af <g> 6') = (aaf) <g> (66') (11.7.1.6) and extend to more general tensors by distributivity. We shall see below that this is a "good definition." If A and B are normed algebras we will usually look only at crossnorms on the product A <8> B that are compatible with this multiplication. If instead X is a right module and Y a left module over the linear algebra A then we can form the further quotient X(g)AY = (X(E)Y)/(g)^1(0) where ^a1!0) = \J2xjaj ®yj-xj®a>jyj:neN,xexn,ye Yn,ae An \ (11.7.1.8) Alternatively we may define ^^(O) as a subset of c00(X x Y) obtained by adding to the set (g)-1^) of (11.7.1.2) all the elements Sxay - Sxay. Naturally we write !>;®Ayy= ll>i®»i \+®A1(0)e{X®Y)/®^1(0) (11.7.1.9) It can be quite difficult to decide exactly when an element of X <8> Y is zero: 11.7.2 THEOREM If X and Y are linear spaces and 0 ^ YJjLi x'j ® v'j € X <8> Y then there are n £ N and linearly independent x £ Xn,2/ £ Yn for which m n ]T xj ® 2/y = Y, xj ®Vj£X®Y (11.7.2.1) y=i j=i If subspaces if C L(X, K) and jK" C L(Y, K) separate points of X and Y, respectively, then for arbitrary n £ N, x £ Xn, and y £ Yn, the following (11.7.1.7)
11.7 Tensor Products 495 are equivalent: n ]T x. (g) y. = o £ X <g> Y (11.7.2.2) y=l n Y, f(xj)yj = °£Y for each / € J5T (11.7.2.3) n ]T y (yy)xy = 0GI for each (/GX (11.7.2.4) j = i Proof: If xj = X)y^t 5yxy *s a linear combination of the remaining x'j then m E xy ® *>y = E xy ® fry + 8M) (11.7.2.5) y=i y** and if at the same time the sequence {y'j)JL1 is linearly independent then so is the sequence [y'j + Sy2/()y^i- Whether or not the sequence (y'J)JL1 is linearly independent we can repeat the same process with the sequence (xf) ^t-, arguing that if it is still linearly dependent then the tensor YJjLi x'j ® y'j can be written without the help of either x\ or x\ for some other index i'. Since not all the x'j can be 0 (remember YlT=i xj ® Vj ^ 0) this process will eventually yield a linearly independent subsequence x" of (xf)Jl_1. If the corresponding sequence y" which has evolved is also linearly independent then we have established (11.7.2.1); if not we now repeat the process with the sequence {yn), noting that we can never lose the linear independence of the sequence (x") (we made this comment immediately following (11.7.2.5)). This finishes the proof of (11.7.2.1). Toward the equivalence of the next three conditions observe that (11.7.2.2) can be rephrased XX^.e®-1^) y=i which makes the left-hand sides of (11.7.2.3) and (11.7.2.4) vanish for arbitrary / £ L(X, K) and g £ L(y, K). In turn, this observation guarantees that the left-hand sides of (11.7.2.3) and (11.7.2.4) are well defined as functions of the tensor Y%Li xj ®Vj €X®Y. We have proved that (11.7.2.2) is sufficient for each of the other two conditions; conversely if (11.7.2.2) does not hold then we are entitled by (11.7.2.1) to assume that x £ Xn and y £ Yn are linearly independent, and by what we have just noted above
496 11. Multiparameter Spectral Theory to use this x £ Xn and y £ Yn in the left-hand sides of (11.7.2.3) and (11.7.2.4). In particular, xx^0^yi and hence by the separating assumptions (6.1.2.10) there are / £ H and g £ K for which /K)^o^ff(yi) But now the linear independence of y £ Yn means that for this / the left-hand side of (11.7.2.3) does not vanish, and the linear independence of x £ Xn means that for this g the left-hand side of (11.7.2.4) does not vanish. ■ When X and Y are normed spaces then Theorem 5.4.1 tells us that we can take H = X^ and K = Y^: more generally, we shall usually always work with H C X* and K C yt. If X and y are normed spaces then an example of a crossnorm on X <8> y is the quotient norm on c00(X x Y)/ <8>_1 (0) induced by the norm of ^(IxF): 5>y ® J = inf J £ ||*;-||||^||:£*y ® 2/y = £ xj- ® ^ V y=i Hi [y=i j=i y=i J (11.7.2.6) The reader should verify that this is a crossnorm, a uniform crossnorm, and that it lives up to its title, the greatest crossnorm for X<8> Y. At the other extreme, the least uniform crossnorm is given by Ex;®2/; j=i = sup £/(*yMy,) y=i <i,IWI<i (11.7.2.7) The reader should verify that this also lives up to its name, and also that its value is unchanged if the supremum is restricted to functional / £ H C X^ and g £ K C yt belonging to subspaces H and K which are "norm- determining" in an obvious sense. Various specific spaces can be recognized as tensor products in the sense of the last part of Definition 11.7.1: 11.7.3 THEOREM If X and Y are normed spaces then the finite rank operators KL0(X,y) C BL(X,y) is a tensor product of X and Y (11.7.3.1)
11.7 Tensor Products 497 with y=i E/y®yy y=i and the linear functional 7f®IC BL(X, y)f is a tensor product of Yf and X with X>y®*y y=i X>y®*y (11.7.3.2) (11.7.3.3) (11.7.3.4) If also W and Z are normed spaces then the elementary operators (11.7.3.5) °£bl(x,y) CBL(BL(^,Z),BL(X,y)) is a tensor product of BL(P7, Z) and BL(X,y) (11.7.3.6) and, provided W ® X has a uniform crossnorm, BL(W, Z) <g> BL(X, y) C BL(W <g> X, Z <g> y) is a tensor product of BL(VF, Z) and BL(X, y) If H is a nonempty set then for each u £ {c00,c0,c1,c,/1,/2}J o;(n,X) is a uniform tensor product of oj{Q) and X (11.7.3.7) and m(H,X) is a uniform tensor product of ^(H) and X (11.7.3.8) If H is a topological space then C^HjX) n m(H,X) is a uniform tensor product of C^Q) and X (11.7.3.9) Proof: In each case it is obvious that there is an embedding of the tensor product, as a set, in the space concerned: to see that it is one-one we apply Theorem 11.7.3 with suitably chosen subspaces H and K, and it can then be left to the reader to see that the induced norm is a crossnorm, and that the image of the tensor product is a dense subspace. ■ The norm that best reflects the character of the tensor product is the greatest crossnorm of (11.7.2.6):
498 11. Multiparameter Spectral Theory 11.7.4 THEOREM If X, Y, and Z are normed spaces then there is isomorphism x 0! (y ®x z) = [x ®x y) ®x z (11.7.4.1) and x <£>! (y 0! z) = (x <£>! y) 0! (x ® x z) (11.7.4.2) and BL(X ® x y, Z) = BL(X, BL(y, Z)) = BL(y, BL(X, Z) (11.7.4.3) Proof: We may safely leave the first two parts of this to the reader, and recall that the second part of (11.7.4.3) was given by Theorem 2.9.3. Toward the first part of (11.7.4.3) we have a well-defined correspondence 0 <—► $ : BL(X <£>! y, Z) = BL(X,BL(y, Z)) given by the formulas (11.7.4.4) 0 HC xj® yj) = ^2 *(xy)yy for each X] xy ® ty G x ® Y \j=i J y=i y=i (11.7.4.5) and $(x)(y) = 0(x<g> y) for each x G X,y G y (11.7.4.6) If $ is in the right-hand side of (11.7.4.4) and 0 is given by (11.7.4.5) then ehr^.ay,. U=1 < 11*11 X>ylll|yyl j=i and hence, considering all possible synonyms for Yll=i xj ®Vj € X ®Y, (11.7.4.7) ©lX>y®yy < 11*11 I>y®yy y=i This proves that ||0|| < ||$||. If instead 0 is in the left-hand side of (11.7.4.4) and $ is given by (11.7.4.6) then llft^t^Hlieiillxsyil^neiiHiiyii and hence, considering all vectors y £ Y of norm < 1, ||$(*)||<||0||||*|| (11.7.4.8) This proves ||$|| < ||0||. ■
11.7 Tensor Products 499 As a special case of (11.7.4.3) we have isometric isomorphism [X <£>! y)f = BL(X, Yf) £* BL(y, Xf) (11.7.4.9) Thus, using Theorem 5.9.1, we have the option of realizing the tensor product X <8> Y as either a subspace of the dual of BL(X,yt) or a subspace of the dual of BL(y, X*). In this sense the linear functional f <2>y of (2.9.2.6) are closer in spirit to our "tensors" than the finite rank operators / © y of (2.9.2.5). The first two parts of Theorem 11.7.4 remain valid if the greatest crossnorm is replaced by a general crossnorm: if on either side of (11.7.4.1) or (11.7.4.2) we change the greatest crossnorm or the direct sum norm then it becomes an interesting calculation to see what norms are induced on the other side. If we try to replace the greatest crossnorm in the left-hand side of (11.7.4.3) then we must replace the spaces of operators on the other two sides by certain subspaces. For example, if we use the least uniform crossnorm and then specialize to Z = K we will get a tensor product of X and Y on the right, consisting of those operators from X to Y^ which can be approximated in "greatest crossnorm" by finite rank operators. Another way to reduce the right-hand sides of (11.7.4.3) is to replace the tensor product X ®>x Y by its quotient X <8>A Y, if X is a normed right and Y is a normed left module over an algebra A. Either the same argument as in the proof of (11.7.4.3), or an immediate deduction from the conclusion (11.7.4.3), gives BL{X ®AY,Z) ~BLA{X,BL{Y,Z)) = BLA{Y,BL{X,Z)) (11.7.4.10) Here we are writing, if $ <E BL{X,BL{Y,Z)) and # <E BL{Y,BL{X,Z)), * e Bl/(X,BL(y,Z)) «=► «(xa)(y) = *(*)(ay) (n.7.4.11) for each x E X,y £Y,a £ A and $ 6 BLA(Y,BL(X,Z)) ^ *(ay)(x) = *(y)(™) (n ? 4 n) for each x E X,y EY,a E A and of course using the greatest crossnorm of X <8> Y to define the norm of X <8>A Y. If we remark that X®AAS*X and A <g>A Y = Y (11.7.4.13) then we notice (generalizing Theorem 5.1.2!) the special cases BLA(y,Af) = yf and BLA(X,Af) ^ Xf (11.7.4.14)
500 11. Multiparameter Spectral Theory In particular, BLA{A\Af) = Aft and BLA(Af, Af) s Aft (11.7.4.15) The reader should notice carefully that (11.7.4.15) represents the second dual A^ of a normed algebra as a normed algebra, but in two different ways, depending on whether the first dual is regarded as a left module or as a right module over the algebra. For tensor products of normed algebras, we have analogs of Theorem 11.6.3 and Theorem 11.6.2: 11.7.5 THEOREM If A and B are normed algebras and A®B is a tensor product of A and B then for arbitrary a £ An and b £ Bm there is inclusion *Ah(°) x *l§h(b) C x™B(a ® 1,1 ® 6) ****(*) x n^(b) C ^g* (a ® 1,1 ® 6) (1L7-5'1) flf(a) x f^'(6) C f$B(a ® 1,1 ® 6) ^ght(«) x f^ht(6) C f&h* (a ® 1,1 ® 6) (11'7-5-2) *!&»(« ® 1,1 ® 6) C <#"((») x a£ft(6) <tf |b(« ® 1,1 ® 6) C a7ht(a) x a£ght(6) (U-7-5-3) If, in particular A <g> 5 is uniform then there is equality *aSb(* 3 1,1 3 6) = ^eft(a) x a^ft(6) *&"(* ® 1,1 ® 6) = *?ht(a) x a£ght(&) (11-7-5>4) Proof: If 5 £ *rjfft(a) and t £ 7r^ft(6) then there are 0 ^ u £ A and 0 7^ t; £ B for which (a;- — Sj)u = 0 = {bk — tk)v for each j and A:, giving ((ay - sy) <g> l)(u <g> v) = (1 <g> (6fc - tk))(u ®v)=0®0^u®v (11.7.5.5) for each j and A:. This proves the first part of (11.7.5.1), and the second is exactly the same (formally, derive it from the first by "reversal of products"). The argument for (11.7.5.2) is almost identical: if 5 £ fj|ft(a) and t £ f!Jft(6) then there are sequences {ur) and (vr) in A and B for which IM = 1 = \K\\ and ||(ay - 5j>r|| + \\{bk - tk)vr\\ - 0 for each j and k: but now, using the crossnorm property for A <8> B, and (11.7.5.6) || ((aj - 5y) ® 1) (ur ® t,r)|| + ||(1® {bk -tk)){ur®vr) II—0
11.7 Tensor Products 501 This proves the first part of (11.7.5.2), and similarly the second. Toward (11.7.5.3) suppose that either £^=1aJ(ay-sy) = 1 £ A or ^=1b'k{bk-tk) = leB: then n m ]T cy((ay - Sj) (g) 1) + ]T ^(1 (g) (6fc - **)) = l<g>l£A<g>5 (11.7.5.7) j = l fc=l with either (c,d) = (a' (g) 1,0 (g) 0) or (c,d) = (0®0,1® 6'). This proves the first part of (11.7.5.3), and hence also the second. Exactly similar argument shows that the analogue of (11.7.5.3) holds for the almost spectrum: whether or not the crossnorm is uniform there is inclusion one way in (11.7.5.4). For equality in the uniform case suppose that (s,£) £ aj|ft(a) xa£ght(6),sothat n m l£M = c\y^A{aj-sj) and 1 £ N = cl ]T B{bk-tk) (11.7.5.8) y=i fc=i Arguing as in Theorem 7.13.3 there are by the Hahn-Banach theorem <f> £ A*, tp £ £t for which <£(1) = 1 = ^(i) and 4>{M) = {0} = 1>(N) (11.7.5.9) By the assumption that the tensor product is uniform there is a well-defined bounded linear functional <f> <g> %j) : A <g> B —> K for which / p \ p p [4> ® ^) X] ^ ® 6*" = ^ ^(a<)^(6i) for each Y,ai®bieA®B \i=l J i=l i=l (11.7.5.10) If the finite rank tensors are only a dense subspace, <j> ® ip extends to the whole of "A <g> B" by continuity. Since [<t> <g> i>) (1 ® 1) = 1 and (<£ (g) ip) (M <g> 5 + A <g> TV) = {0} (11.7.5.11) it follows that the closed left ideal of A <g> B generated by (a — s <g> 1,1 <g> b — t) does not contain the identity 1®1, which means that (s,t) £ <7j|£B(a<g>l, 1(g) 6). This proves the first part of (11.7.5.4), and hence also the second. ■ The analog of Theorem 11.6.3 holds for normed algebras and uniform tensor products: 11.7.6 THEOREM If A and B are normed algebras and D = A <g> B is a uniform tensor product of A and B then for arbitrary a £ An, b £ Bm and
502 11. Multiparameter Spectral Theory 9 e Poly£+m ff(*!f M.^'ifW) C d^g(a ® 1,1 ® 6) <7(ar/hta),arjght(6)) C a^a ® 1,1 ® 6) (11-7'<U) If in particular a £ An is a commuting system then <?£ftff(a ® 1,1 ® 6) = a'jftff(^ft(a),6) <?£ght«7(a ® 1,1 ® 6) = o%shtg(o«eht(a),b)) i11-7*-2) If instead b £ Bm is a commuting system then a^t<7(a®l,l®6)=^ft<7(a,aIJft(6)) c^tya ® 1,1 ® 6) = ar/htff(a,agght(&)) (n-7-6-3) If both cGAn and b G Bm are commutative then agftff(o®l,l®6)=ff(a5ft(a),ajgft(6)) ar^htff(a® 1,1®6) =ff(^ht(a),aright(6)) (U-7'6-4) If in particular A, 5, and 2} are complete and m = n = p = 1 then <7D<7(a<g> 1,1 <g> 6) = 0(aA(a),aB(6)) (11.7.6.5) Proof: This is the argument of Theorem 11.6.6, using (11.7.5.4) in place of (11.6.5.16); for the last part we use the argument of Theorem 11.6.8. ■ The second or third part of Theorem 11.7.6 gives an amusing expression for the spectrum of an "operator matrix." If A is a complex linear algebra we shall write Aqq = A®cCqq = A*2 (11.7.6.6) for the algebra of q x q matrices over A, with obvious addition and multiplication, using (11.7.4.2) to recognize this as a tensor product: if A is a normed algebra then any one of the usual product norms on Aq will be a uniform crossnorm. If c = (a{) £ A we shall write det(c) = Y, {sgn(7r)ai7r(i)^(2) * * * V(g) : * e ^eim{^ 2,..., g)} G A (11.7.6.7) for the "determinant" of c: of course its significance is greatly diminished by the fact that we have had to make a specific choice in the order of the factors in each of the products. Here sgn(7r) is either +1 or —1 depending on whether the permutation is "even" or "odd." If in particular the matrix c has a commuting sequence a = (an,a12,...,alg,...,a ) of entries then
11.7 Tensor Products 503 the ordering of the factors in the determinant no longer matters, and we can go on to define a "cofactor" or "adjugate" matrix adj(c) with cadj(c) = adj(c)c = det(c)J (11.7.6.8) There is a multiplicative property for the determinant: if c' = {a'-) is another matrix and if the whole system (a, a') £ A2q is commutative, then det(cc') = det(c) det(c') = det(c') det(c) = det(c'c) <E A (11.7.6.9) and adj(cc') = adj(c') adj(c) <E Aq* (11.7.6.10) 11.7.7 THEOREM If c = (a{j) <E D = Aqq is a q x q matrix over the normed algebra A with a commuting sequence of entries a = {an,a 12, a^ft(c) = {reC:0e a^ det(c - r)} (11.7.7.1) and a£ght(c) = jrGC:0G ar/ht det(c - r)} (11.7.7.2) and aD{c) ={reC:0e<JA det(c - r)} (11.7.7.3) Proof: Writing B = Cqq let b={blub12,...,blq,...,bqq)eB* (11.7.7.4) obvious ba which be the obvious basis for the vector space B = C , namely, the one for <? q c = (a{j) eD^>c = g{a, b) = ]T ]T a{j ®bijeA®B = D (11.7.7.5) i=iy=i By (11.6.7.2) we have (11.7.7.6)
504 11. Multiparameter Spectral Theory and ^6htW = ^g[a«*\*),b) = U |^ght EE'oV* € ^ht(a) 1 (11.7.7.7) By "classical" determinant theory we have, for each 5 £ Cn, <#**(*. *) = a?hVa,6) = ^M) = *?"*(*, 6) = {r€ C: det($(s,6) - r) = 0} The first four expressions coincide by (3.10.4.1), (6.2.6.5), and (6.2.6.1), because B is a finite dimensional linear algebra, and then implication g(s,b)-re B'1 <£=> det{g{s, b) - r) ^ 0 (11.7.7.9) follows from (11.7.6.8) and (11.7.6.9). If we now apply Theorem 11.3.4 with the polynomial / = det(g(z, b) — r) then we find implication 0 £ aj|ft det(c - r) <<==> det{g{s,b) - r) = 0 for some 5 £ <J!j;ft(a) (11.7.7.10) and 0 £ <5^ght det(c - r) «=> &et(g(s, b) - r) = 0 for some 5 £ <5^ght(a) (11.7.7.11) From (11.7.7.6), (11.7.7.8), and (11.7.7.10) we get (11.7.7.1), similarly (11.7.7.2), and hence, if the algebra is complete, (11.7.7.3). Whether or not A is complete, it is clear from (11.7.6.8) that if det(c — r) £ A~l then c — r £ £>_1; also the reverse implication would flow from (11.7.6.9) if we knew that the entries of a two-sided inverse (c — r)-1 had to commute with one another and with the entries of c — r. To see that this is so observe that for arbitrary c = (ai;) £ D and arbitrary u £ A there is implication c{u <8> J) = (u ® I)c =>■ a{u = ua- for each i,j £ {1,2,..., q} ■ (11.7.7.12) Superficially similar to Theorem 11.7.7 is the spectrum of an upper triangular matrix, with no commutivity restriction on the entries: c = (aiy) £ D => aD{c) = (J dA{ajj) (11.7.7.13) where D is the algebra of upper triangular q x q matrices over a normed algebra A. To see this observe that D = A <8> B for the algebra B of upper triangular qxq matrices over the complex field C, and that for an arbitrary
11.7 Tensor Products 505 system b £ Bm «B(6)={6ii.622,-,6mm} (11.7.7.14) for the left and the right, and the almost left and right, spectra w: thus also ujBg{b) = gujB{b) if b £ Bm, g £ Poly^ (11.7.7.15) The reader should now mimic the argument for Theorem 11.7.7, interchanging the roles of A and B. The spectrum of a continuous vector-valued function is what it ought to be: If / : H —► A is totally bounded and continuous from a normal HausdorfF space H to a normed algebra A then D = ^(fl, A) => oD(f) = closure (J 5J(t) (11.7.7.16) ten We prove more: 11.7.8 THEOREM If D is a uniform tensor product of normed algebras A and B, where A is complete and B is commutative, then for each c £ D there is equality S%%)= U *aV(c) and **"(<:) = |J otf'VM (11.7.8.1) where n n ^A(c) = lim ^2 ^{bj)aj :^L,an® bj —* c for each ^ e B%> c ^ ^ y=i y=i (11.7.8.2) Proof: The uniformity of the tensor product ensures that ipA is well defined, bounded, and linear on the set A <8> B C £), and the completeness of A means that it extends uniquely to D: now we have il> £ Bx =► <PA £ HBL(£>, A) (11.7.8.3) For equality (11.7.8.1) we need the extension of Theorem 11.7.6 to infinite systems: If ip £ B* and r £ £jfft^A(c) then by the extended version of (11.7.5.4) we have tysr) £a£ft(l<g>£,^A(c)<g>l) CCflxC (11.7.8.4) and hence by the extended version of Theorem 11.3.5 tys r) £ a£ft (1 <g> B, c) C CB x C (11.7.8.5)
506 11. Multiparameter Spectral Theory In particular it follows that r £ <?pft(c). Conversely if this is assumed then by Theorem 11.4.4 there is a character ip e B* for which (11.7.8.5) holds, and hence again by (11.7.5.4) also (11.7.8.4); now Theorem 11.7.6 again gives r £ <j^ft^A(c). This proves the first part of (11.7.8.1), and hence also the second ■ To deduce (11.7.7.16) form Theorem 11.7.8, at least for compact Haus- dorff spaces, simply take B = C^Q). For Allan's Theorem take B to be the algebra Holo^) = {f eC(K): (11.7.8.6) / has a holomorphic extension to some open H D K} associated with a compact set K C C. Note that Ho1o(jK") is a subalgebra of C{K) but not in general closed in the usual norm; however, using a Cauchy integration, g £ closure Holo(ii:) =^ g G Holo(interior(X)) (11.7.8.7) 11.7.9 THEOREM Suppose that A is a Banach algebra, that K C C is compact, and that H C interior[K): then for each / £ Holo(iJT, A) there is implication /(*) £ Aj;Jt for each t £ K =► / £ Holo(H,A)fJt (11.7.9.1) and /(<) € AS*ht for each t £ K =► / £ Holo(H, A)^ht (11.7.9.2) Proof: If we take £> = closure Holo(jRT, A) C ^(jRT, A) = C(jK", A) (11.7.9.3) then Z? is a tensor product of A and B = Ho1o(jK"): for if / £ Ho1o(jK", A) then t£K^m = ±.ff{z){z_t)-idz K (11.7.9.4)
11.7 Tensor Products 507 giving n / = lim^6iOay (11.7.9.5) with v ; bJ = (SJ ~ *)"' and aJ = ^M5; " 5;-i) By Theorem 11.7.8 we have /(*) £ A^Jt for each t e K =^ f <£ cle{t(D) =>> / £ D^k (11.7.9.6) and /(*) € ^ht for each t € K => f <£ ar'*ht(D) =► / € I>^ht (11.7.9.7) using Theorem 4.4.5 and the completeness of D at the end. But now if g £ D is either a left or a right inverse for / then (11.7.8.7) extends to vector- valued functions to tell us that the restriction of g to H is holomorphic ■ Theorem 10.9.5 has an analogue for tensor products: 11.7.10 THEOREM Suppose T £ BL{X,Y), 5 £ BL{Y,Z), R £ BL(Z,W), U £ BL(y,y), and 7 <E BL(Z,Z) satisfy 75 = Stf, with (jR, 5) not almost exact and (5, T) not almost exact (11.7.10.1) and V not bounded below or U not almost open (11.7.10.2) then jR® J 000' 7(8)7 5(g)/ 5(g)/ 0<g>0l\ , , , is not almost exact (11.7.10.3) -I®U T®l\J K J Proof: We prove something quite different. If (R,S) and {S,T) are not (left,right) one-one then there are z e Z, h E Z^, y E Y, and g £ yt for which Rz = 0 = hS and Sy = 0 = gT and /i(z) ^ 0 ^ $(</) (11.7.10.4) If 7 is not one-one or C7 is not onto then there are z' £ Z, hf E Z^, yr E Y,
508 11. Multiparameter Spectral Theory and g' £ Y^ for which Vz' = 0 = g'U and (/i'(z') # 0 = q' or 9'W) ± 0 = z') (11.7.10.5) In either case R®I 0<g>0 /(g)^ 5(g)/ z <g> z' 0 <g> 0 L 0 (g) 0 2/0 2/' _ r/Kgj/i' o® o ~ [ 0(g)0 g®g'\ 0(g)0 0(g)0' 0(g)0 0<g>0. 1 5(g)/ 0<g>0" _-J<g>i7 r<g>/_ (11.7.10.6) and /i<g> /i' 0 (g> 0 0(g)0 y </'j z® z' 0(g)0 0(g)0 " y®yf ± "0 0' .° °. (11.7.10.7) so that the left-hand side of (11.7.10.3) is not (left,right) one-one. To convert this argument to a proof of the theorem as stated we now systematically replace vectors and linear functionals by bounded sequences, and equality to zero by convergence to zero. The details are left to the reader. ■ The reader might also like to note the following extension of Theorem 11.7.5: If a e An, a' e An\ b e Bm, and 6' G £m' then there are inclusions ^ett'right(a,a') x ^ft'right(6,6') C ^^ight((a,a') ® 1,1 ® (6,6')) (11.7.10.8) flf™ht(a,a') x f£ft'r*ht(6,6') C f^ht((a,a') ® 1,1 ® (6,6')) (11.7.10.9) <^t£ght(K a') ® 1,1 ® (6,6')) C <^ft >right (a, a') x a™'***(6,6') and equality (11.7.10.10) (11.7.10.11) 11.8 QUASICOMMUTING SYSTEMS The spectral mapping theorem extends to certain noncommuting systems of normed algebra elements. We begin by looking at "derivations": 11.8.1 DEFINITION A linear operator D e L{A, A) on an algebra A is
11.8 Quasicommuting Systems 509 called a derivation if D{ab) = a(Db) + (Da)b for each a,b <E A (11.8.1.1) We shall write DBL(A, A) ={De BL(A, A): D is a derivation} (11.8.1.2) for the bounded derivations on a normed algebra A. Evidently DBL(A, A) is a closed linear subspace of BL(A, A), and is also closed under the formation of "commutators": D,D' e DBL(A, A) =^ D'D - DD' <E DBL(A, A). (11.8.1.3) Each element a £ A gives rise to the "inner derivation" of (11.6.9.12): aeA^La-Rae DBL(A, A). (11.8.1.4) The ordinary product of derivations is not usually a derivation. We have however Leibniz9 rule: if D £ DL(A, A) and a,b G A then by induction L>»(a6) = J2 [) [Dn~ra){Drb) for each n <E N (11.8.1.5) r=0 ^ ' The Kleinecke-Sirokov theorem says that bounded derivations breed quasi- nilpotents: 11.8.2 THEOREM If D e DBL(A, A) and a <E A satisfy either D2a = 0 (11.8.2.1) or a(Da) = (Da)a (11.8.2.2) then ||(£>a)n||1/n—>0 asn—> oo (11.8.2.3) Proof: If D <E DBL(A, A) and x G AN satisfy D2(xn) = 0 for each n <E N (11.8.2.4) then we claim Z)n(x1x2 • • • xn) = (n!)(Z)x1)(Z)x2) • • • (Dxn) for each n <E N (11.8.2.5)
510 11. Multiparameter Spectral Theory and Dn+1{x1x2 • • • xn) = 0 for each n <E N (11.8.2.6) To see this inductively assume both (11.8.2.5) and (11.8.2.6) for n = k and apply (11.8.1.5) with n = k + 1, a = x1x2- — xk and b = xk+1 to derive (11.8.2.5) for n = k + 1, and hence also (11.8.2.6) for n = k + 1. Taking xn = a for each n and assuming (11.8.2.1) thus gives ||(Pa)"|| < ||i?|r||a|r/»! (11.8.2.7) and hence (11.8.2.3), since (n!)_1/n ->0asn->oo. For the other implication write Aa for the inner derivation on BL(A, A) obtained by substituting La for a in (11.8.1.4) and observe Aa{D) = -LDa (11.8.2.8) Thus if (11.8.2.2) holds then A2a(D) = 0 (11.8.2.9) We may now use this in the same way as we have just used (11.8.2.1) to see that \\(Da)n\\ = \\(-LDan < l|A0|H|2>ir/»! < 2»||a|r||2?|r/n! ■ (11.8.2.10) For example, either component of Theorem 11.8.2 gives implication Da = 1 =^ 1 quasinilpotent =^ A = {0} (11.8.2.11) The second part says that if 0 ^ 5 £ C then Da = sa =r> a quasinilpotent (11.8.2.12) Here we can say more: since Da = sa=^ D{an) = nsan (11.8.2.13) it is clear from Theorem 9.2.3 and Theorem 9.3.2 that a is actually nilpo- tent: Da = sa, n\s\ > \\D\\ =^ an = 0 (11.8.2.14) The second part of Theorem 11.8.2 also tells us that bounded derivations on commutative normed algebras are essentially trivial: A commutative, D e DBL(A, A) =^ D{A) C Radical(A) (11.8.2.15)
11.8 Quasicommuting Systems 511 In particular A = C^fl) =► DBL{A,A) = {0} (11.8.2.16) From either part of Theorem 11.8.2 it is clear that if a, 6 £ A then ba — ab £ comm(a) => ba — ab quasinilpotent (11.8.2.17) and ba — ab £ comm(6) =>• 6a — a6 quasinilpotent (11.8.2.18) When both conditions hold at once then (a, 6) £ A2 is called a "quasi- commuting pair": 11.8.3 DEFINITION b £ Am is said to quasi-commute with a £ An if 6a - a6 £ Anm commutes with (a, 6) £ An+m (11.8.3.1) in the sense that each bka^ — afik commutes with each a^, and with each bk,. If this holds with b = a then a £ An is called a quasi-commuting ra-tuple. Here we are interpreting the "commutator" 6a — a6 as a rectangular matrix of elements of A, rather in the spirit of Theorem 11.4.3; alternatively we can write it out (in one of two ways) as an (ram)-tuple. To prove spectral mapping theorems for quasi-commuting systems, we need another auxiliary concept: 11.8.4 DEFINITION The system 6 £ Am is said to be left invariant under a £ An if ( 2^ Abk ay C closure 2J Abk J for each j £ {1,2,..., n} (11.8.4.1) right invariant under a £ An if ay I 2~^ 6^ C closure I Y^ 6^ for each j £ {1,2,..., n} (11.8.4.2) and completely invariant under a G An if b — t is left and right invariant under a £ An for each t £ Cm (11.8.4.3)
512 11. Multiparameter Spectral Theory For example, if a £ An commutes with b £ Am then (11.8.4.3) holds; if t £ Cm is not in <jleft(6) then b — t e Am is left invariant under arbitrary a £ An. Quasi-commutivity induces this kind of invariance: 11.8.5 THEOREM Suppose a £ An and b £ Am: If ba - ab commutes with a then (6,6a — ab) is completely invariant under a (11.8.5.1) If ba — ab commutes with (a, 6) then for each system of polynomials / £ (/(6),6a — ab) is completely invariant under a (11.8.5.2) If ba — ab commutes with (a, 6) and a is commutative then for arbitrary g 6 Poly*+m (g(a,6),6a — a6) is completely invariant under a (11.8.5.3) Proof: Begin with an observation about derivations: if D £ T)L{A,A) and / £ Polyn then there are systems of polynomials /',/" £ Poly£ for which there is implication, for arbitrary a £ An, n n Da £ comm(a)'1 =► Df(a) = £ /;.(a)(D^) = ^(^O/y'W y=i y=i (11.8.5.4) This is trivial if / is either a constant or a coordinate and trivially inherited by sums: if (11.8.5.4) holds with f = g and with f = h then it must also hold with f = g + h. To see that this is also inherited by products observe (assuming that (11.8.5.4) holds with f = g and f = h) n D{g{a)h{a)) = {Dg{a))h{a) + g{a)Dh{a) = ]T(0y(a)/i(a) + g{a)h'j{a))Daj 3 = 1 noting that Daj commutes with h(a) for each j. Thus if / = gh then /y = 9'jh + gh'jy and similarly /j' = g'Jh + gh'j, for each j £ {l, 2,..., n}. Towards (11.8.5.1) we must prove that for each t £ Cm and r £ Cmn (6 — t, ba — ab — r) is left invariant under a (11.8.5.5) By (11.10.2.17) this is trivial if r ^ 0 £ Cmn, and if r = 0 and N is the
11.8 Quasicomnruting Systems 513 closed left ideal generated in A by (6 — t,ba — ah) we have for each j\k,i (11.8.5.6J {bka{ - a{bk)aj = a^b^ - a{bk) £ N so that Naj C N for each j. The argument for right invariance is of course the same. Towards (11.8.5.2) we use (11.8.5.4) to see that, since ba — ab commutes with 6, m f{b)aj - ajf{b) = ]T f'k{b){bkaj - ajbk) for each j £ {1,2,... ,n} whenever / £ Polym is a polynomial in m variables: now if instead / £ Poly^ and r £ Cp take N to be the closed left ideal generated by (/(6) — r, ba — ab) and see that m {fk{t>)-rk)*j = a,-(/*(6)-rfc)+E/**(6)(6*0>-°y6*) e * for each -?>fc 1=1 (11.8.5.7) which together with the second part of (11.8.5.6) gives again Na ■ C JV for each j. Finally, for (11.8.5.3), apply (11.8.5.2) with (a,6) in place of b to see that ((<7(a,6),6a — a&,aa — aa) is completely invariant under a (11.8.5.8) and then specialize to a commuting system a £ An. ■ The almost left and right spectra, and the approximate eigenvalues, of a quasi-commuting system have the "projective property": 11.8.6 THEOREM If a £ An and b £ Am are such that ba - ab is commutative and commutes with b then there is equality, for each u £ {aleft,arisht,fleft,frisht}, "(*)="6a-a6=o(*) (11.8.6.1) If instead a is quasi-commutative and commutes with ba — ab then there is equality <"»«-«»=o(6) =<".=. (*) (11.8.6.2) Hence if a is quasi-commutative and quasi-commutes with 6 then W(6)=a;0=0(6) (11.8.6.3)
514 11. Multiparameter Spectral Theory Proof: If c = ba- ab £ Anm then (11.3.4.1) from Theorem 11.3.4 and (11.8.2.8) give u[b) = uc=c(b) = uc=0(b) (11.8.6.4) for each of the four sets u: this is (11.8.6.1). Towards (11.8.6.2) we begin with the observation, using (11.2.6.3) and (11.2.6.4), that with no restriction on (a,6) £ An+m there is equality uj(a,b) = wba_ab=0(a,b) and hence inclusion <"«=«(*) C wto_ot_0(6) (11.8.6.5) To reverse the inclusion we may use induction on ra, and indeed it is sufficient to prove (11.8.6.2), and hence (11.8.6.3), when n = 1. If therefore a = ax commutes with ba—ab £ Am then by (11.8.5.1) the system (6, ba—ab) is completely invariant under a £ A; but now we may repeat the proof of the first part of Theorem 11.3.4. The complete invariance guarantees that with either T = row(jR6_t|6o_o6) or T = iov/(Lb_tM_ab), the conditions of either Theorem 11.3.2 or Theorem 11.3.3 are satisfied with V = Ra and U = La: by the argument of the first part of Theorem 11.3.4 it follows that u(b,ba - ab) = w0=0(6,6a - ab) (11.8.6.6) for each of the four sets oj. This proves (11.8.6.2) when n = 1, and hence for more general n: to make the induction observe that for each j = 1,2,..., n— 1 the element aJ+1 commutes with each commutator daj+1 — aJ+1rf, where d = (6, al9... ,a;). From (11.8.6.1) and (11.8.6.2) we then immediately get (11.8.6.3). ■ Theorem 11.8.6 is only the analogue of the first part of Theorem 11.3.4; unlike the commutative case the analogue of the second part is not here a trivial consequence. The point is that if a £ An is quasi-commutative and / £ Poly^1 is a system of polynomials then it is possible that a does not quasi-commute with b = f(a). In spite of this, with a bit of extra work, the spectral mapping theorem does follow: 11.8.7 THEOREM If a £ An is quasi-commutative and quasi-commutes with b £ Am then for each g £ Poly£+m and each w £ {aleft,aright,fleft, -righty there is equality ug{a,b) = ua=ag{a,b) = [j ua=3g{s,b) C ug{u{a),b) (11.8.7.1) aGw(fl) In particular if a £ An is quasi-commutative and / £ Poly^1 there is equality ojf(a) = foj(a) (11.8.7.2)
11.8 Quasicommuting Systems 515 Proof: If the commutator ba — ab is commutative and commutes with (a,6), hence also with g(a,6), then it is an application of (11.3.4.1) from Theorem 11.3.4 that c = ba- ab =>■ u>g(a, b) = u>c=cg(a, b) = wc=0g{a, b) (11.8.7.3) We claim that also u(g(a,b),ba - ab) = va=a{g{a,b),ba - ab) (11.8.7.4) and hence "ba-ab=0<l{a> b) C Ua=a<l{a> b) (11.8.7.5) This again is induction on ra, and it is sufficient to prove it in the case n = 1. In the case n = 1 the system a £ An is of course commutative, and since ba — ab commutes with (a, 6) then by (11.8.5.3) the system (g(a, 6),6a — ab) is completely invariant under a, but now (11.8.7.4) follows by the same argument as (11.3.4.1) from Theorem 11.3.4. This gives (11.8.7.5), which together with (11.8.7.3) gives the first part of (11.8.7.1), and hence of course (11.8.7.2). For the rest of (11.8.7.1) we observe that ua=8g{", *>) = "a=sd{s, *>) ^r each s £ Cn (11.8.7.6) Indeed if a — s £ An is left invariant under b £ Am then, with N = closure2y=i A(ay - s;) and M = {d £ A : Nd C N}, g{a,b) £ Mp and g{a,b) - g{s,b) £ Np (11.8.7.7) It is sufficient to prove this for p = 1, and this is established by verifying it if g(a,b) = /(6), if g(a,b) = a;-, and if g is the sum or product of polynomials for which it holds. Thus if a £ An is completely invariant under b £ Am then (11.8.7.6) holds for the almost left and right spectrum, since (g(a,b) — r,a — s) always generate the same closed left and right ideals; by taking annihilators of ideals and considering enlargements we also obtain (11.8.7.6) for approximate eigenvalues. Under the assumptions of Theorem 11.8.7 we have by (11.8.5.1) that (a, ba — ab) is completely invariant under b (11.8.7.8) and hence c = ba - ab =>• ^(ajC)=(5j0)y(a,6) = w(0|C)=( ,0)g{s,b) for each 5 £ Cn (11.8.7.9) applying (11.8.7.6) with (a, ba — ab) in place of a. But now (11.8.7.3) conveys f11.8.7.9) ^ fU.8.7.6), and finishes the proof of (11.8.7.1) ■
516 11. Multiparameter Spectral Theory The conditions (11.8.6.1) and (11.8.6.2) are neither necessary nor separately sufficient for the conclusions of Theorem 11.8.6. For example, in the algebra A of 3 x 3 matrices, take U = Then and ro o -0 1 0" 0 1 0 0. V = 10 0 0 0 0 0 0 -1 and W = VU-UV = U but a{V) ± au=u{V) 11 0 - 0 0-1 0 0 -1. (11.8.7.10) (11.8.7.11) W = VU + UV + V but <j{W) ^ <jv=v [W) (11.8.7.12) Thus if a = U and b = V then (11.8.6.2) holds but not (11.8.6.1); if instead a = V and b = U and g(a, b) =ba + ab + a = W then (11.8.6.1) holds but not (11.8.7.1), and with 5 = 0 not (11.8.7.6). If instead A is the algebra of upper triangular p x p matrices then equality (11.8.7.2) holds for arbitrary cGAn and / £ Poly™. It is clear from their proofs that Theorem 11.8.6 and Theorem 11.8.7 sit at the beginning of a chain of increasingly uninteresting generalizations: 11.8.8 THEOREM If a £ An and b £ Am then each of the following conditions is sufficient for equality u>(b) = o:a=a(b) : D[a is quasi-commutative and commutes with D[+1a for all r £ N U {0} and D^a commutes with b for some p £ N U {0} {Dab, D^b,..., Drab) is commutative for each r eM and a is quasi-commutative and commutes with D%b for some p £ N U {0} a is quasi-commutative and Dba = tb for some t £ Cn (11.8.8.1) (11.8.8.2) (11.8.8.3) Proof: The sufficiency of (11.8.8.1) is derived from Theorem 11.8.6 by repeatedly using the conclusion (11.8.6.3) in the derivation of (11.8.6.1). For the sufficiency of (11.8.8.2) observe that the system c = (b,Dab,Dlb,..., D*b) is completely invariant under the system a, giving wc=Q{b) C w0=0(6). Finally (11.8.8.3) is clearly sufficient if t = 0, and in any case there is inclusion uj(a) C ujb=0(a) Uuj(a + t) (11.8.8.4) Thus by the boundedness of u>(a) there is equality u>(a) = u>b=0(a) and hence, since u>(b) = {0} and uj(a) ^ 0, equality uj(b) = c<;a_a(6) again. ■
11.9 The Taylor Spectrum 517 11.9 THE TAYLOR SPECTRUM The "left" and "right" spectra of Definition 11.1.1 do not quite add up to an adequate "spectrum" for a system of elements a £ An: we need some "middle" terms. For commuting systems of linear operators, the right way to do this has been discovered by Joseph Taylor: we shall tentatively make an extension to noncommuting systems in normed algebras. For an ra-tuple a £ An we need as an auxiliary the "exterior algebra on n generators" An = A(dz) = Alg{dz1,dz2,...,dzn) (11.9.0.1) Whimsically perhaps we shall identify these generators with "differentials" induced by the coordinates z = (z1,z2,...,zn). Exterior multiplication is associative and anticommutative: dz- A dz{ = —dz{ A dzj for each i,j and dzx A dz2 A • • • A dzn ^ 0 (11.9.0.2) so that k{dz) is a linear space of dimension 2n, which is the direct sum of 7i + 1 subspaces kv{dz) of "homogeneous forms of degree p," of dimension (p). For each j the mapping Ay of multiplication by dz^ sends Ap~1(dz) into KP{dz) for each p = 1,2,...,ra, and sends An(dz) into 0. 11.9.1 DEFINITION If X is a linear space we write An{X) =A{X;dz) =X®k{dz) = I x0 + ]T ]T xj ® dz-xj <E X \ { p=1 IjI=p J (11.9.1.1) where J = Ui>J2> • • • >Jp) => \J\ = P and dzj = dzj\ A dzj2 A • • • A dzjp (11.9.1.2) If x £ Xn is an ra-tuple of elements we shall write n Ax(x) = Y,xj ® rf*y € A£(X) (11.9.1.3) If T £ L(X, X)n is an n-tuple of linear operators we shall write n A(T) = E Ti ® Ai e L(AnP0> An(X)) (11.9.1.4)
518 11. Multiparameter Spectral Theory for the Koszul complex of T, or equivalently the induced sequence (11.9.1.5) An(T) A»(X)< A^X) A2(T) , AX(T) For example if n = 2 the sequence (11.8.1.5) can be represented by the operator matrices [ -T2 Tx ] [';] (11.9.1.6) while the operator (11.9.1.4) is represented by the matrix 0 0 0 0 0 T2 0 0 0 Ti 0" 0 0 0 : 'X" X X _x_ — "XI x X _x_ (11.9.1.7) Both we and the reader will find it helpful to examine all theorems and definitions for the Taylor spectrum separately in this case. More generally we can express the Koszul complex of an (n + l)-tuple (T, S) in terms of the Koszul complex of the ra-tuple as a sort of lower triangular operator matrix: A(T,S) = A(T) 0 5V A(T) An(X) LAn(X) An(X) LAn(X) (11.9.1.8) where the matrix 5V is a 2n x 2n diagonal matrix whose diagonal entries are ±5: to see this represent the vectors x £ An+1(X) = A(dz, dw) in a column in which all the terms involving dw are kept to the last: the reader should experiment with the case n = 2. The commutativity of the operator n-tuple A(T) is equivalent to the chain condition (10.3.0.1) for the pair (A(T),A(T)), or its extension (10.3.1.20) for the sequence (11.9.1.5), which of course is tacitly assumed when we call it the Koszul "complex:" 11.9.2 THEOREM If T e L(X,X)n is an n-tuple of linear operators then the following are equivalent: T is commutative Ap(r)Ap_1(r) = 0 for each p <E {1,2,... ,n} A(T)2 = 0 (11.9.2.1) (11.9.2.2) (11.9.2.3)
11.9 The Taylor Spectrum 519 Proof: The equivalence of the first two conditions may safely be left to the reader: for the equivalence of the second two we compute Ap(T)(Ap-1(T)(xJ- <g> dzj = Y, (TiTi' ~ Ti'Ti)x3 ® dzi A dzi' A dzj l<i<i'<n (11.9.2.4) for each j = (j\,j2, • • • )Jp-i) an(i eacn x £ AP_1(X). ■ If X = A is a linear algebra and a G An then A1 (a) = Y^=i ai ® dzi can be treated as an element of the linear algebra An(A) = A <g> A(dz): the argument of Theorem 11.8.2 now gives a commutative <=> A1 (a)2 = 0 (11.9.2.5) More generally we interpret Ap(a) for each p £ {0,1,..., n} as certain kinds of rectangular matrix over the algebra A. A system T £ BL(X, X)n of bounded operators on a normed space can be classified as "Taylor nonsingular" if the pair of operators (A(T), A(T)) is nonsingular in the corresponding sense from Definition 10.3.1: 11.9.3 DEFINITION The n-tuple T £ BL(X,X)n will be called Taylor invertible if there are U and V in BL(An(X), A.n(X)) for which VA(T)+A(T)U = I (11.9.3.1) and called almost Taylor invertible if there are {Um) and (Vm) in BL(An(X),An(X)) with ||VmA(T) + A(T)Um - I|| — 0 and sup ||tfj| + sup ||VJ| < oo m m (11.9.3.2) The n-tuple T £ BL(X,X)n will be called Taylor nonsingular if there are k > 0, h > 0 for which for each x £ An(X),x £ Disc(0;/i||A(T)x||)+A(T) Disc(0;A;||x||) (11.9.3.3) and will be called almost Taylor nonsingular if there are k > 0, h > 0 for which for each x £ An(X),x £ cl(Disc(0 ; /i||A(T)x||) + A(T)(Disc(0 ; A;||x||)) (11.9.3.4) The ra-tuple T £ BL(X, X) will be called linearly Taylor nonsingular if A(T)-1(0)CA(T)An(X) (11.9.3.5)
520 11. Multiparameter Spectral Theory and will be called almost linearly Taylor nonsingular if ACT)"1^) C clA(T)An(X) (11.9.3.6) If T £ BL(X,X)n is Taylor invertible then (11.9.3.1) holds for operators U and V on An(X) of a special form: there are T',T" in BL(X,X)n for which A?{T")lL{T) + A{T)A^{Tf) = I (11.9.3.7) where we define, for arbitrary S £ BL{X,X)n, n At(S) = ]TSy<g>A; (11.9.3.8) i=i Here Ay is the Hilbert space adjoint of the operator Ay induced by the obvious inner product on the space A.(dz): n (a ; P) = a0P0 + J2J2 "jPjdzj (11.9.3.9) p=i |j|=p This is because, when U, V, and A(T) are represented as operator matrices, many of the terms of U and V do not participate in the action, and when these are replaced by zeros we find that (11.9.3.1) is reduced to (11.9.3.7). One simple observation is clear from (11.9.3.7): if T € BL(X,X)n has a left (equivalently a right) inverse S £ BL(X, X)n for which {Tjij e {1,2,...,n}} U {Sjij e {1,2,...,n}} is commutative (11.9.3.10) then T is Taylor invertible: if (11.9.3.10) holds then Y^ SjTj. = I=> Af(5)A(T) + A(T)Af(5) = I (11.9.3.11) j We can make the same comments about almost Taylor invertibility. Relative to the closed set of commuting n-tuples, almost invertible and almost nonsingular ra-tuples form open sets: 11.9.4 THEOREM If X is a normed space then {T £ BL(X, X)n:T commutative and almost Taylor invertible} (11.9.4.1)
11.9 The Taylor Spectrum 521 and {T G BL(X,X)n:T commutative and almost Taylor nonsingular} (11.9.4.2) are open subsets of {T G BL(X,X)n:T commutative}. Proof: Observe that the mapping T —► (A(T), A(T)) is continuous and takes commuting n-tuples T G BL(X,X)n into chains (A(T), A(T)) G BL(An(X),An(X),An(X)), and then apply Theorem 10.3.5 and Theorem 10.3.8. ■ When the normed space X is complete, several of the Taylor nonsin- gularities coalesce: 11.9.5 THEOREM If T G BL(X,X)n is a commuting n-tuple on a Ba- nach space X then the following are equivalent: T is Taylor nonsingular (11.9.5.1) T is almost Taylor nonsingular (11.9.5.2) T is linearly Taylor nonsingular (11.9.5.3) If X is complete then also {T G BL(X,X)n: T is commutative and Taylor invertible} (11.9.5.4) is an open subset of {T G BL(X,X)n : T commutative}. Proof: The first part is by application of Theorem 10.5.4 to the chain (A(T), A(T)), noting that in this case the second part of condition (10.5.4.2) follows from the first. The last part is the end of Theorem 10.5.6. ■ Once again we are unable to show that, for complete X and commuting TGBL(X,X)n, T almost Taylor invertible => T Taylor invertible (11.9.5.5) Under passage to enlargement, almost Taylor nonsingularity takes several forms: 11.9.6 THEOREM If X is a normed space and T G BL(X,X)n is com-
522 11. Multiparameter Spectral Theory mutative then the following are equivalent: Q(T) is almost linearly Taylor nonsingular (11.9.6.1) T is almost Taylor nonsingular (11.9.6.2) Q(T) is Taylor nonsingular (11.9.6.3) Proof: Apply Theorem 10.5.2 to the chain (A(T), A(T)). ■ Taylor nonsingularity is preserved under duality: 11.9.7 THEOREM If T e BL(X,X)n is arbitrary then the following are equivalent: T* is almost Taylor nonsingular (11.9.7.1) T is almost Taylor nonsingular (11.9.7.2) Tf is Taylor nonsingular (11.9.7.3) Proof: Specialize Theorem 10.4.3 to R = I and apply it to (A(T), A(T)). ■ Different definitions of "Taylor nonsingularity" give rise to different kinds of "Taylor spectrum": 11.9.8 DEFINITION If X is a normed space and T <E BL(X,X)n is arbitrary then the Taylor invertible spectrum of T is the set <jTaylor(r) = {seCn:T- si is not Taylor invertible} (11.9.8.1) The Taylor almost invertible spectrum is the set (7Taylor(T) ={seCn:T- si is not almost Taylor invertible} (11.9.8.2) The Taylor singular spectrum is the set rTaylor(T) ={seCn:T- si is not Taylor nonsingular} (11.9.8.3) and the Taylor almost singular spectrum is the set fTaylor(T) ={seCn:T -si is not almost Taylor nonsingular} (11.9.8.4) For commuting systems of operators in incomplete normed spaces the almost invertible and the almost singular spectrum are compact sets:
11.9 The Taylor Spectrum 523 11.9.9 THEOREM If T G BL(X,X)n is a commuting system on a normed space X then n aTaylor(T) = claTaylor(r) C JJ v{T) (11.9.9.1) and fTaylor(T) = clfTaylor(T) C claTaylor(r) (11.9.9.2) Proof: Both the almost singular spectrum and the almost invertible spectrum are continuous counterimages, by the mapping T — zl : Cn —► {S G BL{X,X)n: S commutative}, of closed subsets of {S G BL(X,X)n: S commutative}, namely, the complements of the open subsets (11.9.4.2) and (11.9.4.1). From (11.9.3.10) it follows that n aTaylor(r) Q ^(r) Q JJ ^ (11.9.9.3) 3 = 1 whenever A is a commutative subalgebra of BL(X, X) containing {T;-: j = 1,2,...,n}, and from the "almost" analog of (11.9.3.10) we have, in the same circumstances, n ^Taylor(r) Q -^ Q JJ ~(r) (11.9.9.4) y=i Finally, to finish (11.9.9.2), we have rTaylor(T) C aTaylor(T) and rT*yloT{T) C aTaylor(T) ■ (11.9.9.5) The almost singular spectrum is nonempty, and subject to the spectral mapping theorem for polynomials: 11.9.10 THEOREM If T G BL(X,X)n and S G BL(X,X)m form a commuting system (T,S) G BL(X,X)n+m on a normed space X then fTaylor^ = f£^>r(T) (11.9.10.1) and if / G Poly™ then fTaylor/(T) = frT^loT(T) (11.9.10.2) Proof: By induction on m it is sufficient to prove (11.9.10.1) when m = 1. To do this suppose that 5 G Cn and t G C satisfy \s,t) G fTaylor(T,5), so
524 11. Multiparameter Spectral Theory that /rA(T-sJ) 0 U(S-tOv A(T-sJ) is not almost exact and hence by Theorem 10.9.5, (A(T - si), A(T - si)) is not almost exact (11.9.10.4) Thus the right-hand side of (11.9.10.1) is included in the left. Conversely if 5 e Cn satisfies (11.9.10.4) then by Theorem 10.3.7 either A(T - sI)A : An(X)/cl A(T - sI)An{X) —► An{X) is not bounded below (11.9.10.5) or A(T - sI)A : An{X) —> A(T - 5/)_1(0) is not almost open (11.9.10.6) By Theorem 11.3.3 there is t <E C for which (11.3.3.1) either col(A(T - si), (S - tI)v)A is not bounded below (11.9.10.7) or row(A(T - si), (5 - tI)v)A is not almost open (11.9.10.8) By Theorem 10.9.6, in particular (10.9.6.5) and the analog of (10.9.6.6) for almost exactness, each of (11.9.10.7) and (11.9.10.8) imply (11.9.10.3). This proves (11.9.10.1) when m = 1, and hence for arbitrary m. Towards (11.9.10.2) suppose that A C BL(X, X) is a commutative subalgebra containing {T:j = 1,2,...,n}: then by (11.9.3.11) there is inclusion fTaylor(r) Q ^Taylor^ Q ^ft(T) = (J^T) (11.9.10.9) We claim that this gives equality fTaylor(T) = {4>{T):4> e fTaylor(A)} (11.9.10.10) where, as in (11.4.3.4), fTaylor(A) = {<t>e A*: for each m <E N and S <E Am,<f>{S) G fTaylor(5)} (11.9.10.11) Indeed fTaylor(A) is a compact subset of A*, and since by (11.9.10.1) the family {{4> e A*:4>{S) e fTaylor(5)})nGN)5GA, (11.9.10.12) A(T - si) 0 [(S-tI)v A(T-sI) (11.9.10.3)
11.9 The Taylor Spectrum 525 has the "finite intersection property," fTaylor(A) is nonempty. The right- hand side of (11.9.10.10) is obviously contained in the left; conversely if s e f Taylor (T) then the family Qf sets ({</> e A*:*{T) = s and <j>{S) G rTaylor(5)})mGN5GAm (11.9.10.13) has the finite intersection property, and if <f> £ A* is taken from its intersection then 4>{T) = s. Equality (11.9.10.10) now gives (11.9.10.2): if / e Poly;? then fTaylor/(r) = {^/(T): ^ ^Taylor (r)| = {f<t>{T):<t> e fTaylor(T)} = /fTaylor(r) m It is clear from Theorem 11.9.10 and the inclusion (11.9.9.5) that the almost invertible spectrum aT*yloT(T) of a commuting n-tuple T G BL(X,X)n is nonempty. To see that it has the spectral mapping theorem for polynomials we can use Theorem 10.4.5 and its analogues (10.4.6.5) and (10.4.6.6): 11.9.11 THEOREM If T e BL(X,X)n is a commuting n-tuple on a normed space X then aTaylor(r) = rTaylor(Lr) = rTaylor(i?T) (11.9.11.1) and <7Taylor(r) = fTaylor(Lr) = fTaylor(£r) (11.9.11.2) Proof: For (11.9.11.1) apply the extended version of (10.4.6.1) and (10.4.6.2) to the Koszul complex (11.9.1.5), augmented with zeros at either end. For (11.9.11.2) use (10.4.6.5) and (10.4.6.6) ■ The Taylor spectrum of a tensor product n-tuple is the cartesian product of the spectra of the factor operators: 11.9.12 THEOREM If T e BL(X, X)n is a commuting system of operators on a normed space X then n n II ?(*)) £ fTaylor(T®) C aTaylor(T®) C H a(ry) (11.9.12.1) y=i y=i where we write r® = (rf,rf,...,r®) withrf = /®---®rf ®---®/ (11.9.12.2)
526 11. Multiparameter Spectral Theory If the space X is complete then n fTaylor^®) = JJ ^j (11.9.12.3) Proof: The middle inclusion is just (11.9.9.5), and the final inclusion is (11.9.9.1) together with (11.7.5.4) from Theorem 11.7.5. Towards the first inclusion we claim that, if S G BL(X, X) is such that (T, S) G BL(X, X)n+1 is commutative then fTaylor(T) x t{S) C tTaylor(T® <g> /, 7® <g> 5) (11.9.12.4) To see this apply Theorem 11.7.10 with (A(T) - si, A(T) - si, A(T) - 5/, (5 - tl)v, -(5 - *J)V) in place of {R,S,T,U,V). The first inclusion of (11.9.12.1) is now (11.9.12.4) and induction. For (11.9.12.3) observe that if X is complete then f(Tj)=a(Tj)=a(Tj) for each j, giving equality throughout (11.9.12.1). ■ 11.10 ALGEBRAIC AND ESSENTIAL SPECTRA The various kinds of "Taylor spectra" for an n-tuple a G An of normed algebra elements are obtained by applying the conditions of Definition 10.7.1 to the Koszul complex chains (A (a — 5), A (a — s)) G An(A)2, where A (a — s) is given by (11.9.1.3) with x = a — s: 11.10.1 DEFINITION If a G An for a normed algebra A then the Taylor invertible spectrum of a with respect to A is the set is not (left, right)invertible in An(A)2} (11.10.1.1) and the Taylor almost invertible spectrum of a with respect to A is the set (11.10.1.2) a^aylor(a) = {s G Cn : (A(a - s), A(a - s)) is not almost (left, right) invertible in An(A)2} The Taylor almost singular spectrum of a with respect to A is the set f^aylor(a) ={seCn. (A(a _ fi) A(a _ fi)) A v v J v (11.10.1.3) is not (left, right) bounded below in An(A) }
11.10 Algebraic and Essential Spectra 527 The first two concepts are expressible in terms of the operators La G BL(A,A)n, or the operators Ra <E BL(A,A)n: 11.10.2 THEOREM If A is a normed algebra and a G An is a commuting n-tuple then <r£aylor(a) = TT^lor{La) = rT^loT{Ra) (11.10.2.1) and c£*ylor(a) = fTaylor(La) = fTaylor(£a) (11.10.2.2) Proof: For (11.10.2.1) apply the extended versions of (10.7.7.1) and (10.7.7.2) to the Koszul complex (0, An(a - 5),..., A2(a - s),A1{a - s),0) (11.10.2.3) while for (11.10.2.2) use (10.7.7.3) and (10.7.7.4) ■ From the second part of Theorem 11.10.2 we obtain the compactness and nonemptiness, and the spectral mapping theorem, for the "Taylor almost invertible spectrum" of a commuting system of normed algebra elements: for example, using (11.9.10.2), /<^aylorM = A~Taylor(£J = fTaylo7(£J = fTaylor(£/(a)) = *Aayl°7(a) (11.10.2.4) In the special case A = BL(X, X) Definition 11.10.1 reproduces Definition 11.9.8: 11.10.3 THEOREM If A = BL(X, X) and T e An then aTaylor(r) = aTaylor(r) (11.10.3.1) and ^Taylor (r) = ~Taylor(r) (11.10.3.2) and f Taylor^ = fTaylor^ (11.10.3.3) Proof: Equality (11.10.3.1) is given by the observation (11.9.3.2), and equality (11.10.3.2) by the analogue of (11.9.3.7) for almost Taylor invert- ibility. For equality (11.10.3.3) we specialise Theorem 10.4.1 to R = I ■
528 11. Multiparameter Spectral Theory We have no analogue of Theorem 11.10.2 for f Aay or in more general A: the reader will remember from our comments immediately preceding Theorem 10.7.5 that we have not even proved that fAayor(T) is a closed set. The obvious way to define an "essential Taylor spectrum" would be to use Definition 11.10.1 with the Calkin algebra A = B/J, where B = BL(X,X) and either J = KL(X,X) or J = clKL0(X,X); we will prefer to be more "spatial" and use the "essential exactness" of Definition 10.6.1 and Definition 10.6.6: 11.10.4 DEFINTION The Taylor-Fredholmspectrum of T G BL(X, X)n is the set aJsasylor(T) = {seCn : (A(T - si), A(T - si)) is essentially invertible} (11.10.4.1) and the Taylor almost Fredholm spectrum is the set ^ylor(r) ={^Cn: (A(T - si), A(T - si)) ess v ; i v v ;> v ;; (11.10.4.2) is essentially almost invertible} The essential Taylor spectrum is the set fJsaylor(r) ={seCn : (A(T - si),A(T - si)) is nearly essentially exact} (11.10.4.3) and the weak Taylor spectrum is the set *l7iot(T) = {s€Cn: (A(T - si), A(T - si)) (n.10.4.4) is essentially (left, right)one-one} Of course the first two sets coincide for complete spaces, and there is the obvious inclusion between them in general; also when X is complete then Theorem 10.6.4 says that T commutative => aJsasylor(T) = aJsaylor(T) is compact in Cn (11.10.4.5) For complete spaces the "near essential exactness" of Definition 10.6.6 used in the makeup of the essential Taylor spectrum coincides with "essential" and "almost essential" exactness; also by Theorem 10.6.7 T commutative => r^loT{T) = rTaylorp(r) (n.10.4.6) so that the compactness and nonemptiness, and the spectral mapping theorem, for the essential Taylor spectrum follow from the corresponding theorems for the Taylor spectrum applied to the essential enlargement.
11.10 Algebraic and Essential Spectra 529 We can also define an index for a commuting Taylor-Fredholm system of operators, and a Taylor- Weyl spectrum; we need to work with the more complicated version of the Koszul complex: 11.10.5 DEFINITION The Taylor-Weyl spectrum of T G BL(X,X)n is the set "S^'Cn = {^C": (0,A»(r-«I),...,A1(T-«J),0) ^ g is not spatially Weyl} If T G BL(X, X)n is a commuting Taylor-Fredholm system then the Taylor index of T is the number index(T) = Euler(0, An(T),..., A^T^O) (11.10.5.2) Theorem 10.6.4 says that, for complete spaces, the Weyl spectrum of a commuting n-tuple is a compact set; Theorem 10.6.5, extended to these longer chains, says that T Weyl <=> T Fredholm of index zero (11.10.5.3) The openness of the set of Weyl n-tuples in the set of all commuting n- tuples is consistent with the continuity of the index: we prove this only for operators on Hilbert space. We begin with the representation of the Taylor spectrum of a system of Hilbert algebra elements as the ordinary spectrum of a single linear operator: 11.10.6 THEOREM If a G An is a commuting system in a Hilbert algebra A then the following are equivalent: a G An is Taylor invertible with respect to A (11.10.6.1) A (a) + A (a)* is invertible with respect to An(A) (11.10.6.2) □(A) = A(a)*A(a) + A(a)A(a)* is invertible with respect to An(A) (11.10.6.3) a G An is Taylor nonsingular with respect to A (11.10.6.4) Proof: The equivalence of (11.10.6.1), (11.10.6.3) and (11.10.6.4) is by application of Theorem 10.8.5 to the chain (A(a), A(a)) G An(A)2, noting that An(A) = A ® An is also a Hilbert algebra. The equivalence of (11.10.6.2) and (11.10.6.3) follows from □(a) = (A(a) + A(a)*)2 (11.10.6.5)
530 11. Multiparameter Spectral Theory which holds because (A(a),A(a)) is a chain. We sometimes call D(a) the "Laplacian" of cG An. A special case of Theorem 11.10.6 is A = BL(X,X) for a Hilbert space X: more generally if A C B for Hilbert algebras A and B then Theorem 11.10.6 gives us the "spectral permanence" of the Taylor spectrum: if a £ An C. Bn is commutative then rJaylor(a) = r£aylor(a) = a£aylor(a) = a^aylor(a) (11.10.6.6) The reader should note that Theorem 11.10.6 does not say rTaylor(r) = ^-^ What it does say is that rTaylor(T) ={seCn:0e cU{T - si)} (11.10.6.7) There is a similar description of the "Taylor-Fredholm" spectrum: 'eTssyl°rCn = {s€Cn:0e <7essD(r - si)} (11.10.6.8) This is just part of the more comprehensive statement that all possible definitions of "Taylor-Fredholm" coincide on Hilbert space: 11.10.7 THEOREM If T e BL(X,X)n is a commuting system on a Hilbert space X then the following are equivalent: T is Taylor-Fredholm (11.10.7.1) T is essentially Taylor nonsingular (11.10.7.2) T + KL(X, X) is Taylor invertible in BL(X, X)/ KL(X, X) (11.10.7.3) T + KL(X,X) is Taylor nonsingular in BL(X,X)/KL(X,X) (11.10.7.4) P(T) is Taylor nonsingular (11.10.7.5) P(T) is Taylor invertible (11.10.7.6) A(T) + A(T)* is Fredholm (11.10.7.7) D(T) = A(T)*A(T) + A(r)A(T)* is Fredholm (11.10.7.8) Proof: Implication (11.10.7.1) =J> (11.10.7.3) =J> (11.10.7.6) =J> (11.10.7.5) is elementary, and the equivalence (11.10.7.2) <» (11.10.7.5) is (11.10.4.6). Equivalence (11.10.7.3) <» (11.10.7.4) <» (11.10.7.7) <» (11.10.7.8) is an application of Theorem 11.10.6 with A = BL(X,X)/KL(X,X), and finally (10.6.8.9) gives (11.10.7.2) <» (11.10.7.1) ■
11.11 Functional Calculus 531 The Weyl spectrum of a commuting n-tuple T G BL(X,X)n cannot be related to the normal operator D(T), in view of (8.8.5.4). Instead of assembling the Koszul complex of (11.9.1.5) into the single operator A(T) of (11.9.1.4), we put together its "odd" and "even" parts, writing A(T) = 0 Aodd(T) Aeven(T) 0 A°ddP0 LArnP0J where Arn(*). (11.10.7.9) Arw = © KM p even and and A°dd(X) = 0 A'PO p odd (11.10.7.10) Aeven(T) = 0 Ap{T) and Aodd(T) = 0 AP{T) (11.10.7.11) p odd p even where the operators AP(T) : AP_1(X) -► A£(X) are as in Definition 11.9.1. Evidently, according to Definition 11.10.5 and Definition 10.6.3 index(T) = Euler(Aodd(T), Aeven(T), Aodd(T)) (11.10.7.12) 11.10.8 THEOREM If T e BL(X, X)n is a commuting Taylor-Fredholm n-tuple on a Hilbert space X then index(T) = index(Aodd(r) + Aeven(T)*) Proof: This is Theorem 10.8.7 ■ (11.10.8.1) It is now clear, at least on Hilbert space, that the index is a continuous, therefore locally constant, function of commuting Fredholm n-tuples of operators. To extend the continuity of the index to Banach spaces seems to be harder: the boundary result Theorem 10.3.9 seems however to contain the essence of the argument. 11.11 FUNCTIONAL CALCULUS If a G An is a commuting system of Banach algebra elements and / : H —► C is a function "holomorphic" on an open set H C Cn then we can sometimes define an element f{a) G C. If for example the algebra A is commutative and the set H contains the spectrum cA{a) C Cn then the construction of f{a) is called the Silov-Arens-Calderon-Waelbroeck functional calculus; if instead A = BL(X, X) is the algebra of operators on a Banach space X and the set H contains the Taylor spectrum rTaylor(a) then the construction of
532 11. Multiparameter Spectral Theory f(a) is called the Taylor functional calculus. In the special case of a Hilbert space X we have rTaylor(a) = aTaylor(a), and a simplification of Taylor's calculus due to Vasilescu. In this section we attempt a description of the first and the third of these: we begin with a look at "polynomially convex" subsets of Cn. 11.11.1 DEFINITION The polynomially convex hull of a compact set K C Cn is polycvx(fQ = js e Cn : \p{s)\ < max |p| for each p G Polyn j (11.11.1.1) and its rationally convex hull is ratiocvx(fr) = p| {p~lp{K) : p G Polyn} (11.11.1.2) We shall call a set K polynomially convex iff K = polycvx(jK"), and rationally convex iff K = ratiocvx(fr) (11.11.1.4) It is clear that K C ratiocvx(fr) C polycvx(iC) (11.11.1.5) and that the mappings polycvx and ratiocvx are increasing and idempotent on the set of compact subsets of C; also the rationally convex hull of K is the same as the set obtained from (11.11.1.1) by replacing polynomials with rational functions whose denominators do not vanish on K. Something very similar is going on in Definition 11.5.1 leading up to the "Silov boundary". When n = 1 the polynomially and rationally convex hulls simplify: 11.11.2 THEOREM If K C C is a nonempty compact set then ratiocvx(fr) = K (11.11.2.1) and polycvx(fr) = rjK. (11.11.2.2) Proof: Equality (11.11.2.1) is obtained by looking at the particular polynomial p = z. Towards (11.11.2.2), in which rj represents the modified "connected hull" of (9.3.3.8), inclusion one way comes from the extended (11.11.1.3)
11.11 Functional Calculus 533 version of the maximum modulus principle Theorem 9.1.3: if H is a hole in K and s G H then for arbitrary polynomials p we have \p{s)\ < sup \p\ < sup \p\ (11.11.2.3) dH K Conversely suppose s G C \ rjK and suppose that (pn) is a sequence of polynomials for which sup \pn - (z - s)"11 —> 0 as n —> oo (11.11.2.4) K Then for sufficiently large n G N we will have |pn(s)|>sup|pj (11.11.2.5) To see that such a sequence of polynomials exists we need the Hahn-Banach theorem: if X is the normed space of continuous complex-valued functions on rjK and Y is the linear supspace consisting of the polynomials, and if (j> G Y° C X^ is arbitrary in the annihilator of Y, then the mapping 5 —+ <j>{{z - s)'1) : C \ K —+ C (11.11.2.6) is holomorphic, and vanishes for sufficiently large \s\: oo 4>((z - s)-1) = J2 H^s-"-1) = 0 (11.11.2.7) n=0 By (5.4.1.3) from Theorem 5.4.1 the function (z — s)-1 is in the closure of Y m If v = (i/x, i/2,..., vn) G R^ we shall write A(i/) = {seCn : |sy| < i/y for each jG {l,2,...,n}} (11.11.2.8) for "the closed poly disc of multiradius v"\ more generally a polynomial polyhedron is a set of the form (s.p)-^!/,!*) = {seCn: {s,p{s)) G A(i/,M) C Cn+m} (11.11.2.9) induced by multi-indices v G R^, /x G R^ and a polynomial p G Poly^1. As a matter of notation we shall sometimes write (z, w) for the identity on Cn+m = Cn x Cm, and temporarily regard z as the projection of Cn+m onto Cn obtained by keeping the first n coordinates. Polynomial polyhedra are polynomially convex, and can be used to approximate more general polynomially compact sets:
534 11. Multiparameter Spectral Theory 11.11.3 THEOREM If K = polycvx(ff) is compact and polynomially convex and if K C Q = int(H) C A{v) (11.11.3.1) then there are ra E N and \i G Rm and p G Poly™ for which K C (x^p^A^fi) C H (11.11.3.2) Proof: For each s G A(j/) \ H there is a polynomial q3 G Polyn for which \q3\{s)>l>sup\q3\ K and then a neighborhood U3 of 5 in Cn for which \q3{t)\ > 1 for each te U3 and then by compactness a finite subset H of A(i/) \ H for which AW\nc (J u. Now take m to be the cardinal number of the set H and write \xk = 1 for each k G {1,2,...,m} and p G (p1?p2,... ,pm) = {qa)aeH (11.11.3.3) The first inclusion of (11.11.3.2) is clear: to verify the second we consider two cases. If 5 G A{u) \ H then 5 G Ut for some t G H giving \qt{$)\ > 1; if 5 G Cn \ A{v) then |2--(s)| > Vj for some j G {1,2,..., n}. In either case (s,p(s)) ^ A(i/,m). ■ If we marginally increase the multi-radii u and \i we can replace (11.11.3.2) by K C int(2,p)_1A(i/,M) Q (2,p)_1A(i/,m) C Q (11.11.3.4) In one rather special situation the joint spectrum is polynomially convex: 11.11.4 THEOREM If a G An is a commuting n-tuple for which A = cl(alg(a)) (11.11.4.1) then At = cA{a) = polycvxaA(a) (11.11.4.2)
11.11 Functional Calculus 535 Proof: With no assumptions about either A or a G An the mapping aA : <f> —► <f>{a) from A to Cn (11.11.4.3) is continuous and takes A* into the joint spectrum cA{a)\ if a is commutative and generates A then this mapping is also one-one, and onto the spectrum, therefore topologically a homeomorphism. For the second part of (11.11.4.2) suppose that s G Cn is in the polynomially convex hull of crA(a): then for arbitrary p G Polyn we have \p(s)\ < max |p| = max |p| = max|p(a)A| < ||p(a)|| crA(a) aA(A*) A* The mapping p{a) — p{s) : Alg(a) —♦ C (11.11.4.4) is therefore a well-defined bounded linear homomorphism on a dense sub- algebra of A: thus there exists <f> G A* for which p{s) = <t>{p{a)) for each p G Polyn (11.11.4.5) In particular, taking p = z, s = 4>{a) G <JA(a) ■ We observed the isomorphism at the beginning of (11.11.4.2) in Theorem 9.6.6, for the special case of a Hilbert algebra. The next move, known as the "Arens-Calderon trick," partially reduces the joint spectrum of arbitrary commuting a G An to the situation of Theorem 11.11.4: 11.11.5 THEOREM If A is commutative and a G An and if cA{a) C n = int(n) C Cn (11.11.5.1) then there is b G Am for which B = cl(alg(a,6)) => cB(a) C H (11.11.5.2) Proof: If 5 G Cn is not in cA{a) there is c3 G An for which Y^=i C3jiaj ~ 5;) = 1, and, taking C3 = cl(alg(a,c3)), an open set U3 containing s for which teu3=>tgcCs{a) Taking i/- = ||a.|| for each j G {1,2,... ,n}, so that cA{a) C A(i/), the sets (U3) with 5 G A(i/) \ H form an open cover for the compact set A(i/) \ H,
536 11. Multiparameter Spectral Theory so that there is a finite set H C A(v) \ Q for which A(i/)\HC (J U3 (11.11.5.3) seh If we now put * = (0.6* and 5 = cl(*lgM)) = cl (alS ( U Cs) ) (H.ll-5.4) then (11.11.5.2) holds: cA{a) C aB(a) C (J aC-(a) CO I (11.11.5.5) sEH We are ready to start talking about the functional calculus. If H C Cn is open we shall call the function / : H —► C holomorphic if the partial derivatives df/dxj and df/dy^ exist continuously throughout H and satisfy the "Cauchy-Riemann conditions" of (9.1.2.1): df/dyj = idf/dx- throughout H for each j'g{1,2,...,n} (11.11.5.6) If iC C Cn is an arbitrary set, in particular compact, we shall write Ro\o(K) = {feC(K): K J X K J (11.11.5.7) / has a holomorphic extension to some open H 2 K} Thus HoIo(jK') is a subalgebra of C(K), not necessarily closed. When n > 1 the space HoIo(jK') sometimes coincides with Ho\o(K') for some larger Kf: this is known as Hartog's phenomenon. A specific example is given by 2f = A(lll)\intA(i|i) #' = A(1,1) We can extend / <E Holo(iC) to / G Holo(fr') by setting /(*i.*2) = (2^)"1 / ^I'O^i if (*i,0 e K\K' (11.11.5.8) 7 ^i — *i |*i 1=3/4 For commutative algebras, the functional calculus is defined in two stages: the first is very easy, essentially reducing to Definition 9.7.1: 11.11.6 DEFINITION If a e An for a commutative Banach algebra A and if / e Holo A(i/) with cA{a) C int A(i/) (11.11.6.1)
11.11 Functional Calculus 537 then /(a) = (2*t)-B J (■■■( J f(z){Z1-a1)-1..- \zn\=vn \z\\=v\ fo.o.o. {zn-an)~ldzl The reader can easily extend Theorem 9.7.2 to this context: the mapping / —► f{a) is a homomomorphism from Holo A (j/) into A which sends each coordinate function Zj to the corresponding a-. The full statement is more complicated, and relies not only on the Arens-Calderon trick and related matters, but also on the Oka extension theorem, which can be summarized by saying HoloK^p)"^!/,/*)) = (Holo A(i/,/x)) ° iz>P) (0.0.0.2) In words, whenever / is holomorphic near a polynomial polyhedron (^p)-1 A(j/,aO C Cn there is a function g holomorphic near the polydisc A(i/,/x) £ cn+m with /(*) = 9{Z>P{Z))- We offer no Proof of oka's tneo" rem, except to remark that it ultimately derives from Stokes' theorem and the Cauchy integral formula. 0.0.1 DEFINITION If a e An for a commutative Banach algebra A and if /GHoloaA(a) (0.0.1.1) then f{a) = G{a,b,p{a,b)) (0.0.1.2) where b G Am and p G Poly^+m and polycvxaA(a, b) C mi{z,w,p)~l A(i/,/j,, A) (0.0.1.3) and G e Holo A(i/,/x,A) and f(z) = G{z,w,p(z,w)) (0.0.1.4) To see that this is a good definition we suppose that / is holomorphic on an open set H D ^aM* an<^ use ^rs^ the Arens-Calderon trick Theo- dz„
538 11. Multiparameter Spectral Theory rem 11.11.5 to find b G Am for which (a, 6) G An+m is a commuting system with cB{a) C H, where B is generated by (a,6). By Theorem 11.11.4 the set aB(a, 6) C Cn+m is polynomially convex, therefore contains the polynomially convex hull of <rA{a,b). By (11.11.3.4) we can find a polynomial polyhedron {z,w,p)~1A{i/,fi,A) C Cm containing crA(a,b) in its interior, and by Oka's theorem (11.11.6.3), applied in Cn+m, we can find G G Holo A(i/,/j,, A) satisfying (11.11.7.4): then of course we use Definition 11.11.6 in (11.11.7.2). For a commuting system a G An in a noncommutative algebra A we can now write f{a) G A if / G HoloaD(a) with {ay} CDC commA(D) C A (0.0.1.5) Thus /(a) is defined whenever / is holomorphic near the spectrum of a with respect to any commutative subalgebra of A containing the a;-. For the largest family of such holomorphic functions one should make the spectrum crD{a) as small as possible, and hence the algebra D as large as possible: at the same time the algebra D should in some sense be determined by the system a G An. It would be logical to work with a maximal abelian algebra D containing a;-: but apparently different maximal abelian algebras are liable to induce different spectra. The most reasonable candidate would seem to be the double commutant of Definition 7.1.1: D = comm^(a) (0.0.1.6) When A = BL(X, X) is the algebra of operators on a Banach space then Joseph Taylor has discovered how to define f(a) for functions / G HolorTaylor(a): we make no attempt to reproduce his construction, which is based on the "Cauchy-Weil integral." If we specialize to a Hilbert space X then Vasilescu has descovered a simpler formula, based on the "Mar- tinelli kernel": we must extend An(X) to differential forms in dz- and dz -, and write f{a) = / {2m)-nf{z)M{z - a) A dzx A dz2 A • • • A dzn (0.0.1.7) E for a surface E surrounding rTaylor(a), where M{z-a) = {L{a-z)+k{a-zY)-l{dz{k{a-z)+k{a-zY)-1)n-1 (0.0.1.8)
11.11 Functional Calculus 539 It would be attractive to try to extend such a formula to functions holo- morphic near crTaylor(a) on a Banach space X: an extension of Allan's Theorem 11.7.9 to produce "holomorphic(left,right)inverses" might be helpful.
Notes, Comments, and Exercises CHAPTER 1 1.1.1 EXERCISE: Prove that (1.1.1.5) =J> (1.1.1.3). 1.1.3 COMMENT: These conditions characterize "topological vector spaces"; more generally if {Qj)j^.j is a family of seminorms on the linear space X and we declare "U G Nbd(x)" to mean that there is finite J0 C J and e > 0 for which \ y G X: max qAy — x) < e\ QU ^ jeJo J > then we satisfy the conditions (1.3.1). Necessary and sufficient for the "separated" condition Nbd(x) = {x} is CijeJ^i0) = W • 1.5.1 COMMENT: If 0 < t < 1 then it may not be possible to find (yn) in Y with \\yn -x0\\ - dist(x0,y) and ||yj| < ||x0|| + *dist(x0,y): this is unpublished work of Edward Bach. 1.5.3 REFERENCE: Bonsall (1967b). 1.8.0 COMMENT: Minkowski's inequality is the triangle inequality for II*||p (1 < P < o°)- it extends to mappings "NIP=(/l*lp<^ l/p 541
542 Notes, Comments, and Exercises on the space of V-measurable" functions x : H —► K associated with a "positive measure" \i\ S —► [0,oo], defined and "countably additive" on a "a-ring" % C P(Q) of subsets of the set H. The normed space induced by the quasi-seminorm ||-|| is written LJfi) or Lp{fi). More generally, "Young's inequality" leads to "Orlicz spaces." 1.8.2 EXAMPLE: C^{[a,b}) = {x e C{[a,b}) : 3x' <E C{[a,b}), the space of continuously differentiable functions on [a, b] (necessary and sufficient is uniform convergence of the difference quotient), with norm llxlloo^ — INIco+Mco- 1.8.2 EXERCISE: Write down the definition of C^{[a,b}) . 1.9.0 COMMENT: Analogs of c0 and c1 on a topological space H are the spaces C0(H) = C(H) fl c0(H) and C^H) = C(H) fl c^H), where the spaces c0(H) and c1(H) are induced by the bornology of "relatively compact" subsets of H: thus C00(Q) = C(H) flc00(n) comprises the continuous functions "of compact support." 1.9.1.9 REFERENCE: This observation is due to Wolff. 1.9.2 COMMENT: If H is derived from N by substituting a "maximal" bornology for the bornology of finite subsets then Qn{X) is called an "ultraproduct"—such a thing is more of a precision instrument than our crude but more elementary "enlargement": it can be used to "standardize nonstandard analysis." 1.10.0 EXAMPLE: If H is a group then ^(H) is a normed algebra with (a • b){u) = (a * b){u) = ^ a{s)b{t) st=u ("convolution"). The mapping t —► St is a semigroup homomorphism from the group H to the algebra /1(H). CHAPTER 2 2.1.0 EXERCISE: Prove (2.1.0.3). 2.1.1 EXERCISE: Prove that each of the following is continuous, and determine its bound:
Notes, Comments, and Exercises 543 (a) T = U: (i1,i2,i3,...)->(0,i1,i2,...)froml1to/1,froml2tol2 or from l^ to Z^; (b) T = V: (x1,x2,x3,...) -► (x2,x3,x4,...) from lp to lp {p = l,2,oo); (c) T = K: C([a,6]) -► C([a,6]) where (iCx)(0 = f*=ak{s,t)ds for each £ G [a,6] and each x G C([a,6]), where k : [a,6] x [a,6] —► K is continuous; (d) T = S: C([a,6]) -► C([a,6]) where {Sx){t) = f*=ax{s)ds for each (e) T = R: lp —»• /p where (ifo)^ = (xx + x2 + • • • + xn)/n for each nGN,xG/p. 2.3.3 COMMENT: Our slightly aggressive concepts of "kernel" and "co- kernel" come from the Cambridge notes of C. T. C. Wall on homological algebra (1962-3). 2.5.4 REFERENCE: Kato (1966). 2.8.0 EXERCISE: C^{[a,b}) = K x C([a,b}) under the correspondence y <-► (t,x) given by the formulas {t,x) = {y{a),x') and y = t + x CHAPTER 3 3.1.2 COMMENT: T"1 is the unique U for which UT = Ix, IY = TV. 3.1.4 COMMENT: BL_1(X,y) is a topological space in its own right: U e Nbd(T) means U = V fl BL_1(X,y) for some neighborhood V of T in the normed space BL(X, Y)\ it is not necessary that BL_1(X,y) should be an open subset of BL(X, Y). 3.1.4 EXAMPLE (W. H. Ruckle): If X = Y = c00 is the space of "terminating" sequences and T = I : X —► X then T is invertible but J — (l/n)U —► T where U(x1,x2,x3,...) = (0,x1,x2,.. .)> and I — (l/n)U is not onto, since S1 = (1,0,0...) is not in its range. Conclusion: The set of invertible operators need not be an open set in BL(X, Y). 3.2.2 COMMENT: Compare this "homological" characterization of one- one-ness, denseness with the "Wilson cloud chamber," in which unknown
544 Notes, Comments, and Exercises particles are described by being bombarded with other equally unknown particles. 3.3.2 EXERCISE: I - ST bounded below => I-TS bounded below I-ST closed => I-TS closed COMMENT: Usually "closed" means either "closed range" or "closed graph": here we are thinking of "bounded below" as "almost closed." 3.3.5 REFERENCE: Davis/Rosenthal (1974), Choi/Davis (1974). 3.4.2 EXERCISE: I - ST (almost)open => I-TS (almost)open. 3.4.2 EXERCISE: Define the "almost closure" of the range of T to be cl(T,X) = {limTxn : Tx e c{Y) and x <E /^PO} and verify TXCc\{T,X) Ccl(TX) and cl(T,X) = {ye Y:q(y) G Q(T)Q(X)}. 3.4.2 EXERCISE: Call T e BL{X,Y) "almost onto" iff cl(T,X) = Y, so that T onto => T almost onto =^ T dense and T almost open => T almost onto => T dense verify 5, T almost onto => ST almost onto => S almost onto 3.4.5 EXERCISE: Prove Q(T) dense =J> T dense. 3.5.4.1 COMMENT: The missing (3.4.5.8) is that Q(T) dense =J> T dense. 3.7.3 EXERCISE: I - ST left (right) invertible => I - TS left (right) in- vertible.
Notes, Comments, and Exercises 545 3.8.1 REFERENCE: Caradus (1974, 1977, 1978); Taylor/Lay (1980). 3.8.7 REFERENCE: The origin of this is an (incorrect) result of Treese/ Kelley (1977) which was corrected by Gonzalez and generalized by Harte (1987a): the further simplification presented here is due to Gerard Murphy. 3.9.3 EXERCISE: I - ST left (right) almost invertible => I - TS left (right) almost invertible. 3.11.1, 3.11.2 REFERENCE: These are inspired by the generalization of (3.11.3) due to J. L. Taylor (1970a). 3.12.3.2 REFERENCE: The concept of "relief" comes from Mandelkern (1983), who is however working in the context of "constructive analysis." 3.12.3.2 EXERCISE: If [Uj)jeJ is a finite open cover of a normal Haus- dorff space then there is {uj)jej in C(H, [0, l]) for which Y,uj = 1 and cl(n \ uJl(°)) ^ Uj (The set cl(H \ uj1^)) is called the "support" of u;-; the family (u) is called a "partition of unity" for H, "subordinate to" the cover {Uj)jeJ.) CHAPTER 4 4.1.4 REFERENCE: This is an unpublished argument of Edward Bach. 4.2.3.11 This idea is developed and exploited by G. J. O. Jameson (1974). 4.3.3 EXERCISE: C([0,l]) is not complete with respect to IHIx, ||-||2; lx is not complete with respect to ||-||2, ||*||oo C(1)([0,1]) is complete with respect to ||.||W but not y^ 4.4.6 REFERENCE: This argument is taken from H. G. Heuser (1982). 4.4.7 REFERENCE: Fuster/Marquina (1984).
546 Notes, Comments, and Exercises 4.5.1 REFERENCE: This is an unpublished argument of Edward Bach. 4.6.1 REFERENCE: R. P. Boas (1960). 4.7.4 REFERENCE: A. Wilansky (1977). 4.8.4 COMMENT: Much of the application of functional analysis to differential equations is achieved by extending the theory of bounded operators to "densely defined operators with closed graph." CHAPTER 5 5.3.2 EXERCISE: The Hahn-Banach theorem extends to seminorms, and in the real case to "sublinear functional," which may take negative values: for example, the functional limsup: l^ —► R. 5.3.2 EXERCISE (cf. Edwards (1964)): A linear functional on l^ which is dominated by limsup is necessarily monotonic and translation invariant, and is necessarily an extension of lim : c1 —► R: such things are called "Banach limits." 5.3.2 EXERCISE (cf. Edwards (1964)): Use the Hahn-Banach theorem to prove a "monotonic Hahn-Banach theorem," extending monotonic linear functional from subspaces of partially ordered linear spaces, in a monotonic linear fashion, and to prove an "invariant Hahn-Banach theorem," extending linear functional which satisfy f{x) = f{Tx) for certain families of operators T, in an invariant fashion. 5.5.1.2 EXERCISE: A locally convex topological vector space (one in which every neighborhood of 0 contains a convex neighborhood) has its topology given by a family of seminorms as in exercises to (1.3.1)—use the Minkowski functional of absolutely convex neighborhoods of 0. 5.5.1.3 REFERENCE: Taylor/Lay (1980). 5.5.6.2 REFERENCE: Wilansky (1978). 5.6.4 REFERENCE: Dash/Schechter (1970); Fialkow (1983, 1985); Car- illo/Hernandez (1984). 5.7.1 REFERENCE: Albrecht/Mehta (1984).
Notes, Comments, and Exercises 547 5.7.2 REFERENCE: This idea is inspired by Martin Mathieu (Harte/ Mathieu (1986)). 5.8.2 EXERCISE: Do the complex case explicity 5.8.2 EXERCISE: If H is a normal topological space (e.g., [a, b]), identify the quotient l^ (Q)^ / C ^(Q)0 with more conventional representations of Coo(n)t. EXERCISE: If H is a topological group then C^H)* is a normed algebra, where we define convolution multiplication {fi * v){x) = \i{y * x) where (i/*x){t) = v{xt) where xt{s) = x(ts) This contains ^(H) as a subalgebra. 5.9.2 REFERENCE: The most famous example is due to R. C. James (1951). 5.10.1 REFERENCE: Jameson (1974); other examples are in D. J. Newman (1960), Rudin (1983). CHAPTER 6 6.1.3 COMMENT: A quantitative version of this is stated without proof in Banach (1932): the simple proof of this "lemma of Auerbach," due to Ruston, is given by Jameson (1974). 6.4.2 REFERENCE: This argument is given by Taylor/Lay (1980). 6.4.3 REFERENCE: This proof is taken from Taylor/Lay (1980). 6.5.4 RLFERENCE: Variants of this argument are given by Taylor/Lay (1980) and by Caradus/Pfaffenberger/Yood (1974); an alternative "one diagram" ~>roof is h; Y*ng (1973).
548 Notes, Comments, and Exercises 6.7.2.7 EXERCISE: Verify that m(H,X)t ^ M^X*), and identify (mC)(n,X)t, where (mC)(H,X) = C(H,X) fl m(H,X). 6.7.4 REFERENCE: This idea is due to John Buoni. 6.8.3 COMMENT: A Banach space X is said to have the "bounded compact approximation property" if there is k > 0 for which, for each compact subset HCX and each e > 0 there is K G KL(X, X) for which sup \\Kx -x\\<e and \\K - I\\ < k xEH (Lebow/Schechter (1971), Lindenstrauss/Tzafriri (1979)). Necessary and sufficient for X to have this property is that for arbitrary Banach spaces Y the mapping T + KL(X,y) —+P(T) :BL(X,y)/KL(X,y) —+BL(P(X),P(y)) is bounded below (Astala/Tylli (1983)). 6.9.2 COMMENT: A Banach space X is said to have the "subprojective property" if each infinite dimensional closed subspace contains an infinite dimensional closed subspace which is complemented in X (Lebow/Schechter (1971)). If X is subprojective and Y is arbitrary and T G BL(X,Y) then T essentially bounded below => T upper semi-Fredholm with equivalence if in addition X has the bounded compact approximation property (Astala/Tylli (1983)). 6.8.3, 6.9.2 REFERENCE: Caradus/Pfaffenberger/Yood (1974) do the easy part of this with sets instead of sequences; proof of the hard part is due to Wickstead (Buoni/Harte/Wickstead (1977), Harte/Wickstead (1977)). 6.11.5 EXERCISE: Use this to prove (6.11.3) again for complete spaces. 6.12.3 EXERCISE: Extend (6.12.3.1) to "almost upper semi-Fredholm" and (6.12.3.2) to "almost lower semi-Fredholm." 6.12.4 REFERENCE: This comes from Lebow/Schechter (1971). 6.13.2 REFERENCE: This is the "quantitative" version of the compactness condition for the operator (RToU)A (Ls o V), which is given by Bonsall (1967b).
Notes, Comments, and Exercises 549 6.13.2 PROBLEM: Prove Theorem 6.13.2 directly with ||P(T)|| in place of ||T||gSS. Is it possible to improve on the constants? 6.13.2 PROBLEM: Find a converse to Theorem 6.13.2—a quantitative version of a generalization of Theorem 6.13.4? 6.13.5 PROBLEM: Find an analog for "almost essential invertibility." 6.13.5, 6.13.6 REFERENCE: Compare Carillo/Hernandez (1984). CHAPTER 7 7.2.5 COMMENT: For commutative algebras, Hull(J) is usually factored into the composition hull(kernel(J)). 7.3.2 PROBLEM: Is there implication 1-ab <E A'1 A =J> 1-ba <E A"1 A? (Harte (1986)) 7.3.2 PROBLEM: If A = BL(X,X) for a normed space X, write *~A = {a e A.-a-^O) = X/c\(aX)}, so that A'1 A = ^fl A. Is A closed under i i i i multiplication? Conversely, is there implication ST, TS G A =>• S,T G A ? If S,T,ST,TS e A, is there necessarily equality index(5T) = index(S) + index(T)? 7.3.2 EXERCISE: If J : A = BL(X,X) -► B = A/J with J = KLX(X,X) is the Calkin quotient then T"1^-^ U B^ht) C ^ and A"1 + r-1(o) = AJnr-1(51;f1tu5Tg1ht). 7.3.4 REFERENCE: Harte (1987a); the argument here represents an unpublished simplification due to Gerald Murphy. Compare also Rakocevik (to appear). 7.3.6.8 REFERENCE: This is derived from an example of Jameson (1974). 7.3.6.16,17 REFERENCE: These examples are presented by Caradus (1977, 1984). 7.4.5 REFERENCE: This argument is due to Hilden, as presented by Michaels (1977).
550 Notes, Comments, and Exercises 7.6.1 REFERENCE: This is done with J = socle(A) by Barnes/Murphy/Smyth/West (1982), where socle(A) is the sum of the minimal left ideals of A, provided that it is also the sum of the minimal right ideals. Compare also Pearlman (1974), Rowell (1984), and O'Searcoid. 7.6.1 EXERCISE: If A is semisimple, prove that socle(A) C 'A1 contains only "regular" elements, and decide whether or not they must all be "de- composably regular." 7.6.4.9 REFERENCE: This example comes from Gramsch/Lay (1971) and Berberian (1970). 7.6.4.9 COMMENT: If T e BL(X,X) is Weyl then it is possible to decompose T = S + K with invariable S and compact K for which (SK - KS)2 = 0? (Murphy/West (1981), Laffey/West (1982), O'Searcoid). 7.7.4 PROBLEM: Must the ideal socle(A) have the "Riesz property"? 7.7.6.10 REFERENCE: This was noticed by Trevor West. 7.9.2 REFERENCE: Grabiner (1978). 7.10.0.2 EXERCISE: Prove that the connected subsets of R are the intervals. 7.10.3 REFERENCE: Crimmins/Rosenthal (1967) attribute part of this to Parrot; compare also Salinas (1972) and Harte/Wickstead (1981). 7.11.2 REFERENCE: J. L. Taylor (1975); cf. also Harte (1976). 7.13.4 REFERENCE: This account of Zelazko's theorem is due to Roit- man/Sternberg (1981); cf. also Zelazko (1968) and Kahane/Zelazko (1968). 7.13.4 PROBLEM: Extend Theorem 7.13.4 to the "generalized characters" of Allan (I967a,b). CHAPTER 8 8.4.3 COMMENT: If (cy)yeJ and [fkikeK are orthonormal bases for a Hilbert space X then the sets J and K must have the same cardinal number,
Notes, Comments, and Exercises 551 which we can call dim(X), the "dimension" of X. Necessary and sufficient for Hilbert spaces X and Y to be isomorphic is dim(X) = dim(y). 8.5.1 EXERCISE: if K C X is a subspace then (If-1)* = K°. 8.8.1 COMMENT: The rest of the world calls these either "E*-algebras" or "C"-algebras"; for some of the rest of the world a "Hilbert algebra" is a Banach algebra which is also a Hilbert space, in the rather specialized sense of Ambrose (1969). 8.8.5.3,4 COMMENT: The converse of this inoffensive observation is the enormous work of Brown/Douglas/Fillmore (1977) (cf. Halmos (1983)). 8.9.1 EXERCISE: Verify (8.9.1.3)-(8.9.1.5). 8.9.1 REFERENCE: This is the original construction, due to Berberian (1959), which leads to our "enlargement." 8.9.1 EXERCISE: If A = BL(X,X) for a Hilbert space X then, with a" as in Problems 7.3.2, lA = {ae A: dim a"1 (0) = dim(X/cl(aX))} Hence if X is a "separable" Hilbert space—one with a countably infinite basis—and a G A then i i a regular and not semi-Fredholm =>• a G A CHAPTER 9 9.3.1 COMMENT: Compare Taylor/Lay (1980) and Goldberg (1966), who give an older division of the spectrum cr{T) of T G BL(X,X) for a Banach space X into the point spectrum 7rleft(T), the "residual spectrum" fright (T) \ tfieftjr), and the "continuous spectrum" rrieht(T) \ 7rrieht(T). 9.3.3 EXERCISE: Determine a{T) for T = U and T = V, the forward and backward shifts onX = /l5X = /2. What about the analogs on /X(Z) or Z2(Z), the "bilateral" shifts? 9.3.4 EXERCISE: If T = P = P2 g {0,1} then a(T) = {0, l}.
552 Notes, Comments, and Exercises 9.3.4 EXERCISE: The spectral mapping theorem fw = wf for polynomials / fails with w = da, w = r\o and u; = dr\o (Harte/Wickstead (1981)). 9.5.2 REFERENCE: Lorch (1962). 9.5.4 EXERCISE: If a G A'1 for a Banach algebra A then sup{e>0:{a-*:*eC,|*| < e} C A'1} = lira (dist(an, A \ A_1)1/n : cf. Makai/Zemanek (1983), Zemanek (1984, 1985). 9.6.5 EXERCISE: Show that it is sufficient if each element of A is "conservative" in the sense of (7.4.4.6). 9.7 COMMENT: The Silov Idempotent Theorem says that if a{A) is not connected then A g {0,1} (Silov (1955)). 9.7.3 EXERCISE: What do you understand by length(r)? 9.8.3 EXERCISE: If T = U*SU + K with unitary U {U* = U'1), compact K, and normal S then <?es3{T) = u;ess(T). 9.8.7 REFERENCE: This was first established for Riesz operators by West (1966b) and then extended by Stampfli (1974); generalizations are due to Apostol (1976), Davidson/Herrero (1986), and O'Searcoid. 9.8.8.14 REFERENCE: This example is due to Gillespie/West (1968). Chui/Smith/Ward (1976) show that a Riesz operator T on a Hilbert space X can be decomposed T = S + K with compact K and quasinilpotent 5, SK-KS. COMMENT: Call T e BL(X,X) "diagonal" if X = ciErecC71-*7)"1!0) with complemented null spaces (T-tl)-1^): then if X is a Hilbert space there is implication T diagonal =^ T normal. The converse is known as the Berg/Sikonia theorem (Berg (1971), Sikonia (1971)), a precursor of the result of Brown/Douglas/Fillmore (1977). 9.9.6 COMMENT: It is the sequence of events leading up to this theorem which is enshrined in the tentative-sounding terminology A*-, J5*-, C*-: cf. Dixmier (1977), Kadison/Ringrose (1983), Goodearl (1982). 9.10.1 REFERENCE: Bonsall/Duncan (1971/73, 1973).
Notes, Comments, and Exercises 553 CHAPTER 10 10.1.1, 10.1.2 REFERENCE: Douglas (1966), Embry (1971). 10.2.1 REFERENCE: "Interpolations" were christened by Coburn/ Schechter (1968). 10.2.5 COMMENT: These two special cases were noticed by O'Searcoid. EXERCISE: y G cl(T,X) ^ Ry almost co-majorized by T. 10.3.9 REFERENCE: Wrobel (1987) established this for complete spaces; Cho/Takaguchi (1981) and Curto (1981) for Hilbert spaces. 10.3.9 EXERCISE: Specialize (10.3.9) to give a direct proof of (5.5.6). 10.4.3 REFERENCE: Dash (1973b); Dash/Schechter (1970). 10.4.6 REFERENCE: Slodkowsky (1977). 10.5.2.9 REFERENCE: This example is due to Wickstead (Harte (1978)). 10.5.3 REFERENCE: Kitadani (1982), Harte (1978). 10.5.5 REFERENCE: Cf. Embry (1971). 10.5.5.6 COMMENT: The right hand here is another of the 35 possible combinations of the conditions of Definitions 10.1.1 and 10.1.2 which could have been added to the list of Definition 10.2.1. 10.5.5.7 REFERENCE: This example is similar to one given by Bouldin (cf. Embry (1971)). 10.6.7 REFERENCE: Albrecht/Mehta (1984); Fainstein (I980b,c). 10.8.1 EXERCISE: If A = BL(X, X) for a Hilbert space X then A'1 = Aq1 is connected (cf. Rudin (1973), Kuiper (1965)). This was used in (9.4.4.5).
554 Notes, Comments, and Exercises 10.8.1 EXERCISE: If A = BL(X,X) for a Hilbert space X then A C A~1A+ C cl(A_1) (since positive operators are Hermitian, therefore in cUA"1)). REFERENCE: Feldman/Kadison (1954), Bouldin (1982), Izumi- ne (1979). 10.8.3 REFERENCE: Vasilescu (1977, 1978); Curto (1981). 10.9.5,6 REFERENCE: Slodkowsky (1977). CHAPTER 11 11.1.1 REFERENCE: Coburn/Schechter (1968). 11.1.2 EXERCISE: Determine aleft(T), <7rieht(T), rleft(T), rrisht(T), 7rleft(T), 7rrisht(T), with T e BL(X,X)n given by (a) T = (ri5r2) = {U,V) where U and V are the unilateral shifts on lx or on l2, (b) the corresponding bilateral shifts, (c) if Z\ = S and T2 = P = P2 satisfies SP = PSP or PS = PSP. 11.1.4 EXERCISE: If A and B are commutative Banach algebras and TeBBL{A,B) then T^{BX) = Ax <=> for each n G N,a <E An,aA{a) = cB{Ta) (Corach/Suarez, 1987). 11.2.2 EXERCISE: Define the "commutant" and "double commutant" spectra of a <E An as <^omm(a) = aB{a) and <^bc(a) = aD{a) with B = commA(a), D = comm^(o), and verify that they are both compact, possibly empty, with /<^bc(a) C a^bc/(a) for polynomials / G Poly™; cf. Coburn/Schechter (1968), Dash (1973b), Dash/Schechter (1970), Harte (1973c). 11.2.6 EXERCISE: If T G BL(X,X) commutes with P = P2 <E BL(X,X) then 0>=1(T) is the spectrum of the restriction of T to its invariant subspace P{X). 11.3.3 REFERENCE: Slodkowsky (1977) gives essentially this argument; the idea of using the enlargement comes from Davis/Rosenthal (1974) and Choi/Davis (1974).
Notes, Comments, and Exercises 555 11.3.4 REFERENCE: Originally proved for operators on Hilbert space by Bunce (1971); extended to Banach algebras for the left and right spectrum by Harte (I972a,b). The argument for approximate point spectra was extended to operators on Banach spaces by Davis/Rosenthal (1974) and Choi/Davis (1974) and to Banach algebras by Zelazko/Slodkowsky (1974). 11.3.4 EXERCISE: Give another derivation of (9.5.3.5) and (9.5.3.6). 11.4 REFERENCE: Clarke (1975), Curto (to appear), Zelazko (1979), Slodkowsky (1977), Harte. 11.5 REFERENCE: Wermer (1971), Zelazko (1970), Arens (1961), Harte (1975a). EXERCISE: If a e A then daA{a) C dA{a) (Harte) 11.6.10 REFERENCE: Davis/Rosenthal (1970), Embry/Rosenblum (1974), Harte (1974). EXERCISE: Do the analog of (11.6.10) directly for 3x3 matrices (Harte (1971)). 11.7 REFERENCE: Grothendieck (1955); Shchatten (1950); Brown/ Pearcy (1966); Ichinose (I978a,b). 11.7.7 REFERENCE: Halmos (1982); Harte (1985). 11.7.7.8 COMMENT: This remark uses (3.10.4.1), (6.2.6.5) and (6.2.6.1). 11.7.7.13 COMMENT: The union of the cA (ayy) need not coincide with the spectrum of the upper triangular matrix c with respect to the full matrix algebra. 11.7.8 REFERENCE: Allan (I967a,b); this extends to m-tuples of functions. 11.8.2 REFERENCE: It is the first part of this which is really the "Kleinecke-Sirokov theorem" (Kleinecke (1957), Sirokov (1956)); the second part is due to Singer/Wermer (1955). This derivation (sic) of the Singer-Wermer result is due to Gerard Murphy. 11.8.2.7 EXERCISE: Prove that (n!)"1/n -► 0 as n -► oo.
556 Notes, Comments, and Exercises 11.8.3 EXERCISE: Extend (9.5.3.5) and (9.5.3.6) to quasicommuting pairs. 11.8.6 PROBLEM: Find a pair T = {Tl9T2) of quasi-commuting upper triangular 5x5 matrices and a system of polynomials / for which f{T) is not quasi-commutative. 11.9.8 REFERENCE: J. L. Tayor (I970a,b); Vasilescu (1977, 1982); Curto (to appear). 11.9.8 REFERENCE: Eschmeier calls our "Taylor invertible spectrum" the "split spectrum." 11.9.8 REFERENCE: Hamet Isaev (to appear) relates the Taylor spectrum to the "multi-parameter spectral theory" of Atkinson (1968, 1972, 1977), Browne (I972a,b; 1974a,b; 1977), and Binding (1980, 1981, 1982a,b,d). 11.9.9 EXERCISE: dfTaylor(T) C fle{t{T) U frisht(T) (Wrobel (1987), Cho-Takaguchi (1981). Curto (1986, 1987)); this fails for triples (Curto (1986)) and cannot be improved to replace the union of the left and right spectra by their intersection (Wrobel (1987)). 11.9.10 EXERCISE: Prove directly /u;Taylor(T) C u;Taylor/(T) with u = f,a and / e Poly^ (Harte (1981), Fainstain (1987)). 11.9.10 COMMENT: An alternative derivation of the spectral mapping theorem for the Taylor spectrum uses Zelazko's Theorem 9.6.8 (Zelazko/ Slodkowski (1974)). 11.9.12 REFERENCE: Rynne (1987); Wrobel (1986b); Eschmeier (1986, 1987). 11.10.1 REFERENCE: For noncommutative algebras consult J. L. Taylor (1973), M. E. Taylor (1968), Anderson (1969), Albrecht (1982), Nelson (1970). 11.10.3 EXERCISE: If r = (r1,r2,r3,r4) 0 d/dz1 0 0 0 d/dz2 0 0
Notes, Comments, and Exercises 557 on the space X = C{V) x C^{V) with V = closure ((3D x 3D) \ (D x D)) then <7dbc(T) ± rTaylor(T) and /adbc(T) ^ adhcf{T). Cf. J. L. Taylor (1970a), Slodkowsky/Zelazko (1974), Harte (1973c). 11.10.5 REFERENCE: If T G BL(X, X)n is "Taylor-Weyl" can we write T = S + K with "invertible" S and compact K1 11.10.5 PROBLEM: What should one mean by "Taylor-Browder" ? Compare Schechter/Snow (1975), Snow (1975), Buoni/Dash/Wadhwa (1981), Curto/Dash (1987). In particular, do any of these definitions admit "commuting decompositions" T = S + K with invertible 5, compact K1 11.10.5 PROBLEM: Prove that index(T) is a continuous function of commuting Taylor-Fredholm systems T G BL{X,X)n: this is proved for Hilbert spaces by Curto (1986, 1987) and for unbounded operators on Ba- nach spaces (using "gap theory") by Vasilescu (1979) and by Albrecht/Vasi- lescu (1974), but one might hope for argument as in Theorem 10.3.9 or Wrobel (1987). 11.11.6.3 REFERENCE: Wermer (1971) gives an account of the Oka extension theorem. For holomorphic functions consult Krantz (1982), Henkin/Leiterer (1984), Hormander (1975), Gunning/Rossi (1965); the functional calculus for commutative Banach algebras is expounded in Wermer (1971), Bourbaki (1967), Curto (to appear). The Taylor calculus for Hilbert spaces is given by Vasilescu (1982) and Curto (to appear). EXERCISE: If T = {T^T2) is given by Tx = V <g> /, T2 = V <g> (U - U2V) + J <g> U2V where U and V are the forward and backward shifts then (0,0 G iso(aleft(T) U aT'^ht(T)) \ accaTaylor(T): thus the union of the left and the right spectrum cannot possibly support a functional calculus (Curto (1986)).
References Albrecht, E. (1977). Generalized spectral operators. Proceedings Paderborn Conference on Functional Analysis. North Holland Math. Stud. 27: 259-277. Albrecht, E. (1979). On joint spectra. Studia Math. 64: 263-271. Albrecht, E. (1982). Several Variables Spectral Theory in the Non-commutative Case. Banach Center Publications Warsaw, Poland, vol. 8, Spectral Theory, pp. 9-30. Albrecht, E., and Frunza, S. (1976). Non-analytic functional calculi in several variables. Manuscripta Math. 18: 327-336. Albrecht, E., and Mehta, R. D. (1984). Some remarks on local spectral theory. J. Op. Theory 12: 285-317. Albrecht, E., and Vasilescu, F-H. (1974). Non-analytic local spectral properties in several variables. Czech. Math. J. 24: 430-443. Albrecht, E., and Vasilescu, F-H. (1983). Semi-Fredholm complexes. Op. Theory: Adv. Appl. 11: 15-39. Alexander, J. C. (1968). Compact Banach algebras. Proc. London Math. Soc. 3: 18. Allan, G. R. (1967a). On one-sided inverses in Banach algebras of holomorphic vector-valued functions. J. London Math. Soc. 42: 463-470. Allan, G. R. (1967b). Holomorphic vector-valued functions on a domain of holomorphy. J. London Math. Soc. 42: 509-513. Ambrose, W. (1969). Structure theorems for a special class of Banach algebras. Trans. Amer. Math. Soc. 137. Anderson, R. F. V. (1969). The Weyl functional calculus. J. Fund. Anal. 4: 240-247. Apostol, C. (1976). Inner derivations with closed range. Rev. Roum. Math. Pures Appl. 21: 249-265. 559
560 References Apostol, C, Fialkow, L., Herrero, D., and Voiculescu, D. (1984). Approximation of Hilbert Space Operators, II. Research Notes in Math., vol. 102. Pitman, Boston, London, Melbourne. Arens, R. (1960). Extensions of Banach algebras. Pacific J. Math. 10: 1-16. Arens, R. (1961). The analytic functional calculus in commutative topological algebras. Pacific J. Math. 11: 405-429. Arens, R., and Calderon, A. P. (1955). Analytic functions of several Banach algebra elements. Annals. Math. 62: 204-216. Astala, K. (1980). On measures of noncompactness and ideal variations in Banach spaces. Ann. Acad. Sci. Fenn. Ser. A I Math. Diss. 29: 1-42. Astala, K., and Tylli, H-O. (1983). On the bounded compact approximation property and measures of non-compactness, Universitat Jyvaskyla, Helsinki. Atkinson, F. V. (1951). The normal solvability of linear equations in normed spaces. Mat. Sbornik 28: 70. Atkinson, F. V. (1968). Multiparameter spectral theory. Bull. Amer. Math. Soc. 74: 1-27. Atkinson, F. V. (1972). Multiparameter Eigenvalue Problems, Vol. I: Matrices and Compact Operators, Academic Press, New York. Atkinson, F. V. (1977). Deficiency index theory in the multi-parameter Sturm-Liouville case, Proc. International Conference on Differential Equations, Uppsala, 1977, Sympos. Univ. Upsaliensis Ann. Quingent Celebr. No. 7. Almquist and Wiksell, Stockholm, pp. 1-40. Aupetit, B. (1979). Proprietes Spectrales des Algebres de Banach, Lecture Notes in Mathematics 735. Springer, Berlin. Aupetit, B., and Gardner, L. T. (1981). On a theorem of S. Berberian and I. Halperin. C. R. Math. Rep. Acad. Sci. Can. 3: 229-233. Banach, S. (1932). Theorie des Operations Lineaires. Monografje Matem- atyczne. Warsaw. Barnes, B. A. (1968). A generalised Fredholm theory for certain maps in the regular representation of an algebra. Can. J. Math. 20: 495-504. Barnes, B. A. (1969). The Fredholm elements of a ring. Can. J. Math. 21: 84-95. Barnes, B. A. (1983). Inverse closed subalgebras and Fredholm theory. Proc. R. Irish Acad. 83(A): 217-224. Barnes, B. A., Murphy, G. J., Smyth, M. R. F., and West, T. T. (1982). Riesz and Fredholm Theory in Banach Algebras. Pitman, London. Berberian, S. K. (1959). Note on a theorem of Fuglede and Putnam. Proc. Amer. Math. Soc. 10: 403-404. Berberian, S. K. (1961). Introduction to Hilbert Space, Oxford University Press, New York. Berberian, S. K. (1962a). A note on operators unitarily equivalent to their adjoints. J. London Math. Soc. 37: 403-404.
References 561 Berberian, S. K. (1962b). Approximate proper vectors. Proc. Amer. Math. Soc. 13: 111-114. Berberian, S. K. (1970). The Weyl spectrum of an operator. Indiana Univ. Math. J. 20: 529-544. Berg, I. D. (1971). An extension of the Weyl-von Neumann theorem to normal operators. Trans. Amer. Math. Soc. 160: 365-368. Bernau, S.J. (1965). The spectral theorem for normal operators. J. London Math. Soc. 40: 478-486. Bernau, S. J., and Smithies, F. (1963). A note on normal operators. Proc. Camb. Philos. Soc. 59: 727-729. Binding, P. (1980). Another positivity result for determinantal operators. Proc. R. Soc. Edinburgh 86A: 333-337. Binding, P. (1981). Multiparameter definiteness conditions. Proc. R. Soc. Edinburgh 89A: 319-332. Binding, P. (1982a). Multiparameter definiteness conditions. II. Proc. R. Soc. Edinburgh 93A: 47-61. Binding, P. (1982b). On a problem of B. D. Sleeman. J. Math. Anal. Appl. 85: 291-307. Erratum: (1982c). J. Math. Anal. Appl. 90: 270-271. Binding, P. (I982d). Left definite multiparameter eigenvalue problems. Trans. Amer. Math. Soc. 272: 475-486. Binding, P., and Browne, P. J. (1977). A variational approach to multiparameter eigenvalue problems for matrices. SIAM J. Math. Anal. 8: 763-770. Binding, P., and Browne, P. J. (1978a). A variational approach to multiparameter eigenvalue problems in Hilbert space. SIAM J. Math. Anal. 9: 1054-1067. Binding, P., and Browne, P. J. (1978b). Positivity results for determinantal operators. Proc. R. Soc. Edinburgh 81A: 267-271. Boas, R. P. (1960). A Prime of Real Functions. Carus Math. Monograph 13, Wiley, New York. Bollobas, B. (1984). To what extent can the spectrum of an operator be diminished under an extension. In Linear and Complex Analysis Problem Book, Lecture Notes in Math. 1043, V. P. Havin, S. V. Hru§£ov, and N. K. Nikolskij (Eds.) Springer-Verlag, Berlin, Heidelberg, New York, Tokyo. Bonsall, F. F. (1967a). Compact operators from an algebraic standpoint. Glasgow Math. J. 8. Bonsall, F. F. (1967b). Compact Linear Operators. Yale University Lecture Notes, New Haven, CT. Bonsall, F. F. (1969). Operators that act compactly on an algebra of operators. Bull. London Math. Soc. 1. Bonsall, F. F., and Duncan, J. (1971/73). Numerical Ranges of Operators on Normed Spaces and Elements of Normal Algebras, Vol. I and II, Cambridge University Press, Cambridge.
562 References Bonsall, F. F., and Duncan, J. (1973). Complete Normed Algebras. Springer, Berlin. Bouldin, R. H. (1981). The essential minimum modulus. Indiana Univ. Math. J. 30: 513-517. Bouldin, R. H. (1982). The instability of non-semi-Fredholm operators under compact perturbations. J. Math. Anal. Appl. 87: 632-638. Bourbaki, N. (1960). Theorie spectrales. Hermann, Paris. Browder, F. E. (1961). On the spectral theory of elliptic differential operators I. Math. Ann. 142. Brown, A., and Pearcy, C. (1966). Spectra of tensor products of operators. Proc. Amer. Math. Soc. 17: 162-166. Brown, L. G., Douglas, R. G., and Fillmore, P. A. (1977). Extensions of C*-algebras and ff-homology. Ann. Math. 105: 265-324. Browne, P. J. (1972a). A multiparameter eigenvalue problem. J. Math. Anal. Appl. 38: 553-568. Browne, P. J. (1972b). A singular multiparameter eigenvalue problem in second order ordinary differential equations. J. Differential Equations 12: 81-94. Browne, P. J. (1974a). Multiparameter spectral theory. Indiana Univ. Math. J. 24: 249-257. Browne, P. J. (1974b). Multiparameter problems, Proc. 1974 Dundee Conference on Ordinary and Partial Differential Equations. Springer- Ver- lag Lecture Notes in Math. 415: 78-84. Browne, P. J. (1977). Abstract multiparameter theory, I. J. Math. Anal. Appl. 60: 259-273. Browne, P. J., and Sleeman, B. D. (1985). Inverse multiparameter eigenvalue problems for matrices. Proc. R. Soc. Edinburgh 100A: 29-38. Bunce, J. (1971). The joint spectrum of commuting non normal operators. Proc. Amer. Math. Soc. 29: 499-505. Buoni, J. J., Dash, A. T., and Wadhwa, B. L. (1981). Joint Browder spectrum. Pacific J. Math. 94: 259-263. Buoni, J., and Klein, A. (1979). On the generalized Calkin algebra. Pacific J. Math. 80: 9-12. Buoni, J., Harte, R., and Wickstead, A. (1977). Upper and lower Fredholm spectra. Proc. Amer. Math. Soc. 66: 309-314. Calkin, J. W. (1941). Two-sided ideals and congruences in the ring of bounded operators in Hilbert space. Ann. Math. 2: 42. Caradus, S. R. (1966). Operators of Riesz type. Pacific J. Math. 18: 61-71. Caradus, S. R. (1974). Operator Theory of the Pseudo-inverse. Queen's Papers in Pure and Applied Mathematics no. 38. Queen's University, Kingston, Ontario, Canada. Caradus, S. R. (1977). Perturbation theory for generalized Fredholm operators II. Proc. Amer. Math. Soc. 62: 72-6.
References 563 Caradus, S. R. (1978). Generalized Inverses and Operator Theory. Queen's Papers in Pure and Applied Mathematics no. 50. Queen's University, Kingston, Ontario, Canada. Caradus, S. R., Pfaffenberger, W. E., and Yood, B. (1974). Calkin Algebras and Algebras of Operators on Banach Spaces. Dekker, New York. Carrillo, A., and Hernandez, C. (1984). Spetra of constructs of a system of operators. Proc. Amer. Math. Soc. 91: 426-432. Ceausescu, Z., and Vasilescu, F-H. (1978). Tensor products and the joint spectrum in Hilbert spaces. Proc. Amer. Math. Soc. 72: 505-508. Chadwick, J., and Wickstead, T. (1977). A quotient of ultrapowers of Banach spaces and semi-Fredholm operators. Bull. London Math. Soc. 9: 321-325. Cho, M. (1986). Joint spectra of operators on Banach space. Glasgow Math. J. 28: 69-72. Cho, M., and Dash, A. T. (1985). On the Joint Spectra of Doubly Commuting n-tuples of semi-normal operators. Glasgow Math. J. 26: 47-50. Cho, M., and Takaguchi, M. (1979). Joint spectra of matrices. Sci. Rep. Hirosaki Univ. 26: 15-19. Cho, M., and Takaguchi, M. (1981). Boundary of Taylor's joint spectrum for two commuting operators. Sci. Rep. Hirosaki Univ. 28: 1-4. Cho, M., and Takaguchi, M. (1982). Identity of Taylor's joint spectrum and Dash's joint spectrum. Studia Math. 70: 225-229. Choi, M-D., and Davis, C. (1974). The Spectral mapping theorem for joint approximate spectrum. Bull. Amer. Math. Soc. 80: 317-321. Chui, C. K., Smith, P. W., and Ward, J. D. (1976). A note on Riesz operators. Proc. Amer. Math. Soc. 60: 92-94. Cigler, J., Losert, V., and Michor, M. (1979). Banach Modules and Functions on Categories of Banach Spaces. Dekker, New York. Clark, D. N. (1970). On commuting contractions. J. Math. Anal. Appl. 32 590-596. Clarke, D. J. (1975). A joint spectrum applied to commutative Banach algebras. J. London Math. Soc. 10: 143-152. Coburn, L. A., and Schechter, M. (1968). Joint spectra and interpolation of operators. J. Fund. Anal. 2: 226-237. Colojoara, I., and Foias, C. (1968). Theory of Generalized Spectral Operators, Math, and its Applications Vol. 9. Gordon and Breach, New York, London, Paris. Conway, J. B., and Morrel, B. B. (1981). Operators that are points of spectral continuity, II. Integral Eq. Op. Theory 4: 459-503. Conway, J. B., and Morrel, B. B. (1987). Roots and logarithms of bounded operators on Hilbert spaces, J. Fund. Anal. 70: 171-193. Corach, G., and Suarez, F. D. (1987). Extensions of characters in commutative Banach algebras, Studia Math. 85: 199-202.
564 References Crimmins, T., and Rosenthal, P. (1967). On the decomposition of invariant subspaces. Bull. Amer. Math. Soc. 73: 97-99. Curto, R. E. (1980). On the connectedness of invertible n-tuples, Indiana Univ. Math. J. 29: 393-406. Curto, R. E. (1981). Fredholm and invertible n-tuples of operators. The deformation problem. Trans. Amer. Math. Soc. 266: 129-159. Curto, R. E. (1981). Spectral inclusion for doubly commuting subnormal n-tuples. Proc. Amer. Math. Soc. 83: 730-734. Curto, R. E. (1982). Spectral permanence for joint spectra. Trans. Amer. Math. Soc. 270: 659-665. Curto, R. E. (1983). The spectra of elementary operators Indiana Univ. Math. J. 32: 193-197. Curto, R. E. (1986). Connections between Harte and Taylor spectra. Rev. Roum. Math. Pures Appl. 31: 203-215. Curto, R. E. Spectral systems with the projection property (preprint). Curto, R. E. (in press). Applications of several complex variables to multiparameter spectral theory. Lecture Notes in Math., Pitman, Boston, London, Melbourne. Curto, R. E., and Dash, A. T. (1987). Browder spectral systems (preprint). Curto, R. E., and Fialkow, L. (1987). The spectral picture of left and right multiplications. J. Fund. Anal. 71: 371-392. Curto, R. E., and Fialkow, L. A. Elementary operators with symbols in a function algebra (preprint). Curto, R. E., and Herrero, D. A. (1985). On closures of joint similarity orbits. Integral Eq. Op. Theory 8: 489-556. Curto, R. E., and Muhly, P. S. (1985). C*-Algebras of multiplication operators on Bergman spaces. J. Fund. Anal. 64: 315-329. Curto, R. E., and Salinas, N. (1984). Generalized Bergman kernels and the Cowen-Douglas theory. Amer. J. Math. 106: 447-488. Dash, A. T. (1973a). Tensor products and joint numerical range. Proc. Amer. Math. Soc. 40: 521-526. Dash, A. T. (1973b). Joint spectra. Studia Math. 45: 225-237. Dash, A. T. (1976). Joint essential spectra. Pacific J. Math. 64: 119-128. Dash, A. T. (1985). Joint Browder spectra and tensor products. Bull. Austral. Math. Soc. 32: 119-128. Dash, A. T., and Schechter, M. (1970). Tensor products and joint spectra. Israel J. Math. 8: 191-193. Davidson, K. A., and Herrero, D. A. (1986). Decomposition of Banach space operators. Indiana Univ. Math. J. 35: 333-343. Davis, C, and Rosenthal, P. (1974). Solving linear operator equations. Can. J. Math. 26: 1384-1389. Dixmier, J. (1977). C*-algebras. North Holland, Amsterdam. Douglas, R. G. (1966). On majorization, factorization and range-inclusion of operators in Hilbert space. Proc. Amer. Math. Soc. 17.
References 565 Douglas, R. G. (1972). Banach Algebra Techniques in Operator Theory. Academic Press, New York. Dowson, H R. (1978). Spectral Theory of Linear Operators. Academic Press, London. Dunford, N., and Schwartz, J. T. (1958). Linear Operators, Part I. Wiley- Interscience, New York. Edwards, D. A. (1964). Introduction to Functional Analysis. Lehigh University Notes. Embry, M. R. (1971). A comection between commutativity and separation of spectra of linear operators. Acta Sci. Math. (Szeged) 32: 235-237. Embry, M. R. (1973). Factorization of operators on Banach spaces. Proc. Amer. Math. Soc. 38: 587-590. Embry, M., and Rosenblum, M. (1974). Spectra, tensor products and linear operator equations. Pacific J. Math. 53: 95-107. Eschmeier, J. (1982). Local properties of Taylor's analytic functional calculus. Invent. Math. 68: 103-116. Eschmeier, J. (1983a). On two notions of the local spectrum for several commuting operators. Mich. Math. J. 30: 245-248. Eschmeier, J. (1983b). Equivalence of decomposability and 2-decomposabil- ity for several commuting operators. Math. Ann. 262: 305-312. Eschmeier, J. (1984). Some remarks concerning the duality problem for decomposable systems of commuting operators. Operator Theory: Advances and Applications, Proceedings of the 8th Conference on Operator Theory. Timisoara and Herculane. Basel, Boston, Stuttgart. Eschmeier, J. (1986). Tensor products and elementary operators (preprint). Eschmeier, J. (1987). Analytic spectral mapping theorems for joint spectra. Operator Theory: Advances and Applications, vol. 24, Birkhauser Verlag, Basel, Boston, Stuttgart, pp. 167-181. Eschmeier, J. (in press). Analytic index formulas for tensor product systems. Proc. R. Irish Acad. Eschmeier, J. (in press). Are commuting systems of decomposable operators decomposable? J. Op. Theory. Eschmeier, J., and Putinar, M. (1984). Spectral theory and sheaf theory III. J. reine angew. Math. 354: 150-163. Famshtam, A. S. (1978). On the joint spectrum of families of linear operators, (Russian). Izv. Akad. Nauk Azerb. SSR 5: 11-15. Famshtam, A. S. (1980a). On the joint spectrum of families of linear operators (Russian). Fund. Analiz. Priloj. 14: 83-84. Famshtam, A. S. (1980b). Stability of Fredholm complexes of Banach spaces with respect to perturbations that are small in a-norm (Russian). Izv. Akad. Nauk Azerb. SSR 1: 3-8. Famshtam, A. S. (1980c). Joint essential spectrum of a family of linear operators. Funkt. Analie Ego Priloz. 14: 83-84. English translation in Fund. Anal. Appl. 14: 152-153.
566 References Fa?nshta?n, A. S. (1984). Fredholmness and index of functions of left and right multiplication operators (Russian). Dokl. Akad. Nauk Azerb. SSR 40: 3-7. Fainshtain, A. S. (1987). A spectral mapping theorem for the Taylor spectrum (Russian). Izv. Akad. Nauk Azerb. SSR 8: 212-236. Fainshtain, A. S., and Isaev, H. A. (in press). The Taylor spectrum and the multiparameter spectral problem for systems of operators. Feldman, J., and Kadison, R. V. (1954). The closure of the regular operators in a ring of operators. Proc. Amer. Math. Soc. 5: 909-916. Fialkow, L. A. (1978). A note on the operator X —► AX — AB. Trans. Amer. Math. Soc. 243: 147-168. Fialkow, L. A. (1980). Elements of spectral theory for generalized derivations. J. Op. Theory 3: 89-113. Fialkow, L. A. (1981a). Elements of spectral theory for generalized derivations. II: The semi-Fredholm domain. Can. J. Math. 33: 1205-1231. Fialkow, L. A. (1981b). Essential spectra of elementary operators. Trans. Amer. Math. Soc. 267: 157-174. Fialkow, L. A. (1983). Spectral properties of elementary operators. Acta Set. Math. (Szeged) 46: 269-282. Fialkow, L. A. (1985). Spectral properties of elementary operators. II. Trans. Amer. Math. Soc. 290: 415-429. Fialkow, L. A. (1986). The index of an elementary operator. Indiana Univ. Math. J. 35: 73-102. Fillmore, P. A., Stampfli, J. G., and Williams, J. P. (1972). On the essential numerical range, essential spectrum and a problem of Halmos. Acta Sci. Math. (Szeged) 33: 179-192. Fillmore, P., and Williams, J. P. (1971). On operator ranges. Adv. Math. 7: 254-281. Foias, C, and Mlak, W. (1966). The extended spectrum of completely non- unitary contractions and the spectral mapping theorem. Studia Math. 26: 239-245. Fong, C-K., and Soltysiak, A. (1985). Existence of a multiplicative functional and joint spectra. Studia Math. 81: 213-200. Fong, C. K., and Sorour, A. R. (1979). On the operator identity £ AkXBk = 0. Can. J. Math. 31: 845-857. Frunza, S. (1975). The Taylor spectrum and spectral decompositions. J. Fund. Anal. 19: 390-421. Frunza, S. (1976). Spectral decompositions and duality. Illinois J. Math. 20: 314-321. Frunza, S. (1977). A characterization for the spectral capacity of a finite system of operators. Czech. Math. J. 27: 356-362. Frunza, S. (1978). On the localization of the spectrum for systems of operators. Proc. Amer. Math. Soc. 69: 233-239.
References 567 Frunza, S. (1983). On the spectral semidistance and quasinilpotent equivalence for systems of operators. J. Op. Theory 10: 57-60. Fuster, R., and Marquina, A. (1984). Geometric series in incomplete normed algebras. Amer. Math. Monthly 91: 49-51. Gamelin, T. (1969). Uniform Algebras. Prentice-Hall, Englewood Cliffs, N.J. Gillespie, T. A, and West, T. T. (1968). A characterization and two examples of Riesz operators. Glasgow Math. J. 9 (2): 106-110. Gohberg, I. C. (1951a). On linear equations in normed spaces. Dokl. Akad. Nauk SSR 76: 477-480 (Russian). Gohberg, I. C. (1951b). On linear equations depending analytically on a parameter. Dokl. Akad. Nauk SSR 78: 629-632 (Russian). Gohberg, I. C. and Krein, M. G. (1957). The basic propositions on defect numbers, root numbers and indices of inear operators. Uspekhi Mat. Nauk SSSR 12(2): 74, 43-118 (Russian). Amer. Math. Soc. Transl. 13(2) (1960): 185-265. Gohberg, I. C, and Krein, M. G. (1969). Introduction to the theory of linear nonselfadjoint operators. Transl. Math. Monographs 18, Amer. Math. Soc, Providence, RI. Gohberg, I. C, Markus, A. S., and Feldman, I. A. (1960). Normally solvable operators and ideals associated with them. Bui. Akad. Stiince Rss Moldoven. 10(76): 51-69 (Russian). Amer. Math. Soc. Transl. 61(2) (1967): 63-84. Goldberg, S. (1966). Unbounded Linear Operators with Applications. McGraw-Hill, New York. Gonzalez, M. (in press). A perturbation result for generalized Fredholm operators in the boundary of the group of invertible operators. Proc. R. Irish Acad. Sect. A. Gonzalez, M. Null spaces and ranges of polynomials of operators. Goodearl, K. R. (1982). Notes on Real and Complex C*-Algebras. Shiva. Grabiner, S. (1978). Ascent, descent, and compact perturbations. Proc. Amer. Math. Soc. 71: 79-80. Gramsh, B., and Lay, D. (1971). Spectral mapping theorems for essential spectra. Math. Ann. 192: 17-32. Gravner, J. (1986). A note on elementary operators on the Calkin algebra. Proc. Amer. Math. Soc. 97: 79-86. Grosu, C, and Vasilescu, F-H. (1982). The Kiinneth formula for Hilbert complexes. Integral Eq. Op. Theory 5: 1-17. Grothendieck, A. (1955). Produits tensoriels topologiques et espaces nucleases. Mem. Amer. Math. Soc. 16. Gunning, R., and Rossi, H. (1965). Analytic functions of several complex variables. Prentice-Hall, Englewood Cliffs, N.J. Halmos, P. R. (1982). A Hilbert Space Problem Book. Springer, New York.
568 References Halmos, P. R. (1983). BDF or the principal axis theorem. Not. Amer. Math. Soc. 30: 387-391. Halperin, I. (1981). On a theorem of Sterling Berberian. C. R. Math. Rep. Acad. Sci. Can. 3: 33-35. Harte, R. E. (1965). A generalization of the Hahn-Banach theorem. J. London Math. Soc. 40: 283-287. Harte, R. E. (1966). A theorem of isomorphism. Proc. London Math. Soc. 16: 753-765. Harte, R. E. (1972a). The spectral mapping theorem in several variables. Bull. Amer. Math. Soc. 78: 870-874. Harte, R. E. (1972b). Spectral mapping theorems. Proc. R. Irish Acad. Sect. A 72: 89-107. Harte, R. E. (1973a). The spectral mapping theorem for quasi-commuting systems. Proc. R. Irish Acad. Sect. A 73: 7-18. Harte, R. E. (1973b). Relatively invariant systems and the spectral mapping theorem. Bull. Amer. Math. Soc. 79: 138-142. Harte, R. E. (1973c). Tensor products, multiplication operators and the spectral mapping theorem. Proc. R. Irish Acad. 73A: 285-302. Harte, R. E. (1973d). Spectral mapping theorems on a tensor product. Bull. Amer. Math. Soc. 79: 367-372. Harte, R. E. (I973e). The spectral mapping theorem in many variables. Proceedings of the Symposium Uniform Algebras, University of Aberdeen, Aberdeen, U.K., pp. 59-63. Harte, R. E. (1974). Commutivity and separation of spectra II. Proc. R. Irish Acad. 74 (A): 239-244. Harte, R. E. (1975a). A Silov boundary for systems? Proceedings of the Symposium uAlgebras in Analysis," pp. 268-271, University of Birmingham. Academic Press, New York. Harte, R. E. (1975b). A problem of mixed interpolation. Math. Z. 143: 149-153. Harte, R. E. (1976). The exponential spectrum in Banach algebras. Proc. Amer. Math. Soc. 58: 114-118. Harte, R. E. (1978). Berberian-Quigley and the ghost of a spectral mapping theorem. Proc. R. Irish Acad. 78(A): 63-68. Harte, R. E. (1981). Invertibility, singularity, and Joseph L. Taylor. Proc. R. Irish Acad. 81A: 71-79. Harte, R. E. (1982). Fredholm theory relative to a Banach algebra homo- morphism. Math. Z. 179: 431-436. Harte, R. E. (1984a). Spectral projections. Irish Math. Soc. Newsletter no. 11: 10-15. Harte, R. E. (1984b). Almost open mappings between normed spaces. Proc. Amer. Math. Soc. 90: 243-249. Harte, R. E. (1984c). A quantitative Schauder theorem. Math. Z. 185: 243- 245.
References 569 Harte, R. E. (1985). A matrix joke. Irish Math. Soc. Newsletter no. 14: 13-16. Harte, R. E. (1986). Fredholm, Weyl and Browder theory. Proc. R. Irish Acad. 85(A): 151-176. Harte, R. E. (1987a). Regular boundary elements. Proc. Amer. Math. Soc. 99: 328-330. Harte, R. E. (1987b). Almost exactness in normed spaces. Proc. Amer. Math. Soc. 100: 257-265. Harte, R. E,, and Mathieu, M. (1986). Enlargements of almost open mappings. Proc. Amer. Math. Soc. 96: 247-248. Harte, R. E., and O'Searcoid, M. (1986). Positive elements and the B* condition. Math. Z. 193: 1-9. Harte, R. E., and Wickstead, A. (1977). Upper and lower Fredholm spectra II. Math. Z. 154: 253-256. Harte, R. E., and Wickstead, A. W. (1981). Boundaries, hulls, and spectral mapping theorems. Proc. R. Irish Acad. Sect. A 81: 201-208. Hastings, W. (1978). Commuting subnormal operators simultaneously qua- sisimilar to unilateral shifts. Illinois J. Math. 22: 506-519. Henkin, G. M., and Leiterer, J. (1984). Theory of functions on complex manifolds, Birkhaiiser. Hernandez, C. (1985). A note on the spectral mapping theorem. Studia Math. 83. Herrero, D. A. (1982). Approximation of Hilbert space operators I, Research Notes in Math. 72, Pitman. Herrero, D. A. (1983). Economical compact perturbations. I: Erasing normal eigenvalues. J. Op. Theory 10: 289-306. Heuser, H. G. (1982). Functional Analysis. Wiley, New York. Hochwald, S. H. and Morrel, B. B. (1987). Some consequences of left in- vertibility, Proc. Amer. Math. Soc. 100: 109-110. Hogan, D. A., and Langenhop, C. E. (1975). Invertibility in a Banach algebra. Indiana Univ. Math. J. 24: 965-977. Hormander, L. (1973). An Introduction to Complex Analysis in Several Variables. North-Holland, Amsterdam. Ichinose, T. (1978a). Spectral properties of tensor products of linear operators. I. Trans. Amer. Math. Soc. 235: 75-113. Ichinose, T. (1978b). Spectral properties of tensor products of linear operators. II: The approximate point spectrum and Kato essential spectrum. Trans. Amer. Math. Soc. 237: 223-254. Isaev, H. A. (1976a). The separation of the numerical range of multiparameter spectral problems (Russian). Izv. Akad. Nauk Azerb. SSR 5: 2-13. Isaev, H. A. (1976b). On multiparameter spectral theory. Soviet Math. Dokl. 17: 1004-1007.
570 References Isaev, H. A. (1977). The numerical range and the spectrum of joint holo- morphic operator functions of several complex variables (Russian). Uspekhi Mat. Nauk. 32: 253-257. Isaev, H. A. (1979). Topics in the Theory of Self-Adjoint Multiparameter Problems, SpectraVnaya teorija operatorov. Trudy II. Bsesojnznoi let- nei shkoly. Zagulba, 1975 (Russian). "Elm", Baku. (1979), pp. 87-102. Isaev, H. A. (1980a). On root elements of multiparameter spectral problems. Soviet Math. Dokl. 21: 127-130. Isaev, H. A. (1980b). Introduction to General Multiparameter Spectral Theory, (Russian), Spectral Theory of Operators, vyp. 3, Baku, "Elm" (1980), pp. 142-201. Isaev, H. A. (1981a). Expansions in eigenfunctions of self-adjoint singular multiparameter differential operators. Soviet Math. Dokl. 24: 326-330. Isaev, H. A. (1981b). On the theory of deficiency indices for multiparamter differential operators of Sturm-Liouville type. Soviet Math. Dokl. 24: 580-583. Isaev, H. A. (1983). Genetic operators and multiparameter spectral theory. Soviet Math. Dokl. 27: 148-152. Isaev, H. A. Lectures on multiparameter spectral theory (preprint). Isaev, H. A., and Almamedov, M. S. (1984). On a class of multiparameter spectral problems, Materialy Confer, po pricl. matem. i mech., Kirovabad, 1984, "Elm," Baku, pp. 13-14. Isaev, H. A., and Famshtam, A. S. (1980). On Joint Spectra of Finite Commutative Families (Russian), Spectral Theory of Operators, vyp. 3, "Elm," Baku, pp. 222-257. Izumino, S. (1979). Inequalities on operators with index zero. Math. Japon. 23: 565-572. Izumino, S., and Kato, Y. (1985). The closure of invertible operators on a Hilbert space. Acta Sci. Math. 49: 321-327. James, R. C. (1951). A nonreflexive Banach space isometric with its second conjugate space. Proc. Nat. Acad. Sci. 53: 559-566. Jameson, G. J. O. (1974). Topology and Normed Spaces. Chapman and Hall, London. Janas, J. (1983). Spectral inclusion theorems for commuting subnormal pair. Soc. Math. Pol. Series I: Comment Math. 23: 219-229. Kaashoek, M. A. (1967). Ascent, descent nullity and defect, a note on a paper by A. E. Tayor. Math. Ann. 172: 105-116. Kaashoek, M. A., and Lay, D. C. (1967). On operators whose Fredholm set is the complex plane. J. Math. 21: 275-278. Kaashoek, M. A., and Smyth, M. R. F. (1972). On operators T such that f{T) is a Riesz or meromorphic. Proc. R. Irish Acad. 72(A). Kadison, R. V., and Ringrose, J. R. (1983). Fundamentals of the Theory of Operator Algebras I. Academic Press.
References 571 Kahane, J-P., and Zelazko, W. (1968). A characterization of maximal ideals in commutative Banach algebras. Studia Math. 29: 339-343. Kallstr6m, A., and Sleeman, B. D. (1976). Solvability of a linear operator system. J. Math. Anal. Appl. 55: 785-793. Kaniel, S., and Schechter, M. (1963). Spectral theory for Fredholm operators. Comm. Pure Appl. Math. 16: 423-448. Kato, T. (1958). Perturbation theory for nullity, deficiency and other quantities of linear operators. J. Anal. Math. 6: 261-322. Kato, T. (1966). Perturbation Theory for Linear Operators. Springer-Verlag, Berlin. Kitadani, S-I. (1982). On majorization of induced operators. TRU Math 18: 139-143. Kleinecke, D. (1957). Almost-finite, compact, and inessential operators. Proc. Amer. Math. Soc. 14: 863-868. Krantz, S. (1982). Function Theory of Several Complex Variables, Wiley, New York. Krein, M. G. (1958). Integral equations on a half-line with kernel depending upon the difference of the arguments. Uspekhi Mat. Nauk 13, no. 5 (83), 3-120 (Russian). A mer. Math. Soc. Transl. (2)22 (1962). Krein, M. G., Krasnosel'skii, M. A., and Mil'man, D. C. (1948). On the defect of linear operators in Banach space and on some geometric problems. Sbornik Trud. Inst. Mat. Akad. Nauk Ukr. SSR 11: 97-112 (Russian). Kuiper, N. H. (1965). The homotopy type of the unitary group of Hilbert space, Topology 3: 19-30. Laffey, T. J., and West, T. T. (1982). Fredholm commutators. Proc. R. Irish Acad. 82A: 129-140. Lange, R., and Shulberg, G. Spectra associated with invariant operator ranges (preprint). Laurie, C, and Radjavi, H. (1980). On the West decomposition of Riesz operators. Bull. London Math. Soc. 12: 130-132. Lay, D. C. (1968). Characterizations of the essential spectrum of F. E. Browder. Bull. Amer. Math. Soc. 74. Lay, D. C. (1970). Spectral analysis using ascent, descent, nullity and defect. Math. Ann. 184: 197-214. Lebow, A., and Schechter, M. (1971). Semigroups of operators and measures of non-compactness. J. Fund. Anal. 7: 1-26. Lindenstrauss, J., and Tzafriri, L. (1977). Classical Banach spaces I, sequences spaces. Springer-Verlag, Berlin, Heidelberg, New York. Lindenstrauss, J., and Tzafriri, L. (1979). Classical Banach spaces II, function spaces. Springer-Verlag, Berlin, Heidelberg, New York. Lomonosov, V. I. (1973). On invariant subspaces of families of operators commuting with a completely continuous operator. Fund. Anal. Apl. 7: 55-56 (Russian).
572 References Lorch, E. R. (1962). Spectral Theory. Oxford. Lumer, G., and Rosenblum, M. (1959). Linear operator equations. Proc. Amer. Math. Soc. 10: 32-41. MacNerney, J. S. (1962). Hermitian moment sequences. Trans. Amer. Math. Soc. 103: 45-81. Makai, E., and Zemanek, J. (1983). The surjectivity radius, packing numbers and boundedness below for linear operators, Int. Eq. and Op. Theory 6: 372-384. Manderkern, M. (1983). Constructive continuity. Mem. Amer. Math. Soc. 277. Martensen, E. (1987). Riesz theory without the axiom of choice. Proc. Amer. Math. Soc. 99: 496-500. Martin, D. H., and Swart, J. A characterization of semi-Fredholm operators defined on almost reflexive Banach spaces (preprint). Mathieu, M. (1983). Spectral theory for multiplication operators on C*- algebras. Proc. R. Irish Acad. 83A: 231-249. Mathieu, M. (1984a). A remark on Elementary Operators. Semesterbericht Funktionalanalysis WS 83/84, Tubingen, pp. 239-247. Mathieu, M. (1984b). A characterization of positive multiplications on C*- algebras. Math. Japon. 29: 375-382. Mathieu, M. (in press). Elementary operators on prime C*-algebras. Mathieu, M. (in press). On Approximate Point Spectra of Elementary Operators. Semesterbericht Funktionalanalysis WS 85/86. Mathieu, M. On rings of quotients of ultraprime Banach algebras. With applications to elementary operators (preprint). Mattila, K. (1978). Normal operators and proper boundary points of the spectra of operators on a Banach space. Ann. Acad. Sci. Fenn. Ser. A. I. Diss. 19: 5-48. McCoy, N. H. (1934). On quasicommutative matrices. Trans. Amer. Math. Soc. 36: 327-340. Mcintosh, A., and Pryde, A. (1985). A functional calculus for several commuting operators, CMA-R46-85, Australian National University. Mcintosh, A., Pryde, A., and Ricker, W. Comparison of joint spectra for certain classes of commuting operators (preprint). Michaels, A. J. (1977). Hilden's simple proof of Lomonosov's invariant sub- space theorem, Adv. Math. 25: 56-58. Muller, V. (1977). On discontinuity of the spectral radius in Banach algebras. Comment Math. Univ. Carolinae 18: 591-597. Muller, V. (1979). On domination problem in Banach algebras. Comment Math. Univ. Carolinae. 20 (3): 475-481. Muller, V. (1982). On domination and extensions of Banach algebras. Studia Math. 73. Muller, V. (in press). On quasialgebraic operators in Banach spaces. J. Op. Theory.
References 573 Murphy, G. J. (1980). The Cortex and the Approximate Zero Divisors of an Algebra. Proc. R. Irish Acad. 80A(l): 63-68. Murphy, G. J. (1982). Lifting Sets and the Calkin algebra. Glasgow Math. J. 23: 83-84. Murphy, G. J. (1982). Triangularizable algebras of compact operators. Proc. Amer. Math. Soc. 84: 354-356. Murphy, G. J. (1982). Diagonality in C*-algebras (preprint). Murphy, G. J. Derivations and quasinilpotents (to appear) Studia Math. Murphy, G. J., and West, T. T. (1979). Spectral radius formulae. Proc. Edinburgh Math. Soc. 22: 271-275. Murphy, G. J., and West, T. T. (1980a). Decomposition algebras of Riesz operators. Glasgow Math. J. 21: 75-79. Murphy, G. J., and West, T. T. (1980b). Removing the interior of the spectrum. Comment Math. Univ. Carolinae. 21: 421-431. Murphy, G. J., and West, T. T. (1981). Decomposition of index-zero Fred- holm operators. Proc. R. Irish Acad., Ser A 81: 49-54. Nashed, M. Z. (1976). Generalized Inverses and Applications. Academic Press, New York. Nelson, E. (1970). A functional calculus for non-commuting operators. Functional Analysis and related fields: Proceedings of a conference in honor of Marshall Stone U of Chicago 1968, Springer-Verlag. Newburg, J. D. (1951). On the variation of spectra. Duke Math. J. 18: 165-176. Newmann, D. J. (1960). The nonexistence of projections from L1 to H1. Proc. Amer. Math. Soc. 11: 692. Nussbaum, R. D. (1970). The radius of the essential spectrum. Duke Math. J. 37: 473-478. Oberai, K. K. (1974). On the Weyl spectrum. Illinois J. Math. 18: 208-212. Oberai, K. K. (1980). Spectral mapping theorem for essential spectra. Rev. Roum. Math. Pures Appl. 45: 365-373. O'Searcoid, M. (1985). A Note on a comment of Axler and Shapiro. Irish Math. Soc. Newsletter no. 15: 52-56. O'Searcoid, M. (in press). Nilpotent decomposition of left and right Fred- holm elements of a prime ring. Proc. R. Irish Acad. O'Searcoid, M. (submitted). A contribution to the solution of the compact correction problem for operators on a Banach space. J. London Math. Soc. O'Searcoid, M. (in press). The continuity of the semi-Fredholm index. Proc. Edinburgh Math. Soc. O'Searcoid, M. (in press). Economical finite rank perturbation of semi- Fredholm operators. Math. Z. Pearlman, L. D. (1974). Riesz points of the spectrum of an element in a semi-simple Banach algebra. Trans. Amer. Math. Soc. 193.
574 References Pelczynski, A. (1960). Projections in certain Banach spaces. Studia Math. 19: 209-228. Pietsch, A. (1978). Operator Ideals. Deutscher Verlag der Wissenschaften, Berlin. Potra, F. A., and Ptak, V. (1984). Nondiscrete Induction and Interative Processes, Research Notes in Math., no. 103. Pitman, New York. Putinar, M. (1982a). Some invariants for semi-Fredholm systems of essentially commuting operators. J. Op. Theory 8: 65-90. Putinar, M. (1982b). The superposition property for Taylor's functional calculus. J. Op. Theory 7: 149-155. Putinar, M. (1983a). Uniqueness of Taylor's functional calculus. Proc. Amer. Math. Soc. 89: 647-650. Putinar, M. (1983b). Spectral theory and sheaf theory. I. Op. Theory: Adv. Appl. 11: 283-297. Putinar, M. (1984a). Spectral inclusion for subnormal n-tuples. Proc. Amer. Math. Soc. 90: 405-406. Putinar, M. (1984b). Functional calculus and the Gelfand transformation. Studia Math. 79: 83-86. Putinar, M. (1985). Base change and the Fredholm index. Integral Eq. Op. Theory 8: 674-692. Putinar, M. Spectral theory and sheaf theory. II (preprint). Putinar, M., and Vasilescu, F-H. (1983). Continuous and analytic invariants for deformations of Fredholm complexes. J. Op. Theory 9: 3-26. Putinar, M., and Vasilescu, F-H. The local spectral theory needs Frechet spaces (preprint). Radjavi, H., and Rosenthal, P. (1973). Invariant Subspaces. Springer-Verlag, New York. RakoCevic, V. (1981). On one subset of M. Schechter's essential spectrum. Mat. Vesnik. 5: 389-391. RakoCevic, V. (1984). On the essential approximate point spectrum. II. Mat. Vesnik. 5: 89-97. RakoCevic, V. (1985). On a class of operators. Mat. Vesnik. 37: 423-426. RakoCevic, V. (1986). Approximate point spectrum and commuting compact perturbations. Glasgow Math. J. 28: 193-198. RakoCevic, V. (submitted). Operators obeying a-Weyl's theorem. Rev. Roum. Math. Pures Appl. RakoCevic, V. Moore-Penrose inverse in Banach algebras. Proc. R. Irish Acad, (to appear). Ransford, T. J. (1985). Generalised spectra and analytic multivalued functions J. London Math. Soc. Read, C. J. (1986). Inverse producing extensions of a Banach algebra which eliminates the residual spectrum of one element. Trans. Amer. Math. Soc. 286: 715-725.
References 575 Rickart, C. E. (1960). General Theory of Banach Algebras. Van Nostrand, Princeton, N.J. Ringrose, J. R. (1962). Superdiagonal forms for compact linear operators. Proc. London Math. Soc. 3(12): 41-56. Ringrose, J. R. (1971). Compact Non-Self-Adjoint Operators. Mat. Stud. 35. Van Nostrand Reinhold, New York. Roitman, M., and Sternberg, Y. (1981). When is a linear functional multiplicative? Trans. Amer. Math. Soc. 149: 111-124. Rosenblum, M. (1956). On the operator equation BX — XA = Q. Duke Math. J. 23: 263-269. Rosenblum, M. (1958). On a theorem of Fuglede and Putnam. J. London Math. Soc. 33: 376-377. Rowell, J. W. (1984). Unilateral Fredholm theory and unilateral spectra. Proc. R. Irish Acad. 84A: 69-85. Rowell, J. W. (1986). Spectral collapse. J. London Math. Soc. 34: 479-488. Rowell, J. W. Subdirect ascent in Banach algebras (preprint). Rudin, W. (1973). Functional Analysis. McGraw-Hill, New York. Rudol, K. (1983). On spectral mapping theorems. J. Math. Anal. Appl. 97: 131-139. Ruston, A. F. (1954). Operators with a Fredholm theory. J. London Math. Soc. 29. Ruston, A. F., (1986). Fredholm Theory in Banach Spaces. Cambridge Tracts in Mathematics 86. Cambridge University Press, Cambridge, MA. Rynne, B. P. (in press). Tensor products and Taylor's joint spectrum in Banach spaces. Proc. R. Irish Acad. Salinas, N. (1972). Operators with essentially disconnected spectrum. Acta Sci. Math. 33. Schatten, R. (1950). A theory of cross-spaces. Ann. Math. Stud. 26. Schechter, M. (1966). On the essential spectrum of an arbitrary operator. J. Math. Anal. Appl. 13. Schechter, M. (1967). Basic theory of Fredholm operators. Ann. Scuola Nomale Sup. di Pisa, Classe di Sci. 21. Schechter, M. (1968). Riesz operators and Fredholm perturbations. Bull. Amer. Math. Soc. 64: 1139-1144. Schechter, M. (1969a). On perturbations of essential spectra. J. London Math. Soc. 1(2): 343-347. Schechter, M. (1969b). On the spectra of operators on tensor products. J. Fund. Anal. 4: 95-99. Schechter, M. (1971). Principles of Functional Analysis. Academic Press, New York. Schechter, M., and Snow, M. (1975). The Fredholm spectrum of tensor products. Proc. R. Irish Acad. Sect. A 75: 121-128.
576 References Sikonia, W. (1971). The von Neumann converse of Weyl's theorem. Indiana Univ. J. 21: 121-124. Silov, G. (1955). On the decomposition of a normed ring into a direct sum of ideals. Amer. Math. Soc. Transl. 1: 37-48. Simmons, G. F. (1965). Introduction to Topology and Modern Analysis. McGraw-Hill, New York. Singer, I. M., and Wermer, J. (1955). Derivations on commutative normed algebras. Math. Ann. 129: 260-264. Sirokov, F. (1956). Proof of a conjecture of Kaplansky. Uspekhi Mat. Nauk 11: 167-168. Sleeman, B. D. (1973). Singular linear differential operators with many parameters. Proc. R. Soc. Edinburgh 71(A): 199-232. Sleeman, B. D. (1978a). Multiparameter spectral theory in Hilbert space. Pitman Press, London. Sleeman, B. D. (1978b). Multiparameter spectral theory in Hilbert space. J. Math. Anal. Appl. 65: 511-530. Slodkowski, Z. (1977). An infinite family of joint spectra. Studia Math. 61: 239-255. Slodkowski, Z., and Wojtynski, W., and Zemanek, J. (1977). A note on quasinilpotent elements of a Banach algebra. Bull. Acad. Polon. Sci., Sir. Sci. Math. Astronom. Phys. 25: 131-134. Slodkowski, Z., and Zelazko, W. (1974). On joint spectra of commuting families of operators. Studia Math. 50: 127-148. Smyth, M. R. F. (1974a). A note on the action of an operator on its cen- traliser. Proc. R. Irish Acad. 74(A). Smyth, M. R. F. (1974b). Ideals of algebraic elements of a Banach algebra, Trinity College, Dublin, School of Mathematics (preprints 11). Smyth, M. R. F. (1975a). Fredholm theory in Banach algebras, Trinity College, Dublin, School of Mathematics (preprints 8). Smyth, M. R. F. (1975b). Riesz theory in Banach algebras. Math. Z. 145: 144-155. Smyth, M. R. F. (1976c). Riesz algebras. Proc. R. Irish Acad. 76 (A): 327- 333. Smyth, M. R. F. (1980). On problems of Olubummo and Alexander. Proc. R. Irish Acad. 80 (A): 69-74. Smyth, M. R. F., and West, T. T. (1975). The spectral radius formula in quotient algebras. Math. Z. 145: 157-161. Snow, M. (1975). A joint Browder essential spectrum. Proc. R. Irish Acad. Sect. A 129-131. Stampfli, J. G. (1974). Compact perturbations, normal eigenvalues and a problem of Salinas. J. London Math. Soc. 2: 9. Taylor, A. E., and Lay, D. C. (1980). Introduction to Functional Analysis. Wiley, New York.
References 577 Taylor, J. L. (1970a). A joint spectrum for several commuting operators. J. Fund. Anal. 6: 172-191. Taylor, J. L. (1970b). Analytic functional calculus for several commuting operators. Acta Math. 125: 1-38. Taylor, J. L. (1972). A general framework for a multi-operator functional calculus. Adv. Math. 9: 183-252. Taylor, J. L. (1973). Functions of several noncommuting variables. Bull. Amer. Math. Soc. 79: 1-34. Taylor, J. L. (1975). Banach Algebras and Topology, Algebras in Analysis. Academic Press, New York. Taylor, M. E. (1968). Functions of several self-adjoint operators. Proc. Amer. Math. Soc. 19: 91-98. Treese, G. W., and Kelly, E. P. (1977). Generalised Fredholm operators and the boundary of the maximal group. Proc. Amer. Math. Soc. 67: 123-128. Vala, K. (1964). On compact sets of compact operators. Ann. Acad. Sci. Fenn. AI. Vala, K. (1968). Sur les elements compacts d'une algebre normee. Ann. Acad. Sci. Fenn. AI. Vasilescu, F-H. (1977). A characterization of the joint spectrum in Hilbert spaces. Rev. Roum. Math. Pures Appl. 22: 1003-1009. Vasilescu, F-H. (1978). On pairs of commuting operators. Studia Math. 62: 203-207. Vasilescu, F-H. (1979). Stability of the index of a complex of Banach spaces. J. Op. Theory 2: 247-275. Vasilescu, F-H. (1982). Analytic Functional Calculus and Spectral Decompositions. D. Reidel Publishing Company, Dordrecht and Editura Acl- ademiei, Bucure§ti. Vesentini, E. (1968). On the subharmonicity of the spectral radius. Boll. Union. Mat. Ital. Voiculescu, D. (1976). A non-commutative Weyl-von Neumann theorem. Rev. Roum. Math. Pures Appl. 21: 97-113. Volkmer, H. (1982). On multiparameter theory. J. Math. Anal. Appl. 86: 44-53. Waelbroeck, L. (1954). Le calcul symbolique dans les algebres commuta- tives. J. Math. Pures Appl. 33(9): 147-186. Waelbroeck, L. (1982). The Holomorphic Functional Calculus as an Operational Calculus. Banach Center Publications, Vol. 8 (Spectral Theory), pp. 513-552. Wermer, J. (1971). Banach Algebras and Several Complex Variables. Mark- ham. West, T. T. (1966a). Riesz operators in Banach spaces. Proc. London Math. Soc. 16(3).
578 References West, T. T. (1966b). The decomposition of Riesz operators. Proc. London Math. Soc. 16(3). West, T. T. (in press). A Riesz-Schauder theorem for semi-Fredholm operators. Proc. R. Irish Acad. West, T. T. (to appear). Removing the jump—Kato's decomposition, Rocky Mountain J. Whitley, R. I. (1964). Strictly singular operators and their conjugates. Trans. Amer. Math. Soc. 113: 252-261. Wilansky, A. (1964). Functional Analysis. Blaisdell. Wilansky, A. (1978). Modern Methods in Topological Vector Spaces. McGraw-Hill, New York. Wrobel, V. (1985). Superdiagonal forms for related linear operators. Glasgow Math. J. 26: 19-23. Wrobel, V. (1986a). Multi-dimensional spectral theory of bounded linear operators in locally convex spaces. Math. Ann. 275: 409-423. Wrobel, V. (1986b). Tensor products of linear operators in Banach spaces and Taylor's joint spectrum. J. Op. Theory 16: 273-283. Wrobel, V. (in press). The boundary of Taylor's joint spectrum for a pair of commuting Banach space operators. Studia Math. Yang, K.-W. (1973). Index of Fredholm Operators. Proc. Amer. Math. Soc. 41: 329-330. Yang, K.-W. (1976). The generalized Fredholm operators. Trans. Amer. Math. Soc. 216. Yang, K.-W. (1977). Operators invertible modulo the weakly compact operators. Pacific J. Math. 71. Yood, B. (1951). Properties of linear transformations preserved under addition of a completely continuous tranformation. Duke Math. J. 18: 599-612. Zame, W. (1976). Existence and uniqueness of functional calculus homo- morphisms. Bull. Amer. Math. Soc. 82: 123-125. Zelazko, W. (1968). A characterization of multiplicative linear functionals in complex Banach algebras. Studia Math. 30: 83-85. Zelazko, W. (1970). A characterization of Silov boundary in function algebras. Prace Mat. 14: 59-64. Zelazko, W. (1972). On a certain class of non-removable ideals in Banach algebras. Studia Math. 44: 87-92. Zelazko, W. (1973). Banach Algebras. Elsevier, New York. Zelazko, W. (1979). An axiomatic approach to joint spectra, I. Studia Math. 64: 250-261. Zemanek, J. (1977). A note on the radical of a Banach algebra. Manuscripta Math. 20. Zemanek, J. (1984). The semi-Fredholm radius of a linear operator, Bull. Polish Acad. Sci. 32: 67-76.
References 579 Zemanek, J. (1985). The stability radius of a semi-Fredholm operator, Int. Eq. and Op. Theory 8: 137-144. Zemanek, J. (in press). The essential spectral radius and the Riesz part of the spectrum, Functions, Series, Operators, Proc. Internat. Conf. Budapest, 1980. Colloq. Math. Janos Boylai.
Notation K, R, C ll-ll dist(x,y) Disc(x\S) Int, CI Nbd dK = {c\K)\{mtK) dist{x,K) X/Y Y®pZ ~ = Hoc Np ip(n,x) <f>0 x coo(n,x) c0(n,x) cx(n,x) QnPO Q(X) A-1 L(x,y) = BL{X,Y). LK(X,Y) = BLK(X,y) 1 2 3 3 3 3 3 4 6 11 12 12 14 15 15 15 17 17 17 19 19 21 26 26 Ix JY ker(T) Ky coker(T) core(T) col(5,T) row(5, T) ^U ^2 <, 7T2 Qn(r) *Z j-n ±jrp Q(T) <ln(*) X* jR/p ^x *x /0y /®y /.(*.) ^a 28 28 28 29 29 29 30 31 31 31 32 33 39 42 42 43 43 46 47 47 47 47 48 49, 457 581
582 Notation K HBL(A,J9) BL+(X,y) BL_1(X,y) aBL(X,y) 7rleftBL(X,y) 7:T^htBL{X,Y) fleftBL(X,y) rleftBL(X,y) TT{sht BL(X,y) TT{shtBL{X,Y) core0(T) <7leftBL(X,r) <7T{shtBL{X,Y) aleftBL(X,y) (7rightBL(X,y) rBL1(X,y) v)fBL{X,Y) cf*htBL(X,Y) RT/A LT/B o™BL{X,Y) <4ghtBL(X,y) (^t)m' (^t)m C7left(A) <7rieht(A) 7Tleft(A) 7TT{Sht(A) T\v,z> T/W}Z c{n,x) X~, T~ graph(T) xt rt BL£(X,y) K° H0 cvxK{K) 49, 457 50 51 54 54 57 57 61 61 65 65 72 73 74 76 76 80 86 86 88 88 92 92 94 95 95 95 95 98 108 125, 126 133 137 137 137 140, 296 140, 296 140 sgn cvx{K) Face(JC) Extreme(jK') KL0(X,y) *Js?BL(x,y) ^fhtBL(X,y) aessBL(X,y) index(T) 192 wessBL(X,y) m(H,X) ?7i1(n,x) p(*) P(X) PiW l|r||ess ll^U'ess KLx(X,y) KL(X,y) a+sBL(X,y) a-sBL(X,y) T'aT comm(T) comm(jK') = comm.A(K) comm2 [K) comm(o) MLI(A) MRI(A) Radical(A) Hull(J) I1 A a ax H<±K r-°°(o) T°°{X) ascent (T) 157 165 165 165 183 187 187 187 ,529 192 202 202 202 204 204 205 205 206 206 209 209 209 229 231 237 237 237 241 241 243 244 246 247 258 258 271 271 271
Notation 583 descent (T) acc(jRT) iso(jRT) CcmpK(£) Vt(K) Ifl \cun,n') = {\fl-.f ec(n,n')} winding number(<£) A-1 exp(a) Exp(A) sgn(<£) Re(A), Re(a) State(A) (x\y) y* y J- x K± r* /i+(a) *a(°) *!?"(") c*ht(«) *!|ftM a^ht(a) W") ??%) *!?'(*) *2ght(<0 eA(°) •7(10 u;r(a) cDr(a) W, A* WT °ess(T) 271 275 275 281 281 286 295,296 288 290 292 292 295 303 305 309 309 313 313 324 333 341 341,457 341, 458 341, 458 341, 458 345,459 345,459 345, 459 345, 459 345 347 350 350 352 357 362 370 "ess^) wr (a) "eCs°smm(r) *oo(T) V(a)=VA(a) «#(*) A* BL{X,Y,Z) crleft'r''ehtBL(X,r,^) BL_1 [X,Y,Z) 'W{X,Y,Z) Euler(5,T) Euler(Tn,...,r1,T0) Polyn Poiy^ w6=t(a) w6=6(a) g(K,b), g(a,H) (A)=Al AE(n) dE(n) dvA(a) d*(A) X®Y X®AY det(c) \q adj(c) Holo(jRT) DBL(A,A) kn{X)=k{X',dz) A\x) A(T) KM At(5) aTaylor(T) aTaylor(r) rTaylor/j.\ 370 370 370 371 383 383 385 401 402 402 403 427 427 461 461 465 465 469 472 474 475 476 476 493 494 502 502 503 506,536 509 517 517 517 518 520 522 522 522
584 Notation ~Taylor(r) Taylor/ •. <^lor(a) -Ta,lor(r) fTay.or(r) 522 526 526 526 528 528 528 *eTssyl°r(r) 528 index(T) 529 (cf. p. 192) D(a) 529 Aodd(T), Aeven(T) 531 polycvx(iC) 532 ratiocvx(iC) 532 A(*/) 533 [z.p)-1^) 533
Index Absolutely convex hull, 140 Absolutely convex, 6 Accumulation points, 275 Adjoint, 137, 324 Adjugate matrix, 493, 503 Allan's theorem, 506 Almost (left,right) determined, 395 Almost (left,right) invertible, 401, 435 Almost (left,right) majorized, 395 Almost (left,right) multiple, 395 Almost comajorized, 393 Almost decomposably exact, 401 Almost exact, 402 Almost Fredholm, 214 Almost left invertible, 76 Almost left multiple, 391 Almost left spectrum, 532 Almost linearly exact, 402 Almost linearly Taylor nonsingu- lar, 520 Almost lower semi-Fredholm, 214 Almost open, 65 Almost quasipolar, 257 Almost range-included, 393 Almost right invertible, 76 Almost right multiple, 393 Almost right spectrum, 532 Almost spectrum, 339 Almost Taylor invertible, 520 Almost Taylor nonsingular, 520 Almost upper semi-Fredholm, 213 Annihilator, 139 Approximate (left,right) multiple, 395 Approximate left multiple, 391 Approximate right multiple, 393 Approximate (left,right) invertible, 402 Approximately decomposably exact, 402 Arcwise connected, 287 Arens-Calderon trick, 536 Ascent, 271 Atkinson's theorem, 188 Axiom of choice, 22 Back annihilator, 139-140 Backward shift, 44, 250 Baire's theorem, 127 Banach limit, 335 Banach space, 107 585
586 Index Banach-Steinhaus theorem, 4, 135 Bessel's inequality theorem, 316, 321 Bolzano-Weierstrass property, 200 Bornological space, 17 Bornology, 17 Boundary operator, 71 Bounded, 26 Bounded below, 61 Bounded structure, 17 Browder, 266 Canonical factorization, 31 Cauchy integral formula, 363 Cauchy sequence, 108 Cauchy-Riemann equation, 338 Chain, 399, 435 Character, 305 Closed, 61 Closed boundaries, 474 Closed graph theorem, 133 Closure, 2 Comajorized, 393 Coarser, 13 Codomain, 29 Cofinal, 43 Cokernel, 29 Commutant, 237 Commutative Gelfand-Naimark theorem, 381 Commute, 237 Compact, 197 Compact operator, 206 Complemented, 12 Complete, 110 Completely convex, 113 Completely invariant, 512 Completion, 125 Conjugate linear, 139 Conjugate, 137 Connected, 280 Connected component, 289 Connected hull, 281 Conservative, 255, 358 Continuous, 4, 25 Contractible, 286 Convergent sequence, 4, 17 Convex, 112 Convex combinations, 165 Crossnorm, 493 Cyclic representation, 387 Cyclic vector, 387 Decomposably exact, 401 Decomposably regular, 84, 246 Dense, 57 Dense proper subspace of, 130 Derivation, 509 Derivative, 167 Descent, 271 Determinant, 503 Determined, 392 Different iable, 167 Dimension, 175-176 Dini's theorem, 301 Directed, 21 Disconnection, 280 Double commutant, 237 Dual Space, 137 Enlargement, 19, 41 Epimorphism, 57 Equivalence of norms, 13 Essential enlargment, 204 Essential Taylor spectrum, 528 Essentially (left,right) almost invertible, 425 Essentially (left,right) invertible, 425 Essentially (left,right) one-one, 425 Essentially dense, 187 Essentially one-one, 187 Euler number, 420, 426 Evaluation map, 47 Exact, 394, 402
Index 587 Exponential, 292 Exponential spectrum, 345 Exterior algebra, 517 Extreme point, 165 Face, 165 Faithful representation, 388 Finer, 13 Finite, 17 Finite ascent, 271 Finite descent, 271 Finite dimensional, 174 Finite rank operator, 183 Forward shift, 44, 250 Fredholm, 187 Fuglede's theorem, 360 Fundamental theorem of algebra, 340 Gelfand property, 356 Gelfand theorem, 357 Gelfand-Mazur lemma, 356 Generalized exponential, 292 Generalized inverse, 80 Greatest crossnorm, 496 Green's theorem, 339 Hahn-Banach theorem, 142 Hamel basis, 173 Hartog's phenomenon, 537 Hermitian, 326, 333 Hilbert algebra, 332 Hilbert space, 320 Hole, 281 Holomorphic, 337 Homomorphism, 49 Homotopic, 286 Hyperinvariant subspace, 255 Hyperkernel, 271 Hyperrange, 271 Ideal, 20 Idempotent, 247 Increasing, 50 Index, 192, 529 Index theorem, 194 Inessential hull, 244 Inner product, 309 Inside of a curve, 362 Interior, 2 Inverse, 20, 53 Invertible, 53 Invertible, relative to, 256 Isolated points, 275 Isometric, 12 Isomorphic, 12 Jordan property, 50 Kernel, 29 Kleinecke-Sirikov theorem, 510 Krein-Milman theorem, 166 Least uniform crossnorm, 497 Lebesgue covering property, 200 Left and right compositions, 46 Left and right multiplications, 46 Left approximate eigenvalues, 345, 458 Left eigenvalues, 345, 459 Left ideal, 241 Left invariant, 512 Left invertible, 73 Left invertible modulo, 86 Left M-invertible, 92 Left multiple, 391 (Left,right) bounded below, 435 (Left,right) determined, 395 (Left,right) invertible, 401 (Left,right) majorized, 395 (Left,right) multiple, 395 (Left,right) one-one, 435 (Left,right) T-Fredholm, 440 (Left,right) T-Weyl, 440 Left spectrum, 341, 457 Left topological zero divisor, 79, 95 Lift, 287
588 Index Linear algebra, 19 Linear functional, 47 Linear space, 1 Linear subspace, 5 Linearly dependent, 173 Linearly exact, 402 Linearly generating, 173 Linearly independent, 173 Linearly Taylor nonsingular, 520 Liouville's theorem, 340 Locally compact, 200 Lomonosov's lemma, 255, 374 Lower semi-Fredholm, 209 Majorized, 391 Martinelli kernel, 539 Maximal, 22 Maximal ideal, 241 Maximum, 22 Maximum modulus principle, 340 Measure of noncompactness, 202 Minkowski functional, 145 Monomorphism, 57 Monotonic, 50 Natural injection, 28-29 Nearest point theorem, 317 Nearly essentially exact, 430 Neighborhood, 3 Nilpotent, 251 Noncommutative Gelfand- Naimark theorem, 387 Noncommutative polynomials, 461 Nonsingular, 102 Norm, 2 Normal, 333 Normal space, 104 Normal subgroup, 291 Normed linear algebra, 20 Normed module, 482 Normed space, 2 Null, 17 Null space, 29 Numerical range, 383 Oka extension theorem, 537 One-one, 56 Onto, 56 Open, 65 Open mapping theorem, 128-129 Operator bound, 26 Orthogonal, 313 Orthogonal complement, 313 Orthonormal, 314 Outside of a curve, 362 Partial isometry, Partial order, 21 Partially ordered normed space, 23 Perpendicular, 313 Polar, 257 Polar decomposition, 442 Polarization, 310 Pole, 140 Polynomial polyhedron, 534 Polynomially convex, 532 Polynomially convex hull, 532 Positive, 326 Precore, 73 Proper, 60 Proper ideal, 241 Pseudo-inverse, 80 Punctured neighborhood theorem, 274 Pure numerical range, 383 Pythagoras' theorem, 314 Quadratic norm, 310 Quasi-commute, 571 Quasinilpotent, 251 Quasinorm, 2 Quasipolar, 257-258 Quotient, 6 Quotient map, 29 Radical, 243
[ndex 589 Radical element, 243 Range, 29 Range-included, 393 Rank one, 183 Rationally convex, 532 Rationally convex hull, 532 Reduced by, 256 Reflexive, 161 Regular, 80, 246 Relative spectrum, 465 Relatively almost open, 72 Relatively compact mapping, 202 Relatively Fredholm, 80, 246 Relatively open, 72 Relatively Weyl, 84, 246 Relief map, 105 Resolvent function, 342 Reversal of products, 21, 54 Riesz points, 371 Right approximate eigenvalues, 345, 461 Right eigenvalues, 345, 459 Right ideal, 241 Right invariant, 512 Right invertible, 73 Right invertible modulo, 86 Right M-invertible, 92 Right multiple, 393 Right spectrum, 341, 458 Right topological zerodivisor, 79, 95 Schwartz inequality, 310 Self-adjoint, 326, 333 Semi-essential enlargement, 204 Semisimple, 358 Seminorm, 2 Separated, 105 Separation theorem, 145 Sequentially compact, 198 Sequentially precompact, 198 Sesquilinear, 139 Silov boundary, 476 Simply connected, 287 Simply polar, 248 Spanning, 173 Spatially Browder, 266 Spatially Fredholm, 187 Spatially (left,right) Fredholm, 425 Spatially (left,right) Weyl, 426 Spectral mapping property, 472 Spectral mapping theorem, 343, 468 Spectral permanance theorem, 349 Spectral radius, 352 Spectrum, 341 Stampfli's theorem, 374, 375 State, 305 Steinitz replacement process theorem, 174 Stone-Weierstrass theorem, 302 Strictly exact, 402 Strictly weaker than, 130 Stronger, 13 Subalgebra, 20 Submodule, 304 Subprojective property, 465 Supplemented, 11 T-Browder, 267 T—Browder spectrum, 370 T-Fredholm, 261 T—Fredholm spectrum, 350 T-Weyl, 261 T-Weyl spectrum, 350 Taylor almost Fredholm spectrum, 528 Taylor almost invertible spectrum, 523, 527 Taylor almost singular spectrum, 523 Taylor-Fredholm spectrum, 528 Taylor invertible, 520 Taylor invertible spectrum, 523, 527 Taylor nonsingular, 520
590 Index Taylor singular spectrum, 523 Taylor-Weyl spectrum, 529 Tensor product, 493 Terminating, 17 Tietze's theorem, 297 Topological versus metric, 13 Topology, 3 Total order, 21 Totally bounded, 197 Totally bounded mapping, 202 Totally bounded operator, 206 Triangle inequality, 3 Tychonoff product theorem, 201 Uniform boundedness principle, 134 Uniform crossnorm, 494 Uniform tensor product, 493 Uniformly, continuous, 25 Unitary, 333 Upper bound, 21 Upper semicontinuity, 345 Upper semi-Fredholm, 209 Urysohn function, 105 Urysohn's lemma, 105 Usual axioms of set theory, 22 Vector space, 1 Weak Taylor spectrum, 528 Weak* topology, 384 Weaker, 13 Weakly (left,right) Fredholm, 425 Weakly (left,right) T-Fredholm, 440 Weyl operator, 192 Winding number, 288 Zelazko's theorem, 361 Zorn's condition, 23 Zorn's lemma, 22
about the book . . . Focusing on the various kinds of "singularity" which prevent an operator from being invertible, this introduction to functional analysis bases its presentation on the Open Mapping theorem, the Hahn-Banach theorem, the Dual Space construction, the Enlargement of normed space, and Liouville's theorem. Invertibility and Singularity for Bounded Linear Operators makes these concepts easily accessible to an elementary audience . . . develops the theory of open and almost open operators between incomplete spaces... builds the "enlargement" of a normed space and of a bounded operator . .. and sets up an elementary algebraic framework for Fredholm' theory. In addition, this new approach to elementary functional analysis extends from the definition of a normed space to the fringe of modern multiparameter spectral theory. It concludes with a discussion of the various kinds of "joint spectrum," including the complicated ideas of Joseph Taylor. Complete with exercises and a bibliography, this volume serves both as a text for advanced undergraduate and graduate mathematics students in functional analysis courses and as a monograph for mathematicians researching Fredholm theory, Banach algebras, and multiparameter spectral theory. about the author . . . Robin Harte is Professor of Mathematics at Cork University, a college of the National University of Ireland, where he has taught since 1968. Previously he taught at the University College of Swansea in Wales, U.K., and he was a visiting professor at the University of Iowa. His research interests include Fredholm and spectral theory for bounded operators and Banach algebras. Dr. Harte was educated at Trinity College, Dublin, and received the Ph.D. degree (1965) in mathematics from the University of Cambridge in England. Printed in the United States of America ISBN: 0—8247—7754—9 marcel dekker, inc./newyork • basel