Text
                    Handbook of Measure Theory
Edited by: E. Pap
Copyright © 2002 Elsevier
ISBN: 978-0-444-50263-6


Table of Contents Vol.1 Part 1 Ch. 1 Ch.2 Ch.3 Ch.4 Ch. 5 Ch. 6 Ch.7 Preface List of Contributors Classical Measure Theory History of Measure Theory Some Elements of the Classical Measure Theory Paradoxes in Measure Theory Convergence Theorems for Set Functions Differentiation Radon-Nikodym Theorems One-Dimensional Diffusions and Their Convergence in Distribution Endre Pap Djura Paunic Endre Pap Miklos Laczkovich Paolo de Lucia and Endre Pap Brian S. Thomson Domenico Candeloro and Aljosa Volcic James K. Brooks v-vi vii-viii 1-26 27-82 83-123 125-178 179-247 249-294 295-343
Part 2 Ch. 8 Ch.9 Ch. 10 Part 3 Ch. 11 Ch. 12 Ch. 13 Ch. 14 Part 4 Vector Measures Vector Integration in Banach Spaces and Application to Stochastic Integration The Riesz Theorem Stochastic Processes and Stochastic Integration in Banach Spaces Integration Theory Daniell Integral and Related Topics Pettis Integral The Henstock— Kurzweil Integral Set-Valued Integration and Set-Valued Probability Theory.. An Overview Topological Aspects of Measure Theory Nicolae Dinculeanu Joe Diestel and Johan Swart James K. Brooks M. Diaz Carrillo Kazimierz Musial Benedetto Bongiorno Christian Hess 345-399 401-447 449-502 503-530 531-586 587-615 617-673
Ch. 15 Ch. 16 Ch. 17 Ch. 18 Vol. II Part 5 Ch. 19 Ch.20 Ch.21 Ch.22 Ch.23 Density Topologies FN-Topologies and Group-Valued Measures On Products of Topological Measure Spaces Perfect Measures and Related Topics Order and Measure Theory Riesz Spaces and Ideals of Measurable Functions Measures on Quantum Structures Probability on MV- Algebras Measures on Clans and on MV-Algebras Triangular Norm-Based Measures Wladyslaw Wilczynski Hans Weber Stratos Grekas Doraiswamy Ramachandran Martin Vath Anatolij Dvurecenskij Beloslav Riecan and Daniele Mundici G. Barbieri and H. Weber D. Butnariu and E.P. Klement 675-702 703-743 745-764 765-786 787-825 827-868 869-909 911-945 947-1010
Part 6 Ch.24 Ch.25 Part 7 Ch.26 Ch.27 Ch.28 Ch.29 Ch.30 Part 8 Geometric Measure Theory Geometric Measure Theory.. Selected Concepts; Results and Problems Fractal Measures Relation to Transformation and Duality Positive and Complex Radon Measures in Locally Compact Hausdorff Spaces Measures on Algebraic—Topological Structures Liftings Ergodic Theory Generalized Derivatives Relation to the Foundations of Mathematics Miroslav Chelbik Kenneth J. Falconer T.V. Panchaan Piotr Zakrzewski W. Strauss, N.D. Macheras and K. Musial Frank Blume Endre Pap and Arpad Takaci 1011-1036 1037-1054 1055-1090 1091-1130 1131-1184 1185-1235 1237-1260
Ch.31 Ch.32 Part 9 Ch.33 Ch.34 Ch.35 Ch.36 Ch.37 Real Valued Measurability; Some Set- Theoretic Aspects Nonstandard Analysis and Measure Theory Non-Additive Measures Monotone Set Functions- Based Integrals Set Functions over Finite Sets.. Transformations and Integrals Pseudo-Additive Measures and Their Applications Qualitative Possibility Functions and Integrals Measures of Information Author Index Subject Index Aleksandar Jovanovic Peter A. Loeb P. Benvenuti, R. Mesiar and D. Vivona Michel Grabisch Endre Pap Didier Dubois and Henri Prade Wolfgang Sander 1261-1293 1295-1328 1329-1379 1381-1401 1403-1468 1469-1522 1523-1565 1567-1585 1587-1607
Preface The main goal of this Handbook is to survey measure theory with its many different branches and its relations with other areas of mathematics. Mostly aggregating many classical branches of measure theory the aim of the Handbook is also to cover new fields, approaches and applications which support the idea of the measure in a wider sense, e.g., the ninth part of the Handbook. Although chapters are written as surveys to special areas they contain many special topics and challenging problems valuable for experts and reach sources of inspiration. I hope that mathematicians from other areas as well as physicists, computer scientists, engineers, econometrists will find useful results and powerful methods for their research. The reader may find in the Handbook many close relations to other mathematical areas: real analysis, probability theory, statistics, ergodic theory, functional analysis, potential theory, topology, set theory, geometry, differential equations, optimization, variational analysis, decision making and others. Measure theory, as a classical mathematical area, is treated in many textbooks and monographs and new results are widespread in many different journals. In the last 30 years, traditional conferences on measure theory in Germany (Oberwolfach) and Italy (CARTEMI - Capri, Ischia, Maiori, Grado) were reach sources for new results and further development of many specific areas of measure theory. The increasing interest in measure theory (theory and applications) initiated the creation of the GEM working group (GEnerealized Measures) in the framework of the international EUSFLAT association, and parts of this Handbook are related to the GEM work. I mention also the specific approach by Bourbaki A965) and the recent work of D.H. Fremlin B000) with a systematic approach to measure theory of which volumes 1 and 2 are already published, and drafts of most parts of the other three volumes are available on web page at www.essex.ac.uk/maths/staff/ fremlin/mt.htm. Encyclopaedia of Mathematics, Kluwer Academic Publishers (especially the very useful CD-ROM version, 1997) contains also many measure theoretical items. The reader of this Handbook may see here one place where new results are obtained and new areas are developed. The Handbook preparation started in 1998 with many difficulties. Many discussions on the content and possible authors were undertaken with the measure theoretical community, well connected thanks to the previously mentioned conferences. I am obliged to the authors who agreed to contribute to the Handbook. In the cooperation with them it is the encouragement I was given and the nice personal relations with many of them not only through e-mail (more than 2000 messages), but also often in direct contacts, that have brought this project to the final stage. Although there was a great pressure on authors to make some unification, first of all because of common subject index and author index, they have succeeded in preserving their own scientific styles and approaches.
VI Preface Many mathematicians were contacted and involved, but several of them because of other obligations were not able to deliver their contribution themselves. The reader will also note that some areas are missing, or some areas are under-represented. Some previous issues of the Handbook series already cover some missing parts as Measure Algebras (D.H. Fremlin) in the Handbook of Boolean Algebras (J.D. Monk, ed.), Borel Measures (R.J. Gardner, W.F. Pfeffer) in the Handbook of Set-Theoretic Topology (K. Kunen, J.E. Vaughan, eds). For the readers convenience, the subjects covered by the Handbook in 37 chapters are organized in nine parts although there are close interactions between them. In editing of the Handbook, I received much help from the contributors, as well as many useful advices from Professors D.A. Fremlin, D. Kolzow, W.A.J. Luxemburg, and W.F. Pfeffer. I am grateful to the Johannes Kepler University, Linz, Austria, where I have managed a lot of related research. The project was supported during the visit of the editor to the University Federico II, Naples, as visiting professor for PhD students, by INdAM, Italy, in the period May-June, 2000, and by numerous visits to Naples, supported by MURST. I want to thank for the partial financial support of the Project in the Fields of Basic Research "Mathematical models of nonlinearity, uncertainty and decision" A866) supported by Ministry of Science, Technology and Development of Serbia. I gratefully thank for the fruitful cooperation and support of Dr. A. Sevenster, B. Lightfoot, A. Deelen from Elsevier Science Publishers. Finally, I would like to thank VTEX Typesetting Services and in particular Dr. Z. Kryzius for their fine work in converting the Handbook to its final typeset form. End re Pap
List of Contributors Barbieri, G., Universita di Udine, Udine (Ch. 22) Benvenuti, P., Universita degli Studi "La Sapienza ", Roma (Ch. 33) Blume, R, John Brown University, Siloam Springs, AR (Ch. 29) Bongiomo, В., Dipartimento di Matematica ed Applicazioni, Palermo (Ch. 13) Brooks, J.K., University of Florida, Gainesville, FL (Chs. 7, 10) Butnariu, D., University of Haifa, Haifa (Ch. 23) Candeloro, D., Dipartimento di Matematica, Perugia (Ch. 6) Carrillo, M.D., Universidad de Granada, Granada (Ch. 11) Chlebik, M., Comenius University, Bratislava (Ch. 24) de Lucia, P., Universita "Federico II", Napoli (Ch. 4) Diestel, J., Kent State University, Kent, OH (Ch. 9) Dinculeanu, N., University of Florida, Gainesville, FL (Ch. 8) Dubois, D., IRIT- UPS, Toulouse (Ch. 36) Dvurecenskij, ?., Slovak Academy of Sciences, Bratislava (Ch. 20) Falconer, K.J., University of St Andrews, Fife, Scotland (Ch. 25) Grabisch, M., University of Paris VI, Paris (Ch. 34) Grekas, S., University of Athens, Athens (Ch. 17) Hess, C, Universte Paris Dauphine, Paris (Ch. 14) Jovanovic, ?., University of Belgrade, Belgrade (Ch. 31) Klement, E.P., Johannes Kepler University, Linz (Ch. 23) Laczkovich, M., Eotvbs Lorand University, Budapest (Ch. 3) Loeb, PA., University of Illinois, Urbana, IL (Ch. 32) Macheras, N.D., University of Piraeus, Piraeus (Ch. 28) Mesiar, R., Slovak Technical University, Bratislava and Systems Research Institute PAN, Warszawa (Ch. 33) Mundici, D., University of Milan, Milan (Ch. 21) Musial, K., Wroclaw University, Wroclaw (Chs. 12, 28) Pap, E., University ofNovi Sad, Novi Sad (Chs. 2, 4, 30, 35) Panchapagesan, T.V., Universidad de los Andes, Merida (Ch. 26) Paunic, D., University of Novi Sad, Novi Sad (Ch. 1) Prade, H., IRIT- UPS, Toulouse (Ch. 36) Ramachandran, D., California State University, Sacramento, CA (Ch. 18) Riecan, В., Slovak Academy of Sciences, Bratislava (Ch. 21) Sander, W., Technical University of Braunschweig, Braunschweig (Ch. 37) Strauss, W., Universitat Stuttgart, Stuttgart (Ch. 28) Swart, J., University of Pretoria, Pretoria (Ch. 9) vii
viii List of Contributors Takaci, ?., University ofNovi Sad, Novi Sad (Ch. 30) Thomson, B.S., Simon Fraser University, ВС, Canada (Ch. 5) Vath, M., University ofWurzburg, Wiirzburg (Ch. 19) Vivona, D., Universita degli Studi "La Sapienza", Roma (Ch. 33) Volcic, ?., Dipartimento di Scienze Matematiche, Trieste (Ch. 6) Weber, H., Universita di Udine, Udine (Chs. 16, 22) Wilczynski, W., University of Lodz, Lodz (Ch. 15) Zakrzewski, P., University of Warsaw, Warsaw (Ch. 27)
CHAPTER 1 History of Measure Theory Djura Paunic* Institute of Mathematics. University of Novi Sad. Trg D. Obradovica 4. 21000 Novi Sad. Yugoslavia E-mail: djura®im.ns.ac.yu Contents Introduction 3 1. Beginnings 3 2. The Greeks 4 3. Archimedes (> 4. Infinitesimal methods 8 5. Loss of measure 1* 6. New beginning 1 & 7. Newly found measure 21 References 25 *The author wants to thank for the financial support of the Project in the Fields of Basic Research supported by Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1
This page intentionally left blank
History of measure theory 3 Introduction At the beginnings of civilization mathematics could be differentiated from science and technology as "rational art of solving abstract problems with numbers and geometrical figures". The accent in this definition is on "rational", "abstract", and "problem solving". It means that the solution has to be obtained from initial data with some kind of rational, logical mental process. To calculate the size of a geometrical figure was always regarded as one of the most basic aims of mathematics. Mathematically, size calculation of an object was divided into tree problems which were considered different: finding lengths of lines, areas of surfaces, or volumes of bodies. It was tacitly supposed that size of an object, which will be called measure, has the following properties: • Same objects have equal measure. • Part of an object has smaller measure than the whole. • If an object is divided into nonoverlapping parts, the measure of the whole object is equal to the sum of the measures of the parts. • Points have 0 measure in measuring lengths, lines have 0 measure in measuring areas, and surfaces have 0 measure in measuring volumes. The exposition of all results in this article will be in contemporary mathematical language and notations. It introduces the problem of interpretation of what many authors actually wanted to say since often their mode of expression is not sufficiently clear, or the results are not explicitly stated but only implied, or the original exposition does not exist any longer. 1. Beginnings The first civilizations rose in the fertile river valleys of China, India, Mesopotamia, and Egypt in the third millennium ВС. Unfortunately, in China and India very perishable material was used for writing so it is very little known of their mathematical knowledge at early times. The dryness of climate in Egypt preserved fair number of documents, so that two papyri are preserved, Rhind papyrus and Moscow papyrus, devoted entirely to mathematics. Even more is preserved in Mesopotamia since they used to write on clay tablets which are almost indestructible, especially by fire. In Egypt and Mesopotamia a lot of empirical rules were discovered for solving various mathematical problems. Some of them turned out to be mathematically correct, other only approximate. Rhind papyrus contains 84 mathematical problems. Approximately 20 problems are devoted to computations of areas and volumes. The area of parallelogram (product of the base by height), triangle (half product of the base by height), and trapeze (product of half sum of the bases by height) was computed correctly. For quadrangles it was used incorrect formula (product of half sum of one pair opposite sides by the half sum the other pair of sides). Very intriguing is the 50th problem which says that the area of round field of diameter 9 is equal to the area of square whose side is 8. This is equivalent to
4 D. Paunic Even more interesting are 10th and 14th problems in the Moscow papyrus. In the 14th problem it is drawn trapeze with base 4, height 6. and upper base 2. From the solution it follows that volume of truncated square pyramid is to be calculated. Volume is calculated as sum of 16,4, and 8 multiplied by 2. Is it application of V = -(a2+ab + b2)? It is very difficult to say yes, and equally difficult to say no. Much more unclear is the 10th problem. It is to be calculated the surface area of figure which resembles basket. It seems that the formula ( ?2 ? ? = I 1 - - I Bx)-x, where ? =4-, is applied. Since A - gJ is Egyptian equivalent for тг/4 one gets calculation of surface of a body related to circle. Half sphere or half cylinder or something else? In Mesopotamia there were many different civilizations but all of them used cuneiform writing on clay tablets so all will be considered as one civilization. Most mathematical tablets come from two libraries: Tiglatpilasars I in Assur and Assurbanipals in Ninevah. The sexagesimal system for writing numbers was used (without zero), so the calculations were simple to perform. They could solve much more difficult problems than Egyptians and were more interested in algebraical problems then geometrical. In Mesopotamia appear solution of quadratic equations and Pythagorean theorem. The area of regular polygon or circle was calculated that the length of the perimeter was multiplied by corresponding constants. For circle the constant was 1/12 which corresponds to 3 for ?. 2. The Greeks Greek philosophers and mathematicians made the great discovery that mathematical rules could be proved and that the whole of mathematics can be organized in axiomatic theory. The Greek geometry started with Thales of Miletos, but Pythagoreans made the initial development in VI and V centuries ВС. They investigated the properties of numbers, geometrical figures; especially regular polygons, regular solids and circle, and created the theory of rational proportions. One of the most important discoveries Pythagoreans made was the existence of incommensurable lengths, i.e., that there are segments whose lengths cannot be determined by the process which is today known as Euclidean algorithm. It seems that this discovery was made ca. 430 ВС, and there are two hypothesis how it was done. First hypothesis is that Pythagorean tried to measure the length of diagonal of regular pentagon by its side, since Pythagorean sign was regular pentagon with all diagonals drawn. The problem of measuring the diagonal with the side reduces to the problem of measuring the diagonal of smaller pentagon, formed from diagonals of the original pentagon, to its diagonal, and one gets infinite regress. This explanation is not documented in ancient literature. The other hypothesis is based on the parity of numerical values for sides of right angled triangle and Pythagorean theorem. It is the consequence of easy observations that square
History of measure theory 5 of even number of the form 4k, and that the square of odd number is of the form 4k + 1 (Knorr, 1975, Theorems 7, 8, p. 151). If not all sides are even, then hypotenuse cannot be even since its square is of the form 4k and the sum of two odd squares is of the form 4n + 2. So hypotenuse must be odd. Of the remaining sides one must be even the other odd. If hypotenuse is the diagonal of the square, then one side of the square is even and the other odd, so the length of the side of the square is number which is at the same time even and odd, which is absurd. This explanation can be much more easily defended with ancient sources. Theodoros of Kyrene (V century ВС) studied the problem of irrationality of square roots of numbers and proved that square roots of number 3, 5, etc. to 17 are irrational, "but in this one [17] for some reason he encountered difficulties" (Knorr, 1975, p. 62). The more complete theory of quadratic irrational numbers was created by Theaetetos of Athens (V- IV century ВС) and was included as Book X into Euclid's Elements. Democritos of Abdera (ca. 460-ca. 370 ВС) was good mathematician and proponent of atomic theory. He was pupil of Leuccipos of Miletos, and in philosophical circles it is understood that atomic theory originated from Leuccipos. But it is almost nothing known of Leuccipos. Atomic theory is very useful for mathematical applications since from it follows that any geometrical figure is made of atoms, and that one can use some kind of limiting process to determine lengths, areas or volumes. Archimedes attributes to Democritos discovery that a cone is one third part of the cylinder, and the pyramid one third of the prism, which has the same base and equal height, but that he did not prove it. The works of Democritos are lost. The discovery of incommensurable magnitudes showed that all lengths, all areas or all volumes cannot be measured by rational numbers, and that the existing Pythagorean theory of integral proportions is insufficient. The Greeks did not create the notion of real number but used complicated definition of proportionality of ratios of geometric magnitudes so that so called geometric algebra could be applied. It was rather cumbersome system of calculations with geometric magnitudes but which was, or could be made, completely rigorous. It can be regarded precursor of the definition of real numbers by Dedekind cuts, but they had no intention of developing it further. The originator was Eudoxos of Cnidos (first half of IV century ВС). Essentially his theory of proportions was presented in Book V of Euclid's Elements. In Book XII of Euclid's Elements are presented rigorous Eudoxean proofs of the following results of Democritos and Hypocrates of Hios in which two geometric magnitudes are compared: A) Areas of two circles are in the same ratio as the squares of their radii. This result was discovered by Hypocrates of Hios. B) Volumes of two cylinders with the same height are in the same ratio as the squares of their radii. C) The volumes of two pyramids with the same height are in the same ratio as the ratio of the areas of their bases. D) A cone and the cylinder with the same height are in the ratio of 1 to 3. E) The volumes of two spheres are in the same ratio as the cubes of their radii. For the proofs of these theorems the so-called method of exhaustion was used. It consists in showing that the area or volume of the difference between the given figure, and the
6 D. Paunic sequence of inscribed polygons (Р„)/16ь: or prisms, cylinders etc. {Уп)пчп can be made smaller than any magnitude of the same kind if ? is sufficiently large. 3. Archimedes The greatest mathematician of antiquity was Archimedes of Syracuse B877-212 ВС). Of his extant works 6 are devoted to calculations of lengths, areas, and volumes. These are: • Measurement of a Circle. It consist of two theorems and one corollary, and is only fragment of the original work. • On the Sphere and Cylinder. • Quadrature of Parabola. • On Conoids and Spheroids. • On Spirals. • Method. Only the first two works were very popular and well-known throughout the history of mathematics. The last one was discovered only in 1906, and it had no influence at the development of mathematics. The first Greek who is known to have tried to square the circle was Anaxagoras of Clasomenae. While he was in prison he occupied himself with this problem. Nothing further is known. Next, Hypocrates of Hios tries to divide circle into lunes (gr. meniskoi. lat. lunnulae), geometric figure bounded by two arcs. He succeeded to square 5 lunes, some greater than the half circle, some smaller, but this approach to the determination of the area of the circle was apparently dead end. Dinostratos tried to square circle using special mechanical curve which was used by Hippias of Elis to trisect an angle. This curve is obtained in the intersection of the upper side of square when it moves uniformly parallel to itself towards the lower side, and the left side when it rotates uniformly around lower vertex to the right for 90° till it coincides with the lower side. Both sides start moving at the same moment and reach lower side at the same time. The Nicomedes showed that the side of the square is the geometric mean of the quarter of the circumference (side of square is radius), and the segment of the side between left lower vertex, and the intersection point of the curve with the lower side of the square. This solution was objected that uses point on the curve which does not exist, since when both sides reach lower side of the square they coincide and there is no intersection point of the curve and the side. The first Greek who tried to square the circle by inscribing regular polygons in it was Antiphon, sophist from Athens, and contemporary of Socrates. His work is not extant, and there are only vague description of it by commentators. It seems that his idea was successively to double the number of sides of the inscribed polygons so that after finite number of iterations the polygon will coincide with the circle. Every polygon can be transformed into square so circle can be squared. The reason for coincidence is that after finite number of steps sides of the polygon will coincide with the circle (atom will be reached). This procedure criticized that the side of the polygon will never coincide with the circumference since the line can only touch circumference in one point. His idea, as presented by commentator, can be regarded more as a thought experiment than
History of measure theory 7 mathematical proof. If one understands it that circle is limiting case of inscribed polygons then it becomes true. Anyway Antiphon was the first to propose an idea that could be transformed into mathematical proof. One generation later Bryson, pupil of Socrates or Euclides of Megara, proposes that one has to inscribe square in the circle, and to circumscribe square around circle, and that the circle will be equal to the an intermediate square. It very unclear what he actually meant, but the most important in his construction was that area of the circle is bounded from above and from below. Eudoxos was the first who transformed vague ideas of Antiphon and Bryson into rigorous proofs, but it was Archimedes who succeeded to apply them to the circle. In Measurement of a Circle are proved two important facts: First the area of the circle is the same as the area of right-angled triangle in which one of the sides about the right angle is equal to the radius, and the other to the circumference of the circle. It is equivalent to the statement that the ratio of the diameter to the circumference is the same as the ratio of the area to the square of the radius. The second is that 3 Щ < ? < 3 j. This inequalities were determined by approximating the length of the circumference with the perimeters of inscribed and circumscribed regular polygons of 96 sides. These perimeters were obtained starting with regular hexagon and in each step doubling the number of sides of polygons and calculating their perimeters. The second work On the Sphere and Cylinder is the most interesting, and is divided in two books. The main result is that the surface area of the sphere is four times as the area of the greatest circle Dлт2, proposition 33 of the first book) and the volume of the sphere is for times of the volume of the cone which has its base equal to the greatest circle of the sphere and its height equal to the radius of the sphere (^??\ proposition 34 of the first book). In Quadrature of Parabola he calculated that the area of the segment of parabola is four thirds of the area of the triangle with the same base and equal height. Here the sum of geometric series was used to obtain the result. Conoid is solid which is obtained by rotation of parabola around its principal axis or hyperbola around its major axis. Spheroid is ellipsoid of revolution. On Conoids and Spheroids is devoted to the computation of volume of these solids and the Archimedean constructions for it are very similar to the modern definition of definite integral. To prove that the volume of paraboloid of revolution is half of the volume of the cylinder in which it is inscribed the paraboloid is divided into ? slices in which ? — 1 cylinders are inscribed and around which ? cylinders are circumscribed. In this work he computed that the area of an ellipse is abn, where a, and b are greater, and smaller semiaxis. The work On Spirals is devoted to the investigation of Archimedean spiral. It is very interesting because in it transcendental curve is investigated mathematically for the first time. The first twenty propositions are devoted to determination of the tangent line to the spiral, and played great role in the development of differential calculus. Last eight propositions are devoted to area computation. The main result is that the area of the figure bounded by the first turn of the spiral and the straight line segment joining its beginning to its end is equal to one third of the area of the circle whose radius is that line segment.
8 D. Paunic Archimedes succeeded to calculate the area of spiral sector bounded by radius vectors r\, and Г2 and which has central angle ? ? ( 1 Д a(S)= -I Г|Г2 + -(? -r\)-\. The problem is not important but his skill of the computation is fascinating. In the Method Archimedes showed how he has found some of his results with ingenious use of lever principle. For the sake of discovery he supposes that plane figure is composed of the parallel straight segments. Then these segments are balanced with another set of parallel segments which belong to known figure at the other end of the lever so that area of unknown figure can be determined. The decomposition of plane figure into straight line segments was extensively used in the XVII century. Method ends with volume computation of two new bodies. In proposition 14 Archimedes computes that the volume of the wedge cut from cylinder of diameter d, and height d/2 by a plane through a diameter of the base and one point on the circumference of the other base is d^/\2. In proposition 15 is computed that the volume of the body, which is obtained as common region of two cylinders with equal diameter d whose axes are intersecting and perpendicular, is 2t/3/3. In antiquity there was no new ideas in measure theory after Archimedes. Only Pappos in his Collection gives areas of some interesting surfaces. The most interesting is the area bounded by the spiral on the hemisphere. If the hemisphere is generated so that quadrant makes one revolution about vertical axis, and the point during that rotation moves uniformly from the pole to the equator then the path of the point describes spiral on the hemisphere. Pappos proves that the spiral, and quarter of the circumference which connects its starting, and end point bound the surface which has the same ratio to the hemisphere as has the sector of the quadrant to the quadrant. The other result is the theorem which anticipates Guldin's theorem. He says: "Figures generated by complete revolution of a plane figure about an axis are in a ratio compounded A) of the ratio of the areas of the figures, and B) of the ratio of the straight lines similarly drawn to (i.e., drawn to meet at the same angles) the axes of rotation from the respective centers of gravity" (Heath, 1921, II, p. 403). 4. Infinitesimal methods During middle ages interest for mathematics in Europe was very low, although some dim knowledge of mathematics was present, but without proofs and almost empirical. So, for instance, in the tenth century Adelbold von Utrecht knew that the volume of the sphere is obtained as 11/21 parts of the cube of the diameter (? = 22/7), but did not know how the area of equilateral triangle whose side is 7 can be 28 or 21. Gerbert explained to him that he has confused triangle with triangular number. Only in Islamic countries from VIII to XIV century appeared some interest in mathematics. They were not very interested in measure theory computations, but have obtained some very interesting results. The most interesting result obtained al-Haitham (ca. 965-1039). Islamic mathematicians admired Archimedes and studied On the Sphere
History of measure theory 9 and Cylinder but On Conoids and Spheroids was unknown. Thabit ibn Qurra (ca. 835-901) calculated the volume of paraboloid of revolution in a new way, in a rather complicated manner. His proof was simplified by al-Kuhi (X century), but al-Haitham succeeded to solve the problem in "full generality", i.e., he found that the solid obtained by rotation of segment of parabola around its chord has volume 8/15 of the volume of circumscribed cylinder whose height is the cord. The proof was standard exhaustion argument in the manner of Archimedes. Basis for his calculation was recurrent formula for sum of powers of numbers ?*/+1 +? (?*') = ("+ !)?*'· *=l j=\ \k=\ / k=\ Using this formula it was possible to obtain the fundamental limit ?—>oo ri 5 Archimedes used this limit for / = 1 and / = 2. The Renaissance men were interested in all things from antiquity and soon wanted to master ancient science and philosophy in which mathematics played prominent role. The level of mathematical knowledge gradually rose so that in the second half of the XVI century was high enough that discoveries of new mathematical results became possible. The mastery of Elements of Euclid and Conies of Apollonios prepared scientist for study of Archimedes. The Archimedean book On the Equilibrium of Planes on determination of centers of gravity of plane figures seems to be the work which spurned the imagination of mathematicians to apply infinitesimal methods to determination of centers of gravity of solids. Archimedes uses various results on center of gravity of solids but at that time no systematic work of his that contained proofs was known. Francesco Maurolico A494—1575), improved on Archimedean determination of center of gravity of solids in De momentis aequalibus from 1548, where the proofs were systematized, but it was not printed till 1685 so its influence was limited. Federigo Commandino A509-1575) in his work Liber de centro gravitatis solidorum, Bologna 1565, determines the center of gravity of conoid, but uses Archimedean proofs strictly. His work was continued by Simon Stevin A548-1620) who tried to simplify proofs. Stevin's book Beghinselen der Weeghconst, Leyden 1583 (Introduction to art of weighing) was written in Dutch so its influence was not so great till the beginning of 17th century when it was translated into Latin and French. He deliberately tried to simplify and modify Archimedean proof structure, but he did not try to make general theory of it and in every theorem repeated reasoning from the beginning. Luca Valerio A552-1618) continued of Commandino's work in Italy. Valerio published the book De centro gravitatis solidorum Rome 1604. He was very skillful geometer, and expounded his proofs in Euclidean manner. He had great influence especially upon Galileo and his pupils. The most influential work appeared at the beginning of the 17th century: Nova stereometria doliorum vinariorum, Linz 1615 (New solid geometry of wine barrels) by Johann Kepler A571-1630). Kepler has become famous few years earlier especially after
10 D. Paunic the publication of "Astronomia nova" (New astronomy), 1609, in which the motion of planet Mars from Tycho Brahe's observations was analyzed, and Kepler's first two laws of planetary motion were expounded. In Stereometria Doliomm he tries to calculate volume of wine barrel since he was fascinated that wine merchants in Linz, Austria, used single measuring rod to determine volume of the cask without respect to its size, and without any calculation. In order to explore the validity of this procedure he studies the solids of revolution. In the first part he presented Archimedean theorems on area of circle and volume of sphere but gave proofs based on intuitive application of indivisibles. He said: "We could obtain absolute and in all respects perfect demonstrations from these books of Archimedes themselves, were we not repelled by the thorny reading thereof" (Edwards, 1982, p. 103). Circle is regarded as infinitude of isosceles triangles whose infinitesimal base is on the circumference, and the third vertex in the center. Sphere is regarded as infinitude of cones whose base is on the surface of it and all vertexes in the center. Kepler found that if a segment of conic section rotates around an axis 92 different bodies can be formed. Dissecting torus with planes which pass through its axis of rotation he reduced calculation of volume of torus to that of cylinder; Any ring with circular or elliptic cross section is equal to a cylinder whose altitude equals the length of the circumference which the center of the rotated figure describes, and whose base is the same as the cross section of the ring, this theorem is found in Heron's Metrica where it is attributed to Dionysodoros whose works are lost. If the solid is formed by rotation of a segment of circle around its chord, he calls it apple if segment is greater then half circle or lemon if it is less, then he showed that its volume can be reduced to that of the wedge of cylinder whose base is the given segment of the circle, and height equal to the length of circumference whose radius is height of the segment. Kepler's work was much studied, and soon appeared mathematical books in which the infinitesimal methods were used to calculate areas and volumes. Around 1630 there was a lot of interest among mathematicians for finding areas and volumes, mostly using some sort of subdivision method. The ablest were in Italy Cavalieri and Torricelli, in France Fermat, Descartes, and Roberval, and Gregoire of St. Vincent, Guldin and Tacquet. They mostly communicated by letters or in direct contact and had very varied interests so it is very difficult to established the priorities. Galileo Galilei A564-1642) was interested in infinite and infinitesimal. He made some applications of them in his mechanics but it remained to his pupils and assistants Cavalieri and Torricelli to develop more fully their application to mathematics. Bonaventura Cavalieri A598-1647) tried systematically to apply infinitesimals to geometrical problems since approximately 1626, and wrote two books Geometria indivisibilibus continuorum nova quadam ratione promota (Geometry deduced by new calculations using indivisibles of continua) 1635. and Exercitationes geometricae sex (Six geometrical exercises) 1647. Cavalieri differed in two ways from Archimedes and Kepler. Firstly, his method consists in effort to connect two different objects, one known and one unknown. Then both objects are decomposed in the same way into indivisibles so that there is one to one correspondence between indivisibles and that indivisibles are in certain ratio. Then the area or volumes of the figures are in the same ratio as the ratio of indivisibles. Secondly, the indivisibles in which the figures are decomposed always have one dimension less then the figures. Using this method the limit processes were hidden.
History of measure theory 11 Cavalieri had still another technique, more arithmetical, and at the first sight not so general. If the lengths of the lines could be expressed arithmetically then he used "sums of power of lines" which is equivalent to integral Cavalieri's exposition was far from rigorous and very verbose so it was difficult to read him and understand what he exactly meant. Evangelista Torricelli A608-1647) fully recognized the advantages and disadvantages of Archimedean rigor and indivisible methods and presented his ideas much more clearly than Cavalieri. He published his account of indivisible methods in Opera geometrica (Geometrical works) in 1644 which was illuminating to many who found the work of Cavalieri too obscure and difficult to follow. Although he admired Cavalierean method and gave full account of it, demonstrations according to the methods of the ancients were also given. Torricelli's most interesting results are the transformation of arc of higher spiral rm =квп to the arc of higher parabola xm+" = k{,J^f-y)n. This reduction is equivalent to the transformation of the integrals for this curve. He was the first who succeeded to determine the volume of the infinite solid which is obtained by rotating hyperbola xy = a around jc-axis. For this he used cylindrical indivisibles. Gregoire de Saint-Vincent A584-1667) had very interesting ideas on calculating volumes geometrically. He developed his ideas during 1622-1629 but because of war his results were not published until 1647. Then many of his results were not new, and his geometric approach was not so promising as algebraic one proposed in 1637 in Geometry of Descartes. In Opus geometricum (Geometrical work) Gregoire established that the area under hyperbola xy = 1 has the property that A(at, bt) = A(a,b), a, b, t > 0, where by A(a,b) the area between hyperbola and .v-axis from ? = a to ? = b is denoted. His friend A. A. de Sarasa noticed that from it immediately follows that the area under hyperbola has logarithmic property since A(l,xy) = A(l,x) + A(x,xy) = A(l,x) + A(l,y), and published it in Solutio problematis a Mersenne propositi, 1649. Gregoire's geometric method for finding volumes was called "ductusplani in planum", and is presented in the 7th part of his Opus. It is applicable to the solids whose cross section are parallelograms, since they can be constructed by means of two plane surfaces standing on the same ground line, and parallel normals to that ground line determine cross sections. In that way he reduced the volume calculation to the properties of plane figures. His best results are application of that method to the volume calculation of wedges, formed by cutting right circular cylinder
12 D. Paimic by means of an oblique plane through a diameter of the base. Opus geometricum was carefully studied by B. Pascal, Chr. Huygens, and G. Leibniz. In France at the beginning of XVII century the center of mathematical life was around M. Mersenne A588-1648) who was interested in natural sciences and mathematics, who organized regular meetings, and corresponded with mathematicians and scientist throughout Europe. Thanks to his correspondence Galileo, Cavalieri, and Torricelli in Italy were acquainted with work of Roberval in Paris, Fermat in Toulouse, Descartes in Holland, and vice versa just to mention the most famous scientist which were kept in touch through him. One of problems posed was to investigate the path of a point at the rim of a rolling wheel. The investigation of this curve, called cycloid, was very popular in the XVII century. Origin of the cycloid can be traced to Bouvelle (XVI century) but Galileo posed it as a problem to Mersenne in 1630, and the first investigation was done by Roberval after it. Gilles Personne de Roberval A602-1675) was professor at College Royal in Paris since 1634. This post was renewable every three years. The professor had to propose problems to the candidates and author of best solutions becomes new professor. Roberval did not publish anything about his methods but succeeded to remain professor till his death. He solved many problems with infinitesimal methods which he developed around 1628. He formulated his ideas in the book Traite des indivisibles which was written 1634 but was not published till 1693. The indivisibles there were used intuitively and without rigorous proofs, and were presented as narrow rectangles. Roberval found that the cycloid can be geometrically presented in a way which is in modern notation ? = at — ? sin?. у = a — a cos t and to find the area under an arc of the cycloid he calculated the area under accompanying curve (in this case sinusoid ? = at, у = a — a cost) on which he has to add one circle and finally obtained that the area under one arc of the cycloid is 3 times the area of generating circle. His success provoked Descartes and Fermat to find their own solution to the same problem. In 1638 Descartes used for it an ad hoc method similar to the Archimedean use of triangles in the quadrature of parabola and infinitesimal reasoning. Fermat uses direct decomposition of the area into horizontal rectangles and compares all the lines of the cycloid to the all the lines of rectangle which contains it. Roberval also found the volume of the solid which is generated by rotation of sinusoid. He geometrically found following integrals 2a2/ sin21d(at) =??\ a2 I A + costJd(at) = —-a\ Jo Jo 2 In the next generation of geometers the influence of Descartes algebra begins to be felt and many results were discovered arithmetically. The greatest influence had the second edition of the Latin translation of his Geometry which appeared in 1659, since it contained many additions on arithmetical application of infinitesimal methods written by various authors. John Wallis A616-1703) published in 1655 his most famous book Arithmetica infinitorum. His aim was to present his own method of investigation instead of proving things, so that in it there is practically no proofs. After verifying some statement for 1,
History of measure theory 13 2 and 3 or so Wallis claimed that it holds for any rational k. He introduced fractional exponents and claimed that for any rational к the results equivalent to the Jo k+ 1 holds. This result was proved earlier by Fermat and Torricelli but they published later. Fermat commented about Arithmetica infinitorum: "all these propositions could be demonstrated via ordinaria, legitima et Archimedea (in ordinary, correct, and Archimedean way) with much less words then there is in his book". About 1655-1660 the determination of arc length was done. First W. Neil determined the arc length of semicubic parabola (y = x^l1) and after him Ch. Wren succeeded to determine the arc length of cycloid. In all arc length determination the calculation was reduced to the quadrature of auxiliary curve. This is clear since the length of the curve у = f(x) is calculated geometrically as the integral Chr. Huygens A629-1695) and R.F. de Sluse A622-1685) studied cissoid (y = хъ/2{2а - jc)~1/2). They found that area between the cissoid and the asymptote is 3?2?, that the volume of the solid which is obtained by rotating the curve around its asymptote is 2??2?3, and most surprising that the volume of the solid which is obtained when the curve and its asymptote rotate about y-axis is 10??2?3. Surprising about it was that this solid enclosed volume which was infinite. This result was the consequence that the center of gravity of the area between cissoid and the asymptote is in the point T( ja, 0). Huygens could not make up his mind how to present his results. For him indivisibles were not rigorous enough but Archimedean proofs were too long, so in his most important mathematical book Horologium oscilatorium which he wrote about 1656, and finally published 1673, he gave only results. The next three mathematicians Blaise Pascal A623-1662), James Gregory A638-1675), and Isaac Barrow A640-1677) succeeded to make a synthesis of earlier infinitesimal methods but they presented it in a geometrical language so that their presentations had very limited influence to the further development of calculus. Pascal wrote on cycloid Histoire da la roulette where he presented most things known about it, but his writing Lettre de Amos Dettonville had great influence on the development of analysis, especially on Leibniz. In the first letter in the part Traite des sinus du quart de cercle he very successfully applied the technique of characteristic triangle in solving geometrically r ??? d{r9) = r2(cosa - sin/3), and some generalizations to the higher powers of sine. Leibniz repeatedly stated that he was lead to the invention of calculus by a study of the works of Pascal especially his application of characteristic triangle. This is the use of similarity of infinitesimal triangle L
14 D. Paunic in the point (x, y) of the curve у = f(x) formed by ??, ?}\ and As with triangle formed by y, sn (subnormal-segment of Jt-axis between ? and intersection with normal in the point (x, y)), and ? (normal-segment of the normal to the curve between (x, y) and praxis), and the triangle formed by st (subtangent-segment on jc-axis between intersection of the tangent in (x, y) and x), y, and t (tangent-segment on the tangent between ? -axis and the point (x, y)). Pascal never seriously studied algebra so he missed the possibilities which algebraic suggestive symbolism offers. Gregory presented his version of general method in Geometriae pars universalis (Universal part of geometry) published in 1668. There he proves that determination of arc length is reduced to the quadrature of auxiliary curve, but also asks the converse question: to determine a curve (u(x)) whose arc length s has constant ratio to the area of another given curve (/(*)). He solves this problem in Proposition VI and uses the ordinate of the curve to determine the tangent to the second curve which represents the area of the first curve. So he comes to the Fundamental theorem of the calculus, but his proof are totally verbal and geometrical in the Archimedean manner. So to determine result equivalent to fQ dr/ cos t he had to make six transformations. This process can be fascinating to some but algebraically it is obtained in a couple of lines. Similar result obtained Barrow in his Lectiones geometricae (Geometrical lectures) published in 1670. This book is very similar to Gregory's. It can be explained that both were in Italy for considerable time so that they were well versed in infinitesimal methods of Cavalieri, Torricelli, and Roberval. Barrow's book is more profound than that of Gregory but he also used geometrical language, correct geometrical proofs so that it was even more difficult to understand. In the lecture X, Proposition 11 is explained how to construct tangent to the curve which represents area of another curve using ordinate of another curve. In lecture XI, Proposition 19 he proves the other part of Fundamental theorem, in modern notation ь f'(x)dx = f(b)-f(a), showing that large rectangle R(f(b) - /(a)) (R is unit of measure) is equal to the area formed by all infinitesimal rectangles under the curve Rf'(x)dx. But he was rather disappointed that his book passed practically without much influence in scientific circles. The ease of calculation in algebraic method was paramount and nobody was much interested in long chains of geometrical reasoning. Everybody wanted to apply mathematics as fast as possible to obtain explanations and discoveries of the natural laws. Rigorous geometrical methods went out of fashion in mathematics for more then a century and a half. 5. Loss of measure Between 1665 and 1685 Isaac Newton A643-1727) and Gottfried Wilhelm Leibniz A646- 1716) created calculus. It were two different versions of general approach to the problem solving using infinitesimal technique. The novelty consisted mainly in two things: First, ,
History of measure theory 15 discovery that integration and differentiation are inverse operation to each other so that integration can be done by the formula (in modern notation) ь F'(x)dx = F(b)- F(a). Second, making algorithmic procedure out of it so that it can be applied systematically and generally. Their procedure, specially Leibniz's, were very similar to ours but it should be remembered that both of them never used notion of function in modern sense since it was created much later. They used "quantities", "magnitudes" or "variables" which change with one another, one independently and the other depends of the first, or both depend on time. Newton made his most important discoveries in 1664-1666, his "anni mirabiles" (wonderful years). His first systematic work on calculus was the so-called October 1666 tract on fluxions, the next were De analyst per aequationes numero terminorum infinitas (On the analysis by equations with infinitely many terms) written in 1669, spurned by the publication of series development of logarithm by Mercator in 1668, and De methodis serierum et fluxionum (On the methods of series and fluxions) written in the winter 1670-1671. He tried to publish De methodis, but it did not go easily and later he was no more interested. The second work, De analyst, had limited circulation in the English mathematical community since Newton sent it to the Royal Society in order to establish himself as a scientist which had found powerful new method for solving mathematical problems. In De analyst he presented his discovery in three rules: A) \iy = axm'n then the area under ? is ? r('"/"')+'. 4 ' J J (m/n)+\ B) If у = y\ + у2 + ¦ ¦ ·, were the sum can be finite or infinite, then the area under у is the sum of the areas under every term. C) If the area under the curve f(x, y) = 0 is to be determined then у has to be developed into the sum of terms of the form ax'"/" and the first two rules applied. To solve f{x, у) = О Newton developed "Newton method" of successive approximations /(*„) Xn-\-\ —Xn — , f'(Xn) "Newton parallelogram" for solving implicit equations, and used reversion of series. In De methodis in the first part he developed his ideas from De analyst and applied them to 12 general problems. This paper remained unfinished and unpublished till 1711. Newton used this manuscript to send two letters to Leibniz in 1676 with review of his results, but Newton was not interested to continue mathematical correspondence with Leibniz. Leibniz did not publish any book on calculus but presented his ideas in papers and letters so he changed his point of view with time. Around 1680 he wrote "I represent the area of a figure by the sum of all the rectangles contained by the ordinates and the difference of the abscissae", and that he "obtains the area of the figure by finding the figure of its summatrix or quadratrix; and of this indeed the ordinates are the given figure in the ratio of sum of differences". L
16 D. Paunic He started with publishing on differentiation and determination of extrema in a paper with long name. The name is abbreviated as Nova methodus... (New method) which was published in 1684. After it he published papers on solution of various problems. In 1693 he made his idea of integration more explicit, and in a paper Supplementum geometriae... Leibniz says: "Now I shell show that the general problem of Quadratures can be reduced to the finding of a curve whose inclinations (declivitas) have a given law" (Leibniz, 1995, p. 263). Application of calculus used three facts which were supposed to be true: A) The so-called Fundamental theorem of calculus. For any function holds ь F'(x)dx = F(b)-F(a). B) Any function can be developed into a series of functions f(x) = u0(x) + u\(x) + ui(x)-\ , particularly into power series (Taylor's formula) and later into trigonometric series. C) If / is presented by a series then one can differentiate and integrate it term-wise f'(x) = u'0(x) + u\(x) + u'2(x) -\ . pb pb pb pb ? f(x)dx= I uo(x)dx+ I u\(x)dx+ I U2(x)dx-\ Ja J a Ja Ja Since Leibniz published his results, and his method was more easily understandable and applicable many able mathematicians soon became his followers. The most prominent were brothers Jacob and Johann Bernoulli, Jacob Hermann, marquis de I'Hopital, etc. After death of Leibniz (in 1716) the most famous representative of his calculus became Johann Bernoulli A667-1748). For analysis was important his introduction of the notion of the function in 1718: "The function of a variable magnitude is called the magnitude which is composed in some way or another from this variable magnitude and constants" (Struik, 1969, p. 368). Under "in some way or another" is to be understood as an expression in which algebraic operations are applied finite or infinite number of times. His most famous pupils were marquis de I'Hopital, L. Euler, M. de Maupertuis, G. Cramer, and A. Clairaut. This notion of function was used by Leonhard Euler A707-1783) in the most influential book on analysis in the XVIII century Introductio in analysin infinitorum. In it the function is defined: "A function of a variable quantity is an analytical expression composed in any way from this variable quantity and from numbers or constant quantities" (Edwards, 1982, p. 271). The next important moment for further development of function concept was discovery of partial differential equations by Jean le Rond d'Alambert A717-1783), i.e., his analysis of vibrating string. In 1747 he published that the "general" solution of the equation / Э2.У _ тд2у dt2 ~a дх1-'
History of measure theory 17 is y(t,x) = \<p(at + ?) + \?(?? — ?), where ? and ? are arbitrary functions. The only problem for him was whether any continuous curve which can be drawn with free hand ("libera manu ducta") over a finite interval can be expressed by a single analytical expression, his idea of function. In 1749 Euler also published on vibrating string and found essentially the same solution, but he was confident that any continuous curve "libera manu ducta" over a finite interval can be represented by analytical expression if necessary by discontinuous one. By this Euler understood that curve is defined arc-wise by different analytic expressions, in modern language it is continuous over the interval but its derivative is discontinuous in finite number of points. He also notes that special solution can be ??? ?? ???? 2?? 3??? . 3?? y(t,x) = acos sin Ь/Scos sin h ? cos—-— sin—- l· ¦ ¦ ¦, which is obtained when initial condition is . ?? . 2?? . ?? у (О, ? ) = a sin— + /3sin—- \- ? sin— ? , without any determination whether the sum is finite or infinite. In 1753 Daniel Bernoulli A700-1782) published his analysis of the vibrating string problem. Bernoulli based his work on previous analysis of Brook Taylor from 1713. Taylor found that any sinusoidal curve whose half period goes whole number of times into the length of the string can be solution. Bernoulli's solution was that every solution can be obtained by superposition of these Taylorean "elementary" solutions. According to Bernoulli any solution is ????? ??? b„cos—— sin-—, n = \ since any possible initial curve over the interval [0, /] can be represented by oo *-~» ??? f{x) = 2_ansm~—, n=\ because "there is enough constants to make this series fit any curve". Neither Euler nor d'Alambert admitted Bernoulli's solution as general. In 1759 Joseph Louis Lagrange A736-1813) studied the same problem in the similar fashion as Daniel Bernoulli. He supposed that string is approximated by massless string loaded by ? equal and equally spaced masses and then passed to the limit. But in the subsequent development he at one moment interchanged sum and integral and missed possibility to discover the law of formation of the Fourier coefficients. After long and complicated calculations which had some gaps he obtained result of Euler and d'Alambert. The only consequence of all these discussions was that Euler improved his definition of function in preface of his Institutiones calculi differentialis (Lectures on differential calculus) from 1755. "If some quantities so depend on other quantities that if latter are
18 D. Paunic changed the former undergo change, then the former quantities are called functions of the latter. This denomination is of broadest nature and comprises every method by means of which one quantity could be determined by others. If, therefore, ? denotes a variable quantity then all quantities which depend upon ? in any way or are determined by it are called functions of it" (Edwards, 1982, p. 271). 6. New beginning Fundamental step, which had overwhelming influence on development of analysis, was taken by Jean-Baptist-Joseph de Fourier A768-1830). In his paper on propagation of heath submitted to the French Academy of Science in 1807, he noticed that coefficient of the trigonometric series can be easily calculated using integrals. Fourier supposed that the arbitrary function / defined over interval [—?. ?] has the development into trigonometric series 1 x f(x) = -ao + \] (an cos ? ? + bn sin nx). n=\ It remains to determine coefficients. He proposed two proofs. In the first proof he solved infinite system of equations with infinite number of unknowns to obtain coefficients. The second proof was more formal, and the same idea was used earlier by Euler and Lagrange. In order to obtain coefficients one integrates f(x) or f(x)cosnx or f{x)smnx. If the exchange of sum and integral is permitted then one obtains ao 1 fn 1 ? = — I f(x)dx, a„ = — I f(x)cosnxdx, n = l,2 , ? J-? ? у_.т 1 Г bn = — I f(x)sinnxdx, n = l,2 ? J_„ Fourier's work was amplified and finally printed in 1822 in his book: Theorie analytique de la chaleur. It is interesting that in it Fourier obtained three different expansions for \x on the interval ]0, ?/2[: 1 1 ix 1 ix = = = sinjc 2 - — sinjc ? ? 4 ~ 2 ? 1 2 " sin 2x + - sin 3.v - 3 2 3^ COSJC - 2 sin 3?· ?—— 5-? 2 - -^—cos 3x - 32? . + ... sin5.v 2 ^ _ + .. cos 5 .? This result strongly opposed the notion of the function as analytic expression. The question is what are the integrals for an and bn in case of "arbitrary" function? Fourier did not answer that question, but some years later Augustin Louis Cauchy A789-
History of measure theory 19 1857) faced the same problem when started to lecture on calculus in the Polytechnic school. In 1823 he defined that /' J a f(x)dx is the limit of the sum S = }f(xj-\)(x, — .r,--i), i' = l over every division of the interval [a,b], a = xo < x\ < ¦ ¦ ¦ < *n-\ < x„ = b such that the maximum length of the subintervals tends to 0 as ? -> oo. He introduced the modern definition of continuity (?-?) and proved that for continuous functions, this limit exists. But "arbitrary" function obviously does not have to be continuous. Peter Gustav Lejeune Dirichlet A805-1859) proved in 1829 that a function / is represented by its Fourier series over the interval ]—?, ?[ "if the function f(x), which is assumed to be bounded and single valued ("finite and determined"), has only a finite number of discontinuities between limits —? and ?, and if besides it has only a finite number of maxima and minima between these limits..." (Birkhoff, 1973, p. 146). He expanded this article in 1837 and there defined function in modern sense. Dirichlet's pupil Bernhard Riemann A826-1866) in his Uber die Darstellbarkeit einer Function durch eine trigonometrische Reihe (On the possibility of representation of a function by trigonometric series) from 1854, but published in 1868 asks the same question and answers: "First of all: What is to be understood by Ja f(x)dx1 In order to establish this, we take the sequence of values x\.x2 xn-\ 1У'пе between a and b and ordered by size, and, for brevity, denote x\ — a by A \, .vi — x\ by Л1 b — xn~\ by ?„, and proper positive fractions by ?,. Then the value of the sum S= Л;/(а+?|Л|) + Л2/(?? + ???2) ? + Л„/(л„_| +?»Л») will depend on the choice of the intervals Л, and the quantities ?,. If it has the property that, however the Л, and the ?, may be chosen, it tends to a fixed limit A as soon as all the Л, become infinitely small, then this value is called r. b f(x)dx" (Birkhoff, 1973, p. 22). After it Riemann gives criterion when function is integrable. . Secondly, let us determine the extant of the validity of this concept, and ask: in what cases is a function integrable and in what is not? We first consider the concept of integral in the strict sense, that is. we assume that the sum S converges when all the Л, become infinitely small. Thus let us denote the largest variation of the function between a and x\, that is. the difference of its largest and smallest value in this interval, by D ?, that between x\ and jtt by Di tnat between x„ _ ? and b by D„; then Л;0| + ЛтОт ? + Л„0„ must become infinitely small with the quantities Л," (Birkhoff, 1973. p. 22).
20 D. Paunic After this Riemann constructs his famous example of the function which is integrable but has an dense set of discontinuities in the set of real numbers. Riemann's ideas were soon developed in Germany, France and Italy. Especially lucidly they were developed in France by Gaston Darboux A842-1917). Darboux in his paper Memoire sur les fonctions discontinue from 1875 analyzed only bounded functions over [a,b].lf f is bounded function then for every partition ? = xq < x\ < ¦ ¦ ¦ < x„~\ < xn=b he formed the expressions M(n) = M\S\ ? h M„S„, m(n)=m\S\-\ \-mn&„, ? = ?\&\ +--- + ?„8„, where ?,- = jc, — *,¦_ ?, ?,- the least upper bound for / over <5,, w, the greatest lower bound of / over <$,-, and ? = ?,- - w, the so-called "oscillation" of the function in the interval. He showed that M(n), m(n), and ?(?) converge to the unique limit which depends only on a, /), and /, as ? —> oo and <5, —> 0. Later the suggestive notation / f(x)dx= lim y^rriiSi and / f(x)dx= lim У^М,-5,-, Ja_ /=, J a ( = 1 was introduced by V. Volterra in 1881. With this preparation it easily follows that for Darboux sum ? S(n,S,e) = YjSif(xi-l+0lSi). 1 = 1 which depend only on ?, 8 = (<S|,.. ..<$„), ? = (?\, ...,??), 0 sC #, sC 1, fori = l,...,n, holds m(n)^S(n,S,e)^M(n), since w,- ^ f(xi-\ + ??&?) ^ Mi. This is his THEOREM 2. Darboux sums S(n, S, ?) tend to the unique limit iff ?(?) tends to zem or equivalently if Jfb rb I f(x)dx= / f(x)dx. a J a In that case Darboux denotes the limit by / f(x) dx. As consequences he proves: • Every continuous function is integrable in Darboux sense. • Mapping ? (->¦ f* f{t) dt is continuous in ?.
History of measure theory 21 • Let F(x) = f* f(t) dt. If / is continuous in xo then F'(jto) = /C*o)· • If the function F, defined over [a,b], has bounded and integrable derivative / = F' then F(x)-F(a)=l f(t)dt, for every ? e ]a.b[. Ja Similar construction of Riemann's integral were presented in the same year by J.K. Thomae A840-1921), G. Ascoli A843-1896), P. du Bois-Raymond A831-1889), and H.J.S. Smith. Georg Cantor A845-1918) in sequence of papers between 1879 and 1884 investigated the properties of infinite linear point sets. There he introduced notion of the first species set, set whose «th derived set is finite, and investigated dense sets, nowhere dense sets etc. He actually gave these names although these sets were recognized as important earlier. Most mathematicians of the period were impressed with Riemann's idea of integration and tried to developed them but soon turned out that knowledge of topology was insufficient to make much further progress. Dirichlet's example of the characteristic function of rational numbers f(x)= lim ( lim cos2"m\nx ), which is not Riemann integrable prompted the analysis of dense and nowhere dense sets, but these are topological notions which do not have great influence on measure theory since dense set (topologically large) can have zero measure and nowhere dense set (topologically small) can have positive measure. So H. Hankel, A. Harnack, and R du Bois-Raymond published some interesting examples but made wrong conclusions. One of their mistakes was that in their proofs it was used as fact that nowhere dense sets are equivalent to first species sets. Ulisse Dini A845-1918) showed 1878 that the first species sets are of zero content, so that they cannot have influence on the integrability of functions. English mathematician Henry John Stanley Smith A826-1883) was the first who, in his paper On the integration of discontinuous functions from 1875, constructed nowhere dense set, he called it "in loose order", of positive measure. But his paper remained without influence, probably since his main interest was in number theory, and the reviewer noticed only that in the paper the sharper version of Riemann's integration theory was presented. Vito Volterra A860-1940) published practically the same construction in 1881 but it also did not have much influence. When the same fact was discovered independently for the third time by R du Bois-Raymond in 1880 its importance was finally recognized. 7. Newly found measure In 1881 appeared the outer content of a set. It was introduced by Otto Stolz A842-1905) for subsets of an interval and bounded sets of the plane. Independently from him G. Cantor introduced equivalent definition of content and extended to the и-dimensional space, but
22 D. Paunic he made no distinction between a set and its closure. In 1885 Harnack proposed his version content but he was so fascinated with dense sets that he could not admit that dense set could have measure 0. He introduced infinite covers but had no clear idea of boundary of a set so his results remained incomplete. Giuseppe Peano A858-1932) using upper and lower integrals in 1883 introduced most natural definition of area by inscribed and circumscribed polygons. This theory was developed in Applicazioni geometriche del calcolo infinitesimale from 1887. In it he defined that if the greatest lower bound of the area of circumscribed polygons is equal to the least upper bound for the area of inscribed polygons then the set has an area. Although Peano introduced the notion of measurable set their importance for the integration theory was fully recognized by Camille Jordan A838-1922). He studied it in 1892 in the context of double integrals and applicability of Fubini's theorem, i.e., when double integral is equal to the iterated integrals and when the order of integration of iterated integrals can be interchanged. Jordan remarked that notion of function of two variables was clear enough but the notions of its domain of definition remained intuitive and not well-developed. So he carefully analyzed point sets in plane and defined interior, exterior, boundary, limit points, and closed set. He succeeded to use measurability of sets to construct double and multiple integrals in the Riemann sense with clarity, generality, and suggestiveness they never had before. This theory he included in the second edition of his celebrated Cours d'Analyse from 1893-1896 which was in three volumes. Further advances in measure theory were made in the work of Emil Borel A871-1956). In his thesis he studied the problem of analytical continuation, particularly the problem of analytical continuation over the boundary on which the singularities are dense, and analyzing them he used theorem which later got the name Heine-Borel theorem. Instead to approximate the size of bounded sets from outside by covering them by intervals he used Cantor's result that every open set of the real line is union of denumerable family of disjoint open intervals he simply took for its measure the sum of the lengths of the components. In his book on theory of functions from 1898 he writes: "When a set is formed of all the points comprised in a denumerable infinity of intervals which do not overlap and have total length s, we shall say that the set has measure s. When two sets do not have common points and their measure are s and s'. the set obtained by uniting them, that is to say their sum, has measure s + s'. More generally, if one has a denumerable infinity of sets which pairwise have no common point and having measure s |, si, s„ . their sum ... has measure s ? + si + ¦ ¦ ¦ + sn ? All that is consequence of the definition of measure. Now here are some new definitions: If a set ? has measure s and contains all the points of a set E' of which the measure is s', the set ? — ?', formed of all points of ? which do not belong to E', will be said to have measure s —s'... . The sets for which measure can be defined by virtue of the preceding definitions will be termed measurable sets by us ..." (Hawkins, 1970, pp. 103. 104). He described the family of sets which is obtained from open sets by infinitely iterating countable unions and difference and showed that over that family (later called Borel sets) completely additive measure can be defined, but nowhere in his book the notion of measure is connected to the integration. Henry Lebesgue A875-1941) announced his work on measure theory and integration in five research announcements from June 1899 to April 1901 in Comptes Rendu, journal of the French Academy of Science. This work is contained in his thesis which appeared
History of measure theory 23 1902. In it he developed Borel's ideas on measure with greater clarity and generality. He introduced Lebesgue measure, axiomatically as nonnegative function ? defined on bounded sets of the real line such that A) Two equal sets have the same measure. B) The measure of the set which is the sum of finite or countable infinite number of sets, which are pairwise disjoint, is the sum of measures. C) The measure of all points in ]0, 1 [ is 1. After it he introduced Lebesgue integral which is his totally new invention. In order to define integral Lebesgue partitioned range of the function / instead its domain. If m denotes the greatest lower bound and the ? the least upper bound of / over [a, b], ? denote the partition m = ко < k\ < ¦ ¦ ¦ < fc„_| < k„ = ?, of the interval [m, M], and Ei = {xe[a,b]: ?,_, </(*)<*,·}, / = 1,2,... . Let || ? || denotes the maximum of the differences fc, — fc,_i then rb def " '< / /= lim VV|M(?,)= lim Vfc;M(?,). J a II1-0^ IIPIHO^ Lebesgue's integral solved a lot of problems in analysis. Many results got their natural formulation and easy proofs since generalization to any set ? on which the measure is defined is immediate. In his thesis Lebesgue spent much effort to determine connection between integral and primitive function. He proved that if /' exists and is bounded on [a, b], then /' is summable and /д* /' = f(b) — f(a). He gave some conditions in the case /' is not bounded. The most useful result in Lebesgue's thesis was that if (/„),1€ц is a sequence of measurable functions defined on a measurable set ? such that |/«(-*) I ^ В f°r aU x ш ?, and all n, and if lim f„(x) exists, then / lim /„(*)= lim / f„(x). In 1908 Lebesgue generalized it and obtained his well-known "Dominated convergence theorem". This theorem came at the end of a long search for the solution of the problem whether it is permissible to integrate series term by term. Cauchy and earlier mathematicians thought that it is permissible. First counterexample gave Niels Henrik Abel A802-1829), but he died too young to pursue these ideas further. That additional sufficient condition is uniform convergence became clear to Karl Weierstrass A815-1897) around 1841, and he emphasized it in his lectures when he became the professor of mathematics at Berlin University in 1856. Since it is also the sufficient condition for the continuity of the sum
24 D. Paunic of continuous functions there was not much research in this direction. That the sum of a series of continuous function is continuous if the convergence of the series is uniform was discovered independently by Philip Seidel A821-1896) in 1850 and George Stokes A819-1903) in 1848. It became clear in the seventies of the XIX century that uniform boundedness is sufficient with some additional condition to insure integrability. P. du Bois- Raymond attacked the problem in 1886 and gave complicated and wrong additional conditions. Finally, the problem was solved by William Fogg Osgood A864-1943) in 1897 and Cesare Arcela A847-1912) in 1885 (Arcela result remained not widely known until republication in 1900), in a very complicated way and with some additional conditions which are difficult to check. Arcela's conditions are slightly more general. Lebesgue solved the problem efficiently, more generally and with easy proof. Measure-theoretic ideas of Jordan and Borel stirred other mathematicians to further developed ideas on measure. In Italy Giuseppe Vitali A875-1932) created theory of measure similar to Lebesgue's but without integration. When in 1904 became acquainted with Lebesgue's work Vitali improved on it and in 1905 proved that a function is an integral iff it is absolutely continuous. Using axiom of choice Vitali proved in the same year the existence of nonmeasurable set on real line, but Lebesgue did not recognize axiom of choice because it is too idealistic. William Henry Young A863-1942) was doing research on measure and integration independently of Lebesgue from 1902 to 1905. He defined the integral based on countable additive measure which was similar to Lebesgue's but not so general and Young did not prove all the important consequences which follow from it. Lebesgue's thesis came in very important moment in development of analysis. In a short note published in 1900 Ivar Fredholm A866-1927) has introduced his methods for solving integral equation f(s) = <p(s)+ / K(s,t)<p{t)dt, J a for unknown function ?, which showed that general theory of all integral equations can be made very simple. This work was completed in his subsequent paper published in 1903. Fredholm's theory was further developed by David Hubert A862-1943) who published six papers between 1904 and 1906 on integral equations. The fundamental was the fourth paper from 1906 where, in modern terminology, he presented spectral theory for bounded linear operators in Hubert space. Hubert's work on integral equations and Lebesgue's book on trigonometric series stimulated Fridyes Riesz A880-1956) in 1906 to analyze if (а„)п€щ is a square summable sequence whether there is a function such that a„ = f^ ??„ for some complete orthogonal system of square-summable functions (&>„);ieN· Не found that the answer is positive. The same theorem was independently discovered in the same year by Ernst Fisher A875-1959). This discovery opened the road to the generalizations which led to Lp spaces. F. Riesz also directed the development of analysis in another direction. In 1909 he proved that Stieltjes integrals / i-> /' / ?? are the most general continuous linear functionals on C[a,b]. Thomas Jan Stieltjes A856-1894) introduced his integrals in 1894. Stieltjes
History of measure theory 25 studied the convergence problem of continuous fractions and for its solution introduced generalization of Riemann integral rb " / fd<p = ,J}m У]/(c<)M*<)-?(¦*!-1)). Ja ||Р||->-ос*—' i = l where ? is partition a = xq < x\ < ¦¦¦ < jc„_i < x„ = b, f is continuous, jc, _ ? ^ c, ^ x,. Stieltjes noticed that his integral could be extended to larger class of functions, "but it is of no interest to accord complete generality to the function /(к)" (Hawkins, 1970, p. 181). In 1913 Johann Radon A887-1956) used Lebesgue's procedure to obtain an integral using arbitrary completely additive set function instead of Lebesgue's measure. This paved the road to generalization of measure theory to arbitrary sets via ?-rings and to general representation of measure by Otton Nikodym A887-1974) obtained in 1930. Translational invariance of Lebesgue integral has generalization with important application in the representation theory of groups. In 1897 Adolph Hurwitz A859-1919) used integrals to generate invariants for SO(n) and SU(n). In 1924 Issai Schur A875-1941), while studying rotation group in «-dimensional space, recognized that much of character and representation theory of finite groups remains valid if, instead of summation over group elements, suitable integration is applied over the compact manifold formed by the elements of the rotation group. Hermann Weyl A885-1955) generalized this work and obtained explicit expression for irreducible characters of compact Lie groups. This research culminated in Peter-Weyl theorem, perfect analog of decomposition of the regular characters in irreducible components for finite groups. It became clear that the Peter-Weyl theorem could be proved for a class of topological groups for which left invariant measure exists. In 1914 Felix Hausdorff A868-1942) proved that there is no translational invariant positive finite additive set function defined for all sets in R3. In 1923 Stefan Banach A892-1945) solved positively the same problem for R and R2, and in 1924 together with Alfred Tarski A902-1983) gave his famous counterexample for R3 that any two balls B\, Bi can be decomposed into finite number of congruent sets, so ? ? = (J"= \S,, ?? = U/'= ? d G?)'where C, are congruent transformations. In 1933 Alfred Haar A885-1933) constructed left invariant measure on separable locally compact topological group. His result was completed by John von Neumann A903-1957) who proved its uniqueness for compact groups in 1934 and in general case in 1936, and independently from him by Andre Weil A906-1998). The great success of measure theory was that it can be used as foundation for the axiomatization of probability theory, which was done by Andrei Nikolaevich Kolmogoroff A903-1987) in 1933. References Aaboe, A. A964), Episodes from the Early History of Mathematics, Random Hause, New York. Andersen, K. A985), Cavalieri's method of indivisibles. Arch. Hist. Exact Sci. 31, 291-367. Baron, M.E. A969), The Origins of the Infinitesimal Calculus, Pergamon Press, Oxford. Birkhoff, G. (ed.) A973), A Source Book in Classical Analysis, Harvard University Press, Cambridge. Bourbaki, N. A969), Elements d'Histoire des Mathe'mathques, Hermann, Paris.
26 D. Paunic Boyer, C.E. A959), The History of the Calculus and its Conceptual Developments, Dover, New York. Boyer, C.E. and Merzbach, U.C. A989), A History of Mathematics, Wiley, New York. Edwards, C.H., Jr. A982), The Historical Development of the Calculus, Springer, New York: I st edn 1979. Elstrodt, J. A996), Mass- und Itegrationstheorie, 2nd edn. Springer, Berlin. Grattan-Guinness, I. A970), The Development of the Foundations of Mathematical Analysis from Euler to Riemann, MIT Press, Cambridge. Grattan-Guinness, I. (ed.) A980), From the Calculus to Set Theory 1630-1910, Duckworth, London. Grattan-Guinness, I. A997), The Fontana History of Mathematical Sciences, Fontana Press, London. Hausdorff, F. A914), Grundzuge der Mengenlehre, Veit & Co., Leipzig. Hawkins, T. A970), Lebesgue's Theory of Integration, University of Wisconsin Press, Madison. Heath, T. A921), A History of Greek Mathematics I, II, Clarendon Press, Oxford. (Republished in 1981 by Dover.) Jahnke, H.N. (ed.) A999), Geschichte der Analysis, Spektrum AV, Heidelberg. Katz, V.J. A998), A History of Mathematics, 2nd edn, Addison-Wesley, Reading. Knorr, W.R. A975), The Evolution of the Euclidean Elements, Reidel PC, Dordrecht. Kolmogoroff, A.N. A933), Grundbegriffe der Wahrscheinlichkeitsrechnung, Springer, Berlin. Lebesgue, H. A904), Legons sur ? Integration et la Recherche des Functions Primitives, Gauthier-Villars, Paris. Bnd edn, 1927.) Lebesgue, H. A966), Measure and the Integral, Holden-Day, San Francisco. Leibniz, G.W. A995), La Naissance du Calcul Differentiel, Vrin, Paris. B6 articles from Acta Eruditorum.) Medvedev, F.A. A974), Development of the Integral Notion, Nauka. Moscow (in Russian). Medvedev, F.A. A975), Lectures on the History of the Functions of Real Variable, Nauka, Moscow (in Russian). Paplauskas, A.B. A966), Trigonometrical series from Euler to Lebesgue, Nauka, Moscow (in Russian). Pesin, I.N. A966), Development of the Integral Notion, Nauka, Moscow (in Russian). Pier, J.-P A994), Integration et mesure 1900-1950, J.-P Pier (ed.). Development of Mathematics 1900-1950, Birkhauser, Basel, 517-564. Saks, S. A930), Theorie de Vlntegrale, Warszawa. Serres, M. (ed.) A989), Elements d'Histoire des Sciences, Bordas, Paris. (German translation: Elemente einer Geschichte der Wissenschaften, Suhrkamp, Frankfurt A998).) Struik, D.J. (ed.) A969), A Source Book in Mathematics 1200-1800, Harvard University Press, Cambridge. (Republished in 1986 by Princeton University Press, Princeton.) van der Waerden, B.L. A961), Science Awakening, Oxford University Press, Oxford. Weil, A. A940), I'Integration dans les Groupes Topologiques et ses Applications, Hermann, Paris. Whiteside, D.T. A960-1962), Patterns of mathematical thought in the later 17th century. Arch. Hist. Exact Sci. 1 179-388. Zaanen, A.C. A967), Integration, North-Holland, Amsterdam.
CHAPTER 2 Some Elements of the Classical Measure Theory Endre Pap* Institute of Mathematics, University ofNovi Sad, Trg D. Obradovica 4, 21000 Novi Sad, Yugoslavia E-mail: pap@im.ns.ac.yu, pape@eunet.yu Contents Introduction 29 1. Measurable functions 30 1.1. Classes of sets 30 1.2. Set functions 32 1.3. Step functions 34 1.4. Measurable functions 34 2. Measures 35 2.1. Positive measures 35 2.2. Measure spaces. Extension of measures 36 2.3. Completion of a measure space 38 2.4. Measures with finite variation 39 2.5. The variation of real-valued measures on a ring 40 2.6. Measurability with respect to a positive measure 41 2.7. Examples of measures 43 2.8. Lebesgue measure on K1' 47 3. Integration 50 3.1. The immediate integral 51 3.2. Integral of positive step functions 53 3.3. Integral of positive functions 54 3.4. The classical integral 55 3.5. Lebesgue and Riemann integrals 59 3.6. Integration with respect to a real-valued measure 61 3.7. Density theorem 63 3.8. Absolutely continuous functions 63 3.9. Absolute continuity of measures 65 3.10. The Radon-Nikodym theorem 65 3.11. Conditional expectation 67 3.12. The Lebesgue-Stieltjes integral 68 *The author wants to thank for the partial financial support of the Project in the Fields of Basic Research "Mathematical models of nonlinearity, uncertainty and decision" A866) supported by Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 27
28 ?. Pap 3.13. Transformation of coordinates 68 4. Function spaces 69 4.1. Essentially bounded functions 69 4.2. The space LP (?) 70 4.3. Convergence of sequences of functions 71 5. Product measures and the convolution 72 5.1. Product measures 72 5.2. Repeated (iterated) integral 73 5.3. Infinite products 75 5.4. Convolution 77 5.5. Integral transforms 77 6. Regular measures 79 6.1. Topological measures 79 6.2. Convolution of measures 81 References 81
Some elements of the classical measure theory 29 Introduction In this chapter we shall give some basic notions and related results from the classical measure theory in a very condensed form. The main purpose is to make it easier to follow the whole Handbook. A measure space is a set in which some subsets may be assigned a measure. The usual notions of length or area or volume can be applied only to reasonably regular sets. Today measure theory is remarkably powerful and allows that a large class of sets are adequately regular to be measured. The class of sets which it measures have to be firstly specified. The classical abstract integral presented here is closely related to the main particular case - the Lebesgue integral based on the Lebesgue measure. The standard integral as defined here is an 'absolute' integral, i.e., if / is integrable so is |/|. This means that although the Lebesgue integral extends the 'proper' Riemann integral, there are functions with finite 'improper' Riemann integrals which are not Lebesgue integrable. We want to emphasize that the natural domain of a ?-additive measure is <5-ring or a ?-algebra - but not a ?-ring. In fact, the measurability and the integral involve only sets of finite measure, which form a <5-ring (or a ?-algebra, if the measure is finite). A finite, ?-additive measure on ?-ring can always be extended to a finite, ?-additive measure on the ?-algebra generated by the ?-ring. One of the goals is also the identification of those real functions which are indefinite integrals. One of the central results of measure theory is relating countably additive measures to indefinite integrals. The objective is to give a complete description of the set functions which can arise as indefinite integrals of integrable functions. These can be characterized as the 'truly continuous' additive set functions (Fremlin, 2000a), or more commonly used concept, is that of 'absolutely continuous' additive set function. One of the most important properties of the Lebesgue integral is related to the following problems. Given an integrable function /: [a, b] —» R, we can form its indefinite integral F(x) = /ax f(t)dt for ? e [a, b]. Two questions arise: (a) Can we expect to have the derivative F' of F equal to /? (b) Can we characterize those functions F which will appear as indefinite integrals? There are reasonably satisfactory answers for both of these questions: F' = f almost everywhere and indefinite integrals are the absolutely continuous functions. We present some important function spaces as L° (the space of equivalence classes of all real-valued measurable functions, in which all the other spaces of the chapter are embedded), L1 (equivalence classes of integrable functions), L°° (equivalence classes of bounded measurable functions) and V (equivalence classes of pth-power-integrable functions). One of the most important feature of the concept of product measure is the fact that we can use it to discuss repeated integrals. We give versions of Fubini's theorem and Tonelli's theorem. In the preparation of this auxiliary chapter there were used many different sources. The author wants to stress the useful Fremlin B000a, 2000b), where one can find the proofs and additional explanations. I thank N. Dinculeanu for his useful advice.
30 ?. Pap 1. Measurable functions 1.1. Classes of sets 1. Throughout the paper, S is a non-empty set, V, 7?, A, V, «S, ? are, respectively, a semiring, a ring, an algebra, a <5-ring, a ?-ring, a ?-algebra of subsets of S. A semiring of sets is a non-empty class 73 of subsets of 5" closed under intersection ??) ? and having the property that for any pair (A, B) of sets from V with А с S. there is a finite family (Ci)o^i^n of sets from V such that A = Co С C\ С · · · С C„ = ? and C,\C,-ie-p, forlsC;iC«. An important example of a semiring is the class of intervals of the form ]a,b]. A rwg is a non-empty class of subsets of S closed under union A U В and difference A\B. An algebra is a ring containing the whole set S. A S-ring is a ring closed under countable intersections. A ?-ring is a ring closed under countable unions. A ?-algebra is a ?-ring containing S. A monotone class is a non-empty class ? of subsets of S, closed under unions of increasing sequences and under intersections of decreasing sequences. Every ring is a semiring; every <5-ring is a ring; every ?-ring is a <5-ring and a monotone class. For any class ? of subsets of S there is a smallest semiring (respectively, ring, algebra, <5-ring, ?-ring, ?-algebra, monotone class) containing T, denoted by sr(T) (respectively, г{Т),а{Т),&г{^),аг(Т),аа(Т),М{Т)), and called the semiring (respectively, ring, algebra, <5-ring, ?-ring, ?-algebra, monotone class) generated by ?. Examples. (a) The ?-algebra of subsets of S generated by 0 is {0. S]. (b) The ?-algebra of subsets of N generated by {{«}: ? e N) is V(N) (the power set ofN). (c) The class V\ (respectively, V\ [Щ) of intervals of the form ]a, b] with a, b e R (respectively, a, b e R) is a semiring on R (respectively, R). (c') The class V\ of intervals ]a. b] С R with a. b rational, is a countable semiring on R. (d) The class Рт (respectively, P2M) of intervals of the form [a, b[ with a, b e R (respectively, a, fo e R) is a semiring on R (respectively, R). (d') The class V-, of intervals [a,b[cR with a, b rational, is a countable semiring on R. (e) If S, ? are non-empty sets and if V, Q are semirings on S, T, respectively, then the class V ? Qof rectangular sets ? ? ? with A eV and В е Q is a semiring on the product set S xT.lfV and Q are rings, the class PxQis not necessarily a ring.
Some elements of the classical measure theory 31 (f) If S is a topological space, the ?-algebra generated by the class of open sets is called the Borel ?-algebra of S and is denoted by B(S). The sets of B(S) are called Borel subsets of S. The ?-algebra B(S) is also generated by the class of closed subsets of 5". (g) Let D с К be a dense set (e.g., the set of rational numbers). The Borel ?-algebra В(Ш) is generated by each of the following classes: the class of intervals ]a, b] with a,b e D; the class of intervals [a,b[ with a,b e D; the class of open intervals ]a, b[ with a,b & D; the class of closed intervals [a, b] with a,b e D. In particular, B(R) is generated by any one of the semirings V\,V\,V^,V~,. (g') Let D с Ш be a dense set. The ?-algebra В(Ш) is generated by each of the following classes: the class of intervals ]a, b] with a e D and b e D U {+00}; the class of intervals [a, b[ with ? e DU{- 00} and/? e D; the class consisting of intervals ]a, b[ with a,b e D and {+00}; the class of intervals [a, fo] with a e D and fo e DL)[+oo]. In particular, B(R) is generated by the semiring V\ [M], or by the semiring PifR]. (h) Let S, ? be two non-empty sets and <S, ? ?-rings (respectively, ?-algebras) on S, T, respectively. The ?-ring (respectively, ?-algebra) generated by the semiring of rectangular sets ? ? ? with A e <S and В е T, is called the product ?-ring (respectively, product ?-algebra) of S and ? and is denoted by S <g> T. When no confusion is possible, <S ® ? is also denoted S xT. (h') One can define similarly the ?-ring (respectively the ?-algebra) ® |<,<„ «S, of ? ?-rings (respectively, ?-algebras) 5, on the set 5,-, 1 ^ /' ^ /;. Properties. If S is a countable class, then the ring r(S) generated by S is countable. The ring r{V) generated by a semiring V, consists of all finite, disjoint unions of sets from V. The ?-ring ar{T>) generated by a S-ring T>, consists of all countable, disjoint unions of sets from V. Let ?? be a ring of subsets of S and (A,),e/ a finite family of sets from 1Z. Then there is a finite family (Bj)j€j of disjoint sets from TZ such that each set A, is the union of a certain subfamily of (Bs )j€j. Let S be a class of subsets of 5" and A e 5". Denote 5ПА = {СПА: Се S]. Then гEПА) = гE)ПА and агEПЛ) = агE)ЛЛ. Definition. Let (?„)„ем be a sequence from T. Then we define DC DC DC DC limsup?„ = p| [J Ek and liminf?„ = [J P| ?ь n=\k=n n=\ k=n whenever the right parts are meaningful. If lim sup ?n = lim inf E„ ? » holds we denote this set by lim,, E„. MONOTONE CLASS THEOREM. Let 1 be a class of subsets ofS such that ? ? J e 1 for all I, J el. Let M. с V(S) be a family of sets (monotone class) such that
32 ?. Pap (a) SeM, (b) ? \ A e ? whenever А, В e ? and А с ?, (c) UneN ^« e -^ w/?e/iever (A„),ie[i й ? non-decreasing sequence in M, (d) I CM. Then ?4 includes the ?-algebra of subsets of S generated by I. Specially, let TZ be a ring on S. The monotone class M(TZ) generated by TZ is equal to the ?-ring ??(??) generated by TZ. EXAMPLE (Dynkin system). A family ? с V(S) is called a Dynkin system on S if (a) SeT, (b) Ac e ? whenever Aef, (c) UneN A" e -F whenever (А„)„е^ is a sequence of pairwise disjoint sets in T. A Dynkin system is a monotone class. For any class ?, we denote by Т\ж the class of subsets А с S that are locally in T. i.e., such that А П ? e J" for every В e T. If 4 is a ring, 7?.|OC is an algebra. If ? is a <5-ring and <S is a ?-ring, then T>\oc and <S|0C are ?-algebras. The characteristic function of a set А с S is denoted by хд, 1 д or /д. 1.2. Set functions Let J" be a class of subsets of a set 5 with 0 e f. We give now a list of basic notions related to the set functions. 2. Definition. A function m:T -> [-oo, +oo] is called extended real-valued set function. We say that (we suppose further on that w@) = 0) A) m is (finite) real-valued set function if —oo < m{A) < +oo for all Aef. B) m is monotone if А С ? implies m( A) ^m(B) for every А, В е Т. C) m is normalized if m(S) = 1 for S e .F. D) w is non-negative if it is finite and m(A) ^ 0 for all A ef. E) w : ? -> [0, +oo] is positive. F) w : 7? -> R is supermodular (convex) if for every ?, ? e Я we have m(AL)B)+m(AnB) ^m(A) + m(B). G) w : 4 -> R is superadditive if for every А.ВеЯ with ? ? ? = 0 we have w(AUfi) ^m(A) + m(fi). (8) m : TZ -> R is submodular (concave) if for every ?, ? e TZ we have »i(AUB)+m(Anfi)^m(A)+m(fi). (9) m : 72. -> R is subadditive if for every А, В eTZ with ? ? ? = 0 we have w(AUfi) iCm(A)+m(fi).
Some elements of the classical measure theory 33 A0) т:И-+ [0,+oo[ is totally monotone if for every ? e N and A\,...,A„ e 7? we have -(UA·)^ ? (-1)|/|+|-(ПА<)· \?=? / {/:/С{1 и)) Me/ ' A1) т : .F -> ? is additive (signed finitely additive measure) if we have m(AUS) = m(A)+m(S), for all A,B eF with ? U ? e ? and ? ? ? = 0. Specially for w : 11 -> [0, ooL and 5 = U^eT^ ^ we ca^ m a^so content. A2) m :T —* ? is finitely additive if we have In \ ? U?i)=5>(?i), w ? for every finite family of pairwise disjoint sequences (?j)"=i from .F, i.e., ?„ П ?m = 0 for и ^ w, and (J"_, ?, e J". A3) w : ? -> ? is ?-additive (signed measure) if we have oo m I u = l / i = l |j?, )=][>(?,¦), for all pairwise disjoint sequences (?,-),-efj from F, i.e., ?„ ? ?„, = 0 for /; ? m, andU",?ie^· A4) m :?? -> ? is a measure if it is nonnegative and ?-additive. E, .?7, ?), is aproba- bility space if ?E) = 1. In this case ? is called a probability or probability measure. A5) w: ? -> ? is continuous from below if for every sequence (?„),ieN such that ?i С Е2 С ···,?„ e ?, ?^=? ?n e F we have / ОС m I [JE„ I = lim m(E„). /1—>OC Mi = l / A6) w:? -> ? is continuous from above if for every sequence (?„)„eN from ? such that ?| Э ?2 Э ···,?„ e 7?, ?,%? ?» e -^ and tnere exists  such tnat \m(E„0)\ < 00 we have / 00 m I ОБ,, ) = lim w(?„). /1—>DC wi = l /
34 ?. Pap A7) m is exhaustive (s-bounded) if lim,,^^ m(E„) = 0 for every sequence (E„)n€n of pairwise disjoint sets from T. A8) A positive set function m is called ?-finiteon ? if for every set ? e ? there exists a sequence (E„ )„€щ such that DC [J En = ? and m(E„) < do, for every /; e N. »=i m:T -* [0, oo] for S e ? is totally finite if /n(S) < oo. 1.3. Step functions 3. Let J" be a non-empty class of subsets of S and F a vector space (for example, E", C, E). We denote by Sf(!F ) the vector space of JF-step functions /: 5 -> F of the form / = ?"=l xAjXi, with A, e J" and x, e F. If F = ? we write S(T) instead of <S;r(.F). Remark. There is no need to consider step functions with values in E. Any constant function is an .F-step function. We remark that if V is a semiring and Tl = r(V) is the ring generated by V, then Sr(V)= S/-(K). If ? is a ring, in the definition of .F-step functions the sets A, can be chosen mutually disjoint. LEMMA. Let H be a ring and f: S -> F an TZ-step function f = ???? ??,?" with A, e TZ and Xj e F. Then f can be written in the form f = J2jeJ XS/-vy· w'1^ J finite, Bj e TZ mutually disjoint and v, e F. Moreover f can be written uniquely in the form f = ^2j€j ??,??- w'th )'j ? 0 for every j e J and y-; ? y^ for j ? к in J. Then ?/? = ???,?>??· where for any function /: 5 -> F or E, we denote by |/| the function defined by |/|(?)=|/(?)|, fori e S. 1.4. Measurable functions Measurability is defined with respect to a ?-algebra. We shall consider, mainly, measura- bility of real-valued functions, with values in E. Let ? be a ?-algebra of subsets of S. The pair (S. ?) is called a measurable space. The elements of ? are called ?-measurable sets. In the sequel, E". ?) is a measurable space.
Some elements of the classical measure theory 35 4. Definition. A function /: 5" -> ? is said to be ?-measurable, if for every Borel set В С ? we have/-1 (A) e Г. If S is a topological space and Г = ^E), the Borel ?-algebra of S, a ?-measurable function /: S -> ? is called a Borel function. Any continuous function /: 5 -> ? is a Borel function. Examples. (a) Any constant function / :S -> ? is .^-measurable. (b) If А с 5, then the characteristic function хд : S -> ? is ?-measurable if and only if Л е 2Г. (c) Every iT-step function /: S -> ? is .^-measurable. 5. THEOREM. Let f :S -*Ж be ? ?-measurable function. Then there is a sequence (/«)neN of ?-step functions f„ : S -> ? .ягс/г ?/??? /„ -> f pointwise and |/„| ^ |/|/or еас/г и. //¦/ й positive (with values in [0, +oo]), ?/ге sequence (/„)„е[,- саи foe chosen to be increasing. If f is real-valued and bounded, the sequence (/„)„ек can foe chosen to be uniformly convergent. The following theorem gives a characterization of iT-measurability. .T-measurability is preserved by pointwise convergence: 6. THEOREM. //(/„)//eN is a sequence of Ж valued, ? -measurable functions converging pointwise to a function f, then f is also ? -measurable. 2. Measures 2.1. Positive measures 7. A positive measure m:TZ-* [0, +oo] has the following additional properties: If ? is positive and additive on TZ, then ? is increasing. If ? is positive and additive on TZ, then ? is subadditive. If ? is positive and ?-additive on TZ, then ? is ?-subadditive, i.e., for every sequence (A„ )neN °f sets from TZ with union in TZ, we have ?? |JA„ U]TM(A„). \/ieN / neN Conversely, if ? is positive, additive and ?-subadditive on TZ, then ? is ?-additive. A positive additive measure ?: TZ -> [0. +oo] on a ring 72. has the ^ш;'7е measure property (FMP) on TZ, if for every set A e TZ we have ?(?) = 5??{?(?): Ве7гПА, ?(?) < oo).
36 ?. Pap If ? does not have the FMP on Tl, we can replace it with the measure ? : TL -> [0, +00] defined for every set A e 7? by m'(A) = sup{m(B): ? е7гп ?. ?(?) < oo}. Then ? has the FMP and ? (A) = ?(?) if ?(?) < oo. If ? is ?-additive then ? is ?- additive. 2.2. Measure spaces. Extension of measures 8. The framework for this section is a measure space (S. ?, ?), consisting of a nonempty set 5", a ?-algebra ? of subsets of 5" and a positive, ?-additive measure, with finite or infinite values, ?: ? -> [0, +oo], i.e., such that (a) ?@) = ?; (b) if (?n)„eN is a disjoint sequence in ?. then Members of ? are called measurable sets, and ? is called a measure on S. ?, or (S1. ?1, ?), is totally finite if ?E) < oo. Remark. The '00 of measure' corresponds to the notion of infinite length or area or volume. The basic operation we need is addition: ?? + ? = ? + ?? = ?? for every a e [0, oof, and oc + 00 = 00. This makes [0, 00] a semigroup under addition. We take oc¦ — a = 00 for every a e E; but we avoid to interpret the formula 00 — 00. As for multiplication, it turns out that it is usually right to interpret 00 · 00, a ¦ 00 and 00 · a as 00 for a > 0, while 0 · 00 = 00 · 0 can generally be taken as 0. 9. Example. Let 5" be a set. If /: 5" -> [0. 00] is any function, then for every ?CS we define ?(?) = ??^? /(*) (taking Х]хец/<*) = 0), discrete measure. For infinite sets ? we have X](e?/U) = sup{]Tve/ /(¦*): / <? ? is finite), because every f(x) is non- negative. (S, V(S), ?) is a measure space. A special case of the above is when f(x) = 1 for every ? e S. Then ?(?) is the number of points of ? if ? is finite, and is 00 if ? is infinite. We will call this the counting measure on S. Another special case is with S = N, /(л) = 2-" for every и. Then ?E") = ?,^?2"" = 1. 10. Our purpose is to extend a positive, ?-additive measure ? from a semiring V to the ?-algebra ? = aa(V) and to obtain a measure space E, ? .?). This will be done in several steps. We consider first extensions of an additive measure from a semiring to the ring generated by it. THEOREM. Let V be a semiring, ?? = r(P) the ring generated by V and m:V —* F or [0, +00] a finitely additive measure. Then m can be extended uniquely to an additive measure in' :1?—> F or [0. +00]. Ifm is ?-additive on V. then m is ?-additive on TL.
Some elements of the classical measure theory 37 Example. Let /: R -> F (or R) be a function, and V\ be the semiring of the intervals ]a, b] с R. Define the additive measure m : V\ -> F (or R) by m]a, b] = /(b) - /(a), for ]?,?)]?7?. Then m can be extended to an additive measure m': 1Z -> F (or R) on the ring П = г(Р\) generated by P,. In this example we can replace V\ by the semiring Vi of the intervals [a, b[ с R. 11. Next, we consider extensions of a positive ?-additive measure from a <5-ring to the ?-ring generated by it. THEOREM. Let V be a S-ring, S = ar{V) the ?-ring generated by V and m : V -> [0, +oo] a ?-additive measure. Then m can be extended uniquely to a ?-additive measure m': <S -> [0, +oo]. /fm is finite and bounded on V, then m is finite and bounded on S. 12. The most important step of the extension is that from a ring to a ?-ring. We consider first the uniqueness of such an extension. THEOREM. Let TZ be a ring, S = or(R) the ?-ring generated by 1Z. Let m,n:S -> [0, +oo] be two ?-additive (finite) measures. Ifm = ? on ??, then m=n on S. The above theorem remains valid if we replace S by the <5-ring V = 8r(lZ) generated by П. 13. Finally we consider the extension of a positive, ?-additive measure from ?-ring to a ?-algebra. Let again ?? bearing, S = ог(Я) the ?-ring generated by TZand ? =??A?) = ??E) the ?-algebra generated by Tl or S. Let ?: TZ -> [0. +oo] be a ?-additive measure and ?': S -> [0, +00], the ?-additive extension of ? stated in Theorem in 12. In most cases, ? is finite on 7Z, hence, in this case, the extension ? is unique. If S is not a ?-algebra, we consider a second ?-additive extension ?": ? —> [0, +oo]. Namely, for every set A e ? we define ?"(?) = $\}?{?'(?): BeS, В С A}. The measure ?" is not necessarily the only ?-additive extension of ?' to ?, even if ?' is finite; but ?" is the smallest among all these extensions. 14. Example. Let S— {a, b], a set consisting of two elements, S = {0,{a}} and ? : S ^ [0, oof defined by ?'{?) = 1. The ?-algebra ? =aa(S) is ? = {0. {a}, {b}, S]. For any a e [0, +oo[ the measure ? : ? -> R defined by ?{?) = ?{?] = 1 and v{fo} = a, is a ?- additive extension of ?'. In this example ?"{^} = 0. 15. To distinguish ?" from other ?-additive extensions of ?', we call ?" the canonical extension of ?' (and of ?). In the sequel we shall continue to denote ?' and ?" by ?. We obtained a measure space (S, ?, ?).
38 ?. Pap The idea of the 'outer' measure of a set A is that it should be some kind of upper bound for the possible measure of A. It can happen that this is just the measure of A; but this is likely to be true only for sets with adequately smooth boundaries. 16. DEFINITION. Let S be a set. An outer measure on S is a function ? : V(S) -> [0, oo] such that (a) 4@) = 0, (b) if А С ? then ? (A) iC ?(?), (c) for every sequence (Е„)п€н of subsets of S, we have v({Jll€r:E„) ^ ?^=??(?„)- 17. THEOREM (Caratheodory's method). Let S be a set and ? an outer measure on S. Let ? = {E: ?CX, i)(A) = /)(An?) + f)(A\?)/oreven-ACS). Then ? is ? ?-algebra of subsets of S. If ? : ? -> [0. oo] is defined by ?(?) = ?(?) for ? e ?, then (S, ?, ?) is a measure space. 2.3. Completion of a measure space Let (S, ?, ?) be a measure space. 18. We define first the ?-negligible sets. A set А С S is said to be ?-negligible (shortly negligible) if there is a set В e ? with А С В and ?(?) = 0. A set А С 5" is ?-conegligible (shortly conegligible) if S \ A is ?-negligible. The following properties are immediate: 19. Every subset of a ?-negligible set is ?-negligible. 20. A countable union of ?-negligible sets is ?-negligible. 21. The measure space (?,?,?) is said to be complete if ? contains all the ?- negligible sets. If the measure space (S. ?, ?) is not complete, we can replace it with the complete measure space (S, ?(?). ?), which is obtained in the following way: ?(?) is the ?- algebra generated by ? and all the ?-negligible sets. ?(?) consists of all the sets of the form A = В U N with В e ? and N a ?-negligible set; we can take В and N disjoint. ? (?) consists also of the sets of the form A = B\N with В e ? and N. a ?-negligible set. A set A belongs to ? (?) iff there are sets B.C &? with ? с А с С and ?{0 \ ?) = 0. From the definition of ?(?) it follows that a set А с 5" belongs to ?(?), iff there is a set ? e ? such that the symmetric difference AAB - (A \ ?) U (? \ A) is ?-negligible. Then we extend the measure ? from ? to ?(?) and we denote the extension by the same letter ?. Namely, for every set A = В U N with В e ? and N a ?-negligible set, we set ?(?) = ?(?).
Some elements of the classical measure theory 39 It follows that if N is a ?-negligible set, then ?(?) — 0. The definition of ?( A) depends only on A. The measure space (S, ?(?),?) is called the completion of the measure space (S, ?, ?). The sets of ?(?) are called ?-measurableor simply measurable if it is no doubt with respect to ?. We denote by ?/ (?) the <5-ring of the sets A e ?(?) with ?(?) < oo. The sets of ?/ (?) are called ?-integrable. 22. THEOREM. Let TZbea ring generating the ? -algebra ?. Then for every set A e ?(?) and TZ is a ring generating ?(?). Then TZ is dense in ?(?) for the semidistance p(A, ?) = ?(???),^?, В е ?(?). This, in turn, is used to prove the extension theorem for real-valued or vector-valued ?-additive measures m, absolutely continuous with respect to a finite, positive, ?-additive measure ?. The above theorem is also used to prove that the set of 7^-step functions is dense in the space L' (?) of ?-integrable functions. 2.4. Measures with finite variation Let V be a semiring of subsets of S (in particular, V can be a ring, a ?-ring or a ?-algebra). Let ? be a Banach space and m : V —> ? (or R) a finitely additive measure. We shall define the variation of the vector-valued measure m, although, in this chapter, we shall be concerned especially with real-valued measures. 23. DEFINITION. For every set А с S we define the variation |w|(A) of m on A, by the following equality |w|(A) = sup/]w(A,-)|. where the supremum is taken for all finite families (A, ),e/ of disjoint sets from V contained in A. If V is a ring and A e V, in the above definition we can take the supremum for all finite families (A,),e/ of disjoint sets from V with union equal to A. We say m has finite (respectively, bounded) variation if |w|(A) < oo for every A eV (respectively, \m\(S) < oo). Properties. 24. (K |w |(A) ^+oo, and |w(A)K \m\(A), for A e V. 25. |m|(A) = Oifandonlyifm(B)=0, for every set В e V with В С A. 26. |/я | is finitely additive on V.
40 ?. Pap 27. If m is ? -additive on V, then | m | is ? -additive on V. 28. EXTENSION THEOREM. Let m :TZ-* ? {or Ж) be ? ?-additive measure with finite variation \m\ on a ring TZ and let V — Sr(TZ) be the ?-ring generated by TZ. Then m can be extended to ? ?-additive measure m' :T> —> ? with finite variation \m'\ and \m'\ is the unique ?-additive extension of\m\ from TZ to V. Ifm has bounded variation \m \ on TZ, then m can be extended uniquely to ? ?-additive measure m : ? -> ? (or ?) with bounded variation \m'\ on the ?-algebra ? = ar(TZ) generated by TZ, such that \m'\ is the canonical extension of\m\. 2.5. The variation of real-valued measures on a ring Let TZ be a ring of subsets of S and m : TZ -> ? an additive measure. 29. Ifm ^0, then \m\(A) =m(A). for A eTZ. 30. THEOREM. For every set Ac S we have sup |w(S)| ^ |w|(A) ^ 2 sup |w(B)|. BcA.BtTl BcA.BzTl Ifm is complex valued, we replace 2 by 4. 31. m has finite (respectively, bounded) variation \m\, ifandonlyifw is locally bounded (respectively, bounded) on TZ. We say m is locally bounded on TZ if for every set A eTZ, m is bounded on TZ ? A. 32. Assume m has finite variation \m\. We define the positive part m+ of m and the negative part m~ of m by: m+ = -(|m|+m), m~ = -(\m\— m). Then m+ and m~ are positive, additive measures on TZ. If m is ?-additive, then \m |, m+ and m~ are also ?-additive on 7?. We have w=w+—w~ and \m\= m+ + m~. 33. Every ?-additive, real-valued measure on a <5-ring is locally bounded and has finite variation. Every ?-additive, real-valued measure on a ?-ring is bounded and has bounded variation. 34. THEOREM (Hahn decomposition). Let S be a set, ? a ?-algebra of subsets ofS, and m: ? -> ? ? ?-additive measure. Then m is bounded and there is a set ? e ? such that m(F) ^0, whenever F e ? and F с Р,
Some elements of the classical measure theory 41 m(F) ^ 0, whenever F e ? and F П ? = 0. 35. PROPOSITION (Jordan decomposition). Let S be a set, ? ? ?-algebra of subsets ofS, and m: ? —» ? a real-valued measure. Then m can be expressed as the difference of two finite measures with domain ?. 36. Let m : V -> ? be a ?-additive measure with finite variation \m\ on a <5-ring P. Let 5 = ar{V) be the ?-ring generated by V and ?1 = ??(?>) the ?-algebra generated by V. Consider the ?-additive, positive measures \m\, m+, m~ on V. They have canonical extensions to ?1, denoted by the same letters. We still have |w| =w++w~. 2.6. Measurability with respect to a positive measure Let E, ?, ?) be a measure space and ?(?) the completion of ? with respect to ?. 37. We say that a property P(s) defined for every s e S is true ?-almost everywhere (?-a.e.) if the set of points s e S for which P(s) is false is ?-negligible. 38. A function / : S -> ? is said to be ?-negligible is /(s) = 0, ?-a.e. 39. A function /: S -> ? is called ?-measurable, if it is ? (?)-measurable, i.e., if for every Borel set В с! we have f~](B) e ?(?). The i7^)-step functions are also called ?-measurable step functions. 40. A function /: S -> ? is ?-measurable iff there is a .^-measurable function g:S^> ? such that f = g, ?-a.e. If /| = /?, ?-a.e. and if f\ is ?-measurable, then /2 is ?- measurable. It follows that / is ?-measurable iff there is a sequence (/n)„eN of iT-step functions such that /„ -> /, ?-a.e. 41. We extend the ?-measurability for functions defined ?-a.e. An ?-valued function defined ?-a.e. on S is said to the ?-measurable, if it has a ?-measurable extension g : S -> ? on the whole space S. Then any extension of / to S is ?-measurable. 42. PROPOSITION. Le? (S, ?, ?) foe ? measurable space, and D a subset of S. If ?? is the subspace ?-algebra of subsets of D, then for any function f: D —> ? the following assertions are equivalent, in the sense that if one of them is true so are all the others: (a) {x: f\x) < a] e ?D for even,· a e E; (b) {x: f(x) ^ a] e ?? far every a el; (c) {*: f(x) > a] & ?? for every a e E; (d) {jc: /U)^a}e ?? for every a e E. DEHNITION. Let Z) be a subset of S. A function /: D -> ? is called measurable (or ?-measurable) if it satisfies any of the conditions (a)-(d) of the preceding proposition.
42 ?. Pap 43. THEOREM. Let f and g be real-valued functions defined on domains Dom/, Dom g с S. (a) If f is constant it is measurable. (b) If f and g are measurable, so are f + g, f ¦ g, f/g, where (/ + g)(x) = f(x) + g(x), (/ · g)(x) = f{x) ¦ g(x), (f/g)(x) = f(x)/g(x)forx e Dom /ODomg and additionally g(x) ? Ofor f/g. (c) If f is measurable and eel, then cf is measurable, where (cf)(x) — с ¦ f(x)for ? e Dom/. (d) If f is measurable and ? Cl is a Borel set, then there is an F e ? such that f~\E) = {x: f(x)e E) is equal to FODom/. (e) If f is measurable and h is a Borel measurable function from a subset Dom/? of Ж to R, then hf is measurable, where (hf)(x) = h(f(x))for ? e Dom(/?/) = {v: у e Dom/, /(у) е Dom/?}. (f) If f is measurable and A is any set, then /|д is measurable, where Оот(/|д) = А ПDom/ and (f\A)(x) = f(x)forx e А П Dom/. 44. The main point of the theory of measure and integration, since Lebesgue, is that we can deal with limits of sequences of functions, and the set on which lim„^oc fn(x) exists can be irregular, even for a well-behaved functions /„, see Fremlin B000a, Vol. 1). THEOREM. Let (/„)„eN be a sequence of ?-measurable real-valued functions with domains contained in S. (a) Define a function lim,,^^ /, by (lim,,^^ fn){x) = \\mn^oc fn(x) for all those ? e Ц[екПт>иО°т/и for which the limit exists in R. Then lim,!en /, is ?- measurable. (b) Define a function supneN /, by (sup„et:/»)(·*) = sup„et;/»(-*) for all those ? e P|neNDom/„ for which the supremum exists in R. Then sup„et;/« '¦* ?- measurable. (c) Define a function inf„eK /„ by writing (inf„e;; /„)(*) = inf„ei>; fn(x) for all those ? e P|,!eNDom/„ for which the infimum exists in R. Then inf„ef; fn is ?- measurable. (d) Define a function limsup,,^^/,, by (limsup,,^^ fn)(x) = Hmsup,,^^ fn(x) for all those ? e U„eN Пт>„ Dom/,,, for which the limsup exists in R. Then limsup,!e^/„ is ?-measurable. (e) Define a function liminf,,^^ /„ by (liminf,,^^ f„)(x) = liminf,,^^ fn(x) for all those ? & UneN Пт>и Dom /„, for which the lim inf exists in R. Then lim inf„e^ /, is ?-measurable. PROPOSITION. Let f, g and /„, for ? e N, be ?-measurable real-valued functions whose domains belong to ?. Then all the functions f + g, f-g, f/g< sup/,, inf/,, lim /„, limsup/,, lim inf/„ ,,6f: net·. n — -x. n^-oc и-»зс have domains in ?. If h is a Borel measurable real-valued function defined on a Borel subset of Ш, then Dom/?/ e ?.
Some elements of the classical measure theory 43 45. For two ?-measurable functions / and g we write / ~ g if / = g a.e. Then ~ is an equivalence relation in the set of all ?-measurable functions ?° and we shall write L , or L°(m), for the set of equivalence classes. We denote by the same symbol / the equivalence class corresponding to the measurable function /. The set L° is a vector space with respect to the naturally transferred addition and multiplication with constant of functions. Let / be a measurable real-valued function defined almost everywhere on Er. Then there is a sequence (/n)„eN of continuous functions converging to / almost everywhere. This is a version of Lusin 's theorem. 46. DEnNlTlON. Let (S, ?, ?) be a measure space. (a) ?, or (S, ?, ?), is strictly localizable or decomposable if there is a disjoint family (Xi)iel of measurable sets of finite measure such that S = U/e/ ^' ??(^ ? = {E: ? с X, ? ? ?,¦ е ?, for every i e /}, ?(?)=^?(???,), for every E e ?. ;e/ Such a family (X/),e/ is called a decomposition of 5. (b) ?, or (S, ?, ?), is semifinite if whenever ? e ? and ?(?) = oo there is an F с Е such that Fel and 0 < ?^) < oo. (c) ?, or E, ?, ?), is localizable or Maharam if it is semifinite and, for every fcr, there is an // e ?1 such that (i) E\H is negligible for every ? eJ7, (ii) if G e ? and ? \ G is negligible for every F e JF, then // \ G is negligible. We call such a set // an essential supremum of .F in ?. (d) ?, or (S, ?, ?), is locally determined if it is semifinite and ? = {?: F с 5, EOF e ? whenever F e ? and m(F) < oo}, i.e., for any F e ?E) \ ? there is an F e ? such that ?^) < oo and F ? F ? X\ (e) A set F e ?1 is an atom for ? if ?^) > 0 and whenever F e ?, F ? ? one of F, F \ F is negligible, ?, or E, ?1, ?), is atomless if there are no atoms for ?. (f) ?, or (?, ?, ?), is purely atomic if whenever ?еГ and F is not negligible there is an atom for ? included in F. 2.7. Examples of measures 47. 77?e counting measure. Let 5 be an uncountable set, e.g., S = Ш, and let ? to be counting measure on S. Then ? is ?-additive, complete, it is not ?-finite, it is not a probability measure nor totally finite, it is strictly localizable, and ? is purely atomic. Obviously, ? is not atomless. 48. The countable-cocountable measure. Let S be any set. Let ? be the family of those sets ?CS such that either F or S \ ? is countable. Then ? is a ? -algebra of subsets
44 ?. Pap of S. ? is called the countable-cocountable ?-algebra of S. Let ? : ? -> {0, 1} be defined by ?(?) = 0 if ? is countable, ?(?) = 1 if ? is not countable. Then ? is a ?-additive measure, ? is called the countable-cocountable measure on S. If S is any uncountable set and ? is the countable-cocountable measure on S, then ? is a complete, purely atomic probability measure. 49. The Stieltjes measure. Let ? e R and / an interval, closed to the left, of the form [a, b[ with a < b sC +00 or of the form [a, b], with a < b < +00. We denote by V[I] the semiring of the subintervals of/, of the form [а, r],or]s, t] with ? < s. The ring generated by V[I] consists of finite, disjoint unions of sets from V[I]. The semiring V[I] generates the Borel ?-algebra ??(/). DEnNlTlON (Variation of a function). Let / be a real-valued function and D a subset of E. VarD(/), the variation of f on D, is defined by: if D ODom/ = 0, VarD(/) = 0, and otherwise, Varo(/) is sup|^|/(a,) - /(a,_|)|: a0,a\,... ,an e DnDom/. a() ^<3| ^ · · · ^ an \, allowing VarD(/) = 00. If VarD(/) is finite, we say that / is of bounded variation on D. We write also Var / for VarD0m f(f). The basic properties of the variation are collected in the following (see, e.g., Fremlin B000b), Vol. 2). Proposition. (a) If f, g are two real-valued functions then VarD(/ + g) <: Var0(/) + VarD(S). (b) If с e ? then Varp(c/) = |c| VarD(/). (c) Ifx e ? then Var0(/) ^ Varoni-ac.A-ii/) +??·?)?[?.?[(/), with equality ifx e D ? Dom /. (d) //?> cD'cl г/геи VarD(/) ^ Var/r(/). (e) |/(jc)-/()')I ^ VarD(/)/ora//.v, ? e DODom/. Consequently if f is ofbounded variation on D then f is bounded on D ? Dom / a/id (if D ?? Dom / ^ 0) sup |/(v)| si |/(.v)|+Var/>(/) veDnDom/ /or ever}' ? e D ? Dom /.
Some elements of the classical measure theory 45 (f) If f is monotone and Del meets Dom /, then VarD(/) = sup /(*)- inf f(x). .teDnDom/ .veDnDom/ THEOREM. The following are equivalent: (a) 77?ere are fwo bounded non-decreasing functions f\, /2: ? -> R skc/? ?/??? / = /? - /2 оя ?) ? Dom/; (b) / is of bounded variation on D; (c) There are bounded non-decreasing functions f\, /2 : R -> R smc/? rftaf / = /? — /2 ои ?) П Dom / a/id" VarD(/) = Var/, + Var/2. Let g : I -> R be a function. We assume g is r/'g/?i continuous at every point ? ^ a and that g has finite variation Vary (g) on every bounded interval J. The variation function of g is the function \g\:I -> R+ defined by |g|@ = Var[a.,j(g),forf e /.Then |g| is increasing and right continuous at every point t ? a. To the function g we associate the finitely additive measure mg : V[I] -> R+ defined by mg[a,t]=g(t) - g(a) and mg]s,t]=g(t)-g(s), ifa<s. The measure mg is positive if and only if g is increasing. Then to the variation function |g| we associate the positive measure m\g\, and we have \mg(A)\^m{g{(A), for A e ИЛ· Since |g| is right continuous at every t ? a, the measure m\g\ is ?-additive on V[I] and can be extended uniquely to a ?-additive, positive, finite measure on the ring TZ(I), still denoted by m\g\. From the above inequality, it follows that тц is ?-additive and with finite variation \mg\ ?; m\g\ on V[I], hence mg can be extended to a ?-additive measure on the ring7?(/), still denoted by mg, and we still have \mg\(A)^mlg](A), for A eTZ(I). In fact, we have the equality \mg\ =m\g\. The positive measure m\g\ can be extended uniquely to a ?-additive measure ?: ??(/) -> [0,+oo]. Moreover, ? is finite on the <5-ring T>(I) of bounded subsets of /. It follows that the measure mg can be extended to a ?-additive measure m :T>(I) -> R with finite variation \m\ = ?. If g has bounded variation /, then |g| is bounded on /, the measure m\g\ is bounded on TZ(I) and its extension ? is bounded on ??(/). It follows then that mg can be extended to
46 ?. Pap a ?-additive, finite measure m : B(I) -> R on the Borel ?-algebra, with bounded variation \m\ = ?. We shall continue to denote ? by /И|<,| and m by m4. Then we still have \mg\(A)=mlgl(A), forAeP(Z), or for A e ??(/), if g has bounded variation. The ?-additive measure ms on D(/) or B(I) is called the Stieltjes measure on / associated to the function g. 50. The Lebesgue measure. The Lebesgue measure on / is the measure mg corresponding to the continuous, increasing function g(s) = s, for s e /. In this case we have mg]s, t] = mg[s, t] = mg[s, t[=m4]s, t[= t — s for every s < t in /. 51. Let —oo ^ a < +oo and consider an interval /, open to the left, of the form ]a, b] with a < b ? +oo or of the form ]a,b[, with a < b < +oo. In particular we can have / = ]-oo, +oo[. This time we denote by V[I] the semiring of the subintervals of /, of the form ]s, t] with a < s. We still denote by Щ1) the ring generated by V[I]. The semiring V[I] generates the Borel ?-algebra B(I). Let g: / —> R be a function. We assume g is right continuous at every point t e / and that g has finite variation Var/(g) on every interval ]a,t] С /. The variation function of g is the function \g\:I -> R+ defined by \g\(t) = Varj(,.,j(g),forr e /.Then |g| is increasing and right continuous at every point t ??. To the function g we associate the finitely additive measure m4: V[I] -> R defined by mg]s,t]=g(t) - g(s). The measure m? is positive if and only if g is increasing. Then to the variation function |g| we associate the positive measure т\ц\. As in the previous case, the measure т\я\ is ?-additive, mg is ?-additive, with finite variation |/я?| = m\g\. We can extend m\g\ to a ?-additive ?:??(?) -> [0.+oo] and then we can extend mg to a ?-additive measure m : T>(I) -> R with finite variation |w | = ?. If g has bounded variation on /, then mg can be extended to a ?-additive measure m : B(I) -^Ron the Borel ?-algebra, with bounded variation \m \ = ?. As in the previous case, we continue to denote ? by W|?| and m by mg. The ?-additive measure mg on T>(I) or B(I) is called the Stieltjes measure on / associated to the function g. 52. The Lebesgue measure on / is the positive measure associated to the increasing, continuous function g(s) = s, for s e /. In this case, we still have mg]s, t] =mg[s, t] = m?[i, t[ =mg]s. t[ = t—s. 53. Remark. If / is an interval closed to the left or open to the left and if g : I -> R is a function with finite variation, we can consider the right continuous function g+: I —> R defined by g+(a) = g(a), if ? e / and g+(t) = g(t+), if a < t, and proceed as above.
Some elements of the classical measure theory 47 54. As a consequence of the Monotone Class Theorem we have the following result: if ?, ? are two measures on B(W), where r ^ 1, both defined, and agreeing, on all intervals of the form ]-oo,a] = {x: ? ??} = {(?\,...,??): ?? ????, for every/ ^r), for ? = (? ?,... ,ar) eW, ап<1д(Кг) < oo, then ? and ? are equal on all theBorel subsets ofRr. 55. THEOREM (Image measures). Let (S, ?,?) be a measure space, ? any set, and ?~] : S -> ? a function. Set T={F:Fcy,i)-|(F)ei|, v(F) =m(<p~'(F)), foreveryFeT. Then (Y, T, v) is a measure space, ? is called the image measure ??~'. 2.8. Lebesgue measure on W One of the most important examples of measure is the Lebesgue measure ?' on the Euclidean space W for some r e N. Therefore we consider in more details this measure, with a few of its basic properties. Let a = (? ?,..., ar), b = (fi\,..., ?,-), ? = (?\,..., ?,-) e W. Half-open interval in W is a set of the form [a, b[= {?: ?, ^ ?, < ?,, for every / $5 r), where a, be W. If / = [a, b[ с Rr, then either / = 0 or a, = inf{?,-: ? el}, ?,¦ = sup{|,: ? el], for every / ^ r, so that the expression of / as a half-open interval is unique. 56. DEHNITION. The r-dimensional volume ??(?) of a half-open interval / is defined by /· ??@)=?, br{[a,b[) = Y\(fr-a,), ;=i if ?, < ?/ for every /'. 57. LEMMA. If I cKr is a half-open interval and (//)yeN " a sequence of half-open intervals covering I, then ОС ?'?/???'?/;). 7=0
48 ?. Pap For the proof see Fremlin B000a, Vol. 1, Lemma 115B). 58. DEFINITION. We define ?: V(Rr) -> [0. oo] by ?(?) =mf{^2kr(Ij): (Ij)jef; of half-open intervals with А с [J I} 4=0 yef. Every set A can be covered by some sequence of half-open intervals, e.g., А с и„ем[-п'п[' where n=(n,n, и) е R''. We have a non-empty set A that ?(?) is always defined in [0, oo]. This function ? is called Lebesgue outer measure on R''. PROPOSITION, ? is an outer measure on W such that ?(?) = ?'"(/) for every half-open interval I cRr. 59. Using the Caratheodory's method we construct by Lebesgue outer measure a measure, denoted by kr, which is called Lebesgue measure on Rr. The sets ? for which ??(?) is defined, i.e., for which ?(???) + ?(?\?) = ? (A) for every А с R', are called Lebesgue measurable. Sets which are negligible for ?' are called Lebesgue negligible (these are just the sets A for which ?(?) = 0). Suppose that t e W. Then ?(? + ?) — ?{ A) for every A cR', where A + t = {x+t: ? e A}. If ? с W is measurable so is ? + r, and ?'(? +t) = У (Е). LetoO. Then/j(cA)=c/j( A) for every А с R', where cA = {c.v: ? e A}. If ? cR' is measurable so is cE, and ?' (с?) = crkr{E). Example. All Borel subsets of R' are Lebesgue measurable; in particular, all open sets, open intervals }a, b[, closed intervals [a,b], together with countable unions of them. We have for Lebesgue measure ??: /' кг(]а,Ь[)=кг([а,Ь]) = 1\№-сц). i=\ whenever a ^ b in W. Therefore every countable subset of W is measurable and of zero measure. ?? is complete, ?-finite and therefore is strictly localizable, localizable, locally determined and semifinite. ?'' is atomless. Remark that for kr as the restriction of Lebesgue measure to В the measure space (R, ?, ?') is atomless, ?-finite and not complete. 60. If S is R'\ and ? is its Borel ?-algebra B(W), a S(R')-measurable function is called Borel measurable. If S is R'\ and ? is the ?-algebra of Lebesgue measurable sets, a ?-measurable function is called Lebesgue measurable.
Some elements of the classical measure theory· 49 PROPOSITION. Let S be Rr for some r ^ 1, D a subset of S, and f: D -> R a function. If f is Borel measurable it is Lebesgue measurable. If f is continuous it is Borel measurable. Particularly, if r = 1 and f is monotonic it is Borel measurable. Let / be ?1-measurable real-valued function whose domain belong to ?. If h is a Borel measurable real-valued function defined on a Borel subset of R, then Dom h о f e ?. PROPOSITION. Let S be a set and ? ? ? -algebra of subsets of S. Let f\, ¦¦¦, /,· be measurable functions defined on subsets of S. Set D = C\i<r Dom f and for ? e D set f(x) = (Mx),...,fr(x))eW.Then (a) for any Borel set ? С Rr, there is an F e ? such that /~' (?) = ?) ? F; (b) // h is a Borel measurable function from a subset Dom h of Rr to R, then the composition h о f is measurable. 61. Not every subset of Rr is Lebesgue measurable. The construction of an example appeal to the Axiom of Choice, see Fremlin B000a, Vol. 1). It is in fact the case that very large parts of measure theory can be worked out without appealing to the full strength of the Axiom of Choice. The significance of this is that it suggests the possibility that there might be a consistent mathematical system in which enough of the Axiom of Choice is valid to make measure theory possible, without having enough to construct a non-Lebesgue- measurable set. Such a system has indeed been worked out by R.M. Solovay A970), see Fremlin B000a, Vol. 5). We remark that while we need a fairly strong form of the Axiom of Choice to construct a non-Lebesgue-measurable set, a non-Borel set can be constructed without using any form of the Axiom of Choice (Fremlin, 2000b, Vol. 4). 62. Proposition. (a) If А с R' is any set, then for Lebesgue outer measure ? ?(?) = mf{k'(G): G is open, G 3 A} = min{kr(H): ? is Borel, НЭ A}. (b) If ? с Rr is measurable, then ??(?) =sup{k' (F): F is closed and bounded, FC?), and there are Borel sets H\, № such that H\ с ? с //2 and kr(H2 \ //|) = kr(H2 \E)= ?' (? \ //|) = 0. (c) If А с W is any set, then A has a measurable envelope, i.e., a set ? e ? such that А с ? and k(F ? ?) = /j(F ? A) for all F e ?, which is a Borel set. (d) If f is a Lebesgue measurable real-valued function defined on a subset ofR', then there is a conegligible Borel set ? с R'~ such that /|я is Borel measurable.
50 ?. Pap 63. Examples, (a) Cantor set. The Cantor set С с [0, 1] is obtained by removing from [0, 1] first its middle third ]\., |[, then from the remaining two intervals [0. j] and [|, 1] again their open middle thirds ]g, f [ and ]? |[, respectively, and so on. We obtain a sequence (C„)»eN of sets, Co = [0, \],C\ = [0. ^] U [^, 1] C„ consists of 2" disjoint closed intervals each of length 3~". We haveX(C„) = B/3)" for each n. The Cantor set is C = n«eNC»-Then ?@= lim k(Cn)= lim ( - ) =0. n—*oc n—>зс\ 3/ We remark that С is uncountable. С itself is the set of numbers expressible as ??=\ ^'cj^ where every cj is either 0 or 2. (b) The Cantor function. We define a sequence (/„)„et: of functions /„ : [0, 1] -> [0, 1] by /»W = (j) Л(С„П[0,лг]), for each ? e [0. 1]. Since C„ is a finite union of intervals, /„ is a polygonal function, with /„ @) = 0, /„ A) = 1; /„ is constant on each of the 2" - 1 open intervals comprising [0, 1] \ C„, and rises with slope C/2)" on each of the 2" closed intervals comprising C„. We have \f„+\(x) - fn(x)\ ? \2~n for every ? e ?, ? e [0, 1]. Therefore (/„)„eN is uniformly convergent to a function /: [0, 1 ] -> [0, 1 ], which is continuous. / is the Cantor function or Devil's Staircase. Since every /„ is non-decreasing, so is / and the derivative f\x) exists and is 0 for every ? e [0, 1] \ C; so /' is zero almost everywhere on [0, 1]. /: [0, 1] -> [0, 1] is surjective and we have /(C) = [0, 1]. The Cantor function can be used to prove that the composition of a measurable function with a continuous function need not be measurable. 3. Integration There are numerous ways to construct the classical integral. Each approach has its advantages and disadvantages. Some of them are more laborious than others. We present first the "immediate integral", of bounded, real-valued functions with respect to a bounded, real-valued, additive measure m on a ring ??. This integral is used in the Riesz representation theorem and in the integral representation of the dual of 1°°(?). For the classical integral we shall sketch an approach as it is presented in W Rudin A973), which seems to be simpler and to lead faster to the definition of the integral. For the proof of some of the theorems the reader is referred to Halmos A950), Fremlin B000a, Vol. 1, 2000b, Vol. 2), Rudin A973). The framework for this section is a measure space (?,?,?).
Some elements of the classical measure theory 51 3.1. The immediate integral Let 1Z be a ring of subsets of S and SAZ) the set of 7^-step functions ? f = 2_] XAjXi, with A, e 7? and jc/ e E. i=\ We can always assume that the sets A, are disjoint, and then |/| = ?'!=\ ??, ?*? ?- 64. Definition. A function /: S -> ? is said to be totally 7^-measurable, if it vanishes outside a set A e 7? and if there is a sequence (/„)„ек from S(TZ) converging uniformly to/. If S e 7?, the condition that / vanishes outside a set of TZ is superfluous. The set of totally 7^-measurable functions f.S -> ? is denoted by TMAZ). It is a vector space consisting of bounded functions. If for every / e TMAZ) we set ||/||SUp = supveS |/Cs)|, then ll/Hsup is a norm on TM(TZ), defining the topology of uniform convergence. The space S(R) is dense in TM{K) for this topology. If ? is a ?-algebra, then ??{?) is the space of real-valued, bounded, .?7-measurable functions /: S -> E. Consider now a real-valued, additive measure m : 7? -> ? with finite variation |w |. This means that m is locally bounded on ??, i.e., for each set A eTZ, m is bounded on the ring 7гпА. Proposition. ?/??=\ ??,?; = 0, with A, e TZ and ?, e E, г/геи ?"=l ш(Аг).хг = 0. Corollary. ?/?'!=\ ??,?\ = ?)=\ Хв,У] with Аг, В} е ?? andxi,ys е Е, г/геи ? к ]Г /я (Аг) хг¦ = ]Г т (В j¦) у> · ;=? у=| This corollary leads to the definition of the integral of step functions. 65. Dehnition. For every 7?-step function / = ?'!=\ Хл,х* with A, e К and *,· е Е, we define the integral J f dm by the equality / Sdm = Y^m(Ai)Xi. J i=\ The definition of the integral is independent of the particular representation of / as a step function. Since the variation \m\ is additive on ??, we can define the integral
52 ?. Pap f fd\m\ with respect to \m\. If / = X]"=, XA,Xi with A, e ? disjoint and jc, e R, then I/1 = ?"=? ??, \xi\, hence У |/|^|m| = ^|m|(A,)|.r, i = l We have the following immediate properties of the integral of x-step functions: 66. f(f+g)dm=ffdm+fgdm. 67. / cf dm =cf f dm, for с el. 68. U fdmU J \f\d\m\^\\f\\sup\m\{{f?0}). 69. If (fn)neN is a sequence of functions from <S(X), vanishing outside a set A e ? and converging uniformly to a function / e ???(?). then the sequence (/ /„ dm)n€n is Cauchy in R. In fact, \f /„dm - J fhdm\ <: I \f„ - fk\d\m\ ? ||/„ - /*||5UpM(A). If (gn)neN is another sequence from 5(x) vanishing outside A and converging uniformly to /, then lim / /„ dm = lim / gn dm. n-»oc J n^ac J This property leads to the definition of the integral of totally measurable functions: 70. DEnNlTlON. For every function / e ???(?) we define the integral f f dm by the equality // dm = lim / /„ dm. n-*xj where (/„)neN is any sequence from «S(x) vanishing outside a set A e X and converging uniformly to /. From property 69 we deduce that the integral / /„ dm does not depend on the sequence (/n)neN- Properties 66, 67 and 68 remain valid for functions from ???(?). We have the following representation of the dual of TM{TZ), in terms of the immediate integral.
Some elements of the classical measure theory 53 71. THEOREM. There is an isometric isomorphism L <-> m between the continuous linear junctionals L e TM(TZ)* and the measures m e ba{1Z) = the space of real-valued, bounded, additive measures m : ?? —* R, endowed with the variation norm \\m\\ = \m\(S). The correspondence L +> m is given by and we have \\L\\ — ||w||. The measure m corresponding to L is defined by m(A) = L(xa), for A &1Z. 3.2. Integral of positive step functions 72. For a positive, ?-measurable step function / : S -> R+ of the form / = ]P ??,?-?, with A, e ?(?) and jc, e R+ we define unambiguously the integral / / ??. by the equality ? / i=l with the usual convention 0 · oo = 0. The definition of the integral / / ?? is independent of the particular representation of / as a step function. The integral of positive, ?-measurable step functions has the following properties: 73. 0^ffd?^+oo. 74. ?(/ + 8)?? = ?/?? + ?8??. 75. / cfd? = cf fd^, if с ^ 0. 76. H0sifsig,then0<:ffd?<:fgd?. 77. If / is a positive ?-measurable step function and A e ?(?), then /хд is a positive ?-measurable step function. We denote //^?=//??^?, ??????(?).
54 ?. Pap 78. If / is a positive, ?-step function, the indefinite integral v(A) = fAf ?? defined for A e ?(?) is ?-additive. 3.3. Integral of positive functions 79. We define now the integral f f ?? for positive, ?-measurable functions /: S [0, +00], by the following equality: /f ?? = sup / $??. where the supremum is taken for all positive, finite, iT-step functions s with 0 ^ s ? f. If / is a positive, ?-step function, the integral defined above is equal to the integral defined in the preceding section. In the above definition we can take the supremum for all positive, finite ?(?)-$??? functions. In fact, for each positive, finite ?(?)-$??? function s there is a positive ?-step functions' such that s' ? s everywhere and s =s', ?-a.e., hence f ??? = f s' ??. The integral of positive, ?-measurable functions, with finite or infinite values, has the following properties. The additivity of the integral has to be postponed until after the monotone convergence theorem is given. All functions considered below are positive, ?- measurable functions, with values in [0, 00]. 80. 0<: f /?????. 81. $cf ?? = ?$???, ifc^O. 82. If / ^g, ?-a.e., then// ?? ? f %??. If f = g, ?-a.e., then f /?? = f gd^. 83. If/^O, then/ /??=0 iff / = 0, ?-a.e. 84. If / > 0 and / f ?? < oo, then f < 00, ?-a.e, and the set {/ ? 0} is contained in a countable union of sets of finite measure. 85. If / ^ 0 and A e ?(?), we denote / ???= / ?????. Then ? /?? = sup / s??, JA O^s^fJA where the supremum is taken for all iT-step functions s with 0 ^ s ? f.
Some elements of the classical measure theory 55 86. If / ^ 0, the set function ?: ?(?) -> [0, +oo] defined by v(A) = f J A f ??, for A e ?(?), A is ? -additive. 87. MONOTONE CONVERGENCE THEOREM. If(fn),i€K is an increasing sequence of ?- measurable, positive functions f„: S —> [0, +00]. ?/геи /lim /„ ??? = lim / /„ ?? ?—>oc ?—*·? J or, equivalent!}/ / sup /„ <*? = sup / /„<*?. 88. ?/ + 8)?? = ?/?? + ????. 89. If O^g ^ /, and if $ f ?? < oo, then /(/ -g)<*M = J f ?? - $ ???. This property, is used in the proof of Lebesgue's theorem for the Bochner integral. 90. If (/,,)„eN is a sequence of positive, ?-measurable functions /„ : S -> [0. +00], then /OO ОС - ????? = ? ????. П= I !!= I 91. FATOU'S LEMMA. If (fn)„€N is a sequence of positive, ?-measurable functions f,:S -> [0, +00], ?/геи /lim inf/„ ?? ^ lim inf / /„ ??. л-»оо /г —> эс у We extend the definition of the integral j f ?? for R+-valued, ?-measurable functions / defined ?-a.e. on S. For any two extensions g\, gi: 5 -> K+ of / we have g\ = gz, ?- a.e., hence f g\ ?? = §g-???. We define then unambiguously f f?? = Jg??, where g: 5 -> R+ is any extension of /. The above properties remain valid for positive, ?- measurable functions defined ?-a.e. on S. 3.4. The classical integral At this stage we can define the classical integrability of numerical functions. This definition of the integral does not use the Lebesgue's dominated convergence theorem, or the density of the step functions in ?' (?).
56 ?. Pap Remark. An alternative way to define the classical integral is to wait until we define the Bochner integral of vector-valued functions / :S -> F, for a Banach space F, and deduce the classical integral as a particular case for F = R. But in order to define the Bochner integral we have to define first the space ?^ (?) of Bochner ?-integrable functions f :S —> F, equipped with a seminorm ||/||| = /\/\??, then prove the Lebesgue dominated convergence theorem and the density of the step functions in ?^(?), and then extend by continuity the integral f f ?? from the space of step functions to the whole space ?^(?). This will be done in the paragraph devoted to the Bochner integral in Dinculeanu B002) (Chapter 8 in this Handbook). We define first the integrability. 92. Dehnition. An R-valued function defined ?-a.e. on S is said to be ?-integrable if it is ?-measurable and f \f\ ?? < oo. From / l/l ?? < oo we deduce that |/| < oo, ?-a.e., hence, a ?-integrable function / is finite, ?-a.e. If / is ?-integrable, there is a function g: S -> E, defined everywhere and finite everywhere such that f = g, ?-a.e. Any two such functions g\, gi: S -> ? are equal ?-a.e. Example. If ? is the counting measure on N then a function /: N -> E, i.e., a sequence (f(n))n€N, is ?-integrable iff it is absolutely summable, and [ /?? = f /(?)??(?) = ?/(?). J Jn „=o (?, ?>(?),?) is invariant under permutations, i.e., ?(?(?)) = ?(?) for every А с N and every bijection ? : N -> N. Therefore every definition of integration which depends only on the structure (N, V(N), ?) have to be invariant under permutations, i.e., j /(?(?))??(?) = j /(?)??(?), for every integrable function / and every permutation ?. Consequently, a series such that ??=? /(?(?)) = ??=? /(") e K for апУ permutation ? have to be absolutely summable. Therefore for an integral on an abstract measure space (S, ?, ?) depending only on ? and ?, is obligatory to be an absolute integral. 93. For each ?-integrable function / we denote ||/|| ? = / |/| ??. 94. DEFINITION. We denote by ?'(?). or ?'(.?, ?), or C](S, ?,?), the set of real valued functions /: S -> ? which are ?-integrable. It follows that if / e ?' (?), then |/| e ?' (?), and then /+, /" e ?' (?).
Some elements of the classical measure theory 57 ?' (?) is a vector space and ||/|| | is a seminorm on ?' (?). The topology defined on ?' (?) by the seminorm ||/|| | is called the topology of the convergence in the mean. To say that /,, -> / in the mean, means that |/e-/Hi-*0, i.e., f\fa-f\dii^Q. 95. To define the integral of a ?-integrable function / from ?'(м), we remark that / can be written as a difference of two positive, ?-integrable functions, for example, / = /+-/-¦ If / = /? — fi = g\ — gi with /|, /2, g\, g2, positive, ?-integrable functions from ?'(?), then /, + g2 = /2 + g\. We have / ?\?? + / gid[i= / /2?? + / ?\??. hence / /? ??. - \ /2?? = / g\ ?? - / ^2^?- We can now define unambiguously, the classical integral j f ?? by the equality / f??= / ?\??- / /2??, where f = f\ — f2 with f\, /2 positive ?-integrable functions. Then J ??? = f f+??- J ???. The above definition of the integral / / ?? is valid for functions / defined ?-a.e. on S and finite ?-a.e.; in particular for functions of ?' (?). We remark that if / ^ 0, then/ = / — 0, and the new definition of the integral coincides with that of positive functions defined in Section 3.2. The integral of ?-integrable functions has the following properties: 96. ?/ + 8)?? = ?/?? + /8??. 97. /?/?? = ??/??,{???&?.. 98. If/^0, ?-a.e., then// ??^?. 99. If / ? g, ?-a.e., then j f ?? ^ f gd?. If / = g, ?-a.e., then j f ?? = jgd^.
58 ?. Pap 100. If / is ?-integrable, then \ j f ??\^ j \f\??=\\f\\\. From this property, we deduce that the integral is continuous for the topology of the convergence in the mean: if /„ -> / in the mean, then / /„ ?? -> f f ??. 101. If / is ?-integrable and A e ?(?), then f ?? is ?-integrable. We denote / f ?? = / ?????. 102. If / is ?-integrable, then the set function ?.?(?) -> ? defined by v(A) = JA f ?? for A e ?(?), is ?-additive. For ?-integrable functions, the monotone convergence theorem and the Fatou lemma have a special form. 103. The monotone convergence theorem for ?-integrable functions. If (/H)neN is an increasing sequence of ?-integrable functions and if sup / \f„ \ ?? < oo, then the function sup /„ is ?-integrable (hence finite ?-a.e.) and we have /„ -> sup,, f„ in the mean and / sup fn ?? = sup / ?„??. 104. Fatou's lemma for ?-integrable functions. lf(fn)„eN is a sequence of ?-integrable functions and if liminf,,^^ f \^\?? < oo, then the function liminf,,^^//! is ?- integrable and we have /lim inf/„ ?? ^ lim inf / /„ ??. ? —»ос ? —»эс J 105. Countable convexity. If (/„),ген is a sequence of ?-integrable functions and if ??=\ II/"II? < °°' men the series Y^=] fn is absolutely convergent ?-a.e., its sum ?-integrable and we have is ?/- я=1 I n=l 106. LEBESGUE'S DOMINATED CONVERGENCE THEOREM. Let (/я)яеР? be a sequence of ?-integrable functions, g a positive, ?-integrable function ana f : S —> Ш a function. Assume that, (a) fn -> /, ?-a.e. (b) |/h| ^ g, ?-a.e., for each n. Then f is ?-integrable, /„ —> / in the mean ana \fnd^\ /??. For the proof, we apply Fatou's lemma to the sequences of positive functions (|/„ /I + 2g)„eN and Bg - |/„ - /|)„6N·
Some elements of the classical measure theory 59 107. The set S(Ef) of finite ?1/--step functions is dense in ?' (?). We use the fact that for every function / e ?' (?) there is a sequence (/n)neN of finite .?/-step functions such that /„ -> / pointwise and |/„ | ^ |/|, for each n; we apply then Lebesgue's theorem. 108. The space ?' (?) is complete. More precisely: If (/„)„eN is a Cauchy sequence in ?' (?), then there is a function / e ?' (?) and a subsequence (/,uheN such that /„ -> / in the mean and f„k -> /, ?-a.e. and in the mean. From property 108 one can deduce the following properties: 109. For each function / e ?' (?) there is a sequence (/„) of IT/-step functions such that /,, -> /, ?-a.e. and in the mean. 110. If (/„)„6N is a sequence from ?'(?) such that /„ -> / in ?'(?) and /„ -> g, ?-a.e., then f = g, ?-a.e. 111. / e ?' (?) iff there is a sequence (f„)n€n of IT/ -step functions which is Cauchy in the mean and such that /„ -> /, ?-a.e. 112. Remark. The statement of property 11 lean be taken as a definition of integrability; then we define the integral f fd?Ъy the equality f /?? = lim,,^^ / /« ??. However, to prove that the integral J f ?? is independent of the sequence (/n)neN is more complicated. 113. Definition. Let (S, ?, ?) be a measure space. (a) If D с 5 and /: Z) -> С is a function, then / is measurable if its real and imaginary parts Re /, Im / are measurable. (b) If / is a complex-valued function defined ?-a.e., / is integrable if its real and imaginary parts are integrable, and then j /??= ???/?? + ? j Im/??. (c) Let ? e ? and / be a complex-valued function defined on a subset of S. Then fH f is /(/Ih) ??? if this is defined in the sense of (b), taking the subspace measure ?#, i.e., the restriction of ? to ??- 3.5. Lebesgue and Riemann integrals 114. When ? in Section 3.4 is the Lebesgue measure ?'' on W we say that J f ?? is the Lebesgue integral of /, and that / is Lebesgue integrable if this is defined. 115. Riemann integral. There are many ways of describing the Riemann integral; see Fremlin B000a, Vol. 1), Halmos A950). If [a,b] is a non-empty closed interval in R, then a partition (dissection) of [a, b] is a finite list ? = (ar>,a\,..., an), where ? ^ 1, such that
60 ?. Pap a = ?0 ^ ? ? ^ . · · ^ ?? = b. If / is a real-valued function defined and bounded on [a, b], the Darboux upper sum and lower sum of / on [a, b] depending on ? are defined by я Sp(f) =]?(?; -?;-?) sup /(л), /I ¦sp(/) = y^Sai -a,-\) inf /(*). / = l If ? and P' are two partitions of [a, b] and if ? с ?', then sp(f) ? Sp'(f). The upper Riemann integral and /ower Riemann integral of / are defined by J[a.h}(f) = inf{Sp(/): ? is a partition of [a, b]], ha.b\(f) =suP{sP(f)- ? is a partition of [a, b]}, respectively. We say that / is Riemann integrable over [a, b] if J[u.b\{f) = I{«-b}(f)' 'dn<^ in this case take the common value to be the Riemann integral /((' / of / over [a,b]. If /: [a, b] -> ? is Riemann integrable, then it is also Lebesgue integrable, with the same integral. Remark. For unbounded functions and unbounded intervals, one uses various forms of 'improper' integral. For example, the improper Riemann integral f^° ^ dx is defined by Ит.,_>оо /J ^T5" dx, while /J In ? dx is defined by lim.5 ю /,.' In* dx. The second exists as a Lebesgue integral, but the first does not, because f^° | ^ | rfjc = oo. There is a characterization of the Riemann integrable functions, as follows. THEOREM. A bounded function f : [a, b] -> ? й Riemann integrable iff it is continuous almost everywhere with respect to the Lebesgue measure on [a, b]. 116. As a consequence of Lebesgue dominated convergence theorem we have the following result. COROLLARY. Let (W, ?, kr) be a measure space and ]a,b[a non-empty open interval in E. Let f : S ? ]a, b[ —> ? be a function such that (a) the integral F(t) = f f(x, t) dx is defined for every t e]a, b[; (b) the partial derivative jr of f with respect to the second variable is defined everywhere in Er ? ]?, b[; (c) there is an integrable function g : Er -> [0, oof such that \ ^jj)'" I ^ g(x) far every xeW,te]a,b[. Then the derivative F'(t) and the integral f d'f^'n dx exist for every t e ]a,b[, and are equal.
Some elements of the classical measure theory 61 3.6. Integration with respect to a real-valued measure Let ? be a <5-ring of subsets of S and m:T> -> ? a real-valued, ?-additive measure with finite variation |w|. Let ? be the ?-algebra generated by V. The variation |w| is a positive, finite, ?-additive measure on V and has a canonical ?-additive extension ? : ? -> [0, +oo], defined by M(A)=sup{|m|(D): DeP, О С A}, for A e Г. We shall continue to denote ? by |w|. We obtained a measure space (S, ?, \m\). Instead of |w |-a.e. we shall write w-a.e. 117. ?????????. We say that an ?-valued function /, defined w-a.e. is w-negligible (respectively, w-measurable, w-integrable) if it is |m|-negligible (respectively, |w|- measurable, |w|-integrable). All considerations in these sections hold also for Banach space valued functions. It follows that an w-integrable function is finite w-a.e. We denote by C\m) the space of w-integrable functions /:S -» E, i.e., ?' (w) = Lx(\m\). We consider on ?'(w) the topology of convergence in the mean of ?'(|w|), defined by the seminorm \\f\\\= f\f\d\m\, for/e?'(w). Any w-integrable function is equal w-a.e. to a function from ?' (w). All properties stated before for the space ?' (?) remain valid for ?' (w). 118. We now define the integral j f dm with respect to w. For step functions / e S(Ef) of the form / = ]?'.'=| ????, with A, e ?( and x, e ? we defined the integral { " / /dw = ]Tw(A,)jt, el. We obtain a linear mapping L:8{?]) -> ?, defined by L(f)=jfdm, for feS(Zf). If we consider above the sets A/ mutually disjoint, then /1 l/l = ][>4,l-v;l
62 ?. Pap and \f fdm ^Yi\m(Ai)\\xi\^Yi\m\iAi)\xi\= j \f\d\ hence |адКн/Ц|. It follows that L is continuous on S(Ef) for the topology of ?'(m). Since <S(.?/) is dense in ?' (w), we can extend L to a continuous linear mapping L': ?' (w) -> R, and we still have |?'(/)|<ll/lli, for/e?'(m). 119. Dehnition. For any function / e ?' (m), the value L'(/) of L' at / is denoted by f f dm and is called the integral of / with respect to m. If / is m-integrable, with values in R and defined m-a.e., and if g is any function from ?'(w) such that f = g, w-a.e. we define the integral f f dm by the equality / /dm = f g dm, and the integral depends only on /. 120. For/,ge?'(m) and с е R we have / (f + g)dm = / fdm + / gdn I cf dm = с I f dm, and \j fdm Uj\f\d\m\ = \\f\U. From the last equality it follows that if /„ -> / in the mean, then / f,dm -> / / dm. The Lebesgue's dominated convergence theorem remains valid for real-valued measures m. 121. Remark. For real-valued m an alternative definition of integrability and of the integral with respect to m is obtained by means of the positive measures m+ and m~, and does not use the density of the step functions in ?' (m). Namely, we remark that a function /: S -> R is ?-integrable if and only if it is w+-integrable and m~-integrable and define the integral / / dm by the equality / fdm = I fdm+— ? fdm
Some elements of the classical measure theory 63 3.7. Density theorem 122. VlTALI'S THEOREM. Let A be a bounded subset of R and I a family of non- singleton closed intervals in R such that every point of A belongs to arbitrarily small members ofX. Then there is a countable set Iq с X such that (a) Io is disjoint, i.e., ? Л /' = 0 for all distinct I, Г е Io, (b) k(A\{JI0)=0. 123. THEOREM. Let I с R be an interval and /: / -> R ? monotone function. Then f is differentiable almost everywhere on I. 124. THEOREM. Let f be a real-valued function which is Lebesgue integrable over [a, b]. Then F(x) = f(x / exists in R/or every ? e [a,b], and the derivative F'(x) exists and is equal to f(x)for ?-almost every ? e [a, b]. 125. LEBESGUE'S DENSITY THEOREM. Let I be an interval in R, and let f be a real- valued function which is ?-integrable over I. Then 1 Cx+h 1 Г 1 fx+h /U)=lim-/ fdk = lim- fdk = \im— fdk for almost every ? e /. As a consequence we obtain THEOREM. Let I be an interval in R, and let f be a real-valued function which is Lebesgue integrable over I. Then lim— / \f(y)-f(x)\dy=0 for almost every ? e /. 3.8. Absolutely continuous functions For more details in this section see Fremlin B000b), Vol. 2. 126. THEOREM. Let I (ZRbean interval, and f : / -> R afunction of bounded variation. Then f is differentiable almost everywhere in I, f is integrable over I, and we have УИ^УагД/).
64 ?. Pap 127. THEOREM. Let (S, ?.?) be any measure space and f an integrable real-valued function defined ?-a.e. Then for any ? > 0 there are a measurable set ? of finite measure and a real numberS > 0 such that jF \f\dp ? ? whenever F e ? ? ? and m(F) ^ S. 128. DEHNITION. If [a, b] С R and / : [?, b] -> R we call / absolutely continuous if for every ? > 0 there is a 5 > 0 such that ]T'!=| |/(fo;) - /(?,)| ^ ? whenever ? ???\ ?? ? аг ^ Ьг ^ · · · ^ a„ ^ b„ ^ fo and ?"_, (fo, - ?,) ^ 5. 129. The basic properties of absolutely continuous functions are contained in the following Proposition. (a) An absolutely continuous function on [a, b] is uniformly continuous. (b) If f: [a,b] -> R is absolutely continuous it is of bounded variation on [a,b]. Consequently is differentiable almost everywhere on [a.b], and its derivative is integrable over [a,b]. (c) Iff, g : [a, b] -> R are absolutely continuous, so are f + g and cf.for every eel. (d) If f, g : [a, b] -> R are absolutely continuous so is f ¦ g. (e) If g : [a, b] -> [c, d"] a/id" / : [c, d"] -> R are absolutely continuous, and g is non- decreasing, then the composition f о g : [a, fo] -> R /s absolutely continuous. 130. The fundamental theorem of calculus is a part of the following THEOREM. Let [a, b] С R a/id" F : [a. fo] -> R foe a function. Then the following assertions are equivalent: (a) 77?ere is an integrable real-valued function f such that F(x) = F(a) + /(/l f dk for every ? e [a, b]. (b) f* F' dk exists and is equal to F(x) — F(a)for every ? e [a. b]. (c) F is absolutely continuous. As a consequence we obtain THEOREM (Integration by parts). Let f be a real-valued function which is integrable over an interval [a,b] с R, and g:[a.b] -> R an absolutely continuous function. If F(x) = f* fdk forx e [a,b], then b pb fgdk = F(b)g(b)-F{a)g{a)- I F ¦ g'dk. J и PROPOSITION. Let [a,b] С R· Let f:[a,b] -> R be a continuous function which is differentiable on the open interval ]a, b[. If its derivative f is integrable over [a, b], then f is absolutely continuous, and f(b) - f(a) = fj f'dx. Example. Continuous function of bounded variation which is not absolutely continuous: Let С с [0, 1] be the Cantor set. Recall that the Cantor function is a non-decreasing ,
Some elements of the classical measure theory 65 continuous function /: [0, 1] -> [0, 1] such that /'(-*) is defined and equal to zero for every ? e [0, 1] \ C, but /@) = 0 < 1 = /A). Then / is of bounded variation and not absolutely continuous. 3.9. Absolute continuity of measures 131. ?????????. Let (S, ?, ?) be a measure space and v:I^Ra finitely additive set function. (a) v is absolutely continuous with respect to ? if for every ? > 0 there is a ? > 0 such that \v(E)\ ^ ? whenever ? e ? and ?(?) ^ <5. We write ? <<C ?. (b) ? is truly continuous with respect to ? if for every ? > 0 there are ? e ?, S > 0 such that ?(?) < oo and \v(F)\ ^ ? whenever F e ? and ?(? ? F) ^ <5. (c) If ? is countably additive, it is singular with respect to ? if there is a set F e .?7 such that m(F) = 0 and v(E) = 0 whenever ? e ?, ? с 5" \ F. We write write ?_?_?. The basic properties of notions introduced in the previous definition are contained in the following Proposition. (a) Ifv is countably additive, it is absolutely continuous with respect to ? iffv(E) =0 whenever ?(?) =0. (b) ? is truly continuous with respect to ? iff (i) is countably additive, (ii) is absolutely continuous with respect to ?, (iii) whenever ? e ?, ?(?) ? 0 there is an F e ? such that m(F) < oo and v(EHF) ?0. (c) If(S, ?, ?) is ?-finite, then ? is truly continuous with respect to ? iff is countably additive and absolutely continuous with respect to ?. (d) If(S, ?, ?) is finite, then ? is truly continuous with respect to ? iff it is absolutely continuous with respect to ?. For more details see Fremlin B000b), Vol. 2, 232 A, B. 3.10. The Radon-Nikodym theorem 132. PROPOSITION. Let (S, ?, ?) be a measure space, and f a ?-integrable real-valued function. For ? e ? set v(E) = j fdti. Then ? : ? -> R is a countably additive measure and is truly continuous with respect to ?, therefore absolutely continuous with respect to ?.
66 ?. Pap The mapping ? н» fE f is called the indefinite integral of /. 133. The Radon-Nikodym Theorem. Let (S, ?, ?) be a measure space and ?: ? -+ ? a set function. Then the following are equivalent: (i) there is a ?-integrable function f such that v(E) = jE f ?? for every ? e ?; (ii) ? is finitely additive and truly continuous with respect to ?. COROLLARY. Let (S, ?,?) be ? ? -finite measure space and ?: ? -> ? a set function. Then there is a ?-integrable function f such that v(E) = fE f for every ? e ? iff ? is countably additive and absolutely continuous with respect to ?. COROLLARY. Let (S, ?, ?) be a finite measure space and v. ? -> ? a set function. Then there is a ?-integrable function f on S such that v(E) = fL f ????? every ? e ? iff ? is finitely additive and absolutely continuous with respect to ?. Remark. By the Radon-Nikodym Theorem, the question immediately arises: for a given v, how much possible variation is there in the corresponding /? The answer is: two integrable functions / and g give rise to the same indefinite integral iff they are equal almost everywhere. 134. Theorem (The Lebesgue decomposition of a signed measure, see Fremlin B000b), Vol. 2, 323 I). (a) Let (S, ?, ?) be a measure space and ?: ? -> ? a signed measure. Then ? has unique decomposition as ? = us + v.dC = us + цс + ve, where цс is truly continuous with respect to ?, vs is singular with respect to v, and vac is absolutely continuous with respect to ? and ve is zero on every set of finite measure. (b) If S = Rr, ? is the algebra ofBorel sets in W and ? is the restriction of Lebesgue measure to ?, then ? can be uniquely represented as vp + vcs + vac where v-aC is absolutely continuous with respect to ?, vcs is singular with respect to ? and zero on singletons, and vp(E) = X]vei vp{x] for every ? e ?. 135. For the next notion the following result is important. THEOREM. Let (S, ?. ?) be a measure space, and f a non-negative ?-measurable real- valued function defined on a conegligible subset of S. If v(F) = J f ¦ ?/.- ?? whenever F с 5" is such that the integral is defined in [0. oo], then ? is a complete measure on S, and its domain includes ?. Definition. Let (S, ?, ?) be a measure space, and ? another measure on S with domain ?. ? is an indefinite-integral measure over ?, or sometimes a completed indefinite-integral measure, if it can be obtained from some non-negative, finite measurable function / defined ?-almost everywhere on S. In this case we will call / a Radon-Nikodym derivative of ? with respect to ?.
Some elements of the classical measure theory 67 3.11. Conditional expectation 136. One of the important applications of the Radon-Nikodym Theorem is related to the conditional expectation. Let S be a set and ? a ? -algebra of subsets of S. A ?-subalgebra of ? is a ? -algebra ? of subsets of S such that Tcr. PROPOSITION. Let (S, ?, ?) be a measure space and ? a ?-subalgebra of ?. A real- valued function f defined on a subset of S is ?\?-integrable iff (a) it is ?-integrable, (b) Dom/ is ^\i-conegligible, (c) / is ?\?-measurable; and in this case f fd^\x) = ffdfi. 137. Dehnition. Let / e L'(m)- A conditional expectation of f (defined on S) with respect to ? is a function g defined ?-a.e. on 5* which is ? ??-integrable and T-measurable such that fEgd(?\?) = ff. f ?? for every ? e T. By the preceding lemma we see that for such a g we have / gd(p\T)= / g-XEd^\T)= / g- ????= gd?, for every ? e T. Then g is almost everywhere equal to a T-measurable function defined everywhere on 5" which is a conditional expectation of / on T, too. 138. The following two statements are closely related to the convex functions, which will give an important result related the conditional expectation. PROPOSITION. Let I c R be a non-empty open interval (bounded or unbounded) and ?: I -> R a convex function (?: I -> R, for an interval I с R, is convex if (p(ab + A — a)c) ^ oup(b) +A — <*)<p(c) w/?e/iever fo, с e / a/ida e [0. 1]) (a) For every a e I there is b e R лис/г ?/??? <р(лт) > <?(?) + fo(* - a) for every x e I. (b) If we take by (a), for each q e I nQ, fov e R iwc/г г/шг <p(;t) ^ <p(<7)+ Ь(/(.х —q)for every x e /, ?/геи <pU) = sup (<р(?) + ^и-?)). ^e/nQ (c) <p ? Bore/ measurable. PROPOSITION (Jensen's inequality). Let (S, ?, ?) be a measure space and ?-.Ш^- R a convex function. (a) Suppose that f, g are real-valued ?-measurable functions defined ?-almost everywhere on S and that g >0 а.е., f gd?= 1 and f ¦ g is ?-integrable. Then <pl g- /??.)? I g-(<pof)dp, where we may need to interpret the right-hand integral as oo.
68 ?. Pap (b) Specially for ?(?) = 1 and f a real-valued function which is ?-integrable over S, we have (p(f fd?)^f?of ??. 139. PROPOSITION. Let (S, ?, ?) be a probability space, and ? a ?-subalgebra of ?. Let f be a ?-integrable function and let h be ? (?\?) -measurable real-valued function defined (?|?)-?/№??? everywhere on S. Let g, go be conditional expectations of f and \f\ with respect to T, respectively. Then f ¦ h is ?-integrable iff go · h is ?-integrable, and then g ¦ h is a conditional expectation of f ¦ h with respect to T. 3.12. The Lebesgue-Stieltjes integral 140. Let / be an interval and let g : I -> R be a function with finite variation |g|, and right continuous on /, except, possibly, at the left end point of /. Let B(I) be the Borel ?-algebra of / and V(I) the ?-ring of bounded Borel subsets of /. Let ms : T>(I) -> R be the ?-additive measure with finite variation |/ms| = m|4], associated to g (see Section 2.7). Consider the space ?' (mg) = ?' (\т^\) of m^-integrable functions /: S -> R. The space C\mg) is also denoted by С[(g) and for any function / e ?'(ffls). the integral f f dmx is also denoted by f fdg and is called the Lebesgue-Stieltjes integral of / with respect to g. 3.13. Transformation of coordinates 141. We will give a generalization of the basic formula of calculus Jg(y)dy = j g(<p(x))<p'(x)dx in the context of a general transformation^ between measure spaces. THEOREM. Let (?, ?, ?) and (?, ?, v) be measure spaces, and ?:?? -> ?, J :Dj -> [0, oof functions defined on conegligible subsets ??, Dj ofX such that whenever left side exists and F e ? and v(F) < oo. Then / J -^??)??= / gdv whenever left side exists, for every v-integrable real-valued function g, where we put Ofor (J ¦ (g ? ?)){?) when J (?) = 0 and g((p(x)) is undefined. See Bauer A990), Elstrodt A996), Fremlin B000b), Vol. 2, for the proof and further generalizations. 142. Change of variables. Let Q С R' be a bounded open set with a boundary dQ of measure zero which is mapped with a function ? : Q -> R'\ ? = (??,..., ?,-), of class
Some elements of the classical measure theory· 69 C1 as a bijection on a bounded open set ? с К' with a boundary 3? of measure zero with the Jacobian 9(?|,...,?,.) J = —— ^—i^O. Э(Л|,...,ЛГ) Then the following holds. (a) The functions ? and ?^1 map every set of measure zero on a set of a measure zero; (b) The function /: ? -> R is (Lebesgue) integrable on the set ? if and only if the function (/ о <t>)\J\ is (Lebesgue) integrable on the set Q. Then the following equality holds ? f(y)dy= f (fo0)(x)\J\dx, Jn Jq where dx — dx\ ¦ ¦ ¦ dx, and dy = dy\ ¦ ¦¦ dyr. 4. Function spaces 4.1. Essentially bounded functions 143. Let (S, ?, ?) be any measure space. We recall that we denote by ?°, or ?°(?) the space of real-valued functions / defined on conegligible subsets of S which are ?- measurable, that is, such that f\n is measurable for some conegligible set ?CS, see 45. We define a partial ordering relation ^ on L° (see 45) by saying that / ^ g (as classes of equivalence) iff / ^ g a.e. as measurable functions. L° is a Riesz space or vector lattice, that is, a partially ordered linear space (a (real) linear space with a partial ordering ^ such that if и ^ ? then и + w ^ ? + w for every w, if 0 ^ и then 0 ^ cu for every с ^ 0) such that и ? ? = sup{w, ?}, и л ? = inf{w, и) are defined for all u, ? e L°. We introduce the normed (complete, Banach) vector space L'(m) as the set of equivalence classes of members of ?' (?), which is a Riesz space (even it is Dedekind complete, i.e., every non-empty subset which is bounded has a least upper bound in L' (?)). 144. Let (S, ?,?) be a measure space. Let Cx = ?:?(?) be the set of functions / e ?° = ?°(?) which are essentially bounded, i.e., there is some ? ^ 0 such that l/l ^ ??, ?-a.e., and we define L°° = L°°(M) = {/ (as class): / e ?^(?)} с ?°(?). THEOREM. Let (S, ?, ?) be any measure space. Then (a) L°° = ?°°(?) ? a linear subspace of L° = ?°(?). (b) Ifue Lx, ? e L° and \v\ ^ \u\ then ? e Vе. Then \u\, и ? ?, и л ?, и+ = и ? 0 an<i и~ = (-и) ? 0 belong to Lx for all и. ? e L^. (c) Taking e = 1, ?/?e equivalence class in L° of the constant function with value 1, ?/геи аи element и of I? belongs to L°° iff there is an ? > 0 ??/с/г f/гаг |и| ^ Me.
70 ?. Pap (d) If и, v e Lx then и ¦ ? e Lx. (e) IfueLx, veL] = L'(m), then и ¦ ? e L1. The property of L^ to be a Riesz space is dominated by the fact that it has an order unit e = 1 with the property (c). 145. Let (S, ?, ?) be any measure space. For / e Cx = ?:?(?), say that the essential supremum of |/| is esssup|/| =inf{M: ? ^ 0. \f\^M, ?-a.e.}. Then l/l ^ esssup |/| ?-a.e. We may define a norm || ¦ \\? on Lx = ?^(?) by setting ||м||зс = esssup |/| whenever и = f (equivalence class). We have for any и e Lx, \\u\\x = min{y: \u\ ? ye], where e = leL°°. If u, ? e L00 and \u\ ? \v\ then ||м||зс ^ IIf Hoc an(J 11^ * ^lloc ^ II и || эс || I'll oc , 11^ ^ ^ll^c ^тах(||м||зс, ||и||зс). for all u,v & L°°, i.e., Lx is a commutative Banach algebra and a Banach lattice. 146. If и e L> = ?'(?) and ?)?[? = ?^??), then и · и e L1 and |/и ¦ ?|<*? ^ II" II ? II I'll ос · Therefore, we have a bounded linear operator ? from Lx to the normed space dual (L1)* (space of continuous linear functionals) of L1, given by (Tv)(u) = J и ¦ ???, for every и e L], ? e L00. 4.2. The space Lp(?) 147. DEnNlTlON. Let E, 2?, ?) be any measure space, and ? e ]l,oo[. Then we define ???) = {?; MeL°, iHl'^i'tM)). THEOREM. Let (S, ?, ?) be a measure space, and ? ? [1. oo]. (a) ?7{?) is a linear subspace of ?/'(?). (b) //и e ?,''(?). ? e ^°(?) ?«? ? ^ |и|, then ? e ?/(?)- Therefore \u\, и ? ? and и ли belong to Lp^)forall и, ? e ?,''(?)· (c) The partial ordering inherited from ?°(?) makes ?/(?) ? /?/??: space.
Some elements of the classical measure theory 71 148. We define the norm ||и||р = (/ \u\p???/? for every и е L''(m)· PROPOSITION (Holder inequality). Suppose (?,?,?) isa measure space, and that p,q e ]1, oof are such that - + ^ = 1. Then и ¦ ? e Ll (?) and I/ и ¦ ??? for и eL'4M), ??^(?). 149. PROPOSITION. Let (?, ?, ?) be a measure space and ? e ]1, oof. V is a Banach space under the norm \\ ¦ \\. If q = p/(p — 1), then for every и e L''(M), ||h||p = max{/и ¦ ???: ? e ^(?), \\?\\^ ?. 1}. Specially, for ? = 2 is a Hubert space with the inner product f f ¦ gdp. 150. THEOREM (Duality in V spaces). Let (S, ?,?) be a measure space, and ? e ]l,oo[. Ifq = ?/{p- 1), then uv e ?-'(?) and \\u -v\\\ ^ ||?||?||????/ whenever и е ??{?) and ? e Lq (?). Therefore we have a bounded linear operator ? from L'1 (?) to the normed space dual (?''(?))* of ?7(?), given by (Tv)(u) = I u ¦ ??? for all и е ^(?), ? e L''(m) and the canonical map ? : ?>(?) -> Z/(M)* is a normed space isomorphism. 4.3. Convergence of sequences of functions 151. A sequence (/„),„=n of functions, f,:S-> E, is said to converge in measure on the set S to a measurable function /: S -> R if, for every ? > 0, we have lim ?({?: \f„(x) - f(x)\ ^ ?}) = 0. If the sequence (/„)„ем converges ?-a.e. to a function / and ?E) < oo, then it converges to / in measure as well, while if the sequence (/„),„=n converges to / in measure, then there exists a subsequence of (/„)„еь that converges to / ?-a.e. 152. The following theorem states the relation between almost-everywhere convergence and uniform convergence. THEOREM (Egorov). Let ? be ? ?-additive measure defined on ? ?-algebra ?, let ? e ?, ?(?) < oo, and let a sequence (/„);,er; of ?-measurable almost-everywhere finite functions f„: ? -> R converge almost-everywhere to a function f. Then for every ? > 0
72 ?. Pap there exists a measurable set ?? С ? such that ?(?\?() < ?, and such that the sequence (/u)/ieN converges to f uniformly on Ee. For the case where ? is the Lebesgue measure on the line this was proved by D.F. Egorov A911). Egorov's theorem has various generalizations, e.g., for a sequence of measurable mappings of a locally compact space into a metrizable space. 5. Product measures and the convolution 5.1. Product measures 153. Dehnition. Let (S\, ?\,?\) and (Si. ??,??) be two measure spaces. For А с 5? ? S2 we take /j(A) = inf ]???(?„)·?2(^): EneZi, F„ eb(neN), А с (J l;i=0 ueK (using the convention 0 · oo = 0). ? is an outer measure on S\ ? Si. 154. Dehnition. Let E|,2?|,??) and (Si. ?2. ??) be two measure spaces. The primitive product measure on S\ ? Si is the measure ? derived by Caratheodory's method from the outer measure ? from 153. 155. An alternative definition of the product of measures is given in the following Dehnition. If S\ and Sj are sets with ?-algebras ?\ с V(S\) and ?? с V{Si), respectively, we denote by ?\ § ?? the ?- algebra of subsets of 5? ? Si generated by the semiring of rectangular sets {? ? F: ? e ?\, F e ??]. 156. PROPOSITION. Lei E?, ?\, ??) and (S2. ?2, ??) be two measure spaces; let ? be the primitive product measure on S\ ? Si, and ? its domain. Then ?\®?? с Т and ?(? ? F) = ?\{?) ¦ V2{F) for all ? e ?\, F e ??. 157. The third type of product of measures is given by the following definition. ?????????. Let E?,2?|,??) and (Si. ??. ??) be two measure spaces, and ? the primitive product measure. The c.l.d. product measure on 5? ? Si is the function ?: Dom# -> [0, 00] defined by v(A)=sup{e(Af)(E ? F)): Eeli.Feb. ?\(?) < oo. ?2(F) < 00} for A e Dom ?.
Some elements of the classical measure theory 73 158. THEOREM. Let (S\, ?\, ?\) and (Si, ??, ??) be two measure spaces; let ? be the c.l.d. product measure on S\ ? Si, and ? its domain. Then (a) ?\ <g) ?? с ? and v(E ? F) = ?\(?) ¦ ??(^) whenever ? e ?\, F e ?? and ?\(?)-?2(?) < oo; ^ (b) for every AeT there is а В е ?\® ?2 such that ? с A and v(B) = v(A)\ (c) (S\ ? S2,T, v) is complete and locally determined, and in fact is the c.l.d. version of(S\ ? Si,T,9); in particular, v(A) =?(?) whenever ?(?) < oo; (d) if AeT and v(A) > 0 then there are ? e ?\, F e ?? such that ?\(?)<??, ?2{?) < oo and ?(? ? (? ? F)) > 0; (e) if A e ? and v(A) < oo, then for every ? > 0 there are Eq, ..., En e ?\, Fq, ..., F„ e ??, all of finite measure, such that v(aa|J(?, xF;)W. Example. Let r, ? ^ 1 be integers. Then there is a natural bijection ?: W ? Es -> W+s. This bijection identifies Lebesgue measure on W+s with the product of Lebesgue measure on Rr and Lebesgue measure on W. For more details see Fremlin B000b), Vol. 2. 5.2. Repeated (iterated) integral 159. Let E?, ?\,?\) and (So, ??,?2) be two measure spaces, and / a real-valued function defined on a set Dom/ с 5? ? S2. The repeated (iterated) integral 11 ??,?)???(?)??2(\·)= f(f ??.\)???(?))??2(}·). is the integral j^(?)??2(?) (if this is defined), where h(y)= ? /(?,?)??\(?), у e ? ?: ? e 5, / f(x,z)d?\(x) is defined in R\. Analogously, reversing the roles of 5? and S2, we can define a repeated integral Jf /(?,?)??2?)???(?)= {({ /(?,?)??2(}'))<1??(?). We can connect these repeated integrals to each other by connecting them both with the integral of / itself with respect to the (primitive) product measure on 5? ? S2.
74 ?. Pap 160. THEOREM (Fubini). Let (S\, ?\, ??) ???? (So, ??, ??) be two measure spaces, and ? the primitive product measure on S\ ? Si. Let f be ? ? -integrable real-valued function. Then ff f(x, ?)??\(?)???(?) and ff f(x, \)???(?)??\{?) exist and are both equal to ff(x,y)d0(x,y). 161. THEOREM (Tonelli). Let (S\, ?\, ??) and {Si, .???, ?:) be two measure spaces, ? the c.l.d. product measure on S\ ? Si, and ? its domain. Let f be a T-measurable real-valued function defined on a member of ?, and suppose that either ff \f(x, }>)\??\(?)???(?) or ff !/(¦*> У)\ац.г(у)ац.\(х) exists in R. Then f is v-integrable. 162. We have used the primitive product measure in Fubini's theorem and the c.l.d. product measure in Tonelli's theorem. When these two measures are the same we have the following COROLLARY. Let (S\, ?\,?\) and (Si, ??. ??) be two ?-finite measure spaces, ? the c.l.d. product measure on S\ ? Si, and ? its domain. Let f be a T-measurable real-valued function defined on a member ofT. Then if one of ? |/(*. JS, xSt y)\dv(x,y), f f \f(x,y)\dpdx)dp2(y), Js^ Js, ? ? ^(?^)\???(?)?\(?) Js, Js^ L S, xS-> exists in Ш, so do the other two, and in this case f(x,y)dv(x,y)= ? ? f(x.x)d?](x)d?2(y) Js2 Js{ = 11 ?&.>·)<??2(}·)<1?\(?). J St JS2 Example. Let E?, ?\,?\) be [0, 1] with theLebesgue measure, and let (Sb, ??, ??) be [0, 1] with the counting measure. Take the set A = {(;, ?): ? e [0, 1]} с 5? ? Si. We have neNjt=0 k + l n + l и + 1 k + l П + 1 /7+1 e ?\ ®?2- Then // ??(?.?)??\{?)???(?)= / 0??2(?)=0. // ??(?.?)???^)??\{?)= / ???(*) = 1. We see that the two repeated integrals differ although both repeated integrals exist and are finite. We remark that ?(?) ? v(A). For more details see Fremlin B000b), Vol. 2, 252 J.
Some elements of the classical measure theory 75 5.3. Infinite products For more details see Fremlin B000b), Vol. 2, 254. 163. Let {{Si, ??, M,)},e/ be a family of probability measure spaces. Let S = f]ie/ 5,- be the family of functions ? with domain / such that x(i) e 5,- for every i el. Denote by С the family of subsets of S of the form с = Пс>, ie/ IS where C, e IT,- for every /' e / and the set {/: C, ^ 5,-} is finite. For a non-empty С е С thi representation is unique. Members of С are called measurable cylinders. 164. We define a set function щ : С -> [0, 1 ] by 40(C) = f[/if (С,-) ie/ whenever C, e 2Г,- for every / e / and {/: С\Ф S,} is finite. Since only finitely many terms in the product can differ from 1, so that it can be treated as a finite product. If С = 0, one of the Cj must be empty, so щ(С) is surely 0, although the representation of С as F](.g/ Q is no longer unique. Now define ? :V(X) -> [0, 1] by setting »j(A) = inf ОС ? /?o(C„): C„ e С for every n e N, А с (J C„ ,u=0 ueN The set function ? is an outer measure on S. 165. Let {(Sj, ??, ?/)},-e/ be a family of probability spaces, and 5 the Cartesian product П,-е/ и The product measure on 5 is the measure defined by Caratheodory's method from the outer measure ? from 17. 166. Let {S,-};e/ be any family of sets, and 5" = П/е/ S- If -??? is a ?-subalgebra of subsets of Si for each/ e /, we denote by ®,-е/2Г,· the ?-algebra of subsets of 5 generated by {{x: xeS, x(i)eE): i el, Ее ??}. THEOREM. Let {(Si, ??,??)}?€? be a family of probability spaces, andlet ? be the pmduct measure on S = П/е/ ^ defined as in 165 vv/fft the domain T. Then (a) v(S) = l.
76 ?. Pap (b) // ?/ e ?? for every i e /, and {/: ?, ? S,} is countable, then J~I(.g/ ^' е ^~> and v(Y\j€l Ei) = Y\j€l /J-i(Ej). In particular, v(C) = щ(С) for every measurable cylinder C, and if i e / then ? н» ? (i):: S -> S/ /? inverse-measure-preserving. (c) ®,.е/Г,-сТ. (d) ? /? complete. (e) For every A e ?, ? > 0 ?/геге /? a finite family Co C„ of measurable cylinders such that ?(?? (J^,, Q) ^ ?. (f) For ever}' AeT ?/геге are A\, Ai e ^j€lEj such that A\ С А с At and ?(?2\?,)=0. 167. EXAMPLE (Theproduct measure on {0. 1}'). Let Sj ={0, l},/e/, and let each ?, be the 'fair-coin' probability, i.e., ?,({0}) =?,·({1}) = 1/2. The product S = {0, 1}' has a family {?,},,= / of measurable sets such that v|P|?,-|=2"|У|, if У с / is finite. where ? is the product measure on {0. 1}'. This ? is called the usual measure on {0. 1}'. If / is finite then v({x}) = 2_l" for each ? e S, and if / is infinite, then v({*}) = 0 for every ? e S. 168. There is a natural bijection between {0. 1}' and V(I), matching ? e {0, 1}' with Av = {/: / e I, x(i) = \]. In this way we obtain a standard measure ? on V(I), which is called the usual measure on V(I). We remark that for every finite Sc/, every С с ? we have v({A: АПВ = С}) = v{x: x{i) = 1 for/ e C. .*(/) =0for/ e B\C] _ 2-i«i 169. If / is countably infinite, then there is a very important relationship between the usual product measure of {0, 1}' and Lebesgue measure on [0, 1]. PROPOSITION. Let ? be the usual measure on S = {0. 1 }'\ and let ? be Lebesgue measure on [0, 1]. Let ? be the domain of ? and ? for the domain of ?. For ? e S set ?(?) = ?,??2~'~'-*(')¦ Then we have ?~\?) e ? and ?(?^(?)) = ?(?) for every ? e ?, and <p(F) e ? and X(<p(F)) = v(F) for every F e T. There is a bijection ?:?—* [0, 1] which is equal to ? at all but countably many points, and any such ? is an isomorphism between (S,T,v) and ([0, 1], ??,?). 170. If ? is the usual measure on {0. 1}'. then L1 (v) has a countable dense subset, for its norm topology, iff / is countable.
Some elements of the classical measure theory 77 5.4. Convolution 171. Definition. Let / and g be measurable complex-valued functions defined almost everywhere in R''. We denote by / * g the function defined by *g)(x)= / (f*g)(x)= I f(x-y)g(y)dy for every set for which the integral (with respect to Lebesgue measure) is defined. Then / * g is called the convolution of the functions / and g. 172. Basic properties. Linearity of the integration implies ((/l +f2)*g)(x) = (fl*g)(x)+(f2*g)(x), (cf * g)(x) = (f * cg)(x) = c(f * g)(x) whenever the right-hand sides of the formulae are defined. If /, g are measurable complex-valued functions defined almost everywhere in W, then / * g = g * /. in the strict sense that they have the same domain and the same value at each point of that common domain. If /?, /2, g\, g2 are measurable complex-valued functions defined almost everywhere in R'', and /1 = /2 a.e., g\ = gi a.e., then for every ? e W we have (/1 * g\)(x) = (/2 * gi)ix) in the sense that if one of these is defined so is the other, and they are then equal. If /, g are complex-valued functions which are integrable over W, then / * g is integrable, and we have j f*gdx=j fdx j gdx. j\f*g\dx^j\f\dxj\g\dx. 173. THEOREM. Let f, g, h be complex-valued measurable functions defined almost everywhere in W. If ? eW is such that one of (\f\ * (\g\ * \h\))(x), ((|/| * |g|) * \h\)(x) is defined in R. Then f *(g*h) and (/ * g) * h are defined and equal at x. 174. PROPOSITION. Suppose that f, g are measurable complex-valued functions defined almost everywhere in R, and that f e C~, g e Ci where p,q e [1, 00] and jy + - = 1 (writing-^ =0asusual). Then f * g is defined everywhere inR, and supveK|(/ *g)(x)\ ? WfWpWgWq- 5.5. Integral transforms 175. Dehnition. The Fourier transform ? of a function / e L1 (R') is defined by (rf)(x) = f(x) = —L- f e-i:xf(z)dz {xeW),
78 ?. Pap where ? ? = ?'*=??***- 176. Theorem. Let f eL'(R'). Then: A) /eC(R'); B) ifzk ¦ f(z) is also a function from l} (R''), then there exists Dkf(x) and we have Dkf(x) = -i?f(x); more generally, if z"k ¦ f(z) for к = 1, ..., r are also functions fют L'(R'"), then there exists Da f(x) and we have Daf(x) = Dtrf(x) for a = (<*],... ,<*,), ai-eNLJ {0}; C) ifDkfeLl(W)UC(Rr),then L\f(x)=ixkf(x); more generally, if D"k f e L' (Rr) U C(R') for к = 1,..., r, then D^f(x) = (-ix)af(x) for a = (<*],... ,<*,-), ai-eNU {0}. 177. THEOREM. For a linear differential operator L(D)= ? aaDa and a function f e C'"(R') with D" f e L] (W)for \a\ ^ m the following equality holds L(D)f(x) = L(ix)f(x). 178. The following exchange formula allows us to transfer the convolution in the usual product. Theorem. lff,geLl(W),then T7g(x) = Bn)"'2f(x)g(x). 179. Dehnition. The inverse Fourier transform T~l of a function / e L'(IR') is defined by •F-1 (/)(*) =/(-*)¦
Some elements of the classical measure theory 79 THEOREM. /// e L'(E'),/ e L'(R') and f(z) is a continuous function at the point z = z°, then f(z°)=T-l(f)(z°), i.e., f(z°) = Bn)-"'2 [ e-^f(y)dy. 180. Let / be a complex-valued function defined almost everywhere on [0, oof, endowed as usual with Lebesgue measure. Its Laplace transform is the function F defined by writing poo F(s) = e-"f(x)dx Jo for all those complex numbers ? for which the integral is defined in С Then we have (a) if ? e Dom F and Res' ^ Res then s' e Dom F; (b) F is analytic on the interior of its domain; (c) if F is defined anywhere then limRe.v_».x; F(s) = 0; (d) if/, g have Laplace transforms F, G then the Laplace transform of / + g is F + G, at least on Dom F П Dom G. 6. Regular measures 6.1. Topological measures 181. Dehnition. A measure ? defined on the Borel ?-algebra B(T) of a Hausdorff topological space ?, such that г с ? (? is the family of all open sets), is called regular if for any Borel set В and any ? > 0 there is an open set G С ? containing В, В С G, and such thatM(G\ ?) <?. An equivalent definition is as follows: For any В e B(T) and any ? > 0 there is a closed set F С В such that ?(? \ F) < ?. For example, the Lebesgue measure is regular. 182. An additive set function ? defined on a family of sets in a topological space is regular if its total variation \?\ satisfies the condition \?\(?) = inf |m|(G) = sup|/i|(F), Fc?cG, where G denotes the interior of a set G and F the closure of a set F (and E,G, F, are in the domain of definition of ?). THEOREM (Aleksandrov). Every bounded finitely additive regular set function, defined on a semiring of sets in a compact topological space, is countably additive. 183. We restrict now to the case ? = W.
80 ?. Pap Definition. Let ? be a measure on R'", where r > 1, and ? its domain, ? is a topological measure if every open set belongs to ?. ? is locally finite if every bounded set has finite outer measure. If ? is a topological measure, it is inner regular for the compact sets if v(E) = sup{v(K): К с ? is compact} for every ? e ? (because ? is a topological measure, and compact sets are closed, v(K) is defined for every compact set ?), ? is a Radon measure if it is a complete locally finite topological measure which is inner regular for the compact sets. If ? is a Radon measure on B(W). and ? its domain, then ? is ?-finite, and for any ? e ? and any ? > 0 there is a closed set F с ? such that v(E \ F) ? ?. For every ? e ? there is a set ? с Е, which is the union of a sequence of compact sets, such that v(E\H)=0. THEOREM. A measure ? on W is a Radon measure iff it is the completion of a locally finite measure defined on the ? -algebra В ofBorel subsets ofW'. THEOREM. Let ? be a Radon measure on W, with domain ?, and f a non-negative ?- measurable function defined on a v-conegligible subset ofW. Suppose that f is locally integrable in the sense that fEf< oo for every bounded set ? e ?. Then the indefinite- integral measure ? on W defined by v'(E) = I fdv whenever {?. ? e E, f(x) > 0} e ? is a Radon measure on W . Examples. (a) Lebesgue measure on W is a Radon measure. b) Let (f„)„epj be any sequence in R'\ and (а„)яек any summable sequence in [0, oof. For every ? с R set ?(?) = ]?{?„: t„eE]. Then ? is a (totally finite) Radon measure on R'. (c) Cantor measure. Recall that the Cantor set is a closed negligible subset of [0, 1], and that the Cantor function is a non-decreasing continuous function /: [0, 1] -> [0, 1] such that /@) = 0, /A) = 1 and / is constant on each of the intervals composing [0, 1]\C. It follows that if we set g(x) = ^(x+f(x)) for ? e [0, l],then g : [0, 1 ] -> [0, 1 ] is a continuous bijection such that the Lebesgue measure of g(C) is j; consequently g_1 : [0, 1] -> [0, 1] is continuous. Now extend g to a bijection h : R -> R by setting h(x) = ? for ? e R \ [0. 1]. Then h and /г-1 are continuous. Note that h(C) = g(C) has Lebesgue measure 5.
Some elements of the classical measure theory 81 Let v\ be the Radon measure on R obtained by applying the method in the last Theorem to Lebesgue measure ? on R and the function 2?(/,(?)- Then v\ (h(C)) = v\(R) = 1. Let ? be the measure v\h, that is, v(E) = v\(h(E)) for just those ? с R such that h(E) e Dom v\. Then ? is a Radon probability measure on R, and v(C) = 1, v(R \ С) = м(С) = 0. For more details see Fremlin B000b), Vol. 2. If ? is a Radon measure on R' then it is outer regular, i.e., v(E) = inf{ v(G): ? с G is open} for every set in the domain of v. THEOREM (Lusin). If ? is a regular Borel measure onW , ? is a Borel set offinite measure on R', and f is a Borel measurable function on E, then, for every ? > 0, there exists a compact set К С ? such that ?(? \ ?) < ? and such that f is continuous on K. 6.2. Convolution of measures 184. ?????????. Let v\, vi be two finite Radon measures on R'. Let ? be the product measure on W ? R'; then ? is also a (totally finite) Radon measure. Define ? : R' ? R' -> R' by ?(?, y)=x+y; then ? is continuous, therefore measurable. The convolution v\ * i>2 of ?>? and i>2 is the image measure ??~?. This is a Radon measure. Remark that if v\ and i>2 are Radon probability measures, then ? and ?>? * vi are also probability measures. 185. THEOREM. Let v\, vi be two finite Radon measures on R'; let ? = v\ * i>2 be their convolution, and ? their product on R' ? R'. Then for any real-valued function h defined on a subset ofW, we have / h(x+y)dv(x,y)= / h{x)dp(x) if either integral is defined in [—oo, oo]. 186. If ?>? and i>2 are finite Radon measures on R', then v\ * i>2 = V2 * v\ ¦ If vn v2 and V3 are finite Radon measures on R', then (?? * V2) * vj = v\ * (^2 * V3)- References Bartle, R.G. A966), The Elements of Integration. Wiley. New York. Bauer, H. A990), Mass- und Integrationstheorie. W. de Grayter & Co.. Berlin. Bhaskara Rao. K.P.S. and Bhaskara Rao. M. A983). Theory of Charges. Academic Press. London.
82 ?. Pap Bongiomo, B. and Dinculeanu, N. B001). The Riesz representation theorem. Extension of additive measures. J. Math. Anal. Appl. 261. 106-132. Bourbaki. N. A952-59). Integration, Chapitres I-VI. Hermann. Paris. Caratheodory, C. A917), Vorlesungen tiber reelle Funktionen, B.G. Teubner. Leipzig-Berlin. Cohn. D.L. A980). Measure Theory, Birkhauser. Basel. Diestel, J. and Uhl, J.J.. Jr. A977). Vector Measures. Amer Math. Soc.. Providence. RI. Dinculeanu. N. A967), Vector Measures. Pergamon Press. Oxford. Dinculeanu. N. B000). Vector Integration and Stochastic Integrations in Banach spaces. Wiley. New York. Dinculeanu. N. B002). Vector integration in Banach spaces and application to stochastic integration. Handbook of Measure Theory. E. Pap. ed„ Elsevier. Amsterdam. 345-399. Dudley. R.M. A989), Real Analysis and Probability. Wadsworth & Brooks/Cole. Dunford. N. and Schwartz. J. A958). Linear Operators. Part I. General Theory. Wiley, New York. Egorov. D.F. A911). Sur les suites de functions mesurables. C. R. Acad. Sci. Paris 152. 244-246. Elstrodt. J. A996). Mass- und Integrattonstheorie. Springer. Berlin. Engelking, R. A989). General Topology. Sigma Series in Pure Mathematics, Vol. 6. Heldermann. Berlin. Feller. W. A966). An Introduction to Probability Theory and its Applications. Vol. II, Wiley. New York. Fremlin. D.H. B000a). Measure Theory. Volume I. Torres Fremlin. Fremlin. D.H. B000b), Measure Theory, Volumes 2-5. Internet: www.essex.ac.uk/maths/staff/fremlin/mt.htm George. C. A984), Exercises in Integration, Springer. Berlin. Halmos. PR. A950). Measure Theory. Van Nostrand. New York. Halmos. PR. A960). Naive Set Theory. Van Nostrand. New York. Ionescu Tulcea. A. and Ionescu Tulcea, С A969). Topics in the Theory of Lifting. Springer. New York. Kelley, J.L. A955), General Topology. Van Nostrand. New York. Lebesgue. H. A904), Lefo/?i sur /'integration et la recherche des functions primitives, Gauthier-Villars. Paris. Lebesgue, H. A966). Measure and the Integral, Holden-Day. Munroe, M.E. A953). Introduction to Measure and Integration. Addison-Wesley. Reading. MA. Renyi, A. A970), Probability Theory, North-Holland. Amsterdam. Royden. H.L. A963). Real Analysis, Macmillan. New York. Rudin, W. A973), Real and Complex Analysis. McGraw-Hill, New York. Shiryayev, A. A984), Probability, Springer. Berlin. Solovay. R.M. A970). A model of set theory in which every set of reals is Lebesgue measurable. Ann. of Math. 92, 1-56. Widom, H. A969). Lectures on Measure and Integration, Van Nostrand, Reinhold. Williamson, J.H. A962). Lebesgue Integration. Holt. Rinehart & Winston.
CHAPTER 3 Paradoxes in Measure Theory Miklos Laczkovich Department of Analysis. Eotvos Lorand University, Pdzmany Peter setany I/C. Budapest 1117. Hungary E-mail: laczko@renyi.hu Contents Introduction 85 1. Paradoxical sets 86 2. Paradoxes in Ж" for ? > 3 and in non-euclidean spaces 89 3. Invariant measures and amenable groups 92 4. Decompositions and perfect matchings 97 5. The type semigroup 98 6. Nonamenable actions and local commutativity 101 7. Marczewski's problem 108 8. Tarski's circle-squaring problem 110 9. The problem of equidecomposability with measurable pieces 115 10. Countable equidecomposability and countably additive invariant measures 117 11. The nonconstructive element in the paradoxes 119 References 120 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 83
This page intentionally left blank
Paradoxes in measure theory 85 Introduction The term 'paradox' has two different meanings. The first sense of the word refers to a contradiction or antinomy without any apparent solution. Examples of paradoxes of this kind were the contradictions concerning the values of some infinite series in the seventeenth and eighteenth century (before the precise definition of convergence of infinite series was given by Cauchy), and the set theoretical antinomies, such as Russel's paradox, before the creation of axiomatic set theory. The other meaning of the word 'paradox' is a correct mathematical statement contradicting our intuition. In the sequel we shall use the term in this second meaning. The earliest, and one of the most important example of such a paradox is the 'paradox of infinity' stating that an infinite set can be mapped injectively into a proper subset of itself. This fact was first discovered and clearly explained by Galileo Galilei A638). About 250 years later R. Dedekind A932) realized that this property characterizes infinite sets and used it as the definition of infinity. In the period 1914—1929 several geometric variants of the paradox of infinity were discovered by Felix Hausdorff, the Polish school, and John von Neumann. Among these discoveries the most spectacular is the famous Banach-Tarski paradox: it states that in E3 any two bounded sets with nonempty interior are equidecomposable. Paradoxes are always appealing, but the special charm and beauty of these statements stems from the fact that they represent a mixture of set theory, measure theory, geometry and even discrete mathematics. In fact, some of these paradoxes were motivated by the set theoretical approach itself; that is, by the freedom of forming arbitrary subsets of R" and treating them as geometric objects. By imitating the procedures of geometric dissections, they also share the appeal of recreational mathematics. However, the importance of these paradoxes is due to their close connection to invariant measures, as discovered by Hausdorff, Banach, Tarski, and von Neumann. Hausdorff's paradox, for example, was created in order to prove the nonexistence of finitely additive invariant measures on the sphere. The exact connection is expressed by Tarski's famous theorem A929) stating that, under very general conditions, the nonexistence of paradoxes is equivalent to the existence of invariant measures. In this way a theory was taking shape that unified the investigations of paradoxical sets, invariant measures and equidecomposability. The paper of Banach and Tarski A924) is the first systematic exposition of this theory. Tarski's book A949) was mainly motivated by these investigations, and contains many important results of the topic. The next book that was devoted to the subject was Sierpinski's A954). The best and most comprehensive account of the topic was given by Stan Wagon in his book A986). Wagon's work immediately became the standard monograph of the subject, and the interested reader is urged to consult it concerning the results up to 1986. The survey papers by Laczkovich A991b) and A994) review some of the later developments; part of the present article is covered by these surveys. Since this area also involves amenable groups, we also mention the monograph by Paterson A988), and the recent article of Ceccherini-Silberstein et al. A999). In this article we shall not touch on the Ruziewicz problem asking the uniqueness of finitely additive probability measures on the Lebesgue measurable subsets of the sphere S". We refer to the monograph of A. Lubotzky A994), where the complete solution of the Ruziewicz problem is discussed.
86 ?. Laczkovich 1. Paradoxical sets The simplest geometric form of the paradox of infinity is exhibited by any set that is congruent to a proper subset of itself. Every half-line has this property. Also, the set N of nonnegative integers is congruent to N-, the set of positive integers. In this example only one element of N can be deleted in such a way that the remaining set is congruent to N. This is true in general: for every ? с R there is at most one point ? e ? such that ? \ {x} is congruent to H; see Sierpinski A954) andWagon A986, p. 92). E.G. Straus A957) proved the same for plane sets. In R3 the situation is different. It was proved by J. Mycielski A954, 1955) that there are infinite and bounded sets ? с R3 such that ? \ A is congruent to ? for every finite subset of ? (see also Wagon A986, pp. 92-95)). In the plane we may find bounded sets that are congruent to some proper subsets. Let С denote the set of complex numbers and select a number с е С such that \c\ = 1 but с is not a root of unity. Then the sets A = {c": ? e N) and В = {с": neN1") are bounded (subsets of the unit circle), congruent (a rotation maps A onto ?), and ? is a proper subset of A. These sets are Fn sets (countable unions of closed sets). We can also find Gs sets (countable intersections of open sets) with the same property. Namely, [x eC: |дг | = l} \ A and {x e C: \x \ = l} \ В are bounded Gs sets, they are congruent and the first is a proper subset of the second. These examples are simplest possible in the topological sense. Indeed, in R" no bounded and ambiguous set can be congruent to a proper subset of itself. (A set is called ambiguous, if it is simultaneously F„ and Gs.) This is a special case of the following theorem proved by A. Lindenbaum A926). THEOREM l. I. Let A be an ambiguous subset of a compact metric space. If В С A and B = Athen B = A. (A = ? denotes that the sets A and В are congruent; that is, there is a distance-preserving bijection from A onto ?.) The situation in R is much simpler: no bounded linear set can be congruent to any proper subset of itself. Indeed, if ?, ? are bounded linear sets, A z> В and A = B, then the isometry mapping A onto В must be a reflection and this easily implies that A = B. However, we can find bounded subsets of R exhibiting the paradox of infinity if we replace the notion of congruence by that of piecewise congruence. We say that the sets А, В с R" are equidecomposable if there are finite partitions A = (JL ? Aj and ? - (jf= ? ?/ such that A, = ?, for every /' = l k. We shall denote this fact by A ~ B. If we want to indicate that A and В are equidecomposable using к pieces, then we write A ~* B. (We remark that Banach, Tarski and Sierpinski used the term 'equivalent by finite decomposition', while other mathematicians used "piecewise congruent'. After Wagon's monograph A986) the term 'equidecomposable' became prevalent.) As Sierpinski observed A954, Theorem 16), we have [0, 1] ~.ч ]0, 1]. Indeed, let a e ]0, 1[ be an irrational number and put A = {{na}: ? e N), В = {{па): ? e N+), where {x} denotes the fractional part of the number x. Denoting ? ? = А П [0, 1 — a[. Ai = АП[1 -?, ?], ?| = ? ? [?, 1], ?2 = ? ? [0, ? [, one can easily check that B\ = A\ +a and ?2 = A2 + a - 1. Then, putting A3 = B?, = [0, 1] \ A = ]0, 1] \ B, we obtain the
Paradoxes in measure theory 87 partitions [0, 1] = (J,3=i A, ]0. 1] = ULi Bi such that Ai = Bi (' = 1-2, 3) and thus we have[0, l]~3]0, 1]. A stronger form of the paradox of infinity states that an infinite set can be decomposed into two subsets, each of which is equivalent to the original. As S. Mazurkiewicz and W. Sierpinski A914) realized, this paradox can also be exhibited geometrically. THEOREM 1.2. There are nonempty sets A, A\, A2cR2 such that A = A\ U A2, A\ П At = fl and A = A| = At. Proof. Let ? denote the set of polynomials with nonnegative integer coefficients (including the identically 0 polynomial), and let с be a transcendental complex number with \c\ = 1. We put A = {p(c): ? e P], A\ = A + 1, A: = с A. Using the fact that с is transcendental, it is easy to prove that ? ? ? A? = 0, A = ? ? U At and A = A\= ??. ? A. Lindenbaum A926) proved that a bounded plane set cannot have this property; for a simple proof, see Wagon A986, p. 196). The situation changes, however, if we replace congruence by equidecomposability. A set А с К" is called paradoxical if there are subsets ? ?, At с A such that A = ? ? U ??, A\ П ?? — 0 and A ~ ? ? ~ At. If ? ? ~„ A and At ~,„ A, then we say that A is (n, т)-paradoxical. Thus the Mazurkiewicz-Sierpinski theorem says that there are A, 1 )-paradoxical plane sets, and, as Lindenbaum showed, such a set cannot be bounded. But it was not known for more than 60 years whether or not bounded paradoxical plane sets exist. Stan Wagon A986, p. 232) listed this question under Problem 18 in his catalogue of 19 unsolved problems in the area of paradoxical sets, equidecomposability and invariant measures. The solution was given by W Just A988a). He proved that there exists a nonempty, bounded and A,3)- paradoxical plane set. It was shown by G.A. Sherman A990a) that Just's construction is best possible, in that a nonempty bounded plane set cannot be A.2)-paradoxical. Sherman also constructed a bounded B,2)-paradoxical plane set. Sherman proves in A991) that every paradoxical plane set has empty interior, moreover, has inner Lebesgue measure zero, and asks whether there is a paradoxical plane set with positive outer measure. The following example given by M.R. Burke B000) shows that the answer to this question is positive. Let с and A be as in the proof of Theorem 1.2. Using transfinite recursion it is easy to prove that there exists a set ? с С of positive outer measure such that the elements of ? are linearly independent over the field Q(c). Let В = {с"? +a: л e ?, ? e ?, a e A}. Then ? с В and thus В has positive outer measure. It is easy to check that В = (сВ) U (? + 1) and (с?) ? (? + 1) = 0, therefore ? is paradoxical. Burke B000) also constructs bounded paradoxical plane sets of positive outer measure. The situation in R1 is, again, much simpler: there are no nonempty paradoxical sets in E1. This was proved by Tarski A938a) and, independently, by Sierpinski in A946) and A954, Theorem 19). We shall present their proof in a more general setting. If X is a nonempty set then ?? will denote the symmetric group on X; that is, the group of bijections mapping X onto itself. If G is a subgroup of Sx then we say that G acts on X. ' Our definition slightly differs from Wagon's. The equivalence of the two definitions follows from Theorem 2.5.
88 ?, Laczkovich The sets А, В С X are said to be G-equidecomposable, if there are finite partitions A = U/=| A, and В = U/=i Я/ and gr0UP elements g,¦ e G (/' = 1,... ,k) such that S, = gi (A,) for every / = 1,... ,k. We shall denote this fact by A~S. The set А с X is called G-paradoxical, if there are subsets A\, A 2 с A such that A = A, UA2, A, HA2=0and A~A|~A2. In the sequel we shall denote by G„ the group of isometries of R". Thus two sets in R" are equidecomposable if they are G„-equidecomposable, and a set А с R" is paradoxical if it is G„ -paradoxical. If G is a group and Я is a subset of G then ун(п) will denote the set of elements gi ·' -gn, where gi e ? U ?~' for every / = 1,..., ?. The group G is called exponentially bounded, if lim,,-*;*, |yw(n)|''" = 1 for every nonempty finite ? с G. In other words, G is exponentially bounded if, for every finite ? с G and for every ? > 0, there is an щ such that the number of distinct group elements that can be represented as a word of length ? with letters from ? U H~' is less than A + ?)" for every ? > щ. Tarski and Sierpinski realized that G ? is exponentially bounded. (Proof: Each isometry of ? is of the form g(x) = ax + b (x e R), where a = ±1. If g\ gr e G\, gi(x) = a,x + bi, and g = g'"{ ¦¦¦g'"', then g(x) =ax +k\b\ -\ + krbr, where a = ±1 and \kj\ < |w,-| for ever /'. If the length of the word g is и, then |w,| -? ? for every / and thus the number of different b's is at most (In + 1/. Thus the number of different isometries of the formg' ¦¦¦g'"r with 53,- |w,| =и is at most 2 (In + l)r < A +?)" if ? is large enough.) The same argument shows that if G is Abelian or has an Abelian subgroup with finite index then G is exponentially bounded. Now the statement that R1 has no nonempty paradoxical subsets is a special case of the following theorem. THEOREM 1.3. If an exponentially bounded group G acts on X then X does not contain nonempty G-paradoxical subsets. PROOF. Suppose that 0 ? А с X is paradoxical, and let A - В U С, В П С = 0, С С А^ В^ С. Then there are partitions r s r s /=1 j=\ i=l y=l and maps У), gj e G such that //(A,) = S,, gj(A'-) = Cj (i = 1 г, у = 1 ,s). Lei F\(x) = fj(x) (x e A,, / = 1,..., r), and F2(x) =gj(x) (x e A'y, у = 1,. ..,?), then F| (A) = ?, /*2(A) = C. It is easy to see that the images of a fixed ? e A under the maps FhFh...FiH (i iw = 1,2) are distinct. Each of these images is of the form h\ ...hn(x), where h\,...,h„ e {/? fr,g\ gs}- Thus the words of length ? with the letters f.gj define at least 2" different maps of G, which contradicts the fact that G is exponentially bounded. D
Paradoxes in measure theory 89 Every group acts on itself by left multiplication. That is, if G is a group then we may identify every element g e G with the map ? н> gx (? e G), and thus we identify G with a transformation group acting on the set G. According to this identification, the sets А, В С G are equidecomposable (denoted by A ~ ?), if there are finite partitions A = UjL, ?,, ? = Uf= ? ?< and group elements g\ , gu such that B, = g, A,- for every / = 1 к. The set А С G is paradox, if A = A | U At, where ? ? ? ?? = 0 and A ~ ? ? ~ A2- A group is called supramenable, if does not contain nonempty paradoxical sets. By Theorem 1.3, every exponentially bounded group is supramenable. It is not known if the converse is true or not. (This is Problem 12 in Wagon's list A986, p. 231) of unsolved problems). However, in Theorem 1.3 we may replace 'exponentially bounded' by 'supramenable'. In other words, if a supramenable group G acts on X then X does not contain nonempty G-paradoxical sets. Indeed, it is easy to see that if А с X is a nonempty G-paradoxical set, then for any ? e A the set {g: g(x) e A} is a nonempty paradoxical set in G, which is impossible. Returning to E1 and R2, recall that (i) E1 does not have nonempty paradoxical subsets, and (ii) every paradoxical plane set is small in that its inner Lebesgue measure is zero (Sherman A991)). We shall see in the next section that in R" for ? ^ 3 and in non-euclidean spaces there are 'large' paradoxical sets. 2. Paradoxes in E" for ? ^ 3 and in non-euclidean spaces G. Vitali A905) proved that there are no nontrivial isometry-invariant and ?-additive measures defined on all subsets of R. Motivated by Vitali's theorem, F. Hausdorff raised the question whether or not finitely additive invariant measures exist in Ш". In 1914 he showed that the analogous problem on the sphere S2 = {x e ??: |jc| = 1} has a negative answer. He proved this by means of the following theorem (the so-called Hausdorff paradox). THEOREM 2.1. There are decompositions S2 = A | U A2 U C\ and S = A3 U A4 U A5 U Сг such that the sets A, (/' = 1,..., 5) are congruent to each other, and C\, Сг are countable. This implies that there is no finitely additive isometry-invariant measure m defined on all subsets of S2 satisfying m(S2) = 1. Indeed, such a measure vanishes on every countable set С С S2, since S2 contains ? disjoint congruent copies of С for every ? e N. Thus S2 = ?, U A2 U C\ implies w(A,) = 1/2, while S1 = ?? U A4 U A5 U C2 gives w(A|) = 1/3, a contradiction. An essential ingredient of Hausdorff's proof is the discovery that S03, the group of rotations of S2 contains a subgroup isomorphic to the free product ?? * Z.v Since ?? * ? у contains a free subgroup of rank 2 (if a and b are free generators of ?? * ?^ then bab and ababa freely generate such a subgroup), it follows that SO3 contains a free subgroup of rank 2. This fact has many direct proofs: see Wagon A986, p. 20). We may add the proof given in Laczkovich A991b, Theorem 3.2) that uses the stereographic projection and the fact that the group of linear fractional transformations contains free subgroups. In this way
90 ?. Laczkovic/? we even find free subgroups of SO3 with continuum many generators, a fact first proved by Sierpinski in A945b); see also Wagon A986, pp. 94-95). We remark that Hausdorff's paradox can be constructed without the countable sets C\, Ci, as J.F. Adams showed in A954). However, Adams has to use orientation-changing isometries, while Hausdorff only used rotations. The problem of the existence of invariant finitely additive measures in R' and R- was solved by Banach A923). THEOREM 2.2. For ? = 1 and « = 2 the Lebesgue measure can be extended to all subsets o/R" as a finitely additive and isometry-invariant measure. (These extensions are called Banach measures.) Banach's theorem also gives the following important corollary. THEOREM 2.3. Let ? = 1 or ? =2. If the Lebesgue measurable sets А. В С R" are equidecomposable, then ?„(?) = ?„(?). Indeed, let A = (J*=, A, and В = (J*_, S, be partitions such that A, = ?, for every / = \,...,k.lfm is a Banach measure then we have к к X„{A)=m{A) = Y^m(Ai) = Y^m(Bi)=m(B)=k,AB). i = \ i=\ The nonexistence of Banach measures in R3 was proved by Banach and Tarski A924), by making use of the following theorem, the so-called Banach-Tarski paradox. THEOREM 2.4. If А, В С R3 are bounded sets with nonempty interior then A ~ B. In particular, every bounded subset ofM? with nonempty interior is paradoxical. The same is true in R" for every ? ^ 3. We can see, using the argument of the proof of Theorem 2.3 that in R" (? ^ 3) there is no Banach measure. The proof of Theorem 2.4 given by Banach and Tarski uses Hausdorff's paradox and the following result. THEOREM 2.5. Suppose that G acts on the set X.IfAcX is G-equidecomposable to a subset of В С X using к pieces, and В is G -equidecomposable to a subset of A using ? pieces, then A ~ В using к -\-n pieces. This is an immediate consequence of the following statement, the so-called Banach- Schroder-Bernstein theorem: if f:A —> В and g: В —*¦ A are injections, then there are decompositions A = A\ U At and В = B\ U S: such that f{A\) = B\ and g(Bi) = Aj. See Banach A924) and Wagon A986. Theorem 3.5). The proof of the Banach-Tarski paradox can be slightly simplified if - instead of Hausdorff's paradox - we use the following statement, which is an easy consequence of
Paradoxes in measure theory 91 the fact that SO3 contains a free group of rank 2: There is a subset ? С S2 such that (i) S~ can be covered by four congruent copies of E, and (ii) 5"" contains infinitely many disjoint congruent copies ofE. (See, e.g., Laczkovich A991b, Theorem 3.3).) Now let А, В с R3 be bounded sets with nonempty interior. We may assume that both A and В contain the unit ball D = [x e R3: |jc| ^ 1). Suppose that A can be covered by к congruent copies of D and that В can be covered by ? congruent copies of D. Let ? be a set with the properties (i) and (ii) above, and let E* = \J{rE: 0 < r < 1). Then D is covered by 5 congruent copies of E* (one of them is needed to cover the center), and also contains infinitely many disjoint congruent copies of E*. From the fact that A is covered by к congruent copies of D it follows that A is equidecomposable to a subset ofDcS using 5k pieces, and, similarly, В is equidecomposable to a subset of D с A using 5« pieces. Therefore, by Theorem 2.5, we obtain that A ~ В using 5(it + /;) pieces. In particular we find that if D\ and Z>2 are disjoint balls of radius 1 then D ~ D\ U Di using 15 pieces. This number can be reduced to 11, since D is equidecomposable to D\ using one single piece, and thus, by Theorem 2.5, we only need 1 +5-2=11 pieces. In other words, D is paradoxical using 11 pieces. In Section 6 we will see that, in fact, 5 pieces suffice. Similar paradoxes exist in non-euclidean spaces, too. Let the elliptic and hyperbolic 11- space be denoted by L" and H", respectively. It is easy to see that the group of rotations of L"_l around a fixed point acts in the same way as SO,,, the special orthogonal group, acts upon R". This implies that L" is paradoxical for every ? > 2. Also, the action of the group of rotations of H" around a fixed point is isomorphic to the action of SO,, on R" and hence, for ? ^ 3, every bounded subset of H" with nonempty interior is paradoxical. It was shown by J. Mycielski and S. Wagon that H2 is paradoxical using Borel sets; see J. Mycielski and S. Wagon A984) and Wagon A986, Theorem 5.9). They used the fact that the isometry group of //- contains a subgroup isomorphic to the free product ? 2 * ? ? (Note that the same group appeared in the original proof of the Hausdorff paradox.) Later Mycielski proved A989) that the bounded subsets of H2 with nonempty interior are also paradoxical. Bounded paradoxical subsets of H2 and S2 were also investigated by G.A. Sherman A990b). In this paper Sherman determines those pairs (in. 11) for which there exists an (m,n) -paradoxical subset of a disc of given radius in either ? - от S-. The paradoxes discussed in this section depend on the fact that the transformation groups in question contain F2 (the free group of rank 2) as a subgroup. It was proved by von Neumann A929) that every such group is paradoxical. THEOREM 2.6. If a group G contains Fj as a subgroup then G is paradoxical. PROOF. First we show that Fi is paradoxical. If a, b are the generators of Ft, then every element of F2 is represented by a reduced word in the letters a, a~', b, b~'. Let A denote the set of those reduced words that begin with the letters a or a-1. Since Fj = A U a~ A and abA, a~]bA are disjoint subsets of A, it follows that Ft is equidecomposable to a subset of A. Therefore, by Theorem 2.5, F2 ~ A. Also, bA and b~] A are disjoint subsets of F2 \ A, therefore F2 ~ Fi \ A, and thus F? is paradoxical. (One can prove that F2 is B, 2)-paradoxical; see Wagon A986, Theorem 4.2).)
92 ?. Laczkovich Now suppose that Fj is a subgroup of G, and let U с G be a set containing exactly one element of each right coset of Fi. Then every g e G can be written uniquely in the form hu, where h e F2 and и eU. Let F> = A U ? be a decomposition such that Ft ~ A ~ B. Then G = (AU) U (Si/) is a decomposition with G ~ ??/ ~ BU, showing that G is paradoxical. D A similar argument shows that if a group G contains a free subsemigroup of rank 2, then G contains nonempty paradoxical subsets. Indeed, if S С G is a free subsemigroup generated by the elements a and b, then a S. bS are disjoint subsets of S, and thus S ~2 (S \ aS). Therefore 5 is A. 2)-paradoxical. (Note that the proof of Theorem 1.2 exhibits a free subsemigroup of G: generated by a rotation and a translation.) 3. Invariant measures and amenable groups It was realized by John von Neumann A929) that most of the previous results can be put into a more general context, which also explains the different properties of the action of G„ in the cases of ? ^ 2 and /; ^ 3. The central notions of this theory are finitely additive invariant measures and amenable groups. Let А С V(X) be a ring of sets. A set function m : A -*¦ [0. oo] is said to be a finitely additive measure if w@) = 0 and m(A U ?) = m(A) + m(B) whenever A and В are disjoint elements of A. If G acts on X then we say that m is invariant under G if g( A) e A and m(g(A)) =m(A) for every g e G and A e A. A group G is said to be amenable if there is a finitely additive probability measure on V(G) that is invariant under the action of G on itself. In other words, G is amenable, if there is a finitely additive measure m on V(G) such that m(G) = 1 and m is left invariant; that is, m(gA) — ?(?) for every А С G and g e G. John von Neumann discovered that the contrast between Theorems 2.2 and 2.4 is due to the fact that the transformation groups G\ and G: are amenable, while G3 is not. In fact, G\ and G: are solvable and, as von Neumann proved, every solvable group is amenable. (G\ is solvable because that group T\ of translations of R is an Abelian and normal subgroup of G1 such that the factor group G\/T\ is the two-element cyclic group. The group G: is also solvable. Indeed, let Г: denote the group of translation of R-, and let SGi denote the group generated by the translations and rotations. Then in the sequence of groups {identity}, Г2. SG2. G: each group is a normal subgroup of the next, and the factor groups are Abelian.) Since G| and Gi are amenable, the existence of Banach measures in R1 and R2 is a special case of the following result, called the invariant measure extension theorem. THEOREM 3.1. Suppose that G is an amenable group acting on X, and let m be a G- invariant finitely additive measure defined on a G- invariant field А С V(X). Then m can be extended to V(X) as a G-invariantfinitely additive measure. This important theorem, in this general form, was first stated and proved by Jan Mycielski A979) although in some special cases it was known to Banach A923) and von
Paradoxes in measure theory 93 Neumann A929). Since the group of translations is commutative and commutative groups are amenable by von Neumann's theorem, we obtain the following corollary. THEOREM 3.2. For every ? there exists a translation-invariant extension of the L^ebesgue measure ?„ toV(R"). PROOF OF THEOREM 3.1. First we need the measure extension theorem stating that if m is a finitely additive measure on the ring Л С V(X), then m can be extended to V(X) as a finitely additive measure. (Proof: Let ? denote the set of functions 53!'= ? c'Xa,· where c\,..., c„ e Ш and A\,. ..,A„ are pairwise disjoint elements of ? of finite m-measure. Then ? is a linear space, and Lf = 53!'= ? c,w(A,) defines a positive linear functional on M. We put M* = {f:X^R: 3/ieM, |/Ю}. and define p(f) = miLh, where h runs through all functions from ? satisfying / ^ h. It is easy to check that M* is also a linear space and p(f + g) ^ p(f) + pig) and p(cf) = cp(f) for every /. g e M* and с > 0. Also, if / e ? then Lf sC p(f). By the Hahn-Banach theorem, L can be extended as a linear functional to M* satisfying Lf ^ p(f) for every / e M*. Now we define v(H) = LxH if хн e M* and v(H) = oo otherwise. It is easy to see that у is a finitely additive extension of m to V(X).) We remark that the measure extension theorem is actually equivalent to the Hahn- Banach theorem; see Wagon A986, Chapter 13). We shall also need the notion of integral with respect to finitely additive measures. In the sequel we shall denote by B(H) the linear space of bounded real valued functions defined on the set H. Let m be a finitely additive measure defined on V(X). Let А с X be a set of finite m- measure. If /: A —*¦ ? only takes a finite number of values (that is, if / is a so-called step function), then there is a partition A = A \ U · · · U A» and there are real numbers с \...., ctl such that / = J3!'= ? ciXa, ¦ In this case we define f " / fdm = УЗс;/я(А,-). Ja ы\ If / e B(A) is arbitrary then we choose a sequence ft'-A -» R of step functions converging uniformly to /, and define / f dm = lim / fidm. J a k-*^JA It is easy to check that the limit exists and is independent of the choice of the sequence (fk). It is also easy to prove that integral defined in this way has the following properties, (i) inf/ · m(A) ^ fA f dm ^ sup/ · m(A) for every / e B(A); (ii) For every А с X with finite m-measure the map / н> fA f dm defines a linear functional on the space B(A);
94 ?. Laczkovich (iii) for every / e B(X), the integral Д f dm defines a finitely additive set function on the ring {А С X: m(A) < oo}. Now the proof of Theorem 3.1 is as follows. Suppose that m is a G-invariant finitely additive measure on the G-invariant ring Л С V(X). By the measure extension theorem, there is a finitely additive extension ? of m to V(X). Since G is amenable by assumption, there is a finitely additive and left invariant probability measure ? on V(G). If А с X then let /д(#) = v(g~f (A)) (g e G). Now we define ?(?) = fG fA dy if /д is bounded and ?( A) = oo otherwise. It is easy to show that ? is a G-invariant extension of m. D Let m be a G-invariant finitely additive measure on V(X). If A~Z? then necessarily m(A) = m(B). Indeed, let A = (jf=| A and ? = (J*=i ?/ be partitions, and let g,- e G be transformations such that ?, = g, (A,) for every /' = 1 k. Then we have m(A) = ??=? w(A,) = ?*=| m(B,)=m(B). С С If А с X is G-paradoxical, then A = ? ? U A?, where ? ? П At = 0 and A~ ? ? ~ At. Therefore m(A) = m(A\) + w(A:) = 2m(A) and thus either m(A) = 0 or w(A) = oo. If G is amenable and m is a left invariant finitely additive measure on V(G) satisfying m(G) = 1, then every paradoxical subset of G must be of measure zero. In particular, the group itself cannot be paradoxical. In Section 5 we will see that, by a theorem of Tarski, nonamenable groups are paradoxical and thus a group is amenable if and only if it is not paradoxical. Next we shall briefly review the basic facts about amenable groups. (i) If G is amenable then there is a finitely additive probability measure ? on V(G) which is bilaterally invariant; meaning that v(gA) = v(Ag) = v(A) for every А С G and g e G. (Proof: if m is a left invariant finitely additive probability measure on V(G) then we put v( A) = fG f:\ dm for every А с G, where fA(g) = m(A~]g) (g e G). It is easy to check that ? satisfies the requirements.) (ii) If G is amenable then so is every subgroup and homomorphic image of G. (Proof: Let m be a finitely additive invariant probability measure ? on V(G). If ? is a homomorphismof G onto ? then v(A) = т(ф~] (А)) (А с Н) defines a finitely additive invariant probability measure onV(H). Next suppose that ? is a subgroup of G, and let U с G be a set containing exactly one element of each right coset of H. Then ? (A) = m(AU) defines a finitely additive invariant probability measure on V(H).) (iii) If Я is a normal subgroup of G. and if ? and G/H are both amenable then so is G. (Proof: Let ? and ? be nonnegative finitely additive and bilaterally invariant probability measures on V(H) and V(G/H), respectively. We extend ? by ?(#?) = ?(?) (А с ?. g e G). This definition makes sense, since A\. A2 С Я, g|A| = #2^2 implies g^'gi e Я and thus, by the invariance of ?, ?(?|) = ?(g^?g\A\) = ?(?2). This extension defines ? on the power set of each coset of H. Let ?-.G^- G/H be the natural homomorphism, then 0_l (y) is a coset of ? for every у e G/H. Let А с G be fixed. Then g(y) = ?(? ? </>"' (У)) defines a bounded function on G/H. We put у (A) = fG/H gdv; it is easy to check that ? is an invariant finitely additive probability measure on V(G).)
Paradoxes in measure theory 95 (iv) Let G be a group. We say that L: B(G) ->· R is a left invariant mean, if L is a linear functional satisfying inf/ ^ Lf ^ sup/ and L/ = LTKf for every / e B(G) and g e G, where the function ?<,/ is defined by TKf(x) = f(g.x) (x e G). Then the group G is amenable if and only if there is a left invariant mean on B(G). (Proof: If L is a left invariant mean then m(A) = L/a defines a left invariant finitely additive probability measure on G, and thus G is amenable. On the other hand, if G is amenable and m is a left invariant finitely additive probability measure on G, then Lf = fG f dm is a left invariant mean on B(G).) We note that the existence of a left invariant mean on B(G) is sometimes used as the 'Bourbaki style' definition of amenability. (v) Let G be a group, and let N denote the set of functions of the form ]T"=, [THj f — f], where /?,..., /„ e B(G) and g\ g„ e G. Then G is amenable if and only if inf / ^ 0 ^ sup / for every / e N ('Dixmier's condition'). (Proof: If L is a left invariant mean on B(G) then Lf =0 for every /?? and thus inf/ ^ 0 ^ sup / by the definition of means. If the condition is satisfied then an easy application of the Hahn-Banach theorem gives a linear functional L on B(G) such that Lf = 0 for every f e N and Lf ^ sup / for every / e B(G). It is easy to check that L is a left invariant mean.) It is a remarkable fact that if G is nonamenable then not only inf / > 0 holds for some / e N, but actually N = B(G). In other words, G is amenable if and only if ? ? B(G). This was proved by G.A. Willis A988). For ? > 1 let lp(G) = {/: G -+ R: ? G |/(g)|'' < oo), and let N,, be the linear span of the functions Tsf — f (/ e lp(G). g e G). Willis also proves that G is amenable if and only if Np ? lp(G). For a generalization for graphs see G. Elek A998). (vi) Every Abelian group is amenable. The 'classical' proof runs by checking that Dixmier's condition holds: if / = ?"=, [??, / - / ] then the average of the values of / at the points g'^ ¦ ¦ ¦ g,'," @ ^ /1 i„ < ?) tends to zero if N ->· oo due to cancellations, and thus inf/ ^ 0 ^ sup /. (The same argument proves that every exponentially bounded group is amenable.) A nice functional analytic proof runs as follows: Let ? = LX(G)' with the weak* topology. Then К = \L e LX(G)': inf/ ^ Lf ^ sup /(/ e L^G))} is a compact convex subset of E. The translations Tg (g e G) form an Abelian semigroup of continuous affine transformations mapping К into itself. Then, by the Markov-Kakutani fixed point theorem, they have a common fixed point, which is a left invariant mean on B(G). See Wagon A986, p. 159) and Paterson A988, p. 14). (vii) Every solvable group is amenable; this is clear by (iii) and (vi). It is also easy to see that if G has a normal solvable subgroup of finite index then G is amenable. (viii) A group G is amenable if and only if every finitely generated subgroup of G is amenable. (Proof: if G is amenable then so is every subgroup by (ii). If G is not amenable then, by (v), there is a function / e N satisfying sup / < 0. Let / = ??=? lTx, fi ~ Я, where / /„ e B(G) and g, gn e G. and let
96 ?. LaczMorich ? denote the subgroup generated by gi gn. Since sup f\H ^ sup/ < 0, it follows from (v) that ? (a finitely generated subgroup of G) is not amenable.) (ix) A group G is amenable if and only if any of the following statements is true ('F0lner's conditions'). (F|) For every nonempty finite set А с G and for every ? > 0 there exists a nonempty finite В CG such that \(??)??\ <?\?\. (F?) For every nonempty finite set А с G and for every ? > 0 there exists a nonempty finite В с G such that \(??)??\ < ?\?\ for every ? e A. (Here ? denotes symmetric difference.) A particularly simple proof of the equivalence (F|) «=> (Fi) «=> G is amenable was found recently by T. Ceccherini-Silberstein et al. A999) using the result of W.A. Deu- beret al. A995) on paradoxical metric spaces. See also Laczkovich B001). There are several other characterizations of amenability. One can use recurrence of random walks on the group, or certain growth conditions (akin to the notion of exponential boundedness), and there are cohomological characterizations as well. For the details we refer to Chapter 12 of Wagon's book A986), to Paterson's monograph A988), and to the paper by Ceccherini-Silberstein et al. A999). All these notions and results can be generalized to topological groups. A topological group is called amenable if there is a finitely additive left invariant probability measure on the Borel sets of the group. Here we only mention Paterson's theorem A986): a locally compact group is amenable if and only if it is not paradoxical using Borel pieces. As for the theory of amenable topological groups, we again refer to Paterson's book. As we saw in Theorem 2.6, if a group G contains Ft as a subgroup then G is paradoxical and, consequently, is not amenable. It was not known for 50 years whether the converse is true or not. The problem was solved by A.Yu. Ol'shanskii A980), who constructed a paradoxical and periodic group. (A group is periodic if each of its elements has finite order.) It is obvious that a periodic group cannot contain Ft, and thus Ol'shanskii's group is nonamenable, but does not contain Ft. Later S.I. Adian proved A983) that the Burnside group B{m,n) is also paradoxical (and, of course, also periodic) if m ^ 2 and ? ^ 665 is odd. (The group B(m,n) has m generators and is defined by the relations w" = e, where w runs through all words.) Recently Ceccherini-Silberstein et al. A998) showed that for m ^ 2 and ? ^ 665 odd the group B(m. n) is paradoxical with at most 14 pieces. However, in some important classes of groups, amenability is equivalent to the nonexistence of free subgroups. By a theorem of J. Tits A972), for every subgroup G of GL„ (the group of nonsingular linear transformations of R") either G contains a normal subgroup of finite index, and thus it is amenable, or contains F> as a subgroup. The same is true for every subgroup of Gn. See Wagon A986, p. 151). We note that a group G contains Fn as a subgroup if and only if there is an infinite subset ? С G such that for every finite set D с ? there is an element g e G with gE = ? \ D (theorem of EG. Straus); see Wagon A986, pp. 92-94). A periodic group does not contain free subsemigroups either, and thus B(m,n) (for m ^ 2 and ? odd, ? ^ 665) is also an example of a non-supramenable group without free
Paradoxes in measure theory 97 subsemigroups. However, by a theorem of Rosenblatt A974), every solvable group is either supramenable or contains a free subsemigroup of rank 2. Rosenblatt conjectured that the same is true for amenable groups; this is still open. As we mentioned in the first section, it is also open, whether or not every supramenable group is exponentially bounded. See Wagon A986, Problems 11 and 12on p. 231), and PatersonA988, Chapter6). 4. Decompositions and perfect matchings It was discovered by D. Konig (the founder of modern graph theory) that equidecomposabiUty of sets can be formulated in terms of perfect matchings of bipartite graphs: see Konig A926). Tarski A938a) also used graphs in questions of equidecomposabiUty. Recently W. Just A988b) described this approach explicitly. Let А, В be arbitrary sets. By a bipartite graph on the pair of sets (A. B) we shall mean a multiset of unordered pairs (л-, у) such that ? e A and у e B. The pairs (x, y) are called lines connecting ? and y. Note that we allow multiple lines. Also, if А П ? ? 0 and ? e ? ? ?, then the loop (x,x) is allowed. We shall use the terms degree, walk, path, cycle and perfect matching in the usual sense; see Lovasz and Plummer A986, pp. xxix- xxxii). Thus Г is a perfect matching, if there is a bijection / of A onto В such that Г = ((х,/(х)):хеА]. Let G be a family of functions mapping subsets of X into X, and let А. В С X. Wc. define a bipartite graph on (A, B) by Гс(А,В) = {(х,у): хеА. у e B. 3/e G, ? eDom/, f(x) = y]. The connection between equidecomposabiUty and perfect matchings is explained by the following statement; it is an immediate consequence of the definitions. THEOREM 4.1. Let G act on X. For every А, В e X we have A ~ В if and only if there is a finite subsystem ? CG such that Гн(А, В) contains a perfect matching. By this theorem, in order to prove the equidecomposabiUty of two sets, we have to find perfect matchings in some bipartite graphs. The most important tool in the search of perfect matchings is the following theorem due to M. Hall A948) and R. Rado A949). A graph will be called locally finite, if the degree of each point is finite. If У is a subset of the points of a graph Г then we shall denote by ?(?) the set of those points of ? which are connected to at least one point of ?. THEOREM 4.2. A locally finite bipartite graph ? contains a perfect matching if and only if\r(Y)\ ^ |У| holds for every finite set ? of points of Г. This theorem is an immediate consequence of the Banach-Schroder-Bernstein theorem, and the following result (called marriage lemma): Let Г be a locally finite bipartite graph on the pair of sets (?, ?) and suppose that for every finite ? С A we have \?(?)\ ^ |У|. Then there is an injective map f: A —> В such that (x, f(x)) e Г for every ? e A.
98 ?. Laczkovich For a proof see Hewitt and Ross A979, p. 248) or Laczkovich A991b, p. 157). An important application of Theorem 4.2 is the following classical result of D. Konig and S. Valk<5( 1925). THEOREM 4.3. Let G act on X. Let ? ? Al. B\ Bk С X, and suppose ?? ?) A j = BjnBj=Q, Ai-Aj, ?,- Bj (i ? j). flndUi=i Aj?\Jki = \Bj.Then A\~ B\. PROOF. There is a finite subsystem ? с G such that the graphs Гц(A\. A,), ?)/(?|. В,) (/' = 1,—к) and Гн(и?=| ^bULi ?') contain the perfect matchings M,, N,- (/ = \,...,k) and />, respectively. We define a graph ? on (A\. B\) as follows: we put (x,y) in ? if and only if there is an /' and there are points xj e A,, y, e S, such that (x, .r,) e ?,, U/,}·,) e i" and (у,-, y) e N/. Then Г с Twi(A|, ?|) and ? is regular (the degree of each of its point is к). This easily i mplies that Г satisfies the Hall-Rado condition | Г (?) | ^ | У | and hence, by Theorem 4.2, Г contains a perfect matching. Obviously, Я3 is a finite subset of G and thus, by Theorem4.1, A| ~ ?|. ? In the applications of Theorem 4.1 the following simple facts are often useful; cf. Laczkovich A991b, 1992c). PROPOSITION 4.4. Let ? be a connected and locally finite bipartite graph. (i) If ? is a tree, and there is at most one point of ? of degree 1, then ? contains a perfect matching. (ii) If ? contains at most one cycle, and if the degree of each point of ? is at least two, then ? contains a perfect watching. 5. The type semigroup Now we come to the fundamental theorem of Tarski A929, 1938b) establishing the exact relation between paradoxical sets and invariant measures. THEOREM 5.1. Suppose G acts on X and ? С X. Then there is a G-invariant finitely additive measure defined on V(X) such that ?(?) = 1 if and only if ? is not G- paradoxical. As an immediate corollary we obtain that a group is amenable if and only if it is not paradoxical. Tarski's proof is based on the notion of the type semigroup: the free Abelian group generated by V(X) factorized by the equivalence relation of equidecomposability. The exact definition is the following. Let G act on X. Put X* = ? ? ? and let R be the ring of sets U"^0 A, x {/'} (n e N, A,¦ с X (i = 0 «)). If g e G and ? is a permutation of N then we define the map (g. ?) by (g,n)(x,n) = (g(x).n(n)) (xeX. neN).
Paradoxes in measure theory- 99 Obviously, the set G* of all these maps (g, ?) forms a group of bijections of X* onto itself, and R is a G*-invariant ring. The semigroup types are defined as the equivalence classes with respect to the equivalence relation ~ in R. If [A] and [?] denote the classes containing the sets А, В e R, respectively, then [A] + [?] is defined as the class containing the set g\(A) U gi(B), where g\, gi e G* are such that g\(A) ? gi(B) = 0. It is easy to check that this operation is well-defined and makes the set of types a commutative semigroup denoted by S. If we identify X with ? ? {0} then for every А, В с X we have A ~ В if and only if A ~ B. That is, [A] = [?] if and only if A ~ ?. Since 0 = [0] is a neutral element of S, it follows that S is a monoid (semigroup with a neutral element). We can introduce a partial order in S by defining x ^y «=> 3; e 5", jc + ; = y. If А, В с X then [A] ^ [?] if and only if A is equidecomposable to a subset of B. The fact that ^ is a partial order (reflexive, transitive, and satisfies ? ^ у. у ^ ? =>¦ ? = у) is an immediate consequence of Theorem 2.5. It is easy to see that ? ^ у implies ? + ? ^ у + ? for every ? e S. In the language of the type semigroup, the Konig-Valko theorem (Theorem 4.3) becomes the statement nx = ny =>¦ ? = у (х. у е S). That is, the cancellation law holds in the type semigroup S. One can also prove (using the marriage lemma instead of Theorem 4.2) that nx ^ иу =>¦ ? ?. у for every x, ye 5. It is clear that a set А с X is paradoxical if and only if [A] = 2[A]. We prove that А с X is paradoxical if and only if there is an ? e N such that (n + 1 )[A] ^ n[A]. The 'only if part is clear. Let [A] = x, and suppose (/; + 1 )л ^ nx. If я = 0 then ? = [A] ^ 0. A = 0 and then A is paradox. If ? > 0 then (n +k + \)x ^ (« + k)x for every A; e N, and thus 2nx ^ Bи — 1)jc ^ - - -^ (n + 1 )x ^ nx. Since я.г ^ 2«.v is also true, we have ? ? = 2nx. Then ? = 2x by cancellation, and thus A is paradox. Let m be a G-invariant finitely additive measure on V(X). As we saw in Section 3, ?~? implies m{A) = m(B) for every ?, ? с X. Therefore ? ([?]) = m(A) (А С X) defines a map ?-.S^- [0, oo]. It is clear that ? is a homomorphism from S into the additive semigroup [0, oo]. The converse is also clear: if ? is a homomorphism from S into the additive semigroup [0. oo], then m(A) = ф[А] (А С X) defines a G-invariant finitely additive measure on V(X). Therefore Tarski's theorem is equivalent to the following statement. THEOREM 5.2. Let S be an Abelian semigroup and a e S. Then either (n + 1 )a ^ na for some ? e N or there is a homomorphism ? of S into the additive semigroup [0, oo] such ????(?) = 1. PROOF (Sketch). Suppose that (n + \)a ^ na holds for no ? e N. Then the elements na (n e N) are distinct. Let Sa = [na: ? e N) and F = (x e S: ? + у = na for some у е S, ? e N). Then Sa, F are subsemigroups of S. Let ?(??) = ? (n e ?), then ? has the property that whenever ? ? л„, у ? jie Sa and x\ -\ \-x,i+z. = y\ ? \~Ук for some ? e F then ?(?\) -\ +?(?„) ?. ?(\\) -\ \-ф(ук). Then one proves, using
100 ?. Laczkovich transfinite induction, that ? can be extended to F preserving this property. Finally we define ?(?) = oo for every ? e S \ F. ? Tarski's theorem can also be deduced from general 'sandwich theorems' for monotone additive functions, see P. Plappert A995). As we saw above, if S is a type semigroup then (S, ^) is a partially ordered monoid satisfying ? ^ у => ? + ? ^ у + ? for every ? e S. The following, more general structure was introduced by Friedrich Wehrung. We say that (A. ^) is a positively ordered monoid (POM) if A is a commutative monoid, and ^ is a quasiorder (reflexive and transitive relation) on A satisfying 0 ^ ? and ? ^ у =? ? + ? $5 у + ? for every ?, у, ? e S. The investigations of Wehrung A992) were motivated, to a large extent, by Tarski's theorem. If a POM can be embedded into a type semigroup then it has to satisfy the axioms ? ^ у, у ^ ? =>¦ ? = у and nx ^ иу =>¦ д ^ y. Wehrung discovered three more axioms that, together with these two axioms, characterize those POMs that can be embedded into a type semigroup. An important consequence of this result is the fact that the set of all universal formulas of the language (+. ^) that hold in every type semigroup is decidable. See Wehrung A994a, Theorem 5.3 and Corollary 5.5). The following is an example of a formula that holds in every type semigroup: (x +z =y + z and ? ^ д. ? ^ y) =>- ? = }'¦ (Proof: Let ? = и + ?, y = v+z. Then ?+? = ?+ ?=? u+2z = ? + 2?=? 2(u+z) = u+u+2z = u + v + 2z = ? + и +2z = ? + ? + 2z = 2(v + z) => ? = u + z = v+z = }'-) In general ? +z =y +z does not imply л = у. For example, if X = K, G is the group of translations, ? = [N] = [N+], л· = 0, у = [{0}], then ? + ? = [Щ = [{0}] + [?+] = у + г, but obviously ? ? у. The implication л- + ? = у + ? => ? = у may fail even if ? < }'· Indeed, if a is the type of a nonempty paradoxical set, then 0 + a = a + a, but 0 ??. Moreover, it can happen that ? + ? = ? + ?- but д· ^ ? does not hold (see Wehrung A994a, Proposition 5.1)). However, Tarski proved that if the transformation group is exponentially bounded then (x + ? = у + ? and ? ^ у) =>· x = у. See Tarski A949, pp. 224-229). This theorem is a generalization of the fact that if the transformation group is exponentially bounded, then there are no nonempty paradoxical sets. As we saw above, if А, В с X are G-equidecomposable then m(A) = m(B) for every G-invariant finitely additive measure. The converse is not true, as the following example shows. Let X = Q be the set of rationals, and let G denote the group of all translations by rational numbers. Let m be any G-invariant finitely additive measure on Q. If m(]0, 1] П Q) = oo then obviously m([0. 1] П Q) = oo. If m(]0, 1] ? Q) < oo, then m(\x}) = 0 for every ? e Q and hence w([0. 1] П Q) = m(]0, 1] П Q). That is, /w([0. 1] П Q) = w(]0, 1] П Q) holds for every G-invariant finitely additive measure ?. On the other hand, it is easy to see that ]0.1 ] П Q and [0, 1 ] ? Q are not equidecomposable; see Sierpinski A954, Theorem 17, p. 48). It can also happen that ?(?) = ?(?) holds for every G-invariant finitely additive and finite valued signed measure without A and В being equidecomposable. Indeed, let X — Ъ and let G be the group of translations of ? by integers. If ? is a finite valued G-invariant
Paradoxes in measure theory 101 signed measure on ? then ?(?) = ?(?+) and hence /?({0}) = 0 = /j@). On the other hand, {0} and 0 are not equidecomposable. The situation changes, however, if the value oo is also allowed. By a G-invariant finitely additive signed measure we shall mean a map ?: V{X) ->-RU {oo} such that /j@) = 0, ?(? U ?) = ?(?) + ?(?) whenever ?, ? are disjoint subsets of X, and ?^(?)) = ?(?) for every А с X and g e G. The following theorem is proved in Laczkovich A991a). THEOREM 5.3. For every A,BcXwe have A~ В if and only ifn(A) = ?(?) holds for every G-invariant finitely additive signed measure ?. An equivalent formulation of this result is that every type semigroup can be embedded into a power ofM. U {oo}. Now we consider the more general situation where the sets A, B and the pieces used in the decompositions are restricted to be in a prescribed ring of subsets ofX. By a space we shall mean a triple (X. G, A), where X is a nonempty set, G is a group of bijections of X onto itself, and Л is a G-invariant ring of subsets of X. We say that the sets А, В e A are G-equidecomposable in A, if they are G-equidecomposable in such a way that the pieces used in the decompositions belong to A. The type semigroup of the space (X, G, A) is defined similarly (with the obvious modifications) as in the case of A = V(X). Laczkovich A991a) also proves the following generalization of Theorem 5.3: Let (X, G, A) be a space and let А, В е A. Then ?(?) = ?(?) holds for every G-invariant finitely additive signed measure on A if and only if there is a positive integer n such that n[A] = n[B]. Therefore the statement of Theorem 5.3 is true in a space if and only if the cancellation law holds in the type semigroup of the space. The problem, whether there are spaces for which the cancellation law fails, was formulated by Wagon A986, Problem 14, p. 231). The first examples were given in Truss A990) and in Gardner and Laczkovich A990). The transformation groups of these examples were noncommutative. Examples with Abelian transformation groups were constructed in Laczkovich A991a). For example, let X = E, let G be the group of translations, and let ?, ? be positive real numbers such that ?/? is irrational. It is shown in Laczkovich A991a) that the cancellation law fails in the space (X, G,A), where A is the translation-invariant field generated by the sets OO "V (J [na,Bn + \)a/2[ and |J [??, Bn + \)??[. ll= — OC П — — ОС Wagon also asks if the cancellation law holds for Borel equidecomposability in a locally compact topological group. This difficult and important problem is open even in E. 6. Nonamenable actions and local commutativity Suppose that G acts on the set X, and let ло e X be fixed. It is easy to check that the map Ah A = {geG: g(.x0)e A] (ACX)
102 ?. Laczkovich has the following properties: ? ? ? = 0 =>¦ АПЙ = 0. and ? ~ ? =>¦ A ~ B. From this observation it follows that (i) if X contains a nonempty paradox subset then so does G; and (ii) if X is paradox then so is G. In other words: (i) if G is supramenable then X does not contain nonempty paradox subsets; (ii) ifG is amenable then X is not paradox. The converse of these statements is not true. Moreover, as the following example shows, it can happen that X does not contain nonempty paradox subsets, but G is paradox, moreover, contains Fi as a subgroup. Let X„ (n = 1,2,...) be pairwise disjoint finite sets with \X„\ = n, and put X = \JT=\ X»· ^et ^ be ше set °f those elements of Sx that map each X„ onto itself. If А, В С X and A ~ В then | А П Xn \ = \ ? ? X„ | for every «, and it follows that X does not contain nonempty paradox subsets. Now we define two maps /, g e Sx such that they generate a free subgroup of G. Let ?\. ??, - - - be an enumeration of all reduced words in the letters /, /-l,g, g~\ and let «? < in < ¦¦¦ be an increasing sequence of positive integers such that ? ? is greater than the length of ?? (к = 1.2 ). It is easy to define / and g on the set X„k in such a way that they both map X„k onto itself, and ??, as a composition of/, /~'.g. g_l, is not the identity on X,u. If « ^ «^ (?=1.2—), then we may define / and g on X„ as the identity. It is obvious that the maps / and g generate a free subgroup of G. Nevertheless, under some extra conditions the converses of the statements (i) and (ii) will be true. Suppose that x$ e X is not a fixed point of any nonidentity element of G. It is easy to see that if ? с G is paradox then so is {g(.vo): g e E). Therefore, in this case G is supramenable if and only if X does not contain nonempty paradox subsets. The converse of (ii) is true if G acts without nontrivial fixed points; that is, if no nonidentity element of G fixes a point of X. Indeed, the orbits [g(x): g e G] (x e X) constitute a partition of X. Let ? С X contain exactly one point of each orbit. If G acts without nontrivial fixed points, then the map ? ^ H* = \J{g(E): geH] (HcG) has the following properties: #ПЛ' = 0=>#*ПЛ'* = 0, and Я~А'^Я*~ К* for every H. К с G. Then we obtain the following statement; cf. Wagon A986. Proposition 1.10). THEOREM 6.1. Suppose that G acts without nontrivial fixed points. Then G is amenable if and only if X is not paradox. In the actual applications the condition of Theorem 6.1 proves to be too strong: usually the fixed point are in abundance. However, the converse of (ii) can be obtained under substantially weaker conditions. Let G v denote the group [g e G: g(x) = .x) (the 'stabilizer' of x) for every л е G. The following theorem is due to Rosenblatt A981); see also Wagon A986, Theorem 11.25).
Paradoxes in measure theors 103 THEOREM 6.2. Suppose that G, is amenable for every ? e X. Then G is amenable if and only if X is not paradox. The special case when the groups G.v are Abelian is particularly important. The action of G is called locally commutative if this happens. In other words, the action of G is locally commutative if the following condition is satisfied: whenever two elements of G have a common fixed point then they commute. The role of local commutativity in the theory of equidecomposability was discovered by R.M. Robinson A947). In this paper Robinson finds the minimal number of pieces which are needed to duplicate a ball. Banach and Tarski did not specify the number of pieces to obtain a paradoxical decomposition of the ball. In 1929 von Neumann remarked that 9 pieces suffice. Sierpinski A945a) used 8 pieces; more precisely he proved that the ball is E,4)-paradoxical. Finally, Robinson A947) showed that the minimal number is 5; that is, the ball is B,3)-paradoxical. He also proved that S2 is B.2)-paradoxical. Robinson's proof is based on the fact that the group of the rotations of a sphere is locally commutative: if two rotations have a common fixed point then they have the same axis and hence they commute. Actually, Robinson proved a much more general statement concerning the existence of sets satisfying a prescribed system of congruences; see Wagon A986, pp. 43-51). If our only aim is to prove that S2 is B, 2)-paradoxical then a simpler proof is available using the following result of Laczkovich A992c). THEOREM 6.3. Let G bea locally commutative group acting on a set X, and suppose that G is freely generated by the transformations f\ /„. Let A. B. H\ H„ be subsets ofX such that (i) for every ? e A there are indices 1 ^ ;'. j ^ n. i ? j such that ? e Я, П Hj and f(x)eB, fj(x)eB;and (ii) for every у e В there are indices 1 ^ /. j ^ ?. ?? j and points x, e Я, П A. x} e Hj П A such that f (x,) = fj (x,-) = у. Then there are partitions A = (J"_| A; and В = (J' = i Bi sucn tnat &i c Д' an^ fi (A,) = B, for every i = 1 n. PROOF (Sketch). LetF= [fi\Hr. i = 1 «}andr = />( ?. ?). We have to prove that ? contains a perfect matching. It is enough to prove that there is a perfect matching in any connected component ?\ of Г. We show that Г\ satisfies condition (ii) of Proposition 4.4. Since the degree of each point of Г is at most ?, ?\ is locally finite. The conditions (i) and (ii) imply that the degree of each point of Г\ is at least two. Finally, the fact that Г\ contains at most one cycle, can be deduced from the local commutativity of G. ? The following theorem is due to T.J. Dekker A956): see also Wagon A986, Theorem 4.5 and Corollary 8.6). The proof is taken from Laczkovich A991b). THEOREM 6.4. if the action ofG on X is locally commutative, where G contains Fj as a subgroup, then X is B, 2)-paradoxical.
104 ?. Laczkovich PROOF. Suppose that / and g generate a free subgroup of G, and let g, = fg' f ' (/' = 1, ...,4).Thengi, ...,g4 generate a free group of rank 4. Let X; = ? ? {/'}(/' = 1,2), and define/; (/"= l,...,4)on ? = X\ UX, by M*J)={gi(x)J) (x eX. /=1,2. 7=1,2), and MxJ)=(giix).3-j) ixeX, /=3,4. y= 1,2). It is easy to check that f (/' = 1,..., 4) generate a free and locally commutative group on Y. Now we apply Theorem 6.3 with A=Y. B = X\. H^=H2 = X\, #з = Щ = X2. Thus we obtain partitions X\ = A\ UAi and X2 = A3 U A4 such that f\(A\) U /т(Аг) U /з(Аз) U /4(Ад) is a partition of X \. Taking the projections to X we obtain the partitions X = B\ UB2andX = ?3??4 such that g\ (B\) U g2(B2) U ?з(В3) U #4(?4) is a partition of X. D It is a remarkable fact, proved also by Dekker in A956), that the converse of Theorem 6.4 is also true: ifG acts on X and if X is B. 2)-paradoxical, then G contains two independent elements such that the action of the generated subgroup is locally commutative. See also Wagon A986, Theorem 4.8). Since SO3 contains independent rotations and its action on S2 is locally commutative, we find that S2 is B,2)-paradoxical. Let D = (x e R3: |лг| ^ 1} be the unit ball and let D° denote its interior. The map ? ?* Ял = \J{rH: 0 < r < 1} (H с S) commutes with the elements of SO3 and thus E2)? = D° \ {0} is B,2)-paradoxical. More precisely, we find a decomposition D° \ {0} = D\ U D2 U D3 U DA such that g\ (D\) Ug2(D2) = ЫД0 U g4(U4) = D° \ @), where g\,...,g4 are independent rotations. Now, in order to prove that D is B,3)- paradoxical, we have to 'steal a point' from S2. One can prove, using the argument of Theorem 6.4, that if ? e S2 is not the fixed point of any of the nonidentity elements of the group generated by g \,..., #4, then there is a decomposition S2 \ {?} = С \ U C2 U Cj U C4 such that gi(C|)Ug2(Ci) = gT,(Ci)U g^iC^) = S2. Then we obtain the decomposition D = (D\ UC| U{0})U(D2UC2)U(D3UC3)U(D4UC4)U{p} such that g\ (D| U Ci U {0}) U g2{D2 U C2) = g3(D3 U C3) U g4(U4 U C4) U {0} = D. Note that the point ? is mapped into 0 by an isometry (say, a translation), and thus we obtain that D is B, 3)-paradoxical. Local commutativity proved to be useful in other topics of equidecomposability as well. One of these originates in von Neumann's paradox concerning piecewise contractions. A function / defined on А С R is a contraction, if there is a q < 1 such that \f{x) — f(y)\ < q\x — y\ for every x,y e A. A map /: A -> R is called piecewise contractive if there is a finite partition A = A\ U ·· · U A„ such that the restriction /| A, is a contraction for every / = 1 , n. The following theorem - the so-called von Neumann paradox - was proved by von Neumann A929). See also Wagon A986, Theorem 7.12).
Paradoxes in measure theors 105 THEOREM 6.5. For arbitrary intervals I and J there is a piecewise contractive bijection from I onto J. The linear fractional transformations are the functions (ax + b)/(cx +d) (a,b,c,d e R, ad — be ? 0) mapping R U {oo} onto itself. The set of linear fractional transformations will be denoted by LFT. It is easy to see that LFT forms a group under the operation of composition of functions. The proof of Theorem 6.5, as given by von Neumann in A929), uses the fact that LFT contains free subgroups. For example, the maps a(.x) = ? + 2 and ?(?) = ? /B? + 1) generate a free subgroup of LFT. Indeed, let ? be a nonempty reduced word in the letters ?, ?-1, 0.0-'.Since a±\x) = ±2+x and 0±l(.v)= , ' ,, , ±2+ \/x it follows that ? can be expressed as a continued fraction whose value tends to a finite number as ? —> oo. Therefore ? cannot be the identity map. Von Neumann also proves that if the numbers a*, bk. Ck,dk (к е /) are algebraically independent over the rationals, then the linear fractional transformations c*k defined by otk(x) = (akx + bk)/(ckx + dk) generate a free subgroup of LFT. (Proof: Suppose that the reduced word ? = a' ¦ ¦ -акПг represents the identity map. It is easy to check that p(x) = (Ax + B)/(Cx + D), where A, B. C, D are polynomials of a*·,, bk, ¦ c*,, dk, with integer coefficients, and AD — ВС ^ 0. If ? is the identity map then В = С — 0 and A = D. Since ak,bk.Ck,dk are algebraically independent, these equations must be identities. Therefore, if ? denotes the number of different indices among k\,... ,kr, then for arbitrary 0| , 0„ e LFT, 0"'1 · · · 0^"' is the identity map. However, if a(x) = ? + 2 and ?(?) =x/Bx + 1) and ?? =</0</ (k = 1 и), then 0™' •••0™' is a nonempty reduced word in the letters a±i and 0±! and, consequently, is not the identity.) Theorem 6.5 easily implies the following: if ?. ? are bounded subsets of №. with nonempty interior, then A can be mapped, using a piecewise contractive map, onto B. Suppose that the set А с R is mapped, using a piecewise contractive map, onto B. If ?(?) and the number of pieces, n, are given, then ?(?) cannot exceed ??(?). The next theorem by Laczkovich A992c) gives the sharper estimate ?(?) < »?( ?)/2, and also shows that this bound is the best possible. Theorem 6.6. (i) Let А, В CK be measurable and suppose that there is a map f: A —> Ш. and a partition A = A\ U · · · U A„ such that В = f(A) and /|A, is a contraction for every i = ?,.,.,?. ThenX(B) <n ·?(?)/2. (ii) Let AcRbe measurable and let J be an interval with \J\ <n ¦ ?( A)/2, where ? is a positive integer. Then there is a map f : A —* R and a partition A = A \ U · · · U A„ such that f(A) — J and f\ A, is a contraction for every i = 1 n. If A is an interval, then f can be chosen to be a bijection between A and J.
106 ?. Laczkovich Let J be an interval with 1 < |7| < 3/2. Then (ii) of Theorem 6.6 implies that there is a bijection from [0, 1] onto J which consists of three contractions; that is, von Neumann's paradox can be realized using three pieces. On the other hand, such a paradoxical decomposition does not exist if only two pieces can be used, as (i) of Theorem 6.6 shows. The action of LFT on R U {oo} is not locally commutative. For example, the functions 1 /x and 2x — 1 have a common fixed point, but do not commute. However, it was proved in Laczkovich A992c) that if ал . Ьл, ?. ^ (к e /) are algebraically independent over the rationals, then the linear fractional transformations (a^x + bk)/(cux + dk) generate a free subgroup of LFT with a locally commutative action. The proof of Theorem 6.6(ii) is based on this result together with Theorem 6.3. The statement (i) of Theorem 6.6 is actually a special case of the following theorem proved in Laczkovich A988a). Let Lip / denote the Lipschitz constant of the map /: A ->· R"; that is, let Lip/ = sup{|/(.v)-/(y)|/|.r- v|: .v.veA. хфу\. Suppose that A = A \ U · · · U Ал is a partition of the set А С К", and f: A ->· R" is a map such that /| A, is a Lipschitz function with Lip(/|A;) ^ M, for every i = 1 к. Then the inner Lebesgue measure of f (A) is at most ? ¦ ?„(?), where М = тах(м!',...,М^]Гм;'У If the restrictions /|A, are contractions, then we may apply this estimate with M\ = ¦¦¦ = M„ = 1 - ?, and obtain (i) of Theorem 6.6, even in higher dimensions. On the other hand, it is not known whether or not Theorem 6.6(ii) can be generalized to R". The heart of the matter is the following question posed in Laczkovich A991b). PROBLEM 6.7. Is it true that every measurable subset ofR" of positive measure can be mapped by a Lipschitz map onto a ball? For ? = 1 the answer is yes, as the following simple argument shows. Let К be a compact subset of A having positive measure. Suppose К с [a. b] and ?(?) = d. Then the function f0 ifx^a. /(?)=\?(??[?,?]) ifxe[a.bl id ifx^b maps both К and A onto [0, ii], and satisfies the Lipschitz condition \f(x) - /(v)l ^ \x-y\(x,yeR). For ? = 2 the problem was solved by D. Preiss A992) in the affirmative. A different proof was found by J. Matousek A997). For ? ^ 3 the problem is open. Now we return to nonamenable actions and paradoxical sets. Let SL„[R] and SL„[Z] denote the groups of ? ? ? matrices of determinant 1 with real and integer entries,
Paradoxes in measure theory 107 respectively. For ? ^ 2 these groups contain free subgroups. Indeed, it is easy to check that \ ax +b J ex +d defines a homomorphism of SL2[R] into LFT. Since the maps <*(*)=¦*+ 2 and ?(?) = x/Bx + 1) generate a free subgroup of LFT, it follows that the matrices [,', ]] and [\ , ] generate a free subgroup of SL2[Z]. If // > 2 then SL„[R] and SL„[Z] contain SL2[Z] as a subgroup, therefore they also contain free subgroups. It was proved by von Neumann A929) that the unit square is paradoxical under affine transformations with determinant 1. Wagon proved the following generalization: Let ?. ? be two independent elements o/SL2[Z], and let G denote the group generated by ?. ? and by all translations. Then any two bounded subsets ofR2 are G-equidecomposable. See Wagon A986, Theorem 7.3). Wagon A986, Question 7.4, p. 101) asked if the interior of the unit square is paradoxical under SL2M, or even under SL2[Z] (translations are not allowed). Recently J. Mycielski A998) proved the following theorem. There exists a finitely additive measure m over all bounded subsets ofR" which is an extension of the Jordan measure and is invariant under SL„[Z], moreover, satisfies m(rH) = \r\"m(H)for every r eRand bounded ? с R". Mycielski's theorem implies that no bounded subset of R" with nonempty interior can be paradox under SL„[Z]. Mycielski A998) also proved that the interior of the unit square is paradoxical under SL2[R], subject to the following conjecture (C): there is a free group F acting on the set D = {x e R2: 0 < |.r| < 1} without nontrivial fixed points such that each transformation of F is the union of finitely many elements of SL2[R], restricted to subsets of D. In fact, Mycielski proves that conjecture (C) implies the following more general result: if A and В are bounded subsets of R2 \ {@.0)} with nonempty interior, and either they contain triangles with one vertex at @.0) or their distance from the origin is positive, then A and В are SL2[R]-equidecomposable. This result was proved, without using conjecture (C), in Laczkovich A999). The proof is based on the fact that the action of SL2[R] on the set R2 \ {@.0)} is locally commutative; see Wagon A986, p. 39). The question, whether conjecture (C) is true or not, remains open. We conclude this section with the following question motivated by Theorem 6.1: find conditions on the nonamenable action G that are necessary and sufficient for there to be no finitely additive, G-invariant measure on V(X) having total measure 1. (Greenleaf's problem; see Wagon A986, Problem 6, p. 230).) We mention that the following analogue of Dixmier's condition works. Let N denote the linear span of the functions (fog) — /, where / e B(X) and g e G. Then there is a G- invariant finitely additive probability measure on V(X) if and only if inf / ^ 0 ^ sup/ holds for every f e N. Indeed, if m is a measure with the required properties then fx f dm = 0 for every / e N and thus the condition is necessary. The sufficiency follows by an easy application of the Hahn-Banach theorem. a b с d
108 ?. Laczk/nich 7. Marczewski's problem When Banach proved that the Lebesgue measure can be extended to all subsets of R2 as a finitely additive invariant measure, E. Marczewski realized that a modification of Banach's construction yields a measure m with the following properties: m is a finitely additive invariant measure defined on all subsets of R-, m is an extension of the Jordan measure, and m vanishes on meager sets. (A set is meager, or first category, if it is the union of countably many nowhere dense sets.) Around 1930 Marczewski asked whether a measure with these properties exists on all Borel subsets of R-\ See Mycielski A979) and Wagon A986, Problem 1, p. 229). This problem remained unsolved for 60 years until, in 1991, R. Dougherty and M. Foreman presented a most surprising solution. A set is said to have the Baire property, if it differs from an open set in a meager set. A variant of Tarski's theorem (Theorem 5.1) shows that the following statements are equivalent: (i) Marczewski's measure does not exist in R3; and (ii) the Banach- Tarski paradox can be realized with pieces having the Baire property. Since (ii) sounded very unlikely, it was generally believed that Marczewski's measure exists in R3; see Wagon A986, p. 30). However, as R. Dougherty and M. Foreman proved in A992) and A994), (i) and (ii) are true. THEOREM 7.1. For every ? ^ 3, the ball B" = {.v e R": \x\ ? 1} is paradoxical with pieces having the Baire pmperty. Consequently, Marczewski's measure does not exist in R" for ? > 3. Dougherty and M. Foreman proved that ?3 is paradoxical using six pieces having the Baire property. This is best possible by the following theorem of Wehrung A994b): every paradoxical decomposition of a compact metric space with pieces having the Baire property contains at least six pieces. Theorem 7.1 is an easy consequence of the (classical) Banach-Tarski paradox and the following result called 'the main lemma' in Dougherty and Foreman A994). THEOREM 7.2. Suppose X is a separable metric space and G is a countable group of homeomorphisms of X acting without nontrivial fixed points. Suppose that {ri.g,: 1 ^ /' ^3)CG generate a free subgroup of G of rank 6. Then there are disjoint open sets Rj, G,¦ A ^ / ^ 3) such that U,3= ? П (/?/) = U/L ? Si (Gi) is a dense open subset ofX. Since Theorem 7.1 uses the Banach-Tarski paradox, its proof is nonconstructive. On the other hand, the proof of Theorem 7.2 does not use the axiom of choice. In fact, the proof of Theorem 7.2 runs like this. Fix a countable dense G-invariant subset D in X, and endow it with the subspace topology. It suffices to construct disjoint relatively open subsets Ri, G, (i = 1. 2, 3) of D such that (JiLi /·,(/?,) = U, = i gi(Gi) is dense in D. Indeed, if we let R- be the interior of the closure of /?, in X and G ¦ be the interior of the closure of d in X then /?,·, G, will satisfy the requirements of Theorem 7.2. Now, for the countable subspace D. the open sets /?,. G, are given by an explicit construction based on an intricate induction argument.
Paradoxes in measure theory 109 If A, Set" are open sets then we shall write A « B, if there is a pairwise disjoint collection {A \,..., А к} of open subsets of A whose union is dense in A and a collection {/?. · · ·. Л-} of isometries of R" such that [f\ (A\) /*(Ajt)} is a pairwise disjoint collection of open subsets of В whose union is dense in B. It is easy to see that ъ is an equivalence relation. The following theorem is a striking corollary of Theorem 7.2. THEOREM 7.3. Let ? > 3, a/id A a/id ? foe nonempty bounded open subsets ofR". Then A ^B. PROOF. Let n, g-, (/' = 1. 2. 3) be independent rotations of the open unit ball U = {x e R": \x\ < 1}, and let G denote the group generated by r, and g,. It is easy to see that the set F = {x e U: f(x) = ? for some / e G. f ? identity) is G-invariant and consists of a countable union of line segments. Thus X = U \ F is a dense G-invariant subset of U. By Theorem 7.2, there are disjoint relative open subsets /?,-. G, С X such that Ui = i r>'(Ri) = U/=i SiiG;) is a dense relatively open subset of X. Let /?(' and G- denote the interior of the closure of /?, and G,, respectively, and put C/| = Uf=i K('. ^ = U/=i ^?· ^ 's c'ear that U\C\U2 = 0, and U\^U2^ U. This easily implies that if an open set V can be covered by two open balls of radius 1, then V % ? for some open subset ? с U. Then an obvious induction argument shows that this is true for every open set that can be covered by any (finite) number of balls of radius 1; that is, for every bounded open set. Now let A and В be nonempty bounded open sets. We may assume that they both contain the unit ball. Then A « H\ с В and В ~ Н2С A and hence there are collections of disjoint open sets {A\, ?*}, {?| Bm} and isometries f\ fk. g\ gm such that Uf=i At is a dense subset of A. U'Li ^j is a dense subset of B. and the systems {/?(A|),..., fk(Ak)}, {g\(B\) gni(Bm)} consist of pairwise disjoint subsets of В and A, respectively. Let A' = ljf=| A,. B' = (J'j'=i Bj, and /(.v) = /,(л) (? e A,, /' = 1,...,k), g(x) = gj(x) (x e Bj, j=l m). Now we apply a modification of the argument proving the Banach-Schroder-Bernstein theorem. We put Co = int(A \ g(B')) and define by induction Q+? =(?o /)(Q·) for every A: = 0, 1,.... Putting С = (Jjtto C^ and D = int<A \ C)- one can check that C U D is a dense open subset of A, /(C) and g~] (D) are disjoint open subsets of B, and /(C) U g~l (D) is dense in B. Thus A^B, which completes the proof. D We gave this proof in order to make it clear that this amazing theorem is proved entirely constructively. As R. Dougherty and M. Foreman point out, we have a theorem, proved without the axiom of choice, and asserting that there is a collection of disjoint open subsets of the sun, that fill the sun (in the sense that there are no 'holes' of positive radius) and that can be rearranged by rigid motions to remain disjoint and fit inside a pea. When Marczewski constructed his measure, he also asked if the linear measure (one- dimensional Hausdorff measure) can be extended to all subsets of R2 as a finitely additive invariant measure; see Mauldin A981, Problem 169). J. Mycielski A979) showed that the answer is yes. Moreover, as Mycielski noted, for every 0 ^ s ^ 2, the s-dimensional Hausdorff measure has an invariant extension to ^(И2) by the invariant measure extension Theorem 3.1. The analogous statement in R3 is false: the 2-dimensional Hausdorff measure
по ?. Laczkovich does not have an invariant extension for all subsets of R3, since the existence of such an extension would make the Hausdorff paradox Theorem 2.1 impossible. On the other hand, it is not known whether or not the s -dimensional Hausdorff measure for 0 < s < 2 can be extended to all subsets of R3 as a finitely additive invariant measure. 8. Tarski's circle-squaring problem Tarski A925) asked whether a disc and a square of the same measure are equidecom- posable. (By Theorem 2.3, if two Lebesgue-measurable plane sets are equidecomposable, they must have the same measure.) The answer to Tarski's problem is negative if we impose some restrictions either on the pieces of the decompositions or the isometries of the rearrangement. L. Dubins et al. A963) proved that the disc is not 'scissor-congruent' to the square; that is, if the pieces are restricted to be Jordan domains (topological discs), then the disc and the square are not equidecomposable. The other negative result is due to R.J. Gardner A985). Gardner proved that the 'circle- squaring' is impossible, if the pieces are to be moved by isometries generating a locally discrete group; there are no restrictions on the pieces themselves. (A group G of isometries of R" is called discrete if for every compact subset С С R", we have С П g(C) = 0 for all but a finite number of g e G. The group G is locally discrete, if each finitely generated subgroup of G is discrete.) This negative statement is a corollary of the following positive theorem. See Gardner A985, Corollary 21). A generalization is given in Gardner and Laczkovich A990). THEOREM 8.1. Let G be a locally discrete group of isometries ofR". Suppose that K\ is a convexpolytope and Ki is a convex body in R". IfK\ and K2 are G-equidecomposable, then Кг is a convex polytope and К \ and K2 are G-equidecomposable with convex pieces. Tarski's original question was answered affirmatively in Laczkovich A990). As it turns out, the disc is equidecomposable to a square of the same area using translations alone. Here we shall discuss a generalization of this result to arbitrary dimensions and more general figures as given in Laczkovich A992b). For ?, ? с R" we shall write ?~?, if A and В are equidecomposable using translations; that is, if there are finite decompositions A = U'/ = i Aj- В = \J'j=\ Bj an(^ vectors ? ?,..., xd e R" such that Bj = Aj +xj (j = 1 d). By Theorem 3.2, if ?, ? are Lebesgue measurable and A~ ?, then ?„(?) = ?„(?). We shall prove the following partial converse of this statement: if A. В с R" are bounded Lebesgue measurable sets with ?„(?) = ?„(?) > 0, then A ~ B, provided that A and В have small boundaries. As it turns out, the relevant notion of smallness here is the box dimension defined as follows. Let Q" denote the set of cubes ??-1 m a\~ — m ? · • X a„ - 1 a» m m (a, eZ, /' = 1 и).
Paradoxes in measure theory 111 and let N (m, E) denote the number of cubes Q e Q", with Q ? ? ? 0. The box dimension of the bounded set ? с R" is defined as \ogN(m, E) ?(?) = limsup . hi->зс logw It is well-known that ?(?) = limsup 1-й, ?^?+ -log ? where U{E,s) = (x: dist(jr, ?) ^ ?). The following theorem is proved in Laczkovich A992b). THEOREM 8.2. Suppose that H\ and H2 are bounded measurable sets in R" such that k„(H\) = k„(H2) >0andA(dH\) <??. ?C?2) <?. Then H\~H2. It is well-known that if ? is bounded and convex then there is a constant С > 0 such that k„(U(dH, ?)) ^ С · ? for every 0 <? ^ 1; see Eggleston A963, Theorems 41 and 42, p. 86), Thus A(dH) ^ ? - 1 for every convex HcR". Also, if A is a Jordan domain in R2 with a rectifiable boundary, then Л(Э A) = 1. Therefore we obtain the following corollary. Corollary 8.3. (i) If A, B cR" are bounded convex sets with ?„(?) = ?„(?)> 0, then A ~ ?. (ii) If A, BcM2 are Jordan domains of the same area and with rectifiable boundaries, then A ~ B. Clearly, each of (i) and (ii) settles Tarski's question. C.A. Rogers asked whether or not the set " 1 2 . 3' 3 и 7 8 9' 9 U 25 26' 27' 27 , is equidecomposable to @, 1/2). See Wagon A986, pp. 119 and 230). It is easy to check that Л(Э А) = О and hence, by Theorem 8.2, the answer to Rogers' question is affirmative. More generally, if [a*, bk ] is a sequence of intervals such that 0 ^ au < bk <k~e for every к with a positive ?, then В = (Jbli [fl*> ^*] is equidecomposable to an interval. Indeed, it is easy to see that in this case Л(дВ) ? 1 /A + ?) < 1 and hence Theorem 8.2 applies. On the other hand, it is proved in Laczkovich A993) that there exist intervals converging to 0 such that their union is not equidecomposable to an interval. The proof of Theorem 8.2 is based on a sufficient condition for the equidecomposability of sets in terms of the discrepancy of some special sequences.
112 ?. Laczkovich We shall denote the unit cube {(r,,.... t„): 0 ^ t,¦ < 1 (/ = 1...., n)} by /". If F с /" is finite, \F\ = N, and ? с /" is measurable, then the discrepancy of F with respect to ? is defined as D(F: H) = ¦^\FnH\-k„(H)\ N If a e R then {a} denotes the fractional part of a, that is, {a} = a - [a]. For every z = (z\,...,Zk)eR" we denote (z) = ({? ?}...., U,,}) (i.e., U) e/" and с - (?) e ?"). If и, jti ,..., Xd e R" and N is a positive integer, then we put FN(u\x\, ....xti) = {(u+n\x\ ? VihiXdY- »/ =0 N- 1 (/' = 1 d)\. The following theorem was proved for plane sets and for d = 2 in Laczkovich A990), for subsets of R" and for arbitrary d, in Laczkovich A992b). THEOREM 8.4. Let H\, H2 be measurable subsets of I" with k„(H\) = ?„(?2) > 0 and suppose that there are vectors x\,..., xj eR" such that (i) the unit vectors e, = @, 0. 1,0 0) (/ = 1, n)andx\ , ?</ are linearly independent over the rational numbers, and (ii) there are positive constants ?. ? such that D{FN(M\x\,...,xd)\Hj)^K-N- ¦ \-t for every ueR". N =1,2 and j= 1.2. Then H\ ~ H2. This theorem can be formulated in a slightly simpler way if we identify the unit cube /" with the torus R" /Ъ". Indeed, in this case we do not have to bother with the fractional parts in the definition of Fn(u; x\ , x^) and with the unit vectors e, in condition (i). It is easy to see that this identification does not affect the equidecomposability of subsets of /" if we use only translations. To be precise, let ?: R" -> R" /Z" be the natural homomorphism. Then, for every А, В с I". A~S in R" if and only if ? (?) ~ ?(?) in R"/Z". (See Laczkovich A994, p. 172).) In this form Theorem 8.4 is not restricted to R"/Z", but can be formulated in every Abelian group as follows. Let G be an Abelian group with the group operation written additively. Let m be a finitely additive and invariant probability measure on V(G) (since Abelian groups are amenable, such a measure exists). If F с G is finite, \F\ = N and А с G is arbitrary, then we denote D(F;A) = -\FDA\-m(A) N
Paradoxes in measure theory 113 If u, x\,..., xd e G and NeN then we shall write FN(u;x\, ...,Xd) = {u+n\x\ ? \-ndXd'- nt = 0 ? - 1 (/' = 1, —d)}. The elements ? ?,..., Xd e G arc caWcd independent, ifn\x\ -\ YndXd = 0(ni , n</ e Z) implies n\ = ¦ ¦ ¦ = nd = 0. Then we have the following generalization of Theorem 8.4. THEOREM 8.5. Suppose that H\, H2 С G. m(H\) = m(H2) > 0, and there are independent elements x\,..., лг</ е G and positive constants ?, ? such that D(FN(u;xi,...,xd):HJ)<:K-N~i-F for every и e G, N= 1,2,..., and j = 1.2. Then H\ ~ H2. The core of the proof of Theorem 8.5 is a combinatorial statement proved in Laczkovich A992a, Remark 3.3). We shall say that a set S с M.d is discrete, if every bounded subset of S is finite. By a lattice cube we mean a set of the form Q = [a\,a\ + N [ ? · · · ? [?(/, ?</ + ?[, where ? e N+ and a, e ? (/' = 1 d). The length of the side of the cube Q is denoted by s(Q). THEOREM 8.6. Let 5?, S2 be discrete subsets ofW1, and suppose that there are positive constants ?, ?, ? such that \\SjnQ\-aXd(Q)\^K-s(Q)d-]~F for every lattice cube Q С W1 and j = 1,2. Then there is bijection ? from S\ onto S2 such that \?(?) —x\^ ? for every· ? e S\, where the constant ? only depends on d, ?, ? and a. Supposing Theorem 8.6, Theorem 8.5 can be proved as follows. We put ? =m{Hj) (j = 1, 2). If a = (a\,..., ad) e M.d then we shall abbreviate the linear combination <3|jti + h ad Xd by a ¦ ?. Suppose that ? \,..., Xd satisfy the conditions of the theorem and let ? denote the subgroup of G generated by jr 1, xcj. Let U с G be a set containing exactly one element of each coset of H, then every element g e G has a unique representation of the form g = и + ? ¦ ? where и e U and ? e Zd. Putting Sj(u) = [n eZd: u+n-x e Hj) (и е G. j= 1,2), it is easy to see that the condition of Theorem 8.5 implies that НЗД П Q\-aXd{Q)\ ? Ks{Q)d^~F
114 ?. Laczkovich holds for every lattice cube Q с К'' and j — 1. 2 (see the proof of Theorem 1 in Laczkovich A992b)). Then, by Theorem 8.6, there is bijection ф„ from S\ (и) onto S2(u) such that \<Pn(z) — z\^M for every с e S\ (/<). where the constant ? only depends ond. ?.? and a. The important point here is that ? does not depend on u. If g = u+nxeH\ (u eU. ? e Z''), then // e 5|(/<). Let <?„(") = "'· As "' e 5%(и), we have и + н' · лг е Я2. Let ?(#) = /< + ?' · .v. Then ? is a well-defined map from H\ into H2. As 0„ is a bijection from 5? (и) onto S2 (и), it is easy to see that ? is, in fact, a bijection from H\ onto #2 such that \n' — n\ ^ M. Thus for every g e H\ there is a vector a = (a\,... ,aj) eZ'1 such that |a, | ^ ? for every / = 1 d, and ? (g) = g + ? · ?. Let {^??,^? be an enumeration of the elements a ¦ л. where a e Z'1 and |?,| ^ ? for every / = 1,..., d. Let A, = [geHr. X(g) = g+d,} (t=\ ?). Since ? is a bijection from #i onto H2, it follows that Ur = i A/ and Ui=i(^' + ^') are disjoint decompositions of H\ and H2, respectively, and this completes the proof of Theorem 8.5. Now Theorem 8.2 can be deduced from Theorem 8.4 by making use of some results of discrepancy theory. If F с /" is finite then the (absolute) discrepancy of F is defined as D(F) = supD(F; J). J where the sup is taken over all subintervals J = [a\.b\[x- ¦¦ ? [a„ .b„[c I" ¦ The quantity D(F) can be estimated using exponential sums by a formula due to Erdos, Turan and Koksma (see Kuipers and Niederreiter A974, p. 116)). Using this estimate and some results of W. Schmidt A964), one can show that for almost every .? ? xct e /" there are positive constants С and К such that D (FN( u: ? ? ?,/)) ^C (logN)K ¦ N~cl for every и е R" and N = 2.3, Also, by a theorem of Niederreiter and Wills A975, Korollar 4, p. 133), Л(ЭА) < ? implies that the discrepancy D(F: A) can be estimated by a power of D(F). Putting these results together we obtain that if ? (9 H\) <n. А(дН2) <п, then, for dlarge enough, condition (ii) of Theorem 8.4 is satisfied for almost every choice of x\...., x<i, and hence we have H\ ~ H2. Theorem 8.2 leaves several interesting questions open. The first concerns the number of pieces in the decompositions. Let N be the minimal number such that D ~,v Q, where D is a disc, Q is a square and arbitrary plane isometries can be used. The proof of Theorem 8.2 gives a very large upper bound for N: a rough estimate is 104(). (The proof of Laczkovich A992b) gives a smaller number than that in Laczkovich A990); see Section 10 of the latter.) It would be desirable to give a reasonable estimate for N, not only for its own sake, but also because such an estimate would require new ideas. We remark that it is not known whether or not N > 3. By an unpublished (and nontrivial!) result of R.J. Gardner, we have N > 2. If only translations are allowed, then N > 3 follows from
Paradoxes in measure theon 115 Gardner's Theorem 8.1. Indeed, suppose that D and Q are equidecomposable using the translations t\,r2,r3. Replacing D by t\(D) we may assume that t\ is the identity. Then t\, t2, i3 generate a discrete group, which contradicts Theorem 8.1. Another important question concerns the measurability of the pieces. We shall discuss this problem in the next section. 9. The problem of equidecomposability with measurable pieces The following two problems are posed by S. Wagon A986, p. 229) as 'variations on Tarski's circle-squaring problem'. PROBLEM 9.1. Is a disc equidecomposable to a square of the same area using Borel measurable pieces! PROBLEM 9.2. Is a regular tetrahedron in R·1 equidecomposable to a cube using Lebesgue measurable pieces! We may add the following question posed in Section 10.2 of Laczkovich A990). PROBLEM 9.3. Is every polygon equidecomposable to a square using translations and Lebesgue measurable pieces! All these questions are open. Theorem 8.2 does not settle these problems, because its proof uses the axiom of choice and does not give any information about the measurability of the pieces. A more general conjecture was formulated by R.J. Gardner A989, p. 54). CONJECTURE 9.4. Suppose A and В are Lebesgue measurable sets in R". If A and В are equidecomposable under isometries from an amenable subgroup G of G„, then they are equidecomposable with measurable pieces. The condition of amenability is important, since, by the Banach-Tarski paradox, two balls of different radius are equidecomposable under the (nonamenable) group of isometries of R3, but are not equidecomposable with measurable pieces. If Conjecture 9.4 is true, it provides a positive answer to Problem 9.2. Indeed, by Theorem 8.2, the regular tetrahedron is equidecomposable to a cube of the same measure using translations. Since the translation group is amenable (even Abelian), Conjecture 9.4 applies. Clearly, Conjecture 9.4 would also imply that a disc is equidecomposable to a square of the same area with Lebesgue measurable pieces. If, in the conclusion of Conjecture 9.4 we insisted that A and В are equidecomposable with measurable pieces under the same isometry group G, then it would imply that the answer to Problem 9.3 is also positive. Motivated by Theorem 8.1, Gardner also formulates the following conjecture. CONJECTURE 9.5. Let ? be a polytope and К a convex body in R". If ? and К are equidecomposable with Lebesgue measurable pieces under the isometries g\ gk from
116 ?. ^czkovich an amenable group, then ? and К are equidecomposable with convex pieces under the same isometries g\,..., gk· However, at most one of Conjectures 9.4 and 9.5 can be true. Indeed, the disc and the square are not equidecomposable with convex pieces, and hence, if Conjecture 9.5 is true then the disc and the square cannot be equidecomposable with measurable pieces. Thus Conjecture 9.5 contradicts Conjecture 9.4. As Gardner remarks A989, p. 58), of Conjectures 9.4 and 9.5, probably the latter should be discarded. And, in fact, Conjecture 9.4 is true if we replace the pieces of the decompositions by functions. Let ? ? denote the characteristic function of the set H. The sets ?. ? с R" are said to be continuously equidecomposable, if there are functions f : R" —> [0, 1 ] and isometries g,¦ (/ = 1,... ,k) such that ?? = ?*=l f and ?? = ?., = \ f> ° Si- (This notion was introduced by F. Wehrung in A990).) Obviously, A ~ В implies that A and В are continuously equidecomposable (let /?,..., fk be the characteristic functions of the pieces in the decomposition of A). It is not difficult to show that the converse is also true, so that continuous equidecomposability is actually equivalent to the 'classical' equidecomposability; see Wehrung A994a, Lemma 1.3). However, the following result was proved independently by F. Wehrung and by the author; see Laczkovich A996). THEOREM 9.6. Suppose A and В are Lebesgue measurable sets in W. If A and В are equidecomposable under the isometries g\ gk /ют an amenable group, then A and В are continuously equidecomposable with Lebesgue measurable functions f\ fk and with the same isometries g\,.... gk. In addition, F. Wehrung proved a theorem containing the following result as a special case. See Wehrung A992, Corollary 3.23) and A993). THEOREM9.7. Assume that there exists a medial measure. IftheBorelsetsA, В С ?" are equidecomposable under the isometries g\, ¦. ¦. gk from an amenable group, then A and В are continuously equidecomposable with universally measurable functions f\...., fk and with the same isometries g\,.... gk· A medial measure is a universally measurable finitely additive measure on V(N) vanishing on singletons and normalizing N. It is known that the continuum hypothesis implies the existence of medial measures and thus the conclusion of Theorem 9.7 is consistent with ZFC. One is tempted to consider Theorems 9.6 and 9.7 as evidence for the truth of Conjecture 9.4. Moreover, these results even suggest the following: if the Lebesgue measurable sets A, В с ?" are equidecomposable under the isometries g\, ...,gk from an amenable group, then A and В are equidecomposable with measurable pieces under the same isometries g\,...,gk· Unfortunately, this is not true even for ? = 1, as the following example by Laczkovich A988b) shows. Let и be an irrational number, and let g\ (x) = x + u. gj{x) = x — u, gj(x) = и — x and g4(дг) = 2 - и - x (.t e R); then g\ g4 are isometries of E. It is shown in
Paradoxes in measure theory 117 Laczkovich A988b) that [0, 1] is equidecomposable to itself under g\,..., gA, but it is not equidecomposable to itself under these isometnes with Lebesgue measurable pieces. The group generated by g\,..., #4 is amenable, since it is a subgroup of the isometry group of R. We do not know if such an example exists with commuting isometnes, so we pose the following question. PROBLEM 9.8. Let A and В be Lebesgue (Borel) measurable sets in R" and suppose that A and В are equidecomposable under the commuting isometnes g\,..., gk- Is it true that A and В are equidecomposable under the same isometnes with Lebesgue (Borel) measurable pieces'? Since translations commute, a positive answer to Problem 9.8. would also provide positive answers to Problems 9.1, 9.2 and 9.3. 10. Countable equidecomposability and countably additive invariant measures We start with the following theorem of K. Ciesielski and A. Pelc A985). THEOREM 10.1. The Lebesgue measure in R" does not have a maximal, countably additive and isometry-invariant extension. This solves an old problem of Sierpinski. For a short proof and related results see Ciesielski A990) and Zakrzewski A990). Zakrzewski A995) also gave a somewhat improved version of the Ciesielski-Pelc construction. For more details see Section 6 of Zakrzewski B002) (this Handbook). Let G be a group acting on a set X. We say that the sets А, В с X are countably equidecomposable, A ~oo B, if there is a partition of A into countably many sets A; (/' = 1,2,...), and there are transformations g; e G such that U/?? gi(Ai) is a partition of B. This notion was introduced by Banach and Tarski A924). They proved, among other results, that any two subsets of R" with nonempty interior are countably equidecomposable. (For this topic, see also Wagon's book A986, pp. 135-145).) A set А с X is called countably paradoxical, if there are subsets A ?, ?? с A such that A = A1 U A2, ? ? П At = 0 and ? ~x ? ? ~x A?. In this section by the term measure we shall mean a countably additive, nonnegative extended real valued set function. It is natural to ask whether the analogue of Tarski's Theorem 5.1 holds for countably additive invariant measures and countably paradoxical sets. The exact analogue would say that for every ? с X the following statements are equivalent: (i) there exists a G-invariant measure that normalizes E\ and (ii) ? is not countably paradoxical. R.B. Chuaqui A969) conjectured that this generalization holds, but later he found the following counterexample (see Chuaqui A973) or Wagon A986, p. 136)). Let X be a set of cardinality ? ? and let G be the group of those permutations of X which fix all but a finite number of points of X. By a well-known theorem of Ulam (see Oxtoby A980, Theorem 5.6)), there is no nontrivial measure that vanishes on all singletons. Since
118 ?. Laczkovich all singletons are congruent under G's action, a G-invariant measure must vanish on all singletons. Thus, by Ulam's theorem, (i) of Chuaqui's conjecture does not hold for ? = X. However, X is not countably paradoxical. Indeed, suppose that there are disjoint subsets ? ?, Хг of X such that X ~x X \ ~x X2. Since the transformations involved in ? ~? ? ? and X ~зо Xi can move only a countable set of points, there is ? e X such that ? is a fixed point of all these transformations. This implies ? e X \ and ? e Xi,a contradiction. With this example in mind, Chuaqui modified his conjecture as follows. Let Nc denote the family of all sets А с X such that there are infinitely many pairwise disjoint sets А,- с X with ?, ~? A. Then the following are equivalent: (i) there exists a G-invariant measure that normalizes X; and (ii) there exists a measure that normalizes X and vanishes on Nq. (Note that a measure vanishes on No if and only if it vanishes on all countably paradoxical set. Indeed, it is easy to see that every countably paradoxical set belongs to Nc and every element of Nc is a subset of a countably paradoxical set.) This conjecture was proved by P. Zakrzewski A991). More exactly, Zakrzewski proved the following stronger result. THEOREM 10.2. Let the group G act on X and let ? С X. Then the following are equivalent. (i) There exists a G-invariant measure that normalizes E. (ii) There exists a measure that normalizes ? and vanishes on all countably paradoxical subset of E. Zakrzewski A993) also proved the following generalization. A set А с X is called weakly wandering under G if there are transformations g„ e G such that the sets gn(A) (n = 1, 2,...) are pairwise disjoint. THEOREM 10.3. Let G be a group of measurable transformations of a ?-finite measure space (?, ?, m). Then the following are equivalent. (i) There exists a G-invariant probability measure ? defined on ? such that m is absolutely continuous with respect to ?. (ii) There does not exist any set A e ? of positive measure which is weakly wandering under G. As an immediate corollary, Zakrzewski obtains the following. Let G act on X, and let ? be a G-invariant ?-algebra. Then there is a G-invariant probability measure on ? if and only if there is a probability measure on ? that vanishes on Nc- For further results on invariant measures see Zakrzewski B002) in this Handbook. It was proved by Becker and Kechris A993) that for Borel actions of a Polish group on a standard Borel space the perfect analogue of Tarski's theorem holds: There exists an invariant Borel probability measure if and only if the space is not countably paradoxical. See also Becker and Kechris A996, Chapter 4) and Sections 4 and 5 of Zakrzewski B002). We also mention the paper by Penconek A991) which contains several interesting results about countably paradoxical sets in Ш.". Penconek proves, among others, that every measurable set А С Ш." contains a countably paradoxical set В such that A\ В is null.
Paradoxes in measure theory 119 11. The nonconstructive element in the paradoxes The axiom of choice, formulated by Zermelo A904), aroused much controversy from the very beginning. As early as in 1905, the Bulletin de la Societe mathematique de France published a debate among Baire, Borel, Hadamard, and Lebesgue on Zermelo's axiom and, in the same year, several articles of the Mathematische Annalen were also devoted to this topic. Among those who opposed to the axiom most vehemently were Baire, Borel and Lebesgue. They did not realize that the theory itself they had developed contained nonconstructive elements in that the ?-additivity of the Lebesgue-measure cannot be proved in ZF alone. (For this early opposition to Zermelo's axiom see Sierpiriski A918), Moore A982) and Dauben A990, pp. 250-259).) The existence of nonmeasurable sets and the paradoxes intensified the criticism, and these discoveries were used as 'evidence' against the axiom of choice. Strictly speaking this is not justified, since either one has fundamental and philosophical objections against the axiom of choice (as the early opposers had) and then it does not really matter what actual consequences the axiom has, or one accepts the axiom and then has to accept the consequences as well. Justified or not, this criticism motivated the investigations to determine the exact relation between the Banach-Tarski paradox and the axiom of choice. A comprehensive account of these investigations (up to 1986) is given in the last chapter of S. Wagon's book A986). Here we only mention the most important facts. First, by a theorem of R.M. Solovay, the existence of the Banach-Tarski paradox is independent from the Zermelo-Fraenkel axioms. More exactly, Solovay proved that it is consistent with ZF that there exists an isometry-invariant and countably additive extension of the Lebesgue measure to all subsets of Ш."; moreover, supposing the axiom of choice for countable families of sets (or even a slightly stronger statement called the axiom of dependent choice), the existence of such a measure still remains consistent. Obviously, an isometry- invariant extension of the Lebesgue measure makes the Banach-Tarski paradox impossible. Thus the 'consistency-strength' of the Banach-Tarski paradox over ZF is not lower than that of the axiom of dependent choice. As M. Foreman, F. Wehrung and J. Pawlikowski showed, this consistency-strength can be estimated from above as follows. The axiom of choice, in its full strength, is almost never used in analysis. Some weaker statements, however, are needed in several applications. Such statements are, for example, the Boolean Prime Ideal theorem and the statement that the product of any number of compact Hausdorff spaces is compact. It is known that these statements are equivalent in ZF and are strictly weaker than the axiom of choice. An even weaker statement is the Hahn-Banach extension theorem which, in turn, is equivalent to the statement that all Boolean algebras bear a [0. 1 ] valued finitely additive measure; see Wagon A986, table on p. 214). M. Foreman and F. Wehrung A991) proved the following. THEOREM 11.1. The axioms of ZF together with the Hahn-Banach theorem imply that there exist non-Lebesgue measurable sets. Using some of their ideas, J. Pawlikowski A991) gave the following improvement.
120 ?. Laczkovich THEOREM 1 1.2. The axioms of ZF together with the Hahn-Banach theorem imply the Banach-Tarski paradox. These results say that rejecting the Banach-Tarski paradox we would also have to reject the Hahn-Banach theorem and the Boolean Prime Ideal theorem, and thus large parts of topology and functional analysis as well. But even the complete negation of the axiom of choice could not save us from paradoxes. Indeed, the existence of paradoxical plane sets (Theorem 1.2) and the Dougherty-Foreman paradox (Theorem 7.3) are proved without using the axiom of choice. The author is pleased to acknowledge the permission of Birkhauser to use portions of the material in the reference Laczkovich A994) for this chapter. References Adams, J.F. A954), On decompositions of the sphere. J. London Math. Soc. 29, 96-99. Adian, S.I. A983), Random walks on free periodic groups. Math. USSR Izv. 21, 425^434. Banach, S. A923), Sur le probleme de la measure. Fund. Math. 4, 7-33. Banach, S. A924), Un theoreme surles transformations biunivoques. Fund. Math. 6, 236-239. Banach, S. and Tarski, A. A924), Sur la decomposition des ensembles de point en parties respectivement congruents. Fund. Math. 6, 244-277. Becker, H. and Kechris, A. A993), Borel actions of Polish groups. Bull. Amer. Math. Soc. 28, 334-341. Becker. H. and Kechris, A. A996), The Descriptive Set Theory of Polish Group Actions. London Math. Soc. Lecture Note Ser., Vol. 232, Cambridge Univ. Press, Cambridge. Burke, M.R. B000), Paradoxical decompositions of planar sets of positive outer measure, submitted. Ceccherini-Silberstein, Т., Grigorchuk. R.I. and de la Harpe. P. A998). Decompositions paradoxales des groupes de Burnside. C. R. Acad. Sci. Paris 327. 127-132. Ceccherini-Silberstein, Т., Grigorchuk. R.I. and de la Harpe. P. A999). Amenability and paradoxical decompositions for pseudogroups and discrete metric spaces, Trudy Mat. Inst. Steklov. 224. 68-111 (in Russian). The web-site http://www.unige.ch/math/biblio/preprint/1998/ contains a summary in French including also a one page abridged English version. The file's name is cgh98.ps. Chuaqui. R.B. A969), Cardinal algebras and measures invariant under equivalence relation. Trans. Amer. Math. Soc. 142. 61-79. Chuaqui, R.B. A973), The existence of invariant cotmtably additive measures and paradoxical decompositions. Notices Amer. Math. Soc. 20. A636-A637. Ciesielski, K. A990), Isometrically invariant extensions of Lebesgue measure. Proc. Amer. Math. Soc. 110. 799- 801. Ciesielski, K. and Pelc. A. A985). Extensions of invariant measures on Euclidean spaces. Fund. Math. 125, 1-10. Dauben. J.W. A990), Georg Cantor, his mathematics and philosophy of the infinite. Princeton University Press, Princeton. NJ. Dedekind, R. A932), Was sind und Has sollen die ZahlenT. Gesammelte Mathematische Werke, Braunschweig. English translation: The nature and meaning of numbers. Essays on the Theory of Numbers, Dover. New York. Delcker, T.J. A956), Decompositions of sets and spaces. I and II. Indag. Math. 18, 581-589, 590-595. Deuber, W.A.. Simonovits, M. and Sos. V.T. A995). A note on paradoxical metric spaces. Studia Sci. Math. Hungar. 30, 17-23. Dougherty, R. and Foreman, M. A992). Banach-Tarski paradox using pieces with the property of Baire, Proc. Nat. Acad. Sci. USA 89 B2), 10726-10728. Dougherty, R. and Foreman. M. A994). Banach-Tarski decompositions using sets with the property of Baire. J. Amer. Math. Soc. 7 A), 75-124. Dubins. L., Hirsch, M. and Karush. J. A963). Scissor congruence. Israel J. Math. 1, 239-247. Eggleston, H.G. A963). Convexity. Cambridge Univ. Press, Cambridge. Elek, G. A998). Amenability. I?,-homologies and translation invariant junctionals, J. Australian Math. Soc. 65, 111-119.
Paradoxes in measure theory 121 Foreman, M. and Wehrung. F. A991), The Hahn-Banach theorem implies the existence of a non-Lebesgue measurable set. Fund. Math. 138. 13-19. Galileo Galilei A638), Discorsi e dimostrazioni matematiche. English translation: Dialoges Concerning Two New Sciences, Macmillan, New York A914). Gardner, R.J. A985), Convex bodies equidecomposable by locally discrete groups ofisometries. Mathematika 32, 1-9. Gardner, R.J. A989), Measure theory and some problems in geometry, Atti Sem. Mat. Fis. Univ. Modena 39, 39-60 (Proceedings of the Conference on Real Analysis and Measure Theory held in Capri, 1988). Gardner, R.J. and Laczkovich, M. A990), The Banach-Tarski theorem on polygons, and the cancellation law, Proc. Amer. Math. Soc. 109, 1097-1102. Hall, M. Jr. A948), Distinct representatives of subsets. Bull. Amer. Math. Soc. 54, 922-926. Hausdorff. F. A914), Bemerkung iiber den Inhalt von Punktmengen. Math. Ann. 75, 428^433. Hewitt, E. and Ross, K.A. A979), Abstract Harmonic Analysis. 2nd edn. Springer, Berlin. Just, W. A988a), ? bounded paradoxical subset of the plane. Bull. Polish Acad. Sci. Math. 36, 1-3. Just. W. A988b), Equivalence under decomposition. A graph-theoretical approach, Jahrb. Kurt-Godel-Ges., 111- 117. Konig, D. A926), Sur les correspondances multivoque des ensembles. Fund. Math. 8, 114-134. Konig, D. and Valko. S. A925), Uber mehrdeutige Abbildungen von Mengen, Math. Ann. 95. 135-138. Kuipers, L. and Niederreiter, H. A974), Uniform Distribution of Sequences, Wiley, New York. Laczkovich, M. A988a), Von Neumann's paradox with translations. Fund. Math. 131. 1-12. Laczkovich, M. A988b). Closed sets without measurable matching, Proc. Amer. Math. Soc. 103. 894-896. Laczkovich, M. A990), Equidecomposability and discrepancy; a solution ofTarski's circle-squaring problem, J. Reine Angew. Math. 404, 77-117. Laczkovich, M. A991a), Invariant signed measures and the cancellation law, Proc. Amer. Math. Soc. Ill, 421- 431. Laczkovich, M. A991b), Equidecomposability of sets, imariant measures, and paradoxes, Rend. Istit. Mat. Univ. Trieste 23, 145-176. Laczkovich, M. A992a), Uniformly spread discrete sets in K''. J. London Math. Soc. 46, 39-57. Laczkovich, M. A992b), Decomposition of sets with small boundary. J. London Math. Soc. 46, 58-64. Laczkovich, M. A992c), Paradoxical decompositions using Lipschitz functions, Mathematika 39, 216-222. Laczkovich, M. A993), Decomposition of sets of small or large boundary, Mathematika 40, 290-304. Laczkovich, M. A994), Paradoxical decompositions: a sur\ey of recent results. First European Congress of Mathematics (Paris, July 6-10, 1992), Vol. II, Progr. Math.. Vol. 120, Birkhauser, Basel. 159-184. Laczkovich, M. A996), Decomposition using measurable functions, C. R. Acad. Sci. Paris 323, 583-586. Laczkovich, M. A999), Paradoxical sets under SLt(R), Annales Univ. Sci. Budapest 42. 141-145. Laczkovich, M. B001), On paradoxical spaces. Studia Sci. Math. Hungar. 38, 267-271. Lindenbaum, A. A926), Contributions а Г etude de I'espace metrique I. Fund. Math. 8, 209-222. Lovasz, L. and Plummer, M.D. A986). Matching Theory. Akademiai Kiado, Budapest. Lubotzky, A. A994), Discrete Groups, Expanding Graphs and Invariant Measures, Progr. Math., Vol. 125, Birkhauser, Basel. Matousek, J. A997), On Lipschitz mappings onto a square. The Mathematics of Paul Erdos, Vol. II, R.L. Graham and J. NesetFil, eds, Springer, Basel, 303-309. Mauldin, R.D. (ed.) A981), The Scottish Book. Birkhauser, Basel. Mazurkiewicz, S. and Sierpinski, W. A914), Sur un ensemble superposables avec chacune de ses deux parties, C. R. Acad. Sci. Paris 158, 618-619. Moore, G.H. A982), Zermelo's Axiom of Choice, Its Origins, Development, and Influence, Springer, Berlin. Mycielski, J. A954), On a problem of W. Sierpinski concerning congruent sets of points. Bull. Acad. Pol. Sci. CI. Ill 2, 125-126. Mycielski, J. A955), About sets with strange isometrical properties (I), Fund. Math. 42, 1-10. Mycielski, J. A977), Two problems on geometric bodies, Amer. Math. Monthly 84, 116-118. Mycielski, J. A979), Finitely additive invariant measures I, Colloq. Math. 42, 309-318. Mycielski, J. A989), The Banach-Tarski paradox for the hyperbolic plane. Fund. Math. 132, 143-149. Mycielski, J. A998), Non-amenable groups with amenable action and some paradoxical decompositions in the plane, Colloq. Math. 75, 149-157.
122 ?. Laczkovich Mycielski, J. and Wagon, S. A984). Large free groups of isometrics and their geometrical uses, Enseign. Math. 30, 247-267. von Neumann, J. A929), Zur allgemeinen Theorie des Masses. Fund. Math. 13, 73-116. Niederreiter, H. and Wills, J.M. A975). Diskrepanz und Distanz von Massen heziiglich konvexer und Jordanscher Mengen, Math. Z. 144, 125-134. Ol'shanskii, A.Yu. A980), On the problem of existence of an invariant mean on a group. Russian Math. Surveys (UspekhiK5, 180-181. Oxtoby, J.C. A980), Measure and Category. Graduate Texts in Math., Vol. 2, 2nd edn.. Springer, Berlin. Paterson, A.L.T. A986), Nonamenability and Borel paradoxical decompositions for locally compact groups. Proc. Amer. Math. Soc. 96, 89-90. Paterson, A.L.T. A988), Amenability, Math. Surveys Monographs. Vol. 29, Amer. Math. Soc.. Providence. RI. Pawlikowski, J. A991), The Hahn-Banach theorem implies the Banach-Tarski paradox. Fund. Math. 138, 21-22. Penconek, M. A991), On nonparadoxical sets. Fund. Math. 139. 177-191. Plappert. P. A995), A sandwich theorem for monotone additive functions. Semigroup Forum 51, 347-355. Preiss, D. A992), Manuscript. Rado, R. A949), Factorization of even graphs. Quart. J. Math.. Oxford Ser. 20, 95-104. Robinson. R.M. A947). On the decomposition of spheres. Fund. Math. 34, 246-260. Rosenblatt, J.M. A974), Invariant measures and growth conditions. Trans. Amer. Math. Soc. 193. 33-53. Rosenblatt, J.M. A981), Uniqueness of invariant means for measure-pre ser\-ing transformations. Trans. Amer. Math. Soc. 265, 623-636. Schmidt, W. A964), Metrical theorems on fractional parts of sequences. Trans. Amer. Math. Soc. 110, 493-518. Sherman, G.A. A990a), On bounded paradoxical subsets of the plane. Fund. Math. 136. 193-196. Sherman, G.A. A990b), Paradoxical subsets of hyperbolic and spherical discs, Geom. Dedicata 34. 125-138. Sherman, G.A. A991), Properties of paradoxical sets in the plane, J. Geom. 40. 170-174. Sierpinski, W. A918), L'axiome de M. Zermelo et son role dans la theorie des ensembles et /'analyse. Bull. Intem. Acad. Sci. Cracovie A, 97-152 (Oeuvres Choisies, Vol. II. PWN, Warszawa. 1975, 208-255). Sierpinski, W. A945a), Sur le paradoxe de MM. Banach et Tarski. Fund. Math. 33. 229-234 (Oeuvres Choisies. Vol. Ill, PWN, Warszawa, 439-444). Sierpinski, W. A945b), Sur la paradoxe de la sphere. Fund. Math. 33. 2.35-244 (Oeuvres Choisies, Vol. III. PWN. Warszawa. 445^453). Sierpinski, W. A946). Sur la non-existence des decompositions paradoxales d'ensembles lineaires. Actas Acad. Nac. Ciencias Lima 9, 113-117. Sierpinski, W. A954), On the Congruence of Sets and Their Equivalence by Finite Decomposition. Lucknow. Reprinted by Chelsea, 1967. Straus. E.G. A957), On a problem ofW. Sierpinski on the congruence of sets. Fund. Math. 44, 75-81. Tarski, A. A925), Probleme 38, Fund. Math. 7. 381. Tarski, A. A929). Sur les fonctions additives dans les classes abstraites et leur applications au probleme de la mesure. С R. Seances Soc. Sci. Lettres Varsovie. CI. Ill 22. 114-117. Tarski. A. A938a), liber das absolute Mass linearer Punktmengen, Fund. Math. 30, 218-234. Tarski, A. A938b), Algebraische Fassung des Massproblems. Fund. Math. 31. 47-66. Tarski, A. A949), Cardinal Algebras, Oxford Univ. Press. Oxford. Tits, J. A972), Free subgroups in linear groups. J. Algebra 20. 250-270. Tricot. С Jr. A982). Two definitions of fractional dimension. Math. Proc. Cambridge Philos. Soc. 91, 57-74. Truss, J.K. A990), The failure of cancellation laws for equidecomposability types. Canad. J. Math. 42, 590-606. Vitali, G. A905), Sul problema delta mesura dei gruppi dipunti di una retta. Bologna. Wagon, S. A986). The Banach-Tarski Paradox, 2nd edn. Cambridge Univ. Press. Cambridge. First paperback edition, 1993. Wehrung. F. A990), Theoreme de Hahn-Banach et paradoxes continus ou discrets. С R. Acad. Sci. Paris Ser. I 310, 303-306. Wehrung. F. A992), Injective positively ordered monoids I. II. J. Pure Appl. Algebra 83, 43-82, 83-100. Wehrung. F. A993). La quadrature geometrique continue de cercle. Seminaire d'Initiation a l'Analyse Exp.. No. 6, Publ. Math. Univ. Pierre et Marie Curie, Vol. 104. Univ. Paris VI. Paris. Wehrung, F. A994a). The universal theory of ordered equidecomposability types semigroups. Canad. J. Math. 46, 1093-1120.
Paradoxes in measure theory 123 Wehrung, F. A994b), Baire paradoxical decompositions need at least six pieces. Proc. Amer. Math. Soc. 121. 643-644. Willis, G.A. A988), Continuity of translation invariant linear fiinctionals on Cq(G) for certain locally compact groups G, Monatsh. Math. 105. 161-164. Zakrzewski, P. A990). Extensions of isometricallv invariant measures on Euclidean spaces. Proc. Amer. Math. Soc. 110. 325-331. Zakrzewski, P. A991), Paradoxical decompositions and invariant measures, Proc. Amer. Math. Soc. 111. 533- 539. Zakrzewski, P. A993). The existence of invariant probability measures for a group of transformations, Israel J. Math. 83, 343-352. Zakrzewski, P. A994), When do sets admit congruent partitions. Quart. J. Math. Oxford Ser. B) 45, 255-265. Zakrzewski, P. A995), Extending isometricallv invariant measures on K" -a solution to Ciesielski's query. Real Analysis Exchange 21, 582-589. Zakrzewski, P. B002), Measures on algebraic-topological structures. Handbook of Measure Theory, E. Pap. ed.. Elsevier. Amsterdam, 1091-1130. Zermelo, E. A904), Beweis, dassjede Menge wohlgeordnet werden kann. Math. Ann. 59. 514-516.
CHAPTER 4 Convergence Theorems for Set Functions Paolo de Lucia Universita "Federico 11", Dipartimento di Matematica e Applicazioni "Renato Caccioppoli". 80126 Napoli, Italy- E-mail: padeluci@unina.it Endre Pap* Institute of Mathematics, University o/Novi Sad Trg D. Obradovica 4. 2I0O0 Novi Sad, Yugoslavia E-mail: pap@im.ns.ac.yu; pape@eunet.yu Contents 1. Introduction 127 2. Classical results 129 2.1. Exhaustive set functions 129 2.2. Cafiero uniform exhaustivity theorem 130 2.3. Nikodym convergence theorem 132 2.4. Vitali-Hahn-Saks theorem 133 2.5. Nikodym boundedness theorem 135 2.6. Drewnowski lemma 137 2.7. Cafiero theorem for additive set functions 138 2.8. Brooks-Jewett theorem and related results 139 2.9. Vitali-Hahn-Saks theorem for additive case 139 2.10. Counter-examples on algebras '43 2.11. Algebras with SCP and SIP 144 3. Related convergence theorems 146 3.1. Integration with respect to vector measures 146 3.2. Orlicz-Pettis theorem 146 3.3. Schur lemma and Phillips lemma 149 3.4. Rosenthal's lemma 150 3.5. Hewitt-Yosida theorem 152 *This chapter was written during the visit of the second author to the university "Federico II". Naples, as visiting professor for Ph.D. students, supported by INdAM, Italy, in the period May-June. 2000, and finished during the visit to Naples, supported by MURST. in March. 2001. The second author wants to thank for the partial financial support of the Project in the Fields of Basic Research "Mathematical models of nonlinearity, uncertainty and decision" A866) supported by Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 125
126 R de Lucia and E. Pap 3.6. Biting Lemma 153 4. The relation between boundedness and exhaustiveness '54 5. Diagonal theorem for triangular set function 156 5.1. ?-boundedness 157 5.2. Diagonal theorems 160 5.3. SCP and SIP algebras 162 6. Dieudonne type theorems '63 6.1. Triangular set functions 164 6.2. Convergence theorem 168 7. Convergences of measures related to partial order '69 7.1. Nikodym type theorems for lattice-valued measures '69 7.2. Convergences of measures on orthomodular posets '70 7.3. General Nikodym type theorems '71 References '74
Convergence theorems for set functions 127 1. Introduction Nikodym boundedness and convergence theorems belong to the most important theorems in measure theory. First of them says that a family ? of countable additive signed measures ?, defined on a ?-algebra ? of subsets of a set S, which is pointwise bounded, i.e., for each ? e ? there exists Me > 0 such that \?(?)\<?? (?? ?), is uniformly bounded, i.e., there exist ? > 0 such that |?(?)| <? (??? ?&?) (see, e.g., Dunford and Swartz A958, Section 2.5)). Nikodym convergence theorem, Section 2.3, says that a pointwise convergent sequence {?„}„,=[.: of countable additive measures defined on ana-algebra ?. i.e., ???,,^,? ?,,(?) = ?(?), ? & ?, (i) converges to a countable additive measure ?. (ii) {?,,?/ieN is uniformly ?-additive. The evolution of these theorems reflects the development of today integration theory. Namely, Vitali proved in 1907, that if {/„}„ei; is a sequence of functions from L](k) (? is the Lebesgue measure) such that lim/,-,.^ /„ = /, then lim„^.xfA f„ dk = fAfdk for all measurable set A if and only if {/ /„ dk}„€\; is an equi-X-continuous sequence. Lebesgue, in 1909, proved that if {fn}n€\; is a sequence of functions from ?-'(?) such that (*): НгПп^эс/д /„dk = 0 for every measurable set A, then {/ f„dk}„€i; is an equi-X-continuous sequence. In 1922 Hahn improved Lebesgue theorem removing the assumption (*) by the condition that lim,,^^ fA f„ dk exists for every measurable set A. O. Nikodym A931, 1933) extends this convergence result in an abstract settings as formulated above (the part (i)). Saks A933) has given a Baire Category proof, since till that the proofs were made by sliding hump method. The part (ii) was observed by Doubrovski in 1947 and Pettis in 1951 as a part of Saks proof from 1933. Further generalization of Brooks and Jewett A970) (stated first by Ando A961)) says that the limit of the sequence of finitely additive scalar and exhaustive set functions on ?- algebra is again additive and exhaustive, Section 2.8. The method of proof is going back to sliding hump method, since the Baire category methods are unsuitable for finitely additive case. As a generalization of finitely additive vector measures of bounded variation and countably additive measures on ?-algebra Rickart A942) introduced the notion of a strongly bounded vector measure (s-bounded), which we shall call exhaustive. This notion is closely related to countably additivity, Section 2.1. and boundedness, Section 4. Cafiero result A952) is related to uniform exhaustivity, Sections 2.2 and 2.7, as a very useful matrix type lemma. It is known that for additive set functions the Brooks-Jewett theorem is equivalent with Nikodym convergence theorem, and even more with Vitali-Hahn-Saks theorem (Section 2.4), Drewnowski A972b), which says that the limit of sequence of signed
128 P. de Lucia and E. Pap measures which are continuous with respect to a measure is also continuous with respect to this measure. As it is well-known, the Nikodym convergence theorem for measures in general fails for algebras of sets, Section 2.10. But there are convergence theorems in which the initial convergence conditions are on some subfamilies of a given ?-algebra; those subfamilies may not be ?-algebras. The considered algebras are with the Sequential Completeness Property (SCP) and the Subsequentional Interpolation Property (SIP) (see Constantinescu A981), Schachermayer A982), Freniche A984) and H. Weber A986, Section 2.11)). Adding the topology to the basic set S, a famous theorem of Dieudonne A951) states that for compact metric spaces the sequence of regular measures which converges on open sets, converges on all Borel sets, see Brooks and Chacon A980), Section 6. Many important convergence results are closely related to Nikodym convergence and boundedness theorems (Section 3): Orlicz-Pettis theorem, Shur lemma, Phillips lemma and Hewitt-Yosida theorem. In Section 3 we present also Rosenthal's lemma A968) and Biting lemma as very useful methods in convergence problems. The Nikodym boundedness theorem for measures in general also fails for algebras of sets, Section 2.10. But there are uniform boundedness theorems in which the initial boundedness conditions are on some subfamilies of a given ?-algebra; those subfamilies may not be ?-algebras. As a related result to the theorem of Dieudonne A951) we have that for compact metric spaces the pointwise boundedness of a family of Borel regular measures on open sets implies its uniform boundedness on all Borel sets. We shall present a generalization of Dieudonne's theorem on a wider class of set functions. The class of finitely additive regular Borel set functions gives nothing new, because each finitely additive regular Borel set functions (also in the case of vector measures) is necessarily countable additive. We present a generalized Dieudonne type theorem for the class of ^-triangular set functions (Pap, 1987), see also Pap A995), Section 6. Nikodym boundedness and convergence theorems were generalized not only with respect to the domain (Sections 2.11, 5.3, 7.2 and 7.3) of the measure but also with respect to the range and the properties of the considered measures. So there are vector, group and even semigroup valued measure versions of Nikodym boundedness theorem, Sections 5.2 and 7.1. On the other side, the interest for non-additive set functions (submeasures, ^-triangular set functions (Section 5.1), decomposable measures, null- additive set functions) is growing up mainly of the need of many different applications. There are recent investigations of set functions with values in sets endowed only with some topological structures without any algebraic operations: metric space (Pap, 1988), special topological spaces (Klimkin, 1989), uniform spaces (Pap, 1991), see Pap A995), Section 7.3. We shall present in this chapter many of the mentioned convergence theorems together with the related important convergence results and giving their quite elementary proofs based on diagonal theorems (matrix methods), Section 5. Diagonal theorems as a further extensions of sliding hump method were initiated by Mikusinski A970). Many results in measure theory and functional analysis can be found in the monographs by Pap A982), Antosik and Swartz A985), Pap A995), Swartz A996) (see also Diestel and Uhl A977)), where the method of diagonal theorems is used instead of the usually used Baire Category
Convergence theorems for set functions 129 Theorem (as, e.g., in Dunford and Swartz A958)), which is unsuitable for obtaining more general results. 2. Classical results 2.1. Exhaustive set functions The notion of a strongly bounded vector measure was introduced by Rickart A942) as a generalization of finitely additive vector measures of bounded variation and countably additive measures on ?-algebra. Let ?? be a ring and ? a family of finitely additive set functions defined on 1Z. A finitely additive set function m :7? —>¦ R is exhaustive (strongly additive or strongly bounded) if lim^oo m{Ej) = 0 for every sequence {Ej}j€n of pairwise disjoint elements from ??. A family ? of additive set functions is uniformly exhaustive if lim^oc m{Ej) = 0 uniformly in m e ?4 for every sequence {Е;}/^ of pairwise disjoint elements from ??. It is obvious that a finite measure on a ?-ring is exhaustive. PROPOSITION 2.1. Let ? be a family of finite additive exhaustive set functions on 7?.. The following statements are equivalent (i) M. is uniformly exhaustive, (ii) for every increasing sequence {?/heH the sequence {w(?'i)}ieN is a convergent sequence uniformly for m e ?4, (iii) for every disjoint sequence {?/}/ек of TZ the series ?,¦»!(?,¦) 's convergent uniformly for m e Ai. PROOF, (i) => (ii). Suppose that ? is not uniformly exhaustive, then there exists an increasing sequence {f/Ьек of 7^ such that lim m(Ej) =0 not uniformly in M. Then there exists ? > 0 such that for every r, q in N there exists ? ^ r and m eM such that \m(Ep) -m(Eq)\ > ?. By induction it is the possible to construct a subsequence (?Рл}/Fщ of {?,-},-ек and a sequence {w„}„epj of ? such that |m„(?p„ \ ?,,„_,) | > ? for every и е N, and this is a contradiction because {EPn \ El>n_i }„ер; is a disjoint sequence of 7?. (ii) => (iii). It is enough to observe that if {?,-};ек is a disjoint sequence of TL then ?/}/eN is an increasing sequence of ??. Therefore {53y=i w(^y))ieK f°r m ? Л4 are convergent sequences uniformly in m e Л4. (iii) => (i) is obvious. ?
130 ? de Lucia and E. Pap PROPOSITION 2.2. Let ? be a family of finite additive exhaustive set functions on К. Then (i) M. is uniformly exhaustive, implies (ii) for every decreasing sequence {?,-},-er: the sequence {m(Ei)}i€f: is a convergent sequence uniformly for m e ?4. If TZ is an algebra then (i) and (ii) are equivalent. PROOF. It is enough to observe {E\\ ?,-},· er; is an increasing sequence of 1Z an to refer to the Proposition 2.1. In a similar way we prove the second part of the proposition. ? Let ? be a ?-algebra and ? family of ?-additive set funciions defined on ?. A family ? of countable additive measures ? : ? —>¦ ? is uniformly countable additive if lim Vm(?;)=0 >DC uniformly in ? e ? for every sequence {Ej}j€r. of pairwise disjoint elements from ?. We have by Propositions 2.1 and 2.2. PROPOSITION 2.3. Let ? be a ?-algebra and ? family of ?-additive set functions defined on ?. The following statements are equivalent (i) ?4 is uniformly exhaustive. (ii) ?4 is uniformly countable additive. (iii) for every decreasing sequence {?;};ef: of ? the sequence {?(??)},??; is a convergent sequence uniformly for ? e ?. 2.2. Cafiero uniform exhaustivity theorem We present here a result of Cafiero A952) related to the uniform exhaustiveness. First we shall give an elementary matrix type lemma. LEMMA 2.4. Let (a,,,),,.,^· be an infinite matrix of real numbers such that A) for every ? e N and every subset I of ? there exists X];e/ a,u; B) for every sequence {//theN of pairwise disjoint subsets of ? and for every ? > 0 there exists к e N and «oeN such that ???? < ? for every ? > «о. Then lim a„i = 0 uniformly in ? e N.
Convergence theorems for set functions 131 PROOF. We note that if the matrix (a„,-)„.jeN has the properties A) and B) then also every its submatrix has the same properties. By A) we have lima,„-=0 (neN). A) /—>00 Then we need to prove that: for every ? > 0 there exist k,s eN such that |?„,-| < ? for every /' > к and every ? > s. Suppose that this is not true. Then there exists ? > 0 such that for every k,s eN there exist /' > к and ? > s such that \а,„ | > ?. By induction we can then construct two strictly increasing sequences {цк-ен an(J {i/theN of natural numbers such that \a„Kik | ^ ? for every к eN. Therefore there exists a submatrix of the starting matrix, which we denote by the same symbol (a„/)„.ier; such that it satisfies A), B) and for some ? >0. C) |?«| > ? for every к е N. Let {ar}repj be a decreasing sequence of real numbers such that reN from B) we have that for every r e N there exist /', and sr such that \a,ur\ <°r for every ? ^ mr. We can suppose that the sequences {/VlreN and {ir}reN are strictly increasing and so to construct a new submatrix of (am)w.ieN. which we denote by the same symbol (a„;),M-epj such that it satisfies A), B), C) and for every j e ?, |?„, | < ?, for all ? ^ st. B) Now we construct by induction two strictly increasing sequences {p;,b,eN and {г„}„ем of natural numbers with the property p„ > r„ > max{/o„_ ?, ???_,} for every ? e N, where po = so = 0, such that we have for every ? e N, \<>rki I < ?,„ for every / > p„ and к = 1 ?. C) Suppose that Г|,... ,r„_i; у0| , ??-? are determined and let be r„ an element of N such that r„ > max{pn-\,srn_l}, by A) we can find p„ such that p„ > r„ and C) is true. Consider now the submatrix (ar„r,)n./sH. for и e N and / = l,...,n — 1, we have r„ > ??; and then, by B) \аГпГ \ < ??, and for /' ^ ? + 1 it results r, > p,-_i, л ^ / — 1,
132 P. de Lucia and E. Pap and then, by C) \аГпП | < ???_,. Hence for an infinite subset / of N and ? e / we have byC) Y^ar„r, > KrJ- ? \??,?\>?-??? = 4- ie/ /e/. <?? /eK If {h }k€N is a disjoint sequence of infinite subsets of N we have for every ieN ??^ > — for all ? & h 2 and this contradicts B). Let ? be a ?-algebra in the whole chapter. D THEOREM 2.5 (Cafiero). A sequence {?„}„?? of countable additive real measures is uniformly exhaustive if and only if the following condition holds (a) for every sequence {En}„€^ of pairwise disjoint elements of ? and every ? > 0 there exist k, no e N such that \??(?0\ < ? for every ?^??- PROOF. Let {Еп}п€ц be a sequence of pairwise disjoint elements of ?. Then {??{??))„?€? is an infinite matrix of real numbers. By countable additivity of every ?„ we have that for every « e N and / с N there exists J2i€l ??(??). If {Ik)keN is a sequence of pairwise disjoint subsets of N, then {[Ji€l ?,-}/tei; is a sequence of pairwise disjoint elements of ? and then by (a) for every ? > 0 there exist k. «0 e N such that < ? for every и ^ no. Therefore the conditions A) and B) of Lemma 2.4 are satisfied and we have that lim ?,,(??) = 0 uniformly in ? e N. D 2.3. Nikodym convergence theorem THEOREM 2.6 (Nikodym convergence theorem). Let {?„}/<?? be a pointwise convergent sequence of countable additive measures defined on an ?-algebra ?, i.e., lim ?„(?) = ?(?). Ее ?. D) then
Convergence theorems for set functions 133 (? {?????? converges to a countable additive measure ?, (ii) {?/,}??? is uniformly ?-additive. PROOF. We consider first a special case. If {?„}/<?? is pointwise convergent to zero then condition (a) in Theorem 2.5 are satisfied. Then {??},?€? is uniformly exhaustive and by Proposition 2.3 it is uniformly ? -additive. The general case for (ii), i.e., under the condition D), easily follows by the fact that by D) [??(?)}„€^ is a Cauchy sequence. Namely, suppose that (ii) does not hold for {M/iL?eN, ie., that there is a sequence of pairwise disjoint sets {?,,?,,^? from ?, a subsequence {?*„}„?? of {?„}/,?? and ? > 0 such that |?*„(?/?·„)| > 2?. By exhaustivity of ??? there exists a subsequence {р„}„€н of {fc,ibieN such that \????(??>?^?)\ ^ ?. Taking m„ = ??„+? — ??„ we obtain a sequence {т„},1€^ of countable additive set functions which is pointwise convergent to zero and therefore by the previously proved part it is uniformly countable additive, but this is in a contradiction with \m„(EPn+i)\ > \???+?(???+?)\-\?,)?(?,,?+])\^? for all ? e N. To prove (i), we have to use (ii). Namely, by (ii) we have ? ?j ) = lim ?, ( ? ?j ) = lim lim ][>,(?,) D 2.4. Vitali-Hahn-Saks theorem Absolute continuity of functions was introduced by Vitali A905) who proved that a real valued function on the unit interval is absolutely continuous if and only if it is the integral of its derivative. Absolute continuity of set functions was introduced by Radon A919). Let ? be a ?-algebra. Let ?: ? -> [0, +oo] be a non-negative set function and ? a signed measure defined on ?. We say that ? is absolutely continuous with respect to ?, denoted by ? <<C ?, if for every ? > 0 there exists S > 0 such that \v(E)\ < ? whenever ? e ? and ?(?) < S. A sequence {v„}ll€i-: is uniformly absolutely continuous with respect to ? if for every ? > 0 there exists ? > 0 such that \v„(E)\ < ? whenever ? e ?, ? e N, and ?(?) < S. THEOREM 2.7 (Vitali-Hahn-Saks theorem). For any sequence {?„},?€? of signed measures v„ which are absolutely continuous with respect to a measure ?, i.e., v„ «д, and for which lim^oc v„(E) = v(E) exists for each ? e ?. hold (i) the limit ? is also absolutely continuous with respect to this measure, i.e., ?«?; (ii) {l7,}„i=n is uniformly absolutely continuous with respect to ?.
134 P. de Lucia and E. Pap As we already mentioned this theorem is closely related to integration theory. Namely, if {/wbieN is a sequence of functions from L'([0. 1]), where ? = ? is the Lebesgue measure, and lim / /„ dk = v(E) exists for each measurable set E, then the sequence {/ f„ dk},,^ is uniformly absolutely ?-continuous and ? is absolutely ?-continuous. Proof of Vitali-Hahn-Saks theorem. We shall prove (ii). First we consider a special case and we prove that if {Ay}yei; is a decreasing sequence with ?(?> ^y) = 0 then we have uniformly in /' lim vj(Aj) = 0. E) Let A — ?] ¦ Aj by the ?-absolute continuity of every v, we have v,(A) = 0. Then E), uniformly in /', follows from the Nikodym convergence Theorem 2.6. Consider now the general case for (ii). Suppose that (ii) is not true. Then there exists ? > 0 such that for each S > 0 there is ? e N and ? e ? such that |?„(?)|^? and ?(?)<8. Take <5| > 0, n\ = 1 and E\ = 0. Then we can define by induction a subsequence {/;,},ек and sequence {?,-};epj from ? and {5;};еь:. <5, > 0 such that <5, + i <<5,Д \v,,,+,(Ei+i)\>?. ?(?-? + \)<8?/2 and \vni+l(A)\<e/2 ????(?) <Si+[. Take A, = Utli ^ · Then by countably subadditivity ?(??\??)<??(?? + ?)<: ? ?(?? < ? | < ? у^ш = S" k=i + \ k=i k=i i.e., ?(?,\?,)<5, F) and therefore ? ? ? K,+ |(A'+I \^i + l)| < у
Convergence theorems for set functions 135 Then we have III I ? K+l(?/+l)| = K,*i<Ai + l) - v„,ri(Ai + \ \ ?/ + 1I > 2 and therefore \vn,{Ai)\>\- G) But {A/},-epj is decreasing and for A = ?); ?< we obtain by F) that ?(?) = 0. Then by the special case we have E) uniformly in /'. Contradiction with G). Obviously (ii) implies (i). ? Phillips A940) and Rickart A942) extend Vitali-Hahn-Saks theorem for measures with values in a locally convex topological vector space. There are further generalizations for functions defined on orthomodular lattices and with more general properties (see Antosik and Swartz A985), Pap A995)). 2.5. Nikodym boundedness theorem Nikodym boundedness theorem was obtained for countably additive scalar measures on ?-algebras by Nikodym A931, 1933). The proof by Baire category was given by Saks A933). It has been generalized by Grothendieck A957) for bounded vector measures. Theorem 2.8 (Nikodym boundedness theorem). A family ? of countable additive signed measures ?, defined on ? ?-algebra ?. which is pointwise bounded, i.e., for each ? e ? there exists Me > 0 such that \?{?)\<?? (?? ?). is uniformly bounded, i.e., there exist ? > 0 such that |?(?)| <? (? e ?, ? e ?). PROOF. It is clear that it is enough to prove the theorem for sequences of measures. We start to prove that for every disjoint sequence {E,},¦ e[; of ? the set {??(??): neN, /eN} is bounded. Then we need to prove that there exists ? ^ 0 and h such that |?„ (?,) I ^ ? for every ? ^ m and for every / ^ h.
136 P. de Lucia and E. Pap Otherwise there exist two strictly increasing sequences of ?: {?,,?,^? and {/,;},;i=n such that \????(???)\ ^ ? for every ? e N. (8) Let {a„}„epj be an infinitesimal sequence of positive real numbers. Clearly the sequence {?„?„}?€^ verifies Nikodym theorem 2.6 hypothesis then it is uniformly exhaustive. It follows that the set {?„?„(?,): ???,?'??) is bounded. By (8) it follows that the sequence {—} »eN is bounded for every infinitesimal sequence of positive real numbers. A contradiction. Now we consider the general case and suppose that the theorem is not true. Then sup{^„(?)|: ? e ?, ? e ?} =oo. Therefore for any ? > 0 there is a partition (A. B) of X and ? e N such that ????{|?„(?)|, |?„(?)|} > ?. Namely, by |?„(?)| > ? + sup^;(X)| we have also \?„(? \ A)| > |?„(?)| - \?„(?)\ > ?. Therefore there exists ? ? and a partition (? ?. B\) such that гтп{|мл,(А|)|, |m„, (??)|} > M- Then either sup{^„(Cn A|)j: ? eN. С е ?} = ?? or sup{|M„(CnB|)|: ne N. Ce ?) = oo. Taking whichever ? ? or ? ? satisfies one of the preceding condition and denote it by F| and put ?| = X \ F\. Taking now F\ as X in the preceding procedure we obtain a partition (Eo, Fi) of F\ and m > n\ such that |?„2(?2)|>2 and sup{^„(C ? F2)\: ? e N, С е Г} = oo.
Convergence theorems for set functions 137 Continuing this procedure we obtain by induction a subsequence {и,-},·ек and a sequence {?/};eN of pairwise disjoint sets from ? such that |м«,-(Е/)| > '· Contradiction with previously proved boundedness on disjoint sequences. D 2.6. Drewnowski lemma To extend the previous results to the finitely additive case we will prove a very useful lemma obtained by Drewnowski A972b). Lemma 2.9 (Drewnowski lemma for monotone set functions). If m:lZ—>- R is an exhaustive monotone set function with m@) =0 and {Е„}„ец is a sequence of pairwise disjoint sets from TZ, then there exists a subsequence {E^ }n€i; о/{?'„}„ек such that m is order continuous on the ?-ring 1Z\ generated by {Е^„ }„ем. i.e., limm(A„) = 0 for every sequence {An}„^ifrom TZ\ such that A„ \ 0. PROOF. Let {./,}/eN be a sequence of pairwise disjoint infinite subsets of N. Then {Utev, ?/tL'eN is a sequence of pairwise disjoint subsets of ?? and by the exhaustiveness of m there exists an infinite subset ? ? of N such that ™(U?*)<i In the same way, if {У,},ен is a sequence of pairwise disjoint infinite subsets of N\ \ {minN|}, we can find an infinite subset N2 of N\ \ {minN|} such that m(liEk)<^· Xk€N2 ' By induction we construct a decreasing sequence {N, },ек of infinite subsets of N such that W/+I ? Nj \ {minNj}, mi [J Ek\ < — forevery/' e N. Hew, ' Let kj = minNj and let TZ\ be the ?-ring generated by {E/t„}Hei·:· Let {A„},iei; be a decreasing sequence of elements of 7Z| such that С\11€^Ап = 0. If there exists m e N such that A,, = 0 for every ? ^ m then the theorem is proved. Therefore we suppose that A„ ? 0, и е N. For any и e N let /„ be a subset of N such that A„=\jEki.
138 P. de Lucia and E. Pap Obviously /,,+i с /„ and because none of the Ekj is enclosed in ПпеК^" we nave a'so ????,,^?? /„ = oo. For arbitrary ? > 0 we choose peN such that 1/2'' < ?. Then we have for ? ^ ? that A„ = (J Ek, с (J Ekl i'e/M ieN/, and therefore m(A„) < 1/2" < ? for every и ^ p. ? Lemma 2.10 (Drewnowski lemma for additive set functions). // ?,:7? —>¦ R is a sequence of exhaustive additive set functions and {E„}„ef. is a sequence of pairwise disjoint sets from 1Z, then there exists a subsequence {?*„}„еь: of' {E„}„€u suc^ that ?, is countable additi ve on the o-ring 7Z\ generated by {Ekfl }nei·· PROOF. Let for every ? e N denote by |?„ | the total variation of ?„. We introduce the set function m : 7? —>¦ R by ?1 IMhI . „??2+??" where M„ = sup{^„|(A): A e1Z]. Since the function m satisfies the conditions of Drewnowski Lemma 2.9 there exists a ?-ring 7?| generated by a subsequence {??„}«eK of {??}?,6? such that the restriction of m to Я ? is order continuous. Then easily follows that the restrictions of |?„| to TZ \ are order continuous and therefore every ?„ is countable additive on ?? \. ? 2.7. Cafiero theorem for additive set functions We will generalize Theorem 2.5 to the additive case. THEOREM 2.11 (Cafiero). A sequence {?„}??? of finite additive bounded set functions is uniformly exhaustive if and only if the following condition holds (a) for every sequence {E„}„€t: of pairwise disjoint elements of ? and every ? > 0 there exist к, щ e N such that | ?? (Ej.) | < ? for every ? ^ iiq. PROOF. Suppose that the {?„}„,=?; do not are uniformly exhaustive. Then there exists a disjoint sequence {?, };eN of ? such that lim,,^^ ?,,(??) =0 but not uniformly in и. Then there exists ? > 0 such that we can construct a subsequence of {?,},?=? and a subsequence °f {M/ilneN. that for simplicity, we will denote yet by the same symbols such that ?„ (?„) | > ? for every ? e N. (9)
Convergence theorems for set functions 139 By Drewnowski Lemma 2.10 there exists a subsequence {Ejk }^€щ of {?;},ек such that if ?\ is a ?-algebra generated by {?,t }^еь' the restriction of ?„ to ?\ are ?-additive. From Theorem 2.5 it follows that these restrictions are uniformly exhaustive but by (9) we have | ?,;. {Eik) | > ? for every к eN. A contradiction. ? 2.8. Brooks-Jewett theorem and related results A generalization of the Nikodym convergence theorem was obtained by Brooks and Jewett A970). THEOREM 2.12 (Brooks-Jewett). A pointwise convergent sequence {m„}„eN of finitely additive scalar and exhaustive set functions (strongly additive) defined on an ?-algebra ?, i.e., lim,, ^oc "i«(?) — w(E), ? e ?', (i) converges to an additive and exhaustive set function m, (ii) {w„}„epj is uniformly exhaustive. PROOF. If {ш„}„ем is pointwise convergent to 0 then the condition (a) of Theorem 2.11 is verified and so in this special case (ii) is true. The general case for (ii) follows in the same as in the proof of Nikodym theorem. Then (i) follows by (ii) lim m(Ej)= lim lim mj(Ej)= lim lim >tij(Ej) = 0. D j —* DC j —> DC ? —> DC I—> DC j —* DC Theorem 2.13 (Nikodym boundedness theorem for additive case). A family ? of finitely additive bounded set functions m. defined on an ?-algebra ?, which is pointwise bounded, i.e., for each ? e ? there exists Me > 0 such that \m{E)\ < Me (m eM), is uniformly bounded, i.e., there exist ? > 0 such that \m(E)\ < ? (m eM, ? e ?). The proof is same as the proof of Nikodym boundedness Theorem 2.8 only using the Brooks-Jewett Theorem 2.12 instead of Nikodym convergence Theorem 2.6. 2.9. Vitali-Hahn-Saks theorem for additive case For the proofs of Brooks-Jewett Theorem 2.12 and Nikodym boundedness Theorem 2.13 it was crucial Drewnowski Lemma 2.10. To prove finitely additive version of Vitali-Hahn- Saks theorem we need another useful lemma.
140 P. de Lucia and E. Pap THEOREM 2.14 (Vitali-Hahn-Saks for finite additive case). For any sequence {т„}п€ц of finite additive bounded set functions m„ which are absolutely continuous with respect to a non-negative finite additive set function ?. i.e., m„ <*C ?, and for which lim,,^^ m„(E) = m(E) exists for each ? e ?, hold (i) the limit m is also absolutely continuous with respect to ?, i.e., m <5C ?; (ii) {m„}nepj is uniformly absolutely continuous with respect to ?. The proof of Vitali-Hahn-Saks Theorem 2.14 will be based on the following two lemma. LEMMA 2.15. Let M. be a family of finite additive set functions uniformly exhaustive in the ring TZ. Then for every ? > 0 there exists ?4', finite subset ofM, and ? > 0 such that if \m(E(~)F)\<8 for every E. F e TZ and m e ?4' then \m(E)\<s foreverymeM.EeTZ. PROOF. If the thesis is not true there exists ? > 0 such that for every S > 0 and every finite subset M! of M. there exist ? e TZ and m e M. such that \m(E П F)\ < ? for every m eM' and F eTZ and m'(E) > ?. By induction it is so possible to construct a sequence {»u-}jteN of ? and a sequence {EjtheN ofTZ such that \mp(Ff)Ek)\< -^ for every ? ? к and F e 7г. A0) \mk+i(Ek)\ > ?. A1) If h e N, let к E[= [J ?, fork>h. i=h+\ {E^UeN is an increasing sequence of TZ; by the uniform exhaustivity {m(F П E'k)}k>h is a Cauchy sequence uniformly in m e ? and F eTZ. Then there exists /?'>/? such that |w(F П ?^)-w(F П ?;,.)|<-^?- for every m e M. F e 7г. ? ^ h'. A2)
Convergence theorems for set functions From A2), taking F П ? к instead of F, we obtain 141 Fn?t\ \J ЕЛ \ i=h+\ I < o/T+T fore\ery m eM, F eTl, k^h'. A3) Let {к„}п€щ be a strictly increasing sequence of natural numbers such that FDEk\ \J ?, V /=*„ + ! / <-?— forevery meM, F eK, k^k„ + 1. A4) Put for every ? e N *4. = (*4.n( (j ?,))u(?,„\ Q' E,\ \ \i=k„+\ II \ i=k„+\ I and denote ?*„ \ U, ="!', +1 ?, by T„, so we have mk, + i(Ek„) = m,„ + 1 ? Ek. П ? Q ?, ) ) + mf„ + l ( T„ П |J 7] »-i A5) i=l We will estimate all three summandson the right side of the equality A5). We have for the first summand, for every ? e N, ?*»П( U E'))= ? ffl^l№n?i)· where F,¦, i = k„ + 1,..., k„+\, are suitable elements of 7? mutually disjoint. Therefore observing that in our case is /' ^ k„ + 1, by A0) we have |hu„ + i(F,· П Ej)\ < -—j forevery/ = k„ + l,... ,kn+\. and so we obtain k„ \ \ ;=*„+i < Г - /=*„ + ! 2' +1 A6)
142 P. de Lucia and E. Pap We observe for the second summand that it is »u„+i \Tn? (J?? = ?^,+???,,\ U?? ??? ??-\ ?/ *,-1 \ \ = ]>>*„ + , [[Ekn\ (J eAdF, where F-, is a suitable element from TZ. and then by A3) n-\ r»k„ + \ T„ n\jT, i=l и—\ k„ ?? ^-Ц ? i=l ?' = ? (?) We obtain for the third summand by the uniform exhaustivity of ? that there exists «o e N such that H-l [?,? U ?, By A5)-A7) we have for every ? ^ no < — for every m e ?4 and /j ^ no. A8) l^„+i(?i„)| <^^ + |<^ and this contradicts to A1). D LEMMA 2.16. Let TZ be a ring. ? a family of finite additive set functions on TZ and ? a non-negative finitely additive set function on TZ. If ? is uniformly exhaustive and every element of ? ?-absolutely continuous, then ?4 is uniformly absolutely ?-continuous. PROOF. By Lemma 2.15 for every ? > 0 there exist S > 0 and M', finite subset of M, such that if \m(EC\F)\<8 for every m e ?', A9) then \m(E) \< ? for every m e ?. B0) But the elements of M! are ?-absolutely continuous, then there exists ? > 0 such that ?(?) <? implies A9) and B0). ?
Convergence theorems for set functions 143 Proof of Vitali-Hahn-Saks Theorem 2.14. By Brooks-Jewett Theorem 2.12 we have that the sequence [тп}п€щ is uniformly exhaustive. The thesis then follows by Lemma 2.16. D 2.10. Counter-examples on algebras We shall give three examples showing that generally Nikodym convergence, Nikodym boundedness and Vitali-Hahn-Saks theorems are not true on an algebra. Example 2.17. Let ?? be the ring generated by the right half open subintervals of [0, 1[, and let for every ? e N, be /„ : [0. 1 [ -> [0, 1 [ the function given by /„(x) = x". Let ?„ be the set function defined on the right half open intervals [a, b[ by the formula 4n{[a,b[) = fn(b) - /„(a) = b" -a" and extended on ?? in the natural way. We note that, for every /; e ?, ?„ is countable additive. It is clear that for every a, b with 0 ^ a < b < 1 we have lim,,^^ ?„([?. b[) = 0, and for every a such that 0 ^ a < 1 we have lim,,^ x ?„ ([a, 1 [) = 1. Then for every ? e 1Z .· //74 10 ifl??' hm ?„(?)= ? n—>oc I 1 if 1 e ?¦'. Then {?„}„?=? is a sequence of countable additive measures defined on ring, pointwise convergent to a function ?? = lim,,^^ ?„, but ?? is only finitely additive not countably additive. Example 2.18. Let J be the ring of the finite subsets of E, and let v„ for every /; e N be the function M?) = ( ? if ? e ?, 0 ????. Clearly vn is countably additive. If ? is the algebra, generated by J, of finite and cofinite subsets of R, we extend v„ to ? by the formula _iv„(E) if Ее J. ?"(?) -\-?„(?') ifE'eJ. Clearly {?„}„?? is a sequence of finitely additive measures. We observe that if {?, },ен is a disjoint sequence of elements of A such that U,eh' ^< e -^ tnen a" e'ernents of {?|},-eN excepts a finite number are empty and therefore we can say that every ?„ is countable additive. We have for every ? e ? . . |{0}U(Nn?) if Ее J, \?„(?): neN С ' 1 ' ~ @)U{/eR: -teNDE] ifE'eJ.
144 P. de Lucia and E. Pap Then, for every ? e A, the set {?„(?): ? e ?} is bounded and clearly ?„ is bounded, but M"({n}) = ? for every /; e N and therefore {?„(?): ? e ?, ? e A] is not bounded. Example 2.19. Let ? be the algebra from the preceding example. We define v„(E) = ? if ? e ? finite, —? if ? e ?' finite, 0 otherwise. Then every u„ is bounded, finitely additive and lim,,^^ v„(E) exists for each ? e A. Taking ?(?) = ?,,??? iff is finite, .?+????? if ?¦' is finite, we have that ? is bounded, finitely additive and v„ <K ? but {y„}„eN is n°t absolutely ?-continuous. 2.11. Algebras with SCP and SIP Let ? be an algebra of subsets of a nonempty set S. We have the following definitions by Constantinescu A981), Freniche A984) and H.Weber A986). Dehnition 2.20. An algebra Л has the Sequential Completeness Property (SCP) if each disjoint sequence {?„)„6pj from Л has a subsequence {ЕП] }jSn< whose union is in Л. Dehnition 2.21. An algebra Л has the Subsequentional Interpolation Property (SIP) if for each subsequence {Ау„}„ек of each disjoint sequence {А}}]€п from ? there exist a subsequence {Aju };.ещ and a set В е Л such that А]ч с В (* e ?) and A j П В = 0 for ; e N \ {;„t :JteN). Example 2.22. Let J be the ring of the finite subsets of N and let I be a maximal ideal of V(N) containing J. We note that by a well-known characterization of maximal ideals, we have Eel if and only if Ec il. (*) The ideal I is not a ?-ideal, because I is a proper ideal and therefore N = U„eN{n) ^ ^· Let {?H}„6N be a disjoint sequence of elements of I such that [Jn(z^ E„ does not belong
Convergence theorems for set functions 145 to I and let {N\, N2} be a partition of N such that N\ and N2 are infinite. We claim that U(ieyV] E„ or Une№ ?" belongs to I. Suppose the contrary. Then we have by (*) =((u44U?"))eI· This is a contradiction because I is a proper ideal. Then I is a ring which is not ?- complete, but with SCP. Example 2.23. Let N0 = {0} U N. Obviously if I is a ring of P(N0), then Л={?е-р(М0): ?eIorN0\?eI) is a subalgebra of V(Nq) and it is not a ?-algebra. We can observe that if ? e ? \ I then 0e? and we have also No \ F = ? \ F el and then ? П ? ? I. Therefore we have EeA\I if and only if ? = @)UF with F с N and F ? I. (**) Let {?n}„eN be a disjoint sequence of elements of A. We consider two cases: (i) There exists ? e N such that 0 e Ep. Then we have by (**) Ep = {0} U F with F с N and F i I, F„ e I. Then F„ с N for every ? ? p. Therefore |Jf„ = {0}u[fu(|Jf„)Y II ? ? where F U (Ц,^о En) is a subset of N that does not belong to I. By (**) we have (ii) Let0? E„ for every и e N. Then {F„}„eN is also a disjoint sequence of elements of I. We know by Example 2.22 that I has SCP and therefore there exists an infinite subset N\ of N such that \Jn€N E„ e Л We have by Pap A987). Theorem 2.24. Let A satisfy SCP and {m,,},,^;, m„ : A -+ [0, +oo[ (и e N), be a sequence of additive exhaustive set functions. If exists lim mn(E) = mo(F) n—>oo for each ? e A, then [m„}^t0 is uniformly exhaustive. Using Theorem 2.24 we can obtain more general statement.
146 P. de Lucia and E. Pap THEOREM 2.25. All assumptions and statement as in Theorem 2.24 only the algebra ? satisfy SIP instead o/SCP. 3. Related convergence theorems 3.1. Integration with respect to vector measures Related the integral of scalar valued measurable function with respect to a vector measure, see chapter written by N. Dinculeanu B002), Section 2.3, two crucial properties are related to Vitali-Hahn-Saks Theorem 2.7 vector case. First we recall the definition of the integral of a simple function / = ?"=\ XiXE, · where E\,..., E„ are sets in ?. I f(s)d?(s) = J2x:?(Er\E,). Je tt Now we have that a scalar valued measurable function / is integrable if there exists a sequence {/„}»eN of simple functions such that {/„}„ем converges ?-almost everywhere to / and the sequence {jE f„(x)d?(.x)}n€?; converges in norm in X for each ? e ?. Then the limit of this sequence of integrals is the integral of / with respect to ?. The following two crucial properties are strongly related to Vitali-Hahn-Saks theorem (see Dunford and Swartz A958)). If ? e ? and / is a scalar valued ?-integrable function, then the integral with respect to ? over ? is an uniquely defined element of X. Second, if / is a scalar value ?-integrable function, then v(E) = fE f(.x)d?(x) is a countable additive set function on ?. 3.2. Orlicz-Pettis theorem One of the basic tool of Banach space theory is given by Orlicz A929) and Pettis A938). THEOREM 3.1 (Orlicz-Pettis). Let X be a normed vector space and let ?„ ?? be sub- series convergent in X with respect to the weak topology. Then ?„?" 's subseries convergent with respect to the norm topology. Consequently, a weakly countably additive vector measure on ? ? -algebra is (norm) countably additive. We shall give now an elementary proof of Theorem 3.1 which is based on the following simple Diagonal theorem (see Antosik A976), Pap A982) and Pap A995)). Theorem 3.2 (Diagonal Theorem for non-negative matrix). Let x\s e E+, /', ;' e N, such that lim xjj = 0, lim ;c,y = 0, lim .v,, = 0. 7—>OC j—>3C i—>TC
Convergence theorems for set functions 147 Then for every ? > 0 there exists an infinite subset I o/N such that ? X'J < M- ,.j?l Moreover, the elements of the set I can be ordered in a increasing sequence {р,},€ы such that lim ^P-Xp,^ =0 and lim У~\????? = 0. PROOF. Let ;'o = 0. We shall choose a sequence {/,,},,^,' of natural numbers such that (i) /,,_i < /'„ for every и е N, (ii) ximik < 2~k~s for every l^s^n,l^k^n and « e N. The proof goes by induction. Since lim,--»^.*,·,- = 0 there exists an index r such that хц < 2~~2 for /' ^ r. Let /| = r. Then (i) and (ii) hold for ? = 1. Suppose now that we have already find /? ,ip such that (i) and (ii) hold for 1 ^ s ? ?, 1 ^ к ^ р. Since *,-,,¦ —> 0 and *,¦,¦, —>¦ 0 as /' —>¦ oo for ? = 1,..., ? and .?,, —>¦ 0 as / —>¦ ??, there exists an index ip+\ > ip such that x l\'p+\ <2-''-J-|M and .v,„ „¦<2-p-s-lM for /n = 1,..., ? + 1. Therefore (i) and (ii) hold for 1 ^ s ^ ? + 1 and 1 ^ к ^ ? + 1. So we have constructed a sequence {/„}„eri with the properties (i) and (ii). By (ii) we obtain OO ОС ??1'·''<?- ш=1к=\ Taking / = {/'?, /?,...} we obtain the desired conclusion. Since the sequence of summands of a convergent series converges zero, we obtain oo lim y^Xi,ik =0. Take pk = ? for к eN. D Proof of Orlicz-Pettis theorem. We can suppose without loss of generality that X is separable (replacing X by the closed subspace spanned by the sequence {-х^Ьем)· Let {xnj },epj be an arbitrary subsequence of {*„}/, eN· Taking 5, = Y^'k=\ x,n we nave t0 Prove that {Sj }iepj is a norm Cauchy sequence. We take an arbitrary increasing sequence {p, }iepj and n = J2k=Pi + \ х>ч- So we have t0 prove that lim,^^ ||z,|| = 0. Suppose that this is not true, i.e., for some ? > 0 there exists a subsequence of {?i}„eN (denoting it some) such that Hzill ^ ?. As a consequence of Hahn-Banach theorem for every ? there exists
148 P. de Lucia and E. Pap z* e X* such that ||z*|| = 1 and z*(z„) = \\z„\\. Then by Banach-Alaoglu theorem (for a neighborhood V of 0 the set {x* e X*: \x*(x)\ ^ 1 (x e V)} is weak* compact, see Swartz A992, 16.12) {z*L,eN has a subsequence (denote it same) such that it converges weak* to some z* e X*. Since {z„}„eN is weakly convergent we have \z*(z„)\ < ?/2 for large ? and therefore \(?*-?*)(??)\>?-\?*(?„)\^?/2 for large n. Taking a subsequence of {г*}„ек (again denoting it same) we have |(?;-?*)(?„)|>?/2 for all n. Taking v. i|(s*-z*)(zy)| for/*;. 0 for/ =;, we obtain by Diagonal Theorem 3.2 (take ? = ?/4) that there exists a subsequence {s, },ep such that ? ? x^j<e/4. /=1./= ?. ]?? We have by Diagonal Theorem 3.2 ?«-;*)^) ^ ?/2-?/4. ?«-^) ?' Hence 7=1 ^ ?/4 for every / e N. Contradiction with the weak subseries convergence of ?? zn ¦ a Theorem 3.1 implies directly the locally convex version, i.e., if X is a locally convex vector space and if the series ?? x" IS subseries convergent with respect to the weak topology, then ?? ?? ls subseries convergent with respect to the Mackey topology of X. Further generalization is obtained by Tweddle A970): if ???" IS subseries convergent with respect to the weak topology, then ?? x» IS subseries convergent with respect to the Mackey topology ?(?, G*), where G* is the vector space of all linear functionals y* on X such that ?*(????) = ?,,)'**-*") f°r a" series ?„?» which are weak subseries
Convergence theorems for set functions 149 convergent. Moreover, ?(?, G*) is the strongest locally convex topology on X such that every weak subseries convergent series is subseries convergent. As a consequence of Tweddle's result it can be given an interpretation of Nikodym convergence theorem in the spirit of Orlicz-Pettis theorem, see Antosik and Swartz A985), where also other types of Orlicz-Pettis theorems can be found. 3.3. Schur lemma and Phillips lemma In this part we present the classical lemmas of Schur and Phillips which have many applications in measure theory and functional analysis. One version of the classical Schur lemma (see Dunford and Swartz A958)) states that a sequence in i' converges weakly iff it converges strongly. We present here a generalization of Schur lemma to the group case obtained by Antosik and Swartz A985). Let G be commutative group endowed with a quasi-norm | · |, i.e., | · |: G —>¦ K+ such that |0| = 0, |-дс| = |*| and \х+у\Ц \x\ + \y\. THEOREM 3.3. Let Xjj e G for /', j e N. Assume that the rows of the matrix (-*,, ),.,,=? are subseries convergent and lim,^^ xjj = Xj exists for each j e N. If {?];<=o x4 beN IS convergent in G for each DcN, then (a) the series ]T ¦ Xj is subseries convergent; (b) lim,^oo Y^j€DXij = T,j€Dxj uniformly for D с N. The proof is based on Basic Matrix Theorem 5.10, see Antosik and Swartz A985). Theorem 3.3 is a generalization of the version of Schur lemma for Banach spaces obtained by Brooks A974, Corollary 2). We remark that Brooks' method cannot be transferred to the previous case since it is based on duality methods. The classical Schur lemma is contained in the following (for tj = 0). COROLLARY 3.4 (Schur lemma). Let (i,y)i.yeN be a real infinite matrix. If lim У tjj exists for each DCN JtD andiftj = lim/^oo i,y, then {iy}yeN belongs to l] and oo .lim ??'?-';? = °· 7—>?? * 7 = 1 Phillips lemma was obtained by Phillips A940). We present here a generalization of Phillips lemma to the group case obtained by Antosik and Swartz A985).
150 ? de Lucia and ?. Pap THEOREM 3.5. Let G be a sequentially complete normed group and let Vj: ? —>¦ G, / e N, be exhaustive. If\imj^x Vj(E) = v(E) exists for each ? e ?, then for each disjoint sequence {Ej}j€fqfrom ? lim ]P vi (Ej) = ]P v(Ej) uniformly for D с N. '^°°jeD j€D The proof presented in Antosik and Swartz A985) is based on Theorem 3.3, Basic Matrix Theorem and Drewnowski Lemma. As an easy consequence of Theorem 3.5 follows the classical Phillips lemma (for v(E) = 0). Corollary 3.6 (Phillips lemma). Let v, : V(N) -> R be bounded and finitely additive. Iflimj^oo Vj(E) = v(E) exists for each ?CN, then oo lim ?\?>№)-?(?)\=°· ?—>oc '—' We can interpret the preceding classical Phillips lemma in the following way. The dual space of ix is the space ba of all bounded finitely additive set functions on P(N) with the total variation as norm. Then for each ? eba, the series ? ¦ v({j}) is absolutely convergent and ? h*- MO'Db'eN gives a projection ? from ba onto Iх. By Phillips Lemma 3.6 for each Cauchy sequence {у,-},-ещ from ba with respect to the topology a(ba,mo) the sequence {Ри,},ем is norm convergent in €', where wo is the subspace of Iх containing all sequences with finite range. 3.4. Rosenthal's lemma One useful method close to diagonal theorems and a sharpening of Phillip's Lemma 3.6 was given by Rosenthal A968). We shall prove in this part in a quite elementary way a countable version of the important Rosenthal's lemma. It is used as the basis for a deep study of the behavior of operators on spaces of continuous functions, see Diestel and Uhl A977). The presented proof follows Antosik and Pap A981), where it is used an idea of KupkaA974). Lemma 3.7 (The countable case of Rosental's lemma). Let v„ for ? e N be additive non-negative set functions defined on V(N). If there exists a number ? > 0 such that v„ (?) < ? for each n= 1,2, then for each number ? > 0, there exists an infinite subset N\ ofN such that Vn(N\ \ {«}) < ? foreachneNi.
Convergence theorems for set functions 151 PROOF. Assume that, for some ? > 0, no such subset N\ exists. Let M\, ЛЬ.... be a sequence of infinite disjoint subsets of N such that (J Mp = N. For each ? there is an index к ? e Mp with vk (Mp \ {kp}) ^ ?. Otherwise there would exist an index ? such that v„(Mp \ [n]) < ? for all ? e Mp which is impossible. Let К be the set of all members of the sequence {кр}р€ц, i.e., К = {kp: ? e N}. We note that for each ? e К we have ?„(?) + ?„(?\?)? 1 (without loss of generality we may take 1 instead of ? > 0). Since Mp \ {kp\ с ? \ ? for all p, we have vn(K) + ? ^ 1, for all « e A-. Hence, for all ? e A-. vn(K) < 1 — ?. Next, apply the same argument to v„ with и е К and A-, by replacing v„ for « e N by v„ for и е К and N by A". If the process does not stop, then there is a new subset S ?? ? such that v„ (S) ^ 1 — 2? for each /; e 5. It now becomes apparent that the process must come to end before ? iterations, if /; is the smallest positive integer such that 1 — «? < 0. ? Let T(N) be the family of all finite subsets of the set N. COROLLARY 3.8. Let {9„}пе^ be a sequence of additive non-negative set functions defined on the family .F(N). If there exists a number ? > 0 such that ?„(?) < ? for each ? = 1.2 and each set A ef(N), then for each ? > 0 there exists an infinite subset N\ o/N such that e„(D\{n})<? for each finite subset D of N\ and each ? e N. PROOF. We introduce the functions v„ defined on V(N) whose values are given by v„(A) = J^ ?„({;}) for each ? = 1, 2,... and each A e V(N) and apply Lemma 3.7 to the sequence {v„}n€i;. ? Now it is easy to obtain the countable case of Rosenthal's lemma formulated in Diestel and Uhl A977). COROLLARY 3.9. Let {вп}п€ц be an uniformly bounded sequence of additive real valued set functions defined on an algebra Л of subsets of a set S. Then for each sequence {En }п(-н
152 P. de Lucia and E. Pap of pairwise disjoint members of ? and each ? > 0 there exits a subsequence {ЕП] }j^n of {?,,?,,^?? such that i^i( U Е'ч)<? for all finite subsets D ofN\ and all j e N. If in addition A is a ?-algebra, then the subsequence {E„ }y epj may be chosen such that i^i(U?'u)<? for all j= 1,2,.... PROOF. To prove the first part of the corollary we consider the functions v„ defined on .F(N) whose values are given by Vn(A) = Wn\(\jEj\ for each ? = 1, 2,... and each A e .F(N), and apply Corollary 3.8. To prove the second part, we consider the functions v„ defined on 7?(N) whose values are given by vn(A) = W„\(\jEj\ ??? for each ? = 1, 2,... and each A e V(N). and apply Lemma 3.7. ? 3.5. Hewitt-Yosida theorem In this part using the Nikodym convergence Theorem 2.6, we shall obtain an important decomposition theorem of finitely additive set function. For a real additive set functions v\ and vi on an algebra A we write v\ ^ vi if v\ —vi^-O. This ordering is a partial order on the family of all real valued finitely additive set functions on A, and they form a lattice with respect to this ordering. Dehnition 3.10. A bounded finitely additive set function ? : A -> Ш+ is purely finitely additive if 0 ^ ? ^ ? and ? is countable additive imply that ? = 0. We shall need the following lattice property of special spaces of set functions, see Dunford and Swartz A958).
Convergence theorems for set functions 153 THEOREM 3.11. The lattices ba(S.A) (bounded finitely additive set functions) and ca(S, A) (countable additive set functions) are complete lattices. Now we have the following version of Hewitt-Yosida decomposition theorem, Hewitt and Yosida A952), see also Dunford and Swartz A958) and Diestel and Uhl A977). THEOREM 3.12. If ? is a bounded finitely additive non-negative set function then there exists a unique decomposition ? = v\ + V2, Vj ^ 0, where v\ is countable additive and vi is purely finitely additive bounded set function. PROOF. Let С = {? e ca(S\ A): 0 ^ ? ^ v]. Take a sequence {??/??? from С such that lim ?„E) = 5ирдE) < ??. % Mi ^ ?'!=? My, » = 1,...,«, it follows by Theorem 3.11 that there exists sup^i,..., /J-n} = ~?~?~ in ca(S, A). We have ?^ ^ ?,,+ ?, ? e N. Let ?\ be the ?-algebra generated by A and denote the unique extensions of measures {д^},,ем on ?\ by the same symbol. The extensions are also non-decreasing (extension of non-negative set function from A to ?\ is also non-negative) and therefore for each ? e ?\ there exists \im„^0CJia(E) and we denote it by v\ (E). Then by Nikodym convergence Theorem 2.6 v\ is countable additive on ?\, and so also its restriction to ?. We introduce vi by vi(E) = v(E) — v\ (E) for ? eA. Then by the definition of v\ and {??? we have that vj ^ 0. If we suppose that V2 is not purely finitely additive then there would exists a non-zero countably additive ? such that ?' -?,? — v\. This would imply v\ ^ v\ + ?' ^ ? and therefore supMeC ?(?) = v\(S) < v\(S) + v'(S). Contradiction. Uniqueness follows by Theorem 3.11. D The Hewitt-Yosida theorem was extended by Uhl A971) to the vector measures with Stone space argument, and with a direct proof by Huff A973). Drewnowski A973) and Traynor A972) have extended Hewitt-Yosida theorem for group-valued measures. 3.6. Biting Lemma One useful method close to diagonal theorems are given by Brooks and Chacon A980). Biting Lemma is applied by Brooks and Chacon A980) for a proof of Vitali-Hahn-Saks theorem, in the proof (the part (b)) of Biting Lemma) of Dieudonne convergence theorem A978), and in a short proof of a result of Akcouglu and Sucheston A978) related the existence of an exact dominant for a superadditive process. LEMMA 3.13 (Biting Lemma). Suppose {v„b<eN is a uniformly bounded sequence of additive set functions defined on the measurable space (S, ?), each absolutely continuous with respecttoO, where ? is a positive, bounded and additive set function on (S, ?). Then
154 P. de Lucia and E. Pap (a) If ? and ? are positive, then there exists a set Cf.? and a subsequence {v,u}i€n {depending on ? and ?) such that: 0(CF^) < ?, and there exists S > 0 such that |u„,(A)| < ? for every i eN whenever ?(?) < ? and А С S\Cf^. (b) If all the v„ and ? are measures, then there exists a subsequence {v„; },<=pj such that for every ? > 0 there exists a set BF such that 0(BF) < ? and {vHl },ек is uniformly absolutely continuous with respect to ? on S\ Bf. (c) If all the v„ and ? are measures, then there exists a subsequence {vUl}i€fq that converges w, i.e., there is a sequence B„ \ ?. ?(??) -> 0 such that if А С S \ ?, for some i eN, then v„(A) -^ v(A), and \v\(B) = 0. PROOF. We only give a proof of (b). Suppose that the statement is not true. Then there exists ? > 0 and a subsequence {v,h },€\; such that for A e ? such that ?(?) < ?, there is no further subsequence of {v,u }ier; which is uniformly absolutely continuous with respect to ? on S \ A. Let D \ denote the set of all с > 0 such that for any к} > 0 there exists a set ?,? such that ?(?„) < ? and \vni(En)\ > с for infinitely many /'. Since D\ is nonempty and bounded there exists sup D\ which we denote by 2c\. Take a set C\ such that 9(C\) < ?/22 and |i>i.i|(C|) > c\, where {vuhep; is a subsequence of {vtli },-e^·. Let Di denote the set of all с > 0 such that for any ? > 0 there exists a set ?? such that ?,, с S \ C\, ?(??) < ? and | ?\?\(??) > с for infinitely many /'. Take a set Ci С S\C\,e(Ci) < ?/23, a subsequence {t^.iheN of {vuheN such that |i>2.j|(Ct) > ci for all /', where c? = ^supZK Continuing this procedure we obtain {^./}iei:, Q-.c; such that {Q}jteN forms a sequence of pairwise disjoint sets such that 0(Q) < ?/2*-1, |i>ju1(Q) > ck-' e N. If the sequence {cjtUeN would converge to zero, then the diagonal sequence {i^.jtheN would be uniformly absolutely continuous with respect to ? on S \ (J* Q · wnat 's a contradiction, since ^(Ujt Cjt) < ?. If the sequence {с;-}л-еь- would not converge to zero, this would contradict the boundedness of {i>n }„ ep,- ¦ ? 4. The relation between boundedness and exhaustiveness Let X be a Banach space. A set function m: ? -> X is exhaustive (strongly bounded, shortly s-bounded) if lim,,^^ m{En) = 0 for each sequence {?„}„ек of pairwise disjoint sets from ?-algebra ?. A sequence {w/},ef,- of set functions m^-.? —* X, i e N, is uniformly exhaustive if lim,,^^ ш,(?„) = 0 uniformly in /' for each sequence {?,??,??? of pairwise disjoint sets from ?-algebra ?. It is obvious that exhaustivity implies boundedness. As we have seen a real additive set functions on a ring is bounded if and only if it is exhaustive. For Banach space valued additive set functions this is no more true. Example 4.1. Let ? be the ?-algebra of Lebesgue measurable subsets of [0, 1] and let Lx be the Banach space of essentially bounded functions with usual essentially sup norm II · II. Define m '. ? —> Ly^ by m{E) = ?^, ? 6 ?. Obviously, m is additive but not ?-additive. It is bounded by ||№(?)|| = ||??|| ^ 1. But it is not exhaustive, e.g., for En = [j^,~[ we have ||w(?„)|| = 1 for all n e N. With some additional suppositions on the range we have for vector valued measures on a ring, see Diestel and Uhl A977).
Convergence theorems for set functions 155 THEOREM 4.2 (Diestel and Faires). Let TZ be a ring of subsets of a set S. A bounded finitely additive measure m : TZ -> X is exhaustive if and only if X does not contain a copy of cr,. We shall need in the proof the following theorem. THEOREM 4.3 (Bessaga-Pelczynski). A Banach space X is such that every weakly unconditionally convergent series in X is subseries convergent in norm if and only if X contains no copy ofcc,. PROOF. Suppose that the theorem is not true. Then for a weakly unconditionally convergent series ?^ *, there is a subseries J2?Li xn, which does not converge. Then there is a ? > 0 and an increasing sequence {p, },-е^ such that Pi+i k=p, + \ > ?. Take n = J2k='Pl + i xm- The series ]T. zi is weakly unconditionally convergent ({]Tief c,·: F с N finite} is norm bounded since J2kel·' x»k 's norm bounded). We can suppose without loss of generality that X is separable. Then as a consequence of Hahn-Banach theorem for every ? there exists c* e X* such that ||г,*|| = 1 and z*(zn) = \\z,i\\- Then by Banach-Alaoglu theorem {г*},/ек has a subsequence (denote it same) such that it converges weak* to some z* e X*. Then \(?*-?*)(?„)\>?-\?4?„)\>?/2 for large n. Taking a subsequence of {г*}„ек (again denoting it same) we have \(?*„-?*)(?„)\^?/2 for all n. Taking \(zf - Z*)(Zj)\ fOTi ? j. 0 for/=y, we obtain by Diagonal Theorem 3.2 (take ? = ?/4) that there exists a subsequence {A:,-},-epj such that ? ? ?^<??. Define ?: со -> X by OO i = l
156 P. de Lucia and E. Pap Then we have by Diagonal Theorem 3.2 2\\T({ti}ieN)\\^ \\(??-?*)?{{^]???)\\ ^|?,|?/2-||{?,},??||?/4. Taking supremum over / we obtain 2||Г({г,}/еК)|| ^ 2||{i,}jeN||?/4. Hence ? has a bounded inverse. ? PROOF OF Theorem 4.2. Suppose m is not exhaustive. Then there exist a disjoint sequence {Е„}„€щ and ? > 0 such that ||m(?„)|| > ? for every n. Then for every x* e X* the scalar measure x*m is bounded on К and therefore it has a bounded variation. Hence II ^|jc*w(?,)| <sup{|jt*m|(A): AeTZ} <oo. Therefore the series ?"_| x*m(Ei) is unconditionally convergent and by Theorem 4.3 X contains a copy of со. ? THEOREM 4.4. Let ? be a ?-algebra of subsets of a set S. A bounded finitely additive measure m : ? —* X is exhaustive if and only if X does not contain a copy ofix. PROOF. Repeating the part of the proof of Theorem 4.3 from the part "We can suppose without.. ."taking now ц =m(Ei) and applying Drewnowski Lemma 2.9 before defining the operator T, now in the form ?: ex(J) -> X, where J = [n,¦: /' e N) and {«, },ек is the sequence from Drewnowski Lemma 2.9, by ОС T({ti}ieji) = YltiZ*m(Ei). i = l we obtain that ? has a bounded inverse. ? 5. Diagonal theorem for triangular set function In this section we shall present a general version of very useful Diagonal theorem for triangular set functions. The first Diagonal Theorem was proved by J. Mikusinski A970), and first Diagonal Theorem for triangular set functions by Pap A976), then by H. Weber A986) and Saeki
Convergence theorems for set functions 157 A992). We present here Theorem 5.3, the version of Diagonal Theorem obtained in Antosik and Saeki A995), see Pap A995). Weber has proved Theorem 5.3 in H. Weber A986) only for the case lim lim m„(Bm) = 0. in—>DC It —>DC Thus the second possibility in Theorem 5.3 cannot occur in his setting unless lim,^x sup (d„ — a) ^ 0. On the other hand, in Saeki A992) this result was obtained only for a = 0 and d„ — Ш/;(В/7)/2. Corollary 5.5 with a = у = 0 is a generalization of the nontrivial part of Theorem 2.4 in H. Weber A986). Actually, a careful examination shows that his proof works if у =0. Corollary 5.7 (Diagonal Theorem) was proved for group case in Antosik A971) and for semigroup case in Pap A974). Diagonal Theorem 5.9 denoted by DT is proved by P. Antosik and the Basic Matrix Theorem 5.10 was obtained in Antosik and Swartz A985). 5.1. a-boundedness Let 1Z be a ring subset of a set S. For a given к e [ 1, oo[ the set function m: 7Z -» [0, oo[ is a ^-triangular (triangular for к = 1) set function if m @) = 0 and m(A)-km(B) ? m{A U В) ^ m{A) + km{B). for A, BeTZ, ??? = 0. Let m : ?? —* [0, oo] be a triangular set function. For each set А с S, let in(A) = sup{m(B): В е П, В С A]. DEHNITION 5.1. We say that m is ?-bounded @ ^ a < oo) if inf„ m(Bn) ^ a for each disjoint sequence {В„}„ек in ??. If (A„}„epj, is a sequence of sets and / С ?, we write A/ for Une/ ^"· LEMMA 5.2. Let m:Tl^> [0, oo] be ?-bounded. Then in is a-bounded on the subsets of X. Moreover, if [ B„ }n€n, is a disjoint sequence of subset of S and if ? > a. then there exists an infinite set I С N such that w(Une/ ^") < ?- PROOF. The first assertion is obvious. For the second, we represent N as an infinite disjoint union of infinite sets. ? THEOREM 5.3 (Diagonal Theorem for triangular set functions). Let {т„}„ен be a sequence of?-bounded triangular set functions on the ring ?? @ ^ a < oo), let {Sn},ieN be a disjoint sequence in TZ. and let {Sn}n€f>s be a sequence in [0, oo]. Then either there exists an infinite set I С N such that (i) mll(\Ji€,Bi\Bll)<S„ (nel), or there exists a finite set J С N which satisfies (i) with I = J and (ii) m,i{Bj) ^ <5„ —a for infinitely many ? e N.
158 ? de Lucia and ?. Pap PROOF. If <5„ ^ a for infinitely many ? e N, we can take 7 = 0. Suppose <5„ > a for all ? e N large enough. Let us call / с N a nice set if it satisfies (i). Suppose that no finite nice sets / с N satisfy (ii). We shall construct a sequence {/„}„,=? of infinite sets and a sequence {/цкеь; in N as follows. Let /o = N, and let n\ be any natural number with <$„, > a. Note that \n\) is a nice set. Let к &N, and suppose that infinite sets IqD ¦¦ ¦ D h-\ and a nice set {n \ in} have been chosen in such a way that m for l^j^k. B1) "j ( U ?»< ) < S»j ~a for К ./ < *. and B2) ч ^ 4- 'y'-i li' = l Я,, ? (J ?») < &nj - mnj I \J B„, \ for 1 <: ; ^ к - 1 B3) (ie/, \i = l By B2), the right-hand side of B3) with; = к is larger than ?. since each mn is ?-bounded, it follows from Lemma 5.2 that h-\ contains an infinite set h which satisfies B3) for j = k. Moreover, no finite nice sets J satisfy (ii) and {n\ и*} is a nice set. So /„ П]и*,оо[ has an element m+i which satisfies B2) with j = к + 1. The arguments in the next paragraph will show that {щ >n+1} is a nice set. Define / = {/«: к e N). To show that / satisfies (i), pick any j e N. Then к > j implies nk e /*-1 С I j by B1). It follows by the subadditivity of w„( that m„j ( (J B„ \ Bnj J < тП] i [J ВП] \ + mH] ( [j B„ J < Snj, which completes the proof. ? Corollary 5.4 (Webers Version of the Nikodym Boundedness Principle). LetH be a a-algebra, and let ? be a set of?-bounded triangular set functions on ?? for a < oo, such that sup{w(S): m e T} < oo for each В e ??. Then ? is uniformly bounded. PROOF. Suppose that the theorem is not true. Then there exists a disjoint sequence {?/ilneN ш "R- and a sequence {w„}„ef.; in S such that m„(B„) > 2n for each ? e N. Replacing ?? by a smaller ring, we may suppose ?? = {Une? ^»: ^ С N and U»e? ^" e ^)· If У С N is a finite set and m„(\JneJ B„) > ? - a for infinitely many ? e N, then J contains a set F such that w„(U,ieF ?„) > ? - ? for infinitely many ? e N. Hence sup,, w„(U„e/r S») = oo, a contradiction. It follows from the Theorem 5.3 with <$„ = ?
Convergence theorems for set functions 159 that there exists an infinite set / с N such that w»(U»e/\(n) ?») < n ^or eacn " e ^- Since ft is a ?-algebra, we may suppose U,,e/ ^" e ^- ^ut tnen " e ^ implies 1" ( U ?") ^ m"(?")-"'»( U ?» ) \ie/ ' \ie/\(/i) ' > 2n — ? = ?. /ie/\H Contradiction. ? COROLLARY 5.5. Let ?, ?,? e [0. oo]. and let {m„}nef; be a sequence of a-bounded triangular set functions on the ring ft. (i) Suppose that for each disjoint sequence {Z?„}„ef; in ft and each subsequence \т,ч кем of{m„}n(zn, we have infIm„k ({jB,\.keEcNand |J ?„ e ft ^ /S, and lim I lim w,u(Sy) ^ylimw„(By). j —»зс ' к —> эс ' Г/геи ?/ге wn are uniformly (a + ? + y)-bounded. (ii) //"ft й а ?-ring and if ? (A) = lim m„(A) G—>3C is ?-bounded, then the sequence {m„}n(.\\ is uniformly (a + 2fi)-bounded. PROOF. For (ii), we may suppose ? = ?. Suppose to the contrary that m'n are not uniformly (?+? + y)-bounded. Then there exists a disjoint sequence {б„}„ек in ft, ? > a + ? + ?, and a sequence {n*heN in N such that m,4(Bk) > ? for each к eN. Since each m„ is ?-bounded and ? > a, we may suppose n\ ^ in ^ · · · Case l. Suppose N contains an infinite set / such that w,u(Une/\jjt) ^") < (P + a + ? — ?)/2 for each к e /. Then к e ? с / implies i'u(Us"Msw"<(S'0~w"<( U ?") \ie? ' \ie?\(M ' > ?-(? + ?+?-?)/2 = (?-?-? + ?)/2>?. B4) So, {m„<,}jteN does not satisfy the first requirement in (i). Also B4) shows that ? (Be) > (p - ? - у + /3)/2 whenever ? с / is an infinite set with \Jn(zE B„ e ft. Therefore, if ft is a ?-algebra, then V is not /3-bounded.
160 P. de Lucia and E. Pap Case 2. Suppose that N contains no infinite sets / as in Case 1. Then, by Theorem 5.3 with 8k = S = (? + ? + ? — ?)/2, each infinite subset of N contains a finite set J and an infinite set / such that к е / implies т'ч ( U B" ) > (P + a + У - ?2 -? = {?-?+?- ?)/2 > ?. An inductive application of this fact yields a disjoint sequence {F/}yeN °f finite subsets of N, a sequence {Ay}yepj in 1Z, and a subsequence [mSj}j€n of {m^heN such that Aj с Bfj and m4(Aj) ^ {? — ? + ? — ?)/2 > ? for every к ^ j ^ 1. Hence the second requirement in (i) is not satisfied, and ? is not y-bounded. D 5.2. Diagonal theorems Let S be commutative semigroup with a neutral element 0 which is endowed with a triangular functional /, i.e., /: S -> [0, oof such that fix) - f(y) < fix + У) < fix) + /0') and /@) = 0 (Pap A974), see also H. Weber A976), Pap A982)). Remark 5.6. Let d.S -> [0, +oo[ be a pseudometric such that satisfies the following condition d{x +x\,y + y\) ^ d(x,y) +d(x\,y\) forevery x,x\, y, vi e S. B5) There was proved by H. Weber A976) that the uniformity in each commutative uniform semigroup is induced by a family of pseudometrics which satisfy B5), see Pap A982, 1995). If S is endowed with a pseudometric d which satisfies the inequality B5) then the pseudometric d induces a triangular functional / in the following way: f(x) = d(x,Q) (x e S). Specially, if S is a commutative topological group (it is a uniform semigroup) its topology is generated by a family of quasi-norms. Quasi-norm is a functional defined on a group S such that it satisfies the conditions: (i) |0| = 0; (ii) |jc| = |—*|; (iii) \x + y\ < 1*1+ M. Let (JCiy)i.yeN be an infinite matrix (indexed by ? ? ?) with entries in S. For each sequence {xj]jsm in S and each /cN, write Vye/ 7 \е/П[|.п] 7
Convergence theorems for set functions 161 Corollary 5.7 (Mikusinski-Antosik-Pap Diagonal Theorem). Suppose (i) linwoo/(*,-,-) = 0(/eN). Then there exists an infinite set I and a set J С I such that (a) ?/e/ f(x'j) < oo (/' e N), and (b) П%^хф>ПхнI2ЦЫ). PROOF. By (i), we may suppose that the U,;),.yeR, satisfy ?,??/<-*<'./') < °° f°r eaun /' e N. Define (?) = /(??,? (i€N,?cN). Then each w, is a continuous (hence exhaustive) triangular set function on V(N). If / С N is any set such that w,(/\{/'}) < /и,-({/})/2 whenever/ e /, then /' e / implies т/(/)^т,({/})-т,(/\{/})^т,({/})/2. Therefore the desired result follows by Diagonal Theorem 5.3 for triangular set functions with Si = mi ({/ })/2 and a = 0. ? Let G be a quasi-normed group, i.e, a commutative group endowed with a quasi- norm | · |. Corollary 5.8 (Antosik Diagonal Theorem). Let ?, ? e [0, oof, and suppose that (i) limy^ooU,y| = 0(/eN),_ (ii) for each finite set F С ?, limy-^ | ?/??^?'? ^ ?· and that each principle infinite submatrix of(xu)l, yepj //as a further principle submatrix („v/y) smc// //га/ (iii) Шп^зо??^?у/;Кути «i l'-»00 PROOF. Taking infinite submatrix of (-X/y)/.yeN- we may suppose that ? у era l-^/y I < °° for each /' e N and lim,-^^ \хц\ exists in [0, oo]. Let m-, be the continuous triangular set function on T-4N) defined as in the proof of last theorem. Fix any ?' > ?. Then (ii) implies that there exists no finite set У с N such that irij(J) ^ ?' for infinitely many i e N. So, by Diagonal Theorem 5.3 with a = 0 and Si = ?', there exists an infinite set / с N such that /и,-(/\{/}) < ?' f°r each i e /. Then, /' e /' с / implies \xii\=mi({i})^mi{l')+mi(l'\{i})<mi(l')+P'. Then by (iii) we obtain that lim \хц \??+?'. ? As an immediate consequence we obtain a Diagonal Theorem (which we will denote by DT) given in the next theorem.
162 P. de Lucia and ? Pap THEOREM 5.9. Let (G, | · |) be a quasi-normed group, and(xij)i,j€n an infinite matrix in G such that for every increasing sequence [kj }, <=t; in N there exists a subsequence {и/ },-epj °/{^iheN such that DC lim jc„,„ =0 for all j e N. and lim / .v,,,„ =0. i —> ОС 1 —> ЭС i-~l 7 = 1 Then lim,-^ ос хц = 0. As a consequence we obtain the Basic Matrix Theorem. THEOREM 5.10. Let (G, | · |) be a quasi-normed group, and (.viy)/.yen an infinite matrix in G such that lim Xi. = Xj (I) I—>OC exists for all j e N and for every increasing sequence {шу}уек /и ? ?/геге eiisii ? subsequence {иу}уем o/{wy}yeN i/zc/г ?/??? {?/1? ·??"; )iet·: '5 ? Cauchy sequence. Then liirii^jcxtj = Xj. uniformly with respect to j e N. PROOF. If we suppose that the theorem is not true, then there exist ? > 0 and a subsequence {s,}, ем of natural numbers such that sup \xSlJ —x} | > ?. By (I) we can choose sequences {uheN and {улЬек such that the matrix (ya-.s)л-.лег; for }\s = -*/u\ - Xh+u< satisfies the conditions of DT, but |vu| > ? (к e ?) (taking eventually a submatrix). Contradiction. ? There are many other types of diagonal theorems (see Antosik A971, 1978), Pap A982), Swartz A996)). There are many different applications of diagonal theorems in measure theory and functional analysis (see Pap A982, 1985a, 1985b), Antosik and Swartz A985), Pap A995), Swartz A996)) as Banach-Steinhaus theorem, Banach uniform boundedness theorem, Bourbaki theorem on joint continuity, Orlicz-Pettis theorem, Schur lemma, Phillips lemma, Swartz kernel theorem, Adjoint theorem, Closed Graph Theorem, etc. 5.3. SCP and SIP algebras THEOREM 5.11. Let ? satisfy SCP and [m„}ll€i;, m„ :A -> [0, +oo[ (/; e N), be a sequence of k-triangular exhaustive set functions. If exists lim m„(E) = mo(E) n—>oc for each ? e ? and wo is exhaustive, then {тп}^={) is uniformly exhaustive and wo is k-triangular.
Comergence theorems for set functions 163 Using Theorem 5.11 we can obtain more general statement, see Pap A987). THEOREM 5.12. All assumptions and statement as in Theorem 5.11 only the algebra ? satisfy SIP instead of SCR 6. Dieudonne type theorems We shall present a generalized Dieudonne type theorem A951) for the class of triangular set functions, see Pap A986). Let ? be a locally compact space. Then К and ? denote the family of all compact and open sets, respectively. Definition 6.1. Let Я be a ring of subsets of T. A set function m :TZ^]-oo, +oo[ is regular if for every AeTZ and every number ? > 0 there exist a compact set К с A and an open set V D A such that for every set В e 1Z with В с V \ К we have \m{B)\ < ?. To the rest of this section let m be always non-negative monotone set function. Then a set function m is regular if for each set A e В and each ? > 0 there exist К е К and VeO such that К с А с V and m(V\/0 <?. Remark 6.2. Obviously, that a non-negative monotone set function m is regular iff inf{w(V\^): KcAcV. К е fC, VeO}=0 for each set A e B. PROPOSITION 6.3. Regular positive monotone set function m is O-exhaustive, i.e., lim m(O„)=0 n—>oc for each sequence {Оп}п€ц of open sets from О which are pairwise disjoint. Proof. Since ЭС U 0„ e О and m is regular, we have that for any ? > 0 there exists a compact set К such that ЭС / ЭС \ KG [JO,, and m j (J 0„ \ К 1 < ?.
164 P. de Lucia and E. Pap {?,???? is an open cover of К and therefore exists /?o e N such that К с U"li ^»· The monotonicity of m implies m(Ok) < m ? Oi. U ( Q 0„ \ К j j ^ in ( Q 0„ \ ?? J for it > no + 1. ? 6.1. Triangular set functions Let В be the collection of all Borel sets of a Hausdorff locally compact topological space T. The disjoint variation in of m is defined by in(A) = sup/ J2iei lw(A')l. '4 С Т, where the supremum sets such taken over all finite families {Д }ie/ of pairwise disjoint sets such that Uie/ A = ?, see Pap A995). It is obvious that a triangular set function m with regular variation is itself regular. We formulate the main theorem. THEOREM 6.4. Let ? be a family of triangular set functions defined on В with regular disjoint variations. If the set {m{0): m e M} is bounded for every open set O, then {m(By. me ?, Be В] is a bounded set. We shall assume in the following proofs that ? is a compact Hausdorff space. Namely, we can replace ? with an Alexandrov one point ? compactification ? U {?}, taking m(a>) =0 (m e ?4). COROLLARY 6.5. Let ? be a family of regular scalar measures defined on B. If the set {\m@)\: m e ?4] is bounded for every open set O, then {\m(B)\: meM. В е В) is a bounded set. PROOF. Let v(B) = \m(B)\ (B e B, m e M). It is obvious that the family of all such set functions ? satisfies the conditions of Theorem 6.4 since ? = m is also regular (for real valued this follows from \m\ = m+ +m~ by Jordan decomposition and for complex valued from the inequality \m\ ^ |»i|| + |»i2|, where m =m\+imT). So we apply Theorem 6.4. D In the proof of Theorem 6.4 we shall use the following lemma.
Convergence theorems for set functions 165 LEMMA 6.6. Let m be a triangular set function defined on В with regular variation. Then m is ? -subadditive on each sequence of disjoint open sets {0„},ieN- '¦<'¦, @0 \ 00 PROOF. First we will prove that m is order continuous on open sets, i.e., for each sequence {tMneN of open sets such that Uj D Uj+\ (j e N) and Hyli ui = ^· we have lim m(Uj)=0. For each ? > 0 there exists a sequence of compact sets {?„b<eN such that Kj С Uj and m(Uj\Kj)<e/2J (j e N). B6) Then there exists «o e N such that ?'/=? Kj = 0 f°r aH n ^ "o· Let и > no- Then we have ? m(U„) = mlu„\f>\Kj\=ml \J(U„ \ Kj) 1 < m I |J(i/,,\^))· Hence, since m is subadditive (i.e., m(A U S) ^ w(A) + w(B) for every pair A, B of not necessarily disjoint sets from B), and nondecreasing. we obtain by B6) ? m(U„)^Yim(Uj\Kj)<e 7 = 1 for all ? ^ «o- Now, let {0„}„e^ be a sequence of disjoint open sets. Then we have (ОС \ /7 / 'yZ \ U°mE^)+m U °j\ 7 = 1 / y=l \y=H+l / Taking n^oowe obtain / ОС \ ОС (ОС 7=1 / 7=1 Proof of Theorem 6.4. It suffices to prove that every point in ? belongs to an open set О on which holds sup{m(A): А с О (A e B). m eM] < oo. B7)
166 P. de Lucia and E. Pap Suppose that this is not true. Then there exists a point ? e ? such that B7) does not hold for every open set О such that xeO. We shall prove that there exists a sequence of pairwise disjoint open sets {?/,}„ещ andasequence {/и„}нег; fromAI such that №,(?;) > /' (/' e N). For any open set О such that ? e О there exists a Borel set ? с О and m \ e ? such that W|(B)>4 + 2 sup m({x}). B8) It is easy to prove that the preceding supremum is finite. Since m \ has regular variation, there exists a compact set К с В and an open set О' С О, В с О' such that m\(B') < 1 for each В' с О' \ К. We have by the subadditivity of m \ ml(K)+ml(B\K)^m[(B). Using the preceding inequality, the inequality m \ (В \ K) < 1 and B8), we obtain mi(?)>3+2 sup w({jc}). Let ^i = KU{x]. Then the last inequality implies (directly for.? e К)Ъу the triangularity of m ? (for ? ? ?) that m\(K\)>3+ sup w({.v}). By the regularity of m \ there exists an open set U such that О D U Э K\ and m \ (B") < 1 for every B" cU\ K\. The preceding inequality together with the inequality mi(U)>mi(Ki)-mi(U\Ki) implies m\(U) >2+ sup m({x}). B9) Again by the regularity of m \ there exists an open set W such that {x} С W с U and w,(S'")<l C0) for every S'" с W\{;t}. _ _ Let ? be an open set such that ? e ? с Я с W (where H is the closure of the set H). Then we have m\{H)^ sup wi(A) +wi((.r}) ^ sup w i(S) + wi ({*}). ACtt\(.v| ВсИ'\(л-| Hence by C0) we obtain miG7) < 1 + sup iw({*}). C1)
Com'ergence theorems for set functions 167 Let E\ =U\~H. Then we have E\ С О and E\ ?77 = 0. By the inequality wi(?|)+w|G7) >wi(t/). B9) and C1) we obtain wi(?|) > 1. Using the preceding procedure, taking in the inequality B8) the constant 5 instead of 4 and taking into account the facts that ? e ? and the family ? is not bounded on H, we obtain open sets Ei, H\ (H\ с ?) and mj e ? such that ?? ? #? = 0, jc e H\ and ???(??) > 2. We have ?| П ?2 = 0· Continuing this procedure we obtain a sequence {w,-},-ei.; from ?4 and a sequence {?,},ef: of pairwise disjoint open sets such that «?,(?,)>/ (ieN). C2) We shall prove that m, (i e N) are exhaustive on the sequence {?„}„en of disjoint open sets, i.e., lim mj(Ej)=0 (i e N). C3) У — ЗС Since U/li ^У i-s an °Pen set and "»/ are regular, for ? > 0 there exists a compact set К' с UJli Ej such that w,(C) < ? for each / e N and each С С UjLi ?/ \ *'¦ Since {E/lyeN is an open cover of K\ there exists uo e N such that ?' С U"li Ej- Then we have for m ^ «o + 1 mj(E,„) ? supm,(C') ^ supw,(C) < ? (/' e N) с с where С С Em U ((J!, Ey \ K') and С с Uyli Ej \ K'¦ So we obtain <33)· Let m xn = (Ej)/y/i fori ? j, 4 JO for/=;·. We have by C3) lim,^^ ,vi; =0 (/' e N). We obtain by the boundedness assumption of the theorem that lim,^^ xt] = 0 (j e N). Applying Diagonal Theorem 3.2 for the infinite matrix (Xij)i.jeN we obtain a sequence {/„},ie?; from N such that DC 11тЕ*<л=о C4) k = \ for every ? e N. Using the triangularity of пц„ (? e ?) and Lemma 6.6 we obtain C0 \ DC (J Eik I > min (?,„) - ? '"'" (?'^) <" e N)· )t = l / )t=l. Ып
168 P. de Lucia and E. Pap Hence by C2) , / 00 \ . . DC V'^1 min I |J Eik 1 > y/i~] mhl(Eiu)- ??? ]? min(Ek ) U-=l I к=\.кфп Let ? -> oo. Then by C4) we obtain V'^VJ U?iJ ^°° but since Ut^i ?;* is an °Pen set we obtain a contradiction with the boundedness of {mi„ IneN on open sets. ? Let S be endowed with a pseudometric d which satisfies the inequality B5). Now we can extend the definition of the difference regularity of a set function ?: ? -> S taking only ? andd(v(A), ?(?')) < ? instead of m and \m{A) — m(A')\ < ?, respectively. Now we define the variation ? of a set function ?: ?? -> S with i>@) = 0 in the following way: v(E)=sup]T/(v(A)) (???). Лея where the supremum is taken over all partitions ? of ? into a finite number of pairwise disjoint members of ??. It is easy to see that ? is superadditive. THEOREM 6.7. Let ? be a family of semigroup valued triangular set functions with difference regular variations defined on B. If the set [f(v@)): v e M] is bounded for every open set ?, then \f(v(B)): veM, BeB] is a bounded set. PROOF. We take m(B) = f(v(B)) (B eB.veM) and we apply Theorem 6.4. D 6.2. Convergence theorem THEOREM 6.8. Let [тп}„€^ be a sequence of k-triangular set functions defined on В with regular disjoint variations. If lim m„@) =mo@) n—>oc
Convergence theorems for set functions 169 exists for all open sets О and wo is exhaustive with respect to the family О of all open sets, then {mn}^_0 is uniformly exhaustive on the whole O. PROOF. We have to prove that jm,,}^,, is uniformly exhaustive on the family О of all open sets in T. We note that m, (/' e N) are exhaustive on the family O, i.e., if {?n}neN is a sequence of disjoint open sets then lim m,(?.)=0 (/ eN). у— oc This follows by the regularity of w, (see the proof of Theorem 6.4). Suppose that theorem is not true. Then there exist ? > Oand a disjoint sequence {Еп}„€щ of open sets such that m„(E„)>2s (neN). Then repeating the proof of Theorem 5.11 (Pap A987)), taking Lemma 6.6 and the fact that arbitrary union of open set is again open set, we obtain that {m„}„€p- is uniformly exhaustive on the family O. ? The assumption on the regularity of variation is not so restrictive, because it turns out that in the case of regular additive set functions (then they must be also regular measures) variations are also regular. So we have as a consequence the classical Dieudonne's theorem. COROLLARY 6.9. Let {т,,}„€щ be a sequence of regular scalar measures defined on B. If lim m„@) = mo(O) 11 —>DC exists for all open sets O, then {w„}„ef. is uniformly exhaustive on the whole В and mo is exhaustive. PROOF. Let v„(B) = \m„(B)\ (B e ?, ? e N). The sequence {vn}„eK satisfies the conditions of Theorem 6.8, since |v| = \m \ is also regular. Applying Theorem 6.8 we obtain that {w„}„eK is uniformly exhaustive on O. ? Instead of the family of open sets it can be considered a general class so-called Wells class, KupkaA980). 7. Convergences of measures related to partial order 7.1. Nikodym type theorems for lattice-valued measures Let X be a real vector lattice (see Luxemburg and Zaanen A971)). If и is a positive element of X, then a sequence {.х„ЬеК in X is м-convergent to ? e X if there exists a scalar
170 P. de Lucia and E. Pap sequence {t„]ne^ such that t„ -> 0 and \x„ — x\ ^ t„u for all ? e N. A sequence {х,,},1€® is relatively uniformly convergent to ? e X if {jc„},,epj is м-convergent for some и е X. The sequence is relatively uniformly ^-convergent to ? e X if every subsequence has a subsequence which is relatively convergent to ?, and then we shall write lim,,^ ? xn = x (we shall consider here only such convergence). We say that X has the property (Y) (Antosik and Swartz A992)) if: (Y) If (xij ),-.yeN is an infinite matrix in X such that lim,--^ Xij = 0 for each jeN and limj-*ooXij = 0 for each /' e N, then there exists a principal submatrix (>·,,),., eN of (xij)i.jeN such that lim/^эс X\-e/li v</ = 0 for any sequence {A,},^ of finite subsets of N with /' ? A,. Example 7.1. The following vector lattices X satisfy (Y). (a) X is complete metrizable topological vector lattice. (b) X has the property ? (Luxemburg and Zaanen A971,70.1)). Now we have the following version of Nikodym convergence theorem, Antosik and Swartz A992). THEOREM 7.2. Let X be with the property (Y). Let ?„ : ? -> X be countable additive measures such that lim ?„(?) = ?(?). ? e ?. ?—>эс exists. Then (i) {^nbieN converges to a countable additive measure ?, (ii) {/x„L?i=n is uniformly ?-additive. We have the following version of Nikodym boundedness theorem, Swartz A989c). THEOREM 7.3. Let X be Archimedean and have an order unit u. Let ??:? -> ?, ? e /. be a family of finitely additive, order bounded set functions. If {?,(?): i e /} is order bounded for each ? e ?, then {?,(?): i e /, ? e ?] is order bounded. 7.2. Convergences of measures on orthomodular posets Nikodym convergence and boundedness theorems are extended for measures defined on orthomodular posets (see d'Andrea and de Lucia A991), de Lucia and Morales A988), Guarigla A990, 1991)). Theorem 5.12 was used in the proof of the following theorem (Guariglia, 1991). THEOREM 7.4. Let L be an orthomodular lattice with the (SIP), {w„}„er; a sequence of k- triangular and exhaustive functions from L to [0. +oo[ such that ???,,^+,? m„ (a) = mo(a) for every a eh and wo is an exhaustive function then mo is k-triangular and [m„}ll€^' is uniformly exhaustive.
Convergence theorems for set functions 171 7.3. General Nikodym type theorems Chovanec and Корка A994) have introduced the notion of difference poset (Z)-poset) equivalent with the notion of effect algebra introduced by Foulis and Bennett A994), see for more details the chapter written by Dvurecenskij B002). Definition 7.5. A D-poset (difference poset) is a partially ordered set L with a partial ordering ^, maximal element 1, and with a partial binary operation 9:LxL^L, called difference, such that, for a, b e L, b ? a is defined if and only if a ^ b, for that the following axioms hold for a, b, с е L: (DP|) bQa^b; (DP2) bQ{bQa)=a\ (DP3) a^b^c =3- cQb^cQa and (c ? ?) ? (с ? b)=bQ a. The previous axioms implies that there exists also a minimal element 0 (= 1 ? 1)· For an arbitrary but fixed element a e L we define a± = 1??. We have: (i) ?±?- = ?\ (ii) a^b =>· bL ^aL. The elements a and b from L are orthogonal iff a ^ bL (or b ^ a±). We define a partial binary operation ?: L ? L -> L for orthogonal elements ? and b such that b ^a®b and ? = (? ? b) ? b. This operation ? is commutative and associative. Let {? ?,..., a„} с L. We define ???···??,, =0 for«=0. ? ? ? · · · ? ?„ = a \ for ? = 1. ? ? ?···??„ =(?? ?···??„-?)??„ for» ^3. supposing that ? ? ? · · · ? ?„_ ? and ? ? ? · · · ? ?„ exist in L. We have DEHNITION 7.6. A finite subset {a\ a,,} of L is ©-orthogonalif a\ ? · -·??„ exists in L. We say that an ©-orthogonal subset {? ? a„} of L has an ©-sum, ?"=, ?,. defined by /? ??; =?? ?··??„. ? = ? The preceding ©-sum is independent of any permutation of elements. Definition 7.7. A subset G of L is ©-orthogonal if every finite subset F of G is ©- orthogonal.
172 P. de Lucia and E. Pap We say that an ?-orthogonal subset G = {a,: i e /} of L has an ©-sum in L, ф,е/ at if in L there exists the join ? ?, = sup ? ? щ: F finite subset of / i€l ^i?F Every subset of ?-orthogonal set is again ?-orthogonal. Dehnition 7.8. An D-poset L is complete D-poset (a(©)-D-poset) if, for every ©- orthogonal subset (every countable ?-orthogonal subset) G of L, there exists the ?-sum in L. Dehnition 7.9. A D-poset L is quasi-?-complete if for every ?-orthogonal sequence {a;}ieN m L there exists a subsequence {а,},ем such that ф(е/ ?( e L for each / с М. Let ? be an uniform space with the uniformity U. Definition 7.10. A subset ? of У is bounded (^-bounded) if for every U eU there exist a finite set К с В and a natural number ? such that BcU"[K], where t/1 = U, U" = U ? ?/"-1 (o-composition of the relations) and U[K] is the set of all ? e ? such that (x, y) e U for some у е К. A subset ? of a metrisable uniform space {Y,U) is ^-bounded if and only if it is d -bounded for every metric d generating the same uniformity U. The following known characterization of W-boundedness will be often used. THEOREM 7.11. A set В С ? is U-bounded if and only if it is d-bounded for every uniformly continuous pseudo-metric d defined on Y. We denote with Vm the family of all uniformly continuous pseudometrics defined on (УМ). Let L be a quasi-?-D-poset. Dehnition 7.12. For d e Vm the d-semivariation of a function m-.L-* Y. with respect to a point x0 & ? is m*"(b) = sup{d(m(c), xn): с ? b. с e L} (be L). We define for d e Vm and x„ еУ,а function m:L·—* ? aj"(a, m) =\imsup\d[m(a ? b).x„): m*,"(b)< -. b 11^ ОС [ " eL (oeL).
Convergence theorems for set fimctions 173 Dehnition 7.13. A function m : L -> ? is said to be x„-exhaustive, for x„ e Y, if for each d e Vm, lim d(m(an),x0) = 0 /l-»00 for each ?-orthogonal sequence {a„ }„е^ of elements from L. We have the following general Nikodym boundedness type theorem (de Lucia and Pap A995); see Pap A995)). THEOREM 7.14. Let ? be a family of xo-exhaustive functions m:L-> Y, where L is a quasi-?-D-poset and ? is a uniform space. Then the set \m{a): m e M, a eh] is U-bounded if and only if the following conditions hold (i) For each d e Vm, m e M. and each r e N there exists s(r) e N such that d(m(a), m{b)) > s(r) implies either for b ^ a d(m{a Qb),xo) > r. or for a ^ b d(m{b ??),??) > r: (ii) For each d eVm the set {ay (a, m): m e ?4] is bounded for each a e L; (iii) For each d e Vm {а(т(ац),хо): m e ?, ? e N} is bounded for every orthogonal sequence {a„ }/ieN from L. For a special important case, orthomodular lattice, we can relax the conditions in the previous theorem. THEOREM 7.15. Let L be ? ? -orthomodular lattice. Let ? be a family of functions m:L^> Y. Then the set \m{a): m e M, a eL) is U-bounded if and only if the following conditions hold: (i) For each m e ? and each r e N there exists s(r) e N such that for each a, b e L d(m(a),m(b))>s(r) (d eVm)
174 P. de Lucia and E. Pap implies either mx/{{b ? a')') > r or m'j0{(a ? b')') > r; (ii) For eachd e Vm theset {m{an): m e ?, ? e N} is d-boundedfor every orthogonal sequence {ап)Н€щ from L. We have the following general Nikodym convergence type theorem (de Lucia and Pap A994a); see Pap A995)). THEOREM 7.16. Let ? be an uniform space and L a quasi-?-complete D-poset. Let {"biLieN be a sequence of ?,,-exhaustive functions m,,, m„ :L -> Y, for an element x„ from Y, such that they satisfy the following conditions (i) for each d e Vm and for each ? > 0, there exists ? > 0 such that d(mn(a),x() < S andd(m,i(b),x„) < S for a ^b, a,b eL (n e N) implies d(m„(b ? a), x(>) < ?'. (ii) for each d e Vm and for each S > 0, there exists ? > 0 such that d(mn(a),x„) <?, ueL(«eN) implies aXJ' (a, m„) < S (n e N); (iii) for eachd e Vm we have \imn^ocd{mn{a),m{a)) =0 for each a e L. Then m is ?,,-exhaustive if and only ifmn (n e N) are uniformly ?,,-exhaustive. We remark that Dvurecenskij A996) have obtained a Nikodym boundedness type theorem for a family of completely additive measures defined on the system L(H) of all closed subspaces of a real or complex infinite-dimensional Hilbert space ?, see the chapter written by Dvurecenskij B002). References Abraham, P. A992), Saeki's improvement of the Vitali-Hahn-Saks-Nikodym theorem holds precisely for Banach spaces having cotype. Proc. Amer. Math. Soc. 116. 171-179. Akcouglu, M. A. and Sucheston. L. A978). A ratio theorem for superadditive processes. Z. Wahrscheinlichkeitsth. verw. Geb. 44, 269-278. Aleksjuk, V.N. A968), Two theorems on existence of quasibase for a family of quasimeasures. Izv. Vyssh. Uchebn. Zaved.. No. 6 G3), 11-18 (in Russian). Aleksjuk, V.N. A970), On the weak compactness of a family of quasimeasures, Siberian Math. J. 11. 723-738, (in Russian). Ando, T. A961), Convergent sequences of finitely additive measures. Pacific J. Math. 11, 395-404. Antosik, P. A971), On the Mikusinski diagonal theorem. Bull. Acad. Polon. Sci. Ser. Math. Astronom. Phys. 19. 305-310. Antosik, P. A976), A diagonal theorem for nonnegative matrices. Bull. Acad. Polon. Sci. Ser. Math. Astronom. Phys. 24. 955-959. Antosik, P. A978). Sur les suites d'applications. С R. Acad. Sci. Paris A 287. 75-77.
Convergence theorems for set functions 175 Antosik. P. and Pap. E. A981), ? simplification of the proof of Rosenthal's lemma for measures on fields. Proc. of the Conference on Convergence, Szczyrk. 25-30. Antosik, P. and Saeki, S. A995). A lemma on set functions and its applications. Different Aspects of Differentiability. Warsaw, 1993. Dissertationes Math. 340, 13-21. Antosik. P. and Swartz, C. A985), Matrix Methods in Analysis. Lecture Notes in Math.. Vol. 1113, Springer, Heidelberg. Antosik, P. and Swartz, C. A992). The Nikodym convergence theorem for lattice-valued measures. Rev. Roumaine Math. Pures Appl. 37, 299-306. Berberian, S.K. A965), Measure and Integration. New York. London. Bhaskara Rao. K.P.S.. and Bhaskara Rao, M. A983). Theory of Charges. Academic Press. London. Birkhoff, G. A967), Lattice Theory. 3rd edn.. Amer. Math. Soc. Colloq. Publ.. Vol. 25, Amer. Math. Soc., Providence, RI. Brooks, J.K. A969), On the Vitali-Hahn-Saks and Nikodym theorems. Proc. Nat. Acad. Sci. USA 64, 468^471. Brooks, J.K. A973), Equicontinuous sets of measures and applications to Vitali's integral convergence theorem and control measures. Adv. Math. 10. 165-171. Brooks. J.K. A974). Sur les suites uniformement convergentes dans un espace de Banach. C. R. Acad. Sci. Paris A 274, 1037-1040. Brooks, J.K. A980). On a theorem of Dieudonne. Adv. Math. 36. 165-168. Brooks. J.K. and Dinculeanu, N. A995), Integration in Banach spaces. Application to stochastic integration. Atti Sem. Mat. Fis. Univ. Modena 43. 317-361. Brooks. J.K. and Chacon, R.V A980), Continuity and compactness of measures. Adv. Math. 37, 16-26. Brooks, J.K. and Jewett, R.S. A970). On finitely additive vector measures. Proc. Nat. Acad. Sci. USA 67, 1294— 1298. Cafiero, F. A952), Sulle famiglie difunzioni additive d'insieme nnifonnemente continue. Rend. Accad. Lincei (8) 12, 155-162. Chang, C.C. A958), Algebraic analysis of many valued logics. Trans. Amer. Math. Soc., 467^490. Cook, A.T. A978). The Nikodym-Hahn-Vitali-Saks theorem for states on a quantum logic, Proc. of Mathematical Foundations of Quantum Theory. Academic Press, New York. Constantinescu, С A981), On Nikodym's boundedness theorem, Libertas Math. 1, 51-73. Constantinescu, С A984), Spaces of Measures. Walter de Gruyter, Berlin. Constantinescu, С A991), Some properties of spaces of measures. Atti Sem. Mat. Fis. Univ. Modena Suppl. 35, 1-286. Darst, R.B. A967), On a theorem of Nikodym with applications to weak convergence and von Neumann algebras. Pacific J. Math. 23, 1297-1298, 473^477. Darst, R.B. A970), The Vitali-Hahn-Saks and Nikodym theorems for additive set functions. Bull. Amer. Math. Soc. 76, 1297-1298, 123-138. Darst, R.B. A973), The Vitali-Hahn-Saks and Nikodym theorems for additive set functions. Bull. Amer. Math. Soc. 79, 758-760. Dashiell, F.K. A981), Non weakly compact operators for order-Cauchy complete C(S) lattices, with applications to Baire classes. Trans. Amer. Math. Soc. 266, 397^413. d'Andrea, A.B. and de Lucia, P. A991), The Brooks-Jewett theorem on an orthomodular lattice. J. Math. Anal. Appl. 154, 507-522. d'Andrea, A.B.. de Lucia, P. and Morales, P. A991), The Lebesgue decomposition theorem and the Nikodym convergence theorem on an orthomodular poset, Atti Sem. Mat. Fis. Univ. Modena 34, 137-158. de Lucia, P. A985), Funzioni Finitamente Additive a Valori in un Gruppo Topologico, Pitagora Editrice, Bologna, de Lucia, P. A998), Convergence theorems in orthomodular posets. Atti Sem. Mat. Fis. Univ. Modena Suppl. 46, 171-179. de Lucia, P. and Morales, P. A986). Equivalence of Brooks-Jewett. Vitali-Hahn-Saks and Nikodym convergence theorems for uniform semigroup-valued additive functions on a Boolean ring, Ricerche Mat. 35, 75-87. de Lucia, P. and Morales, P. A988), Some consequences of the Brooks-Jewett theorem for additive uniform semigroup valued functions, Conf. Sem. Mat. Univ. Bari 227. 1-23. de Lucia, P. and Morales, P. A992), A noncommutative version of a theorem of Marczewski for submeasures. Studia Math. 101 B), 123-138.
176 P. de Lucia and E. Pap de Lucia, P. and Morales. P. A994), Noncommutative decomposition theorems in Riesz spaces. Proc. Amer. Math. Soc. 120, 193-202. de Lucia, P. and Pap, E. A994a), Nikodvm convergence theorem for uniform space valued functions defined on D-poset. Math. Slovaca 45. 367-376. de Lucia, P. and Pap, E. A994b), Diagonal theorems and their applications. Preprint. de Lucia, P. and Pap, E. A995), Noncommutative version of Nikodvm boundedness theorem for uniform space- valued functions. Intemat. J. Theoret. Phys. 34. 981-993. de Lucia, P. and Traynor. T. A994), Non-commutative group valued measures on an orthomodular poset. Math. Japon. 40, 309-315. de Lucia, P. and Salvati, S. A996), A Cafiero characterization of uniform s-boundedness. Rend. Circ. Mat. Palermo B) Suppl. 40, 121-128. de Maria, L.J., and Morales. P. A994), A non-commutative version of the Nikodvm boundedness theorem, Atti. Sem. Mat. Fis. Univ. Modena 42, 505-517. Diestel, J.J. and Uhl, J.J. Jr A977), Vector Measures. Math. Surveys. Vol. 15. Amer. Math. Soc. Providence. RI. Dieudonne, J. A951), Sur la convergence des suites de measures de Radon. Anais Acad. Brasil. Ci. 23, 21-38, 277-281. Dinculeanu, N. A967), Vector Measures. Pergamon Press. New York. Dinculeanu, N. B002), Vector integration in Banach spaces and application to stochastic integration. Handbook in Measure Theory. E. Pap, ed., Elsevier, Amsterdam. 345-399. Dobrakov, I. A972). On subadditive operators on Q,(T). Bull. Acad. Polon. Sci. 20, 561-562. Dobrakov, I. A974), On submeasures -1, Dissertationes Math. 112. Dobrakov, I. and Farkova, J. A980), On submeasures II. Math. Slovaka 30, 65-81. Drewnowski, L. A972a), Topological rings of sets, continuous set functions, integration, I, II, III. Bull. Acad. Polon. Sci. Ser. Math. Astronom. Phys. 20, 269-276. 277-286. 439-445. Drewnowski, L. A972b), Equivalence of Brooks-Jelwett, Vitali-Hahn-Saks and Nikodym theorems. Bull. Acad. Polon. Ser. Sci. Math. Astronom. Phys. 20, 725-731. Drewnowski, L. A978). On the continuity of certain non-additive set functions. Colloq. Math. 38, 243-253. Dunford, N. and Schwartz. J.T. A958). Linear Operators, Part I. Interscience. New York. Dvurecenskij, A. A991), Regular measures and completeness of inner product spaces. Contributions to General Algebras, Vol. 7, Holder-Pichler-Tempski Verlag, 137-147. Dvurecenskij, A. A993), Gleason 's Theorem and Applications. Kluwer Academic, Dordrecht. Ister Science Press. Bratislava. Dvurecenskij, A. A996), A note on the Nikodvm boundedness theorem. Tatra Mt. Math. Publ. 8, 229-238. Dvurecenskij, A. B002), Measures on quantum algebras. Handbook of Measure Theory, E. Pap, ed.. Elsevier, Amterdam, 827-868. Faires, B. A974), On Vitali-Hahn-Saks type theorems. Bull. Amer. Math. Soc. 80, 670-674. Faires, B. A976), On Vitali-Hahn-Saks-Nikodym type theorems. Ann. Inst. Fourier (Grenoble) 26, 99-114. Foulis, D.J. and Bennett M.K. A994), Effect algebras and unsharp quantum logics. Found. Phys. 24. 1325-1346. Freniche, F.J. A984), The Vitali-Hahn-Saks theorem for Boolean algebras with the subsequential interpolation property, Proc. Amer. Math. Soc. 92, 262-266. Garcia, F. A997), Convergence theorems for topological group valued measures on effect algebras. Depart- tamento de Matematica Aplicada, E.U. Informatica, Universidad Politecnica de Madrid, Technical Report, 1-15. Grothendieck, A. A953), Sur les applications lineares faiblement compactes d'espaces du type C(K). Canad. J. Math. 5, 129-173. Grothendieck, A. A957), Un resultat sur le dual d'une C*-algebre. J. Math. Pures Appl. (A) 36, 97-108. Guariglia, E. A990a), On Dieudonne's boundedness theorem. J. Math. Anal. Appl. 145, 447^454. Guariglia, E. A990b), Uniform boundedness theorems for k-triangular set functions. Acta Sci. Math. 54, 391- 407. Guariglia, E. A991), k-triangular functions on an orthomodular lattice and the Brooks-Jewett theorem. Radovi Mat. 7, 241-251. Guselnikov, N.S. A975). On the extension of quasi-Lipshitz set functions. Mat. Zametki 17, 21-31 (in Russian). Habil, E.D. A993), Orthoalgebras and noncommutative measure theory. Ph.D. Thesis, Manhattan. KS. Hahn, H. A922), Ober Folgen linearer Operationen, Monatsh. Math, und Phys. 32. 3-88.
Convergence theorems for set functions 177 Halmos, PR. A950), Measure Theory, Springer, New York. Haydon, R. A981), A non reflexive Grothendieck space that does not contain l^_. Israel J. Math 40, 65-73. Hejcman, H. A959), Boundedness in uniforni spaces and topological groups, Czehoslovak Math. J. 9, 544-563. Iseki, K. and Tanaka, S. A978), An introduction to the theory of BCK-algebras, Math. Japon. 23, 1-26. Kalmbach, G. A983), Orthomodular Lattices, Academic Press, New York. Kalton, N.J. and Roberts, J.W. A989), Uniformly exhaustive submeasures and nearly additive set functions. Trans. Amer. Math. Soc. 278, 803-816. Kalton, N.J. and Montgomery-Smith, S.J. A993), Set-functions and factorization. Arch. Math. 61, 183-200. Klimkin, V.M. A989), Uniform boundedness of a family of nonadditive set functions. Mat. Sb. 180 C), 385-396 (Russian). Kli§, С A978), An example of noncomplete normed (K)-space, Bull. Acad. Polon. Sci. 26. 414-420. Корка, F. and Chovanec, F. A994), D-posets. Math. Slovaca 44, 21-34. Kupka, J. A974), A short proof and generalization of a measure theoretic disjointization lemma, Proc. Amer. Math. Soc. 45, 70-72. Kupka, J. A980), Uniform boundedness principles for regular Borel vector measures, J. Austral. Math. Soc. Ser. A 29, 206-218. Landers, D. and Rogge, L. A971a), The Vitali-Hahn-Saks and the uniform boundedness theorem in topological groups, Manuscripta Math. 4, 351-359. Landers, D. and Rogge, L. A971b). Equicontinuity and com-ergence of measures, Manuscripta Math. 5, 123-131. Luxemburg, W. and Zaanen, A. A971), Riesz Spaces. Vol. I, North-Holland, Amsterdam. Maharam, D. A947), An algebraic characterization of measure algebras, Ann. of Math. 2 D8), 154-167. Mikusinski, J. A970), A theorem on vector matrices and its applications in measure theory and functional analysis. Bull. Acad. Polon. Sci. Ser. Math. 18, 193-196. Molto, A. A981), On the Vitali-Hahn-Saks theorem, Proc. Roy. Soc. Edinburgh Sect. A 90, 163-173. Nikodym, O. A931), Sur les suites de functions parfaitement additives d'ensembles abstraits, C. R. Acad. Sci. Paris 192, 727. Nikodym, O. A933), Sur les suites convergentes de functions parfaitement additives d'ensembles abstraits, Monatsh. Math. 40, 427^432. Orlicz, W. A929), Beitrage zur Theorie der Orthogonalentwickliingn II, Studia Math. 1, 241-255. Orlicz, W. A968), On spaces L*^ based on the notion of a finitely additive integral, Comment. Math. 12, 99-113. Pap, E. A974), A generalization of the Diagonal theorem on a block-matrix. Mat. Vesnik 11 B6), 66-71. Pap, E. A976), Uniform boundedness of a family of exhaustive set functions. Mat. Vesnik 13 B8), 319-326. Pap, E. A982), Functional Analysis (Sequential convergences. Some principles of functional analysis), Novi Sad (with English Summary). Pap, E. A985a), Functional analysis with K-convergence, Proceedings of the Conference on Convergence, Bechyne, Czech, 1984, Akademie-Verlag, Berlin, 245-250. Pap, E. A985b), The adjoint operator and K-convergence, Univ. Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 15B), 51-56. Pap, E. A986), A generalization of a theorem Dieudonne' for k-triangular set functions. Acta Sci. Math. 50, 159-167. Pap, E. A987), The Vitali-Hahn-Saks theorems for k-triangular set functions, Atti Sem. Mat. Fis. Univ. Modena 26,21-32. Pap, E. A988), Nikodym type theorem for metric valued XQ-exhaustive set functions, Univ. Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 18, 101-109. Pap, E. A991a), On non-additive set functions, Atti Sem. Mat. Fis. Univ. Modena 39, 345-360. Pap, E. A991b), The Brooks-Jewett theorem for non-additive set functions, Univ. Novom Sadu Zb. Rad. Prirod.- Mat. Fak. Ser. Mat. 21 A), 75-82. Pap, E. A995), Null-Additive Set Functions, Kluwer Academic, Dordrecht. Ister Science Press, Bratislava. Pap, E. and Swartz, С A990a), The closed graph theorem for locallv convex spaces. Boll. Unione Math. Ital. G) 4-B, 109-111. Pap, E. and Swartz, С A990b), On the closed graph theorem, Proc. of the Conference on Generalised Functions and Convergence, Katowice 1988, World Scientific. Singapore, 355-360. Pap, E. and Swartz, С A994), A locally convex version of adjoint theorem. Univ. Novom Sadu Zb. Rad. Prirod.- Mat. Fak. Ser. Mat. 24 B), 63-68.
178 P. de Lucia and E. Pap Pap, E. and Swartz, C. A996), The adjoint theorem on ?-spaces. Novi Sad J. Math, (former Univ. Novom Sadu, Zb. Rad. Prirod.-Mat. Fak. Ser. Mat.) 26 A). 63-68. Pettis, B.J. A938), On Integration in vector spaces. Trans. Amer. Math. Soc. 49, 277-304. Phillips, R.S. A940), Integration in a convex linear topological space. Trans. Amer. Math. Soc. 47, 114-145. Preston. C.J. A971), A theory of capacities and its applications to some convergence results. Adv. Math. 6A). 78-106. Ptak, P. and Pulmannova, S. A991). Orthomodular Structures as Quantum Logics, Kluwer Academic, Dordrecht. Radon, J. A919), Uber partielle und totale Differentiziebarkeit von Fiinkionnen mehrerer Varieblen und uber die Transformation der Doppelintegrale, Math. Ann. 79. 340-359. Randall, С and Foulis, D. A981). Empirical logic and tensor products. Interpretations and Foundations of quantum Theory, Vol. 5, H. Neumann, ed.. Wissenschaftsverlag. Bibliographisches Institut. Mannheim, 9- 20. Rao, M.M. A987), Measure Theory and Integration. Academic Press. New York. Rickart, C.E. A942). Integration in a convex linear topological space. Trans. Amer. Math. Soc. 52. 498-521. Rickart. C.E. A943). Decomposition of additive set functions. Duke Math. J. 10, 653-665. Rosenthal, HP A968), On complemented and quasi-complemented subspaces ofС(S) for Stonian S, Proc. Nat. Acad. Sci. USA 60. 1165-1169. Saeki, S. A992), The Vitali-Hahn-Saks theorem and measuroids. Proc. Amer. Math. Soc. 114. 775-782. Saks, S. A933), Addition to the note on some functionals. Trans. Amer. Math. Soc. 35. 967-974. Sazenkov, A.N. A982), Uniform boundedness principle for topological measures. Mat. Zametki 31. 263-267 (in Russian). Schachermayer, W. A982). On some classical measure-theoretic theorems for non-sigma complete Boolean algebras, Dissertationes Math. 214. 1-33. Swartz, С A987), The closed graph theorem without category. Bull. Australian Math. Soc. 36, 283-288. Swartz, C. A989a), The uniform boundedness principle for order bounded operators. Intemat. J. Math. Math. Sci. 12 C), 487^492. Swartz, C. A989b), A generalization of Mackey's theorem, and the uniform boundedness principle. Bull. Australian Math. Soc. 49. 123-128. Swartz, C. A989c). The Nikodym boundedness theorem for lattice-valued measures. Arch. Math. 53. 390-393. Swartz, С A992), Introduction to Functional Analysis, Marcel Dekker, New York. Swartz, С A994). Measure, Integration and Function Spaces. World Scientific, River Edge, NJ. Swartz, С A996), Infinite Matrices and the Gliding Hump. World Scientific, River Edge. NJ. Traynor, T. A992), A diagonal theorem in non-commutative groups and its consequences, Ricerche Mat. 41. 77-87. Tweddle, I. A970), Unconditional convergence and vector-valued measures. J. London Math. Soc. 2. 603-610. Vitali. G. A905), Sulle funzioni integrali, Atti R. Accad. Sci. Torino 40. 753-766. Vitali, G. A907), Sull' integrazione per serie. Rend. Circolo Mat. di Palermo 23, 137-155. Wang, Z. A984), The autocontinuity of set function and the Fuzzy integral. J. Math. Anal. Appl. 99, 195-218. Weber. H. A976), Fortsetzung von Massen mit Werten in unifonnen Halbgruppen. Arch. Math. 27, 412^423. Weber, H. A984), Topological Boolean rings. Decomposition of finitely additive set functions. Pacific J. Math. 110.471^495. Weber, H. A986), Compactness in spaces of group valued contents, the Vitali-Hahn-Saks theorem and Nikodym's boundedness theorem. Rocky Mountain J. Math. 16. 253-275.
CHAPTER 5 Differentiation Brian S. Thomson Mathematics Department, Simon Fraser University. ВС, Canada V5A IS6 E-mail Thomson @ cs. sfii. ca Contents 1. Preface 181 2. Introduction 181 3. Differentiation in K" 183 4. Some motivation 184 5. Derivation bases 186 5.1. Covering relations 186 5.2. Derivation bases 187 5.3. The limit operation 188 5.4. The dual basis 190 5.5. Some properties of the dual 192 5.6. The variation 193 5.7. Growth estimates 194 5.8. Differentiation under strong Vitali assumptions 198 5.9. Topological considerations 200 5.10. Differential equivalence 202 6. Standard examples 203 6.1. A general construction 203 6.2. Derivation bases in a metric space 204 6.3. Measure contraction 204 6.4. Set contraction 205 6.5. Busemann-Feller bases 205 6.6. Federer's scheme 205 6.7. Net bases 206 6.8. Bases associated with a lifting 206 6.9. Self dual bases 207 7. Derivation bases in a metric space 207 7.1. Differentiation of the integral 208 7.2. The density property 212 8. Differentiation under strong Vitali assumptions 218 9. Strong Vitali conditions 222 9.1. Strong Vitali property 223 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V All rights reserved 179
180 B.S. Thomson 9.2. Classical proofs of the Vitali theorem 224 9.3. (Q)-Property 226 9.4. Besicovitch-Morse property 228 10. Weak Vitali covering properties 230 10.1. Weak halo properties 231 11. Derivation bases in К 231 11.1. The ordinary derivative 232 11.2. The symmetric derivative 237 11.3. The sharp derivative 237 11.4. Density bases on К 238 11.5. The approximate derivative 238 12. Derivation bases in R" 239 12.1. The cube basis 239 12.2. The basis of centered balls 240 12.3. The interval basis 240 12.4. Rectangle basis 241 12.5. Regular bases 242 12.6. Star bases 242 13. De La Vallee Poussin theorem 243 14. The Radon-Nikodym theorem 244 15. Some further remarks 244 References 245
Differentiation 181 1. Preface At the outset let us state the scope of this presentation. The topic is a large one, embracing many different, related ideas. We choose to focus only on the interaction between differentiation properties and covering properties. For example, all mention of the Hardy-Littlewood maximal operators and related inequalities are left out, although the reader should have little trouble finding numerous extensive accounts of such topics elsewhere, as they play a dominant role in many parts of modern analysis. The general presentation is limited to a sketch of an abstract framework within which the main ideas can be expressed, and an exploration of many of the main themes. Other authors have attempted to construct an elaborate abstract theory encompassing all the special applications. To this date none of this seems to have attracted the following needed to generate, what could be called, a "theory" of abstract differentiation. The monographs of Hayes and Pauc [31] and Kenyon and Morse [41], each in their own way, require a serious devotion to a viewpoint and an elaborate language in order to enter their universe. The merits of the monographs are thus easy to overlook. The former is accessible, with many rewards, but still requiring careful and detailed study that few might wish to commit to. The latter is apparently impenetrable. Thus there is no common structure that all would agree amounts to a theory of abstract differentiation in the way that most of us agree as to what constitutes metric space theory, topology, measure spaces, etc. The scheme offered here is meant to communicate the essentials of the various themes that have emerged in this area, but may well, too, find few followers who are willing to devote the necessary effort to mastering the terminology. In Sections 2-4 we give some motivating discussion. In Section 5 a general framework with the main concepts is introduced. Section 6 gives a number of instances of this general theory and how it may be applied. Sections 7 and 8 shows how the general theory may be applied to obtain the natural differentiation results desired in the theory. Section 9 discusses the strong Vitali property and standard methods for obtaining it. Sections 10 and 11 discuss some of the main derivation bases in R and R". 2. Introduction The fundamental theorem of the calculus, presented to elementary students, relates integration and differentiation as inverse processes. Specifically, in the setting of the Riemann integral, one proves that for any integrable function /, -f- f f(t)dt = f(x) A) ax Ja at any point of continuity of /, and for any function / with an integrable derivative /', h f\t)dt = f(b)-f(a). B) L
182 B.S. Thomson These two results are known collectively as the fundamental theorem of calculus and are proved by entirely elementary means. Much more remarkable and much deeper is the version for more advanced integrals. In Lebesgue's theory one shows that for any integrable function /, — / f(t)dt = f(x) C) dx Ja at almost every point jc, even for badly discontinuous functions. The integrability of the derivative /' in general requires going beyond Lebesgue's theory (there are derivatives that are not integrable) and leads to the integration theories associated with the names Denjoy, Perron, Henstock and Kurzweil. If we express this in the correct form the path to a generalization outside the setting of the real line suggests itself. The derivative statement in C) can be visualized, directly from its definition, as asserting that I'm- / f(t)dt = f(.x) a.e. More picturesquely, by thinking of / = [?, ? + h] or [x + h, x] as an interval whose length |/| = \h\ shrinks to zero, we could write C) in the more suggestive form lim — / '-* I/I J ? f(t)dt = f(x) a.e.. where / -> ? assumes some appropriate meaning (here, for example, merely that / is an interval, ? e / and length |/| -> 0). Indeed, expressed this way, we more clearly see the differentiation theorem as an averaging result. The reader should know that a simple measure-theoretic argument (cf. [24, pp. 156-157]) allows one to extend this to the sharper statement that lim — / /-* I/I J ? \f(t)-f(x)\dt=0 a.e. (The set of points ? at which this holds is called the Lebesgue set of /.) We might now ask whether, for any other measure ? on the real line, one should have also lim —— / /(?)??(?) = f(x) ?-a.e. /—.r ?A) J/ and, moreover, whether some such statement might be true in any appropriate measure space. For the latter, however, we must now agree on how to assign some meaning to the notion / -> ? of some sets / that are required to "shrink" down to a point x.
Differentiation 183 The other half of the fundamental theorem of calculus also suggests some problems in this more general setting. If ? and ? are measures and there is a reasonable interpretation of the assertion v(I) lim-—-=/(*) ?-a.e. I^x ?(?) then can one recover ? from an integral v{E)= j /(?)??(?)? Thus the fundamental theorem of calculus suggests, if only vaguely stated, some important problems in measure theory. Is there an analogue of differentiation in more general settings that will allow a generalization of familiar theorems on the real line connecting differentiation and integration. This is briefly our motivation for the subject that follows. We begin with some history of the early results in this topic. 3. Differentiation in R" To carry the Lebesgue differentiation theorem over to higher dimensions we could begin by considering it in the form of a symmetric derivative statement, ,limni f f(t)dt = f(x) a.e. The natural and most geometrically appealing analogue in R" would be to take sequences of open Euclidean balls B(x,rk) centered at ? and with radii rk tending to zero. Lebesgue proved in 1910 that for any integrable function / on R" and almost every point ???", lim ,D, ' ч, ? fiy)dy = f{x) JB(x.? k^oc \B(x, rk)\ n) for every sequence {B{x, rk)} of balls contracting to the point x. (Here |?| denotes the ? -dimensional Lebesgue measure of a set ? с R" ¦) The argument depends on a geometric covering property of Euclidean balls: If every point ? in a set ? С R" has associated with it some sequence B(x, rk) of balls centered at ? and with radii rk tending to zero then one may extract a collection of disjoint balls that can be used to approximate the «-dimensional Lebesgue measure of E. This Vitali covering theorem, it was realized in time, was the crucial property to having the differentiation theorem. The question then arose as to whether the balls shrinking to their center could be replaced by cubes, or intervals, or other geometric objects. This, in turn, leads to the question of whether the Vitali covering theorem (the key tool to proving the differentiation theorem) would remain valid with such replacements. Harold Bohr showed that the family of intervals (rectangles with sides parallel to the axes) in E2 behaved much worse with
184 B.S. Thomson regard to this Vitali property. In 1934 Saks [70] showed that the differentiation theorem fails for some integrable functions if the balls are replaced by intervals. A construction of Nikodym [65] showed that situation was even worse if the balls are replaced by sequences of rectangles shrinking in diameter. It is now realized that the Lebesgue differentiation theorem as stated for sequence of balls B(x,rk) shrinking to ? may not be true if the balls are replaced by other simple geometric objects. Thus any attempts to extend this theorem to a general measure space and to develop a differentiation theory in an abstract setting must confront this fact, that the theory will depend on the underlying geometry of the situation. To capture and express these ideas in a more abstract setting requires us to be familiar with the Vitali covering theorem and the way that it is used to prove the differentiation theorem. Almost all of the abstract theory is an attempt to use these tools and ideas in a more general way. Our goals then are this. In a fairly general setting of a measure ? on a space X we wish to assign some meaning to the limit lim / /(?)??(?· D) We wish to capture the notion of a Vitali cover relative to this limit, and we wish to express the essential ingredients of the arguments needed to make the Vitali covering theorem work so as to answer the question as to whether D) is equal to f(x) at ?-almost every point ? e X. Conversely we hope, too, to discover some conditions under which a relation v(I) ,hm -77T = /<*> E) would allow ? to be reconstructed as the integral of / with respect to ?. 4. Some motivation To express the notion of a derivative in an abstract setting, consider how the ordinary derivative might be viewed. If F'(x) = f(x) then for all sequences of intervals [ak,bk] with ak ^ ? ^ bk and bk -aj^Owe have F(bk)-F(ak) —z * f(x^- bk - ak Vitali arguments require just that there is given some sequence of such intervals at a point x. We can formulate this by considering collections of pairs (/, x) where / is an interval and ? e /. In particular, we expect to encounter the collection fie = {([a,b],x): xe[a,b], b-a <?\.
Differentiation 185 Let us define V to be the collection ?={??: ?>0]. Then note that ¦nev ч · r F(b)-F(a) DF(x)= inf sup and n*v ч ¦ r F(b)-F(a) IJF(x) = sup inf ^e-p([«.fr]..v)e/j fo-a expresses the upper and lower derivatives of F at any point x. How does the notion of a Vitali cover enter into this scheme? If we consider a Vitali cover as consisting of pairs (/, jc) (and not just intervals as is common) then we could say ? is a Vitali cover on R if for every ? e R there is at least one sequence ([a*, ?>*], .v) e /3 for all A: with a* ^ jc ^ ?>*, a*: -> jc, and bt -> x. Now let P* be the collection of all Vitali covers on R. This plays the role of a dual basis (and hence the notation) and captures the essence of the Vitali covering notion. The collection V* expresses the same derivative, but in a different orientation: ¦nir, ч · f F(b)-F(a) DF(x)= sup inf ?*&?>* (l«-ft]..T)e/i* b — a and n/r^ ¦ f F(b)-F(a) DF(x)= inf sup . ^*eu*([„.ft)..v)e/j. fo-a To grasp these two statements is to grasp the nature of the Vitali cover and to see a way to express these ideas more abstractly. The fundamental concept we use to express these ideas in a general setting is the notion of a covering relation on a set E. DEnNiTiON 1. Let X be an abstract set and let ? с X. We say that a collection ? с 2х ? ? is a covering relation on ? if corresponding to each ? & ? there is at least one pair In the definition we do not assume that ? e I, although for most applications that will be the case. Thus the phrase "covering relation" is a bit of a misnomer since there is no implied covering in general.
186 B.S. Thomson 5. Derivation bases We now develop an abstract framework for discussions of differentiation processes in very general settings. 5.1. Covering relations Our general setting is to assume we are given a space X (frequently a metric space, but for the moment no additional structure is imposed) on which we wish to construct a derivation basis. We always assume that there is given to us a covering relation a on X relative to which the basis is to be defined. Thus ? is a collection of pairs (I.x) with ? e X and / с Х- This describes, as a first step the objects within the derivation basis, but does not describe how the convergence is to be defined. On the real line R we might naturally, and almost invariably, start with the relation a = {([a,b],x): ? e [a. b], a <b\. To discuss, instead symmetric derivatives, we would prefer to narrow this to a = {([a,b],(o+b)/2): a <b\. In a metric space X a natural place to begin is with a = {(B(x,r),x): xaX, r > 0}. F) which relation associates each point ? with balls B(x,r) centered at x. (This is the analogue in a metric space of the symmetric derivation basis covering relation above.) The elements / in the pairs (I,x) e a play the role of the intervals in the one- dimensional case and we might think of them as "generalized intervals". Definition 2. A collection ? с a is said to be a covering relation on a set ? С Х provided for every point ? e ? there is at least one pair (/, x) e ?. The terminology was originally used with the additional requirement that the set / contains x, but this an unnecessary addition for most of our theory. Loosely the phrase "covering relation" without any mention of set simply means a collection, even possibly empty, of pairs (I,x) ea. From a covering relation ? we sometimes extract subsets that play a special role. These have been called "prunings" since they do, indeed, remove some unnecessary elements. DEnNiTiON 3. For any ? с X we write ?(?) = {(?,?)??: ICE] G)
Differentiation 187 ?[?] = {(?,?)??: xeE}. (8) 5.2. Derivation bases Definition 4 (Derivation basis). By a derivation basis relative to the pair X and a we mean a nonempty collection of covering relations ? с a on X. Thus ? is a derivation basis if every element ? e ? is a covering relation on X. We shall always assume the following properties of a derivation basis: (B|) If ? e X and /3 e В then there is at least one pair (/, x) in ?. (B2) If /51 e ? and /3| cftCa then necessarily /32 e ?· Frequently we shall need also the following properties, which will be specified as needed: (B3) If for every ? \, ?? e В it is true that ? \ ??2 eB then В is said to be filtering. (B4) The basis В is said to be decomposable or to have pointwise character if whenever there is given for every ? e X an element ?? e ? then it must follow that the collection xeX belongs to B. (B5) The basis ? is said to be ?-decomposable if whenever there is given a sequence of elements ?„ e В and a denumerable partition {X,,} of the space X it must follow that the collection 00 \Jfi„[X„] n=\ belongs to ?. Finally in the presence of a topology on X we shall usually require the following property (see Section 5.9 for a discussion): (Вб) We say a derivation basis ? on a space X furnished with a topology is compatible with the topology if for every ? с X and every open set G D ? /3(G) eB[E] for every ? e B[E]. Note. The condition (B|) just reasserts that each element of В should be a covering relation on X, thus no point in the space is excluded from discussion. The general scheme in Hayes and Pauc [31] fixes a subset ? с X in advance and all derivation statements are made relative to that set. This is typical of the subject: Each attempt to introduce greater generality adds a sometimes cumbersome level of detail.
188 B.S. Thomson The conditions (B2) and (B3) together assert that ? is a filter on a. This is the most natural setting for our theory and allows limits to be taken in the sense of the filter. If ? is a derivation basis on X then it convenient to consider also the collections ?(?) = [?(?): ???] and B[E] = {?[?]: ? e В} defined by means of the prunings in G) and (8). These play the role of restrictions of the basis В although we will not insist on interpreting them as derivation basis themselves, although the language could be tailored to allow such a usage. Note, too, that (B4) =>· (B5). The terminology "decomposable" comes from Hen- stock [36] and "pointwise character" from McShane [53] in a related context. 5.3. The limit operation There are essentially two basic operations we wish to take relative to a derivation basis for any extended real-valued function h defined on a. The first is a limit operation. Dehnition 5. If h :a -> lU {±эо} then we write ??limsup/2(/,x) = inf sup h(I.x) 1-х PeB{i.x)ep and Bliminfh(I,x) = sup inf h(I.x). ?-** ?(???.?)€? As usual in such theories, if the limit superior and limit inferior agree we can write the common value as #lim/_».( h(I,x). Since we assume so little in general, there is no assumption that the limit superior must exceed the limit inferior. Indeed, it is an essential feature of the theory that we do not impose any requirements implying this. We can prove the following lemma, but it should be considered special to the case where ? is a filter. LEMMA 6. If В is filtering [i.e., satisfies (B)), (B2), and (B3)] then Slim inf h(I,x) ? Slim sup h(I.x). (9) '^* l^x PROOF. If there are numbers t and s with t > Blimsuph(I, x) and s < Slim inf /?(/,. v), l^x '^v then, using the definition, there is a ?\ e В so that h(I, x) < t for all (/, x)e ?? and there is a ?2 e В so that h(I, x) > s for all (/, л) e ?2. By (B-,) we have ?\ ? ?? e В. But this
Differentiation 189 means, by (B|) that there is at least one pair (/, x) with the properties that h(I, x) < t and /?(/, x) > s. In particular, s < t. From this inequality (9) follows. ? In working with limits taken relative to a derivation basis, the following device should be noted. In the proof of Lemma 6 we observed that, for each ? e X, if it is the case that f(x) > Slim suph(I.x), then, using the definition, there is a ?? e В so that h(I,x) < f(x) for all (/.x) e ??. If the derivation basis is decomposable [i.e., satisfies (B4)] then there is an element ? so that h(I,x) < f(x) for all (/, x) e ?. By assumption (B2) then the collection {(/,*): h(I,x)<f(x)}eB. We express this observation in the form of three lemmas. LEMMA 7. Assume that В is decomposable (B4). Suppose that Blimsuph(I,x) < f(x) for all ? eE. Then {(I,x)ea: xeE, h(I,x) < f(x)} e B[E]. LEMMA 8. Assume that В is decomposable (B4). Suppose that Bliminf h(I.x) > /(л) for all ? e ?. /-».v Then {(I,x)ea: ? e E, h(I,x)> f(x)}eB[E]. LEMMA 9. Assume that В is filtering and is decomposable [i.e., satisfies (B|), (B2), (B^), and (B4)]. Suppose that Slim h(I,x) = f(x) for all ? eE. /-».v Then for all ? > 0 {(/,jt)ea: jc e ?. |/?(/..v) - /(.v)| <?} eB[?].
190 B.S. Thomson 5.4. The dual basis The notion of a Vitali covering enters our theory as a duality. Dehnition 10. Let В be a derivation basis on X. The family B* of all collections ? с ? with the property that ????{?]]??> for every element ? \ e В and every ? e X is called the dual of ?. Observe that the dual of a derivation basis is itself a derivation basis; this requires checking the two properties (B|) and (B2), both of which are immediate. The dual basis B* always is decomposable [i.e., satisfies the property (B4)]. In general when В is filtering, the dual basis may not be. The fundamental relationship between a derivation basis and its dual is expressed in the next lemma, which shows the essential role the dual plays in the study of limits. LEMMA 11. Let В be a derivation basis and B* its dual. Then B* lim sup h(I. x) = В lim inf /?(/. x) l—.x '—¦»' and B* lim inf h(L x) = B\im sup h(I. x). /^¦v /-л- In particular, B* lim h(Lx) = В lim /;(/..x) /-».V /-».? when the limit exists. PROOF. Fix x. Suppose that s < Slim sup /?(/..x) < t A0) /—.V if possible. Then there must exist an element ?\ of В with the property that h(I,x) <t for all (/, x) e ?\ and also for every element ?2 of В there must exist at least one element (?,?) (??2 withs <h(I,x). Let /S3 be the collection of all (/. y) e a for у ? ? and also all pairs {1.x) with h(I,x) > s. Note that, for every element ?2 of ?, ?? ? &[{*}] ? 0 because of our observation above. In particular, /З3 e B* so ?Tliminf/2(/.x) = sup inf /?(/..x)>s.
Differentiation 191 On the other hand if ?4 e B* then Д; ? ?\[{?}] ^0 so some pair (I.x) in ^4 must exist for which h(I,x) < t. Thus B*\im'mfh(I, x) = sup inf h(I.x)<t. ?^? ?€?^?.?)€? From these two inequalities, together with the fact that s and t are any numbers satisfying A0), it follows that B* lim inf/?(/, jc) = Slim sup h(I. x) as required. The other assertions follow similarly. ? Note. The lemma illustrates the role of the dual basis in capturing the limit. Some readers, encountering this for the first time, experience a small shock that should be lessened if one recalls the simple fact that an ordinary limit superior lim supv_T+ /?(y) on the real line can be written as an "inf-sup" lim sup h(y) = inf sup h(x) y-*.\+ 5><v-.v<S or equivalently, by taking a supremum over all sequences tending to x, as as a "sup-inf" lim sup/2(j) = sup infh(x„). ? —-V+ A„\.V " We can use the dual basis and Lemma 11 to offer some variants on Lemma 7 that will be useful in the sequel. LEMMA 12. Assume that В is decomposable (B4). Suppose that g(x) < B\imsuph(I, x) < f(x) for all ? e E. /—.V Then {(/,*): xe E, I el. h(I.x) <f(x)\ e B[E] and {(I,x): xeE, I el. g(x) < h(I.x)} e B*[E]. LEMMA 13. Assume that В is decomposable (B4). Suppose that g(x) <BYimmih(I,x) < f(x) for all ? e E.
192 B.S. Thomson Then {(/,*): xeE, I el. h(I.x) < f(x)} e B*[E] and {(I,x): xe E, /el, g(x) <h(I,x)} e B[E]. 5.5. Some properties of the dual For ease of manipulation and notation the dual basis allows us to express the limit properties of a basis with some economy. Here are several useful properties. LEMMA 14. // В is filtering [i.e., satisfies (B|), (B2), and (Вт,)] then for every ? \ eBand every ft e B*, it follows that ?\ ? ?2 e В*. PROOF. Let ? e В and ?2 e B* and ? e X. Take any ?3 e В and note that 081 П ft) П ?3[{?}] = (ft П ft) П &[{*}] ? 0, since/3, nfteB and ?2 e ?*. Thus /3, ??2 e ?*. ? LEMMA 15. //? is filtering [i.e., satisfies (B|), (B2), anii (B3)] then В С ?*. PROOF. If ? is filtering and ?\, ?2 e ? then ft = (ft ? ft) e ? by (fo,) ?\??2[{?}] = ?3[{?}]??. It follows that ft e ?*. ? Lemma 16. В с В** = (В*)*. Proof. If ft е ? and /3 е ?* then ?\??[{?]]?91 for all ? e X. But this holds for all ? e B* and so ft e (?*)*. ? Lemma 17. в*** = В*. PROOF. If ft e ?* then ft e ?*** by Lemma 16. Conversely if ft e B*** then ftnft{*}]^0 for all ? e X and all ? e ?**. In particular, this same statement holds for all ? e В since ? С ?**, again by Lemma 16. This proves that ft e ?*. ?
Differentiation 193 5.6. The variation In addition to the limit operation in a derivation basis we require also an operation that can be used to reconstruct the measure. We choose to do this by taking estimates that use sequences {(/*,&¦)} where the {Ik} are disjointed; this is not the only possible way, but it is the way that is associated with strong Vitali conditions which is one of our main concerns in this exposition. Suppose that h is a real-valued function on a. DEnNiTiON 18. If ? is a covering relation then by V(h. ?) we denote the supremum of the sums II J2\h(Ik,xk)\ k=\ where {h, jc^)} is any finite (or, equivalently, denumerable) subset of ? for which // and Ik are disjoint if j фк. Dehnition 19. If ? is a family of covering relations (in particular if it is a derivation basis) then V(h,B) = inf V (/?,?). ??? We refer to this as the variation of the function h over the basis. Some properties are immediate. V(ch,fi) = \c\V(h,fi). A1) У(Й|+Й2,0КУ(Й|,0) + У(Й2,0). A2) If В is filtering [i.e., satisfies (B|), (B2), and (B3)] then V(hi+h2,B)^V(hi,B) + V(h2.B). A3) (This is not true, in general, unless В is filtering.) THEOREM 20. If В is decomposable or a -decomposable [i.e., satisfies (B4), or (Bj)] then the set function E-> V(h,B[E]) is an outer measure on X.
194 B.S. Thomson PROOF. Let ? С Ц?| Ek. Write X0 = X \ Ц^, Ek and Xk = Ek \ U*=! E, for * > 1. Note that {Xk\ is a partition of X. Let ? > 0. Choose /5* e ? so that V(h,pk[Ek])^V(h,B[Ek])+2ks. By the ?-decomposable property (or by the decomposable property which is stronger still) there is a ? e В with ? [Xk ] с fik [Xk ] for all к = 0. 1. 2 It is easy to check that DC jt=l and consequently ЭС DC V(h, B[E]) ? V(h,fi[E]) <: ? V(h. pk[Ek]) ? J2(V(h, B[Ek]) + 2ke). k=\ k=\ This property establishes that the set function in the lemma is an outer measure. D 5.7. Growth estimates Let us establish now some connections between the limit operation and the variational measure. The following lemmas are fundamental to our study, and entirely elementary. In a less abstract setting [14, pp. 281-286] we have called these growth conditions in order to motivate a presentation of some applications of the classical Vitali theorem. We assume throughout this section that h and к are nonnegative real-valued functions defined on the fundamental covering relation a and that we are interested in the studying the limits h(I,x) h(I.x) ulimsup and Sliminf . /^.v k(I,x) i^x k(I.x) We shall use the following short notations for any set ? с Х '¦ h*{E)=V(h,B[E]), k*(E) = V(k,B[E]), h*(E) = V(h,B*[E]), and k+(E) = V(k,B*[E]). We recall that h*, k*, h* and k* are outer measures on X when В is assumed to satisfy (B4)or(B5).
Differentiation 195 LEMMA 21. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B|), (B2), and (B3), and (B4)]. Suppose that s > ?, ? С X and that ? lim sup < s for all x e E. Then h*(E)<isk*(E). Proof. If иг h(I'x) ? lim sup <s ,^/k(I,x) for all ? & ? then ?\ ={(I,x)ea: xe E, h(I.x)^sk(Lx)\ e B[E]. Let ?2 e ?· Then since the basis is filtering ? = ?\ ? /Ь е ?[?] and so ?(?,?[?])< V(M)OV(MKjV(*,ft>[?]). As this holds for all ?2 e ?, it follows that V(h. B[E]) <; s V(k, B[E}). D LEMMA 22. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B|), (B2), and (B3), and (B4)]. Suppose that s > 0, ? С X and that о lim mi > s for all ? & E. Then h*(E)^sk*(E). PROOF. The proof is similar to the proof of the preceding lemma. D LEMMA 23. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B|), (B2), and (B3), and (B4)]. Suppose that s > 0, ? С X and that В lim sup > s l^/k(I,x) for all ? e E. Then h*(E) ^sk^(E).
196 B.S. Thomson PROOF. If ? lim sup > s ,^/ k(I,x) for all ? e ? then ?\ ={(I,x)ea: ? e E. h(I,x) ^ sk(I,x)\ eB*[E]. (Note the use of the dual basis here; in every other respect the proof is following closely theproofof Lemma21.)Let02 e ?. Then by Lemma 14/3 = ?\ П ?2 e B*[E] and so sV(k,B*[E]) ? V(sk^) <: V(h.fi) ? V(h,p2[E]). As this holds for all ?2 e B, it follows that V(h. B[E]) > s V(k, B*[E]). D LEMMA 24. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B|), (Вг), and (B3), and (B4)]. Suppose that s > 0, ? С X and that hi· · fW·*) oliminf < s l^x k(I,x) for all ? & E. Then h*(E)^sk*(E). LEMMA 25. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B,), (B2), and (B3), and (B4)]. Then ifh*(E) < 00 it follows that hi· йG·*) В lim sup < 00 for all ? e ? except for a set N С ? for which k*(N) = 0. PROOF. By Lemma 23 we have that the set N of points ? in ? at which В lim sup = 00 1^/ k(I,x) must satisfy sV(k,B*[N])^V(h.B[N]) <эо for alb >0. It follows that V(k, B*[N]) = 0. ?
Differentiation 197 LEMMA 26. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B,), (B2), and (B3), and (B4)]. Then ifh*(E) = 0 it follows that „ . h(I,x) В lim = 0 l^x k(I,x) for all ? e ? except for a set N С ? for which k*(N) = 0. PROOF. By Lemma 23 we have that the set Ns of points ? in ? at which ? lim sup > s must satisfy sV(k,B*[N])^V(h,B[N])=0 for all s > 0. It follows that V(k, B*[NS]) = 0. Since the set N of ? in ? at which „ . h(I,x) В lim ' ? 0 is the union of the set Ns for s > 0 and s rational it follows, from Theorem 20, that V(k,B*[N]) = 0. ? LEMMA 27. Suppose that the derivation basis В is filtering and decomposable [i.e., satisfies (B|), (Вг), and (Вз), and (B4)]. Suppose that В im-^- = f(x) exists finitely for every point ? in a set E. Then, ifk*(E) < 00, V(h-fk,B[E])=0. PROOF. Under these hypotheses ? = {(I,x)ea: \h(I.x) - f(x)k(l,x)\ <c\k(I,x)\] e B[E] for every с > 0. Consequently, for every ?\ e B[E], V(h-fk,B[E])iZV(.h-fk.finfi\)^cV(k.pnP\)^cV(k,P\). From this we obtain V(h - fk, B[E]) ? cV(k, B[E]) = ck*(E).
198 B.S. Thomson Since this is valid for all с > 0 and k*(E) < oo, it follows that V(h-fk,B[E]) = 0 as required. ? 5.8. Differentiation under strong Vitali assumptions We note in the preceding section that many limiting properties of the quotient h/k of the two nonnegative functions h and it on a have been expressed in terms of the variational measures associated with the derivation basis В and its dual B*. Now we add some additional hypotheses on these measures. These can be considered as "Vitali-like" assumptions. See Sections 7.1 and 9 for more detailed accounts of the relation between this and the Vitali covering theorem. For now we just exploit what seem to be natural assumptions in this theory. As before we require a derivation basis on a space X supplied with a fundamental covering relation a. We assume throughout this section that the derivation basis В is filtering and decomposable [i.e., satisfies (B|). (Вт), and (B?), and (B4)]. The functions h, к: a —> R are assumed to be nonnegative valued. The needed "Vitali" assumptions are h*(E) = V(h. B[E\) = V(h. B*[E]) = h*(E) A4) and k*(E) = V(k,B[E]) = V(k,B*[E])=k*(E) A5) for all sets ? с X that arise in the discussion. Recall that h* and k* are outer measures on X generated by the functions h, and к and the derivation basis (because of Theorem 20). The force of the assumptions are that the basis В and its dual B* generate precisely the same outer measures. Lemma 28. // . h(I,x) В hm mi < с 1->? k(I,x) for all ? e ? then h*(E)sick*(E). PROOF. This follows immediately from Lemma 24 and the assumed identities A4) and A5). ?
Differentiation 199 Lemma 29. // В hm sup > с 1^/ k(I,x) for all ? e ? then h*(E) >ck*(E). PROOF. This follows immediately from Lemma 23 and the assumed identities A4) and A5). D Lemma 30. // hi· h{I'x) В hm sup = oo 1^/ k(I,x) for all ? e ? andh*(E) < oo then k*(E) = 0. PROOF. This follows immediately from Lemma 29. ? Lemma 31.// hi- · fh{Lx) ? В hm inf = 0 /^.v k(I,x) for all xe ? and k*(E) = 0 then h*(E) = 0. PROOF. This follows immediately from Lemma 28. ? LEMMA 32. Suppose that k*(E) < oo. Then hi· h{I'x) В hm l^x k(I,x) exists finitely or infinitely (i.e., it may be +oo)for h*-a.e. ? e ? and k*-a.e. ? e E. PROOF. The set of points where the identity hi- · *h{I<x) иг h{Lx) В hm mi = В hm sup l->x k(I,x) ,_l* k(I.x) is not valid is the union of the countable collection of sets [ h(I.x) „ h(I.x) Rah = ?? e E: Slim inf <a <b < ohm sup—-—- l /-л k(L.x) i^/k(Lx)
200 B.S. Thomson where a < b are positive rational numbers. Combining Lemmas 28 and 29 we must have a"*h*(R«i,) ^ k*(R„b) ^ b'lh*(R„i>) which, since a < b and k*(R„h) ^ k*(E) < oo, is only possible if h*(R„b) = k*(R„b) = 0. From this the lemma evidently follows. ? Lemma 33. Ifh*(E) < oo then В hm l->x k(I,x) exists and is finite for k* -a. e. ? e E. Proof. It follows immediately from Lemma 30 that h(I,x) В lim sup < oo 1^/ k(I,x) except for a set of points in ? of it*-measure zero. Thus the result follows from Lemma 32. ? 5.9. Topological considerations In most applications of a derivation basis ? on a space X the space X is already furnished with a topology. A case in point is if X is a metric space. In this situation we would expect that the convergence in the sense of the derivation basis is also the same as convergence in the sense of the metric, that is that Slim{diameter(/U{.v})}=0. A6) This idea can also be captured by expressing this in terms of pruning by open sets. (This repeats condition (Вб).) Dehnition 34. (B6) We say a derivation basis ? on a space X furnished with a topology is compatible with the topology if for every ? С X and every open set G D ? ?@)??[?] for every ? e B[E].
Differentiation 201 Note that if the topology is given by a metric and the basis is compatible with the topology generated by that metric then A6) would hold at every point x. An alternative approach has been suggested by a number of studies. Instead of assuming that a topology is given in advance and that the derivation basis pays some attention to the topological structure, we can obtain a topology directly from the basis itself. Let us suppose that ? is a derivation basis on a space X. Let ? с X. We say that a point ? e X is a B-interior point of ? if for every ? e ??[{.v}] it is true that ?(?) e ??[{.*}]. We say that a point ? e X is in the B-closure of ? if it is false that for every ? e B[{x}] we must have ?(? \E) e B[{x}]. Thus in this setting we have the notions of ?-interior, ?-closure, ??*-interior, and .Enclosure. These can be used to develop topological ideas and concepts useful in discussing a derivation basis when no natural topology is at hand. Ideas of this kind can be found in Hayes and Pauc [31 ] and in the papers of A.P. Morse [57] and [58], and in a different, but related, setting in Henstock [34]. We shall not pursue these notions as they add a level of complexity and detail that is not necessarily rewarding. Here is an example just to illustrate the particulars that can enter into the subject. In the space R2 let a be the covering relation consisting of all pairs (/. x) where / is an interval (i.e., rectangle with sides parallel to the coordinate axes) and ? & I. The derivation basis В is defined in the most natural way relative to the metric in R2: Let 8A, x) = diameter(Z) for every (I.x) e a and then a covering relation ? da belongs to В if for every ? e R2 there is a ? > 0 so that (I.x) e ? if ?(?,?) < ?. It is not difficult to check the topological notions for this example. Without giving the details, let us just give an illustration. Let ? = [0. 1 ] ? [0. 1 ] be the closed unit square in R2. The ?-interior points of ? are exactly the ordinary interior points of E, namely the points in the open square ?° = @. 1) ? @, 1). The ?-closure of ? is ? itself. The ?*-interior points of ? are exactly the points in ? itself. The ?*-closure of ? is also exactly ?. Contrast this with the situation for the open square ?° = @. 1) ? @. 1). The ?-interior points of ?° consists of ?° itself. The ?-closure of ?° is again merely the set ?° (and contains no points of the ordinary closure). The ??*-interior of ?° is also ?°. The .Enclosure of ?° is the closed square ?. Bruckner and Rosenfeld [15] approach the subject from a different viewpoint. If ? is a derivation basis on a measure space (?.?.?) then consider the class of functions / with the property that for every ? > 0 there is a ? e В so that J \/(?)-/?\??(?)^??(?) for all ? с /, ? e ?, (?,?) e ?. We consider such functions "continuous". The motivation comes from the case in R where continuous functions can be described as those / for which for every ? and ? > 0 there is a S > 0 with I \f(x)-f(y)\dy^ek(E)
202 B.S. Thomson for all measurable ? с (x — S,x + S). One then studies the smallest topology on X for which this class of functions are continuous in the sense of that topology. They obtain numerous natural results, among them an analogue of the Lusin theorem that measurable functions can be approximated by continuous functions. In the sequel we make no use of topological concepts obtained directly from the basis, but we do assume frequently that the basis being studied is compatible with the metric in the sense of Definition 34 (or equivalently condition (Вб)). The following properties then play a role. Let us suppose that X is a metric space. LEMMA 35. If a derivation basis В on X satisfies (Вб) then so too does the dual basis B*. LEMMA 36. Suppose that the derivation basis В on X satisfies (B|), (B:), (B4), and (Вб). Suppose two sets E\ and ?4 ore separated in the sense that d(E\. ?2) > 0. Then there exists ?\, ?? e В with ?\[?\]??2[?2] = 1». LEMMA 37. Let В be filtering and compatible with the metric on X [i.e., satisfies (B|), (B2), (B3), and (Вб)]. Suppose two sets E\ and Ei are separated in the sense that d(E\, ?2) > 0. Then there exists ? ? В with /*[?,] П/3[?:] = 0. LEMMA 38. Let В satisfy (B|), (B2), and (Вб) and suppose that h is a real-valued function on a. Define the set functions h* and /7* by h*(E) = V(h. B[E]) and /j*(?) = V(h, B*[E]) for all ? С X. Then h* and h* are metric outer measures on X. (Although the notation is suggestive of inner and outer measures, that is not the intention; both are outer measures and, if В is filtering, then In^h*.) 5.10. Differential equivalence Let ? be a derivation basis on X satisfying properties (B|)-(B4). For any functions h ,k : a -> R let us write h = k and say that h and к are differentially equivalent relative to the derivation basis В if V(h-k,B) = 0. Under these hypotheses this is an equivalence relation and if h = к then h and к behave very much alike with respect to many properties. For example, the outer measures h* and k* would be identical, as would /г* and it*. Also if g: a -> ? then the limits h(I,x) В lim and В lim l^xg(fx) l^x
Differentiation 203 would be closely related up to a set of g*-measure zero. Write dh for the equivalence class of all functions that are differentially equivalent to h. Then one can assign meaning to statements such asdh = fdk. For example if ? and ? are measures on X then the "differential notation" ?? = fdv now has a significance defined relative to the derivation basis. The method derives from Henstock's "variational equivalence" which he exploits in many arguments, and that in turn is derived from earlier ideas of A.J. Ward. S. Leader [46- 48] has shown how, in some settings, the classical notion of a differential and its traditional properties can be handled by this procedure. 6. Standard examples In order to provide some perspective we give a number of concrete constructions of derivation bases. A later section will explore some of these examples in greater detail. 6.1. A general construction In any space X let there be given a covering relation a and suppose that there is also given a positive, real-valued function S on a. We can express the limit lim h(I,x) S(/..t)—0 as a limit relative to a derivation basis as follows: By В we mean the collection of all covering relations ? С a with the property that for each ? e X there exists a positive ? (depending in general on ?) so that (/, jc) e ? whenever (I,x) ea and ?(?. ?) < ?. The dual basis B* can be described as containing all ? С a with the property that for each ? e X and for any positive ? there is at least one pair (/,x) e ? for which it is true that <$(/,-*) < ?. Hayes and Pauc [31] refer to <5 as a numerical index of contraction and call such bases D-bases (although defined in their scheme, not as here, using covering relations). In order for the method to work (so that В and B* are derivation bases in our sense) there must be for every ? and every ? > 0 at least one pair (/. x) e a with <5(/. ?) < ?. Perhaps the most natural instance of such a scheme occurs in a metric space where we would use <S(/,Jt) = diameter({;t}U/)
204 B.S. Thomson as an index of contraction (Section 6.2). It might also seem natural in the setting of a measure space to use instead ?(?,?) = ?(?). The discussion in Section 6.3 includes a simple example illustrating that this might not have nice properties. 6.2. Derivation bases in a metric space In the special case where X is a metric space one most frequently desires that the limit taken is in the sense of sets contracting in the metric towards the point x. This can be achieved in general by selecting, as usual, a covering relation a on X and then using the numerical index of contraction <5(/,;t) = diameter({jt}U/) to generate a derivation basis as in Section 6.1. Most derivation bases in metric spaces will be found to be of this type. 6.3. Measure contraction In some studies it is useful to consider a limit lim h(I,x) /i(/)^0. .re/ where ? is a measure on the space X and, so, the limit is understood in terms of shrinking values of ? (I). Here we can capture this by taking as a fundamental covering relation a on X such that for each pair (I,x) e a, x e I. I is in the domain of the measure ? and 0 < ?(/). Then an appropriate numerical index of contraction is ?(?,?) = ?(?). The reader should be immediately aware of some limitations in such systems. At first sight it might seem like an intuitively appealing notion of limit. Consider for example, however, X = R2, ? planar Lebesgue measure and let a be the covering relation of all pairs (/, x) where / is an interval (i.e., a rectangle with sides parallel to the axes) and ? e /. If ? denotes the derivation basis generated by taking <$(/,.*) = ?(/) as the numerical index of contraction then observe an unpleasant limit computation. Let ? be the upper half plane. It is easy to check that В lim sup = +эо and В lim inf — = 0 /^v M(/) /--v flU)
Differentiation 205 at every point ?. Thus this system does not have the density property, and should be lacking in most reasonable properties. 6.4. Set contraction In some cases some might wish to study a limit \im h(I,x) /-».v where the sets / are contracting in the sense of set inclusion. For this define an appropriate covering relation a on X such that for each pair (?,?) ea,x e / and take ? С a to belong to В if for every ? e X there is an element (J.x) e a so that every pair (I.x) e a for which J С I belongs to ?. It should be noticed that this notion seldom has very nice properties unless the class of sets involved has been tailored properly. (We shall see that in the presence of a net structure or a lifting this can be done quite successfully.) 6.5. Busemann-Feller bases In the euclidean space Ш" take I to be a fixed family of open bounded sets that has the property that if / e I then every set homothetic to / is also in I. The fundamental covering relation is simply a = I x Ш". Then the derivation basis is defined by using, as in the preceding discussion, the numerical index of contraction <S(/,;t) = diameter({.v}U/). The early study of Busemann and Feller [17] used, essentially, this basis. 6.6. Federer's scheme Federer [24] sets his study in a metric space X equipped with a locally finite metric outer measure ? for which all bounded sets have finite ? measure. The fundamental covering relation a is presumed to have some strong properties: (i) each (?,?) ea has ? e I and / a bounded Borel set; (ii) a is fine at each point ? in the sense that inf diameter(Z) = 0; (/..t)ea (iii) if ? С a is a fine cover of a set ? с X in the sense that ? is fine at each ? e ? then there is a countable family {(/,-,?,-)} С ? for which {/,·} is disjointed and so that v(z\\Ji,)=o.
206 B.S, Thomson The limit now is taken in the sense of the metric so that one can use the numerical index of contraction <5(/, x) = diameter(Z) as before. Note that the dual of the derivation basis here would be, in this language, the collection of all ? с a that are fine at each point ? e X. The extra assumptions on the covering relation a assure that the usual methods of the Vitali covering theorem can be applied. 6.7. Net bases The space R" can be carved up geometrically in a fashion that is useful in many investigations. By a dyadic cube in R" we mean a set of the form [(/? - lJ-*,/|2-*[ ? [(/2 - lJ-*./22-*[ ? ··· ? [(/„ - lJ-*,/n2-*[ where k, i\, /'2,..., /„ are integers. The collection of all dyadic cubes forms a net structure on R" that has the essential property that if N\ and N2 are any elements of the collection then one of the following must be true: N\ С N2, or N2CN\, or N\ ? ?2 = 0. Note, too, that (i) at most a finite number of sets of the net contain a given net set, (ii) the sets are simple Borel sets, in this case of type Fa, and (iii) each point is in sets of arbitrarily small diameter. Any family in a metric space with such properties is called a net. Besicovitch made much use of this in his investigations of Hausdorff measures and, in a more general form, the study of "comparable net measures" plays an important technical role. (SeeEdgar [20, pp. 58-61] and Rogers [69, pp. 101-122].)See also Saks [72, pp. 152- 156] and Hayes and Pauc [31, p. 110] in the setting of abstract derivation theory. The way in which the net structure carves up the space allows, as in the preceding section, for the usual methods of the Vitali covering theorem to be applied to any appropriate measure on the space. A similar structure is available in many measure spaces. The actual requirement here is that the measure space be separable. For measure spaces this means roughly that the space can be carved up into a finite or infinite sequence of small pieces; it is also the same as saying that L2(X), the square integrable functions on the space, is a separable Hubert space. A brief account, but with a large citation list, is given in Bruckner [13, pp. 34—35]. 6.8. Bases associated with a lifting In a general measure space (?, ?, ?) one cannot hope to carve up the space as neatly as does the net structure. In the presence of a lifting, however, this can be done. A lifting is
Differentiation 207 just an operation on measurable sets that picks out one for every family of sets with the same ?-measure and does it in a special algebraic way, preserving essential features. Let L be a lifting on the measurable sets (roughly L(M) picks out a measurable set with the same measure as ? in a way that preserves intersections and unions). Then interpret / =?> ? to mean that ? e / and / is lifted (i.e., L(I) = I) and "shrinking" is just meant as set inclusion shrinking. Thus the basis В is constructed by first taking a as all pairs (/, x) where / is a lifted set and ? e /. The basis is constructed using set contraction as in Section 6.4. This basis can be proved to have the strong Vitali property for ?. Kolzow [43] in 1968 showed more or less that there is a lifting exactly when the Radon-Nikodym theorem is available. Then, because of this fact, if the Radon-Nikodym theorem holds for the measure space there must be a differentiation basis for the space that has the strong Vitali property and so that the Radon-Nikodym derivative is a genuine a.e.-derivative for that structure. We omit any further discussion leaving the topic of liftings to the chapter prepared by W. Strauss and N.D. Macheras in this volume. 6.9. Self dual bases In any space X with a covering relation a on X let U be an ultrafilter of subsets of a. This means that U is filtering and that for every subset ? с a either ? e U or else a \ ? e U. Given any filtering derivation basis В there must be, by the maximal principle, an ultrafilter U that contains B; thus such objects are easy to come by, if rather difficult to conceive. The derivation basis U has the fundamental property that it is self-dual, that is that U = W. To see this suppose that ? e U. Then, since U is filtering, ? e U* by Lemma 15. If ? e U* then either ? e U or else a \ ? e U. But the latter is impossible since ??\(?\?) = 8 which is not allowed for an element ? e U*. Thus U = U*. Note that, because of Lemma 11 and the self-duality, for every function h : a -> R, Uhmsup h(I,x) = Uhm'mfh(I.x) (possible infinite) must hold at every point ? e X. Thus limits are particularly simple in such settings. 7. Derivation bases in a metric space For this section let us suppose that X is a metric space. We require that there is a Borel regular outer measure ? on X; thus all Borel sets are measurable and we need too that that all bounded sets have finite ? measure.
208 B.S. Thomson For the fundamental family a that gives the covering relation on which the derivation basis is defined we require in advance only that each pair (/, x) e a has / a bounded Borel set for which 0 < ?(?) < oo. It is not generally needed that ? e I. Let ? be a derivation basis on X. Although many results have versions under weaker hypotheses, for our exposition here let us assume that В satisfies all of the properties (? ?), (Вг), and (В-,), and (B4) and we ask also for a compatibility with the metric by assuming the property (B^). Associated with ? (which remember is itself an outer measure) are two further metric outer measures ?* and ?* defined for all ? с X by ?*(?)=?(?,?*[?]) and ?*(?) = ?{?. ?[?]). Because of our assumptions we can show that ?* ^?* ^ ?. (Note that while these two additional outer measures are generated by ? and the derivation basis they may not be related to the usual outer measures constructed in such situations.) 7.1. Differentiation of the integral We now return to one of our central problems, the differentiation of the integral. We do this in the setting of a metric space with all the metric and measure assumptions needed to produce a reasonable theory (as outlined already). Thus we seek conditions on a derivation basis ? on a metric space X furnished with a locally finite, Borel regular outer measure ? so that В lim / f du= fix) ?-a.e. /-,?(/)? Jy for any ?-integrable function /. We begin with a variational characterization of the integral that leads immediately to a way to estimate the measure of the Lebesgue set. THEOREM 39. Let f be ? integrable on X and write &(/,?)= ?\/(?)-/(?)\??(?) for all (?, ?) ea. Then V(h,B) = 0.
Differentiation 209 PROOF. Tosimplify the proof and clarify the arguments we present only the case f(x) ^ 0 for all ? e X. (To handle the set {x e X: f(x) < 0} requires a repeat of the same arguments.) Let ? > 0. Using the absolute continuity of the integral, we can determine ? > 0 so that, whenever A is a measurable subset of X with ? (A) < ?, then ?/??<?-. A7) Ja 4 Choose 1 < t < oo so that A-0 ? ???<- A8) Let Z = {x & X: f(x) = 0} and form a denumerable, measurable partition of the set X\Z (i.e., the set of points where / is positive) by writing Xm = {xeX: Г </(*)<;?'"+'} form = 0, ±l,±2, ±3,.... Note that each ?(?,„) < oo because of the inequality fi(Xm)^ ? ?/(?)??(?)??'" ? /??. JXm JX Thus we may, by the assumption on the measure ?, choose open sets G,„ D Xm so that each ???,? \ Xm) is as small as we please. We do this so as to satisfy the two inequalities oo ]? ?@,„\?,?)<? A9) and ]T r+V(G„;\^»)<J· B°) ;n=—oo Finally, again by our assumptions on the nature of the measure ?, we can choose an open set W D ? so that ?(? \?) <?. Take any element ?\ e В and, using the properties (B2) and (B4), construct from it an element ? e ?, ? С ?\ with the property that for all ? e ?, ?[{?\] С fi(W), and for all xeX\Z, ?[{x\\c?{Gml?)) B1) where m(x) is an integer selected so that ? e Xmu> С G,n(.V). In order to estimate V(h, ?) we now consider now a collection {(/,, ??)} contained in ? and with the pairs /,, // disjoint
210 B.S Thomson for i ? j. Corresponding to each ?, select an integer m(i) so that ?? e XB1|,) С Gm(,) and consider the sums T\h(Ii^,)\ = T ?\/(?)-/(?,)\??(?). B2) ? ^ The sum in B2) can be dominated by the four separate sums: ? = ?? \/(?)-/(?,)\<1?(.?), ? J /,???,,?,) 2 = ?? \?(?,)\<??(.?), ? = ?( \/(?)\??(?) I Jli\Xmii) all of which sums are taken only for points ?? e X \Z, and ? = ? ? \?{?)-?{?,)\??{?) which latter sum is taken only over points ?? e Z. The sum ? is simplest to handle since f(x) = 0 for jr e ? and /(?,) = 0 for each element in the sum. Thus jJi,\z Jw\z 4 because of A7). For ? note that if both points ?, ?? are in /, ? X,„(,) then, writing simply m = m(i), tm < f(x) <: r'"+l, t'" < /(??) <: r'"+l, and so |/U) - №-)\ < t'"+] - t'" = t'"(t - 1) iC (t - I)f(x). Consequently ?^?? (\-?)/(?)??(?)^(\-?) ? ???<- jJhnx,„U) J ? 4 by A8). For Q note that if ?, e XmU) then 0 < /(?,¦) ^ r'"(',+ l and /, \ Xm(i) С G„ni) \ ?,„(,,. Consequently DC ?<; ? ?'"+]?(?,„\?,„)<?- m=—oc by B0).
Differentiation 211 Finally for R, define A = (J(/,-\ *«(/)) and observe that the union is disjoint. Thus ОС ?(?) = ]??(/,\Xm{l))ii ? ?(?„\??,)<4 / /я=—зс because of A9). Thus we have R = y[ /(?)??(?)? ( fixUfiM^- because of A7). Thus we have proved that V(h, ?) < ?. Since ? is arbitrary it follows that V(h, B) = 0 as required. ? From this theorem we can deduce the differential equivalence of ? and fdv (in the language of Section 5.10). In particular, we have the following relation for the associated measures. COROLLARY 40. Write v(I, x) = /, / d^for every pair (I, x) e a. Then ?(/?,?[?])=?(?,?[?]) /oral! ? CX. PROOF. The corollary follows from the theorem along with the variational property A3). ? Finally we have our main result showing that the Lebesgue set of an integrable function / has zero measure in a certain special sense. THEOREM 41. Suppose that the derivation basis В satisfies properties (B|), (Вт), (В4), and (Вб). Let f be integrable with respect to ? on X. Then there is a set N С X with ???) = 0 so that Blim-^- i|/U)-/(y)|^(y) = 0 /^.Гд(/)У/' forallxeX\N. PROOF. This follows directly from Theorem 39 and Lemma 26. D
212 B.S. Thomson Finally now we can express our fundamental problem in this setting. What conditions on the derivation basis give us /^л ?(?) J, for any ?-integrable function /? We evidently require that ?*, ?* and ? are identical on the exceptional set. This identity is intimately related to what are known as strong Vitali conditions and, as we shall see in the sequel, can often be proved in important settings. 7.2. The density property The preceding section demonstrated that the integral in this setting (metric space under natural assumptions) can be differentiated outside an exceptional set of ?^-measure zero. It is possible, however, in some examples that the measure ?* is trivial so that the statement has no content. (The interval basis in R" [see Section 12.3 below] with ? the «-dimensional Lebesgue measure is such a case.) Using ideas of De Possel [64] we can refine the arguments. Rather than employing the variation, which uses estimates involving disjointed sequences, we can allow some overlap of the sets. To do this we must, though, give up the chance of differentiating the integrals of all integrable functions, reducing ourselves now to discussions of bounded functions. In particular, note that applying the differentiation problem to a function f = ??, the characteristic function of a measurable set A, we can ask more narrowly for conditions under which we can be assured that ?(???) B ,lim 777~ = *aW B3) ?-*.* ?(/) for ?-almost every x e X. Following common usage we say that the derivation basis has the density property for ? if B3) holds for every measurable set A. The analysis that addresses the problem of finding conditions under which the density property holds is due to De Possel and constitutes mainly a shift of viewpoint on the usual Vitali arguments, by considering approximating sequences that are not disjoint but have a small overlap. Fix s > 0 and let ? be a covering relation. By V(h,fi) for a function h : ? -> Ш the reader will recall that we computed SUp]P|/2(//,?/)| / where the supremum is taken over all collections {(/,-,?;)} С ? with the {/,} disjointed. Now we define \??.5(??, ?) to be the supremum of the same sums, taken overall collections {(//,?;)} С ? for which ]? ?(/,) - ? ((J/,)<*. B4)
Differentiation 213 It is immediately clear that if {/,} is disjointed then the condition B4) does hold. Thus ?^,?)^\??.?^,?). Finally we define, for any family В of covering relations, ??? Again it is clear that V(h,B)^ Wtl,s(h, B). LEMMA 42. Let В be any collection of covering relations. IfG is open then \?,5(?,?@)?;?@ + ?. PROOF. Let ? e B(G) and suppose that {(/,-. ?,)} с ? satisfies B4). Then, since also each /; С G, ]??(/,-) <C mMJ/,J +s ??@ + ; Thus ??.,?,?) iC m(G) + s and the result follows. ? Let now ? be a derivation basis on a metric space under all the same assumptions as before. By the lemma we see that ?(?) + 5>???.1(?,?[?]) for all s > 0 and all measurable sets. This is because, under our assumptions, ?(?) = inf{M(G): G open, G D E\. Thus in general we have \??.4(?, ?[?]) as an underestimate of the measure ?(?). For the density property we shall see that it is an accurate measure. We shall show that, under appropriate hypotheses, the derivation basis В has the density property relative to a measure ? if and only if ?(?)<?\??..,(?,?*[?]) for all measurable sets ? and all s > 0. Let us first show the following lemma which contributes half of this result. LEMMA 43. Suppose that 0 < ?(?) < oo. // mim^^Ul B5) /-* ?(?)
214 B.S. Thomson ?-a.e. in A for every measurable subset Ac E, then ?(?)<?\??.,(?,?*[?]) for each s > 0. PROOF. Fix ? e B*[E]. Choose 0 < a < 1 so that (? -1)?(?)<?. By our hypotheses В hm = 1 /-* ?(/) for most points ? of ? so 0| ={(/,*) e0: ?(/??)>??(/)} cannot be empty and, hence, in particular r|=sup{M(/): (?,?)??]}>0. (Recall we are assuming throughout these sections that ?(/) > 0 for all (/, x) e a.) Choose (/|,?i) e/3, 5???3??(/|)> 3r,/4. We have that ?(/, ? ?) >??(?\). Now write E\ = ?, ?2 = ?| \ I\. By our assumptions ?2 is a measurable subset of ? and so the hypotheses allow us to repeat these arguments applied to ?2 if ? (?2) > 0. If ?(?2) = 0 the process must stop. Thus, continuing inductively we shall find sequences En = E„-\ \ /,,_|, ?„ = {(?,?)??: ?(??)?„)>??(?)}. rn = sup^(/): (/, лг) e ?„} > 0, and (?„,?„)??„ withM(/„)>3r„/4. As long as ?(?„) > 0 the process continues to obtain the element (/„,?„); if ?(?„) = 0 then the process stops there. We arrive at a sequenced 1, ?1), (/2,?>), (/i,?0,... of elements of ? that is either finite (if ?(?„) = 0 at some stage) or is infinite. In the event of a finite sequence evidently ?(?\\^??=0.
Differentiation 215 But this equality is also true for the infinite sequence. To see this observe that ]??(/,) ^fl-' ]??(/,¦ n?,-)=fl-V(U(/,¦ П?,-Л = fl-V(?niU/iJJ^/i(?)<oo. In particular, ]?. ?(/,) < oo and so ?(/,) -> 0 as / -> oo. Since r, < 4?(/,)/3 this means r,- -> 0 as / -> oo too. If contrary to what we want to prove ?(?\\^??>0 then write Ex = ? \ (J,- /; with ?(??) > 0. Applying the density property to ?Oc as a subset of ? we would obtain, following the same line of reasoning as used above, ?? = {(?,?)??: ?(/??^)>??(/)}, and roc = sup^(/): (/, лг) e ?^ \ > 0. But then /Ззс с ?? for all / and it follows that 0 < rx ^ r,¦ -> 0 which is impossible. Thus we have established that ?(?\\^??=0. Now we argue that the sequence (?\.?\), (h,b)Ah-b)'¦¦¦ of elements of/3 satisfies condition B4). To see this we use the facts that ?(?M??]??(/,) and ?(?)^?([}?? to compute ? ? i which is precisely the condition B4) that we wished to verify.
216 B.S.Thomson Since it follows that and the lemma follows. ? Now we turn to the other direction, by establishing the density property under the measure hypothesis. LEMMA 44. Assume that for every measurable set ? that ?(?)<?\??.,(^?*[?]) for all s > 0. Then В has the density property with respect to ?. PROOF. Let ? > 0. Take any ? measurable and write ? = \x?M: Slim sup — >0 , ( T ,^/ ?(?) J Mk = \x<?M: Slim sup —-—- > \/k\. { ?^? ?(?) J Note that the sequence of sets Nt increases to ?'. By our regularity assumptions we may select Gk open so that Gk D Nk and ?(Gk \ Nk) < ?. Let Pk = {(I,x): ?(??)?)>?(?)/?<} and notice that & e B*[Nk]. Note also that ft(G*) e ?*[W*]. Take any {(/,·,&)} С fik(Gk) that satisfies condition B4) J>(/,-)-/*Aj7'" Note that this can also be written as у (??'·--??/,)^<?·
Differentiation 217 Then ]Гм(/,)<;*]Гм(/,пмк*[у (???, -xu/,W + ?(,?n^J/,^ <ks + k^(Gk\Nk) <ks + ks. From this it follows that W^s(h,pk(Gk))<ks + ke and hence that Evidently then ?(?) ^ ^ M(Njt) = 0 and we have proved that ?(??)?) В hm = 0 ?^? ?(?) at ? almost every point ? that is not in ?. Applying this observation to the complement of M, the set X \ M, and using the trivial fact that ?(?) = ?(? П /) +?((? \ ?) ? /) we obtain that кг М(/ПМ) . В hm = 1 /-^ м(/) at ? almost every point ? that is in ?. Since this is the desired density property the proof is complete. ? Thus the results of this section (due to De Possel, but in a different language and setting) show that the density property is equivalent to there being a weak Vitali estimate for the measure from the dual basis B*. In fact under fairly general hypotheses the density property is equivalent to the differentiation property that the basis satisfies Slim / /??= fix) ?-a.e. for any bounded ?-integrable function /.
218 B.S. Thomson 8. Differentiation under strong Vitali assumptions Let us generalize the setting by supposing merely that (?, ?, ?) is a measure space. We suppose that we are given a covering relation a so that / e ?4 and 0 < ? (/) < oo for every pair (/, jt) e a. Suppose that -/. <p(E)=J fdiL for all ? e ? where / is a nonnegative, ?-integrable function. If a filtering, derivation basis В is given on a, then what hypotheses will allow us to conclude that / is a genuine pointwise derivative in the sense that «Г Ф{1) ft ч /—* ?(/) for ?-a.e. point дг е X? The needed assumptions are ?*(?)=?(?,?[?])=?(?.?*[?]) B6) and 0*(?) = ?@,?[?]) = ?@.?*[?]) B7) for all ? с X to begin with. For now we do not assume that ?* and ?* are the actual outer measures generated by the measures ? and ? by measure-theoretic methods. For the first several lemmas the fact that ? and ? are related by ?(?) = JE f ?? plays no role. The assumptions B6) and B7) (along with the refinements B8) and B9)) are called strong Vitali properties. This repeats the development in the metric space setting of Section 7.1, but now in a measure space with these assumptions in force. Now without the topology present we seek a different method than the direct variational one given before. Lemma 45. If hi· · гфA) uliminf < с /-* ?(?) for all ? & ? then ?*(?)???*(?). PROOF. This follows immediately from Lemma 28 and the assumed identities B6) and B7). ?
Differentiation 219 Lemma 46. If ? lim sup > с for all ? e ? then ?*{?)>??*{?). PROOF. This follows immediately from Lemma 29 and the assumed identities B6) and B7). ? Lemma 47. // В lim sup = oo ?^? ?(?) for all ?? ?????*{?) < oo then ?*(?) = 0. PROOF. This follows immediately from Lemma 30. ? Lemma 48. // Sliminf— =0 for all ? e ? and ?* (?) = 0 then ?* (?) = 0. Proof. This follows immediately from Lemma 28. ? LEMMA 49. Suppose that ?*(?) < oo. Then the identity „.. . ?(?) ?(?) В lim ??? = В lim sup /-.t ?(?) ?^, ?(?) holds for ?* -a.e. ? e E. and ф*-а.е. ? e ?. PROOF. The set of points where the two limits are unequal is the union of the countable collection of sets [ </>(/) ~ ФИ) Rah = \ x e E: Sliminf <a <b < ? lim sup ?^? ?(?) ?^? M(/) where a < b are positive rational numbers.
220 B.S. Thomson Combining Lemmas 45 and ?? we must have 0~ V (Rab) < fl*(Rah) < &~ V(/U) which, since ? < b and everything is finite, is only possible if both <p*(Ruh) = ?*(?«/>) = 0. From this the lemma evidently follows. ? Lemma 50. 1/ф*(Е) < оо ?/?еи В hm /-*/*(/) emfs and is finite for ?*-?.?. ? e ?. PROOF. It follows immediately from Lemma 47 that ? hm sup < oo /^/ M(/) except for a set of points in ? of ?*-measure zero. Thus the result follows from Lemma 49. ? There remains now only the problem of determining whether /^.r ?(/) at most points. For this we need to add a few assumptions to B6) and B7) that allow the arguments to be used. The extra needed assumptions are ?(?)=?(?,?[?]) = ?(?.?*[?]) B8) and ?(?) = ?(?,?[?])=?(?,?*[?]) B9) for all ? e ? so that the outer measures ?* and ?* are indeed closely related to the usual outer measures generated by the measure space. Also we require that the measures are complete so that sets of measure zero that enter into the discussion are also covered by the assumptions in B8) and B9). Finally we shall assume that the limit g(x) = В lim is measurable C0) ?^? ?(/) so that we can use some recognizable measure-theoretic arguments on g to show it is precisely / almost everywhere. (Measurability can, in some settings, be obtained directly from strong Vitali assumptions; see [24, p. 155].)
Differentiation 221 THEOREM 5 1. Under these additional assumptions, if ? С ?, ?(?) < oo, and ?(?) < oo, then Slim ? /?? = /(?) for ?-a.e. ? e E. PROOF. We have established already, in our more abstract version of this (Lemma 33), the existence of this limit g(x) outside a set of д*-теа5иге zero in E. We shall show that ?(?) = ?/?? = ?8??. Since this shows this identity for all such sets the functions / and g can differ only on a set of measure zero. Let E\ = {x e E: g(x)=0} and E2 = {x e E: 0 < g(x) < oo}. We deduce from our usual growth estimates that ф(Е\) = 0 and that ?(?\(?]??2))=0. Thus we can focus on Ег. Fix 1 < t < 00 and partition Ei into a denumerable collection of disjoint measurable sets A„ = {xeE2: t" ? g(x)<tn+i} where ? is an integer (positive, negative or zero). Using Lemmas 45 and 46 along with B8) and B9) ?(?2) = ? ?(?»^ ? '" + ??(?„? ? 4 ?<1? = ? &??. ?=-? ?=-? ?=-? Ja" jEl Similarly ЭС Г^С -л- л г* ?(?2)= ? ?^?^> ? г"^А^> ? ?_' / &?? = ?? / &??. J A,, J E- ? = — ? н=— эс ??=—??. " Since ?(?\) = ?(?\) = 0 and ?(? \ (?, U ?2)) =?(? \ (?, U ?2)) = 0 it follows that / 8?? = ?(?) = j ??? and the theorem follows. ?
222 B.S. Thomson 9. Strong Vitali conditions The assumptions ?(?) = ?(?,?[?]) = ?(?,?*[?]) C1) and ?(?) = ?(?,?[?])=?(?,?*[?]) C2) that played a role in our theory are known as strong Vitali conditions. We now turn to the methods that have been used to establish such conditions. To keep the discussion to its most basic level and avoid abstractions and generalities that might obscure the main themes let us work in a metric space X. We assume as before that ? is a locally finite, Borel regular outer measure on X. The derivation basis В under consideration must be compatible with this metric in the sense of (Вб). As usual we require that В is filtering [i.e., satisfies (B|), (B2), (B.i)]. We need to recall some expression of the classical Vitali theorem. It asserts that, for the usual derivation basis on the real line and Lebesgue outer measure ?, the elements of the dual (i.e., the so-called Vitali covers) can be used to estimate the measure of sets. Specifically let ? be a set of finite Lebesgue outer measure. Let /3 be a fine covering relation on a set ? с К so that /3 consists of pairs ? e E, I an interval of ? with there being, for each x, at least one sequence {(h.x)} with ? e h ? ? and |h \ —* 0 as к —> oo. Then, for every ? > 0 the Vitali theorem provides the existence of a sequence {(/*,&)} С 0 such that A) The {/;·} are disjointed. B) The measure ??|/?<?(?) + ?. [Here ? is Lebesgue outer measure and there is no prior assumption on the measurability of E.] C) The measure k(E\\Jlk\=0. Expressed this way it is easy to fail to appreciate the role of item B). By the regularity of Lebesgue outer measure we can find an open set G D ? so that k(G) < HE) + ?. Recall that the pruned covering relation /3(G) is also a Vitali covering relation on E. Then any selection of {(/*,&-)} С /3(G) must always satisfy B). Since for any Vitali cover /3 it is also
Differentiation 223 true that /3(G) is a Vitali cover, item B) can be settled in advance. Thus one needs, in order to prove this theorem, only to obtain items A) and C). Moreover if ? is a measurable cover of ? then by choosing G D Ё so that X(G \ ?) < ? we can arrange for B) to be replaced by a sharper version B') The measure ?(?>\^)<?· Some versions of the theorem are expressed this way instead. Since we are faced with the need to find an abstract expression of this theorem in the setting of an abstract derivation basis we have a number of choices. We might simply, as do many authors, define a strong Vitali property by exactly these three conditions, expressed in an obvious way relative to elements of the basis B[E] or B*[E]. Alternatively we can do as many other authors have done and take only the conditions A) and C) for an abstract expression of the strong Vitali property and place our assumptions in a context which guarantees that B) will hold automatically. The latter is simpler and better expresses the nature of the theory, although there is some formal merit in taking all three properties as a starting point for developing a theory. Note, in the setting of the real line, this expression of the Vitali theorem shows that ?(?,?(?))!??@ and ?(?./8)>]??(/*)>?(?)-? к so that ?(?? ?(?, ?*[?]) <;?@<?(?) +? and it follows that ?(?) = ?(?, ?>[?]) = ?(?. ?>*[?]) giving our Vitali condition in the form C1) expressed at the beginning of the section. 9.1. Strong Vitali property As our definition we merely take the classical Vitali theorem expressed previously as a model. Dehnition 52. Let ? be a derivation basis on a space X. Then В is said to have the strong Vitali property with respect to ? if the following property holds for all subsets ? С of finite ? outer measure: For every /3 e B[E] there exists a sequence {(/*,&)} С/8
224 B.S. Thomson such that A) The {(h)} are disjointed. B) The measure ?(?\???=0. Since we are assuming throughout that ? is a Borel regular outer measure on a metric space X we have immediately that, for any derivation basis B, ?(?,?[?])???(?). C3) This follows from the fact that for any ? > 0 and any ? e B[E] there is an open set G D ? with m(G) ^ ?(?) + ? and so ?(?,???])??(?,?(?)??(?) + ?. Thus the strong Vitali property leads to the following two conclusions. LEMMA 53. Let В be a derivation basis on a space X that has the strong Vitali property with respect to ?. Then ?(?,?[?])=?(?). C4) LEMMA 54. Let В be a filtering derivation basis on a space X such that the dual B* has the strong Vitali property with respect to ?. Then ?(?,?[?]) = ?(?,?*[?]) = ?(?). C5) 9.2. Classical proofs of the Vitali theorem There is now a large literature devoted to covering theorems of the strong Vitali type, that is coverings consisting of countable disjointed families covering almost all points in some set. One theme that has emerged in the general theory is to take some standard technique that can be used to prove the classical theorem and then tailor an abstract version. The most popular versions can be traced back to Banach's [3] proof of the Vitali covering theorem. Here is one theme that we might follow. Let ? be a ball in R". By В we denote the ball with the same center but 5 times the radius of B. Then one can prove the following: Let ? be a collection of nondegenerate closed balls in E" with uniformly bounded diameters. Then there is a countable disjointed family To С Т so that ??? ????
Differentiation 225 For a simple proof and further discussions see Mattila [51, pp. 23-25]. Proofs using the Hausdorff maximal principle instead can be found in Ziemer [91, pp. 7-8], Gariepy [22, pp. 27-28], and Federer [24, 2.8.4-2.8.6]. On the basis of this geometric covering theorem one can then prove, using only measure-theoretic arguments, the Vitali theorem, where here ?" denotes the и-dimensional Lebesgue measure on Ш": Let ? be a collection of nondegenerate closed balls in K". Suppose that ? is a Vitali cover of a set E. Then there is a countable disjointed family iicf so that X"U\ [J в)=0. The theorem is special to Lebesgue measure, although not restricted to that measure. The key feature in the proof is that ?"(?) = 5"?"(?)· C6) In the discussion above let us refer to the set В \ В as the "halo" around the ball B. In order to apply these methods to a measure ? on R" other than ?" we would need to have some control on the ?-measure of the halo around a ball in proportion to the measure of the ball that would play the role that is played by C6) for Lebesgue measure. To generalize to measures on an arbitrary metric space takes some caution^since troubling details emerge. Since balls do not have unique centers the definition of В and hence the definition of the halo for a ball В requires some thought. Also something analogous to C6) for a general measure would be needed. A careful analysis of the conditions needed to extend these arguments to general setting was carried out by Morse [57] and [58]. He expressed his ideas as "halo properties", because of the terminology just used for В \ В. An extensive account of halo properties in relation to strong Vitali properties can be found in Hayes and Pauc [31, pp. 41-77]. See also Federer [24, pp. 141-145] for a discussion, but not using Morse's language. The expository account in Bruckner [13, pp. 32-34] also describes other treatments of these ideas. Here is a brief sample to illustrate the kind of language. The methods are technical adaptations of the classical methods. Let X be a metric space and ? a measure on X. Let ? be a nonnegative, bounded function defined on the class of generalized intervals. For some fixed positive number ? and any generalized interval J we denote by J the set J = \J{I: (I,x)ea, inS^O, 8(I)^tA(J)\ and refer to J as the ?-? enlargement of J. This enlargement is to play the same role as the enlargement obtained in the classical version by multiplying the radius by a factor of 5. We now need an assumption to play the role of C6). We assume, for some ? < ??, that ?G)????G). C7)
226 B.S. Thomson Under these assumptions any derivation basis В that is compatible with the metric on X will have the strong Vitali property with respect to ?. For an elegant proof (and full statement of hypotheses) see Federer [24, pp. 141-145]. Morse's original version appears in [57]. Hayes and Pauc [31, p. 42] call ? a disentanglement function and prove this and several other versions. These methods are known popularly as "halo methods". The most common choice of disentanglement function is to take ?(?) as the diameter of / but other interesting choices can be found in Federer [24, pp. 145-146] and Morse [57, p. 296]. In the spirit of abstract derivation theory we should ask whether there is anything that can be said in the converse direction: If a basis has the strong Vitali property or does differentiate integrals must there also be some kind of halo property as above? The key is the requirement in the strong Vitali property that the sets extracted be disjoint. By adding a countable "cloud" to the sets / in the pairs (/. .v) we can interfere with the Vitali property without interfering with the differentiation properties of the basis. (See Hayes [30] and de Guzman [28, pp. 32-33].) As we shall see, however, weak halo properties and weak derivation properties do go in both directions. 9.3. (Q)-Pmperty The strong Vitali property can be proved in some settings by establishing a variant which involves a proportional cover. To illustrate some aspects of the technique let us use the following definition. Definition 55. We say that a derivation basis В has property (Q) with respect to an outer measure ? on X if there is a constant q > 0 so that for every ? С X with ?(?) < oo and every ? e B[E] there is a collection (?\,?\)??2.?2)??3,?3) (/„.?„) e0 with {/,} disjointed so that ????\^??^??(?). C8) Conditions of this type are satisfied by many bases. For some applications the constant q can be made to be dependent on the choice of ? and ?. Also one usually needs some kind of estimate in the other direction, requiring the measure of the set in the inequality C8) not to exceed ?(?) by a small amount. In our development we normally take this from the topological assumptions. Let us assume that X is a metric space, that the basis В is compatible with the metric structure (Вб), and that ? is a Borel regular outer measure on X. To illustrate the simplest version, we assume that all sets / appearing in the pairs (/, x) e a are closed. Then under these assumptions we can prove the following implication.
Differentiation 227 THEOREM 56. If the derivation basis В has property (Q) with respect to ?, if ?(?) < oo, /3 e B[E], and G С X is open, then there is a a collection (/|,?l),(/2,fc),(/3.?0,··· from ? with {/,} disjointed so that ОС \JliCG C9) and ??(? HG) \\Jli\=0. D0) PROOF. Let apply the property (Q) to the set G ? ?. Since /3(G) e B[G ? ?] there is a finite collection (?\,??)??2.?2)??3.?3) (??,.??,) from /3(G) so that the {/,·} are disjointed and ?? (EDG)r)[jl,j >qp(EC\G). D1) Our general assumptions always require that the set Ui'ii ^ ^e ?-measurable so ?(? П G) = ?? (? ? G) ? \J I, \ + ? ? (? ? G) \ |J /,-J . Together with D1) this shows that /Л (E П G) \ U /,¦ W A - ?)?(? П G). Now let G2 = G \ (J"=| A· Since we are assuming that the sets /, are closed,1 it follows that G2 is open. We apply the same arguments just used to the set ? ? G2 and obtain in /HG2) a collection (/,,I + l,?n, + l), (??,+?-??,+2). (/??,+3.^!,+.?) (??,2,??2) 'it is here that the closed assumption is used, to allow the pruning /JiGi) by the open set Gj. There are ways around this in more abstract settings; see, for example. Alfsen [ I ].
228 B.S. Thomson with the property that the {/,} are disjointed and д (?лс2)\ Q /;?(?-?)?(???2). \ ! =/! 1 + I But that means that ??(???\?/,)=??(???2)\ (j /,J < A - qJ ?(? ? G). Continuing this process inductively we would obtain ? ? (? П G) \ Q I,? ? A - «7)V(? П G). Note that the entire sequence {I\,h, 1щ} is disjointed. From this the theorem evidently follows since 9.4. Besicovitch-Morse property In the period 1945-47 Besicovitch [5,6] and Morse [58] independently generalized the classical covering ideas of Vitali as part of an investigation of derivation in more general setting than those in which the usual Vitali arguments (of Caratheodory and Banach) could be used. Here is one expression of Besicovitch's original theorem. There is a constant N depending only on ? so that if ? is a collection of nondegenerate closed balls in the space R" with uniformly bounded diameters and ? is the set of all centers of the balls in ? then there exist subcollections Т\,Тг,.-ч ?'?, of ?, so that each T-, is a countable collection of disjoint balls from ? and EcUU*· /=l Be^", Because the theorem is an entirely geometric one, depending not at all on a given measure, it can be used to obtain strong Vitali properties of all Radon measures on R" relative to a derivation basis consisting of balls centered at points. For recent discussions of the constants N = N(n) appearing here see Boyvalenkov [11], Fiiredi and Loeb [25], Loeb [49,50], and Sullivan [73]. Proofs of the classical theorem abound in the literature and should be easy to find. For example try first Mattila [51, pp. 28-34], de Guzman [28, pp. 2-7], Ziemer [91, pp. 9-12] or Gariepy [22, pp. 30-37].
Differentiation 229 An abstract presentation of the Besicovitch-Morse and Vitali covering theorems can be found in Bliedtner and Loeb [7]. Note, in this covering theorem, the geometric position of the associated points at the center of the balls does play a role. A simple example (given in Mattila [51, p. 28]) illustrates that there may be no nice covering property or nice differentiation results. In that example the associated points are taken rather far from the centers, leading to a collapse of the properties. We give an abstract expression of the Besicovitch-Morse ideas that can be used in many settings: Dehnition 57. Let ? be a covering relation on a space X. We say a has the Besicovitch- Morse property if there is a positive constant N so that whenever ? с X and ? с a is a covering relation on E then we may select at most N subsets ?\, /Зг, · · ·, ? ? of ? so that ?cU U ' 1=1 (I.X)?fij and each ?\ consists of countably many pairs {(/;.;,?;.;)} with the {/,.7} disjointed. We say a derivation basis В has the Besicovitch-Morse property if the underlying covering relation a on which В is based has that property. For example, any derivation based on the covering relation expressed in F) in R" has this property because of the classical Besicovitch covering theorem expressed previously. Also the net derivation bases of Section 6.7 have the Besicovitch-Morse property. We can very quickly relate this property to the property (Q) and hence to strong Vitali properties relative to appropriate measures under additional hypotheses. THEOREM 58. Let В be a derivation basis on space X and suppose that В has the Besicovitch-Morse property. Then В has the property (Q) with respect to any outer Borel measure ? on X. PROOF. Let ? С X and ? e B[E], where ?(?) < ??. Let N be the constant in the Besicovitch-Morse property. Then ? is a covering relation on ? so that we may select at most N subsets ?\,?? ? ? of ? so that N Ec\J [j I D2) ; = l (/..t)e/j, and each ?-, consists of countably many pairs {(/,.?,)} with the {/,} disjointed. Let q = ?/?. For at least one value 1 ^ /' ^ N it must be true that ^ (/..OeA '
230 B.S. Thomson which is exactly the property (Q) that we wish to establish. If not then, because of D2), N I \ ?(?)???????) \J A < ???(?) = ?(?) / = l ^ (/..t)e/i, ' which is not possible. ? 10. Weak Vitali covering properties In an elegant and highly readable paper De Possell [64] initiated the abstract theory of differentiation by expressing the notion of an abstract differentiation and then seeking to find an analogue of the strong Vitali property that would supply instead the density property. Suppose ? is a derivation basis on a space X furnished with a measure ?. The density property asserts that for a ?-measurable set ? of finite ?-measure, В hm <^± D3) /-·* ?(?) is ?-a.e. equal to 0 or 1 depending on whether ? e ? or ? ? ?. Certainly the strong Vitali property relative to ? implies the density property merely because / ?(???)= / ???? and the strong Vitali property implies the differentiation theorem. A weaker version suffices. We have already used some closely related ideas in Section 7.2. Dehnition 59. A derivation basis В is said to have the weak Vitali property relative to a measure ? if whenever ? с ?, ? D ? is a measurable cover of E, and ? e B[E] and ? > 0, there is a countable collection {(/,, ??)} с ? for which m(e\ [J/,-) = (). m(U7'\M)<? and ?>(/,)-? (jj/,)<*. D4)
Differentiation 231 There are other expressions of this possible and the conditions can vary if the setting changes. "Countable" can be replaced by "finite"; the second condition is often dropped in settings where, for example, an appropriate open set G D ? э Е can be chosen and ? replaced by the pruning /3(G). If the sum ?? M(^) is not finite some other expression will be needed. The main point to observe is that the only essential change from the strong Vitali condition is that the generalized intervals {/,} that appear are not necessarily disjoint, but are instead required to satisfy the condition D4) which controls the amount of overlap allowed. What is most remarkable is that De Possel was able to prove not merely that if the basis B* has this property this implies the density property for B, but conversely that it is equivalent to it. The proof is just measure-theoretic (we have used the ingredients of the proof already in Section 7.2). De Possel proved under fairly broad assumptions that five properties are equivalent for a derivation basis ?, including (i) the density property, (ii) that the derivation basis differentiates the integrals of all bounded measurable functions, and (iii) that this weak version of the Vitali covering theorem must hold. 10.1. Weak halo properties Not too surprisingly there are weak versions of the halo methods that can be used to establish that a basis has the weak Vitali property. It is possible to describe a halo property that is exactly equivalent to the weak Vitali property, and hence equivalent in turn to the density property and the property of differentiating all integrals of bounded functions. There is also an equivalent formulation in terms of properties of the maximal function. This should be considered one of the remarkable successes of the methods of abstract derivation theory, that in the right setting such a variety of different techniques and methods can be identified as equivalent. Variants on weak Vitali and weak halo methods can be adapted to answering the following questions for a basis that is assumed to have the density property: A) If / is a given integrable function what condition on the basis is equivalent to the basis differentiating the integral of /? B) What condition on the basis is equivalent to the basis differentiating the integral of all functions in Lp(X)? The answers can be given as special kinds of weak Vitali properties where the overlap is measured in a different but related way to D4). Any discussion of these ideas would require much apparatus and time for development. The interested reader can consult the following for accounts of weak Vitali methods, including weak versions of halo methods: Alfsen [1], Bruckner [13], Busemann and Feller [17], de Guzman [28], Hayes and Pauc [31], H. Kenyon and A. Morse [41], Morse [57,58], De Possel [64], and Thomson [79]. The bibliographies in [13,28,41] also contain many further sources. 11. Derivation bases in № The study of derivatives on the real line is a huge one with a large and deep history. This might be a bit of a surprise to the novice reader who expects that a typical undergraduate
232 B.S. Thomson and graduate education has exposed pretty well all that is to be known about derivatives. The three monographs Saks [72], Bruckner [12], and Garg [26] should give a good idea of the nature of the problems and methods and accounts of many of the highlights. The process of differentiation on the real line can be studied from a more general point of view and, in some aspects, allows a clarification of ideas. In this section we outline some of the ideas that be developed. 11.1. The ordinary derivative The ordinary derivative fix) = iim /<>>-/(*) D5) _v->-.t У — X can be captured by any of several derivation bases. Let us define the following covering relations: A) ?, = {([?,/>[,?): a < b and ? = ? ?? ? = b). B) a2 = {([a, b[, ?): a < b and ? ?? ? ?? b). C) ?3 = {([?, b[, ?): ? < b and a < ? < b). Note that the use of the half-open interval [a, b) rather than [a, b] does not impose any lack of symmetry since the points ? for the pairs ([a,fo),?) are, in all cases, handled symmetrically. The choice is made only to allow disjointed collections of intervals to be more general. Any one of ? ?, «4, or ат, can be used to express the ordinary derivative; slight differences arise in application. Let us use the first covering relation a\ and define V to be the derivation basis generated by a\ using the real metric (as in Section 6.2). It can be checked that V satisfies all of the properties (B| )-(Вб)· 11.1.1. The Riemann integral. As a preliminary to studying V consider the following basis that describes the ordinary derivative in a more elementary fashion and lacks some of the properties of T>. By ?? we mean the collection of all ? с a \ with the property that for some ? > 0 all pairs ([a, b[, ?) e a\ for which b - a <S necessarily belong to ?. The basis 11 satisfies (B|), (B2), and (B3) but does not satisfy (B4), (B5), or (B6). It can be checked that K* = V* and that П** = V. This basis allows an expression of the ordinary derivative, the Riemann integral, and the Peano-Jordan content. If we pass instead to the basis V which does satisfy all of the properties (В|)-(Вб) we shall have expressions of the ordinary derivative, but also more powerful measure-theoretic tools. The key difference can be considered to be the property (B4). Over the years there have developed many slogans by which one can express why the Lebesgue theory of measure and integration surpasses the Riemann theory. Within the admittedly narrow setting of abstract derivation theory we might claim it rests on the property (B4) that is enjoyed by V but not by H.
Differentiation 233 11.1.2. Measure properties. For any function h:a\ —* ? we define the two outer measures h*(E) = V(h,T>[E]) and /?*(?)= V(h, V*[E]) (?cR). In general /2* ^ h*. As we have seen several times already, the identity /г* = И* is considered as a strong Vitali property. Let ?? denote the function defined on ? ? by Ax([a,b[^) = b-a. Then ??* and ??* are identical and are equal to the Lebesgue outer measure ?. This is a consequence of the classical Vitali covering theorem. It also can be obtained by establishing directly the property (Q) of Section 9.3. (Had we used instead ?(??, Tl[E]) we would capture the Peano-Jordan content of the set E; the expression ?(??, TV [?]) using the dual is, however, the Lebesgue outer measure of E.) An interesting variant is obtained by considering k*(E) = ?((???.?>[?]) and ???) = ?((???,?*[?]) for 0 < s ? 1. The case s = 1 we have seen. For 0 < s < 1 the two outer measures ?,? and ?? play a role in many studies. The latter (the smaller of the two) is closely related to the Hausdorff s-dimensional measure and the former to the s -dimensional packing measure of Tricot [89] (see also Taylor and Tricot [74,75]). An account, together with applications to the study of Lipschitz numbers (as first defined by Besicovitch [4]), can be found in Thomson [76] (but using the slightly modified basis that would arise from аз above). What is the situation for the variational measures associated with a function /: R -> R? As above, write ?/([?,?,?) = /?)-/(?). It is easy to show that the variation of the function / on an interval [a, b] (denoted as usual as V(/, [a, b])) can be expressed by the variation relative to the derivation basis as V(Af,T>([a,b[)) = V(f,[a.b]). More generally there are two outer measures which carry the information as to the total variation of/: kf(E) = V(Af,T>[E]) and kf(E) = V(Af,T>*[E]). If / is continuous and has locally bounded variation then it can be proved that these are identical and are the Lebesgue-Stieltjes measure associated with the total variation of /.
234 B.S. Thomson (This property, too, can be obtained from the property (Q) or from conventional Vitali arguments.) Thus again, this can be considered a strong Vitali property. The differentiation theory for functions of bounded variation can be based on the identity of ? -? and kf. For functions / that are not locally of bounded variation there is now an extensive literature studying these measures. We can anticipate that the identity of ?' and ?/ on all subsets of a set А с R (i.e., a strong Vitali property) should be intimately related to the differentiation properties of / on A. Indeed this is the case. Functions with this property were originally investigated by Saks and Denjoy in the 1920s and 1930s but by much different techniques. An account of their ideas can be found in Saks [72] expressed in the language of VB* and VBG* functions. It is now recognized that the most fertile expression of these notions is in the language of the variational measures instead. A sampling of some of the theory can be found in B. Bongiorno et al. [8,10], Ene [21], Pfeffer [62,63], and Thomson [78,79,81-84,86,87]. Note especially that these measures form an interesting subclass of the ?-finite measures on R, a subclass that has important implications in the study of derivatives and nonabsolutely convergent integrals on R. One other interesting theme that should not be overlooked is the relation between the classes of AC* and ACG* functions and the absolute continuity of the measures ?-1 and ?/ with respect to Lebesgue measure. 11.1.3. Differentiation of integrals. Let ? be any Lebesgue-Stieltjes measure on R (i.e., ? is a locally finite Borel measure). If / is locally integrable with respect to ? then }']? 1 ?[?,?] for ?-almost every jr. There are numerous accounts of this in the literature. Federer [24, pp. 163-168] has a clear exposition, obtained from some very general and useful principles. The result also follows in the setting of Section 7 above. The key in most presentations is that there is a strong Vitali property associated with all the measures that arise in the discussion. It should be remarked that the local finiteness of ? (that is, the fact that ? is finite on all intervals) cannot be replaced in general by ?- finiteness. For a certain class of ?-finite measures there is a strong Vitali property and hence there would be a differentiation theorem. See Thomson [86] for an account of the nature of these measures. 11.1.4. Integration of derivatives. The recovery of a function from knowledge of its derivative is one of the aspects of the fundamental theorem of calculus. One version of this can be obtained as a simple application of our variational methods. If F has a derivative at every point of a measurable set ? then the methods of Section 5.7 can be used to prove that kF(E) = V(AF. ?>[?])= V(F'Ax.T>[E]) = j \F'(x)\dx.
Differentiation 235 From this we see that we can recover the variational measure associated with F from the derivative. The classical expression of a special case of this can be found in many analysis books in the form fh V(F,[a,b])= / \F'(x)\dx Ja for an everywhere differentiable function F (where V (F, [a, b]) denotes the total variation of F). On the basis of this connection between the variation of a function F and the values of the integral of | F'\ one can, in many cases, recover F from F'. This is essentially the basic ingredient in all arguments in Lebesgue's measure theory. A problem arises for everywhere differentiable functions F for which h \F'(x)\dx = oo (and such functions do exist). For such functions the measure-theoretic arguments can not provide a method to recover F from the values of F'. e Historically this led to a number of different approaches. The simplest in concept (although arguably not a "method" of inversion of derivatives) was given by Perron using collections of major and minor functions. Since the collection of major and minor functions cannot themselves be constructed any more easily than the function F itself, this concept received harsh criticism from Denjoy, who was the proponent of another method. By analyzing the structure of differentiable functions Denjoy was able to give a countable, but transfinite, series of extensions of the Lebesgue integral that can recover the primitive F (up to a constant) from its derivative F'. In our present context of derivation bases there is a simpler presentation of a method (which cannot be considered constructive in the sense in which Denjoy's is). One notes first a very special and elementary covering property of the derivation basis V: COUSIN'S LEMMA. Let ? e T>[a, b). Then there are points a = xq < ? ? < x2 < ¦ ¦ ¦ < x„ = b and points ? e [.v,_i, лг,-[ so that eachpair ([*,-_ |..r,-[, ?,) e ?. We consider this a "covering theorem", although of a considerably more elementary nature than the Vitali covering theorem. Expressed more popularly, elements in V have "partitions" of every interval [a,b). Cousin's lemma can be made into a convenient method of proving compactness statements in real analysis (as we do in [88, Section 4.5.3]). Indeed it was periodically rediscovered as a method (see the account in Thomson [76]). More interesting for our purposes is the fact that it allows for a Riemann type integral to be defined (relative to the basis T>) that inverts derivatives. The idea is nearly trivial: If F' = f everywhere then for every ? > 0 the collection L ? = {([*,?[,?): \F {y)-F (?) - f {$){y-x)\ <?{?-?), ? = ? or ? = у}
236 B.S. Thomson belongs to V. Using now the partition using elements from ? promised in Cousin's lemma, F{b) - F{a) = ? F(Xi) - F(xi-i) = ]T/(?)(*, - *,·_,) + R ;' = l ; = l where certainly и \R\ < 2^?(дг/ — дг/_|) = ?? -?). i = \ Thus if one defines a Riemann-type integral as a limit of Riemann sums II i = \ taken in the sense of the filter V then that integral will have the feature that it generalizes the ordinary Riemann integral and satisfies h F'(x)d.x = ??)- F(a) for every differentiable function F. (Had we used instead the basis H in place of V then we would have defined exactly the classical Riemann integral; the advantage of properties (В4)-(Вб) becomes clear.) The integral was discovered and exploited independently by R. Henstock and J. Kurzweil around 1960. There is a relatively large literature devoted to the study of this integral and to a number of variants. In settings where a derivation basis can be constructed that has a property analogous to the Cousin lemma such an integral can be constructed. Mostly this is not the case. Pedagogical issues abound also around this integral. It is curious indeed that an extremely general integral on the real line should be so easily defined (in the "natural" Riemann manner) and yet have been overlooked until half a century after Lebesgue. Is it easier to learn than the Lebesgue theory? No, ultimately one develops the measure- theoretic tools needed in any case. And measure theory is more important as a general subject than this particular method, which applies only in very special situations. Still it is an interesting chapter in the study of derivation bases and it has led to many techniques and problems of interest on their own. Elementary accounts of the basic integral can be found in McLeod [52] and Gordon [27]. Henstock has written a number of accounts himself: see [32,35,36]. The broader perspective in W. Pfeffer [62] is recommended to the serious reader. The survey by W. Pfeffer on the Henstock-Kurzweil integral in this volume should be consulted first. ,
Differentiation 237 11.2. The symmetric derivative The ordinary derivative can be generalized in a simple and natural manner by replacing the familiar expression D5) by one that demands instead: t^o It Certainly if the ordinary derivative /'(x) exists so too does fs (x) and with the same value. The fact that symmetric derivatives arise in certain problems in trigonometric series should guarantee their study to some extent. From the point of view of measure theory we might take the viewpoint of Khintchine [42]: He proved that for a measurable function / the set of points ? where fs(x) exists and f'(x) does not is a set of Lebesgue measure zero. For that reason he felt that the symmetric derivative could be discounted as an object of study. Since pointwise considerations (as opposed to a.e. pointwise) do play a role in many problems this is not a completely fair comment. The symmetric derivation process has been studied in many aspects. From the point of view of Vitali covering properties there is nothing new to say, but there are some other interesting covering properties. An account of many aspects and a large bibliography can be found in Thomson [84]. 11.3. The sharp derivative The ordinary derivative can be narrowed by replacing D5) by one that demands more of the derivative: f*(x)= Hm /?~/(:). D7) This is known as the sharp or unstraddled derivative (the latter because the interval [y, z] shrinks to ? but need not straddle or contain x). Certainly when f#(x) exists so too does f'(x) and with the same value. For a differentiable function / to have a sharp derivative at a point requires that /' is continuous at x. The derivation basis that expresses this can be defined by using as a general covering relation a# = {([a,b[,?): a.fe^eBanda <b\. Again note that the use of the half-open interval [a, b[ rather than [a. b] does not impose any lack of symmetry since the points ? for the pairs ([a, b[, ?) are, in all cases, handled symmetrically. Define T># to be the derivation basis generated by <*# using the real metric (as in Section 6.2). This basis has some interesting and curious properties. While it expresses a very narrow kind of differentiation, if a Riemann-type integral is defined using it in
238 B.S. Thomson a natural way the resulting integral, which we would expect to be narrower than the Henstock-Kurzweil integral discussed above, is precisely the Lebesgue integral on the line. This was exploited by McShane [53] in a monograph and developed later by him for other purposes in McShane [54]. His expository account in [54] should also be consulted. A number of papers have pursued these ideas in various settings. 11.4. Density bases on R The basis T># does not have the density property with respect to Lebesgue measure. That is to say, using the limit in D7), hm D8) y.z—x z- у need not be a.e. equal to 1 for points in ? and to 0 for points outside E. Certainly the ordinary basis V does have the density property (it differentiates all integrals) as would any "straddled" basis. A natural question arises as to what natural unstraddled bases on the real line should have the density property with respect to Lebesgue measure. Let us formulate the problem more narrowly. Let [ak, bk} be a sequence of intervals converging to 0 in the sense that О <ак+\ <bk+\ <ak <bk <¦¦¦ and ak —* 0. At every point we consider the translation invariant version of this limit, namely [x +ak, ? + bk] -> x. (An expression in terms of derivation bases can be made but would be more cumbersome.) What conditions on the sequences {ak} and {bk} would require that the corresponding derivation basis has the density property with respect to Lebesgue measure. The answer was given by Aversa and Preiss [2] as an application of one of the basic halo properties in Hayes and Pauc [31, Theorem 2.2]: If bk+\ hm sup < oo k^oo bk - ak then the derivation basis has the density property with respect to Lebesgue measure. For other interesting variants on this same theme see [2]. 11.5. The app roxima te de riva tive This section can continue to spawn further subsections if we are prepared to account for the full variety of derivation processes studied on the real line. For example, the reader should
Differentiation 239 by now have little trouble in describing a derivation basis A that captures the approximate derivative and in describing its dual A*. It will have most of the Vitali properties of the ordinary basis. More interesting is to ask for which further class of functions / will there be the strong Vitali relations V(Af.A*[E]) = V(Af.A[E]). This seems not yet to have been investigated, although it should not require entirely new methods. There are other covering theorems; the Cousin lemma has an analogue in this setting (see the proof in [76]). Consequently there is a Riemann-type integral that can be defined which inverts approximate derivatives. Let us terminate here, however, and move on to higher dimensions. 12. Derivation bases in R" 12.1. The cube basis In the space R" let a denote the collection of all pairs (/, ?) where / is an «-dimensional cube and ? e /. The derivation basis T>(, generated from a using the ordinary euclidean metric is called the "cube basis" and differentiation relative to this basis is usually called "ordinary differentiation". Let ?„ denote Lebesgue measure on R". The basis cube basis V„ has the strong Vitali property relative to ?„. There are a number of ways to prove this. The methods of Section 9 can be used, and indeed some of the details of the method are given already in that section. It follows then that the differentiation theorem holds for this. Namely for every locally Lebesgue integrable function /, the set of points ? e R" at which V0 Jim —— f \f(x) - f{y)\dX„(y) = 0, D9) that is the Lebesgue set of /, includes ?,,-a.e. point of R". In particular, for ?,,-almost every point ? e R" V0 Jim —— f f(y)dX„(y) = f(x). E0) Thus, in many ways, this seems to be the most natural and important derivation basis on R". A variant on this that would be as natural and have these same properties is to take the balls instead: Thus a consists of all pairs (/, ?) where / = B(y, r) is a ball of radius r and ? e I (which requires that |лг — y\ < r). Note that in both cases the associated point ? in the pairs (/, x) is in the set / but not required to be in any special geometric position in that set. Consider, however, the question as to whether this same result holds true if ?„ is replaced by any locally finite Borel measure ? on R". An easy example shows that this is not the case. Let L be the line у = ? in R2 and let ?(?) be the linear measure of ? П L. Then there is no strong Vitali property for V„ relative to ? and there is no differentiation theorem. An examination of this example shows that the position of the point ? e / for the pairs (?, ?) seems to be a critical issue for the strong Vitali property.
240 B.S. Thomson 12.2. The basis of centered balls The issue raised in the preceding section suggests requiring a natural geometric condition on the position of the point ? e I for the pairs (/, ?) e a. Let us consider the basis obtained from centered balls. Now a denotes all pairs (/. x) where / = B(x, r) is a ball of radius r and x, the associated point, is the center of the ball /. The basis generated from a from the euclidean metric on R" can be proved to have the Besicovitch-Morse property. Consequently this basis has the strong Vitali property relative to any locally finite Borel measure and the analogous statements in E0) and E0) hold for any such measure. Thus, perhaps instead of the cube basis, this seems to be the most natural and important derivation basis on R". Certainly it is used as the main object in many investigations and many accounts can be found in the literature. Consult, for example, Gariepy and Evans [22], Mattila [51 J, and Ziemer [91] for full developments with proofs of the geometric covering properties and differentiation properties possessed by this basis and the role it plays in geometric measure theory and many important questions in analysis. As we have remarked, the fact that the associated point ? in the pair (/, ?) here is the center plays a crucial role: The Besicovitch-Morse property fails if ? is merely taken as any point in the ball /. But there is room for maneuver. Provided the point is not too far from the center, the same property can be proved. Also the balls may be replaced by cubes provided the points are, once again, not too far from the centers of the cubes. 12.3. The interval basis Having seen the success of the cube basis for the differentiation of Lebesgue integrals in Section 12.1 one might expect to be able to generalize to a basis consisting of intervals, rather than cubes. Take a to consist of pairs (?, ?) where / is an interval and ? e /. The basis Vs generated from a is known as the "interval basis" and the corresponding differentiation process, since it is clearly more demanding than the cube basis, is known as "strong differentiation". The strong differentiation fails to have the strong Vitali property with respect to Lebesgue measure and the Lebesgue differentiation property fails. The first indication of this appears in Caratheodory [18] where he presents a counterexample of H. Bohr. A simplified version of that appears in de Guzman [28, pp. 92-95] where he is able to use that construction to show both that the strong Vitali property fails and that the interval basis does not differentiate the integrals of all integrable functions. It fails somewhat dramatically. Saks [70,71] showed that for all functions / in the Banach space L \ (R") excepting only a first category set, P.Hmsup ffdk"=oo E1) at every point ? e R". This is sometimes known as the "Saks rarity theorem" and is an early version of what has become a large industry, showing that some curious behaviour of
Differentiation 241 functions holds "typically" or "generically" in a certain space of functions. Full proofs can also be found in de Guzman [28, pp. 96-99] and Hayes and Pauc [31, pp. 107-109]. If the strong differentiation basis T>s fails to differentiate the integrals of all Lebesgue integrable functions, what positive properties does it have? Even if the typical integrable function satisfies E1), for what class of functions does it differentiate their integrals? The first (and merely partial) answer to this question is obtained by showing that T>s has the weak Vitali property with respect to Lebesgue measure. This can be proved by weak halo methods. See Hayes and Pauc [31, pp. 78-79]. Consequently T>s differentiates the integrals of all bounded measurable functions. (Recall that that this differentiation property is equivalent to the density property and to the weak Vitali property and to the weak halo property.) But this is not the best possible. A remarkable theorem of Jessen, Marcinkiewicz and Zygmund [40] describes precisely (in a sense) what that class of functions is. It is larger than the class of bounded functions, but because of the Saks rarity theorem must form a first category subset of L \ (W). This is a truly wonderful story to recount but perhaps it is best left to others. There is much to relate and deep connections with other ideas in analysis. See Hayes and Pauc [31, pp. 83-94] and Guzman [28, pp. 50-51] for proofs of the theorem. 12.4. Rectangle basis While the cube basis has the best properties with respect to Lebesgue measure, we have seen that the interval basis still has very strong and useful properties. What happens, say in R2, if we replace intervals by rectangles, that is if we no longer require of our rectangles that the sides be parallel to the axes? The resulting derivation basis does not have the density property with respect to Lebesgue measure and so all reasonable Vitali or halo properties fail. An interesting account of the history and the properties of this basis can be found in Guzman [28, Chapter 5] along with a proof that the density property fails. There is a brief discussion also in Hayes and Pauc [31, pp. 104-106]. Other proofs can be found in Busemann and Feller [17] and Papoulis [61]. For our purposes we should be content with an observation that arises from considerations of the three bases here (the cube basis, interval basis, and rectangle basis). From a metric or measure-theoretic viewpoint they seem hardly distinguishable. Had we been inclined to think that the covering properties and derivation properties of a derivation basis should be approachable solely by measure-theoretic and topological arguments, these examples should serve to disabuse us of the idea. Certainly there are geometric arguments needed at many stages in our theory. For example, with an appropriate restriction, a derivation basis constructed from rectangles in different orientations could have the strong Vitali property relative to Lebesgue measure even though the full interval and rectangle bases do not. For example let the class of rectangles be restricted to one in which every pair of rectangles in the class has the property that one of them includes a translate of the other; then the basis constructed from such a family is an example of what Morse calls a "hive" and it does have the strong Vitali property relative to Lebesgue measure. See Morse [57] and Hayes and Pauc [31, pp. 111-112].
242 B.S. Thomson 12.5. Regular bases Intermediate between the cube basis and the interval basis are regular bases that are designed to supply the strong Vitali property with respect to Lebesgue measure in R". The key is the shape of the intervals compared to the cubes: Long narrow intervals interfere with strong Vitali property. For a set ? с R" define its index of regularity as р(Н) = %?клп where the supremum is taken over all cubes С that contain the set ?. This measures how far ? is away from being close to cube shaped, values close to zero indicating the unpleasant extreme. We define a variant on the interval basis as follows. For a take all pairs (/, x) where / is an interval and ? & I. The basis T>,- is defined so as to contain all ? с a with the following property. For each ? e R" there is a S > 0 and ? > 0 so that ? contains all pairs (/, x) e a for which diameter(/) < ? and p(I) > <5. (The numbers S and ? can depend on x.) This basis has the strong Vitali property. One need not use intervals here; any measurable sets with the same properties would work. The proof just uses a halo argument. See de Guzman [28, pp. 25-27] for an elementary and direct proof. For a generalized version of a regular basis and a proof that such bases have strong Vitali properties see Comfort and Gordon [19]. 12.6. Star bases Generally in any space, such as R" here, we would select a derivation basis to study that arises from some compelling geometrical consideration. Certainly the bases above, the cube basis, centered ball basis, and interval basis are entirely natural objects of study. But from the general viewpoint of abstract derivation theory we likely feel motivated to ask a problem in a converse direction. Exactly what derivation bases would have some specified property. Indeed let us ask what bases on R" would have the strong Vitali property relative to all locally finite Borel measures? We have seen that the centered ball basis does. The cube basis does not, although if we modified it by requiring that the associated points are at the centers of the cubes then it would have this property. This question was considered by Besicovitch [5] and by Morse [58]. The answer is suggested by the examples in the preceding paragraph. There must be some geometric relation between the associated point ? and its set / in the pairs (/, x). A basis is said to be a star basis if the appropriate condition holds, and for star bases the strong Vitali property will hold for all locally finite Borel measures. A brief expository account can be found in Bruckner [13, pp. 12-14] and a full account with proofs in Hayes and Pauc [31, pp. 114-119].
Differentiation 243 13. De La Vallee Poussin theorem De La Vallee Poussin obtained a version of the Lebesgue differentiation theorem that clarifies considerably the nature of the differentiation result for functions of bounded variation. If F is a function of bounded variation on [a, b] then most students learn that I b F'(t)\dt ? V(F,[a,b]) and that, if in addition F is absolutely continuous, then J \F'(t)\dt= V(F,[a,b]). Ja Elementary textbooks often do not give the best version of this collection of ideas. Let ?? be the signed Borel measure associated with F (i.e., the Lebesgue-Stieltjes signed measure), let f^and Е-ж denote the set of points at which F'(x) = ooand F'(x) = -oo. Then, assuming F is continuous and has bounded variation, = /'' Jb ??(?)= / ?'A)??+??(????)+??(??)?-^) Jb for all Borel sets B. A proof appears in [14, pp. 301-302]. One sees that the singular part of ?? is concentrated on the sets where the derivative exists but is infinite. How much of this can be expressed in a general setting for a signed measure ? and a measure ?? Are there general hypotheses that would lead to the relation ?(?)= ? /??+??(????)+??(???-?) Jb where f{x) = В hm —— ?^? ?(?) and ?oo and ?-? are the sets of points at which this limit exists infinitely as oo and —oo? Bruckner [13. p. 39] illustrates that this is not true in any generality by showing that for most natural bases В in E2 that it is possible that ? is singular and yet ? or ¦ ?·?(/) or ?(/) 0 = В hm ??? < В hm sup = oo /-.? ?(?) /_/ ?(?) on the set where ? is concentrated. Thus there is no infinite derivative anywhere. It is not known if there is any general version possible. Saks [72, pp. 155-156] shows that for a net structure basis in R" the analogue of the De La Vallee Poussin theorem does hold. See also the proof in [14, pp. 352-353].
244 B.S. Thomson 14. The Radon-Nikodym theorem In most applications of measure theory one can use the Radon-Nikodym theorem to express the relationship between a pair of measures ? and ? by ?(?)= ? fdti. This is true in many measure spaces (?,?,?) for all measures ? that are absolutely continuous with respect to ? and not too large. Commonly the function / whose existence is claimed by the theorem is called a Radon-Nikodym derivative of ? with respect to ?, although it is not obtained as a derivative in the sense of any limit. From the point of view of the abstract theory it becomes natural to ask whether any derivation basis В can be imposed on the measure space so that the function / can be expressed as a genuine pointwise limit, В lim = f(x) ?-a.e. This question may not be natural from all points of view. In most applications of a measure space we are more likely to construct a derivation basis from the geometry to hand and then ask for its properties, rather than ask for the existence of a derivation basis that supplies some need. What we require, apparently, is that the derivation basis we want have appropriate strong Vitali properties. Net structures always supply the strong Vitali properties needed and so whenever a net structure can be imposed this solves our problem. Recall, from Section 6.7, that if the measure space is "separable" in the sense given there then a net structure does exist. More generally any space that has a linear lifting also provides a derivation basis with adequate strong Vitali properties. A collection of necessary and sufficient conditions are given in Kolzow [43] for a measure space to permit this. Without going into details one can summarize popularly by saying that whenever the Radon-Nikodym theorem is available so too is a derivation basis that supplies that Radon-Nikodym derivative as a genuine derivative. For all the ingredients of a readable and accessible proof of the connection between linear liftings and bases with the strong Vitali property see the account in Bruckner [ 13, pp. 35-37]. This material is also developed in [14, pp. 355-362]. For a different perspective on the Radon-Nikodym derivative, this time appearing as a stochastic derivative, see Hayes and Pauc [31. pp. 177-178]. 15. Some further remarks If there is not a unified theory of abstract derivation, there is certainly a body of knowledge that deserves that name. The concepts and techniques of the subject (Vitali covering
Differentiation 245 properties, Besicovitch-Morse covering properties, halo properties, Hardy-Littlewood maximal operator methods, variational measures, liftings, etc.) are part of the general culture of analysis. The survey in this chapter has hardly served as a tour, merely an introduction to a selected variety of themes. The story of the Hardy-Littlewood methods will have to be told by someone else. There is also a connection between martingale methods and differentiation that should be explored elsewhere. The monograph of Hayes and Pauc [31 ] is half of it devoted to that subject. We should remark that martingale methods also seem to be able to replace Vitali methods in a number of differentiation results. Morayne and Solecki [55] and Morayne [56] have proved the Lebesgue differentiation theorem and the Jessen, Marcinkiewicz and Zygmund theorem by using powerful martingale methods in place of appeals to covering theorems. Bliedtner and Loeb [7] develop connections between martingale convergence theorems, covering properties, and differentiation of measures. The high level of technical apparatus needed to develop some of the ideas has been obscured by our simple presentation. The interested reader will have to go more seriously into the details of Hayes and Pauc [31] and de Guzman [28, pp. 2-7] to get a deeper view of the subject. Probably the monograph Ken yon and Morse [41] should be mastered, but only by the most devoted. References [1] E.M. Alfsen, Some covering theorems of the Vitali type. Math. Ann. 159 A965), 203-216. [2] V. Aversa and D. Preiss, Hearts Density- Theorem. Real Anal. Exchange 13 A) A987-88), 28-32. [3] S. Banach, Surle theoreme de Vitali, Fund. Math. 5 A924), 130-136. [4] A.S. Besicovitch, On linear points of fractional dimension. Math. Ann. 101 A929), 161-193. [5] A.S. Besicovitch, A general form of the covering principle and relative differentiation of additive functions. Proc. Cambridge Philos. Soc. 41 A945). 103-110. [6] A.S. Besicovitch, A general form of the covering principle and relative differentiation of additive functions 11, Proc. Cambridge Philos. Soc. 42 A946). 1-10. [7] J. Bliedtner and P. Loeb, A reduction technique for limit theorems in analysis and probability theory. Ark. Mat. 30A992), 25^43. [8] B. Bongiorno, W.F. Pfeffer and B.S. Thomson. A full descriptive definition of the Gage integral. Canadian Math. Bull. 39 D) A996), 390-401. [9] B. Bongiorno, L. Di Piazza and V. Skvortsov. A new full descriptive characterization of Denjoy-Perron integral. Real Anal. Exchange 21 A995/96). 656-663. [10] B. Bongiorno, L. Di Piazza and D. Preiss, Infinite variation and derivatives in K", J. Math. Anal. Appl. 224 A998), 22-33. [11] P. Boyvalenkov, On the Besicovitch constant in small dimensions. C. R. Acad. Bulgare Sci. 50 A997). 17-18. [12] A.M. Bruckner, Differentiation of Real Functions. Lecture Notes in Math.. Springer, Berlin A978). [Second edn., CRM Monograph Series, Vol. 5, Amer. Math. Soc.. Providence. RI A994).] [13] A.M. Bruckner, Differentiation of integrals. Amer. Math. Monthly 78 A971) (Slaught Memorial paper No. 12). [14] A.M. Bruckner, J.B. Bruckner and B.S. Thomson, Real Analysis, Prentice-Hall A997). [15] A.M. Bruckner and M. Rosenfeld, On topologizing measure spaces via differentiation bases. Ann. Scuola Norm. Sup. Pisa 23 A969), 243-258. [16] Z. Buczolich and W.F. Pfeffer, Variations of additive functions. Czechoslovak Math. J., in press. [17] H. Busemann and W. Feller, Ziir Differentiation der Lebesgueschen Integrate. Fund. Math. A934).
246 B.S. Thomson [18] С. Caratheodory, Vorlesungen iiber reelle Funktionen. Leipzig A927). [19] W.W. Comfort and H. Gordon, Vitali's theorem for invariant measures. Trans. Amer. Math. Soc. 99 A961) 83-90. [20] G.A. Edgar, Integral, Probability; and Fractal Measures. Springer. New York A998). [21] V. Ene, Real Functions - Current Topics. Lecture Notes in Math.. Vol. 1603. Springer, Berlin A995). [22] L.E. Evans and R.F. Gariepy, Measure Theory and Fine Properties of Functions. CRC Press. Boca Raton. FLA992). [23] K.J. Falconer, The Geometry of Fractal Sets. Cambridge University Press. Cambridge A985). [24] H. Federer, Geometric Measure Theory. Springer. Berlin A969). [25] Z. Furedi and PA. Loeb. On the best constants for the Besicovitch covering theorem. Proc. Amer. Math. Soc. 121A994), 1063-1073. [26] K. Garg, Theory of Differentiation. CRM Monograph Series, Vol. 24, Amer. Math. Soc.. Providence, Rl A998). [27] R.A. Gordon, The Integrals of Lebesgue. Denjoy. Perron and Henstock. Grad. Studies in Math.. Vol. 4, Amer. Math. Soc., Providence, Rl A994). [28] M. de Guzman, Differentiation of Integrals in K". Lecture Notes in Math., Vol. 481, Springer. Berlin A975). [29] M. de Guzman. A general form of the Vitali theorem. Colloq. Math. 34 A975), 69-72. [30] C.A. Hayes, Differentiation with respect to ?-pseudo strong blankets and related problems. Proc. Amer. Math. Soc. 3 A952). 283-296. [31] C.A. Hayes and C.Y. Pauc. Derivation and Martingales. Springer. Berlin A970). [32] R. Henstock, A Riemann type integral of Lebesgue power. Canad. J. Math. 20 A968). 79-87. [33] R. Henstock, Generalized integrals of vector-valued functions. Proc. London Math. Soc. C) 19 A969), 317-344. [34] R. Henstock, Generalized Riemann integration and an intrinsic topology, Canad. J. Math. 32 A980), 395- 413. [35] R. Henstock, Lectures on the Theory of Integration. World Scientific, Singapore A988). [36] R. Henstock, The General Theory of Integration. The Clarendon Press, Oxford University Press. New York A991). [37] E.J. Howard, Analyticity of almost everywhere differentiable functions, Proc. Amer. Math. Soc. 110 A990), 745-753. [38] K. Iseki, On relative derivation of additive set-functions, Proc. Japan Acad. 36 A960). 630-635. [39] K. Iseki, On decomposition theorems of the Vallee-Poussin type in the geometry of parametric cur\es, Proc. Japan Acad. 37 A961). 169-174. [40] B. Jessen, J. Marcinkiewicz and A. Zygmund, Note on the differentiability of multiple integrals. Fund. Math. 25A935). 217-234. [41] H. Kenyon and A. Morse, Web Derivatives, Amer. Math. Soc. Memoirs A973). [42] A. Khintchine, Recherches sur la structure des functions mesurables. Fund. Math. 9 A927). 212-279. [43] D. Kolzow, Differentiation von Massen, Lecture Notes in Math., Vol. 65. Springer, Berlin A968). [44] J. Kurzweil, Generalized ordinary differential equations and continuous dependence on a parameter, Czechoslovak Math. J. 82 A957), 418^*46. [45] J. Kurzweil and J. Jamik. Differentiability and integrability in ? dimensions with respect to a-regular intervals, Results Math. 21 A992). 138-151. [46] S. Leader, What is a differential? A new answer from the generalized Riemann integral, Amer. Math. Monthly 93 A986), 348-356. [47] S. Leader, A concept of differential based on variational equivalence under generalized Riemann integration, Real Anal. Exchange 12 A986-87). 144-175. [48] S. Leader, Integral and Differential on the Real Line: An Innovative Unified Theory, to appear. [49] PA. Loeb, An optimization of the Besicovitch covering. Proc. Amer. Math. Soc. 118 A993), 715-716. [50] PA. Loeb, On the Besicovitch covering theorem, SUT J. Math. 25 A989). 51-55. [51] P. Mattila, Geometry of Sets and Measures in Euclidean Space, Cambridge University Press. Cambridge A995). [52] R.M. McLeod, The Generalized Riemann Integral. Math. Assoc. Amer.. Washington, DC A980). [53] E.J. McShane, A Riemann-type integral that includes Lebesgue-Stieltjes. Bochner and Stochastic integrals. Amer. Math. Memoirs. No. 88, Providence. Rl A969).
Differentiation 247 [54] E.J. McShane, A unified theory of integration, Amer. Math. Monthly 80 A973). 349-359. [55] M. Morayne and S. Solecki. Martingale proof of the existence ofLebesgue points. Real Anal. Exchange 15 A989/90). 401^406. [56] M. Morayne. to appear. [57] A.P. Morse, A theory of covering and differentiation. Trans. Amer. Math. Soc. 55 A944), 205-235. [58] A.P. Morse, Perfect blankets. Trans. Amer. Math. Soc. 61 A947). 418^*42. [59] M.E. Munroe, Introduction to Measure and Integration. Addison-Wesley A953). [60] A. Novikov and W.F. Pfeffer, An invariant Riemann type integral defined by figures, Proc. Amer. Math. Soc. 120A994). 849-853. [61] A. Papoulis, On the density theorem. Proc. Amer. Math. Soc. 2 A951). 709-717. [62] W.F. Pfeffer, The Riemann Approach to Integration, Cambridge Univ. Press. New York A993). [63] W.F. Pfeffer. On variations of functions of one real variable. Comment. Math. Univ. Carolin. 38 A) A997). 61-71. [64] R. de Possel, Surla derivation abstraite des functions d'ensembles A936). [65] O. Nikod^m. Sur la mesure des ensembles plans dont tons les points sont rectilineairement accessible. Fund. Math. 10A027) 116-168. [66] D. Preiss, Gaussian measures and covering theorems. Comment. Math. Univ. Carolin. 20 A979). 95-99. [67] Ch. de la Vallee Poussin, Sur Vintegrate de Lebesgue. Trans. Amer. Math. Soc. 16 A915), 435-501. [68] Ch. de la Vallee Poussin, Integrates de Lebesgue. Functions d'Ensemble, Classes de Baire, Gauthier-Villars. Paris A934). [69] C.A. Rogers. Hausdorff Measures. Cambridge University Press. Cambridge A970). [701 S. Saks, Remark on the differentiability of the Lebesgue indefinite integral. Fund. Math. 22 A934). 257-261. [71] S. Saks. On the strong derivatives of functions of intenals. Fund. Math. 25 A935). 235-252. [72] S. Saks. Theory of the Integral. Dover A937). [73] J.M. Sullivan, Sphere packings give bound for the Besicovitch covering theorem. J. Geom. Anal. 4 A994). 219-231. [74] S.J. Taylor and С Tricot. Packing measure and its evaluation for a Brownian path. Trans. American Math. Soc. 288 A985), 679-699. [75] S.J. Taylor and С Tricot. The packing measure of rectifiable subsets of the plane. Math. Proc. Cambridge Phil. Soc. 99A986). 285-296. [76] B.S. Thomson. Covering systems and derivatives in Henslock division spaces. J. Lond. Math. Soc. 2 D) A971), 103-108. [77] B.S. Thomson, Covering systems and derivatives in Henstock spaces II. J. London Math. Soc. B) 9 A974). 4 pp. [78] B.S. Thomson. Measures generated by a differentiation basis. Bull. London Math. Soc. 9 A977). 279-282. [79] B.S. Thomson, On weak Vitali covering properties. Canad. Math. Bull. 21 C) A978). 339-345. [80] B.S.Thomson, On full covering properties. Real Anal. Exchange 6 A) A981), 77-93. [81] B.S. Thomson, Derivation bases on the real line (I). Real Anal. Exchange 8 A982-83). 67-207. [82] B.S. Thomson, Derivation bases on the real line (II), Real Anal. Exchange 8 A982-83), 278^142. [83] B.S. Thomson, Real Functions, Lecture Notes in Math., Vol. 1170, Springer, Berlin A986). [84] B.S. Thomson. Derivates of Internal Functions, Amer. Math. Soc. Memoirs, No. 452 A991). [85] B.S. Thomson, Symmetric Properties of Real Functions. Monographs and Textbooks in Pure and Applied Mathematics, Vol. 183, Marcel Dekker A994). [86] B.S. Thomson, ?-finite Borel measures on the real line. Real Anal. Exchange 23 A) A997-98), 185-192. [87] B.S. Thomson, Some properties of variational measures. Real Anal. Exchange 24 B) A998/99), 845-853. [88] B.S. Thomson, J.B. Bruckner and A.M. Bruckner. Elementary Real Analysis, Prentice-Hall B001). [89] С Tricot, Two definitions of fractional dimension. Math. Proc. Cambrigde Philos. Soc. 91 A982), 57-84. [90] A.J. Ward, On the derivation of additive functions of intenals in m-dimensional space. Fund. Math. 28 A937), 265-279. [91] W.P. Ziemer, Weakly Differentiable Functions. Springer, New York A989).
CHAPTER 6 Radon-Nikodym Theorems Domenico Candeloro Dipartimento di Matematica. via Pascoli. 06123 Perugia, Italy- E-mail; candelor@dipmat. unipg. it Aljosa Volcic Dipartimento di Scienze Matematiche, Piazzale Europa I, 34100 Trieste, Italy E-mail: volcic@univ.trieste.it Contents 1. Introduction 251 2. The ? -additive case 255 3. The finitely additive case 259 4. The Banach-valued case 265 5. Finitely additive Banach-valued measures 275 6. Further results 282 Appendix. Singularity and decomposition theorems 290 References 292 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 249
This page intentionally left blank
Radon-Nikodym theorems 251 1. Introduction Suppose ? is the Lebesgue measure on the real line and / is an integrable function. Then the measure ? defined for all the Lebesgue measurable sets v(E)= ? fdk is called the indefinite integral of / with respect to ?. It is obvious from the definition of the integral that if ?(?) = 0, then v(E) = 0 for any Lebesgue-measurable set E. This is expressed saying that ? is absolutely continuous with respect to ? and will be denoted by ? «?. It is not difficult to check (see [27, Section 30, Theorem B]) that under the assumption that ? is a finite measure, this condition is equivalent to another one which can be given in ?-? terms, which will be denoted by ? <gf ?: given any ? > 0, there exists a S > 0 such that if ?(?) < <5, then v(E) < ?, for any Lebesgue-measurable set E. We always have that ? <<cf ? implies that ? «?, but the converse implication does not hold in general, as it can be easily seen considering for instance the measure v(E) = fE \x\dk(x). In the ? -additive case, absolute continuity of a measure ? with respect to another measure ? implies, under mild conditions on ? and v. that ? is the integral measure of some function f e ? (?): such function is called Radon-Nikodym derivative of ? with respect to ?, and it is denoted by ^. Nikodym [44] was the first to prove this result in a quite general setting. If ? and ? are two finite measures on the same ?-algebra, and if ? <g ?, then there exists a Radon- Nikodym derivative ?-, i.e., a ?-integrable function / such that v(E) = J fdiL holds, for any measurable set E. Earlier, Radon [46] proved (using Vitali bases) the same implication under the assumption that the maximal element of the ? -algebra is a measurable subset of the ?-dimensional Euclidean space. For the real line, the corresponding result goes back to Lebesgue. However the situation is not so nice, either when the measures are }ust finitely additive, or when they take values in a Banach space of infinite dimension, and it gets even worse if both of the above cases occur. We shall outline a survey of the results known so far, involving the existence of a Radon-Nikodym derivative, in some appropriate sense, and with respect to suitable types of integrals. In this first section, we deal essentially with a general Radon-Nikodym theorem for nonnegative finitely additive scalar measures, from which the basic ideas of further results can be drawn. In the second section, we turn to the ? -additive measures, investigating conditions which permit that even unbounded measures have Radon-Nikodym derivatives. In the third section we face the finitely additive case for scalar measures, indicating how the existence of a Radon-Nikodym derivative dv/?? is
252 D. Candeloro and A. Volcic strictly related to some geometrical properties of the range of the vector valued measure (?, ?). The fourth section deals with the most beautiful (and difficult) results concerning those Banach spaces possessing the so-called Radon-Nikodym Property (RNP): the geometric concepts met in the third section here become essential tools for describing such Banach spaces. The fifth section is devoted to finitely additive measures taking values in Banach spaces, and to the research of weaker types of derivatives. Finally, the sixth section concerns a number of recent results, for locally convex-valued measures, for multimeasures, for Riesz space-valued measures, and finally also a brief outline of the so- called fuzzy integration: from this survey one can find that, though hidden by the abstract settings and the different kinds of integrals involved, the leading ideas of the first section are always at work. We end the chapter with an appendix devoted to some decomposition theorems for measures, which are relevant for the Radon-Nikodym theorem. We do not give all proofs, some of them being too long and technical: when it is possible, we give an outline of the main steps, trying to clarify the basic ideas. Also, we realize that it would be almost impossible to present here a detailed account of all the contributions given to this problem along almost 100 years, so we simply chose among the results in our knowledge, sometimes simplifying settings, in order to reach a sufficient variety within a relatively concise treatment. The general result, from which we start, is due to Greco [25]. Though the integration theory and the results by Greco involve more general set functions, we shall give the proof only for the finitely additive case. It should be noted that the idea to relate the Radon- Nikodym theorem to the existence of a "scaled" Hahn decomposition goes back, for the ?-additive case, to J.L. Kelley [33]. We need some definitions (see also [19]). Dehnition 1.1. Let (?, ?) be any measure space, where ? is a ?-algebra of subsets of ?, and let ? be any monotone nonnegative set function on ?, ?@) = 0. We say that a measurable function / : ? —>¦ E+ is integrable with respect to ? if the following integral is finite: f /??:= ? ?({?: f(x)>t\)dt. JQ JO Since {x: f(x) > t] is nonincreasing in t, the second integral has to be understood in the Riemann sense. If this is the case, for each set ? e ?, we put: /?(?):= [ /??:= ? /1???. Je Jq The set function /? is often called the integral measure of /, with respect to ?. The integral defined above is called Choquet integral, and coincides with the usual integral, in case / is nonnegative and ? is a ?-additive nonnegative measure, or a finitely additive one.
Radon-Nikodym theorems 253 THEOREM 1.2. Assume that ? and ? are two monotone nonnegative set functions, defined on the ?-algebra ?, such that ?@) = ?>@) = 0, satisfying: ?E) = ?E) = 0 =>· ?(? U S") = ?(?) (?) /or ?// ???. Г/геи, the following conditions are equivalent: (a) 77?ere exists a measurable nonnegative function f: ? —>¦ К smc/? ?/??? v(E)= ? /?? B) /ora// ?el (b) 77?ere emfs a (decreasing) family of sets {Ar}r>o in ?, satisfying: (bl) v(A) - ?(?) ^ ??(?) - ??(?), ?, ? e ?, ? С А С А,, г > 0; (Ь2) ?(?) - v(E П Ar) ^ ?(?(?) - ?(? ? A,)), F е Г, г > 0; (ЬЗ) lim i>(Ar) = 0, as r —> +эо. It is clear that A) is satisfied, as soon as ? and ? are additive. Moreover, in case ? and ? are finitely additive, conditions (bl) and (b2) above are equivalent respectively to (b'l) and (b'2) below: (b'l) v(F) ^^(F),forall Fc A,., r > 0; (b'2) v(F) ? ^(F), for all F С ? \ Ar, r > 0. Moreover, from (b'l) we deduce that lim ? (Ar) = 0 (since ?(??) ^ ?(?) < oo), hence, in case ? <<cf ?. (ЬЗ) is satisfied, too. PROOF. As already mentioned, we shall give the proof just in the case of finitely additive, nonnegative measures, such that ? <ce ?. We first assume (a) and prove (b). Set: Ar := {x e ?: f(x) > r\ and fix ? с Ar, ? e ?. As v(E) = f f\l·;d?,ч/egetv(E) ^ r f \??? = r?(F), so (b'l) is proved. If F is fixed, F e ?, F с ALr, then v(F) = j f\Fd^^ rp(F). so (b'2) holds. As already observed, (ЬЗ) is satisfied because of the absolute continuity, so the first implication is completely proved. Now, we assume that (b) holds, and construct a suitable derivative f. For all ? e ?, set: f(x) := sup{r > 0: ? e A,}. To see that / is measurable, fix any re]0,+oo[, and observe that {x: f(x) > t) =)J{Ar: r e D, r > t}, where D denotes the set of all dyadic positive numbers, i.e., D = {|r, h and к positive integers}. Now, for each element ^ e D, set B\ := ?,,^???+? }pj. We see that ? (?*) > ^?(?*), and also ?(?*) < ^-?(?^). For each positive integer ?, set ? = ?
254 D. Candeloro and A. Volcic We can easily see that Д. f ?? — liiru fE fk ??, for all ? e ?, and moreover ? /1=1 for all к and E. Therefore, we get: /????(?). L IE for all E. On the other hand, for each k, we have: >??{??)-?(?)?? ? = ? As 5??=? v(Bf,) = v(A\/20 ~ v(Ak), from (ЬЗ) we deduce that lim / fkd?^\?mv{A\l-L^) from which we also get lim* / ??,?? ^ ?(?), because v(A^2k) ^ 2~V(^i/2*) ^ 2"V(«)· So far, we have seen that the finitely additive measures ? and /? are in this relation: v(E) ^ /?(?), forall ? e ?, and ?(?) ^ /?(?). From this it follows immediately that the two measures agree on ? (simply considering the complements of the involved sets), and therefore the theorem is proved. ? COROLLARY 1.3. If ? and ? are finite and countably additive, and ? <? ? (or equivalently ? <<Cf ?), then there exists a Radon-Nikodym derivative ^. PROOF. Conditions (b'l) and (b'2) mean that there exists a Hahn decomposition for the measure ? — r?, for all r > 0: this is always the case, for ?-additive measures, and this concludes the proof. ? The importance of Theorem 1.2 rests not only on its generality, but also on the relative simplicity of the involved conditions, from which other similar criteria have been obtained. We shall see many of them in the sequel, dealing with finitely additive measures, with Banach- and nuclear-valued measures, and also with different kinds of integrals. Our dissertation however now focuses the ?-additive case for not necessarily bounded measures.
Radon—Nikodvm theorems 255 2. The ? -additive case In this chapter, we mainly deal with countably additive, possibly unbounded measures. In general, the Radon-Nikodym theorem fails to hold, for unbounded measures, as the following example shows. Example 2.1. Let ? = [0, 1], let 0 ^ ;' < j^ 1 and consider the Hausdorff ;'- and j- dimensional measures ?! and W. Then W «C ?! but dW /dH' does not exist. The reason for the failure of the Radon-Nykodym theorem in the previous example has been described already by S. Saks [50] who considered the case / = 0 and j = 1 (see also Volcic [59]). We will make some additional comments on this example after Theorem 2.12. However we will see that, under suitable conditions, Radon-Nikodym derivatives do exist, even if both measures, ? and v, range over [O.+oo]. Such conditions involve properties of the so-called measure algebra, which we are now going to introduce. As usual, (?, ?,?) denotes a measure space, where ? is a ?-algebra and ? is any nonnegative ?-additive measure, taking values in [0, +oo]. We shall assume that the Caratheodory extension has already been done, and so ? is actually the ?-algebra of all ?-measurable sets. In particular, all subsets of ?-null sets belong to ?. We recall the definition of the outer measure, ?* :?(?) —> [0. +эо]: /i*(E) = infljr/i(F„): F„e?. ?c|Jf„ . Definition 2.2. Given two subsets A and В of ?, we say that A and В are equivalent, if ?*(???) = 0, and write: А я» ? or A = В ?-a.e. In a similar fashion, if / and g are real functions defined on ?, we say that / and g are equivalent (and write / « g, or / = g a.e.) if ?*({???: /(х)фя(х)})=0. In the quotient ?(?)/^ we can introduce a partial order, as follows: [A] > [B] ifandonlyif?*(?\A)=0. Of course, ?(?)/?, contains ? /я», which is called the measure algebra. One can easily see that ?>(?)/~ and ? /я» are ?-complete lattices. We will see that completeness of ?/^ is important for our purposes, however in general the two lattices are not complete. We have the following facts (more details and related results can be found in a series of papers by Volcic [58-60]): Under CH, if ? is the usual Lebesgue measure on ? := [0, 1]. then 'P(i2)/~ is not complete. There exist measures such that ?/^ is not complete (see [27, Section 31, Exercise 9]).
256 D. Cande/ою and A. Volcic DEFINITION 2.3. Given any measure space (?, ?, ?), we say that ? is semifinite if ?(?) = supJM(A): Ac E, AeE, ?(?) < oo}, for all sets ? e ?. Such measures are called essential measures by N. Bourbaki [8]. PROPOSITION 2.4. Suppose ?<?, suppose that ? is semifinite and suppose moreover that v(E) = 0 whenever ?(?) < oo. Then ??/?? exists only if ? is identically zero. PROOF. Suppose dv/?? exists and let ? be any set of finite and positive ? measure. Then {x e ?: dv/?? > 0} ? ? = [j{x e ?: dv/?? > ^} ? ?. ? Each set {? e ?: ??/?? > (|} ? ? has finite ? measure and since v(E) > 0, at least one of them, A„ say, has positive measure ?, a contradiction. This shows that if v{E) < oo, then v(E) = 0 and from the semifiniteness of the measure we deduce that ? = 0. D Definition 2.5. A semifinite measure ? on (?, ?) is said to be strictly localizable, if there exists a family of sets [Еа}аел, such that (a) 0 < ?{??) < oo (b) Ea П ?? = 0 when ? ? ? (c) ?(? П ??) = 0 Vc => ?(?) = 0. THEOREM 2.6. If ?(?) < oo (or wore w general if ? is strictly localizable), then ?/^ is complete. PROOF. We will limit the proof to the case of a finite measure. Given any family Q of measurable sets, we shall show that there exists a measurable set Go such that: A) ?*(?\?0) = 0?>?3?????, and B) IfM*(G\G|) = 0forall G e Q, then m*(G0\Gi) = 0. Without loss of generality, we may assume that Q is a ?-ideal. Now, set: a:=sup{M(G): GeQ}. As ? is finite, ? < oo. Now, let (G;,)„e^ beany sequence in Q, such that ? (G„) > or— -, and put Go := \}G„. As ^ is a ?-ideal. Go e Q- So, we only have to prove that Go satisfies A) above. If this is not the case, then there exists ? eQ , satisfying ?(#\??) > 0. This implies that ? U Go e <?, and ?(# U Go) > a, a contradiction. ? Definition 2.7. A semifinite measure ? is said to be Maharam (or also localizable) if 2?/«? is complete (see [36]).
Radon-Nikodym theorems 257 Remark 2.8. According to Theorem 1, strictly localizable measures are Maharam. It was a long standing open question whether Maharam measures are strictly localizable. The problem has been stated explicitly for the first time in [34, Problem 1], and is relevant in lifting theory. The question has been solved in the negative by D. Fremlin [24]. Dehnition 2.9. Given two measures ? and ? on ? we say that ? and ? are strongly comparable if ? and ? are the Caratheodory extensions of ?\? and of v\s respectively, where S is the ideal of all sets ? e ?, such that ?(?) + v{E) < +oo. The following definition is related to the previous one. Dehnition 2.10. Given two measures ? and ? on ? we say that ? and ? are weakly comparable if ? and ? are the Caratheodory extensions of ?|? and of v\z respectively, where ? is the intersection of the two ?-algebras of ?- and ?-measurable sets (in the Caratheodory sense). The measures ?.' and W are not strongly comparable, if i ? j, but they are weakly comparable. In 1951, I.E. Segal proved the following version of the Radon-Nikodym theorem [51]. THEOREM 2.11. Given a ?-additive measure ?, the following properties are equivalent: A) ? is Maharam; B) for any ?-additive measure v, strongly comparable with ? and such that ? <? ?, there exists a Radon-Nikodym derivative dv/??. Such function is unique, up to equivalence. PROOF. We only prove the implication A) =>· B). So, assume that ? is Maharam, and ? is any strongly comparable, ?-additive measure, ?«?. We denote by S the ideal of all sets ? e ?, such that ?(?) + v(E) < oo. For every element ? e S there exists a Radon- Nikodym derivative of v\e with respect to ? It: we denote by /^ such derivative, and define fE on Ec as the null function. Now, for every positive real number t, set: A, :=Y(A,(?): EeS], where A,(E) = {x e ?: /fU) > t]. (The supremum exists as ? is Maharam.) It is clear that (A,) is a decreasing family in ?/^. Now, for all .tefiwe define: f(x) := sup{r > 0: ? e A,}. As in the proof of Theorem 1.2, one can prove that / is measurable. Let us show that / is the required derivative. In view of the strong comparability, it is enough to check that J fd? = v{E)
258 D. Candeloro and A. Volcic for all F e <S. So, fix any element F e <S, and compute: J /??= f ?{{?&?: f(x)>t\)dt. C) Now, ?{{? e F: f{x) > t)) = sup^(F ПЛ(+|): ? e ?). For every positive number r, we claim that ЕПАгъАг(Е). D) Indeed, Ar{E) С ? by definition, and Ar(E) с Ar up to a null-set, hence A,- (F) с ? ? Ar up to a null-set; conversely, ? ? Ar = v(F ? A,-(F)), where the supremum is taken as F eS. Now, ? ? A, -(F) = A,(F ? F) because of the essential uniqueness of Д-, hence F П Ar с A,-(F) a.e. and D) is proved. From D), we get immediately suP{m(F П At+l): ? e N} = sup{m(A,+i (?)): neN)= ?(?,(?)) for all positive numbers t, and so from C) we deduce /??= ?{{???: f(x)>t\)dt= / ?(?,(?))? ./? JO Л) = ? ???? = ?(?), by definition of /f. This concludes the proof. ? For weakly comparable measures, we have the following result. THEOREM 2.12. Given a ?-additive measure ? on (?, ?), the following properties are equivalent: A) ? is Maharam; B) for any ?-additive measure v, weakly comparable with ? and such that К<Д, there exists a decomposition ?\, ?? of ?, ?\ e ?, for / = 1,2, such that: • for any ? e ? and ? С ?\, ?(?) < эо implies that v(E) = 0; • if we denote by vj the restriction of ? to ? ?) ??, the Radon-Nikodym derivative '-^ exists. The decomposition ?\, ?? and the function '-r~ are unique, up to equivalence, Ju The proof follows from the combination of Theorem 2.11 and the decomposition Theorem A.4 from the Appendix. Remark 2.13. The previous theorem explains why Radon-Nikodym theorem fails when ? is the counting measure H° on [0. 1], and ? is the Lebesgue measure, in spite
Radon-Nikodym theorems 259 of the fact that ? is localizable. The measures ? and ? are in this case just weakly and not strongly comparable, and ?;«?. Theorem 3 applies here with ?\ = ?. We conclude this section presenting two interesting variants of the Radon-Nikodym theorem. We need first two definitions. DEFINITION 2.14. Given a measure space (?, ?, ?), we say that ? admits a monotone differentiation, if there exists a mapping -f-, defined on the family ?? of all the measures which are strictly comparable with ? and absolutely continuous with respect to ?, such that if Vj e ??, for / = 1,2, and ? ? ^ ??, then —? (?) ? —- (.?), ?? ?? for any ? ? ?. Definition 2.15. Given a measure space (?, ?,?), we say that ? admits a linear differentiation, if there exists a mapping j~r defined on the family ?? of all the measures which are strictly comparable with ? and absolutely continuous with respect to ?, such that if vj e ??, for / = 1,2, and a \, a^ are two real numbers, then d(a\v\ +ajvi) dv\ dv2 (x)=a\ —-U)+fl2 —U). ?? ?? ?? for any ? e ?. The following result is due to D. Kolzow [34, Theorems 4, 7 and 8]. THEOREM 2.16. Given a measure space (?. ?, ?), the following properties are equivalent: (i) ? admits a monotone differentiation; (ii) ? admits a linear differentiation; (iii) ? /s strictly localizable. For the long and sophisticated proof we refer to Kolzow's monograph, which also illustrates in great detail the strong relations between the monotone and linear versions of the Radon-Nikodym theorem and lifting theory. The Radon-Nikodym theorem for the Daniell integral is discussed by M.D. Car- rillo [15], Chapter 11 in this Handbook. 3. The finitely additive case When the measures (even one of them) are just finitely additive, the Radon-Nikodym theorem does not hold, in general: (we remark that, in this case, absolute continuity is to be intended in the ?-? sense).
260 D. Candeloro and A. Volcic It is appropriate to begin with the oldest result concerning the Radon-Nikodym theorem for finitely additive measures, attributed to Bochner [7]. THEOREM 3.1. If ? and ? are two finitely additive measures on ?, such that ? <?? ?, then far any ? > 0 there exists a ?-integrable function fn, such that ???? ? <?. for any ? e ?. In order to give an example, we introduce a definition. Dehnition 3.2. Let ?: ? -> Rj be a finitely additive measure, and let 3 denote the ideal of null sets. If 3 is a ?-ideal, we say that ? has the property ?. One can easily see that, if / ^ 0 and ? has property ?. then fd? = 0 / implies that / = 0 a.e. Similarly, one can deduce that, whenever ? has property ?, and / and g are two ?- integrable functions, then f = g ?-a.e. as soon as fA f ?? = fA gd? for all A e ? (this entails uniqueness of the Radon-Nikodym derivatives, up to a.e. equivalence). Example 3.3 ([10]). Let ? be the family of all subsets D с [0, 1], such that the following limit exists: ,. ?(?)?[0,?[) hm — -=8(D). Though ? is not an algebra, the function ? can be extended (using the axiom of choice) to the whole ? -algebra ? of the Lebesgue measurable sets, in such a way that ?(??[0,?[) ?(??[0,?[) lim inf ^ ? (?) ^ lim sup , f^o+ ? г_0+ ? for all sets В e ?, and so that ? is finitely additive on ?. Obviously, if ?(?) = 0 then S(B)=0. Now set ? := ? + ?. It is clear that ?(?) = 0 if and only if ?(?) = 0, hence ? enjoys property ?. Moreover, ? <«? ? also in the (?-<5) sense, however one can easily see that ??/?? does not exist. In fact, such a function / should satisfy ?(??[?, 1])= f ???=\ fdk+( fdS= [ fdk, JBr\\e.\\ Jen[e.\} JBD[fA] J ВП\еЛ]
Radon-Nikodym theorems 261 for all ? > 0, and all В e ?, i.e., / = 1 a.e. in [?, 1] for all ? > 0. Thus, by the property ?, it should follow that / = 1 ?-a.e., a contradiction. The property ? has a double face: on one hand, it looks like a strengthening of finite additivity, in the sense of ?-additivity. On the other hand, it prevents in some sense that a finitely additive measure might behave like a ?-additive one, at least with respect to the Radon-Nikodym property. This statement is clarified by the next theorem. THEOREM 3.4. Let ? : ? —>¦ Rq be any finitely additive measure, defined on the ?- algebra ?, and enjoying property ?. The following are equivalent: A) ? is countably additive; B) for every finitely additive measure ? : ? —>¦ Rj, which is (?-S) absolutely continuous with respect to ?, there exists ??/??. PROOF. Of course, since A) and the other assumptions imply that ? is also ?-additive, just B) =>· A) needs to be proved. Let (A„)„eI: be any increasing sequence of sets in ?, and denote by A their union. Set: v(B) := 1????(?„ ? ?), for all В e ?. It is clear that ? satisfies the condition B), hence there exists / = dv/??. Now, for every В e ?, we have v(B) = v(B ? A) = fB f\Ad?, so / can be replaced by /1A. On the other hand, it is clear that f\A„ = 1 д„ ?-a.e., hence / = 1 a ?-a.e. As a consequence, we have lim,,^^ ?(?„) = v(A) = fA f ?? = fA 1 ?? = ?(?). As the sequence (A„)neN was arbitrary, ? turns out to be ?-additive. ? In the spirit of the remark preceding Theorem 3.4, we can give an antithetic example. Example 3.5. Let ? be any infinite set, and let 3 denote the ideal of all finite subsets of ?. According to the Axiom of Choice, there exists a maximal ideal 3*, including 3. Define for ? ??(?): ?(?) :=|° ifAe3*· I 1 otherwise. According to a well-known theorem by Ulam [57], ? is a finitely additive measure, lacking the property ?. However, if ?: ?(?) —> R(| is any finitely additive measure, @-0) absolutely continuous with respect to ?, then the only possibility is that ? = кв, for some nonnegative constant k. In this case, obviously, к = dv/??. To find a characterization of the Radon-Nikodym property, we have to introduce the concept of completeness. In a recent paper by A. Basile and K.RS. Bhaskara Rao [1] an excellent presentation is given, mainly concerning finitely additive measures defined on algebras. We shall adapt here one of their results, dealing with the Radon-Nikodym property. Definition 3.6. Let ? : ? -> Rj be any finitely additive measure, on a ?-algebra ?. If A and В are elements of ?, we set: d,t(A, ?) = ?(???). It is clear that (?,??) is a
262 D. Candelom and A. Volcic semimetrizable space. We say that (?, ?) is complete, if the quotient ?/« is complete as a metric space, where « is the natural equivalence relation in ?. The characterization given in [1] can be stated as follows (we recall that in our assumptions ? is a ?-field, and contains all subsets of sets of measure 0). THEOREM 3.7. Let ? : ? —> R^ be any finitely additive measure, on ? ?-algebra ?. The following are equivalent: A) {?,??) is complete. B) For every increasing sequence of sets (Fw)„eh' in ?, there exists a sequence of ?- null sets (#„),>eN in ?, such that lim M(F„) = lim m(F„ \ H„) = ?( \\{F„ \ H„)\ «—>ос и—«-ос \ v-' / From Theorem 3.7 one can easily deduce that if ? <Kf ? and (?, d,t) is complete, then (?, dv) is complete, too. COROLLARY 3.8. ^(?.??) is complete, and ? is any signed finitely additive bounded measure such that ? <gf ?, then ? admits a Hahn decomposition. PROOF. We only have to show that there exists a set ? e ?, such that v(P) = ™??€??(?)- As ? is bounded, the number S := supAer ? (A) is finite, and there exists a sequence (^;i)«eN m ?, such that S = lim„^,c v(A„). Setting now v+(A)= sup ?(??)?) and v~(A) = - inf v(A ? ?), VA e ?, Ее ? i'er we readily see that v+(A„AA„t) —>¦ Oand v~(A„AA,„) -+ 0 as both m and// diverge. Now, i>+and v~ are both (?-?) absolutely continuous with respect to ?, and so is |i>| := v+ +v~, hence there exists a set ? e ?, such that lim,,^^ |?|(?„??) = 0. The set ? has the required property. ? COROLLARY 3.9. The following are equivalent: A) (?,??) is complete; B) For every nonnegative finitely aaaitive measure ? <Kf ?, there exists ??/??. PROOF. We only prove the implication A) =>· B). As ? <^F ?, we see that the signed measure ? — /·? is ?-? absolutely continuous with respect to ?, and therefore it has a Hahn decomposition. Due to Corollary 1.3, this implies the existence of ??/??. D The corollary above has been proved in [1]. A different approach to the problem consists in finding additional conditions on the two measures ? and v, besides absolute continuity, which ensure the existence of ??/??.
Radon-Nikodym theorems 263 One of the first consequences of Theorem 1.2 and Corollary 1.3 is the following (see also [3,12,13]): THEOREM 3.10. Let ?, ? be two nonnegative finitely additive measures, defined on the same ?-algebra ?, and such that ? <?e ?. If the range of the pair (?. ?) is a closed subset ofR2, then there exists ??/??. PROOF. In view of Corollary 1.3, it is enough to show that, for every real number r > 0, there exists a Hahn decomposition for ? — r\x. But it is clear from the assumption on the range of (?, ?) that the range of ? — ?? is a closed subset of R, hence there exists an element A e ? in which ? — r? attains its maximum: so (A, A') is a Hahn decomposition for ? — ??, and we are done. ? Of course, closedness of the range of (?, ?) is just a sufficient condition. In [3] and [12] necessary and sufficient conditions are given on the range of (?, ?) in order that dv/?? exists. In both papers, the condition involves the so-called exposed points of the range, according with the following definition. Dehnition 3.11. Let R be any convex, bounded subset of R". Let us denote by R its closure. We say that a point ? e R", Q e dR, is an exposed point for R if for every hyperplane H, supporting R at Q, we have RC\ ? = {Q}. THEOREM 3.12. Let (?, ?) be a pair of nonnegative finitely additive measures, defined on a a -algebra ? and such that ? <Kf ?, and let R denote the range of (?, ?). Then the following are equivalent: A) There exists dv/??. B) The convex hull of R contains its exposed points. The theorem above has been proved in [3]. In [ 12] a similar theorem is stated, for the case of continuous measures. A measure ?: ? —>¦ R^ is said to be continuous if for every ? > 0 it is possible to decompose ? into a finite number of subsets ? ?, At A„ belonging to ?, such that ? (A,) < ? for all /'. If ? and ? are both continuous, finitely additive and nonnegative, it is well-known that the range of (?, ?) is a bounded convex subset of the plane (see also [11]), hence condition B) of Theorem 3.12 can be expressed in a simpler way. However, in [12] the following result has been proved, which gives a full description of those bounded convex sets R CR2 that are the range of some pair (?. ?>) for which dv/?? exists. THEOREM 3.13. Let R be any bounded convex subset of [0. эо[:. Then the following properties are equivalent: A) There exists a pair (?. ?) of nonnegative continuous finitely additive measures, defined on a suitable ?-algebra, such that there exists dv/??, and such that R is the range of {?, ?).
D. Candelom and A. Volcic Fig. 1. Equation of the lower curve: ? = 1 — vO _ ·*")· 0.8 Fig. 2. Equation of the lower curve: ? = A — \/(l - x-))/2. B) R contains its exposed points, and for every segment L С BR, at least a (possibly degenerate) closed subsegment I C L is contained in R. (See Figure 5 (a) and (b).) Usually, if (?, ?) is a pair of nonnegative continuous finitely additive measures, the range looks like in Figure 1, or in Figure 3, when ? <Kf ?. In Figure 2 a particular situation is shown, where ? is obtained by adding to ? some measure which is singular with respect to ?. We observe that the range is always symmetric with respect to the midpoint (?(?)/2, ?(?)/2): hence in Figure 5 (a) and (b) just half of 37? has been drawn.
Radon-Nikod\m theorems 265 0.2 0.4 ? 0.6 0.8 Fig. 3. Equation of the lower curve: у = л6. 0.6 Fig. 4. Lower curve: у = A - у A - д-2))/2. for .? < 0.5. then linear. 4. The Banach-valued case The results from the previous sections can be easily extended to bounded signed measures, since the Jordan decomposition (even in the finitely additive case) always exists. Similarly a complex measure or function can be studied investigating separately its real and imaginary part. In the same way we can also extend the previous results to measures or functions taking values in a finite-dimensional space. A really different situation occurs with infinite-dimensional measures, as we shall see in this section. The research on this subject led, mainly in the seventies, to a variety of quite
266 D. Candeloro and A. Volcic 1- 0.8- 0.6- 0.4- 0.2: 1 ^ у X' ! / Л, ^~~— Line segment (a) 0.8 0.6 0.4 0.2- (b) Fig. 5. (a) Range closed, (b) Range not closed. Part of the segments is missing. interesting results, relating the Radon-Nikodym property to topological and geometric properties of the range space. Here, we shall confine ourselves to the Banach-valued ?-ad- ditive measures, and with respect to the Bochner integral. (As to absolute continuity, for the ?-additive case, the (?-?) definition is still equivalent to the @-0) one.) Maybe the richest survey on this subject is [20], so we refer the reader to that paper, in order to find the proofs missing here, and an exhaustive historical account. We start, recalling the definition of Bochner integral [6,21,22].
Radon-Nikodxm theorems 267 Dehnition 4.1. Given a finite measure space (?.?,?), where ? is a ?-algebra, and a Banach space X, a function /: ? —>¦ X is said to be simple if it is of the form: / = ?, = ? -?/ 1 a, , where x/ e X and A, e ?1 for all /' = 1,..., N. If this is the case, then the integral of / is the element: Г N / /??:=^2???(??)&?. J i = l In the same framework, if (/„)„en is any sequence of simple functions, we say that /, converges in measure to some function / if the following holds: lim ?*({???: \\f„(o)-f(o)\\ >?\) = 0 П—ЮО II I ' for all ? > 0 (here, ?*{?) is the outer measure defined in Section 2). Incase /: ? —>¦ X is the limit in measure of some sequence (/,),,ек of simple functions, then / is said to be measurable (also, strongly measurable). We say that /: ? —>¦ X is (Bochner)-integrable, if there exists a sequence (/H),ieN of simple functions, converging in measure to /, and such that lim [\ /,-/,„ Мм = o. It can be proved that, if f: ? —>¦ X is integrable, then the sequence (f /???)„?? is convergent in X, and the limit is independent of the particular sequence (/„ )„ еы ¦ Of course, the limit lim / /„ ?? is called the integral of /. Also, as in the real-valued case, one can prove that convergence in measure implies convergence a.e. for some subsequence, hence a strongly measurable function is essentially separably valued, i.e., there exists a separable subspace ? С X. such that /(?) e ? for all but a ?-null set of elements ?. Moreover, a strongly measurable function is also weakly- measurable, i.e., {x*, f) is measurable for all x* belonging to X*. Like in the scalar case, it can be proved that integrability of / implies integrability of 11/11, and that \\//?"?\\ ^ /\\/\\??. When / is integrable with respect to ?, we also say that / is in ??{?). The latter is a Banach space if we identify functions which coincide a.e. with the norm ||/|| = f ||/|| ??. Moreover, if / is integrable, then /If is also integrable, for all ? e ?, and the integral becomes a function of E: we set "//*-/ /?(?):= ? /??:= / /\???. One can see that f? is an X-valued measure, which is separably valued, whose variation is ||/||?. The variation of an X-valued measure ? is denoted by |i>| and is defined as: |y|(?)=sup^|v(?,)|.
268 D. Candeloro and A. Volcic where the supremum is taken over all finite partitions of E: one can also regard |v| as the least upper bound of {| (**, v) \: x* eX*. ||.v*||= 1), in the lattice of measures. Moreover, /? <«C ?. The converse question leads, of course, to the problem of the existence of a Radon- Nikodym derivative. The problem can be stated in various different ways: A) Given a measure ? : ? —>¦ X, which is absolutely continuous with respect to a scalar positive measure ?, does there exist a Bochner-integrable function /, such that ?=/?? B) Given a measure ? : ? —>¦ X, with finite variation. |?|. does there exist a function /, Bochner integrable with respect to |i>|. and such that ? = f\v\? Of course, a necessary condition for an affirmative answer to A), is that ? has finite variation. Moreover, if |?| is finite and К<Д. then it is clear that |?>| <? ?: hence, an affirmative answer to B) would yield an affirmative answer to A). Conversely, assuming the existence of dv/??, one can decompose ? into the sum ?? + ?:, where ?? <<? |?| and ?? -L ?· (See Theorem A4.) Hence |?>| <? ??. dv/?? = ??/??\, and dv/d\v\ = (??/???){???/?\?\). In conclusion, we can see that A) and B) are essentially equivalent. Moreover, in [17], Chatterji remarks that in A) above the ?-algebra ? can be equivalently replaced by the Borel ?-algebra on the unit interval, and (up to irrelevant renorming) the measure ? by the usual Lebesgue measure. Now, we give a couple of examples showing that the answer is not always affirmative. Example 4.2. Let X be the L'space over the unit interval in R. Also, let ([0, 1], ?,?) be the measure space, where ? denotes the Borel ?-field, and ? the Lebesgue measure. For all A e ?, set v(A) = 1A, i.e., the characteristic function of the set A. It is easy to see that ? is an X-valued countably additive measure, defined on ?, and that | v| = ?. However, if a Radon-Nikodym derivative / := dv/dk existed, we would have: f g(x)dx=(g, \A)= f (f(x).g)dx J A J A for all A e ?, and all g e L°° (= (L1)*). Therefore, for every fixed g e Lx, we would have: g(x)= (f(x).g) a.e. Hence, there would exist a ?-null set N e ? such that, for all ?, ? in [0. 1 ] ? Q, a < ?. we would have (/(*). Wi)=i for all ? e Nl' ? [?, ?]. So. if we fix ? ?_ ?, and choose (a„)„ef,, (/3„)„еь in such a way that a„ < ? < ??, <*„, ?? e Q, and ?? - a„ —>¦ 0, we get: |ilimc(/(Jc),lK.ft,])= 1 which is impossible, since f(x) e L'.
Radon-Nikodym theorems 269 Example 4.3. The next example concerns the same measure space, ([0, 1]. ?, ?), but with a different range: here, X is со, the space of all real sequences, vanishing at infinity. Let us define: ?(?):=(?,(?),...,?„(?)....). where ?„(?)= I sinBn nx)dx J a for all A e ?, and ? e N. As |?„(?)| ? ?(?) for all A and all /;. it is clear that ? has bounded variation, and |?|([0, 1]) ^ 1. Obviously, ?«?, however ??/?? does not exist. Indeed, such a derivative /,/=(/! /„ ), should satisfy: ?«(?)= / ??B? nx)dx = j f„(x)dx for all A and n. Thus, it would be /,,(л) = sinB7r//;c) a.e., which is impossible, because ??B???) is not vanishing, as /z —>¦ oo. The first affirmative answer to the Radon-Nikodym problem in infinite-dimensional spaces is due to Birkhoff. THEOREM 4.4 ([4]). IfX is a Hubert space, then every ?-additive measure ?:?^>? with bounded variation admits a Radon-Nikodym derivative with respect to \?\. We do not give the proof now, because we will see later some generalizations of this result. Remark 4.5. What happens if we try to adapt Theorem 4.4 to the example given in Example 4.2? One can always think of 1 д as an element of L2, and of ? as an L2-valued ?-additive measure, with ?»«?. However, by the same argument, one can still show that dv/dk does not exist. What is wrong? A simple calculation shows that ? does not have bounded variation, when considered with values in L2. This remark also shows the importance for a measure of having bounded variation. (However, more can be said on this example, see Remark 4.18.) From now on. we shall write BV to mean "bounded variation". DEHNITION 4.6. We say that a Banach space X has the Radon-Nikodym Property (RNP) if A) or B) above holds, i.e., every countably additive X-valued BV measure ?, defined on the measure space (?, ?), admits a Radon-Nikodym derivative with respect to |?|. Hence, according to Theorem 4.4. all Hubert spaces have the RNR Other spaces with RNP can be derived from another important sufficient condition, due to Dunford and Pettis [23].
270 D. Caiideloro and A. Volcic THEOREM 4.7. If X is a Banach space, whose dual space X* is separable, then X* has RNP. PROOF (Sketch). Let ? : ? -> X* be any BV measure, with ? «; ?. For any ? e X, and any A & ?, set vx(A) = (v(A), x). It is clear that vx <<C ?. Of course, there exists dvx /??. The idea now is to set /( = ???/??, and to show that /( defines, as ? varies in X, a linear continuous functional /; however one must take into account that fx is defined а.е., hence a "good" function / must be defined carefully, and this can be done, if X* is separable. D For example, the space L \ is not a dual space, because we know from Example 4.2 that it has not RNP, however /1 is a separable dual, hence it enjoys RNP. We will see in the sequel other classes of spaces with that property, but we will introduce first a very important definition. Dehnition 4.8. Given a bounded nonempty subset ? of a Banach space X, we say that В is deniable if for every ? > 0 there exists a point xe e В such that xe does not belong to the closed convex hull of B\B(xf. ?). In case it is possible to choose the same ? for all ?, then ? is said to be a denting point of B. Dentable sets are closely related to the existence of Radon-Nikodym derivatives, as Rieffel showed in [47,48]. We first remark that В is dentable whenever every countable subset of В has this property. This was proved by Maynard [39], and can be seen as follows: assume that В is not dentable; then there exists ? > 0 such that ? e co(B\B(x. ?)), Vjc e B. So, choose any element ? e B, and points y\ y„ e B, such that ||y,- — jc || > ? for all /.and some convex combination ? ? of these points belongs to ?(?,?/2). For the initial point x, and each of these points, say yj, one can choose a finite subset Fj с В, such that \\y - .y/|| > ?, for any у e Fj, and such that some convex combination Z2 of Fj belongs to B(yj,e/4). Now, the union of the sets Fj, together with ? and the points \j, gives a finite subset E\ С В. For each element s & E\ the same construction is possible, giving rise to a finite subset Ei. We continue by induction. The union of all the sets Ej is then a countable subset of B, which is not dentable. Now, let us state a lemma, where the construction of a Radon-Nikodym derivative is shown, under suitable assumptions. LEMMA 4.9. Let (?, ?,?) be any finite measure space, and ? : ? —>¦ X be any BV measure, \v\ <^?. Then dv/?? exists, whenever the following condition holds: (a) For every ? > 0 there exist sequences (xen)„&i in X, and (Н^)пец in ?, such that the sets Щ are pairwise disjoint, have positive measure, ?(?) = ]??(#^), and ( ^: Л ? Г. Л С «; I СЯК..).
Radon-Nikodym theorems 271 PROOF. Assume that (a) holds, and fix ? > 0. Consequently, a sequence (xf„)neN exists in X, and a corresponding sequence (#;f )„ек in ?, according to (a). Now choose ne large enough, so that ?(?\ \Jj^„f H]) < ?· and set ?·.= ?^4· It is easy (though tedious) to see that (fe) is Cauchy in ?]<·(?), hence converges to some function / in this space. Now, since jE f ?? = lim/? fe ?? as ? —> 0, for all ? e ?, it is enough to compare v(E) with fE fe ??. Of course, up to ?, we can assume ? с U^„f Щ, and so v(E) = ??,„ ?(? ? Яр, while /? /f ?? = ?^„, X)^E n Яр· Now, as ? П Я? С Я?, we have ||?(? ? Я") - ?'?(? ? Я?)|| ^ ??(? П Я|) by (a), therefore \\v{E) - /? /? ??\\ ^ ??{?), and this proves the lemma. ? THEOREM 4.10 (Rieffel). Let (?,?,?) be any finite measure space, and let ? : ? -> ? be any BV measure, with \v\ <<C ?. The following are equivalent: A) There exists ??/??. B) For еас/г .$« ?el, ?(?) > 0. there exists DcE, D e ?, ?(?) > 0, iwc/г ?/??? Л(О) = I -^-^-: А с О. A e ?, ?(?) > 0 й dentable. \?(?) J PROOF (Sketch), A) =>· B) We first observe that the result is obvious, ?? ??/?? is simple: for every set ? e ?, having positive measure ?. there exists DcE, with D e ?, and ?(?>) > 0, such that ??/?? is constant in D, and hence {-7^: AC D, Ae ?, ?(?) > 0} is a singleton. In the general case, let / denote the derivative ??/??, and let (s„ )„ек be any sequence of simple functions, converging to / in the L' -norm, and also almost uniformly. For any set ? e ?, such that ?(?) > 0, there exists a set D с ?, with D e ?, such that ?(?)) > 0, and s„ —>¦ / uniformly on D. Now, it is easy to apply the previous argument. B) =>· A) We shall use Lemma 4.9. To prove (a), we proceed as follows: first, we show that, for every set ? e ?, such that ? (?) > 0, and for any ? > 0, there exists DcE, with D e ?, and ?{?) > 0, such that the set A(D) is contained in B(x, ?) for some element xeX. Once we have proved this, it is possible to replace ? with Dr, and iterate the procedure, constructing a disjoint sequence as in (a). So fix ? e ?, with ?(?) > 0, and let ? > 0. We know that there exists ?0 С Е, with ?0 e ?, and ?(??) > 0, such that A(E0) is dentable. Let ? be an element of A(Eo), such that ? <? co(A(E0)\B(x,e)). Write ? = ?(?H)/?(?)). for suitable Do С ??· If .4(A)) С ?(?,?), we have finished. Otherwise, there exists E\ С A. with ?(?|) > 0, such that y(?i) ? > ?. ?(?\) As gfi| e Д(А)), we also have g^ e co(.A(Do)\B(*, ?))·
272 D. Candeloro and A. Vollic Let us denote by k\ the smallest positive integer for which such an element E\ exists, satisfying ?{?\) ^ y, and choose in this way E\. Set D\ = Dq\E\ . It is impossible that ?(?>?) = 0, because this would imply = v(Dq) = v(E\) ?(?H)_?(?|)' a contradiction. Now, if Л@|) с ?(?,?), we are done. Otherwise, continuing that way, we can produce a disjoint sequence (?„)„epj in ?1, an increasing sequence (k„)nen in N, such that ?(?„) < ?1, and ^Щ е гё(Л(^)\А(х,?)). Of course, we have it,, | oo. Setting \jEn, and ?) = ?H\ ?эс, we can deduce that D is the desired set, i.e., ?(?) > 0 and A(D) с B(x, ?). Indeed, if it were ?(?) = 0, we would have v(Do) v(Eoc) ?>(?„) ^ ?(?„) ?(?„) _/..„..„. ,ч x= = = — => eco(A(Eo)\B(x,E)), ?(?H) ?(??) ?(??0) ^ ?(?„) ?(??) V ; which is impossible. Now, to show that A{D) с ?(?,?), fix D' с D, with D' e ? and ?(?') > 0. If gg^^BU.e), then XJfy eco(A(EQ)\B(x.e)), because D' с O0\U"=i Ei for all n. But then, by the choice of (k„)nem, we must have ?(?') < r- for all n, and hence ?{?') = 0, a contradiction. ? Let us see some consequences of Theorem 4.10. For instance, we can deduce that if X has the RNP then every subspace of X has the same property. Indeed, the construction of Lemma 4.9 shows that, if ? takes values in a subspace ? С X, then the derivative can be constructed as a limit of У-valued simple functions. Another consequence is that X enjoys RNP as soon as every bounded subset of X is dentable. However, the converse is also true, as Huff in [29], and Davis and Phelps in [18], independently proved. THEOREM 4.11. Let X be any Banach space. The following are equivalent. A) X enjoys RNP. B) Every bounded subset ofX is dentable. We refer the reader to [20] for a proof of the implication A) =>· B). Another consequence is that Radon-Nikodym Property is separably determined, i.e., a Banach space X possesses RNP if and only if every separable subspace ? с X has RNP. This is now an easy consequence of Theorem 4.11, and the remark following Definition 4.8: a set is dentable as soon as its countable subsets are dentable. As weakly compact sets are dentable, it is now clear that every reflexive Banach space has the RNP. An interesting consequence of Theorems 4.7 and 4.11 is that X enjoys RNP as soon as each of its separable subspaces are duals. In [52], Stegall showed a stronger result:
Radon-Nikodvm theorems 273 THEOREM 4.12. The following are equivalent, for any Banach space X: A) X* has RNP; B) every separable subspace of X has separable dual; C) every separable subspace of X is embedded in some separable dual. A consequence of Theorem 4.12 is that the dual of a separable Banach space possesses RNP if and only if it is separable. Some of the previous results are also included in a "martingale-type result", obtained by Chatterji in [17]. For the sake of completeness, we give some definitions. Dehnition 4.13. Let (?, ?, P) be any probability space, i.e., any measure space, with ?{?) = 1. Given any Bochner-integrable function f: ? —>¦ X (here, X is any Banach space), and given any sub-a-algebra ?? С ?, the conditional expectation of the function / with respect to ?? is the Bochner-integrable function (defined P-a.e.), denoted by E(f\?o), which has the following two properties: A) E(f\Eo) is strongly .To-measurable; B) fFfdP = fFE(f\Z0)dPfoTiuiyFeZo. The existence of E(f\Eo) is independent of the RNP, and is proved first directly, for simple functions /, and then by approximation, in the general case. DEHNITION 4.14. Let X, and (?, ?, ?) be as above. An X-valued martingale is a sequence (/„, En)neN of Bochner-integrable functions /„ and sub-?-algebras ?„ с ?, such that: A) ?„??„+\, VneN; B) /„ = ?(/„+, |Г„),УиеМ. The martingale (/„, ?? )„epj is said to be convergent if there exists a Bochner-integrable function foo-.?^?, such that /„ = ?(/?\?„) for all ? e N. A typical way to obtain martingales is the following: assume ? = [0, 1], ? is the Borel ? -algebra, and ? the Lebesgue measure. Construct a sequence of decompositions (D„)«eN of the unit interval, first splitting [0, 1] into two sub-intervals of the same length, thus obtaining D\, and then, by induction, dividing each interval from D„-i into two disjoint sub-intervals of the same length in order to get D„. Let ?„ be the (?-) algebra generated by D„. Now, if ? is any measure defined on ?, set: f„(x) = ?(?„(?))/?(?„(?)), where I„(x) is the unique interval of D„ containing x. It is clear that /„ is constant on each interval of Z)„, hence it is ?1,,-measurable. It is also easy to see that ff fn+\ dk = Д /„ dk (= v(In)), which implies that (/„, 2Г„)„ещ is a martingale. If this martingale is convergent, and if ? is ?-additive, then the function fx satisfies the identity: J focdk=v(F), for any F e (J ??, and therefore for any F e ?, i.e., we have fx = dv/dk (and of course ? « ?).
274 D. Candeloro and A. Volcic In case ? is real-valued, a classical condition for the convergence of a martingale is particularly simple: as soon as 5ир,1€1;/^ \ft\dk < oo, the functions /„ are uniformly integrable, and pointwise converging to fx. When the martingale (or the measure v) takes values in a general Banach space X, this is no longer true, in general. Chatterji's result clarifies the situation. THEOREM 4.15 ([17]). Let (?, ?, P) be any probability space, and assume that X is a Banach space. The following are equivalent: A) X has the RNP with respect ??(?.?, ?). B) Every X-valued martingale (f„- ?,,),,^:, satisfying sup,ieH fn \f,\ dP < oo, is convergent, in the sense that f, converges P-a.e. and strongly in X to a Bochner- integrablefunction fx : ? -> X, such that E{fx\E„) = /„. V/; e N. We shall not give the proof here: for the implication B) =>· A), the construction above gives an idea of the main steps. The converse is less elementary. As a consequence of this result, one can easily deduce, besides the already mentioned classes of reflexive spaces, and separable dual spaces, another class of Banach spaces with the RNP, i.e., the weakly complete spaces with separable dual. We conclude this section with an analytic result, due to Moedomo and Uhl [41], connecting "weak" derivatives with "strong" ones. We need a further definition. (See Chapter 12.) Dehnition 4.16. Let (?, ?, ?) be any finite measure space, and X any Banach space. Given a strongly measurable function /: i? —>¦ X (see Definition 4.1), we say that / is Pettis integrable if for every set A e ? there exists an element J(A) e X, such that [x\J(A))=f{x\f)dvL for any x* e X*. The element J (A) is called the Pettis integral of / in A, and is denoted by{P)fAfdvL. THEOREM 4.17 ([41]). Let v. ? ^ X be any ?-additive measure, ? «; ?. The following are equivalent: A) There exists a Pettis integrable function f: ? —> X such that (?) ? fd^ = v{A) J A for all A e ?. (/ is the Pettis derivative of v.) B) For every ? > 0 there exists a set He e ?, such that ?(#/) < ?, and the set {—д|: А С ??, ? (?) > 0} is relatively compact. C) For every Ae ?, with ?(?) > 0, there exists В e ?, В С A, with ?(?) > 0, such that {44=}: ? <??, ?(?) > 0} is relatively compact.
Radon-Nikodym theorems 275 {In items B) and C) above, "relatively compact" can be equivalently replaced by "weakly relatively compact".) Moreover, f turns out to be Bochner integrable, if and only if ? is BV. We shall not give a proof of Theorem 4.17, however we emphasize again the fact that weak compactness of bounded sets ensures that they are dentable, so at least part of this theorem is a consequence of Theorem 4.10. Remark 4.18. In some sense, Theorem 4.17 tells us that bounded variation is not an essential requirement for ? to be an integral measure: if one of the properties B) or C) of Theorem 4.17 is satisfied, ? is at least the Pettis integral of some function /, with respect to ?. For example, if X is reflexive, the only requirement is that B) or C) above hold, with "relatively compact" replaced by "bounded"; however, if ? is not BV, this is not automatic, even for Hilbert spaces: if we consider the Remark 4.5, the example outlined there deals with an L2-valued measure v, absolutely continuous with respect to the Lebesgue measure ?, lacking even a Pettis derivative: indeed, the "averages" y^ fail to be bounded, as soon as ?(?) decreases to 0. 5. Finitely additive Banach-valued measures Of course, the problems concerning the existence of a Radon-Nikodym derivative are still harder, when one allows also finitely additive measures into consideration. As we already observed, even for real-valued finitely additive measures, there are examples in which the derivative does not exist, hence it makes no sense to look for spaces with a property which would take the place of RNP. This is clarified by the next theorem, which is a Banach-valued version of the approximate Radon-Nikodym-Bochner Theorem 3.1. We need first a definition. Dehnition 5.1. If X is a Banach space, we shall say that it has the "approximate finitely additive Radon-Nikodym property" (AFARNP), if for any measure space (?, ?,?), where ? is a finitely additive X-valued BV measure, and for any ? > 0, there exists a ?- integrable function /,; (which may be taken simple), such that -/. ?(?)- ?? < л- for any ? e ?. The following result has been communicated to us by Anna Martellotti. THEOREM 5.2. A Banach space X has AFARNP if and only if it has RNP. PROOF. Assume X has RNP and let ? be a BV finitely additive measure on (?. 2?), with values in X. Since ? is BV, it follows from [11] that it admits a Stone extension, i.e., there exists a countably additive measure ? on Qs, the Baire ? -algebra of the Stone space S associated to the quotient of ? with respect to ?? the family of all |?|-???11 sets.
276 D. Candeloro and A. Volcic It is also known that ? is BV and that By RNP there exists a Bochner-integrable function f:S->X such that H(F) = Jfd\H\, for any F &G$. A density argument shows that, given ? > 0, there exists a C/a-simple function g:S^>X such that j \\?-?\\?\?\<?. Let g = J2"=l Xj 1g, , with G, pairwise disjoint. Since G, e Qs, there exist pairwise disjoint sets ? [,..., E„ in ?, such that /?([?,]) = G,, where h is the natural embedding of ?/?? in 5. Put now ? = ?"=[ X[\ej, and let ? = ?\?\. It is easy to check that у = Jgd\jl\ and from [ 11 ] we have ??-?| = ^??/-*№|. and moreover ? — у = ? — ? and |? — ?\(?) = \? — v\(S), so we can conclude that |?- ?|(?) < /j. Conversely, assume X has AFARNP. Let ?: ? ->¦ X be a countably additive BV measure. For each ? > 0 there exists ^:?->? such that |M-/i)lMl| < »7· Consider ?„ — ^ and let /„ = f,h. We want to show that (/,, )„ец is a Cauchy sequence in ?'(|?|). Indeed, for any n and w, if we put ?„ = /,,|?|, we have ll/i. - /». II ? = /" ll/« - /». Ilx rflMl = |v„ - y,„ |. Since the total variation is a norm in the space of all countably additive BV measures on (?, ?), we have \vn -vm\ ^ |y„ -?| + |?- ?„,|. hence ||/„ - fm \\\ < ?„ + ?,„, and so (/„)„ек is Cauchy. Since |?| is countably additive, L'(ImI) is complete and therefore there exists / e L'(ImI) such that /„ -> / (in ?,'(|?|)). By definition, (/„)„eN is a defining sequence
Radon-Nikodym theorems 277 for /, namely / /„ ?? -> J f ??. From the definition of /„ it follows now the conclusion, namely that ? = f ?\?\. ? From the second part of the proof above, it follows also that an approximate Radon- Nikodym-Bochner theorem for Banach-valued ?-additive measures does not hold, unless the range space has RNP. From now on, we shall limit ourselves to look for conditions, necessary and/or sufficient, to ensure the existence of some kind of weaker Radon-Nikodym derivatives. To be able to do that, besides the Bochner and Pettis-type integral, we shall also deal with the so-called Gel'fand integral, and later on with the Bartle-Dunford-Schwartz integral. DEnNlTlON5.3. Let ?: ? -> Rj be a finitely additive measure, and let X beanyBanach space. Given a function /:?-> ?**, we say that / is Gel'fand integrable with respect to ?, if: (a) (/, x*} is ?-integrable, for all x* e X*. (b) For every Ae ? there exists an element Уд е X such that (x*,JA)= J (?,?*)??. for any x* e X*. (c) The function A -> JA is a ?-continuous finitely additive measure. (d) For every ? > 0 there exists ? <??, such that ?(??) <e,and sup(„eW ||/(?)|| < ос. When a finitely additive measure ?: ? -> X is given, with ?<? (throughout this section we use always the ?-? definition of absolute continuity), we say that ? has a Gel'fand-type derivative, if there exists a Gel'fand integrable function /:: ? -> X**, for which J a = v(A) for all Ae ?, i.e., j (/,?*)?? = (?*,?(?)), for any Ae ?, and x* e X*. An example of such a derivative can be seen in Example 4.3, where the function / takes values in /-?,, while the space X is cq. Another interesting situation is described in Example 4.2, where the existence of a Gel'fand derivative is a consequence of the Lifting Theorem [43,36,30,31] (see also Chapter 28 in this Handbook [53]), in particular for the Lebesgue measure. Let us denote by ? the lifting map. For any ? e ? = [0, 1], /(?) can be defined as the element of L^, which associates 0(/?)(?) to every h e Lx. So we see that (/, h) turns out to be simply the function <p{h), which is bounded and measurable, for each h e Lx = X*, hence (a) is satisfied, and fA(f,h)dk = jAhdk= (h, \A) - {h, v(A)), for any A e ? and h e X*. This proves (b), (c), and the fact that / is the Gel'fand derivative dv/dk. Dealing with finitely additive measures, the existence of Gel'fand derivatives can be derived from the following theorems, which include Example 4.3 as a particular case.
278 D. Candeloro and A. Volcic THEOREM 5.4 ([13]). Let ? be any separable Banach space, and assume that (?, ?, ?) is any finitely additive measure space, with ? nonnegative (and finite). Let us suppose that a bounded finitely additive scalar measure T(y) is associated to every у е Y, such that there exists ??(?)/??, and moreover \T(y)\(A) ?. \\\\\?(?) for every A e ? and у е Y, where \T(y)\ denotes as usual the total variation ofT(y). Then, there exists a function f-.?-?-?* such that: A) f {¦)(}·) = ??(}·)/?? for every у eY. The proof of Theorem 5.4 is in some sense reminiscent of the sketch of proof given for Theorem 4.11. We skip the details, because they are too technical. A direct consequence is the existence of bounded Gel'fand derivatives, under some conditions. THEOREM 5.5 ([13]). Assume that X is any Banach space, with separable dual, and let (?, ?, ?) be as above. Suppose that ? : ? —> X is a finitely additive measure, such that d{x*, ?)/?? exists, for every x* e X*. Assuming moreover that the set \?(?) is bounded, then there exists a bounded Gel fand type derivative ??/??. PROOF. It is sufficient to apply Theorem 5.4, setting ? : = X*, and T(x*) = (x*, v). D COROLLARY 5.6 ([13]). Let ?, (?, ?, ?), ? be as above. For the existence of a Gel'fand derivative dv/?? it is necessary and sufficient that the following two conditions hold: (i) d(x*, ?)/?? exists, for every ?* e X*, and (ii) for every ? > 0 there exists ? e ?, ?(?') < ?, such that the set ( v(A) S(H) := I ——-: ?<??. AcH. ?(?) > 0 \?(?) is bounded. PROOF. The necessity is clear, by definition of Gel'fand integrability. To see the converse, let us take an increasing sequence of sets (Я„)„ек, such that ?(//,') 4- 0* an<J with S(H„) bounded for each n: then, applying Theorem 5.5 to H„ one gets a Gel'fand derivative /„ on #„, and pasting together the functions /, gives the required derivative. D It is interesting to compare Corollary 5.6 with the result due to Moedomo-Uhl, quoted in Theorem 4.17: in some sense, if compactness for the average sets is replaced by boundedness, then Gel'fand derivatives take the place of Pettis derivatives. The requirement concerning the existence of d(x*, ?)/?? in the previous theorems may look too restrictive; however, we can see that it is satisfied under rather mild conditions. One of these has been outlined in [13], where the following well-known theorem by Rybakov is used [49].
Radon—Nikodym theorems 279 Let us recall that a measure ? on ? is said to be .s-bounded, if for every disjoint sequence of sets E„ e ?, we have lim„ ?(?„) — 0. THEOREM 5.7 ([49]). Given an s-boundedfinitely additive measure ? : ? -» X (here, ? may be just an algebra), there always exists some element x* e X*, such that ? <<C \{x\v)\. Definition 5.8. If v. ? -> X is .s-bounded and л* e X* is such that ? « \(x*. v)\, then the measure \(x*, v)\ is called a Rybakov control for v. We can (and do) assume that ??*?? = ?· THEOREM 5.9 ([13]). Let X be any Banach space with separable dual, and let (?. ?) be any measurable space. Assume that ?: ? -> X is any s-bounded finitely additive measure, and let ? be any Rybakov control for v. If ? has weakly closed range, then a Gel'fand derivative dv/?? exists if and only if condition (ii) of Corollary 5.6 holds. PROOF. Having in mind Corollary 5.6, it only remains to show that d{x*. ?)/?? exists, as soon as x* e X*. Let us denote by y* the element of X*, such that ? = \(y*, v}\. As the range of ? is weakly closed, we see that the measure {sx* — ry*, v) =s(x*, v) — r(v*, v) has closed range, hence it admits a Hahn decomposition, for all real numbers s and r, and all elements x* e X*. Now, an easy consequence of Corollary 1.3 gives the desired derivative. ? A more general formulation can be found in the next statement. THEOREM 5.10 ([13]). Assume that X* has the RNP, and let ?: ? -> X be any s- bounded measure, with separable weakly closed range. Then (ii) of Corollary 5.6 is a necessary and sufficient condition for the existence of a Gel fand derivative ??/??, where ? is any Rybakov control for v. PROOF. If ? denotes the separable subspace of X, generated by the range of v, from Theorem 4.12 we find that Y* is separable, hence the conclusion follows from the previous results. ? In order to obtain Bochner (i.e., strong) derivatives in the finitely additive case, one has to make different assumptions, as Hagood in [26] showed. The definition of Bochner integrable function in the finitely additive setting is formally the same as in the ?-additive one (see also [22]). For the sake of simplicity, we shall give here just a particular form of Hagood's result, also related to a previous work by Maynard [40]. THEOREM 5.11. Let (?, ?, ?) be any nonnegative finite and finitely additive measure space, and let ? : ? -> X be any Banach-valuedfinitely additive measure, K</i. Then the following are equivalent: A) there exists dv/?? in the Bochner sense: B) for every ? > 0 and S > 0, there exist С e ?, ?@ > 0, and a e ]0, 1 [ such that:
280 D. Caiideloro and A. Volcic Bi) ?(?)<?. Bii) the set S(C) := {~$-}: A e ?, А С С, ?(?) > 0} is bounded, Biii) for all ? с С, ? е ?, ?(?) > 0, ?/геге ех/ш F с ?, F e ?, such that M(F) > ??(?) and diam(S'(F)) < ?. PROOF. First, let us prove that A) =>· B). We denote by / the derivative dv/??, and fix ?, S > 0. By strong measurability of /, there exists a simple function g = ?"=\ ?? 1?,-, such that the sets ?, form a finite partition of ?, and ?*({? e ?2: ||/(?) - g(o>)|| > ?/4}) < 5. Let us choose a set Я е Г, such that {we ?: ||/(?)-#(?)|| > ? /4} с Я, and ?(#) < ?. Set now: С = Нс. Then, Bi) is satisfied. Moreover, if A e ?, Ac C, we have ||/(?) - g(cu)\\ iC ?/4, for all ? e A, hence ||v(A)|| ^ /д ||g||d/i + |?(?) ^ ???(?), where ? = тах{||/3,|| + ?/4). So, also Bii) is satisfied. Finally, choose a = ^: if ? is any fixed element of ?1 contained in C, denote by ?* one of the elements ?, such that ?(? ? ?*) > ^?(?), and put F = ?* ? ?. So, m(F) > ??(?). Now, let us prove that the diameter of S( F) is less than ?. Let /3 * denote the value of g in ? *, and choose any set Ae ?, with А С F, and ?(?) > 0. We get v(A) ^!???? ^?*?(?) | !A(f-g)dn ?(?) ?(?) ?(?) ?(?) from which we deduce that || —^ - ?*\\ < ?/2, and therefore diamE"(F)) < ?. We now turn to the converse, i.e., B) => A). Fix ? > 0, and ? = ?. Then, let С and a be the corresponding elements, obtained from B). Now, apply Biii) to ? = С: we find a set F| e ?, with F\ С С, and m(F|) > ??@. and such that diamE"(F|)) < ?. If ?№) = ?@ we are finished; otherwise, apply again Biii) to ? = C\F|, thus finding F2 С C\F|, with F2 e ?, such that ?№) > «m(C\F|), and diamE(F2)) < ?. So, F| and F2 are disjoint members of ?, both satisfying diamtSXF,)) < ?. By an exhaustion argument, it is possible to get a (finite or countable) family (F„)„epj of subsets of С of positive measure, each satisfying diam(S'(F„)) < ?, and such that ]??(^,) = ?@. Now define fE = ]T/j„ lfi, where /3„ is, for each n, any element of S(F„). It is now easy to show that fe is Bochner-integrable (indeed, ff is bounded, by Bii)), and L v(A)- / ?,?? A < ?. for any A e ?, AcC. A routine argument now gives an increasing sequence (Ck)keK, with ?{€[) i 0, and a corresponding convergent sequence (/; )??€??, whose limit is the required derivative. D To conclude the section, we turn to the so-called Bartle-Dunford-Schwartz integral, which allows to integrate a scalar function with respect to a vector-valued measure. We shall refer to the paper [37], which in turn was inspired at Musiat's paper [42]. Actually, in [37] locally convex-valued measures were considered, but here we shall limit ourselves to the special case of Banach-valued ones, for simplicity.
Radoii-Nikodvm theorems 281 For the rest of this section, X will denote a Banach space, ?, у: ? -> X will be s- bounded finitely additive measures. Dehnition 5.12. We denote by V the space of all measurable functions /: ? -> R. such that / e L|(|(jc*,m)|), for any x* e X*. Given a function / e 73, we say that / is ?-integrable if for every A e ? there exists an element v(A) e X such that {x*,v(A))= ? /?·(?*?) J A for every x* e X*. We then set: v(A) = fA f??. In general, when / is ?-integrable, the function у is a finitely additive measure, but it satisfies a much stronger condition than absolute continuity with respect to ?. This is evident even in two-dimensional spaces: one can choose the measure space ([0. 1], ?. ?), X := ?2, and then define: ?(?) = (fAxdx.X(A)). v(A) = (?(?), fA xdx). It is clear that ? and у are both equivalent to ?, hence у <<c ?, but there is no function /: [0, 1 ] -> ? such that v(A) = fA f ??: indeed, such a function should satisfy: ?(?)= / xf(x)dx and / .rd.v= / /(jc) ^jc ^A J A J A for any Ael, thus f(x) — ? a.e. and xf{x) = 1 a.e., which is clearly impossible. The key property is introduced in the following definition. DEHNITION 5.13. We say that у is scalarly uniformly absolutely continuous with respect to ?, and write: у <k< ?, if for every ? > 0 there exists S > 0 such that for every x* e X* and every A e ? one has the implication: \(x*, ?)\(?) < S => |(.v*,y)|(A)<e. We say that у is scalarly dominated by ? if there exists a positive number ? > 0 such that |<jc*. v)|(A) ^ Af |<jc*, /i)|(A) holds, for any x* e X* and any A e ?. It is easy to see that, in case /: ? -> ? is bounded and ?-integrable, the measure y, defined in Definition 5.12 above, is scalarly dominated by ?, and satisfies у <sk ?. The Radon-Nikodym theorem stated in [37] asserts that, under certain conditions, depending essentially on the finite additivity of the involved measures, scalar domination is equivalent to scalar uniform absolute continuity, and that each of the two conditions is also sufficient for the existence of a bounded derivative dv/??. The conditions we shall mention here are somewhat stronger, for the sake of simplicity.
282 D. Candeloro and A. Volcic THEOREM 5.14 ([37]). Let ? be any Rybakov control for ?. Assume that for every x* e X*, the set 1 ?(?) J is bounded. Assume also that the ranges of the measures ? and ? are closed. Then the following are equivalent: A) у <sk ?; B) у is scalarly dominated by ?; C) there exists a bounded ?-integrable function f: ? -> Ш, such that (x*,v)(A)= f /?(?*,?), J A for any A e ?, and x* e X*. 6. Further results In this section, we shall see some recent results, concerning different situations: one of them concerns Radon-Nikodym derivatives for measures taking their values in locally convex spaces, and more particularly in nuclear spaces; another situation concerns multivalued measures, taking values in Banach or locally convex spaces; yet a different result concerns Riesz space-valued measures; finally, we shall mention some results concerning Radon-Nikodym derivatives for a different kind of integral, the so-called Sugeno integral. As usual, (?, ?, ?) denotes a measure space, where ? is a ?-algebra, and ? is any nonnegative, finitely additive measure, taking values in [0, +oo[. We start with some results concerning (finitely additive) measures, taking values in a locally convex Hausdorff space X. Some definitions are needed, in order to put appropriate conditions on X. Throughout this section, X* denotes the strong dual space of X. Dehnition 6.1. Let X be any locally convex space, and let В denote any bounded, nonempty subset of X. В is said to be bipolar if В = В00, according with the duality (X, X*). Given any bipolar set В с X, we denote by Хв the subspace of X which can be absorbed by В (i.e., the space of those elements ? e X such that rx e В for some positive scalar r); thus the Minkowski functional рв of В can be defined on Хв (we recall that рв(х) = inf{r > 0: f e B], for ? e Хв)- We can endow Хв with the semi-norm рв and then consider the normed space Хв/ %, where the equivalence relation is defined by: x s» ?' «=> pB(x - x') = 0. We shall denote by X(B) the completion of such normed space. We say that X satisfies the property (SP) if X(B) is separable, for every bipolar set ? CX. Obviously, a separable locally convex space X satisfies (SP).
Radon-N\kod\m theorems 283 DEnNlTlON 6.2. We say that a locally convex space X has the property (SP)' if X*(B°) is separable, for all bounded subsets В с X. If X* is separable, then X has (SP)'. When у takes values in a space X with property (SP)', then a Radon-Nikodym theorem in the Pettis sense has been proved in [9]. The proof is too long and technical to be presented here. THEOREM 6.3. Suppose X has property (SP)', and ?: ? -> X is any ?-continuous finitely additive measure, satisfying the following two conditions: A) there exists d{x*. v)/d^,forany x* e X*: B) the set S := {^щ: A e ?, ?(?) > 0} is weakly relatively compact in X. Then there exists a bounded weakly measurable function g : ? —> X, such that: ('(?*,8(-))?? = (?\?(?)). J A for any x* e X* and all Ae ?. This theorem is applied in [9] to the case of dual-nuclear spaces. In order to state other results, we need some more definitions (see also [45]). Dehnition 6.4. Let X and ? denote two locally convex Hausdorff spaces, and let ?: X -> У be any continuous linear map. We say that ? is nuclear if there exist: (a) a sequence (?,,),,^ in l\; (b) an equibounded sequence (x*)„m ш X*· (c) a bounded sequence (yn)nen in Y, such that Ф(х) = ^2^Уп(х*,.х) for any ? e X. Dehnition 6.5. A locally convex Hausdorff space X is said to be nuclear if every linear continuous map ?: X -> У is nuclear, for every Banach space Y. This definition is not the original one, due to Grothendieck, but we have chosen this equivalent property, because we think it is easier to work with. Nuclear spaces enjoy very interesting properties: for instance, it follows from the classical definition of a nuclear space that it is the projective limit of Hubert spaces. Another important property is that every bounded set in a nuclear space is pre-compact. A useful condition for nuclear spaces is quasi-completeness, i.e., every closed bounded subset is complete. Thus, if a nuclear space X is quasi-complete, all closed bounded subsets of X are compact (hence, X is Montel). If the strong dual X* is nuclear, then the locally convex space X is said to be dual-nuclear. It turns out easily that a quasi-complete dual- nuclear space X is semi-reflexive, i.e., the canonical embedding с: X -> X** is onto.
284 D. Candeloro and A. Vollic From all these remarks, one easily realizes that many interesting results can be found on measures taking values in spaces of this kind. In [9] there are listed some, for measures taking values in a dual-nuclear space. We just recall those results, which are strictly connected with Radon-Nikodym derivatives. THEOREM 6.6 ([9]). Let X be a quasi-complete and dual-nuclear space. Then every bounded finitely additive measure ?:?—>? is s-bounded, and admits a Rybakov contml. THEOREM 6.7 ([9]). Let X be as above, and assume that ?: ? -> X is any bounded finitely additive measure, such that: A) (x*, y) «? , and there exists '^ff1 in L^, for all x* e X*; B) the set S := [j^: ?&?, ?. (A) > 0} is bounded. Then there exists a Bochner-Гуре derivative j-. We remark here that "Bochner" means here that the function j^ is the limit in measure of a sequence of simple functions, and the integral is the limit of the corresponding integrals. This looks like a strong conclusion, and it is worth mentioning how it is derived: from the assumptions we see that the mapping ?: X* -> ?.^, defined as T(x*) = ' ")?? , is nuclear. (For more details we refer the reader to [9], where also completeness of Z-oc(m) is proved.) Therefore one can write: T(x*) = J2^(x\e„)y„ ?/ef. where (A„)„epj e /|, (е„)пец is a bounded sequence in X** = X, and (v„)„ек is a bounded sequence in L^. Choosing a bounded representative /„ from the class of y„, and putting g„(o>) := k„e„f„(w) for all ? e ?, the series ]?#„ converges strongly to the desired derivative. An even stronger result holds for ?-additive measures. THEOREM 6.8 ([9]). Let X be as above, and assume that ? is ?-additive. Ifv-.?-* ? is any measure, К<Д, there exists a Bochner-type derivative ??/??. We now turn to multimeasures, according with the definitions of [16] and [38]: we will just recall the notations and the most relevant results in that setting. DehnitiON 6.9. Given a Hausdorff locally convex space X, the family of all nonempty, closed, convex, bounded subsets of X will be denoted by CC(X)\ if Q denotes the set of all continuous seminorms on X, for every ? e Q and every pair of elements А, В е CC(X), we set: ep(A, B) = sup inf p(x - y). .veA.veB
Radon-Nikodym theorems 285 and dp(A, B) = max{e,,(A. B), ep(B. A)}. For any A e CC(X), we set also: hp(A) — dp(A. {0}). As dp is a pseudo-distance, the family {hp: ? e g) defines a topology on C((X), which is compatible with the following addition: A + В := [a + b: a e A. be B]. We remark that the same topology arises if g is replaced by some absorbing subset go· Also, if X is complete, so is C((X) with this topology. We shall assume, from now on, that X is complete. Given an absorbing subset g0 С ?, a subset В с C( (X) is go- uniformly bounded if sup eg supAeiJ hp(A) < oc. Any finitely additive measure ? : ? -> C( (X) is said to be a multimeasure in X. Dehnition 6.10. Given a multimeasure у: ? -> С, (X), for every p e g the p-varia- tion of у is defined for all ? e ? as: vp(E)= sup УЧДуСА,)), (A,-)eP(?)/e/ where P(E) denotes the family of all finite partitions of ? into sets A, e ?1. We say that у has bounded variation if yp(i2) < oo, Wp e g. Dehnition 6.11. Given a multimeasure у : ? -> C( (X), and a finitely additive measure ? : ? -> Rj, we say that у is absolutely continuous w.r.t. ? ( у <? ?) if for every ? > 0 and every peg there exists a <5(?. ?) > 0 such that ?(?) < S implies vp(E) < ?. We now turn to integration of multifunctions. Our presentation is necessarily concise, and restricted to the Radon-Nikodym problem; we refer to Chapter 14 in this Handbook by C. Hess [28], for a more detailed exposition. Dehnition 6.12. Let ?: ? -> R(| be any finitely additive measure. A map F: ? -> C( (X) is simple if it can be written as: where the C, 's are elements of C, (X), and the A, 's are disjoint elements of ?. The integral of F with respect to ? is defined as: /^М = ]ГМ(А,)С,. •^ / = l As each C, is convex, the definition of integral does not depend on the representation of F.
286 D. Candeloro and A. Volcic In case F is not simple, F is said to be totally measurable if there exists a sequence (Fii)neN of simple multifunctions such that (al) the function dp(F„, F) is measurable for every ? e N and every ? e Q, (a2) the sequence (dp(F„, F)) ?-converges to 0 for every ? e Q. Moreover, we say that F is ?-integrable if there exists a sequence (F„)„eh- of simple functions, satisfying (al) and (a2) above, such that (a3) lim„1.,1_3C/d/,(F„,F„1)^=Oforallpe Q. Such a sequence will be called a defining sequence for F. In case <2o is any absorbing subset of Q, and the conditions (a2) and (a3) above hold uniformly with respect to ? e Qq, then F will be said to be strongly integrable (with respect to <2o, if confusion can arise). In case F is integrable, and (F„)„ei; is any defining sequence for F, then for every F e ?1 the sequence (fE F„ ??),,^; is Cauchy in CC(X), and hence convergent to some element of CC(X), which will be called the integral of F on F, and denoted by /? F ??. This integral does not depend on the defining sequence (see [38] for details). PROPOSITION 6.13. If F :? -> С, (X) й ?-integrable, then ? -> Д- Fi/? ife/i/ies ? multimeasure, which is absolutely continuous with respect to ?. In order to state a Radon-Nikodym theorem, we need one more definition. DEFINITION 6.14. Given a multimeasure ?: ? -> C((X). we say that ? hboundedii sup hp(v(A)) = Mp < oo for every ? e Q. If this is the case, set: <2,, := {-jj~: ? e Q. Mp > 0). If ? : ? -> Rj is finitely additive, for every set ? e ?. and every ? > 0. we put: 5(?):=I^S: Fer· M(F)>o): lM(F) J Sp(?. ?) := {С e C< (X): dp(v(F). Cm(F)) ^ e/i(F) VFeT. F с F}: S(E,e):= P| Sp(?.e). Finally we can state one of the Radon-Nikodym theorems from [38]. The proof is based upon the technique of Hagood, see Theorem 5.11. THEOREM 6.15 ([38]). Let v. ? -> С, (X) be a multimeasure, ? «i ?, where ? is any nonnegativefinitely additive measure on ?. Assume that A) ?(?) is Qv-uniformly bounded: B) for every ? > 0 and every ? e ?, ?(?) > 0. there exists a sequence of pairwise disjoint subsets (E„)„ey: of E, with E„ e ?, such that ?(?) = ]?„?(?„), and such that S(E„. ?) фЪ for any п.
Radon-Nikodym theorems 287 Then there exists a Qv-strong!y measurable multifunction G: ? -> CC(X), with Qv- uniformly bounded range, such that: / Gdix = v(E), for all Ее ?. A quite different problem arises when the measures are both vector-valued. An example was given in the last part of Section 5, with Banach-valued measures; but sometimes it is interesting to consider a richer structure, and to assume that the vector measure takes values in a Riesz space: this is convenient, when the linear spaces have a natural order structure, and the "order" convergence is not topological, like in the space Lq, with the a.e. convergence. We shall present here a very simplified version of the results, and refer to [5 ] for further details or generalizations. Assume that R is an order-complete Riesz space (here, order-complete means that every nonempty subset F С R which is bounded from above has a least upper bound in R). Order-completeness allows to define sup, inf. lim sup and lim inf. We first mention the so-called Riemann-type integral for bounded functions with values in a Riesz space. Dehnition 6.16. Given a bounded function /: [a. b] -> R, we say that / is Riemann- integrable (with respect to Lebesgue measure) if sup / s(x)dx= inf / t(x)dx veS/ J teT, J where Sf (Г/, respectively) is the class of all /?-valued step functions s ?. f (t > /, respectively), and the elementary integral of a step function is defined in the obvious way. This integral is well-defined, and is a linear monotone /?-valued functional. Moreover, one can see (by usual techniques) that monotone functions are integrable. Definition 6.17. Given a positive finitely additive measure ?: ? -> R, we say that ? is ?-additive if, for every decreasing sequence (En),ie\\ in ?, E„ \. ? implies inf{ ? (?„): ? e ?} = ?(?). Given a positive finitely additive measure ?:?^> R, and a bounded measurable function f-.?-* Rq", we say that / is integrable with respect to ? if the following Riemann-type integral is finite: f fdii = [ ^ей: f(x)>t})dt. Jn Jo This means that the set {/()M ?({? e ?: f(x) > t})dt: ? > 0} is bounded in R, and fn f ?? is its least upper bound; we observe that the function g(t) = ?({? e ?: /(л) > t}) is monotone decreasing, and hence Riemann integrable in every interval [0. M].
288 D. Candeloro and A. Volcic If / is integrable, then /1 ? is integrable too, for every ? e ?, and ? -> / /1 ? ?? := §Ef ?? defines a measure ?, which is absolutely continuous with respect to ?, in the sense that limsupv(A„)=0 as soon as (A„)„€f: is any sequence from ?. such that limsupM(A„) = 0. The Radon-Nikodym theorem for Riesz space-valued measures can be formulated exactly like Theorem 1.2, with (bl) and (b2) replaced respectively by (b'l) and (b'2), and (b3) replaced by absolute continuity. THEOREM 6.18. Let ? and ? be positive finitely additive measures defined on ? and taking values in R, with ?«?. Then the following are equivalent: (a) There exists a measurable and ?-integrablefunction f : ? —> [0. oof, such that j fd? = v{E) for all ? e ?. (b) there exists a family of sets (A,-) in ?, for r > 0, such that, for any r > 0: (bl) v(E) > r^(E),forany Ее ?, ? с Аг: (Ь2) v(E) <; r^(E),forany ? e ?, ? с Acr. Though the formulation (and the proof) of this theorem is quite similar to Theorem 1.2, the consequences are not so strong: in fact the same example given before Definition 5.13 shows that even for ?-additive measures, absolute continuity is not sufficient to ensure the existence of the derivative. A different approach to the idea of integration was presented by Sugeno in [54]. We shall give here quite a short outline of the involved concepts, and also a very simple Radon-Nikodym theorem for this integral. For similar topics, see also Chapter 33 in this Handbook [2]. Definition 6.19. Let (?, ?) be any measurable space. A fuzzy measure on this space is a mapping m : ? -> [0, +oo[, such that: A) m@)=O; B) m(A)^m(B), whenever А, В e ?, А С S; C) if (A„) is any monotone sequence in ?, then w(lim F„) = Yimm(F„). If m is a fuzzy measure on (?, ?), then the triple (?. ?, m) is called a fuzzy measure space. Dehnition 6.20. Let (?, ?. m) be a fuzzy measure space, and let h : ? -> Ej be any measurable function. For each A e ?, define: (S) hdm :=sup{<* лш(АП Fa)\. JA a>() where Fa := [? e ?: h(w) > a]. The number (S) fA h dm is called the Sugeno integral of h on the set A.
Radon-Nikodym theorems 289 (Notice that the definition of Sugeno's integral is reminiscent of the Choquet integral, one just replaces л with ordinary multiplication, and ? with addition.) From this definition, it follows immediately that (S) / hdm<:m(A), УАеГ. J A We list a number of results, concerning this integral. THEOREM 6.21 ([14]). Let (?, E.m) be any fuzzy measure space, and let h-.? -> R(j be any measurable function. A) If h is a constant function, h{x)=a. then (S) / hdm=a/\m(A) for any A e ?. J a B) Ifh and ti: ? -> Rj are measurable functions, with h' ^ h, then (S) / h'dm ? (S) hdm for any AeZ. C) Ifh = 1 д, for some Ae ?, then (S) / hdm =m(A). D) //'(/2„)„epj is any monotone sequence of measurable functions, such that h„ —> h. then lim(S) / h„dm = (S) / hdm. J A J A for any Ae ?. From part D) of the previous theorem, it follows that A -> (S)fAhdm is a fuzzy measure, not greater than m. A useful characterization of the Sugeno integral is the following: THEOREM 6.22 ([14]). Given a measurable function h-.? ->¦ Rj, define, for a > 0: Aa := {we ?: h(a>) > a\. Fa := {we ?: h(w) ><*}. If(Ba) is any decreasing family in E.fora > 0. such thatAa С Ва С Fa, then (S) / hdm := sup{<* Лт(А П Ba)} J A o>() for all A e ?.
290 D. Candeloro and A. Volcic Moreover, if I denotes the set of all a e E(|, such that a ^ m(Fa), then (S) / hdm = max/. This result allows us to state a Radon-Nikodym theorem. THEOREM 6.23 ([14]). Let m and ? be two fuzzy measures on (?, ?), with ? ^ m. The following are equivalent: (a) There exists a measurable function /7: i? —» R(|, such that y(A) = (S) / hdm for any A ? ?. (b) There exists a decreasing family (Bu) in ?, with a e M.q, such that: (bl) ?(??\??) ^? лт(АГ)Ва), and (Ь2) у(А)^т(АПВу(А)). for all A ? ?, and all a e Rj. For a richer treatment of Sugeno integral, and its properties, we refer the reader to [54] and [55]. Appendix. Singularity and decomposition theorems In this appendix we will present some known facts about singularity of measures, the Lebesgue decomposition theorem and some related decomposition theorems, which are relevant in connection with the Radon-Nikodym theorem. We assume that all the measures have been already completed in the Caratheodory sense. Definition A.l. We say that a measure ? is degenerate, if it takes no finite, nonzero values, i.e., its range is contained in {0. со). In connection to this definition, let us first mention the following result, due to N.Y.Luther [35]. THEOREM A.2. Let ? be a measure on the ?-algebra ?. Then there exists a unique decomposition ? = ? \ +?2, where ? \ is semifinite and ?? is degenerate, with the property that if ?' = ?', + ?'-,, with ?\ semifinite and ?'-, degenerate, then ?\ ^ ?\ and ?? ^ ?-,. This result and the fact that degenerate measures are not very interesting from the point of view of their representation as integrals, justify the restriction to semifinite measures in Section 2 of this chapter. We will make the same assumption in the Appendix. If ? is a measure on (?. ?), we say that ? is concentrated on A e ?, if ?(? \ A) = 0.
Radon-Nikodxm theorems 291 Definition A.3. Suppose ? and у are two measures on ?, and suppose there exists a pair of disjoint sets A and В in ?, such that ? is concentrated on A and у is concentrated on S. Then we say that у and ? are mutually singular (or that у is singular with respect to ?) and write ?±?. E) Obviously, this relation between measures is symmetric, so E) holds if and only if ? J_ y. The following properties are quite obvious: • If v\ J_ ? and vi -L ?, then v\ + yi J_ ?. • If У| <<C ? and Ут J_ ?, then v\ J_ ??. • If у <K ? and у J_ ?, then у = 0. THEOREM A.4. /f ? and ? are two measures on (?. ?) and ? is ?-finite, then there exist У0 <K ? and ?\ J_ ? such that ? = vq + ?\. The decomposition is unique. Let us sketch the proof. Suppose first that ?(?) < oo, and let a = sup{y(S): В е ?, ?(?) = 0). It is easy to prove, using arguments introduced in Section 2. that there exists С e ?, such that v(C) = a and ?@ = 0. Define now v\ (E) = v(E П C) and vq(E) = v(E \ C). It is easy to check that uo <?? ? and v\ J_ ?. If у is ?-finite, we consider a measurable decomposition {A„} of ? such that ?( ?„) < oo for all и, and apply the argument above to each A„. If у is not ?-finite, the conclusion of the previous theorem may fail. If for instance ? is the Lebesgue measure on [0, 1] and у is the counting measure, such a decomposition does not exist. We need now two definitions. The first is due to R.A. Johnson [32], the second one is taken from [59]. DEFINITION A.5. We say that у is S-singular with respect to ?, denoted ?$?, if given Ее ?, there exists F e ?, F с ?, such that v(E) = v(F) and m(F) = 0. DEFINITION A.6. We say that у is quasi-singular (Q-singular) with respect to ?, denoted у Q ?, if there exists Ae ? such that у is concentrated on A and moreover if у (?) < oo ?????(???) = 0. Singularity implies Q-singularity and the latter implies S-singularity. The three concepts are not on the other hand equivalent. The counting measure on [0, 1] is Q-singular with respect to the Lebesgue measure, but it is not singular. To show that S-singularity does not imply Q-singularity we need a more elaborate example. Let us consider the measure space defined in [27, Exercise 31.9], which provides a non-Maharam measure ? as sum of two measures ? and ?. As noted by R.A. Johnson, ySa anda S y,but it is not у ±?. On the other hand it can not be yQa (or ? Q y) because of the following proposition [59].
292 D. Candeloro and A. Volcic Proposition A.7. /f vQm and ?$?, then ? ±?. R.A. Johnson proved the following decomposition theorem [32]. THEOREM A.8. If ? and ? are two measures on (?. ?), then there exist щ <К ? and у ? ?? such that ? = vo + ? ?. Always v\ is unique. Moreover, ifv = v'0 + v\ is another such decomposition, vq ^ v'Q. The advantage of the previous theorem, compared to Theorem A.2 is that no assumption is required on ? and у (not even semifiniteness). The disadvantage lies in the weakness of the concept of S-singularity. Q-singularity stays in between and shares some good and bad aspects of both concepts. A reasonable assumption (specially if we deal with the Radon-Nikodym theorem) provides the following decomposition theorem (Theorem 2.1 in [59]). THEOREM A.9. If ? + ? is Maharam, then ? admits ?(? + v)-unique decomposition ?,·, i = 1, 2,3, such that if we put y,(?) = y(? ? ?,) and ?,(?) = ?(? ? ?,) for i = 1,2,3, then viQM, F) M2Qv, G) V3«M3«V3· (8) Moreover, щ and ??, are strongly comparable. The set ?\ is defined as the least upper bound in the measure algebra associated to (?, ?, ? + у) of the family of sets {?: ? e ?, y(?) < oo. ?(?) = 0}. Similarly we define ??. The major difficulty in the proof is to show that ?/, for /' = 1,2 are both ?- and у-measurable. References [1] A. Basile and K.P.S. Bhaskara Rao. Completeness of Lp-spaces in the finitely additive setting and related stories. J. Math. Anal. Appl.. to appear. [2] P. Benvenuti, R. Mesiar and D. Vivona. Monotone set functions-based integrals. Handbook of Measure Theory. E. Pap. ed.. Elsevier. Amsterdam B002). 1329-1379. [3] P. Berti, E. Regazzini and P. Rigo. Finitely additive Radon-Nikodym theorem and concentration function of a probability with respect to a probability. Proc. Amer. Math. Soc. 114 A992). 1069-1078. [4] G. Birkhoff. Integration of functions with values in a Banach space. Trans. Amer. Math. Soc. 38 A935). 357-378. [5] A. Boccuto. On the de Giorgi-Letta integral with respect to means with values in Riesz spaces. Real Anal. Exchange 21 A995/96). 793-810.
Radon-Nikodvm theorems 293 [6] S. Bochner. Integration von Funktionen deren Werte die Elemente eines Vectorraumes sind. Fund. Math. 20 A933). 262-276. [7] S. Bochner, Additive set functions on groups, Ann. of Math. B) 40 A939). 769-799. [8] N. Bourbaki, Elements de nmthematique, Liyre VI, Integration. Hermann. Paris A956-65). [9] J.K. Brooks. D. Candeloro and A. Martellotti, On finitely additive measures in nuclear spaces. Atti Sem. Mat. Fis. Univ. Modena 46 A998), 37-50. [10] D. Candeloro and A. Martellotti, Su alcuni problemi relalivi a misure scalari sub-additive e applicazioni al caso dell'additivita finita. Atti Sem. Mat. Fis. Univ. Modena 27 A978), 284-296. [11] D. Candeloro and A. Martellotti. Sul rango di una massa vettoriale. Atti Sem. Mat. Fis. Univ. Modena 28 A979). 102-111. [12] D. Candeloro and A. Martellotti. Geometric properties of the range of two-dimensional qua si-measures with respect to Radon-Nikodym property. Adv. in Math. 93 A992). 9-24. [13] D. Candeloro and A. Martellotti, Radon-Nikodvm theorems for vector-valued finitely additive measures. Rend. Mat. Roma 12 A992), 1071-1086. [14] D. Candeloro and S. Pucci, Radon-Nikodym derivatives and conditioning in Fuzzy Measure Theory. Stochastica 11 A987). 107-120. [15] M.D. Carrillo. Daniel integral and related topics, Handbook of Measure Theory, E. Pap, ed.. Elsevier. Amsterdam B002), 503-530. [16] C. Castaing, A. Touzani and M. Valadier. Theoreme de Hoffman-Jorgensen et application aux a mart multivoques, Seminaire d'analyse convexe Montpellier A984), Expose No. 7. [17] S.D. Chatterji, Martingale convergence and the Radon-Nikodvm theorem in Banach spaces. Math. Scand. 22A968), 21-41. [ 18] W.J. Davis and R.R. Phelps, The Radon-Nikodvm property and deniable sets in Banach spaces. Proc. Amer. Math. Soc. 45 A974), 119-122. [19] D. Denneberg. Non-Additive Measure and Integral. Kluwer. Dordrecht A994). [20] J. Diestel and J.J. Uhl, The Radon-Nikodym theorem for Banach space valued measures. Rocky Mountain J. Math. 6A976), 1-46. [21] J. Diestel and J.J. Uhl, Vector Measures. Math. Surveys, Vol. 15. Amer. Math. Soc., Providence. RI A977). [22] N. Dunford and L. Schwartz. Linear Operators. Vol. 1. Interscience. New York A958). [23] N. Dunford and B.J. Pettis, Linear operations on summable functions. Trans. Amer. Math. Soc. 47 A940). 323-392. [24] D. Fremlin, Decomposable measure spaces. Z. Wahrsch. Verw. Gebiete 45 B) A978). 159-167. [25] G.H. Greco, Un teorema di Radon-Nikodvm per funzioni d'insieme subadditive, Ann. Univ. Ferrara 27 A981), 13-19. [26] J.N. Hagood, A Radon-Nikodym theorem and Lp-completeness for finitely additive vector measures, J. Math. Anal. Appl. 133 A986). 266-279. [27] PR. Halmos. Measure Theory, Van Nostrand, Princeton A956). [28] C. Hess, Set-valued integration and set-valued probability theory: An overview. Handbook of Measure Theory, E. Pap. ed., Elsevier, Amsterdam B002). 617-673. [29] R.E. Huff. Dentability and the Radon-Nikodym property. Duke Math. J. 41 A974). 111-114. [30] A. Ionescu Tulcea and C. Ionescu Tulcea. On the lifting property I.}. Math. Anal. Appl. 3( 1961), 537-546. [31] A. Ionescu Tulcea and C. Ionescu Tulcea. On the lifting property II. Representation of linear operators on spaces L'E, 1 sC r < oo, J. Math. Mec. 11 A962), 773-795. [32] R. A. Johnson. On the Lebesgue decomposition theorem. Proc. Amer. Math. Soc. 18 A967). 628-632. [33] J.L. Kelley, Decomposition and representation theorems in measure theory. Math. Ann. 163 A966), 89-94. [34] D. Kolzow, Differentiation von Mafien. Lecture Notes in Math.. Vol. 65. Springer. Berlin A968). [35] N.J. Luther, A decomposition of measures. Canad. J. Math. 20 A968). 953-958. [36] D. Maharam. An algebraic characterization of Measure Algebras. Ann. of Math. 48 A947), 154-157. [37] A. Martellotti, K. Musial and A.R. Sambucini. ? Radon-Nikodym theorem for the Bartle-Dunford- Schwartz integral with respect to finitely additive measures. Atti Sem. Mat. Fis. Univ. Modena 42 A994). 343-356. [38] A. Martellotti and A.R. Sambucini. A Radon-Nikodym theorem for multimeasures. Atti Sem. Mat. Fis. Univ. Modena 42 A994), 579-599.
294 D. Candeloro and A. Volcic [39] H.B. Maynard, A geometrical characterization of Banach spaces with the Radon-Nikodfm property. Trans. Amer. Math. Soc. 185 A973), 493-500. [40] H.B. Maynard, A Radon-Nikodvm theorem for finitely additive bounded measures. Pacific J. Math 83 A979), 401-413. [41] S. Moedomo and J.J. Uhl. Radon-Nikodvm tlieorems for the Bochner and Pettis integrals. Pacific J. Math. 38 A971), 531-536. [42] K. Musiai, A Radon-Nikodvm theorem for the Bartle-Dunford-Schwartz integral. Atti Sem. Mat. Fis. Univ. Modena 41 A993), 227-233. [43] J. von Neumann, Algebraische Reprasentanten der Funktionen "bis auf eine Menge vom Mafie Null". J. Crelle 165 A931), 109-115. [44] O.M. Nikodym, Surune generalisation des integrities de M.J. Radon, Fund. Math. A930), 131-179. [45] A. Pietsch, Nuclear ика11у Convex Spaces. Springer. Berlin A972). [46] M. Radon, Theorie und Anwendungen der absolm additiven Mengenfunktionen, Sitzungsber. der Math.- Naturwiss. Klasse der Kais. Akademie der Wiss.. Bd. 112. Abt II a/2. Wien A913). [47] M.A. Rieffel, Deniable subsets of Banach spaces, with applications to a Radon-Nikodvm theorem, in: Functional Analysis, B.R. Gelbaum (ed.). Washington, Thompson Book Co. A967). [48] M.A. Rieffel, The Radon-Nikodym theorem for the Bochner integral. Trans. Amer. Math. Soc. 131 A968), 466-487. [49] V. Rybakov, Theorem of Bartle, Dunford and Schwartz on vector-valued measures, Math. Notes 7 A970), 147-151. [50] S. Saks, Theory of Integral. New York A937). [51] I.E. Segal, Equivalences of measure spaces. Amer. J. Math. 73 A95 I). 275-313. [52] С Stegall, The Radon-Nikodvm property in conjugate Banach spaces. Trans. Amer. Math. Soc. 206 A975), 213-223. [53] W. Strauss, N.D. Macheras and K. Musiai, Liftings. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam B002), 1131-1184. [54] M. Sugeno, Theory of fuzzy integrals and its applications, PhD Thesis, Tokyo Inst, of Technology A974). [55] M. Sugeno and T. Murofushi, Pseudo additive measures and integrals. J. Math. Anal. Appl. 122 A987), 197-222. [56] J.J. Uhl, A note on the Radon-Nikodvm property for Banach spaces. Revue Roumaine Math. 17 A972), I 13-1 15. [57] S. Ulam, Zur Mafitheorie in der allgemeinen Mengenlehre. Fund. Math. 16 A930), 140-150. [58] A. Volcic, Sul teorema di Radon-Nikodvm net caso поп a-finite. Rend. 1st. Mat. Univ. Trieste 2 A970), 42-53. [59] A. Volcic, Teoremi di decomposizione per misure localizzabili. Rendiconti di Mat. B), Ser. VI 6 A973), 307-336. [60] A. Volcic, Sulla differenziazione di misure. Rend. 1st. Mat. Univ. Trieste 6 A974), 156-177. [61] A. Volcic, Localizzabilita, semifinitezza e misure esterne. Rend. 1st. Mat. Univ. Trieste 6 A974), 178-197.
CHAPTER 7 One-Dimensional Diffusions and Their Convergence in Distribution James K. Brooks University of Florida, Gainesville, FL 32611-2082. USA E-mail: BmokudPinath.ufl.edu Contents Introduction 297 1.Brownian motion 297 Introduction 297 1.1. Definitions 298 1.2. Mappings of Brownian motion 298 1.3. Markov times and the strong Markov property 300 1.4. Further properties of Brownian motion 301 2.One dimensional diffusions 304 Introduction 304 2.1. The setting 304 2.2. Definitions and properties 305 2.3. The scale function 306 2.4. Speed measure 307 2.5. Local time for Brownian motion 309 2.6. Time changing Brownian motion 310 3.Weak convergence of diffusions 313 Introduction 313 3.1. Preliminaries 313 3.2. The general convergence theorem 314 3.3. Convergence of diffusions 321 4.Diffusions as a limit of stretched Brownian motions and stretched random walks 330 Introduction 330 4.1. Stretched Brownian motion 330 4.2. Diffusions as a limit of stretched random walks 335 4.3. Natural time 340 References 342 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 295
This page intentionally left blank
One-dimensional diffusions and their convergence in distribution 297 Introduction The phenomenon of the irregular movement of pollen grains suspended in water was observed by Robert Brown in 1827. Brownian motion, the name given to this process, was not given a rigorous mathematical treatment until almost one hundred years later by N. Wiener in 1923. Then the stochastic processes called diffusions, which possessed some properties akin to Brownian motion, appeared shortly after Wiener's work on harmonic analysis in 1930. The real starting point in the theory of diffusions was the series of fundamental papers by W Feller A952, 1954, 1955, 1957), in which a semi-group approach was employed. The modern form of the theory of one-dimensional diffusions, regarded as the most complete chapter in probability theory, was developed in Dynkin A955, 1959) and Ito and McKean A974). In this Handbook Series we have attempted to introduce to measure theorists some of the beautiful results in this field; the only probability prerequisite is the early chapters of Breiman A968). Our treatment in the early sections leans toward an intuitive approach, which is the only possible approach in this limited format. Sections 1 and 2, Brownian motion and real valued diffusions, follow the presentation in the fine book of Freedman A971). Instead of proofs, informal discussions of the concepts involved are given, although precise definitions are presented so that all the ingredients are included to reach Feller's theorem which characterizes a regular real valued diffusion as a space change and time change of Brownian motion. In Sections 3 and 4 proofs are given in detail and the reader has the opportunity of working with the nuts and bolts of one-dimensional diffusions: scale functions, speed measures, time changes, Brownian local time and weak convergence. The results in these two sections are due to Brooks and Chacon A982, 1983) and hopefully will provide some insight into the structure of diffusions and their weak (distributional) convergence. One path leading to a study of diffusions is the following: start with Freedman A971) for an elegant introduction, cover Ito and McKean A974), the bible for the one-dimensional case, and then Stroock and Varadhan A979) for the multidimensional case (which is far from being a complete chapter). The reader can construct other paths by consulting the references. In any case, the advice given in the preface to Loeve A963) is germane: ".. .the reader will have to be armed permanently with patience, pen, and calculus". The reward? An excursion into one of the most fascinating areas of Analysis. 1. Brownian motion Introduction The botanist Robert Brown in 1827 focused a microscope on a drop of water which had been trapped in a piece of igneous rock millions of years ago. He observed tiny particles continually moving with an irregular motion-a motion he had observed before while studying pollen grains in water. The entrapment of the water in the rock forced Brown to rule out his earlier explanations causing the phenomenon now referred to as Brownian motion. Bachelier in 1900 in his doctoral dissertation presented the first mathematical formulation of Brownian motion as a result of studying stock price fluctuations. Einstein gave a mathematical description of this phenomenon from the molecular-kinetic theory
298 J.K. Brooks of heat in 1905. The first existence proof was given in Wiener A923, 1924). The fine structure of the sample paths of Brownian motion can be found in the deep work of P. Levy A939, 1948). 1.1. Definitions First we start off by describing the most basic one-dimensional diffusion: Normalized Brownian Motion. Definition 1.1. Normalized Brownian motion ? is a real valued stochastic process {B(t): 0 ^ t < oo} on a probability space (?, ?, P) with the following properties: A) For each fixed t, B(t): ? -> R is (?, B\ )-measurable (B„ denotes the Borel subsets ofR"). B) fi@)=0. C) ?{·)? is continuous on [0. oof for each ? e ?. D) For 0 < 11 < · · · < r„_ ? <t„, the increments ?(?|),?(?2)-?(?,) ?(?„)-?(?„_,) are independent and normally distributed with means 0 and variances t\,tj — t\, —1„ —t„-\. In this section, (?, ?, P) will refer to the above probability space. We regarded ? as a space of sample paths ?, where ?-.t -» ?(?)(?). It is helpful to visualize ? e ? as a small particle released at the origin in R3 at time t = 0; this particle is constantly buffeted about by molecular bombardment. To obtain one-dimensional Brownian motion, fix an axis in R3 and project the motion of the particle ? on this axis. The projection of the position of ? at time t on this axis is ?(?)(?). Briefly, the process B(t) can be constructed for t running through the binary rationals, for a suitable probability space. After showing that for almost all ?, the sample paths are uniformly continuous on bounded time intervals (the hard part), extended these ? to [0, oof, and define the other ?/s to be the constant function 0. Assume now that normalized Brownian motion exists and let us list some of the beautiful properties associated with this fascinating process - a process which has been used in practically every branch of science. We shall transfer the setting (?, ?, ?) to С (I), the set of continuous functions /on the interval / с [0, oo). Most of the time / is [0. oc[. [0, 1] or ]0, oof. For such an /, define the functions ?,, t e /, by ?,(/) = f(t). We then form the ?-field В (or ?(?), which is the smallest ?-field for which all the ?, : С (I) -> R, t e I, are measurable. 1.2. Mappings of Brownian motion 1. With the measurable spaces (?, ?) and (C(/). B) in hand, we shall examine various measurable maps from ? to C(I). We set B(t, ?) = ?(?)(?). In the following, ? > 0, s ^ 0, and ? e R.
One-dimensional diffusions and their convergence in distribution 299 THEOREM 1.2. The following maps from (?, ?) to (C[0, oof, B) are measurable. (a) B, which sends ? e ? to the function ?(-,?) e C[0, oof. (b) —B, which sends ? & ? to —B(, ?). (c) ?~[ ?(?2, ·), which sends ? e ? to ?~? ?(?2(·), ?). (d) ? + ?, which sends ? e ? to ? + ?(·, ?). (e) B(s + ·), which sends ? e ? to the function t -> B(s + t, ?), t e [0, oof. (f) B(s + ·) - ?(?), which sends ? to the function t -> ?(? + ?.?) — ?(.?.?), t e [0,oo[. 2. Definitions. Let ? be a measurable map from (?. ?) to (C(/). ?). We define the P-distribution of ? to be the probability PM~] on B. In particular, for ? e R, set P, = V(x + ?)". Thus, we can now think of our sample paths as belonging to C(I); Px handles paths starting from ? at time t — 0. For 0 < ? < oo, define E(s) to be the ?-algebra generated by B(u), for 0 ^ и ^ s that is, the events of Brownian motion up to time i. 3. We now list some important properties of В. (a) (symmetry) P(-B)~[ = P0. (b) (scaling) ?(?-l ?(?2(¦)))-l = PQ. (c) (Markov property) B(s + ·) - B(s) is independent of E(s) and has P-distribution equal to Po· Property (a) is intuitively obvious-remember, the particle (viewed in one dimension) has equal probability of being pushed to the left or right. Property (b) shows that if we adjust the time by the factor ?2 and the state space, i.e., the range of B, by the factor ?, we retain our Brownian motion. Tampering with ? in a slightly different fashion will lead to "stretched Brownian motion", which is an interesting process different from B, such processes will weakly converge to a general diffusion-more about this in Section 4. 4. Property (c) is an introduction to a key property belonging to B. Let us examine this in more detail. Obviously the P-distribution of B(s + ¦) — B(s) gives the full weight to paths starting from ? = 0; but the interesting aspect is that the particle governed by B(s + ·) — B(s) forgets about the past times before ? and starts from scratch at t = 0, like B. Analytically, we have the following. Fix s; then the function Pa,s. .,(·) :flxB-»[0,l] has the following properties: A) For each ? e ?, ??(.5.?)(·) is a probability on B. B) For each A e B, the function ? -> Pb(,.W)( A) is a measurable function on ?. C) Рвц.шМ) = P(B(s + ·) e ?|2?(?))(?), a.a. ? e ?, A e B, that is P(G, B(s + ·) e A) = [ PBls 0))(A)P(da>). G e Г(л) (write P(G, F) for F(G ? F)). We say ??(.?)(·) is a regular conditional ?-distribution for ?(? + ·) given ?(.v). This is the mathematical statement concerning the Brownian particle starting afresh. Another form
300 J. K. Brooks is, for fixed t > i, P(B(t) e T\E(s))(a>) = PBls.o»(B(t - s) e T), a.a. ?, ? e ?, where (?(? —s)eT) is the set of functions in C[0. oo) whose range at time t — s lies in T. Thus the event (B(t) e T), given the information of the process up to time i, is independent of events prior to time i. But Brownian motion is even richer - it has this property even when the constant time is replaced by a random time. Of course, peeking into the future is not allowed in this game, so we restrict our random times accordingly. To illustrate this point, fix b e E. Let ? (?) = inf{r: B(t,w) = b\, ?&?. Thus ? (?) is the first time the Brownian path hits b. Note that ? is a random variable with the property that the event (? ^ t) belongs to ?(?), that is, the random time ? does not look into the future. (Compare this to the random time ?? (?) = last time the ?-path hits b for t e [0, 1], which does look into the future). Now let ?(?,?)=?(? + ?(?),?) -?(?(?).?). t ^0 (where we may assume ?(?) < oo on ?). It can be shown that ? is normalized Brownian motion and that ?(?(?), t > 0) is independent of the "events up to time ?", where a precise meaning will be given to the last phrase. Thus our Brownian traveler starts afresh even with respect to path shifts induced by (suitable) random times-this is the so-called strong Markov property, examined below. 1.3. Markov times and the strong Markov property Let ? : ? -> [0, oo] be a random variable such that {? < t} e ?(?), for all t > 0. We call such a ? a Markov time or simply Markov (other terminology-? is optional with respect to E(t+) := ?\[?(? +?), ? > 0}). Define ?(?+) to be the ?-algebra of all events A e ? such that А П {? < t} e E(t), for all t > 0. Then ?(?+) is thought of as being the collection of events prior to time ?. On {? < oo} define the process ?(? + ·) — ?(?) as in 4.D) in the previous subsection. This process is a map from [0, oo] ? {? < oo} into R which is measurable with respect to ??([0, oof) ? ? ? {? < oo}; furthermore, ?(?) on {? < oo} is measurable with respect to ?(?+). Note that if t is fixed and we let ? =t, then ?(?+) and ?(?+) agree. The following important theorem says that Brownian motion has the strong Markov property. THEOREM 1.3. Assume ? is Markov. Given {? < oo}, the ?-algebra ?(?+) is conditionally ?-independent of the process ?(? + ·) — ?(?), and this process is conditionally normalized Brownian motion. Furthermore, a regular conditional ? -distribution for ?(? + ·) given ?(?+) is ????)(·) on ? < oo. The above theorem says P(G, ?(? +¦)- ?(?) e D) = P(G)P(B(z + ¦)- ?(?) e D), for G e ?(?+), G с {? < oo} and D e B.
One-dimensional diffusions and their convergence in distribution 301 Also, P(G, ?(? + ¦)- ?(?) e D) = ( ??{?.?»@)?(??), Jg G e ?(?+), G С {? < oo} and D e B. Go back to the example where ? is the first time the path hits b. Then Pb(t.w){D) = Pb(D), that is, starting from time ?, the paths start afresh, and are Brownian motions starting from b at t = 0. These results concerning the strong Markov property are due to Hunt and Blumenthal, but were used by Levy prior to its rigorous establishment; as was often the case. Levy simply forged ahead with his marvelous intuition. This property is essentially the defining property of a diffusion as we shall see. To quote from Dellacherie and Meyer A978) "... the seemingly trivial notion of a stopping time (due to Doob) is the cornerstone of the 'general theory of processes' ". 1.4. Further properties of Brownian motion A few results are recorded in this subsection in order to strengthen the readers intuition concerning Brownian motion. Consult Freedman A971) for proofs of these results. The first result is the so-called reflection principle of Andre. Consider the simple case where b is a positive number and a Brownian traveler B(t,co) hits b at the Markov time ?. Form a new process by defining an associated path after time ? (?) by reflecting the path ?(·, ?) about the horizontal line through b. Because of the symmetry of ?, by now we should expect this modified process to still be Brownian motion. It is!, and the justification for this fact lies in the strong Markov property. More generally, let ? be Markov. Define the process У as ? reflected at ? by defining Y(t,w) = B(t,w) for all t if ?(?) = oo; otherwise, define Y(t, ?) — ?(?,?), for t ? ?(?) and ?{?,?) = 2?(?(?),?) -B(t.w), fori > ?(?). Thus В is reflected about the horizontal line through (?, ?(?)). The reflection principle is that ? is normalized Brownian motion. Next, let M(t) = ma\{B(s): 0 ^ ? ^ ?}. It turns out that, for у ^ 0 and b ^ 0, we have P(B(t) <b-y, M(t) >b) = P(B(t) >b + y). From this, it follows that P(M{t) > b) = 2P(B{t) > b). Not only do these identities aid our intuition, but they are essential tools in calculations involving B. One of the best ways to appreciate the wilderness that is C[0, oof. which is filled with such exotic forms, is to delve into the study of the sample path properties of Brownian motion. Perhaps one is initially discouraged when first discovering the futility in trying to visualize the path of a Brownian traveler:
302 J.K. Brooks For almost all ?, the sample path ?(·, ?) is monotone in no interval. The reader is invited to attempt a sketch of continuous function having this property. If we set ??(?) = {t: B{t,w) = y), then for each у and almost all ?, S\ (?) is closed, unbounded, of Lebesgue measure zero and dense itself. Finally, by now it should come as no surprise that almost all paths are nowhere differentiable. Is it any wonder that Brownian motion has captured the imagination of so many research workers? A proof of the fact that for a.a. ?, ?? := {t: B(t,co) = 0} has no isolated points illustrates the use of the strong Markov property. The idea is that the behavior at t = 0 is duplicated every time the path returns to the origin. To handle the action at t = 0, we use Khintchine's law of the iterated logarithm, which states that limsupB@(V2ilog(log(l/0))~' = 1 a.s. no Note that if we replace sup by inf, and 1 by — 1, equality holds. Hence, due to the fluctuation of the paths close to t = 0, by the above result, we conclude that t = 0 is a limit point, from the right, for a.a. ?. Let r be a positive rational and set ?, (?) to be the first t ^ r such that B(t,w) = 0. Since ?(?, + ·) — ?(?>) behaves like normalized Brownian motion starting at 0, by the strong Markov property, we see that ?, (?) is a limit point, from the right, for a.a. ?, of zeros of ?(·, ?). Intersecting events over all the positive rationals r, conclude that for a.a. ? we have: the first zero of the path after every positive rational time is a limit point of ?? from the right, hence no point of ?? is an isolated point. The case 5?(?), у ф 0, is handled in the same fashion. Fig. 1. The above illustration is a close approximation to the plot of a microscopic particle suspended in water by the French physicist Jean Baptiste Perrin in 1912. The position of the particle was plotted every 30 seconds.
One-dimensional diffusions and their convergence in distribution 303 Let us turn to an example, one among many, which reveals the universal nature of Brownian motion in the sense that this process can be used to capture random walks; it is called the Skorokhod representation for a distribution. Assume (?, ?, P) also supports a random variable which has uniform distribution over [0, 1] and is independent of B. Let F be a distribution function on R with mean 0 and variance ?2 ^ oo. Then there exist random variables 0 = ?? < ? < · · · < ос such that ?(?„+\) — ?{?„) are independent and identically distributed and have common distribution function F, for /; = 0, 1,2, Moreover the {?„+? — ?„}^() are independent and identically distributed with mean ?~. This means that {?(??)}^0 is a random walk stemming from random variables having the given distribution F. As an aid to feeling more comfortable with Brownian motion, we mention an intuitive but weak approach to arriving at B. Let (?1, ?', ?') be a probability space that supports a process (Y„ )^=] of identically distributed independent random variables with mean 0 and variance ?2 < oo. Set ? ?? = ??>- i=\ Consider the following construction of functions ?„(·,?) eC[0, 1], we ?', by linearizing: 1 r-^J- 1 ?„(?,?)=—?5/_?(?) + j-= F У,-(?). ? J ? 1 ? J ? ? ifre[^,i]. ThusX„:i2'^· C[0,1 ]. One can show that the probabilities P' Xn ' onC[0, l]converge weakly to B, in symbols, P'X~l => Д>, that is f fd{P'X-')^j fdP0. for all real valued bounded continuous functions / on C[0. 1]. As a result, the scaled random walk derived from fair coin tossing converges weakly to normalized Brownian motion. This not only gives us insight into Brownian motion, but it displays the power of weak convergence: to reach such a rich process by a random walk generated by such simple random variables. Note that X„ in fair coin tossing has the property that the path speed is the reciprocal of the distance. In Section 4 we shall vary the speed in order to weakly approach general diffusions by "stretched random walks".
304 J.K. Brooks 2. One dimensional diffusions Introduction We shall now examine processes of a more general nature called diffusion processes. Such processes can be observed in common occurrences such as the spreading of color in a glass of water in which powdered dye has been added. The diffusion of the color is a consequence of the continuous motion of all the substances involved and it is the process by which irregularities in the concentration of the solution eventually disappear. There are many other examples of diffusions: diffusions of gases and liquids through solids, mixing in turbulent fluids, thermal diffusions, etc. These processes are somewhat reminiscent of Brownian motion. What are the significant characteristics of Brownian motion to define a general diffusion? As evidenced in Section 1, to deduce the fundamental properties of Brownian motion, the property constantly used is the strong Markov property: we want to retain the property that the past, given a Markov time, is independent of the future-a starting afresh at these random times. Let us keep this feature and specify that the diffusion traveler should possess no memory in this sense. What else? Staying in the space of continuous paths is not only useful analytically, but it is a reasonable requirement to impose on a particle traveling in what we might imagine to be a diffusion. And that's it! These two requirements are enough to define a real valued diffusion. Now, can we expect that there is any relationship between this new, very abstract process and that process endowed with so much structure and symmetry - Brownian motion? Feller's deep theorem provides a sharp, strong connection between the two processes. Here is a very brief sketch of this connection. A general diffusion will not possess the symmetry of Brownian motion, that is, if the diffusion particle is at a certain point in the state space, we expect it to move to the left or right with a probability (not 1/2 in general), which will depend upon its location in space. A function S, called the scale function, will govern this aspect of a diffusion's movement. Next, if we view Brownian motion as a weak limit of random walks, say, generated from fair coin tossing, these walks have a certain speed. If we tamper with this speed at each stage and location, we would expect the weak limit, if it exists.to be a process which travels at a different rate than Brownian motion, and this rate can vary from path to path. Surprisingly, pathwise a diffusion behaves like a state-changed Brownian path which is time changed by a function T(t.w), where the time change is derived from a "speed measure" m defined on the state space. The existence of such a measure, which can capture this aspect of a diffusion, requires quite a bit of analysis - but it does exist! Feller's theorem states that, in distribution, S and m characterize the given diffusion. We shall begin to sketch these essential components of a general diffusion. 2.1. The setting From now on, our sample space ? will be C[0. oof. Thus, the sample points ? belong to C[0, oof, with sample paths t -> co(t). Sometimes we also refer to ? as a sample path for brevity. Let X be the coordinate process defined on ? by X(t)(w) = w(t), for ? e ? and 0 <r < oo. Write ?(?,?) for ?(?)(?). Let B, or ?(?), be the ?-algebra of subsets
One-dimensional diffusions and their convergence in distribution 305 of ? such that all the X(t) are measurable. B(t) is the ?-algebra of events generated by X(s), for 0 ^ ? ^ t. Define B(t+) to be f][B(s), s > t]. We say that the random variable ? : ? -> [0, oo] is a Markov time or Markov if [? < t} e B(t), /or еас/г ?. 77?e ?-field of events prior to ?, ?(?+), consists of all A eB such that ? ? {? <t} eB(t), for each t. If ? is Markov define the shift operator #: ? -> ? by #?(?) =?(?(?) + ·). that is, ??{?) is the path whose value at time ? is ?[?(?) + s]. Note that ?? is a measurable map. 2.2. Definitions and properties For each jc e R, let P, be a probability on (?, ?). Then {Pv} reK is a diffusion if (i) Pv(X@)=jc)= 1, for each jc eR; (?) {?-heR has the strong Markov property, that is, if ? is Markov, the conditional Px- distribution of ??, given ?(?+) is ??? on {? < oo}. This means that for any ? e R, A e #(?+), А с {? < oo}, and В е ?, we have P,(A, ????)= [ ????10)).?)(?)??(??). J A The above is the precise statement of the intuitive notion that the diffusion particle starts afresh at the random times ?. We shall simply write {Px} in place {Px }x€&. Brownian motion, which of course is a diffusion, is denoted by [Bx] (here S, is the Px of Section 1.1). If ? > 0, then (??() is called Brownian motion with variance a-. Note that if {Px} is a diffusion and S: R -> R is continuous, strictly monotone, and onto, then, if Qx is the Ps-i(v)-distribution of {SX(t), 0 < t < oo}, it follows that [Qx] is also a diffusion. Now we shall introduce some definitions and results concerning diffusions, which will enrich our intuition and cast some light on their behavior. We shall always assume in this section that {Px} is a diffusion. A handy tool in establishing properties of a diffusion is the Blumenthal 0-1 law. which states that the ?-algebra ?@+) is ? -trivial, that is. PX(A) = 0 or 1 if A e ?@+). The proof follows easily from the strong Markov property applied to the trivial Markov time, ? ? 0. Indeed, for this ?, and A e B@+) since PX(A) = PX(A, ?? e A), and the later is equal to ? ?????)?)??? = [ PAA)dPx = PX(A)\ Ja J a since ??(?: ?(?(?)) = ?) = 1. Hence, an event which depends on arbitrarily small t > 0, happens or does not happen a.s. for each starting position ?. What about the behavior of the diffusion particle in the vicinity of ? in the state space? Three events are useful in this situation. The event {stay at ? awhile} is the collection of paths ? such that w(t) = .x, for t e [0, c]. for some positive с depending on ?; since this event belongs to B@+), it has Px measure 0 or 1. Call ? instantaneous if this value is 0,
306 J.K. Brooks and call ? absorbing otherwise; in the latter case, the paths are ? = ?, a.s. Px. The event {greater than ? near 0} is the collection of paths ? such that ?(?) > ? for i's arbitrarily close to 0. Define {less than ? near 0} similarly. By the continuity of paths, these events belong to ??@+), and thus they have Px probability 0 or 1. The diffusions we shall study here are the regular diffusions, that is, those diffusions such that P,(hit y) > 0 for all ? and y. Thus these diffusions get to visit any y, starting from any ?. From now on in this section, Px is a regular diffusion. 2.3. The scale function We shall now present a loose argument to show the existence of a continuous, strictly increasing, onto function S: ? -> R, such that for a < b, r,(hit b before a) = . for a ^ ? ^ b, S(b)-S(a)' where {hit b before a} is the collection of paths ? that take on the value b before taking on the value a. As in Section 1, if we define ?-(?) to be the least time t such that co(t) = z, and +oo otherwise, then {hit b before ?} = {?/, < ?(,}; this event belongs to ?(??,+). Before beginning, what would be a scale function for Brownian motion? Obviously, for a < b, Ва+ь (hit b before a) = 1/2, due to the symmetry of Brownian motion. More generally, by partitioning [a, b] by ? equally spaced points a = .x\ < ¦ ¦ ¦ < xn = b, if we release the particle at any x^, the particle will behave like a symmetric random walk on x\,..., ;t„, and this allows us to conclude that ?? — a Bx (hit b before a) = . b — a In general, one can show that S(x) = ? is a scale function for Brownian motion. If a diffusion has a scale function S(x) = x, we say that it is in natural scale. Back to the general regular diffusion {Px}. The idea is to define, for J = ]a, b[, a < b, the function Sj(x) = Px (hit b before a), for ? e J. By the strong Markov property, for a < ? < у < b, Px(hit b before a) = Px (hit у before a) PY(hit b before a), A) which is very intuitive, since you must hit у before ? if you hit b before a, but since the diffusion starts over again at y, the above equation holds. Hence it follows that Sj (x) < Sj (y) for such ? and y, after showing 0 < Px (hit у before a) < 1.
One-dimensional diffusions and their convergence in distribution 307 Let ? e ]a, b[; here is an indication why Sj is continuous at .x. If v„ | x, v„ e (a,b), then A) shows that Sj(x) = Pv(hit}>, before a)Sj(yn). Intuitively, P, (hit y„ before a) ->„ 1. hence Sj(yn) ->« Sj(x). This shows right continuity at ?. Left continuity is similar. So we have at our disposal, a strictly increasing, continuous function on J, which has the correct properties on J. The next job is to show that if J\ = }a\, b\ [ D J, then Sj] = Sj on J. After this, we obtain the desired function, which is denoted by S. Any other scale function for the diffusion is a linear function of S. We record for future reference the following. THEOREM 2.1. Let {Px} be a diffusion with scale function S. Let Qx be the Ps-i ^.^distribution of the process {SX(t): 0 ^ t}. Then Qx is a diffusion in natural scale. In view of this theorem, the characterizations of diffusions reduces to characterizing diffusions in natural scale. A diffusion in natural scale has the same state space features as Brownian motion. Now the task - a difficult one-is to show that a diffusion in natural scale is a time change of Brownian motion. To accomplish this task, we first approach the problem of describing the speed of a diffusion in natural scale in terms of its "speed measure" m. Once this measure is in hand, the desired time change is defined in terms of m and Brownian local time X(y,t,w): the time the Brownian traveler ? spends in (v. y + dy), up to time t, is ?(>\ t, ?) dy. Local time, worthy of study in its own right, will enhance our understanding of Brownian motion. 2.4. Speed measure In order to measure the speed of a diffusion particle, we need the following definitions. In this subsection, {Px} is always a regular diffusion in natural scale. Let J denote the interval [a,b], where —oo < a < b < oo. Define ? least time t, if it exists, such that w(t) = a or b, and ?/(?)={ I +oo otherwise. Then Ex(tj):= I rjdPx, xeJ, gives an indication as to the speed of the diffusion, that is. it seems reasonable to expect that if we know the expected times for a particle at ? to exit J. then this should characterize the "speed" of our diffusion. It turns out that there exists a unique measure m on B\. called the speed measure of {Px} with the following properties: m is finite on compact subsets
308 J.K. Brooks of R, m has strictly positive measure on all non-empty open subsets of R, and for each bounded interval J = [a, b] we have "/, Ex(tj) = / Gj{x,y)m(dy). for.xeJ. B) where Gj = Gab, is the Green's function defined on [a, b] ? [a,b] by: (x-a)(/)-y)(fc-a) -'. a Gab(x,y)=\ , l(y-a)(b-x)(b-a)-1, a ? !? у ^х ^ b. How does the Green's function make its appearance in the definition of speed measure? For an intuitive look into this, we sketch a proof for the existence of a measure m satisfying B) from Breiman A968). The proof presented there is for J = ]a, b[, and there is an extra factor of 2 in that setting, nonetheless, here is what is going on. Partition ]a, b[ by points a = xq < x\ < ¦ ¦ ¦ < x„ = b, for Xk = a +kS. Let Л = ]хк-1, *к+\ i ¦ Then Еч (?^) tells us how fast the particle is moving in a neighborhood of Xk- Introduce a process (Z„) only at the exit times from the ijt's. Then (Z„) is a symmetric random walk moving on the points xq,...,x„. Set n(xj\Xk) to be the expected number of times the random walk visits Xk, starting from xj, before hitting Jto or x„. Argue that ? EX/ (?/) = ^2n(xj;xic)E.4(Tjk). k=\ which is certainly intuitively obvious. Since nixj'.xk) is equal to \/8Gj(xj,Xk), by examining this random walk, it follows that EXj(xj)= / Gj(Xj,y)m(dy), C) where in is the measure that assigns mass \E4 (zjk) to x^. Now C) has the approximate form of B). Next, construct partitions ?„ of R of successive refinements with mesh Sn -> 0. Define m„ as in was defined and obtain a subsequence (шП|) which converges weakly to a measure m on B\ which ultimately satisfies B). Another defining feature of m is that for any a < a ^ ? < b, we have m(]a, ?]) = /+(?) — /+(?), where /+ is the right derivative of the function* -> ?, (ry). Now we have in hand a measure m connected with the diffusion {Px} which does in fact indicate the speed of the process. Crucial is the result that if {Px} and [Qx] are diffusions in natural scale, each of which has the common speed measure w, then Px — Qx for each ? e R. Thus, speed measure determines the diffusion, if it is in natural scale. This comes about by examining the infinitesimal generator ? of the semigroup {Tt}t^o of {Px}, where T,: Q,(R) -> Q,(R) is given by (Tlf)(x) = Ex(f(X(t))).
One-dimensional diffusions and their convergence in distribution 309 Here, Q,(R) denotes the space of bounded continuous functions on R. For / in the domain of ?, we have ? f = h, where f+(x)-f+(y)=f' hdm. Thus m determines ?, which determines Tt. which, in turn, determines {Px}. Now, how does m induce time changes of Brownian motion in such a way that the resulting process is the given diffusion {P,}? This is where Brownian local time enters. 2.5. Local time for Brownian motion In this section we shall introduce the fundamental notion of local time for Brownian motion - a tool used in defining time changes of paths. Let L denote Lebesgue measure; for A eB\, andi ^ 0, set ?(?,?)(?) = ?(?: 0? ^i. w(s) e A). Note that ?(?, ?)() is a measure on B\, with ?(?, ?)(?) = t. Since the image law will be used in the sequel, we mention that ?(?.?) is the image measure of L on [0. t] under the mapping ? -> a>(s). Trotter's important theorem (Trotter, 1958) states that for each ?, ?(?, ?>)() has a continuous density relative to ?: THEOREM 2.2 (Trotter). For each ? e ?, there is a jointly continuous nonnegative function (x,t) -> ?(?, t, ?) such that ? -> ?(?, t, ?) is measurable and ?(?,?)(?)= / k(y,t,a>)dy, J a for all t > 0 and AeB\,for every ? e ??, where ?? e В and Bx (??) = 1, for all x. This says k(y, t, ?) dy is the time Brownian traveler ? spends in (у, у + dy) up to time t. ? is called Brownian local time. All the following properties listed below hold for a.a. ?, that is, there is a set ?? e В such that ??(?0) = 1 for all jc, and the properties hold for ? e ??- These properties provide additional insight into Brownian motion. THEOREM 2.3. Properties of local time. A) ?(?+?,?)(?) = ?(?,?)(?)+?^,?,(?)(?), A e ?,). B) ?0\ t+s,w) = k(y, t, ?) + k(y, s, ?,{?)). C) k(y,0,w) = 0. D) k(y, ·, ?) is nondecreasing. E) //?@) eJ = [a,b], then k(y. ?/(?), ?) = 0 //у 5$аог .у > b. F) For each ?, lim k(x,t, ?) := k(x, oo, ?) = ос, а.е. ??;. t—>oc
310 J. K. Brooks G) For each ?, ?(?, ?,?) > 0 for all t > 0, a.e. Bx. (8) For each ? and y, lim,^^ k(y, t. ?) = oo, a.e. Bx. Again, the Green's function appears in an important context, and will connect m to our desired time change via ?. THEOREM 2.4. Let J = [a, b] and assume ? and у are in J. Then E,(X(y,Tj,))=2Gj(x,y). A useful property concerning ? is the following: Theorem 2.5. Iff e Q>]-oo,oo[ and F(t)(o):= f fX(s)(w)ds= [ f{yWy,t,o)dy, J0 J-oc (the last equality by the image law) then on {??, ?@) e J} we have F(tj(o)) = j f(y)k(y,Tj((o).(o)dy. 2.6. Time changing Brownian motion At this point, we are in a position (after glossing over a large number of technical lemmas) to obtain a diffusion {Px\, which is in natural scale, from a time change of Brownian motion. Assume {Px\ has speed measure m. Our goal is to define a time change ?:[0,??[?? -> [0,oo[, which is measurable and such that: T(-, ?) is a strictly increasing, continuous, 7@, ?) =0, and Hindoo T(t, ?) = oo (of course, all the properties are to hold outside a measurable set of Bx -measure 0). For an appropriate 7, which should, and does, depend upon m, we define a process ? by ?(?,?) = ?(?(?,?),?), that is, the Brownian path ? is time changed into the path t -> ?G(?, ?)). If {Qx} is the Bx-distribution of ?, we want \QX) to be a regular diffusion in natural scale with speed measure m, that is, Px = Qx for all x. How do we define such a Tl Let us back into the definition as follows. Let J — [a,b], and set rj := the exit time from J of the ? process; Tj(a>) will remain the exit time of
One-dimensional diffusions and their convergence in distribution 311 Brownian motion. Obviously, T~](tj(w),w) = ?) (?). Denote ? [(?,?) by ?(?,?). To show {Q?} has speed measure m, we must have Ex(A(tj))= ? Gj{x,y)m{dy), D) in view of B) in Section 2.4, where expectations are taken with respect to Bx. If we work with the right hand side of D), we see by Theorem 2.4 in Section 2.5 that j Gj{x, y)m(dy) = j l-Ex (?(.?. ту, (-))m(dy). but by Fubini's theorem, the latter integral is E,\l-Jk(y,Tj.())m(dy)\. which, in turn, equals, in view of Theorem 2.3E), eV-J k(y,TjA))m(dy)\. Thus, for D) to hold we need Ex(A(tj)) = eA1- ? k(y.Tj.(-))m(dy)\. Now is clear (cf. Theorem 2.5) what the definition of T(t, ?), or equivalently A(t, ?), should be. Define 1 ?0 ?(?,?)=-? k(yj,w)m(dy), ?&??}. E) ^ J-ос Then define T(t, ?) = A'1 (?, ?), for ? e ?() such that lim^oc A(t, ?) = oo and ?(·, ?) is strictly increasing on [0, oof. The set of such ? has В., -measure 1, for each ? eR. Note that if m is absolutely continuous with respect to L and rh = ^, then, by the image law, 1 f A(t,w)=- m(X(s)(w))ds. F) 2 Jo Here is another way to see that F) is the correct definition, in the case m <<C ? and rh is continuous. If ? < 0 < b and J = [a, b], then for J small, observe that ?0(?])= / Gj@,y)m(dy)= Gj@, y)m(vRv « —\a\b.
312 J. K. Brooks Since Eq(tj) = \a\b, we see that ??(?] ) ~ i«i@)?o(Ty). More generally Thus if it takes Brownian motion time At to exit from J, it takes ? roughly jm(x)At to exit from J. Since ? = XT, this means / rh(x)At \ 7 Г + -—^—,? - T(t,w)^ At. With smoothness, we have dT or iii w(jc) rTU-?) rh(X(S)(w)) which yields F). To further imagine the behavior of ? at time t and positioned at у е Е, let ?? =?(?,?) and?2 = T(t +At,?).Thus m(y) so Y(t + At, ?) - Y{t,a>) is governed by Brownian motion starting at у with the time increment ?2 - ?i ^ 2At/m(y). Hence y(i + At, ?) - Y(t) is distributed like у + BBAt/m(y)), that is, normally distributed with mean у and variance 2At/m(y). Now that we have our expression for T, what remains is to prove that ? (t) has the strong Markov property. To be precise, call ? Markov for ? if {? < t] e A(t) := a(Y(s): s ^ f); Л(г+) is defined in the obvious way. Then to establish the strong Markov property for Y, show ??(?} Y~[ is a regular conditional Bx-distribution for ?(? + ·) given ?(?+) on {? < oo}. On the way to establishing this, a number of technicalities must be dealt with, for example, for a.a. ?, show that 7" (? (?) + ?,?) = ? (? (?), ?) + T(t, ??{?(?),?)(?)). But in the end, all is well, and ? satisfies all our expectations. Finally, consider a general regular diffusion {Qx} with scale function S. Let {P,} be the associated diffusion in natural scale associated with [Q.x], that is, Px is the Qs-iurdistribution of {SX(t)},^Q. Let m be the speed measure of {?,? and let ? be
One-dimensional diffusions and their convergence in distribution 313 the associated time change induced by m according to E), where A = ? . Define ?(?)(?) = ?~??(?(?,?),?), and write ? = 5_1X7\ Then \Z{t)) describes the path functions associated with Qx(t -> 5_1(?G(?,?)))). In other words, Qx is the Ssi.rr distribution of Z. This gives the promised representation of a diffusion in terms of Brownian motion. 3. Weak convergence of diffusions Introduction In this section we shall examine the relationship between weak convergence of processes 5,7' В T„, where В is Brownian motion, the convergence of the time changes T„ and the scale functions 5„ (all terms are defined below). In Section 2, we shall show that if the T„ 's are inverses of functional A„ (not necessarily additive) and A„(t) -^ Ao@ for each ? = Bx, ? eR, then the processes BT„ converge weakly to BTq. In Section 3, we shall consider real valued regular diffusions in canonical form on C[0, oo[ and present necessary and sufficient conditions for weak convergence of these diffusions in terms of pathwise convergence of their time changes, weak convergence of their speed measures and pointwise convergence of their scale functions. The main theorem in this section is the following. If (X„),^l() is a sequence of diffusions in canonical form, then X„ converges weakly to Xo if and only if the diffusion paths converge uniformly on compact time intervals. All of these results are due to Brooks and Chacon A982). 3.1. Preliminaries As before, ? = C[0, oo[, and ? will be given the topology of uniform convergence on compact sets; ? = ?(?) was previously defined as the ?-algebra generated by the coordinate processes. Again, Bx denotes Wiener measure on ?{?) starting at x. A set of full measure ?? e В is a set such that ??(??) = 1 for every x. A functional (not necessarily additive) is a mapping A from [0, oof ? ? onto [0. oof such that there exists a set of full measure ?? such that A@, ?) = 0 and ?(·,?) is continuous and strictly increasing for ? e ?. ? will denote the inverse of A. In this section and the next, ? will be twice the ? in Section 2, to conform to the definition in Ito and McKean A974). B(R) will denote B\. All diffusions are real valued and regular. Concepts involving weak convergence can be found in Billingsley A968). Donsker's theorem is proved in Breiman A968). All additive functionals appearing in this chapter are of the form A(t,w)= I k(t,y,w)m(dy), G) where m is a speed measure defined on B(M.) and ? is Brownian local time. The diffusion X on ?, having 5 as a scale function and m as its speed measure, will be written as 5~' ВТ if 5X has ? = A-1 as its time change relative to Brownian motion, where A is given by G).
314 J. K. Brooks In general, without usually stating it explicitly, X„, S„. A„,m„ have the same relationship. The path function Y„(t) associated with X„ is given by Y„(t)(w) = S~l{co(T„(t, ?))}. The measure Q" associated with X„ is the Z?s;i (v)-distribution of Y„. If A„ is a (not necessarily additive) functional, then the process Yn, relative to {S,}, is still denoted by S~l BT„. It is not, in general, a diffusion. Write X„ =>· Xo if Q"x =>· Q°x, for each ? e E, where =>· denotes weak convergence. If (m„) is a sequence of measures on В(Ш),т„ =>? wo means j f dm,, —* f fdmK), for every continuous function on R having compact support. ? will always be a generic symbol for ? ?; write /„ —»p / or lim - ? /„ = / to mean convergence in probability. A sequence of processes Z„(?) is said to converge in probability uniformly on compacts to Zo@ if p( sup \Z„(t)- Z()@| >? ->;?0. foreach<i.<S > 0. Ve[0.</] / Say Z„(r) converges uniformly on compacts to Zo@ on i?o (almost surely) if ?„(·, ?) -> ??(·, ?) uniformly on compact sets for each ? e i?o (a.a. ?). The operator У*: ? ->¦ ? is defined by J," (?) = S„(x) + co. If jc is understood, write J„. Sometimes J. T, A. Q will be written in place of Jo, To, Ao, Qo- 3.2. The general convergence theorem Theorem 3.1. (a) Let (A„) be a sequence of junctionals (not necessarily additive) such that A„ (t) —» p A0(t) for each t > 0 and ? = ??, ? e ?. Then ВТ,, =>¦ ВТо, where T„ = ?,?1. (b) Suppose (S„) is a sequence of scale functions converging pointwise to So- Assume in addition, that A„(t, JJ) -> A0(i, J*), for each ? e К. Г/геи S S7„ =>· S^1 S70. PROOF. First we shall prove (b), since (a) follows from the special case S„(x) =x. As before, Q'{ is the Ss„(.,}-distribution of Yn. We must show that for h : ? -> R bounded and continuous and for each jc, we have j hdQ'[ -> f hdQ®.. We shall do this in several stages. We shall need the following lemma. LEMMA 3.2. Lei {(^„(?,?)},,^ be a sequence offunctionals. IfC„(t) -+p Co(t),for each t < ??, then lim„ P(sup,^,() |C„(r) - Co(OI ><5) = 0, for each ? > 0. PROOF. For simplicity, let ?? = 1 · Let qk(w) = maxCo((; + 1)/*, ?) - Coij/k, ?) > 0. .К* Since Co(-,oi) is uniformly continuous on [0, 1], qk(oS) —*k 0 for each ?. Let ? > 0 be given. By Egoroff's theorem, there exists a measurable set ? such that ?(?') < ?/2 and
One-dimensional diffusions and their convergence in distribution 315 <7„ converges uniformly to zero on A. Let ? > 0 be given. There exists a Icq = ko(S) such that 0 ^ <7„ (?) < ?/2 for all ? ^ ко and ? e A; hence />(?„ > 5/4) <: P(AC) < ?/2 for all n^- ко- Thus for и > &o, p(max(Co((;· + l)/n) - C{){j/n)) > ?/2] < ?/2. (8) Let Ak(l = (max(Co((; + l)/*o) - Co07*o)) > ??). Since C„(j/ko) -»? Со(;До), j = 1,2,... До» there exists an ? ? = ?\(?,?), ?\ > ко, such that if ??, = (|C„07*o) - Co07*o)|) > Я/2, then P(Aj,„) <<5/B?0),for/i >«|. ;' = 1,2 ко- We now assert that pfmax|C„@-Co@| > s) <?, forn>n\. (9) To see this, fix t e [;До, 0' + l)Mo] for an appropriate j. Let *0 Q„ = AkaulijAjA note that P(Qn) < ?. Fix ? e ?^,'. Case 1.0^ C„(i, ?) - Co(f, ?). In this case <КС„(г,<ы)-Со(г,<ы) < C„@· + l)/*0, ?) - Co07*o, ?) = С„(ОЧ- 1)/?0,?) -Co@4- 1)/*?,?) + Со@' + 1 )/*о, ?) - Со07*о, ?) ?? |С„@" + D/ko,co) - Co((j + 1)/*?,?) + |С0@"+ 1)/*?,?) - Со07*о.ш)| < ?/4 + ?/4, smccwe(Acj+lM)n(Alo).
316 J. K. Brooks Case 2. 0 ^ Co (?, ?) — C„(f, ?). In this case <КСо(г,ю)-С„(г,ш) <Со(ОЧ-1)/*о,ш)-С„07Ло,ш) = Co@" + 1 )/*o, ?) - Co07*o. ?) + C007*o. ?) - C„(j/k0, ?) ^\Co((j + l)/ko,a>)-CQ(j/ko.a>)\ + \CoU/ko,(o)-C„(j/kQ,a>)\ < ?/2 + ?/2, sincewe(/V:„)n(/i<:(]). As a result, if ? e ??, where ?(?„) < ?, we have \C„(t, ?) - CoO. ?)I < 5. for all t ^ 1. Hence pfmax|C„(i)-Co(i)| >г) <?, forn>«i, A0) which proves Lemma 3.2. ? We now continue the proof of the theorem. To show convergence in probability uniformly on compacts of the T„, we need the following lemma. LEMMA 3.3. Let (A„)„^,0 be a sequence offunctionals such that A„(t) -+p Ao@ for each t. Then if (Tn)„^o are the associated inverses, T„(t) -+p To(t) for each t\ hence, by Lemma 3.2, Tn —» 7"n uniformly on compacts. PROOF. Step 1. Fix t. For each ? > 0 there exists a constant К and sets ?(?,?) such that ?(?(?,?)) ^ 1 - ? and ?,,(?,?) ^ ? for each we ?(?,?), for each n. To see this, note that for every ? > 0, there exist sets E„ such that P(E„) ^ 1 — ?/2 and an integer no such that \?„(?0B?,?))-??(??B?,?),?)\?? (?) for n > щ and ? e E„. This follows since TBt, ?) is bounded on a set of measure close to 1, and given the bound large enough, say K, use the fact that A„ -> p Ao uniformly on compacts, in particular [0, K]. Furthermore, since AoGbBf, ?), ?) = 2?, |?„(?0B?,?))-2?| <Cjj, ???,,,???- U2) Hence, 2t - ? ^ A„GbBf, ?), ?) ^ 2? + ? for ? e ?„. n ^ ??· Since /j is arbitrary, we may conclude that t ^ ?„G?B?,?),?) foroief,,. //>//().
One-dimensional diffusions and their convergence in distribution 317 Thus T„(t, ?) <: T„(A„(T0Bt, ?), ?)) = T0Bt, ?). Since ToBt) is bounded on sets of measure arbitrarily close to 1, there exists a set С such that P(C) ^ 1 — 5/2 and 70B?. ?) ? ?, for some К, whenever we С. Note that К does not depend upon n. Let ?(?, ?) = ?„ ? С; hence P(/i(n, 5)) > 1 - 5 and for ? e /}(«,<$), ?,,(?,?) $: ?,?^ no. This completes Step 1, since we can assume ? works for и ^ «o- Step 2. Let К be the К appearing in the statement of Step 1. Define D'n(K) = (\A„(u) - A0(u)\ < ?, и ? ?). Since А„(и) ->р Ао(и) uniformly on compacts, lim„ P(Df,(K)) — 1. This implies that |?„G„(?,?))-?0G;,(?.?))| <? for ? e Dj(/f) ? ?(?. ?) and P(Dj(i)n4(n,{))> 1 -2?, ????^?(?). This in turn implies that |?-??G·„(?,?).?)|<? for ? e Dfn = Dfn(K) ?) ?(?, ?). A3) Note that P(D^) ^ 1 - 2<$. Now suppose that 7„(f) -^p 70(i). Then there exists an infinite number of integers (и,-) and /j > 0 such that ?(\???(?)-?0(?)\>?)>?. (?) In a view of A3), we see that |Ao(j, ?) - ?| < ?. we D^, ? >/1(?). A5) for s belonging to an interval having 7b (f, ?) and 7„(f, ?) as its end points, since Aq is increasing and ??G?(?,?),?) = ?. Now let ? e Een. = Df,. ? F„,, where F„, = {?: |7"„, (r) - F(,(i)| > /?}. Take ? < ?/4. Then |?0E,?) - t\ < ? for s ?]70(?,?), 7"„,(?, ?)[ or s ?]7"„,(?,?), ?0(?,?)[ and ? e ?*., where P(E?.) > ?/2. For each ?, let ]af (?). /3f (?)[, &(?) ^ К, be the largest interval such that \Aq(s, ?) - t\ < ? for s e ]af (?). ?((?)[. The above argument shows that ??(?) - ??(?) > ? for ? e Qe = U„,>»(f)?», and p(Qe) > l/2> where 1 was independent of ?, and ? was arbitrary. Take ?(;) { 0 and Q = limsup<2f("; we also have ]a?li)(co),PHi)(w)[l.Let [?(?),?(?)] = [>\[???)(?),?,0)(?)]
318 J.K. Brooks and note that ?(?) - ?(?) ^ ?. But ??. ?) s t for s e [?(?), ?(?)] and ? e Q, which is a contradiction since there is a set Д>, P(i2o) = 1, on which A() is a strictly increasing. This establishes Lemma 3.3. ? Next, let h : C[0, 1] -> Ш be uniformly continuous and bounded, where C[0, 1] is given the metric induced by the uniform norm || ¦ || on C[0, 1]. In the sequel, S~]TJ will denote the mapping defined on C[0. oof as follows: (S-lTJ)(a>)(t) = S~l {J(co)(T(t. УМ))} = S~l {S(x)+c(T(t, J (?)))}. S~' T„ J„ is defined in a similar fashion. We now need the following lemma. Lemma 3.4. lim„ -BQh[S~{T„J„] = h[S~^TJ]. PROOF. Step 1. We assert that given ? > 0 there exists a constant К and an integer h such that for и > й, ?0{(?: \\(T„J„)(w)\\> ?)?(?: \\(TJ)(co)\\>K)}<a. A6) To see this, find a constant L such that bJw. sup 17" (r, J(w))\ > L-\) < a/8. A7) Such a constant L exists since the path T(-, J (?)) is bounded on [0, 1] for each ?. Note that T„(t, J„) -^-Ba T(t. J) uniformly on compacts. Hence there exists an integer и ? such that if ? ^ ? ?, ?0(?: sup\T„(t.J(w))-T„(t. J(a>))\ > l) < a/8. A8) Let Z, „ = f sup|7(i. ?)| > L - \) U (sup|Г,,(г. Л) - 7"(i, J)\ > l); A9) then ??(?|.„) < a/4. Define <Sn:C[0, L]^R by 8,,(?) = sup{|w(i) -co(s)\: \t -s\^h. 0???. s ? L\. Since ????_???) = 0, by Egoroffs theorem we can find an ? > 0 such that if Zi = (8e > l),then B0(Zi) < a/4. B0)
One-dimensional diffusions and their convergence in distribution 319 Next, define Z3.,, = (supi<:| \T„(t, Jn) - T(t. J)\ > ?). There exists an rn_ such that if ?0(?3.„) < ?/4. B1) Finally, since the curve (TJ)(co) is bounded on [0, 1] there exists a constant К such that if Z4 = (?: sup,^|G7)(i4)(f) > К - 2), then S0(Z4) < a/4. B2) Define Z„ = Z|.„UZ2UZ3.„UZ4. Let \S„(x) - S{x)\ < 1 for ? ^ щ. Define й = max(/j|. /г2,«3)- Note that Bq(Z„) < a for n^ ?. Pick ? e ?',; f ^ 1. Since ? e Z\ |7„(?,?(?))-7(?.7(?))|<;?. Also, ? e Z\ n implies that T(t, J „(?)) ? L - 1 and, by A9), 7„(f. J „(?)) ^ L. Now use the fact that ? e Z? П ?'? ;1 and the preceding inequalities to obtain \j„(u)[Tn(t,J„(w))]-J(cu)[T(t,J(cu))]\ ? \S„(x) - S(x)\ + \co(T„(t. ??))) - co(T(t, J(?)))\ ? 1 + 1=2. Thus |?„./„(?)| <;2 + \\TJ(w)\\ <^2 + K-2=K for ? ^ ? and ? e Z'ir Step 1 now follows. To complete the proof we need to show that given ? > 0. ? > 0 if Z„j is defined by Z„.s = {\h(S-]T„Jn) - h{S-]TJ)\ > S). B3) then there exists an n such that whenever n ^ n, Bo(Z„.s) < ?/?(?„?) < ?. B4) As we argued earlier, there exists a constant L and integer n \ such that if Z\.„ = (?: sup7„(f. J„(co)) > L) U (?: sup7(f. J (?)) > LI.
320 J. K. Brooks then for ? ^ ? ?, So(Z|.„)</j/8. B5) Next, by Step 1, there exists a constant К and an integer ni such that if Z2.„ = (||7-„У„|| > A") U (Ц7-УЦ > A1), then for и > пг, Bo(Z2.„) < ?/%. B6) Note that since 5" is increasing and the 5",, 's are onto E, S~' (y) -> 5""' (y) for each y. The monotonicity of S~' and S_l, for each и, implies that S~] -> 5" uniformly on [-AT, AT]. Since /г is uniformly continuous, there exists an ? > 0 such that ||a)| — ?>2||<? implies |/г(а>|) — h(coi)\ < S. B7) There exists an /73 and a ? > 0 such that \?\-??\??/2, \?[\^?, IjoI^AT, On.i B8) implies l^'Ui)-^1^)^?. B9) Let S/, be defined as in Step 1 on C[0. L]. By Egoroff's theorem there exists a p > 0 such that S0(Zo)</?/8, where ?0 = (^ > 0/2). C0) Define Z3.„ = (?: sup|7„(f, 7„(?)) - Г(г, 7(?))| > ?). C1) V ?? There exists an «4 such that ??(?3.„) < /j/8 whenever n > 114. C2) Let h = max(/i ?, /72, из, «4, ?.?, where /75 is an integer such that \S„(x) — S(x)\ <?/2 whenever/j >/is. C3) Set ?,? = Z|.„ U Zi.„ U Zs.n U Z().
One-dimensional diffusions and their convergence in distribution 321 Note that Bo(Z„) < ?. Fix ? e Zln and n > n. Since ? e Z\ n, sup17), (?, У,, (?)) - 7"(?, 7(?))| ? ?. Also note that || 7„У„ (?) || ^ К and \\TJ(a>)\\ ^ ?.Now we Z\' „ implies that T„(t, J„ (?)) ? L and T(t, J (?)) ? L, for every t ? 1. Fix t ^ 1; since we Zc0, \M(o)(T„(t, J„(a>))) - J(o)(T(t. J(a>)))\ ? \S„(x) - S(x)\ + \o(T„(t, J„(a>))) - a>(T(t, J(w)))\ ??/2 + ?/2 = ?. Hence that is, \?„??)(?)-5"' \TJ(w)(t)\ <?, |5,7?7"„7(?)-5"?7(?)| ^ ?. This in turn implies IMS',;1 T„J(w)) - h(S~] TJ(?))| < 8, which completes the proof of B4), and establishes the lemma. ? Continuing the proof of the theorem, we now show that, for fixed x, Q" =>· Qx. To this end, it suffices to show, if h: C[0, ?] -> R, ? < oo, is bounded and continuous, then f hdQ" -> f hdQx. We may assume /? is uniformly continuous on C[0, M]. For simplicity, set ? = 1. Observe that I/hdQ1'- J hdQx\ = \j h{S;]T)dBs„lx)- fh(S-]T)dBSlx] [h(S-^TJ„)~h(S-]TJ)]dBQ. /1 By Lemma 3.4, the integrand converges to zero in So-probability. The Lebesgue dominated convergence theorem implies f hdQ" -> fhdQx; hence Q" =>· Qx. This completes the proof of the theorem. ? 3.3. Convergence of diffusions The associated path function Y„ for the diffusion 5~' BT„ was defined in Section 3.1. All functionals in this section are additive. Since the scale function is determined up to an affine transformation, for convenience set 5@) = 0 and 5A) = 1. A diffusion 5 ВТ. where 5 is normalized in this fashion, is said to be in canonical form.
322 J. K. Brooks THEOREM 3.5. Let (X„)^_0 be a sequence of regular diffusions in canonical form. Then (X„) converges weakly to Xo if and only if the associated paths converge a.s. uniformly on compact time intervals. The proof of the above theorem follows from the following theorem and its corollaries and will be presented following the remark after Corollary 3.8. Theorem 3.6. (A) Let (?„)«5?? be a sequence of regular diffusions such that Xn =>· Xo- Suppose S„ is a scale function for X„. Then for a ^ .? ^ b, [Ь,(х)-T„(a)][J„(b)-Jn(a)]~l -»· [J(x)-J(a)][J(b)-J(a)]~l. Suppose that we chose definite representatives S„ from the equivalence class of scale functions S„ for X„, say Sn@) = 0 and 5„A) = 1, ? = 0, 1, 2 Then S„(x) -> Sq(x) for each x. If Xn = S~l BT„, where T„ is the appropriate time change of S„X„, relative to Brownian motion, induced by the speed measures m„, then m„ =>·(- wo. Also T„ (t) -> 7b(i) and Y„ (t) -> Yo(t) uniformly on compacts a.s. Bx, for xeR. (B) Suppose {Sn)„^o and (T„)n^Q are sequences of scale functions and time changes, respectively. Assume that S„{x) -> 5?(.?) for each ? and T„(t) -^-p 7b(i) for each ? = Bx. Then mn =>·<. тц. Hence T„(t) -> 7b(i) and Y„(t) -> Yg(t) uniformly on compacts a.s. Bx, ? e R. Also S~l BT„ => S~] BT0. PROOF. Before proceeding with the proof, we shall outline some of the main steps leading to the fact that weak convergence implies mn =>·< m and the time changes T„ converge a.s. uniformly on compact time intervals. To avoid technical problems in this discussion, assume that all the diffusions are in natural scale. Our job is to show BT„ =>· ВТ implies m„ =>·<. m. First of all note that (m„) is bounded on compact sets. To see this, suppose that mn([a, b]) -> oo. Argue that there is a point у and an interval /. у e /. and a time ?? such that the set E\ of all ? such that ? (??. л", ?) > с for some с > 0 and for all ? e / has Bo measure greater than 1 — ?, where ? is arbitrary. Furthermore, m„(I) -> oo. This shows that ?„(??, ?) -> oo, for ? e E\. Assume that ?(??, ?) > ? for ? e E\. Then choose a constant ? large enough so that if Ej is the set of ? such that sups^,() \w(s)\ < M, then B(Ei) > 1 - ?. Let ? be the set of ? such that sups^ |?(?)| < ? for a fixed p; we then show that if Q is the measure representing Xq, then Q(dZ) = 0; hence 6(J(Z) -> Q(Z). But on E\ П ??, ?„(?,?) < ??, which means that X„(n) has its range bounded by ? with probability greater than 1 — 2?, which contradicts the recurrence of Xo· The contradiction comes from the fact that on a large set, X„. for large times n, behaves like Brownian motion up to time ??. Obtain a measure ? such that m„k =>·< ? for some subsequence (in)', assume пи = к. Next we show ц.([с,а]) > 0 if с < d, that is. ? is a speed measure. Suppose m„([c.d]) -> 0. Let ? be the midpoint of [c. d]. Obtain a set E\ of B: measure greater than 1 — ? such
One-dimensional diffusions and their convergence in distribution 323 that for a>'s in this set, ? (??, у. ?) < ? for some ? and ??. for all v e [с, d]. Obtain a number ? such that if Ei = ? ?: sup|<w(i) — ;| > ?I, then Bz{Ei) > 1 - ?. Then for ? in ?| П ?2.01 = ?„(??. ?) -> 0. Let L„ be the set of ? such that sup,,^,, \?{?) - z\ > ?. Choose no large enough so that Q-(L) < 1/2, where L = Lll(); set ? = l//;o- Consider only /;'s large enough so that c„ < t. Since Qz(dL) = 0, Q"-{L) -> <2-(L). Note that T„(c„,w) > ?? on a set of measure greater than 1 — 2?, which means that on a large set the processes X„ in a small time interval [0, c„], behaves like Brownian motion in a time interval at least as large as [0. ??]. This means that the process X„ travels quite a distance in a very small time interval. On a large set, sup5^, I X„ (s, ?) | > ? when ? is a large, but the limit of the Q". measure of L must be less than 1/2, which is a contradiction. Thus m„ =>·<. ?, where ? is a speed measure. If ? denotes the time change induced by ?, it follows that ? = m, since ВТ is equal to ВТ in distribution. Hence m„ =>? m, which, since ? (?. ·,?) is continuous and has compact support, implies ?,,(?.?) -> ?(?.?), where the convergence is uniform on compact time intervals; this implies the uniform convergence of the paths. Additional estimates have to be made if the diffusions are not in natural scale. Now we begin the proof. To prove the first part of (A), by definition of scale function, we need to show that Q'l(hit b before a) -> ?°(hit b before a), C4) where Q" is the measure on ? corresponding to the diffusion X„. Let ? = (?: ? hits b before ?). The above limit holds if we can prove that Q(](dA) = 0. To show this, let ?? be the first time ? hits ?; ?/, is similarly defined. Let A ? = (?: ? hits b before a and max ? = b on [?. ?(/(?)]); ?? — (?: ? hits a before b and mina) = a on [?. ?/,(?)]). We assert that 3? с A\ U Aj. Suppose ? e (A| U Aj)c and ? hits b before a. Since ? e A,, maxoi = &o > b on [0. ?(/(?)]. Obtain ?? e [0. ?„(?)] satisfying ?(??) = bo. Let ao = mina» on [0, ??]. Set 0 < ? < min(ao —a, bo - b).To show ?(?,?) С /i, where S(w,e) = (a/: sup|a/(i) - ?(?)| < ?. ? e [?. ?(,(?)]). let ?>? e 5"(?,?). Since ??(??) > b, but a>| does not hit a in [0. ??], ?? e /i; hence ? e (dA)c. In a similar fashion, А, С C/i)'. Since 34 С A| U A2, it suffices to show that <2°(A| U At) = 0. Define D = (?: ?@) = fo and ?(?) ^ b for all .s ^ ? (?) for some ? (?) > 0). Let ? = ?/,. Since Xo is recurrent, ? < oo a.s. Q°. If ft denotes the shift operator, then A1 С (?? e D). By the strong Markov property ?.?(??) = ??(?<??.?,)< ? Ql(D)dQl = Q[l{D).
324 J. K. Brooks Since ?0 its regular, Q°b(D) = 0. Likewise Q°f(A2) = 0; hence Q°x(dA) = 0. If 0 <; ? <; 1, C4) implies 5„U) = [S„(x) - 5„@)][5„A) - ^(O)] converges to 50U), from the definition of scale function. Using C4) again, and combining either 0 < ? < 1 < у or у < 0 < ? < 1, we see that S„(у) -> 5o(y). Next we assert that the (m„) are uniformly bounded on compact subsets of Ш. If not, suppose that supw„([— 1, 1]) = oo. Using a nested interval argument and passing to a subsequence (m„{k)), if necessary, we can find a point у e [-1, 1 ] such that for every open interval / containing у we have mn{k)(I) -*k oo. Since ?(?, y,a>) -> oo a.s. Bo, where ? is local time, we can find a ??, for any given К and ? > 0, such that ? (??, }', ?) > К for ? e ?(??), where Bq(A(K)) > 1 - ?. By the continuity of ? (??, ·. ?), for ? e A(K), there exists a symmetric interval /(?) about у such that ?(??. у. ?)^ ?/2 for all * e /(?). Take ay I 0 and consider Dj = (?: ?(?0, ?, ?) > .?/2 for all jc e //), where /y = [y - ay-, у + ay]. Note that Dj is measurable since it can be expressed as an intersection of sets (?(??. r. ?) ^ AT/2) for rational r e /y by the continuity of ?. If ? e A(AT) and ;' is large enough so that ? (?) D Ij, then ? e D;. Since Z)y \ we have lim Bq(Dj) > 1 - ?; hence there exists an integer ;'@) such that Bo(?>/@)) > 1 - ?; hence there exists an integer ;'@) such that /?о@,@)) > 1 - ?. This implies that AlHk)(to,o))= I k(tQ,z.w)mlHk)(dz)^ / k(t0,z.(i))m,Hk)(dz) ¦I JljM\ > (K/2)m„(k)(Ij@)) -»· oo. for ? e Dj@). Assume, without loss of generality, that A„(??. ?) > н, н = 1, 2,..., for we E\ = Dj(Qy This in turn implies that ?„(?,?) < ??. ? = 1.2 Now choose a constant Mo large enough so that Bo(Ei) > 1 — ?. where ; — ? ?: sup|a)(i)| < Mo I 1 -?. Thus B0(E\ П E2) > 1 - 2?. Choose ? such that ? is larger than \Sq] (-M0) - 11 and 5(y' (M0) + 1. We may and shall assume, since Qq has only countably many atoms, that 2?(?: ?(/?) =-M or M) = 0. for each n. Let ? be a fixed positive integer and define ? = (?: sup|a)(i)| < ? I.
One-dimensional diffusions and their convergence in distribution 325 Assume for the moment that Q^idZ) = 0. With this assumption, we have Qq(Z) -> 2q(Z). Let и > max(p,/ц), where [S~l (-Mo), S,;1 (Mo)] С [So1(-Mq) - 1 · Sq1(Mq) +1] for ? > ??. Fix ? e ?| П ?2· Then 7",,E, ?) < ?? for all s ? p. Since ? e ?2. |?G"„(?, ?))| < Mo, for s ^ p. Hence S-l(w(T„(s,w)))e]-M,M[. Since Qq is the So-distribution of the mapping Y„ :? -> 5~' (?G"„(·. ?))), we have ??(?)^?0(????2)> ? -2?. This in turn implies that Q®(Z) ^ 1 — 2?. Now let ? -> oo and conclude that ??(?: |?(?| <; ? for alb) > 1 - 3?. which is a contradiction since Q° is recurrent on R. To show that Q$(dZ) = 0, define the following sets. A\ is the set of paths ? such that ? reaches ? in the time interval [0. p] and stays less than or equal to ? for a positive time ? (?) after hitting M. Ai is the set of paths that hit —? in the time interval [0. p] and stays greater than or equal to -M for a positive time ?(?) after hitting -M. A3 is the set of paths that hit - ? or ? at time p. We assert that dZ С A| U A2U A-,. Choose ???,? A, U A'v If ? hits ? in [0. p] but goes above M, and attains a maximum, say M\ in [0, p], then if S(w,e) = la/: sup|ai'(i) — ?(?)| < ? ?. where 2? = M\ - M, then ^(?, ?) П ? = 0. Hence ? ? 9Z. Similar reasoning applies if ? goes below — ? in [0, p]. Suppose that ? always stays below ? and above —? in [0, p]. Let 2? be the minimum of ? - M\ and m\ + M, where M| and mi are the respective max andminof ? in [0, p].ThcnS(w.s)(~)A' = 0; hence ? ? дА. We know ?0'(?;0 — 0 from the choice of M. To show 60'(A|) = 0. let ?(?) be the first time ? hits ? in [0, p], if it exists, and +00 otherwise. If С is the set of all paths ? such that ? @) = ? and there exists a ?(?) > 0 such that ?(?) ^ ? for .s e [0. ? (?)], then, if ?, denotes the time shift operator, we have Qo(A\)= Q°0(t < 00. A,) ^ ?((>(? < ??.?? e C) Bм(с)^еоЧем(с)=о. ? (т<ос)
326 J. K. Brooks since Q° represents a regular diffusion. Likewise. Q^iAi) = 0. Hence Q^(dZ) = 0. This completes the proof of the fact that (mn) is bounded on compact sets. As a result, we can obtain, by a diagonalization process, a subsequence (m„a)) and a measure ?, finite on compact sets, such that mll(k) =$c ?. Now we shall show that ?[?, b] > 0, whenever a < b. If we deny this, there exists a further subsequence m^y, such that mk{j)[a,?>]-»/ 0. For notational convenience, assume that m„[a,fo] -> 0. Let г = (? + b)/2. Let ? > 0 be given. There exists a io such that BZ(E\) > 1 — ?, where ??: sup|a)(i) — z\ < (b — a)/3j. Using the continuity of the distribution of the maximum appearing in ?|, there exists a ? \ such that if \z' - z\ < 8\, then S-*(?|) > 1 - 2?. For a.a. ?, sup ?(??, ?.?) < oo; уe[«.ft) hence there exists a constant ? such that B-(Ei) > 1 - ?, where ?2= (?: sup ?(??. ?.?) < Ml. ye[«.fc) ' ' Using the continuity of local time, there exists a <5i such that if |;' - z\ < <5i, then ?:·(?2)> ? -2?. Finally, choose a p > 0 such that В (?-0 > 1 — ?. where 3 = ??: sup |?(?) — z\ > ?). Obtain a <$-, such that if \z - z\ < <5ч, then S-*(?;0 > 1 - 2?. Now let ? e ?| ? ?2. We have ?„??,?) = I k(h),y,w)m„(dy)= ? ?(?(). v.o))m„(dv) ^ Mm„[a, b] = c„ -> 0. For0>O, ?°_, (L„)->0, where L„ = ??: sup |?(?) — 5() ' (z)\ > ?). «<!/;:
One-dimensional diffusions and their convergence in distribution 327 by the continuity ofthe paths ?. Choose an но@) so that Q° . (?„,,(«>) < l/2.Wechoose S() (,) ? as follows. Let 3? be the minimum of S~' (z) - S~' (z - p) and S(y' (z + p) - 5(y' (z). Choose и ? so that for ? > и ? we have min{5,71(z)-5,7l(c-p).5,7l(c + p)-5,7l(c)}>2^. With this value of ?, let F = 1/но@). Set L = L„o(«). Assume for the moment that ?° (9L) = o. Hence we have ?"_, (L) -> ?0,., (?). Choose и2 > n\ large ¦>„ (.-) ^ S„ (;) 5() (:) enough so that c„ ^ Ffor all ? > hi. If w e E\ ? ??. ?.ч, we have sup|wG"„E,ii;)) - z| > sup |?G„(?.?)) - z\ > sup |?(?) - z| > p. because T„(c„,w) > ??, when ? e ?| ? ??· Hence, for ???|???(? ?.i and /7 > «2, we have sup|5,71(wG1,(i.w))-5,7l(z)| ^min{5,7lU)-5,7l(z-p).5,71(c + p)-5,71(;)}>2^. Since S~l(z) -> 5G' (z), we can find an integer/73 > «2 such that if ? e ?| П ?2 П?3 and ? > из, we have и <Г If Y„ denotes the function C[0. oof -> C[0. oof, defined by ^,,(?) = S~l {w(T„(-.w))}, we see that if w e E\ ? ?? ? ?3 and /7 > «3. then sup\Y„(w)(u)-S-\z)\>e. «? that is, ?| П ?2 П ?-, С У,~' (L). Choose «4 >/м such that IS^ST1 (;))-;I < min(<S|. S2. 83) for alb; ^«4.Now ?" , is the ?,, ,c-i... -distributionof У„; hence ?5|7,<:'?) = ??»(?,<:)>?"??')^?^'\-,'.-''(????2???)^1?· for и > ид. Hence 1/2 > ?" (L) = lime" , (L)>l-6e. which is a contradiction since ? was arbitrary.
328 J.K. Brooks The fact that <2"_, (9L) = 0 is proved in a fashion similar to the one used in the S() (;) preceding arguments concerning boundaries of subsets of ?. We omit the details. Returning now to ?, we have shown that ? is finite on compact sets, and is strictly positive, that is, on non-empty open sets, ? is greater than zero. Hence ? is a speed measure. For notational convenience, assume that mn =>·<- ?. Let ?(?,?) = ?(?,?,?)?(??). Since ? is a speed measure, A is strictly increasing and continuous on a set of full measure. Now A„(t,J„(cu)) = /„(?) + /„(?). Note that |/„(?)| < / |?(?, у - S„(x),a>) - k(t, у - SOU), o))\m„(dy). By Trotter's Theorem 2.2, ?(·, ·, ?) is jointly continuous in t and у for ? belonging to a set of full measure ?®. Since ?(?, ·, ?) is uniformly continuous on compacts, (w„) is uniformly bounded on each compact set, and k(t. ·, ?) vanishes outside of a compact set, we see that ?„(?) -> 0. Since m„ =>·<· ?, /„(?)-»· A(t,J0(a>)). Hence An{t, J „(?)) -+ A(t, Jq(co)) for each t and ? e ??- ?? a similar fashion, ?,,(?,?) -> ?(?.?). ? e i?o· Hence by Theorem 3.1, we conclude that S~lBTn =>· Sq1 ВТ. By the uniqueness of traversal times for a Markov process, since 5~' BTn =>· Sq1 BTq, we conclude 7 = 7b a.s. P. Alternately, since SQ] ВТ is distributionally equal to S^1 BTq, and ? is the speed measure for the diffusion S^ ВТ, m = ? by the uniqueness of speed measures. Since the above argument was valid for any subsequence of (mn), we conclude that the full sequence (w„) satisfies mn =>·< w.Thus A„(t) -> Ao@ uniformly on compacts on i?o, and the same is true for T„(t) -> 7b(f). Since 5,7' -> 5^' uniformly on compact subsets of E, one can show that Y„(t) -> У(>@ uniformly on compacts on i?o· This establishes (A). / k(t,y.S„(x)+w)m„(dy) J [k(t, у, 5„U) + ?) - ?(?, у, 50U) + ш)]|я„(dy) + / k(t,y,S0(x) + cu)m„(dy)
One-dimensional diffusions and their convergence in distribution 329 We now turn to the proof of (B). Assume that T„(t) -*p ??) and S„(x) -> S(x), ? = Bx, for each x. Using Theorem 3.1 and the lemmas appearing in its proof, we see that BTn =>· BTo- Then by (A) above, m„ =>·(. тд. By arguments used in the proof of (A), we see that A„(i,y,f)-»· A0(i.y0v) ???0· By Theorem 3.1 we conclude 5~' BT„ =>· S,^ BT<). Also wj„ =>¦<· /wo implies 7„(f) -> 7b@ and y„(i) —>¦ Yo(t) uniformly on compacts on i?o· This completes the proof of Theorem 3.6. ? Corollary 3.7. // A„(t) -^p Ao@ for each ? = Bx and t ^ 0, then the additive junctionals A„ converge uniformly on compact sets on a set of full measure. PROOF. If A„(t) -^p A0(t), then by Lemma 3.2, T„(t) ->p T0(t). Thus by (B) in Theorem 3.6, mn =^f mo· The proof of Theorem 3.6 shows that A„(t) -+ Ao@ uniformly on compacts on a set of full measure. ? Corollary 3.8. S~x BT„ =>· S~[ BT0 if and only ifT„(t) -> 7@ and Y„(t) -> Y0(t) uniformly on compacts on a set of full measure, assuming the S„ are normalized as in (A). In particular, BTn =>· BTq if and only if T„(t) -> 7o(i) ?/iii ?(?„(?,?)) -> ?G?(?,?)) uniformly on compacts on a set of full measure. Remark. Dynkin A965, Chap. VIII) shows that for processes in RUI\ where time changes, relative to Brownian motion in R(i", are given by measures m„, if mn =>·<¦ m, then A„(t) —>p Ao(t), for each t. The above analysis shows that for d = 1. A„(t) -> Ao@ uniformly on compacts on a set of full measure. The above results provide a converse to Dynkin's theorem when d = 1, since if A„(t) -+p A(t), this implies by Lemma 3.2 that Tn(t) -^-p To(t); hence by Theorem 3.6(B). m„ =>,. w. Conclusion of the proof of Theorem 3.5. Suppose that S~ ]BT„^S~^BT. Since the S„ are normalized, (A) of Theorem 3.6 yields the convergence of the paths Y„. Conversely, suppose we have the stated convergence of (Y„). Note that since У„@) -> Y@) a.s. Bx, this implies S~^{x) -> 5~'U). which implies uniform convergence of (S„) on compacts. Consider \S„Y„(t,w)-SY(t,w)\ <: |5?^,(?.?)-5^,(?,?)| + |5^(?.?)-5^(?,?)|. The first term goes to zero uniformly on compacts by the uniform convergence on (S„) and (Y„) on compacts. The second term goes to zero by the uniform continuity of S on compacts and the convergence of (У„). Hence ?(??(?.?)) —* ?(?(?.?)) uniformly on compacts a.s. Bx. But then it follows that BTn =>· ВТ. By Corollary 3.8, this implies that T„(t) -> T(t). By (B) of Theorem 3.6, this implies that 5,7' BT„ =* 5,7' ВТ.
330 J. K. Brooks 4. Diffusions as a limit of stretched Brownian motions and stretched random walks Introduction We shall show that every linear regular diffusion is a pathwise limit of processes which are rather simple in nature. These simple processes, which may be considered to be the basic building blocks of a general diffusion, are called stretched Brownian motions. They behave like Brownian motion except that the variance is a constant which depends on the region of the state space in which the particle is located, and there are a finite number of such regions. The precise description of stretched Brownian motion, its characteristics, and the convergence theorem are presented in Section 4.1. Processes called stretched random walks are defined in Section 4.2. These processes have the characteristics of a symmetric random walk with variance af when they are located in ]*,-, *,-+| [, where — oo < ? ? < · · · < лч < oo, and when they reach Xj they have probability p, of moving to the right of x, and probability q,¦ = 1 - Pi of moving to the left of Xj. It will be shown that every stretched Brownian motion is a weak limit of stretched random walks, and this in turn will imply the existence of a sequence of stretched random walks that converge weakly to a general given diffusion. The last topic deals with the time change ? (called natural time), relative to Brownian motion, associated with a given scale function S. A particle undergoing the diffusion X = S~[ ВТ then moves as closely as possible to a particle undergoing motion which imitates Brownian motion in the sense that its traversal time is quadratic. The limit theorem in Section 4.1 complements the deep results obtained by Knight A962) and Stone A963), who also used local time in considering the problem of obtaining diffusions as a limit of simple processes. All the results presented here are due to Brooks and Chacon A983). 4.1. Stretched Brownian motion First of all, we shall give a loose description of the simplest type of stretched Brownian motion. Let a > 0 be fixed. Imagine a particle whose movement is governed in the following manner: If the particle is situated strictly to the left of zero in the state space, let it undergo ordinary (standard) Brownian motion B{t,a>). When the particle is strictly to the right of zero, stretch the state space by a and alter the time by the factor 1 /a2, so strictly to the right of zero, it behaves, due to scaling, like ordinary Brownian motion. Let Xa denote this process. At first glance, it appears as though Xa is ordinary Brownian motion, however, this is not the case, as can be seen by computing its scale function. A moment's reflection shows that the probability of Xa hitting b before -b. b > 0 starting at zero, is the same as ordinary Brownian motion starting at zero and hitting b/a before —b. Thus Po(Xa hits b before -b) = or/(l + a). This shows that Xa, for ? ? 1, is not Brownian motion. Presently, we will give a rigorous description of X", but from the above, it is clear that zero (any other point in the state space could have been selected) acts like an internal boundary point. The scale function of Xa consists of two straight lines with a break or corner at the origin; the
One-dimensional diffusions and their convergence in distribution 331 speed measure is Lebesgue measure on ] —oo, 0] and a multiple of Lebesgue measure on ]0, oo[. Intuitively, Xa is a Brownian traveller to the left and to the right of zero, and when it hits zero it receives a kick, due to the stretch a, which gives the particle a probability of a/(l + a) of moving to the right: more precisely (see Corollary 4.3), P0(Xa(t) >0)=?/A+?). for every t > 0. Consider now a more general diffusion. Let .vo < ? \ < ¦¦¦ < Xk be given. We can generalize the above construction to obtain a diffusion whose scale function is linear in the regions ]—oo,x\], [xj,Xj+\], [jtjt, oof and whose time change, relative to Brownian motion, is obtained by multiplying Brownian time by a constant, where the constant depends upon the region [*,-,xi+\]. Ito and McKean A974) defined a process called skewed Brownian motion which has attracted much attention. This process is derived from В by randomly flipping the excursions of \B\ by means of a biased coin. When the particle undergoing skewed Brownian motion hits zero, its probability of moving to the right is p, the probability of obtaining a head from the biased coin on that particular excursion; when away from zero, skewed Brownian motion behaves like ordinary Brownian motion. However, as pointed out by Walsh A978), Ito and McKean's argument that skewed Brownian motion is a diffusion is circular. Walsh mentions that a pathwise construction can be given by using Ito's theory of point processes, but the construction is intricate. The convergence theorem stated in this section in a sense justifies the interest in Xa since, loosely speaking, every stretched Brownian motion is made up of processes Xa. We believe that the processes X" captures the essence of skewed Brownian motion since Po(Xa(t) > 0) = or/(or + 1), and it has the advantage of clearly being a diffusion since it arises from a simple scale and time change of В (?), as will be shown. The special case Xa of stretched Brownian motion has also been observed, in different contexts, by Rosenkrantz A975), Portinko A976), Harrison and Shepp A981). Harrison and Shepp originally arrived at Xa by considering a process which satisfies X(t) = B(t) + al^(t), where /?(·) is the local time at zero of the unknown process X(t). They discuss X(t) in terms of a solution of a stochastic differential equation. We shall first examine the construction of Xa. If we stretch the state space by a when the particle is to the right of zero, the time must be altered by a factor of \/or for proper scaling, hence the additive functional we need to obtain the process is ?(?,?)= / f2(B(s,co))ds. Jo where f(x) = a if ? ^ 0 and f(x) =1 if ? < 0. ?(-,?) is continuous and strictly increasing on a set of full measure. If T(t, ?) — A~~' (t, ?), where all paths are restricted to the above set of full measure, then it is clear that the process we described above is given by Xa(t,w) = h{B(T(t,w),w)}.
332 J.K. Brooks where h(x) — ax if ? ^ 0 and h(jc) = ? for ? < 0. Note that the fact that X" is a diffusion follows from the general theory of time changes induced by additive functionals (see DynkinA965)). In general, let x\ < xi < ¦ ¦ ¦ < Xk- Define k-\ f = aol]-oc..v,[ + ]Ti*i l[.v,.v, + ,[ + (*k: l[.u.oc[ , 1 = 1 )t-l g = A)l]-oc..v,[ + ? &'??*,¦·¦»,*,[ + /М[Ц.зс[ · i = \ where ?,, ?, > 0. Next, let Jo Finally, define X(t,w) = h{Bz(T(t,co),co)}. where T(t, ?) is the inverse of the additive functional A(t, ?), h(x) = fQ f(y) dy, and B- is Brownian motion starting at z. X(t) is called stretched Brownian motion (starting at h(z))- Xa corresponds to the case к = 1, jc? = 0, ?\ = a] = a2, a0 = 1. The scale function of X(t) is /г, whose graph is the polygonal line having slope \/a-, between h(jc, ) and h(xj+i). When /3, = a-, call X(t) natural stretched Brownian motion. We compute the speed measure m of X" as follows: Since the process behaves like Brownian motion away from zero, the expected time, starting at jc > 0, to leave the interval (c, d), 0 < с ^ ? ^ d, is cd + ex - x2 - xd, hence the derivative, with respect to h~' (x), of the expected time is a(—d + c — 2x); this implies the speed measure of ]c,d] is 2a times Lebesgue measure. Since the particle does not spend any positive time at zero, m({0}) = 0. Непсет(А) = 2/д/. To calculate the speed measure of a stretched Brownian motion given by X = hBT. h= I fdx. T-\t) = A(t)= J g(Bs)ds, where / and g have been defined earlier, let us first calculate the expected time X(t) exists from the interval [a,b], given that X is at v. a ^ у ? b. Assume h{x) = у and jci < jc < JC2. x\ < h~](a) ? h~l(b) < .in. Since scale functions are determined up to affine transformations, assume that the scale function h~' of X passes through 0 at h{x\) and has slope \/a\ between h{x\) and h{xi)- Now we are interested in the expected time ?(?/?\) exists from [h~' (a). h~' (b)] starting from h~' (v), which is equal to the expected time B(t) exists from [?^/?\/?\, Ьу/Щ/а\\ starting at у^/Ща\; this expected time is
One-dimensional diffusions and their convergence in distribution 333 Pi/a2(by — y2 + ay — ab). Taking the derivative of this expression with respect to the scale function h~[, we obtain (?\/?\)(-2\ + b +a). This function generates 2/5|/ori times Lebesgue measure in the interval [h(x\). ft(jn)]· The speed measure for X in other regions are handled in a similar fashion. Feller's theorem states that if ? e ?>(??), then Гаф=(а/ат)(а+ф/(аИ-[)), where Га is the infinitesimal generator of X". If ?" is continuous and bounded on R \ {0}, then (Гаф)(х) = A/2)ф"(х) since the process is locally Brownian at л ? 0. Continuity of ??? implies ф"@+) = ф"@-). Note that the measure induced by ?+??(?\?~') has value ??'(8) - ?'(-?) on ] - 5, <5], and this value is also given by f^s(raf)(x)m(dx). Letting ? -> 0, we conclude that аф'@+) = ф'@-) if ? e ?>(??). Note that reflected Brownian motion \B\ is, in a sense, the limit of X"asa^ oo; hence for ? to belong to Z?(/]bi), we must have ф'@+) = 0 and (Г\В\ф)@) = (\/2)ф"@+). The calculations for the scale function and speed measure for the general stretched Brownian motion are carried out in a similar fashion. Suppose now that Z(t) is a regular linear diffusion with the real line as its state space and scale function S. Then SZ(t) is a regular diffusion in natural scale with a unique speed measure which we denote by m. Let ? = C[0. oof be endowed with the topology of uniform convergence on compact sets. Bx is Wiener measure, starting at x, defined on the Borel algebra of ?. Let ?(?, у, ?) denote local time for Brownian motion. Define the additive functional A(t,w) = k(t,y,oj)m(dy), ?&?, t > 0. and let ? denote the inverse of A. By Trotter's theorem there exists a set ?? С ? such that ??(??) = 1 for each ? and ?(·, ·, ?) is jointly continuous in t and ? for each ? e i?o; also ?(·, ?) is continuous and strictly increasing for each ? e ??. Henceforth, all paths ? will be confined to ??- If Qx is the Bs(X) distribution of the mapping ?(·) -> S~[ {?(?(-, ?))}, then {?,??? is a diffusion with a scale function S and speed measure m. There exists a sequence of measures mn on the real line satisfying the following conditions: (i) m„(A) = fA f„(x)dx, where/,, = ??<?<>„ < Ь", and ?," > 0, -oo = a'0' < a'{ < ¦¦¦<a": =oo,/;'= ? ,,?"[, with <=?'? =1; Jn I I — 1 1 1 Jn (ii) mn =>·? m, that is, J ????? -> J фат, for every continuous function on R having compact support. Let S„ be a sequence of scale functions each of which is piecewise linear such that Sn{x) —>· S{x) for each x. Without loss of generality we may and shall assume that each S„ is linear on the intervals J',' J", and that these intervals refine the intervals corresponding to /„. Let Q" be the diffusion which is defined by Sn and w„, that is, if A„(t,w)= / f„(y)X(t,y,w)dy= I k(t,y,w)m„(dy),
334 J. K. Brooks and T„ is the inverse of A„, then Q"x is the Z?s„m distribution of the mapping ?(·) -> S~' {?G„(·,?))}. Note that {<2"}.,eR is a stretched Brownian motion for each n. To show that F");Ji| converges weakly to Qx, we need to prove that f gdQ,=\imf gdQ'i for every bounded function g which is continuous on ?. This is equivalent to showing lim jg((S;lTnJ„)(w))Bo(dw)= f g((SulT0J0)(o)) B{)(dw), where J„ :? -> ? is defined by 7„(?) = ?(·) + S„(x) and (S 7„./„)(?) is the curve whose value at t is S~' {Jn(w)(T„(t, У,,(?))}. Here we set S = SO, 7" = 7b- Since g is bounded, it suffices to show that g(S~' Tn J„) -> ?(| giS^1 7ЬУо)· However, a much stranger result holds, namely, (S-lT„J„)(w)(-)^ (Sq1T0Ji))(o)(-) C5) uniformly on compact time sets. To see this, observe that A„(t,J„)(w))= / k(t,y,S„(x) + w)m„(dy) = J[k(t. y, S„(x) + ?) - k(t, у, So(x)+a>)]m„(dy) + / k(t,y,So(x)+a))m„(dy) = /„(?) + /„(?). Now |/„(?)| < ? \k(t, у, S„U) + ?) - k(t, у. Sq(x) + ?) |m„(dy). Since ?(?, ·, ?) is uniformly continuous on compacts, (m„)n is uniformly bounded on each compact set, and ?(?, ·, ?) vanishes outside of a compact set, we see that /,,(o>) -> 0. Since m„ =>·<· m, /,,(?) —* A(t, Jq(co)), hence T„(t,Ma>))^To(t,Jo(a>)). C6) Each Tn (·, ?) is continuous and strictly increasing, thus the convergence in C6) is uniform on compact time intervals. Finally, since each S~' is continuous and strictly increasing, C5) follows. A similar argument shows that if the processes (X„U))%L0 are defined on ? by Xn(t)(w) = S~]{w(T„(t,w))}, then ?,,(-,?) -> ??(-,?) uniformly on compact time
One-dimensional diffusions and their convergence in distribution 335 intervals for each ? e ??. Note that Xo(t) is distributionally equivalent to Z(t). With the convention that we refer all linear diffusions to its canonical representative on ?, we have THEOREM 4.1. Every regular diffusion is the almost sure pathwise limit of a sequence of stretched Brownian motions, where the limit is uniform on compact time intervals. 4.2. Diffusions as a limit of stretched random walks In this section we shall consider a random walk approximation to stretched Brownian motion. Our detailed description will deal with Xa; once this is done, it will be clear how the general stretched Brownian motion is to be approximated. Harrison and Shepp A981) have also obtained a random walk approximation to Xa by methods somewhat different from those presented here. Walsh A978), under the assumption that Ito and McKean's skewed Brownian motion is a diffusion, derived the transition probabilities of X", Our method uses a random walk argument since we do not have Corollary 4.3 as an immediate consequence of the definition of Xa. As we mentioned in Section 4.1, since P0(Xa(t) = b before -b)=a/(a+ \) = p, it is natural to approximate Xa by a random walk which, when away from zero, is symmetric and when it hits zero it has probability ? to go to the right and probability q = 1 - ? to go to the left. Consider a fixed probability space (?, ?, ?) which supports the processes described below. ?\, ?2, ¦ ¦ ¦ are i.i.d. random variables with ?(?\ = 1) = ?(?\ = -1) = 1/2; ?\, ??,... are i.i.d. random variables, independent from the first process, satisfying ?(?\ = 1) = ? mdP(f] =-l)= 1 -?, 0^/?< 1. Let V be the partition of ? defined by the points {x,k/n, k=0, ±1,±2,...}, with partition intervals {8k, к = ±1, ±2,...}. where S\, 82, ¦ ¦ ·, are the first, second, ... partition intervals to the right of x, and <5_i, <5_2, · · ·, are the first, second, ... partition intervals to the left of ?. \?,¦ | denotes the length of the partition interval <5,. Define for ?„«)=? + ?\\??,\+?2\?№2\ + ---- + <Jk+\pk+\?<4+??' у ~ -I-Pit 141 ??4?2 + + I4I21 Ok . t e [|ie, |2/?? + · · · + \Sef/ah, |ie, |2/?, + · · · + |i„t+I |2/<t*+i].
336 J. K. Brooks where Ok = p\-\ hpjt, and Uk+\ ifx + p]\Sel\ + --- + pk\S0,\^0. \ Va+? otherwise. We interpret p\ \??? I + · · · + pi,-\&цК\ = 0 if к = 0. Next, we define the ?* appearing above as follows: if Mk+\ = x +pi|5fl,|H hpjt+i|5^+I| >0, then set au+\ = ??/УДГ; if Mk+\ < 0, then ?*+? = ??/?^· Finally, if M/,+ \ = 0, set <fk+\ = &k- This defines the process Xn(t) starting at every ? e E, and X„(t) has continuous trajectories. When we set <*o = /So = 1 and <*? = ?, ?\ = a2, we obtain an approximation tor. In general, given a stretched Brownian motion X = hBT, where h and ? are given in Section 4.1, one can construct a random walk approximation in a similar fashion-for example, in the state regions jc,_i < .v, < .v, + i, determine the different variances for X viewed as a Brownian motion in ]xj,xi+\[ and ]jc, ?,.x,[; when constructing the random walk, when the particle moves in either of these regions, the speed of the particle should be the corresponding standard deviation times the reciprocal of the distance traveled, and the ?? are used. When the particle hits jc,, then ?(,) are to be used, ?(?{-') = 1) = pt and ?(?['] = — 1) = 1 - pi, where pt is determined by the break in the graph of h at (Xi ,h(xi)). Call these random walks stretched random walks, and natural stretched random walks if the variances are all one. We construct now X„(t) in detail in the case of three space regions (the general case is similar) given by R\ —]—oo,x\[, /?? =]-??.*2[· ^3 =]*2, °°[, where —oo < x\ < X2 < oo. Again consider a fixed probability space (?, ?, ?) which supports the following processes described below. Let ?\,??,... be i.i.d. random variables with ?(?? = 1) = ?(?? = — 1) = 1/2 and let (?^), (?,\) be sequences of i.i.d. random variables independent of each other and of (?„), with ?(?[„') = 1) = Pj, P(flJ) = -1) = 1 - p}, where p; = ajl(otj + ?/+?), j = 0, 1. Define the partition V = [x,x\,X2,k/n, к = 0, ±1, ±2,...}, and the 8^ partition intervals indexed with respect to their position to the right and left of ? as before. Define X,,(t) exactly as before, for t belonging to the appropriate time interval, where Pk+i =?k+\ ifx + p\\Stil| ? l·Pk\Sв^\ e R<, i = 1,2, 3, and where _ | V°+l if x + P\\8e,\-\ l·Pk\S?^\ =x\, [?? if ? +?\\??,\-\ V Pk\&t)k\ = хъ
One-dimensional diffusions and their convergence in distribution 337 We still need to indicate the definition of ?*, which is done as follows: if M^+\ e /?,¦, then ak+\ = otj/y/??, / = 1, 2, 3, and if Mk+\ = x\ or jo, thena^+i = ?*·. THEOREM 4.2. Every stretched Brownian motion is the weak limit of stretched random walks. Hence every regular diffusion is a weak limit of stretched random walks. Proof. Consider first the simple stretched Brownian motion X" with the corresponding stretched random walks X„(t) defined above. Let (Q")„ be the measures induced on C[0, 1] by Xn(t) starting at x. Observe, first of all, that (Q"x) is tight. To prove this, consider |X„(OI· This is the polygonal process associated with a reflected symmetric random walk. Using Donsker's theorem, we see that |X„(OI =>· \B(t)\, where В is standard Brownian motion and =>· denotes weak convergence. We shall use the following THEOREM. (Q"x)„ is tight if and only if (see Billingsley A968)) (i) For every ? > 0 there exists some b such that (?';)(?: |?@)| >&) ^jj for each n\ and (ii) For each ?, ? > 0, there exists an h > 0 and an «o such that Q"(co: Sh(co)^s) !??, form^iiQ. Here, Sh(w) = sup(\w(s) - ?(?)\: \s -t\ ?. h], h ?. 1. Obviously, (i) holds in our case. To prove that (ii) holds, simply observe that <5/,(?) ^ 2<5/,(|a>|), and apply the above theorem to the measures induced by |X„(?)|. Next, let S„(t) denote the symmetric random walk, where pu = &· We will use the following equality: For А С [0, oo[, Po{X„(t) eA) = P(X„(t) e A, Xn(fl) = 0) = 2pP0(S,,(t) e A). C7) This can be deduced by observing that, if ? = time of the last visit to 0 in [0, t], then Po{X„(t)eA, T=l) = 2pP0(S„(t)eA. z = l). In particular, Po(X„(t) ^ 0) = p, for t > 0, since the zeros of S„ and X„ are the same. Using Donsker's theorem again, we have limfb(Xii@ e A) = 2PB0(x(t) e A), where x(t)(w) = ?(?), and А с [0, oo[. If A c]-oo,0], we have \imP0(Xn(t) e A) = 2A - p)BQ(x(t) e A).
338 J.K. Brooks Let <2o be a weak limit point of (Qq),<- The <2o distribution of x(t) can have only countably many atoms, hence for all but at most a countable number of 0 < a < b, Qo{x(t) e [a,b]) = 2pB0(x(t) e [a, b]). A similar result holds for intervals to the left of zero. Hence ^=exp(-y2/Br)Kv. y^O, 6o(*@edy) = < y. 2л ? ^^ exp(-y2/B0) dy. у < 0. 2.?? C8) Next, take jc > 0. If ? is the time of the last zero of X„ in [0. t], if it exists, ? = +oo, otherwise, then if А с ]0, oo[ and Ax is A reflected, P,(Xn(t) e A) = P,(X„(r) e ?, ? = oo) + Px(Xn(t) e ?, ? < oo). By the reflection principle, /\(X„@ e ?, ? = oo) = PtE„(i) e A. S1,, positive on [0, t]) = Pt(S„(t)eA)-Px(S„(t)eA±). Also P,(X„@ e ?, ? < oo) = ]Грл(Х„@ e ?, ? =/) / = 2р]Г>гE11(г)ед, т = /) / = 2pPx(S„(t) e A, S„ hits 0 before f) = 2??E„(?)??±). If Qx is a weak limit of (?"), passing to the limit along a subsequence we obtain Qx(x(t) e A) = 1/^2*7 / (exp(-(y - jcJ/Br)) - exp(-(y +*J/Br))) dy + 2p/Vba I exp(-(y+xJ/Bt))dy. J A Hence, for jc, у > 0 Q,(x(t)edy) = l/Vba [exp(-(y - jcJ/B0) + Bp - 1) exp(-(y -xJ/Bt))] dy. C9) Similar reasoning yields, for jc, у < 0, the above formula with A — 2p) replacing Bp — 1) in C0).
One-dimensional diffusions and their convergence in distribution 339 If x < 0 and А с ]0, oo[, with ? defined above, then P,(X„(t) e A) = P,(X„(t) e ?, ? < oo) = ]??,(?„(?)??|?=/)??(? = /) / = J]2pfb(Sll(i-/)eA)Pr(T=/) / = 2pP4S„(t)eA). Hence, if ? < 0 and у > 0, ?*(*? e dy) = Ipjsfba ехр(-(дг - ?J/B?)· D0) If jt > 0 and у < 0, the formula D0) holds with 2A — p) replacing 2p. The finite dimensional distributions of x(t) relative to Q can be calculated in a similar fashion, and it seen that x(t) is a Markov process. The above formulas for the transition probabilities show x(t) has a Feller-continuous semigroup, hence {Qx} is a strong Markov process, that is, a diffusion. Also, ?? = ??. For ? ? 0, (??/)(*)=1/? if /" is bounded and continuous on R \ {0}. For / described above, /@) = 0, l-E0(f(x«))) = lE. ? N,{y)dy + ^—^ F Ndy)dy, D1) ' t JO t J-oc where N,(y) = (\/¦Jbu)exp(-y1 /2t)\~N,(y) = G/Vf )M (у)· Equation D1) is equal to, for a > 0, 2??-^/'@+)+$? J yH,(.y)dy+j /(}-)'" 4(}') <M -2A-?)!^/'@-) + ??|"??,(}·)?·-| /(>·)?_??(>')^'|, D2) where <5Я -> 0 and Sa -> 0 as ? -> 0. For each a, the second and fourth terms converge as t -> 0. The sum of the first and third terms in D2) is /2P/@+)-2(l-P)/'@-)+gfdj,\ r'^yNl{y)dyt which shows that p/'@+) = A - p)f'@-), or a/'@+) = /'@-), where p = a/(\ +<*), if (Ге/)@) exists. Hence ?? = TQ.
340 J. K. Brooks Alternately, we could have argued that since Qx is locally Brownian, ? ? 0, and since Po(X„(t) hits 1 before - l) = ? (hence Qo(x(t) hits 1 before - 1) = p), that a scale function for Q is S(x) — ?, ? ^ 0, S(x) = (A - p)/p)x, x^O, which is also a scale function for Xa; the speed measure of Q can be shown to be equal to the speed measure of X" by observing that away from zero Q is locally Brownian. This shows that Q and Xa have the same scale functions and speed measures and are therefore equal. In the case X @ is a general stretched Brownian motion, one can prove in a similar fashion, tightness and finite dimensional convergence of the distributions of the corresponding stretched random walks. This proves the first part of the theorem. If X(t) is a regular diffusion, there exist X„(t), stretched Brownian motions such that X„(t) =>· X(t), by Section 4.1. But, for each n, there exist W"' =3-,,, X„(r), where W,'"(f) is a stretched random walk. Since weak convergence is metric, it follows that X(t) is the weak limit of stretched random walks. ? COROLLARY 4.3. For any t, P0(Xa (t) ^ 0) = <*/(<*+ 1). PROOF. Corollary 4.3 follows from C7) of the proof of Theorem 4.2. D 4.3. Natural time Let S be a given scale function S: R -> R, where the mapping is onto. If a diffusion has S as its scale function, what would its time change, relative to Brownian motion, be in order that a particle undergoing the diffusion move, as much as possible, in a Brownian fashion? To motivate this, let X = S~' В Т. We want to find conditions on ? so that X(t) locally has Brownian characteristics. It can be expected, neglecting infinitesimals, ?? - ?? = (X(t\) ~ X(to)J = [S~] B(T(t\ ,?)) - S B(T(t0, ?))? Dividing both sides of D3) by [B(T(t\, ?)) - B(T(tQ, ?)]2, and using T(t\, ?) - T{t0, ?) = [B(T{t\ ¦ ?)) - B(T(t0, ?))]2, we obtain, by allowing ?| -> ??. dS(z) dz z=S-lB(T(i0.a>)). '¦ _???(?,?) V1 = "V dt I=J dA(s) ds D3) D4) D5) i=.V() where ?? = T(to, ?), and A is the inverse of T. Hence, A should satisfy ?(?,?)= / \S'(S~4B(u,w)))] 2du. Jo D6)
One-dimensional diffusions and their convergence in distribution 341 Given a scale function S, the time change ? = A-1 defined by D6) is called the natural time associated with 5". As a further illustration, we show that for a piecewise linear scale function, the time change for stretched Brownian motion corresponding, in the notation of Section 4.1, to the ?? = af case, leads to D6). Let S be piecewise linear scale function with breaks at У\ < У2 < ¦ ¦ ¦ < Ук with slopes 1/<*?, l/a2 l/<*jt in the corresponding regions. Define k-\ / = aol]-3o.S(y,)[ +^ail[S(y,).s<.v,-,)[ +<**l[S(yt).^[ and k-\ « = O!01]-3o.S(yI)l + ^Q!i:i[S(.v,).5(v,-I)[+«il[S(yU.3c[ ¦ i = \ Let X = hBT, where h(x) = /Ql /, S = fc-', T~] = A, where A(t)= f g(B,)ds. Jo This gives a process which behaves like Brownian motion in the state space regions ]~oo,y][, ]yit yi+\[,]ykl oof. Note that g(S(z)) = J2[S'(z)]-h[SiXi.^l){(S(z)), or g@-[5'E-'@)]. Thus A satisfies D6). Let S be the class of all scale functions S having a second derivative existing such that there exists a sequence of polygonal piecewise linear approximations S„ converging pointwise to S satisfying ? ?,? f /о on R, D7) J(.) J(.) where /„ = [S'„(S~' )]-2, So — S, and /(, /o is finite on compact sets. We shall show that if S e S and X = S ] ВТ, where ? is natural time associated with S, then there exists a sequence of natural stretched Brownian motions and natural stretched random walks converging weakly to X. Intuitively, we interpret the condition S e S as
342 J.K. Brooks follows: when S becomes flat in a region, it is harder for the particle under any time change to move as if it were a Brownian traveller, so one would expect some conditions on S'. Let 5" = 5? e S and assume that (S„) is the sequence of piecewise linear scale functions converging to S satisfying D7). As in Section 4.1 we shall work on ??· Define A„(w,t)= / k(t,y,w)m„(dy). where mn = f f„, and the /„ are defined by D7). Since m„ =>· w0, we see, as in Section 4.1, that if X„(t,co) = S-]{w(T„(t,cu))}, where T„ is the inverse of An, then the natural stretched Brownian motions X„ (t) converge pathwise to Xo(t) uniformly on compact time intervals, for ? e ??. Since each natural stretched Brownian motion is the weak limit of natural stretched random walks, by the results in Section 4.2, we have THEOREM 4.4. Let S be a scale function belonging to S. There exists a sequence of natural stretched Brownian motions X„ and natural stretched random walks Y„ such that Xn converges pathwise almost surely to Xq, uniformly on compact time intervals, and Yn converges weakly to Xq, where Xq is a diffusion with a scale function S and time change, relative to Brownian motion, the natural time associated with S. The author is pleased to acknowledge the permission of Academic Press, Inc. to use portions of the material in references J.K. Brooks and R.V. Chacon A982, 1983) for this chapter. References Bachelier, L. A900), Theorie de la speculation. These. Paris. Billingsley, P. A968), Convergence of Probability Measures. John Wiley & Sons, New York. Blumenthal, R.M. and Getoor, R.K. A968), Markov Processes and Potential Theory, Academic Press, New York. Breiman, L. A968), Probability, Addison-Wesley. Reading, MA. Brooks, J.K. and Chacon, R.V. A982). Weak convergence of diffusions, their speed measures and time changes. Adv. Math. 46 B), 200-216. Brooks, J.K. and Chacon, R.V. A983), Diffusions as limit of stretched Brownian motions. Adv. Math. 49 B), 109-122. Brown, R. A828), Philosophical Magazine. Dellacherie, С and Meyer. P.-A. A978), Probabilities and Potential. North-Holland, Amsterdam. Dynkin, E.B. A955), Continuous one-dimensional Markov processes. Dokl. Akad. Nauk SSSR 105. 405^408. Dynkin, E.B. A959), Principles of the Theory of Markov Random Processes. Moscow-Leningrad. Dynkin. E.B. A965). Markov Processes, Springer. Berlin. Einstein. A. A905). On the movement of small particles suspended in a stationary liquid demanded by the molecular-kinetic theory of heat, Ann. Physik 17. Feller, W A952). The parabolic differential equations and the associated semi-groups of transformations. Ann. of Math. 55. 133-160.
One-dimensional diffusions and their convergence in distribution 343 Feller, W. A954), The general diffusion operator andpositivity preserving semi-groups in one dimension, Ann. of Math. 60, 417-436. Feller, W. A955), On second order differential operator, Ann. of Math. 61, 90-105. Feller, W. A957), Generalized second order differential operators and their lateral conditions, Illinois J. Math. 1, 459-504. Freedman, D. A971), Brownian Motion and Diffusion, Holden-Day, London. Harrison, J.M. and Shepp, L. A. A981), A noted on skewed Brownian motion, Ann. Prob. 9, 309-313. Ito, K. and Mckean, HP. Jr. A974), Diffusion Processes and Their Sample Paths, Springer- Verlag, New York. Karatzas, I. and Shreve, S. A991), Brownian Motion and Stochastic Calculus, Springer, New York. Karlin, S. and Taylor, H. A968), A Second Course in Stochastic Processes, Academic Press, New York. Knight, F.B. A962), On the random walk and Brownian motion. Trans. Amer. Math. Soc. 103, 218-228. Knight, F.B. A981), Essentials of Brownian Motion and Diffusion, Math. Surveys No. 18, Amer. Math. Soc., Providence, RI. Levy, P. A937), Theorie de I'addition des variables aleatories, Gauthier-Villars, Paris. Levy, P. A939), Sur certains processes stochastiques homogenes, Compositio Math. 7, 283-339. Levy, P. A948), Processus stochastic et mouvement Brownian, Gauthier-Villars, Paris. Loeve, M. A963), Probability Theory, Van Nostrand Crd edn.). Mandl, P. A968), Analytical Treatment of One-Dimensional Markov Processes, Springer, Berlin. Portinko, N.I. A976), General diffusion processes. Lecture Notes in Math., Vol. 550, Springer, Berlin, 500-523. Rogers, L.C.G. and Williams, D. A944, 2000), Diffusions. Markov Processes and Martingales, Vols 1, 2, Cambridge University Press. Rosenkrantz, W A975), Limit theorems for solutions to a class of stochastic differential equations, Indiana Math. J. 24, 613-625. Stone, C.J. A963), Limit theorems for random walks, birth and death processes and diffusion processes, Illinois J. Math. 7, 638-660. Stroock, D.W and Varadhan, S.R.S. A979), Multidimensional Diffusion Processes, Springer, New York. Trotter, H.F. A958), A property of Brownian motion paths, Illinois J. Math. 2, 425^433. Walsh, J.B. A978), A diffusion with discontinuous local time, Asterisque 52-53, 37^45. Wiener, N. A923), Differential space, J. Math. Phys. 2, I 31-174. Wiener, N. A924), Un probleme de probabilities enombrables. Bull. Soc. Math. France 52, 569-578. Wiener, N. A933), Generalized harmonic analysis. Acta Math. 55, 117-258.
CHAPTER 8 Vector Integration in Banach Spaces and Application to Stochastic Integration Nicolae Dinculeanu University- of Florida. Gainesville, FL 32611, USA E-mail: nd@math.ufl.edu Contents Introduction 347 1. Preliminaries 348 1.1. Banach spaces 348 1.2. Measurable functions 348 1.3. Integral of step functions 349 1.4. Measurability with respect to a positive measure 350 2. The Bochner integral 351 2.1. The seminorm 351 2.2. Bochner integrability 352 2.3. The Bochner integral 353 2.4. The spaces ?.?(?) 354 3. Integration with respect to measures with finite variation 356 3.1. Measures with finite variation 356 3.2. Integration with respect to a measure with finite variation 357 3.3. The indefinite integral 359 3.4. The Radon-Nikodym theorem 361 4. Semivariation of vector measures -364 4.1. The semivariation 364 4.2. Semivariation and norming spaces 365 4.3. Semivariation of ?-additive measures 366 4.4. The family nip.z of measures 367 4.5. Extension of measures 367 4.6. Extension of positive measures 368 4.7. Extension of ?-additive measures * 369 4.8. Canonical extensions 369 4.9. Canonical extension of additive measures 370 5. Integration with respect to a measure with finite semivariation 371 5.1. Measurability with respect to a vector measure 371 5.2. The seminorm mf.cif) 372 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 345
346 N. Dinculeanu 5.3. The space ^DimF.c) 374 5.4. The integral 375 5.5. Convergence theorems 376 5.6. The indefinite integral of measures with finite semivariation 377 5.7. Integral representation of linear operations on L''-spaces 378 5.8. The Riesz representation theorem > 379 6. The Stieltjes integral 379 6.1. The variation and the semivariation of a function 380 6.2. Semivariation and norming spaces 381 6.3. The measure associated to a function > · ¦ 382 6.4. The Stieltjes integral 383 7. The stochastic integral 386 7.1. Notations and definitions 386 7.2. The measure ? ?. Summable processes 387 7.3. The stochastic integral 388 7.4. Convergence theorems > 390 7.5. Summability of the stochastic integral 391 7.6. Local summability and local integrability 391 8. Processes with integrable variation or integrable semivariation 392 8.1. Processes with finite variation or semivariation 392 8.2. Optional and predictable stochastic measures 393 8.3. The measure ?? 394 8.4. Summability of processes with integrable variation or integrable semivariation 395 9. Martingales 396 References 398
Vector integration in Banach spaces and application to stochastic integration 347 Introduction This chapter is devoted to the theory of integration with respect to vector measures with finite semivaration and its applications. This theory reduces to integration with respect to vector measures with finite variation, which, in turn, reduces to the Bochner integral with respect to a positive measure. The Bochner integral, itself, is based upon the classical integral of real-valued functions with respect to a positive measure. The above presentation is a description, in reversed order, of the four stages in the development of the integration theory. We shall present these four stages in their natural order (the first stage in Chapter 2). Among many approaches to the classical integral we have chosen one which seems to be simpler and is presented, for example, by W, Rudin A973), see Chapter 2. Any one of these approaches leads to a vector space L' (?) of integrable functions, equipped with a seminorm ||/|| ?, for which it is complete and in which Vitali and Lebesgue convergence theorems are valid. We shall impose the same requirements to any kind of integration theory. An integration theory satisfying these requirements is called a "satisfactory" integration. Here, "satisfactory" refers to the possibility to use the integral to a wide range of applications. The other three stages of the integration theory yield satisfactory integrals. The Bochner integrability of vector valued functions / reduces to the classical integrability of |/|, The Bochner integral is obtained by extending by continuity the integral of step functions. There are other types of integrals for vector valued functions, for example the Pettis integral. These integrals are not satisfactory in the above sense. The integrability with respect to a vector measure m with finite variation \ m |, is defined as Bochner integrability with respect to the positive measure \m\. The integral is again obtained by extending by continuity the integral of step functions. This stage of the integration theory is presented in detail by Dinculeanu A967), Finally, integrability with respect to a measure with finite semivariation reduces to integrability with respect to a family of measures m- with finite variation. The integral jfdm is defined then as the linear operation си f fdm-. This last stage is very important. In fact, the most interesting measures do not have finite variation, but may have finite semivariation. Among the applications of the last stage of integration theory presented in this chapter we quote: the integral representation of linear operations on V-spaces, the Riesz representation theorem, and the Stieltjes integral with respect to a vector valued function with finite semivariation. The most important application is the Stochastic integral in Banach spaces, with respect to summable processes: in particular with respect to square- integrable martingales and with respect to processes with integrable variation or integrable semivariation. For a detailed presentation of vector integration and its applications, the reader is referred to Dinculeanu B000). In this chapter we give complete statements of definitions and theorems; but no proofs of theorems are given. For proofs, the reader is referred to different books. The numbering of definitions and theorems starts anew in each section.
348 N. Dinculeanu 1. Preliminaries In this section we establish the notations that will be used throughout the chapter, 1.1. Banach spaces 1. Throughout the chapter, E,F,G,D will denote Banach spaces. For any Banach space ? the norm of an element ? e ? is denoted by |jc|. the dual of ? is denoted by M* and the unit ball of ? by M\, The duality between ? and M* is denoted by (x,x*), (x*,x), x*x oreven.vjc*. L (F, G) is the space of bounded linear operations from F into G. We write ? с L(F, G) to mean that ? is continuously embedded into L (F, G). i.e., |jcy| < \x\\y\ for* e ? and ? e F. For example. ? = L(R. ?). ? С ?(?*. ?) = ?**. if ? is a Hubert space, ? = ?(?. R). We write со ?! ? to mean that ?? does not contain a subspace which is isomorphic to the Banach space со. If ? is a Banach space, a subspace ? с ?* is said to be norming for M, if for every ? & ? we have |*| = sup{|(*,z)|·. zeZ|), 1.2. Measurable functions 2. Throughout the chapter, S is a nonempty set, V, 7?, A P, 5, ? are respectively a semiring, a ring, an algebra, a <5-ring, a ?-ring, a ?-algebra of subsets of S. For any class С of subsets of S we denote by r(C), a(C), Sr(C), ar{C),aa{C), respectively the ring, the algebra, the <5-ring, the ?-ring, the ?-algebra generated by C. If ?? is a ring, we denote by Sf(R) the vector space of 7^-step functions f:S^> F of the form / = ?]·'=| Хд,.*,, with A, e 7? and .v, e F. The sets A, can be chosen mutually disjoint. Then ? For any function /: S -> F or R, we denote by | /| the function defined by |/|(?) = |/(?)|, fori e S, We emphasize that a positive 7^-step function takes only finite values, Measurability is defined with respect to a ?-algebra. Let ? be a ?-algebra of subsets of 5". 3. Definition. A function /;S -» F or R is said to be ^-measurable, if there is a sequence (/„) of iT-step functions /„ :S->F such that /„ -> /, pointwise.
Vector integration in Banach spaces and application to stochastic integration 349 The iT-step functions are .^-measurable. It follows that the set of .^-measurable functions is a vector space. If / is .T-measurable, then |/| is also .T-measurable. In the above definition we can choose the sequence (/„) with additional properties, 4. THEOREM. Let f ;S -> F orR be ? ?-measurable function. Then there is a sequence (/„) of ?-step functions f„\S-> F or R such that fn -> f pointwise and |/„| < l/l for each n. If f is positive (with values in [0.+00]). the sequence (/„) can be chosen to be increasing. If f is real-valued and bounded, the sequence (/„) can be chosen to be uniformly convergent. The following theorem gives a characterization of .T-measurability, 5. THEOREM, A function /: S1 -» F orR is ?-measurable iff it has separable range and f~\B)e ? for every Borel set В С F or В С I. iT-measurability is preserved by pointwise convergence: 6. THEOREM. //(/,,) is a sequence ofF or R-valued, ?-measurable functions converging pointwise to a function f then f is also ?-measurable, 1.3. Integral of step functions 7. Let H be a ring of subsets of S and m ; Tl -> ? с L(F, G) be an additive measure. For any F-valued. 7^-step function ? f = 2_] ??,?? with A, e 7? and л, е F, 1 = 1 we define the integral f f dm by the equality / fdm = Y^m(A,)x, e G. J 1 = 1 The definition of the integral is independent of the particular representation of / as a step function. In fact, one can prove that if ??-\ ??,?? = 0, then ?'?=\ m(^i)xi — 0; consequently, if ? in / = l .,= 1
350 N. Dinculeamt with A;, Bj e TZ and x,, yy e F, then /1 HI ]T w (A;,)xi = J^ m (S,)y/. and J f dm is defined unambiguously. We have the following immediate properties of the integral of TZ-step functions: 8· f(f + g)dm = f fdm + f gdm, 9. cf dm = c f f dm, for с el. 10. If ? is a positive, finite, additive measure on TZ and if f.S -> F is an 7^-step function, then f /?? e F and 11. If we want to extend the integral for a larger class of functions, we have to impose additional conditions on TZ and m, for example, that 7? is a <5-ring or a ?-algebra and m is ?-additive and with finite semivariation. In particular, m can have finite variation or can be positive. There are four main stages in the development of the integral / / dm: (I) The classical integral, with m ^ 0 and / real-valued, (II) The Bochner integral, with m ^ 0 and / vector-valued, (III) The integral f f dm, where m is a vector measure with^m/te variation and / is vector- valued. (IV) The integral J f dm, where w is a vector measure with^ш/'fe semivariation and / is vector-valued. The most important stage is the first one. The others stages can be performed to the extent that they can be reduced to the first one. In the following paragraphs we shall present succinctly each of the above stages, except (I) which was already presented in Pap B002) (Chapter 2 in this Handbook). 1.4. Measurability with respect to a positive measure The framework for this section is a measure space (S, ?, ?), consisting of a nonempty set S, a ?-algebra ? of subsets of S and a positive, ?-additive measure, with finite or infinite values, ? : ? -> [0, +oo], such that ?@) = 0. We shall assume that ? has the finite measure property (FMP), i.e., for every set A e ? we have ?(?) = $??{?(?); Be ?, В С ?. ?(?) < ??}.
Vector integration in Banach spaces and application to stochastic integration 351 12. A function / :S -> F or R is said to be ?-negligible is f(s) = 0, ?-a.e, 13. Measurability with respect to the ?-algebra ?(?) is called ?-measurability. The ?(?)-$??? functions are also called ?-measurable step functions. A function f:S -> F or R is called ?-measurable if it is .?^)-measurable, i.e., if there is a sequence (/„) of F or R-valued ?-measurable step functions, such that /„ -> / pointwise. 14. A function / :S -> F or R is ?-measurable iff there is a .^-measurable function g:S^> F or R such that f = g, ?-a.e. If f\ = /2. ?-a.e. and if f\ is ?-measurable, then /2 is ?-measurable. It follows that /is ?-measurable iff there is a sequence (/„) of iT-step functions such that /„ -> /, ?-a.e. 15. We extend the ?-measurability for functions defined ?-a.e. An F or R-valued function defined ?-a.e. on S is said to the ?-measurable, if it has a ?-measurable extension g:S -*¦ F or R on the whole space S. Then any extension of / to S is ?-measurable. 2. The Bochner integral The framework for this section is a measure space (S, ?,?) and a Banach space F. The definition of the Bochner integral is very similar to the classical integral, with some differences. 2.1. The seminorm 1. For every F-valued ?-measurable function / defined ?-a.e. on S we set ||/lh = /\/\??. For each ?-measurable function / defined ?-a.e., there is a ?-measurable function g defined everywhere such that f = g, ?-a.e., therefore ||/|| ? = ||g|| ?. || /|| ? has the following properties: 2. o< 11/111 <;+oo. 3. 11/111 = 0 iff / = 0, ?-a.e. 4. If / = g, ?-a.e.. then ||/||, = ||g||,. 5. Il/ + g||i<||/||i + ||g||i. 6. ||c/|h = |c| 11/111, forceR. 7. If|/K|g|,then||/||i^||g|||. 8. If 11/111 < 00, then |/| < oo, ?-a.e.
352 N. Dinculeanu 9. Countable convexity. If (/„) is a sequence of ?-measurable functions and if the series ??=? f" IS convergent ?-a.e., then ?/- 77=1 DC <?«/·? I 77=1 2.2. Bochner integrability The Bochner integrability is similar to the classical integrability. 10. Dehnition. We say that an F-valued function / defined ?-a.e. on S is Bochner ?-integrable, if/ is ?-measurable and ||/|h < oo, i.e., / |/|^? < oo. If II/77 ~ /II1 ""*¦ 0- we say that /„ -> / in the mean. 11. DEHNITION. We denote by Ь\-(ц.) or ^(.??.?) or LF(S, ?,?), the set of all Bochner ?-integrable functions /: S1 —» F. defined everywhere on S. The Bochner integrable functions have the following properties: 12. A step function / = ]T"=| xAiXj with A, e ?(?) and *,· e F is ?-integrable iff A, e ?/(?) for each /'. 13. If feLFfo), then l/l eL1 (?). 14. ?-|?(?) is a vector space. 15. The mapping / ?» ||/|| ? is a seminorm on L\ (?). The topology defined by this seminorm is called the topology of convergence in the mean. 16. If / e LF{p) and g:S -> F is ?-measurable and satisfies |g| < |/|. ?-a.e., then geLj.-(/i). We can prove Lebesgue's theorem, before defining the Bochner integral; at this stage one can conclude only convergence in the mean. 17. Lebesgue's dominated convergence theorem. Let (/„) be a sequence from LF(p), /:S-> F a function and g e L] (?). Assume that (a) /, -> / ?-a.e, (b) |/n|<g, ?-a.e., for each ?. Then f & Ьр(ц) and fn -> / ш f/ге wea/г. For the proof, we apply Fatou's lemma to the sequences (|/„ - /| + 2g) and Bg — \fn-f\).
Vector integration in Banach spaces and application to stochastic integration 353 After we define the Bochner integral, we can add to the conclusion that / /„ ?? -> 18. The set Sf{Ej) of F-valued, ?/ -step functions is dense in ?^(д). In fact, for each /??[·(?) there is a sequence (/„) of ^/-step functions such that fn~*f pointwise, ?-a.e. and |/„| < |/|, ?-a.e. for every n; then we apply Lebesgue's theorem. 19. LF(?) is complete. Properties 2.37 through 2.41 in Chapter 2 remain valid for Bochner integrable functions. 2.3. The Bochner integral 20. For a Ef -step function f:S-> F of the form / = ]?"=| ??: jc, with A, e ?/ and Xi e F, we defined the integral j f ?? by the equality f " / /?? = ^2?(??)?? i = l Taking the sets A, mutually disjoint we have /1 \?\ = ????\??\, i=\ hence f " J |/|?? = ]??(?, )|л',| i = \ It follows that \j ??? <|?/|?? = |?/???· This inequality shows that the linear mapping L:Sf-~(Ef)^>F defined by L(f) = j ???, iorfeSy(Ef), is continuous for the seminorm ||/|| ?. Since the space Sy(E/) is dense in Lf(m) for the topology defined by the seminorm || /|| ?. one can extend L to a continuous linear mapping L*:L]F(u) -> F, and we still have |?*(/)|<ll/lli. for/eL^(M).
354 N. Dinculeanu For each function / e L\(?), the value L*(f) e F is denoted by / f ?? and is called the Bochner integral of / with respect to ?. The Bochner integral has the following properties: 21. ?/ + 8)?? = ?/?? + ?8??. 22. fcfdp = effdv,ioTceR. 23. |//^?|</|/??=||/|||. 24. If /„ -»¦ / in L[-(M), then / /„ ?? -»· / /??. This property allows us to add / /„ ?? ^> f /?? to the conclusion of the Lebesgue's theorem. If / e LF(u) and A e Г(д). then /?? e L j..(?). We denote, as usual, / /?? = / /????. 25. If / e LiF(?), the set function w : ?(?) -> F defined by m(A) = / /??. for Ae ?(?) J A is a ? -additive measure. 26. We can extend the Bochner integral / f ?? for F-valued, ?-integrable functions / defined ?-a.e. on S, If g; S -> F is a function such that / = g, ?-a.e,, and g e ?[·(?). then we define J f ?? = f g??, and the integral f /?? is independent on g. 2.4. The spaces ^{?) 27. Definition, Let 1 < ? < oo. We denote by Lt(?) the set of ?-measurable functions f:S -> F such that |/K> e L'(m), i.e.. such that /\/\??? < oo. For / e ?//-(?), we denote 11/11/, = (j\f\pd^ 28. ?^(?) is a vector space and the mapping / н» \\f\\P is a seminorm on it. The topology defined by this seminorm is called the topology of convergence in the mean of order p.
Vector integration in Banach spaces and application to stochastic integration 355 29. LPF (?) is complete and Sy(Ef) is dense in Lpt^), 30. Definition. We denote by ?^(?) the set of ?-measurable functions such that II/IIoc = inf{a: 0 ^ a ^ +oo, | f(s)\ < a, ?-a.e,} < oo, 31. L^?(?) is a vector space and \\f\\oc is a seminorm on it, for which it is complete. We have /„ ->¦ / in ?^(?). iff there is a ?-negligible set N С S such that /„ -> / uniformly on S\N. 32. If F = R. the set S(Z) of real valued. ?-step functions is dense in Loc(?) := If F is an infinite-dimensional space, the set Sy(E) is no longer dense is L^(?). The following property asserts the uniform ?-additivity and the uniform absolute continuity of the indefinite integrals of functions in a Cauchy sequence. 33. THEOREM. Let (/„) be a Cauchy sequence in V\ (?) with \ ?. ? <oo. Then: (a) The measures ?„ : ?(?) —» R defined by f \fn J A ?«(?)= / \/?\???. forAeE(u), J A are uniformly ? -additive, for n e N. (b) lim,J(A)^(,/A ???'^? = 0. uniformly for ? e N. (c) For every ? > 0, there is a set Sf e ? with MEf) < oo, such that / \&\???<?, for all ? eN. Js\sf Using Theorem 33, one can prove the Vitali theorem. 34. THEOREM (Vitali). Let (/„) be a sequence from ?^-(?) with 1 ^ ? < oo and f :S -> F a function such that f, -» f, ?-a.e. Then f e ?,?-(?) аиЛ /„ -»· / /n ?^-(?) /# the following conditions are satisfied: (b) Нт^д^о/д \fn\p ?? = 0. uniformly for ? e N. (c) For every ? > 0, i/itre /s a set SF ? ? with ?(??) < oo, such that ? |/„|/??<?, for all ? eN. Js\sf If ?E) < oo, condition (c) is automatically satisfied with Sf = S. From the Vitali theorem, we can deduce the Lebesgue theorem in Lp (?). 35. THEOREM (Lebesgue). Let 1 ^ ? < oo, (/„) a sequence from Lp(?), f: S -> Fa function and g e Lp^), such that
356 N. Dinculeanu (a) fn -> /, ?-a.e,, (b) l/«l^, ?-a.e., for each ?. Then f e L'^(m) and fn -> / /и ?/^(?). //> = 1, ?/геи / /„ ??? -> f f ??. Finally, we state Holder's inequality. 36. Let 1 < p, q ^ +oo be conjugate numbers, i.e.. ^ + ^ = 1. Let E, F, G be Banach spaces such that, ? ci(F,G). (a) If / e ЬрЕ(ц) and g e ?*-(?), then /g e ?^(?) and we have I/ /g^M Here, the product function /g: S" —» G is defined by (fg)(s) = f(s)g(s), iorseS. (b) Let TZ be a ring generating the <5-ring ?/. Then for every / e ?-^(?) we have l/ll /> = sup / \fg\d? = sup \f\\g\dp the supremum being taken for g e Sf (TZ) with ||g||9 ^ 1. (c) If G = R and / e ??-(?), then l/ll/. = sup / />?? / forge5/-(^)with||g||i/< 1. 3. Integration with respect to measures with finite variation In this section we present the third stage in the development of the integral. We study measures with finite variation and integration with respect to such measures. Integrability with respect to a measure with finite variation reduces to Bochner integrability with respect to the variation measure. We then study the indefinite integral and the Radon- Nikodym theorem for vector measures. For a more detailed account, the reader is referred to Dinculeanu A967) and B000). 3.1. Measures with finite variation The framework for this section is a ring TZ of subsets of S, a Banach space ? and an additive measure m : 7? —» E.
Vector integration in Banach spaces and application to stochastic integration 357 1. Definition. For every set Ac S, the variation of m on A is a number, finite or +00, denote by var(w, A) or m(A) and defined by var(w, A) = sup2_] \m(A-i)\ i€l the supremum being taken for all finite families (A,);e/ of disjoint sets from 1Z contained in A. If A e 7Z|0C, we write |m|(A) = var(w. A). We say that m has finite (respectively bounded) variation if var(w, A) < 00 for every A e TZ (respectively var(w, S) < 00). The variation of m has the following properties: 2. \m(A)\ <; |w|(A), for AeTZ. 3. If А С S, then var(w, A) ^ var(w, ?). 4. var(w, A) = 0iff m(?) = 0 for every В е П with ? С A. 5. w is superadditive: for an arbitrary family (A,)(e/ of disjoint subsets of S we have т(\^АЛ^^г}1{А,). ^i€l ' i€l 6. var(w -f ?, ?) ^ var(w. A) -f var(/z. A). 7. var(aw, A) = |or| var(w. A), for ? e R. 8. The variation \m\ :ll\oc -> [0, +00] is additive. If m is ?-additive on H, then |w| is ?-additive on Ti\ac. 9. Let ?: 72. -> R be a real-valued, additive measure. Then for every set А с S we have sup{^(fi)|: ВеП, BcA}^m(A)^2sup{|m(B)|: ? e 7г. ficA}. If ? is complex-valued, we replace 2 by 4. 10. A real valued, ?-additive measure ? : S -> R on a 5-ring ? has finite variation; if ? is a ?-ring, ? has bounded variation. 3.2. Integration with respect to a measure with finite variation The framework for this section and for the following sections of this paragraph consists of a 5-ring V of subsets of S, three Banach spaces ? с L(F, G) and a ?-additive measure m:V —* ? with finite variation |w|.
358 N. Dinculeanu Then I/я I is ? -additive on the ? -algebra P|OC in particular on the ? -algebra ? = ??(?>) generated by P, possibly with infinite values. Integrability with respect to w of vector-valued functions f:S—* F is reduced to Bochner integrability of / with respect to the positive measure |w |. 11. DEnNlTlON. We say that a set А с S is w-negligible (respectively w-measurable) if A is |w|-negligible (respectively |w|-measurable). We denote E(m) = E(\m\). We say a property ? is true w-a.e. if it is true |w|-a.e. We say that an F or R-valued function defined w-a.e. on S is w-negligible (respectively w-measurable, w-integrable) if it has the same property with respect to |w |. 12. For 1 $5 ? ^ +oo we denote LF(m) = LF{\m\) and endow LF(m) with the seminormof Lft-(|m|): \\?\\?=\\\?\\\?={\\?\? d\m\ for / e L^(w), if 1 ^ ? < oo and IIу Hoc = II I/1 Hoc- 13. The vector space LF(m) is complete for the seminorm Ц/Ц,,. 14. If 1 ^ ? < oo, the set Sf(T>) of P-step functions f:S^> F is dense in LF(m). However, if F is infinite dimensional, St (?) is no longer dense in Lf{m). 15. The Vitali and the Lebesgue convergence theorems are valid in L^(w). 16. The following assertions are equivalent for an w-measurable function /: S —> F: (a) / is w-integrable; (b) / is |w|-integrable; (c) l/l is |w|-integrable. 17. We can now define the integral j f dm for functions / e LF(m). For a P-step function / = ?''=| ?,??/ with A, e ? and х-, е F we defined the integral /· " / fdm = Yum(A-)xibG. J ?=? >//'
Vector integration in Banach spaces and application to stochastic integration 359 If the sets A, are mutually disjoint, then ? 1/1 = ? ??, I*, I i=\ and I/ fdm = Y^m(Ai).Xi k]T|w(A,)||.v,| ^Yi\m\(Ai)\xi\= { \f\d\m\ = \\fh. It follows that the linear mapping L : Sy{V) -> G defined by Ufi-ff*.. tofts*!». is continuous for the seminorm ||/lh . Since Sy(T>) is dense in LJ. (m), the mapping L can be extended to a continuous linear mapping L*: LF(m) —* G. The value L*F(f) of the extension for a function / e L j. (w) is denoted by j f dm and is called the integral of f with respect to m. 18. We still have \J fdm U f\f\d\m\=\\fh, for feL\(m). If fn -> / in L[-, then / /„ dm-> j fdm. 19. If / e L^(w) and Л e E(m), we have /хд е L\(m). We denote, as usual, / fdm= / fxAdm. 20. If / e L^(w), the mapping Л ы- fA fdm from 2T(iw) into G is ?-additive and lim [ \f\d\m\ = 0. 3.3. 77?e indefinite integral Let w : V -+ ? С L(F, G) be a ?-additive measure with finite variation |w | on a 5-ring P. and let ? be the ?-algebra generated by V.
360 N. Dinculeanu 21. ?????????. Let g e L\{m). We denote by gm : ? -> G the ?-additive measure defined by (gm)(A) — I gdm, for A e 2?. and we call it the indefinite integral of g with respect to m, or the measure with density g and base m, or the product of g and m. 22. If g e Lf(w), then |g| e L'(|w|) and we can consider the indefinite integral Ы|ш|:Г^К+: (Ы|ш|)(А) = f \g\d\m\, ??????. The measure gm has finite variation \gm\ satisfying \gm\ < |g||w|. 23. THEOREM. Let g e LF(m). If either g or m is real-valued, then we have equality \gm\ = \g\\m\. In particular, if ? is a positive measure and if g e L\ (?), then I#mI = IglM- For the proof, see Dinculeanu B000), Theorem 2.29. 24. Letg e L]F(m). ? \gm\ = |g||m| and if J gdm = 0, for every A e ?, then g = 0, w-a.e. We now state some theorems concerning integrability with respect to the measure mg. We consider first the case of real-valued measures. 25. THEOREM. Let ?:?> -> Ш be a ?-additive measure on a S-ring V with finite variation |?|. Let g e L]E^) and f: S —> Fa ? -measurable function. Then (a) / is \g\\?\-integrable iff f is g\x-integrable. (b) If f is g[L-integrable, then fg is ?-integrable and we have J fd(gp) = f fgd/i
Vector integration in Banach spaces and application to stochastic integration 361 and the associativity formula f(8H) = (/S)M- (c) If g is real valued, then f is g\x-integrable iff fg is ?-integrable. For the proof see Dinculeanu B000). Theorem 2.30. For vector valued measures we have a similar theorem: 26. THEOREM. Let m :T> -> ? С L(F', G) be ? ? -additive measure with finite variation \m\ on a S-ring V. Assume that either (i) g e L' (m) is a real-valued function and f: S -> F is ?-measurable, or (ii) g e L\{m) is a vector valued function and f:S -> R is a real-valued, ?- measurable function. Then: (a) / is \g\\m\-integrable iff fg is m-integrable. (b) If fg is m-integrable, then f is gm-integrable and we have / fd(gm)= / fgdm, and the associativity formula f(gm) = (fg)m. (c) If g is real valued, then f is gm-integrable iff fg is m-integrable. For the proof see Dinculeanu B000). Theorem 2.31. 3.4. The Radon-Nikodym theorem In this section we state a Radon-Nikodym-type theorem for vector measures with finite variation. This theorem is very useful in proving the equality \??\ = n ? for the measure ?? associated to a process X with integrable variation, in Section 8. 27. Let ?: Tl -> R+ be a positive, additive measure defined on a ring 7? and m :!?_--* ? an additive measure. We say m is absolutely continuous with respect to ? or д-^Ьяокиегу continuous and we write m <^?, if lim m(A) = 0. We state first the classical Radon-Nikodym theorem for real valued ? -additive measures on a X'-algebra.
362 N. Dinculeanu 28. THEOREM (Radon-Nikodym). Let ?:? ^ Ш+ be a positive ?-finite, a-additive measure and m : ? -> R a real-valued, ?-additive measure, such that m«/i. Then there is ? ?-measurable function g e L' (?) such that m = g?. i.e., L m(A)= I gdii, forAe?. 'a Then \m\(A)= J \g\dii. for ?&?. J A 29. Remarks, (a) The theorem is valid under more general conditions, for measures having the direct sum property, in particular for regular measures on a locally compact space. (See Dinculeanu A967), Theorem 5. p. 182.) (b) The theorem remains valid for vector-valued measures m: ? —> ? with finite variation \m\, provided that the space ? has the Radon-Nikodym property (RNP). A Banach space ? has the RNP. if for every finite measure space (S, ?, ?) and every ?- additive measure m : .?7 —» ? with finite variation \m\ such that m«/i, there is a function g e ??{?) such that m = g?, i.e., m(A)= I gd?, for A e ?. J a Then -L \m\(A)= / Igld/x. forAeT. 'a Examples of Banach spaces with the RNP are reflexive spaces and separable duals of Banach spaces. For a detailed study of the Radon-Nikodym theorem for vector measures, the reader is referred to Diestel and Uhl A977). We shall state now a more general and very useful version of the Radon-Nikodym theorem, without imposing the Banach space ? to have the RNP. But the density g is no longer measurable; it is only weakly measurable. 30. THEOREM (Radon-Nikodym). Let m: ? -> ? С L(F.G) be ? ?-additive measure with finite variation \m\on ? ?-algebra ? and ?: ? -> R+ a positive, ?-finite, ?-additive measure such that ш«д. Let ? С G* be a space norming for G. Assume F and ? are separable. Then there is a function Vm : S -> L(F,Z*) having the following properties: (a) For every ? e F and ? e Z, the function (V,„.x, z) is ?-measurable, ?-integrable and [m(A)x.z)= I {?,„?,?)??, for A e ?. J A
Vector integration in Banach .spaces and application to stochastic integration 363 (b) |V,„| is ?-measurable, ?-integrable and \m\ — \ ?,„\?. i-e., -L \m\(A)= / \?„\??, forAeE. 'a (c) ?/\??\ = ?, then\V„,\=\. (d) For every function f e L[(m) andevery ;e ? the function (Vmf, z) is ?-integrable and (/'H=/ (??,/.?)??. For the proof see Dinculeanu B000), Theorem 2.34. The theorem is valid without assuming F and ? separable; but the proof involves the lifting theory (Dinculeanu A967, Theorem 5, p. 269) and A. and С Ionescu Tulcea A969)). The following theorem is in a certain sense the converse of the preceding one. It is used to prove the existence of dual predictable projections of processes with integrable variation. 31. THEOREM. Let ?: ? -> R+ be a positive, ?-finite, ?-additive measure, ? С G* a space norming for G and U: S -> ? С L(F', G) a function having the following properties: (i) The function (Ux.z) is ?-measurable for every ? & ? and ? e ?; (ii) The function \U\ is ?-integrable. Then there is a ?-additive measure m: ? -> L(F, Z*) with finite variation \m\ satisfying the following conditions: (a) For every A e ?. ? e F and ? e Z, the function (pA(Ux, z) is ?-integrable and L·"-- [m(A)x,z)= / {??,?)??. (b) \m\(A)^fA\U\dii,fi>rAtEJ.e., \m\^\U\ii. (b') If F and ? are separable, then \m\ = |?/|?. (c) For f e 0?(\?\?) and ;e Z, the function (Uf, z) is ?-integrable andwe have {ifdm-?l (?/.?)??. (d) The measure m has values in L(F.G) if either G is separable, or G is the dual of a Banach space ? and we take ? = H. For the proof see Dinculeanu B000, Theorem 2.35) and Dinculeanu A967. Theorem 6, p. 274).
364 N. Dmculeami 4. Semivariation of vector measures In this section we study measures with finite semivariation, which will be used in the next section to define the integral with respect to such measures. We also study extension of measures from a given class to a larger one. For the proof of the results in this section, the reader is referred to (Dinculeanu B000), §4 and §7). 4.1. The semivariation The framework for this section is a ring ?? of subsets of S, three Banach spaces E, F,G such that ? с L(F, G) continuously and an additive measure m : Ш -> ? с L(F. G). 1. Definition. For every set А с S, the semivariation of m on A. relative to the embedding ? с L(F, G) (or relative the pair (F, G)) is a number, finite or +oo, denoted svar/-.c(w, Д) or /Mf.c(A) and defined by the equality ту.с(А) = sup ]Pm(A,).v, ie/ the supremum being taken for all finite families (A,),e/ of disjoint sets from TZ contained in A and all families (*,¦),¦ e/ of elements of F\. We say m has finite (respectively bounded) semivariation wu.g on 1Z of my.c(A) < oc for every A e1Z (respectively my.ciS) < oo). 2. Equivalently, the semivariation m y.c can be defined by the following equality: I ( I mf.c,(A) = sup / sdm\ where the supremum is taken for all 7^-step functions s: S -> F with \s\ ^ <рл- This alternative definition of the semivariation will be extended for any function / instead of ?д. The semivariation has the following properties. 3. If А с S then/и/-.с(А)^ш/-.с(А). 4. w/..G(A) = 0iff w (?) = 0 for every set ? e TZ with fie A. 5. mr.G(A) ? var(w. A). 6. If the embedding ? с L(F, G) is an isometry, then \m(A)\ ^fny.G(A), for A e 7г.
Vector integration in Banach spaces and application to stochastic integration 365 7. If m, ? : 1Z —> ? are additive measures and ael, then (m+n)F.G(A) ? ту.с(А)+пу.с,(А). (am)t G(A) ^ \a\mF.G(A). 8. The set function my.c is finitely subadditive on 1Z\0c- If m is ?-additive on 1Z, then ffif.G is ?-subadditive on TZ\ac. 9. Assume ? с L(F,R) isometrically. Then ™fm(A) — var(w. A), for any А с S. 10. We have If the embedding ? с L(F, G) is an isometry, then mR.? ^mF.G- 4.2. Semivariation and norming spaces We maintain as a framework an additive measure m : 7? —» ? с L(F,G) defined on a ring 7?, and a space ? с G* norming for G. 11. For each ? e ? we define the set function m:: 1Z -> F* by the equality (jc, w-(A))= (m(A)jt, c), for.? e F and AeK. Then w, is additive. If we consider the embedding F* = L(F, R). then by property 9 we have (w;)f.K = inz. The semivariation inF.G can be computed by means of the variations m-\ 12. PROPOSITION. For any space ? С G* norming for G and for every- set А С S we have wf.c(A) = sup m-(z). If F = R, the semivariation mm.? has special properties:
366 N. Dinculeanu 13. If m: 1Z -> ? — L(R, E) is an additive measure, then: (a) for every set А с S we have mK.?(A)^2sup{|m(B)|: Bell, В С A). (b) m is locally bounded (respectively bounded) iff m^jr(A) < oo for every A elZ (respectively wKi-( S) < oo). To say that m is locally bounded on 7? means that for every set A e TZ, m is bounded on the ring 72. П A. For the proof, we use a corresponding property for the variations m- and then we apply Proposition 12. 4.3. Semivariation of ?-additive measures The semivariation of ?-additive measures has additional properties. 14. Let m:T> -> ? С L(F, G) be a ?-additive measure on a <5-ring P. Then for any ? e G*, the measure /и- is ?-additive on ? and the variation \m:\ is ?- additive on the ?-algebra ? — ??(?>) generated by V. REMARK. It is possible that \m-\ is ?-additive for every с e G*, but the measure m is not ?-additive. 15. Let m-.Ti^f ? с L{F,G) be an additive measure on a ring TZ. Assume m can be extended to an additive measure m : V —> ? с L(F,G) on the <5-ring V generated by 1Z. If \m-\ is ?-additive for every ? in a space ? С G* norming for G, then svarf.c(w', A) = svar/-.c('«. A), for A elZ. The ?-additive measures on a <5-ring have always finite semivariation wjR.t. 16. THEOREM. Let m:T>^ ? be a ?-additive measure on a S-ring V. Then (a) m has finite semivariation wj..?. (b) /fS = (Jn S„ w/f/? S„ e P, then there is a positive, finite, ?-additive measure ? on V such that wr.? <? ? locally and ? ^ /и?..?. Remark. The local absolute continuity in ? ? <? ? means that for every set A e V we have lim wK.?(B) = 0. ?(?)->-0. BCA If ? is a ?-algebra, then taking A = 5, we obtain the usual absolute continuity.
Vector integration in Banach spaces and application to stochastic integration 367 4.4. The family m f z of measures 7. Let m : TZ —>¦ ? с L(F, G) be an additive measure with finite semivariation my.G on a ring TZ and ? с G* a space norming for G. We denote by w/. z the set of positive, finite measures {\m-\: z& Z\] defined on TZ. If m is ?-additive, then all measures \m:\ are ?-additive. We notice that the converse is not true. However, the converse is true provided that m y.z consists of uniformly ?-additive measures. 18. THEOREM. Let m:TZ—> R С L(F.G) be an additive measure with finite semivariation in f.G on a ring TZ and ? С G* a space norming for G. The following assertions are equivalent: (a) m ? c* is uniformly ? -additive on TZ. (b) »i f.z is uniformly ?-additive on TZ. (c) mFG{An) -^ Oas A„ j 0 in TZ. (d) fnFc(An) -> fflf.c(A) for any decreasing sequence (An) of sets from TZ with intersection A = P| A„ eTZ. (e) fflf,c(\) —>· 0 for any sequence (An) of disjoint sets from TZ with union A — Each of these assertions imply that m is ?-additive on TZ. The following three theorems give sufficient conditions for the uniform ?-additivity of mFG. 19. THEOREM. Let m:V -> ? = L(R. E) be a ?-additive measure on a S-ring V. Then the semivariation wr.? is finite and the set of measures mjj.p is uniformly ? -additive. 20. THEOREM. Let m : V —>¦ ? — L(F.G) be a ?-additive measure with finite semivariation in f.g on a S-ring V. If со (?_ G, then in f.G is uniformly ? -additive. 21. THEOREM. Ifm : TZ -> ? С L(F, G) is a ?-additive measure with finite variation \m\ on a ring TZ, then m f.g* is uniformly ?-additive. 4.5. Extension of measures In many cases, a measure is defined initially on a semiring and is first extended to the ring generated by the semiring. This is the case, for example, of a function g: R -> ? and the finitely additive measure mg defined for each interval of the form [a, b[ by mg([a,b[) = g(b)-g(a). The class of the intervals [a, b[ is a semiring V and we extend w? to an additive measure on the ring TZ = r(V) generated by V- This extension is done by using the following proposition:
368 N. Dinculeami 22. PROPOSITION. Let m: V -> ? or Ш+ be a finitely additive measure defined on_a semiring V. Then m can be extended uniquely to an additive measure m': TZ -> ? or M.+ on the ring TZ = r{V) generated by V. Ifm is ?-additive, then m' is also ? -additive. If A e TZ, then A = (J?=| ?, with ?, e V mutually disjoint. The measure m' is defined by /1 m'(A) = ]Pm(B,). i = l We present now some theorems about the extension of a ?-additive measure from a ring TZ to the <5-ring V = 8r(TZ) or to the ?-algebra ? = ??(??) generated by TZ. 4.6. Extension of positive measures 23. A positive, ?-additive measure ?:1?—> [0, + oo] on a ring TZ can be extended by the Caratheodory procedure to a ?-additive measure ?': S -> [0, + oo] on the ?-ring 5 = ar(TZ) generated by TZ. If supAe-^(A) = c, then 5????5?'(?) = с. If 5 is not a ?-algebra, we can perform a second ?-additive extension ?": ? —> [0, +oo] on the ?-algebra ? = ??G?) generated by TZ: ?"(?) = $\??{?'(?): AeSDB). for В е Г. If ? is finite on 7?, then ?" has the FMP. If S = Uw=i S« with Si e 7?, then ? = S and the second extension is no longer necessary. If ? is ?-finite on TZ, then ?' is the only ?-additive extension of ? from TZ to 5. However, ?" is not necessarily the only ?-additive extension of ? to ?. Example. Let #[о.ц and #[о.ц be the class of Borel subsets of [0. 1[ and [0, 1], respectively. Then ?[?. ? [ is a ?-ring in [0, 1 ] and the ?-algebra generated by it is B[o. ? ]. Let ? be a positive, finite, ?-additive measure on B\q. \ \. For any a ^ 0 we obtain a ?-additive extension ? : B[o, ? ] -> ? of ?, be setting ?{ 1} = a. To distinguish ?" from other extensions of ? to ?, we shall call ?" the canonical extension of ? to ?. To unify the language, we shall call ?' also the canonical extension of ? to S. The canonical extension of ? to X1 is the smallest ? -additive, positive extension of ?' to X\ We shall extend below the notion of canonical extension to vector valued, additive measures. We shall continue to denote ?' and ?" by ?. If ? is finite on 7?, then ? is finite on the 5-ring ? generated by 7?; if ? is bounded on 7?, then ? is bounded on S and on ?, and there is a set So e 5 such that ?(?) = 0 for every В e S with ? П 5? = 0 (Dinculeanu B000), Theorem 3.1). We mention the following property.
Vector integration in Banach spaces and application to stochastic integration 369 24. For every set A e ? with ?(?) < oo and for every ? > 0, there is a set В eTZ with ?(???) <?. This property is no longer true for an extension different from the canonical one. This property means that if ? is bounded, then TZ is dense in ? for the topology defined by the semidistance p(A, ?) = ?(???), which allows extensions of vector valued, ?-additive measure from TZ to ? (Theorems 25 and 26 below). 4.7. Extension of ?-additive measures A ?-additive vector measure m:TZ-* ? does not necessarily have a ?-additive extension on the <5-ring V = 8a (TZ). We present some theorems giving sufficient conditions for the extension of m. We consider first the extension of a measure that is absolutely continuous with respect to a positive measure. 25. THEOREM. Let m:TZ-* ? be ? ? -additive measure on a ring TZ and assume there is a positive, finite, ?-additive measure ? : 7U. —» Ш+ such that m <g ?. Then: (a) m and ? can be extended uniquely to ?-additive measures m' and ?' respectively, on the S-ring V — Sr(TZ) generated by TZ and we still have m' <K ?'. (b) If ? is bounded on TZ, then m and ? can be extended uniquely to ? -additive measures m' and ?' respectively, on the ? -algebra ? = ??(??) such that ?' is the canonical extension of ? and we still have m' «?'. If m has finite variation \m\, we can take ? — \m\ and obtain the following theorem: 26. THEOREM. Let m;lZ-* ? be a ?-additive measure with finite variation \m\. Then (a) m can be extended uniquely to a ?-additive measure m :T> —> ? with finite variation \m'\ on the S-ring V — Sr(TZ) and we have \m'\(A)=\m\(A), for A eTZ. (b) If m has bounded variation \m\ on TZ, then m can be extended to a ?-additive measure m': ? -> ? with finite variation \m'\ on the ?-algebra ? = ??A?) such that the variation \m'\ is the canonical extension of the variation \m\. 4.8. Canonical extensions In order to shorten the language, we give the following definition. 27. Dehnition. Let m: TZ -> ? с L(E, F) be an additive measure and ? с G* a norming space for G, such that for every ? e F and zeZ, the real-valued measure (m(-)x, z) is ?-additive and has finite (respectively bounded) variation \(mx, z)|(-)-
370 N. Dinculeami An additive extension m' :V -* L(F. Z*) (respectively m': ? -> L(F, Z*)) of m is called the canonical extension of m, if for every ? e F and ? e Z, the real-valued measure (m'(-)-*, z) is ?-additive and has finite (respectively bounded) variation \(m'x,z)\(-) which is the canonical extension of this variation \(mx.z)\(-)- One can prove easily that the canonical extension is unique, if it exists. If m is ?-additive and has finite (respectively bounded) variation \m\ and if m has a canonical extension m': V -» ? (respectively m': ? -> ?), then m' has finite (respectively bounded) variation \m'\ which is the canonical extension of \m\. 4.9. Canonical extension of additive measures Given an additive measure m:lZ -> ? С L(F,G) on a ring TZ, in order to apply the integration theory of Section 5 below, it is enough to extend m to ? finitely additive (not necessarily ?-additive) measure on the <5-ring V or on the ?-algebra ? generated by TZ. We have the following extension theorem for additive measures. We defined a measure m:TZ-* ? to be locally bounded, if for every set A eTZ, the restriction m : 1Z П A -> ? is bounded. 28. THEOREM. Let m :H -> ? с L(F.G) be an additive measure and ? С G* anorming space for G** (for example ? = G*). (i) Aiiwwe ?/?a? w is locally bounded (respectively bounded) and that for every ? e F and ? e Z, ?/ге real-valued measure (m(-)x. z) is ?-additive. Then there is a unique locally bounded (respectively bounded) additive measure m':T> -> L(F,Z*) (respectively m': ? -> L(F,Z*)) which is the canonical extension ofm. (ii) Aiiwwe f/га? w has finite (respectively bounded) semivariation m^.c a«^ that for each z& Z, the measure m-\TZ^- F* is ?-additive. Then m has a canonical extension m' :T> -> L(F, Z*) (respectively m': ? -> L(F,Z*)) with finite (respectively bounded) semivariation m'FZ,, such that for each ? e Z, ?/ге measure m'.:V -* F* (respectively m'.: ? -> ?*) /? ?-additive and has finite (respectively bounded) variation \m'.\ and we have m'FZ,(A) = in f.g(A), forAelZ. If ? = G*, then m' takes on values in L(F, G**) and we have mFG**(A) = ту.с(А), forAeTZ. If G = D*, where D is a Banach space and if we take ? = D, then m' takes on values in L(F, D*) and we have ™'f.d*(A) = ™f.d*(A). forAeTZ.
Vector integration in Banach spaces and application to stochastic integration 371 For the proof see Bongiorno and Dinculeanu B001), Theorems 3.6 and 3.7. The following theorem gives conditions that ensure that the canonical extension is ?- additive. 29. THEOREM. Let m :?? -> ? С L(F.G) be an additive measure and ? С G* a space norming for G. Assume cq <? E, that m is locally bounded (respectively bounded) and that for every ? e F and ? e Z, the real-valued measure {m(-)x.z) is ?-additive. Then m is ?-additive and has ? ? -additive canonical extension m :T> -> L(F,G) (respectively m : ? -> L(F.G)). For the proof, see Bongiorno and DinculeanuB001). Corollary 3.9. 5. Integration with respect to a measure with finite semivariation 1. The framework for this section is an additive measure m.V -> ? С L(F,G) with finite semivariation mF.G on a <5-ring V and a space ? с G* norming for G, such that for each ? e Z, the measure m::V -> F* is ?-additive. ? = ??(?>) is the ?-algebra generated by V. We do not assume that m is ?-additive. In this section we present the fourth stage in the development of the integral J f dm, for functions /: S -> F. For this purpose we define a seminorm my g (/) for such functions, then the space TF G(m) of measurable functions / with w/-\g(/) < сю and then the integral f fdm e Z* for functions / e Ty,c(m). This is the most important part of the chapter. In fact, most interesting vector measures do not have finite variation, but may have finite semivariation. This is the case, for example, of the stochastic measure ?? associated to a summable process X, even if X is real-valued. We shall apply the integration theory of this paragraph to obtain the stochastic integral. Some of the results are valid under additional conditions such as: (a) mF C(S) < oo, or (b) S=\JS„, with S„ eP, or (c) т is ?-additive. Conditions (a) and (b) are satisfied if ? is a ?-algebra. But most results are valid without imposing these restrictions and can be used in a wider range of applications, such as the integral representation of Gaussian measures or the Riesz representation theorem. If the reader is not interested in this generality, he or she can assume from the very beginning that ? is a ?-algebra and in is ?-additive. For the proof of the results stated in this section, the reader is referred to (Dinculeanu B000), Section 5). 5.1. Measurability with respect to a vector measure We define first negligible sets and functions with respect to m.
372 N. Dinciileanu 2. A set A e ? is said to be m-negligible if m(B) = 0 for every В e ? with В С A. If follows that a set Л e ? is m-negligible iff |m|(A) = 0, iff w/. c(A) = 0, iff A is m;-negligible for every jeZ. 3. A set А С S is said to be m-negligible if it is contained in an m-negligible set В е ?. A countable union of m -negligible sets is again m -negligible. If А С S is m-negligible, then A is m--negligible for every ? e Z. Conversely, if А С S is mc-negligible for every ? e ? and if ? is separable or if the measures \mz\ with ? e Z| are uniformly ?-additive, then A is m-negligible. 4. A property valid outside an m-negligible set is said to be valid m-almost everywhere (m-a.e.). 5. A function /: S —>¦ D or R+ is said to be m-negligible if / = 0, m-a.e. An m-negligible function / is m_--negligible for every ;eZ. Conversely, if / is m-- negligible for every ? e Z, then / is m-negligible if either / is ^-measurable, or ? is separable or the measures \m-\ with ;eZ| are uniformly ?-additive. We define now measurability with respect to m. 6. A function /: S —>¦ D or R+ is said to be m-measurable if it is equal m-a.e. to a ?- measurable function, i.e., if there is a sequence (/„) of D or R+-valued, ?-step functions, converging to /, m-a.e. Moreover, we can choose the functions /„ such that |/„| ^ |/|, m-a.e., for each n. If / is m-measurable, then it is m;-measurable for every ze Z. Conversely, if / is m-- measurable for every ? e Z, then / is m-measurable, provided that ? is separable or the measures \mz\ with ? e Z\ are uniformly ?-additive (Dinculeanu B000), Proposition 5.5). 5.2. The seminorm m pc(f) The alternative definition of the semivariation given in 2 of Section 4 is extended now for functions. 7. DEnNlTlON. For every function f:S^> D or R we define I f I wF.c(/) = sup / sdm\ where the supremum is taken for all iT-step functions ?: S -> D or R with |i| ^ |/|. Compare with Definition 8 of Section 2 in Chapter 2, of the integral of positive functions. We can compute mFG(f)m terms of the measures \m-J. Compare with Proposition 12 of Section 4.
Vector integration in Banach spaces and application to stochastic integration 373 8. If /: S —>¦ D or ? is w-measurable, then wV.c(/) = sup{ j\f\d\mz\: zezA. If the spaces F and G are understood, we shall write w instead of w f.g ¦ We have the following properties of wV.c(/) for w-measurable functions /: 9. (Kw(/)^oo. 10. w(/) = w(|/|). 11. w(/) = 0 iff / = 0, w-a.e. 12. If l/l = |g|, w-a.e., then w(/) = w(g). 13. If l/l ^ |g|, w-a.e., then w(/) ^ w(g). 14. m(f + g)^m(f) + m(g). 15. w(a/) = |a|w(/),fora eK. 16. MONOTONE CONVERGENCE THEOREM. // (/„) is an increasing sequence of positive, m-measurable functions /„ : S —» E+, then w(sup/„) = supw(/„). 17. For every sequence (/,) of positive, w-measurable functions /„ : S -> E+ we have (ОС \ DC n=l / n=\ 18. FatOU'S LEMMA. It (/„) /? ? sequence of positive, m-measurable functions, then wiliminf/,,) ^ liminfw(/„). \ n^oo / л^зс 19. If / is w-measurable and с > 0, then w({|/|>c})<~w(/). 20. If /: S -> E+ is w-measurable and w(/) < oo, then / < oo, w-a.e. 21. If /: S -> D or Ё is w-measurable, then the set {/ ? 0} is contained in the union of a sequence (A„) from ? with w(A„) < oo.
374 N. Dinculeanu 5.3. The space ??(??у q) We define now a space similar to the space L' of integrable functions. 22. Definition. We denote by Toinn.G)· the set of all m -measurable functions /: S -> D with yhf c(f) < oo. If the spaces F and G are understood, we shall write To(m) instead of fo(mf.c). The functions / e ? Dim) are called w-integrable functions. 23. For each г eZ| we have /|/|d|m:| ^ mt.c(f)· hence То(т) С L[,(w:), therefore ^о(»я)Ср|^{,Aя:.). If ? is a closed subspace of G*. then we have the equality TD{fu)=^L^D(m:). ;eZ 24. To(m) is a vector space and mt.G(f) is a seminorm on it. We shall consider on To(m) the topology defined by this seminorm. 25. THEOREM. // (/„) is a Cauchy sequence in L}-,(/??). then there is a function f e Td(™) and a subsequence (flk) such that: (a) fn~* f in Fn(m). (b) fk —> f m-a.e. and in To{m). 26. COROLLARY. The space То(т) is complete for the seminorm in. 27. Definition. Let /,,, /: S -> D be w-measurable functions, ? e N. We say that the sequence (/„) converges in m-measure to /, if for every ? > 0 we have nlim)i»i({|/B-/|>e})=0. 28. Proposition. Let f„,f e TD(m),n e N. Ifm(f„ - f) -> 0 then f„ -> / in m- measure. 29. Remark. The set of ?-step functions of То(т) is not necessarily dense in JFD(m). If the measures (m-) with ;eZ| are uniformly ?-additive, then the set of iT-step functions of To{m) is dense.
Vector integration in Banach spaces and application to stochastic integration 375 30. We shall denote by Td(B, in) the closure of the set of bounded functions of Td(™)- This set has properties similar to the usual L' -spaces, especially concerning the Lebesgue theorem. 5.4. The integral In the particular case D = F, we can define the integral j f dm for functions / e ^Fimr.G) and the integral belongs to Z*. To simplify the notation we shall write Tf.c(m) or Ту.с(т) instead of ^"f(w/-.g)· 30. The construction of the integral / f dm is done in the following way: Let / e TF.c(m)- Then / e L\(\m:\) for every ;eZ. Since m-.? -* F* has finite variation |m-|, the integral f fdmr is defined in the sense of the third stage and J f dm- e R. The mapping ? н> / / dm- is a continuous linear functional on Z: \ifdm-\4 fdm-\<i I \f\d\m:]<:\z\m(f). We denote the linear mapping / h-> f f dmz by / f dm and call it the integral of with respect to m. We have j f dm e Z*. (//<».*)-/. fdm- and I/ /iiw ^mF,G(f). From this last inequality it follows that the mapping / h->- f f dm from Ту,с(т) into Z* is continuous for the topology of TF.dfn). 31. If we take ? = G*, we have / f dm e G** for / e Ту.о(т). Let us denote, for the moment, by (Z) j f dm the integral corresponding to the norming space ? с G*. If Z, Z' are two subspaces of G*, norming for G, and if ? с ?', then the integral (Z) f f dm is the restriction to ? of the integral (Z') / dm. We have therefore /(?) ? /Лш, z\ = /(?') ? /Лш, zl for z e ?. In particular, /(?) ? fdm,z\ = l(G*) f fdm,z\, for ? e ?.
376 N. Dinculeami 32. We are particularly interested in the case when j f dm e G. This is evidently the case if G is reflexive and we take ? = G*. If the measures \m-\ with ? e Z\ are uniformly ?-additive, then f f dm e G for every bounded function / e Тг.в(т). Assume m has finite variation \m\. Then m has also finite semivariation mr.G- We can consider the space LF(m) = LF(\m\) of functions /:S -» F which are m-integrable, in the sense of stage 3. For each ? e G* we have ]w:] ^ \m\, hence mF^{f) ^ / |/]d]m| = 11/111, the norm in LF(m). It follows that LF(m) с TF.c(m) and the embedding is continuous. If / e i^(m), then J f dm is the same, whether we consider / in LF(m) or in •^f.g('w). 5.5. Convergence theorems We state first the analog of Egorov's theorem. 33. THEOREM (Egorov). Assume that the measures \mz\ with ? e Z\ are uniformly ?- additive. Let /„, /: S —» D, ? e N, be m-measurable functions such that /„ -> / m-a.e. Then: (a) For every ief A e ? w/7/г m(A) < oo and for every ? > 0, ?/геге ?'? a set В e ? with В С A such that m(A \ ?) < ? and fn -> / uniformly on B. (b) /n —> / /и m-measure. Uniform convergence implies convergence in To(m). 34. THEOREM. Lei (/,) foe ? sequence from To{m) converging uniformly to a function f-.S^D. Assume there is a set A e ? with m(A) < oo iwc/г г/шг all functions f„ vanish outside A. Then f e TD(m) and m(f„ - f) -> 0. If D = F, ?/геи j fndm-> f f dm. 35. THEOREM (Vitali). Lei (/„) foe ? sequence from Td(™) and f;S-*Danm- measurable function such that: (a) /n —> / /и m-measure, or (a') /„ -> / m-a.e. and the measure \m-\ with z& Z\ are uniformly ?-additive; and (b) lirrim(A)^o rn(fn<PA) — 0, uniformly for ? e N. 77геи / e То(т) and m(fn — f) —»¦ 0. //D = F, ?/геи f fndm-> f f dm. For the Lebesgue's theorem, we have to restrict ourselves to the space Td(B, m). 36. THEOREM (Lebesgue). Assume S eT>. Let (/„) foe ? sequence from TD(B,m), f e То(т) and g e Jk(i3, m) ? positive function. Assume that: (a) /„ -> / /и m-measure, or (a') /„ -> / m-a.e. and the measures \m-\ with ? e Z\ are uniformly ?-additive; (b) |/n| ^ g, m-a.e., for each ? e N. Г/геи / e .FD(fi, w) шЛ w(/„ - /) -> 0. If ? = F, ?/геи / /„ dm -> / /dm.
Vector integration in Banacli spaces and application to stochastic integration 377 5.6. The indefinite integral of measures with finite semivariation The definition of the indefinite integral is the same as for measures with finite variation. 37. Dehnition. Let g e TFG(m). We denote by gm: ? -> ?* the additive measure defined by (gm)(A) — I gdm, for A e ?, and we call it the indefinite integral of g with respect torn, or the measure with density g and base m. The measure gm is not necessarily ?-additive; but for each ? e Z, the measure (gm)z is ?-additive and we have (gm)z = gmz. The following proposition gives sufficient conditions for the ?-additivity of gm. 38. Proposition. (a) Assume the measures \m:\ with ? e Z\ are uniformly ?-additive. Then f f dm e G, for every f e Ty.ciB, m). In particular f f dm e ? for every f e T^,yiB,m). (b) Assume со <t G. Then f fdm e G. for every f e Ty.Gi.rn). In particular, if со <? ?. then f f dm e E, for every f e T^.yim). (c) If g e Ty,cim) and fAgdm e G for every A e ?, then the measure gm is ?- additive. In particular, if g e Jr.?(w) and fAgdme ? for every A e ?, then the measure gm is ?-additive. We state first the associativity formula in case the density is real-valued. 39. THEOREM. Assume m is ?-additive and S = (J?L| Sn with Sn eT>. Let ?: S -» К be a real-valued, m-measurable function such that wr,?(<p) < сю and my ?(?) < oo and assume that fA ? dm e ? for every A e ?. Then: (a) The measure <pm is ? -additive and has finite semivariation i<yrn)y.G- (b) If f ^ 0 is ?-measurable, then i<pm)m.Eif) = m'm.Ei<pf) and i<Pm) y.Gif) = m F.Gi<Pf)· (c) We have f e Ty.cifm) iff?? e Ty.cim) and m tms case we have / fd(<pm)= / ?????
378 N. Dinculeanu and the associativity formula f((pm) = (f<p)m. Now we state the associativity formula for vector-valued densities. 40. THEOREM. Assume m is ?-additive and S = U«t ? S" with Sn eV.Let f e :Ff G (m) and assume fA f dm e G for every A e ?. Then: (a) The measure fm is ?-additive and has finite semivariation (frriMG on ?. (b) If ? ^ 0 is ?-measurable, then (c) If ? is real-valued and ?-measurable and if ?? e Tf.g(™), then ? e Jk.c(/W) and we have I tpd(fm) — I (pf dm and the associativity formula (p(fm) = (<pf)m. 5.7. Integral representation of linear operations on L'' -spaces As a first application of the integration theory presented in the section, we state the following theorem. 41. THEOREM. Let (S, ?, ?) be a measure space, ? t the S-ring of the sets A e ? with ?(?) < oo, F, G, Banach spaces, 1 ^ ? < oo, and U : ?^-(?) -> G a continuous linear operation. Then there is a ?-additive measure m : ?/ -> L(F, G) with finite semivariation mF.G such that: (a) mF.G(A) < ||t/||M(A)'/>\/0r A e ?,; (b) fnFG(f) ^ \\U\\ \\f\\p ^ +00 for ?-measurable functions f: S -> F; (c) LPF^)cTF.G(my, (d) U(f) = ffdm,forfeLpFfa). If U is an isometry, then: (a') wf.G(A) = M(A)l//,,/orAeX'/; (b') mfc(f) — WfWp ^ +oo,/or ?-measurable functions f: S -> F; (c') LF^) = !FF.G(m), isometrically. The measure w is defined for every A e ?/ by m(A)* = 1/(^дл), for * e F.
Vector integration in Banach spaces and application to stochastic integration 379 5.8. The Riesz representation theorem The integration theory of this section can also be used to state and prove the Riesz representation theorem: 42. THEOREM. Let К be a compact Hausdorff space, B(K) the ?-algebra of the Borel subsets of К and Cf(K) the space of continuous functions f : К -> F. endowed with the sup norm. Let U :Cf(K) -> G be a continuous linear operation. Then there is an additive measure m :B(K) -> L(F, G**) such that U(f)=ffdm, forfeCy(K) and \\U\\ = mFmC,.(K). Moreover, for each ? e G*, the measure m- :B(K) -> F* is regular, ?-additive and with finite variation. For the proof see Foias. and Singer A960), Dinculeanu A967) and Diestel and Uhl A977). There are cases when the measure in in the above theorem is neither ?-additive, nor regular, nor with values in L(F, G). It is an open problem to give a characterization of the linear operations U for which the corresponding measure has one or more of the above- mentioned properties. There are partial answers to this problem. 43. If U :Cu(K) -> G is weakly compact, then the corresponding measure m is ?- additive, regular and has values in G. For a detailed presentation of this case see Diestel and Uhl A977). 44. An operation U: Cf (К) -> G is said to be dominated, if there is a positive, regular Borel measure ? on К such that |?/(/)|< ?\?\??, for/e СИ*)· If U :Cf(K) -> G is dominated, then the corresponding measure m is ?-additive, regular, with values in L(F, G) and with finite variation. For a complete presentation of this case see Dinculeanu A967). 45. A continuous linear functional U:Cy(K) -> ? is dominated, therefore, the corresponding measure m : B(K) -> F* is ?-additive, regular and with finite variation. 6. The Stieltjes integral In this section we study the functions with finite variation or semivariation on the real line and define the Lebesgue-Stieltjes integral with respect to such functions, as an application of the integration theory presented in Section 5.
380 N. Dinculeanu 6.1. The variation and the semivariation of a function Let g :E -> ? с L(F, G) be a function. 1. Definition. (a) For any interval /, the variation of g on / is a number denoted var(g, /) and defined by var(g, /) = sup]T|g(r, + |)-g(r,)|, the supremum being taken for all finite divisions d: fy < f ? < · - · < f„ consisting of points from /. (b) For any interval /, the semivariation of g on /, relative to (F, G), is a number denoted by svarf.c(g, /) and defined by svarf.cC?. /) = sup J>(',- + i) - *('¦¦)>¦¦ the supremum being taken for all finite divisions d: tQ < t\ < - - ¦ < t„ consisting of points from / and all families (xq, x\ ?»-?) of elements from F|. 2. It follows that for any interval / we have 0 ^ svarf.G(g, /) ^ var(g, /) ^ +oo. 3. If / С J are intervals, then var(g, /) ^ var(g, J) and svaiF.Gig-1) < svar/.-.c(g. ·/)· Sometimes the variation and the semivariation are equal: 4. PROPOSITION. Assume ? С L(F, R) isometrically. Then, for every interval I we have svarf.R(g,/) = var(g,/). The relationship between the semivariations corresponding to different embeddings is stated in the following proposition: 5. PROPOSITION. For any interval I we have s\aiF.G(g, I) < svar?..p.(g, /) = var(g, /). If the embedding ? С L(F, G) is an isometry, then for any interval I we have svarR.?(g, /) ^ svarf.cC?. /)-
Vector integration in Banach spaces and application to stochastic integration 381 6.2. Semivariation and norming spaces Let g: ? -> ? с L(F,G) be a function and ? С G* a norming space for G. For every ? e ? consider the function g;: ? —» F* defined by [x,gz(t)) = [g(t)x.z), fori eEandjc e F. Considering g-: ? -> F* — L(F,G), by Proposition 4. for any interval / we have svarf.R(g-, /) = var(g;, /). The semivariation of g can be computed by means of the variation of the functions g:: 6. PROPOSITION. For any interval I we have svarf.cC?, /) = sup var(g-, /). In particular, for any t e ? we have svarf.G(g, ]-oo, t]) = sup (gz, ] - oo, i]). I--KI 7. It is convenient to denote |g|@ = var(g,]-oo,r]) and g/.-.c@ = svarf.G(g,]-oo.r]). |g| is called the variation function of g and g/.-.c is called the semivariation function of g, relative to (F, G). If F and G are understood, we shall write g instead of gr.G- We have the following properties: 8. |g| and g are increasing. 9. gf.c^lgl· 10. If ? CL(F,E) is an isometry, then gFM=\g\- 11. If g is right continuous, then \ar(g,[a,b]) = vai(g,(a,b]) and svar(g, [a, b]) = svar(g. (a. b]). 12. If a < b, then \g\(b)=\g\(a)+v<iT{g,[a,b]).
382 N. Dinculeanu 13. If g has finite variation function |g| and if a < b, then \g(b) - g(a)\ < \g\(b) - \g\(a) = var(?, [a, b]). 14. If g has finite semivariation function g and if a < b, then g(b) - g(a) ^ svar(g, [a, b]). 15. If g: R —> R is increasing, then |g|(fe)-|g|(fl) = g(fe)-g(fl). An important property of g is the right or left continuity, which ensures that the measure m4 (defined in the next section) is ?-additive. The right or left continuity of g is inherited by its variation function |g|. 16. THEOREM. Assume g has finite variation function \g\. Then g is right (respectively left) continuous iff\g\ is right (respectively left) continuous. 6.3. The measure associated to a function We denote by TZ the ring generated by the intervals of the form ]a,b]. Each set A e TZ is a finite union of disjoint intervals ]a,·, bj], 1 ^ i ^ n. Letg:R^ ? С L(F, G) be a function. 17. We associate to g a finitely additive measure mK : H -+ E, defined first for intervals ]a, b] by mg]a,b] = g(b)-g(a) and then extended by additivity to the whole ring TZ. We have the following properties: 18. For two functions g. g': R -> ? we have ms = m4· iff g — g' is a constant. 19. If g : R -> R is real-valued, then m K > 0 iff g is increasing. 20. For every ? e G we have In particular, for every л* е ?* we have i*m( = mx*K.
Vector integration in Banach spaces and application to stochastic integration 383 21. If g is right continuous, then for every interval / we have var(w?, /) = var(g, /) and svar/.-.G(«i(., /) = svar/.-.c(g, /)· 22. If g has finite variation function \g\ and if g is right continuous, then \mg\ = mlgl. The following two theorems ensure the extension of the measure nig to the <5-ring V or to the ?-algebra В{Ш). We consider first the case where g has finite variation function |g|. 23. THEOREM. Assume g:R -> ? is right continuous and has finite (respectively bounded) variation function \g\. Then ffij can be extended uniquely to ? ?-additive measure m.V —> ? on the S-ring V of bounded Borel sets (respectively in :B(R) -> E), with finite variation \m\ and the variation \m\ is the unique extension of the variation \m,,\ = m\g\. For functions with finite semivariation we have a similar result. 24. THEOREM. Let g : R -> ? с L(R. G) be a function with finite (respectively bounded) semivariation gu.c ???" ? С G* a norming space for G** (for example, ? = G*). Then the measure m ч :??—* L(F, ?*) has finite (respectively bounded) semivariation my.G- (i) Assume that for every ? e F and ceZ, the real-valued function (g(-)x, z) is right continuous. Then: (a) For every ? e F and ? e Z, the real-valued measure (m4(-)x, z) is ? -additive and has finite (respectively bounded) variation. (b) The measure m(, has a canonical finitely additive extension m:T> —> L(F,Z*) (respectively m:B(R) -> L(F.Z*)) with finite (respectively bounded) semi- variation m r.z- (ii) Assume that for every ;eZ, the function g:: R -> F* w n'g/?? continuous. Then, for every ? e Z, the measure m- :T> -> L(F. Z*) (respectively in: :B(R)-> L(F.Z*)) is ?-additive and has finite (respectively bounded) variation. IfZ = G*, then m takes on values in L(F. G**). IfZ = G = D*, where D is a Banach space, then m takes on values in L(F, D*). Ifco<?E, then mg is ?-additive. For the proof, see Bongiorno and Dinculeanu B001), Theorem 4.21. 6.4. The Stieltjes integral Let g:R -+ ? с L(F,G) be a function and let mg:Tl -> ? be the finitely additive measure associated to g.
384 N. Dinculeanu 25. We define first the Stieltjes integral f fdg in case g is right continuous and has finite variation function |g|. In this case, by Theorem 23, the measure mx can be extended to a ?-additive measure m:V -> ? with finite variation \m |. We shall denote m still by wg; we have We can consider the space LF(mg) = L\(\mg\), in the sense of stage 3 of the development of the integral. We shall denote LF(mg) by LF(g). For every / e LF(g) we define the Lebesgue- Stieltjes integral f fdg by the equality / fdg= / /d»v If feLF(mR), then |/| e L1 (wi|.?|) and we have \f\dm\g\. therefore \J fdgU f\f\d\g\. 26. We consider now the case where g has finite semivariation function (g)-$..E and there is a space ? с G* norming for G** (for example ? = G*), such that for each ? e Z, the function gc: R -^· F* is right continuous. Then, by Theorem 24, ш, can be extended to an additive measure mg :T> -> L(F, Z*) with finite semivariation (w?)/.-.z* such that for each ? e Z, the measure (m?): is ?-additive. We can consider then the space TF.z*(m$) defined in Section 5. We denote J7F.z*(m<>) by Fy.z*(g) and for every function / e Fy.z'ig) we define the LebesgueStieltjes integral f f dg by the equality / fdg= / /dwi? ,, eZ*. We have then fdgU(m!!)F.z.(f). If ? = G*, then I/ j fdg = J fdmxeG**.
Vector integration in Banach spaces and application to stochastic integration 385 If, in addition, cq <? G, then J fdgeG. 27. Assume g is right continuous and has finite variation function |g|. Then g has also finite semivariation function gr.G relative to any embedding ? с L(F, G). The measure mg has finite variation |w?| = m^ and finite semivariation (in^r.G- We have LxE(mg) CTV.c^i;)· For / e LF(mg), the Stieltjes integral f f dg is the same, whether we consider / in LF(mg) or in TF.c(mg). 28. We could consider the semiring V of the intervals of the form [a, b[ and the ring Ti! generated by V, and define the measure т':Т1'—> Е by m'g[a,b[ = g(b)-g(a). Then m' is ?-additive and has finite variation (respectively finite semivariation) iff g is /e/r continuous and has finite variation (respectively finite semivariation). If w'? has finite variation, then w' can be extended to a ?-additive measure m' with finite variation on the <5-ring V. If m has finite semivariation and if for every ? e Z, the function g,- is left continuous, then w' can be extended to an additive measure m' with finite semivariation on V, such that for every ? e Z, the measure m'_ is ?-additive. If we start from a function g: R -> ? with finite variation but not necessarily left or right continuous, it is more appropriate to define the measure m:TZ —>¦ ? by m]a,b] = g(b+) - g(a+) and the measure m': 1Z -> ? by w'[a>[ = g(fo-)-g(a-). Then both measures are ?-additive and can be extended to the same ?-additive measure on V. In fact, the function g+(t) = g(t+) is right continuous and the function g~(t) = g(t—) is left continuous and we have m]a,b] = mg+]a,b] and m'[a,b[ = mg_[a,b). Similar considerations can be made in case g has finite semivariation. But in this case. g does not necessarily have lateral limits in G. However, there are elements G(t+) and g(t—) in G** such that for every ? e F and ? e G* we have lim(g(s)x,z)=(g(t+)x,z) and \imig(s)x,z}= (g(t-)x,z). sit iff
386 N. Dinculeanu 7. The stochastic integral The main application of the integration theory with respect to a measure with finite semivariation is the stochastic integral. 7.1. Notations and definitions The reader is supposed to be familiar with the general theory of stochastic processes, as presented in any book, for example, in Dellacherie and Meyer A975-1980). We present below a few definitions and notations that will be used in the sequel. 1. E, F, G are Banach spaces with ? с L(F, G). 2. (?,?, ?) is a probability space. The P-negligible (respectively P-integrable) sets or functions are called, simply, negligible (respectively integrable). Instead of P-a.e. we shall write a.s. (almost surely). The space LF(P) with 1 ^ ? ^ oo will be denoted by L!'F. A set ? с R+ ? ? is called evanescent if it is contained in a set of the form R ? A with А С ?, negligible. {Tt)t€r+ is a filtration, i.e., each Tt is a ?-algebra contained in ? and ? С Tt if s ^ t. We assume the filtration satisfies the usual conditions, i.e., Tt = C\s>t ^ for every t ^ 0 and each Tt contains all the negligible sets. _ A stopping time (or optional stopping time) is a function ? : ? -> E+ such that {T ^ t) e T, for every t > 0. If 5 ^ ? are two stopping times, we define the stochastic interval ]S,T] = {(t,w)eRx ?: S(w) < t ? T(w)\. Other stochastic intervals, [S, T[, ]S, T], [S, T] are defined similarly. The graph of a stopping time ? is [?] := [?, ?]. 3. 1 ^ ? < oo. From ? с L(F, G) we deduce L? с L(F, Lpc). 4. 7? is the ring of subsets of R+ ? ? generated by the semiring of predictable rectangles {0} ? A with A e Tq and (s,t] ? A with Л е Ts. The ?-algebra generated by TZ is called the predictable ?-algebra and is denoted by V¦ The predictable ?-algebra is also generated by the adapted, left continuous, real valued processes (adapted processes are defined below). A T-measurable process is called a predictable process. A stopping time ? is said to be predictable if the stochastic interval [T, oo) is predictable. The optional ?-algebra О is the ?-algebra generated by the adapted, right continuous processes. It is also generated by the stochastic intervals ]S, T] with S ? ? simple stopping times. An 0-measurable process is called an optional process. Other ?-algebras of interest are Tt and Tt-, where ? is a stopping time. Tt is the ?-algebra of the sets A eT with А П [T ^ t] e T, for every t ^ 0. The ?-algebra Tt- is generated by the sets of the form А П {t < ?} with t ^ 0 and A e Tt.
Vector integration in Banach spaces and application to stochastic integration 387 5. A function F:I+xi2-»fl, with values in a Banach space D is called a process, or a stochastic process. We say ? is adapted if for each t > 0, the random variable Y, is ^-measurable. ? is cadlag if for each ? e ?, the path t -» Yt (?) is right continuous and has left limits. In the sequel, X : E+ ? ? -> ? will denote a cadlag, adapted process with X, e i,? for every t ^ 0. We consider X automatically extended ????? with X, = 0 for t < 0. Then Xo- = 0. We extend also the filtration with Tt = JT() for t < 0. Let ? : R+ ??-> D be a process and ? a stopping time. We denote by Hj the function defined by ??{?) = #7-(?)(?) for ? e ? with ?(?) < ??, and by HtI\t<oc) the function equal to Яг on {T < oo) and equal to 0 on {T = oo}. The stopped process HT, obtained by stopping the process ? at the time ? is defined by #,r = Н,лТ. The process #r~ obtained by stopping the process ? before ? is defined by Hj" = H, if t < ? and Я,7^ = HT^ iit^T. A process ? is said to be a modification of a process ?, if for every t > 0 we have ^ = ?,, a.s., the negligible set depending on t. A process ? is said to be evanescent if the set {? ? 0} is evanescent. Two processes ? and Z, with values in the same Banach space, are said to be indistinguishable, if ? — ? is evanescent. 7.2. The measure ??. Summable processes Let 1 ^ ? < oo. Let X :R+ ? ? -> ? с L(F. G) be an adapted, cadlag process with X, e L? for every t ^ 0. 6. We associate to X the additive measure ??.??-* L!'E с L(F, LPG) defined, first, for predictable rectangles by /*({()} ? A) =lAXo, forAeJb and Ix{]sj]xA)=lA(Xt-XJ, foTAeT,, and then extended by additivity to the whole ring 1Z. We have lx{[0,t] x A) = lAXt, for t ^ 0 and A e To- In particular, ??([0.?]??) = ?,, forr>0. If the process X is understood we shall write / instead of ? ?.
388 N. Dinculeanu 7. Since LPE с L(F, L^), we can consider the semivariation of ?? relative to the pair (F, Lq). To simplify the notation, we shall write /or (Ix)f.g instead of (Ix)F Li>: If.g(A) = sup ?/?(A,); /e/ , for A e TZ, where the supremum is taken for all finite families (Аг )i€/ of disjoint sets A, from ?? contained in A and for all families (jc/)ie/ of elements of F\. 8. Dehnition. We say that the process X is p-summable relative to the pair (F, G), if ?? has a ? -additive extension ? ? : V -> L^ with finite semivariation relative to (F, L^). If ? = 1, we say that X is summable relative to (F, G). The stochastic integral Я · X will be defined with respect to p-summable processes X. Examples of p-summable processes are: (a) Processes with integrable variation. (b) Processes with integrable semivariation, provided that cq <?_ ? and cq <? G. (c) Square integrable martingales X, in case ? and G are Hubert spaces. We have the following useful criterion for the extension of the measure ? ? to a ? -additive measure on V. 9. THEOREM. If cQ (? E, assertions (a)-(d) below are equivalent. If ? is any Banach space, assertions (b), (c) and (d) are equivalent and (a) implies (b). (a) ??.??—* LPE can be extended to ? ?-additive measure ?? : V -> L!E. (b) ?? is bounded on TZ. Let ? С L?t, ^ + ^ =\, be a closedsubspacenormingfor LE- (c) For every g e Z, the real-valued measure (??, g) is bounded on 1Z. (d) For every g e Z, the real-valued measure {Ix,g} is ?-additive and bounded on ??. If ?? has ?-additive extension to V and has finite semivariation on 1Z relative to (F, G), then the extension of ? ? has finite semivariation on V relative to (F,G) hence X is P-summable relative to (F, G). 7.3. The stochastic integral 10. Let 1 ^ ? ^ oo and X: R+ ? ? -> ? с L(F, G) be a cadlag, adapted process. Assume X is p-summable relative to (F, G). Consider the ?-additive measure /x:-p-i.?ci.(F.i.?) with finite semivariation /f G relative to (F, L?). We can apply the integration theory presented in Section 5, replacing S, ?, m with ???,?, /x and E, F, G with LE. F, LPG. respectively.
Vector integration in Banach spaces and application to stochastic integration 389 LetZcZ^,, \ + \= 1 be a norming space for Lpc. For ? e Z, consider the measure (/?): -V ^ F*, defined for A e ? and у е F by (у,(/х);(А)) = (/х(АK\г)= ?(??(?)(?)?,?(?))??(?), where the bracket in the integral represents the duality between G and G*. Then (/x)f.z.? = sup{|(/x);|:zeZ, ||г||,<1}. If ? and X are understood, we shall write I = ?? and /f .c = IF ^/. ¦ For a Banach space D we denote TdUf.g) = FdUf Lr_), the space of predictable processes Я: R+ ? ? -> D. such that If.g(H) = sup) /"|tf|d|(/x),|: ||~||„ < 11 < oo. Then TdUf.g) is a vector space and /f.c is a seminorm on TdUf.g), f°r which it is complete. The simple processes are not necessarily dense TdUf.g)· 11. If D = F we shall write J>G(X), TF,p(X), TfgUx) or TF ,?(??) instead of Tf(Ax)f.g)- In this case we can define the stochastic integral ? ¦ X for processes Я eJy.c(X) as follows. Let ? e :Ff.G(X); then Я e L^((/.v);) for every с e Z, hence the integral j Hd(Ix)- is defined and is a scalar. The mapping ? ь* j ??(??): is linear and continuous on Z. We denote it by / ????. We have, therefore J ?dlx e Z*. If Hdlx,z)= f Hd(Ix):, for: eZ and I/ tfd/x ^If.g(H). IfwetakeZ = (L?)*, then / ?dlx e (L?)**. We are interested in those processes ? e Tf.g(X) for which the integral /[(O] Я^/? belongs to Lg, for every t ^ 0. In this case we denote by the same symbol the equivalence class f[0l]HdIx in Lpc, as well as any random variable belonging to this equivalence class. We obtain in this way a process (L· . ?dIx),^o with values in G. This process is always adapted; but it is not necessarily cadlag. This leads to the following definition: 12. Dehnition. We denote by L\- g(X) the set of processes ? e Ff.gW) satisfying the following two conditions:
390 N. Dinculeanu (a) /[0 t]Hd1x e LG, for each t ^ 0. (b) The process (L· t, HdIx)t^o has a cadlag modification. The processes ? e L]h G(X) are said to be integrable with respect to X. If ? e LF G(X), any cadlag modification of the process (/](),| H ???),^? is called the stochastic integral of ? with respect to X and is denoted by ? ¦ X or / ? dX: (??),(?)=( ? HdXJ (?) = (? ? dlx\w), a.s., for each t ^ 0. It follows that the stochastic integral ? ¦ X is defined up to an evanescent process, and is a cadlag, adapted process. The following theorem gives the jumps of the paths of a stochastic integral. 13. THEOREM. For any process ? e L\- g(X) we have Л(ЯХ) = ЯЛХ, where &X, = X, -X,^. The relationship between summability and stopping times is stated in the following theorem: 14. THEOREM. Assume X is p-summable relative to (F, G). Let ? e L\ g(X) and ? a stopping time. Then: (a) XT is p-summable relative to (F. G). (b) HaLFG{XT). (С) 1|0.ПЙ?^С(Х). (d) (HX)T = HXT = (\[0.T]H)X. If ? is predictable, then: (a') XT~ is p-summable relative to (F, G). (b') HeL[.G(XT~). (c') l[Q.T[H e L^iX). (d') (?-?)?- = ?-??- = (???.?\?)-?. 7.4. Convergence theorems Let 1 ^ ? < oo and X : R+ ? ? -> ? с L(F.G)a p-summable process relative to (F. G). 15. THEOREM. 77?е space L^ G(X) ;'i complete for the semivariation h,c- 16. THEOREM (Lebesgue). Lei (#") foe ? sequence from LFG(X), converging pointwise to a process H. Assume there is a positive process ? e Tj.Uf.g ) such that ]#"] ^ ?, for each ?.
Vector integration in Banach spaces and application to stochastic integration 391 Assume, in addition, that ? can be approximated in Tr{1f.g) by bounded processes and that the measures |(/?)-| are uniformly ?-additivefor ? e LG,. — + - = 1 · Then ? e LFG(X), H" -> ? in L\- g(X) and {Hn ¦ X), -> (H ¦ X), in LG for each A Vitali-type theorem can be stated along similar lines. 7.5. Summability of the stochastic integral If X is a p-summable process, the stochastic integral ? ¦ X is not necessarily p-summable. We state below two theorems giving sufficient conditions for the summability of the stochastic integral ? ¦ X. We consider first the case when ? is real-valued. Let 1 ^ ? < oo and X : E+ ? ? —> ? с L(F, G) be a p-summable process. 17. THEOREM. Let ? e ?,? ?(?). Assume JAHdIx e L^ for each AeV. Then (a) Я · X is p-summable relative to (F, G). (b) К e LF G(H ¦ X) iff ? ? e L\g(X) and /? ??/'? cave we have the associativity formula K(HX) = (K- H)X. Then we consider the case where ? is vector-valued. 18. THEOREM. Let ? e LF G(X). Assume fA Hdlx e LG for every AeV. Then (a) Я · ? /? p-summable relative to (F, G). (b) If К is a real-valued, predictable process and if KH e L]FG{X). then К е Ljj G (H ¦ X) ?/iii we have K(H ¦ X) = (KH)- X. 7.6. Local summability and local integrability Let 1 ^ ? < oo and X :R+ ? ? -> ? с L(F. G) be a cadlag, adapted process with X e LPE for each ? > 0. We shall define now the local summability of X and the stochastic integral ? ¦ X. 19. DEHNITION. (a) We say X is locally p-summable relative to (F. G), if there is an increasing sequence G„) of stopping times with T„ \ эс. such that for each n. the stopped process XT" is p-summable relative to (F. G).
392 N. Dinculeanu (b) Assume X is locally p-summable relative to (F, G). A predictable process ?: E+ ? ? —>¦ F is said to be locally integrable with respect to X, if there is an increasing sequence G„) of stopping times with T„ \ oo, such that, for each n, XT" is p- summable relative to (F, G) and 1\q.t„]H is integrable with respect to XT". Then, for each n, the stochastic integral (?^.?,,?^) ¦ XT" IS defined. The following theorem states the existence of the pointwise limit of the above sequence of stochastic integrals. 20. THEOREM. Assume X is locally p-summable relative to (F, G) and let //:К+хй-> F be a predictable process, locally integrable with respect to X. Let (T„) be an increasing sequence of stopping times with T„ f oo. determining the local integrability of ? with respect to X. Then, the limit \im(\{0.T)l]H)XT" exist pointwise, outside an evanescent set, is cadlag, adapted and independent of the sequence (T„). The limit in the above theorem is called the stochastic integral of ? with respect to X and is denote by ? ¦ X or / ? dX. For each Tn we have (H-X)T"=(l{0.Tn]H)-XT". The stochastic integral with respect to locally p-summable processes has the main properties of the stochastic integral with respect to summable processes. 8. Processes with integrable variation or integrable semivariation In this section we state the summability of processes X with integrable variation or integrable semivariation. For this purpose, we associate to X a stochastic ?-additive measure ??. The stochastic integral can be computed pathwise as a Stieltjes integral. We shall denote ? = B(R+) ? ?. We say that a process X with values in a Banach space is measurable, if it is measurable with respect to M. Every process X : E+ ? ? —>¦ ? is automatically extended with Xt (?) = 0 for t < 0 and ? e ?. A measurable process which is not adapted is called a raw process. 8.1. Processes with finite variation or semivariation Let X : R+ ? ? -+ ? с L(F, G) be a process. 1. Definition. We say that the process X has finite variation (respectively finite semivariation relative to (F,G)), if for every ? e ?, the path t ь* ?((?) has finite
Vector integration in Banach spaces and application to stochastic integration 393 variation (respectively finite semivariation relative to (F, G)) on each interval [0, t], or, equivalently, on each interval ]—oo, t]. We denote |X|,M = var(X.(w),boo,r]) and (Xf.gMw) = svarf.G(X.M, ] - oo. t]). We say that X has p-integrable variation |X| (respectively p-integrable semivariation Xf,c)if |Х|эо е L] (respectively (Xf.g)tc e L]). 2. Proposition. (a) Assume X is right continuous and has finite variation |X|. If X is measurable {respectively optional, predictable), then so is \X\. (b) Assume X is right continuous and has finite semivariation X^.c- Assume also that either F or G is separable. If X is measurable (respectively optional, predictable), then so is Xf.G- 8.2. Optional and predictable stochastic measures We state first the existence of optional and predictable projections of bounded processes. 3. THEOREM. Let ? : R+ ? ? -н>- R be a real-valued, bounded, measurable process. Then there is a real-valued, optional process "? and a real-valued, predictable process !'? satisfying: Е{Фт1\т<ос}\Тт\) = "ФтЦт<ос). a.s., for every stopping time T, and Е(.Фт1\т<эс\\Тт-) = рФтЦт<^\- a.s., for every predictable stopping time ?. For the proof, see Dellacherie and Meyer A975-1980), VI 43. The process "? is called the optional projection of ? and the process ГФ is called the predictable projection of Ф. Using the optional and predictable projections of processes, we can define the optional measures and the predictable measures. 4. Dehnition. Let m : ? -> ? be a ?-additive measure.
394 N. Dinculeanu (a) We say that m is a stochastic measure if it vanishes on evanescent sets. (b) We say that m is optional (respectively predictable) if for every set ? e ? we have m{M)= J "(\M)dm (respectively \"(\м)ат). In this definition, the characteristic function 1 м can be replaced by real-valued, bounded processes Ф. 5. THEOREM. ? ?-additive measure ш : ?1 -> ? is optional (respectively predictable) iff for every real-valued, bounded, measurable process ? we have I 0dm = I "Фат (respectively I рФат). The measure m has automatically finite semivariation in?..?-, therefore the above integrals are defined. 8.3. The measure ? ? To a process X with integrable variation or integrable semivariation we can associate a stochastic measure ??, which is used to prove the summability of these processes, as well as the summability of square integrable martingales. We consider first processes with integrable variation. 6. THEOREM. Let X: R+ ? ? -» ? be a measurable, right continuous process with integrable variation (i.e., \X\X e L1). Then there is a stochastic measure ??.?—> ? with finite variation \??\ satisfying the following conditions: (a) ??(?) = E(j \MdXs), for ? e M, and \??\(?) = E(f \Md\X\s),for MeM. i.e., \??\ =?\?\- (b) // ? С L(F; G) and ? :R+ ? ? ^ F is a process, then ? e ?,,(??) iff ? is ??- measurable and E(J \Hs\d\X\s) < oo. In this case E(f HsdXs) is defined and we have f ???? = ?(? HsdX,\ and f ???,?\ = e( f Hs d\X\s (c) The measure ?? is optional (respectivelypredictable) iffX is optional (respectively predictable). (d) If X is real-valued, then ?? ^ 0 iff X is increasing.
Vector integration in Banach spaces and application to stochastic integration 395 A similar theorem is valid for processes with integrable semivariation, but we have to impose cq<? E. 7. THEOREM. Assume со <? E. Let X : R+ ? ? -> ? be a right continuous, measurable process with integrable semivariation Xp_.? {i.e., (?^.?)? € ?-')¦ Then there is a stochastic measure ??.?,^? ? with finite semivariation (??)?,.? satisfying the following conditions: (a) ??{?) = E{f\MdXs),for ? eM. (b) if ? : R+ ? ? —» R is a measurable process with ?(?*.?(\?\)) ¦= Е((тХш)-.^(Ф((о)) < эо. then ? e 3-?..?{??) and we have [???? = ?([?!!??? eE. (b') Assume, in a addition, that X has integrable semivariation X e.g and со <t G. Then ?? has finite semivariation {/1x)f.g- If ? :R+ ? ? -> F is a measurable process with E(XF.G(\H\)) := E{mXiU!))FG(H{w)) < oo. then ? е^г.с(дх) and we have J ???? = ?( ?HsdXA eG. (c) The measure ?? is optional {respectively predictable) iffX is optional {respectively predictable). 8.4. Summability of processes with integrable variation or integrable semivariation The following theorem asserts the summability of processes with integrable variation or semivariation. Moreover, in this case, the stochastic integral can be computed pathwise, as a Stieltjes integral. 8. THEOREM. ^гХ:М+хй->?с L{F, G) be a cadlag, adapted process. Assume that X has integrable variation \X\ {respectively со <? ?, со <? G and X has integrable semivariations Xr.? and X f.g)- Then: (a) The measure ? ? : 7? -> L^ can be extended to ? ?-additive measure ? ? :V -> LfE with finite variation \??\ {respectively with finite semivariations {Ix)r.e and Ux)f.g)- (b) X is summable relative to (F, G).
396 N. Dinculeanu (c) For every M eV we have ??(?) = ?(??(?)). If X has integrable variation, then \??\(?) = \??\(?) = ?]?](?) = ?(???(?)). (d) If ? e LFG(X) and if f \Hs(w)\d\X\s(w)) < oo, a.s., (respectively the function ? —>¦ Xf.g(H(co)) is integrable), then (??),(?)=\ Hs(w)dXs(w). a.s., fort^O. (e) If E(f \Hs(co)\d\X\s(co)) < oo (respectively the function ? ь* XFG(H(a>)) is integrable), then j ???? = e( I Hdlx\ =?(? Hs(aj)dXs(oj) 9. Martingales A martingale ? :R+ ? ? —>¦ ? с L(F. G) is not necessarily summable; but if it is, the stochastic integral ? ¦ X is again a martingale. If ? and G are Hilbert space and if ? is a square integrable martingale, then ? is 2-summable. 1. THEOREM. Let ? :Ш+ ? ? -> ? С L(F,G) be a martingale. Assume ? is p- summable relative to (F, G) and let ? e Ту.с(М). If L)t,H dlM e ^? /or every t ^ 0, ?/?en ? e L\ g(M) and the stochastic integral ? ¦ ? is a uniformly integrable martingale, bounded in L!'G. IfLG is reflexive, then UF G(M) = TFG(M). 2. THEOREM. Assume ? and G are Hilbert spaces and let ? : Ш+ ? ? -> ? С L(F, G) be a square integrable martingale. Then ? is 2-summable relative to (F,G). We have L^Li(M) = FFLi (M) and for every ? e L1 , (?), the stochastic integral ? ¦ ? is a square integrable h.L-G martingale. The proof of summability can be done either by using the summability criterion (Theorem 9 of Section 7), or by using the Doleans measure ?^?) associated to the sharp
Vector integration in Banach spaces and application to stochastic integration 397 bracket {M) (defined below), which is an increasing process with integrable variation, and proving that IM <<??<??>· Let ? : R+ ? ? -> ? be a square integrable martingale. Assume ? is a Hubert space. Then \M\2 is a submartingale of class (D) and has a Doob-Meyer decomposition \M\2 = N + {M), where N is a martingale of class (D) and (M) is a predictable, integrable, increasing process, called the sharp bracket of ?. Consider the Doleans measure ? ?? defined for every A e 1Z by ?(?)(?) = E(I{M)(A)). 3. PROPOSITION. The measure ???) can be extended to a ?-additive measure on the predictable ?-algebra V and we have ?{?}(?)=\\??(?)\\~? forAeV. 4. COROLLARY. The process (M) is summable relative to ? ?{?)(?) = ?(?{?)(?)), forAeV. and we have 5. THEOREM. Let ? :R+ ? ? -> ? С L(F,G) be a square integrable martingale. Assume ? and G are Hilbert spaces. Then (a) ?^(?(??)) С 0F G(M) and for ? e ?^-(?(??)) we have I/ ЯЛ/М ^ ^(/м)^-ЧЯ)^||Я|Ц(;,л, (b) For the particular embedding ? = L(R, E) we have ¦ 2 Lr(jhm)) = Lr_lif(M) and for ? ? ?-flj(M(Ai)) we have I/ HdIM „=(/«)к.^(«) = 11«111:(м,и, (с) // ?? /? real-valued and D is a Hilbert space and we consider the embedding M.CL(D,D), then L2D(M(M)) = ZA 2(M) U.LD
398 N. Dinculeanu and for H e L' ·, (M) we have D.L-D JHdlA =(iM)DLJ)(H) = \\H\\L])Ul„y Remark. Here is the classical approach of the stochastic integral of real valued semimartingales. A semimartingale is a process of the form X = ? + A, where ? is a local martingale and ? is a process with finite variation. We define separately the stochastic integrals ? ¦ ? and ? ¦ A and then define the stochastic integral HX = HM + HA. The stochastic integral ? ¦ A is defined pathwise, as a Stieltjes integral: (??),(?)=\ Hs{w)dAs{w), for t > 0 and ? e ?. J\o.t] The stochastic integral ? ¦ ? reduces to the case where ? is a square integrable martingale. So, assume ? is a square integrable martingale. We identify the space M.- of cadlag, square integrable martingales with the space L2(P), by identifying a martingale ? e M2 with Мое e L2(P) and endow M2 with the norm of L2(P). For a simple process ? # = ??1? + ]?]?,1]?? we define the stochastic integral ? ¦ ? by /1 (? ¦ ?), = ?0??0 + ]??,(????, - ???)?,). ;"=? Then ? ¦ ? & ?2- If we consider the simple process Я as an element of ?,-(?(??)) = L2{V, ?.(?)), then one can prove that the mapping ? ь* ? ¦ ? is an isometry: ||Я-А#||м2 = ||Я||/.2(/1м). Since the simple process are dense in L2(?/л?)) one can extend the above isometry to an isometry of the whole space ?,2(?(??)) into M2- The value of this extension for a process ? e ?,2(?(??)) is called the stochastic integral of ? with respect to M. The isometry between the spaces M2 and ?,2(?(??)), which is the starting step for the classical approach of the stochastic integral with respect to a square integrable martingale, is also obtained in Theorem 5 above, using the measure-theoretic approach presented in this chapter. References Bartle, R.G. A956), A general bilinear vector integral, Studia Math. 15, 337-352.
Vector integration in Banach spaces and application to stochastic integration 399 Bochner, S. A933), Integration von Funktionen. deren Werte die Elemente eines Vektorraumes sind, Fund. Math. 20, 262-276. Bourbaki, N. A927-1959), Integration. Chapitres I-VI, Hermann, Paris. Bongiorno, B. and Dinculeanu, N. B001), The Riesz representation theorem and extension of additive measures, J. Math. Analysis and Appl. 261, 706-732. Brooks, J.K. and Dinculeanu, N. A976), Lebesgue type spaces for vector integration, linear operations, weak completeness and weak compactness, J. Math. Analysis Appl. 54. 348-389. Brooks, J.K. and Dinculeanu, N. A991), Stochastic integration in Banach spaces. Seminar on Stoch. Proc., Birkhauser, 27-115. Dellacherie, С and Meyer, P.-A. A975-1980), Probabilities et Potentiel, Hermann, Paris. Diestel, J. and Uhl, J.J. Jr. A977), Vector Measures, AMS Math. Surveys, Vol. 15, Amer. Math. Soc., Providence, RI. Dinculeanu, N. A967), Vector Measures, Pergamon Press, Oxford. Dinculeanu, N. B000), Vector Integration and Stochastic Integration in Banach spaces, Wiley, New York. Dobrakov, I. A970a), On integration in Banach spaces I, Czech. Math. J. 20(95), 511-536. Dobrakov, I. A970b), On integration in Banach spaces 11, Czech. Math. J. 21 (96), 680-685. Dunford, N. and Schwartz, J. A958), Linear Operators. Part I. General Theory, Wiley, New York. Foias, С and Singer, I. A960), Some remarks on the representation of linear operators in the space of vector- valued, continuous functions. Revue Roumaine Math. Pures Appl. 5, 729-752. Gowurin, M. A936), Ober die Stieltjessche Integration abstrakter Funktionen, Fund. Math. 27, 255-268. Halmos, PR. A950), Measure Theory; Van Nostrand. New York. Ionescu Tulcea, A. and Ionescu Tulcea, С A969), Topics in the Theory of Lifting, Springer. New York. Kluvanek, I. A961), On the theory of vector measures I, Mat. Fyz. Casopis 11, 173-191. Kluvanek, I. A966), On the theory of vector measures II, Mat. Fyz. Casopis 16, 76-81. Kussmaul, A.U. A977), Stochastic Integration and Generalized Martingales, Pitman, London. Metivier, M. A982), Semimartingales, de Gruyter, Berlin. Metivier, M. and Pellaumail, J. A980), Stochastic Integration, Academic Press, New York. Pap, E. B002), Some elements of the classical measure theory. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 27-82. Pellaumail, J. A973), Sur I'integrale stochastique et la decomposition de Doob-Meyer, Asterisque, Vol. 9. Rao, M.M. A979), Stochastic Processes and Integration, Sijthoff and Noordhoff, Alphenaan den Rijin. Rao, M.M. A996), Stochastic Processes: General Theory, Kluwer Academic Publishers, Dordrecht. Rudin, W. A973), Real and Complex Analysis, McGraw-Hill, New York.
CHAPTER 9 The Riesz Theorem Joe Diestel* Banach Center, Kent State University: Kent. OH 44242-0001, USA E-mail: j_diestel @ hotmail. com Johan Swart* Department of Mathematics and Applied Mathematics, University of Pretoria, 0002 Pretoria, Republic of South Africa E-mail: jswart@math. up.ac.za Contents Introduction 403 1 .The classical Riesz Theorem 404 1.1. A brief historical discussion 404 1.2. The abstract characterization of C( A") as a Banach lattice 406 1.3. The abstract characterization of С (A") as a Banach algebra 406 1.4. The Josefson-Nissenzweig Theorem 408 2.The Riesz Theorem and the Hahn-Banach theorem 411 2.1. The dual of the injective tensor product of two Banach spaces 411 2.2. Choquet's representation theorem 413 2.3. The Stone-Weierstrass theorem 417 2.4. The Pietsch domination theorem 419 3.The Riesz Theorem for operators 421 3.1. Weakly compact operators on C(K) 421 3.2. The Dunford-Pettis property and property V 422 3.3. Absolutely summing, integral and nuclear operators on C(K) 424 4.The Riesz theorem for vector-valued continuous function spaces 430 *The work of this author on the report was done while visiting the Mathematics Department of the University of Missouri at Columbia; sincere thanks must be extended to the members of the faculty and staff there for providing such a good atmosphere in which to work. Also appreciative thanks must be extended to Kent State University's Research and Sponsored office and the powers-that-be at KSU who provided support for a sabbatical year at Missouri. +The work of this author on the report was done while on sabbatical leave at Kent State University; thanks are extended to the Mathematics Department there for providing a pleasant working environment. The author also wants to thank the South African NRF as well as the Mellon Foundation for financial support. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 401
402 J. Diestel and J. Swart 5.Notes and remarks 435 5.1. Notes and remarks to Section 1 436 5.2. Notes and remarks to Section 2 437 5.3. Notes and remarks to Section 3 439 5.4. Notes and remarks to Section 4 440 5.5. Notes and remarks on tensor products 441 References 444
The Riesz Theorem 403 Introduction The objective of this essay is a discussion of the following representation theorem, henceforth referred to as 'The Riesz Theorem'. THEOREM 0.1. The dual С (K)* of the space С (К) of continuous scalar-valued functions defined on the compact Hausdorff space К (equipped with the supremum norm ||/|| = sup{|/(&)|: к e K\) is the space M(K) of all regular Borel measures defined on K. That is, there is a correspondence x* eC(K)* <s> ???(?) between members x* ofC(K)* and regular Borel measures ? on К given by **(/)= ? f(k)dv(k); Jk this correspondence is a linear isometric isomorphism between C(K)* and M(K), equippedwith the total variation norm \\?\\ \ = \?\(?), where \?\{?) — sup] У^ |?(?,)|: (?,),€?; is a Borel partition of К \. We open with a brief discussion of the history of this theorem tracing its development from R Riesz's earliest papers through Kakutani's finishing touches. Along the way we give several applications that are more or less classical. Next, we investigate the Riesz Theorem when used in tandem with the Hahn-Banach theorem. Surely this has been the most successful partnership in abstract analysis. The representation theory of Banach space-valued linear operations on C(K) is our next task. Here the wish to give The Riesz Theorem direct generalization meets with considerable success and some elegant, far-reaching applications; such generalizations have rather severe limitations though. Once these limitations are understood, the Riesz Theorem - scalar-valued case - takes center stage once again and directs the action, motivating the pertinent questions. The Riesz Theorem for continuous Banach space-valued functions is investigated in our fourth section. Again, despite some initial successes, the applicability to understanding spaces of continuous vector-valued functions is shown to be fraught with difficulties. The structure of spaces of Banach space-valued continuous functions on a compact Hausdorff space is considerably more complicated than the scalar theory. We'll make some observations about why this is so and discuss the situation vis-a-vis some of the most important invariants arising in the study of C(KYs. Finally, we make a few (very few) choice comments about bimeasures and polymea- sures, as well as about variations on The Riesz Theorem for other tensor norms. Alas many have contributed to our understanding of these topics. We must acknowledge the following: Witold Bogdanowicz, Jean Bourgain. Pilar Cembranos, Bill Davis, Leonard
404 J. Diestel and J. Swart Dor, Paddy Dowling, Barbara Faires, Tadek Figiel, Ben Garling, Bob Huff, Hans Jarchow, Bill Johnson, Nigel Kalton, Chris Lennard, Dan Lewis, Paul Lewis, Victor Lomonosov, Jose Mendoza, Olek Pelczynski, Albrecht Pietsch, Elias Saab, Paulette Saab, Chuck Seifert, Anton Stroh, Andrew Tonge and last, but far from least, Jerry Uhl. Each of these mathematicians explained things to us that added to our understanding and clarified much of our thinking. 1. The classical Riesz Theorem 1.1. A brief historical discussion The study of the dual of C(K) started, naturally enough, with the case of К = [a, b], a closed bounded interval of the real line. The main player was F. Riesz who returned to the problem on several occasions. By 1914 his increasing awareness of the measure-theoretic underpinnings of the subject was plain. J. Radon extended Riesz's work to describe the dual of C(K) when К is a compact subset of E". It was Radon who first laid bare the measure theoretic character of C(K)*. The first to successfully attack the subject of C(K)* for abstract K's was S. Banach, in an appendix to S. Saks' classic monograph "Theory of the Integral". To be sure, Banach's note relies on his earlier work in an annexe to Chapter XII of his famous "Theorie des Operationes Lineaires". Saks was not one to let the grass grow underfoot and he soon presented a more 'standard' proof constructing the representing measure via Caratheodory's procedure. The finishing touch was supplied by S. Kakutani who established the Riesz Theorem in a form more-or-less as stated in Theorem 0.1. It's important that we mention that these results did NOT happen in a mathematical vacuum. Rather, they were part and parcel of the rapid development of functional analysis before World War II and played a central role in that development. We take a moment to discuss just a few of the ramifications of the study of С (К )-spaces and their duals in the development of functional analysis. One of the first applications of the Riesz Theorem is in characterizing weakly convergent sequences in C(?)-spaces. Proved by Banach in the case of compact metric spaces К (without use of the Riesz Theorem!), the following is an excellent example of how measure theory can be used effectively in abstract analysis. THEOREM 1.1. Let (/„) be a sequence in C(K). Then in order that (/„) be weakly convergent to f e C(K) it is both necessary and sufficient that the sequence be uniformly bounded and f(k) = limbec fn(k)for each к e К. Of course, if (/„) converges weakly to / \x\C{K), then {/„: ? ^ 1}U{/} is bounded in C(K) by the Banach-Steinhaus principle of uniform boundedness; further, if к е К, then Sk(g) :=g(k), forg e C(K), is a member-in-good-standing of С (К)* and so
The Riesz Theorem 405 f{k) = Sk(f) = lim Sk(f,)= lim /„(*) holds for each к е К. On the other hand, if (/„) is a uniformly bounded sequence in C(K) and / e C(?) satisfies /(*) = Mm f„(k) 77^ ОС for each к e K, then / is the weak limit of (/„). In fact, one needs to test the convergence of (**(/„)) to **(/) for each x* e C(K)\ But the Riesz Theorem tells that any x* e C(K)* is represented by a regular Borel measure ? on К with jr*'s action given by **(*)= I g{k)dvL{k) JK for g eC(K). Hold on! (/„) is uniformly bounded on К and pointwise convergent to /; Lebesgue's bounded convergence theorem tells us that **(/)= [ f{k)dii(k)= [ lim fn{k)dii{k) = lim [ ????&)= lim x*(fnl And this shows that (/„) converges to / weakly in C(K)L As mentioned above Banach proved Theorem 1.1 without recourse to the Riesz Theorem, but deduced it nevertheless at least in the case of compact metric space domains. Along the way, Banach also uncovered when uniformly bounded sequences of scalar- valued functions converge weakly to zero on any set; his result, which seems to have been forgotten by abstract analysts, follows from delicate manipulation with lim suprema and lim infima and his own secret weapon: Banach limits. The result? THEOREM 1.2 (Banach). Let (/„) be a uniformly bounded sequence of scalar-valued functions defined on a set S. In order that (/„) converge weakly to zero in the space B(S) of bounded functions on S (equipped with the supremum norm) it is both necessary and sufficient that for any sequence (s^) of points in S we have lim lim inf I/„ (ii) I =0. 77^00 k^-OC This is still a marvelous theorem. The appearance of С(?)-spaces m arguments of functional analysis became more frequent. Often such appearances were a simple consequence of the Hahn-Banach theorem. Here's how the reasoning went: take a Banach space X and look at the closed unit ball ??* of the dual X* of X; when equipped with the so-called weak* topology, the
406 J. Dieslel and J. Swart topology of pointwise convergence on ?, ??* is a compact Hausdorff space (it is compact and metrizable precisely when X is separable). X always is isometrically isomorphic to a closed linear subspace of the space C(Bx·. weak*) of continuous scalar-valued functions defined on ? ?* with respect to this weak* topology. The map jc i—> x(-) of evaluation of xeXat member x* of ? ?* affects an isometric isomorphism with a tip of the hat to the Hahn-Banach theorem. Banach and S. Mazur used this idea to show that every separable Banach space is isometrically isomorphic to a closed linear subspace of C(/i), ? the Cantor discontinuum, or of C[0, 1], results that've been employed by many in the more than half century since their discovery. In fact,the identification of C(?)-spaces among the Banach spaces was a cause celebre in each of the real and complex cases. 1.2. The abstract characterization ofC(K) as a Banach lattice To characterize the C(?)-spaces among the real Banach spaces, it is natural to call on the notion of a Banach lattice. Recall that a real linear space X is called ordered if there's a partial ordering "<" that's compatible with the linear structure: if л ^ у then .v + с ^ .v + с for all ? and ax ^ ay for all scalar a > 0; an ordered linear space is a Riesz space if its order is a lattice ordering. In a Riesz space, if ? is a given element, then ? is the difference of two non negative disjoint elements ? = x+ — x~ where x+ = sup{.x,0} and x~ = sup{— .v.0); the absolute value of x is given by |jc| =x+ +x~. A real Banach space is a Banach lattice if it's a Riesz space for which ||.v|| ^ ||>? whenever |jc| ^ \y\. Here's how Kakutani characterized the C(?)-spaces among the (real) Banach lattices. THEOREM 1.3 (Kakutani). A real Banach lattice X is isometrically isomorphic, as a Banach lattice, to a space C(K) of all continuous real-valued functions on some compact Hausdorff space К if and only if X is an "M-space with unit", that is, for x, у е X with x,y^0, \\sup[x,y}\\ = sup{M-хЦ. ||_v|l} and there is a largest element (a unit) in the closed unit ball ofX. 1.3. The abstract characterization ofC(K) as a Banach algebra The characterization of С (^)-spaces among the complex Banach spaces followed close on the heels of Kakutani's gem and has became one of the fundamental results in all analysis.
The Riesz Theorem 407 It is due to I.M. Gelfand and M.A. Naimark. The structure that gives the suitable framework within which to formulate the Gelfand-Naimark theorem is that of a Banach algebra. A complex Banach space X that's also an algebra is a Banach algebra if \\x - >'|| ^ ||-v||||y|| whenever x, у е X; should the algebra have an identity we suppose that the identity has norm = 1. An involution on the Banach algebra X is a map .т и .т* of the algebra onto itself for which (x + y)* = x* + y*, (??)* = ?.?*. (? у)* = y*x*,x** = x and ||.v*|| = ||jc||; here x's and y's range over X and ? is an arbitrary complex number. If in addition, \\x*x\\ = \\x\\2 for each ? then X is called a C*-algebra. Here's the remarkable theorem of Gelfand and Naimark. THEOREM 1.4 (Gelfand-Naimark). Suppose X is commutative C*-algebra with identity. Then X is isometrically isomorphic, as a C*-algebra, to the space C{K) of all continuous complex-valued functions on some compact Hausdorff space К with the identity of X corresponding to the constantly = 1 function on K. In each case, be it Kakutani's characterization of real C(K)'s or the Gelfand-Naimark characterization of complex C(K)'s, the volatile mix of algebraic and analytic structures is key. In Kakutani's proof, the space К is the collection of norm one linear functional on X that're lattice homomorphisms while Gelfand and Naimark use the norm one linear functional that're multiplicative; in each case, the weak* topology serves to make К compact. Unquestionably, the Gelfand-Naimark theorem quickly paid handsomer dividends and continues to do so. Here's the basis of many of the theorem's most penetrating applications: start with a normal (bounded linear) operator ? acting on a Hubert space H. Look at the smallest closed *-subalgebraCr inside the algebra B(H) of all bounded linear operators on ? that contains ?, ?* and the identity; Ст is a commutative C*-algebra with identity and so, thanks to Gelfand and Naimark, we know that Cr is isometrically isomorphic, as a C*- algebra, to the space C{K) of all continuous complex-valued functions on some compact Hausdorff space K. It's part of the magic of C*-algebras that one can show that the К under question is a (non-empty) compact subset of C! In fact, ? = ?(?), the spectrum of ? as an operator on //!! It's plain and easy to consider p(T) if ? is a polynomial but now, thanks again to Gelfand and Naimark, we can consider f(T) where / e C{K)\ For / e C(K) we define f(T) as the operator in Ст that corresponds to / under the above mentioned isometric isomorphism. For the polynomial p, this correspondent will just be p(T), exactly the polynomial in T. After all the Gelfand-Naimark theorem guarantees that the identity on ?(?) corresponds to ? itself. Here's what's so: (f\ + fi)(T) is just /? (?) + f2(T), (/? · f2)(T) = /? (T) ¦ f2(T) - where /? (?) ¦ f2(T) is the product of two operators - and f{T)* = f(T). In other words the interpretation of f(T) is faithful. What's remarkable about the above interpretation of f(T) is that it leads to a reasonable interpretation of ?(?) for bounded Borel functions <p defined on ? =?(?). Here's where the Riesz Theorem enters the field of action. To find an operator in Ст that behaves like ?(?) is a bit too much to ask. But one knows that whatever the operator ?(?) is to be, it's necessary to know how its values at x's act vis-a-vis y's, that is, we need to understand what (?(?)?, у) looks like for any ?, у e ? where (, ) is H's inner product. This in mind, let S range over members of Ст (which we know is C(a(T))) and for x, у e ? consider the functional Sh> (Sx, y) onC(a(T)). Each such functional on C(a (T)) is continuous
408 J. Diestei and J. Swart and linear, with norm ^ ||?||||}??. by the Cauchy-Schwartz inequality. It follows that each pair x, у е И generates a regular Borel measure ??,? ???(?). Of course, /?{?)????{?) makes sense for any bounded Borel function ? acting on ?(?). To make sense of ?(?) is now a (non-trivial) matter of uncovering an operator - we'll call it ?{?) - on ? that satisfies (<p(T)x,y) = ?(?)????(?) foranyj, ve H. Jo(T) Just how this is done is well-documented in many texts in functional analysis starting with L.H. Loomis' classic "Abstract Harmonic Analysis", where it seems this idea was first shown the light of day, on up to W. Rudin's masterful "Functional Analysis". Among the many byproducts of the above approach one funds the Spectral Theorem for normal operators and generalizations thereof. All this is classical. 1.4. The Josefson-Nissenzweig Theorem An application of the Riesz Theorem of more recent vintage is due to B. Josefson and A. Nissenzweig. To put their theorem into perspective recall that for any infinite dimensional reflexive Banach space X there is always a sequence of norm-one functionals (x*) in X* that is weakly null. Since the dual of a reflexive space is also reflexive, this sequence (x*) is weak* null as well. What happens in general Banach spaces? This is answered by the Josefson-Nissenzweig theorem; we indicate a proof of this below, a proof shown to us by Leonard Dor many years ago. It emphasizes how regularity comes into play and along the way makes use of the Lebesgue space L2. THEOREM 1.5 (Josefson-Nissenzweig). If X is an infinite dimensional Banach space, then there exists a sequence (x*) in X* of norm-one functionals that go to 0 in the weak* topology. Outline OF the proof. Nissenzweig used an observation of A. Pelczynski to construct a measure-theoretic Cantor set. A bit of notation clears the way to describing the results of Nissenzweig's combinatorial designs. If<pe{0, 1}", и e N, then we'll denote by (?, ?) the member of {0, 1}"+I obtained from ? by tacking on ее {О, 1}. Following Pelczynski's generous advice, Nissenzweig found an a > 0, a sequence (z,*) of norm one elements in X*, a sequence (x„) of norm one elements in X and for each ? e {0, 1}" (any и) an infinite subset ?? of {?*,- ? e N} and a yv e X in such a way that (i) for any ?, A№.0) and A№j > are pairwise disjoint infinite subsets of ??; (ii) inf(A(V.o) - A№.|))(yv) > 0, for each ?; (??) ?" ?<?€[?.?)"-' mf(^(<p.0)-*H ~ ?(?.\ )xn) ^ ?· Once Nissenzweig had done this (admittedly highly non trivial) chore, he proceeded as follows: for each ? let ?? ~ weak*closure A,
The Riesz Theorem 409 By (ii) ?(?.?) ? E(ipj) = 0. It follows that if ?, ? e {0, 1}" and ???, then ?? П ?? = 0. From this and (iii) we see that — ]P ??{(?(?,0}?„ - ?(?. ? }?„) ^a. <pe[0.1)"-' For any ? e {0, 1}" choose и* е ?? and let <pe[0.l)" Each ?„ is a regular Borel probability measure on ? = (??*, weak*). The Banach- Alaoglu theorem assures us of the existence of a weak* limit point ? e BCiK)* in the closed unit ball of C(K)*. ? is a regular Borel probability on ?. It's more, much more. If ? ^ m and <p e {0, 1}'" then 1 ?„(??) = —; after all, there are exactly 2"~m ? 's in {0,1}" the first m of whose entries match ? exactly. It follows that for any ? e {0, 1}'" 1 Indeed, if ?{??) < ^, then there'd be an open set G с К containing ?? so that ?(?) < 2^. It follows that there's a g eC(K), (Kg ^ 1, so that g(x*) = 1 for x* e ?? and g(x*) = 0 for ? <? G. Of course, J 8??^?@)<^. But / g ?? must be a limit point of the sequence (/ gd?,,) and so must be ^???^??,??^ —.
410 J. Diestel and J. Swart Think about it: 1 С 1 — > M(G) > / %?? > limsupM„(?v) = — · *¦ 7 /1-»3C ^ OOPS! We conclude that ?(??) > 4r for each ? e {0. 1}'". But there are precisely 2'" pairwise disjoint E/s as ? ranges over members of {0. 1}'" so ?(??) = ^ is all that remains as a conclusion - after all, ? is a probability. Now define /„ e L2(K, ?) by <pe|0.l]»-' Then (/„)„epj is an orthonormal sequence in ?2(?,?). Let ;' :C(?) ^ ?-(?,?) be the natural inclusion jx = jc(). For any ? e X ? I I- |0'-*- /'?)?.-(???)? I Г 2 ? / ;дс(дс*)/й(дс*)^(дс*) If Jt* e X* is given by <M)= ? ??(?*)??(?*)??{?*), JK then we just saw that for any ? e ? ?, ?>,??|4?. What about ||**||? ?? * и -^ X S> <(*„) = ? ??*)???*)<??(?*)= ? ?*(?„)?,,(?*)<1?(?*) Jk Jk = ? ?? ?*(?„)??(?*)- ? **(*„) ??(**) <ре|0.1] > ? <ре[0.|)"-' 1 1 ¦ г * ! inf ? ?„ — — sup ? x„ 2" л*€?,<.„, 2" i*e?. ?.1) = 2^ ?! ???(?[?,0)?„- ?{??)?„) (pe|0.l)"-' ^? >0.
The Riesz Theorem 411 In tandem ||**|| ^ a > 0 and ]P |-v,?(*)|~ < oo. ? for each ? e X and this finishes the proof of Theorem 1.5. ? 2. The Riesz Theorem and the Hahn-Banach theorem The partnership of the Riesz Theorem with (various forms of) the Hahn-Banach theorem is the most powerful in all abstract analysis. In this section we offer a glimpse of just how this one-two punch can act. We'll start with the most basic form of the Hahn-Banach theorem: extending bounded linear functionals from a subspace of a Banach space to the whole space without changing the norm. Our application will be the wonderful characterization by A. Grothendieck of the dual of the injective tensor product of two Banach spaces. Next, we apply Banach's version of the extension theorem wherein we extend a linear functional that's dominated by a positively homogeneous, subadditive functional to the whole space with domination remaining. Here the objective is the ever-fresh theorem of G. Choquet to the effect that each point of a compact metrizable convex subset of a locally convex space is the barycenter of a probability Borel measure supported by the set of extreme points of that set. We follow this with L. de Branges' famous proof of the Stone-Weierstrass theorem. We use here the complex version of the Hahn-Banach theorem. Finally, the separation theorem is applied to derive A. Pietsch's profound characterization theorem for absolutely /^-summing operators. 2.1. The dual of the injective tensor product of two Banach spaces If X and ? are Banach spaces (over the same scalar field), then their tensor product X <g> ? can be normed via the so-called projective tensor norm in such a way that the dual of the completion Х®У under this norm is (isometrically isomorphic to) the space B(X, Y) of all continuous bilinear functionals on ? ? ?. The norm? If и e X <g> ?, then the projective norm |и|л of и is 1л = 1пП]Г]||.х,|||Ы|·. и = ]Г\х, ®y,- |. This was known to J. von Neumann and his co-workers F.J. Murray and R. Schatten. They used this fact to considerable advantage when X and ? were both Hubert spaces. What they could not do was identify the dual of the injective tensor product. If и e X <g> Y, then the injective norm \u | v of и is given by |M|v = sup{|u*®}>*)(M)|: x*eBx*.y*eBY*}.
412 J. Diestel and J. Swart where **® y*(j> ®y«) = J>*(*«)>'*0'«). It's easy to see that | |v is a norm on X <g> ? which is a uniform cross-norm: \x <g> _y|v = ||*|| ||.y|| and if и ? : ? ? -> У ? and и:: ?: -> ^2 are bounded linear operators then и ? <g> i<2 is a bounded linear operator from X| <g>X2 to Y\ ®?? with bound the product of the individual norms when each is equipped with the injective norm. This norm is in fact the least such norm and the completion of X <g> ? with this norm is called the injective tensor product of X and ? and denoted by Х®У. For any и & X <g> Y, it's easy to see that |k|v ^ |к|л- So each member of (Х®У)* is a member of the dual B(X, Y) of Х®У. The question is: just which continuous bilinear functionals on ? ? У are continuous on Х<Й>У? Here's what Grothendieck saw: given X and Y, the dual balls Bx> and By are each compact in their respective weak* topologies and so the product space К = (Bx*, weak*) ? (??>, weak*) is compact, too; if и e X <g> Y, then и can be viewed as a member Л of C(?)-just define й at a typical point (**, y*) of К by й((х*.У*)) = (х*®У*)(")· It's easy to see (especially once it's been told us) that the map МИ Й is well-defined linear isometry from (X <g> ?. \ |v) into C(K)l This map extends to an isometric embedding of X<g>? intoC(^). Okay, so take ? e (Х®У)*. Extend ? to a member of C(K)* with the extension having the same norm as ?. The Hahn-Banach theorem will come in handy at this juncture. The extension is naturally identifiable with a regular Borel measure ????; the Riesz Theorem strikes! If we take ? e X and у e Y, then on realizing ? as a member of B(X, Y) we see that 4>{x,y) = (p{x®y) = ? ?®?(?*.?*)??(?*.?*)= f ?*(?)?*()')<1?(?*, y*). Jk Jk ? is an integral bilinear form whose norm in (X®Y)* is just the norm \?\(?) of ? as a member of C(K)*. To put it the way Grothendieck did, we have the following.
The Riesz Theorem 413 THEOREM 2.1 (A. Grothendieck). The dual (X®Y)* of the injective tensor product X®Y of two Banach spaces can be identified with the space ??(?; ?) of integral bilinear forms on ? ? Y; under this isometric identification a bilinear functional ? in (Х<8>У)* corresponds to a measure ? on К (as above) via the formula <p(x,y)= ? **0r)v*(v)d/i(**.v*). Jk 2.2. Choquet 's representation theorem Next we wish to talk about Choquet's representation theorem. We need some words of introduction. Our setting is a locally convex (Hausdorff) linear topological space E; inside of ? we have a (non-empty) compact convex set K. Each such set К has extreme points: ? e К is extreme if ?\{.x} is convex. In fact, it's a celebrated result of M. Krein and D. Milman that any non-empty compact, convex subset К of a locally convex space ? has extreme points and, in truth, К is the closed convex hull of its set of extreme points. If К is a non-empty compact convex subset of a locally convex space ? and ? is a regular Borel probability on К then ? has a barycenter ?(?) e К: .?(?) is the unique point of К for which ?*(?(?)) = ( **(?)??(?) Jk for each x* e E*. Here's a stunning fact-of-life discovered first by G. Choquet; we follow F. Bonsall's advice about the truth of Choquet's representation theorem. THEOREM 2.2 (G. Choquet). Let К be a non-empty compact metrizable convex subset of the (real) locally convex space E. Then each point of К is the barycenter of a regular Borel probability measure ? on К which is supported by the set e\(K) of extreme points ofK. To be sure, when К is metrizable then the set cx(K) is a Borel set, a Ga-set, and so our claim is that given jco e К there is a regular Borel probability ? on К whose barycenter is xq for which ?(&?(?)) = 1. Outline of Bonsall's proof. We start with the closed linear subspace A(K) of C(K) consisting of affine functions; f eC(K) is affine if given ?. ? e К and 0 ^ ? ^ 1 we have f(Xx + A - X)y) = ?/(.?) + A - ?)/(.?). Of course we also consider convex and concave functions on К: /is convex if given x,y e К and 0^? ^ 1 we have f(kx + A - k)y) <: kf(x) + A - ?)/(?);
414 J. Diestel and J. Swart while / is concave if — / is convex. To help us in our search for a suitable dominator, we consider the concave envelope / of / e C(K): J(x):=M{h(x): heA(K). /(v)O(y), foryeK]. It's not hard to verify that • / is concave, bounded and Borel; • / ^ / with / = / precisely when / is concave; • f_+g^f + gand7f = tfift^0: • i/-gmi/-giioc • f + a = f+a ifaeA(K). Take xq & K. Define ? onC(K) by />(/) =/(*>)¦ ? is positively homogeneous and subadditive. Since К is compact and metrizable, C{K) is separable and so is A(K). Hence there's a sequence in the closed unit sphere Sa = {a e A: \\a\\x = 1} of A(K) that's dense therein; if (a„) is the sequence let /о е С (К) (by Weierstrass's M-test) and /o is strictly convex. Define F on A{K) + E/o by F(a + ?/?) = a(x[)) + a/0(.v0). Notice that on A(K) + R/0, F is dominated by p! If ? ^ 0, then a + a/0 =_a + a/0; if a < 0, then a + a/o is concave so that a + ?/? = a + ?/? which is ^ ? + ?/(). In either case, F(a+a/0) s?p(a+a/b). We can extend F from A(K) + R/o to all C(K) in a linear fashion with its extension, which we'll call ?, still dominated by p; that is, for any g e C(K) 4>{g) ^P(g) =?(¦*())¦ Of course, for any ? e A(?). <p(a + ?/?) = ?(.??) + ?/(.го) = F(a + ?/0). If g e C(?) is non-positive, then 0 > g(xo) = Pig) > <p(g)\
The Riesz Theorem 415 so ? is non-positive on non-positive members of C(K); <p is non-negative on non-negative members of C(K). ? is non-negative member of C(K )*; there is a non-negative measure ?, a regular Borel non-negative measure on ?, so that -/.¦ <P(g)= I gd^- 1 eA(K) and \ = \(xQ) = F(\) = <p(\)= ? \?? = ?{?). Jk ? is a probability. A look at our strictly convex friend /o reveals that / /??? = <?(/0) = /0(?·0): but /o ^ /0 so / /?^?^ / /?^?- On the other hand, if a e A and ? > /?, then ? > /0, too; so a(*0) = .F(a) = <p(a) = / ?<*?> / /0^? ^ / /?<*? = ?(/?) =/()Uo)· A look at the definition of /o(*o) shows all above are equal! So / /0??= ?()??. Jk Jk Wow! _ /o is strictly convex so any point л e К where /o(.v) and /o(.v) are the same must be extreme: if у ? ?, then f(y + z\ /o(.v) , /o(c) . 7o(v) + 7o(c) . 7 ?? + ¦ Hence {jteK: /o(*) = 7(>W}?ex(K). However, /o ^ /(, and fK /0?? = fK ????.
416 J. Diestel and J. Swart It follows that 1 = м([/о = 70]Нм(ех(К))«:1. This proof is complete. ? One remarkable consequence of Choquet's theorem is the following characterization of weakly null sequences in Banach spaces COROLLARY 2.3 (J. Rainwater). Let (x„) be a bounded sequence in the Banach space X. Suppose that lim;t*(jt„) = 0 ? for each extreme point x* of ? ?*. Then (xn) is weakly null. PROOF. Suppose (x„) is a bounded sequence in X satisfying the extremal condition: for each x* e cx(Bx*) lim;t*(jt„) = 0. ? If we could just suppose X were a separable Banach space, then ??* would be weak* compact and weak* metrizable and so Choquet's theorem would be "on call". So what? Well, if Xq e ??* is arbitrary then there must be a regular Borel probability measure ? on (??*, weak*) such that ?(??(??*) = 1 and for each ? e X x%(x)= ? ?*(?)??(?*)= ? ?*(?)??(?*). J Bx* ??(??») Of course, the hypothesis that lim„ x*(x„) = 0 for each x* e ex(Sx>) just says that as functions on ??*, (jc„) tends to zero ?-almost everywhere. But (x„) is also uniformly bounded on ? ?*, so Lebesgue's still-wonderful bounded convergence theorem says \imx*(x,,)= [ ?*(?„)??(?*)= ? ?„(?*)??(?*)=0. " J вх* Jbx, (?,,) is weakly null. How we can deduce the general situation from the separable one? Well, the closed linear span S of the sequence (x„) is separable and (x„) is weakly null in S precisely when (x„) is weakly null in X. Does (x„) satisfy the hypotheses that lim„.s*(.x„) = 0 for each s* e ex(Ss*)? If so, then all's right and the Corollary follows. Here's the punchline: if s* is an extreme point of Bs*. then one ofs*'s Hahn-Banach extensions to all X is an extreme point of ??*. This keen observation of Ivan Singer is easy-to-see (once it's been pointed out!): if you consider the set ? of all Hahn-Banach extensions of a given extreme point s* to all of X, then ? is a non-empty weak* compact
The RieSi Theorem 417 convex subset of Bx- arid so has at least one extreme point x*\ it's easy to see x* is also an extreme point of ? ?*. ? 2.3. The Stone-Weierstrass theorem Joined by the Krein-Milman theorem and playing under the brilliant direction of L. de Branges, our dynamic duo soon produces a famous abstract proof of the Stone-Weierstrass theorem in its complex setting. Recall the theorem's statement. THEOREM 2,4 (Stone-Weierstrass). Let К be a compact Hausdorff space and X be an algebra inside C(K), the space of all continuous complex-valued functions defined on K. Suppose X is 'self-adjoint', that is, f e X /// e X. Assume that A) ifkeK, then there's an f e X so f(k) ? 0; B) if p,q e К and ? ? q, then there's an f e X so f(p) ? f{q). Then X is dense in C(K). Suppose that X is not norm dense in C{K). The crux of the de Branges's argument is in the careful study of the family of measures U{X) = {? e BC(K)*'- ? vanishes on X and is real}. a weak* compact convex set which contains a non-zero member, thanks to the Hahn- Banach and Riesz partnership. By all that's holy (that is, the Krein-Milman theorem), U(X) has many non-zero extreme points. Let's take a look at such. LEMMA 2,5. If ? e cx(U(X)) and ? ? 0 and if g: К -> С is a ?-essentially bounded Borel function such that f /?<?? = 0 for each f e X, then g is ?-essentially constant. PROOF. The proof of this lemma involves masterful manipulation of measures and of integrals having complex values. It's worthy of the closest attention. It's plain that ||?|| = 1. Further, jlgd? = 0 for each / e X; after all, X is 'self-adjoint'. But each ? e U{X) is real so for any ?- integrable h, I hd? = I hd?\
418 J. Diestel and J. Swart from this we see that if / e X, then / 0 = j fgdix = j fg-?? = j fgd/l and so for each / e X fgdv = 0. Ah ha! If / e X, then 0 = J fgdix + j fjdix = j f(g + g)d^ = 2 j /Re^M. We can assume that g is real valued. We do so. Adding real constants to g doesn't affect g's Borel measurability nor its ?-essential boundedness nor its real-valued nature. So if we add С to g we have for any / e X that by ?^ membership in U{X) 0= j fgd^ + C j /??= ? fig + ???. In short, we can assume g is positive ?-almost everywhere. Normalize and we can assume as well that / 8?\?\=\. ? ?' then |g| ^ 1 so g, being real-valued, satisfies Suppose |g| ^ ?, ?-almost everywhere, where 0 < ? < ??. Could it be that ? > 1? If so, J \\-8\?\?\ = j ??\?\- j 8?\?\=1-1=0 and 11 - g\ = 0, ?-almost everywhere! This says g is 1 ?-almost everywhere which we've supposed not to be the case. So 0 < ? < 1. The finale is in sight. Define ?? and ?? by ?,(?)= ? &??, ?:(?)= ? \— J в J в ' — — ??. ? for ? a Borel subset of K. Both ??, ?: e U{X). Since g is not ?-essentially constant, neither ?? nor ?? is ?. Yet ? = ??? + A -?)??, contradicting ?'5 extremality. The lemma's proof is complete. ?
The Riesz Theorem 419 On to the proof of the Stone-Weierstrass theorem. Suppose X is a self- adjoint subalgebra of C{K) satisfying B) yet failing to be norm-dense in C{K). Then U{X) ? {0} and so U{X) contains a non-zero extreme point ?. Let g ? X, For each / e X, f · g e X, too. Hence for each / e X, Since g e X. g is a bounded Borel measurable function and so our Lemma 2.5 kicks in to tell us that g must be ?-essentially constant. But this says that g is constant on ?'5 support; after all, g e C(K). Since \?\(?) = 1. f gdM = g(k) is the constancy achieved by g on ?'5 support, g does not change anywhere throughout ?^ support. Look what we have done; any g e X is constant on ? 's support; B) says that support must be a singleton {ко}. There must be an a e C, |a| = 1 so that ? = aSin, where <5*„ is the 'point charge' or 'Dirac delta' at ко, ?*„(/) = f{h)) for / e C(K). It follows that for any g e X 0= / gd? = ag{k()) and so A) fails. This completes the proof, ? 2.4. The Pietsch domination theorem In the late sixties, A. Pietsch initiated the study of absolutely p-summing operators between Banach spaces. A bounded linear operator и : X -> ? between the Banach spaces X and ? is absolutely p-summing A ^ ? < oo) if there is a constant ? > 0 so that for any finite collection {jc ? xn} of vectors in X ???,??") <? sup ??>*?'') ; (О if и is /^-summing then we define the /^-summing norm np(u) to be inf{Ai; A) holds}. The importance of p-summing operators lies in large part in the way it measures m's size. The comparison found in A) is not at all classical and it's a fact that being able to compute the right-hand side often leads one to classical notions, be they from harmonic analysis, operator theory or probability theory; as a result, p-summing operators are offen unexpected players in a serious mathematical morality play.
420 J. Diestel and J Swart While the definition of /^-summing says nothing about measures, be assured - measures are always nearby, everready to make a difference. Pietsch made sure of that by proving the following THEOREM 2.6 (Pietsch domination). Let и: X -> ? be p-summing. Then there exists a regular Borel probability measure ? defined on compact space (??*, weak*) such that for any ? e X \\ux\\ <л>(и)( ? \?%?)\???(?*) V Jbx, An aside; if we wish we can view each ? e X as a member of C{(B\*, weak*)); this has been our schtick on several occasions already. The point of Pietsch's Theorem 2.6 is that и being a p-summing operator allows и to be extended in a natural fashion from X viewed as inside C((Bx*, weak*)) to be bounded linear operator (with bound ^ n,,(u)) on the ??(?) closure of X. PROOF, Let's look at just how this theorem is proved. Our proof follows Pietsch's spirit but we let B. Maurey lead the way. For simplicity we denote by К the compact space ? := (??*, weak*). For ? ? ,x„ ? X define /v, ,„ e C(K) by ?-, ??**) = <(") ? ?**(*;)? - ? ""¦*; II"· ]?? j ?? Let C be C=\fXl h,eC(K):xu,...xneX}; it's a pleasant surprise to find out that С is a convex cone. More, by the very definition of np(u), each member of С is somewhere non-negative. Hence СпЛ^=0, where Л/"= {g e C(K): g is everywhere negative}. But ? has a non-empty interior so the separation theorem assures us of the existence of a non-zero member ? e C(K)* such that ?(??(??(/), forgeWand/eC, B) The left-hand side of B) ensures that ? is a non-negative measure; B) is impervious to positive scaling, so we can assume the non-zero ? is, in fact, a probability. Finally, if ? e X '//'
The Riesz Theorem 421 then <??(/?)= ? (n^(u)\x*(x)\p -\\??\\?)??(?*) J ? ?* = ?';(??) ? \x*(x)\"dv(X*)-\\ux\\P. Jbx, since ? is a probability; A) follows from this. ? 3. The Riesz Theorem for operators 3.1. Weakly compact operators on C(K) The first, and as yet the most successful, line of generalization of the Riesz Theorem, is to operators as opposed to functionals. In the main, here's what's involved. Let и :C(K) -> X be a bounded linear operator from the Banach space C(K) to the Banach space X, Then и** is bounded linear operator from C(K)** to X**; if В is a Borel subset of K, then ?? acts continuously and linearly on C(K)* via the formula ??(?) = ?(?). An X**-valued measure is borne: F„(B) = и**(хв). The basic properties ofF„? • Fu is bounded and finitely additive into X**, • Fu is weak*-countably additive. • F„ is weak*-regular. It's a wonderful discovery of R.G. Bartle, N. Dunford and J.T Schwartz that tells us when Fu is norm-countably additive. THEOREM 3,1 (Bartle-Dunford-Schwartz). Let u:C(K) -> X be a bounded linear operator and Fu:Bq{K) -> X** be it's 'representing measure', as above. Then the following are equivalent: A) и is a weakly compact linear operator, that is, u(BciK)) 's relatively weakly compact in X, B) Fu takes all its values in X. C) F„ is norm-countably additive. D) Ftl is norm-regular. The point of the correspondence is just this: for any operator и : С (К) -> X we have that x*uf= f f(k)dx*Fu(k) Jk holding for each / e C(K) and each x* e X*. If и is weakly compact, then the x*'s are
422 J. Diestel and J. Swart extraneous, the integral exists in the norm topology and the integral [ fdF Jk is a powerful ally in the study of и. 3.2. The Dunford-Pettis property and property V To mention but one application of the Bartle-Dunford-Schwartz Theorem 3.1 we cite the following basic result due to A. Grothendieck that's found in his fundamental paper about weakly compact linear operators on C(K )'s. THEOREM 3.2 (Grothendieck). Any weakly compact linear operator u:C(K) -> X carries weakly convergent sequences to norm convergent sequences. L[-spaces share with the C(?)-spaces the phenomena described above, a fact discovered by N. Dunford and B.J. Pettis; in light of this, Grothendieck called such spaces 'spaces with the Dunford-Pettis property'; after Grothendieck's discovery that C(?)-spaces have the Dunford-Pettis property it was to be several decades before other significant examples were uncovered and when they were it was with the aid of the Riesz theorem in subtle, analytically-delicate situations. A word or two about how the Bartle-Dunford-Schwartz description of weakly compact operators provides an approach to Theorem 3,2. If (/„) is a weakly null sequence in C(K), then the Banach-Steinhaus theorem tells us there's a constant ? > 0 so that \f,,(k)\ ^ ? for all ? e N and all к e K; further, Banach tells us that (f„(k)) tends to 0 for each к е К. But the point of a norm countably additive integral is that the Lebesgue bounded convergence theorem holds, for much the same reason as for scalar integrals: Egoroff's theorem. So (/ /„ dFu) tends to 0 in X's norm and with it lim„ \\uf„ || = 0, too. Before you get carried away with the apparent elegance of this proof, we rush to warn you that Grothendieck's proof, while never explicitly naming names or specifying a representing vector measure, follows pretty much the same line of attack as that of Bartle- Dunford-Schwartz. He does more. His deep analysis of weak compactness in C(K)* permits him to prove the converse of Theorem 3.2. In fact he shows the following. THEOREM 3.3 (Grothendieck). Let К be a compact Hausdorff space and X be a Banach space. Suppose u:C(K) —> X is a bounded linear operator. Then the following are equivalent. A) и is weakly compact. B) и is completely continuous, that is, и takes weakly convergent sequences to norm convergent sequences, or, what's the same, и takes (relatively) weakly compact sets to (relatively) compact sets. C) и is weakly completely continuous, that is, и carries weakly Cauchy sequences onto weakly convergent sequences. D) и takes weakly Cauchy sequences into norm convergent sequences.
The Riesz Theorem 423 It is Grothendieck's supple handling of regularity that wins the day; paving the way is his stunning improvement of a result of his mentor, J. Dieudonne. The result? THEOREM 3.4 (Dieudonne-Grothendieck). For a bounded subset В of С (К)* to be relatively weakly compact it is necessary and sufficient that given any sequence (G„) of pairwise disjoint open subsets of К we have lim sup |m(G„)| =0, Grothendieck was not only one who had something to say about interesting variants of weakly compact operators on C(K). Soon after Grothendieck, A. Petczynski introduced the notion of an unconditionally converging operator, и : X —* ? is unconditionally converging if whenever ?„ ?? is a series of terms in X for which ?„ ?-**(?'?)? < oo for each x* e X*. then ?„ u(xn) is unconditionally convergent in Y. The celebrated theorem of W. Orlicz and B.J. Pettis assures that weakly compact operators are unconditionally converging regardless of their domain/codomain. Petczynski showed the converse for operators acting on C(K)'s. THEOREM 3.5 (Petczynski). A bounded operator и :C(K) -> X is weakly compact if and only if и is unconditionally converging. Again this special result about operators onC(^)'s leads to the isolation of an important Banach space invariant. With Petczynski, we say that a Banach space X has property V if any unconditionally converging operator и : X -> У is weakly compact. It's an elegant piece of functional analysis that there is but one possible obstruction to an operator being unconditionally converging: the classical Banach space cq of all null sequences of scalars; indeed, as noted by Petczynski, a bounded linear operator и ; X -> ? fails to be unconditionally converging if and only if there is a subspace Xo of X that's isomorphic to со such that u's restriction to Xq is an isomorphism. This leads to a fundamental consequence about the structure of Banach spaces of C(K) ilk. THEOREM 3.6 (Petczynski), If X is a complemented (closed linear) subspace ofC(K) and X is infinite dimensional, then X contains an isomorphic copy o/cq. Complementation means there is a bounded linear projection ? :C(K) -> C(K) whose range is X. Were X to be without a subspace isomorphic to со, then by what we're said above, ? is unconditionally converging, after all, P's range has no cq's in it. But Theorem 3.5 tells us that ? is weakly compact. Theorem 3.3B) tells us that ? is also completely continuous. Let's take stock: start with a bounded sequence (/„) in C(K); apply ? and the resulting sequence (Pf,) has a weakly convergent subsequence (Pg„)\ apply ? again and the result (P2g„) = (P(Pg„)) is norm convergent. P2 (= P) takes bounded sets to relatively compact sets; ? is a compact linear operator! The only closed
424 J. Diestel and J. Swart linear subspaces that could possibly serve as the range of a compact linear operator are finite-dimensional ones. We rush to point out that Pelczynski called on the Bartle-Dunford-Schwartz Theorem 3.1 to prove a version of Theorem 3.5 that led him to Theorem 3.6; more precisely, he showed that // X contains no copy of со, then even,· и : C(K) -> X is represented by an X-valued Fu and so is a weakly compact operator. As yet no one has proved Theorem 3.5 by purely vector measure theoretic techniques using Theorem 3.1. Before leaving this aspect of the Riesz theorem for operators we'd like to point out that Bartle, Dunford and Schwartz based much of their analysis on a basic feature of vector measures discovered during their work: If ? is ? ?-algebra, X is a Banach space and F: ? —> X is norm-countably additive, then there is a countably additive scalar valued ? on ? so that the family {x*F: \\.x*\\ ^ 1} is uniformly absolutely continuous with respect to ?. Consequently, if и :C(K) —» X is a weakly compact linear operator then there is a regular Borel probability ? on К so that u*X* с Ll(?); of course, u* :X* -> L'(m) Я: C(K)* is still a weakly compact operator and u*B\* is bounded and uniformly integrable. An old chestnut of de la Vallee Poussin now provides us with a convex increasing ? : [0, oof-» [0, oof such that Ф(х)>0 (ifjc>0), г ?(?) ? г ?(?) , lim = 0, hm = +oo and .v->-0 X л->-+зс ? f *(|iiV(*)|)d/i(*Kl JK for each x* e Bx*\ u* is actually a bounded linear operator into the Orlicz space L0 (?), A bit of tender love and care allows us to factor и through L* (?) where ? is the N- function conjugate to ?, in fact through the 'absolutely continuous' part of L* (?). Much can be derived from this; we mention but one analytic consequence: if и :C(K) -> X is a weakly compact linear operator and (/„) is a bounded sequence in C(K), then (u(f,)) has a subsequence with norm convergent arithmetic means. What of other classes of operators? Naturally, compact linear operators are weakly compact; what of their representing measures? Bartle, Dunford and Schwartz were up to the task: a bounded linear operator u;C(K) —> X is compact precisely when the representing Borel measure Fu is X-valued and has a relatively compact range. 3.3. Absolutely summing, integral and nuclear operators on C(K) More intriguing is the following THEOREM 3.7. The operator и ,C(K) -> X is absolutely {-summing if and only if the representing Borel measure Fu is of bounded variation; in this case, и is integral and
The Riesz Theorem 425 admits the following factorization C(K) *¦ X L\\FU\) where a :C(K) -> Ll(\Fu\) is the natural inclusion and b ; L[ (|Fu\) -> X is given by b(f)= f f(k)dF«(k). We take note of the easily-verified fact that a bounded linear operator between Banach spaces is absolutely summing precisely when its biadjoint is, with both sharing the same sized absolutely summing norm; it's not hard from this to verify that the representing measure Fu of an absolutely summing operator u:C(K) -> X is of bounded variation. If F„ has bounded variation, then the factorization above is quickly verified and with it the absolutely summing nature of и; after all, the natural inclusion of С (К) into L](\FU\) is absolutely summing, thanks to Beppo Levi, and an operator with absolutely summing factor is absolutely summing. Such a result is absolutely pregnant with possibilities. Here's one: when is Fu Radon- Nikodym differentiable with respect to | F„ | ? The answer is marked by its simplicity. THEOREM 3.8. Let и :C{K) —> X be an absolutely summing operator with representing Borel measure Fu. Then d Fu /d\ Fu \ exists as a Bochner integrable function precisely when и is a nuclear operator, that is, и admits a representation in the form "(/) = ][>„(/)*„. feC(K). ? where (д„)С С'(К)*, U„) С X andJ2„ IImJKII < °?· As an application of Theorem 3.8 we present a proof of a gorgeous theorem of Pietsch. It may be viewed as a natural generalization of the fact that in Hilbert spaces, the composition of two Hilbert-Schmidt operators is an operator of trace class. Recall from Section 2 that an operator и : X -> У is absolutely /^-summing if there is a ? > 0 so that if ? \, xn e X then (][>**||? <Msup{ (J] |.ir*(**) inf{Ai > 0: above obtains} is the absolutely /^-summing norm of u, denoted by ??(?). Here 1 < ? < oo. ЧР <1
426 J. Diestel and J- Swart Now it is relatively painless exercise to see that и: X -> ? is absolutely /^-summing precisely when и carries weakly p-summable sequences in X to absolutely p-summable sequences in Y. The sequence (x„) is weakly p-summable if for each x* e X* ?^\?*{?„)\? <??; ? this linear space of X-valued sequences is a Banach space (if X is) when equipped with the Carnorm |(^|^к=5ир{(]Г|.х*(;с,,)|Л :дг*еВх.|. The sequence (y„) in ? is absolutely p-summable when ?>???"<??; ? naturally, ||(jn)llt'' is just (?„ 113"cr 11я)'/ я - Again, €^1гопа(У) is a Banach space (if У is). It's part and parcel of the theory of absolutely /^-summing operators that и : X -> ? is p- summing when (u(x„)) e t'^mm„(Y) whenever (л„) е ^eak№ an(J, in fact, лр{и) is just the operator norm of the induced operator from ^.eak(X) to iflronB(i'). The p-summing operators are an Operator ideal' so the composition of two operators one of which is p- summing is /^-summing with an estimate of the form np(uv) ^ тг/,(ы)||и|| or ?^(?)||?|| available depending on the /^-summing factor. Finally, if и is /^-summing and ? < r < oo, then и is r-summing, too, with ?, (и) < ?,, (и). The /^-summing operator scale behaves much like the L''(?)-scale, a fact that in light of Pietsch's Domination theorem is not surprising. But surprises do exist. Here's one. THEOREM 3.9 (Pietsch). If u:X -> ? and v.Y -> ? are absolutely 2-summing operators, then ? о и is nuclear First, we'll write и in factored form. By the Pietsch Theorem 2.6, there is a regular Borel probability measure ? on К = (??*, weak*) so that for any ? e X \\UX\\ ^n2(u)\\x(-)\\L2uty If we temporarily denote by ?? the closure of {.v(-): ? e X} in ?2(?), then this inequality says that и can be (linearly) extended to a bounded linear operator й: ?? -> Y\ we can even
The Riesz Theorem 427 assume ||?|| ^ Л2(и)- But Hubert spaces being Hubert spaces, й can be extended to all of ?2(?) in a linear, norm-preserving manner just by letting ? vanish on X^. The result: -* ? C(K) - L2(M) where a : X -> C(?) is evaluation and /' :C(K) -> ?2(?) is the natural inclusion. Our first task will be to show that и о и is absolutely 1 -summing and so we need a handle as what и does to sequences (x„) that're in CakW- Of course, if (*„) e ^,eak(X), then (jc„) e ^weak^)'too; s0 ("¦*'<) 's absolutely 2-summable. More can be said. Leta„ = (^ |jc„(^*)|iiM(.v*)I/2.Then ???"?2= ? ?\??(?*)\??(?*)= ? ^i^*(^)i^u*)^ii(^)ii(. (?)" So(a„)er and||(a„)||2iC||(.v„) |1/2 *'....l(A·)' Now we call on the duality implicit in our factorization of и through ?~(?): for y*eY*, й*(у*)е?2(м)* = ?2(м); |й*(У)| < 11й*И1Ь-*И = ???????.??? «? л-2(и)||у*Н- Moreover, for ? e X and у* e ?*, й*у* е L2(m)* and х(-) e ?2(?) so у*(и(дс)) = й*0'*)(дг(-)) = (*(-). "Y(-))t2lA) = ( дс(дс*)й*у*(дг*)^(.г*). ./к |у*(и(дс))|< ( |**(дО||й*у*(дг*)|<*д(дг*) ./к = ( |.х*и)|1/2|.х*(.х)|'/2|й*у*(л-*)|^м(.х*) ./к which by Cauchy-Schwartz is, <|к(-)Г/2|2||дс(-)Г/2|й*у*(-)||2 = |U-(-)|!/2||.v(-)il/2|i7V(-)i|U.
428 J. Diestel and J. Swart If we let y„ = и {x„ /?„) then \У*(У„)\2 = \y*{u(x„/a„))\2 = -i-| y*M(.v„)|2 < ^||k,(oi1/2iuV(-)i|; = \\\х,л-)\1,1\й*y*l·) \?„\? = f \х„(х*)\\й*у4-х*)\2<1?(?*). JK It follows that for any NeN, ??,·?:«?/?-'*>??*?<·<·>|^·> = /(EI*-^l)l--vu-)|2^) <|(^)|tiiiik(x,ii5Viii2(/11: from this we see that (y„) e ^^еак(У) and l(}'")lt-;eUk(>')^ll(":")lli\i,ak^'7r2(")- So? Well if (x„) e ^eak(-O then we can write u{x„) in the form ?(?„) = ?„}'„ where (a„)ei2 and II(cr„)II ^ ||(-v„)II {" and (Уп) ? tleak With ||(y„)| 2 (n<|Ui)|ii2 ,v,T2(«). "'weak1'' («еак(Л' Now let's apply the absolutely 2-summing operator ? to (u(x,,)) = {u(a„y„))\ the result is the sequence (a„v(y„))-a sequence which is plainly absolutely summable! To be sure, (}'„) e ^Суеак(^) and и is absolutely 2-summing so (u(.v„)) is absolutely 2-summable, that is, (||f(}'„)ll) e i2; it follows that (?„?(\„)) is in ?l with the i[ -norm gauged as follows: им MDI, = 11Aк^(л)||)||, = 1A^111^11I, < ||K)|,|(ikv„ii)|, < IMI'i2 .x^i(v)\\(y„)\\t2 Y («еак,л' «сак1
The Riesz Theorem 429 This shows that not only is и о и absolutely summing but ? ? (v ou) ^ ?2(??)?2(?) as well. Now we look at the figure of и о и; * ? C(KY) —^ L2(My) and concentrate on the following factor of ? о и: C(KX) —^*- L2(MA·) —^ ? >¦ C(KY) —^- L2(My). It's easy to see that ix and /y are absolutely 2-summing and so iY o(Y^ C(KY)) ouoix:C(KX) -> L:(Mr) is absolutely 1-summing. Theorem 3.8 says this operator is represented by an ?-(ду)- valued measure that's countably additive and has finite variation; but L_^y)-valued measures of finite variations always have Bochner integrable Radon-Nikodym derivatives with respect to their variations (Hubert spaces have the "Radon-Nikodym property'); so ? о и has a nuclear factor and it's easy to see that this ensures us that и о и is itself nuclear. It is relatively easy to show that between Hilbert spaces, the 2-summing operators and Hilbert-Schmidt operators are precisely the same while the trace-class coincides with the nuclear operators. So the above result is a generalization, a Banach space generalization, of a familiar and fundamental feature of operators on Hilbert spaces. No doubt the role of Theorem 3.8 in Banach spaces has been most profound when used in tandem with the theory of differentiability of vector-valued measures. Without going into details (they can be found in the monograph of J.J. Uhl and J. Diestel A978)) we mention a couple of the most striking such applications • If и : X -> ? is an integral linear operator (that is, the bilinear functional ?„ on X x Y* given by <Р(х,У*) = У*(и(х)) is integral in the sense as defined earlier) and v:Y->Zisa weakly compact linear operator, then ? о и is nuclear. • IfX* has the approximation property and the Radon-Nikodym property, then X* has the metric approximation property. C(KX) *· 1?(??)
430 J. Diestel and J. Swart Recall that a Banach space У has the approximation property if regardless of the Banach space X, the finite rank bounded linear operators from X to У are dense in the space of all bounded linear operators from X to У relative to the topology of uniform convergence on compact subsets of X. У has the metric approximation property if for any Banach space X and any bounded linear operator и : X -> У. и is in the closure of all finite rank bounded linear operators between X and У of norm ^ ||u||, again relative to the topology of uniform convergence on compact subsets of X. We hasten to observe that the Radon-Nikodym property's role in the proof may be extraneous; as yet, it's unknown whether dual spaces with the approximation property cannot be renormed to have the metric approximation property. 4. The Riesz theorem for vector-valued continuous function spaces We turn now to the Riesz Theorem in a vector-valued setting; more specifically, we describe the dual Cx(K)* of the space of continuous X-valued functions defined on the compact Hausdorff space K, The basic result is due to I, Singer. THEOREM 4.1, The dual, C\(K)*, of the space of all continuous X-valued functions defined on the compact Hausdorff space К (and equipped with the supremum norm 11/11 = sup{||/(?)||: к e K}) is (isometrically isomorphic to) the space of all countably additive, regular X*-valued Borel measures on К that have finite total variation where the norm of such an F is given by \\F\\ = total variation of F over K. Key to establishing this theorem is a result of J, Dieudonne which says that Cx(K) is just the closed linear span of the 'elementary tensors'/(О* (= / <g> x), where / e C(K) and ? e X. Using this by-now-classical result, we are able to recognize Cx(K) as the injective tensor product С(КЩХ discussed in Section 2, Building on the discussion of Section 2, most particularly, Grothendieck's description of the dual of any injective product, we soon see that Cx(K)* = (C(K)<&>X)* is precisely the space of the integral bilinear forms on C(K) ? ?. But each such bilinear form gives use to an integral linear operator и : С (К) -> X* which through the good grace of Theorem 3.7 in Section 3 forces a representing measure F„ on us. F„ = F is the culprit we're looking for and it fills the bill: Theorem 3.7 of Section 3 tells us that F is an X*-valued countably additive, regular Borel measure of finite total variation; what's more, the total variation of F, the integral norm of u, the absolutely summing norm of и, the integral norm of the bilinear functional, the norm of the same bilinear functional as a member of (C(K)®X)* - they're all the same. Finally, if / e C(K) and ? e X, then [ f(k)dF(k)(x)= f f®xdF Jk Jk defines the fundamental action of a member F of the space of measures at an elemental f <8>x &Cx(K). The natural extension of this action to all of Cx(K) defines f [f(k),dF(k))= f fdF. Jk Jk
The Riesz Tlieorem 431 Theorem 4.1 provides some with the smug feeling that the structure of Cx(K) is much like that of a convenient mix of C(K) and X, Woe befalls possessors of such naivete. The structure of C\(K) is subtle and, while it a fortiori must reflect that of C(K) and X, it also has more than a few hidden secrets. Here's one small bit warning: in the case of C(K), once you know that К is an infinite compact Hausdorff space, C(K) contains an isomorphic copy (an isometric copy even) of со, Indeed, under such circumstances, you can build a sequence (/„) in C(K) of norm- one elements so that /„, · /„ = 0 whenever тфп; it's easy to show that the closed linear span [/„] in C(K) of the /,,'s is isometrically isomorphic to c(>. Sometimes, the result is complemented in C(K) and sometimes it's not. In fact, it's so that for C(K) to contain a complemented copy of с о it is necessary and sufficient that there exists a weak*-null sequence (?„) in C(K)* that's not weakly null. This delicious byproduct of Theorem 3.4 is due to Pilar Cembranos. Rest assured that there are C(K)'s for which weak*- null sequences in C(K)* are always weakly null: the first examples of this phenomenon were uncovered by Grothendieck (and so Banach spaces sharing in this property are called Grothendieck spaces) who showed that // К is extremally disconnected (the closure of any open set in К is open), then weak*-null sequences in C(K)* are weakly null. Thanks to Kakutani we know that the 2nd dual space C(K)** is a C(K**) for some (extremally disconnected) compact Hausdorff space K** and so, in a sense, the C(K)'s that're Grothendieck spaces are in the majority. It's still not known precisely which compact Hausdorff spaces К produce C(K )'s that're Grothendieck spaces; be forewarned, there are some connected spaces to be accounted for in any characterization. Take heart: even among the O-dimensional compact Hausdorff spaces no usable characterization, be it topological or Boolean, exists. In case of vector-valued functions, not only can you find co's, under minimal conditions there are complemented co's! Here's what's so: THEOREM 4.2 (Cembranos-Freniche). If К is any infinite compact Hausdorff space and X is any infinite dimensional Banach space, then Cx(K) contains a complemented copy of c0. (We follow F, Bombal in the proof of Theorem 4.2) How can such a thing be? Well it's because of X's involvement in the make-up of Cx(K). X is infinite dimensional so the Josefson-Nissenzweig Theorem 1,5 provides us with a sequence (x*) in Sx* (so all have 11-11 = 1) that's weak*-null. For each x* there's an x„ e X so that x* (x„) = 1 and ||jr„|| ^ ?, say. If we pick the sequence (/„) in C(K) to be disjointly supported and each of norm-one, then (f„(-)x„)„ spans а со in Cx(K); if k„ e К satisfies \\fn(k„)x„\\ = \\f„(-)xn\\cX{.K), then the linear operator ? ;Cx(K)^> Cx(K) given by />(/) = ?>,;(/(M)/„(-).v„ ? is a bounded linear projection, whose range is the closed linear span in Cx(K) of (fn(-)xn)n, an isomorph of со.
432 J. Dieslel and J. Swart Remark 4.3. It is important to understand that in the above proof the fact that for any / e C\(K), the sequence (f(k„)) is relatively compact ensured that (x*,(f(k„))) is in со so ?„ ?*?/^?))/,?-)?? makes sense. Since weak* null sequences in the dual Z* determine operators from ? to cq while weakly null sequences in Z* determine weakly compact operators from ? to c<), if weak*- and weakly null sequences in Z* are the same, then every operator from ? to cq is weakly compact. Whatever else is so, a weakly compact linear operator between Banach spaces cannot have cq as its range - Banach's open mapping theorem ensures the reflexivity of any Banach space that's the image of a weakly compact linear operator acting on a Banach space. So, the Cembranos-Freniche Theorem 4.2 has the following consequence. COROLLARY 4.4. if К is an infinite compact Hausdorff space and X is an infinite dimensional Banach space, then C\(K) is NOT a Grothendieck space. After all, Cx(K) always contains a complemented subspace isomorphic to c(> and such containment is forbidden to Grothendieck spaces. What about operators from C\(K) to any other Banach space Y? Formally, the basic representation of operators from Cx(K) to ? in terms of measures is much like that of Section 3. If we start with a Borel subset В of K, an ? e X and a v* e ?*, then define FU(B) to be the member of ?(X; Y**) given by Fu(B)(x)(y*)=u*y*(B)(x) - keeping in mind the fact that u*y* takes its values in C\ (K )* a space of X* -valued Borel measures - then it's plain that we have the right object to represent u. THEOREM 4.5. Bounded linear operators и : Cx(K) -> ? correspondto bounded, finitely additive measures Fu defined on the Borel subsets of К with values in ?(X; Y**). For any y* e Y*, Fu(B)(x)(y*) is countably additive in B, X*-valued with finite variation, regular; for any x, Fu(B)(x)(y*) is a weak*-countably additive, weak*-regular Borel measure having values in Y**. For any f e Cx(K), we have Ы,у*)= f (/(*), <*fi, (*)(/))¦ J к An excellent place to read of this theorem is in it's discoverer N. Dinculeanu's book "Vector Measures" [34] Дп important a.lly in the study of operators и on С ? (?) is the 'Operator semi-variation', Fu, of Fu. Foj a 8o/e.l set В с К, FAB) = ||F„||0p(S) - supj ?? Fu(E)(Xi) where the sypiemum is tafen ?.??? all finite Bo/el partitions я ©f В and. ail members ц of the close4 unit ball ?,? of X.
The Riesz Theorem 433 Here the theory diverges drastically from that of Section 3. In fact, it's still unknown when Fu takes all its values in C(X; Y) - this is a long-standing question of N. Dinculeanu. In a similar vein, a usable characterization of the weak compactness of и in terms of properties of Fu remains open. There have been several serious assaults on Dinculeanu's question. No doubt the starting point of the ultimately successful solution will be the following artful variation of the theorem of Bartle-Dunford-Schwartz discussed in Section 3. THEOREM 4.6 (J.K. Brooks and P.W. Lewis). Let u:Cx(K) -> ? be a bounded linear operator and Fu be the C(X; Y**)-valued representing Bore I measure. Then the following are equivalent statements. A) Fu takes all its values in C{X; Y). B) For each ? e X, Fu(-)(x) is norm-countably additive. C) Each operator ux:C{K) -> Y, ux(f) = u(f()x) is a weakly compact linear operator. In their extensive study of operators и : С ? (?) -> ?, Brooks and Lewis also isolated important special cases in which properties of Fu itself ensure that it takes all its values in C(X; Y). The most interesting of these is the continuity of Fu at 0; // FU(B„) -» 0 whenever (B„) is a decreasing sequence of Borel sets with f\ B„ = 0, then all of Fu 's values lie in C(X; Y). While sufficient, this condition is not necessary. A modified example of I. Dobrakov due to P.W. Lewis and J.P Ochoa serves as a warning to any 'quick-fix artist'! Let К be the one-point compactification of N, X = i] and ? = cq. Set Fu([oo}) = 0. If ? e Iх and В is a Borel subset of K, put F„(B)(k) = ?„???""«. where u\ = e\, иг = щ = ег/2, щ — и$ — иь = еу/Ъ Fu is ?(€'; co)-valued and countably additive but Fu{{\\) = Fu({2, 3}) = F„({4, 5, 6}) = · · · = 1 so Fu is not continuous at 0 (to be continuous at 0 it is necessary and sufficient that Fu(Bn) -> 0 whenever (B„) is a sequence of pairwise disjoint Borel sets). This example is covered by a treasured result of Lewis and Ochoa. THEOREM 4.7 (Lewis and Ochoa). Suppose X is a Banach space with a Schauder basis, and that u:Cx(K)->Ya bounded linear operator with representing measure Fu. F„ takes all its values in C(X; Y) if and only if there is a sequence (u>„) of weakly compact linear operators w„ :Cx(K) -> ? such that (i) u(f) — \\mn w„(f), for each f e Cx(K) and (ii) // Fw„ represents w„ then for each x e X and Borel set В <ZK, F«(B)(x)=limFWn(BXx)- ? This brings us to the issue of weakly compact operators и: Cx(K) -> У-how to tell when и is weakly compact (if possible) in terms of its representing measure Fu. Here one comes face^to-.face with the problem of what do the weakly compact subsets of
434 J. Diestel and J. Swart Cx(K)* look like? This problem is open. In fact, it's not that long ago that the question of weak compactness in the considerably less-intricate Lebesgue-Bochner space ??(?) was resolved by A. Ulger with finishing touches provided by J. Diestel, W. Ruess and W. Schachermayer. As of yet this result has not lead to any new insight into C\{K)*, even when X* has the Radon-Nikodym property - new insights that might help resolve the question of what weakly compact operators on С ? (К) look like. While the general study of weakly compact operators on С ? (?) remains in its infancy, there are results in case X is a special space and the objective is to deduce something about weakly compact operators. For instance, there has been considerable work of substance about when Cx(K) has the Dunford-Pettis property. The following result is by no means all that's known, just the sharpest. Theorem 4.8. (P) J. Bourgain: if X = ?'(?) or X = CL\Ul)(J) or X = L[ ] (J)(v) or ..., then Cx (K) and all its duals have the Dunford-Pettis property. D. Contreras/S. Diaz, N. Randrianantoanina: ifX is the disc algebra A, then Cx(K)* (and so Cx(K) ) has the Dunford-Pettis property: (?) ?. Talagrand: There is a Banach space X so that in X* weakly convergent sequences are norm convergent yet Cx([0, 1]) does not have the Dunford-Pettis property. Talagrand's example is particularly noteworthy for the warning it contains: even when the weakly compact sets in X* are easily understood, those in Cx(K)* need not be. In Section 3 we told of Pelczynski's theorem to the effect that unconditionally converging operators on C(^)-spaces are weakly compact; stated formally, 'C(^)-spaces have property V. In light of the Cembranos-Freniche theorem, it is natural to ask whether or not Cx(K) has property V if X does. Natural to ask but difficult to know. Nevertheless the question has a pleasing answer. THEOREM 4.9 (Randrianantoanina). If X is a separable Banach space with property V, then Cx(K) has property V, too. The proof is a deliciously delicate mix of Banach space ideas, adroit handling of liftings and weak*-measurable maps with a pinch or two of descriptive set theory. Randrianantoanina gets around the question of weakly compact sets in С ? (? )* by calling on M. Talagrand's proof that Cx(K)* is weakly sequentially complete if X* is - a highly non-trivial result of its own. It comes into play because X* is weakly sequentially complete whenever X has property V. Now it's not enough to just call on Talagrand: there's a need to know that in weakly sequentially complete Banach spaces bounded sets are relatively weakly compact precisely when they contain no sequences that act like U 's unit vector basis (a classical result of H.R Rosenthal), and what this means in the case of Cx(K)*; here a deep understanding of Talagrand's work is called on. But Randrianantoanina has such understanding and more, much more. Whether or not the separability assumption on X is needed is not known. For instance, the conclusion of Theorem 4.9 holds if X is any reflexive Banach space - something
The Riesz Theorem 435 first observed by Pelczynski, himself. In truth though even this fact can be deduced from Theorem 4.9 thanks to the property of reflexive spaces that separable subspaces are contained in separable complemented subspaces. Curiously, while weak compactness of operators on Cx(K) are a quagmire not easily escaped, the situation with absolutely summing and integral operators is clear. THEOREM 4.10 (C. Swartz). The operator и :Cx(K)-> ? is absolutely summing if and only if F„ takes its values in ?\ (?; ?), the space of absolutely summing operators from X to Y, in a countably additive fashion with finite variation (each with respect to the absolutely summing norm). THEOREM 4.11 (P. Saab). The operator u:Cx(K)-> ? is integral if and only if Fu takes its values in I(X; Y), the space of integral operators from X to Y, in a countably additive fashion with finite variation (each with respect the integral norm). While in the case of scalar valued functions, absolutely summing operators and integral operators coincide, such is not the case of С ? (?). Indeed, it's a penetrating theorem of JR. Retherford and С Stegall that says that absolutely summing operators on a Banach space ? are integral precisely when ? is a Cx -space, that is, Z's finite dimensional structure is much like that of a C(?)-space; it follows that for most X's there are absolutely summing m's on Cx(K) that are not integral. To continue the path set in Section 3, we mention what the case is with nuclear operators; again, things are unclear: Here's the best that's been said so far. Theorem 4.12 (P. Saab and B. Smith). (I) If u:Cx(K) -> ? is nuclear, then the representing measure F„ takes all of its values in the space N(X\Y) of nuclear operators from X to Y, is countably additive and has finite variation with respect to the nuclear norm. (II) и : Cx(K) -> ? is nuclear, whenever the operator u# : C(K) -> N(X\ Y) given by u*(f)(x) = u(f®x) is nuclear (into the space N(X; Y)). (Ill) In order that regardless of К and Y, u's nucleariry implies that ofu# it is necessary and sufficient that X* have the Radon-Nikodym property. 5. Notes and remarks This last section is a sort of catch-all. We provide bibliographical data. We discuss results related to those of the previous sections. We broach several topics that were left unattended earlier. To be sure, the Riesz Theorem is too broad a subject to be given complete coverage, even in the limited compact Hausdorff context. Our choices of topics were driven by our interests and tempered by our deep, unabiding ignorance.
436 J. Diestel and J. Swart 5.1. Notes and remarks to Section I To say that F. Riesz started the ball rolling with regards to the 'Riesz Theorem' is an understatement of vast propositions. After discovering the proper description of C[a, b]* in [84], he set upon applying the result in [85] as well as detailing its proof. Riesz was to return to the result on several occasions; of particular interest to any serious student of the evolution of ideas is his elegant proof presented in [86] - no one reading this classic piece of mathematics can doubt that the time for a measure-theoretic treatment was rapidly approaching. Soon this promise came to fruition in J. Radon's ground breaking work [79]. As mentioned in Section 1, S. Banach [4] was to provide the first abstract version of the Riesz Theorem. As if to cement the importance of this result, Banach [5] then derived the existence of Haar measure on locally compact separable metric groups; Banach's proof was to serve as a model for the most general results on the subject of invariant measures on homogeneous spaces as witnessed for example in R. Steinlage's [95] characterizations of existence and uniqueness of measures invariant under group actions and С Bandt's [7] work on metric invariance. S. Saks [91] must have felt obliged to answer Banach's functionally derived characterization of C(K)*, for К a compact metric space, with a measure-theoretic approach; in this he was eminently successful. In the early forties, S. Kakutani [59,60] embarked on a program of characterizing C(K)- and L1-spaces among Banach lattices. It was out of this efforts that he was inspired (compelled?) to obtain a usable description of C(K)* for any compact Hausdorff space K. Our formulation of the Riesz Theorem is but one consequence of Kakutani's efforts. The paper of S. Banach and S. Mazur [6] highlights the central role that C(?)-spaces are bound to play in the general theory of Banach spaces. Though Banach and Mazur restricted their attention to separable Banach spaces, their result on universality is a model for general spaces; indeed, it's this fact that forces C(?)-spaces (and with them, the Riesz Theorem) into our analytic consciousness. After all, if X is any Banach space then the closed unit ball ??* is weak* compact and norming; hence evaluation of x e X on Bx» is a linear isometry of X into C((BX*, weak*)). Hence, with the Hahn-Banach theorem in hand, analysis of functionals on X can often be turned to analysis of members of a C(K)*. that is, a 'help!' call to the Riesz Theorem. This being said, we must to add that the Banach-Mazur theorem is special and much of its value lies in its special character. That every separable Banach space is linearly isometric to a closed linear subspace of C(A), where ? is the Cantor set, is important not least of all because C(A) has a Schauder basis, for instance. This feature of the Banach-Mazur universality theorem has been used by many abstract analysts. The Josefson-Nissenzweig theorem is due, independently, to B. Josefson [57] and A. Nissenzweig [69], who gave different proofs that were hauntingly similar in spirit, if not in execution. It was after H.R Rosenthal's [87] profound I' -theorem that the similarity was explicated; we owe a great debt to J. Hagler and W.B. Johnson [55] for such an explanation. The part played by regularity in the proof Josefson-Nissenzweig seems to be indicative of a general theme: the closer one is to a direct application of the Riesz Theorem the more
The Riesz Theorem 437 likely that the regularity of the measure will be a real player in the proof. Sometimes its principal use is to ensure that a space of continuous functions injects continuously onto a dense linear subspace of an L''-space and sometimes, as in Section 1, it is used more bluntly. 5.2. Notes and remarks to Section 2 The subject of tensor products of Banach spaces is widely shunned yet possesses a certain elegance once the arid preparations have been overcome. The earliest work by J. von Neumann, F.J. Murray and R. Schatten was inspired by the Hilbert space situation and so doomed to limited success. To be sure, A. Grothendieck was the first to penetrate the finer structure of tensor products, first in his 'Memoir' [54] and later in much greater depth in his 'Resume' [53]. The integral operators of Section 2, be they linear or bilinear, were to form the cornerstone to Grothendieck's theory of Banach spaces. The proof of Theorem 2.1 can be found in his Memoir, wherein several fundamental consequences are drawn; however, it wasn't until his Resume that the pivotal part played by the integral operators in his view of Banach spaces became clear. For those adventuresome souls who'd like to learn more about tensor products, we recommend R. Ryan's forthcoming Cambridge introduction [88]; a detailed expose of the Resume by the authors and J. Fourier [29] might be helpful for a fuller understanding of Grothendieck's game plan for the study of Banach spaces and their structure. Last but far from least, the monograph [26] of A. Defant and K. Floret gives a clear overview of tensor product techniques vis-a-vis operator ideal notions. Choquet's theorem made its debut in [23]. It soon found applications in diverse areas of mathematical inquiry such as C* -algebras, ergodic theory, non commutative harmonic analysis and Banach space geometry. As befitting such a gorgeous theorem, immensely successful efforts were made to simplify its proof, generalize and expose. F Bonsall's proof [12] probably provided the last word in elegance with regards to Choquet's theorem in the metrizable case. With regards to generalization, probably the most well-known (and as of yet the most often used) is due to E. Bishop and K. de Leeuw [11]. THEOREM 5.1 (Bishop-de Leeuw). If К is a non-empty compact convex subset of a locally convex Hausdorff linear topological space E, then each point ? of К is the barycenter of a regular Borel probability measure ? on К that's supported by the extreme points of К 's in the sense that ? (В) = Ofor any Baire subset В of X that contains none of К 's extreme points. G. Edgar [43] surprised the mathematical world when he provided a generalization that relaxed the compactness condition. THEOREM 5.2 (Edgar). Let К be a non-empty closed bounded convex subset of the Banach space X. Suppose К is separable and has the Radon-Nikodym property. Then each point is the barycenter of a regular Borel probability measure supported by the set of К 's extreme points.
438 J. Diestel and J Swart Separability plays the role of providing access to the methods and results of descriptive set theory, most particularly, the Kuratowski/Ryll-Nardzewski selection theorem; the Radon-Nikodym property enters the foray through ingenious use of the Martingale Convergence Theorem. The monograph of R. Bourgin [15] should be consulted here. Another fascinating effort at generalization (albeit in a different direction) was expanded by S. Khurana [61,62] whose aim was a finitely additive representation theory. As yet, this promising approach has been just that: promising. The Choquet theorem has been the inspiration of several brilliant treatises. First, R.R. Phelps' still-wonderful pocket-book [73] is a must; it's too bad it's out-of-print. Next, E.M. Alfsen's monograph [ 1 ] presents the subject in a clear and compelling manner. Finally, the master himself puts his theorem (and, oh. so much more) into perspective in his treasured three volumes [20-22]. Rainwater's theorem is due, of course, to the ubiquitous John Rainwater [80]. It has descendants of outstanding pedigree, to be sure. John Elton [44] showed the following. THEOREM 5.3 (Elton). In order that every series ???" m a ^anac^ space X for which ?„\?*( xn)\ < oo for each x* e ??(??*) be unconditionally convergent it is both necessary and sufficient that X contains no isomorphic copy o/cq. Be forewarned: if you recall the classic result of С Bessaga and A. Pelczynski [9] which asserts that it's precisely in spaces X without copies of cq that ???" 's unconditionally convergent when J2„ \?*(??)\ < °° f°r eac'1 -v* e %*· men be aware that Elton's Theorem 5.3 is considerably more subtle, relying on ideas related to the Radon- Nikodym property. In particular, adroit use is made of another classical result of Bessaga and Petczynski [10] this time to the effect that in separable duals every non void closed bounded convex set has extreme points and is the closed convex hull of the set of such. J. Bourgain and M. Talagrand [14] also entertained questions involving extremal convergence. The applications of the Riesz Theorem in tandem with the Hahn-Banach theorem often involve extreme point considerations. Nowhere is this more powerfully demonstrated than in de Branges's proof of the Stone-Weierstrass theorem. This paper [25] was somewhat typical of many beautiful proofs from the 1960's involving function algebras. The monographs of ? Gamelin [47] and L. Stout [96] provide clear exposition of many of the best results of the times. De Branges's proof still has a lot of kick in it. Most recently, Victor Lomonosov [66] used ideas implicit in de Branges's proof to show that there exists a uniform algebra A and a non-void closed bounded convex subset К of A such that the set of members of A* that attain their maximum modulus on К is not norm dense in A*. Thus one of the most beloved consequences of the Bishop-Phelps theorem is not so in complex Banach spaces. Now Lomonosov is just the kind of mathematician who knows a good thing when he sees it and soon he turned his counter-example into a positive result [67] showing that all dual uniform algebras in which the full Bishop-Phelps result holds are self-adjoint. This is startling mix of algebraic and geometric thinking, and the Riesz Theorem is again at the center of the action.
The Riesz Theorem 439 A. Pietsch's stunning Domination Theorem appeared in the paper [74] in which he first introduced the class ?? of absolutely /^-summing operators. Our proof is a small modification and special case of an argument of Bernard Maurey [68]. Pietsch's isolation of the absolutely /^-summing operators was a singular event in abstract analysis in general and Banach space theory in particular. It's true that in one guise or another Grothendieck recognized ?\ and ?? but in the latter case he did not have the same norm as Pietsch did. This is important, as witnessed by the remarkable result discovered by D.J.H. Garling and Y. Gordon [48], and, independently, ?.?. Kadec' and M.G. Snobar [58]: // ? is a finite dimensional Banach space, then 7n(id?) = vdimf. Precise computations like this are frequently encountered in the literature on absolutely summing operators, particularly as they relate to other aspects of mathematical enquiry. We cannot overstate our respect for Pietsch's accomplishment. Our attitude is buttressed by the many monographs that center on the Pietsch theorem and its ramifications. To be precise let us mention: [72] (where connections are made with spaces of analytic functions), [77] (which shows the role played by factorization schemes in harmonic analysis and operator theory), [63,75] (each of which draws deep conclusions about eigenvalue distributions for classical operators from the theory of /^-summing operators and their relatives), [99] (where Minkowski spaces are the object of attention from a ??-perspective), [78] (which studies volume ratios and the geometry of convex bodies with a ??,,-helper) and [76] (where classical studies on orthonormal systems are given a refreshing new look). In addition the Cambridge books of [30] and [102] give ample overview to how summing operators 'fit' within the fabric of abstract analysis. 5.3. Notes and remarks to Section S The original proofs of Theorems 3.1 through 3.6 in Section 3 are to be found in the classics of A. Grothendieck [52], R.G. Bartle, N. Dunford and J.T Schwartz [8] and A. Pelczynski [70,71 ]. We can think of no better advice to the students of the Riesz Theorem than to study these papers closely. The observation about weakly compact operators and arithmetic means was made by J. Diestel and C.J. Seifert [32]. R. Anantharaman and J. Diestel [2] were able to improve this result by dealing directly with the range of a countably additive vector measure. Tracking the origins of Theorems 3.7 and 3.8 is a bit touchy, in large part because we've taken a vector measure theoretic stance. However, the results are pretty enough that there's plenty of credit to share so we mention L. Schwartz [92], J. Gil de Lamadrid [49,50], A. Tong [100] and J. Diestel [28]; the book [33] gives these results careful discussion. Theorem 3.9 appears already in Pietsch's original paper. It is a good example of measure theory in action. Thanks to Pietsch's Domination Theorem, tools like Cauchy-Schwartz and the Holder inequalities are made available for direct use in operator theory. It is so (and somewhat more complicated to show) that if и : X -> ? is /^-summing and ? : ? -> ? is <7-summing where ? and q are conjugate indices between 1 and oo then vu is 1 -summing. Our parting comments about the composition of weakly compact and integral operators and their relevance to approximation properties goes back to A. Grothendieck [54]. For a modern read on these issues the paper [65] is highly recommended.
440 J. Diestel and J. Swart 5.4. Notes and remarks to Section 4 Theorem 4.1 is due to Ivan Singer [93]. [56] should also be consulted. Our comment following Theorem 4.1 regarding C{K)'s that contain complemented copies of со is due to Pilar Cembranos [17] who also noted that Cx(K) cannot be a Grothendieck space if К is infinite and X is an infinite dimensional reflexive Banach space. The stunning Theorem 4.2 was discovered independently by Cembranos [18] and Francisco Freniche [46]. Their result soon lead to a spate of papers with interesting variations, twists and turns on the same basic theme. Cembranos, Nigel Kalton and the Saabs [19] showed that the injective tensor product of any two infinite dimensional Banach spaces contains a complemented copy of со once one coordinate space contains a copy of со. G. Emmannuelle [45] looked at the Lebesgue-Bochner spaces, Paddy Dowling [42] considered the Hardy-Bochner spaces and vector-valued spaces of harmonic functions and Diaz, Fernandez, Florencio and Paul [27] looked at spaces of Pettis integrable functions. In each case that a complemented copy of со emerges in the function space/tensor product after just appearing in one or another 'coordinate', it comes about as a result of the following phenomenon: anytime a Banach space contains an unlimited copy of the unit coordinate vector basis of со, it contains a complemented copy of со. Unlimited? Well, a set К in a Banach space X is limited if any weak* null sequence in X* goes to zero uniformly on К; К is unlimited if К is not limited. If (x„) is an unlimited unit vector basis of со in X then there's a weak* null sequence (x*) in X*. an ?? > 0 and a subsequence (*„.„) of (x„) so that xm (x'ht< ) > ? for all m; if we define и: X -> c(> by u(x) = (x*n(x)) then the series ?,? ?">„ satisfies ?», \x*(x>h„)\ < oo for each x* e X* (?,,,.?,,,,, is a 'wuC') yet since \\ux„m\\ > I*m (¦*"»,) I > ?< Hmux"m ls not unconditionally convergent. We can apply the Bessaga and Petczynski [9] result alluded to above to conclude that there is a subsequence (vk) of (x,i,„) such that (f/t) and (u(vk)) are each equivalent to the unit vector basis of со and m's behavior on the closed linear span [и* ] of the и* 's is that of an isomorphism. If we look at ? = u([vk]) we (should) see an isomorph of со inside со; but со is 'separably injective' and so there is a bounded linear projection ?: со -> со with Pco = ?- The operator Q : X —> X defined by is a bounded linear projection on X with QX = [v„], an isomorph of со. A space related to Cx(K) that's found application in the study of the invariant subspace problem is the space WCx(K) of weakly continuous functions from К to X equipped with the supremum norm. Though there is no known case in which WCx(K)* has been described (unless К is finite and dimX is, too) it has been shown [3] that Cx(K)* is complemented in WCx(K)*. This follows from the somewhat surprising fact that each member ofWCx(K) is integrable with respect to each member ofCx(K)*. Theorem 4.5 is due to N. Dinculeanu [34].
The Riesz Theorem 441 Theorem 4.6 comes from the paper of Jim Brooks and Paul Lewis [16] which contains much of interest with regards to to the Dinculeanu problem. The paper of Lewis and Ochoa [64] has a refreshing new look - hypotheses on a Banach space lead to conclusions of a special and gratifying nature. Go figure. The papers of Ulger [101] and Diestel, Ruess and Schachermayer [31] 'settle' the problem of relative weak compactness in the Lebesgue-Bochner spaces Vx for 1 ^ ? < oo. Regarding weak compactness in C\{K)*, only the paper of N. Randrianantoanina and E. Saab [83] contains any real progress. This being so, [83] is a paper that should be closely studied. None of the papers that gave birth to Theorem 4.8 are 'easy reading'. Bourgain [13] uses some techniques rooted in Riesz products; Contreras and Diaz [24] and Narcisse Randrianantoanina [82] each need random versions of a nearest point technique that're non-trivial and Talagrand's [98] is typical Talagrand - interesting but hard. Theorem 4.9 is from [81]. Theorem 4.10 is found in [97], Theorem 4.11 in [89] and Theorem 4.12 in [90]. The paper of Retherford and Stegall [94] contains a number of characterizations of ?? -spaces; even when applied to C(K )-spaces there's much to be learned and enjoyed from Retherford and Stegall. 5.5. Notes and remarks on tensor products The Riesz Theorem takes on different guises when describing the dual (C(K) <8>„ X)* for tensor norms a that're stronger than the injective norm. The most important and best known of these tensor norms is the projective tensor norm 'л' (often called '?'). Here if и is a typical member of C(K) <g> X then l"L = infj]P||/,'||ocll-*/l|: и = ]Г/, ®Xi and C(K)®X is the completion of (C(tf)® X, I U). It is known that (C(K)®X)* is the space B(C(K), X) of continuous bilinear functional on C(K) ? ?; in linear forms, (C(K)®X)* = C(C(K);X*) and so can rely on the Bartle-Dunford-Schwartzframework to deal with (C(K)®X)*. The result is that members of (C(K)®X)* are bounded, additive, weak* countably additive measures from the Borel ?-algebra of К to X*. Should X* contain no copy of со, then each such measure is, in fact, countably additive and regular in the norm topology of X*. So life isn't so bad. Now a most interesting case is when X is itself a C(?)-space and we're looking at (C(K\)®C(K2))*. In this case, C(K2)* is known to be weakly sequentially complete - it's, of course, the space of regular Borel measures on ??-??? so does not contain an isomorphic copy of cq. It follows that any и* е (С(К\)®С(.Кг))* corresponds to a (weakly
442 J. Diestel and J. Snarl compact) linear operator и : C(K\) —» C(/o)*. which in turn corresponds to a countably additive regular Borel measure Fu acting on the Borel subsets of ? \, and with values in C(K2)* ¦ If we let B\ and Bi be Borel sets in K\ and Kz, respectively, then F.,(Bi)(xb2) makes sense: a mapping from the Cartesian product of the collection Bo(K \) and Bo(Ki) of Borel subsets of ? ? and K2 is borne. This mapping, call it ?, acts on B0(K\) * B0(K2) via M(S,,S2)=F„(S|)(xb:). and for fixed B\ is countably additive and regular in Bi and vice-versa. It is a bimeasure. Bimeasures are objects worthy of close scrutiny and close scrutiny has been paid them. In a series of papers, I. Dobrakov [35-41] and his collaborators detailed the theory of integration with respect to bimeasures and multimeasures, even if they're vector-valued. Much remains to do and this is in large pact due to the role spaces like C(?|)<g)C(/o) (and, more generally, C(K\ )<g> · · · ®C(Kn)) play in affairs of harmonic analysis or rather harmonic synthesis. A word or two about this last comment. Part of the charm of the projective tensor product ('charm' being a word rarely appearing near the phrase 'tensor product'!) is the fact that the projective tensor product of Banach algebras is a Banach algebra; in case we're dealing with C(^i)<g)C(^2). the Banach algebra is commutative, has an identity and, remarkably enough, a recognizable state space: ? ? ? ?2. This last fact was uncovered by N. Varopoulos who laid bare the deep connections between these 'tensor algebras' and questions in harmonic analysis. Let us whet the appetite of more ambitions readers. Let G be a compact Abelian topological group and ? be its (discrete) dual group. A(G) is the linear space of all / e C(G) for which there's a member F e Ь](Г), the convolution group algebra (= ?' (?) in our present situation), so / = F, the Fourier transform of F. If we norm A(G) by II/IIa@ = ????.?(?) = ll^ll(i(D then A(G) is a Banach space. THEOREM 5.4 (N. Varopoulos). Define the operators ?, ? as follows M:C(G)-> C(G ? G), P:C(G ? G) -+ C(G), (Mf)(x,y):=f(x+y) and (Pf)(z):= f f(z-x.x)dx, Jg where dx is the normalized Haar measure on G.
The Riesz Theorem 443 A) ? is an isometric embedding of A(G) into C(G)®C(G). B) ? takes C(G)®C(G) into A(G) with norm ?. 1 action. C) PoM = idA{G). D) ? ? P:C(G)®C(G) -> C(G)<g>C(G) ? ? norw о«е linear projection of C(G)®C(G) onto A(G). Some words-to-the-wise are in order. First, using the fact that C(K) has the approximation property of Grothendieck, the inclusion C(G)<g>C(G) <-+ C(G)®C(G) is one to one; as a consequence members of C(G)®C(G) may be viewed as members of C(G)<g)C(G), which we're already seen is CciG)(G) or C(G ? G). This might clarify some of the above theorem's claims and bona-fides. Next, for members of C(G)®C(G) we can compute the norm as follows: if / e C(G)<g)C(G), then ' II ' where ?„ II/и II ос II gn Hoc < oo, and и = ?? f» ® gn- This is a famous characterization of the norm in the completed projective tensor norm. The proof of this theorem is a sequence of direct computations. Remark 5.5. If the statement of the above theorem warms the cockles of your mathematical soul, then you can't go wrong reading the essays of the stimulating monograph of С.С Graham and O. McGehee [51], wherein the proof of the above theorem and much more is given complete and clear exposition. Sadly we are neither here to discuss the many wonders of spaces like C(K\)®C(Ki) (which are called Varopoulos algebras and usually denoted by V(?|, ?2)) nor are we qualified to do so. Rather, it's to point out the relationship with the Riesz Theorem through bimeasures and to note that there are a fine gradation of other related spaces of 'measures' between CX(K)* and (C(K)®X)*. In his Resume, Grothendieck isolated a dozen other 'natural' tensor norms ? which, with their associated continuous linear/bilinear forms, fit neatly in the general theory of Banach spaces. In each case, the dual of C(K) <g>„ X is some sort of space of measures; in many cases the precise determination of just which measures arise is not clearly understood. For instance, the precise nature of which measures represent operators (all of which are weakly compact) from a C{K) to an abstract L-space is unknown. Similarly, the determination of which operators from C{K) to X have absolutely summing adjoints is open.
444 J. Diestel and J. Swart References [1] E.M. Alfsen. Compact Convex Sets and Boundary Integrals. Ergebn. Math. Grenzgeb.. B. 57. Springer. New York A971). [2] R. Anantharaman and J. Diestel. Sequences in the range of a vector measure. Comment. Math. Prace Mat. 30B) A991) 221-235. [3] J. Arias de Reyna, J. Diestel. V. Lomonosov and L. Rodriguez-Piazza. Some obsenations about the space of weakly continuous functions from a compact space into a Banach space, Quaestiones Math. 15 D) A992L15^425. [4] S. Banach. Theorie des Operations Lineaires. Monografje Matematyczne. Vol. 1 A932). [5] S. Banach, The Lebesgue integral in abstract spaces. Note II in S. Saks' "Theory of the Integral", Dover Publications A964). [6] S. Banach and S. Mazur, Zur Theorie der linearen Dimension. Studia Math. 4 A933) 100-112. [7] C. Bandt, Metric invariance ofHaar measure, Proc. Amer. Math. Soc. 87 A) A983) 65-69. [8] R.G. Bartle, N. Dunford and J. Schwartz. Weak compactness and vector measures, Canad. J. Math. 7 A955J89-305. [9] C. Bessaga and A. Petczynski, On bases and unconditional convergence of series in Banach spaces, Studia Math. 17A958) 151-164. [10] С Bessaga and A. Petczynski, On extreme points in separable conjugate spaces, Israel J. Math. 4 A966) 262-264. [11] E. Bishop and K. de Leeuw, The representations of linear functionals by measures on sets of extreme points, Ann. Inst. Fourier Grenoble 9 A959) 305-331. [12] FF. Bonsall, On the representation of points of a convex set. J. London Math. Soc. 38 A963) 332-334. [13] J. Bourgain, On the Dunford-Pettis property. Proc. Amer Math. Soc. 81 B) A981) 265-272. [14] J. Bourgain and M. Taiagrand, Compacite extremale. Proc. Amer Math. Soc. 80 A) A980) 68-70. [15] R.D. Bourgin, Geometric Aspects of Convex Sets with the Radon-Nikodym Property. Springer, Berlin A983). [16] J.K. Brooks and P.W. Lewis, Linear operators and vector measures. II. Math. Z. 144 A) A975) 45-53. [17] P. Cembranos, Una caracterizacion de cuando C(K, E) es un espacio de Grothendieck para espacios ? reflexives, Actas de las VIII Jomades Luso Espanholas Mathematica, Univ de Coimbra A979-1982). [18] P. Cembranos, C(K, E) contains a complemented copy ofcu, Proc. Amer. Math. Soc. 91 D) A984) 556- 558. [19] P. Cembranos, N.J. Kalton, E. Saab and P. Saab, Pelczynskis property (V) on C(i2, E) spaces. Math. Ann. 271A) A985) 91-97. [20] G. Choquet, Lectures on Analysis. Vol. I: Integration and Topological Vector Spaces, W. A. Benjamin, New York A969). [21] G. Choquet, Lectures on Analysis. Vol. II: Representation Theory. W.A. Benjamin, New York A969). [22] G. Choquet, Lectures on Analysis. Vol. Ill: Infinite Dimensional Measures and Problem Solutions, W.A. Benjamin, New York A969). [23] G. Choquet and P.-A. Meyer, Existence et unicite des representations inte'grales dans les convexes compacts quelconques, Ann. Inst. Fourier Grenoble 13 A963) 139-154. [24] M.D. Contreras and S. Diaz, C(K. A) and C(K. H^) have the Dimford-Pettis property. Proc. Amer. Math. Soc. 124 A1) A996) 3413-3416. [25] L. de Branges, The Stone-Weierstrass theorem. Proc. Amer. Math. Soc. 10 A959) 822-824. [26] A. Defant and K. Floret, Tensor Norms and Operator Ideals. North-Holland, Amsterdam A993). [27] S. Diaz, A. Fernandez, M. Florencio and P.J. Paul, Complemented copies of cq in the space of Pettis integrable functions, Quaestiones Math. 16 A) A993) 61-66. [28] J. Diestel, The Radon-Nikodym property and the coincidence of integral and nuclear operators. Rev. Roumaine Math. Pures Appl. 17 A972) 1611-1620. [29] J. Diestel, J.H. Fourie and J. Swart, The metric theory of tensor products, Grothendieck s Resume revisited B001), in preparation. [30] J. Diestel, H. Jarchow and A. Tonge. Absolutely Summing Operators, Cambridge University Press. Cambridge A995).
The Riesz Theorem 445 [31] J. Diestel, W.M. Ruess and W. Schachermayer, On weak compactness in ?.'(?, X). Proc. Amer. Math. Soc. 118 B) A993) 447^453. [32] J. Diestel and C.J. Seifert, The Banach-Saks ideal. I Operators acting on C(Q), Comment. Math. 1 A978) 109-118. (Special issue dedicated to Wtadystaw Orlicz on the occasion of his seventy-fifth birthday.) [33] J. Diestel and J.J. Uhl, Jr., Vector Measures, Math. Surveys. No. 15, Amer. Math. Soc., Providence, RI A977). (With a foreword by B.J. Pettis.) [34] N. Dinculeanu, Vector Measures, Pergamon Press. Oxford A967). [35] I. Dobrakov, On integration in Banach spaces. VIII. Polymeasures, Czechoslovak Math. J. 37A12) C) A987L87-506. [36] I. Dobrakov, On extension of vector polymeasures. Czechoslovak Math. J. 38A13) A) A988) 88-94. [37] I. Dobrakov, On integration in Banach spaces. IX, X. Integration with respect to polymeasures, Czechoslovak Math. J. 38A13) D) A988) 589-601. 713-725. [38] I. Dobrakov, On integration in Banach spaces. XI. Integration with respect to polymeasures, Czechoslovak Math. J. 40A15) A) A990) 8-24. [39] I. Dobrakov, On integration in Banach spaces. XII. Integration with respect to polymeasures, Czechoslovak Math. J. 40A15) C) A990) 424-440. [40] I. Dobrakov. On integration in Banach spaces. XIII. Integration with respect to polymeasures, Czechoslovak Math. J. 40A15) D) A990) 566-582. [41] I. Dobrakov, On extension of vector polymeasures. 11. Math. Slovaca 45 D) A995) 377-380. [42] P.N. Dowling, Complemented copies ofc() in vector valued Hardy spaces, Proc. Amer. Math. Soc. 107 A) A989J51-254. [43] G.A. Edgar, A noncompact Choquet theorem, Proc. Amer. Math. Soc. 49 A975) 354-358. [44] J. Elton, Extremely weakly unconditionally convergent series. Israel J. Math. 40 C^4) A982) 255-258. [45] G. Emmanuele. On complemented copies o/c(> in U'x. 1 < ? < ос. Proc. Amer. Math. Soc. 104 C) A988) 785-786. [46] EJ. Freniche, Barrelledness of the space of vector valued and simple functions. Math. Ann. 267 D) A984) 479^486. [47] T.W. Gamelin. Uniform Algebras. Prentice-Hall. Englewood Cliffs. NJ A969). [48] D.J.H. Garling and Y. Gordon, Relations between some constants associated with finite dimensional Banach spaces. Israel J. Math. 9 A971) 346-361. [49] J. Gil de Lamadrid, Measures and tensors. Trans. Amer. Math. Soc. 114 A965) 98-121. [50] J. Gil de Lamadrid, Measures and tensorv II. Canad. J. Math. 18 A966) 762-793. [51] C.C. Graham and O.C. McGehee. Essays in Commutative Harmonic Analysis. Springer, New York A979). [52] A. Grothendieck, Siirles applications lineaires faiblement compactes d'espaces du type C(K). Canadian J. Math. 5A953) 129-173. [53] A. Grothendieck, Resume de la theorie metrique desproduits tensoriels topologiqnes, Bol. Soc. Mat. Sao Paulo 8 A953/1956) 1-79. [54] A. Grothendieck, Produits tensoriels topologiqnes et espaces nucle'aires. Mem. Amer. Math. Soc.. No. 16 A955) 140. [55] J. Hagler and W.B. Johnson, On Banach spaces whose dual balls are not weak* sequentially compact, Israel J. Math. 28 D) A977) 325-330. [56] W. Hensgen, A simple proof of Singer's representation theorem. Proc. Amer. Math. Soc. 124 A0) A996) 3211-3212. [57] B. Josefson, Weak sequential convergence in the dual of a Banach space does not imply norm convergence, Ark. Mat. 13A975O9-89. [58] M.I. Kadec' and M.G. Snobar, Certain functional s on the Minkowski compactum. Mat. Zametki 10A971) 453^457. [59] S. Kakutani, Concrete representation of abstract [D-spaces and the mean ergodic theorem. Ann. ot Math. BL2A941M23-537. [60] S. Kakutani, Concrete representation of abstract (M)-spaces. (A characterization of the space of continuous functions). Ann. of Math. B) 42 A941) 994-1024. [61] S.S. Khurana. Barycenters. extreme points, and strongly extreme points. Math. Ann. 198 A972) 81-84. [62] S.S. Khurana. Barycenters, pinnacle points, and denting points. Trans. Amer. Math. Soc. 180 A973) 497- 503.
446 J. Diestel and J. Swart [63] H. Konig, Eigenvalue Distribution of Compact Operators. Birkhauser, Basel A986). [64] P.W. Lewis and J.P. Ochoa. The range of a representing measure. Math. Proc. Cambridge Philos. Soc. 124 B) A998) 365-369. [65] A. Lima, O. Nygaard and E. Oja. Isometric factorization of weakly compact operators and the approximation property. Israel J. Math. 119 B000) 325-348. [66] V. Lomonosov, A counterexample to the Bishop-Phelps theorem in complex spaces. Israel J. Math. 115 B000) 25-28. [67] V. Lomonosov, The Bishop-Phelps theorem fails for uniform non-selfadjoint dual operator algebras, to appear. [68] B. Maurey, The'oremes de factorisation pour les ope'rateurs line'aires a valeurs dans les espaces L1'. Asterisque, Vol. 11. Societe Mathematique de France, Paris A974). (With an English summary.) [69] A. Nissenzweig, W* sequential convergence, Israel J. Math. 22 C^4) A975) 266-272. [70] A. Petczynski, Projections in certain Banach spates. Studia Math. 19 A960) 209-228. [71] A. Petczynski, Banach spaces on which every unconditionally converging operator is weakly compact. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys. 10 A962) 641-648. [72] A. Petczynski. Banach Spaces of Analytic Functions and Absolutely Summing Operators. CBMS Regional Conf. Series in Math.. Vol. 30, Amer. Math. Soc, Providence, RI A977). [73] R.R. Phelps, Lectures on Choquet's Theorem. D. Van Nostrand, Princeton A966). [74] A. Pietsch. Absolut p-summierende AbbiUhmgen in normierten Raumen. Studia Math. 28 A966/1967) 333-353. [75] A. Pietsch, Eigenvalues and s-Numbers. Cambridge University Press. Cambridge A987). [76] A. Pietsch and J. Wenzel, Orthonormal Systems and Banach Space Geometry. Cambridge University Press, Cambridge A998). [77] G. Pisier, Factorization of Linear Operators and Geometry of Banach Spaces. Conference Board of the Mathematical Sciences. Washington, DC A986). [78] G. Pisier, The Volume of Convex Bodies and Banach Space Geometry. Cambridge University Press, Cambridge A989). [79] J. Radon, Theorie und Anwendungen der absolut additiven Mengenfunktionen. Sitzungsber. Akad. Wiss. Wien 122A913) 1295-1438. [80] J. Rainwater. Weak convergence of bounded sequences. Proc. Amer. Math. Soc. 14 A963) 999. [81] N. Randrianantoanina. Pelczynski's property (V) on spaces of vector-valued functions. Colloq. Math. 71 A) A996) 63-78. [82] N. Randrianantoanina, Some remarks on the Dunford-Pettis property. Rocky Mountain J. Math. 27 D) A997I199-1213. [83] N. Randrianantoanina and E. Saab, Weak compactness in the space of vector-valued measures of bounded variation. Rocky Mountain J. Math. 24 B) A994) 681-688. [84] F. Riesz, Sur les operationes fontionelles lineaires, C. R. Acad. Sci. Paris 149A909)974-977. [85] F Riesz. Sur certains systemes singuliers d'e'quations integrates. Ann. Ecole Norm. Sup. C) 28 A911) 33-62. [86] F. Riesz, Demonstration nouvelle dun theoreme concernment les operationes fonctionelles lineares. Ann. Ecole Norm. Sup. C) 31 A914). [87] H.P. Rosenthal. A characterization of Banach spaces containing /'. Proc. Nat. Acad. Sci. USA 71 A974) 2411-2413. [88] R. Ryan. An introduction to tensor products of Banach spaces, to appear. [89] P. Saab. Integral operators on spaces of continuous vector-valued functions. Proc. Amer. Math. Soc. Ill D) A991) 1003-1013. [90] P. Saab and B. Smith, Nuclear operators on spaces of continuous vector-valued functions. Glasgow Math. J. 33B)A991J23-230. [91] S. Saks. Integration in abstract metric spaces. Duke Math. J. 4 A938) 408^411. [92] L. Schwartz, Produits tensoriels topologiques. d'espaces vectoriels topologiques. Espaces vectoriels topologiques nucleaires. Applications. Seminaire Schwartz 1953-54. Faculte des Sciences de Paris A954). [93] I. Singer. Linear functionals on the space of continuous mappings of a compact Hausdorff space into a Banach space. Rev. Math. Pures Appl. 2 A957) 301-315.
The Riesz Theorem 447 [94] C.P. Stegall and J. R. Retherford. Fully nuclear and completely nuclear operators with applications to L\- and Loc-spaces. Trans. Amer. Math. Soc. 163 A972) 457^492. [95] R.C. Steinlage, On Haar measure in locally compact ?? spaces, Amer. J. Math. 97 A975) 291-307. [96] E.L. Stout, The Theory of Uniform Algebras, Bogden & Quigley, Inc.. Tarrytown-on-Hudson, NY A971). [97] С Swartz. Absolutely summing and dominated operators on spaces of vector-valued continuous functions. Trans. Amer. Math. Soc. 179 A973) 123-131. [98] M. Talagrand, La propriete de Dunford-Pettis dans C(K. E) et Ll(E), Israel J. Math. 44 D) A983) 317-321. [99] N. Tomczak-Jaegenmann, Banach-Maiur Distances and Finite-Dimensional Operator Ideals. Longman Scientific & Technical, Harlow A989). [100] A.E. Jong, Nuclear mappings onC(X). Math. Ann. 194A971) 213-224. [101] A.Ulger, Weak compactness in Li(?,X), Proc. Amer. Math. Soc. 113A)A991) 143-149. [102] P. Wojtaszczyk, Banach Spaces for Analysts. Cambridge University Press. Cambridge A991).
CHAPTER 10 Stochastic Processes and Stochastic Integration in Banach Spaces James K. Brooks University of Florida. Gainesville. FL 32611-2082. USA Contents Introduction , 451 1. Stochastic integration in Banach spaces , 451 Introduction 451 1.1. Notation and definitions , . . . , , 452 1.2. The stochastic measure , 453 1.3. The summability question , 454 1.4. The stochastic integral 458 1.5. Closures of subsets in 7fC(J[) 461 1.6. Completeness of the space L^ c(x) 4f>2 1.7. Stopping the stochastic integral 464 1.8. Convergence theorems in L^- G(X) 466 1.9. Weak completeness and compactness of L^ C(B. X) 467 1.10. Local summability and local integrability 468 I.I I. Ito's formula 469 1.12. Further extensions of the stochastic integral 477 2. Regularity and the Doob-Meyer decomposition of abstract quasimartingales 478 Introduction 478 2.1. Notation 479 2.2. The Doleans function 480 2.3. The mean variation of X and quasimartingales 480 2.4. Regularity of quasimartingales 481 2.5. Cadlag modification without RNP 493 2.6. The Doob-Meyer decomposition of quasimartingales 496 Appendix A 497 A.l. The Doleans function 497 A.2. Mean variation 498 A.3. Quasimartingales 499 A.4. Right continuous quasimartingales 500 References 501 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 449
This page intentionally left blank
Stochastic processes and stochastic integration in Banach spaces 451 Introduction In this chapter, the following two topics are presented; A) stochastic integration in Banach spaces; B) regularity of Banach valued processes and the Doob-Meyer decomposition of these processes. In Section 1, the stochastic integral in Banach spaces is developed, with the aid of a vector bilinear integral. Convergence theorems are given and applied to establish Ito's formula - the essential tool used in stochastic calculus. Section 2 deals with regularity problems. It is of the upmost importance to establish when a process (Yf)t^o is right continuous. For example, if this regularity condition is absent, we have no guarantee that Yj is measurable, when ? is a stopping time - a disastrous state of affairs! Moreover, as we shall see, regularity is equivalent to the Doob-Meyer decomposition of the process, a very useful property. We show that essentially a process is regular if and only if it is weakly regular. To establish these results, quasimartingale theory (in Banach spaces) is needed; some of the background is given in the Appendix of Section 2, The material in this chapter is based on a series of papers by Brooks and Dinculeanu (see references). If the reader is familiar with classical stochastic analysis, this chapter is a quick tour of the machinery used in the infinite-dimensional case. For the reader lacking this background, the presentation can serve as a (very!) brief introduction to some aspects of stochastic analysis - just assume that all Banach spaces are the scalar field, but even in this case the stochastic measure is infinite dimensional, so a (hopefully enjoyable) detour through vector bilinear integration theory is necessary. We recommend, for the first account of Banach space processes. Metivier's impressive book A982), but be warned that at that time even the regularity of Banach valued martingales was not known. (It was proved in Brooks and Dinculeanu A987).) In Dellacherie and Meyer A978, p. 65), the existence theorems for regular versions of real-valued supermartingales are regarded as "possibly the most important single result in the whole theory of continuous parameter stochastic processes". 1. Stochastic integration in Banach spaces Introduction Pellaumail A973) had the wonderful idea of representing a stochastic integral /HdX as genuine integral J ? dm, where m is a "stochastic measure" on the ?-algebra of predictable subsets of R+ ? ??, where (?. ?. ?) is the underlying probability space. The measure m is L' -valued, that is. it is an infinite-dimensional vector measure. Due to the lack of a satisfactory integration theory, his goal could not be completed - even the existence of a cadlag modification of the process (j'0 ? dX), was not established. Kussmaul A977), using Pellaumail's approach, was able to define a measure-theoretic stochastic integral for real valued processes. Various definitions of a stochastic integral have been given (Yor, 1973. 1974; Kussmaul. 1978; Gravereaux and Pellaumail, 1974; Metivier, 1982; Metivier and Pellaumail, 1980; Pratelli, 1988), however either the Banach
452 J. K. Brooks spaces were too restrictive, or the construction did not yield the convergence theorems necessary for a full development of stochastic integration theory. A general theory of the stochastic integral in Banach spaces was developed in Brooks and Dinculeanu A991), using the bilinear vector integration (Brooks and Dinculeanu, 1976) with respect to a Banach valued measure with finite semivariation. This theory seems to be tailor-made for applications to the stochastic integral. Let W:R+xi2-> F be a predictable process, and assume Х:Е+хй-> ? с L(F. G) is a cadlag, adapted process, with X, e LE, where F, G and ? are Banach spaces. The stochastic integral fQ ?dX is defined to be j \[Q.t\Hdlx, where ?? is a stochastic measure on V, the predictable ?-algebra, with values in L[E с L(F. LlG). We call X summable if ??, defined on the ring ??, generated by the predictable rectangles, can be extended to a ?-additive measure on V, and the extension has finite semivariation. The space of ? ? -integrable processes is a Lebesgue-type space with a seminorm, and the space is complete. Unlike classical Lebesgue spaces, the simple processes are not necessarily dense; this causes considerable difficulty since most properties of the integral are usually proved first for simple functions and then extended by continuity. To overcome this difficulty, a Lebesgue-type theorem is proved, which ensures the convergence of the integrals (rather then convergence in the Lebesgue space itself). The Vitali and Lebesgue convergence theorems are established; weak compactness criteria and weak convergence theorems for stochastic integrals are presented, and are new even for the scalar case. The reader is referred to Brooks and Dinculeanu A991) for a quick treatment of the bilinear integration theory. The book Dinculeanu B000) is for the most part based on the works of Brooks and Dinculeanu, and covers among other topics the bilinear integration theory, stochastic integration in Banach spaces, regularity of processes, strong additivity, weak compactness, Ito's formula, etc. - see the references to the joint work listed in this chapter. 1.1. Notation and definitions The spaces E, F, G will denote Banach spaces. The dual space of ? is denoted by ?". We write ? с L(F, G) to mean that ? is isometrically embedded in L(F, G). the bounded linear operators from F to G. Let (?, ?, P) be a probability space. LPE = L''(i2. ?. P), 1 < ? < oo, is the space of .^-measurable, ?-valued functions such that / \f\p dP < oo. If ? = oo, L^ denotes the space of ?-valued essentially bounded .^-measurable functions. We say that (Et)t-^Q, an increasing family of sub-?-algebras of ?, is a filtration. Set ?,+ to be f][Es: s > t}; the family is right continuous if ?,+ = ?, for each t. A process O't)t^o is adapted to (?,) if Y, is XV-measurable for each t. We say that the filtration satisfies the usual conditions if it is complete, that is (?. ?. ?) is complete and all the P-negligible sets belong to ??, and if the filtration is right continuous. We shall always assume that the filtration satisfies the usual conditions. A process (Yt) is said to be cadlag if it is right continuous on [0, oof with finite left hand limits on ]0, oof. Let (Xt) and (Yt) be two processes with values in the same space. We say that (Yt) is a modification of (Xt) if Xt = Yt a.s. for each t ^ 0. We say (Xt) and (Yt) are indistinguishable if
Stochastic processes and stochastic integration in Banach spaces 453 for almost all ? e ?, Xt(o>) = Yt(a>). If А С ?* ? ? (R+ = [0, oof) and the process Aд)ь t ^ 0 is indistinguishable from the zero process 0, we call A evanescent - that means that the projection of A on ? is contained in a P-negligible set. Thus (Xt) and (Yt) are indistinguishable if and only if the set {(t. ?): Xt(co) ? Yt{u>)) is evanescent. We shall always assume in Section 1 that (X,) is a process; X :R+ ? ? —* ? с L(F, G) and X is cadlag, adapted, with X, e i,fp for each t, where 1 < ? < oo. The terminology of Dellacherie and Meyer A978) will be used. We shall denote by ?? the ring of subsets of R+ ? ? generated by the predictable rectangles [Од], with A e Eq and ]s, t] ? A, with 0 < s < t < oo and A e Х\. The ? -algebra V of predictable sets is generated by TZ. 1.2. The stochastic measure Now we shall define the crucial concept used in the construction of the stochastic integral. The stochastic measure Ix:Tl-> LPE. where ? С L(F.G) and X : E+ ? ? -> ? is first defined on the predictable rectangles by Ix([0A]) =lAX0 and /x(]i.r]xA) = lA(X,-XJ. and then we extend it in an additive fashion to ??. Frequently we write / in place of ? ?. We consider LPE с L(F, L?) and we define the semivariation of / with respect to (F, L?), denoted by If.g, on R by If.g(A) = sup\^2 I (Ai)xj for A e R. where the supremum is extended over all finite families of .v, e F\ and disjoint sets A, from ?? contained in A. If / can be extended to V, extend the semivariation to V in an analogous fashion. We say that / has finite semivariation on TZ if it is finite on each set AelZ. Definition 1.1. We say that X is /?-summable relative to (F. G) if ?? has a ?-additive L^-valued extension, which will be unique, to V, and ?? has finite semivariation on V relative to (F, Lp). If ? = 1, simply say X is summable relative to (F.G). If we regard ? = L(R, E), and X is p-summable relative to (E. E). we say that X is p-summable. Observe that X is p-summable if and only if ?? has a ?-additive extension to V, since in this case ? ? in bounded in L? on V, which implies bounded semivariation relative to (R, L?). If 1 </>'</>< oc, and if X is /?-summable relative to (F. G), then it is p'-summable relative to (F.G). In particular, p-summable relative to (F, G) implies summable relative to (F. G), hence most results proved for summable processes remain valid for p-summable processes. If X is p-summable relative to (F,G), then X is summable relative to (R, E). If X is a process with integrable variation, then X is p-summable relative to any pair (F,G) such that ? с L(F,G). If ? and G are
454 J. K. Brooks Hubert spaces, then any square integrable martingale X with values in ? с L(F.G) is 2-summable relative to (F, G). Note that X is p-summable relative to (F. G) if and only if ? ? has a ? -additive extension to V and /? has bounded semivariation on H. rather than on V. Hence the problem of summability reduces to a great extent to that of obtaining a ? -additive extension of ? ? from ?? to 73. Once the summability of X is assured, we can apply the theory of vector bilinear integration theory (Brooks and Dinculeanu, 1976) and define the stochastic integral as the bilinear integral with respect to vector measure ? ?. Suppose now ? ? has been extended to V. It is convenient to extend it further to sets of the form {00} ? A where A e ??. by setting /({00} ? A) = 0. Then V U ({00} ? ?^) is the ?-algebra ^[0. 00] of predictable subsets ofR+ ?? on which / is still ?-additive, and I(]S, T]) has the same value whether (S. T] is regarded as a subset of R+ ? ? or R+ ? ?. Similar considerations hold for other types of predictable stochastic intervals, and in particular for I{[T}) if ? is a predictable stopping time. The following theorem extends the computation of / from predictable rectangles to stochastic intervals. THEOREM 1.2. Assume that X is p-summable relative to (F.G) and regard Ix as the unique extension of ? ? to V. Then (a) There is a random variable, denoted by Xoc· belonging to Lh-, such that ???,-эсХ, = Хэс in LP. and Ix(]t.oo[ ? A) = l.A(X^ - X,) for A e ?,. If X has a pointwise left limit X~^ then X-*.- = X^ a.s. Consider now X extended at 00, by a representative ofX-^, and define X^- to be ??. (b) For any stopping time T, we have ?? e Lp and ??([0. ?]) = Хт- (c) If T is a predictable stopping time, then Xj- e Lpy and ??([0. ?)) = ??- and ??([?]) = ???. (d) If S < 7 are stopping times, then Ix((S. T]) = Xj - X$. If S is predictable, then IX([S, T]) = XT- Xs-· If both are predictable then IX([S. T) = XT- - X.s- 1.3. The summability question Obviously the main question is given X :R+ ? ? -> ?, when is X /?-summable? The answer to this question in the case ? ^> cq is unexpected: the mere boundedness of ?? on 1Z implies that X is p-summable relative to (R. E) and bounded semivariation on Tl, relative to (F, L^) implies that X is /?-summable relative to (F. G). We state this main result below. THEOREM 1.3. Assume that ? ^> c(>. If ?? is bounded on ??, then X is p-summable relative to (R, E). If Ix has bounded semivariation on ??, relative to (F, L?), then X is p-summable relative to (F,G). COROLLARY 1.4. Assume X is real valued and regardR С L(F. F). Then X is summable relative to (F, F) if and only if ?? has bounded semivariation on 1Z relative to (F. LP). What are the steps leading to a ?-additive extension of ? ? by just requiring boundedness. if ? 2$ со? The first step is to establish a new extension theorem for vector measures
Stochastic processes and stochastic integration in Banach spaces 455 m : ?? -> V requiring a very weak hypothesis, that is, an extension theorem requiring only ?-additivity of zm, for ? belonging to a norming subset of V'. where V is Banach space. This is necessary since we do not have a representation of (L^)'. but we can work with the norming set of simple G'-valued functions. The extension is the following theorem (Brooks and Dinculeanu, 1991). THEOREM 1.5. Let m:TZ—*V be finitely additive, where V is a Banach space. Suppose that ? С V is norming for V. Then conditions (a), (b) and (c) are equivalent. ?? is a ring of subsets. (a) m is strongly additive on ?? and for each ? e Z. zm is ?-additive on 1Z. (b) m is strongly additive and ?-additive on TZ. (c) m can be extended to ? ?-additive measure on ?(??) which we denote by ?. (d) Assume V ~fi cq. Ifm is bounded on ?? and if zm is ?-additive on IZfor each ;e Z, then m can be extended uniquely to ? ?-additive V-valued measure on ?. (e) Assume V 2$ cq. Let (?, ?. ?) be a measure space and let m :TZ —*¦ ??(?) be finitely additive and bounded. For each ? e Z, define the measure zm : K-> ?/(?) by (zm)(A) = (m(A), z),for AelZ. If zm is ?-additive on TZfor each ? ? Z, then m can be uniquely extended to a ?-additive measure m \ : ? —> Lv (?). The next step is the following fundamental summability extension theorem (Brooks and Dinculeanu, 1991). THEOREM 1.6 (Summability extension theorem). If ? ^> со. then thefollowtng assertions (l)-F) are equivalent. If ? is any general Banach space, then assertions B)-F) are equivalent and A) implies B). A) ?? can be extended to a ?-additive measure on V. B) ?? is bounded on ??, the ring generated by the predictable rectangles in R+ ? ?. Let ? С LqE, be any closed norming sub space for LPE, where l/p + l/q = I. For g e Z, let G denote the process defined b\ G, = E(g/?,). XG is the real valued process «X,,G,)),^>. C) For each g e Z, the real measure (??, g) is bounded on TL. D) For each g e Z, XG is a quasimartingale on ]0, oof. E) For each g e Z, XG is a quasimartingale on ]0, oo]. F) For each g e Z, the measure (??. g) is ? additive and bounded on 1Z. PROOF (Sketch). Some of the quasimartingale theory used here is given in Section 2 infra. We shall proceed in the following order: A) =>· B) 4» C) O· D) О E) О F)=>A). The implication A) =>· B) is evident since ?-additivity on a ?-algebra implies boundedness. Also B) =>· C) is evident. To prove C) =>¦ B), note that for each AeK, the linear functional g -» (IX(A), g) on ? is continuous. Since ? is norming for LPE, we can embed LE с ?' isometrically. If we assume C), then sup{|(/x(A),g}|: AeTZ) < oo for each g e Z.
456 J. K. Brooks Then by the Banach-Steinhaus theorem we deduce that sup{\lx(A)\p. А еП} < oo. which is B). Now prove C) o· D). Let g e L\, and consider the real measure {Ix,g} on ?? defined by f{lx(A), (Ix,g)(A)= I (lx(A),g)dP forAeft. We shall show (Ix,g) is bounded on 7? if and only if XG is a quasimartingale on ]0, oof. To prove this, we first show that (??^)(?) = ???(?), for A eft, where ??0 is the Doleans function of the process XG. In fact, for В е ?? we have {lx,g)([0B]) = f 1b(Xq, g)dP=f UXoGodP = ???([0?]). For ]s, t] ? ?, where В е Es, we have (Ix,g)((s,t]xB)={lB{(Xt-X>),g)dP = f (X,,Gt)dP- [\xs.Gs)dP Jb Jb = ^xc{\s.t] ? ?). Hence, (Ix, g) is bounded on A]0, oof, the ring of all finite unions of predictable rectangles ]s, t] ? A, where 0 < s < t < oo and A e Es, if and only if ??? is bounded on A]0, oof, which is true if and only if XG is a quasimartingale on @, oo). It follows that 3 o· 4 since TZ=A[0]UA]0,oo[,(ix,g}= ??? on A]0, oof and /? is bounded on A[0], where Д[0] = {0} ? ??. We now show D) <?> E). Obviously E) =>· D). If D) holds, then from B) <?> C) <?> D) proved above, we deduce that ?? is bounded on ??. Thus for g e ? we have |X,G,|, = J \X,G,\dP < |g|,|X,|„ = |g|,|/x([0,r] ? ?)\?, hence sup|X,G,|i <|g|,sup{|/x(A)| : A e ft} < oo. t ' Thus XG is a quasimartingale on [0, oo], which is E).
Stochastic processes and stochastic integration in Banach spaces 457 Next we show E) о F). The implication 6 =>· 5 is evident. Assume E) and let g e Z. Then XG is a quasimartingale on ]0, oo] where (XG)^ = 0 by definition. For each n, define the stopping time T„ = inffr: \X,\ > n). Then T„ / oo and \Xt\ < ? on [0. 7),]. At this stage we do not know if Xjn belongs to l?, but since XG is a quasimartingale on @, oo], we know Xt„Gt„ e L1 and I^GrKnlCH^rn+I^GrJli,^)· Since G is a uniformly integrable martingale, it follows that XT" GT" is a quasimartingale of class (D) on ]0, oo], hence the corresponding measure д(ХС|7„ is ?-additive with bounded variation on A]0, oo] (where A[0, oo] allows ]s, t] ? A, where t < oo), therefore it can be extended to a ?-additive measure with bounded variation on V]0, oo]. Now for each predictable rectangle ]s, t ] ? A, s < t < oo, A e Es, we have ^(Л-СO" (К ']хЛ)= ???(]?, П х А П [0. 7,,]). therefore и(ВIХС)т„ = ???{? П [0. ?,,]) for S e 7>]0. ??]. It follows that ?(?? is ?-additive on the ?-ring V]0, oo] П [0, 7),]; consequently д^с is ?-additive on the ring В = Ui<„oc 7-?0· °°] n №, 7",,]. On the other hand, ??? is bounded on A]0, oof since XG is a quasimartingale on ]0, oo], hence ??? has bounded variation on A]0, oo]. It follows that ??? is ?-additive and has bounded variation on the ring В П A]0, oo], which generates V]0, oo); hence ??? can be extended to a ?-additive measure with bounded variation on V]0, oo]. Since (??. g) = ??? on A]0. oof, it follows that (Ix,g) is bounded on A]0, oo]. Since (Ix.g) is bounded and ?-additive on A[0], it follows that {Ix,g) is bounded and ?-additive on 7? = Д]0. oo]; hence F) holds. To prove F) =>· A), assume ? ^ cq. If we assume F), then (Ix,g) is bounded and ?-additive on ?? for g e Z. By Theorem 1.5, /? can be extended to a ?-additive measure on ? = ?(??), and this is A). D Since Hilbert-space valued stochastic processes are so important, let us turn our attention to this setting and see how the summability theorem can be applied to these processes. When one examines the Hilbert space case it is easy to understand how important the summability theorem is. Without the geometry of Hilbert space, the above theorem allows us to replace the condition that we have a square integrable martingale at our disposal. The summable processes in this theory play the role of the square integrable martingales in the classical theory. It turns out that every Hilbert valued square integrable martingale is summable. One can show by an example that for any infinite-dimensional Banach space E, there is an ? valued summable process which is not even a semimartingale. Suppose ? and G are Hilbert spaces and F is a Banach space such that ? с L(F,G). For example, ? = L(R, ?), ? = L(E. R) or ? с L(G. ? §>ws G), where HS indicates that the Hilbert-Schmidt norm is used on ? <g> G. The main result in this setting is that
458 J.K. Brooks any ?-valued, square integrable martingale ? is 2-summable relative to any embedding ECL(F,G). We say that ? is a square integrable martingale if ? : R+ ? ? —*¦ ?. ?, e LE for each t and sup, \M,\i < oo; in other words, there exists a random variable ?? e LE such that M, = E(Mx\E,),t^0. Our task is to show that Im -V -> ^^- is the ?-additive extension of Im on 7?. having finite semivariationrelative to (F, LG). To show this, we begin with properties of Im . (A) If ? and N are square integrable martingales taking values in E, then if A and В are disjoint sets from 4 and if ?, ? e F. then /m(A)±//v(S) in L| and Im(A)x 1 1ы(В)у mL2G. (B) If ? is a square integrable martingale, then Im is bounded on ??. Indication of proof. Let A e Tl and let A be a disjoint union of predictable rectangles {0} ? A0, A0 e Tq and ]s,, t, ] ? A,, for A, e ?,. , 1 < / < и. Let 7 be the max of the i, and set В = [0, 7] ? ?. Since /M(A)±/M(S - A) by (A), we have \Im(B)\\ = \Im(B-A)\; + \Im(A)\]. which implies \Im(A)\~2 < \Mt\j <sup|M,|5 < oo. t (C) ? ? can be extended to a ? -additive measure on V by the Summability Theorem, since M, being a Hubert space, does not contain a copy of со· (D) ? ? has finite semivariation relative to (F. LG) for any embedding ? с L(F,G). a 1.4. 77?e stochastic integral Now that we have our vector measure ?? in hand, we can apply the vector bilinear integration of Brooks and Dinculeanu A976) (see also Diculeanu B000) and Dinculeanu B002), Chapter 8 in this Handbook, for details regarding this theory). We shall freely use this theory in the sequel. We shall assume in this section that X:R+ ? ? ^ E. where ? с L(F, G) is a cadlag, adapted p-summable relative to (F,G). We consider ?? as extended to V[0, oo], with /x({oo} x A)=OforAer:=V^o^· Our vector measure space is (R+ ? ?. V. ??) where ?? : V -> LE С L(F. LG). If l/p + l/q = 1, and the space ? с LG, which contains the simple functions is norming for L?. For convenience, we let m denote ??. Define the measure m- = (??):: V[0, oo] —> F' for ? e ? by (y, m-(A)) = (m(A)y, z)= I [lx(A)(co)y. ?(?))??(?).
Stochastic processes and stochastic integration in Banach spaces 459 Then, by the definition of the semivariation m = Ix, we have (I'^F.Lp=ffiF.L''. :=sup{|iw:|: ;eZ. |c|i;<l}. where \m-\ denotes the total variation of m -. If ? is fixed, write If.,? to be the of all positive ?-additive measures |w-|, ? e ? and |;|(/ < 1. For any Banach space D, we denote by Td(If.g) or To(IF Lr) the space of all predictable processes ? :R+ ? ? ->¦ D such that < 1 \ < oo. iF.G(H)=mh. L^H) = supl j\H\d\m:\: \z\4 < 1 even if Я is defined on R+ ? ?, since the value is the same. We remind the reader that TdUf.g) is a complete vector space with seminorm Iy.c an(J is contained in the intersection of all the spaces 0D(\m:\), for ; e Z. For any set С С Tody.с), we denote the closure of С in this space by Td(C. If.g)- Obviously, the most important case is when D = F, then we can define j1\о.цН ??? := fQ ?dX e LPC for all t ^ 0, and we also demand that the process (L· ,, ?dI),^o has a cadlag modification, for ? belonging to a subspace of Ту {Aу,о)У. this will be our Lebesgue space of functions denoted by Ц, G(X). the space in which our stochastic integration theory lies. But, a lot of work must be done before we reach this stage. Now we begin work on constructing the stochastic integral. Let ? с (L^)' be norming for Lpc and let ? e TF.C{X) := Ff(Ux)f.g)· Then since ? e L\-(\m:\) for each ; e Z, we have that the mapping /"¦ m ? dm- A) is a continuous linear functional on Z. Definition 1.7. Denote by f Hdlx the element of Z' defined by A). Note f Hdlxe (LPG)" if ? = L4C.. Also note, if ? с Z|, then / ? dlx is a restriction to ? of the element defined by Z|. We now present a useful type of Lebesgue Theorem for pointwise convergence H" -> ? of elements in Tf g(^) relating to the convergence of J H"dlx -> f Hdlx, when fH"dIxeL'Gc(L'G)". THEOREM 1.8. Let (H")n^0 be a sequence of elements from Tf.g(x) s"ch that \H"\ < \H°\ for each ? and H" -> ? pointwise. Assume that f H" dlx e L!G for each ? ^ 1, and f H" d]x —* j ? dlx converges pointwise on ?, weakly in G. Then f Hdlx eLG and f H"dlx -> f Hdlx in the a(L!G.LqG.) topology of L'G, as well as pointwise, weakly in G. If (f H" dlx)n converges pointwise strongly in G, then f H" dlx -> / ? d1x strongly in LG.
460 ? ?. Brooks Indication of proof. Since |//K \H°\, we deduce that ? e Ту.с(Х). Let ? e i/JL. Apply Lebesgue's theorem in space Lj. (\m-\), and deduce that H" -> Я in L^(|m.-|), and thus J H" dm- ->¦ f ? dm-, that is /»¦ 'rf/? K-),c(-))J -> ( / Hdlx,z ? h e LX(P), then hz e L^,, so we obtain E\h()((j H,dIxX),z{)^^n Hd1x,hz\ Thus the sequence (((/ Hn dlx)(),z{))) is weakly Cauchy in Ll(P), and consequently the indefinite integrals of the above sequence are uniformly absolutely continuous with respect to P. Let ?(?) be the weak limit in G of (J Hndlx){w). The Vitali Convergence theorem implies that (?(-),?(-)) e Ll(P) and ((/ H" dlx)(-),z(-)) converges to (Ф(),г(·)) in L'(P), hence the expectations ?(((/H"dlx)(-),z(-))) converge to ?({?,?)). Since ? e ?-g. is arbitrary, we see that ? e LG. Consequently (Ф,г) = ?((*(¦), г(-))) = (f Hdlx.z), hence /ffrf/x = ? e LG and jH"d1x -»· (fHdIx) pointwise, weakly in G. From the above, it follows that f H"dlx -*¦ f Hdlx in the a(Lg, L^,) topology of Lji; in particular in the ?(?^, L^,) topology of L^. This in turn implies that the indefinite integrals are uniformly ?-additive on ?. If (/ H"dlx)(w) -> ?(?) strongly, the Vitali convergence theorem for LG implies f H''dlx -> J Hdlx mOG. ' D Now we are in a position to describe our space of integrable functions ? for which we can define the stochastic integral. First of all, it can be shown that if X is p-summable relative to (F, G) and ? e Ty.G(X), then if / \iQj]HdIx belongs to Ll'E for each t > 0, the process t н» il[0.,]HdIx is adapted, that is f 1[0j\Hdlxis ?1,-measurable, for each t. But this process may not be cadlag, which is necessary for any viable stochastic theory. So we make the following definition. Definition 1.9. We denote by L\ G(X) the set of processes ? e JFf.cih) satisfying the following two conditions: A) j\[0j]Hd1x eL?, for each? > 0; B) the process (f lw.t]H dlx),^t) has a cadlag modification.
Stochastic processes and stochastic integration in Banach spaces 461 We say the processes ? e LlF G(X) are integrable with respect to X, and any cadlag modification of the process in B) is called the stochastic integral of ? with respect to X, denoted by ? ? X or J HdX: (? ??),(?)=( f HdXj (?)= ( I 1\(,.,\H dlx )(?), a.s. If ? = oo, set (? ? X)x = f Hdlx. We note that the stochastic integral is defined up to an evanescent process. If ? is an 7^-step process taking values in F, then (? ? ?),(?) = / Hs(w)dXs(w), that is ? e LFG(X) and its stochastic integral is a Stieltjes integral. Also if X has finite variation |X| and if X is p-summable relative to (F, G), and if ? belongs to Tf.g(X) and satisfies f[Q r]\Hs(w)\d\X\s(w) < oo for each t and ?, then Я е Lf.G(X) and the stochastic integral (? ? ^),(?) equals the pointwise integral /[() ,| Hs(tu)dXs{w). Furthermore, if ? e LF G, then for every t e [0, oo], we have (//xX),_eL? and (HxX),-= Hd1x. •/I0.il In particular, (Я ? ?)-?- = (? ? ?)^ = f ?dlx and the mapping f -> (Я ? ?), is cadlag in L^,. 1.5. Closures of subsets in T(-\g(X) As before, if С С TF.G{X), then Ty.c(C, X) denotes the closure of С in Т/.,С(Х). If С consists of processes ? such that f Hdlx e LG, then by continuity of the integral, we have f Hdlx e L'G for all ? e TF.G(C, X)- It turns out that if С С L\G{X), then Ff.g{C,X)cLfg{X). In this case, we write LF G(C,X) = Tf-.c(C, X). Here are some spaces С of particular interest: A) The space BF of bounded predictable processes with values in F; B) The space SFD) (respectively SF(V)) of simple, F-valued processes over К (respectively V); C) The space ? of predictable, elementary, F-valued processes of the form // = //0l|o)+ ? ?*\?,.?,-^ B)
462 J. K. Brooks where (?,) ? ^,^„+i is an increasing family of stopping times with 7b = 0 and //, is bounded and ??; measurable, for each /'. One can show that SfCR) and ?f are contained in LFG(X). More generally, if the family of measures If.g is uniformly ?-additive then Тг.сE{П),Х)=Ту.с(Ву,Х) = 1\,с{Вь,Х), and this is always true if F is replaced by ? and G is replaced by E. If X has integrable variation, or if X is a square integrable martingale in a Hubert space E, then J>.c(S(ft), *) = ?'-.c<X)· The need for the family of measures /f.c to be uniformly ?-additive arises from the fact that Egorov's theorem is not valid, in general, without this condition - a condition that markedly differs from the classical setting. It turns out that if G ^> cq or if G is weakly sequentially complete, then /f.c is in fact uniformly ?-additive. We note that it can be shown that if ? is an elementary process of the form listed in B), then ? e LFG(X) and (? ? ?), can be computed pathwise: (H xX), = HQXQ+ ? //.-(??-,.,?,-??-?,). 1.6. Completeness of the space L h G (X) It is of the upmost importance that our Lebesgue space L\. c(X) of functions ? integrable with respect to X is complete. It is the cadlag aspect which is the difficult point, and this is handled in the next theorem, which will imply that LF G(X) is in fact complete. THEOREM 1.10. Let (//") be a sequence from L\- g(X) and assume that H" -> ? in Ff.g(X)- Then we have ? e LF G and for every t we have (//" ? ?), -> (? ? X), in LG and there exists a subsequence (nr) such that (H"r ? X), -> (? ? X), a.s. as г -> oo, uniformly on every bounded time interval. Indication of proof. Note, to get ? e L]F G(X). we use the uniformly convergent subsequence to ensure that we have an adapted cadlag modification of (/ l[o.,|# dlx),. (//") is a Cauchy sequence in LF G(X) converging in Ту.в(Х) to H. By passing to a subsequence, if necessary, we can assume that /V.c(#" - Hn+[) ^ 4~" for each n; let Z" = //" ? X, and for ?? > 0, define the stopping time t/„=inf{r: \Z* -?;,+1|>2-"}??().
Stochastic processes and stochastic integration in Banach spaces 463 Let G„ —{Un< ?()}¦ For each stopping time v, we have Z" = / l[o.v\H"dIx, hence E(\Znv-Z4, + l\) = E(\ f (H"-H"+])dIx) \\J[0.v\ / (H"-Hn+l)dIx I/ ./[Од·] \f (//" - H"A \JU).v\ l)dlx L'c, <;/>.с(я"-я"+')^4-". /[Од In particular, for ? = Un, we have E(\Z"u„-zauV\)^-n- We assert P(G„)^2nE(\ZnUn-Z»+l\)^2-n. To see this, note if ? e G„, then [/„(?) < ?<>; take a sequence ?* \ ?/„(?), with ^ < ?? such that |??(?) - ?"+1(?)| > 2"", for each it. Then, by the right continuity of the Z', conclude that |?^(?) - Z?+' (?)| > 2"", and thus E(\Z4jn-Z'^\)^2-"P(Gn), and the desired inequality follows. NowletGo = limsupG,,.ThenP(Go) = 0. ????? Go, there is a ? such that if и ^ ?, we have ? ? G„, hence [/„(?) = ??· Thus sup|z;'M-z;,+1|s;2-", '<? and we have the desired uniform Cauchy condition of (Z"(a>)) for t < to- Define ?,(?) = lim„ ?"(?), ? ? Go. This process is adapted and cadlag and ?/ — Zf ? ? ^2 -n + l It follows that for t < t0 we have Z, e L? and Z" -> Z, in L?. On the other hand, 1[0.,)Я" -»¦ 1[0.,)Я in TF.G{X), hence Z'' -»· f[0j]HdIx in (?.«,)'. Thus / ¦ Hd1x = Z, eL1' Since ? is cadlag, we deduce that Я e L\ g(X) where we extend Z, consistently, for r e [0, oof, and we also have (? ? ?), = ?, for each t. Thus LlFG(X) is complete, and
464 J. K. Brooks since ?? was arbitrary, we have the desired uniform convergence for a suitable subsequence on each bounded time interval, ? Corollary 1,11, LF G(X) is complete, COROLLARY 1.12, If IF,G is uniformly ? -additive (for example, if G ^> со or G is weakly- complete), then LFG(X) contains all the F-valued, bounded, predictable processes (in particular, if F = Й). In fact, in this case Zf, the space of elementary processes is dense in Ff.g(Bf, X)· Since EF с LrG(X), we have Ty.c(Br\ X) С L\G(X), One can also show that if X has integrable variation or if ? and G are Hubert spaces, then LF G(X) = Рг'.вФ, X) = Ff.g(X). 1.7. Stopping the stochastic integral We present in this section a few results to show that the usual rules to stopping apply to LlFG(X), THEOREM 1,13, Let ? e L[FG(X) and let ? be a stopping time. Then 1[о.т\Н е LlFG(X) and A[0tj#) xX = (H xX)T, If ? is predictable, then 1[о.;г[# е LF G(X) and (ll0JlH)xX = (H xX)T~, COROLLARY 1,14, For every stopping time T, we have l[o.T] x X = X ¦ If ? is predictable, then l[o.r[ x X = XT~. THEOREM 1.15. Let S ^T be stopping times and assume that either (a) h : ? -> ? is bounded, ? ^-measurable and ? e LF G (X) or (b) h; ? —> F is bounded, ?'^-measurable and ? e L^ E(X)· Then (i) 1M 7-)Я and hl]s.T]H are integrable with respect to X and (hl]S.T)H) ? X = h[(l]S.TiH) ? X], (ii) If S is predictable and h is ?<;--measurable, then l[s 7-)// and h\[s.T\H are integrable with respect to X and (hl[S.T]H) ? X = h[(l[S.T]H) ? X].
Stochastic processes and stochastic integration in Banach spaces 465 The next theorem gives a more complete description of the properties of X for a p-summable process X, THEOREM 1,16, Let ? be a stopping time, (a) XT is p-summable relative to (F,G) and /хг(А) = /х([0,Г]ПА), /orAe-PfO, oo], (a') If ? is a predictable, then XT~ is p-summable relative to (F, G) and we have ???- (A) = Ix ([0, T] П A), for A e V[0, oo]. (b) For every predictable, F-valued processes H, we have s\arF L^IXT(H) = s\aiF LrIx(l[0.T]H), (b') If ? is predictable, then s\aiF L?lXT-(H)=s\aiF L?lx(llo.T]H). (c) We have ? e TF.G(XT) if and only if\[i).T]H e TF.G(X) and in this case ? Hdlxr= j ll0,T]HdIx. (c') If ? is predictable, then ? e Ту,с{Хт~) if and only ifl[o.T[H e Tf.g(X) and in this case f HdIXT=fl[0.T[HdIx. (d) We have ? e LF G(XT) if and only ifl{o.T]H e LF G(X), and in this case ? ? XT = (ll0.T]H) xX. IfHe LFG(X), then ? e LFG{XT), 1?0.7?^ e lf.gW> and (? ? X)T = ? ? XT = (Iio.t-)//) ? X. (d') Assume ? is predictable. We have ? e L\ g(Xt~) if and only if l[o.T[H e LF G(X), and in this case ? xXT~ = (l[l).T)H) xX.
466 J. K. Brooks (e) If the set of measures (??) F L·/, is uniformly ? -additive, then so is (Ixi) FL4, (e') If ? is predictable and if (? ?) h- L4 is uniformly ?-additive, then so is Aхт-)у L4 , Recall, that if (Yt) is a process, AY, = Y, — Y,-, assuming У,_ exists, THEOREM 1,17, For any process ? e L\ - g(X), we have A(H ? X) = HAX. 1.8. Convergence theorems in L^ G(X) To be a genuine Lebesgue space, the Vitali theorem and the Lebesgue convergence theorems should hold. These theorems are now stated for LF G(X). THEOREM 1.18 (Vitali). Let (//") be a sequence from J7f.g(X) and let ? be an F-valued predictable process. Assume that (i) 1F.G(//" 1A) -> 0 as IF.G(A) -> 0, uniformly in ? and that any one of the conditions (ii) or (iii) below is true; (ii) H" -> ? in Ifg-measure; (iii) H" -> ? pointwise and I/. ,G is uniformly ? -additive (for example, if F = R or G is a Hilbert space). Then (a) ? e TF.G(X) and H" -* ? in FF.G(X), Conversely if H'\ ? belong to FF.G(BF, X) and H" -+ ? in TF.G(X), then (i) and (ii) hold. Under the hypotheses (i) and (ii) or (iii), assume, in addition, that H" e LhG(X) for each n. Then (b) ? e LF C(X) and H" -> ? in L[- g(X)\ (c) For each t e [0, oo], we have (//" ? X)l -> (? ? ?), in L'G; (d) There is a subsequence («,·) such that (?"' ? X), -> (? ? X), a,s., uniformly on bounded time intervals. We remark that if H" -> ? pointwise uniformly, we have a similar result, THEOREM 1.19 (Lebesgue). Let (//") be a sequence from TF.G(X) and let ? be an F-valued predictable process. Assume that (i) There is a process ? e Fr(Bvx, X) such that |Я"КФ for each n\ and that either (ii) or (iii) hold; (ii) //" —> ? in IFG-measure; (iii) H" -> ? pointwise and IF G is uniformly ? -additive (for example, if F = RorG is a Hilbert space).
Stochastic processes and stochastic integration in Banach spaces 467 Then (a) ? e TF.G(B, X) and H" -> ? in Ty.ciX). Assume in addition that H" ei,|- G{X) for each n. Then (b) ? eL]F G(X) and Hv -> ? in L\- g(X); (c) For each t e [0, oo], we have (//" ? X), -> (Я ? ?), /? L?; (d) 7/геге is a subsequence (nr) such that (//"' ? X), -> (Я х X), a.s., uniformly on bounded time intervals. 1.9. WeaA; completeness and compactness of LF G(B, X) The advantage and purpose of establishing a Lebesgue space for the bilinear vector integral is the possibility to examine weak completeness and weak compactness - a major consideration for any theory. In this section we address this topic. We write В for By. THEOREM 1.20. Assume that F is reflexive and G ~J> cq. Then L\ g(B, X) is weakly sequentially complete. Now we shall apply the general theory of weak compactness in the bilinear integration theory to LF G(B, X). Recall that a subset К in a Banach space is said to be conditionally weakly compact if every sequence of elements from К contains a subsequence which is weakly Cauchy. THEOREM 1.21. Let X be p-summable relative to (F, G). Assume F is reflexive and If.· G is uniformly ? -additive. Let К С L\g (?, ?) be a set satisfying the following conditions: A) К is bounded in LFG(B,X)\ B) H\An -> 0 in LFG(B, X) uniformly for ? eK, whenever A„ e V and A„ \ 0. Then К is conditionally weakly compact in Ly G(B, X). If in addition, G ^> cq, then К is relatively weakly compact in LF G(B, X). In the last case, for every sequence (H")from K, there exists a subsequence (H'h) such that (f H"rdX), converges weakly in LG,for each t. THEOREM 1.22. Let X be ?-valued and p-summable relative to (?, ?). Let К С Ljjj E(B, X) be a set satisfying the following conditions: A) К is bounded in L^ E(B, X); B) fA ? dlx -> 0 in LPE uniformly for ? e K, whenever A„ e V and A„ \ 0. Then К is conditionally weakly compact in L~ E(B, X). If in addition, ? ^ cq, then К is relatively weakly compact in L^ E(B, X). In the last case, for every sequence (H")from K, there exists a subsequence (//"') such that (//"' ? X), converges weakly in LE, for each t.
468 J. K. Brooks THEOREM 1.23. Let X be an ?-valued process, p-summable relative to (?. ?). Let (//")„^0 be a sequence of scalar processes from L~ E(B.X). Suppose that ? ^> cq. IfflAH"dX -> flAH°dX, for every AeV, then H" -> H° weakly in LlEE(B. X), hence ( I H" dx\ -> ( I H°dx\ weakly in LPE, for each t > 0. 1.10. Local summability and local integrability As usual, X : R+ ? ? -> ? с L(F, G) is a cadlag, adapted process with X, e L[ for each f >0. Definition 1.24. (a) We say X is locally p-summable relative to (F, G), if there exists an increasing sequence (T„) of stopping times, with T„ / oo, such that for each n, XT" is p-summable relative to (F,G). (b) Assume X is locally p-summable with relative to (F,G). A predictable process ? : R+ ? ? -> F is said to be locally integrable with respect to X if there is an increasing sequence (T„) of stopping times with T„ / oo, such that, for each n, XT" is p-summable relative to (F, G) and l[o.r„)" is integrable with respect to X1". Note that in the above situation, the stochastic integral A|о.7"„]н) х XT" is defined. The following theorem states the existence of the pointwise limit of the above sequence of stochastic integrals. THEOREM 1,25. Assume X is locally p-summable relative to (F,G) and let ? be an F-valuedpredictable process, locally integrable with respect to X, Let (T„) determine the local integrability of ? with respect to X (as above). Then the limit lim(l[o.:r„itf) x X ?„ exists pointwise outside an advanced set. This limit process is cadlag, adapted and independent of the sequence (T„). The limit in the above theorem is called the stochastic integral of ? with respect to X and is denoted by ? xX or / HdX /»¦ We have that (? ? X)T" = A[0???]?) ? ??" for each n. This stochastic integral has the main properties of the stochastic integral with the respect to a summable process.
Stochastic processes and stochastic integration in Banach spaces 469 1.11. Ito's formula In order to compute the Riemann integral, a calculus had to be developed; the stochastic counterpart of evaluating stochastic integrals is the Ito stochastic calculus, founded on the famous Ito formula or change-of-variable formula. Ito established this formula for continuous semimartingales, which was later extended by Kunita and Watanabe. In this section we shall present the full extension in the Banach case setting to processes X which are semi-locally summable (defined below). This was developed in Brooks and Dinculeanu A990), In order to motivate Ito's formula, consider the very concrete, but important, case when (X,) is a Brownian motion. Assume С and g are suitable real-valued functions on R~ and that f(x, t) is a continuous function on R ? [0, oof together with its derivatives /,·, /,,, /,, Let (Y,) be a processes such that for any 0 ^ t\ <fiwe have: Yn-Yt]=i C(t)dt+[ g(t)dXt. J[t]-4] J\']-':\ Set dY, = C(t)dt +g(t)dX,. Then the differential form of Ito's formula is df(Y,,t) = [?,(?,,?) + fAY,,t)C(t) +{f,AY1J)gZ(t)]dt + fAY,,t)g(t)dXt. Note that if X, were continuously differentiable in ?, then by the standard calculus formula for total derivatives, the term 1 /2fxxg- dt would be absent. It is this term that distinguishes the Ito calculus from the classical case. Brownian motion, being a square integrable martingale on [0, t], with continuous paths, yields this "nice" version of Ito's formula. The general case requires quite some preparation. In the general Banach space setting, dealing with processes of a much more complicated nature, we must study jumps (AXSJ and processes [[X]], the quadratic variation, and impose the condition that the quadratic variation is finite. Now we shall state Ito's formula in its general form; all terms will be defined later, and some theorems involving the relevant concepts will be presented in order to contribute to the reader understanding of the framework of the theory. THEOREM 1.26 (Ito's formula). Let X: R+ ? ? —> ? be semi-locally summable with respect to the bilinear mappings B\ (x. y) = л <g> у of ? ? ? into ? <g)T ? and Bj(x, у) = y(x) of ? ? L(E, G) into G (that is, we regard X, in this instance, as taking values in L(L(E,G),G)). Assume that X has finite quadratic variation [X]. Let f : ? —*¦ G be a function of class C~ such that f"': ? -> L(E ®? ?, G) is uniformly continuous and bounded on bounded subsets of E. Then for every t > 0, we have a.s.: f(X,)=f(Xo)+ f f'(Xs~)dXs + \S f"(Xs-)d[[X]]s J\Q.i\ l J]0.t]
470 J. K. Brooks = /(^?) + ? f'(X,-)dXs + \ ? f"(X*-)d[[X]]\ J]0.i] J\0.i\ + ? [уда-я*.-)-/'<?.-)??,]. 0<.?<? 1.11.1. Preliminary results. Throughout this section, assume ? С L(F. G) and X : E+ ? ? -> ? is a cadlag, adapted process with X, e LJ.¦(/>), for ? > 0. We shall reserve the notation (и(нД))^о, н = 1.2 , for a family of stopping conditions, which we indicate by (*): (i) for each n, v(n, 0) = 0 and v(n, k) / oo, as к —> oo; (ii) Нт„5ирА.(и(иД + 1) - u(/;, A:)) =0; (iii) there is a sequence a„ \ 0 such that for t e [>(//, k), v(n. к + 1)], we have \%t - Xv(„.k)\ ^ an- Suppose a„ \0. An example of such a family (v(n,k)) satisfying (*) is u(n, 0)=0 and v(n,k + 1) = inf{/ > v(u,k): \X, - XHn.k)\ > a,,} A(v(n,k) +a„). For each n, we define X" = / ,-Уг(п.;-I1г(п.;-).1'(п.;-+1I- *^o From condition (ii) we deduce that X" -> X_ pointwise, and uniformly on М+хй. Define Уо- = 0, for any process Y. The next theorem is important in proving Ito's formula. THEOREM 1.27. Assume that F = L(E,G)\ hence ? С L(F,G). Assume X is locally summable relative to (F,G). Let f: ? -> G be a function of class C1. Then /':?-> L(E, G) = F is continuous and f'(X-) is locally integrable with respect to X. Assume further that either the set of measures (I)r.G " uniformly ?-additive (for example, ifF = RorG^> c0) or the second derivative f":E-> L (? ®T E.G) is bounded on bounded subsets of E. Then for every t > 0, we have f'(X,-)dXs =lim}f'(Xl(„.k)At)(^Hn.k+\)A! - Хцп.к)лО- к>{) / Ло
Stochastic processes and stochastic integration in Banach spaces 471 1.11.2. Vector quadratic variation. In this section ? and D are Banach spaces and ?. ? ? ? -> D is a continuous bilinear mapping, denoted by ?(.?, y) = лгу, such that 1-х:| = sup{|*y|: |y| ^ l},for? e E. Denote by ?': ? ? ? -> ?) the bilinear mapping defined by B'(.x, y) = ?( ?, л). Write 2 ?:? = ? ¦ ?. Important examples of such bilinear mappings are: A) The tensor product B(x, y) = ? <g> у from ? ? ? into ? <g>.T ?. We write д·®- = ? <8>?. B) The inner product S(jc, у) = {?, у), if ? is a Hilbert space. We define a process Z, taking its values in ?, to be semi-locally p-summable relative to the pair (?, G), if it is of the form ? = X + ?, where X is locally p-summable relative to (F, G) and ? is an ?-valued cadlag, adapted process with finite variation. If5:?x?->Disa bilinear mapping as above, we can embed isometrically ? с L(E, D). We say a process X : R+ ? ? -> ? is locally summable or semi-locally summable relative to the bilinear mapping ?, if we regard X as taking values in L(E, D) and X is locally summable or semi-locally summable relative to (?, D). If X is semi-locally summable relative to both В and B\ then we can integrate ?-valued processes H,and f ? dX and / dX ¦ ? have values in D. In particular, if X is semi-locally summable relative to В and B', then X_ is locally integrable with respect to X for both В and S', and the stochastic integrals / Xs-dXs and / dX,-Xs- J[0.t] J\Q.i\ are defined and have values in D. Note that if В is the tensor product or the inner product, then local summability relative to В implies local summability relative to ?'. Definition 1.28. Assume Х:1+хй-> ? is semi-locally summable relative to В and ?'. The D-valued, cadlag, adapted process [[?]]? defined by [[X]]f = Xj- [ (*,_¦</*,+</*.,-Хд._) J [O.i] is called the vector quadratic variation (or vector square bracket) of the process X with respect to B. Note that [[X]]B = XBr If B(x, y)=x®y and D = ?®? ?, we denote [[X]] = [[?]]?, and call [[X]] the tensor quadratic variation of X. Hence, if X is semi-locally summable relative to (x, у) н> ? <g> у, then [[X]], = Xf2- f (XA_®i/X(+i/Xv®X.,_). JI0.t \ If ? is a Hilbert space and X is semi-locally summable relative to the inner product ?, we denote [X] = [[?]]?, and call [X] the scalar quadratic variation of X. In this case we have [X], = |X,|2-2 [ Xs-dX„. J[0.t\
472 J. K- Brooks Shortly we shall present an extension of [X], when ? is a general Banach space. Note that if ? is a Hubert space and X is a semimartingale, then both [[X]] and [X] can be defined since in this case X is semi-locally summable relative to the corresponding bilinear maps. The following theorem gives us some insight into the nature of [[?]]?. THEOREM 1.29. Assume X:R+xl2->?is semi-locally summable relative to В and B'. For each t ^ 0, we have [[X]]f = X5+limprob^(Xl.(„.jt+i)A, - Хш.к)л,J- k^O If X is locally summable then there is a subsequence such that the limit is a.s. uniform on bounded time intervals. Remark 1.30. If ? is a Hubert space and X is semi-locally summable relative to the inner product, then [X] is an increasing, positive process. 1.11.3. Quadratic variation. In order to ensure that [[X]] has finite variation paths, we need to examine the quadratic variation [X]. Definition 1.31. We say that X has finite quadratic variation if there exists a family of stopping times (v(n,k)) satisfying condition (*) such that [X], := |Xol" + hm / \Xvt„.k+\)Ai ~ Ху{п.к)л,\~ ? *¦—' k^O exists and is finite a.s. for every t ^ 0. The process ([X]t) is called the quadratic variation ofX. Note that [X] is positive, increasing and adapted. If ? is a Hubert space and X is a semimartingale. then X has finite variation [[X]] which is equal to the vector quadratic variation [[?]]? relative to the inner product B. An important fact is that if X is semi-locally summable relative to В and B' and if X has finite quadratic variation, then [[?]]? has finite variation and \[[X]]f-[[X]]f\^[X],-[Xh, for s^r. When [[?]]? has finite variation and takes values in Z), one can integrate pathwise, with respect to [[X]]B, process ? with values in F, if F с L(D, G). Although [[X]]f has an integral representation, we cannot use this formula in evaluating / Hsd[[X]]f, since we do not know if the processes j Xs- ¦ dXs and fdXs ¦ Xs_ have local finite variation or if they are locally summable relative to the bilinear mapping D ? F -> G. The following theorem studies this problem.
Stochastic processes and stochastic integration in Banach spaces 473 THEOREM 1.32. Assume that X is semi-locally summable relative to the bilinear maps В and B' and suppose X has finite quadratic variation [X]. Let F = L(D, G) and let ? be an F-valued cadlag, adapted process. Then [[X]]B has finite variation, hence the Stieltjes integral L· t.Hs-d[[X]]f is defined pathwise. Let (v(n,k)) define the quadratic variation [X]. Assume that for each ? e ? there is a sequence b„(w) 4- 0 such that for t e [v(n. k), v(n,k + 1)[ we have \?,(?) - Я,,,,,.*,(?) | ?,,(?). Then there is a subsequence {m} of{n] such that a.s. Hs-d[[X]] = lim> Hl.lm_ii)(Xv{m^^.\)Al — Xi-im.k)M)'· m *—' The above theorem has a corollary which is used in the proof of Ito's formula. COROLLARY 1.33. Assume that X is semi-locally summable relative to the tensor product and has finite quadratic variation [X], and let f: ? -> G be a function of class C~, such that f" is uniformly continuous on bounded subsets of E. Then the Stieltjes integral f\p ,|/"("К.«-)^[["К]]? " defined pathwise and there is a family of stopping times (which can be used in the definition of[X]) such that a.s. f"(Xs-)d[[X]]s = \imyif"(Xrl„.k))(Xl.i„.k+i^l - Х|.<п.*>л,)®2- Jt^O 1.11.4. The process of jumps. In this section, we shall state some results about the jumps of X, which will enable us to prove Ito's formula. LEMMA 1.34. Let X:R+ ? ? -> ? be cadlag and adapted. Let (v(n,k)) satisfy condition (*). Then for every ? and ?, we have {s: \AXA >2an} c{v(n,k): k^O}. THEOREM 1.35. Assume that X is ?-valued, adapted, and has finite quadratic variation [X], and let (v(n, k)) be a family of stopping times defining [X]. Then A) thefamily {\??^\-\ s ^t] is summable for each t; B) the process of jumps defined for any t ^ 0 by J, :=J2\A*s\2 is increasing, cadlag, adapted and satisfies J, ^ [X]t\ C) J, = lim„ Хл>о |A-Kr<n.it)l2l|r(H.Jt)^f) uniformly for t on any bounded time interval; / / J]0
474 J. K. Brooks D) let V," = J2k^o \Xvin.k+\)/\t - Хш.к)лА2 and assume that for every ? there exists a subsequence («,-), which may depend on ?, such that V'h (?) converges uniformly to [?],(?) — \Xq(w)\2 on bounded time inten'als. Then [X] is cadlag and the process [?]?;·.= [?], -?|дх.,|2 is continuous. Note that the assumption D) is satisfied if ? is a Hilbert space and X is locally summable relative to the inner product. Now we examine the regularity of [[?]]?. THEOREM 1.36. Assume X is ?-valued, cadlag, adapted and has finite quadratic variation [X]. Let (v(n,k)) be a family of stopping times defining [X]. Then A) the family {(Xs)-: s ^t] is summable for each t; B) the process of vector valued jumps ? J, := ?(???J is cadlag, adapted and has finite variation \vJ\ which satisfies \vJ\, <?\??,\2\ C) we have ^2(AXsJ = \im^2(AX,llKk)Jl[,ilukKl] uniformly for t in any bounded time interval; D) assume that X is semi-locally summable relative to В and B'. Then the process [[x]f; ¦= [[xnf - ?(???J s<i is continuous; E) let ? :R+ ? ? -> L(D, G) be a cadlag, adapted process. The family {//,_(??,J: s ^ t} is summable and we have J, ¦= ? Hs-(AX.sJ = lim^2Hv{,Kk)-(AXm.k)Jl[H„.k)^t} 0<s%i k^O uniformly on bounded time intervals. The process J,(H, X) is adapted and right continuous.
Stochastic processes and stochastic integration in Banach spaces 475 The following corollary is used in establishing Ito's formula. COROLLARY 1.37. Assume X is ?-valued, cadlag, adapted and has finite quadratic variation [X]. Let (v(n, k)) be a family of stopping times satisfying (*) defining [X], Let f ; ? -> G be a function of class C2\ hence /":?-> L(E <§>.T E, G). Then the family {f"(Xs-_)(???)-: s ^ t] is summable and V /"(X,-)(AX,)®2=limV/"(X1.,n.A)-)®2l|,.,n.*)<0 1—1 n 1—/ 0<.5<i i>0 uniformly on bounded time intervals. 1.11.5. Comments on the proof ofltd's formula. Now that we have the machinery in place, we shall make some comments on the outline of the proof of Ito's formula, stated in Theorem 1.26. If / is the function in Corollary 1.37, Taylor's formula holds, that is, there exists a function RE ? ? -> L(E ®? ?,G), with R(x,x) = 0, for ? e E, such that limv_>..v R(y, x) = 0 uniformly with respect to .v belonging to any given bounded set and such that for ?, у e ? we have f(y) = /(*) + /'(*)<>· - -v) + fv/V)(y - -v)®2 + Жу, *)(y - ·*)®2- Returning to Ito's formula, by the above, we have f{X,) - f(Xo) = ? (/(^i-c.t+DAi) - /<X.-(,uw)) Jt^O = /__,/ (-^?·(??.I·)??)(-??·(??.I + ?)?? — -^i'(ii.i)Ai) C) jt>0 + - 2__,/ '(-??·<?1.*)??)(-??·(?!.* + ?)?? — -??·(?!.*)??) " D) )®2, E) where we set R„.k := /?(?,4„.*+?,?? - Л:г|„.*)л,). By Theorem 1.27, taking a subsequence if necessary, we see that the sum in C) converges pointwise to L· t] f'(Xs-) dX,. By Corollary 1.33, again by taking a subsequence if necessary, we deduce that the sum D) converges pointwise to , f,Qj, f"(Xs-) d[[X]]x. The remaining task - an arduous one - is to prove that lim > R,i.k(Xv{n.k + \)At — -Kr(njt)Ai) /1 * ' k>0
476 J. K. Brooks is equal pointwise to the sum ? (/<*¦«) " ¦№-) " /'(^.»-)??? - ?/(?,-)(??.?®2)- 0<?<? To do this, write Rn.k(Xi4n.k + \)At - Xr{n.k)At) ' = vn.k + Wij.k +Уп.к, where Vn.k = f(Xv(n.k + \)At) - f(Xi-(n.k+\)At-) — f (Xnii.k))&Xvln.k-<-UAt - ? /"("^ !¦("¦* ))(?-^ !'('!.*+? ??/)8"· W,uk = f(XvOi.k+\)/\t-) - f(Xv<n.k)) - f'(Xi4Kk))(Xvin.k + [)Ai- ~ Xv(n.k)) ~ j/ (Xv{n.k))(Xvin.k + \)At- — Xvln.k)) '· Уп.к = 2~/ (Xvln.k))((&XH,uk+\)At) ® (Xv(n.k+l)At ~ Xf{n.k)) + (Xrln.k+\)At- — Xt-(n.k)) ® ??,.(„.?+|)??)- First prove lim^|iun.jt|l|l.(„.jt)<,)=0 F) uniformly on bounded time intervals. Then prove, if Aluklv{n,k]<t = {???,,,,.^?? > b„}, where b„ I 0. a„ ^ b\ and 2a„ ^ b„, then Hm ^lji?.A|lA„.Al|f(n.*)<:i) = 0 G) k^O uniformly on bounded time intervals. Next prove that for t ^ 0 we have lim^ Я(Хн„д+|)л,, XV{n.k)Ai)(Xi-Ouk + \)Ai - ??(??)??) ~^A'nX =0 (^) k^O and lim J^ /?(X,(„.jt+i), Xv(„.k))(Xvt„.k^\ ? - Хг(«.*))®21д;а ¦ 1[?·(«.*?? = °- (9) k^o Finally, the last two steps are to show Ит?и„.аА„.;-1|,,,,Д)<,)= ? ?(??,^-)(??,)®2 A0) k>0 0<s<i
Stochastic processes and stochastic integration in Banach spaces 477 and Нгп?]и,,.*1и,,.*><,)= ? tf(Xs.Xs-)(AXs)®2. (И) k^O 0<.s<i All of the above are for the case X is bounded. For the unbounded case, obtain a sequence Tj ? oo such that XT'~ is bounded. Note that XT,~ still has finite quadratic variation and [XTi~], = [X],'~; therefore, for each XT<~ we have the formula. For a given ? and t > 0, т. _ choose /' such that ?,(?) > t. Then for every 0 < s ^ t, we have Xs' = X.s, and the equality holds for ?, (?). This completes the outline of the proof. 1.12. Further extensions of the stochastic integral In this section we shall briefly sketch some extensions of the theory which was presented in the previous sections. If ?: R+ ? ? -> ? and X: R+ ? ? -> ? are processes, ? is a Hubert space, X is a square integrable martingale and ? is assumed to be measurable (i.e., measurable with respect to B(R+) ? ?), a stochastic integral ? ? ? was constructed in Brooks and Neal A998), and properties of this integral were presented. In the case ? is an optional process (i.e., it is measurable with respect to the optional ?-algebra - the ?-algebra generated by right-continuous, adapted processes), ? ? ? agrees with the compensated (or optional) stochastic integral developed by Yor (see Dellacherie and Meyer A978)). Moreover, when ? is bounded and measurable, then ? ? ? = °H с X. where °H is the optional projection of Я. When ? is predictable, then ? ? ? agrees with the Hilbert-space stochastic integral defined by Kunita A970). In Brooks and Neal A998) a very general stochastic integral is developed when one of ? and X is real valued and the other is Hilbert-space valued, assuming that ? is just measurable; the construction uses H1' and BMOp spaces of Hilbert- space valued processes. A weak Pettis-type stochastic integral is presented in Brooks and CandeloroB000). Next we turn to the nuclear space setting. The space ? will denote a reflexive, complete, bornological nuclear space, whose dual ?' under the strong topology is nuclear. Since ? and ?' are projective limits of separable Hubert spaces, we can choose appropriate neighborhood bases of zero, U(E) and U(E'), respectively, and work with quotient spaces ?([/), E'(U'), U e U(E), U' e U(E'), that are Hubert spaces induced by the norms corresponding to U and U'; we have canonical mappings k(U):E—* E(U) and k(U, V):E(V) -> ?([/), if V с U (also for the E'(U') system). Ustunel A982) defined the concept of a projective system of stochastic processes in ?' as follows. Call the set X = {Xu'\ U' e U(E')} a projective system of martingales if X1' is an ?'([/')-valued martingale and if V" с U', then k(U', V')XV and Xй are indistinguishable. We say such an X has a limit in ?' if there exists a weakly adapted mapping X' :E+ ? ? —* ?' such that k(U')X' is a modification of XL', for each U'. Such a limit does exist if the projective system consists of square integrable martingales, and X' will be strongly cadlag in ?'. The predictable case is treated in Ustunel A982) and the general case in Brooks and Neal A998b). If ? is bounded and weakly optional, a stochastic integral ? ? ? is defined
478 J.K. Brooks in terms of ? о X u , where U' absorbs the range of ? in the sense that ? takes its values in the Hubert space E[(U')°]. It can be shown that if {//"} is a sequence of weakly optional E-valued functions on R+ ? ?, which converges pointwise in ? to Я and if each H" takes its values in a fixed E[(U')°], then {#" ? ?] converges in the Hubert space of square integrable real valued martingales, if X is a projective system of square integrable martingales. The general case X:R+ ? ? —> ? с L(F,G). where the spaces are nuclear are treated in Brooks A999), where a stochastic measure ? ? on V takes its values in h\: and a bilinear integration theory is constructed for certain processes X. We have the following summability result in this setting: X is summable if ? ? is bounded on 1Z, the ring generated by predictable rectangles. Finally, we construct a Pettis (weak) stochastic integral for predictable processes ? : R+ ??^?, where ? is a Banach space; X is scalar valued, adapted and X, e L', for each t. Call ? weakly integrable if there exists an E- valued process ? such that ? is cadlag and Z, = (((*', //)) ¦ X), for each t. This integral is developed in Brooks and Candeloro B000). The theory yields a norm for such H, and the space of weakly integrable processes strictly contains Ll(X), if ? is infinite dimensional. 2. Regularity and the Doob-Meyer decomposition of abstract quasimartingales introduction In this section we shall examine the structure of quasimartingales taking their values in a Banach space E. We are especially interested in the existence of cadlag modifications and in the Doob-Meyer decomposition of such quasimartingales. The motivation of this inquiry was twofold. First, the problems concerning regularity and the Doob-Meyer decomposition of quasimartingales arose naturally in the attempt to develop a stochastic integration theory for Banach space valued processes. Secondly, we wished to improve on the existing results on the regularity and structure of quasimartingales, as presented by Pellaumail A973), Metivier and Pellaumail A975) and Kussmaul A977,1978). The results in this section were developed by J.K. Brooks and N. Dinculeanu A988). Here is a sketch of the results. Let ? :?+ ? ? -> ? be a quasimartingale (not necessarily cadlag). A) Assume ? has the Radon-Nikodym Property (RNP), that is any ?-valued measure, with finite variation and absolutely continuous with respect to a probability measure P, has an ?-valued Radon Nikodym derivative with respect to P. It turns out, surprisingly, that X is strongly cadlag if and only if X is weakly cadlag B.12). Furthermore, X is cadlag if and only if X has a Doob-Meyer decomposition X = ? + A, where ? is a cadlag local martingale process and A is a predictable, cadlag process with finite variation and A() = 0. This decomposition is unique up to an evanescent set.
Stochastic processes and stochastic integration in Banach spaces 479 B) Assume that ? has the RNP and X is a quasimartingale. Then X has a cadlag modification if and only if X satisfies the following regularity condition: (R) lim?(lFX,) = ?(lFX() fori < oo and F e ?„\ il.v or even the following weak regularity condition: (WR) \im(E(lf.-X,),z) = {E(lf.-X<,z) for s < oo and F e ?,, il.v г in a subset ? of ?", which is norming for E. This is proved in Theorem 2.1 which gives 13 equivalent conditions for the existence of a cadlag modification; these conditions are new even in the scalar case. As a corollary (Theorem 2.13) we give a characterization of the ?-valued measure ??, which is associated with a quasimartingale X. C) Without any assumption on the Banach space E, we present results concerning the existence of a cadlag modification of X in terms of its right limit X+ along the rationals (Theorems 2.15-2.18). The fact that ?-valued martingales have cadlag modification was established by J.K. Brooks and N. Dinculeanu A987). D) Assume that ? has the RNR Then X is a quasimartingale of class (D) and satisfies condition (R) if and only if X has a Doob-Meyer decomposition X = ? + A, with ? a martingale of class (D) and A a predictable, cadlag process with integrable variation and A0 = 0 (Theorems 2.20-2.23). Conditions (D) and (R) together are also equivalent to the associated measure ?? being ? -additive with bounded variation. Various concepts used will be stated in Appendix A and sections in this appendix will be referred to as Sections A. 1, A.2, etc. 2.1. Notation In this section, we shall present notation and definitions which will be used to prove the main results in the next sections. Throughout this part, (?, ?, ?) is a probability space, (?,I€-^. is a filtration satisfying the usual conditions, ? is a Banach space with norm || ¦ ||, and dual space ?", and (Xt),€?__ is an ?-valued, adapted process with X, e L[(P), foreach t ^ 0. We shall always consider X to be extended to oo, with X^ = 0. If X has a limit at do, it will be denoted by X^_. For every a, with 0 < a ^ do, we shall denote by 1Z@. a], the ring generated by the predictable rectangles of the form ]s, t] ? F, with 0 ^ s < t ^ a and F e ??, and by 7?]0, <*[, the union of the rings 7?]0, ?], with ? < a. In particular, 7?]0, do[ consists of all finite, disjoint unions of bounded predictable rectangles ]s, t] ? F with t < oo, and this ring generates the ?-algebra V of predictable subsets of ]0, oof ? ?. Note that 7?]0, a] contains the stochastic intervals ]S, T], where S and ? are simple stopping times bounded by a. We notice that the above rings do not contain subsets of {0} ? ?. In fact, these sets have no role in problems of regularity.
480 J.K. Brooks 2.2. The Doleans function For each predictable rectangle ]s, t] ? F, we define fix(]s,t] ? F) = E(lr(X, - Xs)). In particular, ?-xQs, oo] ? F) = — E(lf.-Xs). Note that ?? is finitely additive on the semiring of predictable rectangles, and thus it can be extended uniquely to an ?-valued, finitely additive measure on the algebra 7?]0, oo]; this extension will still be denoted by ??. It is called the Doleans function of the process X. This function is invaluable in the study of quasimartingales. An immediate consequence of the definition of ?? is the equivalence of the following two conditions: YunE{\FXl) = E(\FX<), A2) lis ШпдлЧКг] x F)=0. A3) ? is As we mentioned previously, we refer to the regularity condition above as condition (R). Other properties of ?? will be stated in Section A.l. 2.3. The mean variation of X and quasimartingales For any a with 0 < a ^ oo, we define the mean variation of X on ]0, a] to be the number Varx]0,a] = sup53|?((X,,.1 " ?0\?',)\? < *>· i where the supremum is taken over all finite partitions 0 ^ ?i ^ · ¦ · ^ t„ = a. The mean variation VarxJO, a[ of X on ]0, a[ is defined similarly by taking t„ <a. It is important to be able to compute Vaix]0, a] by taking the supremum over different sets, for example, random partitions. Results concerning different representations of VarxJO, a] are presented in Section A.2, along with the relationship between Varx]0, a] and ????. the variation of ??. We say that X is a quasimartingale on ]0.a], or on ]0.a[, if VarxJO.a] < oo or Varx]0,a[< oo, respectively. We shall be interested in three types of quasimartingales. namely, those on ]0, oo], on ]0, oof, and those on every bounded interval ]0. a]. Elementary facts concerning quasimartingales. along with some convergence properties are given in Sections A.3 and A.4. In particular, X is a quasimartingale on ]0, a] or ]0, a[, with a ^ oo, if and only if the Doleans function ?? has bounded variation \??\ on TZ]0,a] or 7?]0, <*[, respectively. We say that X is of class (D) if the family {X/: ? simple stopping time) is uniformly integrable. We say X is of class (LD) if for each fixed a < oo, the above family, with ? ^ ?, is uniformly integrable.
Stochastic processes and stochastic integration in Banach spaces 481 2.4. Regularity of quasimartingales In this section we shall present one of the main theorems concerning the existence ?? cadlag modifications of a quasimartingale. Recall that a family ? с ?" is called total (for E) if ? e ? and (x.z)=0 for alU e Z implies that ? = 0. A family ? с ?" is called norming (for E) if for each xe?we have Evidently, any norming set is total. We remark that the RNP of ? in the next theorem is used only in steps 7 and 8 of the proof, and is not needed in the hypotheses of the lemmas established in the course of the proof, except for Lemmas 2.3(b) and 2.11. THEOREM 2.1. Assume that ? has the RNP and that X is a quasimartingale on each bounded interval and has a separable range. Let ? С ?' be any set which is norming for the range of X. The following assertions are equivalent: A) X has a cadlag modification; B) X has a right continuous modification; C) X is right continuous in the mean, that is, for the strong topology of LE; D) X is right continuous for the weak topology of LE; E) X is right continuous in probability; F) lim,j.j E(lFXt) = E(\FXs)fors < oo and F e Es; G) X = ? + A, where ? is a (not necessarily right continuous) local martingale and A is a predictable, cadlag process with finite variation and Aq = 0; A') (X, z) has a cadlag modification for each ? e Z; B') (X, z) has a right continuous modification for each ;eZ; C') {X, z) is right continuous in the mean, that is, in the strong topology of L', for each zeZ; D') (X, z) is right continuous in the weak topology ofO, for each zeZ; E') (X, z) is right continuous in probability for each ? e Z; F') \imlU{E(lrXt),z) = (E(lFX,),z) for s < oo, F e ?, andze Z; (T) (X,z) — M(z) + A(z), for each ? e Z, where M(z) is a (not necessarily right continuous) local martingale and A(z) is a predictable, cadlag process of finite variation with Aq(z) = 0. The decomposition in G) or (T) is unique up to an evanescent set. PROOF. The proof will proceed along the following implications: 7=>1=>2=>3·?>5=> 6 => 6', 7 => 7' => 1' => 2' => З'о 5' => 6' => 7, 3 => 4 => 6' and 3' => 4' => 6'. The only implications that require the space ? to have RNP are 6 => 7 and 6' => 7. All the other implications mentioned above are valid for any Banach space E. The implications 7 => 1 and 7' => ? follow from the fact that any local martingale has a cadlag modification (see Theorem 2.15 below). The implications 2 => 3 o· 5 and 2' => 3' о 5' are stated in Proposition A.4.1. We shall prove below the implication 6' => 7.
482 J.K. Brooks All the other implications are evident. The proof will be done in several steps. In each step it will be assumed that condition 6' is satisfied. We remark that the RNP is not needed in Steps 1-7. In the process of proving some of the steps, we shall establish properties which are valid in more general situations and these will be stated as independent lemmas. In the statements of these lemmas, no conditions will be assumed, unless otherwise specified. We shall now prove the implication 6' => 7 assuming first that X is a quasimartingale on ]0, oo]; therefore ?? has bounded variation on 7?]0, oo]. Without loss of generality we can also assume that ? is separable and that ? is norming for ? and countable. STEP 1. The measure ?? satisfies the following conditions: lim|MX|(]i.f] ? ?)=0 fors < ос A4) and limlux(]s,(]xF) = 0 fors < oo and F e 27,. A5) In fact, hypothesis F') is equivalent to Нт(дх(]5,г] xf),;) = 0 fors < oc, F e ?, and ;eZ. Then Step 1 will be a consequence of the following lemma. LEMMA 2.2. Let ? be any Banach space, let ? С ?' be a total set, and let ?: TZ]0, oo] —> ? be an additive measure with bounded variation \?\. Thenfor every s < oo, the following assertions are equivalent: A) Нт1Ь|д|(]л,г]хЯ)=0; B) lim, is ?(]?, t] ? F) = 0, for every F e Es; C) lim, u (?(]?, t] ? F), z) = 0, for F e ?, and ? e Z. PROOF. Obviously A) => B) => C). Assume now C) and prove B). Since ? has bounded variation |?|, it is strongly additive on 7?]0. oc], that is ?(?„) -> 0 for every sequence (?„) of disjoint sets from 7?]0. oo] (see Brooks and Dinculeanu A974) for results concerning strong additivity). As a result, for every decreasing sequence A„ 4- 0 from 7?]0. oo], the sequence (?(?„)) is convergent in ?. In particular, for A„ = ]s,t„] ? F, with tn i s and F e 2T.V, the limit ? = lim„ ?(]?,?„] ? F) exists in E. By assumption C), we conclude that (x, z) = 0 for every ;eZ, which proves B). Next assume B) and prove A) (cf. also Metivier A974). Let t„ I s and let ? > 0. Let (?;) be a finite partition of ]s,t\] ? ? consisting of disjoint predictable rectangles such that |?|(]?.?|]??)<53|?(?|+?. к
Stochastic processes and stochastic integration in Banach spaces 483 For any set A e 7?]0, t\ ] contained in ]?,?|]?? we have ? \\?(? П Rk)\\ 2 ? ?^?*)|| " ? У(ЛСП Rk)\\ к к к ? |?|(]0, ?,] ? ?) - ? - |м|(Л') = \?\(?) - ?. If we apply the above to A = ]s. t„] ? ?, we obtain \?\ (]s, t„] ??)^? \\?{?'.*?] x ? П ??) ? + ?. Now let ? -> ?? and obtain lim„ |?|(].?,?„] ? ?) ^ ?, which proves A). ? Step 2. (a) For each z&Z, the quasimartingale {X, z) has a cadlag modification. (b) There exists a cadlag negative submartingale ? such that \??\ = ?? on 7?]0, oo]. Assertion (a) follows immediately from Lemma 2.5 below, since {X,z) satisfies condition F') which is equivalent to condition A) of this lemma, for (X,z). Assertion (b) will be a corollary to the following two lemmas. In fact, from Lemma 2.3, we deduce first that \?? | satisfies condition (i) in Lemma 2.3, and then deduce that there exists a negative submartingale ? such that \??\ = ?? оп7?]0,оо]. By Step 1, \??\ satisfies condition (ii) of Lemma 2.5, which insures that ? can be chosen to be cadlag. Lemma 2.3. (a) Let ? be any Banach space and let ? be an adapted ?-valued process with Y, e L[E(P)forall t ^ 0. The ? is a quasimartingale on ]0, do] if and only if its Doleans measure ? = ?? satisfies the following condition: (i) For every t < oo, there is a function f, ^ 0 in L[(P) such that |?|(]?, эо] ? F) ^ E(\Ff)JorFeEt. (b) Assume that ? has RNP and let д:7?]0,эо] —> ? be an additive measure with bounded variation \?\ satisfying condition (i) above. Then there is an ?-valued quasimartingale ? such that ? = ??. PROOF, (a) Assume first that У is a quasimartingale on ]0,oo]. For any predictable rectangle ]u, ?] ? F, set IY(]u, ?] ? F) = lF(Yv - Yu). Then ? ? is an L}(P)-valued, finitely additive set function defined on the semiring of predictable rectangles; it can be extended to an additive measure on 7?]0, do]. For every A e TZ]0, oc], we have ?? (A) = E(IY(A)). Let t < oo and let ? be the set of all functions in L[(P) of the form ?\\?{??(?^\?,)\\
484 J. K. Brooks where (A,-) is a finite family of disjoint predictable rectangles contained in ]t,oo] ? ?. Note that for each such family, we have J]||/y(A,)|1^|MH(]0,oo]xi2)<oo, i hence the set ? is bounded in L' (?). We next show that W is directed for the usual order in L\P). If (A,) and (Bj) are two finite families of disjoint predictable rectangles contained in ]?, oo] ? ?, and if (Bj) is a refinement of (A,), then using the additivity of ?? we obtain ^|?(/y(A,-)|r,)|iC^|?(/y(S,)|r,)[[. i i This result then implies that ? is directed in Lx (?). ?. being bounded and directed in the space Ll(P) of equivalence classes of integrable functions, it has a supremum in L](P), and we choose a representative f, in this equivalence class. For any set F e ?, and any finite family (A,) of disjoint predictable rectangles contained in ]t,oo] ? F, we have J2\\E{lY(Ai)\E,)\\ iClf/, a.s.. hence Х)|ду(А/)|^?(Ь-/,); consequently lMH(]r,oo]xF)iC?(b-/,). Conversely, if (i) is satisfied for t = 0, then ?? has bounded variation on ]0, oo] ? ?, hence У is a quasimartingale on ]0, oo]. (b) Let ? be an F-valued additive measure on 7?.]0,oo] with bounded variation \?\ satisfying (i). For fixed t < oo, the mapping F н> ?(]?, oo] ? F) is a countably additive measure on ?, with bounded variation and is absolutely continuous with respect to P. Since ? has the RNP, there is a function Y, e L\. (P) such that ?(]?,??] ? F) = -E(\yY,) forFeXV It follows that ? = ?? and since ? has bounded variation on 7?]0,oo], У is a quasimartingale on ]0, oo]. ? Remark 2.4. In the scalar case, a property similar to (i) in Lemma 2.3 was proved in Kussmaul A977), in a different setting.
Stochastic processes and stochastic integration in Banach spaces 485 In the scalar case, the following lemma characterizes the right continuity of ? in terms of the corresponding measure ??. The vector case will be considered later (Theorem 2.13). LEMMA 2.5. Let ? be a real valued, additive measure on 1Z]0, oo] with bounded variation \?\ satisfying condition (i) in Lemma 2.3 and let ? be a real valued quasimartingale on ]0, oo] with ? = ??. Then ? has a cadlag modification if and only if lim^(]j,(]xF)=0 fors < oo and F e ?,. A6) r is If, moreover, ? ^ 0 on 1Z]0, oo], then ? is a negative submartingale; ? has a cadlag modification if and only if ????(]5,?] ??)=0 fors<oo. A7) tls PROOF. If ? has a cadlag modification, then, by Proposition A.4.1, ? is right continuous in the mean, hence lim,j,A Е{\уФ,) = Е(\уФн) for s < oo and F e Es, and this condition is equivalent to A6); if ? ^ 0, this condition is equivalent to A7). To prove the converse implication, assume first that ? ^ 0on7?]0, oo] and satisfies A7). Then ? is a negative submartingale and A7) is equivalent to Нт?(Ф,) = ?(ФЛ) fors<oo. lis By Brooks and Dinculeanu A987), ? has a cadlag modification. If ? is real valued and satisfies condition A6), then by Lemma 2.2, its variation |?| satisfies A7). By the first part of the proof, there are two cadlag negative submartingales ?+ and ?~ satisfying ?+ = ??+ and ?~ = ??-. Then ?? = ?,?+_?-), hence ?+ - ?~ is a cadlag modification of ?. D Remark 2.6. A similar result was proved in Kussmaul A977) for measures satisfying a certain "condition (S)". The above lemma will be extended to quasimartingales with values in a Banach space (Theorem 2.13). For every simple stopping time ?, ?? is an ? ? -measurable, integrable random variable. We shall extend, in Step 3 below, the definition of ?? by means of a random variable ??, where ? is an arbitrary stopping time T. To do this we need the following lemma. LEMMA 2.7. If ? is an ?-valued quasimartingale on ]0, oo], then for every stopping time ? and for every decreasing sequence (T„) of simple stopping times with T„ I T, the sequence (?(???\??)) is Cauchy in L\-. PROOF. We remark first that if ? ^ U ^ V, where U and V are simple stopping times, then E((YV -??)\??I<:\??\(]?.?]).
486 J. K. Brooks In fact. ? \\E((YV - ??)\??)\\?? = sup J2 ? \\?((??-??)\??)??\\ = ™?? l· \\(Yv-Yv)dP\\ = sup?|/i>-(]t/. V]nR+ xFj)\\ i ?\??\(]?.?]). where the supremum is taken over all finite families (F,) of disjoint sets from ?? ¦ Then, taking U = Tn+\ and V = T„, we obtain ? \\?{(??„ ~ ??„+?)\??I iC ^|му|(]Г/| + 1. Г,,]) iC |My|(E+ ??)<??. /1 /1 It follows that the series ?? ?((??„ - ??;;+] )|?Y) is convergent in L^. For any к we have ? ?((??? ~ ??„+] )\??) = ?(?? \??) - ?{??\??). П<к hence ?(??? \ ??) converges in L[-, and this proves Lemma 2.7. ? Remark 2,8. This lemma was proved in Kussmaul A978) under the assumption that ? is right continuous in the mean. STEP 3. For any stopping time ? we can chose ? ??-measurable random variable Хт е LF satisfying the following conditions: ___ (a) If(Tn) is a decreasing sequence of stopping times with T„ 4- T, then ?(??„ ? ??) -> Хт in the mean, (b) If ? is a simple stopping time, then Хт = Хт a.s. (c) If X is right continuous, then Хт = Хт a.s. (d) If X' is a modification of X, then X'T = Хт ci.s. (e) If ? eZ andY = {X, z), then ?? = {XT.j) a.s. (f) IfT„ I oo, then for any t < oo we have ??,,^? -*¦ %t я-*- We first take a decreasing sequence (T„) of simple stopping times Tn with T„ 4. T. By Lemma 2.7, the sequence (?(????\??)) is Cauchy in L\:, and has a limit и е L[. This limit is independent of the sequence (T„). In fact, let (S„) be another decreasing sequence of simple stopping times converging to T, and let ? = lim„ ?(?$„ \??) in LE. Let ; e Z. By Step 2, the real quasimartingale ? = {?,?} has a cadlag modification W. By Proposition A.4.2, we have WT e L1 and WT„ -> WT and W.Sn -> W/ in L1;
Stochastic processes and stochastic integration in Banach spaces 487 consequently both sequences (?(???\??),?) and (/?(?^,|.??), z) converge to Wt in L1. Thus {u,z) = Wt = (v, z) a.s. Since ? is countable, we have и = ? a.s. We denote by XT a representative of the element lim E(XtJEt) in L[. Then Xr is ?? -measurable. For ? e ? and У = (X, с), we then have (Хг,г} = (Нт?(Х7-„|Гг).;} = Нт?((Х7;,.;)|Гг) = ?г. and this proves (e). If X' is a modification of X, then X'T = ??„ a.s., for simple stopping times ?„ and (d) follows. Also (b) follows in a similar fashion. If X is right continuous, then (c) follows by Proposition A.4.2. Property (a) holds by definition if (Г„) is a sequence of simple stopping times. Now let (Г„) be an arbitrary decreasing sequence of stopping times converging to Г, and prove that E{Xj„ \??) -> Xr in L\:. For each « choose a simple stopping time Sn with ?„ ^ S„ ^ ?„ + I In and ?^?,,,?^-??,,?^?/". Hence llECXsJ-^-^xYj^r)!, <!/"¦ We can choose the S1,, decreasing to ?, hence ?(X.y„ ?-??7) -»¦ %r and then (a) follows. To prove (f), let ? e ? and let W be a cadlag modification of ? = (?, ?), which exists by Step 2, Then a.s., (??„?,, с) = ?7„л, and W, = (?,,?). Let N С ? be a ?-negligible set such that for all ? e /V, the above equalities are valid for a fixed f, for all integers ? and all г in a countable norming set Z. For ? e /V, choose « such that ?„(?) > ?. Then (??„?,(?), г) = WTnAI(w) = W,(co) = (?,(?), ?), hence ?7„?,(?) = ?,(?), which proves (f). Step 4. (a) For any stopping time ? we have \\??\\ ^ —?? a.s. (b) For any stopping time S ^ ? iff have \\E(Xr — Xs)\\ ^ ?(?? — ?s)¦ For ? & ? and Л е 7Z]0,00] we have \?{?.:)(?)\ = \(??(?),?)\ iC ||;||??(?). hence | < X,, с > | ^ —Ф, a.s. for each t ^ 0. Let (Г„) be a sequence of simple stopping times, T„ 4. Г. Then | {X7;,, c) | ^ -??„ a.s. for each «, therefore ?((Х7-„,г)|Г7-)^-?(Ф7„|Г/) a.s.
488 J.K. Brooks Let ? —» oo and obtain \{??,?)\^-?? as. Since ? is countable and norming, we have \\?? II ^ — ?? a.s. For 5 ^ ? simple stopping times we have \\?(??-?3)\\ = \\??(]?,?])\\^\??\(]?,?]) = ??{]?,?]) = ?(?? - <?s)- Then (b) follows by approximating general stopping times by simple ones. ^ For any stopping time ?, we define now the "stopped" process XT as follows: (XT), = ????, fori ^ 0. STEP 5. For each stopping time T, the stopped process XT is a quasimartingale on ]0, oo] satisfying the regularity condition \imE(lFXf) = E(lFxl) for s < oc, F e ?,. If 5? ^ Si ^ ¦ · ¦ < S„ is a finite family of simple stopping times, then by Step 4(b), ?№+, -xTs,)\\ = EHW, -xt.Si)\\ i i ^??(??^-< -*7-AS,)^Var*]0,oo]. The last inequality is valid by Proposition A.4.3 since ? is a cadlag quasimartingale. Taking the supremum for all families E,) as above, and using Proposition A.2.3, we obtain VarjrJO, oo] ^ Var<2>]0, do] < oo, hence XT is a quasimartingale on ]0, oo]. Since ? is cadlag, it is right continuous in the mean by Proposition A.4.1. Then by Proposition A.4.2, we have ]im Е(ФТмг -Фт^ч)=0, r Is therefore \imE(lFXj) = E(lFXj). and this proves Step 5.
Stochastic processes and stochastic integration in Banach spaces 489 STEP 6. If S and ? are stopping times with S ^ ?, then (XT)S is a modification of Xs. We remark first that, by Step 5, we can apply Step 3 to XT and define for every t ^ 0, (XT)sAt; for notational convenience, we shall abuse notation and we shall denote the latter simply by (??)$??- Hence the process (XT)S is define by Step 5 and is a quasimartingale satisfying the regularity condition (R). Let S„ I S, S„ simple stopping times, and let t ^ 0; then Sn At 4- S At and Sn л ? л t I S a t, hence, since the Sn are simple. = Xsai = X, in LE. It follows that (XT)f = X? a.s., hence (XT)S is a modification of Xs. Next, for each ? we define the stopping time r„ = inf{i: -?,^?). Sine ? is cadlag, we have Tn f oo. Set X"" = X7". STEP 7. X(,,) is a quasimartingale of class (D) on ]0,oo] a/id satisfies the regularity condition lim?(lfX,(")) = ?(l/-X!")) forsoo, FeE,. In fact, for simple stopping time ? we have by Step 4(a): \\Х?\\^-Фтлтп ^п+ФТ11еЬ1. since ? is cadlag (see Proposition A.4.2). This shows that X' is of class (D). The other assertion was proved in Step 5. STEP 8. X(,,) can be decomposed uniquely (up to an evanescent set) in the form X"" = Min) + A"", where M(n) is an ?-valued martingale of class (D) and A"" is an E-valued, predictable, cadlag process with integrable variation with Aj" = 0. This is a direct consequence of Step 6 and Lemma 2.11 below. In the proof of Lemma 2.11 we shall use the following lemma. LEMMA 2.9. Let ? be an ?-valued, adapted process such that Y, e L\(P) for each t. The ?? is countably additive and of bounded variation on 7?]0, do] if and only if ? is a quasimartingale of class (D) on TZ]0, do] and satisfies lim?(l/.y,) = ?(l/.ys) for s < oo and F e Es. A8) lis
490 J.K. Brooks PROOF. Assume first that У is a quasimartingale of class (D) satisfying condition A8). This condition is equivalent to Нтму(]5,г] ? F)=0 for s <oo and F e ?\. A9) ? l.s v ' We shall prove that ?? also satisfies the following condition: For any decreasing sequence //„ from ?, with H„ I 0 we have limsup{|My(]5,r])|}=0, B0) " л where ?„ is the family of stochastic intervals ]S, T], such that S and ? are stopping times, S^T and] 5, T] C]0,oo] ? ?„. In fact, for (#„) and S, ? as above, we have \\??(]?, T])\\ = \\E(lHn(YT - Ys))\\ iC E(lHn(\\YT\\ + \\YS\\)). and this last term tends to zero uniformly relative to ]S, T]. We now use the criteria in Metivier A982, Lemma 2, p. 83) and deduce that conditions A9) and B0) imply that ?? is countably additive on 7?]0, oo]. Conversely, assume that ? у is countably additive and of bounded variation on 7?]0, oo]. Then it is immediate that У is a quasimartingale on ]0, oo] satisfying A8). Since \??\ is also countably additive, we have Ит|ду|(]5, t] ? ?) = 0 for.v<oo. By Lemma 2.5, there is a cadlag, negative submartingale ? such that \??\ = ??. Then ||F(l/ry,)| <; -ЕAуФ,) fori ooandFeT,. From this, we deduce that || Y, \\ ^ -Ф, a.s. As a result, || Yj || ^ —Фт a.s. for every simple stopping time T. By Metivier A982, Theorem 13.3) ? is of class (D), hence ? is also of class (D). Remark 2.10. Metivier A982) proved the above lemma under the additional assumption that ? is right continuous. His result follows from the above lemma since right continuity implies right continuity in the mean (see Proposition A.4.1), which, in turn, implies A8). LEMMA 2.11. Assume ? has the RNP. The following two assertions are equivalent: (a) Y is a quasimartingale of class (D) on ]0, do] and satisfies limF(l/-T,) = E(lFY,) for s < oo and F e ?,. lis
Stochastic processes and stochastic integration in Banach spaces 491 (b) ? = ? + A, where ? is an ?-valued (not necessarily cadlag) martingale of class (D) and A is an ?-valued cadlag process with integrable variation and with The decomposition in (b) is unique (up to an evanescent set). PROOF. Assume first (a). By Lemma 2.9, ?? is ?-additive and has bounded variation on 1Z]0, oo]; therefore it can be extended to a ?-additive measure, still denoted by ? у on the ?-algebraof all predictable subsets of ]0, do] ? ?. Consider the measure у defined on the ?-algebra ? of all predictable subsets of ]0, oo[ ? ?, equal to ?? on the predictable subsets of ]0, ??[??, and equal to 0 on the predictable subsets of {0} ? ?. Then у is ?-additive and has finite variation on V. Since ? has the RNP, there is a predictable ?-valued process such that ||g|| = 1 and v(B)= I gd\v\ for В e V. f gd\v Jb Since ?? and у vanish on evanescent sets, there is a unique (up to an evanescent set), predictable, cadlag, increasing, integrable process V associated to \v\ (Dellacherie and Meyer, 1978,VI.65),by \v\(B) = e( J lBdVj forBeP. Since |y|({0} ? ?) = Owe have V0 = 0. Then v(B) = e( (\Bgsdv\ for Sep. The process A, = L· t,gsdVx is cadlag, predictable, with A(> = 0 and v(B) = e( \\BdA \=??(?) for Be P. The variation \A\ of ? is integrable, since it is predictable and satisfies \A\ ^ V. For В е 7?]0, oof we have ??(?) = ?(?) = ??(?). If we set ? = ? — A, then ?? = 0 on 7?]0, oof, hence ? is martingale. Moreover, ? is of class (D) since both ? and A are of class (D), and this proves (b). The uniqueness of the decomposition follows from the fact that, (as in the scalar case), an ?-valued, predictable, cadlag martingale with finite variation and vanishing at 0 is equal 0 everywhere (except on an evanescent set). Conversely, if we assume (b), then ? is of class (D) and ?? = ? a on 7?]0, oof and ?? = ?? on {oo} ? ?. Since ?« and ?.\ are ?-additive and of bounded variation on
492 J.K. Brooks {00} ? ? and 7?]0, ??[, respectively, it follows that ?>- is ?-additive and with bounded variation on 7?]0, 00]; hence, by Lemma 2.9. У satisfies condition (a). D STEP 9. We have X — ? + A as in assertion G) of the statement of the theorem. By Step 8, for each ? we have a unique decomposition X"" = M"" + A"", where M1 is a martingale of class (D) and A"" is a cadlag, predictable process with integrable variation and Aq" = 0. Let N' be a cadlag modification of M(n) (see (Brooks and Dinculeanu, 1987)). Then У("> = N(n) + A"" is cadlag modification of X("K By Step 6, (X("+1))r" is a modification of X""; (X<"+1>O'« therefore (y<"+1>O'» is a modification of У""; since they are cadlag, we have (У1"*"O" = y1 (outside an evanescent set). It follows that (N("+U)T" = N{,,) and (A<"+"O« = A<">. Then we can define the cadlag processes У, N, A such that these processes, stopped at T„ are equal, respectively, with Y{"\ N[n), A"". It follows that N is a cadlag local martingale and A is a cadlag, predictable process with finite variation and with A() = 0, and ? = N + A. We deduce, using Step 3(f), that У is a modification of X. Hence the process ? = X - A is a modification of the local martingale N, and for each « and ? we have (MT»), = X\n) - A™ = N™ a.s„ hence M7" is a martingale of class (D). It follows that ? is local martingale and this proves the implication 6' => 7 in case X is a quasimartingale on ]0, do]. STEP 10. Assume now that X is a quasimartingale on each bounded interval. Then for each a < 00, the stopped process X" is a quasimartingale on ]0, do] satisfying assertion F') of the theorem. By Step 9, we have X" = Ma + A" with M" and A" as in assertion G) of the theorem. By the uniqueness of this decomposition, taking a = 1,2,..., we have (X"+1)" = X" hence (M"+1)" = M" and (A"*1)" = A"; hence there is a local martingale ? and a predictable cadlag process A with finite variation and Aq = 0 such that ? and A stopped at ? are equal respectively with N" and A"; hence X = ? + A, and the theorem is completely proved. D The next theorem gives equivalent conditions for X to be actually cadlag (rather than having a cadlag modification). In particular, it states that X is a strongly cadlag if and only if it is weakly cadlag. THEOREM 2.12. Assume that ? has the RNP and that X has separable range and is a quasimartingale on each bounded interval. Let ? С ?" be a set which is normingfor the range ofX, The following assertions are equivalent (up to evanescent sets): (a) X is cadlag; (b) X is right continuous; (c) X is weakly right continuous;
Stochastic processes and stochastic integration in Banach spaces 493 (d) (X, z) is right continuous for every ;eZ; (e) X = ? + A, where ? is a cadlag local martingale and A is a cadlag predictable process of finite variation with Aq = 0. PROOF. The implications (e) => (a) => (b) => (c) => (d) are obvious and valid in any Banach space (not necessarily having the RNP). Now assume (d) and show (e) holds. We can assume ? is countable, since the range of X is separable. By Proposition A.4.1, {X, z) is right continuous in the mean for each ; e Z, hence it satisfies condition F') of Theorem 2.1: \imE((lFX,,z)) = E({lFX,,z)) fori <эо, F e ?,, ze Z. Then by Theorem 2.1, X = N + A where N is a local martingale and Л is a cadlag predictable process with finite variation and A(> = 0. Let ? be a cadlag modification of N (see Theorem 2.15). Then ? = ? + A is a cadlag modification of X. This implies that (Y, z) = (X, z) up to an evanescent set, for each fixed ; e Z. Since ? is countable, X = ? up to an evanescent set and the theorem follows. ? Lemma 2.5 can now be completed as follows. THEOREM 2.13. Assume that ? has the RNP. Let ?:??]0,??] -> ? be an additive measure with bounded variation |?|. Then there exists a cadlag quasimartingale ? on ]0, oo] such that ? = ?? if and only if the following two conditions hold: For every t ^ 0, there exists a function f, e L' (P) such that |M|(]f,oo]xF)iC?(lf/,) forFeE, B1) and ??????,?] ? F) =0 for s < oo and F e 2ГД. B2) ? is PROOF. Since ? has the RNP, by Lemma 2.3, ? satisfies B1) if and only if there exists an ?-valued quasimartingale W such that ? = ?\?. Then by Theorem 2.1, W has a cadlag modification У if and only if condition F) of Theorem 2.1 holds, which is equivalent to condition B2) above. ? 2.5. Cadlag modification without RNP In this section we shall investigate the existence of right continuous or cadlag modifications of X without assuming that ? has the RNP. The characterization of cadlag modifications of X is given in terms of its right limits X+ along the rationals (any countable set dense in E+ can be used). The following results are proved: A) If X is a local martingale then X+ exists (outside an evanescent set) and is a cadlag modification of X.
494 J, K. Brooks B) If X is a quasimartingale and if X+ exists (outside an evanescent set), then X+ is a modification of X if and only if the regularity condition (R) holds. If X is a real-valued quasimartingale, then X+ and X- exist (outside an evanescent set). C) If X is a process with integrable variation, then X+ (which exists everywhere) is a modification of X if and only if condition (R) holds. 2.5.1. Right and left limits. For ? e ? and t e R+, we denote (in this section only) ?,+(?) = lim ?,-(?), ?,-(?) = lim ?,-(?), r U с ?' for r rational, whenever these limits exist. We denote by X+ and X- the functions (t, ?) н> ?,+(?) and (t, ?) н> ?,_(?), respectively. We notice that if X+ is defined on a set ? = Ш+ ? F with F С ?, then X+ is right continuous on M. Moreover, X+ is cadlag if and only if X- also exists on ?. We know that if X is a real-valued supermartingale, then X+ and X_ exist a.s., i.e., outside an evanescent set, see Dellacherie and Meyer A978, VI.3.). It follows that if X is a real-valued quasimartingale, then X+ and X- exist a.s. But if X is an ?-valued quasimartingale, we do not know whether X+ and X exist. We list first some general properties of X+. Lemma 2.14. (a) IfX has a right continuous (respectively cadlag) modification, then X+ (respectively X+ and X-) exist a.s. and X+ is a right continuous (respectively cadlag) modification of X. (b) Let (T„) be an increasing sequence of stopping times Tn f oo, such that XT" has a right continuous (respectively cadlag) modification for each n. Then X+ (respectively X+ and X-) exists a.s. and X+ is a modification of X. (c) Let ? С ?" be a countably set, norming for the range of X. If X+ exists a.s. and if (X,z) has a right continuous modification for each ? e Z, then X+ is a right continuous modification of X. PROOF, (a) If У is a modification of X, the assertion follows from the equality Xr = Yr a.s. for each r rational. (b) For each и, (Хт")+ exists, by (a), and is a modification of XT". Then X+ exists a.s. and (XT,<),+ = XTt" a.s. for each t; letting ? -+ oo we get Xt+ = X, a.s. for each t. (c) For each ? e Z, (X, z)+ exists a.s. and is a modification of {X, z). Then (Xl+,z) = (Xl,z)+ = (X„z) a.s. for each ? e ? and t ^ 0. Since ? is countable, X,+ = X, a.s. for each t. О 2.5.2. Local martingales. We call X a local martingale if there is an increasing sequence (Г„) of stopping times with Г„ t oo a.s., such thatforeach n, XT" is a martingale (not necessarily right continuous). This means that for each ? and t, X," is integrable;
Stochastic processes and stochastic integration in Banach spaces 495 but Xj is not necessarily integrable if ? is an arbitrary stopping time. We note that a modification of a local martingale is not necessarily a local martingale. THEOREM 2.15. If X is a local martingale then X+ and X- exist a.s.\ X+ is a local martingale and a cadlag modification of X. PROOF. Let ?„ f oo such that XT" is a martingale for each n. By Brooks and Dinculeanu A987), each XT" has a cadlag modification. Then we apply Lemma 2.14, (a) and (b), to deduce that (XT")+ is a modification of XT" and X+ is a modification of X. Moreover, X+ is a local martingale, since for each ? we have (X+)T" = (XT")+ and (XT")+ is a martingale. ? 2.5.3. Quasimartingales We first state a new theorem on real-valued quasimartingales. In the sequel, condition (R) will be referred to as condition B3). THEOREM 2.16. If X is a real valued quasimartingale on bounded intervals, then X+ and X- exist a.s. and are quasimartingales. X+ is a cadlag modification of X if and only if condition B3) holds: \imE(lFX,)=E(lFX,) fors < oo and F еГ,. B3) PROOF. Without loss of generality we can assume that X is a quasimartingale on ]0, oo]. Then ?? has bounded variation \??\ on Щ0, oo] and ? + and ?? are positive additive measures on 7?]0, oo]. Let ? and ? be negative submartingales such that ? J = ?<*> and ?? = ??. Then ?+, ?+, ?+ and ?- exist a.s. and are submartingales (Dellacherie and Meyer, 1978, VI.3). From the equality ?? = ??-? on TZ]0, oo], we deduce that ? - ? is a modification of X. It follows that X+ and X- exist a.s. and are modifications of the quasimartingales ?+ — ?+ and ? — ?-, respectively; hence X+ and X- are also quasimartingales. If X+ is a right continuous modification of X, then X is right continuous in the mean (Proposition A.4.1) and condition B3) follows. Conversely, if condition B3) holds, then, by Lemma 2.5, X has a cadlag modification; by Lemma 2.14(a), X+ is a cadlag modification of X. ? THEOREM 2.17. Assume that X is a quasimartingale on bounded sets, has separable range and X+ exists a.s. Then X+ is a right continuous modification of X if and only if condition B3) holds. PROOF. Assume first that X satisfies condition B3) and let ? с ?" be a countable set, norming for the range of X. Then for each ? e Z, (X, z) is a real-valued quasimartingale satisfying condition B3). By Theorem 2.16, (X, z)+ exists a.s. and is a cadlag modification of (X, z). Then, by Lemma 2.14(a), X+ is a right continuous modification of X. Conversely, if X+ is a right continuous modification of X, then X is right continuous in the mean (Proposition A.4.1) and condition B3) follows. ?
496 J.K. Brooks 2.5.4. Processes with finite variation. The following theorem is new even in the scalar case. We remark that a modification of a process with finite variation does not necessarily have finite variation. THEOREM 2.18. Assume X has finite variation \X\ and that \X\, is integrable for every t. Then X+ {which exists everywhere) has integrable variation on bounded intervals. Moreover, X+ is a modification of X if and only if condition B3) holds. PROOF. The existence of X+ and X_ follows from the inequality \\X,-XS\\ ^ |X|,-|X|A fori < t. From this inequality we deduce \\Xt+ — Xs + W ^ \X\t+ — I-KL + fori < t. Since \X\, is increasing, it follows that X+ has finite variation |X+|, and that \X+\ ^ \X\ + . X+ is right continuous and adapted, hence its variation |X+| is right continuous and adapted. From \X+\, ^ \X\,+ ^ |X|,+i e L1 we deduce that \X+\, e L1 for every t ^ 0. Now, since X has integrable variation on bounded intervals, it is a quasimartingale on bounded intervals. By Theorem 2.17, X+ is a cadlag modification of X if and only if condition B3) holds, and the theorem is proved. ? Remark 2.19. If the variation \X\, of X on [0. t] is integrable, then it is ^-measurable. In fact, the set ? of functions of the form ]Г, !№, + , - %t, II with 0 = ?? < ¦ ¦ ¦ < ?„ = t is directed and bounded in L\Et), hence sup<? exists in L[(Et). Since \X\, = sup<? a.s. it follows that \X\, is ?1,-measurable. 2.6. The Doob-Meyer decomposition of quasimartingales Theorems 2.1 and 2.12 can be viewed as giving equivalent conditions for the Doob-Meyer decomposition X = ? + A of a quasimartingale, in terms of right continuity. Lemma 2.11 gives conditions for the Doob-Meyer decomposition of quasimartingales of class (D). The following theorem is an extension of both Lemmas 2.9 and 2.11 for quasimartingales of class (LD). THEOREM 2.20. The following two assertions are equivalent: (a) X is a quasimartingale on bounded intervals, is of class (LD) and satisfies B3); (b) ?? has bounded variation on each ring TZ]0, a] with a < oo, and is ?-additive on TZ]0,oo[. If ? has the RNP, the above assertions are equivalent to (c) X = ? + A, where ? is a (not necessarily right continuous or bounded) martingale and A is a cadlag, predictable process with Aq = О and with integrable variation on each bounded interval. The decomposition is unique up to evanescent set. For the correspondence X ++ ?? ++ ? +? established above we can state the following completions. THEOREM 2.21. X is a quasimartingale on ]0, oo[ of class (LD) and satisfies condition B3) if and only if ?? is ?-additive and of bounded variation and on Щ0, oo[ if and only if ? is a martingale and A has integrable variation.
Stochastic processes and stochastic integration in Banach spaces 497 THEOREM 2.22. X is a quasimartingale on ]0, oo] of class (LD) and satisfies condition B3) if and only if ?? has bounded variation on 1Z]0, oo] if and only if ? is a martingale bounded in L]E and A has integrable variation. THEOREM 2.23. X is a quasimartingale on ]0, oo] of class (D) and satisfies condition B3) if and only if ?? has bounded variation and is ?-additive on 1Z]0, oo] if and only if ? is a martingale of class (D) and A has integrable variation. PROOF. Theorem 2.23 is proven in Lemmas 2.9 and 2.11. To prove Theorem 2.20, we apply Theorem 2.23 for each ? e N to the stopped quasimartingale X". The implication (a) =>· (b) follows then by noticing that if ?? is ?-additive on each ring 7Z]0, и], then it is ?-additive on Щ0, oof. The implication (a) => (c) is proved by decomposing X" = M" + A" where M" is a martingale of class (D) and A" is a predictable, cadlag process with integrable variation and ? ? = 0. From the uniqueness of this decomposition we deduce the existence of two processes ? and A which stopped at ? are equal to M" and A", respectively. Finally, we remark that ? is itself a martingale. The implications (b) =>· (a) and (c) =>· (a) are evident. Theorems 2.21 and 2.22 are immediate. D Appendix A In this appendix we shall collect some properties of the concepts involved in this chapter. АЛ. The Doleans function The following properties hold for the Doleans function ? ? introduced in Section 2. АЛЛ. X is a martingale if and only if ? ? = 0 on 7Z]0, oof. АЛ.2. X is a submartingale if and only if ?? ^ 0 on 7Z]0, oof. X is a negative submartingale if and only if ?? ^ 0 on7Z]0, oo]. АЛ.З. If ? is another process like X, then ?? = ?? on 7Z]0, oof if and only if X - ? is a martingale. We have ?? = ?? on Щ0, oo] if and only if X is a modification of ?. АЛА If У is a negative submartingale and if \??(?)\ ^ ??(?), for A e 7Z]0, oo], then \X,\ ^ —Y, a.s. for each t. АЛ.5. For any stochastic interval ]S, T], with S and ? simple stopping times, we have ??]8,?]=?{??-?8). АЛ.6. Assume X is a quasimartingale on ]0, or], for each ? < oo and let ? be a total subset of ?". The following conditions are equivalent for any s < oo:
498 J. K. Brooks (a) ????.?|??|(]?,?] ??) = 0; (b) lim(iJMX(]j,f]x F) = Oforany F e ?,- (c) ????,?^???,?] ? F),z) = 0, for any Fel, and с e Z; (d) \imasE(lFX,)=E(lFX,), for F e ?„\ (e) lim,is(?(lFA:,), г) = (?A/- Xs), c), for F e 2Tt and ? e Z. The equivalence of (a), (b), (c) was proved in Lemma 2.2. The equivalences (b) о (d) and (c) o· (e) are evident and are valid even if X is not a quasimartingale. A.2. Mean variation The mean variation was introduced in Section 3. A.2.1. PROPOSITION. For any ? ? oo, we have Varx]0,a] = sup5]||?(lfI(X,i-X.fl)|2:(,)||1, where the supremum is taken over all finite families (]s,. ?,] ? F,) of disjoint predictable rectangles contained in ]0, ?] ? ?. To prove the above, arrange the endpoints л, and ?, in an increasing order, 0 ^ и \ ^ ¦ ¦ ¦ ^ u„ = a. Then ]j/,f/] ? F, = [J ]и;, иу_ц] ? A/y, where A,, = 0 if ]s,-, Г,] П ]иу, uJ + \] = 0 and Ai; = F, e 2Г„; if s, ^ uj < t,. For fixed /, the sets A,y and A^ are disjoint if ?'? к. Then Yd\E{\F,(X,i-Xs,)^,)\x^Yi\E{(Xlli^-Xl,l)\EU])\x^yax]0,a], and this inequality is preserved if we take the supremum in the left hand side. The converse inequality is evident. A.2.2. PROPOSITION. For any a ^ oo, we have |??|(]0.?]??) = ?8??]0.?] and |?? |(]0.?[??) = Varx]0,a[. For the proof, use Proposition A.2.1 above and |F(X,,+, -?,,|?,,)||,=5??|?((?(?,, + , -?,,\?,??))\ the sup being over the ?1,,-measurable step functions / with ||/||эс ^ 1- See Pellaumail A973,2D1) for the real case and Metivier A982, ex. E11.2).
Stochastic processes and stochastic integration in Banach spaces 499 A.2.3. PROPOSITION. For any a < oo, we have \arx]0,a] = sup^2\\E<,Xs,.l - XS,)\\ = sup J] \\E(Xs,.t i i the supremum being taken over all finite increasing sequences 5? ^ stopping times Si, with S„ ? a. The proof is similar to that of the scalar case (Kussmaul, 1977, Theorem 9.2). If X is a right continuous quasimartingale, the above equalities remain valid for arbitrary stopping times Si (see Proposition A.4.3). A.3. Quasimartingales For the definition of quasimartingales see Section 3. A.3.1. X is a quasimartingale on ]0, a[ or on ]0, a] if and only if ?? has bounded variation on 7?]0, а[ or 7?]0, a], respectively. A.3.2. Any martingale is a quasimartingale on ]0. oof; it is a quasimartingale on ]0, oo] if and only if sup, \\X,\\i < oo; in this case Var;e]0, oo] = sup, HXJi. A.3.3. Any negative submartingale and any positive supermartingale is a quasimartingale on]0, oo]. A.3.4. A process X with integrable variation (that is, E(\X\X-) < oo) is a quasimartingale on ]0, oo]. A.3.5. X is a quasimartingale on ]0, oo] if and only if X is a quasimartingale on ]0, oof and sup, \\Xi\\\ < oo. A.3.6. X is a quasimartingale on ]0,a] if and only if the stopped process Xa is a quasimartingale on ]0, oo]. A.3.7. IfX is a quasimartingale on ]0, a], then ||X|| is a quasimartingale on ]0, a]. A.3.8. PROPOSITION. X is a quasimartingale on ]0, oof (respectively on ]0, oo]) if and only if the following condition is satisfied: For every t < oo, there is a positive function /, e L' (P) such that for every F e ?, we have №x\(]t,oc[xF)^E(lFf,) (respectively \??\(]?,??] ? F)^E(lyf)). This is Lemma 2.3 for quasimartingales on ]0, oo]. -Х5,-1ЗД|Р ¦ ¦ ¦ ^ S„ of simple
500 J. K. Brooks A.4. Right continuous quasimartingales A.4.1. PROPOSITION. Assume X is a quasimartingale on each bounded interval. X is right continuous in the mean if and only if it is right continuous in probability. If X has a right continuous modification, then it is right continuous in the mean. The above proposition is due to Orey A965-66) for the scalar case. The proof for the vector case is similar. See also Kussmaul A977, Theorem 9.3). A.4.2. PROPOSITION. Assume X is a right continuous quasimartingale on ]0, oo]. Then (a) Xt is integrable for every stopping time T; (b) IfT„ is a decreasing sequence of stopping times converging to T, then Xjn —> Xt in the mean. This is proved by Orey A965-66) in the scalar case. See also Kussmaul A977, Theorem 9.4). A.4.3. PROPOSITION. Assume X is a right continuous quasimartingale. Then the equalities in Proposition A.2.3 remain valid for arbitrary stopping times Si, with S1, < <*. This follows from Propositions A.2.3 and A.4.2. See also Kussmaul A977, Theorem 9.2) for the scalar case. A.4.4. PROPOSITION. Assume X is a quasimartingale on each bounded interval, and let ? С ?' be a total set for E. We have B3) in case any one of the following conditions is satisfied for every ? e Z: A) (X, z) (in particular X) has a right continuous modification; B) (X, z) (in particular X) is right continuous in the mean; C) (X, z) (in particular X) is right continuous in probability; D) (??, ?) (inparticular ??) is countably additive. Indication of PROOF. By Proposition A.4.1, A) => B) and B) <s> C). Then B) implies, for every ? e Z, s < oo and F e ?^: ]imE(lF(X,,z)) = E(lF{X„.z)), which is equivalent to lim^x(]s,t]xF),z) = 0. lis Then apply Lemma 2.2 to deduce B3). Finally, D) => B3) is evident. D The author is pleased to acknowledge the permission of Springer-Verlag to use portions of the material in references Brooks and Dinculeanu A988, 1990, 1991) for this chapter.
Stochastic processes and stochastic integration in Banach spaces 501 References Brooks, J.K. A976), Stochastic processes in Banach spaces. Quasimartingales and stochastic integrals. Rend. 1st. Mat. Univ. Trieste 26 A994) 6-33. Brooks, J.K. A999), Stochastic integration in nuclear spaces. University of Florida. Preprint. Brooks, J.K. and Candeloro, D. B000), The foundation of weak stochastic integration theory in Banach spaces. Tech. Report N. 15, Perugia, 1^4. Brooks, J.K. and Dinculeanu, N. A974), Strong additivirv. absolute continuity and compactness in spaces of measures, J. Math. Anal. Appl. 45, 156-175. Brooks, J.K. and Dinculeanu, N. A976), Lebesgue-rxpe spaces for vector integration, linear operations, weak completeness and weak compactness, J. Math. Anal. Appl. 54. 348-389. Brooks, J.K. and Dinculeanu, N. A987), Projections and regularity of abstract processes. Stochastic Anal. Appl. 5, 17-25. Brooks, J.K. and Dinculeanu, N. A988), Regularity and the Doob-Mexer decomposition of abstract quasimartingales. Seminar on Stochastic Processes, Birkhauser. Basel, 21-63. Brooks, J.K. and Dinculeanu, N. A990), ltd's formula for stochastic processes in Banach spaces. Conference on Diffusion Processes, Birkhauser, Basel. 349-397. Brooks, J.K. and Dinculeanu, N. A991), Stochastic integration in Banach spaces. Seminar on Stochastic Processes, Birkhauser, Basel, 27-115. Brooks, J.K. and Neal, D. A988), The optional stochastic integral. Seminar on Stochastic Processes, Birkhauser. Basel, 45-54. Brooks, J.K. and Neal, D. A991), A characterization ofHilbert stochastic integrals. Atti. Sem. Mat. Fis. Modena 39, 113-129. Brooks, J.K. and Neal, D. A992), A vector measure approach to the optional stochastic integral. Ulam Quart. 1, 17-25. Brooks, J.K. and Neal. D. A998), Generalized stochastic integral for real and Hilbert processes, Atti. Sem. Mat. Fis. Modena 46, 189-247. Brooks, J.K. and Neal, D. A998b), Generalized stochastic integral on nuclear spaces, Atti. Sem. Mat. Fis. Modena 46, 83-98. Dellacherie, С and Meyer, P.-A. A978). Probabilities and Potential, North-Holland. Amsterdam. Dinculeanu, N. B000), Vector Integration and Stochastic Integration in Banach spaces. Wiley-Interscience. New York. Dinculeanu, N. B002), Vector integration in Banach spaces and application to stochastic integration. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 345-399. Fisk, D.L. A965), Quasimartingales, Trans. Amer. Math. Soc. 120. 359-388. Folmer, H. A972), The exit measure of a supermartingale. Z. Wahrscheinlichkeitsth. verw. Geb. 21, 154-166. Gravereaux, B. and Pellaumail, J. A974). Fonnule de ltd pour des processes a valeurs dans des espaces de Banach, Ann. Inst. H. Poincare 10, 399^422. Kunita, H. A970), Stochastic integrals based on martingales taking their values in Hilbert spaces. Nagoya Math. J. 38, 41-52. Kussmaul, A.U. A977), Stochastic Integration and Generalized Martingales. Pitman, London. Kussmaul, A.U. A978), Regularitat und stochastische Integration von Semimartingalen mit Werten in einein Banachraum, Dissertation, Stuttgart. Metivier, M. A974), The stochastic integral with respect to processes with values in a reflexive Banach space. Theory Prob. Appl. 14, 758-787. Metivier, M. A982), Semimartingales, deGruyter, Berlin. Metivier, M. and Pellaumail, J. A975), On Doleans-Folmer's measure for quasimartingales. Illinois J. Math. 77, 491-504. Metivier, M. and Pellaumail, J. A980), Stochastic Integration. Academic Press, New York. Orey, S. A965-66), F-processes, Proc. 5th Berkley Symposium, Vol. 2, 301-313. Pellaumail, J. A973), Sur I'integrale stochastique et la decomposition de Doob-Meyer, Asterisque, Vol. 9, Soc. Math. France.
502 J. K. Brooks Pratelli, ?. A988), Integration stochastic et geometric des espaces de Banach, Seminaire de Probabilities, Lecture Notes in Math., Springer-Verlag. New York. Protter, P, A990), Stochastic Integration and Differential Equations, Springer-Verlag, New York. Rao, K.M. A969), Quasimartingales, Math. Scand. 24. 79-92. Strieker, C. A977), Quasimartingales, martingales locales, semimartingales et filtration naturelles. Z. Wahr- schenlichkeitsth. verw. Geb. 39. 55-64. Ustunel, S. A982), Stochastic Integration on Nuclear Spaces and Its Applications. Ann. Inst. H. Poincare 18 B). 165-200. Yor, M. A973), Sur les imegrales stochastiques a valeiirs dans mi espace de Banach. С R. Acad. Sci. Paris 277. 467^469. Yor, M. A974), Surles imegrales stochastiques ? valeiirs dans un espace de Banach, Ann. Inst. H. Poincare 10, 31-36.
CHAPTER 11 Daniell Integral and Related Topics M. Diaz Carrillo Departamento de Analisis Matematuo. Universidad de Granada, 18071 Granada, Spain E-mail: madiai@ugr.es Contents Introduction 505 1. Daniell integral extension 505 1.1. Daniell systems of functions and their first extensions 505 1.2. Daniell upper and lower integrals. Summable functions 506 1.3. Characterisations and extensions of integrable functions 508 1.4. Generalisations and related topics 511 2. Integral representations for linear functionals 514 2.1. Measures induced by integrals. Daniell-Stone theorem 514 2.2. Related results and applications 516 2.3. The interplay between measure and topology. Riesz representation theorem 519 3. The abstract Fubini theorem 521 4. The Radon-Nikodym theorem 522 5. Integral norms. Local integral metrics and Daniell-Loomis integrals 524 Acknowledgements 526 References 526 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 503
This page intentionally left blank
Daniell integral and related topics 505 Introduction The Riemann integral extension and its limit theorems have a grave drawback, all connected with the fact that it does not produce enough integrable functions. Reformulating Lebesgue's ideas, Daniell extension shows how to improve on this constructing the upper and lower integrals, analogs of Lebesgue's outer and inner measures. The present article is devoted to a description of the different formulations of the general Daniell scheme of extending certain linear functionals on a vector space, that is not based on a preliminary construction of a theory of measure, this appears as consequence of the theory of the integral. We offer the appropriate abstract forms of classic results as the Radon-Nikodym theorem and Fubini theorem. Moreover, we point out when the "functional approaches" are effective and we note the way to other generalisations. In particular, looking at the integral as an extension by continuity under a suitable seminorm or integral metric, matters simplify considerably. We shall use the following terminology and notational conventions: Ш denotes the extended reals, with the usual conventions concerning the arithmetic and order structure. An assertion ? about extended real numbers is said to hold, if ? is true provided that every sum appearing in ? is defined. When a set Л с К is not bounded from above or below in R, we write sup Л = oo or inf A = —oo, respectively; and if A = 0, then inf A = — sup A = oo. For a nonempty set X we let Rx consist of all functions /: X -> R. The algebraic operations, partial ordering and limits on Rx are defined pointwise. For a subset AcMxwe write +A = {/ e A: / ^ 0}. If a sequence (/„) С Шх converges to / we denote it by /«->/, and if it is increasing or decreasing, we write /„ f / or /„ 4- /, respectively. Given a real vector space В с Rx, we say that ? is a ring of functions if it is closed under taking products, ? is a vector lattice or a Riesz space of functions if it is closed under taking pointwise maxima and minima: f,g&B=$-fvgeB and / л g e В (or equivalently, if / e В also |/| e B, with |/|(.r) := |/(.r)|). В is Stonian if it contains / л 1 for any / e В (Stone's condition). 1. Daniell integral extension This section gives a brief discussion of the Daniell extension procedure. The extended function class L\ (I) of summable functions is defined via three classic methods: equality of the upper and lower integrals, difference of monotone limits with boundedness in integral / and closure of the class of elementary functions with respect to an L \ (I) type seminorm. We provide a simplified treatment of the former method and compare it with the main steps of the others. Well-known text books on this field are Pfeffer A977), Royden A968) and Weir A974), among others. 1.1. Daniell systems of functions and their first extensions Let X be an arbitrary set and 0 ? В С ?? a vector lattice (elementary functions). A linear functional /: ? -> ? is said to be nonnegative if 1(h) ^ 0 for every h e +B\ then it is
506 ?. Diaz Carrillo monotone increasing. In particular, \I(h)\ ^ /(|?|) f°r апУ h e В. A. nonnegative linear functional / on В is called a Daniell integral whenever (D): (/?„) С В, h„ I 0 implies /(/?„) —> 0. The triple (X. ?, /) is called a Daniell system. There are several conditions equivalent to (D) used by different authors (Daniell, Riesz, Loomis, Stone) that are basic for the construction of the integral starting from nonnegative linear functionals (e.g., see Bogdanowicz A967, Proposition 2)). Denote by ?? the class of all E-valued functions / on X for which there is a sequence (h„) С В such that h„ \ f. Let /+(/) = lim /(/?„). Since (D) is equivalent to (D'): if (/?„) and (g„) are increasing sequences in ?, and if lim/?,, < limg,,, then lim /(/?„) ^ lim/(g„), we see that /+(/) is well defined. We list the more relevant properties: В с В *, ?? forms a positive cone and is closed under taking limits of increasing sequences, finite infima and countable suprema. If / e B\ then /+(/) = sup{/(g): g e B, g ^ /} > —oo; /+ is additive, positively homogeneous and increasing on B'. If (/„) с ?? and f„ \ /, then /+(/) = lim I+(f„), that is, /+ is continuous along increasing sequences of ??. As a consequence, the fact that /+ commutes with sup (and series) when it is applied to an increasing sequence (respectively nonnegative functions) from ??, is also especially important. One introduces ?^ and /~ in complete analogy with ?? and /+. The properties of (?^, /~) derive from those of (??. /+) by means of the following elementary assertion: / eBl if and only if -/e fit, and in this case /"(/) = -/ + (-/). Denote ? = Bi ???. Order-continuous systems. ? \ f means that ? is a directed upwards system of functions on X such that / = supjg: g e ?] (with dual definition for ? I /). A nonnegative linear functional / on ? is called a Bourbaki integral on В whenever (B): ? С ?, ? ? 0 implies inf{/(/): / e ?} = 0. ?^ denotes the class of all functions on X for which there is a system ? с В such that ? f /. Put /г(/) = Нт/(Ф). ? (?*, /?) extension is constructed in the dual manner. Recall that B^ с B^ and every Bourbaki integral on В is also a Daniell integral on B, and that has a slightly superior integration theory. Bourbaki integral extensions can be found, e.g., in Pfeffer A977), as an easy kind of dualization. For an arbitrary linear functional /, we can reduce it to the nonnegative case by means of the functional Jordan decomposition (see, for instance, Papangelou A966), Gunzler A973, Lemma 1, p. 169)). Under some additional assumptions Adamski A983, 1986) gives several characterisations of Daniell lattices and strong Daniell lattices (i.e., vector lattices В с Rx, where every nonnegative linear functional ? : В —> ? is ?-smooth or net- continuous). Although, in contrast to Gould A964) and Jacobs A978), Adamski does not use any compactification in the latter case. 1.2. Daniell upper and lower integrals. Summable functions Section 1.1 provides a rich class of functions that will be used to play the role that open sets play in the construction of Lebesgue's measure. Now, the Riemann upper and lower integrals are replaced by slightly different ones, called the Daniell upper and lower integrals.
Daniell integral and related topics 507 For an arbitrary function /eRx one defines the Daniell upper integral 1(f) = inf{/+(g): / < g e ??}, and the Daniell lower integral /(/) := -l(-f). 7 is positively homogeneous, increasing and subadditive for /, g for which / +g is defined everywhere; /^Tonl^andif /e ?? nBi, then7(/) =/(/). Next, the following result, which is usually referred to as the "monotone convergence theorem", is fundamental to the whole theory of integration. THEOREM 1.1. Let(f„)<zRx, f, t f andl(f) > -co, then 7(/„) 11(f)- Now, using Theorem 1.1 we obtain the classic Fatous lemma: PROPOSITION 1. Let (/„) cR* such that l(inff„)> -oo, thenl(inff„) ^ 7(liminf/„) ^ liminf/(/„)· Theorem 1.1 and finite subadditivity of 7 imply its countable subadditivity, which is one of those features of 7 that the Riemann upper integral does not share. From now on we will use L\A) to denote the class of those f еШх for which 1(f) = /(/) e Ш (I-summablefunctions), and we define the common value of 1(f) and 1(f) as the integral of / e L\(I) and denote it by /(/). PROPOSITION 2. / e fif andl(f) < oo implies f e L\. L\ is "almost" a vector lattice of functions containing B, and I is a nonnegative linear functional on L\. The class L \ П R (which forms a genuine vector space on Ш), is not "much smaller" than L\, in fact: if f & L\ and g e Rx, g(x) = f(x) for each ? e X for which f(x) is finite, then g e L\ and I(g) = I(f). Observe that L \ contains functions with values oo and -oo, so L \ is not a vector space (one cannot define a "+" on it such that one gets an Abelian group), this is an error that appears already in Daniell A917/18, p. 288), and also in several books. Of course one can circumvent this difficulty, in Pfeffer A977, p. 61), Floret A981, pp. 81-82), Stroock A994, p. 132), the situation is handled correctly. Also, since the infinity values are attained only on null sets, one can form equivalence classes of functions which differ only on null sets. Another approach which avoids these difficulties is given by an der Heiden A970), where the Daniell method is modified in such a way that automatically a vector space arises. Now, an application of Proposition 1 and its dual, gives the Lebesgue dominated convergence theorem. Theorem 1.2. Let (/„) с L\, g,h e L\, and g ^ /„ ^ h, ? e N. /// = lim/„, then /eL, a^/(/) = lim/(/„). Then, with the above results we can establish that the functional / ???,? nRx is a Daniell integral that extends / on B, and L ? has the closure properties desired. A detailed presentation of the complete results as stated above can be found in Pfeffer A977, Chapters 4 and 5). Also along the same lines see also Royden A968), Taylor
508 ?. Dial Carrillo A965) and Stroock A994). Pfeffer's book contains an excellent collection of examples and counterexamples in measure and integration. In the remainder of this section we group some rich classes of well-known examples of Daniell systems. PROPOSITION 3 (Radon measures). В = Coo(X) (continuous functions with compact support on a locally compact Hausdorff space ?). ? is a Stonian lattice ring of bounded functions. Any nonnegative linear functional on Cqo(X) (it is customary to term such a functional a positive Radon measure on X) is a Bourbaki integral on Q)q(X). The first Daniell and Bourbaki extensions of Q)q(X) coincide if and only if every open subset of X is ?-compact (Pfeffer, 1977, p. 37). PROPOSITION 4 (Linear extensions of set functions). Let ? be an ?-additive measure on a ring R of subsets, Br denotes the lattice ring of step functions over R. The map f ??? = ?,.6??? ([h = r]), h e ?/?, is the unique linear extension of ? to ?/?. This extension preserves ?-additivity, at least for nonnegative measures. Riemann-Stieltjes integral. Let X = [a, b], В = C(X) (continuous functions on X), and let g be an increasing real-valued function on X. For /efi, let /(/) = fj f(x)dg(x), then / is a Bourbaki integral on В. 1.3. Characterisations and extensions of integrable functions The Daniell mean. A variant functional approach to integration of interest considers the integral as an extension by continuity under a suitable seminorm, as introduced by Stone A948/49) and Aumann A952). For a more general exposition of this subject see Section 5 in this chapter. A recent book by Bichteler A998) presents a complete treatment of integration using the Daniell mean || · || associated with the Daniell upper integral /, defined by ||/|| = 7(|/|) e +1 for all / e Rx. Bichteler point out that "already in the case of the Riemann integral, a simple absolute-value under the upper integral, simplifies matters considerably"; also, since the most frequently encountered class of functions (in the context of integration) are lattice rings, he starts with an elementary integral system (X, ?, /), where / is a Daniell integral on a lattice ring ? of bounded functions 0? a set X, and studies the structure of the closure of ? under || · ||. A sequence (/„) с ?? is said to be converging in || · \\-mean to / if ||/„ - /|| -> 0. The functional || · || is absolute-homogeneous, solid (|/| ^ \g\ => ||/|| ^ \\g\\), countable subadditive and finite on ? (elementary integrands). We remark that the utility of || · || lies in that it is upper bound for the elementary integral, \I(h)\ ?. \\h\\ for all /iei. A function / is called || · \\-negligible if its mean ||/|| is zero. A subset A of X is || · ||- negligible if its characteristic function ?? is negligible; and the notion || · \\-a.e. arises as usual. L* denotes the class of || · \\-a.e. defined functions with finite mean. Now, an a.e. defined function / e Rx is called || · \\-integrable if it belongs to the mean-closure off in L*. The class of integrable functions is denoted by L|(?, /). For / e L\ define 1(f) = lim/(/?„),
Daniell integral and related topics 509 where(/?„) с ? converges in mean to /. L\ is complete in mean; moreover, if (/„) C+L| satisfies supA. ||?„_| f„\\ < oo then ||/„|| -> 0, and the properties concerning limits (monotone and dominated convergence theorems) are formulated in terms of mean. For these results, and for more in the same context (on integrable sets, and their connection with integrable functions, measurable functions, L''-spaces, etc.) see Bichteler A998). In this way it is easy to prove the following characterisation: Theorem 1.3. A function f is || · \\-integrable if and only if 1(f) = /(/) e R. We remark that the above construction can easily be extended to functions with values in a Banach space (?, || · || ?¦). In fact, the class L\(E) (Bochner integrable functions) will be the || · ||-closure of ? <g> ? in L*t-. where L*. denotes the class of all ?-valued defined functions on which the mean ||/|| = l(\\f\\t) is finite (Schafke, 1970,1972; Hoffmann and Schafke, 1992). Finally, if signed elementary integrals / are considered, the finite variation /(/) = sup{|/(/?)|: h e ?, |?| <c /} e R for all / e +?, replaces the positivity. Here, L,(/) = L|(|/|), since |/(/)| <c |/|(|/|) <c Daniell mean for |/| (see Bichteler A998, p. 140), GunzlerA973,p. 169)). The closure of a Daniell system. Another functional approach, based essentially on the order structure of the Daniell system (X, B, I), is contained in Constantinescu and Weber A985). The notion of B-exceptional sets associated with the Riesz space В in Шх is introduced (i.e., А с ?, ???? e ?), which have the characteristics appropriate to exceptional sets in a sense of "almost every where", in terms of Riesz spaces, and the notion of B-a.e. also appears as usual. For an increasing functional / on a subset F с Шх, the triple (X, F, /) is said to be closed if, for every monotone sequence (/„) С F with (/(/„)) bounded in R, the function lim/„ e F and /(lim/„) = lim/(/„). For a closed Daniell system, the ?-exceptional sets are exactly those subsets of X whose characteristic function have integral zero. The explicit process for extending a given Daniell system to a closed Daniell system allows several ways of characterising L\ (/), constructed as above, with the notation S+ = {Д|1б1,: /„: (/„) с В. f„ ;}. THEOREM 1.4. Let (?. ?, /) be a Daniell system. Then the following assertions are equivalent, for every f e Шх, a e R (i) For every ? > 0, there exists an ?-bracket of f relative to I (i.e.. a pair (h. g) e ?/ ? ?? such that h <c / <c g. /-(/;). I+(g) e R and I'(g) - r(h) < ?), and 1(f) = L(f) = a. (ii) For every ? > 0, there exists an ?-bracket of f relative to I (h. g) ? Bi ? ?? such that ?(h) iCa iC I+(g). The equivalences are all consequences of the definitions of the space L\(I) and of the functionals/+ and /~ (see Constantinescu and Weber A985, p. 214)). The triple (X. L\(I), /) is the smallest closed Daniell system extending every Daniell system (X, B. I). In addition, the properties of the given Daniell system are characterised (Induction Principle, Constantinescu and Weber A985, p. 313)). We note that the closed Daniell systems constructed here are not yet "integrals" in the sense of the above authors;
510 ?. Diaz, Carrillo an additional step is required adjoining the Daniell system in an obvious way, and then taking the Daniell system closure. This method uses the notion of an "admissible extension" of a Daniell system (X. B, /), that is to say, a closed Daniell system (X, B, I) extending (X, ?, /) such that for every / e В there is a function g e L \ with f = g ?-a.e. In a more recent book by Constantinescu, Filter and Weber A998), the above point of view of integration theory via Daniell methods, is also treated. Difference of monotone limits. For a Daniell system (X, B, I) another standard functional approach to obtain the class of all /-summable functions (originally given by Daniell A917/18)), can be found in classic texts such as Weir A974) or Shilov and Gurevich A966). The main steps are: А с X is said to be null if there is an increasing sequence (/?„) с В for which /(/?„) is bounded and h„ (x) diverges for every ? e A. The corresponding notion of a.e. follows as usual, and if h e Rv, h = 0 a.e. (null function), then h e L\{I) and 1(h) = 0. Let the positive cone Lmc = {/ e ??: 3(?„) С В, h„ \ f a.e. and /(/?„) bounded}. For / e LC, define /(/) = lim/(/j„). One has that any function in L"K is equal a.e. to a function of Вт П L\(I); thus, if / e Rx such that f=h-g,h,ge L"K, then / e L|(/). On the other hand, a similar argument (?-bracket) used to prove ii) in Theorem 1.4, and with the monotone convergence theorem (in the L"u'-process), proves that every /-summable function can be expressed as the above difference h — g. A detailed explanation of the results as stated above can be found in Weir A974, p. 11). Note that the corresponding formulation of the monotone convergence theorem in terms of absolutely convergent series (Weir A974, p. 15)), allows the whole theory of Daniell integrals to be worked out in terms of absolute convergence; this method is considered by Asplund and Bungart A966). Finally, observe that if f e L\(I), there is a function h e B* with h ^ / and /(/) finite. The above definition of null set implies that h, and hence /, can only take the value ooona null set А с X (analogy, with the value — oo), and 1(хл) = 0 (so, it is a negligible set). This point is essential for the usual convention about "linear combination" in ? and "admissible" extension mentioned above. I-integrable functions. A final extension appears in Pfeffer A977, Chapter 6), to extend В U L\ (I) adding to it a large class of functions whose integrals are infinite. L^ denotes the class of all functions / e Rx such that / л h e L\ for all h e В (equivalently, for all h e L\)\ and L~ :— —L^. A function / on X is called I-integrable whenever /eli:=i.f Ui.7. One has ?? с L+ and L\ = LJ П LJ. It is consistent to define /(/) = /(/) if / e L^, and the extension process is closed. Note that an / e +LJ is sometimes called nonnegative /-measurable function (see, e.g., Taylor A965, p. 246), Royden A968, p. 295), Volcic A980)). This final extension has useful properties for studying extended real valued measures (respectively integrals) induced by integrals (respectively measures), as treated in Section 2. Note that if / e L^ and /(/) < 00, then / e L\\ and if 1 e L\, /-integrable sets (Xa & L+) coincide with /-summable sets. We mention that if / is a Daniell integral with corresponding space L\ (I) of summable functions, abstract LP{I) spaces containing all
Daniel! integral and related topics 511 the measurable functions (in the sense of Stone) for which \f\p e L\{I), ? > 1, can be studied (see, e.g.. Weir A974, p. 206)). 1.4. Generalisations and related topics The systematic search for more general basic systems (?,?,/), or certain particular systems, to obtain new applications, as well as to unify methods, has given special functional approaches to integration. Non-lattice integration. Daniell-Stone integration also makes sense if the underlying function vector space E, on which the nonnegative linear / is defined, is not a lattice. The first author to have this idea seems to have been Leinert A982); his aim was to work in abstract harmonic analysis (see Leinert A984) where he gives a proof of Plancherel's theorem). The space L\ is constructed as the closure of ?" in an L \ -type seminorm. Namely, for /eP, 7(/):=inf(?/(/„): f„e+E, ?/„>/); 11/11 = /(|/|) for every / e R*,and L\ is the norm closure of ?" = {/ e ?; ||/|| < oo} in the class of all extended real-valued functions on X with finite norm. Then, the Beppo Levi theorem is proven, and with some weak additional assumptions, the classic convergence theorems follow, which are proved in the usual fashion. These results seem to have been done essentially already by Aumann A952) with the same integral norm. He also proves a Beppo Levi theorem. It could be checked whether the lattice structure is here essential. A similar generalisation is investigated by Konig A982, 1992) with applications to complex algebras of continuous functions closed with respect to uniform convergence and to Hardy algebras. Starting with the sequential continuity condition (?): for each increasing sequence (h„) с ? with lim/?,, > 0 we have lim/(/j„) ^ 0, then L]n(I) is defined as the closure of ? in Rx with respect to the sequential upper integral I„(\ ¦ |) (or Stone's integral norm, see Aumann A952)), where In(f) = infjlim/f/j,^: (/;„) С Е increasing with / ^ lim/?,,}. Equivalently, h\ consists of all functions / e Шх such that h(f) + In(—f) = 0, i.e., it is the linearity space of the sublinear functional l„. To obtain convergence theorems / needs to fulfill a Beppo Levi condition; similar situation is treated with "summenbeschrankt" by Aumann A952), "starke integralnorm" by Schafke A977), and "upper S-norm" by Bichteler A973). Furthermore, in order to obtain the full results, one has to assume the fortified continuity condition (?_): In(f) = I~{f)ior all / ^ 0, where /+ (satellite functional associated with I„), is defined for all / e Rx, as above with (h„) с +Е. Now, the monotone representation theorem is proven: for / e L\ there are u, ? monotone limits from E, such that / = и — ? a.e.; which, combined with the previous description of Lj,, show that the three afore-mentioned classic extension methods to construct the spaces On continue to produce identical results under the present assumptions. Integration in the sense of Bourbaki (which is not covered by Leinert), is considered here as a parallel dualization. Another generalisation along these lines was investigated by Leinenkugel A992), who uses non-lattice integration procedures to specialize Denjoy and Perron-Li spaces. This
512 ?. Diaz Carhllo proves useful in some cases that could not be treated with the usual Daniell-Stone integration theory. Order theoretic, non-additive integration. The idea of an order theoretic presentation of integration was most extensively dealt with Alfsen A958, 1963) (see also McShane A953)). The complete "absence of linearity" requires modifications in the definitions and proofs of standard methods of integration. This question has been reviewed by Riecan A979), who gives a unified and simplified study, with the help of an ordering, of the analogy between the measure theory and the integration theory. He starts with the notion of full integral introduced by Alfsen: Let J: A —>¦ R be a monotone map defined on a sublattice A of a lattice H, satisfying the conditions: (i) J(x) + J(y) = J(x ? v) + J(x л v), for all jr. у e A (J is a valuation); (ii) (x„) с ?, ? & A, x„ ? ? implies J(x„) —>¦ J(x) (x„ \ ? means x„ ^ x„+\, ? e N and ? = \Jn x„)\ and the following Beppo Levi requirement·. (iii) (xn) с A, x e #, x„ | ? and (J(x,,)) bounded implies ? e A. If we assume that ? is a relatively ? -complete sublattice of an ? -continuous lattice H, then (A, p) is a complete pseudometric space, where p(x. y) = J(x ? у) — J (? л у) for all x,y e A (Alfsen A963, Theorem 2)). In particular, if A is the class of all ?-integrable functions, J(/) = j f ?? and p(fg) = /(/-#)??, one has the Riesz-Fischer theorem. Construction of a full integral can be solved in various ways. For instance, the first extension of the Daniell system given in Section 1.1, and the well-known Caratheodory method can be adapted (for results in this direction see Riecan A964, 1975), Futas A971), Brehmer A974)). Various generalizations are also discussed by Riecan with a general algebraic system (quasilinear structures and subadditive measures); in this setting, an interesting decomposition theorem is presented including the Jordan decomposition for Daniell integrals (Riecan A979, p. 227)). Also an der Heiden A970) considers Daniell integrals with values in abstract boundedly complete partially ordered vector space. A further generalized Daniell extension (using partially ordered groups, weakly ?-complete lattice-ordered groups) can be found in Wilhelm A988), where some theorems are adaptations to this more abstract setting of the results known for function spaces (see Schafke A971)), and in the more recent book by Riecan A997). For related results, using natural generalisations of a nonnegative linear functional ("prelinear", "strong sublinear", "monotone" functions) or of a nonnegative additive measure (subadditive measures, capacity functions) see also Sipos A979a, 1979b). Greco A977). Schmeidler A986), Denneberg A994) and Pap A995). Non-additive measures and integrals might be of interest to pursue with respect to future applications in problems that cannot be treated with additivity: problems in economic, fuzzy measures, artificial intelligence, Bayesian decision theory, subjective probability, among others. For an exposition of non-additive measure see Part 9 in this Handbook. Essential integration. A unified functional approach to integration based on the notion of the upper functional (an abstract version of the upper integral) is given by Anger and Porte- nier A992). The procedure is different from the more abstract one discussed by Aumann A952), Hoffmann and S_chafkeJ_1992), Schafke A977) or Bichteler A973) (see also our Section 5). A given v:Rx -> R is called an upper functional if ? = {/ e R: v(f) <
Daniell integral and related topics 513 00} is a lattice cone on which ? is positively-homogeneous with v(f +g)^v(f Ag) + v(f v g) ^ v(_f) + v(g) (strongly sublinear on 7"), satisfying v(f) = inf{u(g): / ^ g e T] for / e Wx (determined by 7), with R=]-oc, 00]. Then, the class of all v- integrable functions is defined J(v) = {/ e Rx: v(f) = u*(/) := -v(-f) e R}. i(u) is closed in Rx with respect to v(\ ¦ |). The special case where a function cone 6сК" that is closed under minima and ? : & —>¦ R is linear and regular (? = ?* on ?) gives ?*(/) = inf^(g): f ^ g e &} which is an upper functional and J (?*) is the abstract proper Riemann integral (see the particular situation treated by Bobillo and Diaz Carrillo A987)). Next, essential integration is investigated, which comprises abstract improper Riemann integrals as well as essential Radon integrals. It is based on the essential upper functional v*(f) = inf/,ey $upkeJ_ (v(f л(-к)) ? /г), where J- = {/ e i(u), / ^ 0}. v* corresponds to Schafke's local integral norm (Schafke A977)). J(v') defines the class of all essentially v-integrable functions. If / is a nonnegative linear functional on a vector lattice S, with ? = /*, Diaz Carrillo and Mufioz A990) have introduced J(v'). An upper functional ? is called an upper integral if it satisfies Daniell's property. For any increasing sequence (/„) с Rx, f(sup„ /„) = sup,, v(f„). Then, for upper integrals the usual convergence theorems are obtained; furthermore, measurability in the sense of Stone is considered. For related results (with the corresponding strong integral norm) see also Bichteler A973). The usual Daniell integration theory is a special case of the above general theory. In fact, if ? is a monotone linear functional and regular (always true for vector spaces) on a function cone ? closed under minima, then the usual extension ?" to monotone limits from ? gives upper functionals ? = ?&·. ?' satisfying Daniell's condition; and the Daniell-?-summable functions are obtained by the above closure J(v) for the cone of monotone limits from ? (see Theorem 9.10 in Bichteler A973)). Fubini theorems are not investigated for essential integration, which are treated with integral metrics (see Section 5). For the case of finitely additive set function essential integration has been introduced by Gunzler A973), extending the Dunford-Schwartz integral (Dunford and Schwartz A958)). For alternative points of view, we note that techniques of categories and functors are used to introduce in a new "axiomatic" way the notion of upper and lower integrals of "Daniell measures" and to prove extension theorems and generalised monotone convergence theorems (see Cicogna A976) and Bogdan A975)). Further generalizations and related results have been considered to Daniell-type integration theory: integrals having values in a ordered semigroup by Thiam A975); integrals having values in a Dedekind-complete vector lattice by Kluvanek A965) (see also Rudin A975), Muni A974) and an der Heiden A970)), with values in a Banach space by De Lia and Mikusinski A995); integrals defined on a vector lattice of functions having values in a quasi- complete locally convex Hausdorff space by Okada A984); and integrals defined in locally convex lattices by Bogdanowicz A971), among others. Non-continuity integration. Analogues to the Daniell extension process, with or without weaker continuity assumptions on the elementary integral, have been treated by Au- mann A952), Loomis A954) and Gould A966). More recently, Bobillo and Diaz Carrillo A987, 1989) discussed an integral extension of Lebesgue power, starting in this case with a vector lattice В с R* and a nonnegative linear functional / on B. The triple (X. B, I) is
514 ?. Diaz Carrillo called a Loomis system, for which we define a first extension BT = sup{M: 0 ? ? С В) and / + (/) = sup{/(g): / > g e ?} for all / e R*. Since /+ is not additive on ??, we introduce the class BT = {/ e ??: / + (/ + g) = /+(/) + /+(g) for all g e ??}· Now, using the classes ?? and — ??. for each / e Rx, the upper and lower integrals /(/)_and /_(/) are defined as usual (see Sections 1.1 and 1.2). The elements of ? = {/ e Rx: 1(f) = 1(f) e R) are called /summable (with respect to a Loomis system). This extension permits us to obtain results similar to those for the Daniell-Bourbaki case; for instance, convergence theorems are proven by Gunzler A991). We recall that "no continuity conditions" on the starting elementary integral allows us to subsume situations of integration with respect to finitely additive measures ?. Diaz Carrillo and Gunzler A997) generalise the above process by "localization", using an appropriate local convergence in measure, which is very useful for obtaining convergence theorems analogously to the classic ones (some of which are not true for В ). Namely, the set L = L(B. I) of /integrable functions is defined as the set of those / e Rx for which there exists an 7-Cauchy net (h„) с В with h„ -> /G), which means that 7(|/„ - /| л h) -> 0 for each fixed h e +B. L coincides with the Daniell L\ in the classic case. A similar R\-theory (abstract Riemann integration) replacing / by I~(f) := —/_( —/). was treated by Diaz Carrillo and Munoz A993), which subsumes the space of abstract Riemann-?-integrable functions R ? (?, R), defined as above, but with h„ -> / "?-locally" (Gunzler A985, p. 199, 70)). Then, given the Loomis system (X. Bq. /;,) induced by a finitely additive measure ?:?^- [0, oof, where ? is a semiring of sets in X, one has R^ (?. ?) (proper Riemann ?-integrable functions) с DS(X, ?. ?. R) (Dunford-Schwartz integral A958, p. 112)) С R\^,W) с L(Bn. //;), with coinciding integrals, and all С are in general strict. For general Loomis systems the above extension classes are specialized to abstract Riemann, Loomis, Daniell and Bourbaki integrals (see Diaz Carrillo and Gunzler A997, p. 1084)). 2. Integral representations for linear functionals For integral representation in the scalar case, one usually has the following general situation: Let us take an abstract set X, a vector lattice i,cMx, / : L —>¦ R linear and continuous in some sense, and a class ? of subsets of X in some way associated with (X, L, I). Then, one may ask if there is a set function ? on ?, possibly unique, such that / is representable by ?, i.e., all functions / e L are ?-integrable and satisfy /(/) = f f ??- The question is answered, in a very special and simple case, by the original representation theorem of F Riesz, which states that any nonnegative linear functional / on С ([a. b]) can be given by means of a Lebesgue-Stieltjes integral. This result has been generalised in several significant ways. For an overview of this subject see Batt A973), Weir A974, Chapter 10), an der Heiden A978) and Glicksberg A951). First we focus our attention on Daniell systems (X. B. I). Various generalisations and special cases will be considered later. 2.1. Measures induced by integrals. Daiiiell-Stone theorem Given a Daniell system, the main purpose of this section is to derive conditions for the existence of a measure ? with the property stated above. (?. ?. ?) denotes a measure
Daniel! integral and related topics 515 space, whenever ? is a ? -algebra in X and ? an extended nonnegative measure on ?. The elements of ? are called ?-measurable sets. The following theorem summarizes the well-known facts. THEOREM 2.1. Let (X, B, I) be a Daniell system, and suppose that L\(I) is Stonian. Let A consist of all I -integrable sets А С X, and let v(A) = ?(?,\) for each A e A. Then (X, A, v) is a measure space (induced by ?), ? is complete and saturated (every locally ?-measurable set is A-measurable). The proof essentially uses the fact that +LJ is closed under taking limits and the countable additivity of 7 on+L+ (see PfefferA977, pp. 122, 126)). PROPOSITION 5. If M(A± denotes the family ofall ?-measurable functions in X (i.e., /"' (B(R)) с Л with В(Ш) Borel ? -algebra in I), one has for any f eRx: f e ? (A) if and only if /^ e L+, and in particular, f e L\ if and only if f e ? (A) and I(f+) — l(f~) has meaning. Note that many of the results in Section 1.2 can be strengthened using the notion of Ao-a.e., with the null ideal An = {A e A: v(A) = 0}. Integrals induced by measures. Given a measure space (?.?,?) we shall define a Stonian Daniell system Bq on X and a Daniell integral ?? on Bq that we extend to L \ (??) (/?-integrable functions) (see Proposition 4 in Section 1.2). Namely, the Stonian vector lattice Bq consists of all i2-simple functions on X (i.e., immeasurable functions / that assumes only finite number of values and with ?([f ? 0]) < oo). Now, given the usual canonical form of / e Bq, the map defined by /„(/) = ?/'=? c4fJ-(Aj) gives a Daniell integral on Bq . Next, using the notion of /-integrable function given in Section 1.3, we can extend the integral ?? to L \ (??) and according to Theorem 2.1, define a new measure space (X. A, v) induced by ??. The relationship between the measures ? and и (induced by ? via /;<), is given for the following assertions·. PROPOSITION 6. The measure ? is a complete and saturated extension of ?, i.e., ? С А and v(A) = ?(?) for each A e ?. If, in addition, ? is complete and saturated, then (?,?,?) = (?,?,?) (see Pfeffer A977'. p. 152)). Now, applying the process of inducing measures by integrals and vice versa, the following theorem is obtained. THEOREM 2.2 (Daniell-Stone). Let (X.B.I) be a Daniell system, and suppose that L\(I) is Stonian. Let (?,?,?) be the measure space induced by I (Theoren^lA), and let (X, /?_д, Iv) be the Stonian Daniell system induced by v. Then L\(I) = L\(IV) and Hf) = l,-(f) for all /el|(/).
516 ?. Diaz Carrillo The proof uses the fact that ? is complete and saturated; so, with Proposition 6, (?, ?, v) is the measure space induced by / and /,., and the classic approximation of nonnegative Д-measurable functions for nonnegative Д-simple functions and Proposition 7 (see Pfeffer A977, Theorem 12.18)). Abstract Lebesgue integral. Given a space of measure (?, ?, ?), for any nonnegative simple function /, f f ?? is defined as above. Now, for any nonnegative immeasurable function / on X, let f /?? = sup{f g??: g e ??, 0 ^ g ^ /}. An arbitrary immeasurable function / is called ?-integrable(o? abstract Lebesgue ?-integrable) whenever J f+ ??- f f~ ??:= f f ?? has meaning; and we let L\ (?) = {f ? L\ (?): f /????.) where L \ (?) denotes the class of all / ?-measurables for which J f ?? exists (see Pfeffer A977), Cohn A980, Chapter 2)). The relationship between the integral /;< and the abstract Lebesgue ?-integral can be obtained as follows: Proposition 7. ?,(?) = ?,(/;1) ? ? (?) and j f ?? = l „(f) for all f e ?\(?). If ? is complete ana saturatea, L\ (?) = L\ (/;1), so that, the abstract Lebesgue integral is a restriction of a certain Daniell integral. As a consequence, the following theorem tells us that the Daniell integral / on L \ (I) is equivalent to the abstract Lebesgue integral with respect to the measure ? induced by /. THEOREM 2.3. Let (X, B, /) be a Daniell (or Bourbaki) system. Assume that the class of I-summable functions is Stonian. Let (?, ?. v) be the measure space inaucea by I (Theorem 2.1). Then, L\(v) consists of all I -integrable functions ana f f dv = 1(f) for all feL^v). The proof follows directly from Theorems 2.2, 2.1 and Proposition 7 (see Pfeffer A977, p. 155)). Alternative expositions of Daniell-Stone integral representation can be found in general textbooks such as Royden A968, p. 297), Segal and Kunze A968, p. 57), Stroock A994, p. 144), Konig A997, Appendix V) and also Gunzler A975, p. 143). It is well known that one cannot dispense with "Stone's condition" (compare Fremlin A974) for a counterexample), but other substitute conditions can be considered (see also Kindler A983)). The study reported here has the advantage that it is an immediate consequence of the previous treatment of measures (integrals) induced by integrals (measures). The most frequently used general method of Caratheodory extension gives the same measures as the Daniell construction. Uniqueness and approximation theorems can be found, for example, in Weir A974, Chapter 13). For recent versions of the "traditional" Daniell-Stone theorem in terms of the abstract Bourbaki integral, we refer the reader to Leinert A995, Chapter 14) and Konig A992, Appendix). An interesting reference on these objects is the survey textbook by Konig A997), with bibliography notes and comparative descriptions of some previous works. 2.2. Relatea results ana applications We use the notation ?(?) to denote the smallest ?-algebra over X with respect to which all functions in the vector lattice ? are measurable, a(C) as usual denotes the smallest
Daniell integral and related topics 517 ?-algebraover С С P(X). Let (X, B, I) be a Daniell system, and assume that 1 e B. Note Lo = L\ f)Rx, then ?(?,?) = {А С X: хд е Lq}, which is the completion of ?(?) with respect to the finite measure ? defined as usual on ?(?0), and the corresponding integral representation holds. As a consequence, one has the following well known Caratheodory extension: Let ? be an algebra of subsets of X and suppose that ? : ? —>¦ [0, oc[ is a finitely additive measure such that ?(?„) 4- 0 whenever (А„)сй decreases to 0. Then there is a unique finite measure, ? on ?(?) with the property that ? coincides with ? on ?. Here one has ?(?) = ?(??), /? on Bq is a Daniell integral, and Theorem 2.3 gives the desired existence and uniqueness (see Stroock A994, p. 146), where an important application to the "probability product" is also proven). We can still establish the uniqueness of ? with a weaker assumption, e.g., there is an everywhere positive function / e L\ (Royden A968, p. 295)). Moreover, by an obvious localization procedure, one can reduce problems involving ?-finite situations to finite ones. We also recall that the extensions of Daniell integral serve as background for the study of probability. In fact, given a Daniell system (?, ?, /), we assume that 1 e В and /A) = 1, let ? = {А с ?: ?? e В*} and define ?(?) = ?(??) for all A e ?. Then ?*(?) = ???(?@): А С D e ?] is a probability measure on the ?-algebra A = [H С Х\ ?*(?)+?*(?€) = l},and ?* = ? on ?. Now, since for / e B\ [f > r] e ?, r e E, thena(B) = a(Si) = a(i?).Now,forany А С ?, ?*(?) = inf{/(/): f e Br, f > ??}, and ?* is a measure on ?(?2). Then, integral representations follow: There is a unique probability measure ? on ?(?) such that each / e В is P-summable and /(/) = f fdP- The results are identical under the hypothesis that / is a Bourbaki integral, with ?? replaced by Bl and sequences by nets (see Ash A972, p. 175)). For applications to probability theory see also Simonnet A996) and Neveu A964). It is interesting to note that the capacity approach can be used to directly deduce the Daniell method of integration and the main Daniell representation theorem. For instance. Chapter 7 in Rao A987) is devoted to a comparative study of an outer measure and an analogous capacity function. Some fundamental properties of measures of fitzzy sets are discussed by Qu A983), using the Daniell representation theorem; in this paper a fuzzy set is considered as a point of a segment in a Riesz space. A recent systematic treatment of measure and integration based on the notion of regularity is investigated in a survey by Konig A997). This development started in the work of Tops0e A976) and others, and the "main instruments are certain new (and unconventional) envelope formations which resemble the traditional Caratheodory outer measure". For a lattice cone ? с +ШХ let_r(?)= {Ac ?: ?.\ e ?}andL(?) = {[/ ^ r\. f e ? and r > 0}. If ?-.ЦЕ) —>¦ +K is a monotone set function with <p@) = 0, the "horizontal integral" is defined for all / e ? by f fd<p = J^ ?([/ > r])dr e +Ш (an integral of Choquet type). Let /: ? -> +R be a nonnegative linear and increasing functional (elementary integral); then, we define ? to be a representation (a source) of / if /(/) = § f ?? for all / e ?. Next, we define the crude outer envelope /*:+E* _> +K and the crude inner envelope h ¦ +ШХ -+ +K of / to be /*(/) = inf{/(g): / ^ g e ?} and /*(/) = sup{/(g): / ^ g e ?}, which induce the two set functions ?, ?:?,(?)-> +E defined by A(A) = ?*(??) and v(^) = /*(Xa) for all
518 ?. Diaz Carhllo A e L(E). In this context, ? с Шх is said to be Stonian if / e ? => / л r, (f - r)+ e ? for all r > 0. The following representation theorem is due in essence to Greco A977) and see also Denneberg A994). THEOREM 2.4. Let /:?—>¦ +R be an elementary integral on the Stonian lattice cone ? С +RX. Then, I admits "sources" if and only if it has the truncation properties: Co (/ continuous at 0): /(/лг) J, Ofor r 4- 0, a/id Cx (/ continuous at oc): /(/ Л г) | /(/) У»г r I oo, /or a// / e ?. The sources of / have a natural and simple characterisation; in fact, if / fulfills Co +CX, then a monotone set function ?: L(E) —>¦ +E is a source of / if and only if ? ^ ? ^ ?- If, additionally, ? is downward ?-continuous, then ? = ?. The above general statements can be seen in Koning A997, p. 149). Their proof use a known method of "lower-upper aproximant" for the characteristic function of certain "parameter spectral sets" of /, and the definition of Riemann integral. These results are already contained in Gunzler A973, Theorem 2), even for / nonnegative, there the conditions Co and Cx have been introduced. Results in this direction had been obtained by an der Heiden A978), investigating the representation of linear functionals by finitely additive measures. Integral representations with prescribed lattice В by some finitely or ?-additive measure, were also discussed by Gunzler A974, 1975) (e.g., if X is any topological space, В contains all continuous functions, or if В contains all bounded functions); in addition, these results can be generalised to abstract В with certain completeness conditions (see Gunzler A974) and Gould A966)). Moreover, generalizing the vector lattice ? to a "subtractive" subsemigroup of Kx, and I: В —> ? additive, К and ? Abelian partially ordered groups, sufficient conditions are given by Gunzler A975, p. 119), for the existence of an unique measure representing a nonnegative /. Still further main generalisations are discussed by Konig A997); we formulate a particular situation: A measure ? : ? —>¦ +R on an ?-algebra ? is called an ?-representation of an elementary integral / if, for all / e E, f is immeasurable and /(/) = j f ??. Then Konig A997, p. 159), gives the following substitute for the traditional Daniell-Stone theorem: THEOREM 2.5. Let I: ? —>¦ +R be an elementary integral on a Stonian lattice cone ? С +ШХ. Then the following are equivalent: (i) / admits ?-representations, (ii) I(v) = I(u) + ??(? - и) for all и ^ и in E. In this case. /„(/) = / / d? for all f e E„. where ? is the maximal inner ?-extension of A. Here, for all /e +RX, I„(f) = sup{lim /(//„): (h„) С ? with/j„ \, some function ^ /} (inner ?-envelope). This result includes the more classic theorems about representing a given linear functional / as an integral with respect to a measure. See also the earlier paper by Pollard and Tops0e A975), where / is defined on a cone E С +??, with some variant assumptions on the Daniell condition, and (/ - g)+ e ? for all /, g e E. This last condition is avoided by Tops0e A976), but similar results are obtained.
Darnell integral and related topics 519 Finally, since the Daniell extension scheme works for some non-linear functionals (see our comments in Section 1.4), integral representations are investigated in this case by Sipos A979c). 2.3. The interplay between measure and topology. Riesz representation theorem It is suitable to use the Daniell approach to obtain the integral representation of an important class of linear functionals. The main results are based on the notion of "regularity", which, for a set function, means to determine its values from a particular set system by approximation from above or below. In traditional measure theory this notion is linked to topology. We assume that X is a locally compact Hausdorff space. Denote by J and ? the classes of all compact and open subsets of X. respectively. Let / be a Bourbaki integral on Cqo(X) and let (?.?, v) be the measure space induced by / (see Theorem 2.1); we know that ? is complete and saturated. Furthermore, an extremely useful property is shown: ? is regular, i.e., (i) О С A (so, A contains the Borel ?-algebra B(X))\ (ii) К eJ=>v(K) <oc; (iii) О е 0 => v(O) = sup[v(K): К ej, К с О), and (iv) A e A =>v(A) =inf{u(C>): OeO. А с О} (outer regular v). Now, by Theorem 2.3 one obtains the following important result referred to in the literature as the "Riesz representation theorem". THEOREM 2.6 (Riesz). Let J be a nonnegative linear functional on Cqq(X). Then, there is a unique measure space (X, A, v) satisfying the following conditions: (i) ? is a regular complete and saturated measure, and (ii) J(f) = ffdvforallfeCw(X). The proof is essentially contained in the above explanation, only remains to be proven the uniqueness. To do so is essential for ? to be regular, together with Propositions 7 and 6. Here the fact that / is a Bourbaki integral is essential (see, in this sense, the proof of ? regular by Pfeffer A977, p. 164)). For instance, in Ash A972), the different results under hypothesis (D) and (B) in Section 1.1 are stated separately. A nonnegative regular Borel measure ? on X (defined on the Borel ?-algebra) is referred to in some literature as "Radon measure". Therefore, Theorem 2.4 establishes a one-to-one correspondence between nonnegative linear functionals on Cqo(X) and Radon measures in X. The reader should be warned that not everyone uses the same definitions for regular, Borel (and Baire) measures. We therefore explain what it means to us. A Borel measure ? is called inner regular if v(A) = sup{v(K): KeJ. К с A) for every Borel set A. Assume the hypothesis of Theorem 2.4, and suppose that X is ?- compact, then there is a unique Borel inner and outer regular measure ? on X, which represents to / on Cqo(X). We note that if О с Кп (countable union of compact sets) and и is a nonnegative Borel measure such that v(K) < oc for all KeJ, then ? is inner
520 ?. Diaz Carrillo and outer regular. As a consequence, if / is a nonnegative linear functional on Coo(X), there is a unique Borel measure ? such that Coo(X) С L\ (v) and /(/) = f fdv for all /eCoo(X). PROPOSITION 8. //X /5 compact Hausdorff and M(X) is the collection of all the outer and inner regular finite signed measures on B(X), the map I н> ? is a isometric isomorphism of the conjugate space ofC(X) (real valued continuous functions on X) and M(X), where the norm of ? e M(X) is taken as \v\(X)(= \\I\\). For X a metric space, not necessarily compact, and nonnegative measures. Ash A972) investigates a weak*-convergence type of sequence (v„) С M(X) to a measure ? : / fdv,, —>¦ / fdv for all / e C(X). The results form the starting point for the study of the central limit theorem of probability. The classic paper by Glicksberg A951) merits special mention, as it shows that a nonnegative functional / on Ci,(X) (bounded, real-valued continuous functions on a topological space X) is representable if and only if Lebesgue's monotone convergence theorem, restricted to elements of Ci,(X), holds for / (Dini's theorem works here). If X is a completely regular topological space, Ascoli's theorem gives such a representation. There are proofs of Proposition 8 using compactification theory (see Varadajan A958), and the complete survey by Flachsmeyer and Terpe A977)). For the historical development that leads to the traditional Riesz representation theorem, we refer the reader to the survey article of Batt A973), as well as Anger and Portenier A992), Dunford and Schwartz A958) and Weir A974). A treatment of the extensions of measures in topological spaces, and related results in this line, can be seen in the classic books of Halmos A950), Berberian A965), Royden A968), Taylor A965), Valdivia A979) and Weir A974). For adiscussion of the different definitions of "Radon measures" (on arbitrary Hausdorff spaces), see Schwartz A973) and Berg, Christensen and Ressel A984), notes and remarks in Chapter 2. For an abstract Riemann-type approach given by Anger and Portenier A992) (see Section 1.4 on essential integration), if &(X) denotes the lattice cone of all lower semi- continuous R-valued functions on X, which are nonnegative outside some compact subset of an arbitrary Hausdorff space X, and if ? is a regular linear functional on &(X) ("Radon integral on X"), a related refinement on representation by Radon integrals is contained in the following theorem by Anger and Portenier A992, p. 131). THEOREM 2.7. Let ? be a lattice cone of lower semi-continuous functions such that &-(X) С Тф (upper envelopes of upward-filtering families in T). If ? is a regular linear functional on T, which is &(X)-tight (i.e., ?*(/) = inf{r*(/ vs): se ?_}), then ? is a Bourbaki integral and the restriction of f dr (= ??') to &(X) is the only Radon integral representing ?. We note that, according to Konig A997, p. 168), the corresponding extended Riesz representation theorem is obtained as a specialization of the above Theorem 2.5, in the order continuous version. Finally, some historical notes and relations to previous works, concerning to Riesz representation theorem, have been considered by Anger and Portenier A992, Appendix §16). See the related Chapter 9 in this Handbook by Diestel and Swart B002).
Daniell integral and related topics 521 3. The abstract Fubini theorem The Daniell method developed in the previous sections allows us to establish a general form of the powerful result that evaluates "multiple integrals by iterated integration". This was originally proven by G. Fubini in 1907 for R"+"' and some improvements were provided by M. Stone in the late 1940s. Throughout this section we assume that for / = 1,2, (X, B,, Ij) are Daniell (or Bourbaki) systems, and L\ denotes the class of all /j-summable functions on Xt. Let X = X\ ? X:; for any function / on X we consider the section functions as usual and, if fx e Lj, we define (h_f)(x) = h(f.x) for each ? e X |. Thus, we have defined functions hf and /| / on ? ? and Xi, respectively. A vector lattice В on X is called a product system (with respect to (S|, /|) and (Z?2, /2)) whenever, for each / e В the following conditions are satisfied: (i) fx e B2 for each ? e X \ and /, e B\ for each у е Xi\ (ii) hf e S| and /| / e Вт, and (iii) /|(/2/) = /:(/i/). Next, let /: В -> R be defined by /(/) = /| (hf) for each / e B. One has that / is a Daniell and Bourbaki integral (product). L\ denotes the class of all /-summable functions on X. The (iterated) upper and lower integrals are defined as above; then, one has the following main preliminary result: Proposition 9. /// e L,, then /2/,72/ e L\. /,/,7|/ e L] and 1(f) = l\(hf) = /|(/2/) = /2(/|/) = /2G|/). The key for the proof is the following relations: For any / e Rx. 7(/) ^ max{7| G:/), 72G|/)}, and its dual. Let (Xi,Ai,Vj) be the measure space induced by /,-,/ = 1,2 (Theorem 2.1). The following well-known theorem holds. Theorem 3.1 (Fubini). Let f e L\, then /v e L.} for v\-a.e. ? e X\, and /, e L\ for vi-a.e. у e Xi. Moreover, if F and G are functions on X\ and X?, respectively, such that F(x) = fr(fx) and G(y) = /|(/,), whenever the integrals on the right side exist, then F e L\, G e L] and 1(f) = /, (/2/) = h(I\ /)¦ The proof follows the standard steps, using the function h = 11 / - I_, / on X2- which belongs to +Lj and h = 0 u2-a.e., since 72(/;) = 0. Then, reiterated applications of Proposition 9 yield the result (see Pfeffer A977. p. 186)). We recall that the "tensor product" of the vector spaces B\ and ?2, ? ? <g> ?2, such that I /I e ?| <g> ?2 whenever / e B\ <g> ?2, satisfies that B\ <g> ?2 С В (ample product system), and it is a product system (see Pfeffer A977) and Floret A981, §13)). Starting with two complete measures ?? and ?2 on X\ and X2, respectively, the above results are used to construct a product measure in X\ <g> X2. and to obtain the corresponding Fubini (and Tonelli) theorem, via the induced integrals. In this context, a new product measure in a topological space, which is regular whenever the factor measures also (topological product or measure topologically induced by the product measure) is also treated by Pfeffer A977, Chapters 16 and 17).
522 ?. Dial Carrillo In Bichteler A998), products of elementary integrals are studied in the following way: For/' = 1,2, (Xj, ?¦, /,) elementary integrals (see Section 1.3, Daniell mean; remember, ?, are rings of bounded functions on X,), the elementary integrals onX| xXiare the finite sums of the form h(x, y) —^2':i hJi(x)hJ2(\). hj e ?,; such a collection is denoted by ?3 or ?| <g> ?2 (ring of bounded functions on X \ xXi. but it is not, in general, a vector lattice; this fact justifies the integration theory "of means" for rings). The elementary integral of h e?, is well defined by /(/?) = (/| ? /2)(?) = ?/=? 1\(h\) ¦ h(h2). Products of additive set functions and "Radon measures" are specialized. Let /''(/) = I \ (hf) (iterated upper integral) and ||/f = 7,G2/), then || · ||ft is an upper bound for I = I\ ? h and that agrees with / on +?, so we close ? with respect to || · ||л to obtain L\ (|| · ||);') and we extend the elementary integral by continuity to obtain the extension / on L\(|| · \\h). Then, one can evaluate the integral by iterated integration, but this uses a mean that refers to the order of integration, which leads to choosing a new mean || · ||*: the largest mean that agrees with / on +?. Let L\(I\ ? I2) be the closure of ? in T( || · ||*) (integrable functions for the product), and extend /| ? I2 to L1 by continuity. In this framework, the classic Fubini and Fubini-Tonelli theorems (with measurable functions with ?-finite carrier) are established (Bichteler A998, pp. 130-131)). Here, some applications to convolution and interpolation are studied. Order continuous situations can also be treated. The Daniell extension is used by Kelly and Srinivasan A976) to give a unified proof of the Fubini theorem for "Baire and Borel measures" on locally compact Hausdorff spaces. Konig A997) investigates the product formations based on inner regularity, which can be specialized to "Radon measures" (in the sense given here in 2.3). He made an extensive study of the "sectional representation" and "monotone approximation of functions" to present several versions of Fubini-Tonelli theorem; his bibliography notes (p. 229) are worthy of consideration. See the related Chapter 17 in this Handbook by Grekas B002). Fubini theorems are investigated by de Amo and Diaz Carrillo A995) for proper and abstract Riemann integration with "non-continuity integration" discussed in Section 1.4. The results obtained can be developed with Proposition 9 and they contain those of the Riemann^-abstract integral due to Eisner A975). Iterated (local) integral norms and corresponding abstract Fubini theorems are studied by Hoffmann A977a, 1977b) and by Hoffmann and Schafke A992, Chapter 3). 4. The Radon-Nikodym theorem Throughout this section we shall assume a Daniell system (X.B.I) which has been extended to (X, L1 (/),/). A function h e +RX is said to be locally integrable with respect to / if fh e L1 (/) for all / e В (note that if ? is the Lebesgue measure on the line, then / is locally ?-integrable if and only if it is ?-integrable on every bounded interval). Then, the functional defined by J (/) = I(fh). for all / e B. is a Daniell integral (sometimes called the "measure with base / and density /7". written J = hi). Consider two assumptions about the vector lattice В: В is Stonian and X is ?-finite with respect to B. These require the existence of an increasing sequence (X„) of elementary sets X„ = {.v e X: h„(x) ^ 1}. h„ e B, whose union is X, and ensures the ?-finiteness of the measure derived from any
Daniell integral and related topics 523 elementary integral on В (see Zaanen A958), where the above assumptions are discussed in some detail). THEOREM 4.1 (Radon-Nikodym). Let (X.B.I) be a Stonian Daniell system, and suppose that X is ?-finite with respect to B. Let J be a Daniell integral on B, and (i): let h e +RX such that J(g) = I(gh), for all g e B. Then, f e L\(J) if and only if fh e L|(/), and in this case (ii): J(f) = Hfh). Condition (i) is equivalent to (i'): every /-null subset of X is У-null (write J <Cl)Af h, к both satisfy (i) then h — к is /-null. To prove that (i') => (i), one uses that for a bounded linear functional J on L2(/), there is h e L2(I) such that J(f) = I(fh) for all / e L2(I) (Riesz's theorem). The proof can be found in Weir A974) and is based on four assertions: (a) A 7-null => ? ? [h ? 0] is /-null; (b) if/eL,G), then//? e L\(I) and (ii) holds; (c) A is /-null => A is У-null; and (d) if fh e L\ (/), then feL\(J) and (ii) holds. The classic measure-theoretical Radon-Nikodym theorem (see, for example, Weir A974, p. 216)) is a special case of Theorem 4.1. Also in the same context, an excellent outline of the complete results (?-finite and general case), as well as the different methods of proving it and the usual topics on duality of L''-spaces and conditional expectation, can be found in Rao A987). In the sense of extension of the integral, via the "Daniell mean" by Bichteler A998), the measures with densities and the Radon-Nikodym theorem are also treated. In fact, the ?-additive elementary integrals of finite variation form a band M*(?) in the class of the elementary integrals of finite variation on ?. If / is ?-finite (a similar condition to the above one), every J belonging to the band generated by / can be obtained by the formula J = hi. As application conditional expectation is also treated (p. 155). An interesting generalization to the differentiation theory of Daniell integrals (in the sense of Radon-Nikodym derivatives) has been considered by Volcic A980), without any use of measure theoretic methods; hence, it does not suppose that Stone's condition holds. Volcic proves a Hahn-type decomposition theorem and a Radon-Nikodym theorem for Daniell integrals, using the notion of localizability of the basic integral /. We say that / is localizable if the lattice M^ is complete, where M^ is the lattice of all extended real- valued functions к for which kg is Stone-measurable, for any Stone-measurable g. For a signed integral У, a set H+ locally-J-integrable is said to be positive (respectively H~ negative) if J(fxH+) > 0, for each / e +L\ (J) (respectively ^ 0 for H~)\ the disjoint sets H+ and H~ with union X are said to form a Hahn decomposition of X with respect to J. In this context (compare Zaanen A961). Floret A981) and Bichteler A973)), the following main results are proven; THEOREM 4.2. A) I is localizable if and only if each J that is absolutely continuous with respect to I (J <<C /), has a Hahn-decomposition.
524 ?. Diaz Corrilh B) /// is localizable and J «; /, then there exists g e M^(I) such that J(f) = I(fg) forallfeLi(J). The function g is given by max{/? e M^_: I(fh) < J(f)}. Assertion A) is a Kelley- type characterization of localizability. For the measure case see Kelley A966). See also the related Chapter 6 in this Handbook by Volcic and Candeloro B002). Using similar methods, Volcic A981) proves that the lifting theory can be extended to Daniell integrals. Essentially the same general setting is considered to investigate the "localizable upper gauge" by Bichteler A973), where Radon-Nykodym theorems, for scalar and Banach-valued "measures" are given. The existence of liftings is also investigated. Finally, we note that in the context of non-continuity integration, an approximate functional Radon-Nikodym theorem is proven by de Amo, Chictescu and Diaz Carrillo A999). In fact, for two nonnegative linear functionals J and / on a lattice algebra В сШх. such that J is "absolutely continuous" with respect to /, one can express J as follows: J(f) = lim/(/u„) for all /efl, where (i'„) is a fixed sequence in B. This result is the "functional" similar to a previous fundamental result by Fefferman A967). In the same spirit, the remaining question is whether we can find an exact Radon-Nikodym derivative. We know some partial results using a notion of "exhaustion" of / on elements of a function algebra lattice (for the finitely additive case, see the classic results by Maynard A979) and Hagood A986)). See de Amo, Chictescu and Diaz Carrillo B001). 5. Integral norms. Local integral metrics and Daniell-Loomis integrals The classic book by Bichteler A973) gives an excellent and extensive study of integration for Banach space-valued functions and measures with upper integral norms and their generalizations to upper gauges, "motivated by the desire to carry M.H. Stone's unified treatment of set functions and Radon measures as far as possible". These questions were first raised by the pioneering work of Aumann A952), some more general results and variants were also treated by Schafke A970, 1972). In Bichteler, local integral norms are not investigated, only "Daniell-continuous" integral norms (?-subadditive), and only with the Stone condition assumed. The finitely additive case is not considered. The basic assumptions and definitions in Bichteler are as follows: The pair (R.m) is called an ?-valued elementary integral on X, whenever R is a Stonian vector lattice (integration lattice), and m (integral or measure) is only a linear map from R to some Banach space E. The linear operator U : R® ? —>¦ F (Banach space) and its semivariation \\U\\ are defined as usual; U has finite variation if there is a seminorm/;: +RX -> +RX such that ||?/(j)|| ^ n(\g\), for all g e R ® ? (see Dinculeanu A966)). An upper norm ? on X (M : +RX -> +RX increasing, subadditive and positively homogeneous) is said to be an upper-S-norm on X if additionally satisfies: (/,) с +ШХ, f„i and supM(/,) = M(sup/„) (equivalent to Aumann's "Summenbeschrankt" A952). Next, the space L]f(R. M) is defined as the M-closure of R ® ? in FE(M) := [k e Ex: M(\k\) < oo} ((R. M)-integrable functions or "Bochner integrable" functions is also used). The scalar case is obtained similarly. Then, the following main result is proven (Bichteler A973. Theorem 7.12, p. 79)):
Darnell integral and related topics 525 THEOREM 5.1. Let R be an integration lattice and ? an upper-S-norm on X, that is finite on +R. Then LE(R, M) is an M-complete seminormed vector space and for any M-Cauchy sequence (f,) С LE, there is a subsequence (f,k) that converges M-a.e. to lim/,. In particular, with a Jordan upper norm mJ(/) = {m(h): \f\ ^ h e R\ for m : R —>¦ К nonnegative and linear, the Jordan-w-integrable functions space, L E (R. m J) is defined and it agrees with /?'(/?, m), ? = R. but the abstract Riemann-integrable functions space R\ (R,mJ) is not treated (see Diaz Carrillo and Mufioz A993), Diaz Carrillo and Giinzler A993, p. 421)). With the notion of weak upper gauge: ? an upper-S1-norm finite on R, such that +R э f, <: f„+i ? /? e LlE(R, M) implies /,->¦/ e LE(R. M) and M(\f, - f\) -> 0 (converges in mean), weak convergence theorems are proven (pp. 83-84). Moreover, with the additional condition that: +R э f„ ^ /,+i, supM(f,) < oo implies (f,) M-Cauchy (upper gauge), monotone convergence theorem and Lebesgue's dominated convergence theorem (with only \f„ | ^ к е +ШХ, M(k) < oo) are obtained (pp. 88-89). Next, it is shown how the upper norms can be used to extend an elementary "measure": Given U . R <g> ? —> F linear, ? upper norm on X with the condition that it majorizes U i.e., ||?/(й)|| ^ M(\h\) for all h e R <g> E. then there is a unique M-continuous extension of U to LE(R.M) (this corresponds with Aumann's extension A952) or Schafke's extension A977)); but the above condition is stronger than Aumann's assumption). Given a "measure" M, a necessary condition for the existence of a majorizing upper gauge is that it be 5"-continuous. Standard upper integrals are constructed by Bichteler A973. Section 9); in particular, the Daniell mean has been already considered in our Section 1. Also, projective limits, martingales (from the projective limit point of view), lifting theorems, integration of measures and Radon-Nikodym theorems are discussed by this author. See also Bichteler A975), where "measures" with values in non-locally convex spaces are considered and interesting applications are discussed. Note that Schafke's approach A970. 1971, 1972) is more general, but one comparison of conditions on integral norms, results and possible applications should be an interesting task for the future. The book by Hoffmann and Schafke A992) could give a good introduction in this field. With the aid of Schafke's local integral metrics (Schafke A977)), the "non-continuity integration process", given in Section 1.4, is generalized by Diaz Carrillo and Giinzler A993). One has the usual convergence theorems using a suitable local mean convergence, which can be tracked back to Loomis A954). More precisely, the functional q : +ШХ —> +ШХ is called an integral metric on X if q@) = 0 and q(f) ^ q(g) +q(k) if / ^ g +k, fg,k? +ШХ. If В С Шх, we define the class of all q-integrable functions Bq as the closure of В in Rx with respect to q; if additionally an / : В —>¦ ? is given, which is uniformly continuous on В with respect to q, the unique ^-continuous extension of / to Bq will be denoted by 1q. In all of the following (X, B. I) will be a Loomis system. For any integral metric q, the corresponding local integral metric of Schafke is defined ЬУ ?/(/) = sup{<7(/ Ah): he +B) for all / e +RX; q\ is again an integral metric with q\ -?, q, and one has В С Bq С Bqi. Г1 = Iе" on Bq. Moreover, we introduce q- measurability, defined by the property that truncation by integrable functions leads to
526 ?. Diaz Carrillo integrable functions. The relevant convergence properties with respect to q or qp (where ?/>(/) = [<7(//')]l//', / e +??, ? > 1) are developed. This allows us to introduce the abstract Daniell-Loomis spaces Rp as usual, and to prove that Rp = B'1'' (see Diaz Carrillo and Gunzler A996)). In this framework. Giinzler A996) discusses Bq-improper integrable functions < = (/ e Rx: /П/ie Bq for all h e+B and lim+? /(/ ? h) exists e E}, where / П h = (/ л h) ? (-/г). If / and q are equivalent on B, such improper integration turns out to coincide with Schafke's local integral extension. The results hold also in the finitely additive case, and it is shown that as special cases proper- and abstract Riemann-, Loomis-, Lebesgue-, Daniell- and Bourbaki-integrals are subsumed. In fact, for a general Loomis system (X, B, I) and q(f) = /"(/) :=inf{/(/?); / ^ h e +B) for all / e +RX, one has Bq = /?prop(S, /) = two-sided completion of Loomis A954, p. 170) с S'" = R\(B.I) (abstract Riemann-/-integrable functions of Diaz Carrillo and Mufioz A993)), containing the one-side completion of Loomis A954, p. 178). Moreover, it is also true that with q(f) = 1(f) := inf{/+(g); / ^ g e ??] (as in Section 1.4), Bq = ~B (/-summable function by Bobillo and Diaz Carrillo A987)) and Bq< — L(I,B) (local extension by Diaz Carrillo and Gunzler A997)) = improper B- integrable functions. Given the Daniell condition (D), then one can use Aumann's /? as_<7, where /?(/) =???(?? I(h„): f ij Y^ h„,h„ e +B), then B'< = Daniell's L\(I,R) and Bqi = L\ + {local L?-malfunctions}. Finally, in the Bourbaki situation, where / is net-continuous, let q = 1+ with /+(/) := -/"(-/), then LT = Bq = L\. For a related result concerning essential integration (see Section 1.4 and Anger and Portenier A992)), one_has essential-u-integral functions J'(v) = ?1/, = ?'11, with q an upper functional ? on +RX; which give new characterisations of R\ (B. I) and L(I, B), for instance. Acknowledgements The author would like to express his gratitude to Prof. Hans Gunzler for his valuable comments and suggestions. References Adamski, W. A983), Some characterizations of Daniell lattices. Arch. Math. 40 D). 339-345. Adamski, W. A986), ?-smooth linear junctionals on vector lattices of real-valued functions, Studia Math. 83 B). 105-117. Alfsen, E.M. A958), On a general theory of integration based on order. Math. Scand. 6 (S). 67-79. Alfsen, E.M. A963), Oder theoretic foundations of integration Math. Ann. 149. 419^461. an der Heiden, U. A970), Integration in Vektonerbanden nach Daniell, Diplomarbeit Univ. Gottingen. an der Heiden, U. A978). On the representation of linear functionals by finitely additive set functions. Arch. Math. 30, 210-214. Anger, B. and Portenier, С A992), Radon Integrals. Progress in Math.. Vol. 103, Birkhauser, Basel. Ash, R.B. A972), Measure, Integration and Functional Analysis. Academic Press. New York. Asplund, E. and Bungart, L. A966), A First Course in Integration. Holt. Reinhart and Winston. New York. Aumann, G. A952), Integralerweiterungen mittels Normen. Arch. Math. 3. 441^450. Batt, J. A973), Die Verallgemeinerungen des Darstellungssatzes von F Riesz und ihre Anwendungen, Jahresber. Deutsche. Math.-Verein. 74, 147-181.
Daniell integral and related topics 527 Berberian, S.K. A965). Measure and Integration. Macmillan. New York. Berg, C, Chistensen. JR.P. and Ressel. P. A984). Harmonic Analysis on Semigroups. Graduate Texts in Math.. Vol. 100. Springer. Berlin. Bichteler, K. A973), Integration Theory (with Special Attention to Vector Measures). Lecture Notes in Math.. Vol. 315, Springer, Berlin. Bichteler, K. A975), Measures with values in non-locally comex spaces. Lecture Notes in Math.. Vol. 541. Springer, Berlin, 277-285. Bichteler, K. A998), Integration. A Functional Approach. Birkhauser. Basel. Bogdanowicz, W.M. A967), An approach to the theory of integration generated by daniell fimctionals and representations of linear continuous functionals. Math. Ann. 173. 34-52. Bogdanowicz, W.M. A971), Minimal extension of Daniell functional.·, to Lebesgue and Daniell-Stone integrals. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys. 19. 1093-1100. Bobillo Guerrero, P. and Diaz Carrillo. M. A987). Siimmable and integrable functions with respect to any Loomis system, Arch. Math. 49, 245-256. Bobillo Guerrero, P. and Diaz Carrillo. M. A989). On the summability of certain ?-integrable functions. Arch. Math. 52, 258-264. Bogdan, V A975). A new approach to the theory of probability via algebraic categories. Lectures Notes in Math.. Vol. 541, Springer, Berlin. 345-367. Bourbaki. N. A952), Integration. Elements de Mathematique III. Livre VI. Hermann. Paris. Brehmer, S. A974). Verbandtheoretische Charakterisierung des Mass und lntergralbegriffs von Caratheodory. Potsdam Forsch. В C). Candeloro, D. and Volcic, A. B002). Radon-Nikodym theorems. Handbook of Measure Theory. E. Pap. ed.. Elsevier, Amsterdam, 249-294. Choquet. G. A953/54), Theory of capacities. Ann. Inst. Fourier (Grenoble) 5. 131-295. Cicogna, G. A976), Un nuovo approcio alia teoria dell'integrazione reale. Bull. Un. Mat. Ital. A E) 13 A). 142-151. Cohn. D.L. A980), Measure Theory, Birkhauser. Boston. Constantinescu, С and Weber, K. A985). Integration Theory. Vol. I: Measure and Integral. Wiley-Interscience. New York. Constantinescu, C, Filter, W. and Weber. K. A99%). Advanced Integration Theory. Kluwer Academic. Dordrecht. Daniell, P.J. A917/18), A general form of integral. Ann. of Math. B) 19. 279-294. Daniell, P.J. A919/20), Furthers properties of the integral. Ann. of Math. 21, 203-220. de Amo, E. and Diaz Carrillo, M. A995), On abstract Fubini theorems for finitely additive integration. Proc. Amer. Math. Soc. 123 (9), 2739-2744. de Amo, E. and Diaz Carrillo, M. A997), Fubini-integrals metrics. Rend. Circ. Mat. Palermo B) 44. 161-174. de Amo, E., Chictescu, I. and Diaz Carrillo, M. A999). An approximate functional Radon-Nikodym theorem. Rend. Circ. Mat. Palermo B) 48. 443^450. de Amo, E., Chictescu. I. and Diaz Carrillo. M. B001), An exact functional Radon-Nikodym theorem for Daniell integrals, Studia Math. 148 B), 97-110. De Lia, A. and Mikusinski, P. A995), A Daniell-type integral with values in a Banach Space, Arch. Math. 65 E), 417-423. Diestel. J. and Swart, J. B002). The Riesi theorem. Handbook of Measure Theory. E. Pap, ed.. Elsevier, Amsterdam, 401^*47. Denneberg, D. A994), Non-Additive Measure and Integral, Kluwer Academic. Dordrecht. Diaz Carrillo, M. and Munoz Rivas, P. A990), Integral extension with locally-integral seminorm, Acta Univ. Carolin. Math. Phys. 31, 25-27. Diaz Carrillo, M. and Munoz Rivas. P. A993), Finitely additive integration: Integral extension with local- convergence, Ann. Sci. Math. Quebec 17, 145-154. Diaz Carrillo, M. and Gunzler, H. A993), Local integral metrics and Daniell-Loomis integrals. Bull. Austral. Math. Soc. 48, 411^426. Diaz Carrillo, M. and Gunzler, H. A996). Abstract Daniel l-Loomis spaces. Bull. Austral. Math. Soc. 53. 135- 142. Diaz Carrillo, M. and Gunzler, H. A997). Daniell-Loomis integrals. Rocky Mountain J. Math. 24 D). 1075-1087. Dinculeanu, N. A966), Vector Measures, VEB, Berlin.
528 ?. Dial Carrillo Dunford, N. and Schwartz, J.T. A958), Linear Operators. Port I: General Theory. Interscience. New York. Eisner, J. A975), Zum "Sati von Fubini" fiir ein abstraktes Riemann-integral. Math. Z.. 265-278. Fefferman, C.H. A967). A Radon-Nikodvm theorem for finitely additive set functions. Pacific J. Math. 23 B). 35^45. Flachsmeyer, J. and Terpe. F A977), Some applications of the theory of compactifications of topological spaces and measure theory, Russian Math. Surv. 32 E). 133-171. Floret, K. A981), ???- und Integrationstheorie. Teubner. Stuttgart. Fremlin, D.H. A974), Topological Riesi Spaces and Measure Theory. Cambridge University Press. Cambridge. Futas, E. A971), Extensions of continuous functionals. Math. Cas. 21. 191-198. Glicksberg, I. A951), The representation of junctionals by integrals, Duke Math. J. 19. 253-261. Gould, G.G. A964), A Stone-Cech-Ale.xandroff-type compactification and its application to measure theory, Proc. London Math. Soc. C) 14, 221-224. Gould, G.G. A966), The Daniell-Bourbaki integral for finitely additive measures. Proc. London Math. Soc. C) 16, 297-320. Greco, G.H. A977), Integrate monotono. Rend. Sem. Mat. Univ. Padova 57. 150-166. Grekas, S. B002), On products of topological measure spaces. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 745-764. Giinzler, H. A973), Linear functionals which are integrals. Rend. Sem. Mat. Fis. Milano 43, 167-176. Gunzler, H. A974), Stonian lattices, measures and completeness. Linear Operatoren und Approximation II, Herausg. Butzer und Sz. Nagy, Intern. Ser Numer. Math., Vol. 25. Birkhauser, Basel. 113-126. Gunzler, H. A975), Integral representations with prescribed lattice. Rend. Sem. Mat. Fis. Milano 45, 107-168. Gunzler. H. A985), Integration, Bibliogr. Instit., Mannheim. Gunzler, H. A991), Convergence theorems for a Daniell-Loomis integral. Math. Pannon. 2, 77-94. Gunzler, H. A996), Improper integrals and local integrals. Benefit 96-14, Mathematischen Seminars Kiel. Revue Rumaine. Hagood, J.W. A986), A Radon-Nikodym theorem and Lp completeness for finitely additive vector measures, J. Math. Appl. 113. 266-279. Halmos, PR. A950). Measure Theory. Van Nostrand. New York. Hoffmann, D. A977a), Zum "Satz von Fubini". J. Reine Angew. Math. 298. 138-145. Hoffmann, D. A977b), Integralerweiterung durch Intergralnonn mil Werten in priigeordneten Halbgruppen, J. Reine Angew. Math. 295, 187-201. Hoffmann, D. and Schafke, FW. A992), Integrate. Bibliogr. Instit.. Mannheim. Jacobs, K. A978), Measure and Integral. Academic Press, New York. Kelley, J.L. A966), Decomposition and representation theorems in measure theory. Math. Ann. 163, 89-94. Kelley, J.L. and Srinivasan, T.P A976), A unified proof of Fubini theorems for Baire and Borel measures. Lectures Notes in Math., Vol. 541. Springer, Berlin. 25-29. Kindler, J. A983), A simple proof of the Daniell-Stone representation theorem. Amer. Math. Monthly 90 F). 396-397. Kluvanek, I. A965). Integrate vectorielle de Daniel/. Mat. Fyz. Casopis Sloven. Akad. 15. 146-161. Konig, H. A982). Integraltheorie ohne Verbandspostulat, Math. Ann. 258. 447^458. Konig, H. A992), Daniell-Stone integration without the lattice condition and its application to uniform algebras, Ann. Univ. Sarav. Ser Math. 4. 1-91. K6nig,H. A997), Measure and Integration. An Advanced Course in Basic Prodecures and Applications, Springer, Berlin. Leinert, M. A982), Daniell-Stone integration without the lattice condition. Arch. Math. 38, 258-265. Leinert, M. A984), Plancherel's Theorem and integration without the lattice condition, Arch. Math. 42, 67-73. Leinert, M. A995), Integration und ???, Vieweg. Weisbaden. Leinenkugel, С A992), Daniell-Stone approach to the general Denjoy integral, Proc. Amer. Math. Soc. 114 A), 39-52. Loomis, L.H. A954), Linear functionals and content, Amer. J. Math. 76, 168-182. Maynard, H.B. A979), A Radon-Nikodym theorem for finitely additive bounded measures. Pacific J. Math. 83 B), 401^413. McShane, E.J. A953). Order-Preserving Maps and Integration Processes, Ann. of Math. Stud.. Vol. 31, Princeton University Press, Princeton, NJ.
Daniell integral and related topics 529 Muni, G. A974), Intorno adalcuni aspetti della teoria dell'integrate xectoriale. Rend. Mat. F) 7, 239-268. Neveu, J. A964), Bases mathematique du calcul des probabilites, Paris. Okada, S. A984), Vector Daniell integrals. Math. Slovaca 34 A), 35-65. Pap, E. A995), Null-Additive Set Functions. Kluwer Academic, Dordrecht. Papangelou, F. A966), Note on the functional analogue of the Jordan decomposition of measures. Arch. Math. 17 C), 253-255. Pfeffer, W.F A977), Integrals and Measures, Marcel Dekker, New York. Pollard D. and Tops0e. F. A975), A unified approach to Riesz type representation theorems. Studia Math. 54 B), 173-190. Qu, Y.S. A983), Measures of fuzzy sets. Fuzzy Sets and Systems 9 C), 219-227. Rao, M.M. A987), Measure Theory and Integration, Wiley-Interscience, New York. Riecan, B. A964), Sur une extension continue des certaines fonctions monotones, Spisy Prer. Fak. Univ. J.E. Purkyne 457, 481^483. Riecan, B. A975), An extension of the Daniell integration scheme. Mat. Cas. 25 C), 211-219. Riecan, B. A977), On the Caratheodory method of the extension of measures and integrals. Math. Slovaca 27 D), 365-374. Riecan, B. A979), On the unified measure and integration theory. Acta. Fac. Res. Nat. Univ. Camen. Mathem. 27, 217-237. Riecan, B. A983), On measures and integrals with values in ordered groups. Math. Slovaca 33 B), 153-163. Riecan, B. A997), Integral, Measures and Ordering. Math. Appl., Vol. 411, Kluwer Academic, Dordrecht. Royden, H.L. A968). Real Analysis, 2nd edn.. MacMillan, New York. Rudin, V.N. A975), On the abstract theory of the Daniell integral. Math. Questions C). Trudy Tomsk. Gos. Univ. 220,57-61. Schafke, FW. A970), Integrationstheorie I, J. Reine Angew. Math. 244, 154-176. Schafke, FW. A971), Integrationstheorie II, J. Reine Angew. Math. 248, 147-171. Schafke, FW. A972), Integrationstheorie und quasinormiert Gruppen, J. Reine Angew. Math. 253, 117-137. Schafke, FW A977), Lokale Integralnormen und verallgemeinerte uneigentliche Riemann-Stieltjes Integrale, J. Reine Angew. Math. 289, 118-134. Schmeidler, D. A986), Integral representation without additivity. Proc. Amer. Math. Soc. 97 B), 255-261. Schwartz, L. A973), Radon Measures on Arbitrary Topological Spaces and Cylindrical Measures, Oxford University Press, Oxford. Segal, I.E. and Kunze, M.A. A968), Integrals and Operators, McGraw-Hill, New York. Shilov, G.E. and Gurevich, B.L. A966), Integral, Measure and Derivative: A Unified Approach, Prentice-Hall, Englewood Cliffs, NJ. Simonnet, R. A996), Measures and Probabilities, Springer, Berlin. Sipos, J. A979a), Integral with respect to apre-measure. Math. Slovaca 29 B), 141-155. Sipos, J. A979b), Nonlinear integrals. Math. Slovaca 29 C), 257-271. Sipos, J. A979c), Integral representation of nonlinear fimctionals, Math. Slovaca 29 D), 333-345. Stone, M.H. A948/49), Notes on integration I-IV, Proc. Nat. Acad. Sci. USA 34 A948), 336-342, 447^455, 483^490; 35 A949), 50-58. Stroock, D.W. A994), A Concise Introduction to the Theon of Integration, Birkhauser, Basel. Taylor, A.E. A965), General Theory of Functions and Integration, Blaisdell. New York. Thiam, D.S. A975), Integrale de Daniell a valeurs dans un semi-groupe ordoenne, С R. Acad. Sci. Paris Ser. A-B 281 E-8), Aii, A 215-A 218. Tops0e, F. A970), Topology and Measure, Lecture Notes in Math., Vol. 133, Springer, Berlin. Tops0e, F. A976), Further results on integral representations, Studia Math. 55 C), 239-245. Valdivia, M. A979), Analisis Matemdtico V, Unidades Didact'icas. Vols. 4, 5. 6, UNED, Madrid. Varadajan, VS. A958), On a theorem of F Riesz concerning the form of linear fimctionals. Fund. Math. 19, 209-220. Volcic, A. A975), An extension of the Daniell integral scheme, Mat. Cas. 25 C), 211-219. Volcic, A. A977), Sulla differenziazione degli integrali di Daniell-Stone, Rend. Sem. Mat. Univ. Padova 61, 251-258. Volcic, A. A980), Differentiation of Daniell integrals, Proceedings Measure Theory. Oberwolfach 1979, Lectures Notes in Math., Vol. 794, Springer, Berlin, 284-294.
530 ?. Diaz Carrillo Volcic, A, A981), Liftings and Daniell integrals. Measure Theory. Oberwolfach 1981. Lectures Notes in Math.. Vol. 945, Springer, Berlin. 180-186. Weir, A.J. A974). General Integration and Measure, II. Cambridge University Press. Cambridge. Wilhelm, M. A988). Integral extension procedure in weaklv ?-complete lattice-ordered groups II. Studia Math. 89C), 231-239. Zaanen, A.C. A958), An Introduction to the Theory- of Integration, Interscience. New York. Zaanen, A.C. A961), The Radon-Nikodym theorem, I, II. Indag. Math. 23. 157-187.
CHAPTER 12 Pettis Integral Kazimierz Musial Institute of Mathematics, Wroclaw University, 50-384 Wroclaw, PI. Grunwaldzki 2/4. Poland E-mail: imisial@math.imi.wroc.pl Contents Introduction 533 1. Preliminaries 534 2. Measurable functions 535 3. Scalar integrals, basic properties 538 4. Pettis integral 541 5. Limit theorems 550 6. The range of the Pettis integral 553 7. Universal integrability 557 8. Pettis integral property 558 9. Weak Radon-Nikody'm property and related properties 560 10. Conditional expectation 568 11. Differentiation 570 12. Fubini theorem 572 13. Spaces of Pettis integrable functions 573 13.1. Space of all Pettis integrable functions 573 13.2. Functions satisfying the strong law of large numbers 575 13.3. Functions with integrals of bounded variation 577 13.4. 11.?(?, X) equipped with the variation norm of integrals 578 13.5. Bounded Pettis integrable functions 579 References 580 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 531
This page intentionally left blank
Pettis integral 533 Introduction Until 1968, there has been no significant progress in the theory of the Pettis integral since Pettis's A938) original paper. The Pettis integral was known to be countably additive and absolutely continuous. Due to Pettis A938) it was also known that the space of Pettis integrable functions defined on the unit interval, integrated with respect to the Lebesgue measure may be non-complete in the semivariation norm. There were also known some facts concerning Pettis integrability of strongly measurable functions and a few examples of non-strongly measurable but Pettis integrable functions but almost nothing more. Then, Rybakov A968) proved that the Pettis integral is of ?-finite variation. In 1975, Thomas presented some negative aspects of Pettis integrability. He proved that the space of Pettis integrable functions with respect to a nonatomic measure in never complete if the range space is infinite dimensional and that for each infinite dimensional Banach space X there exists an X-valued function on [0, l]2 such that not almost all its section are Pettis integrable on [0, 1]. Thomas also suggested that probably one could find a Pettis integrable function / on [0, 1] such that lim/,^o II7^ fs+' f(t)dt\\ = 00 on a set of positive Lebesgue measure. Then started a series of papers developing the theory of Pettis integration. Edgar A979) investigated Banach spaces with the property that each scalarly integrable function with values in the space is Pettis integrable (so-called PIP) and proved several non-trivial facts about these spaces. He was also the first to consider the Lindolof property (so-called (C) property) for closed convex sets in the context of Pettis integration. Fremlin and Talagrand A979) constructed an example of a Pettis integrable function with non relatively norm compact range of its integral, answering a question of Grothendieck formulated in the language of operators. The same paper contains a proof of Stegall theorem saying that on perfect measure spaces the range of a Pettis integral is always norm relatively compact. Musial A979) (preprint version was published in 1976) called attention to the problem of the existence of Pettis integrable Radon-Nikodym derivatives of Banach space valued measures proving that there is a strong connection between conjugate spaces with all measures of finite variation possessing Pettis integrable densities and Banach spaces not containing any isomorphic copy of l\. I have tried to include all papers concerning Pettis integrability, but the text presents my own point of view on the subject. As a result some papers are only mentioned without details. There are some topics completely absent in this survey. In particular I consider only Banach spaces, in spite of several papers dealing with locally convex spaces, where the notion of the Pettis integral is quite natural. I also left out of account the approach via Stone transform due to D. Sentilles, for the benefit of liftings, which seem to be more measure theoretical. That approach however was fully exploited by Talagrand A984). I skipped also some important geometric properties of Banach spaces possessing WRNP type properties (containing trees, extreme points), mainly because of limited amount of space. Totally overlooked is the theory of strongly measurable functions, which is discussed in several other papers and books (cf. Diestel and Uhl A977). I also decided do not write anything about Fremlin's generalization of McShane integral (Fremlin A995)), which is intermediate between Bochner and Pettis integral, in spite of thinking that the integral of Fremlin may turn out to be more suitable for several applications than the Pettis
534 A". Musial integral. Also the approaches of Phillips A940), Birkhoff A935), Dobrakov A970a) and Erben and Grimeisen A990) are beyond this paper. Most of the problems posed in the survey paper Musial A991) are still open, so I do not formulate new ones. 1. Preliminaries Throughout (?,?,?) is ? finite complete measure space with ? ? 0. If not stated differently, the conjectural measure space will always be denoted by (?.?.?). ?+ is the collection of all measurable sets of positive measure, ??(?) is the ?-ideal of ?-null sets and ?* is the outer measure generated by ?. ? will denote a lifting on (?, ?,?) (for the detailed information concerning liftings see Strauss, Macheras and Musial B002) in this Handbook). ?o(M) is the space of all real-valued measurable functions on (?,?,?), functions which coincide ?-a.e. are not identified. ?? is the topology of pointwise convergence in the space of all real-valued functions. C\ (?) is the space of all real-valued ?-integrable functions. A set ? с С ? (?) is uniformly integrable if limM(A)^0supye? jA \f\d? = 0. С is the ?-algebra of Lebesgue measurable sets on the unit interval [0, 1] or on the real line and ? is the Lebesgue measure on C. V(S) denotes the collection of all subsets of S. If Q is a non-empty set with a topology T. then the ?-algebra of Borel sets on Q is denoted by Bo(Q. T). Ba(Q. T) is the ?-algebra of Baire sets. Compact spaces are always assumed to be non-empty and Hausdorff. X is always an infinite dimensional Banach space (unless otherwise stated), B(X) is its closed unit ball and X* is the conjugate of X. If ? с X then ?? denotes the annihilator of ? in X*. If L с X then L** is the ?(?**. X*)-closure of L in X** and linL is the linear space generated by L in X, If /;? —>¦ X, then the composition of / with a functional x* is denoted as x* f or (x*, f). We say that X has Mazur's property if each sequentially weak*-continuous functional on X* is in X. X has the property (C) or X is a Corson space if each family of closed convex subsets of X with the countable intersection property has a nonempty intersection (see Corson A961), Pol A980) and Drewnowski A986)). X has the property (K) if each sequence of points x„ e X converging to zero contains a subsequence (x,n) with convergent series Y^L\ х,ц. By BC(X**) we denote the set of all x** e X** which are weak*-cluster points of countable subsets of B(X). If у : ? —>· X is a measure, then we say that у is ?-continuous if 1нп//(д)^о ?|?>(?)|| = 0. у is dominated by ? if there exists ? > 0 such that ||?>(?)|| ^ ??{?) for all ? e ?. As a rule we will assume in the proofs that ? = 1. |?| denotes the variation of у and cabv^, X) is the Banach space of all ?-continuous X-valued measures of finite variation endowed with the variation norm. са(ц. X) is the Banach space of all ?-continuous X- valued measures equipped with the semivariation norm. If у : ? —> X is a ?-continuous measure, then for every ? e ?+ the set ? v(F) ] A.(E) := I -J—-: F e ?? and F с ? is said to be the average range of у on E.
Pettis integral 535 AXIOM L (cf. Fremlin and Talagrand A979)). [0. 1] cannot be covered by less then the continuum closed sets of the Lebesgue measure zero. AXIOM К (cf. Fremlin and Talagrand A979)). There is a cardinal к possessing the following properties: (a) there is a set U с [0, 1] of cardinality к such that k*(U) = 1; (b) [0, 1] is not a union of к Lebesgue negligible sets. It is known (cf, Fremlin and Talagrand A979)) that Axiom L is a consequence of Martin's Axiom and Axiom К follows from the existence of real measurable cardinals, 2. Measurable functions Definition 2.1, Let Г be a linear total subset of X*. A function f :? -+ X is said to be ?-scalarly ?-measurable, if x* f is ?-measurable for each x* e ?. If ? = X*, then / is called scalarly ?-measurable. If X = Y* and ? = ? then / is called weak* scalarly ?-measurable. f is strongly measurable if there is a sequence of X-valued simple functions /„ = 53/J'i xihXE,„ with ?,„ e ?, which is ?-a.e. converging to /. If ? is fixed the reference to it will be suppressed. This will concern also all further definitions. The following theorem, due to Pettis A938), explains the relationship between the strong and weak measurability. Theorem 2.2 (Pettis's measurability theorem). A function f-.?^? is strongly ?-measurable if and only if (i) / is scalarly ?-measurable, and (ii) / is ?-essentially separably valued, i.e., there exists ? e ??(?) such that f(? \ ?) is a separable subset of X. DEnNlTlON 2.3. We say that two ?-scalarly ?-measurable functions /. g : ? -> X are ?-scalarly ?-equivalent if x* f = x*g ?-a.e. for each x* e ?. If ? = X*, we say about scalarly ?-equivalent functions and, if X = Y* and ? = ? then / and g are said to be weak* scalarly ?-equivalent. Two strongly measurable functions / and g are ?-equivalent if f = g M-a.e. Below are presented some classical examples of ?-measurable functions. EXAMPLE 2.4. A scalarly measurable function that is not strongly measurable but is scalarly equivalent to a strongly measurable function. Let a : [0, 1 ] —> [0, oof be a function such that the set ? := {t e [0. 1]: a(t) > 0} is of positive Lebesgue outer measure and let [et: t e [0, 1]} be the canonical basis for the nonseparable Hubert space Ь([0. 1]). Define fa : [0, 1] -> /t([0, 1]) by fa(t) = a(t)e,. It is a consequence of the Riesz Representation Theorem that x* f = 0 ?-a.e. for each x* e Ь([0, 1])* (i.e., / is scalarly ?-equivalent to
536 A". Musial the zero function). On the other hand, if ? с Я is of positive outer measure then fa(E) is nonseparable. In virtue of Theorem 2.2, fa is not strongly ?-measurable. If a is non- measurable then also the function ||/„||: t —>¦ ||/a@|| is not measurable. As a specific example one can take a = ?\>, where V is the Vitali set. EXAMPLE 2.5 (Ryll-Nardzewski). A weak* scalarly measurable function that is not scalarly measurable and not weak* scalarly equivalent to any scalarly measurable function. Define /: [0, 1] -> C*[0.1] by f(s) = <5S. / is obviously weak*-X-measurable, since for у e C[0, 1], we have (y, f(s)) = y(s). To see that / is not scalarly measurable denote by ?? the atomic part of ? e C*[0. 1], and let V be a ???-?-measurable subset of [0, 1]. Define x** e C**[0, 1] by ?**(?) = ?,,(?). Since x**(f) = ?? the function / is not scalarly measurable with respect to ?. Since C[0, 1] is separable, / is not weak* scalarly equivalent to any scalarly (hence also strongly) measurable function. It is worth to notice that the norm of / is a measurable function. EXAMPLE 2.6 (Hagler). A scalarly measurable function that is not scalarly equivalent to a strongly measurable one. Let (A„) be a sequence of nonempty subintervals of [0.1], such that: (i) A, =[0,1], (ii) A„ = Ain U ??„+? for each ? e N, (iii) А; П Aj = 0 if /'? j and 2" ^ij< 2"+1. (iv) ???„?(?„) = 0. Define /: [0, 1] —>· ?? by /(f) = (Хд„@) fori e [0, 1]. Then / is scalarly measurable (cf. Diestel and Uhl A977)). To prove that / is not scalarly equivalent to a strongly measurable function it is enough to show that / itself is not strongly measurable (because l\ is separable). This immediately follows from Pettis's Measurability Theorem. Indeed, if ?(?) > 0 and t. s are two distinct points of ? then there is ? such that t e A„ buts ^ A„. Hence ||/(f) — f(s)\\ = 1. One can ask when a scalarly measurable function is scalarly equivalent to a strongly measurable one. The first global non-trivial result of this type is due to D.R. Lewis (see Stegall A975/76a, 1975/76b)), who proved that scalarly measurable functions taking their values in a WCG space are scalarly equivalent to strongly measurable functions. Edgar A977) has undertaken an effort to characterize the Banach spaces with the property that each X-valued scalarly measurable function is scalarly equivalent to a strongly measurable function. In case of a single function we have THEOREM 2.7 (Edgar A977)). Let f: ? -> X be a scalarly measurable function. Then, f is scalarly equivalent to a strongly measurable function if and only if the image measure /(?): Ba(X,weak) —>· R is tight in the weak topology (i.e., for each ? > 0 there is a weakly compact W С X with ?*(\?) > ?(?) — ?). As a direct consequence of the above theorem, one obtains:
Pettis integral 537 THEOREM 2.8 (Edgar A977)). Let X be a Banach space. Given any (?, ?,?), each scalarly measurable f:? —> X is scalarly equivalent to a strongly measurable function if and only if (X,weak) is measure compact (i.e., each measure on (X,weak) is ?-additive, what in the case of the weak topology means the tightness of each measure, that is given ? > 0 there is a norm compact К С X with ?*(?) > ?(?) — ?). ????????? 2.9. A function f-.? -> ? is ?-scalarly ?-bounded provided there is ? ^ 0 such that for each x* e Г the inequality \x*f\ ? М||л:*|| holds ?-a.e. If ? = X* then we say about scalarly ?-bounded function, and in the case of X = Y* and ? = Y- about weak* scalarly ?- bounded function. An easy calculation proves that if /: ? —>¦ X is strongly measurable and scalarly bounded, then it is bounded (i.e., there is ? > 0 such that sup{||/(a>)||: ? e ?} ^ ? ?-а.е.). The following fact (usually presented in the context of a family of measurable scalar functions) permits often to reduce the general situation to the case of scalarly bounded functions. PROPOSITION 2.10 (cf. Musial A979)). // f-.? -> X is ?-measurable then there exists a non-negative measurable function ??, with the following properties: (i) For each x* e ? we have \(x*. /(?))| ;??^(?)\\?*\\ ?-a.e., (ii) ??(?) ? \\f(co)\\r (=sup{|<**, f(co))\: x* e Г П B(X*)}) ?-a.e., (iii) If ф : ? —> [0, oo) is a measurable function satisfying (i) and (ii) (with ?^. replaced by ф), then ??, ?. ф ?-a.e. PROOF. Consider the set ? ? R endowed with the ?-algebra ?(? ? ?), and the product measure ? ? ?, where к is any probability measure on С such that ??(?) = ??(?). Let S(x*) = {(?,?): \(x*,f(w))\ > л||.г*||} for x* e Г, and let a = sup{(M ? КЮТ=\ S(x*)l х*еГП B(X*), ?? e ?}. Since a < oo there are x*, х*_,...еГП В(Х*) such that a = (? ? k)[U,?Li s(x*)]. Now, it is enough to put <p^ = sup,, \x*,f\, where the supremum is taken pointwise. ? COROLLARY 2.11. /// : ? —>¦ X is ?-scalarly measurable, then there exists a sequence ofpairwise disjoint sets E„ e ? covering ? and such that for each ? the function f is ?-scalarly bounded on E„. It is easy to give an example of a scalarly bounded function that is not bounded (e.g., setting a(t) = t in Example 2.4). It turns out however that the things can be more complicated. Edgar (see Talagrand A984)) proved that there exists a scalarly bounded function which is even not scalarly equivalent to any bounded function. Weak*-scalarly bounded functions behave much better. To formulate the result we will introduce a notion that will be in constant use further.
538 К. Musial Let / be an X*-valued function which is weak* scalarly measurable and weak*-scalarly bounded. If ? is a lifting on L,c(M) then we denote by po(f) the unique X*-valued function defined by (x,p0(f)(w)): = p(xf)(w), where ? and ? run across X and ?, respectively. It is known (see A. and С Ionescu Tulcea A969)) that po(f) is weak*-Borel measurable (i.e., /o(/)_l (?) e ? for all weak*-Borel В с X*) and the measure ?? : = ????/)-1 is a Radon measure on the completion ?A of the ?-algebra of weak*-Borel subsets of X* (see von Weizsacker A978) and Edgar A978)). It is however totally not obvious whether po(f) is in general scalarly measurable. PROPOSITION 2.12. If f-.? -> X* is weak*-scalarly bounded and weak*-scalarly measurable, then for an arbitrary lifting ? the function f is weak*-scalarly equivalent to the bounded and weak* Borel measurable function po(f). 3. Scalar integrals, basic properties Definition 3.1. Let ? be a linear total subspace of X*. A function /: ? -> X is ?- scalarly ?-integrable if x*f e L ? (?) for each.v* e T.If ? = X*, then / is called scalarly ?-integrable, and in the case of / : ? -> X* and ? = X с X**, the function / is said to be weak*-scalarly ?-integrable. DEFINITION 3.2. A ?-scalarly ?-integrable f-.? -> X is ?-?-integrable if for each ? e ? there exists v/(?) e X such that ?*??(?) = ??*/?? for each jc* e ?. The set function v/¦: ? —>¦ X is called the indefinite Г-integral of / with respect to ?, and ? f(E) is called the Г-integral of / over ?el with respect to ?. An X*-integrable function is called Pettis ?-integrable and an X-integrable function (if / : ? -> X* and ? = X) is called vveaA:* ?-integrable (or Gelfand ?-integrable). The Gelfand integral of / will be denoted by *v/. If f-.? -> X is considered as an X**- valued function then its weak* integral in X** is called the Dunford integral and it is denoted by v**. It is clear that each Г-integral is uniquely determined and it is an additive set function (provided it exists). The Г-integral is also a ?-measure, i.e., x*v( is ?-ad- ditive for each x* e ?. Sometimes, we shall use the following notations: ? - fF f ??, weak* - fE f ?? and D- fEf ??. If f-.? -> X is scalarly ?-integrable, then an operator Tf.X* -> ?.|(?) associated with / is defined by Tf{x*) = x* f. From the integral point of view, the functions with the same indefinite Pettis integrals are non-distinguishable, they are scalarly equivalent. We shall denote by ?(?, ?) (or
Pettis integral 539 by ?((?, ?,?),?) if necessary) the space of classes of scalarly ?-equivalent Pettis ?-integrable X-valued functions. ?(?. ?) is a linear space with ordinary algebraic operations. One defines a norm on ?(?. ?) by |/| =sup\ f \(x*. /)\?·?: л* e B(X*)\ =sup{\X*vt\(Q): .x* e B(X*)\. It is known that ill f 1 sup I / / ?? : ? e ? J \\\Je I defines an equivalent norm on ?(?, ?). We will investigate also the space of all Pettis integrable functions with norm relatively compact range of their integrals: ?, (?, X) : = {/ e ?(?, ?): ?/(?) is norm relatively compact}. Besides ?(?, ?) we will be often considering its subsets ?(?, ?) consisting of functions taking their values in a set К с ?. It is one of the main problems in the theory of vector integration to find conditions guaranteeing the existence of the Pettis integral.We shall start with two classical results. PROPOSITION 3.3 (Gelfand A936)). Each weak* scalarly ?-integrable /:? -> X* is weak* ?-integrable. As an immediate corollary we get the following fact PROPOSITION 3.4 (Dunford A937)). Each scalarly ?-integrable function f :? -> X is Dunford ?-integrable. If X is reflexive then the Dunford and Pettis integrals coincide. When X is not reflexive, this may not be the case. EXAMPLE 3.5 (cf. Diestel and Uhl A977)). A Dunford integrable function that is not Pettis integrable. Define / : @, 1] —> c\) by /@ = BХC-1.||@,22ХB-:.2-.|(г),..-.2',хB-я.2-^1|(г),.-.)- If jc* = (??,«2,...) e /| = c*, then x*f = ?^??2"?0~„ 2_„*i| and / \x*f\dX ? ]? |?„| < oo. ,7=1 ^0 /7=1 It follows from Proposition 3.4 that / is Dunford ?-integrable. On the other hand, it is easily seen that for each ? e С D- [ fdk= {2k(E П B-', 1]) 2"?(? П B"", 2-"+l]),. -.}.
540 A". Musiat In particular D- f /<Я = A,1,1,...,1.-..)^С(, ??.?] and so / is not ?-Pettis integrable. At this place one should recall that such a phenomenon cannot happen if X is a separable Banach space not containing any isomorphic copy of cq. Unfortunatelly, the result does not extend in this form to non-separable Banach spaces (cf. Edgar A979)). THEOREM 3.6 (Dimitrov A971), Diestel A973)). If X is a separable Banach space without an isomorphic copy o/co, then each X-valued Dunford integrable function is Pettis integrable. The function fa considered in Example 2.4 is Pettis integrable with vfu =0. But there are also non-trivial examples of Pettis integrable functions. Example 3.7. Let / be the function considered in Example 2.6. Since ||/(r)ll ^ 1 everywhere, / is Dunford integrable and for each ? e С and ? = ?\ + ?? e l^c = ? ®co> we have DC j' ???? = j m(f)dk+ { rtf)dk = j ???„?\{{?})?? n= I DC = ]Г щ ({?})?(? ПА„) = (???, (?(? П ?,,))) = (?. (?(? П ?,,))). The last equality follows from the fact that lim„ ?( A„) =0 and so v(E) = {?(??\ A,,)) e C(>. But /ji considered as a functional on ?? belongs to c(| and so ???(?) = 0 for each ? e ?. It follows that / is Pettis ?-integrable. In case of a quite arbitrary ? nobody has been too much interested in describing the properties of the ?-integral. The most interesting case is when ? = X*. THEOREM 3.8. /// is Pettis ?-integrable, then v/ is a ?-continuous measure ofa-finite variation. Moreover, \vf\(E) = fF<pfdp for each ? e ? (we put here ? / instead of ?* for simplicity). In particular, the collection [x*f: ||.v*|| ^ 1} is uniformly ?-integrable. The ?-additivity and continuity of v/ is due to Pettis A938). The ?-finiteness of the variation was proved by Rybakov only in 1968. Remark 3.9. A result similar to that in Theorem 3.8 for an arbitrary total Г с X* is false. If ? is norming (i.e., for each ? e X the equality ||jc|| = sup{|(jc*,.r)|: ||л-*|| ^ 1,
Pettis integral 541 x* e ?} holds) and / is ?-integrable, then |v/1 is a ?-finite measure and \vf\(E) = §???(?? for all ? e ? (see Musiat A979)). In particular, if f-.? -> X* is weak*- bounded and satisfies for a lifting ? the equations xf = p(xf) for all ? e X, then | v/ |(E) = /? 11/11 ?? for all ? e ?. This clearly generalizes the well known equality for the Bochner integral. It may however happen that u/ is not countably additive in the norm topology of X, and it is not ?-continuous. It is so for the function presented in Example 3.5, when considered as an /^-valued function. Vf : С —>¦ lx is weak*-countably additive but not countably additive in the norm topology and not ?-continuous. 4. Pettis integral We shall start describing conditions equivalent to the Pettis integrability with a classical result that is always a starting point, when one wants to find conditions guaranteeing the Pettis integrability of a single function. THEOREM 4.1. Let f : ? —>¦ X be scalarly integrable. Then f is Pettis ?-integrable if and only ifT/ : X* —>¦ L\ (?) is weak*-weakly continuous ifandonly ifTf : B(X*) —>¦ L\ (?) is weak*-weakly continuous. The following corollary is a simple consequence of Theorem 4.1. COROLLARY 4.2. A scalarly integrable f :? —>¦ X is Pettis integrable if and only if the set {x* e X*: x* f = 0 ?-a.e.} is weak* closed. COROLLARY 4.3. // f-.? -> X is Pettis ?-integrable, then Tf.X* -> ? ? (?) is weakly- compact and the set Zf := {x* f: \\x* || ^ 1} is weakly closed in L \ (?). To formulate the next characterization of Pettis integrable functions we are going to use an idea which is a generalization of Huff's A986) conception of separable-like functions. Dehnition 4.4. A scalarly measurable function f-.? -> X is determined by a space ? с X if for every x* e Y^ the equality x* f = 0 holds ?-a.e. The characterization presented below has been discovered by Drewnowski A986) (without (iii)). Stefannson A992) proved then the equivalence of (i) and (iii). THEOREM 4.5. ///: ? —>· X is scalarly integrable and the induced operator Tj : B(X*) —> L\ (?) is weakly compact, then the following conditions are equivalent: (i) /eP(M,X); (ii) / is determined by a weakly compactly generated space ? ^ X; (iii) / is determined by a space ? С X possessing Mazur's property; (iv) / is determined by a Corson space ? С Х.
542 A". Musial PROOF (Sketch). If / e ?(?, X), then one can take as ? the space generated by ?/(?). Since each WCG space has Mazur's property (cf. Diestel A975, p. 148)) and is a Corson space, we have (ii) =>· (iii) and (ii) =>· (iv). To prove (iii) =>· (i) we are going to use a simple idea of Stefannson. Let /: ? —>¦ X be scalarly measurable and determined by a weakly compactly generated ? с X. First denote by Ту the operator Ту : Y* —>¦ Ь\(ц.) defined by 7>(y*) := Tt(y*x]), where y*xl is an arbitrary extension of y* to the whole X. In a standard way one proves that Ту is weak*-weakly continuous if and only if 7/ is weak*-weakly continuous if and only if Ту is ?(?*. y)-weakly continuous. Since 7/ is weakly compact and / is determined by Y, one can prove, applying Mazur's theorem on the equality of weak and norm closures of convex sets that 7/ is sequentially ?(?*, Y)- weakly continuous. Hence Ту is sequentially ?(?*. y)-weakly continuous. Now, since ? has Mazur's property and Ту is sequentially ?(?*. y)-weakly continuous, we get the weak*-weak continuity of Ту and hence the Pettis integrability of /. For the proof of (iv) =>· (i) we refer to Drewnowski A986). ? Notice that each of the conditions (ii)-(iv) of Theorem 4,5 can be equivalently written as ??(?) с ?**, where ? is respectively WCG. Mazur or Corson. Example 3.5 shows that the weak compactness of Tf in the above theorem is necessary. To formulate more general result, we need a notion introduced by Drewnowski A986). Definition 4.6. If ?? is the collection of all Corson subspaces of X then the Corson envelope of X is defined by X:= (J Г*. YeCx Drewnowski A986) strengthened the implication (iv) =>· (i) of the above theorem to the following form: THEOREM 4.7. Iff-.?^? is scalarly integrable, T,¦ : X* -> L\ (?) is weakly compact and ?**(?)(??, then /еР(д.Х). Geitz A981) introduced a notion related to Rieffel's essential range of strongly measurable function (Rieffel A968)). Definition 4.8. Let /: ? -> X be a function and let ? e ? be an arbitrary set. The core of / over E, denoted by cor/¦(?), is the set defined by cor/(?):=P|{conv/(?\yV): ????(?)}. Geitz A981) also noticed that if/??(?,?). then for each ? e ?+ сот/(Е) = convA ; (E). The following lemma is quite useful:
Pettis integral 543 LEMMA 4.9. /// is weakly measurable and cor /(?) ?? for all ? e ?+, then x* f = 0 ?-a.e. if and only ifx* = 0 on cor/ (?). Then Andrews A985) introduced the weak*-core of a weak* scalarly measurable function /; ? —>¦ X*\ cor*f(E) := P|{conv*/(? \ ?): ? ???(?)}. Drewnowski A986) introduced the weak**-core (he called it the weak*-core) setting cor**(?) = Q{conv**/(?\yV): ? ???(?)} and proved that if / e ?(?, ?), then cor}*(?) = cbrTTEj" = Л~{Ё~)**. THEOREM 4.10 (Talagrand A984)). Let f :? -> X be scalarly integrable. Then f is Pettis integrable if and only if ?t :X* —>· ?,|(?) is weakly compact and cor / (E) ?^/ifor each ? e ?+ PROOF. Assume that Tf :X* —> ?.|(?) is weakly compact, cor/ (?) ? 0 for each ? e 2?+ and / ? ?(?, ?). Then, according to Lemma 5-1-2 of Talagrand A984) (which remains true for functions with weakly compact Tf), there exist functionals y*. .v* e B(X*) such that ?{? e ?: ?*/(?) ??*/(?)} > 0 and у*/ is in the pointwise closure of the set ??/:={?*/???:?*/ = ?*/?-?*.}. Let {x*} с W/ be pointwise convergent to y*f. Without loss of generality, we may assume that x* —>¦ y* in ?(?*. X). Since for every a we have x*f = z* f ?-a.e., we have .х*|сог;Ш) = r*|Cor,№)· Hence y*|a)r,(i2) = c*L-or,<i?) and consequently y*f - z*f ?-a.e. This however contradicts our assumption, and so / e ?(?, ?). ? Another proofs of the above result were presented by Edgar A982) and Huff A986). Certainly the above theorem has been inspired by earlier characterizations of Pettis integrability obtained by Geitz A982) in case of a perfect measure (that was a little bit more technical) and by Sentilles's A981) characterization (also for perfect measures only) of Pettis integrability in terms of the Stonian transform. Talagrand A984) formulated also an equivalent condition guaranteeing Pettis integrability of /, which has been then reformulated and reproved by Drewnowski A986) in the following form: THEOREM 4.11 (Drewnowski A986)). Let f: ? -> X be scalarly integrable. If сот*,*(Е) П ? ? Я for each ? e 2?+ and Tf is weakly compact, then f is Pettis integrable.
544 A". Musial ????????? 4.12 (Riddle, Saab and Uhl A983)). A family ? of real-valued functions on (?, ?, ?) has the Bourgain property if for each ? e 2?+ and each ? > 0 there is a finite collection fcpffjn ?+ such that for each function h eH one can find Fef with suph(F) - infh(F) < ?. In case of a uniformly bounded ? an equivalent definition is as follows: for each ? & ?+ and each pair a < b of reals, there is a finite collection ? с V(E) П .?7+ such that for each function h eH one can find F e ? with inf{/?(o>): ? e F) ^ a or sup{/?(o>): ? e F} < b. It is easy to see that the Bourgain property of ? yields the same property of the pointwise closure of ?. Moreover the Bourgain property of a function constrains its measurability. PROPOSITION 4.13 (Riddle, Saab and Uhl A983)). If ? satisfies the Bourgain property, then each function in ? is measurable and each function in the pointwise closure of ? is the almost everywhere pointwise limit of a sequence from ?. PROOF. In order to prove the proposition take / e HT<' and an ultrafilter WonH which has / as a cluster point. Then, put for ? e 2?+ and ? > 0 ?(?,?)= {heH:suph(E)-mfh(E) <?). Each ? e ?+ contains F e ?+ with H(F. ?) e U. Using the Zorn-Kuratowski lemma, one can find for each positive ? a maximal family Ve of pairwise disjoint sets in 2?+ such that H(F, ?)eW for each F eVF. It is obvious that ?(? \ \JFeVf F) = 0. Moreover, if ??? is the family of all finite subcollections of Ve, then /e f) f) ??(?,?)'' eU. KeK, EeK Now let for each m e N the sequence {A,„.„: ? e N} be an enumeration of V\/„, and let ?,„„ e AmM be arbitrary. Taking for each meNa function in m /mep|p|W(At.(I;l/fc) * = l n=l such that |/m(Wjt.„)- /(Wjt.„)| < 1/* for each 1 ^ k, ? ^ m, we get a sequence (/,„) с ? that is ?-a.e. convergent to /, D Dehnition 4.14. Let S be a topological space and let ? be a positive finite measure defined on a ?-algebra В containing all Borel subsets of S. ? is said to be selfsupporting, if for each В е В+ there exists A eV(B)C) B^ such that ?\ A is strictly positive (i.e., for each open U, we have t/nA = 0ort/nAe #+).
Pettis integral 545 It is easily seen that if ? is selfsupporting, then for each В e #+ there exists A e V (?) ? ?+ such that ?\? is strictly positive and ?(?) = ?(?). We shall write A = supp(M|S). It is well known that Radon measures are selfsupporting. Also if ? is a lifting on ?, Tp is the topology defined by taking as its basis, the family {A e ?: Ac p(A)} (cf. A. and С Ionescu Tulcea A969, p. 59)), then ? is selfsupporting with respect to Tp. PROPOSITION 4.15. Let S be a topological space and let ? be a uniformly bounded family of real-valued continuous functions on S. Moreover, let ? be a selfsupporting measure on ? ?-algebra В 5 Bo(S). Then, if ? does not contain any sequence equivalent to the standard unit vector basis of I \ (in the sup norm), then ? has the Bourgain property. PROOF. Suppose that ? does not have the Bourgain property. Then, there exists ? e BjL such that ? — supp^|7) and there exist a < b such that for each finite collection П с V(T) П B+ there is / e ? with inf f(R)<a and sup f{R) > b, for every R e K. We shall now construct inductively a collection {A„.,„: m = 1 2"; ? e N} of sets from ?+ and a sequence (/,,) с ? satisfying the following properties: A„+\.2m-\ UA„ + |1,„ С Anj„. fn+\(s)<a ifi e A„+i.2„,_|. fn+\(s)>b ifs e ?,,+ ?.:,,,. Assume, we have already constructed {/„,: m = \,...,k) and {A„.,„: m = 1 ,2"; ? = 1,... ,k]. By the assumption, we can find for each m e {1 2k] a set Tk.meB+nV(Ak.,„) such that Tk ,„ = 5ирр(д|7А.,„) (for the first inductive step set 7bi = 7o? = T). Moreover, there is fk + \ &H. with inf fk+\ (TkJn) < a and sup fk+\(Tk.m) > b for every m e {1,..., 2k}. Put now Ak+\.2m-\ = [s eTk,,„: fk+\(s) <a), Ak+\.bn = {s e Tkjn: fk+\(s) > b). It follows that Ak+\.m e B+ for every m e {1 2*+'}. Rosenthal's argument A974) shows that the sequence (/„) is equivalent to the standard basis of l\ in the sup norm. D Dehnition 4.16. Let К с X* be a nonvoid set. A set W с X is weakly precompact with respect to К (or weakly ?-precompact) if each bounded sequence in (x„) с W has a subsequence (x,n) such that for each x* e К the sequence (x*,x„k) is convergent. If К = B(X*), then W is called weakly precompact (equivalently, no bounded sequence in W is equivalent, in the norm topology, to the unit vector basis of/|).
546 A". Musial As particular instances of Proposition 4.15 we get the following results: COROLLARY 4.17. Let ? be a lifting on ?^(?, ?, ?) and let ? be a uniformly bounded family of real-valued functions defined on ? and such that ?. is weakly precompact in ^cc(M)· Then p{H) has the Bourgain property. Corollary 4.18 (Riddle and Saab A985)). If{xf: \\x || ^ 1} has the Bourgainproperty, then f e ?(?, X*) and the range ofvf is norm relatively compact. PROOF. Fix ? > 0, ? e ? and ?** e B(X**). Then put H=[(x,f):xe B(X). \(x** - .v. *vf(E))\ <?]. Clearly ?. has the Bourgain property. It follows from the Goldstine theorem that (x**, /) is in the pointwise closure of ?. and so (Proposition 4.13) there are x„ e ?, ? e N, such that {xn, /)eW for each ? and lim„U„, /) = (***, /) ?-a.e. Hence, we get <2? by the Lebesgue Convergence Theorem, and this proves that / e ?(?, ?*). The norm relative compactness of ?/ (?) — *v/ (?) follows from the properties of Tj: if {x„) is weakly convergent, then (Tf(x„)) is a.e. convergent (cf. Riddle, Saab and Uhl A983)). D COROLLARY 4.19 (Haydon A976)). Let X be a Banach space containing no isomorphic copy ofl\. If ? is a complete finite Radon measure on B(X*) equipped with the weak* topology, then the identity function on B{X*) is ?-Pettis integrable. PROOF. In view of Proposition 4.15 the family ? = B(X) has the Bourgain property. Hence Corollary 4.18 yields the Pettis integrability of the identity function. D Among several questions concerning Pettis integrability the following seems to be quite interesting: assume that / e ?(?. ?*) is weak*-scalarly bounded and ? is a lifting on ^(?). When is the function po(f): ? —>¦ X* Pettis integrable? The next result is due to Talagrand A984) but his proof is different from that presented here. PROPOSITION 4.20. Let f :? —>· X* be a weak*-scalarly bounded and weak*-scalarly measurable function. Assume that for each S > 0 there is ? e ? with ?(? \ ?) < ? and such that the set {(/. jr)/?-: ||л|| < 1} is weakly precompact in ?,^(?). Then Po(/) e ?(?, ?*). In particular, if X does not contain any isomorphic copy of l\, then й(/)еПд,П PROOF. The assertion is an immediate consequence of Corollaries 4.17 and 4.18. D
Pettis integral 547 As a consequence we get a result due to Riddle, Saab and Uhl A983) and deduced by them from Theorem 9.10 via Theorem 9.7. COROLLARY 4.21. Let X be a separable Banach space and let f: ? -> X* be a weak*- scalarly bounded and weak*-scalarly measurable function. Assume that for each S > 0 there is ? e ? with ?(? \ ?) < S and such that the set {(/, x)xe- \\x\\ < ? " weakly precompact in ?,^??). Then f e ?(?, ?*). THEOREM 4.22 (Macheras and Musiat B000)). Let f :? -> X* be a scalarly bounded Pettis integrable function. If each x** e X** is ^-measurable, then A)(/) e ?(?. ?*)- PROOF. According to Theorem 4.5 and the weak*-Borel measurability of A)(/) we have only to show that for an arbitrary с e ?/-B?)? the equality (z, A)(/)) = 0 holds true ?- a.e. Suppose that there exists с e ?/ (?)^ such that ?{? e ?: (?, ?)(/)(?)) > 0} > 0 and ||z|| = 1 and let K°f be the the weak*-closureof ft)(f)№) in X*. Then, since ?0 is aRadon measure and ??(?°.) = ?(?), there exist a weak*-compact set L с {.?* e AT^-. (z.x*) > 0} and a positive real number a such that ?o(^)>0, г is continuous on L and (c, .x*} > a for each x* e L. Take now an arbitrary net (ха)аел in B(X) that is ?(?**, ?*)-convergent to с Then, the convergence to ? is uniform on ?/ (?). To avoid unnecessary complications, we assume at once that the initial net (ха)аел is Mackey convergent to с Then, for each ? e N there exists an e A such that \{xa,vf(E))\ < l//j for all ? e ? and all a ^a„. If A„ := {a e A: a ^ a,,}, then for each collection of points x* .v* e i, there is an index or.-* . * e A„ such that \(?,?*)-(???, ,...v,*}| < 1/" foreach/'^/j. Equivalently, L"ci{J{x*:\(z.X*)-(xa.x*)\<l/n}". aeA„ Now, as a consequence of the compactness of L and the continuity of z\L, there exists a finite set B„ с А„ such that the inclusion L"c (J{**: \{z,x*)-{xa.x*)\<l/n}n aeB„ holds true. It follows that z\L is a pointwise cluster point of the countable set {.va|L: xa e U^i S,,}. Consequently, there exists x** e X** that is a weak*-cluster point of the
548 К. Musial set {xa: a e Ц^, ?,,} and x*f\L = z\L. It follows from the construction of x™ that Xq* e Vf(EI and so f (x?*,po(f))d?>a?{po(f)-\L))>O = {x**.v/{p0(f)-](L))). J^frHL) On the other hand, one can easily show that if /: ? —>¦ X* is a scalarly bounded Pettis integrable function, then [x** ,?/(?)) = ?(?**,??{/))?? for each x** e BC(X**) and each ?el This gives the required contradiction. ? The next corollary is part of Proposition 4.20 but can be also derived from Theorem 4.22 if one takes into account Haydon A976) who proved that if l\ <? X isomorphically, then each functional x* e X* is universally measurable with respect to the weak* Borel structure of X* (this is in fact contained in the assertion of Corollary 4.19). COROLLARY 4.23. If l\ <? X isomorphically and f-.?^-?* is weak* scalarly measurable and weak* scalarly bounded, then for each lifting ? the function po(f) is Pettis integrable. Trying to overcome difficulties met while characterizing Pettis integrable functions Fremlin imposed an additional requirement besides the scalar integrability. The property was called by him "proper relative compactness" (Fremlin A982)) and then "stability" by Talagrand A984). Definition 4.24. Let ? be a collection of real valued functions defined on ?. ? is said to be stable if for each A e 2T| and arbitrary reals a < ? there exist k, I e N satisfying the inequality l4+i ( U (/ < «}* x (/ > ?)' n ?"+?) < H(A)k~ where ?*+/ is the direct product of к +1 copies of ?. One may assume in the above definition that к = I. Moreover (Talagrand A984)), if ? is stable and pointwise bounded then it is pointwise relatively compact in ??(?) and its pointwise closure is also stable. Definition 4.25 (Talagrand A984)). A function / : ? -> X is properly measurable if the set Zf:=[x*f: |И|<1} is stable.
Pettis integral 549 THEOREM 4.26 (Talagrand A984)). // f :? -+ X is properly measurable and Zf is uniformly integrable, then f e ?(?, ?) and the range of ? f is norm relatively compact. If ? is perfect, Axiom L holds and B(X*) is weak*-separable, then each scalarly measurable f: ? —> X is properly measurable (and hence it is Pettis integrable if Zf is uniformly integrable). PROOF (Sketch). We have to prove only that the map x* -> x*f from the unit ball of X* into L\(?) is weak*-norm continuous. Let (x*f)ses be a net that is pointwise convergent to x*f and ||jc*|| ^ 1 for all s e S. According to Talagrand A984, Theorem 9-5-2), the net (x*f)ses is convergent to x^f in ?-measure. Because of the boundedness of the net in ?,^?) the Lebesgue dominated convergence theorem may be applied, yielding the convergence in the norm of L \ (?). The second assertion is a direct consequence of Axiom L (Talagrand A984, Theorem 9-3-3)). D In case of a weak*-scalarly bounded and weak*-scalarly measurable function /: ? —> X* we have the following two results: THEOREM 4.27. Let ? Cl X be a bounded norming subset of X and let f : ? -> X* be a function. If each countable subset of the set {xf: ? e ?} is stable then for each lifting ? on Loc(?) the function po(f) is an element of ?(?. ?*) and the range of its Pettis integral is norm relatively compact. If ? is consistent, then po(f) is properly measurable. PROOF. If ? is a lifting, then each countable subfamily of {xpo(f): ? e ?} is stable. If moreover ? is consistent, then the whole collection {xpo(f): ? e A] is stable (see Musial B000)). The Pettis integration of po(f) and the norm relative compactness of its Pettis integral are now consequences of Theorem 4.26. Let now ? be an arbitrary lifting on Loc(?). Then again each countable subfamily of {xao(f): ? e ?} is stable. Due to the countability of the set we may apply Lemma 9-3-2 of Talagrand A984) to get the stability of each countable subset of ? with respect to ???(/)~' defined on the completion of weak*-Borel subsets of X* with respect to ??o(f)~]. But as all elements of ? are weak*-continuous, we may apply Theorem 9-4-2 of Talagrand A984) obtaining the ???(/)_? -measurability of all elements of X** (notice that in the proof of the implication (c) =>· (a) of 9-4-2 Axiom F is not applied). Since ??(/) is weak*-scalarly measurable, we get the weak scalar measurability of ??(/). The Pettis integrability of ??(/) follows now from Theorem 4.22. Since the Pettis integral of ??(/) coincides with that of po(f), this completes the whole proof. D THEOREM 4.28 (Talagrand A984)). (Axiom L) Let (?, ?, ?) be perfect, f : ? -+ X* be a function and let ? с X be a bounded norming set. Moreover, let A a be the union of the weak*-closures in X** of countable subsets of ?. Then the following conditions are equivalent: (a) For each x** e A a the function л**/ is measurable; (b) If ? is a consistent lifting on Lr^(?), then po(f) is properly measurable;
550 К. Musial (c) If p is a lifting on ?,^??), then po(f)e ?(?. ?*) and the range of its Pettis integral is norm relatively compact: (d) If ? is a lifting on Loc(?), then po(f) is scalarly measurable. PROOF. If ? is perfect, Axiom L holds true and (a) is satisfied, then each countable subset of {xf: ? e ?] is stable by Talagrand A984, Theorem 9-3-3). ? 5. Limit theorems It is the purpose of this section to prove the convergence theorems of Vitali and Lebesgue type for the Pettis integral. The theorems had been proved first by Geitz A981) under the assumption of perfectness of the basic measure space and then they were proved in full generality by Musial A985). Geitz's proofs were based on a James's characterization of weakly compact subsets of a Banach space (see James A964)) and on Fremlin's subsequence theorem (see Fremlin A975). Applying the classical Mazur theorem on the equality of weak and norm closures of convex sets instead of Fremlin's theorem I was able to get rid of perfectness of the measure space (Musial A985)). Below I am presenting still another proof based on a Grothendieck characterization of weakly compact sets that seems to be more elementary than James's characterization (cf. Holmes A975, p. 157 for a simple proof)). In fact one can obtain the result directly from Theorem 4.5 but the proof presented here seems to be more exciting. PROPOSITION 5.1 (Grothendieck A952)). A bounded nonvoid set W С X is relatively weakly compact if and only if for every two sequences (v„) С W and (x*) С В(Х*) the equality lim lim(x* ,?,? — Hmlim(r* . .v„) П 111 ' 111 ?7 ' holds true, provided all the limits exist. The theorem we are going to present is an analogue of Vitali's convergence theorem. Conditions (a) and (b) of this theorem guarantee that for each x* e X* and ? e ? the sequence {$??*^??: ? e ?} is convergent to fEx*fdp , and that the set {x*f: x* e B(X*)} is weakly relatively compact in ?.|(?). They may be replaced by any others guaranteeing the above weak compactness and the convergence of the appropriate sequence of scalar integrals. THEOREM 5.2 (Geitz, Musial). Let f :? -> X be a function. If there exists a sequence of Pettis integrable functions /„ : i? —>¦ X such that: (a) The set {x* f,: \\x*\\ ^ 1, ? e N} is uniformly ?-integrable, (b) lim„ x* fn =x*f in ?-measure, for each x* e X*, then f is Pettis ?-integrable and, lim / ?„??= \ f ?? weakly in X,for each ? e ?.
Pettis integral 551 PROOF. Fix ? e ?. Since the classical Vitali convergence theorem yields the convergence lim„ fEx*f„dp = fEx* f ??, for each ? e X*. we see that the sequence (fE f, ??) is weakly Cauchy. In order to prove our assertion, it is sufficient to show that given (/„,„)», and {x\)k С B(X*), we have a := limlim/^*, / /„,„ ??) = ? := limlim/.v^, / /„,„ ??? provided all the limits exist. It follows from (a) and (b) that the sequence (.v^, /)* is uniformly integrable and bounded in L\ (?). Hence it is weakly relatively compact. This yields the existence of a subsequence (z*k)k of (xf)k and a function h ??-?(?) such that z*kf ^ h weakly in ?,|(?). Mazur's theorem yields now the existence of functionals wf e conv{~*: /' ^ k} such that lim / \w*kf - ??\?? = 0 and limu^/ = h ?-a.e. If Wq is a weak*-cluster point of (wf)k, then ? = u>(*/ ?-a.e. Consequently, <* = wofdV- On the other hand, we have limU*. J . /«„ ??\ = lim/;*. ? /„,„ ??) = lim/n*. j /„„, ?? = (»'()- ( /«„ ^М) = J >'?· ??,„)??. It follows from the classical Vitali theorem that ? = \??[(??*0,?„,)?·?= /"?,/^?. '" Je Je This proves the required equality a = ?. ? As an immediate consequence of Theorem 5.2, we get the following generalization of the classical Lebesgue Dominated Convergence Theorem. THEOREM 5.3 (Geitz, Musial). Let f : ? -> X be a function satisfying the following two conditions: (a) There exists a sequence of Pettis ?-integrable functions /,,:i2 —>¦ X such that lim„ x*fn = x* f in ?-measure, for each x* е X*, (?) There exists h e ?.|(?) such that for each x* e B(X*) and each ? e N, the inequality \x*f,\ < h holds ?-a.e. (the exceptional sets dependon x*).
552 A". Musial Then f e ?(?, ?) and lim / fndix= \ ???. " Je Je weakly in X, for all ? e ?. PROPOSITION 5.4. Assume that the assumptions of Theorem 5.2 or 5.3 are satisfied. Assume moreover that ? is perfect and the functions f, are scalarly bounded. Then Hindoo /,, = / weakly in ?(?, ?). PROOF. According to Collins and Ruess A983) if/? л:*/<*M^ fE x* f ?? for all ? e ? and all x* being extreme points of B(X*), then the sequence (/„) is weakly convergent ??/???(?, ?). ? It would be interesting to know whether the unit ball of X* in the assumptions of Theorems 5.2 or 5.3 may be replaced by its extreme points. It is natural to ask when the condition (a) of Theorem 5.3 is sufficient for the Pettis integrability of a function. If X contains an isomorphic copy of cq, then (a) is too weak to guarantee the integrability. Indeed, let / be the function considered in Example 3.5. With the same notation, we have ОС ? f = 2_ian2 "XB-.-2-"- /1=1 If /,(?) = B?B-,ц(г),..., 2"??-„_2-„~?,@,0,0,0,...) then clearly lim / ?*^?? = ? ?*f ?? " Je Je foralljt*eX*and Ее ?, but / i ?(?. ?) It turns out however that со is the only exceptional Banach space. THEOREM 5.5. Let X be without any isomorphic copy of cQ. If f: ? ^ ? is scalarly ?-integrable and there are functions f, e ?(?, ?) such that lim / ?*^??= / ?*f ?? " Je Je for all ? ?? and ?* e X*, then f e ?(?. ?) ??? lim / ???? e / f ?? weakly in X for all ? e ?.
Pettis integral 553 PROOF. In virtue of Corollary 2.10 the set ? can be decomposed into pairwise disjoint set ?„ e ?, such that for each ? e N and x* e X* the inequality |.?*/??„ I ^ «II-"*"* II holds ?-a.e. It follows from the comments preceding Theorem 5,2 that / is Pettis integrable on each ?„. Now it is sufficient to apply the series characterization of a Banach space not containing cq (Bessaga and Pelczynski A958)). ? Notice that if X is separable, then the assumption of scalar integrability itself yields the required assertion (Theorem 3,6), 6. The range of the Pettis integral For several years it has been an open question of Pettis whether for each Pettis integrable function the range of v/ is always norm relatively compact, Fremlin and Talagrand A979) gave a negative answer to that question, Before providing their example let us first present a positive result of Stegall (see Fremlin and Talagrand A979)), THEOREM 6,1 (Stegall), If ? is perfect then for each scalarly integrable f : ? -> X such that Tf is weakly compact, the range ?**(?) of the Dunford integral of f in X** is norm relatively compact. PROOF. We may assume that / is scalarly bounded. It is obvious that for each ? e ? the equality ??-?? = v*f*(E) holds, so in order to prove the norm relative compactness of vj( ?) it is sufficient to show the compactness of Tt. To do it, choose any sequence (x„) с B(X*). Since / is scalarly measurable, (x*f) has a subsequence (x*Iltf) that converges a.e. Otherwise, we could apply Fremlin's subsequence theorem (Fremlin A975)), to get a subsequence without measurable cluster points, in the space of all real-valued functions endowed with the topology of pointwise convergence. If x* is a weak* cluster point of {(x* )), then x*nf -> x*f pointwise, and hence in L ? (?), because of the Lebesgue theorem. Thus, Tf is compact, and the assertion is proved. ? Since perfectness of the basic measure space in not necessary for a Pettis integral to have norm relatively compact range there is an obvious question when it can happen. It turns out that the answer depends on approximation of Pettis integrable functions by simple functions. THEOREM 6.2 (Musial A985)). If f e ?(?, ?), then vt (?) is norm relatively compact if and only if f is a limit of a sequence of X-valued simple functions, in the norm topology ?/?(?, ?). We say that X has the ?-PCP (Pettis compactness property) if for each / e ?(?, ?) the set ?/(?) is norm relatively compact. If this property is satisfied for an arbitrary (?, ?, ?), then X has the PCP No general description of PCP is known but there are interesting partial solutions. In particular we have the following result of Talagrand:
554 К. Musiai THEOREM 6.3 (Talagrand A980)). (MA) If l^ is not a quotient of X, then X has the PCP. Talagrand A980) presents also an example of X possessing PCP and having lx among its quotients. More can be said about the following stronger property. DEHNITION 6.4. A set 0 ? К с X has the ?-Compact Range Property (CRP) if every X-valued ?-continuous measure of finite variation with its average range contained in К has norm relatively compact range. The global property in this context (i.e., CRP of X or, equivalently, of B(X)) has been studied by Musiat A979). Then it was localized by Riddle, Saab and Uhl A983). They called К to be a set of complete continuity. Definition 6.5 (Riddle, Saab and Uhl A983)). A subset ? ? 0 of a Banach space X is called a ?-weak Radon-Nikodym set (respectively, a ?-weak**-RN set) if for every ?- continuous measure of finite variation у : ? —>¦ X satisfying for all sets ? e ? the inclusion Av(E) с К there exists a ?-valued (respectively, a AT**-valued) Pettis integrable density of у with respect to ?. In the above definitions the ?-continuity may be replaced by ?- domination. If AT is a ?-weak Radon-Nikodym set for all finite complete ?, then it is called a weak Radon-Nikodym set. Similarly for ?-weak**-RN set. Sometimes we will say also that К has the weak (or weak**) Radon-Nikodym property. If К = B(X) then we say about the weak RNP (or weak** RNP) of X. WRNP was introduced in Musiai A979) and W**RNP by Janicka (see Musiai A980)). It is an immediate consequence of the complementability of X* in X*** that X* has WRNP if and only if it has W**RNP Remark 6.6. L? [0, 1] is an example of a Banach space without the W**RNP Indeed, L,[0, 1] is complementable in L**[0, 1], so the W**RNP of L,[0, 1] would imply the RNP of the space, and it is well known that L | [0, 1 ] does not enjoy the last property. Since each measure space can be embedded as a thick subset of a perfect measure space (cf. Musiai A979)), we get the following conclusion from Theorem 6.1: THEOREM 6.7 (Musiai A979) in case of К = X). If К has the W**RNP (in particular WRNP), then К has also the CRP. In case of the separable range of Vf the complete characterization is given by the following theorem that has been obtained independently by Talagrand A984) and Musiai A985): THEOREM 6.8. /// e ?(?, ?), then the following are equivalent: (i) {x*f: x* e B(X*)} is a separable subset of ^?(?);
Pettis integral 555 (ii) There exists ? ?-algebra ГС1 such that (?, ?,?\?) is separable, and f is scalarly measurable with respect to ?; (iii) There exists a sequence (/„) of X-valued simple functions, such that {x*f,'- " ? ?, ||**|| ^ 1} is uniformly integrable and for each x* e X* the sequence (x*f,)„eU is ?-a.e. convergent to x*f; (iv) There exists a sequence (/„) of X-valued simple functions, such that for each x* e X* the sequence (x*f,}ne'::. is convergent to x*f weakly in L\ (?); (?) у / (?) is a separable subset ofX. PROOF, (i) =>· (ii) Assume that the set {x* f\ x* e B(X*)} is separable. Then there exists a sequence (x*) in B(X*), such that {x*f: ? € ?} is dense in {**/: x* e B(X*)}. If ? is the ?-algebragenerated by all x*f and by ??(?) then, clearly ?\? is separable and each x* f is ?1-measurable. (ii) =>· (iii) Assume that / is scalarly measurable with respect to a separable measure space (?, ??,?\??) and let ? = ?{(?„: ? e ?}) с Д, be a countably generated ?- algebra that is ?\ ??-dense in ?0. Moreover, let ?„ be the partition of ? generated by the sets E\,..., E„. Put for each ? _ ^ Vf(E) }"-^ ?{?)?? Еел„ with the convention 0/0 = 0. Since {x*f: \\x*\\ ? 1} is uniformly integrable, this yields the uniform integrability of {x*f„: ? e N, ||jr*|| ^ 1}. As by the assumption ? is dense in ??, we have E(x*f\E) = x* f ?-a.e., and so lim„ x* f, =x*f ?-a.e.. (iv) =>· (v) The condition (iv) means that for each ? e ? the sequence (\>jn(E)) is weakly convergent to vj(E). Hence, y/ (?) is contained in the weak closure of the set U,^| ?f„(?) and the last set is separable, since the ranges of all vj„-s are finite dimensional. (v) =>· (i) Suppose that [x*f: x* e B(X*) is non-separable and take an arbitrary jc* e S(X*) and h\ e ?.^(?) such that {h\,x*f) = 1. Assume then that we have already constructed for an ordinal ? < ?\ a family {(x*. ha): a < ?] with the following properties: (a) x*eS(x*), (?) ha e ?-??(?), (у) **/elin{**/: ? < у} for each ? < ?, «>«.,«>=(: ¦;::;;;*: Since {x*f: x* e S(X*)} is non-separable, one can find Xp e S(X*) such that .v^/ ? linjjc*/: ? < /3}. Then, applying the Hahn-Banach theorem we get hp e ?.^(?) such that (hft,x*pf) = \ ma (hp,x*f) = \ ??? -?\ ? < ?. Consequently, we get a net {(**, /?„): a < ?\} satisfying (?)-(?) for all ?, ?, ? less than ? ?.
556 К. Musiat It follows that \T}(hp)-T}(ha)\>\ whenever ? < ?, and so the set 7",*(?-^(?)) is non-separable in X**. But 1??{??: ? & ?) is norm dense in /^(?) and so linv(.?) is norm dense in ?*(?^(?)). It follows that ?(?) is non-separable. ? Stefansson A992) noticed that one can add an additional equivalent condition to Theorem 6.8: / is determined by a separable space. Combining Theorem 5.2 with Theorem 6.8 we get the following characterization of Pettis integrability: THEOREM 6.9 (Geitz, Musiat). Let f: ? ^ ? be a function. Then, f e ?(?, ?) and ?/(?) is a separable set if and only if there exists a sequence (/„) of X-valued simple functions, such that: (j) The family {x*f,: ? e N, x* e B(X*)} is uniformly integrable, (jj) For each x* e X* we have lim„ x* f„ = x* f ?-a.e. If f is scalarly bounded, then (/„) can be taken to be bounded (i.e., sup,, ||/,?(?)|| ^ ? ?-a.e.). The first result of the above type was obtained by Geitz A981) who proved it in case of a perfect measure ?. It was then generalized by Musial A985). Plebanek A993) introduced the following notion: a weak Baire measure ? on X (i.e., a measure on the ?-algebra of Baire sets in the weak topology of a Banach space X) is scalarly concentrated on a subspace ? of X if x*\Y = 0 implies x* = 0 ?-a.e., for all functionals x* e X*. Then he proved the following characterization of Pettis integrable functions with separable range of their integrals: PROPOSITION 6.10. /// : ? —>· X is scalarly bounded and scalarly measurable then f e ?(?, X) and ? [(?) is separable if and only if the measure /(?) is scalarly concentrated on a separable subspace of X. In the particular case of spaces of continuous functions on compact spaces a few further results are known. Rosenthal A970, Theorem 4.5) proved that if (and only if) a compact space К satisfies the countable chain condition (CCC), then all weakly compact subsets of C(K) are separable. Hence all C(?)-valued Pettis integrals have then norm separable ranges. In particular it is so in case of К carrying a strictly positive Radon measure. Another type of a sufficient condition was formulated by Plebanek A993). PROPOSITION 6.11. If every sequentially continuous function f : К —> R is continuous, then every Pettis integral of а С (К)-valued function has a separable range.
Pettis integral 557 EXAMPLE 6.12 (Fremlin and Talagrand A979)). Let / be an arbitrary nonvoid set and let ? be the product measure on ? ? := {О, 1}' of the measure 1/25[0| + l/25[ ц. If ? is the completion of the product ?-algebra, then let ? be the ?-algebra on ? ? defined in the following way: ? e ? if and only if there exists a free non-measurable filter W on I and a set F e ? such that EDW = FCiW.lt can be proved then that there is a unique extension of ? to a complete measure Jl on ? (cf. Talagrand A984, Theorem 13-2-1)). Let ? be the canonical injection of ? ? into loc(I)- Then, ? is scalarly /Z-integrable but it is not /Z-Pettis integrable. The detailed proof of these facts can be found in Fremlin and Talagrand A979) or in Talagrand A984, Chapter 13). EXAMPLE 6.13 (Fremlin and Talagrand A979)). Let ? be the measure described in Example 6.12 and let к = Jl <§> ? be its complete product on ? ? ? ??. If / :?/ ? ?/ —> /??(/) is defined by f(x,y) = (p(x)-(p(y) then / e P(/c,/oc(/)), but vt (? ® ?) is not relatively compact if / is infinite, and non- separable if / is uncountable. 7. Universal integrability Definition 7.1. Let ? ? 0 be a compact space. A function h:K -> R is said to be universally measurable if it is measurable with respect to the completion of each Radon measure defined on K. f.K —> X is ?-universally scalarly measurable if**/ is universally measurable for every x* e Г с Х*- THEOREM 7.2 (Riddle, Saab and Uhl A983)). Let ? be a Radon measure on a compact space К and let f : К —> X* be a scalarly bounded and scalarly universally measurable function. If X is weakly compactly generated and f takes its values in a weak*-separable subspace ofX*, then f e ?(?, ?*). PROOF. Assume first the separability of X and let ? > 0 be arbitrary. Since X is separable, there exists a compact set L с К such that ?(? \L) < ? and (/, x) is continuous on L for each ? e X. Let A=[(f,x)\L: II*K1} and Mr (L) be the set of all real-valued universally measurable functions on L equipped with the pointwise convergence topology. As / is universally measurable, the set A is relatively compact in Mr(L). According to Theorem 2F of Bourgain, Fremlin and Talagrand A978), every sequence in A has a pointwise convergent subsequence and so, it is weakly precompact in C(L). A direct application of Rosenthal's theorem (Rosenthal A974)) says that A contains no copy of the standard unit vector basis of /|. Since,
558 К. Musial the canonical embedding of C(L) into ?,^?^.?) is a contraction, the set {(/,x)xl'- ||jr|| ^ 1} contains no copy of the /1-basis in the L^(^^)-norm either. Thus, it is weakly precompact and Lemma 4.20 completes the proof. The non-separable case is a consequence of separable complementability of X. ? Assuming the continuum hypothesis, Plebanek A998) proved that the weak* separability of the range cannot be omitted. He proved namely that if ? is the Haar measure on 2W>, then L\ (?) is a non-separable WCG space and there is a bounded function /:[0, 1] -> Loc(?) such that (x*, f) is Borel measurable for every x* e L~c(?)* and /^?(?,^(?)). Applying some consequences of Proposition 4.20 Bator A988a) and Stefansson A995) have got the following decomposition properties of universally measurable functions: PROPOSITION 7.3. Let К ^ 0 be a compact space and let f : К -> X* be bounded and universally scalarly measurable. Then: (a) (Bator) for each Radon measure ? on К there exist functions g and h such that g has the Bourgain property for ?, h is weak* scalarly ?-null and f = g + h; (b) (Stefansson) if X is a WCG space, then for each Radon measure ? on К there exist functions /| and fi such that f\ is universally Pettis integrable, /i is weak* scalarly equivalent to zero and f = f\ + fi- As an immediate consequence of Proposition 7.3 and Corollary 4.17 we get the following result: COROLLARY 7.4. Let ? be a Radon measure on a compact space К and let f .K —>¦ X* be bounded and universally scalarly measurable. Then for each lifting ? on Loc(?) the function po(f) is Pettis ?-integrable. The following result of E. Saag generalizes Corollary 4.19 of Haydon A976). PROPOSITION 7.5. Let fi ? К С X* be a convex weak*-compact set equipped with the weak* topology. If B(X) is weakly precompact with respect to K, then the identity function on К is universally Pettis integrable and the integrals take their values in K. PROOF. In view of Proposition 4.15 if ? is a Radon measure on K, then the family ? = B(X) has the Bourgain property. Hence Corollary 4.18 yields the Pettis integrability of the identity function. The Hahn-Banach theorem applied to the weak* topology of X* proves that the integral takes its values in К. ? 8. Pettis integral property Dehnition 8.1. X has the ?-Pettis integral property if each X-valued, scalarly bounded and scalarly ?-measurable function is ?-Pettis integrable. If such a property holds true for all complete measures, then X is said to have the Pettis Integral Property (PIP). If ?
Pettis integral 559 is the Lebesgue measure on the unit interval, then we say about the Lebesgue PIP. If X has the ?-??? with respect to all Radon measures, then it is said to have the universal PIP. A measure space (?, ?, ?) has PIP if for every Banach space X each bounded scalarly ?-measurable X-valued function is Pettis integrable. X has the universal PIP if for every compact К each bounded universally scalarly measurable function /: К —>¦ X is universally Pettis integrable. The first example of a Banach space without the Lebesgue PIP was given by Pettis A938). He constructed an example of a bounded scalarly Borel measurable /: [0, 1] -> /oc[0, 1] that is not ?-Pettis integrable. Then, Edgar A979), assuming (CH), published an example of C[0, ? ? ]-valued function, which is also not ?-Pettis integrable. No complete characterization of Banach spaces possessing PIP is known, but there are several partial answers. THEOREM 8.2 (Fremlin and Talagrand A979)). If Axiom К is true, then every Banach space has the Lebesgue PIP. If either Axiom К or Axiom L is true, then l-^ has the ?-??? for every perfect ?. As observed by Plebanek A993) the original proof of the first part of the above result gives in fact more. To formulate it let us assume that ? is non-atomic and denote by ???(?/"?) the minimal cardinality of a set ? с ? which is not in ???. Then, let cov(Aflt) be the minimal cardinality of a subfamily of ??? covering ?. Moreover, let ?? be the standard product measure on 2K. THEOREM 8.3. //поп(Л/1,) < co\(M\k )¦ then the measure ?? has PIP. The main positive result is an immediate consequence of Theorem 4.10. THEOREM 8.4 (Talagrand A984)). If X is Corson, then X has PIP. As a direct consequence of Theorem 2.8 we get PIP of measure compact spaces (Edgar A979)). Since each weakly countably determined space X (see Vasak A981)) (in particular every K-analytic or WCG) is Lindelof in the weak topology of X (Vasak A981)), and every Lindelof space is measure compact (see Varadarajan A961)), all such spaces have PIP (Edgar A979)). If card ? = ? ? then, /| (?) is measure compact, hence it has PIP. On the other hand, /|(K|) is not Corson (see Edgar A979), where several other important observations concerning PIP can be found). It turns out also that there is a strong connection between PIP and real-valued measurable cardinals. THEOREM 8.5 (Edgar A979)). l\ (?) has PIP if and only //card ? is not a real-valued measurable cardinal. THEOREM 8.6 (Andrews A985)). If the least real-measurable cardinal is not less than the continuum, then 1\(?) has UPIPfor all ?.
560 К. Musial Andrews A985) presented also a few other conditions guaranteeing UPIP of conjugate Banach spaces with metrizable weak* compact and separable subsets. Edgar A979) observed also that Mazur's property implies PIP, and since each X with angelic (X*,weak*) is Mazur, also such spaces possesses PIP. As noticed by Plebanek A993), if К is a first countable compact space, then C(K) is Mazur. Consequently, we get the following result. THEOREM 8.7 (Plebanek A993)). If К is a first countable compact space, then C{K) has PIP and every C(K)-valued Pettis integral has a separable range. In case of UPIP certainly the most important result has been obtained by Riddle, Saab and Uhl and it is simply a reformulation of the main part of Theorem 7.2. THEOREM 8.8 (Riddle, Saab and Uhl A983)). IfX is separable, then X* has UPIP. No general characterization of UPIP even for conjugate spaces is known. 9. Weak Radon-Nikodym property and related properties The following two results due, independently, to Dinculeanu A967) in case of Banach space valued measures of finite variation and Rybakov A968) in case of ?-finite variation are the starting points of all non-separable Radon-Nikodym type theorems. THEOREM 9.1. Let ? : ? —>¦ X* be a weak* measure. If\v\ is a ?-finite measure, such that ??(?) ^??(\?\), then there exists a weak* scalarly integrable function f : ? —> X* such that /(i2)Cconv*A,(i2) and (x, v(E)) = ? (?, ?)?? for each ? e X and each ? e ?. If ? dominates ? and ? is a lifting on ??(?) then f can be chosen to satisfy also the equality (x, f) = p({x, f)) for all ? e X. PROOF. Assume first that и satisfies the inequality |u|(?) ^ Мд(?Кога11 ? e ?. Denote for ? e X by fx the Radon-Nikodym derivative of the measure (x,v) with respect to ?. Clearly, \fK\^M ?-a.e., and so |p(/v)| ^ ? everywhere. Defining / : ? -+ X* by {x,f(w)) = p(f,)(co) for each ? e ? and ? e X we get the equality J (x, f)dp={ ?(/?)?? = J ? ?? = {?, ?(?)) for each ? e ? and each ? e X.
Pettis integral 561 Consider now the family ? of all finite partitions ? of ? such that if ? = {? \,..., E„} then p(Et) = Ei for all i <^n. We say that ? \ < ?? if each element of ? ? is a union of elements of 7?2. Then, let fc€7T Applying Lemma VI.8.3 of Dunford and Schwartz A958), we see that xf(w) = lim^/y xfn(co) for all ? e ? and all ? e X (cf. Kupka A972)). The general case follows by decomposing of ? into pairwise disjoint sequence of sets ?„ e ?, such that \v\(E ? ?„) ?. ??(? ? ?„), for all ? e ? and ? e N. D The property saying that for each X*-valued measure ? of ?-finite variation that is ?- continuous there is a weak* measurable / : ? —>¦ X* such that {x,v(E)) = ?{?,/)?? A) for each ? e ? and ? e X, may be called the ?-weak* Radon-Nikodym Property (?-W*RNP). Without loss of generality one may assume that и is ? dominated. So it is a consequence of Theorem 9.1 that each conjugate Banach space has the W*RNP (please notice that we use this name in a different meaning than it is used in Talagrand A984)). / will be called a weak* density (or a weak* Radon-Nikodym derivative) of ? with respect to ?. More generally, a set 0 ? К с X* has weak* RNP if for each (?, ?, ?) and each ?: ? -> X* satisfying v(E) e ?(?) ¦ К for every ? e ? there is /: ? -> К such that A) is satisfied. Then weak* compact convex subsets of X* have W*RNP. As a consequence of Theorem 9.1 we obtain the following result: THEOREM 9.2. Let ? : ? —> X foe ? ?-continuous measure of ?-finite variation. Then, there exists a weak* measurable f : ? —> X** such that f^)^coiw**AvW) and {?*,?(?))= ? [?*, ??? for each x* e X* and ? e ?. If ? dominates ? and ? is a lifting on ?.^(?) then f can be chosen to satisfy also the equality {x*. f) = p((x*, f)) for all x* e X*. It is interesting and useful to know that the W**RNP and the WRNP are determined by a single measure space. THEOREM 9.3 (MusialA982)if К = B(X)). Let К =convK фЪ be a bounded subset of X. If К has the X-W**RNP (respectively ?-WRNP), then it has also the W**RNP (respectively WRNP).
562 К. Musial PROOF. Let (?, ?, ?) be an arbitrary complete probability measure space and let и : ? —> X be a nonatomic measure satisfying for every ? e ? the relation v(E) e ?(?) ¦ K. (A) Assume first that ? is the completion of a countably generated ?-algebra ГСГ with respect to ?\?. Let (?„) с ? be a sequence generating ? and let ? : ? —>¦ [0, 1] be its Marczewski function: ?(?) = 2]?3-"??-„(?). It can be easily checked that ?-1 :? ? ?(?) —>¦ ?(?) is a Boolean ?-isomorphism of ??[0.i] П ?(?) onto Г. Let ?:? -> [0, 1] be the image of ? under ? and, let ? :[0, 1]^· [0, 1] be the function defined by ?([0,9(t)]) = ?([0, t]). If ? = ? ? ?, then for each ? e ?, we have ?[?~' (?)] = ?(?) and the measure algebras of ? and ? are isomorphic. Let now ? : ? —>· X be given by ?(?) = ?[?'(?)]. We have ||?(?)|| ^?(?) for each В e ?. Hence, by the assumption, there is / ??(?,?**) (respectively ?(?, ?)), such that ?(?) = Д- f ??. It follows that for each ? e ? -L v(E)=P- ? /????. Moreover, in view of Theorem 6.7, the set ?(?) is norm relatively compact. (B) Assume now that ? is arbitrary and notice that ?(?) is a norm relatively compact subset of X** (respectively X). Denote by ? the collection of all complete measure spaces (?, ?, ?\?) with 4СГ being the completion of a countably generated ?-algebra ? ? with respect to ?| ? ? and order ? upwards by inclusion. In view of (A), for each ? there is fA e ?(?\?, ?**) (respectively ?(?\?, ?)), such that L v(E)=P- J /??? for each ? e ?. We shall prove that the net (fA) is Cauchy in the norm of ?(?, ?**) (respectively P(m.X)). To prove it, fix ? > 0 and take a simple function hF ·. ? —>¦ X , such that sup ЕеГ v(E)- ? ???? <? (cf. Musial A980)).
Pettis integral 563 Now fix ? e ? such that he is /i-measurable. Then, for each ? ^ ? \?? ~he\ < 4 sup ??? It follows that for ? \, /}2 ^ /i the inequality |/j, - /j,| < 2? holds, and so the net is Cauchy, as required. But ? is countably directed, so there exists /}(> such that for each ? ^ Д> we have |/л-/л„|=о. It follows that each such fA is scalarly ?-equivalent to fAn and so for each ? e ?, we get the equality v(E)=P- j /??)?·?. This completes the proof. ? Definition 9.4. Given a directed set (Я, <), a family of ?-algebras ?? с Г, and functions /? e ?((?, ??,?\??); X) with ? e ?, the system {/т,Гт;л· е Я) is a martingale if ? ^ ? yields Гт С Г,, and ?(/?\??) = /?. The martingale is bounded if there is ? > 0 such, that for each x* e X* and each ? e Я the inequality | (**, /T)| < Af ||.x*|| holds ?-a.e. The martingale is convergent in ?(?, X) if there is / еР(д, X) such that liin^ \f„ - /| = 0. The collection of all finite ^-partitions of ? into sets of positive measure is denoted by Яг- We order it in the following way: ?\ < ?? if each element of ? ? is, except for a null set, a union of element of ?2 - The following theorem is a martingale characterization of the WRNP and the W**RNP. THEOREM 9.5 (Musial A980) for К = S(X)). For a bounded set К = conv К ^0С X the following conditions are equivalent: (i) К has the WRNP (respectively W**RNP), (ii) Given any (?, ?, ?) and any bounded martingale {/„, ?„; ? e ?) of К-valued Pettis ?-integrable (simple) functions, then {/„,.?„; ? e N) is convergent in ?(?,?) (respectively ?(?, ?**)). PROOF. Assume (i) is satisfied and take a bounded martingale {/„,.?„; ? e N) in ?(?,?). Assume, that ? > 0 is such that |(.v*,/„)| ^ M||.v*|| ?-a.e. (the exceptional sets depend on**). Let ?? = U^L ? ?» and 'et ? ·" Д) —>¦ -^ be given by u(?) = lim / /,,??. " Je v(E) -I. hf ?? < ?. for each ? e ??.
564 A". Musiat We have ||?(?)|| < ? ?(?) and ? (?) e ?(?) ¦ ? for all ? e ?. The set function ? extends uniquely to a measure v\ : ? = ?(?^) —>¦ X satisfying the similar conditions for all ? e ?. Setting for each ? e ? v(E) = J E{XE\S)dv] (where the integral is in the sense of Bartle, Dunford and Schwartz A955)) we get an extension of t>| to the whole ? satisfying for all ? e ? the relations \\v(E)\\ ^ ??(?) and u(E) e ?(?) - ??. Since К has the WRNP (or W**RNP), we get / e ?(?, ?) (respectively ?(?, ?**)) being the density of ? with respect to ?. Since К has the W**RNP, it has the CRP (by Theorem 6.7). Thus, in a similar way, as it has been done in the proof of Theorem 9.3, one can show that {(/„, ?„); ? e ?) is a Cauchy martingale in ?(?, ?**). Since (?\?„)(?) = fEf„dp for each /; e N, we have for each x* e X* (x*. f„) = E((x*,f)\E„) and this gives limi \E(x*f\E)-x*f„\dii = Q. Together with the Cauchy condition, this yields lim|/„-/| =0. Assume now that (ii) is satisfied and take a measure ? : ? —> X satisfying for each ? e ? the relations |?(?)|^?(?) and u(E) e ?(?) ¦ AT. Define for each ? e ? ? the function /T by f - ? OIEI/F Jn — /^ c XL· and let E„ =?(?). {(/?, ??); ? e ? ?) is a bounded martingale in P( ?, ?)???\ ^ ?? < • - then, by the assumption, {(/T„, 2TT„); // e N) is convergent in ?(?, ?) (respectively ?(?,?**)). Let / : ? —> X** be a weak* density of ? with respect to ?: foreach;c*eX*, ? eE.
Pettis integral 565 One can easily see that there exists in ? ? a sequence ?\ < ?? < ¦ ¦ ¦ such that limsupl jfj(**, /- /*,)\??: ?* e B(X*)\ = 0 and so, in particular ]???[(?*,/?„)<?? = (?*,?(?)) " J ? for each x* e X* and ? e ?. Hence, if g = lim„ /^ e ?(?, ?) (respectively ?(?, ?**)), then for each ? e ?, we get the required equality u(E)= / #??. D PROPOSITION 9.6 (C.Ryll-Nardzewski, see Musi at A979)). lx does not have the WRNP. PROOF. Let ?„ be the dyadic partition of [0, 1 ] into 2" intervals and, let ?„ be the collection of all possible unions of elements taken from ?„. If (A„) is an enumeration of Ui^li ^n, then clearly lim„ ?(?„) = 0. Define a measure ?: С -+ со с /эс by setting ?(?) = (?(???„))?:. Then, f(?) is a norm relatively compact subset of со, ||v(?)|| ^ ?(?) for each ? e ?and ? is without Pettis ?-integrable derivative in ??. Indeed, let /: [0, 1] -> lx = I* be a weak* density of и with respect to ?. It means in particular that if (en) is the standard basis in /?, then к(ЕПА„) = {е„,у(Е))= ? (?„,/)?? for each ? e N. But the sequence (хд„) is pointwise dense in {0, 1}1°". Thus, if хд is a ???-?- measurable cluster point of (хд„), then хд is ?-a.e. equal to a pointwise cluster point of ((e„, /)). Such a point is of the form (x*,f) for a functional x* e lx. This means that / is not scalarly measurable and hence it cannot be a Pettis integrable density of ? with respect to ?. ? The next theorem has been first proved by Musiai A976) (see also A979)) for separably complementable X. Then, in full generality the necessity has been proved by Musiai and Ryll-Nardzewski A978) and the sufficiency by Janicka A979), Musiai A979) and J. Bourgain. THEOREM 9.7. X* has the weak Radon-Nikodym property if and only if X contains no isomorphic copy ofl\.
566 A". Musiat Assume that X contains no copy of l\. If ?: ? —>· X is a ?-dominated measure and ? is a lifting ?? Loc (?), then by the weak* Radon-Nikodym property of X*, there exists a weak* density /:?—>- X* of и with respect to ? such that p((*,/)) = (¦*,/) for each .veX. Since X contains no isomorphic copy of/|, it follows from Corollaries 4.17 and 4.18 that [x**ME)) = f[x*\f)dvi by the Lebesgue Convergence Theorem, and this proves the WRNP of X*. Assume now that X contains a subspace that is isomorphic to l\ and let ? :l\ —>¦ X be an isomorphic embedding. Then T* : X* —>· /^ is a surjection. If ? :C —>¦ lx is the measure constructed in the proof of Proposition 9.6, then according to Musial and Ryll-Nardzewski A978), there is a ?-dominated measure к : С -> X* such that ?*? = v. Since ? is not Pettis differentiable, also к cannot have a Pettis integrable density. At this place it is also worth to recall a characterization of WRNP of X* in terms of functions. The result is a direct consequence of the W*RNP of conjugate spaces. COROLLARY 9.8 (Musial A979)). X* has WRNP if and only if for every complete (?, ?, ?) and each weak*-bounded and weak*-scalarly measurable f: ? —>¦ X* there exists g e ?(?, ?*) and a weak*-scalarly null h-.?^-?* with f = g + h. More or less at the same time Rybakov A977) presented another characterization of Banach spaces not containing l\ (see also Musial A979)). THEOREM 9.9. X does not contain any isomorphic copy ofl\ if and only if X* has CRP. The next result is a generalization of Theorems 9.7 and 9.9. It was also an essential step towards localizing of the weak RNR THEOREM 9.10 (Riddle, Saab and Uhl A983)). For a given operator ?: X -> ? the following conditions are equivalent: (i) The set T(B(X)) is weakly precompact; (ii) ? factors through a Banach space containing no isomorphic copy ofl\; (iii) The set T*(B(Y*)) has WRNP; (iv) The set T*(B(Y*)) has CRP; (?) ? * factors through a Banach space with the WRNP. The most general description of sets possessing WRNP will be given in Theorem 9.15. In order to present an idea of its proof we need however yet a few additional facts. LEMMA 9.11 (Rosenthal A974)). Let (x„) be a pointwise bounded sequence of real-valued functions defined on a set S and having no pointwise convergent subsequence. Then there
Pettis integral 567 exists a subsequence (x„k) of (x„), a real number r and ? > 0 such that for every infinite subset Mof{nk: k^ 1}, there is a point s e S with Xm (s) > r + ? for infinitely many m e ? and x,„ (s) < r for infinitely many in e ?. THEOREM 9.12 (Rosenthal A978)). Let Q be an uncountable Polish space and let (A„, S„),i> ? be a sequence of pairs of closed and disjoint subsets of Z. Assume that the sequence (A„, B„ )„ ^ ? has no convergent subsequence. Then, there exists a compact subset L of Q, a homeomorphism h from L onto the Cantor set ? = {0, 1}'", and an increasing sequence (nk) such that A„KC\L = h~\vk) and B,„ ? L = /?"' (V[) for all к (here Vk = {t = (t„) e ?: tk = 0}). The next lemma is taken from Matsuda A985). LEMMA 9.13. Let X be separable and let Q ? 0 be a weak*-compact convex subset of X*. If B(X) is not Q-weakly precompact, then there is a Radon measure on (Q, weak*) and a measure ? : ? —>¦ X* such that (a) v(E) e ?.(?) ¦ Q for each ? e ?; (b) ?(?) is not relatively compact in the norm topology. PROOF (Sketch). Let (x„) be a sequence in B(X) without subsequence pointwise convergent on Q. Without loss of generality we may assume that (x„), r and ? > 0 satisfy the conclusion of Lemma 9.11. Setting A„ := (x* e Q: (x*,x„) ^ r +?] and B„ := [x* e Q: (jc*, jc„) < r] we get a sequence (A„, S„)„^i that has no convergent subsequences. In virtue of Theorem 9.12 there exists a compact subset L of Q, a homeomorphism h from L onto the Cantor set ? = {0, 1 }?, and an increasing sequence (nk) such that A„,nL = A"l(l/i) and ?„1??, = ??(??') for all к (here Vk = [t = (t„) e ?: tk = 0}). Let ? be the normalized Haar measure on ? and let ? be the Radon measure on L such that h(rj) = ?. ? ? is the extension of ? to the whole Q.Then ?(?„?) = ?^](?,)) = ?(??<)=\/2 = ?(??4) and ?(?„????4) = ?(?,??<) = 1/4
568 A". Munal whenever ?'? j. Let ? be the weak*-integral of the identity function on Q with respect to ?. ? is a vector measure on the ? -algebra of weak*-Borel subsets of Q. The conclusion (a) of the lemma is an easy consequence of the separation theorem. To see that (b) is also fulfilled notice that if /' < / then |u(A„,)- u(AHi)| >(u(A„,)-u(A„,),.r„,} > (r + ?)?(?„, П ВП]) - ??(?„? П ?,,,) = ?/4. ? Definition 9.14 (Talagrand A984)). A weak* compact subset ? of X* is a Pettis set if the identity function is universally scalarly measurable (with respect to Radon measures on (K, weak*)). The next theorem summarizes most of the results presented earlier in this paper and concerning WRNR THEOREM 9.15. Let К фЧ)Ьеа weak*-compact convex subset ofX*. Then the following conditions are equivalent: (i) B(X) is weakly precompact with respect to К; (ii) the identity function is universally Pettis integrable on K; (iii) К is a Pettis set; (iv) К has WRNP; (?) ? has ?-WRNP; (vi) К has CRP, (vii) К has ?-CRP PROOF (Sketch), (i) =>· (ii) follows from Proposition 7.5. (iii) =>¦ (i) is a consequence of Bourgain, Fremlin and Talagrand A978). (i) =>· (iv) follows from Corollaries 4.17 and 4.18. (iv) =>· (vi) follows from Theorem 6.7 and (vii) =>· (i) from Lemma 9.13. ? Applying the result of Talagrand A984) which says that if К с X* is a Pettis set then its weak* closure has WRNP, one gets further generalizations of Theorem 9.15. 10. Conditional expectation Let ? С ? be a ?-algebra. If / e ?(?, ?), then a function ?(/\?):? -> X is a conditional expectation of / with respect to ? if E(f\E) e ?(?, ?, ?) and J ?{/\?)?? = ?' ??? B) for all ? e ?. The first example of a Pettis integrable function without conditional expectation was published by Rybakov A971). It was an /4-valued function. Heinich A973) published then an example of /1-valued Pettis integrable function on [0, 1]-, which does not admit
Pettis integral 569 the conditional expectation with respect to a sub-? -algebra of the ? -algebra of Lebesgue measurable sets. These examples can be classed in the following pattern: If / e ?(?, ?) has infinite variation \Vf\ and ?-algebra ? С ? is such that |u|<a| is not ?-finite then the existence of E(f\S) would contradict Theorem 3.8. Since for each non-atomic ? and infinite dimensional X there is / e ?(?, X) with infinite \vf\, there are a lot of such examples. The following global result is an obvious consequence of the above considerations: PROPOSITION 10.1 (Musial, A976) for K = B(X)). Let К С X be a closed and convex set with WRNP. Iff e ?(?, ?) and ? is ? ?\?-complete sub-?-algebra ???, then f has the conditional expectation with respect to ? if and only if the measure ?/ \? is of ?-finite variation. First examples of scalarly bounded Pettis integrable functions without conditional expectations were presented by Talagrand A984, 6-4). One of them is / e F(k,Ix(I)) defined in Example 6.13 (see also Musiat A985)). Indeed, if ? is the completion of the Borel algebra of ?/ ? ?/ with respect to ? <g> ?, then ? is к -dense in ? ®?. If there existed a Pettis integrable E(f\S), then the equality B) would be true for all к -measurable sets. This is however impossible in case of infinite /, since according to Stegall's result (Theorem 6.1) the range of vFAj^) is norm relatively compact. Talagrand A984) obtained the following interesting result covering the particular case of the conditional expectations: THEOREM 10.2. (Axiom L) Let (?,?,?) be a complete probability space and let ? :?,|(?) —>¦ L\(v) be a bounded operator. Then, let f: ? —>¦ X* be a weak*-scalarly bounded function such that each countable subset of {xf: \\x\\ < 1} is stable. Then there exists a properly measurable function g :(9 —>¦ X* such that T(xf) = xg v-a.e. for all ? e X. /// e ?(?, ?*), then g = Tf e P(u, X*). Riddle and Saab A985) proved another sufficient condition guaranteeing the existence of conditional expectations. THEOREM 10.3. // / ??(?, X*) is scalarly bounded and the set {xf: \\x\\ < 1} is weakly precompact in ?,^??), then f has conditional expectation with respect to all sub-?- algebras of ?. PROOF. Assume that / e ?(?, X*) satisfies the above assumptions, let ? be a sub-?- algebra of ? and let ? bea lifting on L~x_ (?). Define g-.? -> X* by setting xg = p(Es(xf)), for each ? e X. Since {.xf: \\x\\ ^ 1} is weakly precompact in /.^(?) and the conditional expectation operator ?? :?^(?) —>¦ /^(??-?) is a contraction, the set {xg: ||jr || $5 1} is weakly precompact in L^c (?\?). In view of Corollaries 4.17 and 4.18 we have g e ?(?\?, X*). Now, since the equality fExgd? = jE xf ?? for all ? e X and the functions / and g are Pettis integrable, we get the same equality for all ? e X**. D
570 К. Musial 11. Differentiation Already Pettis A938) noticed that if / e ?(?, ?) then for each x* e X* there exists a set A(x*) e С of measure one, such that the equality h~* ?1+\?\??))??=?*?(?) lim /i — 0 holds true for all t e A(x*). Pettis asked then whether in case of a strongly measurable / e ?(?, ?) the sets A(x*) could be replaced by a single set of full measure, i.e., whether the Pettis integral of / is a.e. weakly differentiable. The answer is negative in case X = h (Phillips A940)) and X = C[0, 1] (Munroe A946a)). Thomas A976) conjectured that such a counterexample could be constructed in every infinite dimensional X. In A994) Kadets proved that for every infinite dimensional Banach space X there is a strongly measurable and Pettis integrable function /:[0, 1] —>¦ X for which g(t) := /o f(s)ds is non-differentiable on a set of positive measure. Then Dilworth and Girardi A995) generalized the above result proving that always there exist functions that have nowhere weakly differentiable Pettis integrals. To formulate the main result more precisely let ? be the collection of all increasing functions ? : [0, oo) —> [0, oo) satisfying the growth condition DC ]??B"/'"-,)?/2^<^· C) n=\ for some increasing sequence (p„);^l() of integers and such that ?@) = 0. THEOREM 11.1 (Dilworth and Girardi A995)). For each X and ? e ? there exists a strongly measurable f e ?(?, ?) such that II/fdk WJi ?(?(?)) for every non-degenerated interval I с [0, 1 ]. If moreover ОС n=\ thenf?L\(k,X), PROOF (Sketch). Let {/J': ? = 0, 1,2,...; к = 1,2, ...,2") be the dyadic intervals on [0,1], i.e., 4 'k-\ к 2" ' 2"
Pettis integral 571 Define inductively a collection {A'].: /7 = 0, 1,2 ; к = 1,2 2") of pairwise disjoint sets of positive measure such that А" с /". Fix К > 1. By a theorem of Mazur there is a basic sequence {.v„} in X with basis constant at most К. Take a blocking {F„} of {x„} with each subspace F„ of large enough dimension to find (using the finite-dimensional version of Dvoretzky's Theorem (Dvoretzky, 1961)) a 2"-dimensional subspace E„ of F„ such that the Banach-Mazur distance between E„ and /|" is less than 2. Let T„ :l;" —>· E„ be for each ? an operator such that ||7"„|| ^ 2 and ||?„"?| = 1. If {u'j.: к = 1,2 2") are the standard unit vectors in /;", then let <!:=Г„<:. Let (p„) be an increasing sequence of integers, with po = 0 satisfying DC ??D- 2~p"~t)-j2^ < oo. »=? Define/:[0, !]->¦ X by /(?) = 2*]?>D.2-''»-0]??<,,??") -',f. »=l jt=l One can easily check that for each ? e С ос 21'" f /^ = 2^^?D-2-/'-)??«")"' / XA["dkepk\ J E - ,? J ? Since (E„) form a finite-dimensional decomposition there is a projection of ?„ ?„ onto EPn of norm not exceeding 2K. Now, if / is an arbitrary interval in [0, 1] one can find first /"' С / such that 4?(/"') ^ ?(/) and then ? satisfying p„-\ ^m < pn. It follows that \l fdk > VD-2~/'""') _A = I |J/ \P- and so since A^" с lj!" С /'" С / we have I/ /?? > VD-2-',"~'). But ? is increasing and 4 ¦ 2 /,n~' ^4-2 '" > ?(/) and so II/ fdk\ ?{?(?)).
572 A". Musiat If ?~ ? tfrB-"«-'J"" = oo, then ??=? *<4 ¦ 2-"-')?*=? ?'?? = oo and so / ? L ? (?, X) (according to Diestel and Uhl A977, p. 55)). D As a corollary we get an answer to the original Pettis's question. COROLLARY 11.2 (Dilworth and Girardi A995)). There exists a strongly measurable f e ?(?, ?) that has nowhere weakly differentiable integral. PROOF. Taking in Theorem 11.1 ?(?) = f^4 we get a function / satisfying for each t e [0, 1 ] the equality lim ?—? /' 1 /"+" - / ?(?)?? ¦ oo. If the Pettis integral of / were weakly differentiable at some t, then the above limit would be finite. D As noticed in Dilworth and Girardi A995)) it follows from Theorem 11.1 the existence of a Pettis integrable function such that its integral is of infinite variation on every subinterval of [0, 1]. This generalizes earlier result of Janicka and Kalton A977) where the Banach space valued measure possessing that property could not be represented as a Pettis integral (A good reference to the circle of problems concerning vector measures of infinite variation on all sets of positive measure is the work of Drewnowski and Lipecki A995)). With some further effort one can get other results connecting the differentiability problem with absolutely summing operators and cotype of X (see Dilworth and Girardi A995)). 12. Fubini theorem The classical Fubini theorem holds true not only for real functions but also for Bochner integrable functions on an arbitrary measure space. Thomas A976) observed that for every infinite dimensional Banach space X does exist a strongly measurable and ?--Pettis integrable function /: [0, l]2 —>¦ X with the property that for every jte[0,l] the function у —>¦ f(x,y) is not ?-Pettis integrable. Recently Michalak B000) attempted to resolve the problem in case of arbitrary bounded Pettis integrable functions. THEOREM 12.1 (Michalak B000)). Let (?, ?, ?) and (<9, ?,?) be complete probability spaces and let X be a WCG space not containing any isomorphic copy of l\. If f e ?(?<§?, X*) is such that the set {\xf\: ? e B(X)} is order bounded in ?,|(?®?), then there exists a function j:flx0->X* which is scalarly equivalent to f and (i) the function g(-,0) ??(?, X*) for v-a.e. ? ев; (ii) the function g(w, ¦) eP(u, X*)for?-a.e.? e ?;
Pettis integral 573 (iii) fA ?? %?? ® ? = fA (fB g(co, ?)??(?)) ??(?) = jB(jA g(co, ?) ??(?)) dv(9) for all Ae ? and В еТ. The above theorem cannot however be extended to all Banach spaces. If ??(?, ?) is the space of all functions f :? —>¦ X such that ??€? ??/(?)||2 < oo, then one gets the following result: THEOREM 12.2 (Michalak B000)). Let (?, ?, ?) and (<9, ?, v) be complete probability spaces and let X be a Banach space. Assume that ? vanishes on points and there is a nonmeasurable subset of ?. Then for every bounded Pettis integrable function f:? ? ? -> 12(?, X) there exists a bounded function g-.? ? ? -> ??(?, ?) which is scalarly equivalent to f and ?({? e ?: g(a>, ¦) is not scalarly ?-measurable}) ^ 1/2. 13. Spaces of Pettis integrable functions 13.1. Space of all Pettis integrable functions Pettis A938) noticed that ?(?, ?-?(?)) is non-complete. Rybakov A970) proved that ?(?, со) is non-complete. Then Thomas A976) proved that ?(?, ?) is non-complete in case of an arbitrary not purely atomic ? and infinite dimensional X. Janicka and Kalton A977) proved the same in case of the Lebesgue measure on [0, 1]. Modifying the example of Pettis, Drewnowski, Florencio and Paul A992) showed that ?(?, ?-2(?)) does not have property (K). Then, Drewnowski and Lipecki A995) proved that ?(?, ?) is never ultrabarrelled if ? is non-atomic, and so it is neither Baire nor has property (K). However, the following holds true: THEOREM 13.1 (Drewnowski, Florencio and Paul A992)), ?(?, ?) is always barrelled. PROOF (Sketch). Let us say that a locally convex space ? admits an (?, ?, ?)-???1?3? algebra of projections (see Drewnowski et al. A992) for details) if there exists a set {Рд: A e ?] of linear projections in ? such that: A) ?? is the identity on Z, PAnB = Pa ¦ Рв for all ?,???, and РАив = Рл + Рв for all disjoint А, В е ?; B) Pa is continuous for every A e ?; C) for every ? e Z, the vector measure Fx: ? —>¦ ? defined by F,(A) := Pa(x) is ?-continuous (that is Рд(.х) = 0 if ?(?) = 0). Now we need the following PROPOSITION. Let ? be a metrizable locally convex space admitting an (?,?,?)- Boolean algebra of projections {Рл: A e ?]. Assume also that the projections satisfy the following condition:
574 A". Musial D) If(A„) is a sequence of pairwise disjoint elements of ? and (x„) is a null sequence in ? such that Pa„(x„) =xn. ? = 1,2 then there exists a sequence (и*·) in N such that the series ^ д-,ч is convergent. IfW is a closed subspace of ? such that Pa(W) с W for all A e ? and (?, ?, ?) is atomless, then W is barrelled. Now we are ready to present the proof. Assume first that (?, ?, ?) is atomless. For A & ? and / e ?(?, ?), define P,\(f) := хд · /. One can easily check that the conditions (l)-C) are fulfilled. We have to check yet the condition D) of the just formulated proposition. So let {A,,} be a pairwise disjoint sequence in ? and let (/„) be a null sequence in ?(?, X). Without loss of generality, we may assume that ? := ?„ |/»| < °°· Let /(?) := ]ГН /н (?) be defined for all we ?. Then, for every x* e X* we have f \{?*,??))\??= ? ?\(?*,/,??))\????\\?*\\ J ? Jn „ and so / is scalarly integrable. Since for every A e ?, we have ]T„ || fA f, ??\\ < oo, the series ]TH fA f, ?? is convergent to an element v(A) e X. One easily see now that (?*,?(?))=?? [?*,?)??=?[?*.?)??. „ Jada,, Ja This completes the proof in case of an atomless ?. If ? is atomic, then ?(?, ?) is a Banach space. In the general case one decomposes the measure space into atomless and atomic parts obtaining ?(?, ?) as a direct sum of a barrelled space and a Banach space, which is again barrelled. ? The above result has been then generalized by Diaz et al. A995) to the following form: THEOREM 13.2. ?(?, X) is always ultrabornological. PROOF (Sketch). The proof is based on the following fact (see Diaz et al. A995) for details). PROPOSITION. Let [PA: A e ?] be an (?, ?,?)-??????? algebra of projections in a metrizable locally convex space Z. Assume that (Рл: А е ?} is an equicontinuous family of operators and satisfies the following condition: E) // (?„) is a decreasing sequence in ? with ?(?\?„) = 0, (л„) is a bounded sequence in ? such that ??„(.?„) = x„ for all n, and (a„) e /|, then the series ^„а,д„ is convergent in Z. If Pa(Z) is ultrabornological for each ?-atom A, then ? is also ultrabornological. The rest of the proof is similar to the previous one. Рд(/) := Хд · / with / e ?(?, ?) and A e ?. This is an equicontinuous family of projections forming an (?. ?, ?)-???1?3?
Pettis integral 575 algebra of projections in ?(?, ?). One can easily check the validity of the condition E). If A is an atom then ??(?(?, ?)) is isomorphic to X. Hence the assumptions of the proposition are satisfied and so ?(?, ?) is ultrabornological. ? There was also an attempt to introduce a complete metric on ?(?, X) (necessarily not equivalent to the original one). Heilio A988) defined such a topology composing the original semivariation norm with the convergence in measure. Setting for an arbitrary X- valued function / \/\?:=?{?: ?*{???: |/(?)| ? a } ? a}. Heilio set then for each scalarly integrable / 111/111··= |/| +l/l„ and proved that ?(?, ?, ||| · |||) is complete. One of the consequences of this fact is another proof of the incompleteness of ?(?, ?). There is a wide class of problems concerning the possibility of embedding of a space ? into ?(?, X) provided ? is an isomorphic subspace of X. Diestel proved in 1988 (unpublished) that if the range of ? is infinite and X contains an isomorphic copy of со, then the completion of ?, (?, ?) contains a complemented copy of со. Emmanuele A992) generalized it to ?(?, X). Diaz et al. A993) proved that if X contains an isomorphic copy of со and ? is non-atomic and perfect, then ?(?, ?) contains a complemented copy of cq. This has been then generalized by Freniche. THEOREM 13.3 (Freniche A998)). If the range of ? is infinite, then the following are equivalent: (i) X contains a copy o/co; (ii) ?(?, ?) contains a copy of cq; (iii) ?(?, ?) contains a complemented copy of cq. The next result concerns ?(?, X) as a subspace of the space ca(C, ?. ?) of all X-valued ?-continuous measures equipped with the semivariation norm. THEOREM 13.4 (Drewnowski and Lipecki A995)). If X is separable then ?(?, ?) is an T„a but not T„ subset ofca(C,k, X). 13.2. Functions satisfying the strong law of large numbers Hoffmann-J0rgensen A985) and Talagrand A987) introduced the space ???(?, ?) of X- valued functions satisfying the law of large numbers. X: 3a, eX lim 1 af - -V/??,) ? L—' i = l :0 fo^^-a.e. (?;) &?~ where ?°° is the countable direct product ??????^- the countable product of ?.
576 A". Musiat Talagrand A987) defined the Glivenko-Cantelli seminorm on LLN(?, ?) setting for an arbitrary function / : ? —>¦ X ||/1|cc =Hm sup / ?,,??*, where ??)= sup -]?|.?*(/(?,))|- [|л*|[<1 « /<л It has been proved by Beck A963) that Bochner integrable functions satisfy the strong law of large numbers. If X is separable and a function /: ? —>¦ X satisfies the strong law of large numbers, then / e L \ (?. ?), i.e., / is Bochner integrable (Hoffmann-J0rgensen A985)). Talagrand A987) described completely the class of functions satisfying the law of large numbers. His proof is however too long to be presented here. THEOREM 13.5 (Talagrand A987)). For a function f-.?^? andw = (?„) e ?? set Sn(w) = (\/n) 53"_| /(?,). Then the following conditions are equivalent: (a) / satisfies the strong law of large numbers; (b) / is properly measurable and f? ||/|| ?? < oo; (c) For almost allwe ?^, the sequence (\/n)S„(co) converges in norm; (d) For each ? > 0 there is a simple function g : ? —>¦ X such that \\f — g|| gc ^ ?· It easily follows from the above theorem that the Glivenko-Cantelli norm coincides on ???(?, ?) with the ordinary | -1 norm and each function satisfying the law of large numbers is Pettis integrable. Functions in ???(?, ?) that are scalarly equivalent are not distinguishable by the GC-norm. This permits us to identify scalarly equivalent elements of LШ(?, X) and investigate the quotient space (denoted also by LLN (?. ?)). In the context of LLN(?, X) Talagrand A987) successfully applied the concept of stable sets to description of the so called Glivenko-Cantelli classes of functions. Dobric A990) posed a question about completeness of LLN(?, X). It turned out that the space is almost never complete and in general it is even not barrelled. THEOREM 1 3.6 (Musial B001a)). Assume that ? is not purely atomic. Then the space LLN (?, X) is non-complete. If moreover LLN (?. ?, var) is complete, then LLN(^, X) is even not barrelled. PROOF. ??·(?, ?) endowed with the Pettis norm is non-complete (see Thomas A976)). Let (/„) be a Cauchy sequence in ?((?, X) that is not convergent in ?,(?, ?). It follows from Musial A979, Proposition 3) that for each ? e N there exists a simple function /2„·.? —>· X with |/„ - h„\ < l/n. Since simple functions are properly measurable and LLN(^, X) с P( (?, X), the sequence (hn) is also Cauchy in LLN(?, X).
Pettis integral 577 Clearly, the sequence (h„) is divergent in ???(?, ?). This completes the proof of the noncompleteness. If ???(?, ?) were barrelled then applying the closed graph theorem to the identity map from ???(?, ?) to LLN(^, X, var) one would get its continuity. Hence the two norms would be equivalent. This is however impossible if X is infinite dimensional and ? is not purely atomic. ? If each X-valued Pettis integrable function is weakly equivalent to a strongly measurable function (it is in case of X possessing RNP or in case of a measure compact X), then ???^?, X, var) = L ? (?, ?) and so ???(?, ?) is not barrelled. 13.3. Functions with integrals of bounded variation We denote by PV(m, X) the set of those / e ?(?, ?) for which \vf\ is finite. ??(?, ?) can be considered as a subspace of саЫ(ц., X), endowed with the variation norm. We define a ||-1| у norm on ??(?, ?) by setting ||/||?:=|?>/|(?). Clearly |/| <: \\f\\v for each / e ?(?, X). Exactly as in the proof of Theorem 13.1 one can obtain THEOREM 13.7 (Musial B001a)). PV(M, X) is barrelled. PROPOSITION 13.8 (Musial B001a)). // X has the weak Radon-Nikodym property, then PV(M, X) is a Banach space. In particular if ? is a Banach space not containing any isomorphic copy ofl\, then ??(?, ?*) is complete. It is not known whether the space ??(?, ?*) is always complete. Assuming the validity of Axiom L, one can prove however the completeness of ? V (?. ?*) for an arbitrary perfect measure ?. THEOREM 13.9 (Musial B001a)). (Axiom L) If ? is perfect then FV (?, ?*) is complete. Proof (Sketch). Let (/„) be a Cauchy sequence in ??(?, ?*)) and let ? be the limit of (vfn) in cabv(\i, X*). The classical Radon-Nikodym theorem guarantees the existence for each ? e N of a function h„ e L\ (?) such that \\>-\>fn\{E) = f hndn. D) for each ? e ?. Moreover, since X* has the W*RNP, there exists a weak* measurable function /: ? —> X* such that for each .x e X and each ? e ? xv(?) = / xfd\x.
578 К. Miisial The theorem will be proved if we show that there exists a Pettis integrable function / that is weak*-equivalent to /. To do it let us notice first that for each ? from the closed unit ball of X and for each ? e N the equality D) yields the relation \(x,f-fn)\^h„ ?-a.e. E) It follows that for each /; e N and each x** e BC(X**) we have \(x**,f-f„)\^h„ ?-a.e. Since the sequence (h„) is convergent to zero in the norm of ?-?(?), we obtain the measurability of all functions x**f, with x** e BC(X**). If / is weak*-scalarly bounded, then the Pettis integrability of po(f) follows from Theorem 4.28 and we may put / = /}()(/)- If / is arbitrary, we apply the decomposition Corollary 2.11. This completes the proof. ? The next result shows that sometimes the assumptions concerning Axiom L and the measure space are superfluous. We omit a direct and easy proof. PROPOSITION 13.10 (Musial B001a)). Lei X be a separable Banach space and let ? be a closed linear subspace ofX*. Then ??(?, ?) is complete for an arbitrary ?. It is not known whether the space ??(?, ?*) is always complete. There is however an example due to D. Fremlin of a non-conjugate X, such that PV(A, X) is non-complete. Since ? is perfect, it follows that in general the structure of the Banach space X is more important, than the properties of ?. EXAMPLE 13.11 (Fremlin, see Musial B001a)). For each t e ]0, 1] let e, e /,J0, 1] be the unit vector at t: e,(s) = 0 if ? ? t and e,(s) — 1 if s = t. Consider ??).,] as an element of Loc[0, 1] and set for each t e]0, 1] x(t) = Xio.i\ + ei/r. Then, let X be the closed linear subspace of L-J0, 1] ? /^[0, 1] generated by all x(t), te]0, 1] and all (??,?) with ?? e C[0, 1]. THEOREM 13.12 (Musial B001a)). The space FV(X,X) does not have the (K) property. 13.4. LLN^, X) equipped with the variation norm of integrals It has been proved in Talagrand A987) that Pettis integrals of functions from LLN(^, X) are measures of finite variation. Thus, it makes sense to equip the space ???/(?, ?) with the variation norm of the integrals. It will be denoted by ???(?, X,var). Contrary to ??(?, ?*), the space LLN(^, X* ,var) behaves quite well.
Pettis integral 579 THEOREM 13.13 (Musial B001a)). ??(?, X*,var) is a Banach space. PROOF {Sketch). Let ? be a consistent lifting on (?, ?, ?) and, let (/,,) be a Cauchy sequence in LLN{p, X*, var). Moreover, let ? be the limit of (t>/„) in cabv(^, X*) and, let f-.?^ X* be a weak*-density of ? with respect to ?. We shall split the proof into two parts. Assume first that / is weak*-scalarly bounded. Without loss of generality, we may assume then that / = po(f). Moreover, let for each ? a function ?„ e L \ (?) be the RN- density of the measure \vtn\ and let ??„, := ?({? e ?: ? „(?) ^ m}). Since ?„ e L\(p), there exists for each ? e N a number m„ such that ?(?'? ,„ ) < \/n. Since f„xn„„, is scalarly bounded and properly measurable, it follows from the consistency of ? that g„ := A)(/hX#„,„„) is bounded and properly measurable, for each ? e N. Hence, g„ e ???(?, X*, var) and since f - gn= po(f - g„), we see that for each ? e N and some ? > 0 II/-««lice <|?-??|(?) + ???(??_ )- That is / is approximated in the Glivenko-Cantelli norm by elements of ???/(?, ?*). In virtue of Theorem 13.5, / satisfies the law of large numbers. The general case follows by an appropriate decomposition of ?. ? 13.5. Bounded Pettis integrable functions We denote by ??(?, X) the linear space (/??(?,?): ||/||^:= sup ||.v*/lk<^|. 1 iiv-iki ' where ||.х*/11зс is the /^(?^???? of x*f. One can easily check that || · \\p^ is a norm. Then, let ?^(?, ?) := {/ e ?^(?. ?): vt(E) is norm relatively compact). Identifying weakly equivalent functions - we denote by LZJVoc(M. X) the linear space \feLLNte,X):\\f\\P^:= sup \\x*f\\x < oo|. 1 ??.???? ' PROPOSITION 13.14. IfX has the WRNP then Лс (?, ?) is complete. THEOREM 13.15 (Musiat B001b)). (Axiom L) If ? is perfect, then ?^(?,?*) is complete. PROOF. Let (f„)%\ be a Cauchy sequence in ?^(?, ?*). Then for each m,n eN SUp \\x**f„ - X**f„, | = SUp ||-v/„ - Xf„, \\oc ll.v"Kl ' 11-vlKl
580 К. Musiat and sup ||*/„ - xf,„\\x = sup! ??/,,)(?) - ?)(/»;)(?)||, F) where ? is a lifting on Lx(^). Consequently, the sequence (po(f„)) is uniformly convergent to a function h-.? ^> X* such that /г = aj(O- Since for each x** e SC(X**) the functions A) (/«) are measurable, according to Talagrand A984, Theorem 6-2-1)(where the Axiom L is used), the functions A)(/i) are m ??(?·%*) and so ^ is scalarly measurable. Then, it is a consequence of Theorem 5.2 that h e ??(?, ?*). Thus, using F), with h rather then /,„, we get lim||/„ - h\\P =lim sup ||jv/„ — дсй||эс " ????? = limsup|A)(/i)(w)-?)(?)(?)| =0. This proves the completeness of ??(?, ?*). ? The above proof makes it obvious that in fact the following more general result holds true: THEOREM 13,16 (Musiat B001b)). Let ? and X be arbitrary: If for each countable family ? С ??(?, X*) there exists a lifting ? such that Art/) is ?-Pettis-integrable for each f e T, then ??(?, ?*) and ??(?, ?*) are complete. COROLLARY 13.17. // X is separable, then for each ? the spaces ?^(?,?*) and ??(?, ?*) are complete. Analysis of the proof of Theorem 13.15 when ? is consistent shows the validity of the next result. THEOREM 13.18 (Musiat B001b)). The space ????(?, ?*) is complete. Considering each X-valued function as an X**-valued function we get the following result in case of an arbitrary Banach space X: THEOREM 13.19 (Musiat B001b)). The completion of the space ???(?, ?) is a subspace ofLLN'?(?, ?**). If Axiom L is satisfied and ? is perfect then the completion of ??(?-, ?) is a subspace of ??(?, ?**). References Andrews, K.T. A985), Universal Pettis integrability, Canad. J. Math. 37, 141-159. Battle, R.G., Dunford, N. and Schwartz, J. A955), Weak compactness and vector measures, Canad. J. Math. 7. 289-305. Bator, E.M. A985), Pettis integrability and the equality of the norms of the weak*-integral and the Dunford integral, Proc. Amer. Math. Soc. 95, 265-270.
Pettis integral 581 Bator, E.M. A988a), A decomposition of bounded scalarh- measurable functions taking their values in dual Banach spaces, Proc. Amer. Math. Soc. 102, 850-854. Bator, E.M. A988b), Pettis decomposition for universally scalarh measurable functions, Proc. Amer. Math. Soc. 104, 795-800. Bator, E.M. and Lewis, P. A987), Some connections between Pettis integration and operator theory. Rocky Mountain J. Math. 17, 683-695. Bator, E.M. and Lewis, P. A989a), Weak precompactness and the weak RNP, Bull. Polish Acad. Sci. 37, 7-12. Bator, E.M. and Lewis, P. A989b), Remarks on completely continuous operators. Bull. Polish Acad. Sci. 37, 409^413. Beck, A. A963), On the strong law of large numbers, Ergodic Theory, Proc. Intemat. Sympos., New Orleans, 1961, Academic Press, New York. Bessaga, С and Petczynski, A. A958), On bases and unconditional convergence of series in Banach spaces, Studia Math. 17, 151-174. Bilyeu, R.G. A978), A metric inequality characterizing bancentres and other Pettis integrals, Proc. Amer. Math. Soc. 68, 323-326. Bilyeu, R.G. and Lewis, P.W. A985), Differentiability of convex functions, Pettis integrability, and Rybakov's theorem, Bull. Polish Acad. Sci. Math. 33, 581-585. Birkhoff, G. A935), Integration of functions with values in a Banach space, Trans. Amer. Math. Soc. 38, 357-378. Bongiomo, В., Di Piazza, L. and Musiat, K. B000). An alternate approach to the McShane integral. Real Anal. Exchange 25, 829-848. Bourgain, J., Fremlin, D.H. and Talagrand, M. A978), Pointwise compact sets of Baire-measurable functions, Amer. J. Math. 100, 845-886. Brooks, J.K. A969a), Representations of weak and strong integrals in Banach spaces, Proc. Nat. Acad. Sci. USA 2, 266-270. Brooks, J.K. A969b), On the Gelfand-Pettis integral and unconditionally convergent series. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys. 17, 809-813. Brooks, J.K. A972). Sur la representation de I'espace dual de I'espace des functions integrables an sens de Pettis, C. R. Acad. Sci. Paris Ser A-B, A1099-A1101. Brooks, J.K. and Dinculeanu, N. A980), On weak and strong compactness in the space of Pettis integrable functions. Proc. Conf. on Integration, Topology and Geometry in Linear Spaces (Univ. North Carolina. Chapel Hill, NC, 1979). Contemp. Math., Vol. 2. 161-187. Brooks. J.K. and Dinculeanu. N. A982). On weak compactness in the space of Pettis integrable functions. Adv. in Math. 45, 255-258. Chakraborty. N.D. and Jaker, A.Sk. A993), On strongly Pettis integrable functions in locally convex spaces. Rev. Mat. Univ. Complut. Madrid 6. 241-262. Chatterji, S.D. A974), Sur Vintegrabilite de Pettis, Math. Z. 136, 53-58. Chictescu, I. A990), A parametrical example of Dunford, Pettis and Bochner integration. Stud. Cere. Mat. 42, 405^41 8. Collins, H.S. and Ruess, W. A983), Weak conipactness in spaces of compact operators and of vector-valued functions. Pacific J. Math. 106. 45-71. Corson, H.H. A961), The weak topology of a Banach space. Trans. Amer. Math. Soc. 101. 1-15. Diaz, S., Fernandez. ?.. Florencio, M. and Paul. P.J. A993). Complemented copies of cq in the space of Pettis integrable functions. Quaestiones Math. 16. 61-66. Diaz, S.. Fernandez. ?., Florencio. M. and Paul. P.J. A995). A wide class of uhrabomological spaces of measurable functions, J. Math. Anal. Appl. 190. 697-713. Diestel, J. A973), Applications of weak compactness and bases to vector measures and vectorial integration. Rev. Roum. Math. Pures Appl. 28, 211-224. Diestel, J. A975), Geometry of Banach Spaces - Selected Topics. Lecture Notes in Math.. Vol. 485, Springer, Berlin. Diestel, J. A984), Sequences and Series in Banach Spaces. Graduate Texts in Math., Vol. 92. Springer, Berlin. Diestel, J. and Faires. B. A974). On vector measures. Trans. Amer. Math. Soc. 198. 253-271. Diestel, J. and Uhl, J.J. A977). Vector Measures. Math. Surveys. Vol. 15, Amer. Math. Soc., Providence. RI. Diestel, J. and Uhl, J.J. A983), Progress in vector measures 1977-83. Proc. Measure Theory and Its Appl. (Sherbrooke, 1982). Lecture Notes in Math.. Vol. 1033. Springer. Berlin, 144-192.
582 A". Musial Dilworth, S.J. and Girardi. M. A993). Bochner vs. Pettis nann: Examples and results. Banach Spaces (Merida. 1992), Contemp. Math.. Vol. 144. Amer. Math. Soc., Providence. RI. 69-80. Dilworth, S.J. and Girardi, M. A995). Nowhere weak differentiability of the Pettis integral, Quaestiones Math. 18, 365-380. Dimitrov. D.B. A971). One remark concerning Gelfand integral. Functional Anal. Appl. 5. 84-85 (in Russian). Dinculeanu, N. A967), Vector Measures, Pergamon Press. Dinculeanu, N. A981). Uniform a-additivirs- and uniform comergence of conditional expectations in spaces of Bochner or Pettis integrable functions. General Topology and Modem Analysis (Proc. Conf. Univ. California. Riverside. 1980), Academic Press, New York. 391-397. Dinculeanu, N. A983). Uniform с-additivitx in spaces of Bochner or Pettis integrable functions over locally compact group. Proc. Amer. Math. Soc. 87. 627-633. Dinculeanu, N. and Uhl, J.J. A973), A unifying Radon-Nikodym theorem for vector measures. J. Multivariate Anal. 3, 184-203. Dobrakov, I. A970a). On integration in Banach spaces I. Czechoslovak Math. J. 20. 51 1-536. Dobrakov, I. A970b), On integration in Banach spaces II. Czechoslovak Math. J. 20. 680-695. Dobric, V. A990), The decomposition theorem for functions satisfying the law of large numbers. J. Theoretical Probab. 3, 489^496. Drewnowski, L. A986), On the Dunford and Pettis integral. Probability and Banach Spaces (Zaragoza. 1985). Lecture Notes in Math.. Vol. 1221. Springer. Berlin. 1-15. Drewnowski, L., Florencio. M. and Paul. P.J. A992). The space of Pettis integrable functions is barreled. Proc. Amer. Math. Soc. 114. 687-694. Drewnowski, L. and Lipecki. Z. A995). On vector measures which have everywhere infinite variation or non- compact range. Dissertationes Math. 339. 3-39. van Dulst, D. A980), Characterization of Banach Spaces not Containing /'. CWI Tract. Vol. 59. Dunford. N. A937), Integration of vector-valued functions. Bull. Amer. Math. Soc. 43. 24. Dunford. N. and Schwartz. J. A958). Linear Operators I. Pure and Appl. Math.. Interscience. New York. Dvoretzky, A. A961). Some results on convex bodies and Banach spaces. Proc. Intemat. Sympos. on Linear Spaces, Jerusalem A961). 123-160. Edgar, G. A977). Measurability in a Banach space I. Indiana Math. J. 26. 663-667. Edgar, G. A978), On the Radon-Nikodym property and martingale convergence, vector space measures and applications II, Proc. Dublin A977). Lecture Notes in Math.. Vol. 645. Springer. Berlin. 62-76. Edgar, G. A979). Measurability in a Banach space II. Indiana Math. J. 28. 559-579. Edgar, G. A982), On pointwise-compact sets of measurable functions. Measure Theory (Oberwolfach. 1981). Lecture Notes in Math., Vol. 945. Springer. Berlin. 24-28. Egghe, L. A982-83), Extensions of the martingale comergence theory in Banach spaces. Egghe, L. A984), Convergence of adapted sequences of Pettis integrable functions. Pacific J. Math. 114. 345-366. Emmanuele, G. A992), On complemented copies ofc^ in spaces of operators, Comm. Math. 32, 29-32. Emmanuele, G. A993), Precompactness in the space of Pettis integrable functions. Acta. Math. Hungar. 62, 333-335. Emmanuele, G. and Musial, K. A990), Weak precompactness in the space of Pettis integrable functions, J. Math. Anal. Appl. 148. 245-250. Erben, W. and Grimeisen, G. A990), Integration by means of Riemanii sums in Banach spaces I, Z. Anal. Anwend. 9. 481-501. Erben, W. and Grimeisen. G. A991), Integration by means of Riemann sums in Banach spaces II, Z. Anal. Anwend. 10, 11 -26. Farmaki, V A995), A geometric characterization of the weak Radon-Nikodym property in dual Banach spaces. Rocky Mountain J. Math. 25, 611-617. Fink, J.P. A979), The Pettis integration of a perturbed wave equation. Math. Proc. Cambridge Philos. Soc. 86, 145-159. Fremlin, D.H. A975), Pointwise compact sets of measurable functions, Manuscripta Math. 15, 219-242. Fremlin, D.H. A982), Properly relatively compact sets of measurable functions. Note of 1 July 1982. Fremlin, D.H. A994), The Henstock and McShane integrals of vector-valued functions, Illinois J. Math. 38, 471^479. Fremlin, D.H. A995). The generalized McShane integral, Illinois J. Math. 39. 39-67.
Pettis integral 583 Fremlin, D.H. and Mendoza, J. A994), On the integration of vector-valued functions. Illinois J. Math. 38. 127— 147. Fremlin, D.H. and Shelah, S. A993). Pointwise compact and stable sets of measurable functions. J. Symbolic Logic 58, 435^455. Fremlin, D. and Talagrand, M. A979). A decomposition theorem for additive set functions and applications to Pettis integral and ergodic means. Math. Z. 168. 117-142. Freniche, F.J. A998), Embedding cq in the space of Pettis integrable functions. Quaestiones Math. 21. 261-267. Geitz, R. A981), Pettis integration, Proc. Amer. Math. Soc. 82. 81-86. Geitz, R. A982), Geometry and the Pettis integral. Trans. Amer. Math. Soc. 269, 535-548. Gelfand, I.M. A936), Sur un lemme de le theorie des espaces lineaires. Comm. Inst. Math. Mec. Univ. de Kharkoff et Soc. Math. Kharkoff 13. 35^40. Ghoussoub, N. and Saab, E. A981). On the weak Radon-Nikodvm property. Proc. Amer. Math. Soc. 81, 81-84. Gordon, R. A990). The McShane integral of Banach-valued functions. Illinois J. Math. 34. 557-567. Gordon, R. A991), Riemann integration in Banach spaces. Rocky Mountain J. Math. 21, 923-949. Graves, W.H. and Ruess. W. A984), Compactness and weak compactness in spaces of compact-range vector measures. Canad. J. Math. 36. 1000-1020. Grothendieck, A. A952), Criteres de compacite' ge'neraux dans les espaces fonctionnels. Amer. J. Math., 168- 186. Gulisashvili, A. A983), Estimates for the Pettis integral in interpolation spaces with some applications. Banach Space Theory and Its Applications (Bucharest. 1981), Lecture Notes in Math., Vol. 991, Springer. Berlin, 55-78. Hashimoto, K. A983), On the sequential approximation of scalarly measurable functions by simple functions, Tokyo J. Math. 6, 153-166. Hashimoto, K. and Oharu, S. A983), Gelfand integrals and generalized derivatives of vector measures. Hiroshima Math. J. 13, 301-326. Haydon, R. A976), Some more characterizations of Banach spaces containing l\. Math. Proc. Cambridge Philos. Soc. 80, 269-276. Heilio, M. A988), Weakly summable measures in Banach spaces, Ann. Acad. Sci. Fennicae. Ser. A, I. Math., Dissertationes, Vol. 66. Heinich, M.H. A973). Esperance conditionelle pour les functions vectorielles. C. R. Acad. Sci. Paris Ser. A 276. 935-938. Helmer, D. A994). On the Pettis measurability theorem. Bull. Austral. Math. Soc. 50. 109-116. Hoffmann-j0rgensen, J. A971), Vector measures. Math. Scand. 28. 5-32. Hoffmann-j0rgensen, J. A985). The law of laige numbers for non-measurable and non-separable random elements. Coll. en l'Honneur Laurent Schwartz. Vol. 1. Asterisque 131. 299-356. Holmes, R.B. A975). Geometric Functional Analysis anil Its Applications. Graduate Texts in Math., Vol. 24. Springer. Berlin. Huff, R. A986). Remarks on Pettis integrability. Proc. Amer. Math. Soc. 96. 402^404. Ionescu Tulcea, A. and Ionescu Tulcea, С A969), Topics in the Theory of Lifting, Ergeb. Math. Grenzgeb. B. 48. Springer. Berlin. Jaker. A.Sk. and Chakraborty. N.D. A997), Pettis integration in locally coiwex spaces. Anal. Math. 23. 241-257. James, R.C. A964). Weak compactness and refiexivitv, Israel J. Math. 2, 101-119. Janicka. L. A979), Some measure theoretical characterizations of Banach spaces not containing l\. Bull. Acad. Polon. Sci. 27. 561-565. Janicka, L. and Kalton. N.J. A977), Vector measures of infinite variation. Bull. Acad. Polon. Sci. 25, 239-241. Kadets. V.M. A994), Non-differentiable indefinite Pettis integral, Quaestiones Math. 17, 137-139. Kluvanek. I. and Knowles, G. A975). Vector Measures and Control Systems. North-Holland/American Elsevier. Komlos, J. A967). A generalization of a problem ofSteinhaus, Acta Math. Acad. Sci. Hungar. 18. 217-229. de Korvin, A. and Roberts. Ch.E. A979). Interchange of vector-valued integrals when the measures are Bochner or Pettis indefinite integrals. Bull. Austral. Math. Soc. 20, 199-206. Kupka. J. A972). Radon-Nikodym theorems for vector valued measures. Trans. Amer. Math. Soc. 169. 197-217. Luu. D.Q. A997), Convergence of Banach-space-valued martingale-like sequences of Pettis-integrable functions. Bull. Polish Acad. Sci. Math. 45. 233-245. Macheras. N.D. and Musial. K. B000). Liftings of Pettis integrable functions. Hiroschima J. Math. 30, 215-219.
584 К. Musial Matsuda, M. A983), On Sierpinski's function and its applications to vector measures. Math. Japon. 28, 549-560. Matsuda, M. A985), On Saab's characterizations of weak Radon-Nikodym sets, Publ. Res. Inst. Math. Sci., Kyoto Univ. 21, 921-941. Matsuda, M. A986), A characterization of weak Radon-Nikodym sets in dual Banach spaces, Publ. Res. Inst. Math. Sci., Kyoto Univ. 22, 551-559. Matsuda, M. A991), A characterization of Pettis sets in dual Banach spaces. Publ. Res. Inst. Math. Sci., Kyoto Univ. 27, 827-836. Matsuda, M. A993), A characterization of'поп-Pettis sets in terms of martingales. Math. Japon. 1. 177-183. Matsuda, M. A994), Remarks on Pettis sets. Rep. Fac. Sci. Shizuoka Univ. 28. 17-23. Matsuda, M. A995), A characterization of Pettis sets in terms of the Bourgain property; Math. Japon. 41, 433- 439. Matsuda. M. A996), On localized weak precompactness in Banach spaces. Publ. Res. Inst. Math. Sci., Kyoto Univ. 32, 473^491. Matsuda. M. A998), On localized weak precompactness in Banach spaces II. Hiroshima Mat. J. 28. 399-418. Matsuda, M. A999a), ? generalization of the Radon-Nikodvm property in dual Banach spaces, fragmentedness, and differentiability of convex functions. Publ. Res. Inst. Math. Sci.. Kyoto Univ. 35. 921-933. Matsuda. M. A999b). A remark on localized weak precompactness in Banach spaces. Comment. Math. Univ. Carolin. 40. 271-276. Michalak, A. A996). Translations of functions in vector Hardy classes on the unit disk. Dissertationes Math.. Vol. 359. Michalak, A. B000). On Pettis integrabilin of translations of functions in L^. Bull. Polish Acad. Sci. Math. 48. 113-119. Michalak, A. B001), On the Fubini theorem for the Pettis integral for bounded functions. Bull. Polish Acad. Sci. Math. 49, 1-14. Moedomo, S. and Uhl, J.J. A971), Radon-Nikodvm theorems for the Bochner and Pettis integrals. Pacific J. Math. 38,531-536. Munroe, M.E. A946a), A note on weak differentiability of Pettis integrals. Bull. Amer. Math. Soc. 52, 167-174. Munroe, M.E. A946b), A second note on weak differentiability of Pettis integrals. Bull. Amer. Math. Soc. 52, 668-670. Musial, K. A976), The weak Radon-Nikodvm propertv in Banach spaces, Aarhus University. Preprint Series 1976/77, No. 7. Musial, K. A979), The weak Radon-Nikodym property in Banach spaces, Studia Math. 64, 151-174. Musial, K. A980), Martingales of Pettis integrable functions. Measure Theory (Oberwolfach, 1979), Lecture Notes in Math., Vol. 794, Springer, Berlin. 324-339. Musial, K. A982), A characterization of the weak Radon-Nikodvm property in terms of the Lebesgue measure, Proc. Conf. Topology and Measure III. Vitte/Hiddensee. 1980, Wiss. Bait. Emst-Moritz-Amdt-Univ. Greifs- wald, 187-191. Musial, K. A984), The weak Radon-Nikodym Property in conjugate Banach spaces. Proc. Conf. Topology and Measure IV, Trassenheide, 1983, Wiss. Bait. Ernst-Moritz-Amdt-Univ. Greifswald, 163-168. Musial, K. A985), Pettis integration. Proc. 13th Winter School on Abstract Analysis (Smi, 1985), Rend. Circ. Mat. Palermo, Ser II Suppl. 10, 133-142. Musial, K. A987), Vitali and Lebesgue convergence theorems for Pettis integral in locally convex spaces, Atti Sem. Mat. Fis. Univ. Modena 35. 159-166. Musial, K. A990), Pettis integration of dominated functions. Atti Sem. Mat. Fis. Univ. Modena 38. 261-265. Musial, K. A991), Topics in the theory of Pettis integration. School on Measure Theory and Real Analysis (Grado, 1991), Rend. Istit. Mat. Univ. Trieste 23, 177-262. Musial, K. A994), A few remarks concerning the strong law of large numbers for non-separable Banach space valued functions. Workshop di Teoria della Misura e Analisi reale (Grado, 1993), Rend. Istit. Mat. Univ. Trieste Suppl. 26, 221-242. Musial, K. B000), Lifting and some of its applications to the theory of Pettis integral. Workshop di Teoria della Misura e Analisi reale (Grado, 1997). Rend. Istit. Mat. Univ. Trieste Suppl. 31 A). 107-141. Musial, K. B001a), The completeness problem in the space of Pettis integrable functions. Quaestiones Math. 24 441^452.
Pettis integral 585 Musiat, K. B001b), The completeness in spaces of bounded Pettis integrable functions and in spaces of bounded functions satisfying the law of large numbers. Demonstratio Math. 30, 329-334. Musial, K. and Plebanek, G. A989). Pettis integrabilirx and the equalin of the weak* -integral and the Dunford integral. Hiroshima Math. J. 19, 329-332. Musial, K. and Popa, D. A997), The weak Radon-Nikodvm property in spaces of nuclear operators. Quaestiones Math. 20, 677-683. Musial, K. and Ryll-Nardzewski, С A978). Liftings of vector measures and their applications to RNP and WRNP, Vector Space Measures and Applications II. Proc. Dublin A977). Lecture Notes in Math.. Vol. 645. Springer. Berlin, 162-171. vanNeerven, J.M.A.M. A994). Remark on a theorem of Riddle, Saab, and Uhl. Quaestiones Math. 17. 183-187. Odell, E. and Rosenthal. H.P. A975). A double-dual characterization of separable Banach spaces containing l\. Israel J. Math. 20, 375-384. Pallares, A.J. and Vera. G. A985). Pettis integrability of weakly continuous functions and Baire measures. J. London Math. Soc. 32. 479^487. Pettis, B.J. A938), On integration in vector spaces. Trans. Amer. Math. Soc. 44. 277-304. Pettis, B.J. A939), Differentiation in Banach spaces. Duke Math. J. 5. 254-269. Phillips, R.S. A940). Integration in a convex linear topological space. Trans. Amer. Math. Soc. 47. 114-1456. Plebanek, G. A993), On Pettis integrals with separable range. Colloq. Math. 64. 71-78. Plebanek. G. A998). On Pettis integral and Radon measures. Fund. Math. 156. 183-195. Pol, R. A980), On a question ofH.H. Corson and some related problems. Fund. Math. 109. 143-154. Riddle, L.H. A982), The geometry of weak Radon-Nikodvm sets in dual Banach spaces. Proc. Amer. Math. Soc. 86, 433^438. Riddle, L.H. A984), Dunford-Pettis operators and weak Radon-Nikodym sets. Proc. Amer. Math. Soc. 91. 254- 256. Riddle, L.H. and Saab. E. A985). On functions that are universally Pettis integrable. Illinois J. Math. 29. 509-531. Riddle, L.H., Saab, E. and Uhl, J.J. A983). Sets with the weak Radon-Nikodym property in dual Banach spaces. Indiana Univ. Math. J. 32, 527-541. Rieffel, M.A. A968). The Radon-Nikod\m theorem for the Baclmer integral. Trans. Amer. Math. Soc. 131. 466- 487. Rosenthal, H.P. A970), On injective Banach spaces and the spaces L^- (?) for finite measures ?. Acta Math. 124, 205-248. Rosenthal, H.P. A974), A characterization of Banach spaces containing l\. Proc. Nat. Acad. Sci. USA 71. 2411- 2413. Rosenthal. H.P. A978), Some recent discoveries in the isomorphic theory of Banach spaces. Bull. Amer. Math. Soc. 84, 803-831. Rybakov, VI. A968). On vector measures. Izv. Vyssh. Uchebn. Zaved. Mat. 79. 92-101 (in Russian). Rybakov. VI. A970), On the completeness of the space of Pettis integrable functions. Uchebn. Zap. Moskov. Gos. Ped. Inst. 277, 58-64 (in Russian). Rybakov, VI. A971), Conditional mathematical expectation for functions that are integrable in the sense of Pettis. Mat. Zametki 10, 565-570 (in Russian). Rybakov, VI. A975), A generalization of the Boihner integral to locally convex spaces. Mat. Zametki 18, 577- 588 (in Russian). Rybakov, VI. A977), Some properties of measures defined on a normed space possessing RNP. Mat. Zametki 21, 81-92 (in Russian). Saab, E. A982a), On Dunford-Pettis operators that are Pettis representable. Proc. Amer. Math. Soc. 85. 363-365. Saab, E. A982b), Some characterizations of weak Radon-Nikodym sets. Proc. Amer. Math. Soc. 86. 307-311. Saab, E. A988), On the weak*-Radon Nikodym property. Bull. Austral. Math. Soc. 37. 323-332. Saab, E. and Saab, P. A983), A dual geometric characterization of Banach spaces not containing l\. Pacific J. Math. 105,415^425. Schachermayer, W., Sersouri, A. and Werner. E. A989). Moduli of non-dentability and the Radon-Nikodym property in Banach spaces, Israel J. Math. 65. 225-257. Sentilles, D. A981), Stonian integration of vector functions. Proc. Conf. on Measure Theory and Appl.. Northern Illinois Univ., 123-135. Sentilles. D. A983), Decomposition ofweaklx measurable functions. Indiana Univ. Math. J. 32, 425^437.
586 A". Musial Sentilles, D. (unpublished). Differentiation of measures on boolean algebras: Lifling and Radon-Nikodym, Preprint. Sentilles, D. A984), Some measure theoretic implications for the Pettis integral. Measure Theory (Oberwolfach, 1983), Lecture Notes in Math.. Vol. 1089. Springer. Berlin. 165-170. Sentilles, D. and Wheeler, R.F. A983), Pettis integration vie the Stonian transform. Pacific J. Math. 107. 473^496. Sierpinski, W. A934), Hypothese du continu. Monogratie Matematyczne. Sikorski, R. A964), Boolean Algebras, Ergeb. Math. Grenzgeb. B. 25, Springer, Berlin. Stefansson, G.F. A992), Pettis integrability. Trans. Amer. Math. Soc. 330, 401^418. Stefansson, G.F. A995), Universal Pettis integrability property. Proc. Amer. Math. Soc. 123, 1431-1435. Stegall, Ch. A975/76a). A result ofHaydon and its applications. Seminaire Maurey-Schwartz, Expose No. II. Stegall, Ch. A975/76b), Errata. Seminaire Maurey-Schwartz. Strauss, W.. Macheras, N.D. and Musial, K. B002). Liftings. Handbook on Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 1131-1184. Talagrand, M. A980), Sur les mesures vectorielles definies par une application Pettis integrable. Bull. Soc. Math. France 108, 475^483. Talagrand, M. A984), Pettis Integral and Measure Theorv, Mem. Amer. Math. Soc., No. 307, Amer. Math. Soc., Providence, RI. Talagrand, M. A987), The Glivenko-Cantelli pmblem, Ann. Probab. 15. 837-870. Talagrand. M. A996), The Glivenko-Cantelli pmblem, ten years later. J. Probab Theoret. Probab. 9, 371-384. Thomas, E. A974), The Lebesgue-Nikodym Theorem for Vector Valued Radon Measures. Mem. Amer. Math. Soc, No. 139, Amer. Math. Soc, Providence. RI. Thomas, ?. (?976), Totally summable functions with values in locally convex spaces. Measure Theory (Oberwolfach, 1975), Proceedings, Lecture Notes in Math., Vol. 541, Springer, Berlin, 117-131. Tortrat, A. A976), ?-regularite des lois, separation an sens <le A. Tulcea et pmpriete de Radon-Nikodym. Ann. Inst. H. Poincare 12, 131-150. Uhl, J.J. A972), ? characterization of stronglx measurable Pettis integrable functions, Proc. Amer. Math. Soc. 34, 425^427. Uhl, J.J. A972a), Martingales of stronglx measurable Pettis integrable functions. Trans. Amer. Math. Soc. 167, 531-536. Uhl, J.J. A973), ? sunvy of mean convergence of martingales of Pettis integrable functions. Vector and Operator Valued Measures and Applications (Proc Symp., Alta. Utah, 1972). 379-385. Uhl, J.J. A973a), Erramm to "Martingales of strongly measurable Pettis integrable functions", Trans. Amer. Math. Soc. 181, 507. Uhl, J.J. A976/77), Pettis mean convergence of vector-valued asxmptotic martingales. Z. Wahrscheinlichkeitsth. verw. Geb. 37, 291-295. Uhl, J.J. A980), Pettis's measurability theorem. Proc. Conf. on Integration, Topology and Geometry in Linear Spaces (Univ. North Carolina. Chapel Hill, NC, 1979). Contemp. Math.. Vol. 2, Amer. Math. Soc. Providence, RI, 135-144. Vasuk, L. A981), On one generalization of weakly compactly generated Banach spaces, Studia Math. 70, 11-19. Varadarajan. VS. A961), Measures on topological spaces. Mat. Sb. 55. 35-100. (English translation: Amer. Math. Soc. Transl., Vol. 48 A965), 161-228.) von Weizsacker, H. A978), Strong measurability, liftings and the Choquet-Edgar theorem. Vector Space Measures and Applications II, Proc. Dublin A977), Lecture Notes in Math., Vol. 645, Springer, Berlin, 209- 218.
CHAPTER 13 The Henstock-Kurzweil Integral Benedetto Bongiomo Dipartimento di Matematica ed Applkcrjoni, Via Archirafi 34, 90123 Palermo, Italy E-mail: bongiom @ unipa. it Contents Introduction 589 1. Partitions and Riemann sums 589 2. The Henstock-Kurzweil integral on the real line 590 2.1. The primitives 592 2.2. Convergence theorems 593 3. Further Riemann-type integrals on the real line 596 3.1. The McShane integral 596 3.2. The C-integral 596 3.3. Integrals induced by differentiation bases 597 4. Multidimensional Riemann-type integrals 599 4.1. The Henstock-Kurzweil integral 600 4.2. The ?-regular Henstock-Kurzweil integral 601 4.3. The regular Henstock-Kurzweil integral 602 4.4. The p-integral 602 4.5. The divergence theorem 603 4.6. Fubini's theorem 608 5. The Henstock-Kurzweil integral of vector valued functions 608 6. The Henstock-Kurzweil integral on general spaces 609 References 610 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 587
This page intentionally left blank
The Henstock-Kurzweil integral 589 Introduction The problem of recovering a function from its derivative is called the problem of primitives. In his thesis Lebesgue A902) remarked that his integral does not integrate all unbounded derivatives and left open the problem of defining a more general integral which solves the problem of primitives. This was done independently by Denjoy A912a. 1912b) and Perron A914). However, the advantages of Lebesgue's theory were so evident that only few mathematicians were interested into the Denjoy-Perron integral. In spite of the general conviction that no modification of Riemann's method could possibly give such powerful results as that of the Lebesgue's integral, independently Kurzweil A957) and Henstock A963) introduced a Riemann-type integral that is equivalent to the Denjoy-Perron integral. The fact that the Riemann integral has an intuitive appeal and it is well known to each mathematician, and the fact that a Riemann-type integral was as good as Lebesgue's attracted the interest of many scientists. So the theory of nonabsolutely convergent integrals has advanced considerably in the last 30 years. In this chapter we give an overview of this theory. It is far from complete, as research is still being carried out in the area; however we hope that this note gives resonable informations on the current status of the subject. 1. Partitions and Riemann sums The method of Kurzweil and Henstock is based on the notions of gauge and partition. Let [a,b] be a subinterval of the real line E. A gauge on [a,b] is, by definition, any positive function <5 defined on [a,b]. A partition of [a, b] is, by definition, any collection ? = {(A,·,*i)}f=1 of pairwise disjoint intervals A, and points .v, e A, such that [a,b] = U, Ai- If Ц Ai is strictly contained in [a, b]. then ? is called a partial partition of [a,b]. ? is said to be • S-fine whenever А, С ]л",- - <5,, xt¦ + S, [, /' = 1, 2,..., p, • anchored on a set ? whenever .x, e E, i = 1, 2 p. COUSIN'S LEMMA. For any gauge 8 on [a, b] there exists a S-fine partition of [a, b]. Given a function /: [a, b] -> R and a partition ? = {(A,, .v,)}f=1 of [a, b] the Riemann sum of / with respect to ? is defined as follows ? ]?/ = ]?/(*,)?(?,), ? i=l where ?( ?,) stands for the Lebesgue measure of A,.
590 В. Bongiorno 2. The Henstock-Kurzweil integral on the real line Dehnition 2.1. It is said that / is Henstock-Kurzweil integrable on [a, b] whenever there is a real number / satisfying the following condition: for each ? > 0 there exists a gauge S such that ?/-' < ?, for each <5-fine partition ? of [a. b]. Remark that Cousin's lemma implies that the number / from Definition 2.1 is determined uniquely by the integrable function /. In fact, assume that J ? I also satisfies Definition 2.1 and take ? = \I - У1/2. Then we can find two gauges <5| and <5i on [a.b] so that | J2p / - /| < ? for each <5|-fine partition ? of [a, b], and | ?? f - J\ < ? for each <52-fine partition ? of [a. b]. Now let S — min{<5|, <5i), and use Cousin's lemma to find a S-fine partition ? of [a, b]. Then I/-./K ?/ + ?/ <2e = \I-J\ which is a contradiction. The number / is called the Henstock-Kurzweil integral of / on [a.b] and we set / = fa f. It is said that / is Henstock-Kurzweil integrable on a set ? с [a. b] if /?? is Henstock-Kurzweil integrable on [a.b]. Here ?^ denotes the characteristic function of E. If / is Henstock-Kurzweil integrable on [a.b], then it is Henstock-Kurzweil integrable on each subinterval of [a.b] and the function F(x) = f* f is called the indefinite Henstock-Kurzweil integral or the Henstock-Kurzweil primitive of f. Fundamental theorem OF calculus. // F is a differentiable function on [a,b], then its derivative F' is Henstock-Kurzweil integrable on [a, b] and f F' = F(b)-F(a). We remark that in the definition of the Henstock-Kurzweil integral the gauge S can be chosen to be Lebesgue measurable (see Liu A987-88), Pfeffer A988), and Buczolich A997-88)).Moreover if there existsaBaire 1 function^ suchthatg > |/| (and only in this case) the gauge 8 can be chosen to be nearly upper semicontinuous (i.e., almost everywhere equal to some upper semicontinuous function) (see Buczolich A995)). A frequently used lemma in the theory of Henstock-Kurzweil integral is the following lemma.
The Henstock-Kurzweil integral 591 HENSTOCK'S LEMMA. // / is Henstock-Kurzweil integrable on [a,b], then for each ? > 0 there exists a gauge 8 such that ?/(*,-)(&¦-?,-)- ? ' f b, "' < ?. ?'=? for each S-fine partition {]a,, fr, [,-*<)}> ' = ', 2,.. ·, p, of [a. b]. A proof can be found, for example, in McLeod A980) and in Gordon A994). The Henstock's lemma is used, in particular, to prove that: The indefinite Henstock-Kurzweil integral F(x) = /(|* / is a continuous function, differentiable almost everywhere on [a, b] with F' = f almost everywhere on [a, b]. Notice also that (a) each Henstock-Kurzweil integrable function on [a, b] is Lebesgue measurable; (b) each nonnegative Henstock-Kurzweil integrable function on [a, b] is Lebesgue integrable and the two integrals coincide; (c) // a function is Henstock-Kurzweil integrable on every measurable subset ? of [a, b], then it is Lebesgue integrable on [a, b] and the two integrals coincide. From (b) it easily follows the monotone convergence theorem and the dominated convergence theorem. Monotone convergence theorem. If fi ^ fi ^ ¦ ¦ ¦ fn ^ ¦ ¦ ¦ is a sequence of Henstock-Kurzweil integrable on [a. b] functions such that lim„ fj f„ exists finite, then f = lim fn is Henstock-Kurzweil integrable on [a, b] and fa f = lim,, f(l /„. DOMINATED CONVERGENCE THEOREM. If {fn} " a sequence of Henstock-Kurzweil integrable on [a, b] functions such that (i) inf„ /„ and sup„ f„ are Henstock-Kurzweil integrable on [a, b]; (ii) lim„^oo fn = f almost everywhere on [a. b]; then f is Henstock-Kurzweil integrable on [a, b] and fa f = lim,, fa fn. The fact that the Henstock-Kurzweil integral includes the Lebesgue integral was proved by Henstock A968), McShane A969). Davies and Schuss A970). It is however interesting to note that each Henstock-Kurzweil integrable function / on [a,b] is Lebesgue integrable on some subinterval of [a,b] (see Buczolich A991a)), and that it is Lebesgue integrable on a sequence {X„} of measurable sets such that [a, b] = U„ Xn and /x / -»· /j" / (see Lu and Lee A990-91)). Liu A992-93) proved that each X„ can be chosen closed, and Gordon A991-92) proved that each measurable set X„ can be chosen such that fx f = f f. If / is a Lebesgue measurable function, then it is Henstock-Kurzweil integrable if and only if
592 В. Bongiorno (i) it has locally small Riemann sums; i.e., for each ? > 0 there exists a gauge <5 such that for each ? e [a, b], for each subinterval [c,d] of ]x — ?(?), ? + S(x)[ and for each <5-fine partition ? of [c, d] we have | ?? f\<?- or, if and only if (ii) it has functionally small Riemann sums; i.e., for each ? > 0 there exist a nonnegative Lebesgue integrable function g and a gauge <5 such that | ??. f\<?> f°r every S-fine partition ? (where PK denotes the subpartition of P) consisting of all (Ai.Xi) e ? such that |/(.r,-)| > g(xt). Condition (i) was proved by Schurle A986a. 1986b), Bullen and Vyborny A991), and condition (ii) by Lu and Lee A990-91). 2.1. The primitives Lusin A912) introduced the following notion of generalized variation to give a descriptive definition of the Denjoy-Perron integral. Definition 2.2. A function F is said to be AC* on a set ? wheneverfor each ? > 0 there exists ? > 0 such that ?? co(F, ]a,-. ?>,·[) < ? (where ? denotes the oscillation of F over ]a,-, bl¦[), for every sequence of non-overlapping intervals {]a,·. i>,-[} with ]T- \bi — ?,-1 < ? and a,·, fr, e E. Further, F is said to beACG* on [a, b] whenever it is continuous and there exists a sequence of closed sets {?„} such that [a,b] = U„ E„ and F is AC* on each En. Henstock A963), Kurzweil A978), Kubota( 1980), Lee and WittayaNaak-in A982), and Gordon A986-87) showed that the Denjoy-Perron integral is equivalent to the Henstock- Kurzweil integral by proving the following characterization of the Henstock-Kurzweil primitives. THEOREM 2.3. The family of all Henstock-Kurzweil primitives coincides with the class of all ACG ^-functions. A slight modification of the notion of ACG„-function, due to Henstock A963) (see also Gordon A989-90a) and Lee A989-90)) gives a different descriptive characterization of the Henstock-Kurzweil primitives. DEnNlTlON 2.4. A function F is said to be ACs on a set ? whenever for each ? > 0 there exist a positive number ? and a gauge ? such that | ]?;. F(A,-)| < ? for each у-fine partial partition {(Aj. jc,)} anchored in ? and such that ]?( | A, | < ?. F is said to be ACGs on [a, b] whenever there exists a sequence {?„} of sets such that [a,b] = (J,, F„ and F is АСц on each En. It is clear that an ACGs function is ACs on each set of Lebesgue measure zero. Bongiorno et al. A995-96) proved that the converse is also true. It is then possible to formu late a third characterization of Henstock-Kurzweil primitives in terms of the absolute continuity of an useful extension of the classical notion of variation of a function. Let
The Henstock-Kurzweil integral 593 F: [a, b] -> R, ? с [?, fr] and let 5 be a gauge on [a, b]. The 5-variation of F on ? is defined as follows Var(F, S, E) = sup ]T| F(AA)|, л where the supremum is taken over all <5-fine partial partitions {(A;. *<)}f=| anchored on E. The infimum over all gauges S of the 5-variations Var(F, S. ¦) is a metric measure called f/?e variational measure associated with F and denoted by ?/-; i.e., Mf(F) = infVar(F,<5, F). Remark that ?/? coincides with the classical variation of F if and only if F is absolutely continuous (i.e., a Lebesgue primitive). THEOREM 2.5 (Bongiorno et al. A995-96)), The family of all Henstock-Kurzweil primitives coincides with the class of all functions F whose associate variational measure ?/r is absolutely continuous with respect to the Lebesgue measure. Lee A989-90), and Kurzweil and Jarnik A991, 1992a), gave a characterization of Henstock-Kurzweil primitives equivalent to the previous one, but with the additional hypothesis that F is differentiable almost everywhere in [a,b]. Moreover the condition "?/? is absolutely continuous with respect to the Lebesgue measure" is displayed in terms of ? and <5 and it is called the strong Lusin condition of F in Lee (see also Lee and Vyborny A993)). The same property is called F is well behaved on sets of measure zero in Kurzweil and Jarnik A991), and F is variational^' normal in Kurzweil and Jarnik A992a). 2.2. Convergence theorems Let F(x) = jc2sin;c~2 when ? ? 0 and F@) — 0. The sequence /„ = F'x|„-i_|j, ? = 1,2,..., converges to / = F' everywhere on [0. 1] as ? -> oo, but it is not monotone and it is not dominated by any Henstock-Kurzweil integrable function. Yet each fn and / are Henstock-Kurzweil integrable on [0, 1] and we have [ f= lim f fn. Jo "^°°Jo A convergence theorem that includes the monotone convergence theorem, the dominated convergence theorem and covers the above case, is an extension of Vitali's theorem, called controlled convergence theorem. CONTROLLED CONVERGENCE THEOREM. Let{f,,} be a sequence of Henstock-Kurzweil integrable functions on [a, b] such that (i) /,,(jc) -> f(x) almost everywhere in [a, b]\
594 В. Bongiorno (ii) the primitives F„ of /„ are ACG* uniformly in n; i.e., [a.b] is the union of a sequence of closed sets {?<.} satisfying the following condition: for each к and for each ? > 0 there exists ? > 0 such that sup„ J2i co(F„, ]a,, fo, [) < ? for every sequence of non-overlapping inten>als {]a,, b,[] with J2i \bi ~ ai\ < Л ап<^ ai, bi e Ek; (iii) the primitives F„ converge pointwisely to a continuous function F: then f is Henstock-Kurzweil integrable on [a. b] and F is its primitive. This theorem (under the additional hypothesis of uniform convergence of the primitives), was proved by Chelidze and Djvarsheishvili A978), Lee and Chew A985), Lee and Chew A987) and Gordon A990-9la). In the present version it was proved by Chew A980), Liao A987) and Fleischer A996-97). A weaker version was given by Liao A989). If we assume that {F,,} is uniformly ACG& (instead of uniformly ACG*), then condition (iii) can be skipped (see Bongiorno and Di Piazza A991-92)). Some other suitable notions of uniform generalized absolute continuity has been used to skip condition (iii) in the controlled convergence theorem (see Lee A990) and Pujie A993— 94)). Remark that although the notions of function ACG* and ACGs are equivalent, a sequence {F,,} is uniformly ACG s if and only if it is uniformly ACG* and equicontinuous (see Bongiorno et al. A993-94, Theorem 3)). If a sequence {/„} satisfies conditions (i), (ii), (iii), then it is said that {/„} is control convergent to /. It is interesting to notice that for every Henstock-Kurzweil integrable function / on [a, b] there exists a sequence of step functions {sn} which is control convergent to / (see Chew and Lee A990)). A sequence {/„} of Henstock-Kurzweil integrable functions on [a, b] is said to be equi- convergent to / whenever f„{x) -» f(x) for each ? e [a, b], and for each ? > 0 there exists a gauge S such that sup n ? Jit < ?. for each <5-fine partition ? of [a, b], THEOREM 2.6 (Kurzweil A980)). Let {/„}, be a sequence of Henstock-Kurzweil integrable functions on [a,b]. It {/„} is equi-convergent to f, then f is Henstock-Kurzweil integrable on [a, b] and fj f = lim„ f'f„. Gordon A992-93) proved that the notions of equi-convergent sequence and control convergent sequence are equivalent (for everywhere convergent sequences). Further conditions equivalent to the equi-convergence have been given by Bongiorno et al. A993-94), Leader A993-94) and Wang A993-94). See also Leader A992-93) for more convergence theorems. Finally we give three necessary and sufficient conditions for the convergence of a sequence of Henstock-Kurzweil integrals.
The Henstock-Kurzweil integral 595 THEOREM 2.7 (Bartle A994-95)), Let {/„} be a sequence of Henstock-Kurzweil integrable functions on [a,b] such that /„(л) -> f(.x), for each ? e [a,b]. Then f is Henstock-Kurzweil integrable on [a, b] and rb rb / /= lim / /„ if and only if for each ? > 0 there exists a positive integer NF such that for each ? ^ NF there exists a gauge S„ with \ ? ? fn ~ ? ? /? < ? for aH S„-fine partition ? of [a, b]. THEOREM 2.8 (Gordon A995-96)). Let {/„} be a sequence of Henstock-Kurzweil integrable functions on [a,b] such that f„(x) -> f(x), for each ? e [a,b]. Then f is Henstock-Kurzweil integrable on [a, b] and pb pb / /= lim / /„ if and only if for each ? > 0 there exists a gauge 8 such that for each S-fine partition ? on [a, b] there exists a positive integer ? ? with \ ?? fn — fj fn\ < ? for all ? ^??. A sequence {F,,} is called asymptotically-ACG* convergent if there is a sequence of closed sets {X*} such that [a, b] = {jk Xt, and for each ? > 0 and each к there exists щ > 0 such that limy^a>(F„; Ah) < ?, ? '—? h for each collection {A/,} of nonoverlapping intervals with end points on Xk and with ?????? < ik. A non-negative function ? is said to be an essential gauge on [a, b] if the set {x e [a, b\. ?(?) = 0} is a Lebesgue null set. Let F: [a, b] -> ? and let ? с [a, b]. The essential variation of F on ? is defined as follows Vess(F,?) = infsupV|F(A„)|. ? ? ? where the supremum is taken over all y-fine partitions {(A,..v,)}-'=1 anchored on ? and the infimum over all essential gauges ?, THEOREM 2.9 (Bongiorno et al. A995)). // {/„} is a sequence of Henstock-Kurzweil integrable functions on [a, b] such that (!) /h ->¦ / a.e. in [a,b], B) the primitives Fn of'/„ converge pointwise to a continuous function F(x) everywhere on [a, b], then f is Henstock-Kurzweil integrable on [a, b] with primitive F if and only if. (i) {Fn} is asymptotically ACG* on [a, b], (ii) there exists a sequence of measurable sets {Xk} such that [a,b] = {J^Xk and Wfi, - F, Xk) -> O.for each к е N.
596 В. Bongiorno 3. Further Riemann-type integrals on the real line The method of Henstock and Kurzweil was adapted in many ways to different situations. The first and most striking modification was done by McShane A969). It concerns the notion of partition. A collection ? = {(A,, .v,)}^, of pairwise disjoint intervals A, and points Xi e [a, b] is said to be a McShane partition if [a, b] = Ц A,. Remark that each partition in the sense of Section 1 is a McShane partition, but not vice versa. 3.1. The McShane integral Dehnition 3.1. It is said that / : [a, b] -> ? is McShane integrable on [a, b] whenever there is a real number / satisfying the following condition: for each ? > 0 there exists a gauge S such that ?/-' < ?. ? for each 5-fine McShane partition ? of [a, b]. The fact that in a McShane partition {(?,, ?,)} the point x, may be chosen outside the interval A, increases the number of <5-fine partitions, so it is more difficult for a function to be integrable. In fact, THEOREM 3.2 (McShane A969)). A function f is McShane integrable on [a, b] if and only if it is Lebesgue integrable on [a. b]. 3.2. The C-integral Bruckner et al. A986) remarked that the solution of the problem of primitives provided by Denjoy, Perron, Kurzweil and Henstock possesses a generality which is not needed for this purpose. In fact the function F(x)=\XS'm^-- °<^'- lo, x = 0. is a primitive for the Denjoy-Perron-Kurzweil-Henstock integral, but it is neither a Lebesgue primitive, nor a differentiable function, or a sum of a Lebesgue primitive and a differentiable function. A Riemann-type definition for the minimal integral which includes Lebesgue integrable functions and derivatives was given by Bongiorno A996) and Bongiorno et al. B000), using the following small modification of the McShane integral.
The Henstock-Kurzweil integral 597 DEHNITION 3.3. A function / : [a, b] -> ? it is said to be C-integrable on [a, b] if there exists a constant / such that for each ? > 0 there is a gauge <5 so that ]?/(*,)?(?,)-/ / = l < ?. for each <5-fine McShane partition {(A\,x\), (??,??)} satisfying the condition Ef=1distU,, A,)< 1/?. The mentioned minimality of this integral is an immediate consequence of the next theorem. THEOREM 3.4 (Bongiorno et al, B000)), A function f:[a.b}^> ? is C-integrable if and only if there exist a derivative f\ and a Lebesgue integrable function /2 such that / = /1 + /2. 3.3. Integrals induced by differentiation bases We recall that a differentiation basis on [a, b] is a collection ? of pairs (A, x), where A is a subinterval of [a, b] and ? is a point of A, such that inf{X(A): (А. л) e ?} = 0 for each ? & [a, b]. A partition ? = {(Aj, xi)}l.= [ is called a /i-partition whenever (A,-,*;) e /i for J = 1, 2,...,p. Let ? be a family of differentiation bases {?(?): a el} satisfying the partitioning property; i.e., a /i(a)-partition of [a, b] exists for each or e I. A function f:[a,b]-> ? is said to be B-integrable, with integral /, on [a, b] if for every ? > 0 there exists a differentiation basis ?(?) e В such that | ]T/> / — /| < ?, for each ? (a)-partition ? of [a, b]. The number / is called the ?-integral of / on [a, b]. Examples of such integrals are the dyadic integral, the approximate integral, the symmetric integral and the approximate symmetric integral. 3.3.1. The dyadic integral. Let a and b be dyadic numbers and let S be a gauge on [a,b]. We denote by /id(<5) the class of all pairs (A,x) such that ? e А с [a,b], A = [;/2*, (; + \)/2k], for some naturals ; and k, and Ac].t- &{x),x + S(x)[. Let ??d = {Ad(S)}, where 5 runs on the family of all gauges on [a,b]. The Sj-integral is called the dyadic integral. It was studied in many papers, in particular in connection with some problem in Dyadic Harmonic Analysis (see Skvortsov A969, 1988, 1992-93), Kahane A988) and Gordon A990-9lb)). The primitives have been characterized by Bongiorno et al. B000) by an extension of Theorem 2.5 in which the variational measure is constructed using Bd partitions. 3.3.2. The approximate integral. Let {Sx: ? e [a. b]} be a family of measurable subsets of [a,b] such that ? e Sx and ? is a point of density of ??¦. The class of all pairs ([c,d], x) such that c,d e Sx and ? e [c.d] is, by definition, an approximate differentiation basis.
598 В, Bongiorno Let ??ap be the family of all approximate differentiation bases. The ??ap-integral is called the approximate integral. A Perron equivalent definition of the approximate integral was defined by Burkill A931), The approximate integral was studied by Henstock A991), Carrington and Pacquement A972), Bullen A983), Liao A987), Lee A989), Gordon A990-9 la), Skvortsov A992-93), Fu A993-94), Liao and Chew A993-94), Liao A993-94), Gordon A994), Lu A996-97), Bongiorno et al. B000). The problem of recovering an approximately continuous function from its approximate derivative is solved by the approximate integral (see Gordon A994)), while the Denjoy- Khintchine integral recovers the continuous primitives only (see Saks A964) and KhintchineA927)), A function F is the primitives of / with respect the approximate integral if and only if it is approximately differentiable on [a, b] except for a countable set with approximate derivative F' — f almost everywhere on [a, b]. A different characterization in terms of generalized absolutely continuous function was given by Gordon A994—95): THEOREM 3.5. A function F:[a,b]-+ Ш is a primitive of the approximate integral if and only if it is ACG&; i.e., there is a sequence of measurable sets {E„} such that [a, b] = Un E„, and for each ? > 0 and each ? e N there exist ?„ > 0 and an approximate differentiable base ?„ such that J2(F(di)-F(Cl)) v,e?„ < ? for each ?,,-partition {(]c/,d,-[, xj)\ i — 1. 2...., p\ with ?,,.^,,^/ — c<) < '!¦ Convergence theorems are due to Darmawijaya and Lee A988). 3.3.3. The symmetric integral. Given a gauge S on [a. b], we denote by /isym (<5) the collection of all pairs (I,x) such that / = [x -h,x +h], withO < h < S(x) and ? <x-h, x + h <Ь,от I = [a,a + h] with 0 < h < S(x), or / = [b - h, b] with 0 < h < S(x). Let fisym = {Asym(S)}, where <5 runs on the family of all gauges on [a, b]. The ??sym-integral is usually called the symmetric integral. The symmetric integral was studied by Henstock A963), Kurzweil and Jarnik A987, 1990a) and Thomson A994). In fact the integral considered by Thomson (called the (R[s )- integral) is somewhat more general than the /isvm-integral. It is defined by special partitions called symmetric partitions. For each gauge S, a /iSymE)-partition {(A/, Xj)} is said to be a symmetric partition if the conditions a e A-,,b e Aj imply ?(?,) = X(Aj). Remark that 4sym- (respectively G?s)-) integrability on [a.b] does not imply ? sym" (respectively (R[s)-) integrability on each subinterval of [a, b] and that /}Sym- (respectively (/?])-) integrability on contiguous intervals does not imply /iSym- (respectively (/?])-) integrability on their union. The primitives of the (tf')-integral are characterized by symmetrically ACG* functions defined on all [a, b] except possibly some countable set (see Thomson A994, Theorem 9.34)), or, in some particular cases, by an extension of Theorem 2.5 in which the
The Henstock—Kurzweil integral 599 variational measure is constructed using Bsym partitions (see Bongiorno et al. B000, Corollary 5)). Remark also that in some very simple situations the symmetric integral Asym (respectively (/?])) may be used to solve the coefficient problem for trigonometric series; i.e., if ОС f(x) = — + /J(fljt cosk.x + bk sinkx). for each ? e R. *=o how may the coefficients of the series be determined? Of course if / is Lebesgue integrable then the coefficients are determined by the usual Fourier formulas using Lebesgue integrals. But in general / is not Lebesgue integrable, neither Henstock-Kurzweil integrable. This is for example the case of each function f(x) = ?„ b„ ???? with b„ > 0 and ]Tn b„/n = +oo. Nevertheless, even in the most general case b„ \ 0, / is symmetrically integrable (in both senses) on any interval of length 2? and the coefficients are determined by the usual Fourier formulas using the symmetric integrals. 3.3.4. The approximate symmetric integral. A complete solution for the coefficient problem is given by the approximate symmetric integral, introduced by Preiss and Thomson A989). A differentiation basis ? is said to be a measurable approximate symmetric differentiation basis if there is a measurable set Г с R ? ]0, oo[ such that (i) ([x - t, ? +1], x) e ? whenever (x, t) e ?, (ii) limsup,lV) \{t e @, h); (?, t)<?T}\/h = 0, for each ? e R. Let ??as be the family of all measurable approximate symmetric differentiation bases. As before, in the definition of the ??as-integral the partitions are "symmetric partitions". The resulting integral is the approximate symmetric integral. Solutions of the coefficients problem have been also done by Denjoy A941^49), Marcinkiewicz and Zygmund A936), James A950) and Burkill A951). They involve the inversion of a second order symmetric derivative: en f/ ч r F(x+h) + F(x-h)-2F(x) SD2F(x)— lim . ft^o+ h2 A Riemann-type integral which gives F in function of SDtF was defined by Freiling et al. A997). The definition of this integral involves several elaborate partitioning arguments for rectangles in the plane. 4. Multidimensional Riemann-type integrals A subset A of R" is said to be an interval if it is the Cartesian product of ? no degenerate intervals of R. Given a gauge S: A -> R+ and a partition ? we say that ? is 5-fine if d(Aj) < S(xj), i = 1,2,..., p, where "d" stands for "diameter".
600 В. Bongiorno Given a function /: A -> Ш and a partition ? = {(A,, jt,)}''=1 of A, we set ? ? f = ??=? /(¦*/')?"(?,-), where ?"(?,) stands for the и-dimensional Lebesgue measure of A,. 4.1. 77?e Henstock-Kurzweil integral DEHNITION 4.1. Let A be an interval of E" and let /: A -> R. It is said that / is Henstock-Kurzweil integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exists a gauge S on A such that ? ?/> / — /| < ?, for each <5-fine partition ? of A. Many of the properties of the one dimensional Henstock-Kurzweil integral extends to the и-dimensional one (see Henstock A963. 1988)), in particular - the Henstock-Kurzweil integral includes the Lebesgue integral; - each nonnegative Henstock-Kurzweil integrable function is Lebesgue integrable; - monotone and dominated convergence theorems hold; - the Henstock-Kurzweil integrability on A implies the Henstock-Kurzweil integrabil- ity on each subinterval of A; - the Henstock-Kurzweil integrability on A implies the Lebesgue integrability on some subinterval of A (Buczolich A991a)); - the Henstock-Kurzweil integrability on A implies the existence of a sequence of closed sets [X„] such that A = U„ X» and /x / -> /j' / (see Lee et al. A996-97)), The indefinite и-dimensional Henstock-Kurzweil integral F(B) = fB f is an additive continuous interval functions, but in general the set Of of all points ? e A such that F is not differentiable at ? has positive Lebesgue measure; i,e„ for each ? e Of. there exits к &N such that for each m e N the condition F{B) ft ч - fix) ?» (S) >k holds for some interval В with ? e В and d{B) ^ \/m. For each teNwe set rk(F)= \(B,x): xe Of and —!—- - f(x) ^k\. ? ?»(?) ) THEOREM 4.2 (Lu and Lee A999)). An interval function F defined on the subintervals of A is the primitive of a Henstock-Kurzweil integrable function f on A if and only if for any к e N and ? > 0 there is a gauge S such that ]??"(?)<? and Y^\F(B)\<e, for each S-fine partial partition ? = {(A,. jc,-)}/Li with iA>-xi) e A-(F) and*; e 0F,for i = 1, 2,..., p.
The Henstock-Kurzweil integral 601 Other characterizations of the primitives of the «-dimensional Henstock-Kurzweil integral has been given by Lee and Ng A996-97), and Kurzweil A998). Integrals of Riemann-type which primitives are differentiable almost everywhere and absolutely continuous in some generalized sense are the ?-regular Henstock-Kurzweil integral, the regular Henstock-Kurzweil integral, the p-integral. 4.2. The ?-regular Henstock-Kurzweil integral The regularity of an interval В is the ratio of its shortest and longest side; it is denoted by r(B). Let 0 < ? ^ 1. A partition ? = {(A,,jc,)}''=1 of A is said to be ?-regular if r(Aj) ^ ? for each /'. Dehnition 4.3. A function /: A -* R is said to be ?-regularly Henstock-Kurzweil integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exists a gauge S such that | J2P f - I\ < ?. for each <5-fine ?-regular partition ? of A, Note that for each 0 < ? < 1 there exists a function / which is ? ?-regularly Henstock- Kurzweil integrable for each a\ e (a. 1), and is not ??-regularly Henstock-Kurzweil integrable for each aj e @, ?), An interval function F is said to be a-differentiable at a point ? e A with ?-derivative F^{x) if for every ? > 0 there exists <5(;t) > 0 such that F(B) F>, ч ?» (S) < ? for each interval В satisfying the conditions ? e B, d(B) ^ <5(;t) and r{B) ^ a. Remark that if 0 < a < ? ? 1 and F is ?-differentiable to / at x, then F is ?- differentiable to / at ? (see Kurzweil and Jarnik A992a)). Let ? С [a, b) and let S be a gauge on E. The ?-regular S-variation of F on ? is defined as follows V„(F,a,E) = sup?|F(Aft)|, li where the supremum is taken over all ?-regular S-fine partial partitions {(Ал ,*/,)} anchored on E. The ?-regular variation of F on ? is defined as follows V„(F. E) = mfVa(F,8,E), where the infimum is taken over all gauges S. V„(F, ¦) is a metric measure. If F is additive and V„(F. ¦) is absolutely continuous with respect to the Lebesgue measure on A then F is ?-differentiable almost everywhere on A (see Di Piazza B001)).
602 В, Bongiorno THEOREM 4.4 (Kurzweil and Jarnik A992a)). An interval function F defined on the subintervals of A is the primitive of an a-regularly Henstock-Kurzweil integrable function f on A if and only if (i) F is additive; (ii) the measure Va(F. ¦) is absolutely continuous with respect to the Lebesgue measure; (iii) F^{x) = f{x) at almost every ? e A. 4.3. The regular Henstock-Kurzweil integral Given a function ? ; A -> @. 1), a partial partition ? = {(Ai.Xi)}j'_] of A is said to be ?-regular if r(A;) >0 (*,·). An interval function F defined on the subintervals of A is said to be regularly differentiable at a point ? & A with derivative F'(x) if for every ? > 0, there exist S(x) > 0 and 0 < ?(?) ? 1 such that F (x) < ? ?» (S) for each interval В satisfying the conditions ? e B, d(B) ? S(x) and r(B) ^ ?(?). DEnNlTlON 4.5. A function f:A -> R is said to be regularly Henstock-Kurzweil integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exist a gauge <5 and a function ? : A —* @. 1) such that \?? f ~ I\ < ?> for each <5-fine /3-regular partition ? of A. Let ? с A. An interval function F is said to be AC* on ? if for every ? > 0 there exist a gauge 8, a function ?;? -* @, 1) and a positive constant ? > 0 such that 53,- |F(A,)| < ? for each <5-fine /S-regular partial partition {(A/,-*,)}f=1 anchored on ? such that ?], ?"(A,) < /j. A continuous function F is said to be ACG* on A if A is the union of a sequence of sets {?,} such that F is AC* on each ?,-. THEOREM 4.6 (Chew A989-90)). Аи interval function F definedon the subintervals of A is the primitive of a regularly Henstock-Kurzweil integrable function f on A if and only if F is ACG* on A and F'(x) = f(x) almost everywhere in A. 4.4. The p-integral Given a function p:AxR+->@,l) and a partial partition ? = {(A,, -*/)}''=| of A, it is said that ? is yo-regular whenever r( A,) > ? (.*,-, d( A,·)) for each /'. If p(x,d) < 1 for each jc e A and ii e E+, then given a gauge S on A there exists a <5-fine p-regular partition of A (see Jarnik and Kurzweil A995)).
The Henstock-Kurzweil integral 603 The /O-regular variation VP(F, E) of F on a set ? с A is defined as the ?-regular variation by using yo-regular partitions. If F is additive and VP(F,-) is absolutely continuous with respect to the Lebesgue measure on A, then F is regularly differentiable almost everywhere on A (see Di Piazza B001)). Dehnition 4.7. Let ?: ? ? R+ -> @. 1) with p(x.d) < 1 for each .v e A and d e К" and let / : A —* R. It is said that / is yo-integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exists a gauge S such that ? ?? / — /| < ?, for each <5-fine yo-regular partition ? of A. THEOREM 4.8 (Jarnik and Kurzweil A995)). Let ?: ? ? E+ -> ]0, 1[ be a function satisfying the conditions: A) p(x,d) < 1 for each ? e A a/idd e R+: B) 0 < liminfi/_0+ p(x,d) ^ limsup(/^(r p(x.d) < \ for each ? e A. Then an interval function F defined on the subintervals of A is the primitive of a p- integrable function f on A if and only if (i) F is additive; (ii) the measure Vp(F, ¦) is absolutely continuous with respect to the Lebesgue measure; (iii) F'(x) = f(x) at almost every ? e A. Each /O-integrable function is the limit of an equi-convergent sequence of step-functions (see Jarnik and Kurzweil A997)). 4.5. The divergence theorem The main force of the one dimensional Henstock-Kurzweil integral is the integrability of each derivative. A corresponding result in higher dimensions should be the integrability of the divergence of any differentiable vector field. Unfortunately this does not seem to be the case using the и-dimensional Henstock-Kurzweil integral. Modifications of the Henstock- Kurzweil method producing integrals for which the divergence of any differentiable vector field is integrable, are due to Mawhin A981a, 1981b), Jarnik et al. A983), Jarnik and Kurzweil A984, 1985, 1988), Pfeffer A986 1993), Jurkat A993), Jurkat and Nonnenmacher A994a, 1994b, 1994c), Nonnenmacher A994, 1995, 1995-96). 4.5.1. The GP-integral. DEHNITION 4.9. It is said that / : A -> R is GP-integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 and 0 < a ^ 1 there exists a gauge S on A such that I ?/> / — /| < ?. for each partition ? of A which is 5-fine and ?-regular. The GP-integral was introduced by Mawhin A98 lb). The divergence of a differentiable vector field is GP-integrable and the Gauss-Green formula holds. However, there are
604 В. Bongiorno examples when A and В are non-overlapping intervals, the GP-integrals fA f and fB f exist, but fAljB f does not (see Jarnik et al. A983)). Pfeffer A986) suggested to solve this problem by using special r-regular partitions. A partition ? = {(А,, .г,·), / = 1,2, — p} is called special if jc, is a vertex of the interval A, for each /'. Then the problem arises whether for any gauge S and for any 0 < ? ? 1, there exists a <5-fine ?-regular special partition of A. Buczolich A991a) answered positively this problem in R2 if ? — 1/1000, and Fejzic et al. A993-94) if ? < 1/-У2. In dimensions higher than two the problem about the existence of special partitions is still unsolved. 4.5.2. The M\-integral. Jarnik et al. A983) modified the notion of ?-regular partition, obtaining an additive integral which preserves the good properties of the GP-integral. Given a partition ? = {(A/, -*,¦)},'!= ? · we set /' 2T(P) = ??*„_, OA;)d(A;) ; = i where ЭА; denotes the boundary of A, and HH~\ (ЭА,) its (n — 1 )-dimensional Hausdorff measure. Definition 4.10. It is said that /: A -> R is M\ -integrable on A whenever there is a real number / satisfying the following condition: for each positive ? and С there exists a gauge <5 on A such that | ? ? / — /| < ?, for each <5-fine partition ? of A with ?(?) ^ C. 4.5.3. The Nonnenmacher's integral. Nonnenmacher A994) extended the M\ integral by considering only those <5-fine partitions for which the sum of the (n — 1 )-dimensional Hausdorff measures of the boundaries of the non-regular intervals occurring in the partition is bounded. DEnNlTlON 4.11. A function / : A -> R is said to be Nonnenmacher's integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0, r > 0, К > 0 there exists a gauge S on A such that | ?? f — Ц < ?, for each <5-fine partition P = {(Ai,Xi)}?=i of A such that Eru,)<rHn-№Aj)) ??. The divergence theorem holds with respect to each vector field which may not be differentiable on countably many hyperplanes and its continuity may fail at contably many points where it is only assumed the vector field to be of Lipschitz type with a non-positive exponent ? > 1 — п. 4.5.4. The Pfeffer's integral. An extension of the Nonnenmacher's integral was obtained by Pfeffer A986) by using a relative regularity condition with respect to a finite family of planes parallel to the coordinate axes. Let 0 ^ к ? ? — 1. A k-plane is a A:-dimensional linear submanifold of R" which is parallel to к distinct coordinate axes. The regularity of an interval В relative to a family V of ^-dimensional planes is the number r(B.V) defined as follows: if V = 0,
The Henslock-Kurzweil integral 605 then ?(?,-?) = k"(B)/(d(B))u; if V consists of a single ?-plane H, then r(B.H) = ?*(? ? H)/{d{B))k (here ?*(·) denote the A:-dimensional Lebesgue measure) whenever ? П ? ? 0 and r(fi. ?) = k"(B)/(d(B))n otherwise; finally, if V ? 0 is arbitrary, then r(fi,-p) = sup{r(fi.{#}): ? eV]. Dehnition 4.12. A function / : A -> ? is said to be Pfeffer's integrable on A whenever there is a real number / satisfying the following condition: for each 0 < ? < 1/2 and each finite family of A:-planes V, with 0 ^ к ^ ? — 1. there exists a gauge <5 on A such that | ?? / — /| < ?, for each 5-fine partition ? = {(А/. л";)}''=| of A such that r(A(, V) ^ ?, i = 1,2,..., p. Using this integral it is possible to formulate a divergence theorem in which the vector field may not be differentiable on countably many (n — l)-planes and its continuity may fail at contably many (n — 2)-planes. Moreover the usual transformation formula holds for a continuous map ? such that A is the union of nonoverlapping intervals A,, /' = 1.2. p, and ? is affine on each A;. 4.5.5. The PV'-integral. A better transformation formula holds for the Pt/-integral. It was introduced by Jarnik and Kurzweil A985, 1988) with the use of С '-partitions of unity (br. Pt/-partitions) instead of the usual partitions. Given an interval А с ?", a Pi/-partition of A is a collection {(9j,Xi)}f=i of C1-functions ?, with compact supports and points ?? e A such that 0 ^ ?f=, ?, (t) ^ 1 for each t e E". and ?f=, ?, (t) = 1 on A. If S is a gauge on A, then a Pi/-partition {(#,, .v/M'Li is said t0 be <5-fine if ft vanishes outside the ball of center x-, and radius ?(?,). for each /'. Dehnition 4.13. A function /: A -> ? is said to be Pt/-integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exist a gauge S on A and two constants a > 0 and ? > 1 such that ? fix,) / Wdt-I < ?. for each 5-fine Pt/-partition {(ft..v,)}''=| of A such that 9,(t) < A +orH;(.x;) for each t e E", and / |D0/(O||dr < — / 0/(r)dr. Jr" Pi Jr» where || · || denotes the euclidean norm of E", D9, denotes the gradient of ?,, and Pi — sup{ || t - ?/1|: t e supp ft}. The usual transformation formula holds for С' -diffeomorphisms; therefore it is possible to extend the Pt/-integration to differentiable manifolds. The divergence theorem holds with respect to each continuous vector field which may not be differentiable on a hyperplane.
606 В. Btmgionio 4.5.6. The g-integral. The g-integral, introduced by Pfeffer A993), is an integral which uses a more traditional concept of interval partition, but a larger class of gauges. A function S : A -> R+ U {0} is said to be a weak gauge on A whenever its null set has ?-finite (n — 1 )- dimensional Hausdorff measure. Dehnition 4.14. Let A be an interval. A function / ; A -> R is said to be g-integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exist a weak gauge <5 and a sequence of positive real numbers {?;}, j = 1, 2 , so that | ?? f - I\ < ? for each 5-fine ?-regular partial partition ? = {(?,, .?,)}''=| such that A \ U(;L| A-j is the union of disjoint intervals B\. Zb Bk with 7i„-\ (SBj) < l/? and ??{?])<?]???] = 1,2,.... ?. The divergence theorem holds with respect to each continuous vector field which may not be differentiable on a set of ?-finite (n — 1 )-dimensional Hausdorff measure. However the g-integral is not rotation invariant (Buczolich A992-93)). This deficiency is removed by the ^"-integral. 4.5.7. The ?-integral. The :F-integral. introduced by Pfeffer A993). uses partitions with figures (finite union of intervals). Definition 4.15. Let A be a figure. A function /: A -> R is said to be JF-integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exist an essential gauge <5 and a sequence of positive real numbers {/jy}, j = 1.2 so that | J2p f - l\ <? for each 5-fine partial partition ? = {(A,. .v,)}''=1 such that A) A, isafigurewithX"(A,)/(i/(A,)W„-i(A,)) > ? for/= 1.2 p: B) A \ (Jf=| A, is the union of disjoint figures B\, B-> Bk with H„-\(dBj) < 1/? and ?"(?j) < ?, for j = 1.2 k. The J"-integral satisfies the change of variable formula for each injective Lipschitz map ? : A -> R" such that ?~] is also Lipschitz (lipeomorphism) and ?(?) is a figure. However, there is some lipeomorphism ?:?-> R" such that ? (A) is not a figure. This lack is removed by the ??V-integral. 4.5.8. The BV-integral. The SV-integral. introduced by Pfeffer A99la, 1993), is based on partitions which use BV-sets instead of figures. Given a set E, the essential closure of ? is defined as the set of all ? e R" such that ? is not a dispersion point for E. It is denoted by cL?. The set deE =cl,.? nd,.(R" \ E) is called the essential boundary of ?. A BV-set is a bounded subset ? of R" such that H„-\ (de(E)) < +oo. The intervals and the figures are particular В V -sets. Dehnition 4.16. Let A be a BV set. A function /:cl,.A -> R is said to be BV- integrable on A whenever there is a real number / satisfying the following condition: for each ? > 0 there exist an essential gauge S and a sequence of positive real
The Henslock-Kurzweil integral 607 numbers {/jy}, j = 1, 2 , so that | ]Tp / — /| < ? for each <5-fine partial partition {{A\,x\), (A2,X2), ¦¦¦, (Ap,xp)} such that (?) A, isaBVsetwithXtAO/O^A/iK-i^A,)) > ? for / = 1, 2,.... p; B') A \ Uf=| A, is the union of disjoint BV sets ?|. B2 Bk with H„-1 (9^бу) < ?/? and ?"(?,) < /jy for ; = 1. 2 к. The SV-integral is invariant for general lipeomorphisms. The divergence theorem holds with respect to each continuous vector field which may not be differentiable on a set with ?-finite (n — 1 )-dimensional Hausdorff measure. 4.5.9. The v-integrai The v-integral was introduced and studied by Jurkat A993). Jurkat and Nonnenmacher A994a, 1994b. 1994c). Nonnenmacher A995, 1995-96). It is a coordinate free integral for which the divergence theorem holds with respect to each unbounded vector field with singularities of order greater than 1 — /;. Dehnition 4.17. Let A be a compact BV set. A function / : A -* Ш" is said to be v- integrable on A whenever there are a real number /, a decomposition A = Eq U (U/? ? ?/') of A and a function ?: N -* [0, ?] ? {1,2} such that A) the sets ??> E\,... are disjoint; B) EQ is a subset of the interior of A, and ?" (? \ ?(>) = 0; C) if 00) = (?, 1) with a < n, then Ha(Ej) < do; D) if 0(/) = (n, 1), then ?"(?;) = 0; E) if 0(/) = (a. 2), then W„(?,) = 0; and such that, for each positive ?,?,?\ there exist a gauge S on A and a sequence of positive real numbers ?\,?? so that | ?]/>/ — /| < ? for each <5-fine partial partition ? = {(Ak, **)}?=| U {(AJr ^',)}^=| of A satisfying the following conditions: A) the {Ak} and the {A'h) are compact BV subsets of A; B) if jc* e ?0, then Л(А*)" <; K\Ak\„ and H„-1 (ЭЛА) ? Kd{Akf-]; C) for / = 1,2, the collection ?, of all A* such that xk e E-, satisfies one of the following conditions: (a) if 0@ = (?, 1) (respectively 00) = (?. 2)) with 0 ^ a < ? - 1, then ?, is a subcollection of a finite family {Cj]"-=i of compact SV sets such that 7in-\(B(x,r) П ЭСУ) ?? АГ,г"~' for all .r e R" and r > 0. Ey^C/)" ij AT, (respectively Zjd(Cj)a < 6»,). and ?y. Xc,(-v) < */; (b) if 0(/) = (n - 1. 1) (respectively 00) = (?/ - 1.2)). then П„-\(дАк) ? AT,, (respectively H„- \ (ЭС,¦) ^ 6>,). for each Aa e ?,; (c) if 0(z') = (a, 1) (respectively 00) = (a. 2)) with ? - 1 < a < л, thenrf(A^)" ^ AT, ?" (Л*), Нп-\(ЕдАк) ^ AT/, (respectively |Э(Су)„_| ^ 6),·) for each A* e r,; (d) if 00) = (и, 1) or 00) = 0/.2), then d(Ak)" ? K,\Ak\„ and H„-\(dAk) ? KidiAk)"-* for each Ак еГ,; D) ?Л W„_ ? (Э A;,) ^ AT, and for /; = 1. 2 <7 either x'h e ?0 either there exist / e N and a e ]n - 1, и] such that 0@ = (<*- ;')¦ j =1.2, and .v^( e ?,-.
608 В. Bongiorno 4.6. Fubini 's theorem One of the main tool of the multidimensional Lebesgue integral is the Fubini's theorem. This is not the case for many of the integrals considered in this section. In fact Fubini's theorem is incompatible with the divergence theorem, in the sense that for each multidimensional integral that satisfies Fubini's theorem and in dimension one coincides with the Henstock-Kurzweil integral, it is possible to construct a differentiable vector field that is not integrable (see Pfeffer A993)). Fubini's theorem holds for the и-dimensional Henstock-Kurzweil integral (see Kurzweil A973) and Henstock A988)) but it does not hold for the regular integrals (see Ostaszewski A986)). 5. The Henstock-Kurzweil integral of vector valued functions The Henstock-Kurzweil method is readily adaptable to provide a theory of integration for vector-valued functions. However in case the range space is an infinite dimensional Banach space the corresponding theory differs from the real-valued case in some important points: A) the Henstock's lemma does not hold (see Skvortsov and Solodov A998-99), and CaoA992)); B) there exist integrable functions which are not measurable (for example / : [0, 1 ] -> /oo[0, 1] denned by f(t) = ?[0,?]); C) there exists integrable functions which primitives are not ACG* neither ACGs (same example). But if the range space is a vector space endowed with nuclearity, then the Henstock's lemma holds (see Nakanishi A994)); therefore in particular Henstock's lemma holds if the range space is one of the spaces S, S', V and V occurring in the Schwartz's theory of distributions. As mentioned in Section 2.4 a slight modification of Henstock-Kurzweil method produces the McShane integral which is equivalent (for real valued function) to the Lebesgue integral. We recall that if X is an infinite dimensional Banach space the Bochner integral and the Pettis integral are the most known generalizations of the Lebesgue integral. A function /: [a, b] -> X is said to be Bochner integrable on [a, b] if there exists a sequence of simple functions {s,,} such that lim„ fh \\f — s„ \\ = 0. where || · || denotes the norm in X. A function /: [a, b] -> X is said to be Pettis integrable on [a, b] if the scalar function x* о f is Lebesgue integrable on [a. b] for each x* e X*, the dual of X, and there exists we X such that fix* of =x*(w) for each x* e X*. The class of all McShane integrable functions is strictly contained between the class of all Bochner integrable functions (see Gordon A990)) and the class of all Pettis integrable functions (see Fremlin and Mendoza A994)). A Pettis integrable on [a, b] function / is McShane integrable in the following cases: A) /is strongly measurable (see Gordon A990)); B) / is Henstock-Kurzweil integrable on [a. b] (see Fremlin A994)).
The Henstock-Kurzweil integral 609 Therefore it follows that even in the Banach-valued case a function / is McShane integrable on [a, b] if and only if //? is Henstock integrable for every measurable Ec[a,b]. Nakanishi A994) showed that the McShane and the Bochner integrals are equivalent for functions with range in a vector space endowed with nuclearity. 6. The Henstock-Kurzweil integral on general spaces The Henstock-Kurzweil method was developed in a very general situation by Henstock in many papers, in particular in A991). The basic elements of the theory are a generic fixed set X and a generic fixed family ? of non-empty subsets of X, called intervals. Every finite disjoint union of intervals is called an elementary set. A partition of an elementary set ? is as usual any finite collection ? = {(A,·,*,-)} of disjoint intervals A, and points ?? e A, such that [Jj A, = E. In this general situation there is no meaning to define a gauge, then Henstock defines the "<5-fine partitions" in the following axiomatic way. A collection S of pairs (A,x) with A e ? and ? e A is called a dividing collection. It is said that S divides an elementary set ? whenever there exists an <S-partition of E. A family ? = {Sa: a e T] of dividing collections is fixed. This family is the vital component of the theory, differences between the families being reflected in the differences between the corresponding integrals. To assure the unicity and the additivity of the resulting integral it is assumed that ? satisfies the following conditions: A) for each elementary set ? there exists a e ? such that Sa divides E; B) if a and ? are elements of ? such that Sa and Sp divide a given elementary set E. then there exists ? e ? such that SY divides ? and SY С Sa ? Sp; C) if E\ and Ej are elementary sets and Sa divides E\ U E2, then there exists a \, ai e ? such that Sa] ={(A,x)eSa: А с E\] and ?„, = {(A.jc) e <S„: Ac E2): D) if?| and Ej are disjoint elementary sets, <Sff| divides ? \, Sai divides Ei and А с Е, for each (A, x) e Sa,, / = 1. 2. then there exists ? e ? such that 5y divides ?|U?i and <Sy С <Sa, U <Sa2. Let h(A,x) be a real-valued function defined on the family of all pairs (A. x) e «S„, for each or e Г. It is said that h is T-integrable on an elementary set ? with integral fEdh if for each ? > 0 there is a e ? such that y^h(Ai,Xi) - dh\<s Je I for each Sa -partition of E. Notice that even in this general setting monotone and dominated convergence theorems hold, and that the corresponding integral on product spaces satisfies Fubini-type and Tonelli-type theorems. It is interesting that this integral applies to the unification of Wiener and Feynmar integration in a single mathematical framework, and to the availability of new and powerful techniques for handling some long-standing mathematical difficulties in formulating a
610 В. Bongiomo definition and calculus of the Feynman integral (see Muldowney A987) and Chapter 6 ofHenstockA991)). References Alexiewicz, A. A948), Linear junctionals on Denjoy integrable functions. Coll. Math. 1. 289-293. Bartle. R.G. A994-95), A convergence theorem for generalized Riemann integrals. Real Anal. Exchange 20. 119-124. Bongiomo, B. A994), Relatively weakly compact sets in the Denjoy space, J. Math. Study 27 A). 37^44. Bongiomo, B. A996), Un nuovo integrate per it problema delle primitive. Le Matematiche 51B). 299-313. Bongiomo, B. and Di Piazza, L. A991-92). Convergence theorems jor generalized Riemann-Stieltjes integrals. Real Anal. Exchange 17, 339-361. Bongiomo, В., Di Piazza, L. and Pfeffer, W,F. A993-94), A full descriptive definition of the BV-integral. Comment. Math. Univ. Carolin. 36. 461^469. Bongiomo, В.. Di Piazza. L. and Preiss. D. B000), A constructive minimal integral which includes Lebesgue integrable functions and derivatives. J. London Math. Soc. 62 A), 117-126. Bongiomo, В.. Di Piazza, L. and Skvortsov, V.A. A993-94). Uniform generalized absolute continuity and some related problems in H-integrability, Real Anal. Exchange 19 A), 290-301. Bongiomo, В., Di Piazza, L. and Skvortsov, V.A. A995). The essential variation of a function and some convergence theorems. Anal. Math. 22. 3-12. Bongiomo, В.. Di Piazza, L. and Skvortsov. V.A. A995-96). A new full descriptive characterization of the Denjoy-Perron integral. Real Anal. Exchange 21 B). 656-663. Bongiomo, В., Di Piazza, L. and Skvortsov. V.A. B000). On variational measures related to some bases, J. Math. Anal. Appl.. to appear. Bongiomo, В., Pfeffer, W.F. and Thomson. B.S. A996). A full descriptive definition oj the gage integral, Canad. Math. Bull. 39D), 390-401. Bruckner, A.M., Fleissner, R.J. and Foran. J., A986). The minimal integral which includes Lebesgue integrable junctions and derivatives, Colloq. Math. 50 B). 289-293. Buczolich, Z. A987-88), Nearly upper semicontinuous gauge junctions in K'", Real Anal. Exchange 13 B). 436-440. Buczolich, Z. A991a), A general Riemann complete integral in the plane. Acta Math. Hungar. 57C^4). 315-323. Buczolich, Z. A991b), Henstock integrable junctions are Lebesgue integrable on a portion, Proc. Amer Math. Soc. 111A), 127-129. Buczolich, Z. A992-93), The g-integral is not rotation invariant. Real Anal. Exchange 18. 437^447. Buczolich, Z. A995), Characterization oj upper semicontinuoiisly integrable junctions,]. Austral. Math. Soc. 59. 244-254. Buczolich. Z. and Pfeffer. W.F. A997). On absolute continuity, Czechoslovak Math. J. 47. 525-555. Buczolich. Z. and Pfeffer. W.F. A998). On absolute continuity. J. Math. Anal. Appl. 222 A). 64-78. Bullen. PS. A983). The Burkill approximately continuous integral, J. Austral. Math. Soc. Ser. A 35. 236-253. Bullen. PS. and Vybomy, R. A991). Some applications of theorem of Marcinkiewicz. Canad. Math. Bull. 34 B), 165-174. Burkill, J.C. A931), The approximately continuous Perron integral. Math. Z. 34. 270-278. Burkill. J.C. A951), Integrals and trigonometric series, Proc. London Math. Soc. 1. 46-57. Cao. S.S. A992), The Henstock integral for Banach-valued functions. Southeast Asian Bull. Math. 16. 35^40. Carrington. D.C. and Pacquement. A. A972), Sur line extension duproce'ded'integration de MR. Henstock,C. R. Acad. Sc. Paris Ser. A 274. 1901-1904. Chelidze. V.G. and Djvarsheishvili. A.G. A978). Theon of the Denjoy Integral and Some of Its Applications, Tbilisi (in Russian). Chew, T.S. A980). On the generalized dominated convergence theorem. Bull. Austral. Math. Soc. 37. 165-171. Chew. T.S. A989-90), On the equivalence of Henstock-Kurzweil and restricted Denjoy integrals in K". Real Anal. Exchange 15. 259-268.
The Henstock-Kurzweil integral 611 Chew. T.S. and Lee. P.Y. A990). The topology of the space of Denjoy integrable functions. Bull. Austral. Math. Soc. 42C). 517-524. Darmawijaya. S. and Lee, P.Y. A988). The controlled convergence theorem for the approximately continuous integral of Burkill. Proc. of the Anal. Conf.. Singapore. 1986. North-Holland Math. Stud.. Vol. 150. North- Holland, Amsterdam. Davies. R.O. and Schuss, Z. A970). Aproof that Henstock's integral includes Lebesgue's, J. London Math. Soc. 2B). 561-562. Denjoy, A. A912a), Une extension de I'integrale de M. Lebesgue, C. R. Acad. Sci. Paris 154. 859-862. Denjoy, A. A912b). Calcul de la primitive de la fonction derivee la plus generate. С R. Acad. Sci. Paris 154, 1075-1078. Denjoy, A. A941^49), Legns sur le calcul des coefficients d'une serie trigonome'trique. Hermann, Paris. Di Piazza. L. A994). Weak convergence of Henstock integrable sequences. J. Math. Study 27 A), 148-153. Di Piazza. L. B001). Variational measures in the theory of the integration in K'". Czechoslovak Math. J.. to appear. Fleischer. I. A996-97). A Vitali-like convergence theorem for the Henstock integral. Real Anal. Exchange 22 A), 174-176. Fejzic. H.. Freiling. С and Rinne. D. A993-94). Two-dimensional partitions. Real Anal. Exchange 19 B). 540- 546. Freiling. C. Rinne. D. and Thomson. B.S. A997). A Riemann-type integral based on the second symmetric derivative. J. London Math. Soc. 56. 539-556. Fremlin. D.H. A994). The Henstock and McShane integrals of vector-valued functions. Illinois J. Math. 38 C). 471^479. Fremlin, D.H. A995). The generalized McShane integral. Illinois J. Math. 39. 39-67. Fremlin. D.H. and Mendoza. J. A994). On the integration of vector-valued functions. Illinois J. Math. 38 A). 127-147. Fu. S. A993-94). The S-Henstock integration and the approximately strong Lusin condition. Real Anal. Exchange 19A). 312-316. Gordon. R.A. A986-87). Equivalence of the generalized Riemann and restricted Denjoy integrals. Real Anal. Exchange 12 B). 551-574. Gordon. R.A. A989-90a). A descriptive characterization of the generalized Riemann integral. Real Anal. Exchange 15. 397^*00. Gordon. R.A. A989-90b). Another look at the convergence theorem for the Henstock integral. Real Anal. Exchange 15 B). 724-728. Gordon. R.A. A990). The McShane integral of Banach-valuedfunctions. Illinois J. Math. 34. 557-567. Gordon. R.A. A990-91a). The inversion of approximate and dyadic derivatives using an extension of the Henstock integral. Real Anal. Exchange 16 A). 154-168. Gordon, R.A. A990-91b). Another approach to the controlled convergence theorem. Real Anal. Exchange 16. 306-310. Gordon, R.A. A991), A general convergence theorem for nonabsolute integrals. J. London Math. Soc. 44. 301 — 309. Gordon. R.A. A991-92). Riemann tails and the Lebesgue and Henstock integrals. Real Anal. Exchange 17. 789- 795. Gordon, R.A. A992-93). On the equivalence of two convergence theorems for the Henstock integrals. Real Anal. Exchange 18A). 261-266. Gordon. R.A. A994). The Integrals of Lebesgue. Denjo\. Perron, and Henstock. Graduate Stud, in Math.. Vol. 4. Amer. Math. Soc. Providence. RI. Gordon. R.A. A994-95). Some comments on an approximately continuous Khintchine integral. Real Anal. Exchange 20 B). 831-841. Gordon, R.A. A995-96), An iterated limits theorem applied to the Henstock integrals. Real Anal. Exchange 21 B), 774-781. Gordon. R.A. A997-98). Some comments on the McShane and Henstock integrals. Real Anal. Exchange 23 A). 329-342. Henstock. R. A961). Definition of Riemann type of the variational integral. Proc. London Math. Soc. 11 C). 402^418.
612 В. Bongiorno Henstock, R. A963). Theory of Integration. Butterworths. London. Henstock, R. A968), A Riemann-tspe integral ofLebesgue power, Canad. J. Math. 20. 79-87. Henstock. R. A968). Linear Analysis, Vol. 16. Butterworths. London. Henstock. R. A969). Generalized integrals of vector-valued functions. Proc. London Math. Soc. 19 C). 509-536. Henstock, R. A976). Integration in product spaces, including Wiener and Feynman integration. Proc. London Math. Soc. 27. 317-344. Henstock, R. A988). Lectures on the Theory of Integration. World Scientific. Singapore. Henstock, R. A991). The general theory of integration. Oxford Mathematical Monographs. Clarendon Press. Oxford. James. R.D. A950), A generalized integral (II). Canad. J. Math. 2. 297-306. Jarnik, J. and Kurzweil. J. A984). A nonabsolutelv convergent integral which admits С' -transformations, Casopis Pest. Mat. 109B). 157-167. Jarnik, J. and Kurzweil. J. A985). A nonabsolutelv convergent integral which admits transformation and can be used for integration on manifolds, Czechoslovak Math. J. 35. 116-139. Jarnik, J. and Kurzweil. J. A988). A new and more powerful concept of the PU-integral. Czechoslovak Math. J. 38. 8-48. Jarnik, J. and Kurzweil. J. A995). Perron type integration on ?-dimensional intervals and its properties. Czechoslovak Math, J. 45. 79-106. Jarnik. J. and Kurzweil. J. A997), Another Perron type integration in ? dimensions as an extension of integration of stepfunctions, Czechoslovak Math. J. 47 C). 557-575. Jarnik. J.. Kurzweil, J. and Schwabik, S. A983). On Mawhin's approach to multiple nonabsolutely convergent integrals, Casopis Pest. Mat. 108. 356-380. Jurkat. W.B. A993). The divergence theorem and Perron integration with exceptional sets. Czechoslovak Math. J. 43. 27-45. Jurkat. W.B. and Nonnenmacher. D.J.F. A994a). An axiomatic theory of non-absolutely convergent integrals in R". Fund. Math. 145 C). 221-242. Jurkat. W.B. and Nonnenmacher. D.J.F. A994b). A generalized ?-dimensional Riemann integral and the divergence theorem with singularities. Acta Sci. Math. 59. 243-258. Jurkat. W.B. and Nonnenmacher. D.J.F. A994c). A theory of non-absolutely convergent integrals in K" with singularities on a regular boundary. Fund. Math. 146 A). 69-84. Jurkat. W.B. and Nonnenmacher. D.J.F A995). The fundamental theorem for the v^-integral on more general sets and a corresponding divergence theorem with singularities. Czechoslovak Math. J. 45 A). 69-77. Jurkat. W.B. and Nonnenmacher. D.J.F. A995-96). A constructive definition of the ?-dimensional v(S)-integral in terms of Riemann sums. Real Anal. Exchange 21 A). 216-235. Kahane. J.-R A988). Une theory de Denjoy des martingales dyadiques. Enseign. Math. B) 34 C—4). 255-268. Khintchine. A. A927). Recherches sur la structure des-functions mesurables. Fund. Math. 9. 212-279. Kubota. J. A980). An elementary theory of the special Denjoy integral. Math. Japon. 24 E). 507-520. Kurzweil. J. A957), Generalized ordinary differential equations and continuous dependence on a parameter. Czechoslovak Math. J. 7. 418—446. Kurzweil. J. A973). On Fubini theorem for generalized Perron integral. Czechoslovak Math. J. 23 (98). 286-297. Kurzweil. J. A978). The Perron-Ward integral and related concepts. Appendix A in: Jacobs. K.. Measure and Integral, Academic Press. New York. Kurzweil. J. A980). Nichtalsolut konvergente Integrate, Teubner. Leipzig. Kurzweil, J. A998). The set of primitives of the Henstock-Kurzweil integrable functions on a convergence vector space, Tatra Mt. Math. Publ. 14. 29-37. Kurzweil, J. and Jamfk. J. A987). On some extensions of the Perron integral on one-dimensional intervals. An approach by integral sums fulfilling a svmmetrv condition, Funct. Approx. Comment. Math. 17. 49-55. Kurzweil, J. and Jarnik. J. A990a), On a generalization of the Perron integral on one-dimensional inteivals, Ann. Polon. Math. 51. 205-218. Kurzweil. J. and Jarnik. J. A990b). The PU-integral: its definition and some basic properties. New Integrals. Lecture Notes in Math., Vol. 1419. Springer-Verlag. Berlin. Kurzweil. J. and Jarnik. J. A991). Equi-integrabilitv and controlled convergence of Perron-type integrable functions. Real Anal. Exchange 17 A). 110-139.
The Henstock-Kurzweil integral 613 Kurzweil. J. and Jarnik. J. A992a). Differentiability and integrability in ? dimensions with respect to a-regular intervals. Results Math. 21 A-2). 138-151. Kurzweil. J. and Jarnik. J. A992b). Equivalent definitions of regular generalized Perron integral. Czechoslovak Math. J. 42 B). 365-378. Kurzweil. J. and Jarnik, J. A993), Generalized multidimensional Perron integral involving a new regularity condition. Results Math. 23 C^4). 363-373. Kurzweil. J.. Mawhin. J. and Pfeffer. W.F. A991). An integral defined by approximating BV partitions of unity, Czechoslovak Math. J. 41. 695-712. Kubota. Y.. A direct proof that the RC-integral is equivalent to the D*-integral, Proc. Amer. Math. Soc. 80. 293-296. Leader. S. A992-93). Basic convergence principles for the Knrzweil-Henstock integral. Real Anal. Exchange 18 A). 95-114. Leader. S. A993-94). Uniform Kurzweil-Henstock integrability. Real Anal. Exchange 19 A). 173-193. Lebesgue. H. A902). Integrate, longueur, aire, Ann. Mat. Рига Appl. 7. 231-359. Lee. P.Y. A989). Lanzhou Lectures on Henstock Integration, Series in Real Analysis. Vol. 2. World Scientific. Singapore. Lee. P.Y. A989-90). On ACG* functions. Real Anal. Exchange 15. 754-759. Lee. P.Y. A990). Generalized convergence theorems for Denjov-Perron integrals. New Integrals. Lecture Notes in Math.. Vol. 1419. Springer-Verlag. Berlin. Lee. P.Y and Chew. T.S. A985). A better convergence theorem for Henstock integrals. Bull. London Math. Soc. 17. 557-564. Lee. P.Y. and Chew. T.S. A987). Л short proof of the controlled convergence theorem for Henstock integrals. Bull. London Math. Soc. 19. 60-62. Lee. P.Y. and Ng. W.L. A996-97). The Radon-Nikodvm theorem for the Henstock integral in Euclidean spaces. Real Anal. Exchange 22. 677-687. Lee. P.Y. and Vybomy. R. A993). Kurzweil-Henstock integration and the strong Lusin condition, Boll. Univ. Mat. Ital. 7D). 761-773. Lee. P.Y. and Wittaya Naak-in A982). A direct proof that Henstock and Denjoy integrals are equivalent. Bull. Malaysian Math. Soc. B) 5, 43^47. Lee. T.Y. Chew. T.S. and Lee. P.Y. A996-97). On Henstock integrability in Euclidean spaces. Real Anal. Exchange 22A). 382-389. Liao. K.C. A987). A refinement of the controlled convergence theorem for Henstock integrals, SEA Bull. Math. 11.49-51. Liao. K.C. A989). The weak controlled convergence theorem for Henstock integrals, SEA Bull. Math. 13 A). 15-18. Liao. K.C. A992-93). On the descriptive definition of the Burkill approximately continuous integral. Real Anal. Exchange 18. 253-260. Liao. K.C. A993-94). Two types of absolute continuity used in Henstock and A P integration are equivalent. Real Anal. Exchange 19A). 212-217. Liao. K.C. and Chew. T.S. A993-94), The descriptive definitions and properties of the AP integral and their application to the problem of controlled convergence. Real Anal. Exchange 19 A). 81-97. Liu. G.Q. A987-88). The measurability of ? in Henstock integration. Real Anal. Exchange 13 B). 446-450. Liu. G.Q. A992-93). On necessary conditions for Henstock integrabilin. Real Anal. Exchange B). 522-531. Lu. S.R A996-97). Notes on the approximately continuous Henstock integrals. Real Anal. Exchange 22 A). 1-5. Lu. S.P and Lee. P.Y. A990-91). Globally small Riemann sums and the Henstock integral. Real Anal. Exchange 16 B). 537-545. Lu, J. and Lee. P.Y. A999). The primitives of Henstock integrable functions in Euclidean space. Bull. London Math. Soc. 31. 173-180. Lusin. N.N. A912). Sur lesproprietes de I'integral de M. Denjoy, C. R. Acad. Sci. Paris 155. 1475-1478. Marcinkiewicz. J. and Zygmund, A. A936). On the differentiability of functions and the summability of trigonometrical series. Fund. Math. 26. 1^4.4. Mawhin. J. A981a). Generalized Riemann integrals and the divergence theorem for differentiable vector fields, E.B. Christoffel. Birkhauser, Basel. 704-714.
614 В. Bongiomo Mawhin, J. A981b), Generalized multiple Perron integrals and the Green-Goursat theorem for differentiable vector fields, Czechoslovak Math. J. 31, 614-632. McLeod, R.M. A980), The Generalized Riemann Integral. Carus Mathematical Monograph. Vol. 20. Mathematical Association of America. McShane, E.J. A969), A Riemann-Tvpe Integral that Includes Lebesgue-Stieltjes, Bochner and Stochastic Integrals. Amer. Math. Soc. Memoirs, No. 88. Amer. Math. Soc.. Providence. RI. McShane, E.J. A973), A unified theory of integration. Amer. Math. Monthly 80. .349-359. McShane, E.J. A983), Unified Integration. Academic Press. New York. Muldowney, P. A987). A General Theorv of Integration in Function Spaces. Pitman Res. Notes in Math. Series. Vol. 153, Longman Scientific and Technical, Harlow. Nakanishi, S. A994), The Henstock integral for functions with values in nuclear spaces. Math. Japon. 39 B). 309-335. Nonnenmacher. D.J.F. A994). A descriptive, additive modification of Mawhin's integral and the divergence theorem with singularities. Ann. Polonici Math. 59 A). 85-98. Nonnenmacher, D.J.F. A995). Sets of finite perimeter and the Gauss-Green theorem with singularities. J. London Math. Soc. 52 B), 335-344. Nonnenmacher, D.J.F. A995-96), A constructive definition of the ?-dimensional v(S)-integral in terms of Riemann sums. Real Anal. Exchange 21 A). 216-235. Ostaszewski, K.M. A986), Henstock Integration in the Plane. Mem. Amer. Math. Soc., No. 35.4, Amer. Math. Soc. Providence, RI. Perron, O. A914), tlber den Integralbegriff. Sitzber. Heidelberg Akad. Wi.ss.. Math. Naturw. Klasse Abt. A 16. 1-16. Pfeffer, W.F. A986), The divergence theorem. Trans. Amer Math. Soc. 295 B). 665-685. Pfeffer. W.F. A987), The multidimensional fundamental theorem of calculus. J. Australian Math. Soc. 43. 143- 170. Pfeffer, W.F. A988). A note on the generalized Riemann integral. Proc. Amer. Math. Soc. 103 D), 1161-1166. Pfeffer. W.F. A991a), A descriptive definition of a variational integral and applications. Indiana Univ. Math. J. 40, 259-270. Pfeffer, W.F. A991b), The Gauss-Green theorem. Adv. Math. 87 A). 93-147. Pfeffer, W.F. A992), A Riemann type definition of a variational integral. Proc. Amer. Math. Soc. 114. 99-106. Pfeffer, W.F. A993), The Riemann Approach to Integration. Cambridge University Press, Cambridge. Preiss, D. and Thomson, B.S. A989), The approximate symmetric integral. Canad. J. Math. 41 C). 508-555. Pujie, W. A993-94), Equi-integrabilin and controlled convergence for the Henstock integral. Real Anal. Exchange 19 A), 236-241. Saks, S. A964), Theory of the Integral. Dover, New York. Schurle. A.W. A986a). A function is Perron integrable if it has locally small Riemann sums, J. Austral. Math. Soc. A 36, 220-232. Schurle, A.W. A986b), A new property equivalent toLebesgue integrability. Proc. Amer. Math. Soc. 96. 103-106. Skvortsov, V.A. A969). A certain generalization of Perron's integral. Vestnik Moskov. Univ. Ser. 1 Mat. Mekh. (English translation: Moscow Univ. Math. Bull. 24. 48-51). Skvortsov, V.A. A988). Some properties ofdvadic primitives. New Integrals. Lectures Notes in Math.. Vol. 1419. Spnnger-Verlag, Berlin. 167-179. Skvortsov, V.A. A992-93). On some questions of R. Gordon related to approximate and dyadic Henstock integrals. Real Anal. Exchange 18 A). 267-269. Skvortsov, V.A. A997). A variational measure and sufficient conditions for the differentiability of an additive interval function. Vestnik Moskov. Univ. Ser. 1 Mat. Mekh. B). 55-57 (English translation: Moscow Univ. Math. Bull. 52 B), 38-39). Skvortsov, V.A. A998), Variation and variational measures in integration theory and some applications. J. Math. Sci. (New York) 91 E). 3293-3322. Skvortsov, V.A. and Solodov. A.P A998-99). A variational integral for Banctch-valued functions. Real Anal. Exchange 24 B). 799-806. Skvortsov. V.A. and Thomson. B.S. A995-96). Symmetric integrals do not have the Marcinkiewicz property. Real Anal. Exchange 21 B), 510-520.
The Henstock-Kurzweil integral 615 Thomson. B.S. A971). Covering systems and derivatives in Henstock division spaces. J. London Math. Soc. B) 4. 103-108. Thomson, B.S. A985). Real Functions. Lecture Notes in Mathematics. Vol. 1170. Springer-Verlag. Berlin. Thomson. B.S. A994). Symmetric Properties of Real Functions. Monographs and Textbook in Pure and Appl. Math.. Vol. 183. Marcel Dekker. New York. Thomson, B.S. A998-99). Some properties of variational measures. Real Anal. Exchange 24 B). 845-854. Wang. P.J. A993-94). Equi-integrability and controlled convergence for the Henstock integral. Real Anal. Exchange 19 A). 236-241.
CHAPTER 14 Set-Valued Integration and Set-Valued Probability Theory: An Overview Christian Hess Universite Paris Dauphine, Viabilite, Jeux, Contrdle, 75775 Paris Cedex, France E-mail: hess@viab.ufniid.daupluiie.fr Contents 1. Introduction 619 2. Notations and preliminaries 620 3. Integration of strongly measurable multifunctions 622 4. Weakly measurable multifunctions and graph-measurable multifunctions 629 5. The Aumann integral 634 6. The set-valued conditional expectation of closed valued multifunctions 639 7. Set-valued measures 643 8. The probability distribution of a measurable multifunction 647 9. Set-valued strong laws of large numbers 652 9.1. Convergence in the Hausdorff metric topology 652 9.2. Convergence in the sense of Painleve-Kuratowski 654 10. Set-valued martingales 658 11. Gaussian multifunctions. The set-valued Central Limit Theorem 660 12. Set-valued versions of the Fatou Lemma 662 13. Epigraphical convergence 663 14. Concluding remarks 665 References 666 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 617
This page intentionally left blank
Set-valued integration and .set-valued probability theory: An overi'iew 619 1. Introduction The mathematical modelization based on multifunctions (alias set-valued maps, correspondences, etc.) has shown its great adaptability and relevance for a long time. It allows one to take into account the multiplicity of possible choices, the lack of information and/or the uncertainty in a lot of situations ranging from Optimal Control to Economic Theory. For example, this can be seen in the monographs by Hildenbrand [118], and Aubin and Frankowska [16]. In particular, measurable multifunctions (alias set-valued random variables, random sets, etc.) are of interest in Probability Theory and in Statistics, not only from the theoretical point of view, but also for applications. Here, our aim is to present the main results from the theory of measurable multifunctions. We will look more closely at integration, conditional expectation and convergence theorems such as strong laws of large numbers and martingales convergence theorems. The notion of random set is intuitive and can be considered from different points of view. It is possible to go back as far as the classical Buffon's needle problem (see, e.g., [125]) that implicitly involves this notion. In the twentieth century, the first papers dealing with random sets seem to be those of Robbins [172,173] in the middle forties. Robbins proved a celebrated formula related to the measure of a random set in an Euclidean space. Later, Kudo [135] and Richter [171], motivated by Mathematical Statistics, examined measurability and integration problems. The literature of the sixties and the early seventies was more abundant. At this time, a great deal of works were inspired by problems arising from Control Theory and Mathematical Economics. We can cite the papers by Aumann [17], Debreu [65], Castaing [33], Castaing and Valadier [40], Van Cutsem [203- 205] and Valadier [199,200]. During the seventies and the next decades, the number of works on the field increased rapidly. Let us mention, for example, the monograph by Castaing and Valadier [41], and the paper by Hia'i and Umegaki [117]; both were published in the late seventies and were highly influential. A lot of further references will be given in the sequel, but one can also cite general monographs on multifunctions (not especially devoted to integration theory), for example that by Klein and Thompson [130] in the eighties, and that by Aubin and Frankowska [16] in the early nineties. This paper is organized as follows. In Section 2, we collect some elementary facts and introduce our notations. Section 3 deals with the integration of strongly measurable multifunctions and, more particularly, with those that can be approximated by simple measurable multifunctions in the sense of the Hausdorff distance. Furthermore, these multifunctions are assumed to be integrably bounded, whence almost surely bounded valued. In Section 4, we present weakly measurable multifunctions in connection with graph-measurability, and we recall two versions of the Measurable Selection Theorem. Section 5 is devoted to the Aumann integral, whose construction is based on integrable selections and which has become the most popular set-valued integral. The Hia'i-Umegaki set-valued conditional expectation is examined in Section 6. Set-valued measures (alias multimeasures) are considered in Section 7; as we shall see, three distinct definitions can be given. A few basic results on the probability distribution of measurable multifunctions are presented in Section 8. Section 9 is devoted to set-valued strong laws of large numbers; there, two convergence concepts are considered on the space of subsets, namely the Hausdorff metric convergence and the Painleve-Kuratowski convergence. In
620 С. Hess Section 10, we introduce the definition and the main properties of set-valued martingales. In Section 11, we recall some basic facts about Gaussian multifunctions and the set- valued Central Limit Theorem. Section 12 is devoted to set-valued versions of Fatou's Lemma. Last but not least, we give a short introduction to epigraphical convergence, which derives from set convergence and has appeared to be an interesting functional convergence. There, we shall give applications in a stochastic context. Finally, a few concluding remarks indicate other topics of interest in the field. They are followed by a substantial reference list which shows how wealthy this domain is. 2. Notations and preliminaries We consider a separable Banach space X. whose norm is denoted by || · || and Borel ?-field by B(X). We define the following families of subsets of X • 2X: the family of all subsets of X. • Q: the family of open subsets of X, • B{X): the Borel ?-field (or Borel ?-algebra) of X, generated by Q, • C(X): the set of all closed subsets of X, • CC(X): the set of all closed convex subsets of X. • Сь(Х): the set of all closed bounded subsets of X. • /C(X): the family of all nonempty compact subsets of X • /CC(X)'· the family of nonempty compact convex subsets of X, • /CW(X): the family of all nonempty weakly compact subsets of X. The meaning of Ct>c(X) and ?*-с(Х) is clear. On 2X we consider the Minkowski addition, denoted by "+" and the scalar multiplication, respectively defined by C + C' = {x+x': ? eC. v'eC'}. aC = {ax: ? eC], where С, С е 2X and a e R. These operations satisfy the following properties aC + aC'=a(C + C), (a + /3)C с cC + /3C. B.1) Easy examples show that the inclusion in B.1) is strict when С is not convex. When С is convex, inclusion B.1) becomes an equality, namely (a+fi)C = aC + pC. ?, ? e R. Given С е 2х, the distance function d(-,C) and the support function s(-,C) of С are respectively defined by d(x,C) = mf{\\x-y\\: veC}. ? e X. i(;y,C) = sup{ (у.*): лес}, ? e X*. where (y,x) stands for the duality pairing and X* for the dual space of X. We denote by clC (respectively, coC. coC) the closure (respectively, the convex hull, the closed
Set-valued integration and set-valued probability theory: An oveniew 621 convex hull) of C. Recall that the distance function (respectively, the support function) characterizes a closed set (respectively, a closed convex set). Indeed, for every closed set С (respectively, every closed convex set C), we have C={x eX: d(x,C) = 0}, and C= f]{xeX: (y.x)^siy.C)}. B.2) vefl* where B* denote the closed unit ball of X*. Equality B.2) is a consequence of the Hahn- Banach Theorem. The support function satisfies the following simple properties that are stated for easy references. s(yX) = s(y,coC), Ce2x. s(y, C + C')= s(y, C) + s(y, C'). ? e X, С С е 2х \ {0}. It is positively homogeneous and subadditive. s(ay,C) = as(y,C), a > 0. ? e X, С е 2х. B.3) s{y + y',C)iis(y.C) + s{y'.C), y.y'eX. Ce2x. B.4) In particular, taking у' = —у in B.4) we obtain s(y,C) + s(-y,C)^s@.C) = 0, Ce2x\{0). B.5) For every pair (С, С) е Съ, we also define the Hausdorff distance between С and С by /j(C, C')=inf{a >0: CCC'+afiandC'cC + fffi), where ? denotes the closed unit ball of X. Equivalently, if the excess of С over C' is defined by e(C, C')=sup{d(x,C): .teCJ, we have AfC,C')=max{«(C,C').e(C',C)). These formulas also makes sense when С and C' are closed sets, possibly unbounded, but in this case, h is only a pseudo-distance in that it can take on the value +oo. The
622 С. Hess topology generated by h is called the Hausdorff metric topology and is denoted by гн. For every С е 2х \ {0} we set ||C|| = sup{||jc||: ? e C), which is the Hausdorff distance between {0} and С Some other properties of the Hausdorff distance are listed below for easy references. The proof can be found in [65, p. 362]. PROPOSITION 2.1. For every С, С', К, К' e Съ(Х) and a > 0 the following properties hold (a) h(c\(C + C), c\(K + K')) <; h(C, K) + h(C, K'); (b) h(aC,aK)=ah(C,K); (c) h(coC,coK)<:h(C,K). In particular, the maps (C. K) -^ cl(C + K), (a, C) -> <*C and С -> соС are continuous in the Hausdorff metric topology on Сь(Х). It can be shown that the space С endowed with the pseudo-distance h is complete, and that Сь, Cc, & are closed subspacesofC (see, e.g., [41. Chapter II]). Using a lemma due to Grothendieck (see, e.g., Lemma V.6 in [41]), the same property can be proved for /Cw. Finally, we recall the Hormander formula (see, e.g.. Theorem 11.18 of [41]) which expresses the Hausdorff distance of bounded convex sets in terms of support functions. For every C,C e Ct>c one has h(C,C')= sup \s(y, C)-s(y,C')|. B.6) vefl* The above equality admits several variants or special cases that will be useful. Because of B.3), the supremum can also be restricted to S={yeX*: \\y\\ = l]. the unit sphere of X*. In the special case where C' = {0}, it is readily seen by using B.5) and B.6) that ||C|| =sups(yX). yeB 3. Integration of strongly measurable multifunctions Let (?, ?, ?) be a probability1 space. A multifunction is a map, defined on ?, whose values are subsets of some given set. In this section, we shall restrict our attention to the space Съ(Х) (or Сь) of closed bounded subsets of X, endowed with the topology гн 'We have choose this setting forsake of simplicity. In fact, a lot of results of this Handbook remain valid when ? is a finite or ? -finite positive measure.
Set-valued integration and set-valued probability theory: An oven'iew 623 generated by the Hausdorff distance h. Further, we consider the Borel ?-field В(Съ(Х), гц) generated by the гн-ореп subsets of Съ(Х). A multifunction F-.? -> Сь(Х) is said to be stmngly ?-measurable (or simply, strongly measurable) if, for every member W of В(Съ, тц), one has F~l(W) e A. A more precise criterion is given in Proposition 3.3 below for the case where F takes on its values in a гн-separable subspace of Сь(Х). Multifunctions that enjoy some measurability property are also called "random sets". The proofs of the results of this section can be found in [110]. We begin by defining the set-valued integral or expectation of a simple multifunction, i.e., of a multifunction F assuming only a finite number of values, More precisely, let {? ?,,.,, Ak} be an Д-measurable partition of ? and let F: ? -> F (?) be the measurable multifunction taking on the value К, е Съ for any ? e A, (/' = 1,..,,k). This can be written к F = J24Ki· i = \ F is called a simple multifunction. Clearly, F is strongly measurable. In this equality, 1д, denotes the indicator function of A/, namely ? 1, if we Aj, In this case, the expectation (or integral) of F is the member of Съ defined by E(F) = cl ?>(Ai)*,-|, C.1) where the sum refer to the Minkowski addition. When X is finite dimensional, the closure operation is not necessary. The following lemma which is valid for a general metric space allows for identifying the class of strongly measurable multifunctions that can be approximated by a sequence of strongly measurable simple multifunctions. LEMMA 3.1. // F-.? -> С is a closed valued multifunction, then the following two statements (a) and (b) are equivalent. (a) F is the pointwise limit on ? of a sequence of strongly measurable simple multifunctions. (b) F(i2) is a тц-separable subspace of С and, for every К e ?(?), the map ? -> h(K, F(<w)) is measurable. Recall that /C and /Cc are гн-separable subspaces of Сь- However, when X is infinite dimensional, neither Сь nor Сьс is гн-separable. For example, consider the space X = I of sequences (?„)„^ ? such that ??>\ ??" I converges and, for any subset / of N, the set of positive integers, the subset С ? =co{e„: ? e /}.
624 С. Hess Recall that e„ denotes the sequence whose all terms are zero excepted the /zth equal to 1. It is readily seen that if / and J are two distinct subsets of N, one has h (C/, Cj) > 1 · Since the family is uncountable, the nonseparability of Сы. follows. In particular, there exists strongly measurable multifunctions that cannot be approximated by a sequence of strongly measurable simple multifunctions. Given a subspace С of Сы it is useful to introduce the following three classes (A), (B) and (C) of multifunctions: (A) ?' (С, Л) — the class of strongly Д-measurable multifunctions with values in С such that E||F|| < +oo, where "E" stands for the expectation. Such multifunctions are said to be integrably bounded; (B) S(C, A) = the subclass of ?' (С. A) whose members are strongly Д-measurable simple multifunctions; (C) С1(С, A) = the subclass of ?' (С. Л) of those F that can be approximated by simple multifunctions, i.e., such that one can find a sequence (F„)n^\ in S(C, Л) verifying lim h(F(cu),Fn(cu))=0 a.s. Further, given F,G e ?' (С. Л) we set /}(F,G) = E/?(F,G). Similarly to the case of random vectors, it is easy to show that ? is a metric on С1(Съ,Л), modulo the ?-almost sure equality. In other words, ? satisfies the following three properties, for every F, G, ? in ?' (Съ- Л) (i) /i(F, G) = 0 if and only if F(w) = G(w) a.s. (ii) /i(F,G) = /i(G.F); (iii) /i(F, G) ^ /i(F, ?) + ?(?. G). In addition, if we denote by F' and G' two other multifunctions in С1(Съ-А), the following three properties can be easily deduced from Proposition 2.1. (iv) /i(cl(F + F'), cl(G + G')) < /i(F. G) + /i(F', G'); (v) 4(aF,aG)=a4(F,G),a>0; (vi) /i(coF,coG)^/i(F, G). Further, it can be proven that С1 (Съ- Л) endowed with ? is complete. More precisely, if (Fn)n~z\ is a Cauchy sequence in (С\Съ-А). ?), i.e., if it satisfies lim /}(F,„,F„) = 0. /n.n—>зо
Set-valued integration and set-valued probability theory: An overview 625 then there exists a strongly measurable multifunction F such that lim A(F,Fn)=0. n—>oo Moreover, F is unique up to a modification on a null set. and there exists a subsequence (Fn(k))k>\ of (F„) such that lim h(F, F„(jt)) =0 a.s. The following lemma, inspired from Lemma V.2.4 of [157], shows that for every гн-separable subspace С of Съ, every member of C](C ,A) can be approximated by a sequence of strongly Д-measurable simple multifunctions. The limit exists almost surely (in fact for every ? e ?) in the sense of the quasi-metric ?. LEMMA 3.2. Let C' be a ??-separable subset ofCb(X). Then, for every F e ?' (C, A), there exists at least a sequence (F„) of simple multifunctions in С (С, A), satisfying the following three properties (a) lim^oc h(F(w), F„(w)) = 0,Vwei2: (b) ||F„(w)|| <; ||F(w)||, 4?&?: (c) \imn^ooA(F,F„) = 0: (d) Moreover, when the values of F are convex (respectively, compact, weakly compact), the F„ can be chosen so as to be convex-valued (respectively, compact- valued, weakly compact-valued). It is interesting to mention the following characterization of С)}(Съ, A) in the space Cl(Cb,A). PROPOSITION 3.3. For every F e С](Съ-А), the following two statements are equivalent: (i) F is a member of C)} (Съ, A); (ii) There exists a x\\-separable subspace C' ofCb such that F(a>) e С a.s. Now, let us explain how to construct the integral of a strongly measurable multifunction F :? -> С', where С is a rH-separable subspace of Cb- For the sake of simplicity, we limit ourselves to the case of a convex valued multifunction, i.e., when С с Ссъ- We also assume that С is гн-closed, and stable under the Minkowski addition and multiplication by positive scalar. This is not restrictive, because the closed convex cone generated by a ??-separable subset of Сьс is still гн-separable. The difficulties encountered in the case of a multifunction whose values are not convex are briefly discussed in Remark 3.8. So, given a multifunction F: ? -» С', we define the map ? : S(C, A) -> С by 0(F) =E(F), C.2) where E(F) is defined by C.1). The following result displays the main properties of the map ? on S(C, A).
626 С. Hess PROPOSITION 3.4. Let F and G be two strongly measurable simple multifunction with values in C, i.e., members ofS(C, A), and let a ^ 0. Then, the map ? defined by C.2) satisfies the following properties (a) 0(F + G) = c\[0(F) + 0(G)}, (b) <P(aF) = a<P(F), (c) h@(F), 0(G)) s: Eh(F, G), and, in particular, |?(F)|^?(||F||), (d) for every у e X, one has i(y,E(F))=ET(v.F). (e) if F с G a.s., then E(F) с E(G) (monotonicity property). The next result shows that the integral (or expectation) can be defined for every F e ?,',(?', A), and that the properties listed in Proposition 3.4 still hold in this more general framework. THEOREM 3.5. The map ? : S(C, A) -> С defined by C.2) can be extended to a map ? from ?' (C\ A) into C. It will be also denoted by ${F) = E(F), for F e ?' (C, A). This map still enjoys properties (a) to (e) of Proposition 3.4. More precisely we have (a) $(F + G) = c\[$(F) + $(G)}, (b) $(aF)=a$(F), (c) h($(F), $(G)) < E/?(F, G), and, w particular, |E(F)|^F||F||. Moreover, the map ? still enjoys properties (d) and (e) of Proposition 3.4. Remark 3.6. The above approach for defining the set-valued integral is explicit, in that it starts with simple multifunctions and uses only elementary operations such as the Minkowski addition and the scalar multiplication. It is also intrinsic in that it does not involve any auxiliary space. In spite of these nice features, it is interesting to sketch briefly an alternative approach for constructing the set-valued conditional expectation for a compact convex multifunction. This approach was adopted by Debreu [65]. Although of a more implicit and abstract nature, it is shorter. Indeed, it is based on the possibility of embedding isometrically the metric space (Сьо Ю. endowed with the Minkowski addition and the scalar multiplication, in a Banach space. More precisely, let C(B*) denote the set of real-valued continuous functions defined on ?*. Further, it is assumed that C(B*) is endowed with the uniform norm || · ||u, i.e., IN|u = sup{|u;(;y)|: yefl*), weC(B*).
Set-valued integration and set-valued probability theory: An oven-iew 627 As already mentioned, we assume that Сы,- is endowed with the Hausdorff distance h. Then, the map ? : Сы: -> C(B*) defined by ^@ = ?(·,0, СеСьс. is an isometry from (Cbc, h) onto (<^(Сьс)· II ¦ llu)· This is an immediate consequence of the Hormander formula. Moreover, ?@^) is a convex cone, and ? is an isomorphism with respect to the convex cone structure. In other words, it satisfies *(с(С) = а*(С), С.С еСьс- а>0. ? is called the Radstrom embedding. In addition, we have the implication Ccc4W)^(C). C.C'eCbc· If ? is restricted to a гн-separable subspace С of Cbc then ?@'), as well as c\(V(C) - ?(?')), the Banach subspace of C(B*) generated by V(C), are separable. Thus, it is possible to define the integral of a random vector /: ? -> C(Z?*). This is similar to the construction of the Bochner integral for random vectors. Let us recall that this integral is appropriate when one has to deal with functions taking on their values in an infinite dimensions Banach space (see, e.g., [68, p. 44]). Then, given an integrably bounded multifunction F :? -> С', the integral of F. can be defined by simply setting E(F) = «^-'(E(^(F))). C.3) In Debreu's approach, the properties of the set-valued integral are inherited from those of the integral of C(Z?*)-valued random vectors and from the properties of the map ?. Thus, our approach is a little more general than that of Debreu, because Equation C.3) is not required to define E(F), but is only a consequence of the definition. Remark 3.7. When the dimension of X is greater than one, the convex cone ?(€^) is infinite dimensional. Indeed, it contains an infinite sequence of linearly independent vectors. For example, assume that X = R2 and consider the sequence (C„ )„ ^ ? of compact convex subsets defined by C„ = the polygon whose vertices are the points of coordinates (cos2kn/n. пп2кл/n), where к = 0,1,... ,n. It is easily checked that, for every ? > 1, the support functions s(-, Ci), ·. -,s(-, C„), regarded as vectors in C(B*). are linearly independent. Remark 3.8. Let us say some words about a possible definition of the integral of a multifunction F: ? -> Съ(Х), whose values are not necessarily convex. For this purpose,
628 С. Hess we recall that a member A of Л is called an atom of (? .?. ?) if ? (A) > 0 and if for every В e A such that В с A, one has P(B) = 0 or P(A \ B) = 0. If ? ? and A2 are two distinct atoms, then P(A\ П A3) = 0. It follows that the collection of atoms of a probability space is finite or countable. Consequently, given a probability space (?, ?, ?), the set ? can be split into the purely atomic part i?pa and the nonatomic part i?na (see, e.g., [118, p. 45]). The purely atomic part ??? expresses as a finite or countable union of atoms A*, namely If the multifunction F: ??? -> Сь(Х) is defined by where Q e Сь (^ ^ 1), and if F is integrably bounded, i.e., ||F||d/i = J]/i(A*)||C*|| < +oo, then it is natural to define the integral of F over i2pa by ЕA^) = ]ГМ(А,)Сч-. C.4) where the series converges in the Hausdorff metric. It was shown in [110] that, for any F e ?J,(Cb, A), the integral can be defined by E(F) = d{E(lnp.F)+E(lnoacoFdM)}, where E(l^paF) is defined by C.4) and E(l^nacoF) is defined as in Theorem 3.5. When X is finite dimensional, the closure operation is no longer necessary. Remark 3.9. (i) Using the approach displayed in this section, several other properties of the set-valued integral could be proved. For example, it would not be difficult to prove set-valued analogs of the Dominated Convergence Theorem. On the other hand, the same construction also allows for defining the conditional expectation of strongly measurable multifunctions (see [110]). However, a other more general approach will be given in Section 6. (ii) Other approaches based on the Riemann integral were adopted by Hukuhara [122, 123], by McShane [149], and by Artstein and Burnes [8]. In Section 5, we shall present the Aumann integral, whose definition involves selections and which even makes sense for non integrably bounded multifunctions. /,
Set-valued integration and set-valued probability theory: An overview 629 4. Weakly measurable multifunctions and graph-measurable multifunctions This measurability concept is the most largely used, because it is not too strong, whence easy to check. It allows for a lot of stability properties and for general results of existence of measurable selections, especially in connection with the graph measurability that we shall also present. On 2X, we consider the Effros ?-field ? generated by the subsets U~ defined by U~ = {Ce2x: СГШ^0}, where U ranges over Q, the set of open subsets of X. Clearly, the restriction of ? to X, considered as a subset of 2X, coincides with B(X). The restriction of the Effros ?-field to some subspace S of Iх will be denoted by ? («S). We continue to denote by (?, A) an abstract measurable space. As already mentioned, a map F from ? into Iх is also called a multifunction. The domain and the graph of F are, respectively, defined by dom(F) = {we ?: F(w) ^ 0} and Gr(F) = {(?. .?) e ? ? ?: ? e F(w)\. F is said to be Effros measurable or simply measurable, if for every W in ?, F~](W) is a member of A ("weakly measurable" in the terminology of Himmelberg [ 120]). From the definition of the Effros ?-field, it follows that F is measurable if and only if, for any open subset U of X, the subset Г'(Г)=Ц«: F(w)ni/^0}. is a member of A- Thesub-a-field F~'(?) generated by F is denoted by Ar- Clearly, a multifunction F: ? -> 2х is measurable if and only if the multifunction cl(F) is measurable. Further, if F is measurable then its domain is a member of A- This is a consequence of the equality dom(F) = F-'(X-). A strongly measurable multifunction is measurable, which justifies the terminology. This follows from the relationship (F-1(i/-))c = F-'({CeC: Cct/e}) valid for all open sets U and from the гн-closedness of the set [С еС: С с t/c}, which shows that F~'(i/~) is a member of В(Съ, тн) (in the above equality, the superscript "c" stands for the complement operation). Conversely, if F is a weakly measurable multifunction taking on its values in a rn-separable subspace С of Сь(Х), then it is also B(C, TH)-measurable (see Remark 4.9(H)). On the other hand, there are measurable multifunctions which are not strongly measurable (see, e.g., Example 3.4 in [ 117] or [23]).
630 С. Hess The set of all measurable multifunctions with values in X is denoted by M(A, 2X) or simply M{2X). The set of closed-valued measurable multifunctions is denoted by M{C{X)). A multifunction F: ? —» 2х is said to be graph-measurable if Gr(F) is a member of the product ?-field A <g> B{X). The connections between Effros measurability and graph measurability are given by Theorem 4.1 below. The proofs can be found in [41]. First, the following notion from measure theory is needed. The ?-field of universally measurable subsets, associated with this measurable space, is denoted by A and is defined as the intersection of all ?-fields ?? (the ?-completion of A), where ? ranges over the set of all positive bounded (or probability) measures on (?. A). Consequently, one has the inclusion А с A and, when the ?-field A is ?-complete with respect to some fixed positive bounded measure ?, one has the equality For example, analytic subsets and coanalytic subsets of a complete separable metric space are universally measurable (see, e.g., [67]). Theorem 4.1. (a) If a closed valued multifunction F is Effros measurable, then it is graph-measurable. (b) Conversely, every graph-measurable multifunction F : ? —> 2х is Effros measurable with respect to A, the ?-field of universally measurable subsets of(?, A). (c) Consequently, if F is defined on a complete probability space (?. ?. ?), a closed valued multifunction F is Effros measurable if and only if it is graph-measurable. PROOF. Statement (a) is easily proved by relying on the equality Gr(F)= {(?,?)?? ? X: d(x, F(w)) = 0}. and by noting that the function (?. л) -> d(x.F(co)) is A ® ??(X)-measurable. The proof of statement (b) is a consequence of the Projection Theorem (see the proof of Theorem 4.6). ? From Theorem 4.1(c), one can deduce a lot of stability properties for measurable multifunctions. For example, we mention the following result concerning countable unions and intersections. THEOREM 4.2. Assume that (?. A. P) is complete. If (F„),i^i is a sequence of measurable multifunction, then the multifunctions F and G defined by F=\jF„ and G=f]F„. are measurable.
Set-valued integration and set-valued probability theory: An overview 631 A crucial notion for working with measurable multifunctions is that of selection and of Castaing representation. Dehnition 4.3. Let F: ? -> 2х be a multifunction. (i) A selection of F is a map / from ? into X such that /(?) e F(a>) for every ? e dom(F). (ii) A Castaing representation is a sequence (/„) of measurable selections, such that for every ? e dom(F), F(oj) is equal to the closure of the countable subset {/„(?): ? 1} (see, e.g., [41. Chapter III] or [1741). The following theorem provides two existence results of a measurable selection. It does not involve the linear structure of X. The proof of part (a) can be found in, e.g., [41,124, 136,174,208,209]; that of part (b) is contained in [41, Chapter III]. Theorem 4.4. (a) IfF e M(A, C(X)) and ifdom(F) is nonempty: then F admits at least one A-meas- urable selection. (b) If F is a graph-measurable multifunction with values in Iх and if Gr(F) is nonempty, then F admits at least one ?-measurable selection. In the above result, it can be noted that, when F is closed valued, it has at least one Д-measurable selection. When no particular hypothesis is made on the values of F, one can only assert the existence of an Д-measurable selection. As already mentioned, when (?, A) is endowed with a positive finite measure ?, this entails the existence of one ?? -measurable selection. As the next result shows, the measurability of a closed-valued multifunction can be expressed in terms of distance functions and is connected with the notion of Castaing representation. THEOREM 4.5. If F is a closed valued multifunction, the following three statements are equivalent (a) F is Effros measurable, i.e., F_l (U~) e Afor all open subsets U of X. (b) For every ? e X, the function d(.x. F(·)) is A-measurable. (c) F admits a Castaing representation. Equivalence (a) o· (b) is valid in a more general setting. Indeed, a multifunction F whose values lie in a separable metric space is weakly measurable if and only if, for every ? e X, the positive function d(x. F(·)) is measurable. This is a consequence of the following equality F-'(?(*,(*)) = {?&?: d(x.F(co)) <a\. valid for all ? e X and a > 0 (where B(x, a) denotes the open ball of radius a centered atjc).
632 С. Hess THEOREM 4.6. If F is a closed valued multifunction defined on a complete probability space (?,?,?), then each of the following statements is equivalent to anyone of Theorem 4.5. (d) F is graph measurable. (e) F'BeAforall BeB(X). (f) F~C e Afar all С eC(X). PROOF. We only sketch the main implications. Implication (d) => (e) is an immediate consequence of the Projection Theorem (Theorem 111.23 in [41]). Indeed, for every В е B(X), the subset F~ В = ???)? (Gr( F)A(fix8)), is a member of A = A- As to the implication (b) => (d), simply observe that, for any countable dense subset D of X, the following equality holds Gr(F)= []{???: d(.x. F(a>)) = 0}. Implication (f) => (a) follows from the fact that, in a metric space, every open subset U is the countable union of closed sets, namely u=\Jc„, which yields u- = {J c-. ? Remark 4.7. Consider the property (g) below. (g) F~K eA for all К е /C(X). When X is a metric space the implications (f) => (a) => (g) hold. The three properties are equivalent when X is ?-compact (i.e., a countable union of compact sets), especially when X is a finite dimensional Banach space (see, e.g., [120] or [41]). Further results on the measurability of multifunctions and counter-examples can be found in [121]. One of the advantages of considering the Effros ?-field in order to define the measurability of a multifunction is that this ?-field is equal to the Borel ?-field of a separable metrizable topology, namely the Wijsman topology. This topology was introduced by Wijsman [213,214] on the space of closed convex subsets in an Euclidean space, but it also makes sense in a general metric space (X, d). In this setting, the Wijsman topology is defined on C(X) as the weakest topology rWu/) determined by the family {d(x, ·): ? & X). Equivalently, rw(f/) is the topology of pointwise convergence of distance functions on X.
Set-valued integration and set-valued probability theory: An overview 633 For every С е C(X), the distance function d(-,C) is Lipschitz continuous (with Lipschitz constant one). It follows that, when the metric space (X, d) is separable, the Wijsman topology is metrizable and separable (see [101]). Further, Beer [26] proved that when (X, d) is Polish2 (C(X), rW(j,) is Polish too. The following result displays the Borel structure of the Effros ?-field. The proof can be found in [101, Proposition 3.1.1] or [27, Theorem 6.5.14]). THEOREM 4.8. If(X, d) is a separable metric space, then, one has ?(C(X))=B(C(X) Remark 4.9. (i) The equality of Theorem 4.8 is no longer true when the Wijsman topology is replaced with the Hausdorff metric topology гн- In general, the Effros ?-field is strictly included in B(C(X), ??) (see [23]). (ii) However, when we restrict to the space IC(X) of compact subsets of X, we have the equality ?(?(*)) =S(/C(X),th). The proof can be found, e.g., in [27,41,177]. More generally, according to a remark in the beginning of the present section, if С is a тн-separable subspace of C(X), then one has E(C')=B(C',tv/{j))=B(C',th). This follows from the second countability of the restriction of гн to С and from the fact that the /г-closed balls are members of the Effros ? -field. In turn, the latter is a consequence of the equality h(C,C') = sup\d(x,C)-d(x,C')\, valid for every pair (C, C') e C(XJ, where D denotes a countable dense subset of X. There are a lot of specific situations where one wants to find a measurable selection. For example, a result which can be useful in many situations, for example in Control Theory, is the so-called Measurable Implicit Function Theorem (Theorem III.38 in [41]). THEOREM 4.10. We consider another measurable space (T.T), a Suslin topological space ? (i.e., the continuous image of a Polish space), a multifunction F: ? -> 2Y such that Gt(F) eA<S>B(Y),a multifunction G : ? -> 2"'" such that Gr(G) e ? ? ? and a map g-.???^? which is measurable with respect to ? ? ?(?) and T. If it is assumed that g(w, F(w))nG(w)^0, 4???, -A topological space is said to be Polish if it is separable, metrizable and complete for a suitable metric.
634 С. Hess then there exists a map f: ? —> ?, whose graph is A <8> ? (?) -measurable, which is a selection of F and which satisfies g(w,f(w))eG(w). VweC. Remark 4.11. (i) Another useful notion for closed convex valued multifunctions is that of scalar measurability. A multifunction F : ? -> CL- is said to be scalarly measurable if for every у e X*, the function s(y, F(·)) is measurable. It is not hard to show that a measurable multifunction is scalarly measurable. Indeed, for every у е X* and every real a, one has {C eCc: s(y,C)>a} = W~. where W = {x e X: (y, x) > a). The converse implication does not hold in general, but it holds for the class of countably supported multifunctions (see Definition 5.5 below) satisfying a suitable analyticity property. The reader may consult [24] for further details and [23] for the connections with strong measurability. (ii) Let us also mention the works of Leese [141,142] who presented another approach of measurable multifunction, involving the Suslin operation and Suslin spaces. The case of multifunctions whose values lie in a nonseparable Banach space was considered by Barcenas and Urbina [25]. 5. The Aumann integral Consider a probability space (?. ?, ?) and a sub-a-field В of A. By ?\?, ?, ?; X) we denote the space of all (classes of) measurable functions from (?. B) into (X.B(X)). For every F e M{C(X)), we also define S(F, B)={fe L°(i2. ?, ?: ?): /(?) e F(co). for ?-almost every ? e dom(F)}. It is known that L°(i2, ?, ?; ?) (= L°(X)) endowed with the topology of convergence in probability is a metrizable topological vector space, provided one identify two functions that coincide ?-almost surely. Since a sequence converging in probability admits an almost surely converging subsequence, it is clear that, for any sub-a-field В of Л, the set S(F, B) is closed in L°(i2, ?, ?; X). We denote by ?(?,?,?.?) (or L](X) for short) the subspace of L°(X), whose members are Bochner integrable. Given a sub-?-field В of A and a multifunction F, we define the following L1 (X)-closed subset of ^(?, ?, ?; ?) S](F,B)= {/??\?,?,?:?): f(w)e F(w), for ?-almost every ? e dom(F)}. We begin by recalling a result on the measurability of infimums (respectively, supremums).
Set-valued integration and set-valued probability theory: An overview 635 PROPOSITION 5.1. Let F eM(C(X)) and ?: ? ? ? -> Ш be an A® B(X)-measurable function. If for every ? e ?, ?(?. ·) is upper semicontinuous, then the function тф defined by ? -» ???(?) = \??{?(?, ?): ? e F(a>)}. is measurable. On the other hand, if?(?.¦) is lower semicontinuous, >Пф is A-measurable. A similar result can be stated for the supremum by replacing ? with —?. If ?: ? ? ? -> R is A® ??(X)-measurable then we can define the integral functional ^опЬ\Х)Ъу Mf)= f ?{?,/(?))??, feC](X). The following result allows for interchanging the infimum and the integration operations, and plays a crucial role in set-valued integration. Its proof can be found in [117, Theorem 2.2]. PROPOSITION 5.2. Let ?: ? ? ? -> R be as in Proposition 5.1. Then one has inf{70(/): /eS'(F)}= [ тф{Ш)а^х. JQ Applying Theorem 5.2 to the function (?, ?) -» ? (?, ?) = \\?\\, it is not hard to see that when the following inclusions hold AF с В с А, the set S] (F, ?) is nonempty if and only if the positive function d@, F(·)) is integrable. In such a situation, we say that the multifunction F is integrable. Observe that when F is integrable, one has M(dom(F)) = 1. On the other hand, F is said to be integrably bounded if the function ? -> /?(F(o>)) = sup{ll-x||: xeF(a))} (also denoted by ||F(?)||) is integrable. In this case, every measurable selection of F is integrable, so that one has S](F,A) = S(F,A). Further, the values of F are almost surely bounded.
636 С. Hess The following notion of integral for multifunctions was introduced by Aumann [17]. For any measurable multifunction F and any sub-?-field В of A, the set-valued (Aumann) integral of F over ?, with respect to B, is denoted by I(F, B) and is defined by I(F,B) = U /??: feSl(F.B)\. I(F,A) is simply denoted by 1(F); it is nonempty if and only if F is integrable. The integral of F over A e A is denoted by I a (F). On the other hand, F is said to be Aumann-Pettis integrable if it admits at least a Pettis integrable selection. For the definition and elementary properties of the Pettis integral, the reader may consult [68, p. 52] and [155,156]. Let us only recall that in finite dimensional spaces the Pettis integral coincide with the usual integral, whereas in infinite dimensions there exist Pettis integrable functions that are not Bochner integrable. Example 5.3. An integrably bounded multifunction is integrable, whence Aumann- Pettis integrable, but the converse implication is false. Indeed, consider the measurable multifunction F defined by F(w) = ?@, r(o))) = the closed ball of radius r(w) centered at the origin, where r : ? -> @, +oo) is a given nonintegrable measurable function. In this case, the Aumann integral (see below) of F over ? is equal to X. Here are some properties of the Aumann integral, whose proofs can be found in [ 17]. THEOREM 5.4. Let F,G e M(C(X)). If we assume in addition that F and G are integrable, then the following properties hold. (a) I(c\(F + G)) = d(I(F)+I(G)). (b) cl/(coF) = co/(F). (c) ?/(?,?, ?) is nonatomic, then c\I(F) is convex and one has c\ I (co F)=c\ 1(F). (d) //(?, ?, ?) is nonatomic and if X is finite dimensional, then 1(F) is convex and I(F) = I(coF). Moreover, if F is integrably bounded, 1(F) is compact. (e) For all у е X* one has s(y,I(F))= [ s(y,F)d?. JQ
Set-valued integration and set-valued probability theory: An over\iew 637 (f) If F and G are integrably bounded then the following inequality holds fc(cl/(F),cl/(G))< f h(F,G)dp. (g) If F is integrably bounded and takes on its values in a тц-separable subspace of Със(Х), then one has E(F) = cl/(F) where E(F) denotes the integral as defined in Section 3 (Theorem 3.5). The first part of property (c) is a consequence of the Lyapunov Convexity Theorem (see [117, Theorem 4.2]). The first part of statement (d) was proved by Richter [171]. The proof of statement (g) can be found in [ 117 ]. The others properties are due to Aumann [17]. Other properties related to statement (b) will be given in Theorem 5.9. Theorem 5.6 below concerns the class of countably supported multifunctions, whose definition is recalled hereafter. DEFINITION 5.5. A multifunction F-.? -> CC(X) is said to be countably supported if one can find a countable subset D of B* such that F(w) = P| {x e X: (y,x) < s(y, F(co))}, for all ?? ?. If we want to emphasize the role of D, we say that F is D-countably supported. For example, if ? is a topology on X, compatible with the duality pairing, every multifunction whose values are weakly compact and convex is D-countably supported for every ?- dense subset of B*. However, the class of countably supported multifunctions is much wider (see [24]). In particular, a countably supported multifunction may have unbounded values. On the other hand, every member F of ?/'(Cc. Л) is countably supported with respect to a countable subset of X*. generally depending on F (Theorem 3.6 in [23] and Proposition 2.4(g) in [24]). The first part of the following theorem characterizes integrable selections of a countably supported multifunction, the second part provides a criterion for two such multifunctions to be almost surely equal. Theorem 5.6 remains valid for Aumann-Pettis integrable multifunctions (see [78]). THEOREM 5.6. Let F be a measurable, integrable, closed convex valued multifunction and f eL'(X). (a) If F is countably supported then feS](F)ifandonly if fd^?c\IA(F). for all A e A- I
638 С. Hess (b) Let G be another integrable, closed convex valued multifunction. If G is countably supported then F(co) = G (?) for almost every we ? if and only if clIA(F) = c\IA(G). for all A e A. The set-valued Aumann integral is not always closed. Thus, it is interesting to indicate situations where it is. This problem has been considered by several authors, such as Castaing [33,37], Hia'i and Umegaki [117], Byrne [31] and KM [128] among others. THEOREM 5.7. If F e M(C(X)) is integrably bounded and convex weakly compact valued then 1(F) is weakly compact. PROOF. Clearly, the set 51 (F) of integrable selections of F is L1 (X)-closed and convex. By Theorem 3.6(ii) of [129]. S'tF) is also weakly relatively compact, whence weakly compact. Further, since the map ?-.f —* §Qfdpi from L](X) into X is linear and strongly continuous, it is also continuous with respect to the weak topologies. The desired conclusion is obtained by noting that 1(F) = ^(S'(F)). ? Now, we turn to an interesting question: given a subset 5 of L' (X), is it possible to find a necessary and sufficient condition for the existence of an integrable multifunction F such that S = S*(F)? The answer is provided by Theorem 5.8 below, which is Theorem 3.1 of [117]. Recall that a subset S с L](X) is said to be decomposable (with respect to A), if for every A e A and every /. g e S. t.\f + 1дс? is a member of S. THEOREM 5.8. Let S be a nonempty closed subset of ZJ(X). Then, there exists an integrable multifunction F e M(C(X)) such that S = S](F) if and only if S is decomposable. The following result addresses the problem of interchanging the set-valued integration and the convex hull in finite dimensions. THEOREM 5.9. Assume that X = W1 and consider a multifunction F, whose values lies in R'l, the positive cone of X. IfGr(F) e A<S>B(X), then, one has со / Fd\x= I coFi^. Remark 5.10. (i) The equality of Theorem 5.9 was proved by Aumann [17] and was extended later by Wagner [210]. Another proof based on the theory of Young measures was given by Balder [20]. On the other hand, the case of multifunctions with a countable range was examined by Khan and Sun [126]. (ii) It is interesting to mention another area of research concerning the integration of both, single and set-valued functions, namely integration of random variables taking on their values in a metric space with no linear structure. This can be traced back to Frechet [84] and was taken further by S. Doss [72.73], Benes [28]. Herer [90-93],
Set-valued integration and set-valued probability theory: An oven'iew 639 H. Doss [71], Raynaud de Fitte [170], Es-Sahib and H. Heinich [79], and Hess [1121. In some of these works, one can find extensions of the strong law of large numbers and of martingales convergence theorems. (iii) Let us mention a result that has applications in Control Theory and Optimization. Given a measurable multifunction G defined on a product space ? ? ?. the integral over ? is a multifunction depending on the second variable, namely ?-.t -> / ??,?)?(??). In this situation, one may look for a measurable selection ? of the multifunction ?, which has the following form ?-.t -> / ?(?,?)?(??). where g is a suitable measurable selection of G. This question was raised and solved by Artstein [6] when X is finite dimensional. An infinite dimensional extension was established recently by Saint-Pierre and Sajid [178]. (iv) A notion of variance for random sets was introduced by Kruse [ 134] by means of selections. 6. The set-valued conditional expectation of closed valued multifunctions The construction of the set-valued expectation is a natural extension of the multivalued integral, but it is more delicate. It was examined by several authors. Let us mention Van Cutsem [203-205], Neveu [158], Bismut [30], Dynkin and Evstigneev [77], Castaing and Valadier [41], Hia'i and Umegaki [117]. Hia'i [114] and Papageorgiou [163]. Given a sub- ?-field В of A, and a integrable Д-measurable multifunction F. Hia'i and Umegaki [117] showed the existence of a ?-measurable integrable1 multifunction G such that S\G,B) = c\{E(f\B): /eX'(FJ)}, the closure being taken in L1 (?; X). The multifunction G is the (set-valued) conditional expectation of F relatively to В and is denoted by E(F\B). Clearly, it is defined up to a ?-null set. The existence of G is an immediate consequence of Theorem 5.8. Indeed, it is easy to check that the set (?(/|?): feS\F,A)}. is decomposable with respect to the sub-a-field В and that the L1 (X)-closure of a decomposable set is still decomposable. ¦^In fact, Hia'i and Umegaki assumed that F is integrably bounded, but their proofs easily extend to the integrable case (see [114]).
640 С. Hess One may observe that the above definition is global and only defines E( F\B) by mean of the set of its ?-measurable integrable selections. Thus, it is desirable to provide examples where a 'pointwise' characterization is available, i.e.. a formula allowing one to calculate E(F\B)(w) for almost every ? e ?. Several examples of that type are presented hereafter. The more general ones are examples (e) and (f)- Example 6.1. (a) If the multifunction F reduces to a single-valued function /:? -> X, the conditional expectation of F clearly coincides with the ordinary conditional expectation for vector-valued random variables. (b) If ? = {?, 0}, then for any integrable multifunction F we have E(F\B)(co) = c\I(F) a.s. (c) Let r be a real positive integrable function and /: ? -> X be a vector-valued integrable function. If the multifunction F is defined by F(w) = B(f(co),r(co)). then its conditional expectation with respect to В is given by E(F\B)(w) = ?(?(/|?)(?), ?(?·|?)(?)). This can be seen by using Theorem 6.2(c) below. (d) Let и be a real-valued integrable function and у е X. If we define the multifunction Fby F(co) = {xeX: (y,x)=v(w)}, then E(F\B)(w) = {x e X: (y, ?) = ?(?|?)(?)}. (e) Assume that X* is strongly separable and that F is an Д-measurable integrable multifunction whose values lie in Съ^Х)- Then, the following equality holds (see [117, Theorem 5.5]) E(F\B)(w)= f]{.xeX: (y.x) < E(j(v. F)\B)(w)}. A similar equality holds when F takes on its values in /CWC(X) and when D is a countable Mackey dense4 subset D of X*. It is proved by replacing the strong topology on X* by the Mackey topology. 4Such a countable subset exists in X* as soon as X is separable. For the definition and the basic properties of the Mackey topology, see, e.g.. [184. p. 1311.
Set-valued integration and set-valued probabilin theory: An overview 641 (f) Assume now that there exists a regular conditional probability distribution of ? given B, i.e., a map Q: ? ? ? -> [0, 1] satisfying (i) for every В e B, the map ? -> <2 (?|?) is Д-measurable, (ii) for every we ?, the map В -> Q (?\?) is a probability measure on (?. B), (iii) VAe AVBefi, ?(???) = /4 <2(?|?)?(??). In this case, it was shown by Valadier [201, Theoreme 1] that the set-valued conditional expectation is given by E(F|B)(fl>) = cl{J f($)Q(d$\w): feS\F,A)\ a.s. We end this section by recalling some basic properties of the set-valued conditional expectation, which are natural extensions of those of real-valued or vector-valued random variables. The proofs can be found in [117] for the integrably bounded case and in [114] for the integrable case. As the reader will see, the classical properties of the conditional expectation are recovered only for convex valued multifunctions. THEOREM 6.2. If F and G are two integrable multifunctions with closed values in X, and В is a sub-?-field of A, then the following properties holds. (a) E(cl(F + G)\B) = cl(E(F|B) + E(G|S)) a.s. (b) Ifr is a real В-measurable function such that rF is integrable, then E(rF\B) = rE(F\B) a.s. (c) If g is a bounded scalarly B-measurable5 function from ? into X* then s(g,E(F\B)) = E(s(g,F)\B) a.s. where s(-, C) denotes the support function of the subset С (see Section 2). (d) E(coF\B)=coE(F\B)a.s. (e) If F and G are integrably bounded, then A(E(F\B),E(G\B))^A(F.G). In other words, the map F —> E(F|??) is nonexpansive from С (Съ, A) onto C\Cb,B). (f) If F c, G a.s., then E(F|#) с E(G\B) a.s. (monotoniciry property). In the following theorem, the values of the multifunction F are required to be convex. Thai is, for every ? e X, the function ? —> (g(a>).x) is ?-measurable.
642 С. Hess THEOREM 6.3. Let F be an ?-measurable integrable multifunction, with values in Cc. (a) IfB\ ? ? ? A, thenE(E(F\B)\B\) = E(F\B\). (b) If F is B-measurable and if r is an ?-measurable positive bounded function such that rF is integrable, then E(rF\B) = E(r\B)F a.s. In particular, E(F\B) = F. (c) If F is B-measurable and if В] СбсД then the conditional expectation of F relative to B\ taken on the base space (?.?.?) is equal to the conditional expectation of F relative to B\ taken on (?. ?. ?). Remark 6.4. (i) The assumption of r being positive cannot be removed in Theorem 6.3(b) as the following example shows. Consider the measure space ([0. 1), ?, ?), where ? is the Lebesgue measure, В the trivial ?-field [?, 0} and F the constant multifunction defined by F(a>) =[-1,1] (here X = R). Also consider the function r such that r(w) = -l if0^w<l/2 and r(w)=l. if 1/2 ?? ? <1. Then E(rF\B) = F, but E(r\B)F = {0}. (ii) If F is not assumed to be convex valued, properties (b) and (c) of Theorem 6.3 are no longer true. Indeed, assume that X = R and that (?.?. ?) is nonatomic. Let ? = {?, 0} and F be the constant multifunction such that F(co) = {0. 1), ? e ?. Theorem 5.4(c) shows that E(F|?) = cl/(F) = [0. 1]. so that the equality E(F|??) = F does not hold. In view of the next result, we recall that for every G e MBX) and every sub-?-field ? of Л, we have set (see Section 5) h(G,T)= J f fdix: /6S'(G.f) ¦ BeT. THEOREM 6.5. Let F be an integrable multifunction with closed values. (a) For every В е В one has c\IB(E(F\B).B)=c\IB(F.A). (b) If F is convex valued, then for every В е В one has c\IB{E(F\B),A)=c\IB(F.A).
Set-valued integration and set-valued probability theory: An over\'iew 643 (c) If F is a member of C[(C^.A) (see Proposition 3.3), then E(F\A) is uniquely determined as a member of L\ (С, B) satisfying the equality in (b)for every В eB. Here is an extension of Theorem 5.5 of [ 117] and of our Example 6.1(e). The proof is similar, but is based on our Theorem 5.4. THEOREM 6.6. Let D be a countable subset of B* and F : ? -* CC(X) be a D-countably supported, integrably bounded multifunction. Then one has, for almost all ? e ?, E(F\B)(w) = p| {x e X: (y,x) <; E(i(y. F(c))\B)}. veD Remark 6.7. The multivalued conditional expectation is closely connected with the conditional expectation of random variables depending on a parameter. This problem is of great importance in many applications. See, e.g., [41,43,81,201,202,204]. 7. Set-valued measures The theory of set-valued measures (we shall rather use the synonym "multimeasures"), has been developed by Vind [206], Schmeidler [186], Debreu and Schmeidler [66], Art- stein [2], Godet-Thobie [86-88], Coste [48-50], Pallu de la Barriere [162], Drewnowski [74], Coste and Pallu de la Barriere [52-55], Hiai [113,114], Thiam [193-196], Papageor- giou [164] and Klei" [127] among others. One of the motivations was the applications to Mathematical Economics or to Statistics. Basically, multimeasures are measures whose values are subsets of some Banach space (or, more generally, topological vector space). As we shall see, there are at least three possible definitions depending on the series summability concept considered in the space of closed sets. We first recall that a sequence (x„ )„ ;> ? in X is said to be unconditionally convergent if for every one to one map ? from N onto itself the series ?,^\ -*0<»> 's convergent. An analogous definition can be given for a sequence (C„) in Със(Х) endowed with the Hausdorff distance. Given a sequence (C„)„^i in Iх, the infinite sum ?,?>\ С» is defined by J]C„ = {* e ?: ? = ??"· x" e Ся. О 11. G.1) where the convergence of the series ?]„> | ·*„ is unconditional. Further, recall that a map ?: A -> Iх \ {0} is said to be additive if for every pair (?, ?) of disjoint members of A one has M(ADB)=c\(M(A) + M(B)).
644 С. Hess where the "+" sign stands for the Minkowski addition. When the values of ? are weakly compact, the closure operation is no longer necessary. This definition can be extended to a pairwise disjoint family of members of A. The first definition of a multimeasure uses the Hausdorff distance on the space of closed sets. DEFINITION 7.1. An additive map ? : A -> C(X) is called an h-multimeasure if ?@) = {0}, and if for every sequence (A„)„^ ? of pairwise disjoint sets in A, we have Дт^ UMjA„.cl^M(A,) =0. The second definition refers to the summability notion introduced by equality G.1). DEFINITION 7.2. An additive map ?: A -> 2х \ {0} is called a strong multimeasure if ??@) = {0}, and for every sequence (A„)„^| in A of pairwise disjoint sets, we have mI\JaA=c\J2m(A„). The third definition involves support functions. DEFINITION 7.3. An additive map ? : A -> 2х \ {0} is called a weak multimeasure if for every у & X*, the map A -> s(y, M(A)) is an extended real-valued measure. Clearly, if ?: A -» 2X is a weak multimeasure, then so are the maps M' = c\M and M" = coM. The same property is valid for /г-multimeasures (for M", use Proposition 2.1(c)). The main connections between the above three definitions are provided in the next proposition. Proposition 7.4. (a) If M.A-* Съ(Х) is an h-multimeasure, then it is also a strong multimeasure. (b) IfM.A—* Съ(Х) is a strong multimeasure, then it is also a weak multimeasure. (c) IfM.A—* /CWC(X) is a weak multimeasure. then it is also an h-multimeasure. Thus, in this case the three notions of multimeasure coincide. Proposition 7.4(a) follows easily from the following result which provides another formulation of the countable summability for /г-multimeasures. It is due to Drewnowski [74]. PROPOSITION 7.5. //(C„)„^i is a sequence in Съ(Х), then the following two statements are equivalent. (i) The series (C„) is unconditionally convergent in Съ(Х) endowed with the Hausdorff distance.
Set-valued integration and set-valued probability theory: An overview 645 (ii) For every sequence (x„)„^ \ in X such that x„ e C„ for all ? ^ 1, the series (*,,) is unconditionally convergent. Moreover, the following equality holds cl ]T C„ = ell ??": x" e C» \ Let ? : A -> Iх \ {0} be a strong multimeasure. For every A e Л we define V^(A) by VM(A) = sup ]Г||М(А,)|| I /=1 where the supremum is taken over all finite Д-measurable partitions {? ?,..., A„} of A. From Proposition 1.1 of [113], it is known that Vm is a positive measure. If Vm(&) < +°o, then ? is said to be of bounded variation. Example 7.6. Let us give an example of strong multimeasure. For this purpose, consider Q e Със(Х) and the collection S of vector measures m : A -> X such that for all m e S and A e A, one has m(A) e ?(?)?). Now, define the map ?: A -> 2? \ {0} by ??(?) = y^w,(A,): w, e5, A, e Л (pairwisedisjoint), ? ^ 1 >. I i = l J It is not difficult to check that ? is additive and that it is a strong multimeasure. Example 7.7. A simple example of /г-multimeasure can be given in the following way. Let F e М(Съ(Х)). Assume that F is integrably bounded and define ?: A -> Съ(Х) by M(A) = c\IA(F). AeA. where Ia(F) denotes the Aumann integral of F over A (see Section 5). Example 7.8. If in Example 7.7 the multifunction F is no longer assumed to be integrably bounded, but only Aumann-Pettis integrable (i.e., the set of its Pettis integrable selections is nonempty), then the map ? is a weak multimeasure, but not necessarily an /г-multimeasure. Indeed, as Example 5.3 shows, there exist Aumann-Pettis integrable multifunctions that are not integrably bounded. Further, ? is said to be ?-continuous if ?(?) = 0 implies ? (A) = {0}. Given a weak multimeasure ?: A -> Iх \ {0}, a selection of ? is a vector measure m : A -> X such that m{A) e ? (A) for every A e A. The set of all selections of ? is denoted by <S,w· The multimeasure ? is said to be rich (of selections) if M(A) = cl{w(A): m eSM\,
646 С. Hess for all A e A. In this case, the multimeasure can be recovered from its selections. The following result is due to Coste [48]. THEOREM 7.9. Let M:A-* Съ(Х) be a strong multimeasure. (a) If A is countably generated and if ? is ?-continuous, then ? is rich. (b) IfX is separable and if ? takes on its values in Cbc(X), then ? is rich. (c) If ? takes on its values in /C« (X), then for every A e A, one has M(A)= \m(A): m e<S,w}. A member A of Л is called an atom of the multimeasure ? : A -> 2X \ {0} if ? (?) ? {0} and if either ? (?) = {0} or M(A\B) = {0} holds for every ? с А, В е A. A measure space having no atom is said to be nonatomic. The Banach space X is said to have the Radon-Nikodym property (RNP) if for each finite measure space (?, ?. ?) and each vector measure m : A -> X which is of bounded variation and ?-continuous, there exists f e L] (?,?.?: X) such that m(A) = I f ??. for all A e A. For example, it is known that reflexive Banach spaces and separable dual spaces have the RNP (see [68] or [69]). The following result concerns the convexity of the values of a multimeasure. The first part is due to Hiai [113], while the second part can be found in [2] or [49]. THEOREM 7.10. Assume that X has the RNP. Let ? : A -> 2?\{0) be a nonatomic strong multimeasure of bounded variation. Then for every A e A, cl ? (A) is convex. Moreover, the set cl \JM(AI 1ЛеЛ J is convex. Example 7.11. Let m : A -> X be a vector measure. Define ?: A -> 2х \ {0} by ? (A) = \m(B): В eA. B^A}. AeA. Then ? is a strong multimeasure, and ? is nonatomic if and only if m is so. Further, Vm(A) is equal to the variation of m and ?(?) is equal to the range of m. Now, we turn to versions of the Radon-Nikodym theorem for multimeasures.
Set-valued integration and set-valued probability theory: An merview 647 Definition 7.12. Let ? : А -> 2х \ {0} be a multimeasure. F eMBx) is said to be a set-valued Radon-Nikodym derivative of ? with respect to ? if M(A)= I Fd\x, AeA. JA and a generalized Radon-Nikodym derivative of ?? with respect to ? if clM(A) = cl / F<*m. AeA. J A Let us give a version of the Radon-Nikodym Theorem concerning multimeasures, whose values are weakly compact. It is due to Kle'i [127,129]. It provides a necessary and sufficient condition. Earlier versions of this result had been proved by several authors (see, e.g., [2,50,54,66,86-88,164,206]). THEOREM 7.13. Let X be a Banach space. Then, the following two statements are equivalent. (a) X has the Radon-Nikodym property: (b) For every probability space (?. ?. ?), for every multimeasure ? :?—> ?,?,(?) of bounded variation, there exists an integrably bounded Radon-Nikodym derivative F of ? with weakly compact values. Remark 7.14. Let us also mention the works of Thiam [193-196] who studied the Daniell integral for functions with values in an ordered semi-group and gave applications to the integration of real-valued functions with respect to a multimeasure. On the other hand, Puri and Ralescu [166] proved a version of the strong law of large numbers for vector-valued random variables with respect to a multimeasure. 8. The probability distribution of a measurable multifunction This section is devoted to the presentation of fundamental properties of the distribution of measurable multifunctions. This domain was explored by several authors for various purposes (applications to mathematical economics, stochastic optimization, etc.). Let us mention, for example, the works of Hart and Kohlberg [89], Hildenbrand [118], Artstein [3], Gine et al. [85], Artstein and Hart [10], Hess [96,97,103,111], Salinetti and Wets [182], Lavie [137,138,140] and Raynaud deFitte [169]. Here, we denote by M(X) the space of all probability measures on (X, B(X)). M(X) is endowed with the weak topology, also called the topology of narrow convergence (see, e.g., [29]). For each measurable function /: ? -> X, the distribution of / is denoted by ? f and defined on B(X) by ?/(?) = ?{/-'(?)}, BeB(X).
648 С. Hess Let ft be a ?-field on the space Iх and F: ? -> 2? an Д/ft-measurable multifunction. Like for real or vector valued random variables, the distribution of F is the measure ?^ on B*,ft) defined by Mf(tf) = M(F-' (/?)), tfeft. (8.1) Two measurable multifunctions F. G : ? -> 2? are said to be independent if for every R, S eTZ one has M(F-|(/?)nG-|E))=M(F-|(/?))M(G-|E)). This amounts to the equality M(F.G) =MF®MG, on the product space (C(X) ? C(X), 7г ® ft)). In the sequel, it will be sufficient to consider the case where ft is equal to the Effros ?-field ? (this will be explained in Section 9). It is natural to consider measurable selections that are measurable with respect to the ?- field Af = F~]{?) (i.e., the ?-field generated by the measurable multifunction F). This raises the question of the relations between the sets of distributions of ?h -measurable selections and those of Д-measurable selections. The answer is given in Theorem 8.3, and in Corollary 8.5(a) for integrable selections. In view of application to the SLLN, the multivalued integrals are considered in Corollary 8.5(b). On the other hand, we shall provide results allowing one to extract equidistributed and independent selections from sequences of measurable multifunctions having similar properties. For this purpose, appropriate characterizations of equidistribution are proved at the end of this section (Proposition 8.6), as well as a connection with the multivalued integral. First, we recall a simple and useful criterion for two measurable multifunctions to have the same distribution. PROPOSITION 8.1. If F and G are members of M(C(X)), then the following three statements are equivalent: (a) F and G have the same distribution on (C(X), ?). (b) For any open subset U of X, P{F~(U)) = P[G~(U)}. (c) For any finite subset ? = {.x\, л>} of X (or of some countable dense subset), the Rk-valued random vectors (d(x,F))x€y and (d(x,G))x€Y have the same distribution. PROOF. It follows from the definitions that statement (a) implies (b) and (c). To prove the implication (a) => (b), we define the subset W+ of C(X) by W+ = {CeC(X): CCIV).
Set-valued integration and set-valued probability theory: An oven'iew 649 for every subset W of X, and we observe that for each open subset U of X one has (U-)c = (Uc)+ = {CeC(X): C^UL}. where Uc denotes the complement of U. This class generates ? and is stable under finite intersections, which by a classical result yields the desired implication. For proving implication (a) => (c), remember that the Effros ?- field is also generated by the family of distance functions С ->d(x,C), where ? ranges over X (or over a countable dense subset). Like in the first part of the proof, it is enough to observe that the class, whose members are defined by (C eC(X): d(x,,C)<a,, V/= 1 it} (where к e N, x, e X and or,- e R), generates ? and is stable under finite intersection. D For every sub-a-field ? of A, consider the space ?\?. ?, ?; X) of all (classes of) measurable functions from (?,?) into (X.B(X)). Further, define the following subset associated with the multifunction F ?(?.?)=|? = ?/??(?); feS(F.F)}. where S(F,T) is the set of all J"-measurable selections of F (see the beginning of Section 5). So, M(F,T) the set of all probability measures ? on (X.B(X)) such that each ? e M(F,T) is the distribution of some ^"-measurable selection of F. Before stating the results on the set of selections and its associated set of probability measures, an example is useful. Example 8.2. Consider the case where ? = [0, 1], ? = ?(?), ? = Lebesgue's measure, X = ? and F = {0, 1} (i.e., F is a constant multifunction). Clearly we have ?? = [?,?), S(F,Af) = (f = Q,f=\) and M(F,AF) = [h.&\\. On the other hand, one has S(F,A) = (f = tA: AeA], This shows that the inclusions S(F,AF)?S(F.A) and M(F,AF)?M(F.A) M(F.A) = {p8{) + (\- p)o\: pe[0.\]}. may be strict.
650 С. Hess The following theorem is the main result of the present section. It was already stated in [96,97], but the proof contained a gap; the correct proof was given in [103,111]. It provides a fundamental equality, which is the starting point for the study of the distribution of multifunctions in connection with their measurable selections. Its consequences, especially Corollary 8.5(b) concerning the multivalued integral, play an important role in the proof of the multivalued SLLN for measurable multifunctions whose values may be unbounded. THEOREM 8.3. IfF is a measurable multifunction with closed values in X, then, in M(X) endowed with the topology of weak convergence of measures, the following equality holds true coM(F,A)=coM(F,AF). (8.2) where "со" denotes the closed convex hull operation. We denote by ?' (X) the subset of ? (?) whose members ? satisfy \?\\?? < +oo. Given a sub-a-field ? of A and a measurable multifunction F, we define the following subset of M(X) M\F,T) = {^,: feS\F.T)}. where 51 (F, T) was defined in Section 5. Now, consider an integrable multifunction F. Obviously, the inclusion M] (F,T) с M(F. JF) holds for any sub-a-field ? of A. The following simple lemma shows that the closure of both sides are equal in ? (?). LEMMA 8.4. For any sub-? -field ? ofA and any integrable multifunction F whose values are members ofC(X) the following equality holds true c\M\F,F) = c\M(F.T). the closure being taken in M(X) in the weak (narrow) topology. From Theorem 8.3 and Lemma 8.4, we easily deduce the following corollary, whose first part makes equality (8.2) more precise when F is integrable. Part (b) is an application of (a) to the multivalued integral. It was given by Artstein and Hart [10, Theorem 2.2] when F is an integrable measurable multifunction with closed values in a finite dimensional space. Let us note that Artstein and Hart's proof relies on a previous result of Hart and Kohlberg [89] valid for integrably bounded multifunctions. A self-contained proof was given in [111]. /
Set-valued integration and set-valued probability theory: An overview· 651 Corollary 8.5. (a) For every integrable multifunction F whose values are in C(X) the following equality holds true со M\F, А) = со M](F.Ah), (8.3) the closure being taken in M(X) in the weak topology. (b) For any integrable multifunction F whose values lie in C(X), one has со I (F, A) = со I (F,Ar). (8.4) The following result states that two measurable multifunctions F and G have the same distribution if and only if the sets of distributions of selections that are measurable relatively to the ?-fields Af and Ac respectively, are equal. As we shall see, this property plays a crucial role in the proof of the multivalued SLLN in an infinite dimensional Banach space. Considering the ?-fields generated by the multifunctions (instead of the ?-field A) allows for more precise results. Indeed, unlike in [10]. the equalities of statements (b) and (c) below do not involve any closure operation. This is important in view of the extension of the SLLN in infinite dimensional spaces. PROPOSITION 8.6. Let F and G be two measurable multifunctions with closed values in X. Then, the two following statements (a) and (b) are equivalent: (a) F and G have the same distribution on the measurable space (C(X), ?). (b) In M(X), the following equalit}· holds true M(F,Ar) = M(G,AG). Moreover, if F and G are integrable then each of the above statements is equivalent to (c) In ?' (X), the following equalit}- holds true M\F.Af) = M\G.Ag)- Consequently, if F and G have the same distribution one has I(F,AF) = I(G,AG). Remark 8.7. (i) An alternate proof of the last statement of Proposition 2.6(c) was given by Hia'i [116, Lemma 3.1B)] using the properties of the multivalued conditional expectation, as defined in Section 6. (ii) It is worthwhile to remark that Theorem 8.3 and Proposition 8.6 ((a) and (b)) remain valid when X is only assumed to be a complete separable metric space. (iii) Let us mention a result of Artstein [3], who considered the following problem: given a probability distribution ? on (K,(X). ?), is it possible to find ? e M(X) which is
652 С. Hess the distribution of some measurable selection / of a random set F : ? -» ?,(?) whose distribution is equal to ?? (iv) The study of the distribution of measurable multifunctions was initiated by Choquet [45]. Given F e M(C(X)), Choquet introduced the capacity functional redefined on fC(X) by ??(?) = ?(?-?(?-))=?{???: ?(?)?\??9). KelC(X). Choquet showed that, when X is finite dimensional, 7> characterizes the distribution of F. He also showed that if ? is a capacity satisfying suitable additional requirements then there exists a unique probability distribution ? on (C(X).E) such that T(K) = v(K-) = v({CeC(X): СПК^0}). К е ?,(?). Later, Matheron [148] gave a probabilistic proof of Choquet's result. See also Norberg [160,161] and Salinetti and Wets [182] for related results. A more abstract approach was considered by Ross [176] without topological assumptions. It was shown recently by H.T. Nguyen and N.T. Nguyen [159] that the above result on the existence of у may fail when X is not locally compact. Finally, let us note that the convergence in distribution of multifunctions was studied by Norberg [160,161], and by Salinetti and Wets [182]. The convergence in probability was examined by Salinetti and Wets [181], and Salinetti et al. [179]. 9. Set-valued strong laws of large numbers It is remarkable that the classical Kolmogorov strong law of large numbers (SLLN) makes sense in the set-valued case. Firstly, we shall present set-valued SLLN involving the convergence in the Hausdorff metric topology гн on Сь(Х). Secondly, we shall be interested in SLLN involving the Painleve-Kuratowski convergence, which allows for multifunctions taking on unbounded values. In both cases, we shall observe a convexification phenomenon. More precisely, in multivalued SLLN the limit is always convex, even if the given multifunctions are not convex-valued. On the average, the Cesaro sums tend to be convex. 9.1. Convergence in the Hausdorff metric topology As to the measurability issues, we have already noted in the beginning of Section 8 that the distribution and independence of multifunctions are relative to a given ?-field, say ??, on C(X). In this subsection, it is natural to take the ?-field ?? equal to В(Съ, ти). However, since we shall restrict to a гн-separable subspace, we can take TZ equal to the Effros ?-field as well (see Remark 4.9(ii)). THEOREM 9.1.1. Consider a strongly measurable multifunction F with values in a тц-separable subspace С of Съ(Х) and a sequence (F„)„~^\ of pairwise independent multifunctions having the same distribution as F. Then, the following two statements are equivalent
Set-valued integration and set-valued probability theory: An overview 653 (a) F is integrably bounded. (b) There exists С &Съ such that lim h I C, - V Fi (?) ) = 0 a.s. Moreover, when С exists it is equal to col(F) and the convergence also holds in the ?- metric, i.e., lim /}(C, -Vf,) =0. The proof of Theorem 9.1.1 can be done in two steps. When F is assumed to be convex valued, it suffices to use the embedding ? from Сы.-(Х) into C(B*) (see Remark 3.6) and to invoke the SLLN for vector-valued random variables. This is the easier step, but it requires the гн-separability of С The second step consists of a so-called convexification procedure, namely of a property of the following type, which is of a purely deterministic nature. The convexification property (#). Consider a pair (X,C), where X is a Banach space and С asubspaceofCb(X). For every sequence (C„)„~^\ in С and С eC satisfying (i) С is convex, (ii) Нт„^эсй(С,^5:!'=|СоС/) = 0. one has lim hlc-f^d) =0. Example 9.1.2. Let us mention three situations where property (#) holds. (a) X is finite dimensional and С = ?,(?) (Artstein and Vitale [11]). Using the Shapley-Folkmann inequality, it can be shown that for every sequence C|,..., C„, in K(X) one has hi yjcoC,, /Jc,- J ^ \idmax{r(Cj): i = 1 /?}. \/=i ;=i / where d is the dimension of X and r(C) denotes the radius of the subset C, i.e., r(C) — inf sup \\x — y\\. ¦te*veC An alternative proof in special cases was given by Cressie [59]. A variant of this theorem has been rediscovered recently by Uemura [198], and Taylor and Inoue [192].
654 С. Hess (b) X is a separable Hubert space and С = /C« (X). This was proved first by Cassels [32] using probabilistic arguments and later by Hess [95,99] by a geometric approach in the framework of (countable) Hubert spaces. This was extended later by Puri and Ralescu [166,167] when X is a Banach space of type ? e A, 2]. In all these papers, the proof is based on an inequality of the following form ]ГсоС,,]Гс,и ]Гг(С,У ( = 1 ( = 1 / I ( = 1 where к = 2 when X is Hubert and к = ? e A. 2] when X is of type p. (c) X is a separable Banach space and С = /C(X), the set of strongly compact subsets of X. In this case, property (#) was proved independently by Artstein and Hansen [9] and by Hia'i [115]. Further, it was shown by Hess [98,99] that it is not possible to replace /C(X) by /CW(X). It is interesting to mention a set-valued version of the Birkhoff ergodic theorem. Assume that a measurable map ?: ? -> ? is given. ? is said to be measure preserving if for every A e ? one has ?(?_? (?)) = ?(?). The ?-field I of invariant subsets of ? is the set of those A e ? such that A and T~l (A) are equal up to a ?-null set. The following result is due to Hess [94-96]. THEOREM 9.1.3. Let F be a strongly measurable and integrably bounded multifunction with values in a z\\-separable subspace С ofCb(X)- Then, for almost all ? e ? one has lim h ( coE(F|I), - Vfir?)) - 0. n^oc \ ? *—i y ' J As it is known for single-valued random variables, this result can be stated equivalently in terms of stationary sequences (see [94—96]). Extensions and variants of this result have been proved by Schurger [189] and, more recently, by Krupa [131]. On the other hand, assuming conditions on the moments, set-valued SLLN have been proved for nonidentically distributed, independent sequences of integrably bounded multifunctions. See, e.g., Lyashenko [146] and Hia'i [115]. 9.2. Convergence in the sense of Painleve-Kuratowski Now, we consider the extension of the SLLN to the case of measurable multifunctions whose values are unbounded. In this case, the Hausdorff distance is no longer appropriate to formulate the convergence, because it is too strong. Instead, we consider the Painleve- Kuratowski convergence, whose definition is recalled hereafter. Let ? be any topology on X. If (Cn)„^\ is a sequence in 2X we set ?-LiC,, = ? ? e ?: ? — ?- lim x„. xn e C„, ? ^ 11, I / Л
Set-valued integration and set-valued probability theory· An overview 655 and r-LsC„ = \x e ?: ? — ?- lim лг*. xt e C„{k). к > 1 . where (Cn^))k^\ is a subsequence of (C„). The set ?-LiC,, (respectively, ?-LsC,,) is called the lower limit (respectively, the upper limit) of (C„) relatively to the topology ?. These subsets are also called the inferior limit and the superior limit, respectively, which explains the notation. When ? is metrizable, these subsets are both ?-closed. Further, ?-Li C„ and ?-Ls C„ are unchanged if each C„ is replaced with its ?-closure. The sequence (C„) is said to converge to С in the sense of Painlevi-Kuratowski (PK), relatively to ?, if one has С = ?-Li C„ = ?-Ls C„ (see, for example, [ 14,27,45,180,190]). This is denoted by C = PK(r)- lim C„, n —>эс or simply С = PK- lim C„. when X is finite dimensional and ? is the usual topology on X. A convergence criterion in the general case was given by Couvreux and Hess (Corollary 1, p. 944, in [56]). The convergence in the Hausdorff distance entails the PK-convergence, but the converse implication is false, as the following simple example shows. Example 9.2.1. Let X = R2 be endowed with any norm and consider the subsets С = a straight line passing through the origin. С,, = a straight line such that the angle between С and C„ is \/n. It is readily seen that the sequence (C„)„j>i PK-converges to C, but does not гн-converges. In the present section, the ?-field TZ considered in Section 8 is taken equal to E, the Effros ?-field on C(X). Thus, the definition of the distribution and independence of multifunctions are relative to ? and we consider an Effros-measurable multifunction F: ? -> C(X). As to the integrability condition, integrable boundedness is no longer appropriate because an integrably bounded multifunction is almost surely bounded valued. Instead, we assume that F is integrable, namely that S] (F) is nonempty. The proof of the SLLN for multifunctions whose values may be unbounded are mainly based on the results presented in Section 8, especially on Corollary 8.5(b) and Proposition 8.6(c).
656 С. Hess THEOREM 9.2.2. Let X be a separable Banach space, F e M(C(X)) be an integrable multifunction and (F„)„^| be a sequence ofpairwise independent multifunctions having the same distribution as F. Then, there e.xists a null set N such that 1 " coI(F,A)=PK- lim - Vf,(w). ???\?. ?—>эс /; ?—1 ? = ? A converse to the above theorem can be given when the values of the multifunctions do not contain any line. They may be unbounded, but may contain half-lines only. For sake of simplicity, we only present the finite dimensional version of a result due to Hess (Theorem 9 in [98,99]). THEOREM 9.2.3. Consider a finite dimensional normed linear space X and a closed subset L of X which contains no whole line. Let (Fn)n^\ be a pairwise independent, equidistributed sequence in Л4(С(Х)). Moreover, assume that for almost all ? e ? the following inclusion holds Fd<o)QL. (9.1) Under the above assumptions, if F\ is not integrable, i.e., ifS(F\) =0, then 1 " PK- lim - Vf,(w)=0. 1 = 1 For example, if X = M.d, the subset L can be taken equal to the positive cone W+. When the F„ are single-valued, condition (9.1) simply means that the X-valued random variables F„ have positive values. Remark 9.2.4. (i) Theorem 9.2.2 has been proved by a lot of authors. It was proved first by Artstein and Hart [10] when X is finite dimensional, for mutually independent, identically distributed sequences. In the early eighties, infinite dimensional versions were proved independently by Hiai [116] and Hess [98,99], when the PK-convergence is replaced by the Mosco convergence, an infinite dimensional extension of the PK- convergence (see the next remark). Moreover, using a result of Etemadi [80], Hess [98,99] relaxed the hypothesis of mutual independence, assuming only the pairwise independence. Afterwards, the set-valued SLLN was proved with respect to other extensions of the PK- convergence such as the Wijsman convergence [111] and the slice convergence [107,111], that are of interest in infinite dimensional spaces. (ii) The above mentioned Mosco convergence is one of the infinite dimensional extensions of the PK-convergence. It was introduced by Mosco [153,154] and has interesting variational properties that was exploited in the areas of calculus of variations (see, e.g., [62,64]) and optimization [175]. If X is infinite dimensional and if the sequence (C„) PK-converges to С relatively to both the strong topology and the weak topology of X (denoted by s and w, respectively) then (C„) is said to be Mosco convergent to C. This
Set-valued integration and set-valued probability theory: An oven'iew 657 is denoted by С = M- lim„ C„. However, Mosco convergence has nice properties only in reflexive spaces so that several other topologies have been introduced. For an extensive study of convergences and topologies on spaces of subsets of a metric or a normed linear space, the reader is referred to the monograph by Beer [27] and to the paper by Sonntag and Zalinescu [190], who presented a classification of set-convergences. (iii) In some extent, it is possible to deduce the set-valued SLLN for unbounded valued multifunction from the integrably bounded case. This approach relies on a truncation argument. More precisely, let F e M(C(X)) be an integrable multifunction. For every integer к ^ 1, we define the multifunction Fk by Fk(w) = c\(F(w)DBk(w)), ?&?. where Bk(co) denotes the open ball of radius ?(?) = k(d@, F(a>)) + 1), centered at 0. Now, if (F„)„^ | is a sequence of pairwise independent measurable multifunctions having the same distribution as F, we set Fk(w) = d(F„(<u)r)Bk(co)), ???, where Bk{a>) is defined as above except that F(a>) is replaced by F„(a>). For every k, ? ^ 1, Fk is Af„-measurable and integrably bounded (because d@, F„(-)) is integrable). The Af„ -measurability follows from the Effros measurability of the map С->с1(СПВ@,г)). for every r > 0. Thus, when X is finite dimensional, Fk takes on compact values, which allows us to apply Theorem 9.1.1 to the sequence (Fk)n^\, for every к ^ 1. This yields 1 " coE(F*) = lim -Vf*(w) a.s. Then, using the simple equalities k^\ i = l / = l jt^l /'=| and Lemma 5.11 of [106], it is readily seen that this yields the PK-convergence as in Theorem 9.2.2. This technique can be used for proving other strong limit theorem for multifunctions. (iv) There are other kinds of set-valued SLLN where the multifunctions are no longer assumed to be identically distributed, but only independent (see, e.g., [115,116] and [169]). Let us also mention set-valued versions of the Komlos theorem (see [22,39,133,216]). Like in the SLLN, Komlos theorem involves Cesaro means, but is valid for L' -bounded sequences. An interesting application to fuzzy random sets was obtained by Colubi et al. [47].
658 С. Hess 10. Set-valued martingales In this section, X generally stands for a finite dimensional Banach space. Infinite dimensional extensions will be mentioned in Remark 10.4. Dehnition 10.1. Let (?,,)??^? be a nondecreasing sequence of sub-a-fields of ?. A sequence (F„)„^| of measurable multifunctions with values in CC(X) is said to be adapted to (Bn) if, for any /; ^ 1, F„ is B„-measurable, i.e., F~](U~) e B„ for every U e Q. An adapted sequence (F„) is said to be a set-valued martingale if the following two conditions (a) and (b) hold, for every integer /; ^ 1: (a) S] (F„,B„) is nonempty (i.e., F„ is integrable), (b) F„ =E(Fn+i\Bn). Further, if, instead of equality (b), one has the inclusion F„ с E(Fn+i |?„) (respectively, F„ 5 E(F„+| \B„)), we shall say that (F„) is a submartingale (respectively, supermartingale). Example 10.2. In the following examples (G„) denotes a sequence of integrable CC(X)- valued multifunctions and, for every integer// ^ \,B„=a(G\ G„) is the sub-a-field generated by the multifunctions G \ G„. (i) In this example, we assume that the G„ are centered in the sense that 0eE(G„+||i3„) a.s. for all О 1. (Ю.1) If we define the sequence F„ by F(I=cl j^G,¦[, (Ю.2) then (F„) is a submartingale. Indeed, from A0.1) and A0.2), and from Theorems 6.2 and 6.3, we obtain, for every ? ^ 1, E(F„+i|S„) = e(c1(^]g,+G„. Bn >i = l = cl{F(I+E(G(I+i|B(I)}2Fn. (ii) Now, we consider the sequence (G„) and (?„) as in (i). In addition, the G„ are assumed to be integrably bounded, but condition A0.1) is no longer assumed to hold. If the sequence (F„) defined by A0.2) is a supermartingale, then for all ? ^ 2 the multifunctions G„ are single-valued. Thus we have F, = G ? + g2 + h g„.
Set-valued integration and set-valued probability theory; An overview 659 where g-,: ? -> X are integrable random vectors with zero expectation. Only G\ can be set-valued. Indeed, for every ? ^ 1, one has E(F„+| \B„) = F„ + E(G„+| \B„) с F„. which implies E(G„+i|i3„) = {0}. for all « > 1. In turn, this yields the existence of a Bn+\-measurable random vector gn+\ such that G„+\ = [gn+\ }¦ This result is due to Ezzaki ([83, p. 17]). (iii) If we consider (G„) and (B„) as at the beginning of the present example, then the sequence (F„) defined by F„=co(|Jg,Y for all О 1. is a submartingale. Indeed, using Theorems 6.2 and 6.3 it is readily seen that for every ? ^ 1 and almost every we ?, one has E(F„+||i3„) = E(F„UG„+i|i3„) 2 E(F„\B„) UE(G„+| \B„) = F„ UE(G„+| |?„) Э F„. (iv) Let (G„) be a sequence of multifunctions as in example (iii). If we assume that OeG„ a.s. forallO 1, A0-3) then it is readily seen that the sequence (F„) defined by F„ = f]Gi, is a supermartingale. Indeed, for every ? ^ 1 one has almost surely E(F„+||i3„)=E(F„nG„+i|i3„) с E(F„\B,,) ПЕ(С,|Д,) - F„ ПЕ(С,|Д,) с F„. Condition A0.3) prevents F„ from taking empty values. (vi) Examples (iv) and (v) are special cases of more general situations. Indeed, consider an adapted sequence (F„) of multifunctions. If (F„) is a nondecreasing (respectively, nonincreasing) sequence of multifunctions, then it is a submartingale (respectively, a supermartingale). (vii) Finally, it is worthwhile to observe that the following property is an immediate consequence of our definitions: if a set-valued submartingale or supermartingale is single- valued (i.e., its values are points of the space X), then it is a martingale.
660 С. Hess The following result presents a property of almost sure convergence of set-valued martingales whose values may be unbounded. It can be seen as an extension of the classical Doob's Convergence Theorem. THEOREM 10.3. Let {B„)n^\ be a nondecreasing sequence of sub-?-fields of ? and (F„)„j,| an adapted sequence of multifunctions with values in CC(X). If (F„) is a set- valued supermartingale satisfying the condition sup Ed@, Fn) < +oo. then there exists an integrable multifunction F e M(C(X)) such that, for almost every ? e ?, F(w) = PK- lim F„(w). II —>OC Remark 10.4. (i) Theorem 10.3 is a special case of Theorem 5.12 in [106], where X is assumed to be an arbitrary separable Banach space (possibly infinite dimensional). There, the following additional condition was required. (WB) there exists a multifunction H, whose values are weak ball-compact, such that Fn(w)?H(w), OUefi. Recall that a subset С of CC(X) is said to be weak ball-compact if its intersection with every closed ball is weakly compact. Condition (WB) is automatically satisfied when X is reflexive or, more particularly, finite dimensional. The proof is based on a truncation argument and involves Lemma 5.11 in [106], already described in Remark 9.2.4(iii). On the other hand, existence results for martingale selections can be found in [105,106]. Here, for the sake of simplicity, we have only presented convergence results when X is finite dimensional. (ii) There are other references where other set-valued versions of the Doob convergence theorem are proved. For example, let us mention the works of Neveu [158], Daures [63], Hiai and Umegaki [117], Coste [51], Luu [144,145], Bagchi [18], Choukairi [46], Wang and Xue [211], Dong and Wang [70], Hess [109], Krupa [131,132] and Lavie [139]. A version of the optional sampling theorem can be found in [1]. An application to the convergence of fuzzy sets was given by Li and Ogura [ 143 ]. A lot of further references can be found in [131]. 11. Gaussian multifunctions. The set-valued Central Limit Theorem First recall that an X-valued random variable /: ? -> X is said to be Gaussian if for every finite sequence (y\,..., yy) e (X*O, where j' > 1, the Ry-valued random vector «3?,/>,-¦¦, <O./».
Set-valued integration and set-valued probability theory: An overview 661 is Gaussian. More generally, an integrably bounded multifunction F ; ? —* /C(X) is said to be Gaussian if, for every finite sequence (vi, v/) e (X*)J, the R^-valued random vector (s(y],F),...,s(yj,F)), is Gaussian. The following result characterizes Gaussian multifunctions with compact values. THEOREM 11.1. Let F-.? -> /CC(X) be an integrably bounded multifunction. The following two statements are equivalent. (a) F is Gaussian. (b) F has the following form F = K + f. where К e K,C(X) is nonrandom and /:?-> ? is a Gaussian vector with zero expectation. Theorem 11.1 was proved by Lyashenko [147] when X is finite dimensional and by Vitale [207]. It was extended later to the case where X is infinite dimensional by Puri and Ralescu [168], and by Meaya [150]. On the other hand, there exists a version of the central limit theorem for multifunctions whose values are compact. First, we need some notations. Let Sk ~' be the unit sphere of X = Жк and С (Sk~') be the space of continuous functions defined on S*_l, endowed with the uniform norm || · ||u. The dual space C(S*_I)* is the space of Radon measures on Sk~]. The duality pairing is defined by (k,u)= I udX, ue<C(Sk-^). XeCiS'*)*. Jsk-X Given a C(Sk~' )-valued random variable /, the expectation E(/) is defined as a Bochner integral. This is possible because CE'A^1) is separable. Further the covariance function rf is defined on the product space C^)* xCEi_l)* by ?/(??, ?2) = ?{(? ?, / - ?(/))(?2, / - ?(/))} = ? (?,. /(?) - ?(/))(?2· /(?) - ?(/)) ?(??). When F e _M(/C(X)) is an integrably bounded multifunction, the covariance function of F is denoted by /> and is defined as the covariance function of the C(Sk~' )-valued random variable y-> s(y, F).
662 С. Hess THEOREM 11.2. We assume that X = Rk. Let F e M(K(X)) be an integrably bounded multifunction and a sequence (F„)„^| of independent multifunctions having the same distribution as F. Then the sequence Vnhl -]Tf,,EcoF J, converges in distribution to \\g\\u, where g is a centered Gaussian C(Sk~])-random variable with covariance function Гу. The proof of Theorem 11.2 appeals to the central limit theorem for C(S*_l)-random variables, which allows for proving the result in the case where F e М(К,^(Х)), and to the Shapley-Folkmann inequality in the general case. Theorem 11.2 is due to W. Weil [212]. Previous versions had been settled by Cressie [60,61 ] in the case of multifunctions taking on a finite number of values. 12. Set-valued versions of the Fatou Lemma The classical Fatou Lemma of Integration Theory asserts that, given a sequence (/„)„^ ? of positive measurable functions, the following inequality holds / liminf/^P^liminf / f„dP. JR n^oc n^x J? Extensions of this inequality to the case where the /„ take on their values in X = R' (d ^ 2) or in infinite dimensional Banach spaces, were established in order to get the existence of equilibria in Economic Theory. In these extensions, the inequality must be changed into an inclusion. It is worthwhile to note that, because of the partial character of the order relation in W1, this sort of result cannot be deduced directly from the classical Fatou Lemma. For the same modelization purpose, the case where the /„ are replaced with measurable multifunctions F„ was examined by several authors (see, e.g., [118, Theorem 6, p. 68]). In the set-valued case, there are in fact two versions of the Fatou Lemma for multifunctions: one for the PK-lower limit and another for the PK-upper limit. Indeed, it is no longer possible to deduce one version from the other by merely changing Fn into — F„. On the other hand, it was shown by Balder and Hess [21 ] that there is a strong connection between the set-valued version and the vector version of Fatou's Lemma. In fact, one can be deduced from the other. As mentioned above, several authors have studied this subject. Let us mention the works by Aumann [17], Schmeidler [185], Hildenbrand and Mertens [119], Artstein [5], Balder [19], Yannelis [215],Castaing and Clauzure [38], and Balder and Hess [21] among others. We refer the reader to [21 ] for a more complete list of references and for a discussion of the various existing results. Basically, the first version of the following result is due to Aumann [17].
Set-valued integration and set-valued probability theory: An overview 663 THEOREM 12.1. Let X be a separable Banach space and (F„) be a sequence in M(C(X)). (a) If the sequence (d@, F„))n~^\ is uniformly integrable, then the following inclusion holds cl/(s-LiF„)CS-Li/(F„). (b) If the sequence (|| F„ \\ )„^\ is uniformly integrable and if there exists a multifunction G : ? -» /CW(X) such that F„(a))CG(a)), шей. ii^l. then one has w-Ls / (F„) с cl /(со w-Ls F„). Remark 12.2. (i) In statement (a) the multifunctions are only assumed to be integrable, whence can have unbounded values. In statement (b), the F„ are assumed to be integrably bounded, thus they are almost surely bounded valued. Versions of statement (b) for integrable multifunctions have been given, but they are of a more delicate nature. For example, they involves conditions on the asymptotic cones of the multifunctions (see [21,104]). On the other hand, results involving the convergence in distribution in connection with the approximation of the multivalued integral were given by Artstein and Wets [12]. (ii) Similar properties have been proved for the set-valued conditional expectation (see [102,104,116]). 13. Epigraphical convergence The epigraphical convergence, "epiconvergence" for short, is closely related to PK- convergence and its infinite dimensional extensions. Let и ·. X -> R be an extended real- valued function. Its epigraph (or upper graph) is the subset of ? ? Ш denoted by epi(w) and defined by epi(n) = {(x,a)e XxR: u(x) iCa}. The conjugate (or the polar) function of и is denoted by u* and defined on X* by K*00 = sup{()>,;0 -u(x): ? e X}. ye X*. The map и —» и* is often called the Young-Fenchel transform. Further, let (?,,),,^? be a sequence of extended real-valued functions. Consider a topology ? on X. The sequence (и„) is said to converge epigraphically (or "epiconverge" for short) to u, relatively to ?, if ерЦк)=РК(г)- lim epi(M„). A3.1) /?—?-ОС
664 С. Hess When X is finite dimensional and ? is the usual topology, this is denoted by и = lime и„, n—*oc where the subscript "e" stands for "epigraphical". Of course, a symmetric notion involving hypographs (alias lower graph) can be introduced. This type of convergence was considered by a lot of authors in order to study the convergence and the approximation of optimization problems, especially in the calculus of variations. Let us mention the works of De Giorgi [64], Mosco [153,154], Attouch [14], Dal Maso [62], Rockafellar and Wets [175] and Artstein and Wets [13], where further references can be found. Also, applications to the convergence of estimators, in Statistics, were given by Hess [108] and Choirat et al. [43]. Without entering into details, let us mention the following facts (see, e.g., [14] and [62]). If (V„) denotes the minimization problem mm[un(x): ? e X), /ieNU{oo), and if (и„) epiconverges to ux, then under additional compactness assumptions, it can be shown that infi/3C(;c)= lim inf m„(jc). .t€X ?-*???? Moreover, the solutions of (Voc) can be approximated by solutions of (Vn) for ? large enough. The connection of epiconvergence with that of measurable multifunctions is clear. Indeed, let be an extended real-valued function. The multifunction F defined by F(a>)=epi(<p(w. ¦)) = {(x,a)e XxR: ?(?.?) ^a\. A3.2) It is called the epigraphical multifunction associated with ?. When ? is ? <8> B{X)- measurable, it is called an integrand. It is not difficult to show the following equivalence (see Lemma VII. 1 in [41]) Gr(F)e A®B(X) <s> ? is ? ® ?(X)-measurable. Let us give an example involving epiconvergence in connection with the SLLN. First, we need the following simple auxiliary result, which is of independent interest. The proof can be found in [107].
Set-valued integration and set-valued probability theory: An oven-iew 665 Proposition 13.1. (a) The multifunction F defined by A3.2) is integrable if and only if there exists f e L1 (X) such that the function ? -> ?(?, /(?))+ is integrable.^ (b) In ? ? R, f/?e distance function from the origin to the epigraphical multifunction is given by w^d(@,0),F(w))=\nf{\\x\\ +?(?..?)+: xe X}. Theorem 13.2. (a) The stochastic discrete infimal convolution ?„ (?. ?) defined on ? ? X by II " 1 " ] -VWffl.i,): (.vi x„) e X", -У]*/ =x \ ? *—' ? i—1 1=1 1=1 J epiconverges to the convex lower semicontinuous regularization of the continuous infimal convolution ? which is the deterministic function defined by ?(?)=?\{ ?(?,/(?))??: feL](X). J /(?)??=?\. (b) For almost all ? e ?, the so-called mean functional defined by ?(?.?)??. ?? is the epi-limit of the sequence of the sample mean given by 1 " (?,?) -> -?^?(?,?). ? ?—1 /=? Statement (a) can be easily proved by reformulating Theorem 9.2.2 in terms of epigraphs. The proof of statement (b) is based on the continuity of the Young-Fenchel transform with respect to epiconvergence (see [14,62]). Remark 13.3. There is another approach allowing one to prove more general versions of Theorem 13.2(b). It is based on the formulation of the epigraphical limit in terms of Lipschitz approximation. It has applications to Statistics [76,108], Optimization Theory [13,57,58], Ergodic Theory and in Econometrics [43]. 14. Concluding remarks As illustrated above, the theory of measurable multifunctions raises a great variety of problems involving probability theory and mathematical analysis arguments. It allows For every real a. we set a+ = max@. a). L
666 С. Hess for new points of view and is often at origin of new approaches for classical questions. Of course, we could have presented other topics of interest. Let us mention, for example, the measurability issues concerning the extreme points of multifunctions (see, e.g., [41, Chapter IV]). Further, measurable multifunctions depending measurably or continuously on a second parameter play a important role in several applications. We refer the reader to [34,35] and [165] for continuous dependence (Caratheodory multifunctions), and to Godet-Thobie [87] and Artstein [7] for measurable dependence. Concerning convergence problems, one can mention the weak convergence of multifunctions studied by Artstein [4], using two different approaches: one based on support functions, the other on the embedding technique. A result on large deviations for sums of random sets was proved recently by Cerf [42]. The theory of random sets has also beautiful and rich geometric aspects, especially when X is finite dimensional. Let us mention, for example, the books of Kendall and Moran [125], Matheron [148], Santalo [183], and that of Stoyan et al. [191]. In this context, a lot of interesting probabilistic and statistical applications have been given, for example, to Sterology and to Image Processing. In fact, the Statistics of random sets, which has connections with spatial stochastic processes, is a very lively domain. One can find works dealing with theoretical aspects, and others with engineering and numerical applications. Let us mention [ 148,191 ], Molchanov [151,152], Schmitt and Mattioli [ 187], Aubin [ 15], and Choirat and Seri [44]. The interested reader may consult the proceedings of the international symposium on "Advances in Theory and Applications of Random Sets", edited by D. Jeulin (World Scientific, 1997), where a large variety of recent papers are available along this line. References [1] R. Alo. A. de Korvin and R. Roberts. The optional sampling theorem for convex set-valued martingales. J. Reine Angew. Math. 310 A979), 1-6. [2] Z. Artstein, Set-valued measures. Trans. Amer, Math, Soc, 165 A972). [3] Z, Artstein, Distribution of random sets and random selections. Israel J. Math, 46 D) A983). [4] Z, Artstein, Weak convergence of set valued functions and control. SIAM J, Control Optim. 13 D) A975), 865-878. [5] Z. Artstein, A note on Fatou's lemma in several dimensions. J, Math. Econom. 6 A979). 277-282. [6] Z. Artstein, Parametrized integration of multifunctions with applications to control and optimization. SIAM J. Control Optim. 27 F) A989), 1369-1380. [7] Z. Artstein, Relaxed multifunctions and Young multimeasures. Set-Valued Anal. 6 C) A998). 237-255. [8] Z, Artstein and J,A. Bumes. Integration of compact set-valued functions. Pacific J. Math, 58 B) A975). 297-307, [9] Z. Artstein and J.C, Hansen, Conve.xification in limit laws of random sets in Banach spaces. Ann, Probab. 13A985), 307-309, [10] Z. Artstein and S, Hart, Law of large numbers for random sets and allocation processes. Math, Oper, Res, 6A981). 482^492, [11] Z, Artstein and R.A, Vitale. A strong law of large numbers for random compact sets. Ann. Probab. 3 A975), 879-882. [12] Z, Artstein and R. Wets. Approximating the integral of a multifunction, J. Multivariate Anal. 24 A988), 285-308, [13] Z, Artstein and R, Wets, Consistency of minimizers and the SLLNfor stochastic programs. J, Convex Anal. 2 A-2) A995), 1-17.
Set-valued integration and set-valued probability theory: An overview 667 [14] H. Attouch. Variational Convergence for Functions and Operators. Applicable Mathematics Series, Pitman Advanced Publishing Program, New York A984). [15] J.P. Aubin, Mutational and Morphological Analysis. Birkhauser. Boston A999). [16] J.P. Aubin and H, Frankowska. Set-Valued Analysis. Birkhauser. Boston A990). [17] R.J, Aumann. Integrals of set-valued functions. J. Math, Anal. Appl. 12 A965). 1-12, [18] S, Bagchi. On a.s. convergence of classes of multivalued asymptotic martingales. Ann. Inst, H. Poincare. Probab, Statist, 21 D) A985). 313-321. [19] E, Balder, A unifying note on Fatou's Lemma in several dimensions. Math, Oper, Res, 9 A984), 267-275, [20] E, Balder. A unified approach to several results involving integrals of multifunctions. Set-Valued Anal, 2 A994). 63-75. [21] E, Balder and С Hess. On the unbounded multivalued version of Fatou's Lemma. Math, Oper Res. 20 A) A995), [22] E, Balder and С Hess, Two generalizations of Komlos' theorem with lower closure-type applications. J, Convex Anal. 3 A996). 1-20. [23] A, Barbati. G. Beer and С Hess, J. Convex Anal, 1 A) A994), 107-119, [24] A, Barbati and С Hess. On the largest class of closed convex valued multifunctions for which Effros measurability and scalar measurability coincide. Set-Valued Anal. 6 A998). 309-326, [25] D, Barcenas and W. Urbina, Measurable multifunctions in nonseparable Banach spaces. SIAM, J. Math, Anal, 28 E) A997). 1212-1226. [26] G, Beer. A Polish topology for the closed subsets of a Polish space. Proc, Amer. Math, Soc. 113 A991). 1123-1133, [27] G. Beer. Topologies on Closed and Closed Convex Sets. Mathematics and Its Applications, Kluwer Acad, Publ., Dordrecht A993), [28] V.E. Benes, Martingales on metric spaces. Theory Probab, Appl. 7 A962). 81-82. [29] P, Billingsley, Convergence of Probability Measures. Wiley A968). [30] J.M. Bismut, Integrates convexes et probabiite's. J. Math, Anal, Appl, 42 A973). 639-673. [31] C.L. Byrne. Remarks on the set-valued integrals of Debreu and Aumann. J, Math. Anal. Appl. 62 A978), 243-246, [32] J.W.S. Cassels. Measure of nonconvexity of sets and the Shapley-Folkman-Starr theorem. Math. Proc, Cambridge Phil. Soc, 78 A975), 433^436, [33] С Castaing. Sur les multi-applications mesurables. R.I.R.O. 1 A967). 91-126. [34] C, Castaing, Sur /'existence des sections separement mesurables et separement continues d'une multifunction. Sem. Anal, Convexe de l'Universite de Montpellier, No. 14 A975), [35] C. Castaing. A propos de /'existence des sections separement mesurables et separement continuez d'une multifunction separement mesurable et separement semi-continue infe'rieureinent. Sem. Anal, Convexe de l'Universite de Montpellier, No, 6 A976), [36] C, Castaing, Me'thode de compacite' et de decomposition, applications: minimisation, convergence des martingales, lemme de Fatou multivoque. Ann. Mat. Рига Appl, D) 164 A993). 51-75. [37] C, Castaing. Weak compactness and convergences in Bochner and Pettis integration, Vietnam J, Math, 24 C) A996), [38] С Castaing and P, Clauzure. Lemme de Fatou multivoque. Atti, Sem, Mat, Fis, Univ. Modena 39 A991), 303-320. [39] C, Castaing and F, Ezzaki. Convergence of convex weakly compact random sets in super-reflexive Banach spaces. Preprint. Universite de Montpellier II A996). [40] С Castaing and M, Valadier. Equations differentielles muttivoques dans les espaces vectoriels localement convexes. R.I.R.O, 16 A969). 3-16, [41] С Castaing and M. Valadier. Convex Analysis and Measurable Multifunctions. Lecture Notes in Math,, Vol, 580. Springer, Berlin A977), [42] R, Cerf, Large deviations for sums ofi.i.d. random compact sets. Proc. Amer, Math. Soc, A998). [43] С Choirat. С Hess and R, Seri. A functional version of the Birkhoff ergodic theorem for a normal integrand: A variational approach. Working paper No. 0101. Centre de Recherche Viabilite. Jeux. Controle, Universite Paris Dauphine B000). to appear in Ann, Probab, [44] С Choirat and R, Seri, Confidence sets for the integral of random closed sets, in: Proceedings of the 8th International Conference IPMU. Madrid. July 3-7 B000). 509-514.
668 С. Hess [45] G. Choquet, Theory of capacities. Ann. Inst. Founer 5 A953/1954). 131-295. [46] A. Choukairi-Dini. M-convergence et regularite des martingales multivoques: epi-martingales. J. Multivariate Anal. 33A) A990). [47] A. Colubi. M. Lopez-Diaz. J.S. Dominguez-Menchero and M.A. Gil. A generalized strong law of large numbers. Probab. Theory Related Fields 114 A999). 401^417. [48] A. Coste. Contribution a la theorie de {'integration multivoque. These d'Etat. Universite Pierre et Marie Curie, Paris A977). [49] A. Coste, Sur les multimesures ? valeurs ferme'es bornees d'un espace de Banach. C. R. Acad. Sci. Paris 280A975). 567-570. [50] A. Coste. La propriete de Radon-Nikodim en integration multivoque. С R. Acad. Sci. Paris 280 A975). 1515-1518. [51] A. Coste, Sur les martingales multivoques. С R. Acad. Sci. Paris 290 A980). 953-956. [52] A. Coste and R. Pallu de la Barriere. Un theoreme de Radon-Nikodym pour les multimesures a valeurs fermees localement compactes sans droite. C. R. Acad. Sci. Paris 280 A975). 255-258. [53] A. Coste and R. Pallu de la Barriere. Sur I'ensemhle des selections d'une multimesure a valeurs fermees, C. R. Acad. Sci. Paris 282 A976). 829-832. [54] A. Coste and R. Pallu de la Barriere. Radon-Nikodxm theorems for set-valued measures whose values are convex and closed. Roczniki PTM Prace Mat. 20 A978). 283-309. [55] A. Coste and R. Pallu de la Barriere. On set-valued measures with convex closed values and their set of selections. Comment. Math. Spec. Issue 2 A979). 43-57. (Special issue dedicated to W. Orlicz.) [56] J. Couvreux and С Hess. A Levy type martingale convergence theorem for random sets with unbounded values. J. Theoret. Probab. 12 D) A999). [57] J. Couvreux and C. Hess. Approximation an sens de Mosco d'integrandes et defonctionnelles integrates. С R. Acad. Sci. Paris Ser. I 321 A995). 159-164. [58] J. Couvreux and C. Hess. Mosco approximation of integrands and integral functionals. J. Optim. Theory Appl. 90 B) A996), 335-356. [59] N. Cressie. A strong limit theorem for random sets. Adv. in Appl. Probab. Suppl. 10 A978). 36-46. [60] N. Cressie. Random set limit theorem. Adv. in Appl. Probab. Suppl. 11 A979). 281-282. [61] N. Cressie. A central limit theorem for random sets. Z. Wahrscheinlichkeitsth. verw. Geb. 49 A978). 37- 47. [62] G. Dal Maso. An Introduction to Г-Convergence. Birkhaiiser. Boston A993). [63] J.P. Daures, Version multivoque du theoreme de Doob. Ann. Inst. H. Poincare 9 B) A973). 167-176. [64] E. De Giorgi, Convergence problems for functions and operators: Proceedings of International Meeting on Recent Methods in Nonlinear Analysis (Rene. 1978). E. De Giorgi. E. Magenes and U. Mosco, eds, Pitagora. Bologna A979). 131-188. [65] G. Debreu. Integration of correspondences. Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability. Vol. II. Part I A966). 351-372. [66] G. Debreu and D. Schmeidler. The Radon-Nikadym derivative of a correspondence. Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability A971). [67] С Dellacherie and P.-A. Meyer. Probability et Potentiel. Chapitres I a IV. Hermann. Pans A975). [68] J. Diestel and J.J. Uhl Jr.. Vector Measures. Math. Surveys. Vol. 15. Amer. Math. Soc.. Providence. RI A977). [69] N. Dinculeanu. Vector Measures. Pergamon Press. Oxford A967). [70] W.L. Dong and Z.P Wang. On the representation and regularity of continuous parameter multivalued martingales, Proc. Amer. Math. Soc. 126 F) A998). 1799-1810. [71] H. Doss. Esperance mathematique et martingales dans un espace metrique. Laboratoire d'Analyse et Modeles Stochastiques. annee 1988-99 A988). 23-27. [72] S. Doss. Sur la moyenne d'un element aleatoire dans un espace distancie. Bull. Sci. Math. 73 A949). 1-26. [73] S. Doss. Moyennes conditionnelles et martingales dans un espace metrique. С R. Acad. Sci. Paris Ser. A 254 B2) A962). [74] L. Drewnowski. Additive and countably additive correspondences. Ann. Soc. Math. Polon. Comm. Math. 19A976). 25-54.
Set-valued integration and set-valued probability- theory: An overview 669 [75] N. Dunford and J.T. Schwartz, Linear Operators. Parti; General Theory. 3nd edn. Interscience Publishers Inc. Wiley, New York A966). [76] J. Dupacova and R. Wets. Asymptotic behaviour of statistical estimators and of optimal solutions of stochastic optimization problems. Ann, Statist. 16 D) A988), 1517-1549. [77] E.B. Dynkin and I.V. Evstigneev. Regular conditional expectations of correspondences. Theory Probab, Appl. 21 B) A976). 325-338. [78] K, El Amri and С Hess. On the Pettis integral of closed valued multifunctions. Set-Valued Anal. 8 B000). 329-360. [79] A, Es-Sahib and H, Heinich. Barycentre canonique pour un espace metrique a courbure negative. Preprint. UPRES-A 6085, Analyse et Modeles Stochastiques A998), [80] N, Etemadi, An elementary proof of the strong law of large numbers. Z. Wahrscheinlichkeitsth. verw. Geb, 55A981). 119-122, [81] I.V, Evstigneev, Regular conditional expectations of random variables depending on parameters. Theory Probab, Appl, 31 C) A986), 515-518. [82] I.V, Evstigneev. Measurable selections theorems and probabilistic control models in general topological spaces, Math, USSR Sb, 59 A) A988), [83] F, Ezzaki. Convergence des esperances conditionnelles d'ensembles aleatoires et LFGN. These d'Etat, Universite de Rabat, Morocco, June A996), [84] M. Frechet, Les elements aleatoires de nature quelconque dans un espace distancie. Ann, Inst, H, Poincare 10D) A948). 215-310, [85] E, Gine, M.G. Hahn and J. Zinn. Limit theorems for random sets; an application of probability in Banach space results. Probability in Banach Spaces IV, A, Beck and K, Jacobs, eds. Lecture Notes in Math.. Vol, 990. Springer. Berlin A983), 112-135. [86] С Godet-Thobie. Sur les multimesures de transition. C. R, Acad. Sci. Paris 278 A974). 1367-1369, [87] C, Godet-Thobie, Selections de multimesures. Applications a un theoreme de Radon-Nikodym multivoque. C. R, Acad, Sci, Paris 279 A974). 603-606. [88] С Godet-Thobie. Some results about multimeasures and their selections. Measure Theory at Oberwolfach 1979, Lecture Notes in Math, Vol, 794. Springer, Berlin A980). 112-116. [89] S. Hart and E, Kohlberg. Equally distributed correspondences. J, Math, Econom, 1 A976). [90] W, Herer. Esperance mathematique au sens de Doss d'une variable aleatoire ? valeurs dans un espace metrique. С R, Acad, Sci. Paris Ser. I 302 C) A986). [91] W. Herer, Esperance mathematique au sens de Doss d'une variable aleatoire ? valeurs dans un espace metrique. С R. Acad. Sci. Paris Ser. I 305 A987). 275-278. [92] W. Herer, Esperance mathematique au sens de Doss d'une variable aleatoire a valeurs dans un espace metrique. С R. Acad. Sci. Paris Ser. I 306 A988). 681-684. [93] W. Herer, Martingales of random subsets of a metric space of negative cunature, Set-Valued Anal. 5 A997), 147-157. [94] C. Hess, Theoreme ergodique et hi forte des grands nombres pour des ensembles aleatoires. C. R. Acad. Sci. Paris Ser. A 288 A979). 519-522. [95] C. Hess, Sur les multi-applications ? valeurs fermees dans un espace de Frechet. Partie 4: theoreme ergodique et loi forte des grands nombres. Cahiers Math, de la Decision, No. 7932, Universite Paris Dauphine A979), 1-27. [96] C. Hess, Loi de probabilite des ensembles aleatoires a valeurs fermees dans un espace de Banach separable, C. R. Acad. Sci. Paris 296 A983). [97] C. Hess, Loi de probabilite et independance pour des ensembles aleatoires ? valeurs fermees dans un espace de Banach, Sem. Anal. Convexe de l'Universite de Montpellier, No. 7 A983). [98] C. Hess, Quelques theoremes limites pour des ensembles aleatoires home's он поп, Sem. Anal. Convexe de l'Universite de Montpellier, No. 12 A984). [99] C. Hess, Loi forte des grands nombres pour des ensembles aleatoires поп homes a valeurs dans un espace de Banach separable. С R. Acad. Sci. Paris Ser I 300 A985), 177-180. [100] C. Hess, Mesurabilite. convergence et approximation des multifonctions a valeurs dans un e.l.c.s.. Sem. Anal. Convexe de l'Universite de Montpellier, No. 9 A985). [101] C. Hess, Quelques re'sultats sur la mesurabilite' des multifonctions ? valeurs dans un espace metrique separable, Sem. Anal. Convexe de l'Universite de Montpellier, No. 1 A986).
670 С. Hess 102] С. Hess, Theoreme de la convergence dominee pour Vintegrale et I'esperance conditionnelle des ensembles aleatoires поп home's et des integrandes, C. R. Acad. Sci. Paris Ser. I 306 A988), 139-142. 103] С Hess, Sur la loi de probabilite d'un ensemble aleatoire a valeurs fermees dans tin espace metrique separable. Sem. Anal. Convexe de I'Universite de Montpellier, No. 19 A991). 104] C. Hess, Com-ergence of conditional expectations for unbounded random sets, integrands and integral junctionals. Math. Oper. Res. 16 C) A991), 627-649. 105] C. Hess, Sur I 'existence de selections martingales et la convergence des surmartingales multivoques. C. R. Acad. Sci. Paris Ser. I 312 A991). 149-154. 106] C. Hess, On multivalued martingales whose values may be unbounded: martingale selectors and Mosco convergence, J. Multivariate Anal. 39 A) A991). 107] С Hess, Multivalued strong laws of large numbers in the slice topology. Application to integrands. Set- Valued Anal. 2 A994), 183-205. 108] C. Hess, Epi-convergence of sequences of normal integrands and strong consistency of the maximum likelihood estimator. Ann. Statist. 24 A996). 1298-1315. 109] C. Hess. On the almost sure convergence of random sets: martingales and extensions. Cahiers Math, de la Decision, No. 9743, CEREMADE A997). ПО] С Hess, Conditional expectation and martingales of random sets. J. Pattern Recognition 32 A999), 1543- 1567. Ill] C. Hess. The distribution of unbounded random sets and the multivalued strong law of large numbers in nonreflexive Banach spaces, J. Convex Anal. 6 A) A999), 163-182. 112] С Hess, The Doss integral for random sets. Comparison with the Aumann integral. Proceedings of the 8th International Conference IPMU. Madrid. July 3-7 B000), 515-520. 113] F. Hia'i, Radon-Nikodym theorems for set-valued measures. J. Multivariate Anal. 8A978), 96-118. 114] F. Hiai, Multivalued conditional expectations, multivalued Radon-Nikodym theorems, integral representation of additive operators and multivalued strong law of large numbers. Conference of Catania 1983, working paper. 115] F. Hia'i, Strong Law of Large Numbers for Multivalued Random Variables. Lecture Notes in Math. Vol. 1091, Springer, Berlin A984). 116] F Hiai, Convergence of conditional expectations and strong laws of large numbers for multivalued random variables. Trans. Amer. Math. Soc. 291 A985), 613-627. 117] F. Hia'i and H. Umegaki, Integrals, conditional expectations and martingales of multivalued functions. J. Multivariate Anal. 7 A977), 149-182. 118] W. Hildenbrand, Core and Equilibria of a Large Economy. Princeton University Press A974). 119] W. Hildenbrand and J.P. Mertens. On Fatou's Lemma in several dimensions. Z. Wahrscheinlichkeitsth. verw. Geb. 17A971), 151-155. 120] C.J. Himmelberg, Measurable relations. Fund. Math. 87 A975), 53-72. 121] C.J. Himmelberg, T. Parthasarathy and F.S. Van Vleek, On measurable relations. Fund. Math. Ill A981), 161-167. ' 122] M. Hukuhara, Sur I 'application semi-continue dont la valeur est un compact convexe. Funkcial. Ekvac. 10 A967), 43-66. [123] M. Hukuhara, Integration des applications mesurables dont la valeur est compacte et convexe. Funkcial. Ekvac. 10A967), 205-223. [124] A.D. Ioffe, Sun-eys of measurable selections theorems. SIAM J. Control Optim. 16 A978), 728-732, Russian literature supplement. [125] M.G. Kendall and P.A.P. Moran, Geometrical Probability. Charles Griffin, London A963). [126] M.A. Khan and Y. Sun, Integrals of set-valued functions with a countable range. Math. Oper. Res. 21 D) A996), 946-954. [127] H.A. Klei. Le theoreme de Radon-Nikodym pour des multimesures ? valeurs faiblement compactes, С R. Acad. Sci. Paris 297 A983), 643-646. [128] H.A. Klei, Compacite faible de parties decomposables de Ll(E). C. R. Acad. Sci. Paris 296 A983), 965-967. [ 129] H.A. Klei, A compactness criterion in L[(E) and Radon-Nikodym theorems for multimeasures. Bull. Sci. Math. B) 112A988), 305-324. [130] E. Klein and A. Thompson, Theory of Correspondences, Wiley, New York A984).
Set-valued integration and set-valued probability theory: An overview 671 [131] G. Krupa, Limit theorems for random sets. Thesis. University of Utrecht. November A998). [132] G. Krupa, Convergence of unbounded multivalued supermartingales in the Mosco and slice topologies. J. Convex Anal. 5A) A998). 187-198. [ 133] G. Krupa. Komlos theorem for unbounded random sets. Set-Valued Anal. 8 B000). 237-251. [134] R. Kruse. On the variance of random sets. Math. Anal. Appl. 122 A987). 469-473. [135] H. Kudo. Dependent experiments and sufficient statistics. Natur. Sci. Rep. Ochanomizu Univ. 4 A954). 151-163. [136] K. Kuratowski and С Ryll-Nardzewski. A general theorem on selectors. Bull. Polish Acad. Sci. Math. 13 A965). 397-603. [137] M. Lavie, These, Universite de Montpellier A990). [138] M. Lavie, Fonction caracteristique d'un ensemble ale'atoire. Applications a la convergence de martingales et de sommes d'ensembles aleatoires. С R. Acad. Sci. Paris Ser. I Math. 314 A992). 399^402. [139] M. Lavie, An extension of Korzeniowski's theorem for bounded multivalued martingales, Actas de las III Jomadas Zaragoza-Pau de Matematica Aplicada. Universidad de Zaragoza B000). [140] M. Lavie, Characteristic function for random sets and convergence of sums of independent random sets. Acta Math. Vietnamica 25 A) B000). 87-99. [141] S.J. Leese. Miiltifiinctions ofSuslin type. Bull. Austral. Math. Soc. 11 A974). 395^411. [142] S.J. Leese. Measurable selections in normed spaces. Proc. Edinburgh Math. Soc. 19 A974). 147-150. [143] S. Li and Y. Ogura, Convergence of set-valued and fuzzy-set-valued martingales. Fuzzy Sets and Systems 2498 101 C) A997), 139-149. [144] D.Q. Luu, Multivalued quasi-martingales and uniform amarts. Acta Math. Vietnamica 7 B) A982), 3-25. [145] D.Q. Luu, Applications of set-valued Radon-Nikodsm theorems to convergence of multivalued L -amarts. Math. Scand. 54A984), 101-113. [146] N.N. Lyashenko, On limit theorems for sums of independent compact random subsets in the Euclidean space. Zap. Nauchn. Sem. Leningrad Otdel. Mat. Inst. Steklov 85 A979). 113-128. [147] N.N. Lyashenko. Statistics of random compact sets of Euclidean space. Zap. Nauchn. Sem. Leningrad Otdel. Mat. Inst. Steklov 98 A980). 115-139. [148] G. Matheron. Random Sets and Integral Geometry. Wiley A975). [149] E.J. Mc Shane. A Riemann-type integral that includes Lebesgue-Stieltjes. Bochner and stochastic integrals, Mem. Amer. Math. Soc. 88 A969). [150] K. Meaya, Characterization of Gaussian random sets. Afrika Mat. 7 C) A997). 39-59. [151] I.S. Molchanov, Limit Theorems for Unions of Random Closed Sets. Lecture Notes in Math., Vol. 1561. Springer, Berlin A993). [152] I.S. Molchanov, Statistical problems for random sets. Random Sets. Theory and Applications. J. Goutsias, R.P.S. Mahler and H.T. Nguyen, eds. IMA Volumes in Mathematics and its Applications, Vol. 97. Springer, Berlin A997). [153] U. Mosco. Convergence of convex sets and solutions of variational inequalities. Adv. Math. 3 A971). 510-585. [154] U. Mosco, On the continuity of the Young-Fenchel transform. J. Math. Anal. Appl. 35 A971). 518-535. [155] K. Musiat, Topics in the theory of Pettis integration, in: School of Measure Theory and Real Analysis. Grado. Italy, May A992). [156] K. Musial, Pettis integral. Handbook of Mesure Theory. E. Pap. ed.. Elsevier. Amsterdam B002). 531-586. [ 157] J. Neveu, Martingales ? temps discret. Masson et Cie. Paris A972). English translation: J. Neveu. Discrete- Parameter Martingales. North-Holland. New York A975). [158] J. Neveu. Convergence presque sure de martingales multivoques. Ann. Inst. H. Poincare В 8 D) A972), 1-7. [159] H.T. Nguyen and N.T. Nguyen, A negative version on Choquet's Theorem on Polish spaces. East-West J. Math. 1A998). 61-71. [160] T. Norberg. Convergence and existence of random set distributions, Ann. Probab. 12 C) A984), 726-732. [161] ? Norberg. Random capacities and their distributions, Probab. Theory Related Fields 73 A986). 281-297. [162] R. Pallu de la Barriere, Quelques proprietes des multimesures. Sim. Anal. Convexe de l'Universite de Montpellier. No. 11 A973). [163] N.S. Papageorgiou. On the theory of Banach space valued miiltifiinctions I. Integration and conditional expectation. J. Multivariate Anal. 17 A985). 185-206.
672 С. Hess [164] N.S. Papageorgiou, On the theon; of Banach space valued multifunctions 2. Set valued martingales and set valued measures,]. Multivariate Anal. 17 A985), 207-227. [165] N.S. Papageorgiou. On measurable multifunctions with applications to random multivalued equations. Math. Japon. 32 A987), 437^464. [166] M.L. Puri and DA. Ralescu. Strong law of large numbers with respect to a set-valued probability measure, Ann. Probab. 11 D) A983). 1051-1054. [167] M.L. Puri and D.A. Ralescu, Limit theorems for random compact sets in Banach spaces. Math. Proc. Cambridge Philos. Soc. 97 A985). 151-158. [168] M.L. Puri and D.A. Ralescu. Gaussian random sets in Banach spaces. Theory Probab. Appl. 31 C) A986), 526-529. [169] P. Raynaud de Fine. These. Universite de Montpellier A990). [170] P. Raynaud de Fine, Theoreme ergodique ponctuel et lois fortes des grands nombres pour des points aleatoires d'un espace metrique a courbitre negative. Ann. Probab. 25 B) A997). 738-766. [171] H. Richter, Verallgemeinerung eines in der Stati.stik benotingen Satzes der Masstheorie, Math. Ann. 150 A963), 85-90 (corrections pp. 440-441 in the same volume). [172] H.E. Robbins. On the measure of a random set I. Ann. Math. Stat. 15 A944). 70-74. [173] H.E. Robbins. On the measure of a random set II. Ann. Math. Stat. 16 A945), 342-347. [174] R.T. Rockafellar. Integral Junctionals, normal integrands and measurable selections. Non-Linear Operators and Calculus of Variations. Lecture Notes in Math., Vol. 543. Springer. Berlin A976). [175] R.T. Rockafellar and R. J.-B. Wets. Variational systems, in multifunctions and integrands, G. Salinetti, ed.. Lecture Notes in Math., Vol. 1091. Springer. Berlin A984). 1-54. [176] D. Ross, Random sets without separability, Ann. Probab. 14 C) A986), 1064-1069. [177] J. Saint-Raymond, Topologie sur I'ensemble des compacts поп vides dun espace topologique se'pare. Seminaire Choquet, annee 1969/70, No. 21. [ 178] J. Saint-Pierre and S. Sajid. Parametrized integral of multifunctions in Banach spaces. J. Math. Anal. Appl. 239 A999), 49-71. [179] G. Salinetti. W. Vervaat and R. Wets. On the convergence in probability of random sets (measurable multifunctions). Math. Oper. Res. 11 C) A986). 420-422. [180] G. Salineni and R. Wets. On the comergence of sequences of convex sets infinite dimensions, SIAM Rev. 21A) A979), 18-33. [181] G. Salinetti and R. Wets. On the convergence of closed-valued measurable multifunctions. Trans. Amer. Math. Soc. 266 A) A981). 275-289. [182] G. Salinetti and R. Wets. On the convergence in distribution of measurable multifunctions (random sets). normal integrands, stochastic processes and stochastic infimo. Math. Oper. Res. 11 C) A986). [183] L. Santalo. Integral Geometry and Geometric Probability. Addison-Wesley. Reading, MA A976). [184] H.H. Schaefer, Topological Vector Spaces. Springer. Berlin A971). [ 185] D. Schmeidler. Fatou 's Lemma in several dimensions, Proc. Amer. Math. Soc. 24 A970), 300-306. [186] D. Schmeidler. Convexity and compactnesss in countably additive correspondences. Differential Games and Related Topics, North-Holland, Amsterdam A971). [187] M. Schmin and J. Mattioli, Morphologic mathematique. Rapport ASRF-92-2. Laboratoire Central de Recherche Thomson - CSF. Domaine de Corbeville 91404 Orsay cedex A992). [188] J. Serra. Image Analysis and Mathematical Morphology. Vol. I. Academic Press. London A982). [189] K. Schiirger. Ergodic theorems for subadditive superstationary families of convex compact random sets, ? Wahrscheinlichkeitsth. verw. Geb. 62 A983). 125-135. [190] Y. Sonntag and С Zalinescu, Set convergences: An attempt of classification. Proceedings of the Intl. Conf. on Diff. Equations and Control Theory. Iasi, Romania. August 1990. [191] D. Stoyan, WS. Kendall and J. Mecke, Stochastic Geometry and its Applications, 2nd edn. Wiley A985). [192] R.L. Taylor and H. Inoue, Laws of large numbers for random sets, in: Random Sets, Theory and Applications. J. Goutsias. R.P. Mahler and H.T Nguyen, eds. IMA Volumes in Mathematics and its Applications. Vol. 97, Springer, Berlin A997), 347-360. [193] D.S. Thiam, Multimesures positives. С R. Acad. Sci. Paris Ser. A 280 A975), 993-996. [194] D.S. Thiam. Integrate de Daniell a valeurs dans un semi-groupe ordonne. C. R. Acad. Sci. Paris Ser. A 281A975), 215-218.
Set-valued integration and set-valued probability theory: An overview 673 [195] D.S. Thiam. Applications, a /'integration multivoque. de Vintegrale de Daniell dans un semi-groupe ordonne, C. R. Acad. Sci. Paris Ser. A 280 A975). 955-958. [196] D.S. Thiam, Integrates multivoques monotones. С R. Acad. Sci. Paris Ser. A 280 A975). seance du 15 decembre 1975. [197] M. Tsukada, Convergence of best approximations in a smooth Banach space. J. Approx. Theory 40 A984), 301-309. [198] T. Uemura, A law of large numbers for random sets. Fuzzy Sets and Systems 59 A993), 181-188. [199] M. Valadier, Integration de convexesfermes. notamment d'epigraphes. R.I.R.O. R2 A970), 57-73. [200] M. Valadier, Multi-applications mesurables a valeurs convexes compactes. J. Math. Pures Appl. 50 A971), 265-297. [201] M. Valadier, Surl'esperance conditionnelle nmltivoque поп convexe. Ann. Inst. H. Poincare 16 B) A980), 109-116. [202] M. Valadier, Conditional expectation and ergodic theorem for a positive integrand. J. Nonlinear Convex Anal. 1 E) B000), 233-244. [203] B. Van Cutsem, Martingales de multiapplications ? valeurs convexes compactes. C. R. Acad. Sci. Paris 269A969), 429^432. [204] B. Van Cutsem, Elements aleatoires a valeurs сот-exes compactes. Thesis, Grenoble A971). [205] B. Van Cutsem, Martingales de convexesfermes aleatoires en dimensionfinie. Ann. Inst. H. Poincare В 8 D) A972), 365-385. [206] К. Vind, Edgeworth-allocations in an exchange economy with many traders. Intemat. Econom. Rev. 5 A964), 165-177. [207] R.A. Vitale, On Gaussian random sets. Stochastic Geometry, Geometric Statistics, Stereology, R.V Ambartzumian and W. Weil, eds, Teubner, Leipzig A984), 223-224. [208] D.H. Wagner, Surveys of measurable selections theorems. SIAM J. Control Optim. 15 E) A977), 859-903. [209] D.H. Wagner, Surveys of measurable selections theorems: on update. Measure Theory at Oberwolfach 1979, D. Kolzow, ed.. Lecture Notes in Math., Vol. 794, Springer, Berlin A980). [210] D.H. Wagner, Integral of a convex-hull-valued function. J. Math. Anal. Appl. 50 A975), 548-559. [211] Z.P. Wang and X.H. Xue, On convergence and closedness of multivalued martingales. Trans. Amer. Math. Soc. 341 B) A994), 807-827. [212] W Weil, An application of the central limit theorem for Banach-space-valued random variables to the theory of random sets. Z. Wahrscheinlichkeitsth. verw. Geb. 60 A982), 203-208. [213] R.A. Wisjman, Ccm-ergence of sequences of convex sets, cones and functions. Bull. Amer. Math. Soc. 70 A964), 186-188. [214] R.A. Wijsman, Convergence of sequences of convex sets, cones and functions II. Trans. Amer. Math. Soc. 123A966), 32^45. [215] N. Yannelis, Fatou's Lemma in infinite-dimensional spaces. Proc. Amer. Math. Soc. 102 A991), 303-310. [216] H. Ziat, Convergence des ensembles aleatoires et applications ? ?? loi forte des grands nombres multivoque. These de doctorat d'Etat, Universite Sidi Mohamed Ben Abdellah, Fes, Maroc B000).
CHAPTER 15 Density Topologies Wladyslaw Wilczynski Faculty of Mathematics, University ofLodi. Bimaiha 22. PL-90-238 Lodz. Poland Contents Introduction 677 1. Points of density of linear sets 678 2. Density topology 682 3. Approximately continuous functions 686 4. Density topology in Euclidean space 691 5. ? -density topologies on the real line 693 6. Another local property of measurable sets 700 References 700 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 675
This page intentionally left blank
Density topologies 677 Introduction The purpose of this chapter is to present basic properties of the density topology on Euclidean spaces with the special emphasis on the one-dimensional case. The limited length of the chapter does not allow to consider numerous differentiation bases which appear if the dimension of the space is greater than one. so we restrict ourselves to ordinary basis consisting of squares with sides parallel to the axes and to strong basis consisting of intervals (rectangles with sides parallel to the axes) in the plane E2. A good reference on differentiation bases, Vitali covering and density theorem is Bruckner A971) and de Guzman A975). Abstract density topologies are treated in Martin A964), Lukes et al. A986), Zajicek A986-87). A density topology on Euclidean «-space for general measure is studied in Troyer and Ziemer A963). The existence of the lower density operator in a finite measure space has been proved in Maharam A958). The family of 2' non-homeomorphic density topologies is considered in Aniszczyk and Frankiewicz A986). A good reference on density topology is Lahiri and Chakrabarti A991). Topologies generated by liftings are treated in Ionescu Tulcea and Ionescu Tulcea A969). Other generalization of the density topology one can find in Rusu A995). The notions of density point and approximately continuous functions have been defined at the beginning of XX century. The density topology appeared much later and very quickly became an important concept. In this chapter we start the presentation with the properties of single density points of a set and the lower density operator, proceed to study these properties of density topology which can be formulated and proved using the sets of density points only, then go to the properties of density topology which need the notion of approximately continuous functions (all on the real line). The fifth section is devoted to the brief presentation of the ordinary and strong density topologies on the plane. The next is again one-dimensional: we present some results concerning a wide class of topologies similar to density topology and constructed with the use of theorems of Taylor A959. 1960). At last we present a local property of measurable sets discovered by Lusin. Although it does not generate a density- type topology, it is interesting by itself and probably will not appear in another place in the Handbook. Throughout this chapter С will denote the ? -algebra of Lebesgue measurable subsets of the real line, I - the ? -ideal of null sets, ?* and ?* - the outer and inner Lebesgue measure, respectively. ? + ? = {t + ?: t e E),xE = {xt: t e E} for ? с R and ? e R. ??, Ъ and ?? will stand for the ?-algebra of Lebesgue measurable sets, ?-ideal of null sets and the Lebesgue measure on the plane R-. If ? is a topology, then the interior of a set A, closure of a set A and so on shall be denoted by r-int(A), r-cl(A) etc. We shall omit ? for a natural topology. The rest of denotations is the same as in the whole Handbook.
678 W. Wilczynski 1. Points of density of linear sets Let ? с ? be a Lebesgue measurable set. We say that a point jc() e ? is a density point of ? if and only if k(En[xQ-h,xQ + h]) , lim = 1. A.0) ft—0+ 2h We say that *? e ? is a dispersion point of ? if and only if xo is a density point of the complementary set ?' = ? \ E. Equivalently, xq is a dispersion point of ? if and only if lim ЦЕП[*о-*.ДГо + *])=0. A.0? л—о+ 2й Let Ф(?) = {? е ?: .? is a density point of E} if ? e ?. Then we have ?(?) ? ?(? \ ?) = 0, hence Ф(?)\?с(Е\?)\Ф(Е\?). A.1) The most important property of Ф(?) is described in the following theorem, called the Lebesgue Density Theorem: THEOREM 1.1. For any measurable set ? С ?. ?(???(?)) = 0. PROOF (According to Oxtoby A980)). By virtue of A.1) it is sufficient to show that ?(? \ ?(?)) = 0. Assume also that ? is bounded. We have ? \ ?(?) = Uf >o Ae< where ?(??[? -lux + h]) ? e ?: lim inf < 1 — ? л^о+ 2/7 Observe that Af] С Ae, if ?? > ?:. Suppose that X*(AF) > 0 for some ? > 0. Then there exists a bounded open set G D A f such that ? (G) < л/_!*/'. Let f be the family of all closed intervals / such that / с G and ?(? П /) ^ A - ?) ¦ ?(/). Then ? includes arbitrarily short intervals around each point of Ae (which means that ? is a Vitali cover of Af) and for any disjoint sequence {/и}яен of elements off we have r(AAU7")>0- (L2) Indeed, ?*(?? П Ц, /») < ?„ ?(? П /,,) <: A - ?) ?„ Ml·,) ? A - ?)?@ < ?*(?). Inductively we construct a special disjoint sequence {/„}„ег; of elements of ?. Let do be the least upper bound of the measures (= lengths) of members of ? and choose /? e ? such that ?(/|) > do/2. After choosing /| /„. let Z„ be the set of all elements of ? that are disjoint to /| /„. From A.2) it follows that ?„ is non-empty. Let d„ = sup{k(I): I e ?„}. Choose /,,+ i e Zn such that ?(/„+?) > d„/2. Put В =
Density topologies 679 Ae \ U(J /„. We have ?*(?) > 0 by A.2). Hence there exists a positive integer m such that f>/„)<^. A.3) и=ш+| For each ? > m let J„ denote the interval concentric with /„ such that ?G„) = 3?(/„). The family of intervals {Jn}n>o do not cover В by virtue of A.3). Take ? e В \ U,TL,„-h J»- Since ? & Ae \ Un'=i Ai, there exists an interval / e ?,„ centered at ? (recall that ? is a Vitali cover of ??). There exists ? > m such that / П /„ ? 0. Indeed, in the contrary case ?(/) ^ d„ < 2?(/„+?) for all n, which is impossible since ?^1| ?(/?) ^ ?@ < +oo. Let A: be the least integer for which / П h ? 0. Then к > m and ?(/) ? dk- \ < 2k(h). Hence ? as the center of / belongs to ? - a contradiction with .v ? U«lm+1 ^« · So ?(??) = 0 for each ? > 0 and finally ?(? \ ?(?)) = 0. The "density theorem" is often proved similarly as above by means which are more or less equivalent to Vitali's covering theorem (see Blumberg A918-19). Hobson A927), Lusin and Sierpinski A917), Khintchine A922), Sierpinski A923) or Zajicek A979)). It is worth mentioning that Lusin attributes the theorem to A. Denjoy (see Lusin A951), p. 220). It is also possible to use the Lebesgue Differentiation Theorem (Lebesgue, 1904, p. 124)): If F(x) = /av f(t)dt, where /: [a. b] -» R is summable. then F'(x) = f(x) a.e. on [a, b]. Indeed, observe that ? is a density point of Л с [а. b] if and only if lim = 1 A.4 ) ft—o+ h and г 4AO[x~h,x]) lim = 1. A.4 ) Л—0+ h If A.4') (respectively, A.4")) is fulfilled we say that ? is a right-hand (respectively, left- hand) density point of A. Now apply the Lebesgue Differentiation Theorem to f = ??. From A.4') and A.4") it follows immediately that x is a density point of А с К if and only if к(АГ)[х-И:.х + 1ц]) lim = 1. A.5) hx — 0+ h\ + hi Л? —0+ h\ +/n >0 The condition A.5) is sometimes written in the form .. ?(??/„) lim =0. «-3C ?(/„)
680 W. Wilczyiiski for each sequence {/,;}„ек of intervals convergent to ? (which means that ? e P|„ /„ and ?(/„) ,,^c 0), For a detailed analysis of the exceptional set in the Lebesgue Density Theorem (that is, of the set where the density of a given set differs from 0 and 1), see Besicovitch A957). It is natural to understand limits in A,0') and A.0") in the sense of Cauchy (?-? language) or, if we assume AC in the sense of Heine: For each sequence {/г„}„е^ of positive numbers convergent to 0 k(An[x-h„,x+h„]) hm = 1, (lo) и-» ос 2/г„ for each sequence {/i„}„epj of positive numbers convergent to 0 k(An[x-h„,x + hn]) , hm =0, A.6 ) н-юс 2h„ THEOREM 1,2. //{?„}„,=n is a decreasing sequence convergent to 0 such that sup,, j^- = с < oo, then ? is a dispersion point of A if and only if k(An[x-t„,x+tn]) hm =0. A./) PROOF, If ? is a dispersion point of A. then A,7) is obviously fulfilled. Suppose now that A,7) holds and take an arbitrary sequence {/г„}леК of positive numbers convergent to 0. For each ? e N there exists w„ e N such that t,„n + \ ^ h„ < t„,n. We have also ????,,^?? mn = oo. Since 0 < ?<?? [*-*»¦* +h„]) < ЦАП[х-!,„„,?+ tm„]) 1hn ^ 2i,„,] + | _ k(An[x-tm,:,x + tmJ) r,„„ < ?(?? [*-?„,„,.?+ ?,„„]) c 2tm„ tin,, ¦*- ? 2?„,n we have lim,,^ ^n|.r-ft,,-v+ft„|) = о, so A,6") holds. Of course, similarly one can prove that a single sequence suffices for a density point. However, the condition sup,,^^, -^ < oo is essential, as the following example shows: Let {i,i}nepj be a decreasing sequence convergent to 0 such that sup,,^^ j^- = oo .Choose a subsequence {m-heN f°r which limbec r^- = oo and put A=( UK+l'A+i ·4])?( Ut"A+i -Ч.-4+?] ¦ Then we have (for ? = 0) ?(??[-?„,?„]) hm =0, ?— ? 2t„
Density topologies 681 while Hm ??[-?+? ¦4.A+i-'»J) = , i—oc 2y/t,lli + \ -t„k So Theorem 1.2 together with the above remark can be formulated in the following form: A.0") is equivalent to A.7) if and only if sup,, ™- < oo. From Theorem 1.1 if follows immediately that if ? is a Lebesgue measurable set, then Ф(Е) is also measurable. In fact, Ф(?) is a Borel set, moreover, Ф(?) е F„s- Indeed, ? e Ф(Е) if and only if for each ? e N there exists к eN such that for each h e ]0, k~' [ wehaveX(?n[;t-/2,jt + /2])/B/2) ^ 1 -и-1. Hence ад=Пи ? ?* zieN A-eN /re J0.A--' [ б^(ЕП[х-и + *])^_д., 2h Since the set ueR: ^ 1 - ? 2h is closed for each ? e N and /г > 0, the conclusion follows. If ? с К is an arbitrary set (not necessarily measurable), then we say that .vo e ? is an outer density point of ? if and only if ,. k*iEn[x-h.x + h]) , hm = 1. /1—0+ 2/? where ?* denotes, as usual, the outer Lebesgue measure. Let ?*(?) = {? e ?: ? is an outer density point of ?}. Then we have ?*(?) = ?(?), where D D ? is a measurable cover of ?. The above equality follows from the fact that for each interval / e ? we have ?*(? П /) = ?(?) П /). So taking into account that ? \ Ф*(?) сО\ ?{?) we obtain from Theorem 1.1 that ?(? \ Ф*(?)) = О, which means that almost each point of an arbitrary set ? с ? is the point of outer density of ?. However, one cannot expect that a\so ?(?*(?)\?) = 0, this is impossible for any non-measurable set ? С ?, since ?*(?) (= ?(?)) is measurable. Consider the following sequence of equivalent statements for a set A e C: (A) 0 is a density point of A; (B) Нт|,_>эсЛ(АП[-л-|,л-|])/B|1-|)= 1; (C) ?????_>00?((?·?)?[-1,1]) = 2; (D) {x(„.A)n[-i.i)},,eN convergestox[_U] in measure; (E) for every increasing sequence {итЬ»ег; of positive integers there exists a subsequence {птр}р€щ such that lim/,_>TCXi(H,„/),4)p,[-i.n) = X[-i.ij a-e- The equivalences of (A) to (D) are straightforward, while the equivalence of (D) and (E) follows from a classical theorem of Riesz concerning convergence in measure on a finite
682 W. Wilczynski measure space. Observe that in the condition (E) the measure is absent and the only important thing is the ?-ideal of null sets. This gives a possibility to define, among others, the category analogue of the density topology. For details see Wilczynski A984-85), Ciesiel- ski et al. A994). Compare also Tall A976) to see why it is not a "precise analogue" of the density topology. Using the condition (E) and taking into account that a sequence of non-measurable functions may converge almost everywhere to a measurable function one can define a density-type topology not included in С This was done in Ciesielski and Larson A993) and Wilczynski A984) leading to a topology with quite different properties. 2. Density topology Let ? (A) again denotes the set of all density points of a Lebesgue measurable set А С К. We shall also write A ~ ? if and only if ?(???) = 0 for measurable sets A.Scl. THEOREM 2.1. The mapping ? : С -> 2" has the following properties: @) for each A eC, Ф(А)еС, A) for each A eC, А~Ф(А), B) for each A, Be С, if A~ B, then Ф(А) = Ф(В), C) Ф@) = 0, Ф(К)=К, D) for each A, BeC, Ф(АП В) = Ф(А)ПФ(В). PROOF. @) In fact Ф(А) e F„s, so ? (A) is even a Borel set. A) This is the Lebesgue Density Theorem. B) If A ~ ?, then for each interval/ с Ш wehaveX(A„n/) =?(??\?) and the equality follows. C) Obvious. D) Observe first that it А, с A2, A,, A2 e C, then Ф(А\) с Ф(А:), so ?(? ? ?) С ? (А) П ?( ?). To prove the opposite inclusion, use the equality ?(/??) ?(/??) ?(/????) — - + — - 1 ^ — -. НП НП HD which holds for each interval / с К. The mapping ? having the above properties is usually called a lower density. Observe also that ? is idempotent, i.e., ?(?) = ?(?(?)) for A e С. This follows immediately from A) and B). Now we are ready to define the density topology. Put ? = {A e C: Ac ?(?)}. THEOREM 2.2. ? is a topology in the real line R.
Densit\ topologies 683 PROOF. 0, R e ? by virtue of C). From D) it follows that is closed under finite intersections. The only trouble is to prove that ? is closed under arbitrary unions (recall that ? с С and С is closed only under countable unions). Take a family {At}t€T с Т. We have obviously А, с ?(?,) for each t e T. Choose a sequence {г„}„ем such that for each r e ? we have ?(?, \ \J^L] aO = 0· It is possible because the Lebesgue measure on R fulfills the countable chain condition. Then we have Ф(А,) с ?I |Ja,„ J. for each re T. Hence ОС / ОС \ [JAtnc\jA, C\J0(A,)C<P( \JA,,X n=\ t?T teT \n=l / The first and the last set in the above sequence of inclusions differ by a nullset and both are measurable, so {jt€T A, e С Also LUr A' c ^'LUr A>) by virtue of the central inclusion and the monotonicity of Ф, so finally \Jt€T A, &T. ? The density topology was defined in Haupt and Pauc A952, 1954). However, these papers had almost no impact. Some years later two papers: Goffman et al. A961) and Goffman and Waterman A961) attracted the interest to the density topology. It is worth mentioning that approximately continuous functions (the topic of our next paragraph) were introduced by A. Denjoy A915. 1916) without relation to any topology on the real line. The density topology can be also described in the following form: ? = {Ф(А)\Р: А е С and ? e I}. Indeed, if A e T, then А с ? (A), so A = ? (A) \ (?(?) \ A) and ?(?(?) \ A) = 0. We have also ?(?) \ ? с Ф(Ф(А) \ ?), since ?(?(?) \ ?) = ?(?(?)) = ?(?) (compare Oxtoby A980)). THEOREM 2.3. ? is essentially stronger than a natural topology on the real line. PROOF. For each open interval ]a. b\ we have ]a. b[ с Ф( ]a,b[). To find a T-open set which is not open in the natural topology consider a set of the form R\ E, where ? is a nullset. THEOREM 2.4. A point ? e R is a T-limit point of a set А С R (not necessarily measurable) if and only if k*(An[x-h.x+h]) lim sup > 0. /1^0+ 2/2 PROOF. Follows from the definition of a T-limit point.
684 W. Wilczynski THEOREM 2.5 (Ostaszewski A982)). For an arbitrary set А С R IntTA = ???(?). where В is a measurable kerne! of A. PROOF. Let ? e IntTA. Then there exists a set U e ? such that и e U and U С A. So ? e Ф([/). Since С/ \ В с Л \ В and С/ \ S e ?, we have ?(?/ \ ?) = 0. Hence <P(U) = 0(U П ?) с ?(?) and ? e ? ? ?(?). Suppose now that jc e А П ?(?). Then ? U {x} С A and ? U {x} e ?, so ?(? U {.r}) = ?(?). Hence ? e (? U {*}) ? ?(? U {.?}). Since ?((? U {*}) ? ?(? U {.?})) = ?(? U {*}) П Ф(Ф(В U {*})) = ?(? U {*}), the set (? U {*}) ? ?(? U [?]) is T-open. So there exists a T-open set including ? and included in A. hence ? e Intx A. D THEOREM 2.6 (Scheinberg A971)). A set A e ? is 7 -regular open if and only if A = ?(?). PROOF. Let A be T-regular open, i.e., A = IntT(Clr(A)). Then A = ClT(A) ? Ф(С1Т(Л)) = Clx(A) П Ф(А) = ?(?). The first equality follows from Theorem 2.5, the second from the fact that C\T(A) ~ A (because C\T{A) = R \ IntT(R \ A) and Intx(R \ A) ~ R \ A) and from Theorem 2.1 B). the third from the inclusion Ф(А) с Clx(A) (compareTheorem 2.4). Suppose now that A = Ф(А). We have IntT(ClT(A)) = C1T(A) ? Ф(С1Т(А)) = Clx(A) П Ф(А) = ?(?) = ?. The verification of the first, the second and the third equalities is exactly as above, the fourth is simply our assumption. D The set ? e R is a T-neighbourhood of a point ? e R if and only if there exists a set Av e ? such that ? e AA and Ax с ?. From the inequality ?*(? П [? - h,x + h]) ^ ?( ?? П [? - h, ? + /?]), which holds for all h > 0, it follows that ? is a T-neighbourhood of ? if and only if ,. ?*(??[?-?,? +h]) THEOREM 2.7. ? is a T-neighbourhood of each of its point if and only if ? e Т. PROOF. Suppose that ? is a T-neighbourhood of each ? e ? and let AA e ? be the set such that ? e Av and А, с ?. Then ? = {JxeL Ax and ? e ? by virtue of Theorem 2.2. The converse is obviously true. THEOREM 2.8 (Oxtoby A980), Tall A976)). I = {А С R: A is 7'-nowhere dense set] = {А С R: A is ?-first category set] = {А С R: A is ?-closed ?-discrete set]. PROOF. The first equality. If A e I. then R \ A e T, so A is T-closed. We have Int(A) = А П Ф(А) = А П 0 = 0, hence A is T-nowhere dense. Suppose now that A is T-nowhere
Density topologies 685 dense. Then Cl-j-(A) is also T-nowhere dense, T-closed (and Lebesgue measurable). So IntT(ClT(A)) = C1T(A) П Ф(С1Т(А)) = 0, hence C1T(A) e I (compare Theorem 2.1 A)) and finally A el. The second equality. The family of T-nowhere dense sets is a ?-ideal, so it coincides with the family of T-first category sets. The third equality. If А с I, then for ? e A the set (R \ A) U {x} is a T-neighbourhood of x, so А П ((R \ A) U {?}) = {?} is а Тд-neighbourhood of ? (Тд is a subspace topology on A) and A is discrete. A is also closed, as we saw earlier. Suppose now that A is T-closed T-discrete and A ? I. Then there exists a non- measurable set ? с ?. ? is not T-closed (neither Тд-closed), because T-closed (and Тд-closed) sets are Lebesgue measurable - a contradiction. ? THEOREM 2.9 (Scheinberg A971)). ? ?-algebra ofT-Borel sets coincides with С PROOF. If A e ?, then A = (А П ?(?)) U (A \ ?(?)). The first set belongs to T. while the second is the nullset, hence a T-closed set. Hence each Lebesgue measurable set is a T-Borel set. Since T-open sets are measurable, the same is true for T-Borel sets. Finally the family of T-Borel sets coincides with ? -algebra of Lebesgue measurable sets. D THEOREM 2.10. If ? cRisa 7-compact set, then ? is finite. PROOF. Suppose that ? с R is infinite. Let ?() с ? be a countable infinite subset of E. Then {(R \ ?()) U {jrH.vet,, is a T-open cover of ? without finite subcover. D THEOREM 2.11. The space (R. T) is neither first countable, nor second countable, nor Lindelof, nor separable. PROOF. Take ? e R and let {En}„€\-, be a sequence of T-open neighbourhoods of .v. For each ? e N choose xn e E„ \ {x} and put ? = E\ \ {x„: « e N). Then ? is also a T-open neighbourhood of ? which does not include any E„, so (R. T) is not first countable. Hence it is not second countable, too. Let С be the Cantor set. Then {(R \ C) U {.v}}vec is a T-open cover of R without any subcover of the power less than continuum, so (R. T) is not Lindelof. Finally, it is not separable because all countable sets are T-closed. D THEOREM 2.12. (R, T) is a Baire space. PROOF. The countable union of T-nowhere dense set is the set of Lebesgue measure zero (by virtue of Theorem 2.8). A surprising and very useful (e.g., in proving monotonicity theorems) property of density has been discovered by O'Malley. THEOREM 2.13@'Malley( 1972)). //Ac]0, 1 [ is a set oftype F„ which has left density equal to 1 at all its points. Then there exists xq e [0. 1 [ \A such that A has right density 1 atx0.
686 W. Wilczymki Much more set-theoretic properties of the density topology one can find in Tall A976) and Lukes et al. A986). 3. Approximately continuous functions If / is a real function of a real variable, then we say that / is approximately continuous at a point xq if and only if there exists a set АЛA е С such that х0еФ(АХи) and /(*,,) = Шг^ /(л) C.1) ле.4,„ (Denjoy, 1915, 1916). The approximate upper limit of / at x(> (limsupapv_A.()/(*)) is the greatest lower bound of all the numbers v(+oo included) for which xq is a point of dispersion of the set {x e R: /(л) > у}. Similarly one can define liminfapA^V|i f(x). When these extreme limits are equal, their common value is called the approximate limit of / at Jto and denoted by limapv_v /(-*)· It is natural to say that / is approximately continuous at xq if and only if /(*o)= Um ap/U). C.2) x-*xu This definition was introduced also by Denjoy A916). THEOREM 3.1. The above two definitions of the approximate continuity at a point are equivalent. PROOF. Observe that f(xo) = limapv_V|i f(x) if and only if for each ? > 0 xo is a density point of the set {x e R: \f(x) - f(xo)\ < ?]. Suppose now that there exists a set AV|) e С fulfilling C.1), then for each ? > 0 there exists ? > 0 such that {x e R: \f(x) - f(xo)\ < ?] D AXoC\ ]xo - ?, xo + <5[ and C.2) follows. To prove the converse implication we shall need the following fact (which is also interesting by itself): if {Е„},,ец is a decreasing sequence of measurable sets and xq is a density point of each ?„, then there exists a strictly decreasing sequence {?„}„е^ of positive numbers convergent to 0 such that xq is also a density point of the following set: ОС A*u = U ( E" \ ^° ~k"-x0 + kn[)- n = \ This condition is called the condition (J2) of Jedrzejewski A973/74). Let { ?»}neN be a sequence of positive numbers less than 1 convergent to 0. For each ? e N there exists <5„ > 0 such that for h e ]0, <5„ [ we have k(E„n]xo-h,x0+h[) 2h >1"?"· Choose a decreasing sequence {h,,},,^; convergent to 0 such that h„ e]0, <5,,[ for each ? e N and put A:,, = ?„ -h„+\. If h e [h„^\.h„[ for some ? e N then we have
Density topologies 687 k(AXon]xo-h,x0 + h[) ?((?„ П ]дг() - h.xQ + h[)- ]xq - k,„xo + k„[) 2/2 " 2/2 _ ?(??„? ]x0- h.xo + h[)- 2s„h„+i 2/2 ^ ! ?„ > 1 - 2?„ 2/г and the conclusion follows. (See also Bruckner A978). Theorem 5.6, p. 23; Thomson A985), Ex. 14.1, p. 27.) Suppose now that for each ? > 0 xq is a density point of the set {x e R: \f(x) - f(x0)\ < ?}. Put E„ = {x e R: \f(x) - f(x0)\ < j;} and consider the set A.V|) described above. Then /(*o)= Hm /(*), .veA,() so / is approximately continuous in the sense of Denjoy. ? The second definition of approximate continuity has a strict connection with the density topology T. Namely, a real function / of a real variable is approximately continuous at a point xq if and only if it is continuous at a point xq as a function from the topological space (R, T) into a real line equipped with the natural topology. As usual, we shall say that a function /: R —> R is approximately continuous if it is approximately continuous at each point. In this case we have obviously /_l (G) e ? for each open (in the natural topology) set G с R. There is a strict connection between measurability of real function and approximate continuity on a big set. To this aim we shall need a classical theorem (Lusin A912); see also Oxtoby( 1980)). THEOREM 3.2. A real function f defined on R is measurable if and only if for each ? > 0 there exists a set ? e С such that ?(?) < ? and the restriction of f toR\E is continuous. We can require in the above theorem that the set R \ ? is closed (or perfect). THEOREM 3.3 (Denjoy A916), Stepanoff A924)). A real function f defined on R (or on a set ? e ?) is measurable if and only if f is approximately continuous almost everywhere on R (on E, respectively). PROOF. If / is a measurable function, then there exists a sequence {F,,},,^- of closed sets such that f\F„ is continuous for each ? e N andX(R\F„) < «"'.Then / is approximately continuous on the set U„eN(F„ ni»(F„)) and k(R \ \Jll€U F„ n<P(F„)) = 0. Suppose now that / is approximately continuous almost everywhere. Take a e R and put В = [?: f(x) < a}. We want to show that В is Lebesgue measurable. If С is the set where / is approximately continuous, then we have В = (В П С) U (? \ С) and ?(? \ С) = 0. So it is sufficient to show that В П С is measurable. If ? e В П С, then there exists a set Av e С such that ? e ?(??) and f(x) = lim;-. f(t). Since f(x) < a, we can choose
688 W. Wilczyiiski Ax С В (taking, if necessary, АхГ\]х — ?, ? + ?[ for sufficiently small ? > 0), moreover, we can assume that A.v с С (recall that ?(? \ С) = 0). Then В П С = \JxeBncAi and the measurability follows as in Theorem 2.2. ? Theorem of Lusin (Theorem 3.2) cannot be improved. The statement: "if /:R —>¦ R is measurable, then there exists a set ? such that ?(?) = 0 and /|R \ ? is continuous" is false. Indeed, let A e С be the set such that A and R \ A have positive measure in every interval. Then for / = ?? the restriction of / to the complement of any nullset is discontinuous at each x e R. To study further properties of the density topology we shall need following facts (Denjoy A916), see, for example, Bruckner A978)). We shall formulate theorems for functions defined on R but they are obviously true also for functions defined on intervals (closed, open or half-open). THEOREM 3.4. // /:R —>¦ R is appmximately continuous and locally bounded, then f(x) = F'(x) for each x e R, where F(x) = f* f (t) dt for fixed a e R. PROOF. Since / is approximately continuous, it is measurable by Theorem 3.3. From local boundedness it follows that F is defined on the whole real line R. Fix *? e R· We want to show that f(x0)=\im- f(t)dt. C.3) A-o h J_Xu Let AV() e С be the set such that xo e Ф(АХ{)) and /|AV|) is continuous at *o· Let M = sup{|/(f): ге[дг0-1,*о+1]}. Then we have I 1 [,x«+h | 1 f° f(t)dt-f(xQ)U | f \f(t)dt-f(x0)\dt •Avo I " ^[л-|,.Л|,+А]ПД1() / •/|.v()..V(. + ^/ |/(i)dr-/(jto)|dr. The first term on the right-hand side of the above inequality tends to 0 with h —>· 0, because |/(?) - f(xo)\ tends to zero, while the second tends to zero because ?([*?,^o + h]\ Axo)/h tends to 0 and the function under the integral sign is bounded by 2M (for h < 0 [xq, xq + h] means obviously [xq + h. xq]). ? THEOREM 3.5. /// is approximately continuous at xq and g is continuous at f(xo), then g о / is approximately continuous at xq. PROOF. Apply the definition of approximate continuity. THEOREM 3.6. ///:R —>¦ R is approximately continuous, then it is Darboux-Baire one function (/ eDB\).
Density topologies 689 PROOF. If / is locally bounded, then it is a derivative by Theorem 3.4. It is well-known that a derivative is Darboux-Baire one and the thesis follows. If / is not locally bounded, then let h be a homeomorphism of R onto @, 1). Since ho f is bounded and approximately continuous, by Theorem 3.5, so ho f e DB\.But / = /?_l o(hof) and finally / e DB\.D THEOREM 3.7. The family of ?-connected set coincides with the family of sets connected in the natural topology. PROOF. We shall prove that each bounded open interval (in the natural topology) is T- connected. For remaining types of intervals (bounded or not) the proof is similar. Suppose that ]a, b[ is not T-connected. Hence ]a, b[= A2 U A2, where A \, A2 e T. Put f = ??]. A function / is obviously approximately continuous and has not the Darboux property - a contradiction with Theorem 3.6. To finish the proof observe that since a density topology is strictly greater than a natural topology, the classes of connected sets fulfill the converse inclusion and the equality follows. ? It is obvious that (R, T) is a Hausdorff space. To establish that the space (R, T) is completely regular, we shall need the theorem which is traditionally connected with the names of Lusin and Menchoff. THEOREM 3.8 (see Bruckner A978)). Let ? С R be a Lebesgue measurable set (or a Borel set) and let F с ? be a closed set such that F с ? (?). There exists a perfect set ? such that F С ? С ? and F с Ф(Р). THEOREM 3.9 (Zahorski A950)). Let E\.E2, ? be pairwise disjoint subset of Ж such that E\ U F2 U # = R, E\ U #, F2 U # e ? and both sets are of type F„. Then there exists an approximately continuous function /:R —>¦ R such that f(x) = 0 for x e E\, 0 < f(x) < 1 for x e ? and f(x) = 1 for x e F2. The method of the proof reminds that of the Urysohn lemma. THEOREM 3.10 (Goffman et al. A961)). The space (R, T) is completely regular. PROOF. Let F с R be a T-closed set and xo i F. Let A be a G& set such that А э F and ?(? \ F) = 0. Put F| = A, E2 = {x0} and Я = R \ (F| U F2). Then F, U ? = R\ {x0} and F2 U ? = R \ A are both T-open and Fs. Hence by virtue of Theorem 3.9 there exists an approximately continuous function such that f(x) = 0 for x e E\ (E\ э F) 0 < f(x) < 1 for x e ? and f(x0) = 1. ? THEOREM 3.11. The space (R, T) is not normal. PROOF. Suppose that this space is normal. Let F|. F2 be disjoint sets which are dense in R and ? (F1) = ?( F2) = 0. Then E\.E2 are T-closed. By virtue of Urysohn lemma there
690 W. Wilczxmki exists an approximately continuous function / such that f(E\) = {0} and /(?2) — {1}· Then / is discontinuous everywhere - a contradiction with the fact that / is Baire one. D It is worth mentioning that if A, В с R are disjoint countable sets (possibly dense), then it is possible to find disjoint T-open sets C. D such that А с С and В с D (Foran, 1991). There are other topologies related to the density topology and to some subclasses of the family of approximately continuous functions. A set Л с К is called ambivalent if it is both an Fa set and a Gs set. Finite unions, finite intersections and complements of ambivalent sets are ambivalent sets. A function /: R —>¦ R is ambivalent if and only if for each a e R the sets {x: f(x) > a] and {x: f(x) < a] are ambivalent. Obviously each ambivalent function is Baire one, however, the converse is not true. A function / is Baire* one if and only if for every non-empty closed set С there exists an open interval (a. b) such that ]а,МПС^0 and the restriction of / to C, f\c, is continuous on ]a. b[ DC. A function / is approximately differentiable at xq if and only if there exists limA _>Л|1 ар -'(Л()~-/(Л|>) and is finite. Of course, if / is approximately differentiable at xQ, then it is also approximately continuous at this point. At last, let Л be the family of all sets from ? which are ambivalent. We shall denote by r (after O'Malley) the coarsest topology making the approximately differentiable functions continuous. THEOREM 3.12 (O'Malley A977)). The family ? is a basis for r. Moreover, r is simultaneously the coarsest topology for which either the Baire* one approximately continuous functions or ambivalent approximately continuous functions are continuous. O'Malley A977) has defined another topology in the following way: a set A e ? is called almost open if and only if ?(?) = X(Int(A)). THEOREM 3.13 (O'Malley A977)). The family of almost open sets forms a topology (calleda.e. topology). THEOREM 3.14 (O'Malley A979)). The a.e. topology is the coarsest topology for which approximately continuous functions which are almost everywhere continuous are continuous. Theorem 3.15 (O'Malley A977)). a.e. ^r^T. THEOREM 3.16 (O'Malley A977, 1979)). Both topologies r and a.e. are completely- regular but not normal. For further properties of topologies a.e. and r see the above quoted papers of O'Malley. One more "density type" topology has been studied by M. Filipczak and ? Fil- ipczak A988). They considered the family %? = {4>(G) \ P: G is open in the natural topology and Pel]. THEOREM 3.17 (Filipczak and Filipczak A988)). %,p is a topology essentially stronger than a natural topology, essentially weaker than a densit}· topology, essentially stronger than a.e. topology and incomparable with the r topology.
Density topologies 691 An interesting example of topology has been given by Wojdowski A989). He defined an operator Фс on the class of B„ sets having the Baire property in the following way: ФС(А) = <P(G(A)), where G(A) is a regular open part of A. Recall that the set A e B„ if and only if A = GAP, where G is open (in the natural topology) and ? is of the first category. Among all possible representations of this type there is one for which the set G is the greatest and then it is regular open set, which we have denoted above by G(A). The topology of Wojdowski equals the Hashimoto topology generated by the a.e. topology and the ?-ideal of sets of the first category (that is the topology of the form {A \ P: A e a.e. and ? is of the first category). Sarkhel and De A981) while studying integrals of the Perron type have introduced a notion of sparse set. A set ? с R is said to be sparse at a point ? e R on the right if and only if for each ? > 0 there exists к > 0 such that each interval ]a. b[ с ]x. ? + k[ with a - ? < к ¦ (b - x) contains at least one point у such that ?*(? П ]x. y[) < ?(? - ?). Sparseness on the left is defined similarly. A set ? is said to be sparse at ? if and only if it is sparse at ? on the right and on the left. We shall apply this notion to measurable sets only. Let Sr(E) = {x e R: R\E is sparse atjc). THEOREM 3.18 (Filipczak( 1988-89)). For each ? e ?, Ф(Е) С S,,(E). THEOREM 3.19 (Filipczak A988-89)). The operator S,, :? -> 2?- is a lower density operator and the family 7V, = {E e ?: ? С Sp (?)} is a topology stronger than a density topology. For further properties of the topology 7}, see Filipczak A988-89). Most of them are similar to that of the density topology. 4. Density topology in Euclidean space Tall A976) says: "Many of the topics we touch upon can be treated in more general measure-theoretic structures than the real line, but this does not appear to be particularly fruitful topologically". However, in Euclidean space of dimension greater than one several differentiation bases (see de Guzman A975)) are considered, from which at least two seem to be interesting for the construction of density topologies. For simplicity of denotation we shall present basic facts in R2, the generalization to R" (n ^3) being (in general) obvious. We say that a point jto, }'o e R1 is an ordinary density point of the set A e ?2 if and only if ,. k2(Ar)([xo-h,xo + h]x[y0-h.yi)+h])) lim ? = 1. D.U ) A—o+ 4h- The condition D.1) is equivalent to the following one к2(АГ\([хо-И„,Х(,+И„] lim n^oc 4h„k„ ?2(??)([?0 -h,,,xQ +h„] ? [yo-k„. yo+?„])) lim = 1. D.0 )
692 W. Wilczyiiski for each pair of sequences of positive numbers {й„}„еь:, {?»ЬеК tending to 0 and for which there exists a number a e]0, 1 [ (called the parameter of regularity) such that a < h„ ¦ k~' < a~' for each ? e N. Let ??(?) denotes the set of all ordinary density points of a set A e ?2 and let 7г = {A e ?2·' А с Фо(А)}. It turns out that Фо is a lower density operator and T> is a topology on the plane, which means that two-dimensional versions of Theorems 2.1 and 2.2 are true. The (ordinary) approximate continuity of a real function of two real variables at a point (xq, )?) defined in a similar way as in one-dimensional case either as a restrictional (or "path") continuity or as continuity with respect to Ъ- Again both definitions are equivalent. Also all remaining theorems from Section 2 and all but two (Theorem 3.6 and Theorem 3.7) from Section 3 have their exact analogues is Euclidean space of dimension greater than one. In Theorem 3.6 the first Baire class is the only one which can be similarly proved for Ш". ? ^ 2. In Theorem 3.4 one should consider the derivative of two-dimensional rectangle function (indefinite integral). THEOREM 4.1. If f :R- —> Ш is (ordinarily) approximately continuous, then it is Baire one function. In the literature there are numerous (not equivalent) definitions of Darboux property for functions of several variables (see, for example, Gibson and Natkaniec A996-97)). One of the most natural says that a function / has the Darboux property if and only if it transforms connected sets onto connected sets. However, the exact analogue of Theorem 3.7 does not hold. For example, the linear segment / = [0.1] ? {0} is connected in the natural topology on the plane, but it is not in topology Ъ. Indeed, / = (АП/)и(ВП/), where A = {(x, у): у ?. 0 or (у > 0 and (у < -?2 or у > ?2))} and В = {(?, у): у > 0 and —?2 < у < ?2} are disjoint 72-open sets. However, the following theorem holds: THEOREM 4.2 (Goffman and Waterman A961)). Every open (in the natural topology on R") connected set is T2-connected. We say that a point (jt0, y0) e R2 is a strong density point of the set A e ?2 if and only if Ь2(АГ\([хо-Ь,х0 + Ь]х[у0-к,уо+к])) Inn — 1. D.1) A—0+ Ahk Obviously if (jto, yo) is a strong density point of A. then it is also an (ordinary) density point of A. A point @,0) is a density point of the set A in the above decomposition of /, but it is not a strong density point of A. Let Ф.АА) denotes the set of all strong density points of a set A e ?2 and let % = {A e ?2: А с ФХ(А)}. Again Ф5 is a lower density operator and % is a topology on the plane (called the strong density topology) stronger than a natural topology and weaker than 72, so theorems similar to Theorems 2.1, 2.2 and 2.3 are true in this case. However, the proof of the Lebesgue Density Theorem is essentially different, (see Riesz A934), Saks A937)). The reason is that there exists a set A e ?2 and a family of intervals A such that A covers A in the Vitali sense (i.e., for each ? e A there exists a sequence {/,i}„eN
Density topologies 693 of intervals from ? such that ? e I„ for ? e N and diam(/„) -- 0) having the property that for each sequence {?-keN of pairwise disjoint intervals from ? ?(? \ \Jk Л) > О (see Caratheodory A927) or Jeffery A962)). Than means that the analogue of Vitali covering theorem does not hold for intervals (without the assumption on parameter of regularity). All counterparts of Theorems 2.4-2.12 are also true for Ts, essentially with similar proofs. The situation is different for analogues of theorems from Section 3. Again one can introduce two definitions of strong approximate continuity at a point - one restrictional (or "path") and the second topological (with respect to %). Observing that (л"п, vo) e K2 is a strong density point if and only if ?-,(??)?) lim — - = 1, diam(/)->O ?2(/) (л-0..П))е/ where / denotes the interval centered at (ло.уо). it is possible to repeat the proof of Theorem 3.1 with necessary changes. We have theorems similar to Theorems 3.3, 3.4 (the proof uses the theorem of Jessen et al. A935) on strong derivation of the indefinite integral), Theorem 3.5 (for a real function g of a real variable), Theorem 3.6 (only the part concerning Baire one class). Since % is included between the natural topology in the plane and T>, the same can be said on classes of connected sets with respect to these topologies (in the inverse order). However, % behaves much worse with respect to separation axioms than 7?. THEOREM 4.3 (Goffman et al. A961)). The topology % is not regular. Hence % is not the coarsest topology relative to which each strongly approximately continuous function is continuous. Let % = {A e dr. А с ФЛ(А) and for each л. у e ? the sets A.v = {у: (x.y) е A], Ay = {x: (x,y) e A) are Lebesgue measurable (as linear sets), moreover Ax с ?(??), ?? СФ(АУ)}. Theorem 4.4 (Goffman et al. A961), O'Malley A972)). % is a topology on E2 strictly coarser than % and such that each strongly approximately continuous function is continuous with respect to %. It is not known whether % is completely regular. O'Malley A972) observed, that the product of two density topologies on the real line (which is obviously coarser than %) is too small to make all strongly approximately continuous functions continuous. 5. ?-density topologies on the real line S. Ulam in the Scottish book in 1937 has posed the problem: if ? e ?, does there exist a real function ? :E+ —>¦ E+ (dependingon ?), increasing, such that lim,_o+ ^@ = 0 a°d
694 W. Wilczyiiski lim Mrn/) =0 E.1) ?,/,^?+?(/)·^(?(/)) .ve/ a.e. on ? (here / denotes a closed interval). The affirmative answer to this question has been given by Taylor A959, 1960). In the sequel С will denote a class of continuous increasing functions ? : R+ —>¦ R+ such that lim,^?+^@ =0. For technical reasons it is more convenient to consider the following condition: k(ET)[x-h,x+h]) n ,„ .,. lim =0. E.1) /1^0+ 2hVBh) It does not change the scope of studies because of the following fact THEOREM 5.1 (Wagner-Bojakowska and Wilczynski B000)). For each A e С and each ? eRthe condition E.1) is fulfilled for a function ? eC ifandonly //E.?) isfiilfilledfor a function ?\ e С defined by the formula ?\(?) = ?B~? ¦ t). Theorem 5.2 (compare Taylor A959), Terepeta and Wagner-Bojakowska A999)). For each ? e С there exists a function ? e С such that E.Г) isfiilfilledfor all ? e ? except for a subset of Lebesgue measure zero. PROOF. Observe that E.?) is fulfilled if and only if for each к eN there exists ? e N such that for each/2 e]0.p-'[ we have ;-(?'n|-'~/'-v+/l" sC k'2. If we put ? k(E'f)[x-h.x+h]) . , _,r Ek.,, = \x e E: ^ к - for each h e JO. ? [ then ? П Ф(?)=р|У ?*.,,. E.2) k=\ ,,= \ Since the set [ _ ?(?'?[? -h,x + h]) ,_-. , , п. t г 1 Ak.p =\xeR: — < к - for each /2 e JO, ? ' [ is closed for each k, ? e N and ?*.,, = ? ? ?*./;, all sets Ek.,, are measurable. Moreover, for each k, ? e N, ?*.,, с ?*.,,+ ! С ?. By virtue of E.2) we have ?({]^=, ?*.,,) = ?(?), so \im,,->x WEk.,,) = ?(?) for each к е N. Choose an increasing sequence {р*Ьек of Then we have natural numbers such that k(Ek ,,,) > ?(?) - 2 k for each ieN. ?( f) ?,-.„, ) > ?(?) - JT2-'= ?(?) - 2-(Ь w'=)t / ?=*
Density topologies 695 SO / DC DC - un- \A=I/=A k(\immfEk.pk)=k[ I Jf }Ej.lh )=k(E). Let^:R+^R+ be a function defined in the following way: ? B- pk . ') = (k - l)_l, linear and continuous on the intervals [2 · /»^i|.2 · pj~]] for к = 2.3 and any continuous increasing function on the half-line [2 · p-T1 · +oo[. Take jc e liminfA Fa.w· There exists кц e N such that ? e F,.№ for all / ^ ?(). if /г е [pr+'p ?"'], then ?B?) ^i'x and Л(?'П[дс-й,дс+й]) /-: ._, < r=/ . ???/^??. 2?·?B?) /-' Hence iimA^0+M?nj^±M=Oa.e. on F. ? At the end of this section we shall see also that the continuity of ? is not essential for the construction of a density-type topologies. However, the problem whether does exist a function ? independent on ? has the negative solution (Taylor, 1959) THEOREM 5.3. For an arbitrary function ? e С and a fixed number a e]0, \[ there exists a perfect set ? С [0, 1] such that k(E) =? and k(E'C\[x-h,x + h\) lim sup = +oo, ,;^o+ 2h4>{2h) for each ? e E. PROOF. Put ak = ?(? ¦ 2~k). A sequence {ak}k€\\ decreases to zero. Let {?·}?·???|0| be a decreasing sequence convergent to 0 such that ro = 1 - a and lirru-^ rk · ??~_!, = oo. Let Fq — [0, 1]. Suppose that we have constructed F(). F| Fa·, where i: e N U @) and the set Fk consists of 2* pairwise disjoint closed intervals of equal length. To obtain F;._n we subtract from each component of Fk a concentric open interval of length (r; — ?+?) ·2~ . So by induction we have defined a descending sequence of closed sets {?*?·}?-6??|??- Put ? = ????^·· ^ 's clearly a Cantor sets of measure a. Simultaneously we have k(Fk)=a + rkmdk(Fk\E) = rk. Take ? e F. Let P> be a component of Fk such that .v eft for к e N U {0}. Then {Fi-}*eN is a decreasing sequence of closed intervals. From the construction it follows that k(Pk) = (a+ rk) ¦ 2'k and ? (Fa \ F) = rk ¦ 2'k. Since Нгпа^-х П = 0, there exists A:, e N such that for к ^ к\ we have rk < a. Hence for к ^ к\ the following inequality holds: ?((? + rk)- 2'k) <: ? (a ¦ 2~ik-]') = a*.,. So ?(?'??*) гА-2"* ^ rk к(Рк)-Ф(к(Рк)) (а+гк)-2-к-Ф((а + гк)-2-к) 2а ¦ ak.
696 W. Wikzxn.sk) for к ^k ?. Let Pk = [x—hk,x + hk] be the smallest closed interval including Pk for к eN. Then {PjthgN is a decreasing sequence of closed intervals such that {?} = ??? ^ (since X(Pk) <^2X(Pk)).SoioTk^kl we obtain ЦЕ'ПРк) > к(Е'ПРк) 2hk-VBhk) 2к(Рк)-ФBк(Рк)) П ¦ 2-k rk 2 · (? + rk) ¦ 2~к ¦ ? ((а + ?) ¦ 2~к) ?? ¦ ак-\ Hence X(E'n[x-hk,x+hk]) hmsup =+oo. ,-. k^J 2hk-WBhk) ? Let ? eC.Wc say that ? is a ? -density point of ? e С if and only if ,. k{E'n[x-h,x + h]) n hm = 0. A^o+ 2h4>Bh) The set of all ? -density points of ? we shall denote ??(?). THEOREM 5.4 (Terepeta and Wagner-Bojakowska A999)). For each ? eC the mapping ?? : С —> 2Ш has the following properties: @) for each AeC, ?? (A) e ?, A) for each A e ?, ФФ(А)сФ(А), B) for each А, В е С, ifA~B, МепФФ(А) = ФФ(В), C) ??@) = 0, ??(?) = ?, D) for each ?,? eC, ??(?? ?) = ??(?)???(?). PROOF. @) similarly as for the operation ? one can prove that ??(?) e F„s, for each AeC (the proof does not use the assumption of continuity of ?). A), B) and C) follow from the definition. D) follows from the inequality ?((? ? ?)' ??[? - h,x + h] ? ?(?' ? [? - h, ? + h]) + k(A'n[x-h,x + h]). From Theorem 5.3 it follows that ?? is not a lower density operator for any function ? e C, because the Lebesgue Density Theorem is not fulfilled. However, with each function i^eCwe can associate a topology which we shall call ? -density topology in the following way: Theorem 5.5. For each ? e С the family ?? = {A e С: А С ?? (A)} is a topology on the real line, stronger than the natural topology and weaker than the density topology T. PROOF. From Theorem 5.4 A) we conclude that ?? с Т, which implies that the union of an arbitrary subfamily of ?? is a Lebesgue measurable set. The rest of axioms of topological space is easy to verify.
Density topologies 697 Each set of the form Ш \ E, where ? is dense in R and ?(?) = 0, belongs to ?? but not to the natural topology. The set ? ? ? (?), where E is constructed in the proof of Theorem 5.3, belongs to T, but not to ??. ? The set ? с К is a ?? -neighbourhood of a point ? e ? if and only if there exists a set Ax e ?? such that jc e Ал and Av С ?. Hence ? is a ?? -neighbourhood of ? if and only if ,. X*(ET)[x-h,x + h]) lim = 0. /i^o+ 2h4>Bh) Similarly as Theorem 2.7 one can prove: THEOREM 5.6. ? is ? ?? -neighbourhood of each of its points if andonly if? e ??. The formula for ?? -interior of an arbitrary set А с К is much more complicated than in the case of the density topology. To state the theorem we need the following denotation for a measurable set А: Ф$ (A) = A; if a ^ 1 is a countable ordinal having the predecessor, ? =/3+1, then Фф(А) = ФФ(Ф^(А)) and if ? is a limit countable ordinal, then ??(?) = ?[?<?<???(?). It turns out that a transfinite sequence {??(?)}\<??<?] of sets is descending. Moreover Theorem 5.7 (Wagner-Bojakowska and Wilczynski B000)). Fix ? e C. For each set А С К there exists a countable ore В С A is a measurable kernel of A. А С К there exists a countable ordinal ? ^ 1 such that Int^ (A) = А П ??(?) where An ordinal ? is the smallest ordinal such that ??(?) = ??(?) for each countable ordinal ? > ?. In connection with this we shall say that a countable ordinal <*o A ^ ao < ?\) is the ? -order of the set A e С if and only if ? ? (A) = ? ?'(A) for each a > a() and ???(?) \ ???°(?) ? 0 for each ?, 1 <; ? < a0. Theorem 5.8 (Wagner-Bojakowska and Wilczynski B000)). Fix ? e С For each countable ordinal a ^ 1 there exists a set A e С with the ?-order equal to a. Let ? с IR be a perfect nowhere dense set of positive Lebesgue measure such that ??(?) = 0 (as in the proof of Theorem 5.3). Then ? is ??-closed and Intx^(?) = 0, so ? is ??-nowhere dense, so Theorem 2.8 does not hold in this case. However, the following property is true (compare Theorem 2.4): THEOREM 5.9. A point ? e R is a ??-limit point of a set А сШ (not necessarily measurable) if and only if ?*(An[x-h,x + h]) lim sup > 0. ,,^o+ 2h4>Bh)
698 W. Wilcztfski For each ? e С the topology ?? has a lot of properties similar to that of T. We shall mention only some of them: Theorem 5.10 (Terepeta and Wagner-Bojakowska A999)). If ? cR is ? ??-compact set, then ? is finite. Theorem 5.11 (Terepeta and Wagner-Bojakowska A999)). The space (R. ??) is neither first countable, nor second countable, nor Lindelof nor separable. Theorem 5.12 (Terepeta and Wagner-Bojakowska A999)). The family ofTi* -connected sets coincides with the family of sets connected in the natural topology (and in T). PROOF. Follows from the inclusion: (top. nat.) с ?? с Т. There are also essential differences between the properties of ? and ??. THEOREM 5.13 (Wagner-Bojakowska B001)). The space (R. ??) is of the first category. PROOF. Let ? be a perfect nowhere dense set of positive measure such that ??(?) = 0. Put D = \J,.€Q(E + r). Then it is not difficult to show that ?(? \ D) = 0. So R = (R \ D) U U,- e q (? + r) is of the first category as the union of countably many ?? -nowhere dense sets. To establish that the space (R, ??) is completely regular for each ? &C we shall need the notion of ? -approximate continuity. As in Section 3 we can introduce the notion of ? -approximate continuity of a function ? : R —>¦ R either using the "restricted" definition or the "topological" definition (compare definitions of Denjoy). THEOREM 5.14. Both definitions oj ?-approximate continuity at a point are equivalent. The proof uses the fact that for ? -density points the condition (J?) of Jedrzejewski is also fulfilled (Wagner-Bojakowska B001)). Theorem 5.15 (Terepeta and Wagner-Bojakowska A999)). A function /:R -+ R is measurable if and only if there exists a function ? e С such that f is ? -approximately continuous almost everywhere. PROOF. Observe that if {f,,},^^· is a sequence of measurable sets, then there exists a function ? eC such that ?(?„ \ ФФ(Е,,)) = 0 for each ? e N. Indeed, from Theorem 5.2 for each ? e N there exists a function ?„ e С such that ?(?„ \ ??? (?,,)) = 0. If ? e С is a function for which there exists <5„ > 0 such that ?(?) > ?„(?) for all t e]0, <$„[, then ???(?) с ??(?) for each measurable set A and the conclusion follows. Now let {]йнА<[)неМ be a basis for the natural topology in R and put E„ = /~' (]a„,b„[). If D(f) is the set of all points where / is not ? -approximately continuous, then D(f) с U/JeN(?„ \?? (?н))· Hence ?(?)(/))=0 for a function^ е С described above. D
Density topologies 699 So the class of all measurable functions coincides with the union of all classes of functions which are ? -approximately continuous almost everywhere. However, the union of all ? -density topologies does not behave so nicely. Theorem 5.16 (Wagner-Bojakowska and Wilczynski B00..)). U* eC ?* § ?- Recall that the Hashimoto topology on the real line with respect to I is the family O* = {Up: U is open in the natural topology and Pel}. (Hashimoto, 1976.) Theorem 5.17 (Terepeta and Wagner-Bojakowska A999)). П*еСТ* = °*- The family {??}?^ of all ? -density topologies is partially ordered by inclusion. The structure of this partially ordered set is rather complicated. Observe that if ?\ (f) ^ ?2(?) fori e]0,<5[, where <5 is some positive number, then ??, с ??,. For each sequence {<^„}„е;; of functions from С there exist two functions ?. ? e С such that for each /; e N there exists ?„ > 0 such that ?(?) < ?„(?) < ?(?) fori e]0.<5„[. Hence for each sequence {??„}„€\., of ? -density topologies there exist two topologies ?? and T= such that ?? с ??„ с Т= for all /; е N. Simultaneously different functions from the class С can generate the same topology. This is the case, for example, for arbitrary function ? e С and a function ?\ e С defined by the formula ?\(?) = a ¦ ?(?), where aeR+. Let ?\, ?2 e С and к e N. Put A+ = {x eR+: ?{B?) < к'1 ?2B?)), Bk+ = {x eE+: ?2B?) <k~l ?{B?)}. Ak = ?^ U (~??), Ви = Bf U (-Bf). The following theorem gives necessary and sufficient condition for ??, = ??,. Theorem 5.18 (Wagner-Bojakowska and Wilczynski B000)). Let ?\,?2 e С The topologies ??, and ??: are equal if and only if\m\k-*xBk = linu·-»^ Щ = 0, where k(Akf)[-h,h]) j ,. k(Bkn[-h.h]) Sk = lim sup and m = lim sup — :———— · ft^o+ 2/2-^,B/2) A_o+ 2h4t2Bh) From the above theorem it follows (among others) that there is not need to use increasing (not necessarily continuous) functions ? : Ш^ -> R+ such that lim, _o+ ^ @ = 0, because in this way we shall not obtain any new topologies. Namely, we have the following THEOREM 5.19 (Aversa and Wilczynski B000)). // <^:E+ -> K+ is an increasing (not necessarily continuous) function such that lim,_*o-r ^@ = 0> then there exists two functions ?\,?2<?0 such that ?\ (t) ^ ? (t) sC ?2(?) for t e @, 2] and ??, = ??:.
700 W. Wilczynski 6. Another local property of measurable sets Lusin in this dissertation has proved the following theorem Theorem 6.1 (Lusin A951), Wheeden and Zygmund A977)). For each set A e C, А С [0, 2?], the integral ???(? + {)-??(?-{)?(= Hm rXA(X+t)-XA(X-t)dt F1) ? ' F-0+Л t exists and is finite for almost every ? e [0. 2тг]. The above theorem shows a kind of symmetry of a measurable set, because at the same time in Lusin A951, Commentary 117), one can find an example of a set A e ?, А С [0,2?], for which f '*'(*+0-*'<*-OI</, =+00. 1ога.е.л-е[0.2л]. Jo ' Obviously, if F.1) holds at some point x, then ? can be neither a density, nor a dispersion point of the set A. Conversely, there exists a set A e С and a point ? e ?(?), for which the integral F.1) is infinite (Lusin, 1951. Commentary 118). One can find interesting density theorems independent of Lebesgue Density Theorem in Cs6rnyeiA998). References Aniszczyk, B. and Frankiewicz. R. A986). Non-homeomorphic density topologies. Bull. Acad. Pol. Sci. 34 C-4). 211-213. Aversa, V. and Wilczynski. W. B000). ?-density topology for discontinuous regular functions. Atti Sem. Mat. Fis. Univ. Modena 48, 473^*79. Besicovitch, A. A957). On density of linear sets. J. London Math. Soc. 25. 170-178. Blumberg, A. A918-19). A theorem on linear point sets. Bull. Amer. Math. Soc. 25. 350-353. Bruckner. A.M. A971), Differentiation of integrals. Amer. Math. Monthly. No. 12 of the Herbert Ellsworth Slaught Memorial Papers. Bruckner, A.M. A978), Differentiation of Real Functions. Lecture Notes in Math.. Vol. 659, Springer. Berlin. Caratheodory, С A927), Vorlesungen iiber reelle Funktionen. Leipzig. Ciesielski, K. and Larson, L. A993). Refinements of the density and I-density topologies. Proc. Amer. Math. Soc. 118B), 547-553. Ciesielski, K., Larsen, L. and Ostaszewski. K. A994), 1-density continuous functions. Mem. Amer. Math. Soc. 515. Csomyei, M. A998), Density theorems revisited. Acta Sci. Math. (Szeged) 64. 59-65. Denjoy, A. A915), Memoire sur les derives des functions continues. J. Math. Pures Appl. 1. 105-240. Denjoy, A. A916). Sur les functions derivees sommables. Bull. Soc. Math. France 43, 161-248. Filipczak, M. and Filipczak. T. A988). On some topology related to metric density. Rad. Mat. 4. 299-307. Filipczak, T. A988-89), On some abstract density topologies. Real Anal. Exchange 14 A). 140-166. Foran, J. A991), Fundamentals of Real Analysis. Dekker. New York. Gibson, F.G. and Natkaniec. T. A996-97), Darboux like functions. Real Anal. Exchange 22 B). 492-533.
Density topologies 701 Goffman, C. Neugebauer, C.J. and Nishiura. T. A961). The density topology and approximate continuity. Duke Math. J. 28, 497-506. Goffman, С and Waterman. D. A961). Approximately continuous transformations. Proc. Amer. Math. Soc. 12, 116-121. de Guzman, M. A975), Differentiation of Integrals in R-. Lecture Notes in Math.. Vol. 481. Springer. Berlin. Hashimoto, H. A976), On the * topology and its application. Fund. Math. 91. 5-10. Haupt, O. and Раис. С A952), La topologic approximative de Denjoy em-isagee comme vraie topologic. C. R. Acad. Sci. Paris 234, 390-392. Haupt, O. and Раис, С A954). Uber die durch allgemeine Ableitungsbasen bestimmten Topologien. Ann. Mat. Рига Appl. Ser. IV. 36, 247-271. Hobson, E.W. A927). The Theory of Functions of a Real Variable and the Theory of Fourier Series. Vol. I. 3rd. edn, Cambridge. Ionescu Tulcea, A. and Ionescu Tulcea. С A969). Topics in the Theory of Lifting. Springer, Berlin. Jedrzejewski, J.M. A973/74), On limit numbers ofreal functions. Fund. Math. 83 C), 269-281. Jeffery, R. A962). The Theory of Functions of a Real Variable. University of Toronto Press. Jessen, В., Marcinkiewicz. J. and Zygmund, A. A935). Note on the differentiability of multiple integrals. Fund. Math. 25. 217-234. Khintchine. A. A922). А пек proof of the basic theorem on metric set theory, Proc. of Ivanovo-Vozn. Technical Institute. No. 6, 39^1 (in Russian). Lahiri, B.K. and Chakrabarti. S. A991), Density topology. Math. Student 59, 89-108. Lebesgue. H. A904). Legons sur Г integration et la recherche des functions primitives. Paris. Lebesgue, H. A910). Sur I 'integration des fonctions discontinues, Ann. Ecole Norm C) 27. 361^50. Lukes. J., Maly\ J. and Zajicek, L. A986), Fine Topology Methods in Real Analysis and Potential Theory. Lecture Notes in Math.. Vol. 1189. Springer. Berlin. Lusin. N.N. A912), Sur les proprietes des fonctions measurables. C. R. Acad. Sci. Paris 154, 1688-1690. Lusin, N.N. A951). Integral and Trigonometric Series. Moskwa (in Russian). Lusin, N.N. and Sierpiiiski, W. A917). Demonstration elementaire du theoreme fundamental sur la densite des ensembles. Rend. Circ. Mat. Palermo 42. 167-172. Maharam. D. A958). On a theorem of von Neumann. Proc. Amer. Math. Soc. 9. 987-994. Martin. N.F.G. A964), A topology for certain measure spaces. Trans. Amer. Math. Soc. 112. 1-18. O'Malley. R.J. A972), Density properties of functions and sets. Thesis. Purdue University. O'Malley. R.J. A977). Approximately differentiable functions: the r topology. Pacific J. Math. 72 A). 207-222. O'Malley. R.J. A979), Approximately continuous functions which are continuous almost everywhere. Acta Math. Acad. Sci. Hungaricae 33 C-4). 395^02. Ostaszewski. K. A982). Continuity in the density· topology. Real Anal. Exchange 7 B). 259-270. Oxtoby, J.С A980). Measure and Category. Graduate Texts in Mathematics. Vol. 2. Springer. Berlin. Riesz. F. A934), Sur les points de densite au sens fort. Fund. Math. 22. 221-225. Rusu, D. A995). Density-type topologies. An. StiinJ. Univ. "Al. I. Cuza" Mat. 41. 269-285. Saks. S. A937). Theory of the Integral. Monografie Matematyczne. Warsaw. Sarkhel, D.N. and De. A.K. A981). The proximalIу continuous integrals. J. Austral. Math. Soc. Ser. A 31. 26-45. Scheinberg, S. A971). Topologies which generate a complete measure algebra. Adv. Math. 7. 231-239. Sierpinski, W A923). Demonstration elementaire du theoreme sur la densite des ensembles. Fund. Math. 4. 167-171. Stepanoff. WW A924). Sur tine propietecaracte'ristique des fonctions measurable. Mat. Sb. 31 D). 87^89. Tall. F.D. A976), The density topology. Pacific J. Math. 62 A). 275-284. Taylor, S.J. A959), On strengthening the Lebesgue Density Theorem. Fund. Math. 46, 305-315. Taylor, S.J. A960). An alternative form of Egoroff's theorem. Fund. Math. 48. 169-174. Terepeta, M. and Wagner-Bojakowska. E. A999), ?-density topologies. Rend. Circ. Mat. Palermo 48. 451^76. Thomson, B.S. A985). Real Functions. Lecture Notes in Math.. Vol. 1170. Springer. Berlin. Troyer, R.J. andZiemer. WP. A963), Topologies generated by outer measures. J. Math. Mech. 12C). 485^94. Wagner-Bojakowska, E. B001). Remarks on ?-density topology. Atti Sem. Mat. Fis. Univ. Modena 49. 79-87. Wagner-Bojakowska. E. and Wilczynski. W. B000). The interior operation in ? ?-density topology. Rend. Circ. Mat. Palermo 49. 5-26.
702 W. Wilczynski Wagner-Bojakowska. E. and Wilczynski. W B000). Comparison of ?-density topologies. Real Anal. Exchange 25B). 661-672. Wagner-Bojakowska. E. and Wilczynski. W. B00..). The union of all ?-density topologies, in preparation. Wheeden. R. and Zygmund. A. A977). Measure and Integral. Marcel Dekker. Wilczynski. W. A984). Remarks on density topology and its category analogue. Rend. Circ. Mat. Palermo B) Suppl. No. 5. 145-153. Wilczynski. W. A984-85). A category analogue of the density topology, approximate continuity and the approximate derivative. Real Anal. Exchange 10. 241-265. Wojdowski. W. A989). Density topology involving measure and category. Demonstratio Math. 22 C). 793-812. Zahorski. Z. A950). Sur la premiere derivee. Trans. Amer. Math. Soc. 69. 1-54. Zajicek. L. A979). An elementary proof of the one-dimensional density theorem. Amer. Math. Monthly 86. 297- 298. Zajicek. L. A986-87). Porosity. X-densitv topology and abstract density topologies. Real Anal. Exchange 12. 313-326.
CHAPTER 16 FN-Topologies and Group-Valued Measures Hans Weber Universita di Udine, Dipartimento di Matematica e Infornuitica. via delle Science 206. 33100 Udine. Italy E-mail: weber@dimi.imiitd.it Contents Introduction 705 1. Definition and generation of FN-topologies 706 2. Exhaustivity 709 3. ?-submeasures and completeness 713 4. ?-order continuity and order continuity 716 5. Extension of FN-topologies and of measures 719 5.1. Extension of ?-order continuous FN-topologies and ?-additive measures 719 5.2. Extension of exhaustive FN-topologies and exhaustive measures 724 6. The completion of topological Boolean rings and the system FN,. (/?) of exhaustive FN-topologies . . 726 7. More on the ? -topology 732 8. Decomposition of exhaustive measures 734 9. Connectedness 738 10. Vitali-Hahn-Saks and Nikodym theorems 739 References 741 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 703
This page intentionally left blank
FN-topologies and group-valued measures 705 Introduction In this chapter we present the basic results about the FN-topologies on Boolean rings and the applications in the theory of group- or vector-valued measures. A Frechet-Nikodym topology (FN-topology) is a topology generated by a measure in such a way that the measure becomes continuous. As an illustration, let ? be a positive real-valued measure on a ring of sets. Then the topology generated by ? is induced by the semimetric defined by d(A, ?) = ? (A AS). Frechet A919/20) introduced a metric in the space of measurable functions such that convergence with respect to the metric is equivalent to convergence in measure. Identifying characteristic functions with sets the subspace of characteristic functions becomes a ?- algebra and the metric on this subspace is equivalent to the metric induced by a measure described above. Later on, this metric space was studied by Nikodym. This metric space is complete and therefore a Baire space. Saks was the first to use a Baire category argument to prove the Vitali-Hahn-Saks theorem. It was one of the first important application of FN-topologies. Since that time FN-topologies have now and then been used in measure theory. But only since the beginning of the seventies, the Frechet-Nikodym approach has been used in a systematical way starting with Drewnowski A972a). Today FN-topologies are considered an elegant and powerful tool in measure theory. The Frechet-Nikodym approach has a particular advantage when the range space of the measures plays a minor role. This is the case, e.g., for Vitali-Hahn-Saks and Nikodym theorems, for extension theorems and for decomposition theorems. In the most known proofs of the Vitali-Hahn-Saks and Nikodym theorems, these are obtained as an easy consequence of the fact that the domain of the measures is a Baire space with respect to a suitable FN-topology (see Section 10). As already mentioned this idea goes back to Saks. The extension theorems presented in Section 5 are obtained extending the measures by continuity. So the measure extension theorems are equivalent to extension theorems of FN-topologies. This approach was used among others by Drewnowski A972a), Herer A976), Lipecki A974,1987), Maharam A987) and Weber A976,1980) and was originated by M.H. Stone as pointed out by Maharam A987). A first unified approach to the Lebesgue decomposition theorem and the Hewitt-Yosida decomposition theorem was presented by Drewnowski A973b). Stimulated by a question posed by Drewnowski A973b), Traynor A976) proved that an exhaustive measure can be decomposed with respect to an arbitrary FN-topology; different choices of the FN- topologies yield different decomposition theorems. The method to get decomposition theorems presented here was given by Weber A982a, 1984a). The idea is to extend the measure to a suitable completion, to study the extensions and then to transfer the results to the original measure (for details see the beginning of Section 6). The same method is also useful to prove other results, e.g., the control measure theorem of Bartle-Dunford-Schwartz. Throughout let R be a Boolean ring. As usual, we denote the symmetric difference (addition), infimum (multiplication), supremum, difference, natural ordering by ?, ?, ?, \, ^, respectively. If R is a Boolean algebra, i.e.. Boolean ring with unit, we denote the unit by e and the complement of an ? e R by x' := e \ x. G stands for a commutative additively written group.
706 ? Weber 1. Definition and generation of FN-topologies FN-topologies are particular group topologies on boolean rings generated in a natural way by a measure or by a family of submeasures. Let us fix our terminology. A pseudonorm on G is a function | |: G —>¦ [0, +oo] satisfying |0| =0, |-*| = |.v|. \x + у I ^ |.v| + |y|. A seminorm is a real-valued pseudonorm. A norm is a seminorm satisfying |0| = 0 only if ? = 0. If | \:G-> [0, +oo] is a pseudonorm (seminorm, norm), then d(x.y) := |.v - y| defines a pseudometric (semimetric, metric), which induces on G a group topology, called the | |-topology. By the inequality ||jc| - |y|| ^ \x - y|, a pseudonorm is uniformly continuous; here [0, +oo] is endowed with the pseudometric (a. b) н> \a - b\ and |(+oo) - (+oo)| := 0. If | | is a pseudonorm on G, then ||jc|| := min{|.r|. 1} defines a seminorm on G generating the same topology as | |. DEHNITION 1.1. A function ? : R -> [0. +эо] is called a submeasure if ?@) = 0 and /j is monotone and , i.e., ?(? ? у) ^ /j(.v) + ?()-) for all .v. у е /?. Example 1.2. Let | | be a pseudonorm on G and ?-.R^-Ga. measure, i.e., a finitely additive function. Then the semivariation of ? defined by ||?||(?) := 5ир{|д(;с)|: Яэ.х^а} is a submeasure on R. The total variation of ?, defined by II lMl(a):=sup]P|M(A-j)|. i=l where the supremum is taken over all finite disjoint sequences xt in R with .?, ^ ?, is a [0, +oo]-valued measure and therefore a submeasure. It is easy to see that a function is a submeasure on R iff it is a monotone pseudonorm on (/?, ?). In particular, a submeasure on /? induces on (R. ?) a group topology, which has a 0-neighbourhood base ([/„)„ef; consisting of ,vo//iisets, i.e., .r ^ у e [/„ implies .v e Un. Definition 1.3. A group topology on (R. A) which admits a 0-neighbourhood base consisting of solid sets is called a Frechet-Nikodym topology or FN-topology. We denote by FNG?) the set of all FN-topologies on R. This now used terminology goes back to Drewnowski A972a). Vladimirov A972) used the term monotone topologies. FN-topologies can be characterized by their 0-neighbourhood systems.
FN-topologies and group-valued measures 707 PROPOSITION 1.4. (a) If и e FN(tf), then for any 0-neighbourhood U in (R. u) there is a 0-neighbourhood V in (R, u) such that x, у e V and ? ^x ? ? imply zeU. (*) (b) If U is a filter on R with the property that for any U el] there is a V e U with (*), then U is the 0-neighbourhood system for a unique FN-topology on R. PROOF, (a) Let U be a 0-neighbourhood in (R.u) and V a solid 0-neighbourhood in (R, u) with VAV cU.lf x.yeV and ; ^ ? ? у, then ? л ? ^ ? e V, z\x ^ у е V, hence ? л ?, ? \? е V and ? = (? ?.?)?(; \?) e VAV C U. (b) Let U, V e U with (*). Then VA V с U. Therefore there is a unique group topology и on (/?, A) such that U is the 0-neighbourhood system of (R. u). Moreover, V с U* : = {x e R: у e U for any у ^ х} с U, hence U* is a solid 0-neighbourhood contained in u. a PROPOSITION 1.5. (a) FNG?) is a distributive complete lattice with the discrete topology as the greatest element and the trivial topology as the smallest element. (b) If u,v e FNG?) and U and V are the 0-neighbourhood systems, respectively, in {R,u)andin (R, v), then ji/W: ?/eU, VeV} is a 0-neighbourhood base in (R,uav) and {U A V: ?/eU, V e V) is a 0-neighbourhoodbase in (R,uv ?). PROOF. The system of all group topologies on (R. A) form a lattice with the discrete topology as the greatest element and the trivial topology as the smallest element. If U and V are the 0-neighbourhood systems, respectively, for group topologies и and v on (R, A), then {UAV: U e U. V e V) and {U ? V: ?/eU, VeV) are 0-neighbourhood bases in (R, и ли) and in (R, и ? и), respectively. If U and V are solid subsets of R, then [/AV = [/vVand[/nv = [/AVare solid. Therefore FN(tf) is a sublattice of the lattice of all group topologies on (R. A). Moreover, using the proved description of the 0-neighbourhood bases of и v и and и л ? given in (b), one easily sees that FNG?) is distributive. (Observe that it is enough to prove one of the distributive laws.) FNG?) is complete: If, for a e A, ua e FNG?) and U„ is a 0-neighbourhood base in (R, ua) consisting of solid sets, then the finite intersections C\uef.- Uu (F finite с А, Ua e Uff) is a 0-neighbourhood base for supae4 ua- ? PROPOSITION 1.6. Any FN-topology on R makes the operations ?, ?, ?, \ uniformly continuous. PROOF. Let о be one of the operations ?, ?, ?, \. Let и e FNG?) and U. V 0-neighbour- hoods in (R, u) with (*) of Proposition 1.4(a). If х,. у, e R with .v|Ajt2,yiAy2 e V, then (x\ oy])A(jC2 °yi) € U since (,vi oyi)A(.V2 oy2) ^ (.vi ?.?2) ? (у|Ау2>· П In particular, any FN-topology is a ring topology. More precisely, a topology on R is an FN-topology iff it is a ring topology on (R, А, л) which makes the multiplication л uniformly continuous iff it is a ring topology on (R, А, л) which is bounded in the sense of topological rings (for the definition of boundedness see Warner A993)).
708 ?. Weber Corollary 1.7. Let и e FN(tf) and S a subring of (R,u). Then the closure of S in (R,u) is a subring of R. Nachbin A965, p. 25) called a partial order on a topological space (?, ?) closed if its graph {(x, y): ? ^ y] is closed in (?. ?J. COROLLARY 1.8. If и is a Hausdorff FN-topology on R, then the partial order of R is closed. PROOF. Let (xY)yer and (yY)yer be nets converging in (/?, и), respectively, to ? and у and suppose that xy ? yy for any у e ?. Then ? = limK xy = limy xy л yy = ? л у ^ у. D PROPOSITION 1.9. Let < be a closed order on a topological space (?, ?). (a) Then the intervals {x e ?: ? ^ a], {x e ?: ? ^ b) and [a, b] := {x e ?: ? ^ ? ^ b] are closed in (?, ?) for any a, b e X with a ^ b. (b) If(xy)yep is a decreasing or increasing net converging to ? in (?, ?), then xy ? ? or xy | ?, respectively. PROOF, (a) Let (xY)yer be a net converging to л· in (?, ?) and xY ^ a for у e ?. With yY := a (ye ?) we obtain ? = limy,, ^ lim.vy = x. Hence [?: ? ^ a] is closed. In the same way one sees that {?: ? ^ b} is closed. It follows that [a, b] is closed as an intersection of two closed sets. (b) We only consider the case that (ху)уег is increasing. For any ? e ? we have by (a) л = lim„^y xa ^ xy. Therefore ,v is an upper bound of {xy: ? e ?]. If b is any upper bound of {xy: ye ?), then ? ?. b by (a). Hence ? is the last upper bound of (xy )yer- ? We now define the FN-topology associated to a measure. Let ?: R —> G be a measure with values in a topological group. It is well-known that the topology of G is generated by a system (| \y)yer of seminorms, i.e., {{.v e G: \?\??}: ? e ?. ? > 0} is a subbase for the 0-neighbourhood system of (G, ?). Put \\?\\?(?):=$\??{\?(?)\?: R э ? ^ a} for у е Г (see Example 1.2). Then the ?-topology is defined as the FN-topology generated by the submeasures (||м||у)уег· Here is a description of the ?-topology without using pseudonorms. The sets {a e R: ?(?) e U for any.? ^ ?}, where U is a 0-neighbourhood in G, form a 0-neighbourhood base for the ?-topology. Any FN-topology is the ?-topology for some group-valued measure. In fact, if и е FN(tf), put ? := (R, ?, u) and choose for ? the identity map on R; then и is the ?-topology. Since a ?-topology is generated by a system of submeasures. it follows that any FN-topology is generated by a system of submeasures.
FN-topologies and group-valued measures 709 PROPOSITION 1.10. Let ?: R -> G be a measure with values in a topological group. (a) If и e FNG?) and ? is continuous at 0 with respect to u, then ? is uniformly continuous on (R,u). (b) The ?-topology is the coarsest FN-topology which makes ? continuous. PROOF. Let | | be a continuous seminorm on G and ||?|| the semivariation of ? with respect to | |. Then |?(-*)-???| = |m(jc\3')-m(.v\jc)| ^ |?(.?\ v)[ + |?(}'\-*)| ^2\\?\\(??>-). Therefore ? is uniformly continuous with respect to the ?-topology. Let и e FNG?) and ? be continuous at 0 with respect to и. If ? > 0 and U is a solid 0-neighbourhood in (R, u) such that \?(?)\ ^ ? for ? e U, then U С {? € R: \\?\\(?) ^?}. Therefore the ?-topology is coarser than u. Moreover, ? is uniformly continuous with respect to и since it is so with respect to the ?-topology. ? Let N be an ideal in R. Then ?^(?) = 0 (? e ?) and ?,?(.?) = \ (? e R \ ?) defines a submeasure on R. ? ? is a measure iff jV is a maximal ideal in R iff R \ N is an ultrafilter in R iff the quotient R/N has two elements. If N is a maximal ideal, we call the ?,?-topology an ultrafilter topology. A measure which induces an ultrafilter topology, is called an ultrafilter measure. If G is a Hausdorff topological group without elements of order 2, then a measure ?: R —> G is an ultrafilter measure iff it is a 2-valued measure. 2. Exhaustivity Exhaustive measures were introduced by Rickart A943) as a common generalization of bounded real-valued measures on algebras and ?-additive Banach space-valued measures on ?-algebras. Exhaustive measures are called in Rickart A943) s-bounded (strongly bounded) since exhaustivity is for Banach space-valued measures a stronger condition than boundedness (see Corollary 2.7 and Example 2.11). Exhaustivity is an essential condition for most of the basic results for finitely additive measures; it is a crucial condition, e.g., for decomposition theorems, see Remark 8.7. Diestel and Uhl A977) illustrated the importance of exhaustivity (there called strong additivity) in their book "Vector Measures", 1.6 Notes and remarks. ????????? 2.1. An FN-topology и on R is called exhaustive if x„ -> 0 (u) for any disjoint sequence (xn) in R. If ? is a submeasure or a measure on R, then a is called exhaustive if the ?-topology is exhaustive, or equivalently, if a(xn) —> 0 for any disjoint sequence (x„) in R. If G is a topological group and ?-R^-Ga measure, then ? is exhaustive iff the ?-topology is exhaustive. We give two examples of exhaustive measures.
710 ?. Weber Example 2.2. (a) If R is ?-complete, G a topological group and ?: R —>¦ G a ?-additive measure, then ? is exhaustive. (b) If | | is a seminorm on G and ? : R —» (G, | |) a measure of bounded variation, then ? is exhaustive. PROOF. Let (Xn)neN be a disjoint sequence in R. In (a), M(sup„eN^;i) = ??=\ ?(¦*«). hence the series ???? ?(*/?) converges and therefore ?(?„) —>¦ 0 (« —>¦ oo). In (b), ?,^? \?(?>>)\ <supne/j|M|(a) <+oo, hence ? (.*„)-> 0 (л -> oo). D Any ultrafilter topology and therefore any ultrafilter measure is exhaustive. This follows from the fact that, for a maximal ideal ? in R. at most one member of a disjoint sequence in R does not belong to M. Since for any a e R \ {0} there is a maximal ideal in R not containing a, the supremum of all ultrafilter topologies on R is Hausdorff. Since the supremum of exhaustive FN-topologies is exhaustive, we have proved: PROPOSITION 2.3. The supremum u\, of all exhaustive FN-topologies on R is exhaustive and Hausdorff. Proposition 2.4. Let и eFN(R). (a) и is exhaustive iff every increasing sequence is Cauchy in (R.u) iff every increasing net is Cauchy in (R, u). (b) If и is exhaustive, then every decreasing net is Cauchy in (R. u). PROOF, (a) (i) Let и be exhaustive and (л'^Ьел an increasing net in R. Suppose that (ха)аел is not Cauchy. Then there is a 0-neighbourhood U in (R, u) such that for every ? e A there is an a ^ ? with xu \ xy ? U. Therefore one can inductively define an increasing sequence (or,,) in A with dn := xa,l+l \ xu„ $ U. Then (dn)neti is a disjoint sequence not converging to 0. (ii) Suppose that any increasing sequence is Cauchy in (R, u). If (xn)nei; is a disjoint sequence in R, then s„ := V/'=i xi defines an increasing sequence and therefore a Cauchy sequence. It follows that x„ = sn \sn-\ converges to 0. (b) Observe that for any decreasing net (ха)аел in R and ? e A the net (?? \??)?^? is increasing and apply (a). ? PROPOSITION 2.5. Let и e FN(fi) and S a subring of(R, u). If the relative topology u\s on S is exhaustive and S is dense in (R, и), then it is exhaustive. PROOF. Let (а„)„ек be a disjoint sequence in R and U a solid 0-neighbourhood in (R.u). Let U„ be a sequence of 0-neighbourhood in (R. it) with Uq = U and Un vUn С U„-\ (n e N). Take b„ e 5" with b„ Aa„ e U„. Then c„ :=b„\ sup, <n fo, defines a disjoint sequence in S. Therefore eventually c„ eU. Moreover, anAc„ = (a„ \ supl<:„ ai)A(bn \ sup,,,,, b,) ^ supi^n(ajAbi) e U\ ? Ui ? · · · ? Un С U. It follows that eventually an e UAU. ? In the following we examine the relationship between exhaustivity and boundedness.
FN-topologies and group-valued measures 711 PROPOSITION 2.6. Let ?:/? —>· (G. | \) be a measure with values in a seminormed group. Then supve/? \?{?)\ = +oo iff there is a disjoint sequence (xn) in R with lim„^+0C|MU„)| = +oo. PROOF, of the nontrivial implication. Let supveK |?(*)| = +oo. Case 1: Suppose that the semivariation ||?|| of ? is finite. We define inductively a disjoint sequence xq,x\,xi, ¦ ¦ ¦ in R with \?(?„)\ ^ «. Put xo = 0. If .v, are chosen for /' < n, put ? = sup''^ x; and take an ? e R with |?(.?)| ^ IImII(c) + "· For .v„ := ? \z we have \?(?„)\ + \?(? ??)? > ????? > ??(·* ?;)| + /;, hence |?(.??)| ^«. Care 2: Suppose that /? contains an element a with ||?||(?) = +эо. We define inductively a disjoint sequence .Vo..V|,.V2 in R with |?(.?„)? ^ " and ||?||(? \ sup/'=0Xi) = +oo. Put jco = 0. Assume that .v, are chosen for /' < ? and ||?||(;) = +oo where z=a\ sup;<„ ?,. Let ? ^ ? with |?(.?)| ^ |?(;)| +«· Then |?(; \.?)| ^ |?(^)| - |?(?)| ^ ?· Moreover, ||?||(?-)+ ||?||(: \.?) = +эо. Put.v,, = ? if ||?||(; \jc) = +oo, and Xn =z \x if ||?||(?-) = +oo. ? COROLLARY 2.7. ?^? ?: R —>¦ G be an exhaustive measure with values in a topological group. Then supveK \?(?)\ < +oo for any continuous seminorm \ \ on G. A subset ? of a commutative topological group G is called bounded in G if one of the following equivalent conditions is satisfied: A) supveB |*| < +oo for every continuous seminorm | | on G; B) for every 0-neighbourhood U in G there is a natural number n and a finite subset F of G such that В с F+ [/<"> where U{U = U and Ulk+U = U[k) + U (keN). The equivalence of A) and B) is proved in Hejcmann A959, Theorem 2.5) (cf. also Weber A986), Section 6). We here use only the first condition. The following theorem is a reformulation of Proposition 2.6 and Corollary 2.7. THEOREM 2.8. Let ? : /? —>· G be a measure with values in a topological group. (a) Then ?G?) is bounded iff [?(?„): ? e ?) is bounded for every disjoint sequence (a„) in R. (b) If ? is exhaustive, then ?(R) is bounded. COROLLARY 2.9. If и isan exhaustive FN-topology on R, then R is boundedin (R, ?, u). PROOF. Apply Theorem 2.8(b) with G = (R, ?, u) and ?(?) = ? (? e R). ? Actually, Corollary 2.9 is equivalent to Theorem 2.8(b): Under the assumption of Theorem 2.8(b), R is bounded with respect to the ?-topology and therefore the uniform continuous image ?(/?) is bounded in G. THEOREM 2.10. Let ? e N. For a measure ?-.R^-M." the following conditions are equivalent: A) ? is exhaustive.
712 ?. Weber B) ? is bounded. C) ? is of bounded variation. PROOF. C) =>· A) =>· B) is contained in Example 2.2(b) and Corollary 2.7. B) =>· C). It is enough to show that the components ?, of ? = (??,..., ?„) are of bounded variation. So we may assume that ? = 1, i.e., that ? is real-valued. Let ? : = supveK ????? and x\,... ,x„ disjoint elements of R. Then ?]"=? |?(-*/)? = ? ?(??) ~~ ]?~ ?(??) = ?E\??+ Xj) - ?($\??~ Xj) ^ 2.?, where ]?+ and sup+ (]?~ and sup") is taken over those /' for which ?(?,) ^ 0 (?(.?,) < 0). Therefore |?|(?) ^ 2s for all a e R. D There are Banach space valued bounded measures which are not exhaustive: Example 2.11. Define ?: P(N) -> lx by ?( ?) = ?? where P(N) denotes the power set of N and ?? the characteristic function of A. Then ? is bounded, but not exhaustive. On the other hand, there are also infinite dimensional Banach spaces ? with the property that any ?-valued bounded measure is exhaustive; e.g., any reflexive Banach space, in particular any Hilbert space, has this property. This follows from the fact that a Banach space valued measure is exhaustive iff it has a relatively weakly compact range (cf. Diestel and Uhl A977), Theorem 1.5.2). By the Dvoretsky-Rogers theorem, for any infinite Banach space ? there is a ?-additive ?-valued measure ?:?(?) —>¦ ? whose total variation is not bounded. Therefore the equivalences A) ·?> C) and B) ·?> C) fail in any infinite dimensional Banach space. The following fact is used in Theorem 7.7 to compare the ?-topology and the |?|- topology. PROPOSITION 2.12. Let ?: R -> (G, | |) be a measure with values in a seminormed group. Then \?\ is exhaustive iff \?\ is bounded. PROOF. We will apply Theorem 2.10 A) о B) for the measure |?|. Therefore we only have to prove that |?| is real-valued if |?| is exhaustive. Let |?| be exhaustive. Then ? is exhaustive and therefore ? := supveK |?(.?)| < +oo by Corollary 2.7. We first show: If a e R and |?|(?) = +oo, then there are disjoint elements b,c e R with a = bvc, \?\?) ^ ? and \?\(?) = +oc. If \?\(?) = +oo, there are disjoint elements a\ a„ e R with a = sup,^„a, and ?!'=? 1м(а*I > 3?· Let к be the smallest natural number with ?]f=1 |?(?,)| > ?· We put b := supi</t a, and с :=a\b = sup^^a,. Then \?\?) > s and ? ? k—\ |д|(с) ^ ]? |?(?,)| = ?\?(?,)\ - |?(?*)| - ]?|?(?,)| > 3? - ? - ? = ?. i=k + \ i = l ; = l Moreover, \?\?) = +?? or \?\(?) = +?? since \?\?) + \?\(?) = \?\(?) = +??. Suppose now that |?|(??) = +°? for some ?? e R. Then, by the proved statement, we can inductively define sequences (an)„e\: and (b„)nei; in R with ?„-? = b„ va„.
FN-topologies and group-valued measures 713 bn л a„ = ?, |?|(?„) = +oo and |?|(?>„) ^ i. This contradicts the exhaustivity of |?| since (fo„) is a disjoint sequence. ? 3. ?-submeasures and completeness ? : R —>¦ [0, +00] is called ?-subadditive if ? = sup,!6f;a„ implies /j(a) ^ ??=\ ^(Ял) for any a,an e R. A monotone function ? : R —>¦ [0. +00] is ?-subadditive iff the implication DC a = supa„ =>· /?(a)^ VVa„) "eN „=i holds for any disjoint sequence (a„),!et; in /? and a e /?. A ?-subadditive monotone function /j: /? -> [0, +00] with /j@) = 0 is called a ?-submeasure on /?. Obviously, any ?-submeasure is a submeasure. Example 3.1. Let | | be a pseudonorm on G and ?: R -> (G.| |) a ?-additive measure. Then the semivariation ||?|| (defined in Example 1.2) is a ?-submeasure. PROPOSITION 3.2. For a submeasure ty.R -> [0,+00] the following conditions are equivalent: A) ? is a ?-submeasure. B) If(an)nefi is a Cauchy sequence in (R. ?) and an \ 0, then ?(??) —>¦ 0. C) If (an),,eN is a Cauchy sequence in (R, ?) and an la. then ?(?„??) —>¦ 0. D) If(a„)neN is a Cauchy sequence in (R. ?) andan f ?, ?/??>/? ?(????) —>¦ 0. PROOF. Using the fact that a„ 4, a implies a 1 Act,, 1 a\Aa. an \ a implies a Aa„ 4, 0, one immediately sees that conditions B)-D) are equivalent, A) =>. B), Let (a,,)ne^ be a Cauchy sequence in (R. ?) with an 4. 0. Choose a subsequence (fo„)weN of (а„)„ек with r)(bnAbn+\) ^ 2^" for /7 e N. Since fo„ = sup.^iA \ fy+?) one obtains ??„) ^ ??^?»7(*?' \ fo< + i) ^ 2""*1, Therefore ??„) -+ 0. Since (a„)nsN is a Cauchy sequence which has a subsequence converging to 0, also (a„)nem converges to 0 in (/?,/?). D) =>· A). Let ?,?„ e fi with а = sup„er:a„. We will show that ?(?) ^ Y^=\ ?(?„). For that we may assume that ??=\ ?(?„) < -boo. Then b„ := sup"=| a, defines a Cauchy sequence; in fact, III ??,,,?^,) ??( sup ?, W V] f?(a,) -> 0 (m > n ->¦ +00). Moreover, b„ f a. Therefore t)(aAfc„) -> 0 by D). Since a = (sup"=1 a,) v (aAb„), we have 77(a) < ]? ??(?,) + n(aAb„) ? ]? 7?(a,) + il(aAb„). 1 = 1 It follows that ??(?) ^ ]Р?1| /?(а/). D
714 ?. Weber Condition B) of Proposition 3.2 corresponds to the pseudo ?-Lebesgue property for locally solid Riesz spaces (see Aliprantis and Burkinshaw A978, p. 115)), A further characterization of ?-subadditivity is based on the following lemma. LEMMA 3.3. Let R be ?-complete and ? a ?-submeasure on R. If(xn)net< 's a sequence in R with ?(????„+\) ^ 2~" for n e ?, ? = liminfjt,, and у = limsupjt,,, thenx,, -> ? (?) andxn -> у (?). Proof, (i) For n e N we have sn := sup^j =xn vsupUi+? \xi). hence s„ Ax„ = sn \ xn ^ sup(.vj+i \ Xj), With the ?-subadditivity of ? we get ?E„??„) ^??(?1 + ] \Xi) iJ2-" + 1. Since (xn)neN is a Cauchy sequence and - as we have proved - SnAx,, —> 0 (?), also (sn)neN is a Cauchy sequence. Moreover, sn I y. Therefore sn —> у (?) by Proposition 3.2 A) =>· C). It follows that x„ -> у (?) since Sn ?.?„ -> 0 (?). (ii) Let ? := sup,!eN;t„ and ;* := ? \ ; for ; ^ J. Then limsup.x* = .v*. Since ?(?*??*+]) ^2~" we get from (i) that, ?* -> ?* (?). It follows that.xn -* ? (?). ? For a submeasure /;:/?—>¦ [0, -boo] and .?, ? e R we write л· ~,; у iff ?(???) = 0. It follows from Proposition 1.6 that ~,; is a congruence relation on (R. v. ?. ?, \). Sometimes we use the following fact: If л„ —>¦ ? (?), then xn —> у (/?) iff ? ~?; }'· PROPOSITION 3.4. Let ? be a submeasure on R. If R is ?-complete, then the following conditions are equivalent: A) ? is a ?-submeasure. B) // (a„)„eN 's a Cauchy sequence in (R, ?) and liminfa,, = limsupa,, =: a, then an -+ ? (?). C) //'(a,!),ieN is a Cauchy sequence in (R, ?), a e R and liminfa,, ~,; a ~,; limsupa,,, then an^r ? (?). PROOF. C) =>· B) is obvious, B) =M 1) follows from Proposition 3.2. A) =>· C). Let (a„)„eK be a Cauchy sequence in (/?./?) and liminfa,, ~,; a ~,; limsupa,,. Choose a subsequence (x„)net'. of (an)ner; with i)(Xn&Xn+\) ^ 2~". Since a ~^ liminfa,, ^ ? := liminf.v,, ^ limsupa,, ~,; a, we have ? ~,; .v. By Lemma 3.3, (xn)neN converges to ? and therefore also to a ? in (R. ?). So (а„)„ек is a Cauchy sequence which has a subsequence converging to a. It follows that also (an)nen converges to a. D
FN-topologies and group-valued measures 715 PROPOSITION 3.5. Let R be ?-complete and ? a ?-submeasure on R. If (а„)„ек is a sequence converging to a in (/?,/j). then (a,,)„eu has a subsequence (x,,)net-i with liminf.*,, ~4 a ~^ limsupjt,,. PROOF. Let (x„)neN be a subsequence of (а„)„еь: with ?(?„???+?) ^ 2'" for n e N, ? := liminfjt,,, у := limsupjt,,. Then x„ —>¦ ? (?) and л„ —>¦ у (?) by Lemma 3.3. Moreover, x„ —> ? (?) by assumption. It follows that ? ~,; у ~,; а. П THEOREM 3.6. Let R be ?-complete and ? a submeasure on R. (a) If ? is ?-subadditive, then (/?./?) is complete. (b) If(R, ?) is complete and Hausdorff, then ? is ?-subadditive. PROOF, (a) immediately follows from Lemma 3.3: If (а„)„ек is a Cauchy sequence in (/?,/?) and (xn)„eN a subsequence of (a„)„et; with ?(?„???+\) ^ 2~", then (jt„)„eN converges by Lemma 3.3. Hence (a„)net: converges. (b) We use the characterization of ?-subadditivity given in Proposition 3.2 A) о B). Let (a„)„epj be a Cauchy sequence in (R. ?) and an \, 0. We will show that a,, —>¦ 0 (?). Since (/?, ?) is complete, (а„)пе]ч has a limit a e R. By Corollary 1.8 and Proposition 1.9 we have a„ 4- ?. Therefore ? = 0 and a„ —>¦ 0 (?). ? COROLLARY 3.7. Le? /? be ?-complete and ? : /? —>¦ G ? ?-additive measure with values in a pseudonormedgroup. Then R is complete with respect to the ?-topology. We now give examples of finitely additive, non ? -additive measures inducing a complete FN-topology. In contrast to the first example, the measures in the second and in the third example are atomless. Example 3.8. Let ? be a maximal ideal in P(N) containing all finite subsets of N and ? the corresponding ultrafilter measure, i.e., ?(?) = 0 for A e ?4 and ?(?) = 1 for A e P(N) \ M. Then ? is a non ?-additive measure and (P(N). ?) is complete. Example 3.9. Let ? be the Lebesgue measure on the ?-algebra ? of Lebesgue measurable subsets of [0. 1] and ?:?([0. 1]) —>¦ [0. 1] be a measure extension of ? such that ? is dense in (P[0. 1].?). (Such a measure exists by Corollary 5.2.7.) Then (P([0, 1]),?) is complete since its dense subspace (?. ?) is complete. But under the continuum hypotheses ? is not ?-additive by Ulam's theorem. We now modify the last example to avoid the continuum hypotheses. Example 3.10. Let ? be the Lebesgue measure on the ?-algebra ? of Lebesgue measurable subsets of [0, 1] and (Nn)ne\-: be an increasing sequence of subsets of [0. 1] with inner Lebesgue measure 0 and Um=i N» = $· 4· Let ?? be the ideal in P([0. 1]) generated by {jV„: пеЩ and ?0:????^> [0. 1] defined by ??(???) = ?(?) (A e A, N e ??). Let ? :P([0, 1]) -> [0. 1] be a measure extending ?() such that ???? is dense in (P([0, 1]), ?). Then (P([0. 1]). ?) is complete. But ? is not ?-additive since ?(?„) = 0(neN)andM(U,?l| W„)=l·
716 ?. Weber The following problem is related to problem 2 on p. 101 of Aliprantis and Burkinshaw A978): If R is a ?-complete Boolean algebra and и an FN-topology on R generated by a family of ?-submeasures, does it follow then that (R.u) is sequentially complete? 4. ? -order continuity and order continuity An FN-topology и on R is called (a)-order continuous if every decreasing net (sequence) order converging to 0 converges topologically to 0 in (R, u). An FN-topology и on R is (a)-order continuous iff for every net (sequence) order convergence implies topological convergence in (R, u). A submeasure ? is called (?)-order continuous if the /j-topology is (a)-order continuous. PROPOSITION 4.1. Let ? be a submeasure on R. (a) If ? is ? -order continuous, then ? is ? -subadditive. (b) If ? is ? -subadditive and exhaustive, then ? is ? -order continuous. (c) If R is ?-complete, then ? is ?-order continuous iff ? is ?-subadditive and exhaustive. PROOF, (a) immediately follows from Proposition 3.2 B) =>· A). (b) Let R э an \, 0. Then (a„);ieH is a Cauchy sequence by Proposition 2.4 and therefore converges to 0 by Proposition 3.2 A) =>· B). (c) We have only to show that a ?-order continuous submeasure on a ?-complete Boolean ring is exhaustive. Let (an)„e>: be a disjoint sequence in R. Then sup^a, ^ 0, therefore ?{??) ^ /j(sup,^„ a,) —>¦ 0 (n —>¦ эс). So ?(?„) —>¦ 0 (n —>¦ oo). ? COROLLARY 4.2. If R is ?-complete, G a topological group and ? :/?—>¦ G a measure, then ? is ? -additive iff the ?-topology is ? -order continuous. PROOF. If the ?-topology is ?-order continuous, then R э an \, 0 implies that (a„)neN converges to 0 in the ?-topology. hence ????(?,?) = 0 by the continuity of ?. Vice versa, let ? be ?-additive and (| |,,)r6/' a system of seminorms generating the topology of G. Then the semivariation ||?||,, with respect to | \Y is a ?-submeasure by Example 3.1 and exhaustive by Example 2.2(a). hence ?-order continuous by Proposition 4.1. ? For any и e FN(tf), the closure N(u) := @)" of {0} in (R. u) is an ideal in R. и is Hausdorff iff N(u) = {0}. If и is generated by a submeasure ?. then N(u) = ?(?) := {? e R:n(x) = 0}. If ? and ? are submeasures on R, we write ? <? ? if ?{?„) -> 0 (n —>¦ oo) implies ?(?„) —>¦ 0 (/; —>¦ oo), i.e.. if the ?-topology is weaker than the /j-topology. THEOREM 4.3. Let R be ?-complete, ? ? ?-submeasure on R and ? ? ?-order continuous submeasure on R. Then ? <? ? iff ?(?) С ?'(?).
FN-topologies and group-valued measures 717 PROOF. =>·. The condition that ?(?„) —>¦ 0 implies ?(?„) -> 0 means for constant sequences exactly ?(?) с N (к). «=. Suppose that R contains a sequence (a;,)„er.; such that ?(?„) —> 0. but (?(??))„?? does not converge to 0. Passing to a subsequence we may assume that k(a„) ^ ? (? e ?) for some ? > 0. By Proposition 3.5, (а„)„ег; has a subsequence (.v„),ie[.: such that ? := limsupjt,, ~,; 0, i.e.. ?(?) = 0. Hence ?(?) = 0. On the other hand, ?(?) = limX(V,^„ ??) ^ ?, a contradiction. ? COROLLARY 4.4. If R is ? ?-complete and к ?/iii f? агг exhaustive ?-submeasures on R, then the ?-topology agrees with the ?-topology iff N(k) = ?(?). Proof. This follows from Theorem 4.3 and the fact that every exhaustive ?-submeasure is ?-order continuous. D Let ? and ? be measures on R and и e FN(R). Then ? « м or ? « ? means that the ?-topology is weaker than и or weaker than the ?-topology, respectively. Moreover, we put ?(?) = /V^-topology); if ? has values in a Hausdorff topological group, then ?(?) = {ae R: Wx e [0, ?], ?(.?) = 0). COROLLARY 4.5. Let R be ?-complete, ? a ?-submeasure on R and и а ?-order continuous FN-topology on R. Then и is weaker than the ?-topology iff ?(?) С N(u). PROOF. Let (ka)aeA be a family of submeasures generating u. Then и is weaker than the /j-topology iff ?? <^? for any ? e A iff ?(?) с N(ku) for any ? e A iff ?(?) С N(u). D COROLLARY 4.6. // R is ?-complete and ? and ? are ?-additive measures on R with values in pseudonormed groups, then the ?-topology agrees with the v-topology iff ?{?) = ?(?). This follows from Theorem 4.3 and the fact that the ?-topology and the ?-topology are ?-order continuous (see Proposition 4.2) or equivalently that the semivariations ||?|| and || ? || are ?-order continuous. To show the significance of the assumptions in Corollary 4.6, we give some examples of measures ? and ?, which induce different FN-topologies. although ?'(?) = N(v). Example 4.7. (a) Let ?:?(?)-> [?, +??] be the counting measure, i.e., v(A) = \A\ for А с ?, and ? :P(N) -> [0, 1] the probability measure defined by ?.(A) = ?„еД 2'" for Л С N. Then ? and ? are ?-additive, ?(?) = ?(?) = {0}. ? induces on P(N) = {0, 1}N the product topology, which is compact, whereas the ?-topology is discrete. Observe that ? is not exhaustive (cf. Corollary 4.4) and the range space of ? is not a group (cf. Corollary 4.6). (b) Let ? be a maximal ideal in P(N) containing all finite subsets of N and ? the ultrafilter measure defined by к\м = 0 and ?(?) = 1. Let ? be defined as in (a) and ? = ? + k. Then ?'(?) = ? (?) = {0}. ?(?) is compact with respect to the ?-topology, but not complete with respect to the ?-topology. Observe that ? is not ?-additive.
718 ?. Weber (c) Let ?, ? and ? be defined as in (b). Let ? be an uncountable set with ? ? ? = 0, ?? the ideal of all countable subsets of ? and A = {AUB: AeM, В eAf}U{AUB: А с ?, N \ A e .?, В с ?, ? \ В е ??]. Then A is an algebra with the property: if a sequence (A„)/ieN in A has a supremum A in Д, then A = U/Jli ^«· ?(??) := ?(? П ?) and v(M) := v(M П ?) define ?-additive measures on A with ?(?) = N(v). But Д and ? induce different FN-topologies on ?. Observe that A is not a ? -algebra. (d) Let A be the ?-algebra of Lebesgue measurable subsets of [0,1 ] and ? the Lebesgue measure on A- We denote by ?? the characteristic function of a subset ? of [0, 1]· Then ?(?) = (?(?), ?A) and ?(?) = @, ??) define ?-additive measures on A with values in G :=R xl|0J| endowed with the product topology. We have ?(?) = N(v) = {0}, but the ?-topology and the ?-topology are different. Observe that G is not metrizable. The measures Д and ? in Example 4.7(c) are not strictly positive. In fact, it can be shown that two ?-additive strictly positive real-valued bounded measures on R induce the same topology even if R is not ?-complete; for more precise information see Weber A994, Section 4). THEOREM 4.8. Let R be a complete Boolean algebra and и and ? order continuous FN- topologies on R. Then и is coarser than ? iff N(v) С N(u). Proof. =>· is obvious. ·?=. Let (??)??? be a net of submeasures generating v. We may assume that (??)?€? is increasing. Otherwise, replace (??)??? by (?^) where ?^ := J2aeF'?« ап<* Р is апУ finite subset of A. xa := supN(^u) e ?(??) since ?? is order continuous and therefore x := miaeAxa e N(v) с N(u). Since xa 4. x, we obtain xa -+ x (u). hence xa —> 0 (u) since x e N(u). Let U be a solid 0-neighbourhood in (R.u). We complete the proof showing that U ? U is also a 0-neighbourhood in (R,v). Let /3 e A with ?? elf and 5= [0,^] where jcl denotes the complement of xp. Since ?(??\?) = {0} С ^V(m|s), we obtain by Corollary 4.5 that и Is is weaker than /j/j|s-topology. Therefore there is a solid 0-neighbourhood V in (R, ??) such that V ? 5" с U ? 5". We show that V с C/ ? [/: Let jc e V.Thenjc = (jc \jc/j) v(jc л.х/j) e (V П 5) vi/ с С/ ? С/. It follows that С/? С/ is a 0-neighbourhood in (R, ??) and therefore in (R. ?). ? COROLLARY 4.9. A complete Boolean algebra admits at most one Hausdorff order continuous FN-topology. COROLLARY 4.10. Let R be a complete Boolean algebra. D a disjoint subset of R (i.e.. different elements of D are disjoint) and sup D = e. Let w be a Hausdorff order continuous FN-topology on R and w(i the relative topology induced by w on [0, d] for d e D. Then f(x) := (x л d)deD defines a topological isomorphism f : (R, w) -> П(/ед([0, d], w,i). Proof. Observe first that / is a Boolean isomorphism. Moreover, the product topology on ??/eD^· ^1 's Hausdorff and order continuous. Therefore its inverse image under / is Hausdorff and order continuous and agrees therefore, by Corollary 4.9, with w. This means that / is a topological isomorphism. ?
FN-topologies and group-valued measures 719 COROLLARY 4.11. Let R be a complete Boolean algebra and w a Hausdorff order continuous FN-topology on R. Then (/?, w) is algebraically and topological!}· isomorphic to a product Y\aeA(Ra, wa) where Ra is a complete Boolean algebra and wa a metrizable order continuous FN-topology on Ra. PROOF, (i) We first prove that for any a e R \ {0} there is an element b e R such that 0 < b ? a and w/, := w|[0./j] is metrizable. Let (иа)аел be afamily of FN-topologies on R with countable 0-neighbourhood base such that w = supaeA ua. Since w is Hausdorff, a ? N(ufi) for some ? e A. Let ?? = sup jV(h/j)· Then N(u^) = [0, jc/j] and b := a \ ?? > 0. Since the restriction of u? on [0, b] is Hausdorff, it coincides with w/, by Corollary 4.9. Therefore w/, is metrizable. (ii) It follows from (i) that R contains a disjoint set D such that sup D = e and wd is metrizable for any d e D. The assertion of Corollary 4.11 now follows from Corollary 4.10. D PROPOSITION 4.12. Let и be a Hausdorff FN-topology on R. Then R is a complete Boolean algebra and и is order continuous iff every increasing net converges in (/?, u). PROOF. =>·. If {xa) is an increasing net, then (xa) has by assumption a supremum ? in R, i.e., xa l ?. Hence xa —> ? (и) since и is order continuous. <=. Let (xa)aeA be an increasing net in R. By assumption, (xa)aeA has a limit ? in (R, u). It follows from Corollary 1.8 and Proposition 1.9(b) that ? = supxa. Therefore R is order complete. This argument also shows that xa \ ? implies xa —> ? (и), i.e., и is order continuous. ? 5. Extension of FN-topologies and of measures In this section, let S be a subring of R, G be a complete Hausdorff topological commutative group and ? : S —> G a measure. Let (| \Y)Yer be a system of seminorms generating the topology of G. We define the semivariation ||/х||у(йг) := sup{|/z(Ar)|: 5эа$й) for any a e R and ? e ?. 5.1. Extension ofa -order continuous FN-topologies and ?-additive measures The topological approach to measure extension theorems is based on the following simple fact, which follows from Corollary 1.7 and Proposition 1.10. PROPOSITION 5.1.1. Let и e FN(R) and ?-.S^- G a measure with ? «С и (i.e., ? <<C и 15). Then the closure ? = S" of S is a subring of R and ? has a unique continuous extension ? : (Г, и\т) —> G to ?. Moreover, ? is a measure.
720 ?. Weber Starting from a measure ?, we have to construct a suitable FN-topology и to apply Proposition 5.1.1. It is near at hand to look for an extension и of the ?-topology; this guarantees the continuity of ? with respect to u. Which other properties of и are needed to obtain a ?-additive extension of ? on a ?-ring, can be seen by the following proposition. PROPOSITION 5.1.2. Let R be ?-complete and и be an FN-topology on R generated by a system of ?-submeasures. Suppose that u\s is exhaustive. (a) Then the closure ? = S" of S is ? ?-subring of R (i.e., a subring of R closed with respect to countable suprema). (b) If ?. «u, then the unique u-continuous extension ? :T —>¦ G of ? is a ?-additive measure. PROOF. Let a e R and (a„)HeN be a sequence in Г with supa,, = a. We will show that a eT. Since Г is a subring of R, we may assume that (а„)„ек is increasing. Since и\т is exhaustive by Proposition 2.5, (a„)„ef; is a Cauchy sequence by Proposition 2.4. Now it follows from Proposition 3.2 that a„ —>¦ a (u) and therefore a eT. If ? : S —> G is a и-continuous measure, then its и-continuous extension ? : ? —>¦ G is a measure as stated in Proposition 5.1.1. Let (а„)„ек be an increasing sequence in Г and a = supa,,. As we have seen before, this implies a„ —>¦ a (u) and by the continuity of ? that ?(a„)->A (?). D LEMMA 5.1.3. Let R be ?-complete, 0 e А С R and ??:?^ [0, +??] a function with jjo@) = 0. Then ? : R -> [0, +оо] Лг/шгЛ fcy /j(-*) = inf ^ijo(a/i): an e A, .v ^supa,, I /1 = 1 /? ? ?-submeasure and ?? > /?|д. (Яггг inf0 := +oo.) Moreover, ? extends ?? iff ? ? supa,, implies ??(?) ^ Y^L\ 1o(an)far any sequence (а„)„ек in A and a e A. THEOREM 5.1.4. Let R be ?-complete. A measure ? : S —> G /?a.v a ?-additive measure extension to the ?-subring ?(?) of R generated by S iff ? is ?-additive and exhaustive. PROOF. Suppose that this (obviously necessary) condition holds. For any ? e ?, the semivariation ||? \\? is an exhaustive ?-submeasure on S and ??(?) :=inf ]? ||?||7(?„): an e S, ? ? supa,, I /i = l defines by Lemma 5.1.3 a ?-submeasure extending ||? || ?. Let и be the FN-topology on R generated by (??)уеГ and Г and ? defined as in Proposition 5.1.2. Then ? is a ?-additive measure extension of ? and its domain ? contains ?(?). ? In the proof of Theorem 5.1.4 we used Lemma 5.1.3 with A = S. If one takes for A a suitable bigger set, the FN-topology и becomes weaker and so the domain S" of the
FN-topologies and group-valued measures 721 measure extension ? becomes bigger. For example, let R be a Boolean ?-algebra, e e S and (d„)neN be a disjoint sequence in R and A = Su{.x\ sup dj\ ? eS, ? eN}; then one obtains as in the proof of Theorem 5.1.4 that every ?-additive exhaustive measure ? : S —>¦ G has a ?-additive measure extension to the ?-subring a(S U {ii„: ? e ?}) of /? generated by S and the disjoint sequence (dn)nei;. For a more general result (see Theorem 5.1.7 and Corollary 5.1.8) we use the following lemma. LEMMA 5.1.5. Let R be ?-complete, S a ?-subring of R, SO = {x e R: ? ? ? e S for every ? e S], W a well ordered subset of R containing 0 and A = {z\w: ? e So, we W). Let ?-.S ^ [0, +oo] foi> ? ?-submeasure, ??(?) := supa^xeSk(x) for a e A and ? defined as in Lemma 5.1.3. (a) Then ? is a ?-submeasure extending ??. (b) If ? is exhaustive, then {x e S: ? ^ ?) converges to a in (R, ?) for any a e A, in particular А С S'>. PROOF, (a) By Lemma 5.1.3 we have to prove that for a countable index set /, ? e S, aa e A (a e ?), ? ? supaa imply ?(?) ^ ]P 1o(oa)· «' ael aa can be written as aa = za\ »« with za e So and wa eW. With the aid of the well order of W one can define a well order on / such that (wa)aei becomes an increasing net. Since S is a ?-subring of R, the elements xa := (x л za) \ supp<a ?? belong to S. Moreover, xa ^ za and xa ^ ? \ sup г/j ^ sup(z/j \ ш/O \ sup ?? ^ sup(c/j \ w/0 ^ sup ?? \ wa, ?<? pel ?<? ?^? ?^? hencexa ^za\wa =aa. We now show that ? ?. supue/Xa. Suppose that ? := ? \ supa6/xa ? 0, Since у ^ supae/ za and / is well ordered, there is a smallest index ye/ with у л ?? ? 0. Therefore у /\?? =0for/3 < у mdyAzy ^ (x Azy)\supp<yzp = ??. Since улл-у =0, it follows that у л ?? = 0, a contradiction. Finally, ?(*) ^ ?ffe/ ?(*„) ^ ??6/ /??(??)· (b) follows from the following lemma observing that for a e A and jeSwe have a\x e A and therefore /j(a \jc) = /jo(a \x)· ^ LEMMA 5.1,6. Lei ? : S -> [0, +oo] be an exhaustive function, i.e., ?(?„) -> 0 (n -> oo) for every disjoint sequence (a,,) in S. Put ?* (?) := supa^.veS k(x)for a e R. Then for any ? e R and ? > 0 г/гггг w аи jc e S ??/'?/г л^ ? and ?*(? \ л) ^ ?.
722 ?. Weber PROOF. Let ? e R and ? > 0. Suppose that ?*(; \?)> ? for any ? e S with ? ^ z. Then for any ? e S with ? ^ ? there is an у e 5 with у ^ с \ ? and ?()') > ?. Therefore we can inductively find a sequence (y„) in S with y„ ^ с \ sup, <;i y, andX(y„)> ?, a contradiction to the exhaustivity of ?. ? THEOREM 5.1.7. Let S be a subring of ? ?-complete Boolean ring R, W a well ordered subset of R and ? : 5" —> G ? ?-additive exhaustive measure. Then ? has ? ?-additive measure extension p.:a(SUW)—>-Gto the ?-subring of R generated by SUW such that S is dense in a(SU W) with respect to the ?-topology. Proof. Since R is a ?-subring of a ?-complete Boolean algebra, we may assume that R has a maximal element e. Replacing W by W U {0} we may assume that W contains 0. Moreover, in view of Theorem 5.1.4 (and its proof), we may assume that 5" is a ?-subring of R. We now proceed as in the proof of Theorem 5.1.4. Let ??(?):=??{ ^||?||7(?„): ane ?. ? ^supa„ /?=? where A is defined as in Lemma 5.1.5. By Lemma 5.1.5, ?? isa?-submeasureon R which agrees on A with ||?||7. Let и be the FN-topology generated by (??)?^? and ? and ? defined as in Proposition 5.1.2. Then Д is ?-additive and ? a ?-subring of R. S is dense in ? with respect to the Д-topology since ? <K u. It remains to show that W с Т. From Lemma 5.1.5(b) follows that А с Г. Moreover, ceA and e \ w e A for w e W. Therefore W cT since Г is a ring. ? COROLLARY 5.1.8. Let R be ?-complete. D a disjoint subset of R and ? : S —>¦ Gaa-ad- ditive exhaustive measure. Then ? has a ?-additive measure extension ? : ?(? U D) —>¦ G to the ?-subring of R generated by S U D such that S is dense in a(SUD) with respect to the ?-topology. PROOF. Let /эа|->4еОЬеа bijection from / onto D where / is a well ordered set. If / has a maximal element ?, we put J = I U {? + 1}; otherwise we put J = I. Then W := {sup^<aify.· a e J] is a well ordered subset of R. Now apply Theorem 5.1.7 observing that da = 5ир^<а+, ?? \ sup^^ ?? and therefore o{S U D) с o{S U W). D Sometimes the following additional information is of interest. Remark 5.1.9, With the assumptions and notations of Theorem 5.1.1 we have: (a) |IAI|y(z)= llMlly(z) for any с e S and ? e ?. (b) The Д-topology induces on S the ?-topology, (c) The Д-topology agrees with к17- iff the ?-topology agrees with u\$.
FN-topologies and group-valued measures 723 PROOF, (a) Obviously ||Д||у(с) ^ \\?\\?(?). Let now с e 5" and ? e ? with ? ^ z- We show that \?(?)\? ^ \\?\\?(?). Let (л„)„е,л be a net in S converging to ? in (Г. и). Then (?,,??) converges to x, too. Therefore \?(?)\ =Нт|д(л„лс)| iC ||?||7(;). (b) follows from (a), (c) follows from (b) and the following proposition. D PROPOSITION 5,1.10. Let и e FN(tf) and S a dense subring of(R,u). Then ? ы> v\s defines an order isomorphism from the lattice FN(R. u) of all FN-topologies on R coarser than и onto the lattice FN(S, u\$). PROOF. Obviously, r:ww v\s defines a monotone map from FN(R. u) into FNiS, и Is)· r is surjective: let и e FN(m|s) and 0?„)„ед be a family of submeasures on S generating ?. Then the continuous extensions rja of ?? on (R.u), a e A. induce on R an FN-topology ? coarser than и with v\$ = v. The injectivity of r and the monotonicity of r~' is a consequence of the following statement: If ? e FN(R, и), then the closures V" in (R. u) of the 0-neighbourhoods in (S, v\s) form a 0-neighbourhood base in (R, v). In fact, if ? is a continuous submeasure on (R, u), ? > 0 and V = {x e S: ?(?) < ?], then {xeR: ?(?)<?} с V" с {? e R: ri(x)^s}, ? Let ? and ? be measures on R and u.ve FN( R). We write и J_ ? if the infimum of и and и is the trivial topology, ? J_ ? (and ? J_ v) means that и J_ ? where и is the ?-topology (and ? is the v-topology). If ? J_ v, we say that ? and ? are singular. From Theorem 5.1.1, Remark 5.1.9(b) and Proposition 5.1.10 immediately follows: Corollary 5.1.11. Let и e FN(fi) and S be a dense subring of (R, u). Denote by a(R, G, u) and a(S, G, u\$) the spaces of all G-valued continuous measures on (R, u) and (S, и Is), respectively. (a) Then ? н>- ?|5 defines a group isomorphism from a(R, G, u) onto a(S, G, u\$). (b) For ?, ? ea(R, G, u) and ? e FN(/?, u) we have: (i) ? « ? ??"?\5 « v\s, mIuiJmIsI v\s- (ii) ?«?(^?|5«?|5, ? ±? iff ?\$ ±v\s. Theorem 5.1.4 was first proved by Sion A969) by a suitable modification of the Caratheodory process. A topological proof of this theorem is contained in Drewnowski A972a), Herer A976), Weber A976). D. Maharam A987) observed in a footnote that the topological approach to obtain a ?-additive extension of a bounded real-valued measure from a ring to the generated ?-ring was originated by M.H. Stone. D. Maharam A987) presented a topological approach to extend also [0,+oo]-valued measures, which in general are not exhaustive. Here, also the following fact observed in Lembcke and Weber A992, Theorem 3.3.1) is of interest:
724 ?. Weber THEOREM 5.1.12. Let R be ?-complete. Then every ?-additive [0, +oo]-valuedmeasure on S admits a ?-additive [0, +oo]-valued measure extension to R iff every ?-additive [0, l]-valuedmeasure does so. This follows from the fact that every [0, +oo]-valued ?-additive measure is an (infinite) sum of [0, l]-valued ?-additive measures (see Lembcke and Weber A992, Corollary 3.1.17)). Theorem 5.1.7 is essentially contained in Weber A980). It is worthwhile to observe that the assumption in Theorem 5.1.7 that W is well ordered is essential. The result fails already for a countable totally ordered set W as observed in the postscript of Lipecki A980a, 1980b). Its Corollary 5.1.8 generalizes results of Bierlein A966), Ascherl and Lehn A977), Lipecki A980a, 1980b). In Bierlein A966), Lipecki A980a, 1980b), the disjoint system is countable. In Bierlein A966), Ascherl and Lehn A977), the set functions are probability measures. 5.2. Extension of exhaustive FN-topologies and exhaustive measures In contrast to the ?-additive case, any finitely additive vector-valued measure defined on an algebra ? of subsets of X has a measure extension to the whole power set F(X), This is an easy consequence of the Hahn-Banach theorem. The result remains true (but is harder to prove) in the group-valued case, see Carlson and Prikry A982). This is a purely algebraic result. In measure theory it is more of interest to extend exhaustive measures. Lipecki A974) proved that any exhaustive measure ?: ? -> G has an exhaustive measure extension Д :P(X) ->¦ G such that ? is dense in F(X) with respect to the Д-topology. The significance of the density statement is here in particular to transfer properties from ? to Д. Lipecki's extension theorem for measures is an immediate consequence of an analogous extension theorem for FN-topologies; actually these two extension theorems (Corollaries 5.2.6 and 5.2.7) are equivalent. We present in the following also Lipecki's more general extension theorem for tight measures, cf. Lipecki A987). Definition 5.2.1. Let К с R. An FN-topology и on 5" is called K-tight if for every ? & S and every neighbourhood U of ? in (S, u) there are elements к е К and ? e U with у ^ к ^ x, A measure ? : S -> G is called ?-tight iff the ?-topology is ?-tight, i.e., if for every ? e S and every 0-neighbourhood U in G there are elements к е К and у е S such that у < к ? ? and ?(?) e U for every ? e [0. ? \ y]. LEMMA 5.2.2. Let ? ; S —> [0, +oo] be a submeasure on S. (a) Then ?*(?) = inff,^.ve5 ?(?) defines a submeasure on R. (b) ?*(?) = $???^?€5?(?) defines a monotone function on R with the following property: ifa,be R and a Ax e S for any ? e S with ? ^ ? ? b, then ?*(? ? b) ^ ?*(?) + ?*?). (c) ?* < ?* and ru\s = ?= DISPROOF, (а), (с) and the monotonicity of ?* obviously hold. We prove the last statement of(b).
FN-topologies and group-valued measures 725 Leta,fo e R suchthataAJt e S for any .r e 5??[0, ? ? fo]. Then, for any .r e Sn[0,a vfo], we have x\a = x\(x ла) e 5??[0, fo], hence ?(?) ^ ^(.v ла) + /j(.v\a) < ?*(?) + /j*C>)· It follows 77* (? vfc)i?ij*(a) + ij*(fc). ? LEMMA 5.2.3. Let R be a Boolean algebra, S a subalgebra of R, ?: 5 -> [0, +oo] an exhaustive submeasure, к e R and ? = {(.? Л к) ? (у Л к'): х, у е S] the Boolean subalgebra generated by S andk. Put p(a) = ?*(? А к) + ?*(? /\k')for а е Г: (a) ?/??7? /0 is a submeasure on T and ? ^ p\s ^ 2?\ (b) {jc e S: jc ^ k'} converges to k' in (T. p)\ (c) S is dense in (T, p); (d) the continuous extension ?: (?, ?) —> [0, +эо] of ? is an exhaustive submeasure inducing the p-topology; (e) // К is a sublattice of R,k e К and ? is К-tight, then ? is К-tight, too. PROOF, (a) p is a submeasure since if and /j*|.w are submeasures, see Lemma 5.2.2. Let ? e S. Then p(x) = ?*(? л к) + ^*(л л к') ^ ?*(.?) + ?*(?) = 2?(?). We show now that ?(?) ^ p(x): Let ? > Oand у e 5 with .? А к < у < .? and /j(y) < f?*(.v л ?) + ?. Then *\у < .xAfc'andijC*)^ /j(y) + ? (? \ у) ^ ?* (? лк)+е + л*(х лк') = ?(?)+?. (b) Let ? > 0. By Lemma 5.1.6 there is ап.кеХ with .r ^ k' and /j*(^' \ л) ^ ?. Now observe that /j*(^' \ x) — p(k' \ x). (c) Sp is an algebra containing S and A:' as proved in (b). Therefore Sp contains ?, the algebra generated by S and k'. (d) By (a), ? is continuous on (S. p\s) and its continuous extension ? on (?, p) is a submeasure satisfying ? ^ ? ?;2?. Therefore ? and /О induce the same topology. The exhaustivity of ? follows from (c) and Proposition 2.5. (e) Let a e ? and ? > 0. We write a = (x\ л it) ? (x2 л it') with .v, e 5". By (b) and (d), there is an xq e S with Jto < k' and rj(k' \xq) < ?. By assumption, there are elements к, е ? and y, e 5 with у ? < k\ <,x\,y2^k2^ x2 л .то, /j(.ti \yi) ^ ? and ?((?2 ???) \y:) ^ ?. Then we have у := (yi л к) ? y2 e Г Д := (?? л к) ? к2 е К. у ^к ? a and ?j(a \ у) < Л(х\ \ У\) + 4@*2 л дсо) \ Уг) + Щк' \ .го) < 3?. ? THEOREM 5.2.4. Let К be a sublattice of R and и a K-tight exhaustive FN-topology on S. Then there is a K-tight exhaustive FN-topology ? on the ring ? generated by S U К such that v\s = и and S is dense in (T. v). PROOF. Since every Boolean ring can be embedded in a Boolean algebra, we may assume that R has a unit e. Let (??)?€? be a family of submeasures on S generating и. lie ^S, let SO := SU {?': ? e S} the Boolean subalgebra of R generated by 5". Then, for a e A, the restriction ?„ of (?„)* to SO is by Lemma 5.2.2(b) a submeasure which extends ??. Let mo be the FN-topology generated by (??)?€?. One easily sees that mo is exhaustive and- using Lemma 5.1.6- that S is dense in (So. мц) and мц is ?-tight. So we may assume that e e S. Consider the class of all families (va)a€A of submeasures with the following properties: (i) The va have a common domain D which is a subalgebra of ? containing S. (ii) va extends ?? for each a e A.
726 ?. Weber (iii) If v denotes the FN-topology generated by (vff )оед on D, then ? is ?-tight and S is dense in (D, v). We define a partial order in this family putting (va)a€A < (??)?(?? iff ?« extends va for every a e A. By Zorn's lemma, this class has a maximal element (/?«)<*,= д. Suppose that the domain D of ??, a e A, is different from ?. Then there is к e ? \ D. By the preceding lemma, for any a e ?, ?? has an extension rju on the algebra Д> generated by Z) and A; to a ?-tight exhaustive submeasure such that {.v e D: ? ^ k'} converges to k' in (Д>, ??„). Then {* e ?): ? < A;'} converges to k' with respect to the FN-topology ? generated by the family (rja)a€A. It follows that k' e Dv and finally that S is dense in (Д>. ?). Thus (т)а)аеА is not maximal, a contradiction. ? From Theorems 5.2.4 and 5.1.1 follows immediately: COROLLARY 5.2.5. Let К be a sublattice of R and ?: S -> G a K-tight exhaustive measure. Then ? has а К-tight exhaustive measure extension ?:? —* G on the ring ? generated by S U К such that S is dense in ? with respect to the ?-topology. Every FN-topology and every measure on S is К -tight for К = R. Therefore one obtains from Theorem 5.2.4 and Corollary 5.2.5 the following two corollaries. COROLLARY 5.2.6. For every exhaustive FN-topology on S there is an exhaustive FN- topology ? in R such that v\s = и and S is dense in (R.v), COROLLARY 5.2.7. Every exhaustive measure ?: S -> G has an exhaustive measure extension ?: R —* G such that S is dense in the ?-topology. For the relationship of Theorem 5.2.4 with results of Henry, Lembcke, Bachman and Sultan, and Dalgas we refer to the introduction of Lipecki A987). 6. The completion of topological Boolean rings and the system FNf(/?) of exhaustive FN-topologies Often ?-additive real-valued measures on a ?-algebra are easier to handle than finitely additive measures. For example, for ?-additive real-valued measures on a ?-algebra we have: A) Measures with the same null sets induce the same topology. B) Several decomposition theorems can be obtained by decompositions of the ?-algebra. C) The algebraic condition that a measure is atomless implies connectedness properties of the range. This is not any more true in the finitely additive case. On sight it may be surprising that in the study of ?-additive measures on a ?-algebra with values in a non-metrizable range space there occur similar difficulties as in the finitely additive (real-valued) case; in particular, A )-C) are not any more true. It turns out that A )-C) are satisfied for exhaustive
FN-topologies and group-valued measures 727 measures which induce a complete FN-topology. So not ?-additivity of a measure ? but completeness of the ?-topology is often the crucial condition (cf. Theorems 6.5 and 6.16). The idea is now, in order to study exhaustive measures on R, to consider first the continuous extensions ?, v,... on the uniform completion (R, u) of (R, u) with respect to a suitable FN-topology u. Results for ?. v.... then yield results for the restrictions ? = ???, ? = v|к This procedure is related to the Stone space argument. In contrast to the Stone space argument, here the measure ? associated to an exhaustive measure ? is not only ? -additive but completely additive and its domain is order complete, not only ? -complete. Let w be a Hausdorff FN-topology on R and R the uniform completion of R^ Then the ring operations ? and л of R have uniform continuous extensions on R. So R becomes a Boolean ring and the uniformity of R is induced by an FN-topology w. (R. w) is called the completion of (R, w). THEOREM 6.1. Let w be an exhaustive Hausdorff FN-topology on R and (R, w) the completion of(R, w). Then R is a complete Boolean algebra and w is order continuous. PROOF. By Proposition 4.12 we have only to show that every increasing net converges in (R, w). But this follows from Proposition 2.4 since w is exhaustive by Proposition 2.5. D THEOREM 6.2. Let wbea Hausdorff FN-topology on R. Then w is exhaustive and (R, w) is complete (as a uniform space) iff R is a complete Boolean algebra and w is order continuous. PROOF. =>· is contained in Theorem 6.1. <^=.Let(fl, w) be the completion of (R, w). We show R= R.Leta e R. Then [0,а]ПЛ is a closed ideal in (R, w), hence a principal ideal, i.e., [0, ?] ? R contains a maximal element b. Let c = a\b and ? be the FN-topology on R which induces on [0, c] the trivial topology and on [0, c'] the same topology as w. ? := v\r is order continuous since ? is weaker than w. ? is Hausdorff since [0, с] П R — {0}. Therefore ? = w by Corollary 4.9 and v= w by Proposition 5.1.10. It follows с — 0, hence a — b e R. ? COROLLARY 6.3. Let w e FN(R). Then w is exhaustive and (R. w) is complete iff every increasing net converges in (R, w). PROOF. This follows from Theorem 6.2 and Proposition 4.12, if w is Hausdorff. To conclude the proof we reduce the general case to the Hausdorff case passing to the quotient (R, w) = (R, w)/N(w) where R = R/N(w) and w is the quotient topology on R with respect to w. Obviously, (R, w) is complete and exhaustive iff (R, w) is so. It remains to show that every increasing net converges in (/?, w) if (R, w) has this property. Let (??)??? be an increasing net in R and xa e ??. «? A. Let ? be the upwards directed system of all finite subsets of A. Then (supa€f.xa)r€?- is an increasing net in (/?, w), hence convergent. It follows that (??)?€? is convergent. ? COROLLARY 6.4. Let w be an exhaustive FN-topology on R such that (R, w) is complete. Then (R, v) is complete for every ? e FN(R, w).
728 ?. Weber PROOF. Apply Corollary 6.3 observing that every increasing net converges in (R, v) if it converges in (/?, w). ? THEOREM 6.5. Let w be an exhaustive Hausdorff FN -topology on R and (R,w) the completion of(R, w). (a) Then ? н> v\ r defines an order isomorphism r : FN(/?, w) —> FN(/?, w). (b) vh>f\ sup N(v) defines an order isomorphism s : FN(/?, w) -> R. (c) FN(/?, w) й a complete Boolean algebra and sor a Boolean isomorphism from FN(fl, w) onto R. (d) Let ?, ? be continuous measures on (R, w) with values in a topological group, ? = Д|к, ? = v\r, ve FN(R, w) and ? = v\r. Then (i) ? <?u iff ?(?) э N(u), ? ±u iff there is an a e ?(?) with a' e ?(??). (ii) ? <K ? iff ?(?) D N(v), ? J_ ? iff there is an a e ?(?) with a' e N(v). PROOF, (a) is contained in Proposition 5.1.10. (b) follows from Theorem 4.8 observing that - as a consequence of Theorem 6.1 - any ? e FN(/?, w) is order continuous and therefore N(v) = [0, a] where a = sup N(v). (c) immediately follows from (a), (b) and Theorem 6.1. (d) follows from (a). Theorem 5.1.1 and Remark 5.1.9(b). ? We denote the system of all exhaustive FN-topologies on R by FNe(R). Then FNe(R) = FN(R, we) with the notation of Proposition 2.3. COROLLARY 6.6. FNe(R) is a complete Boolean algebra. We now examine the relationship between properties of и (е FN(/?)) and properties of FN(K, и). The following fact makes it understandable that the condition of exhaustivity plays a particular role. Remark 6.7. Let и e FN(R). Then и is exhaustive iff FN(/?, u) is a Boolean algebra. PROOF. =>· holds by Theorem 6.5(c). <=. Let (a„)„eN be a disjoint sequence in /?. For ? e N, put R„ = [0, ? ? ? · · · ? ?„] and S1,, = {* e /?: jc л у = 0 for every ? e /?„}. The sets U ? S^, where i/ is a 0-neighbourhood in (/?, и) and teN, form a 0-neighbourhood base for an FN-topology v e FN(/?, u). Let w be a complement of ? in FN(/?, u). It is easy to see, using that R„ is an ideal in R, that ??|?„ is a complement of и | ?;? in FN(/?„, m|r„). Since ?|?„ coincides with и \цп, we get that ??|?„ is the trivial topology, hence an e N(w) for any ? e N. Therefore (а„)„еи converges to 0 with respect to w. Obviously, (а„)„ек converges to 0 with respect to ? and therefore with respect to и since и = ? ? ц\ ? Dehnition 6.8. Let и e FN(/?). We say that (R,u) is chained if for every 0-neigh- bourhood U in (/?, и) and every ? e R there is a finite sequence л ? x„ in i/ with jc = sup^=, x,.
FN-topologies and group-valued measures 729 An equivalent condition is that for every ? .ye R and every O-neighbourhood U in (/?, u) there is a finite sequence .то,..., л„ in U with лц = ?, ?„ = у and X[Axi-\ e U (i = 1,..., и). Therefore, our terminology is in accordance with Hejcmann's definition of chained uniform spaces. Proposition 6.9. Let и eFN(/?). (a) The following conditions are equivalent: A) (R,u) is chained. B) Every continuous submeasure on (R. u) is real-valued. C) There is no ultrafilter topology on R coarser than u. D) (R, u) does not contain any open ideal different from R. (b) и is an ultrafilter topology iff the trivial topology is the only FN-topology on R strictly coarser than u. PROOF, (a) A) =>· B). Let ?: (R.u) -> [0. +oo] be a continuous submeasure and .v e R. Since (/ := (z e R: ?(?) ^ 1} is a O-neighbourhood in (/?, и), there are x\ x„ e U with ? ^ x\ ? · · · ? xn. Therefore ?(?) < ??=? ^(-"?') ^ " < +oo. B) =>· C), Suppose that ? is a maximal ideal in /? such that the corresponding ultrafilter topology is coarser than u. Then ?(?) = 0 (.? e ?) and /j(.v) = +00 (x e R\ M) defines a continuous(sub-)measureon (R.u). C) =>· D). Suppose that / is an open ideal in (R. u) and / ? R. Let ? be a maximal ideal in R containing /. Then ? is open and therefore the corresponding ultrafilter topology is coarser than и. D) =>· A). Let U be a O-neighbourhood in (/?, и) and / the set of elements ? e R such that there are x\,..., x„ e i/ with .v < .?? ? · · · ? л„. Then / is an open ideal in (/?, и), hence I = R, i.e., (/?, и) is chained. (b) =>·. Let и be a non-trivial FN-topology on R coarser than и and V a solid O-neighbourhood in (/?, v) with V ? V ? R. Then, for any a e V, we have [0,«] ? N(m) с V ? V ? R. Since N(u) is a maximal ideal, it follows [0. ?] ? N(m) = N(u), hence a e N(u). Therefore V = N(u) and ? = и. <=. Suppose that и is not an ultrafilter topology. Then N(u) is not a maximal ideal and therefore R \ N(u) contains two disjoint elements a and b. Then the sets [0. ?] ? ?/, where U is a O-neighbourhood in (R.u), form a neighbourhood base for a non-trivial FN- topology strictly coarser than и. ? Recall that an atom in a lattice with smallest element 0 is an element a e L such that there is no ? e L with 0 < ? < a. L is called atomless if L does not contain any atom. L is called atomic if for every г e i, \ {0) there is an atom a e L with a < z, and atomistic if every nonzero element of L is a join of atoms. Corollary 6.10. Let и eFN(R). (а) и is an ultrafilter topology iff и is an atom in FN(/?) iffFN(R. u) contains exactly two elements.
730 ?. Weber (b) и is the supremum ofultrafilter topologies iffFN(R. u) is atomistic iffFN(R, u) is atomic and и is exhaustive. (c) (/?, u) is chained iff FN(R, u) is atomless. PROOF, (a) and (c) are reformulations of Proposition 6.9 (b) and (a) A) о C). (b) If FN(R, u) is atomistic, then и is the supremum of atoms of FN(/?. и), i.e., the supremum ofultrafilter topologies. If и is the supremum of ultrafilter topologies, then и is exhaustive, FN( R, и) is a Boolean algebra by Theorem 6.5(c) and its maximal element is the supremum of atoms in FN(/?, u) (see (a)). Therefore FN(/?. u) is atomic. If и is exhaustive and FN(/?. u) atomic, then FN(/?. и) is an atomic Boolean algebra and therefore atomistic. ? We used that any Boolean ring is atomistic iff it is atomic. Any atomistic lattice with zero is atomic; the next proposition shows that FN(/?. u) can be atomic without being atomistic where и is the discrete topology. A Boolean algebra is called superatomic iff every subalgebra is atomic. It is easy to see that A is superatomic iff A does not contain a tree (i.e.. a subset order isomorphic to the system of intervals {[//2", (/' + l)/2"[: i.n eNU(O), /' < 2"}). The algebra generated by the finite subsets of a set X is an example of a superatomic algebra. For further information see Bhaskara Rao and Bhaskara Rao A983), Section 5.3. PROPOSITION 6.11. Let R be a Boolean algebra. (a) FN(/?) is atomic iff R is superatomic. (b) FN(/?) is atomistic iff R is finite. PROOF, (a) If FN(/?) is not atomic, then there is by Corollary 6Л0(с) a non-trivial FN- topology и on R such that (R, u) is chained. Then the quotient (R,u) = (R. u)/N(u) is chained, too. It is easy to see that therefore R contains a tree. Hence R contains a tree and therefore R is not superatomic. If R is not superatomic, then R contains a subalgebra A isomorphic to the algebra A generated by the intervals [//2", (/ + l)/2"[ (/, ? e N U {0}, / < 2"). Let ?: A -+ [0, 1] be the Lebesgue measure, яМ->Да Boolean isomorphism, ?: R -> [0, 1] a measure extension of ? ? ? and и the ?-topology. Then (/?, и) is chained. Therefore FN(/?) is not atomic, see Corollary 6.10(c). (b) If R is finite, then by Theorem 6.5(c) - which is for finite R obvious - FN(/?) is isomorphic to R and therefore atomistic. If FN( R) is atomistic, then the discrete topology is the supremum of ultrafilter topologies (see Corollary 6.10(b)) and therefore exhaustive. Therefore R does not contain an infinite disjoint subset. Hence R is finite. ? We will now characterize FN-topologies satisfying [Bl], i.e., which admit a countable 0-neighbourhood base.
FN-topologies and group-valued measures 731 PROPOSITION 6.12. Let и be an exhaustive Hausdorff FN-topology satisfying [B1 ]. Then R satisfies the countable chain condition (CCC for short), i.e., every disjoint subset of R is countable. PROOF. Let (i/„)neN be a O-neighbourhood base in (R, u) and D a disjoint subset of R. For any ? & ?, D \ U„ is finite since и is exhaustive. Therefore the union {j)l€i:(D \ Un) is countable. This union equals to D since и is Hausdorff. ? THEOREM 6.13. Let и e FJSL(K). Then и satisfies [Bl] iffFN(R, u) satisfies CCC. PROOF. «=. By Theorem 6.5(c), FN(/?, //) is a Boolean algebra. We now use the following fact: If Л is a Boolean algebra satisfying CCC and D a dense subset of A (i.e., for every a e A \ {0} there is a d e D with 0 < d < a). then every element of A is the supremum of a countable subset of D. Applying that for Л = FN(/?, u) and D = {w e A: u> satisfies [Bl]} we see that и is the supremum of a countable family of FN-topologies satisfying [Bl] and therefore itself satisfies [B1 ]. =>·. Passing to the quotient (R,u)/N(u),-we may assume that и is Hausdorff. Let (/?,«) be the completion of (R,u). Then и satisfies [Bl]. Therefore by Proposition 6.12, R satisfies CCC. Since R and FN(/?, u) are isomorphic by Theorem 6.5(c), FN(/?, u) satisfies CCC, too. D Corollary 6.14. Let ? e FNe(R). if ? satisfies [Bl], then even и е FN(/?, v) satisfies [Bl]. COROLLARY 6,15, If R isaa -complete Boolean ring satisfying CCC and и is ? ? -order continuous FN-topology on R, then и satisfies [Bl]. PROOF. CCC of R implies that every subset A of R contains a countable subset В with sup В = sup A, Therefore R is complete and и is order continuous, hence (R.u) is complete by Corollary 6,3, Therefore R and FN(R. u) are isomorphic by Theorem 6.5(c), So FN(/?, u) satisfies CCC and therefore и satisfies [Bl] by Theorem 6,13, ? THEOREM 6,16, Let и e FN<,(/?) and (R.u) be the completion of the quotient (R,u) = (R,u)/N(u), Then: (a) (R, u) is chained iff R is atomless, (b) и is the supremum of ultrafilter topologies iff R is atomic, (c) и satisfies [Bl] iff R satisfies CCC, PROOF. By Theorem 6.5(c), R and FN(^. u) are isomorphic. Moreover, there is a natural isomorphism between FN(R,u) and FN(R. u). Therefore R andFN(/?,w) are isomorphic. With this fact, (a), (b) and (c) follow from Corollary 6,10 (c)-(b) and Theorem 6,13, D Definition 6.17. Let ?; R -> G be a measure. If (R. ?-topology) is chained, we say that R is ?-chained.
732 ?. Weber This concept corresponds to strong continuity in the sense of Bhaskara Rao and Bhaskara Rao A983). From Theorem 6,16(a) and Corollary 3.7 follows: COROLLARY 6,18, Let R be ?-complete and ?: R -> G ? ?-additive measure with values in a pseudonormed group. Then R is ?-chained iff ? is atomless (i.e., R|N{?) is atomless). 7. More on the ?-topology The ?-topology of a measure ? ;/? -» G depends on the topology taken on G: If и e FN(/?), G = (/?, ?, u) and ?: R -> G is defined by ?(?) = ?, then the ?-topology coincides with и. In the next theorem there is given a condition under which the ?-topology is the same for two different group topologies on G, This theorem has several interesting consequences, THEOREM 7.1, Let(G. ?) be a complete topological group, ? a Hausdorff group topology on G coarser than ? and ?: R -> G a measure exhaustive with respect to ?. Then the ?-topology with respect to ? agrees with the ?-topology with respect to ?. PROOF. We apply Theorem 6.5 with w = we. With the notation of Theorem 6,5, let ?: (R, w) -> (G, ?) be the continuous extension of ?. Let ? and и be the ?-topology, respectively, with respect to ? and with respect to ?, and и = u\r, v = v\r. Then N(v) = ?(?) = N(u), hence ? = и by Theorem 6.5, i.e., the ?-topology with respect to ? agrees with the ?-topology with respect to ?, see Remark 5.1.9(c). ? Let ? be a set of measures on R with values in a topological group. ? is called uniformly exhaustive if for every disjoint sequence (xn)n€\; in R the sequences (?(?„))„€?? converge to 0 uniformly in ? e M. For и e FN(/?), ? is и-equicontinuous if for every net (xa)aeA converging to 0 in (/?, u) the nets (?(.??))??? converge to 0 uniformly in ? e M. COROLLARY 7.2. Let и e FN(/?), G be a Hausdorff topological group and ? be a uniformly exhaustive subset of a (R,G,u). Then ? is u-equicontinuous. PROOF. We may assume that G is complete; otherwise replace G by its completion. Let Гэо and ?? be, respectively, the topology of uniform convergence and of pointwise convergence on GM. Then the measure m: R -> (G'w, Гэс) defined by m(a) = (?(?))?€? is exhaustive and the m -topology with respect to ?? is coarser than и. By Theorem 7.1, the m -topology with respect to ?^ agrees with the m -topology with respect to ??. It follows that the w-topology with respect to ?^ is coarser than и, i.e., ? is и-equicontinuous. ? COROLLARY 7.3. Let ? be a locally convex linear space, ? its topology and ?: R —> (E, p) an exhaustive measure. Then the ?-topology with respect to p, the ?-topology with respect to the weak topology ?(?, ?") and the supremum of the x' о ?-topologies, ?' e ?", coincide.
FN-topologie.s and group-valued measures 733 PROOF. Let N = {0}p and ? : ? -> ?/? be the canonical map onto the Hausdorff quotient space E/N. Since ? and ? ? ? induce the same FN-topology, we may assume that ? is Hausdorff. Moreover, since the weak topology of the completion of (E, p) induces on ? the topology ?(?,?'), we may assume that (E,p) is complete. Therefore, by Theorem 7.1, the ?-topology with respect to ? agrees with the ?-topology with respect to ?(?, ?"). The last one agrees with the supremum of the ? о ?-topologies, ?' e ?". since ?(?, ?') is the supremum of the ??--topologies, x' e ?". where px- := \x'(x)\ (x e E) and, moreover, the ?-topology with respect to the px--topology agrees with the ?' о ?-topology. ? COROLLARY 7.4. Let ? and F be locally convex linear spaces and ?: R -> ? and ?: R —>¦ F be the exhaustive measures. (a) Then ? <<C ? iff ?' ??« ? for every ?' ? ?'. (b) ? J_ ? iff ?' ? ? J_ у' о ? for every ?' & ?' and у' е F'. (c) If R is a subring of a Boolean ring ? and К a sublattice of P, then ? is К-tight iff ?' ? ? is K-tightfor every x' e ?". (d) (i) R is ?-chained iff R is ?' ? ?-chainedfor every x' e ?". (ii) IfR is ?-complete and ? is ?-additive, then R is ?-chained iffx'o ? isatomless for every x' e ?". PROOF. Besides Corollary 7.3 we use in the proof of (b) that и J_ i; if u. ua, ? e ?Ne(R), и = supa€Aua and ua ± ? for every a e A. In (c) we use that the supremum of ?-tight FN-topologies is ^-tight. In (d) we use Corollary 6.18 and that (/?, u) is chained if ua eFN(/?), (R,ua) is chained for every ? e A and и = supa€Aua. ? COROLLARY 7.5 (Bartle-Dunford-Schwartz). Let (E, p) be a metrizable locally convex linear space and ?: R -> ? an exhaustive measure. Then there exists a real-valued measure ? : R —» [0, 1] equivalent to ?, i.e., ?«? and ?«?. PROOF. Let и be the ?-topology with respect to p. Since ? is metrizable, и has a countable 0-neighbourhood base. By Corollary 7.3, и is the supremum of the ?' ? ?-topologies, ?' e ?". It follows that for some sequence (x'„),i€!· in ?", и is the supremum of the x'n ? ?- topologies, ? e N. Now put ? = J2r^=\ ?n\x'„ ° Ml where ?„ are suitable small positive real numbers such that ? becomes [0, l]-valued. Then и agrees with the v-topology. D If ? is a measure with values in a seminormed group, then ? <<? |?|. We now examine when |?| <<C ? and therefore ? and |?| are equivalent. PROPOSITION 7.6. LetG be a seminormed group, ?: R -> G a measure and S a subring of R which is dense in R with respect to the ?-topology. Then the total variation (?| of ? extends the total variation of ?\$. PROOF. Let* e 5" and ? := ? |s. Obviously, \v\(x) ^ |?|(.?). To prove \?\{?) ^ | v|(.v), let ? > 0 and x\,..., x„ be disjoint elements of R with .?, < ? (i — 1,..., n). Let уч е S with
734 ?. Weber ||?||U, Ay,·) ^ ?/?2 and Zi := (?, \ sup/<( v7) /\.x. Then -\?(?? - ?(??)\ ^ ||?||(.?,?;,)= ||?||((.?, \sup*/) Ах\л((у7 \sup>'7) л*) i «? ?]||?||(.?7·?.?;-)<?— «? -. 7=1 hence ?''=? lMU-)l «??"=? |v(c#)| + 2? < |?|(.?) + 2?. ? THEOREM 7.7. Let G be a seminonned group and ?: R -> G an exhaustive measure. Then the following conditions are equivalent: A) ??|«?· B) \?\ is exhaustive. C) \?\ is bounded. PROOF. B) о C) is proved in Proposition 2.12. A) => B) is obvious. B) ^ A). Replacing G by the completion of G/N where N — {x e G: |*| = 0), we may assume that G is Hausdorff and complete (cf. the proof of Corollary 7.3). We now apply Theorem 6.5 with w = we. Since, with the notation of Theorem 6.5, ?(\?\) = ?(?), we obtain that |?| \R <^?. Since by Proposition 7.6 |?| = |?| \r, we have |?| «: ?. ? 8. Decomposition of exhaustive measures In this section let G be a complete Hausdorff topological commutative group. We will decompose a measure ? in the form ? = ? + ? where ? J_ v. The next proposition shows that here ? is ?-additive (or exhaustive or ...) iff both ? and ? are ?-additive (or exhaustive or ...). PROPOSITION 8.1. Let ?. ? be G-valued measures on R and ? = ? + v. (a) If ? J_ v, then the ?-topology is the supremum of the ?-topology and the v-topology. (b) If и e FN(/?), ? «: и and ? J_ и. then the ?-topology agrees with the infimum of и and the ?-topology. PROOF. Denote by u(?), и (?). u(v), respectively, the ?-topology, the ?-topology and the ?-topology. (a) Obviously, ? is continuous with respect to u(k) v«(ii) and therefore и(ц) is coarser than «(?) ? u(v). Therefore it remains to show that ?«? and ? <*C ? if ? J_ v. We show that ? <<C ?: Let U and V be 0-neighbourhoods in G with V + V — V с U and ? e R such that ?([0, .?]) С V. We show that ?(.?) e U. Since ? J_ ?, there are disjoint elements ? ?, ?? e R with .? = x\ ?.??, ?(.?|) e V and v(x2) e V (cf. Proposition 1.5). Now ?(?) = ?(?\) + ?(?2) = ?(?\) + ?.(?2) - v(.X2) e V + V -V cU.
FN-topologies and group-valued measures 735 (b)By (a), ?(?) = u(k)vu(v). Therefore, by the distributivity of FN(/?), we have u(?) Ли= (?/(?) л и) ? (?(?>) Л и) = «(?) since ?(?) л и = и(к) and u(v) л и = 0. ? We now present a general version of the Lebesgue decomposition theorem, which contains several decomposition theorems as special cases. In a more familiar formulation of Lebesgue's decomposition theorem, the FN-topol- ogy и in the next theorem is the v-topology for some measure v. THEOREM 8.2 (Lebesgue decomposition theorem). Let ?: R -> G be an exhaustive measure and и e FN(R). Then there exist unique G-valued measures ??, ?? on R such that ? = ?? +??, ?? «и and ?? J_ и. Moreover, ?,(/?) С ?(R)¦ PROOF. We apply Theorem 6.5 with w = we. Let ? be the infimum of и and the ?- topology. With the notation of Theorem 6.5, let ? e FN(/?. w) with v\r = ? and ? the continuous extension of ? on (/?, u>). see Theorem 5.1.1. Put ? = s(v), ?\(?) = ?(? ? ?) and Дг(л-) = ?(? ? ?') for .? e /?. Then ?? = ??|? and ?? = ??|« are measures with Ml +M2 = ?· Since ?(??) = [?,?'] = TVCu), we obtain that the ? ? -topology agrees with v, hence ?? <g u. Since ? e ?(??) and ?' e ?(??), we have ?? J_ v. This implies ?? ±u since the ?2-topology is weaker than ?-topology. By the definition of ?, we have ?j(R) С ?/(?) С Д(Л) С ?№- If ? = ?? + ?? is a further Lebesgue decomposition. 14 <K и and vi J_ и, then ? := Д2 — vi = V| — ?? С и and ? J_ и, hence the ?-topology is trivial. It follows that ? = 0 and ?, = ?,- (/' = 1,2). ? COROLLARY 8.3. Let ?-.R -* G be an exhaustive measure and и an FN-topology on R coarser than the ?-topology. Then there is a measure ?: R -> G such that и agrees with the v-topology. PROOF. Take ? = ? ? of the decomposition of Theorem 8.2. By Proposition 8.1 (b) и agrees with the v-topology. ? Remark 8.4. Let ?: R -> G be an exhaustive measure and и е FN(R). Then the following conditions are equivalent: A) ? J_ u. B) If ?: R -> G is a measure with ?«? and ? <<C и, then ? = 0. C) If Я is any Hausdorff topological commutative group and k:R->H& measure with ?«? and ?««, then ? = 0. PROOF. A) =>· C) =>· B) is obvious. B) =>· A). Let ? be the infimum of и and the ?-topology. By Corollary 8.3, ? is the ?-topology for some measure ?: R -> G. Condition B) implies that ? = 0, i.e., ? is the trivial topology. ?
736 ?. Weber A measure ? : R -> G is called purely finitely additive if there is no non-zero ? -additive measure ? on R with values in a Hausdorff topological group such that ? <<? ?. THEOREM 8.5 (Hewitt- Yosida decomposition). Let ?: R -> G be an exhaustive measure. Then there are unique G-valued measures ?\, ?? on R such that ? = ?\ + ?2, ?? « ?-additive and ?2 is purely finitely additive. Moreover, ?,(/?) С ?(R) and ?, « ? (? = 1,2). PROOF. Apply Theorem 8.2 and Remark 8.4 with и = un being the supremum of all ? -order continuous FN-topologies on R. Observe that a measure ? on R is ? -additive iff ? <<C u„ and ? is purely finitely additive iff ? J_ u„. ? Another choice of и in Theorem 8.2 yields another decomposition theorem: THEOREM 8.6. Let R be a subring of a Boolean ring ?, ? a sublattice of ? and ? : R —> G an exhaustive measure. Then there are unique G-valued measures ? ?, ?? such that ? = ? ? + ?2, ?? is K-tight and there is no non-zero K-tight measure ? : R —> G with ? «; ?2. Moreover, ?, (/?) с ?(?) <я«й/ ?/ «С ? (/' = 1. 2). PROOF. Apply Theorem 8.2 and Remark 8.4 with и being the supremum of all ?-tight FN-topologies on R. D The importance of the exhaustivity assumption in decomposition theorems becomes clear by the following remark. Remark 8.7. A measure ?: R -> G is exhaustive iff for every и е FN(/?) there are measures ?\,?2- R -> G such that ? = ?? + ?;>, ?? <sC и and ?? -L и. PROOF. =>· is proved in Theorem 8.2. «=. Let w be the ?-topology and и е FN(R. w). Let ??, ??: R -> G be measures with ? = ?? + ?2, ?? <sC и and ?? -L к. Denote by и, the ?,-topology (/' = 1, 2). It follows from Proposition 8.1(a) that u\Vut = w and u\ aui=0 and from Proposition 8.1(b) that u\ = u. Therefore u^ is a complement of и. It follows that FN(/?, w) is a complemented distributive lattice, hence a Boolean algebra. Therefore w is exhaustive by Remark 6.7. D For real-valued measures there is another method to obtain a general decomposition theorem, which is based on the Riesz decomposition theorem. The space ba(R) of all real- valued bounded measures on R is a Dedekind complete Riesz space. If ? is a band in ba{R) and ?-1 its disjoint complement, then any ? e ba(R) has a unique decomposition ? = ?\ + ?? with ?? e ? and ?? e ??. These decompositions are exactly the decompositions which can be obtained by Theorem 8.2. This follows from the next proposition. PROPOSITION 8.8. (a) For any и e FN(fl). ?(") ;= 1м е ?/?): ? « ") " ? band and BiuI- = {? e ??): ?-L и}. (b) For every band В in ba(R) there is an и e FN(/?) ??/с/г ?/??? ? = B(u).
FN-topologies and group-valued measures 737 PROOF, (a) (i) We first prove that B(u) is a band. Obviously B(u) is an ideal in ba(R). Let (??)??? be an increasing net in B(u) and ? e ba(R) with 0 < ?? | ?. We show that ? «Си. Let ? > 0, ? е /? with ?(?) > sup_e/^(;;) — ? and /3 e A with ?/jfa) > ?(?) —?. Then for у е /? we have ?(>) ^ ? if у ? ? = 0, and ?(?) — ?/?(?) ^ (? — ?/?)(?) ^ ? if у < ?. Therefore ?(*) = ?(* \?) + (?(.? ??) - ?/?(.? ??)) +?/?(.? ??)< 2? + ?^(^). It follows that {? e R: ??(?) ^ ?) С {.? е R: ?(?) ^ 3?). Therefore {? e ?: ?(.?) ?? 3?) is a O-neighbourhood in (R, и) for any ? > 0. Hence ? <sC и by Proposition 1.10. (?)???? e b?(/?)and? J_ и. We show that ? e б(и)-1-. Let ? e в(и) and ? — |?| ?|?|. Then vlu and v«u, hence, ? = 0. Let ? e б(м)". By Theorem 8.2 there are measures ??, ?: e foa(/?) with |?| = ?? + ?2, ?? « и, ?? J- и and ?,(?) С |?|(?) С [0.+??[, hence ?,- > 0 (/ = 1,2). ?? «Си means ?? е fi(w), and 0 ^ ?? < |?| е б(м)х implies ?? e ?(«)?· Hence ?? =0 and |?| = ?2 -L ?, hence ? J_ и. (b) Let и be the supremum of the ?-topologies, ? e B. Then ? с В(и). We now show that B(u) С S. Let ? e в(и). Then there are ?? e В and ?? e ?? with ? = ?? + ?2· Since ?2 -L ? for any ? e ?, we obtain ?? J_ и. On the other hand ?? <<C ? <SC к. Hence ?2 = 0 and ? = ? ? e В, ? Generalizing Theorem 8.2 we now give a decomposition of a measure into an infinite sum of measures. THEOREM 8.9. Let ?-.R -* G be an exhaustive measure and (ua)a€A be a family in FN(/?) such that ua J_ up for different indexes ?, ? e A. Then there is a unique family (??)??? of G-valued measures on R and a measure ?: R —» G such that 53„ед ??(?) is summable uniformly in ? e R, ? = ? + ???? Iх" and ? J_ ua, ?? <sC ua for any a e A. Moreover, the ?-topology is the supremum of the ??-topologies and the ?-topology, ?^) c]JXR) and ?„(?) С ?(/?) for a e A. PROOF (Sketch), We proceed as in the proof of Theorem 8,2, Let ?? be the infimum of ua and the ?-topology and aa be the corresponding elements in the completion R of (/?, u-v). Let ? be the continuous extension of ? on R, ??(?) = AaU ? aa) for ? e R and a e A and ? = ? — ??€? ??- As in Theorem 8,2 one proves that ? and (??)?€? have the desired properties, THEOREM 8.10 (Hammer-Sobczyk decomposition). Let ?: R -> G be an exhaustive measure. Then there is a family (??)?€? pairwise singular G-valued ultrafilter measures on R and a measure ? : R —» G with the following properties, A) Х]аед ??(?) is summable uniformly in ? e R, B) R is ?-chained, C) ? = ? + ]?^???. D) The ?-topology is the supremum of the ??-topologies and of the ?-topology, E) X(R) С ?(/?), ??(?) С ^(R)for a e А. PROOF. Apply Theorem 8,9 where (иа)а?Л is the family of all ultrafilter topologies on R and ua ? и? for different indexes ?, ? e A, Then R is ?-chained by Proposition 6,9,
738 ?. Weber since ? J_ ua for every a e A. Since ?? <? /<„. it follows from Corollary 6.10(a) that ?? is an ultrafilter measure or ?? = 0, Now replace (??)?€? by the family (??)?€?' where A' = {aeA: ????]. D The Hewitt-Yosida decomposition for group-valued measures and the Lebesgue decomposition Theorem 8,2 if и is generated by a single submeasure was proved by Drewnowski A973b). Drewnowski A973b, p, 47) posed the question whether Theorem 8,2 holds true for an arbitrary FN-topology u. The first proof of Theorem 8,2 was given by Traynor A976), The approach to decomposition theorems presented here is due to Weber A984a, 1984b). The Hammer-Sobczyk decomposition for group-valued measures was given in Weber A982a, 1982b), 9. Connectedness In this section we study connectedness of topological Boolean rings and the range of measures. It turns out that connectedness of the range of a measure ? : R —* G depends - in contrast to convexity - only on the ?-topology and not on the (geometric or topological) properties of the range space. THEOREM 9,1, Let и be a Hausdorff exhaustive FN-topology on R such that (R,u) is complete. (a) Then the following conditions are equivalent: A) R is atomless. B) (R, u) is connected. C) (R,u) is arcwise connected. (b) The following conditions are equivalent: A) R is atomic, B) (R, u) is totally disconnected. C) For some set A, (R.u) is algebraically and topological!)· isomorphic to the product {0, 1 }A endowed with the product topology with respect to the discrete topology on {0, 1}, PROOF, (a) B) =>· A), Let a e R \ Щ. Then [O.a] is connected as the image of (/?, u) under the continuous map R эн> л л ?. Therefore [O.a] cannot be finite. In particular, a cannot be an atom, A) =>· B), Suppose that (R.u) is not connected. Then R contains a clopen nonempty subset A, such that e ? A. Let С be a maximal chain in A. (c)cec considered as an increasing net is Cauchy by Proposition 2,4 and has therefore a limit s e A. Then s = sup С Since s' > 0 and R is atomless, [0. s'] contains a disjoint sequence (d„)„€^ of nonzero elements. Since и is exhaustive. dn -> 0 and therefore.? vd„ -> s in (/?, u). Since A is open, s ? dm e A for some m e N. Therefore CU(sv d,„) is a chain in A different from C, a contradiction to the maximality of С C) =>· B) is obvious.
FN-topologies and group-valued measures 739 B) =>· C), By Corollary 4,11 and Theorem 6.2 we may assume that и is metrizable. In this case the implication follows from the fact that any connected, locally connected, complete metric space is arcwise connected. (b) A) =>· C), By Theorem 6.2, и is order continuous and (L. ^) is complete. Since (/?, ^) is complete and atomic, R is algebraically isomorphic to the power set of a set A, So we may assume that R = {0, 1}A, The product topology on {0. \}A is order continuous and therefore coincides with и by the uniqueness statement of Corollary 4,9. C) =>· B) is obvious. B) =>¦ A). Suppose that R is not atomic. Then there exists an a e R \ {0} such that [?,?] is atomless. Then [?,?] is connected by (a) (A) =>· B)). Therefore (R.u) is not totally disconnected. ? The condition B) of Theorem 9.1(b) implies that (R. u) is compact. On the other hand, any compact Hausdorff topological ring without total zero divisors is by Anzai A943) totally disconnected. This shows that a further equivalent condition in Theorem 9.1(b) is that (/?, и) is compact. This is also contained in the following theorem which is a special case of Kaplansky's structure theorem for compact semi-simple rings, THEOREM 9.2 (Warner A993). Theorem 32.8). A Hausdorff topological Boolean ring is compact iff it is for some set A (algebraically and topologically) isomorphic to the product {0, 1}A. All the known proofs of Theorem 9.2 (=>·) are based on the deep result - already used in Anzai A943) - that any compact Hausdorff topological commutative group admits a non-trivial character. Only if the topology of the Boolean ring is generated by real-valued measures, there are also elementary simple proofs of Theorem 9,2 (=>·). THEOREM 9,3. Let ?;/? —» G be an exhaustive measure with values in a complete Hausdorff topological group, (a) If ? is atomless and (R. ?-topology) is complete, then ?(R) is connected. (b) If R is ?-chained, then ?(/?) is connected, PROOF. We may assume that the ?-topology is Hausdorff. Otherwise consider ?; R -> G on the quotient R = R|N(?) defined by ?(.?) = ?(?) for .? e л е R. (a) Under the assumptions of (a), R is by Theorem 9.1 connected with respect to the ?-topology. Therefore the continuous image ?(/?) is connected, (b) Let (/?, u) be the completion of (R. ?-topology) and ?: (R.u) -> G the continuous extension of ? on R. If R is ?-chained, then R is atomless by Theorem 6.16(a), i.e., ? is atomless. Therefore ?(/?) is connected by (a). Hence ?(R) = ji(R) is connected. D 10. Vitali-Hahn-Saks and Nikodym theorems We here present a proof of the Vitali-Hahn-Saks and Nikodym theorems based on Baire's category theorem. The idea of this proof goes back to Saks and is also used in Dunford
740 ?. Weber and Schwartz A957, III.7). With the aid of Drewnowski's Lemma 10.2 the finitely additive case can be reduced to the ?-additive case, cf. Drewnowski A973b). THEOREM 10.1. Let R be ?-complete, ?: R -> [0, +oo] ? ?-submeasure, (G, | |) a seminormed group and ?„ : R —> (G. | |). n e N. a pointwise convergent sequence of ?-continuous measures. Then {?„: n e N) is ?-equicontinuous. PROOF, Let ? > 0. The sets C,j := {x e R: |?,-(.?) - ?;Or)I ^ ?] and C„ = f|, 7>, cv are closed in (R, ?). R = (J^=i C„ since (?,,?.?)),,^ converges for any ? e R. Since (/?, /j) is complete by Theorem 3,6, by Baire's category theorem Cq has for some q e N an inner point a. Let S > 0 such that for any .v e /? with ?(?) ^? S we have |?,(.?)| ^ ? for /' = 1,..., q and ? ? ?, a \ ? e Сц, From the equality ?„ (jc) = ?? (jc) + (?„ (???)- ?9(? ? ?)) — (?„(? \ jc) — ?(,(? \ ?)) follows that |?„(.?)| ^ 3? for any n e N and ? e R with ?(?) ^ ?. This shows that (?,,),,^; is /j-equicontinuous. ? LEMMA 10,2 (Drewnowski's lemma). Let R be ?-complete, ?-.R^ [0,+oo] an exhaustive submeasure and (a„)/ieN a disjoint sequence in R, Then there is an infinite set I С N such that the restriction of ? on the ?-subring ?({??: n e /}) generated by (an)n€i is ?-order continuous. PROOF. We first prove that for any ? > 0 and any infinite subset Л с N there is an infinite subset ? of Л such that /j(sup,eiJ ?,) ^ ?: Let (An)nei; be a disjoint sequence of infinite subsets of A. Then the elements sup,^ a;, n e N, are disjoint. Since ? is exhaustive, f?(sup,eA/; ?,) < ? for some n e N. Using this fact, one can inductively find a decreasing sequence (/л)л6^ of infinite subsets of N such that /j(sup,e/; a,) ^ l/н and /'„ :=min/„ iIn+\. // e N. Let / = {/'«: ? e N). It is easy to see that ? is ?-additive on ?({?„: n e /}) = {sup ](zJ af. Ус/}. ? THEOREM 10.3. Let R be ?-complete, G a topological group and ?„ : R -> G. n e N. be a sequence of exhaustive measures converging pointwise to ? : R —> G. (a) Then {?„: n e N) is uniformly exhaustive: in particular, ? is exhaustive. (b) // for every n e ?, ?„ is continuous with respect to an FN-topology и on R, then {?„: n e N) U {?} is u-equicontinuous. (c) If?? is ?-additive for every n e N, then ? is ?-additive. PROOF, (a) Since it is enough to prove that (?„) is uniformly exhaustive with respect to any continuous seminorm on G, we may assume that G = (G. | |) is a seminormed group. Then the semivariation ||?„|| of ?„ is bounded by Corollary 2.7. Therefore there are numbers ?„ > 0 such that ? := ?^=? ?„||?„|| is a bounded function. It follows that ? is an exhaustive submeasure and ?„ is ^-continuous for every //. Suppose that (?„)„,=? is not uniformly exhaustive. Then there is an ? > 0, a disjoint sequence (a,,)n€i; in R and a subsequence (v„)„eN of (?„)„?? such that \v„(a„)\ ^ ? for every n e N. By Lemma 10.2, (a„)„eN has a subsequence (bn)„€\; such that the restriction of ? on S := o{{b„: n e N}) is ?-order continuous. Applying Theorem 10.1 to v„\s one obtains that (v„|5)„e^ is
FN-topologies and group- valued measures 741 /jls-equicontinuous. Since ??„) -> 0, it follows that sup,,, \v,„(b„)\ -> 0 (и -> +oo), a contradiction to | ?„(?„)| ^ ?. (b) follows from (a) and Corollary 7,2. (c) Let и be the supremum of the ?„-topologies, и e N. Then и is ?-order continuous if ?„, ? e ?, are ?-additive (see Proposition 4.2). By (b), ? is и-continuous, hence ?-addi- tive. ? PROPOSITION 10.4. Let ? be a set of measures on R with values in a seminormed group (G, | |), //sup{^(a„)|: ? e M, /ieN}< +<%> for every· disjoint sequence (а„)леК in R, then 5??{|?(?)|: ? e ?, ? e /?} < +эо. PROOF. Put ||U;J)/ieM|| :=supMeM |jr,,| for (л·,,) e GM and ? := {(л>)е GM: ||U;/)|| < +oo). Then ?(?) := (?(?))?€? defines an Я-valued measure on R. Since sup„ ||?(?„)|| < +00 for every disjoint sequence in R, we have supneK ||v(a)|| < +00 by Proposition 2.6, i.e., sup{|M(a)|: ? e M, a e /?} < +00. ? We now deduce - as in Drewnowski A974) - Nikodym's boundedness theorem from Theorem 10.3(a). THEOREM 10.5, Let R be ?-complete, ? a set of exhaustive measures on R with values in a seminormed group (G. | |) and sup eM |?(·?? < +oo for any ? e R. Then sup{^U)|: ? e ?, ? e R] < +00. PROOF. Suppose that sup{^(.v)|: ? e ?, ? e R] — +00. Then, by Proposition 10.4, there is a disjoint sequence (a„)„€i< in R and ?„ e ? such that |?„(?„)| ^ 11, ? e N. For ? e N, we define GK-valued measures on R by ?,,(?) = @,..,, 0, ??(?), 0,...), where ?„(?) occupies the «th place. We consider G1 with the seminorm defined by ||(jc„)|| := sup„(l//i)|;c„|. Then the measures ?„, не N. are exhaustive and ?,,(^) -> 0 for any ? e R. Therefore {?„: ? e ?} is uniformly exhaustive by Theorem 10.3(a), Hence II An (a«) II -> 0, a contradiction to ||A«(an)ll = (\ ? ?)\?„(??)\ > 1, ? Instead of the ?-additivity of /j it is in Theorem 10.1 enough to require that (R, ?) is a Baire space. It is natural to ask whether for an exhaustive submeasure ?: R -> [0, +00] on a ?-complete Boolean algebra (R, ?) is a Baire space (see Diestel and Uhl A977, p, 35) andLabudaA972,p. 455). The examples contained in Armstrong and Prikry A982), Arias de Reyna A983), Basile and Weber A986) show that this is not the case. References Ajupov, S.A,. Cilin, V,I,, Hadjiev, D. and Sarymsakov, T.A. A983). Ordered Groups, Usbek Academy of Sciences. Tashkent (in Russian). Aliprantis. CD. and Burkinshaw. O. A978). Locally Solid Riesz Spaces, Academic Press, New York. Anzai. H. A943), On compact topological rings. Proc. Imp. Acad. Tokyo 19. 613-615. Arias de Reyna, J. A983), Non-Baire measure spaces. Collect. Math. 34. 109-114.
742 ?. Weber Armstrong. Т.Е. and Prikry. К. A982). On the semimetric on a Boolean algebra induced by a finitely additive probability measure, Pacific J. Math. 99. 249-264. Ascherl. A. and Lehn. L. A977). Two principles for extending probability measures, Manuscripta Math. 21, 43- 50. Basile, A, A984), A Lebesgue-type decomposition theorem for topologies on rings of sets. Rend. Accad. Sci. Fis. Mat. Napoli 51 D), 61-65. Basile, A. and Weber, H. A986). Topological Boolean rings of first and second category. Separating points for a countable family of measures, Rad. Mat. 2. I 13-1 25. Bhaskara Rao, K.P.S. and Bhaskara Rao, M. A977), Topological properties of charge algebras. Rev. Roumaine Math. Pures Appl. 22. 363-375. Bhaskara Rao, K.P.S. and Bhaskara Rao, M. A983), Theory of Charges. A Study of Finitely Additive Measures, Pure Appl. Math., Vol. 109, Academic Press, New York. Bierlein, D, A966), Uber die Fortsetzung von Wahrscheinlichkeitsfeldem, Z. Wahrscheinlichkeitstheorie verw. Gebiete 1, 28^46. Bogdan, V.M. and Oberle, R. A. A978). Topological rings of sets and the theory of vector measures. Dissertationes Mathematicae 154,5-70. Brook, С A984). Decompositions of submeasures, Canad. J. Math. 36, 577-590. Carlson. T. and Prikry. K, A982). Ranges of signed measures. Period. Math. Hungar. 13. 15 1-155. Constantinescu, С. A976а), The range ofutomless group valued measures. Comment, Math. Helv. 51 B). 207- 213, Constantinescu, С A976b), Atoms of group valued measures. Comment. Math. Helv, 51 B), 19 1-205, de Lucia. P, A985). Funzioni finitamente additive a valori m un gnippo topologico. Quademi dell'Unione Matematica Italiana, Vol. 29, Pitagora Editrice, Bologna. Diestel, J. and Uhl. J.J. Jr. A977), Vector Measures. Math. Surveys. Vol. 15, Amer. Math, Soc. Providence, RI. Dobrakov, I, A974), On submeasures I. Dissertationes Mathematicae 112. 5-39. Dobrakov, I. and Farkova, J. A980). On submeasures II. Math. Slovaca 1. 65-81. Drewnowski, L. A972a), Topological rings of sets, continuous set functions, integration I. II. Ill, Bull. Acad. Polon. Sci. Ser, Sci. Math. Astronom. Phys. 20. 269-276. 277-286, 439-445. Drewnowski, L. A972b), Equivalence of Brooks-Mvett. Vitali-Halm-Saks and Nikodym theorems, Bull. Acad. Polon. Sci. Ser, Math. Astronom. Phys. 20, 725-731. Drewnowski, L, A973a). Uniform boundedness principle for finitely additive vector measures. Bull. Acad. Polon. Sci. Ser. Math. Astronom. Phys. 21, 115-118. Drewnowski. L. A973b), Decomposition of set functions, Studia Math. 48, 23^48. Drewnowski. L. A974), On control submeasures and measures. Studia Math. 50, 203-224. Drewnowski, L. A974-1975). On complete submeasures. Comment. Math. Prace Mat. 18. 177-186. Dunford, N. and Schwartz, J.T. A957), Linear Operators. Part I: General Theory, Interscience. New York. Frechet, M. A919-1920), Sur divers modes de convergence d'une suite de fauctions d'une variable. Bull, Calcutta Math. Soc, 11, 187-206. Hejcmann, J. A959), Boundedness in uniform spaces and topological groups, Czechoslovak J. Math. 9, 544-562. Herer, W, A976), On the extension of measure with values in a topological group. Comment. Math. Prace Mat. 19, 73-80. Hoffmann-J0rgensen, J. A97 I), Vector measures. Math, Scand, 28. 5-32. Khurana. S.S. A976), Extensions of exhaustive submeasures. Bull. Acad. Polon. Sci, Ser. Sci. Math. Astronom. Phys, 24, 213-216. Khurana, S.S, A979), Submeasures and decomposition of measures, J, Math. Anal. Appl, 70. I I I-1 13. Labuda. I. A972), Sur quelques generalisations des theoremes de Nikodym el de Vitali-Hahn-Saks, Bull, Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys. 20, 447-456. Labuda, I, A975), Sur les mesures exhaustives el certaines classes d'espaces vectoriels topologiques consideres par W. Orlicz et L, Schwartz. C. R. Acad. Sci. Paris Ser. A-B 280, Ai. A997-A999. Landers, D. A973), Connectedness properties of the range of vector and semimeasures, Manuscripta Math. 9, 105-112. Lembcke, J, and Weber, H, A992). Decomposition of group-valued and [0,oc)-valued measures on Boolean rings. Measure Theory (Oberwolfach, 1990). Rend, Circ. Mat. Palermo B) Suppl, No. 28, 87-1 16.
FN-topologies and group-valued measures 743 Lipecki, Z. A974), Extensions of additive set functions with values in a topological group. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys, 22, 19-27. Lipecki, Z. A980a), A generalization of an extension theorem of Bierlein to group-valued measures. Bull. Acad. Polon. Sci. Ser. Sci. Math. 28, 441-445. Lipecki, Z. A980b), The Countable Chain Condition and Restrictions of Group-Valued Additive Set Functions, Bull. Acad. Polon. Sci. Ser. Sci. Math. 24, 363-365. Lipecki, Z. A987), Tight extensions of group-valued quasimeasures, Colloq. Math. 51, 213-219. Maharam, D. A987), From finite to countable additivity, Portugal Math. 44, 265-282. Nachbin, L. A965), Topology and Order, D. Van Nostrand Company, New York. Nikodym, O.M. A929), Sur les functions d'eseinples, Comptes Rendus du I Congres des Math, des Pays Slaves, Warsaw, 304-313. Nikodym, O.M. A930), Sur une generalisation des integrates de M.J. Radon, Fund. Math. 15, 131-179. Rickart, C.E. A943), Decomposition of additive set functions, Duke Math. J. 10, 653-665. Sion, M. A969), Outer measures with values m a topological group, Proc. London Math. Soc. 19, 89-106. Traynor, T. A972), A general Hewitt-Yosida decomposition. Canad. J. Math. 24, 1164-1169. Traynor, T. A973), S-bounded additive set functions, Vector and Operator Valued Measures and Applications (Proc. Sympos., Alta, Utah, 1972), Academic Press, New York, 355-365. Traynor, T. A976), The Lebesgue decomposition for group valued set functions. Trans. Amer. Math. Soc. 220, 307-319. Vladimirov, D.A. A972), Boolesche Algebren, Mathematische Lehrbucher und Monographien, B. 29, Akademie- Verlag, Berlin. (Translated from the Russian and edited by Gunther Eisenreich.) Warner, S. A993), Topological Rings, North-Holland Math. Studies. Vol. 178, North-Holland, Amsterdam. Weakley, C.B. A987), On the lattice of Frechet-Nikodym topologies, J. Math. Anal. Appl. 122, 385-392. Volkmer, H, and Weber, H. A983), Der Wertebereich atomloser Inhalte. Arch. Math. (Basel) 40 E). 464-474. Weber, H. A976), Fortsetzung von Massen mil Werten in unifonnen Halbgruppen. Arch. Math, 27, 412^423. Weber, H. A980), Ein Fortsetzungssatz fur gruppenwertige Masse, Arch. Math. 34, 157-159. Weber, H. A982a), Die atomare Struktur topologiscber Boolescher Ringe und s-beschriinkter Inhalte, Studia Math. 74, 57-81. Weber, H. A982b), Vergleich monotoner Ringtopologien und absolute Stetigkeit von lnhalten, Commentarii Mathematici Universitatis Sancti Paul! 31, 49-60. Weber, H. A984a), Topological Boolean rings. Decomposition of finitely additive set functions, Pacific J. Math. 110,471^495. Weber, H. A984b), Group- and vector-valued s-bounded contents. Measure Theory (Oberwolfach, 1983), Lecture Notes in Math., Vol. 1089, Springer, 181-198. Weber, H. A986), Compactness in spaces of group-valued contents, the Vitali-Hahn-Saks theorem and Nikodym's boundedness theorem, Rocky Mountain J. Math. 16, 253-275. Weber, H. A994), Minimal ring topologies, Adv. Math. 109, 140-15 I.
CHAPTER 17 On Products of Topological Measure Spaces Stratos Grekas Department of Mathematics, Section of Mathematical Analysis, University of Athens, Pimepistemiopolis, 15784 Athens, Greece E-mail: sgrekas@math.uoa.gr Contents Introduction 747 1. A central problem in topological measure theory 747 1.1. Review of terminology 748 1.2. On product measures 748 1.3. Fremlin's example and related results 751 2. On the 'product-like' structure of the Haar measure on a compact group 752 3. Topological liftings for product measures 755 3.1. Baire, Borel and Strong liftings 756 3.2. On the existence of Baire strong liftings 757 Acknowledgements 762 References 762 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 745
This page intentionally left blank
On products of topological measure spaces ?4? Introduction One of the most important achievements in Lebesgue's theory is related to the double integral, Euler, in 1780, had already used a reduction formula of a double integral to two single successive ones. Although Riemann's integral was limited, mainly, in continuous functions of two variables, there were no restrictions in Lebesgue integral; that is exactly Fubini's theorem A907), Lebesgue theory was completed by Radon's monograph, which is presented in Caratheodory A918), The author does not intent to refer to the history of the 'successive integration' notion or the 'product measure' notion; nevertheless, he notes that, from that time up to nowadays, every integration theory which was established, was also accompanied by some appropriate notions of product measure. In the duration of this procedure, a question arisen; which properties of each one of the-finite or infinite-factors can also be valid in a product measure (this means they are productive properties)? The present chapter is, mainly, concerned with this question in the framework of topological measure theory. The article is organized in three parts. In Section 2 the question whether the product of topological measure spaces is again a topological measure space is discussed. Section 3 studies, particularly, the Haar measure on a compact group and emphasizes its connections with a usual product measure on a product of compact metric spaces; its object is the far reaching conjecture of J,R, Choksi, that complete analogies of certain measure-theoretic properties of such a product space are valid for compact groups. Section 4 deals with topological (Baire, Borel, strong) liftings for a product measure; the discussion yields a unified measure-theoretic treatment of Baire strong liftings for a class of locally compact ?-compact groups and for a class of products of separable metric spaces. The choice of these three precise themes, instead of a - usually seen - extensive survey (on product measures) is not only due to my own interest, but also to the completeness of Fremlin B00?) (a very interesting work) which (as far as I know) will sufficiently overcover such a survey. The reader will notice that the main point of this chapter is the conjecture of J,R. Choksi; this conjecture led my occupation with this area. So, I am pleased to dedicate the chapter to Jal Choksi. 1. A central problem in topological measure theory The main problem in this section is whether the product of a family of topological (probability) spaces is a topological measure space. Although the (vague) description of the problem is quite simple, it includes some particular questions, like the following ones: • What kind of measures do we consider? • Is such a product well defined? • Is the (finite or infinite) product of such measure spaces of the same nature like the one of each of the factors? For convenience (and simplicity), only ?-additive probability measures on completely regular Hausdorff spaces are considered (Varadarajan, 1965; Gardner, 1975; Fremlin,
748 S. Grekas 200?). The reader will easily check which of the results hold for quasi-Radon measure space (Fremlin, 1982). For those who are interested in a higher level of generality, see Fremlin A984), Gardner and PfefferA984). 1.1. Review of terminology Let (?, ?,?) (simply denoted by (?,?)) be a probability measure space, where X is a completely regular space and ? a ?-algebra of subsets of X, The family of Baire, respectively Borel, sets in X is the ?-algebra ??o(X), respectively ??(X), generated by the zero, respectively closed, sets. A Baire measure ? on X is called completion regular if every open set in X is ?-measurable. Note that the terminology is due to Halmos A950), but the concept of completion regularity was discussed earlier by Kakutani; see Choksi A984), also Theorem 1.3, this section. A topological probability space is a probability measure space (?, ?) - or just ? - such that every open set in X is measurable. A topological probability space (?, ?) is called ?-additive, respectively completion regular, if for every family (i/,),e; of open sets in X, ? ( [J t/,¦ J = supj ? ? [J U,j: J С I countable J. respectively its restriction to the Baire sets is completion regular. For further information on ?-additive and completion regular measures, see Knowles A967), Kirk A969). Gardner A981) gives is a detailed exposition with an extensive bibliography. 1.2. On product measures Suppose that (?,?',?), (?. ?, ?) are (?-additive) probability spaces. One can form product measures in various ways. A natural product measure notion is the one of simple (completed) product measure ? = ? ? ?, that is, the completion of the usual product measure ? = ? ? ?, defined on the product ?-algebra ? <g> ?. This measure is identical with the ordinary (completed) product measure, derived by Caratheodory's method from the outer measure ?* (see, e.g., Fremlin A984. A6K)). The simple (completed) product ?,?/?, on Y\j€i Yi, for any family {(У,, ?,). /' e /} of probability spaces, can also be similarly defined; cf. Halmos A950). Since ?, ? are assumed to be ?-additive. ? does have a unique ?-additive extension ? <g> ? to the Borel sets in ? ? У (we will take it that ? <g> ? is to be the completion of its restriction to B(X ? ?)); cf. Shwartz A973), Fremlin B00?). Let now (X),e/ be a family of topological spaces and X = П/е/ %>¦ ^ 's clear that ??(X), respectively Bq(X) contains the product ?-algebra <g)j€l B(X,). respectively <8>,e/ Д)№), but in general the inclusion may be strict (cf, Gardner A981), Fremlin et al. A996) and references there). The following result was proved in a special case by Ressel A977), while it was proved by Fremlin A982) in full generality in the quasi-Radon measures wider field (see also Gryllakis and Koumoullis A990)).
On products of topological measure spaces 749 THEOREM 1.1. If ?, is ? ?-additive probability measure on X,-, /' e /, then there exists a unique ? -additive measure ? on X, extending the simple product ?,<=/?,. ? is called the ?-additive product of the family (м,),е/ and we write ? = ®,?/^' (if X, all are compact spaces, then ? is the Radon product of the ?,, see Bourbaki A965), FremlinA989)). The problem will be discussed in this sub-section is the below mentioned. PROBLEM 1.2. Underwhich (topological or measure-theoretic) conditions, the -finite or infinite -product of topological probability spaces is also a topological probability space (and consequently, when the product of completion regular spaces is completion regular)! The story of this problem started in 1943. when Kakutani A943a. 1943b) presented the following THEOREM 1.3. Let (?,,?,), / e /, be an uncountable family of strictly positive probability spaces, where the X, are all compact metric spaces. Then every Borel set in X = ???/ ^' 's ®ie/ ??-measnrable. The key point of the proof of Theorem 1,3 is the following assertion, which will be mainly discussed in Section 4, ASSERTION 1.4. Every open set in the product space has an open Baire cover of the same measure. In A958a), Maharam, using a very special well ordered martingale/derivation argument, proved the following significant result. THEOREM 1.5. There exists a strong, (completion) Baire lifting for the usual product measure on the Cantor space {0, \}r, ? arbitrary. Thus, the product measure on {0, l}r satisfies Assertion 2.4 and so. Theorem 1.5 provides a different proof of Theorem 1.3. Evidently, Theorem 1.5 indicates some connection between strong lifting property and completion regularity; however, as Choksi A984) says, this connection is somewhat elusive (detailed information is also provided in Fremlin B00?)). In A976), Fremlin "making ingenious use of an earlier result of Erdos and Oxtoby A955), showed that the product of the hyperstonian space of Lebesque measure on [0, 1] with itself is not completion regular" (see comments in Choksi A984)). On the other hand, Choksi and Fremlin A979), in the course of a study concerning completion regular measures, generalized Kakutani's theorem as follows: THEOREM 1.6. Let ?, ?,, / e /, be completion regular Radon probability measures on X, Yj, i e /, respectively, where X is any compact space and the У, are compact metric spaces. If the ?, are, all, strictly positive, then the Radon product measure ? <g> (<S>,e/ ?') is completion regular
750 S. Grekas In the same article, the authors posed the question whether the product of two completion regular measures, each on a product of compact metric spaces, is completion regular. The answer was an affirmative one (Gryllakis, 1988), Grekas and Gryllakis A991), using Fubini's theorem, obtain an extension of Theorem 1,6: THEOREM 1,7, Let X bea compact space, (Yj)jei a family of compact metric spaces and ?, у completion regular measures on X, Y\.€j Yj, respectively. Then, subject to Martin's axiom and the negation of the continuum hypothesis, ?® ? is completion regular. Gryllakis and Koumoullis A990) connected the notion of completion regularity and that of ?-additivity and showed the more general theorem. THEOREM 1,8. Let (X,),e/ be a family of separable metric spaces, X = fl,e; X; and ? a probability Baire, respectively Borel measure on X. (a) If ? is completion regular, then ? is ?-additive , (b) The following conditions are equivalent', (i) ? is completion regular: (ii) for every ?-additive probability Borel measure space (?, ?), (? ? ?, ? ? ?) is a topological measure space; (iii) for every ?-additive completion regular probability Baire, respectively Borel measure on a completely regular space Y, the ?-additive product ? <g> ? is completion regular. Remark 1,9. In case that ? is a (?-additive) Borel measure, then the three conditions are equivalent to Assertion 1,4, In the same paper, the authors verify the following interesting fact: a ?-additive product measure ®,e/ M/ on апУ product П/е/ %> ls completion regular, provided that each finite sub-product is completion regular. Thus, Theorem 1,3 of Kakutani could be, in some sense, extended to the general setting of arbitrary - not, necessarily, metrizable - topological spaces; for the precise statement, see Gryllakis and Koumoullis A990, Theorem 2,9), The research was extended to quasi-dyadic spaces. Following Fremlin and Grekas A995), a topological space is quasi-dyadic if it is the continuous image of a product of separable metric space. The next two referred results are the main ones of that paper, THEOREM 1.10, Let (?, ?) be a completion regular topological probability space. If X is quasi-dyadic, then ? is ?-additive, THEOREM 1.11. Let (?,?) and (Y,v) be topological probability spaces; suppose that ? is quasi-dyadic, (Y, v) is completion regular and that (?, ?) is ?-additive. Then, (? ? ?, ? ? у) is a topological measure space. The proof of both results, which apply many combinatorial arguments, exploits the simple fact that the inverse image (under the corresponding continuous surjection) of any
On products of topological measure spaces 751 Baire set in the quasi-dyadic space is effectively determined by a "small" (in fact countable) set of coordinates; cf. Fremlin B000). Remark 1.12, A) In Fremlin and Wajch A993) there are interesting examples which show that the class of compact quasi-dyadic spaces is not identical to the one of dyadic spaces. B) As far as we can see, Assertion 1.4 characterizes the completion regularity, provided that the underlying topological space is (homeomorphic to) a product of separable metric spaces. It is not clear, at least for the author, when same thing happens in the (general) case of quasi-dyadic spaces, C) Section 2,2 is, essentially, an investigation into the structure of product measures; that is what is discussed here and not the applications in other mathematics areas. However, Problem 1,2 is not of purely measure-theoretic interest; it is also closely related to important problems in Functional Analysis and Probability theory; for instance, it worths to refer the notions of "stability" and "R-stability" as they are described in Fremlin A983), Talargand 1984). In Talagrand A987a, 1987b) are also presented various interesting related results. 1.3. Fremlin's example and related results Let (S, y) be the hyper-stonian space of power Lebesgue measure ?" on [0, 1]". Fremlin has therefore shown that not all the open subsets of S ? S are (? ? v)-measurable. By Urysohn's embedding theorem, S can be identified with a compact subset of [0, l]y, ? = w(S) and ? is then a measure on [0, \]Y with suppv = S, It is, therefore, obvious that the measure ? = \v + ^?? has PROPERTY 1.13, The open sets in [0, l]y ? [0, 1]'' arenotall (? ? \x)-measurable. In spite of the fact that 5иррд = [0, l]y, there is insufficient information about the relation that ? and ?? have; we do not even know how the projections of ? on each topological factor [0, 1], behave. This lack of sufficient information led Gryllakis and Grekas A999), to use, expect of all the others, many of Fremlin's arguments and thus to construct a Radon probability measure ? on [0, 1]'', ? ^ с (:= the continuum), satisfying the following conditions: (ii) supp$ = [0.1F; (ii) ? satisfies Property 1.13; A3) for every F с ? finite, the projection of ? on [0, 1]^ is a measure equivalent to the F-dimensional Lebesgue measure ?^. Remark 1.14, As it has already been noticed, on [0, l]y, ? ^ c, we can construct - in various methods - measures with Property 1.13, The author does not know what the situation is for ? < с (under Martin's axiom). In the second part of Gryllakis and Grekas A999), the above construction is generalized, in a certain way, by replacing the space [0. \]Y with an arbitrary compact group G, The
752 S. Grekas main result of their paper is the following "compact group analogue" of the construction in[0, IF. THEOREM 1.15, Ifw(G) ^c, then there exists a Radon probability measure ? onG with the properties: (ii) supp? = G; (ii) ? satisfies Property 1.13; Оз) far every compact Lie factor group G/H, the projection of ? on G/H is - as a measure - equivalent with the normalized Haar measure on G/H. 2. On the 'product-like' structure of the Haar measure on a compact group The above mentioned observations show that the study of the product measure on a product of compact metric spaces has played a key role in the evolution of the topological measure theory. But, the development of the theory was connected with the investigation about the Haar measure, in the same time. Since 1940s there has been a systematic effort - as it is obvious by the bibliography - for searching after common properties for the Haar measure and such a product measure. Kakutani and Kodaira A944) had already proved, by an ingenious adaptation of the original proof of Theorem 1.3, that the Haar measure on a locally compact ?-compact group is completion regular (for information on the completion regularity of the Haar measure, the interested reader could consult Halmos A950). Mongomery and Zippin A955), Hewitt and Ross A963) and Choksi A984), a reference of particular interest). One can say, roughly speaking, that this result is the group analogue of Theorem 1.3 valid for product measures. This analogy does not derive only by the measure-theoretic properties of the Haar measure; it seems to be closely connected with the fact that a locally compact ?- compact group is the projective limit of (a particular family of) metrizable groups indexed by closed normal subgroups (the reader worths to pay attention at Choksi A984), p. 84). Indeed, in A973), Choksi, applying in the area of compact groups arguments valid for generalized cubes (cf. Maharam A958a), Choksi A972). Graf A982)) proved the following remarkable result. THEOREM 2.1. If ? is a Baire measure on a compact group G, then every automorphism of the measure algebra of ? can be induced by an invertible, completion Baire measurable transformation of G. The proof uses an interesting method of combining the classical Weil-Pontrjagin theorem with ideas of Choksi A972). Choksi and Simha A978) remarked that the proof of Theorem 2.1 "seems to depend on topological properties of homogeneity of the space". Confirming this vague assertion, these authors proved, among others, the following theorem. THEOREM 2.2. Let G be a locally compact ?-compact group, L a closed subgroup of G and ? a ?-finite Baire measure on (the homogeneous space) G/L. Then, every
On products of topological measure spaces 753 automorphism of the measure algebra of ? is induced by an invertible, completion Baire measurable point transformation ofG/L. It is worthwhile to note, here, that the purpose of this section is not to survey the whole work on measures on compact groups, but, mainly, to focus the reader's interest on the similarity that exists between Haar measure and product measure. In A979), Choksi and Fremlin wrote an article in which a large class of results in topological measure theory, usually proved for Polish spaces, was shown to hold in any uncountable product ???/ %i °f compact metric spaces. Two Radon probability measures on П/е/ ^i'> whose measure algebras are homogeneous of the same Maharam type, where shown to be (completion) Baire isomorphic. Also, the authors, combining their results with Maharam's famous theorem, arrived at some surprising theorems, some of whose proofs do not assume any set-theoretic hypotheses. The main of the latter are included in the statement (for a precise statement, see Choksi and Fremlin A979, Theorem 5)): PROPERTY 2.3. For a large, in fact cofinal, class of cardinals I, any two completion regular measures on П,е/ ^>· are completion Baire isomorphic. Choksi A984) asks "whether analogues of the results of Choksi and Fremlin - without additional set-theoretic assumptions - are valid for compact groups (and possibly even for homogeneous spaces)". This important conjecture was based on the one hand, on Kakutani and Kodaire theorem, and on the other on Theorems 2.1 and 2.2. Furthermore, this conjecture implies the necessity to find an intimate connection between Haar measure and product measure. Choban A994) had already proved that every compact group is Baire isomorphic to a product of compact metric spaces. In a subsequent paper, the author rediscovered the same result and proved that such a Baire isomorphism can be constructed so that it takes Haar measure to a product measure (Grekas, 1994). Thus, the Haar measure on a compact group plays a similar role to that of a product measure in the structure theory of measure algebras on a product of compact metric spaces (similar results can be proved for compact homogeneous spaces (Grekas, 1995)). Shortly before the previously mentioned, the author attempted, bearing in mind Choksi's question, to obtain "compact group analogues" of the results of Choksi and Fremlin A979). Grekas A992a) deals with the isomorphisms of measure algebras on compact groups and their inducing point maps. Modifying the proof of Theorem 2.1, the author first proves that two (Radon probability) measures on the group, whose measure algebras are homogeneous of the same type, are completion Baire isomorphic. The rest of the paper, as Choksi and Fremlin did, studies how many non-isomorphic completion regular measures can exist on the group. Using approximation by Lie groups, it is shown that the compact group analogue of Property 2.3 is valid (see Theorem 5.A and Remark 5.13C) there); note that to carry out the extension to compact groups, it is used, in a crucial way, the fact - incidentally established there - that the Haar measure on any uncountable compact group G is homogeneous of Maharam type w(G). The author states nextly the following theorem. THEOREM 2.4. IfG is a compact group with w(G) ^ ?, then the Maharam type of G is equal to the Maharam type of {0, 1}",G).
754 S. Grekas Although the statement was correct, there were some gaps in the proof; see also corrigendum (Grekas, 1994) (a detailed account of these problems is given in Godin B000)). Remark 2.5. Theorem 2.4 leads to an affirmative answer to Choksi's question, not only regarding the results free on additional set-theoretic assumptions but also regarding the remaining results of Choksi and Fremlin. A correct proof of Theorem 2.4 was done in Grekas and Mercourakis A998), where the measure-theoretic structure of topological groups is further more investigated. More precisely, Theorem 2.4 is an immediate consequence of the following structure theorem (Grekas and Mercourakis, 1998, Theorems 1.1 and 1.4): THEOREM 2.6. Let G be a compact group with w(G) = a ^ ?. There exist two families Gf, ??, ? < a, of compact metric groups (each having at least two points if a ^ ?+) and two continuous open surjections such that where ??, ?, ?$ denotes the (normalized) Haarmeasure on Gf, G, Щ, respectively. Roughly speaking, the measure ? is compressed between 0^ ?? and 0^ ?? so that the maps /, g are very close to being continuous epimorphisms (see Remarks 1.2 and Corollaries 1.3, 1.6 in the same paper). Theorem 2.6, whose proof is based on classical results of Pontrjagin-van Kampen and Mostert (see Price A997); also references given in Grekas and Mercourakis A998), extends and refines the well-known structure theorem of Kuzminov A959): every compact group is a dyadic space (Comfort et al., 1992) contains many interesting insights into the structure of topological groups). The remaining part of Grekas and Mercourakis A998) deals with the structure of the Jordan algebra of the Haar measure. As it is already mentioned, the Haar measure on a compact group G is Baire isomorphic to the Haar measure on some product X = П$<и.(С)^? °f comPact Lie groups. The authors, therefore, obtain a stronger result, in fact that the isomorphism can be chosen to be a "Jordan isomorphism" (see Definition 2.6 and Theorem 2.13); this, particularly, implies that the space of Riemann integrable functions on a compact group is isometric to that of Riemann integrable functions on some product of compact Lie groups (Corollary 2.15 there). Additionally, several applications of the above results are given. Among these, the authors obtain a direct proof of the existence of a strong - not necessarily invariant - lifting for the Haar measure on a compact group and a rather illuminating proof of
On products of topological measure spaces 755 the existence of uniformly distributed sequences on a compact separable group (Veech, 1971). Remarks and questions 2.7. A) Grekas A995) showed, using Furstenberg's A963) theorem that the phase space of any minimal distal flow is Baire isomorphic to a product of compact metric spaces; the isomorphism takes some invariant, completion regular probability measure to a product measure. According to the previously referred, the author believes that this isomorphism can be chosen to be also a Jordan isomorphism. B) It is easy to verify that if two Radon probability spaces have isomorphic Jordan algebras, then they have isomorphic measure algebras are hence the same Maharam type (Grekas and Mercourakis, 1998). The converse is not true, as it follows from Examples A) and B) in the same paper. In view of these examples, it is natural to ask whether there exists a strictly positive measure ? on X = {0, \}a ,? ^ ?>+, homogeneous of Maharam type a, such that its Jordan algebra is not isomorphic to the Jordan algebra of the Haar measure on X (of special interest is the case ?+ ^ a ^ c; notice that a similar question has been posed in Mercourakis A996)). C) Associate with any topological space X is the Boolean algebra ?(?) of Baire sets modulo sets of first category. Given an automorphism ? of ?(?) it may not be possible to find a point transformation of X that induces ? (Choksi et al., 1987). However, when X is a product of complete separable metric spaces, it is shown in Maharam A979) that any automorphism of ?(?) can be realized by an invertible, Baire measurable transformation of X. By combining Maharam's result with ideas of Grekas and Mercourakis A998), it is possible to prove the same result in the case when X is any compact group (it is a natural question whether category results are valid for compact groups). D) Methods which were developed in the proof of Theorem 2.6 can lead to some results concerning the "homeomorphic measure problem" for compact connected topological groups (Grekas and Mercourakis, 2002). For more information on this problem, see references in Prasad A981), Morris and Peck A983, 1984), Gryllakis A989). 3. Topological liftings for product measures Theorem 1.5 is, perhaps, the first published result which connects the meanings among them, lifting, topology and product measure. Clarifying elements about the history of this theorem can be found in Choksi et al. A987), cf. Remark 4.5 in Fremlin A989). This section emphasizes on topological (strong, Borel, Baire) liftings for product measures. As far as simple - not necessarily topological - liftings, the reader has to refer to A. lonescu Tulcea and С lonescu Tulcea A969), Fremlin B00?), Strauss et al. B002). The terminology and notation are the same with the ones of Section 2, but, in some cases, the considered measure spaces (which from the beginning are supposed to be localizable) are not necessarily bounded. All unexplained notions can be found in A. lonescu Tulcea and С lonescu Tulcea A969).
756 S. Grekas Let (?, ?) be a topological measure space and ? a Boolean subalgebra of the ?-algebra ? ? =??(?) of ?-measurable sets in X. A Boolean homomorphism ?: ? -> ?? is called a lifting for ? if ? ~ ? A and ? ~ В implies ? A = pB for ?, ? e ? (where ~ denotes equivalence modulo negligible sets). The lifting ? is called strong, respectively Borel, Baire if U с pU for every open set U e ?, respectively pA e B(X),Bo(X) for all A e ?. If ? is a lifting for ?/;, then we say that ? is a lifting for ? (if there is a strong lifting for ?, we also say that (?, ?) - or, just ? - has the strong lifting property SLP). The notion of (multiplicative) lifting /, linear lifting, etc., for (the algebra of bounded measurable functions) ?/°°(?,?), in the sense of lonescu Tulcea, can also be similarly defined (p and / are related, in a biunique way to each others, by фрл = 1(Фа), where ? denotes characteristic function). 3.1. Baire, Bore! and Strong liftings Problem 1.2 can get, in the liftings language, various forms, as, e.g.: (Qi) Is the SLP a productive property? (Note that, in this point, product means ?-ad- ditive product measure.) (Q2) Which topological measure spaces have the "product-strong lifting property" (PSLP)? (This means their product with any space having the SLP has again the SLP.) Qi is an open question, even if we consider two Radon measures on compact spaces (Kupka, 1983, Question 3.3). However, since the completion regularity of measures on products of compact metric spaces is a productive property, it is natural to ask: (Q3) If ? and ? are completion regular strictly positive measures on products of compact metric spaces (of weight > ?+) and if ?, у both have the SLP, then does ? ® ? has the SLP? As far as Q2 is concerned in the setting of compact spaces, there are in the literature very special examples of measure spaces (?. ?) having this property: • X is any product X = JT X, of compact metric spaces and ? = 0 ?, with supp ?, = X, for all/; • X is a compact group and ? the Haar measure. The first one (Grekas, 1987) is an (easy enough) extension of a noticable result, Theorem 5 in A. lonescu Tulcea and С lonescu Tulcea A969, Chapter VIII). A crucial step in its proof was the following "extension theorem" from the one factor to the product A. lonescu Tulcea and С lonescu Tulcea A969, Chapter VIII, Theorem 4); further generalizations of the lonescu Tulcea theorem are given in Fremlin B00?), Strauss et al. B002). THEOREM 3.1. Let Z = T ? X, where ?, ? are compact spaces and let ?, ? be (positive) Radon measures on T, X respectively, with supp ? = T, supp ? = X. Suppose that: (i|) X is metrizable; A2) r is a strong lifting for v. Then, there is a strong lifting ? for ? <g> ? such that p(f <g> g) = rf <g> g, for all f e M°°(T, v), g e C(X) (in particular, ? is an inverse lifting of r under the canonical projection, in the sense of Strauss et al. B002)).
On products of topological measure spaces 757 We will afterwards see that the extension procedure of a lifting from the one factor to the product has some additional interest. If one seeks to obtain similar results about Borel or Baire liftings, must bear in mind that the non-existence of Borel (linear) liftings (for many non-complete probability spaces (У, В, у)) is consistent with ZFC. This was established by an articles series, in the center of which is Shelah A983); there was proved that the usual Borel measure on [0, 1] consistently does not have a lifting. Various improvements of that result followed, see Burke and Just A991), Burke A993b), Burke and Shelah A992). In the latter are presented three examples of particular interest: A) ? = [0, 1]A, B = B(Y), ? is the usual product measure; B) ? = {0, 1}A, B= B(Y), ? is Haar measure; C) (?,?, ?) is any probability space, ? = [0. 1] ? ?.?= ?([0?])® ?. The following problem, which is related to this discussion, arises (see, e.g., Fremlin A993)): (F) Can there be a lifting r of ([0, 1]-, B([0, 1])-, mj) ( where mi is the restriction of Lebesgue planar measure) which is consistent, i.e., such that if ?, F e B([0, 1]), then r(E ? F) is of the form ?" ? F'l Notice that, by a theorem [CH] of Mokobodzki, any probability space (?,?,?), |ГК К2, admits a lifting (Fremlin, 1989). Remark 3.2. A) Consistent (not necessarily strong) liftings for complete probability spaces always exist. Talagrand A982, 1988) introduced them, proved their existence and has shown that these objects give a simple and efficient way to deal with measurability problems for a class of stochastic process. On the other hand, any consistent lifting for an hyperstonian space is not strong, see Strauss et al. B002), where an interesting variation of the notion of consistent lifting, the so called admissibly generated, is extensively discussed. B) Burke A995) and Fremlin B00?) introduced the notion of lifting r respecting coordinates, for a product (X = f],e/ ^,·, ? = Xie/M;) as following: for every УС/ there is a lifting rj for х,еуд, such that r is an inverse lifting of rj under the canonical projection. Concerning topological probability spaces, is interesting, of course, the r to be strong too; this is the case for any translation invariant lifting for the usual product measure on {0, 1}" (this fact, due to Fremlin B00?), is the main step for showing that if the (?,, ?,) are Maharam-homogeneous spaces, then there is a lifting respecting coordinates for ?. The - not so important and without any difficulty - observation, at this point, is that a translation invariant lifting r for the Haar measure on any locally compact group G "respects coordinates", in the following sense: for every closed normal subgroup ? of G, there is some lifting r# for the Haar measure on G/H such that r is an inverse lifting of гн under the canonical projection. 3.2. On the existence of Baire strong liftings Throughout this subsection, the framework is that of ?-additive product measures. The first part deals with - more or less - known positive results on the existence of Baire
758 S. Grekas strong liftings. In the second part, the discussion is about one more existence theorem. The approach developed, essentially based on Assertion 1.4, provides a unified lifting-theoretic treatment for a class of ? -compact groups and for a class of products of separable metric spaces. One of the reasons which make the existence of a Baire strong lifting interesting is that such a lifting assures 'good' measurable sections for the continuous functions onto the underlying topological space; cf. Talagrand A978), also final remarks in Losert A980). The latter concerns the existence and, mainly, the non-existence of measurable selections; Losert, following a transfinite induction argument, concludes in: THEOREM 3.3 (CH). Let X, be compact metric spaces, ?, probability measures on X,- with supp ?, = X, (i e I) and assume that \l\ ^ Ni. Then there exists a Baire strong lifting /ог(8>,-е/Л,-. Grekas and Gryllakis A992), using Losert's arguments and additionally some characteristics properties of completion regular measures, saw that the conclusion of Theorem 3.3 remains valid, if X, is a product of separable metric spaces, w(X,) ^ с and ?; completion regular (/' el). Note that, one of these properties is Assertion 1.4; ideas in their proof suggested to the author to introduce the following concept (Grekas, 1992b). Let (У, y) be a ?-additive probability space. Consider the condition: (O2) for every ?-additive probability space (?,?) and every open set U in ? ? ?, there are a separable metric space ? and a continuous open surjection ? : ? —> ? such that U ~ p_i pU, where p:XxY->XxZis defined by p(x,y) = (?,?(?)). (?, у) - or just у - is called an Ch-space, if it satisfies (??). Clearly, every Cb-space (satisfies Assertion 1.4 and) is completion regular; the converse is also true, concerning measures on products of separable metric spaces. Also, the Haar measure on any compact group satisfies {От) (Grekas, 1992b, Propositions 1.3 and 1.4). At this point, some comments are necessary. The structure theorem of Kakutani and Kodaira (Choksi, 1984, p. 84) made easy the proof of the latter. If, now, 'Haar measure on a compact group' is - in the language of Section 3 - the (compact group) analogue of 'product measure on a product of compact metric spaces', then Proposition 1.4 in Grekas A992b) is the analogue of Theorem 1.6 of Choksi and Fremlin. Another analogue (of Losert's theorem) about liftings, is the fact, remarked in Grekas and Gryllakis A994), that (subject to CH) the Haar measure on any compact group of weight ^ N2 admits a strong Baire lifting (this lifting, of course, cannot be translation invariant, see, e.g., Burke A993a)). It is obvious that the existence of a lifting 'compatible" with the structure of a product ?-algebra, can be examined in many ways. Concerning complete, e.g., measure spaces, the construction of a consistent lifting is always possible (Remarks 3.2). Although, concerning non-complete ?-algebras, the problem seems to be very fine and remains open (e.g., problem (F)). If now the requirements become weaker: the compatibility to refer to the one out of the two factors, that is 'the product lifting' (we wish to construct) extends a given lifting for the one factor, then (we saw) many positive results, although concerning non-complete spaces (especially measures on non metrizable topological spaces), the study seems to be more difficult.
On products of topological measure spaces 759 The investigation which follows, motivated by Losert's result, studies the construction of inverse Baire strong liftings for products. The relative results show that the class of (?2-spaces is a convenient setting for such an investigation. LEMMA 3.4 (CH). Let (У, v) be a completion regular probability space, with supp ? = ? and w(Y) ^ с If (?, ?) satisfies Assertion 1.4, then there is a Baire strong lifting for v. PROOF. Cf. Remark 3 in Grekas and Gryllakis A994). D Let now ? and ? be strictly positive measures on X and ? respectively, with w(X) ? ? ?, w(Y) ^ ? ?. The next lemma is based on techniques, essentially, due to Losert. LEMMA 3.5. If both ?, ? satisfy (СЬ) and if ? is a Baire strong lifting for ?, then there is an inverse Baire strong lifting under the canonical projection, for ? <g> v. PROOF. This is made in two steps. A) Let W = ? ? ?, ? = ? ? ? and ? be the subalgebra of By (W), consisting of those subsets of W which are of the form D ? ?, for some D с X. For A e B>.(W) define d(A) :-\J[rBr\U: ???, ?/openinW, k(BDU\A) = 0}. Since any open set U in W is ?-equivalent to p~' pU - where ? is as in (От) - it is easy to check that there are countably many choices for U and so d(A) is a Baire set. Furthermore, d has the properties (P) in Losert A980, p. 157). B) The second step is Lemma 3 in Losert A980). Therefore, exists a Baire strong lifting ? for ? such that d(A) с ? (А) с (d(Ac)Y for all A eB,,(W). ? is the required one. D Consider now a family (?,-,?,-), ? e /, of supporting СЬ-spaces, ? = Y\j€lMi, ? = 0/e/ ?/. The proof of the following result is similar to Grekas and Gryllakis A992, Lemma 4.6). LEMMA 3.6. Let {/„}„ек be an increasing sequence of subsets of ?, ? the subalgebra of BK{M) consisting of those subsets of ? which depend on I„ for some n. Assume that r is a strong Baire lifting for ?. For A e B,.(M) define d(A) := \J{rB f)U: В e ?, U open in ?, ?(? П U\A) = 0}. Then d has the properties (P) in Losert A980). THEOREM 3.7 (CH). Ifw(Mj) ^ c, i e /, and if a = |/| iC c+, then there exists a Baire strong lifting for (?,?). PROOF. There is nothing else to do, except almost to repeat arguments in Losert's paper. Here follows an outline. There is a directed set ? = {Fj·, j e J} of finite sub-products, of cardinal a such that projy lim Fj = M. Enumerating ? as [F^, ? <a} and taking Hq = Fq, Hy = Hs ? Fy if ? = ? + 1 (for some ? < a) and Hy = proj5<), lim Hs otherwise, we define, by transfinite induction, a family (??)?<? of sub-products of ?, e/??, such that ? =projylim Hy.
760 S. Grekas For ? < a let ?? be the sub-algebra of B,.(M) of those sets which are the form pr~' (D) with ? < ?, where pr^ : ? -> ? ? is the canonical projection. The lifting pY for i2y if defined by induction on ? such that ??/?? = ?? for /3 < у. If ? is a limit ordinal with uncountable cofinality, then the definition of pY follows by the fact that ?? = 1]?<? ? ?. In the other cases, the lifting py is defined with the help of Lemmata 3.5 and 3.6. D The question that, naturally, arises is how much is the Ch-spaces class wider than the one of completion regular probability measures on products of separable metric spaces? The СЬ-condition leads our thought to an already know topological spaces class: the class of inverse limits of separable metric spaces; although, such kind of spaces have been studied in depth, during the last 20 years' period of time by he Russian school of topology, a great part of this literature has not been translated in English. More terminology can be fixed: let ? = {Xa. a e A} be an inverse, or projective, system ('spectrum', in the Russian language) of topological spaces, where A is some directed set, ??? ¦' ?? -*¦ ??, с* < /3, are the bonding maps and pa : X -> Xa (a ? A) the (canonical) projections (we assume throughout that the projections are surjections). From the fact that a product space is the limit of the family of its countable sub-products, it follows that in this case (S) for every increasing sequence {a,,}„et; in A, there exists a e A such that Xa = lim Xan. This means that the spectrum is, roughly speaking, closed under countable sub-limits. We generalize, in some sense, this property, by saying that (any spectrum) ? is ?-closed if it satisfies (S) and the bonding maps are open. If ? is ?-closed, then a subset В of X = lim Xa {? 0) is said to be determined by countable many coordinates if there is a e A such that В = p~\ paB; then, for simplicity, we say that В depends on a (the terminology has been created just in needs of the text). The next auxiliary result is an easy application of Scepin A981, Proposition 1.8) (cf. Tkacenko A981, Lemma 12) or Grekas and Karolis A999, Theorem 2.1)). PROPERTY 3.8. If the limit space X of a ?-closed spectrum is ccc, then every Baire set in X is determined by countable many coordinates. The following examples of inverse limits of ?-closed spectra are of measure-theoretic interest: • Every product of separable metric spaces (the product space is the limit of its countable sub-products). • A locally compact, ?-compact group (the spectrum is that of metrizable subgroups described in Choksi A984, p. 84). • A (Baire subset of a) Dugundji space of weight ^ ?\ (the spectrum is, here, the corresponding "Haydon system"; see Uspenskii A988), cf. Shapiro A985)). Of course, a Dugundji space is dyadic and a ?-compact group is quasi-dyadic (Fremlin and Wajch, 1993). The next discussion, which is adapted from Engelking A997), shows that the limit of a ?-closed system of compact metric spaces need not be dyadic. Remark 3.9. For any compact (Hausdorff) space Z, 2Z denotes the space of its compact subsets with the Vietoris topology (Engelking, 1997, 2.7.20, 3.12.26). Consider
On products of topological measure spaces 761 the spaces D = {0, 1}, ? = D"'-, X = 2Z. Since ? = lim^ Z„, where {Z„} is the system of countable sub-produces of D"-\ we must have: X = lim^ Xa, where X„ = 2X". Then, it is immediate that {Xa} satisfies (S); moreover, its projections are open surjections. Note that the compact space X is not dyadic (Shapiro, 1976). On the other hand, since there are quasi-dyadic compact spaces which are not dyadic, it is natural to ask whether this space is quasi-dyadic. The following proposition shows that the completion regularity in ?-closed spectra setting is a productive property. The condition that the weight of the one factor is considered as "relatively low", simply means that the author could not find the way to prove it without this term. PROPOSITION 3.10. Let ? = {Ya,a e E} be a ?-closed spectrum of separable metric spaces and ? a completion regular probability measure on ? = lim^ Ya (ф И). // w(Y) ^ co+, then (Y, y) is an Oi-space. PROOF. It combines ideas in Fremlin and Grekas A995) with techniques used in Grekas A992c). Summary steps are sketched as follows: A) Construct (by induction on ? < ?\) a sub-system ?' = {??. ? < ?\} of ? with lim ZY = ?. If (?, ?) is a topological probability space and ? = ? <g> у, then it is not so hard the СЬ-condition to be seen as a consequence of the assertion. For any ?-measurable set A, there is ? < ?\ such that ?(? П (U ? ?^] psV)) = 0 for all couples of sets (t/, V) where U is open in X, V is open in У, ?(? ? (U ? ?)) > 0 and ?(? ? (U ? V)) = 0. B) Supposing, if possible, that the assertion is not valid for some closed and ?- supporting set A, we find (by transfinite induction) two families UY, VY, ? < ?\ where each VY is an open set in У, depending on ? + 1 and each UY is an open set in X, such that A f) (Uy ? VY) = fi and ?(? ? (UY ? ?~] ?? VY)) > 0, ? > ?,. A) Nevertheless, A) combined with techniques which have been developed in Grekas A992c) leads to contradiction. (Note that Grekas A992c) deals with compact groups; the arguments, however - which are there presented in full details - can also be used in any limit space like ?.) ? Remarks 3.11. A) The above mentioned lead to the following conclusions, about the product (?, ?) of family (?,, ?,), i e /, of supporting, completion regular probability spaces, where each Xi is the limit of some ?-closed spectrum of separable metric spaces and w(Xi) ???+: (C|) (?, ?) is completion regular; (Ci) [CH] If |/| ^ c + , then there is Baire strong lifting for ?. In the case when the X,- are locally compact, ?-compact groups, (C2) is the ?-compact group analogue of a result about product of separable metric spaces; see Grekas and Gryllakis A992, Remark 5.3). B) Concerning compact spaces, the things lead to known situations. If, for instance, in Proposition 3.10, ? is compact, then ? is Dugundji, hence dyadic and so
762 S. Grekas Theorem 1.11 yields a different and direct proof of the proposition. The same would happen if ? was, just, quasi-dyadic (it is not clear, for the author, if the limit of a ?-closed system of separable metric spaces is quasi-dyadic). Acknowledgements I would like to thank the editor of the Handbook E. Pap of his kind support and understanding; J.R. Choksi for precious information about homeomorphic measures; N.D. Macheras, K. Musiat, W. Strauss for sending a copy of their work and S. Mercourakis for his valuable suggestions. References Bourbaki, N. A965), Integration. Chapitres I-IV, Actualites Sci. Indust., Vol. 1175, Hermann, Paris. Burke, M.R. A993a), Liftings and the property of Baire in locally compact groups, Proc. Amer. Math. Soc. 117, 1075-1082. Burke, M.R. A993b), Liftings for Lebesgue measure, Israel J. Math.. Conference Proc., Vol. 6. Burke, M.R. A995), Consistent liftings. Preprint. Burke, M.R. and Just, W. A991), Liftings for Haar measure on @. 1}A', Israel J. Math. 73, 33^4. Caratheodory, С A918), Vorlesungen iiber reelle Funktionen. Teubner. Leipzig. Berlin. Choban, M.M. A984), Baire isomorphisms and Baire topologies. Solution of a problem of Comfort, Soviet Math. Dokl. 30, 780-784. Choksi, J.R. A972), Automorphisms of Baire measures on generalized cubes I, Z. Wahrsch. 22, 195-204. //, ibid. 23,97-102. Choksi, J.R. A973), Measurable transformations on compact groups, Trans. Amer. Math. Soc. 184, 101-124. Choksi, J.R. A984), Recent developments arising out of Kakutani's work on completion regularity of measures, Contemp. Math., Vol. 26, Amer. Math. Soc., Providence, RI, 81-94. Choksi, J.R., Eigen, S.J., Oxtoby, J.C. and Prasad, VS. A987), The work of Dorothy Maharam on measure theory, ergodic theory and category algebras, Contemp. Math., Vol. 94, Amer. Math. Soc., Providence, RI, 57-71. Choksi, J.R. and Fremlin, D.H. A979), Completion regular measures on product spaces, Math. Ann. 241, 113- 128. Choksi, J.R. and Simha, R.R. A978), Measurable transformations on homogeneous spaces. Studies in Probability and Ergodic Theory, Adv. in Math. Suppl. Stud., Vol. 2, 269-286. Comfort, W.W., Hofmann, K.H. and Remus, D. A992), Topological groups and semigroups. Recent Progress in General Topology, M. Husek, J. Van Mill, eds, Elsevier. Engelking, R. A997), General Topology, PWN, Warszava. Erdos, P. and Oxtoby, J.C. A955), Partitions of the plane into sets having positive measure in even- non-null measurable product set. Trans. Amer. Math. Soc. 79, 91-102. Fremlin, D.H. A976), Products of Radon measures: a counter example, Canad. Math. Bull. 19, 285-289. Fremlin, D.H. A982), Quasi-Radon measure spaces, Note of 2.6.82. Fremlin, D.H. A983), Stable sets of measurable functions. Note of 17 May. Fremlin, D.H. A984), Consequences of Martin's Axiom, Cambridge Univ. Press. Fremlin, D.H. A989), Measure algebras. Handbook of Boolean Algebra, J. Monk, ed., North-Holland, Amsterdam. Fremlin, D.H. A993), Problems, 9 March. Fremlin, D.H. B000), Sets determined by few coordinates. Preprint. Fremlin, D.H. B00?), Measure Theory, to appear. Fremlin, D.H. and Grekas, S. A995), Products of completion regular measures. Fund. Math. 147, 27-37.
On products of topological measure spaces 763 Fremlin, D.H., Johnson, R.A. and Wajch, E. A996), Countable network weight and multiplication of Borel sets, Proc. Amer. Math. Soc. 124, 2897-2903. Fremlin, D.H. and Wajch, E. A993), Quasi-dyadic spaces. Version of 25.10.93. Preprint. Furstenberg, H. A963), The structure of distal flows, Amer. J. Math. 85, 477-515. Gardner, R.J. A975), The regularity of Borel measures and Borel measure compactness, Proc. London Math. Soc. 30,95-113. Gardner, R.J. A981), 77k· regularity of Borel measures. Oberwolfach 1981. Lecture Notes in Math.. Vol. 945, Springer, Berlin. Gardner, R.J. and Pfeffer, W.F. A984), Borel measures. Handbook of Set-Theoretic Topology, K. Kunen and J.E. Vaughan, eds, North-Holland, Amsterdam. Godin, V. B000), Isomorphic measures and point realization of set tr/msformations on compact topological groups and compact transformation groups. These de projet. McGill University. Grekas, S. A987), Remarks on the strong lifting property for products. Israel J. Math. 58. 198-204. Grekas, S. A992a), Isomorphic measures on compact groups. Math. Proc. Cambridge Philos. Soc. 112. 349-360. Grekas, S. A992b), Onproducts of completion regular measures, J. Math. Anal. Appl. 171. 101-110. Grekas, S. A992c), Note of April. Grekas, S. A994), Structural properties of compact groups with measure-theoretic applications, Israel J. Math. 87, 89-95. Grekas, S. A995), Measure-theoretic problems in topological dynamics. J. Anal. Math. 65, 207-220. Grekas, S. and Gryllakis, C. A991), Completion regular measures on product spaces with application to the existence ofBaire strong liftings. Illinois J. Math. 35, 260-268. Grekas, S. and Gryllakis, C. A992), Measures on product spaces and the existence of strong Baire liftings, Monatsh. Math. 114, 63-76. Grekas, S. and Gryllakis, С A994), A remark on a theorem ofLosert, Monatsh. Math. 117, 95-102. Grekas, S. and Karolis, P. A999), Notes on projective limits of separable metric spaces. Preprint. Grekas, S. and Mercourakis, S. A998), On the measure theoretic structure of compact groups, Trans. Amer. Math. Soc. 350, 2779-2796. Grekas, S. and Mercourakis, S. B002), Homeomorphic measures on compact connected groups, in preparation. Gryllakis, C. A988), Products of completion regular measures. Proc. Amer. Math. Soc. 103, 563-568. Gryllakis, C. A989), Homeomorphic measures on products of intervals. Math. Ann. 284, 377-389. Gryllakis, С and Grekas, S. A999), Onproducts of Radon measures, Fund. Math. 159, 71-84. Gryllakis, C. and Koumoullis, G. A990), Completion regularity and ?-additivity of measures on product spaces. Compositio Math. 73, 329-344. Halmos, P. A950), Measure Theory, Van Nostrand. Products of completion regular measures. Proc. Amer. Math. Soc. 103, 563-568. Hewitt, E. and Ross, K. A963), Abstract Harmonic Analysis. Vol. I, Springer. Berlin. Ionescu Tulcea, A. and Ionescu Tulcea. С A969). Topics in the Theory of Liftings, Ergebn. Math. Grenzgeb. B. 48, Springer, Berlin. Kakutani, S. A943a), Notes on infinite product measure spaces I, Proc. Imper. Acad. Tokyo 19, 148-151. Kakutani, S. A943b), Notes on infinite product measure spaces II, Proc. Imper. Acad. Tokyo 19, 184-188. Kakutani, S. and Kodaira, ?. (?944), Uber das Haarsche Mass in der local bikompacten Gruppe, Proc. Imper. Acad. Tokyo 20,444-450. Kirk, R.B. A969), Locally compact, B-compact spaces. Indag. Math. 31, 333-344. Knowles, J.D. A967), Measures on topological spaces. Proc. London Math. Soc. 17, 139-156. Kupka, J. A983), Strong liftings with application to measurable cross sections in locally compact groups, Israel J. Math. 44, 243-261. Kuz'minov, V. A959), On a hypothesis of PS. Alexandroff in the theory of topological groups, Dokl. Akad. Nauk SSSR 125, 727-729 (in Russian). Losert, V. A980), A counterexample on measurable selections and strong lifting. Measure Theory, Oberwolfach 1980, Lecture Notes in Math., Vol. 794, Springer, Berlin. Macheras, N.D. and Strauss, W. A994), On strong liftings for projective limits. Fund. Math. 144, 209-229. Macheras, N.D. and Strauss, W. A996), On products of almost strong liftings, J. Austral. Math. Soc. Ser. A 60, 311-333.
764 S. Grekas Maharam, D. A942), On homogeneous measure algebras, Proc. Nat. Acad. Sci. USA 28, 108-111. Maharam, D. A958a), Automorphisms of products of measure spaces. Proc. Amer. Math. Soc. 9. 702-707. Maharam, D. A958b), On two theorems of Jessen, Proc. Amer. Math. Soc. 9, 995-999. Maharam, D. A958c), On a theorem of von Neumann, Proc. Amer. Math. Soc. 9, 987-994. Maharam, D. A979), Realizing automorphisms of category algebras of product spaces. Gen. Topol. Appl. 10, 161-174. Mercourakis, S. A996), Some remarks on countabh determined measures and uniform distribution of sequences, Monatsh. Math. 121,79-111. Montgomery, D. and Zippin, L. A955), Topological Transforation Groups, Interscience, New York. Morris, A.S. and Peck, C.V. A983), A note on homeomorphic measures on topological groups, Proc. Edinburgh Math. Soc. 26, 169-171. Morris, A.S. and Peck, C.V. A984), On the homeomorphic measure property, Colloq. Math. 49 A), 5-10. Prasad, V.S. A981), A sur\-ey on homeomorphic measures. Measure Theory, Oberwolfach 1981, Lecture Notes in Math., Vol. 945, Springer, Berlin, 150-154. Price, J.? A977), Lie Groups and Compact Groups, Cambridge Univ. Press. Ressel, P. A977), Some continuity and measurability results on spaces of measures. Math. Scand. 40, 69-78. Scepin, E.V. A981), Functors and uncountable powers of compacta, Uspekhy Mat. Nauk 36, 3-62 (in Russian). Shapiro, L.B. A976), The space of closed subsets of D^2 is not a dyadic bicompact, Soviet Math. Dokl. 17, 937-941. Shapiro, L.B. A985), On Baire isomorphisms of spaces of uncountable weight, Soviet Math. Dokl. 32, 113-117. Shelah, S. A983), Lifting problem of the measure algebra, Israel J. Math. 45, 90-96. Shwartz, L. A973), Radon Measures on Arbitrary Topological Spaces and Cylindrical Measures, Oxford Univ. Press. Strauss, W., Macheras, N.D.and Musial, K. B002), Liftings, Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 1131-1184. Talagrand, M. A978), Non existance de certaines sections et applications a la theorie du relevement, C. R. Acad. Sci. Paris Ser. I Math. 286, 1183-1185. Talagrand, M. A982), Closed convex hull of set of measurable functions, Riemann-measurable functions and measurability of translations, Ann. Fourier Grenoble 32, 36-69. Talagrand, M. A984), Pettis Integral and Measure Theory, Mem. Amer. Math. Soc., No. 307. Talagrand, M. A987a), The Glivenko-Cantelli problem, Ann. Prob. 15, 837-870. Talagrand, M. A987b), A 'regular oscillation' property of stable sets of measurable functions. Contemp. Math., Vol. 94, Amer. Math. Soc, Providence. RI. 309-313. Talagrand, M. A988). On liftings and the regularization of stochastic process. Probab. Theory Related Fields 78, 127-134. Tkacenko, M.G. A981). Some results on inverse spectra I. II. Comment. Math. Univ. Carolin. 22. 621-633. 819-841. Uspenskii, V. V. A988), Why compact groups are dyadic'!. General Topology and its Relations to Modem Analysis and Algebra, Z. Frolik, ed., Proc. Sixth Prague Topological Symp., 1986. Heldermann. Varadarajan, V.S. A965), Measures on topological spaces. Amer. Math. Soc. Transl. Ser, Vol. 48. 161-228. Veech, W.A. A971), Some questions of uniform distribution. Ann. of Math. 94 B). 125-138.
CHAPTER 18 Perfect Measures and Related Topics Doraiswamy Ramachandran Department of Mathematics and Statistics, California State University, 6000 J Street, Sacramento. CA 95819-6051, USA E-mail: chandra@csiis.edu Contents Introduction 767 0. Preliminaries 767 1. Compact measures 769 2. Perfect measures 771 3. Measures on product spaces 773 4. The marginal problem 775 5. Monge-Kantorovich duality spaces 776 6. Regular conditional probabilities 778 7. Independence and Blackwell spaces 779 8. Standard measure spaces 780 9. Mixtures of perfect measures 781 10. Disintegrations 781 References 783 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 765
This page intentionally left blank
Perfect measures and related topics 767 Introduction The fascinating class of perfect probability spaces was introduced by Gnedenko and Kolmogorov A954) "to achieve complete harmony between the abstract theory of measures and the theory of measures in metric spaces." The two volume monograph of Ramachandran A979) brought together various developments involving perfect measures, in a unified manner, showing that perfect spaces are technically the most pleasing class of probability spaces. In this article we summarize the classical results and outline the progress involving perfect measures since the late seventies. While the references contain an almost exhaustive list of relevant papers, we can only provide an overview of some of the major developments and key results. For proofs, additional details, and open problems not mentioned herein the reader should consult the references. 0. Preliminaries We follow the customary measure theoretic terminology and notation (see Billingsley A986), Halmos A950), Neveu A965)). All measures considered in this chapter are probabilities. The word function, when used without qualification, will always mean a real-valued function. Let С be a collection of subsets of a set X. By Cs. С'1, С" and C& we denote respectively the smallest class containing С and closed under finite unions, finite intersections, countable unions and countable intersections. By alg(C) and a(C) we denote respectively the algebra and the ?-algebra generated by C. a(C) will be called the ?-algebra generated by С and С will be called a generator of a(C). In case a ?-algebra A has a countable generator, A will be called a countably generated (henceforth abbreviated as e.g.) ?-algebra. A ?-algebra В С A will be called a sub ?-algebra of Л. А ?-algebra with a generator of cardinality к ^ Ki will be called an ? ?-generated ?-algebra. A measurable space is a pair (X. A) where X is a nonempty set and Л is a ?-algebra of subsets of X. If ? с X then ? ? ? = [А П ?: A e A} is a ?-algebra of subsets of ? and is called the trace of A on Y. If A is generated by С then ? ? ? is generated by С П ?. If ? is a measure on (X, A) (recall that P(X) — 1 for all measures considered) such that P*(Y) > 0, where P* denotes the outer measure induced by P, then Py defined on (?,???) by ??(??)?) = ?*(??\?)/?*(?) is a probability on (У, А П Y) and is called the trace of ? on (?. ? ? ?). If (X, A) is a measurable space then a nonempty subset A e A is called an atom of A (or Д-atom) if В с А, ВеА => ?=???? = ?. A is said to be atomic if every nonempty set in Л is a union of atoms. Clearly any two distinct atoms are disjoint and the atoms of an atomic ?-algebra form a partition of X. Any
768 D. Ramachandran e.g. ?-algebra is atomic. A ?-algebra A is called separable if it is e.g. and contains all singletons. If Л is separable and if У с X, then ? ? У is separable. A (X, A) is separable if Л is a separable ?-algebra. Let ? be a probability on (?, A). A set A e A is called a P-atom if P(A) > 0 and for every В е А, В с A either P(S) = 0 or P(B) = P(A). The measure ? is said to be nonatomic if there are no P-atoms. The measure ? is called discrete if there exists a sequence {A,,} of pairwise disjoint P-atoms such that ?^_| P(A„) = 1. A measure ? on (X, Л) where A contains all singletons is called continuous if P({.v}) = 0 for every ? e X. If A is separable then ? on (X, A) is nonatomic iff it is continuous. If Ao is an algebra of subsets of X then a set function ?? on (X, Ao) is called a charge if ?? is nonnegative, finitely additive and ??(?) = 1. The charge ?? is a measure if it is countably additive. Two measurable spaces (X,A) and (У, B) are said to be isomorphic if there is a 1-1 map / from X onto ? such that both / and /"' are measurable. Two probability spaces (X, A, P) and (Y,B,Q) are said to be isomorphic if there is an isomorphism / from (X,A) to (Y, B) such that Q = P/; they are said to be almost-isomorphic if there exists X\ eA,Y\ eB with P(X,) = Q(Y\) = 1 such that (X|, ? ? ?\, PXl) and (У|, В П У|, <2?) are isomorphic. We say that (Z,C, P) is a thick subspace of(Y,B, Q) and write (Z.C, P) с (У, ?, Q) if ? с ?,? П ? =C,Q*(Z) = I, and ? = Q*z If ? is a metric space then we denote by ?? the Borel ?-algebra of X, namely the ?-algebra generated by the open subsets of X. Any subset of the real line ?)? will be given the usual metric when considered as a metric space. The Borel ?-algebra of any separable metric space is separable. If X is a metric space then a measure ? on (?, ?), where ? D ??, is called outer regular if for every В eB P(B)=inf{P(U). i/open in X, Set/} and ? is called inner regular if for every В e В P(B)=sup{P(K). A: compact, К С В). Any measure on the Borel subsets of a metric space is outer regular. A Polish space is a complete, separable metric space. A measurable space (X, A) is called a standard Borel space if (X, A) is isomorphic to (У,В) where У is a Borel subset of a Polish space and ? is the Borel ?-algebra of У. If (X, Л) and (Y,B) are standard Borel spaces and if / is a 1-1 measurable map from X into У then the forward images of Borel sets under / are measurable; in particular, /(X) e В and / is an isomorphism of (X, A) onto the standard Borel space (/(?),? П /(X)) (see Kuratowski A966, Theorem 1, p. 489)). Let X be a Polish space and let В be the ?-algebra of the Borel subsets of X. A subset У of X is called analytic if it is a continuous image of the irrationals. The complement of an analytic subset of X is called a coanalytic set. If У is a an analytic (respectively coanalytic) subset of X then (У, В П У) as well as any measurable space isomorphic to
Perfect measures and related topics 769 it will be called an analytic space (respectively coanalytic space). For details concerning analytic and coanalytic sets we refer the reader to Kuratowski A966), Bhaskara Rao and B.Y Rao A981) and Cohn A980). For the definition and properties of the Lebesgue measure, to be denoted by ? on the real line and on the unit interval we refer the reader to Section 3 of Billingsley A986) or Section 5 of Halmos A950). The Axiom of Choice will be assumed throughout. We will specify whenever we assume the Continuum Hypothesis and denote it by (CH). 1. Compact measures Probability measures on the Borel subsets of a Polish space are well-behaved. They are, for instance, inner regular, i.e., approximable from inside by compact sets (see Neveu A965, p. 64)). But, in a general probability space, to start with we do not have a notion of compactness. In this section we discuss the notion of compact classes of sets and of compact measures, both due to Marczewski. We derive basic properties of compact measures. A collection С of subsets of a set X is called a compact class if for every sequence (Ci)cC the following condition holds: P| Ck? 0 for every ? > 1 => f|Ct/H. k=\ k=\ Recall that a sequence {A„, ? > 1} of subsets of X is said to possess the finite intersection property if, for every и > 1, ?"=? Ak ? 0- Thus a collection С of subsets of X is a compact class if every sequence of sets from С having the finite intersection property has nonempty intersection. The collection of compact subsets of any topological space is a compact class. Every subcollection of a compact class is compact. If С is a compact class then so are Cs and С. Let ?? be a charge on an algebra Ao of subsets of a set X. We say that a class S of subsets of X ^-approximates Aq if for every ? e Ao and for every ? > 0 there exists a set S? eS and a set Ee e Ao such that Ee С Se С ? and ??(? — ?[)<?. Obviously, a class ? С ^-approximates Л) iff for every ? e Ao and for every ? > 0 there exists a set FE e ? such that Fe С ? and ??(? - Fe) < ?. PROPOSITION 1.1. If ? is a measure on (X, A) where A is ? ?-algebra and if ? ?-??- proximates A then T& П A also ?-approximates A-
770 D. Ramachandran Marczewski A953) defined a charge ?? on (?,??) where Ao is an algebra to be compact if there exists a compact class С which ??-approximates Ao and showed that every compact charge is indeed a measure, that is, it is countably additive; hence such a compact charge ?? has a unique extension as a measure to ?(??)- The class of compact measures is sufficiently wide. With the usual compact sets forming an approximating class, we have CI. Every measure on a standard Borel space is a compact measure. PROPOSITION 1.2. Suppose ?? is a compact measure on (X. Ao) where Aq is an algebra. Then ?, the extension of ?? to ?(.4?) is a compact measure; further, there is a compact class contained in ?(??) which ^-approximates ?(.4?). In the above setting, Cs П ?(??) is a compact class ?-approximating ?(??)- C2. If ? is a compact measure on a ?-algebra A then there is a compact class К, С А which ?-approximates A. It is not known whether a similar property holds for compact measures on algebras: if ?? is a compact measure on an algebra Ao, can one find a compact class 1С С Ao which ??-approximates Ao"! PROPOSITION 1.3. If ? is a nonatomic measure on an algebra Ao, then every ? e Ao with ??(?) > 0 contains a set N of cardinality с such that ?*(?) = 0. Sierpinski A956, p. 81) has proved using (CH) that there is an uncountable subset D С [0,1] such that every subset of D which has Lebesgue measure zero is at most countable; ?? on (D, B\o, \ j П D) is therefore noncompact. For further results on Sierpinski sets and related results see Miller A984), Shortt A986), Jasiriski and Weiss A991). The following properties of compact measures are useful. For their proofs refer to Ramachandran A979): C3. Every discrete measure is compact. C4. If ? on (X, A) is compact and if ? e A with P(A) > 0 then ?? on (?. ? ? ?) is compact. C5. If ? on (X, A) is compact and Q on (X, A) is such that Q <?. ? then Q is compact. C6. ? on(X, A) is compact iff Pon(X. A) is compact where (X, A. P) is the completion ??(?,?,?). C7. Let (Z,C,P) с (Y,B.Q), i.e., (Z,C. P) is a thick subspace of (Y.B,Q). If ? is compact, then Q is compact. C8. If (X, A) and (Y, B) are two measurable spaces and /: X -> У is an A. ?-measurable map. If ? is a compact measure on (Y.A), then P/~' is compact on (Y.B).
Perfect measures and related topics 771 For a characterization of standard Borel spaces using the notion of compact classes see Bjornsson A980). We will specify additional properties of compact measures in the sequel. 2. Perfect measures Perfect measures are more general than compact measures and the two notions coincide on e.g. ?-algebras. Definition 2.1. A measure ? on (?. A) is perfect (equivalently, the probability space (?, A, P) is perfect) if, for every Д-measurable function / on X, and for every subset С of the real line for which /_1C e A there is a Borel set В с С such that P(f~l В) = P(/-'C). The following fact is useful in verifying perfectness of a measure. PI. A measure ? on (X, A) is perfect iff for every Д-measurable function / on X there is a Borel set Bf с f(X) such that P(/-'B/) = 1. Ryll-Nardzewski A953) defined ? on (X. A) to be quasi-compact if, for every sequence {An, ? ^ 1} с A and for every ? > o, there exists Ae e A with P(Af) > 1 - ? such that {An П Ae, ? ^ 1} is a compact class. Every compact measure is quasi-compact; further, quasi-compactness and perfectness are equivalent. Compactness, perfectness and quasi- compactness are all equivalent on countably generated ?-algebras. Ryll-Nardzewski's notion brings perfectness as an intrinsic property of the probability space (?,?,?) while the notion of perfectness as in Definition 2.1 and PI involves real-valued measurable functions on (X,A). The basic properties of perfect measures are: P2. Every compact measure is perfect. P2 shows that the class of perfect measures is rich. Consider the Lebesgue measure ? on ([0, l],?[o.ij).LetM c[0, 1] be such that ?*(?) = 1 and ?*(?) = 0. Since ?*(?) =0, taking / to be the identity function and using PI, we see that ?? is nonperfect (and hence noncompact) measure on (M, #[o, ij П M). Musiat A976) has constructed a perfect measure which is not compact with the underlying ?-algebra being ? ? -generated, thereby showing that the converse of P2 is false. P3. The restriction to any sub ?-algebra of a perfect measure is perfect. The corresponding result for compact measures is C9. The restriction to any sub ?-algebra of a compact measure is compact. This was established by Pachl A978) using the notion of disintegrations which we will discuss later. P4. A measure on (X, A) is perfect iff its restriction to every e.g. sub ?-algebra is perfect. P5. If ? on (X, A) is perfect then, for every ? e A with P(Y) > 0, ?? on (?, ? ? ?) is perfect.
772 D. Ramachandran P6. (?, A, P) is perfect iff (X, A, P) is perfect. P7. Every discrete measure is perfect. P8. Let (Z,C, P) с (?,В, Q), i.e., (?,С, ?) is a thick subspace of (Y,B, Q). If ? is perfect, then Q is perfect. P9. Let X с ?)?. Every measure on (X, #;)( П X) is perfect iff X is universally measurable. THEOREM 2.1. Lef (Х,Л) foe a measurable space. Then the following statements are equivalent: (a) For every sub ?-algebra С С A, every measure on (X, C) is perfect. (b) For every e.g. sub ?-algebra V С A every measure on (X, T>) is perfect. (c) The range of every real-valued ?-measurable function f is universally measurable. Kallianpur A959) introduced the notion of D-space as a separable space in which the range of every measurable function is a universally measurable set and showed (b) => (c) part of Theorem 2.1. Sazonov A962) independently showed (b) => (c) => (a). Blackwell A956) started the study of connections between perfectness and analytic subsets of the real line. Darst A969, 1971) constructed a D-space which is not analytic and a universally measurable space which is not a D-space. Additional results in this direction can be found in the above references and in Shortt A984). If X is a separable metric space, ?? its Borel ?-algebra, and ?? is the class of compact subsets of X. Then a measure ? on (X,A) where Ad ?? is perfect iff ? is inner regular, that is, C\ P-approximates A. If, further, X is a Polish space, then a measure on (X, A) where A D ?? is perfect iff ?? P-approximates A; according to Gnedenko and Kolmogorov this property confirms the reasonableness of the definition of a perfect measure. Pachl A979) has extended these properties to metric 2D-spaces (spaces no closed discrete subspace of which has a 2-measurable cardinal). THEOREM 2.2. Let ? be a measure on the Borel ?-algebra of a metric ID-space. Then the following are equivalent: (i) ? is inner regular, i.e., the collection of topological compact sets ?-approximates the Borel ?-algebra. (ii) ? is compact. (iii) ? is perfect. The above result can also be found in Koumoullis A981). For results on general (not necessarily finite) perfect measures on topological spaces, the reader is referred to Koumoullis A981) and the survey article of Gardner A982). The notion of weakly compact measures (see Erochin A961)) is shown to be equivalent to compact measures by Pachl A975). Mallory A975) defined a collection K, of subsets of X to be a monocompact class if for every decreasing sequence {K„, ? ^ 1} С /С such that ?,??? Kn = 0, there is a natural number N such that K,\ = 0. If ? is a measure on (X, A) and the monocompact class /C P-approximates A, then we say that ? is a monocompact
Perfect measures and related topics 11Ъ measure. The notion of a monocompact class is weaker than that of a compact class. Tops0e A979) showed that every monocompact measure is perfect. Recently, Dekiert (nee Remy) A993) has shown that for K|-generated ?-algebras compactness and monocompactness are equivalent. Since Musiat's example of a noncompact perfect measure is defined on ? ? - generated ?-algebra, it follows that there is a perfect measure which is not monocompact. Related open questions are raised in Dekiert A993). A measure ? on (X, A) is called purely noncompact (respectively purely nonperfect) if, for every С e A with P(C) > 0. Pc is noncompact (respectively nonperfect). Remy A988) has established the following results: (i) Every probability ? on (X, A) has a unique representation as ? = a \ P\ + <*? Pi + ?^??, where (a) a, ^ 0, a \ + ai + aj = 1; (b) P\ is compact; (c) Pi is perfect and purely noncompact, and (d) P?, is purely nonperfect. (ii) If {P„, n^ 1} is a sequence of measures on (X.A) and ? = ??= ? a» ?" wnere a„ ^ 0 for all ? > 1 with ??= ? ?» = ' <tnen ? is Perfect iff Pn is perfect for every и ^ 1. 'Perfect' can be replaced by 'compact', 'purely nonperfect,' and 'purely noncompact' in the assertion, (iii) Let.M be the set of all probabilities on (X.A) and let M,. Mp,Mpnc and Mpnp represent the set of all compact, perfect, purely noncompact and purely nonperfect measures. Then Mr С Мр С ? and Mpnp С MP„r С М. Each is a convex set and the set of extreme points of each is the set of 0-1 valued measures. Since every 0-1 valued measure is compact (by C3), Mpnp and M,,,u- do not contain any extreme point. As a consequence of (iii) above, if {P„. ? ^ 1} is a sequence of compact measures on (X, A), then there is a common compact class /С с A such that /C P„-approximates A for each ? ^ 1 (by taking /C to be a compact class P-approximating A for ? = ?^=\ 2" Pn)- Aniszczyk A986) has constructed a family {Pa. a e /} of compact measures on a measurable space (X, A) for which there is no such common compact class K,/^-approximating A for each a e /, answering a question raised by Pachl A979). 3. Measures on product spaces In this section we show that perfect measures provide a natural setting for the existence of product measures. Let {(?/, ?), ? e /} be a family of measurable spaces. Let Ao be the algebra of subsets of П/е/ %> generated by finite dimensional cylinders. Recall that ?(?))=®? i€l is the product ?-algebra. The classic example of Sparre-Anderson and Jessen A948) shows the existence of a charge ? on (["[,¦ e/ -??. Ao) such that each marginal P, = Pott, on (Xj,Ai), where ?,- is the canonical projection, is countably additive while ? is not countably additive. This phenomenon cannot occur if each marginal is perfect.
774 D. Ramachandran Theorem 3.1 (Marczewski A953);respectively,Ryll-Nardzewski A953)). Let {(?,, ?, i e /} be a family of measurable spaces and let ? be a charge on (П,е/ ?'?··??) such that every marginal Pi = ???^] on (Xj.Ai) is compact (respectively perfect). Then (i) ? is countably additive; and (ii) the unique extension of ? to ®i€! Д is compact (respectively perfect). Note that the Daniell-Kolmogorov consistency theorem for the existence of a product measure is a consequence of Theorem 3.1. In view of РЗ (С9) we have P10 (CIO). A measure on a product space is perfect (compact) iff each marginal is perfect (compact). By using a countable bilateral product of nonperfect spaces Nadkarni and Ramachandran A978) have answered a question of D. Maharam A978) concerning tail fields. Let (X, A) and (У, B) be two measurable spaces and let 11 = [A x В: A e А. В е В} be the semialgebra of measurable rectangles. Let ? be a charge on the semialgebra with ?-additive marginals ? ? ??* on (X, A) and ? ???] on (?, ?), respectively. The following theorem due to Marczewski and Ryll-Nardzewski A953) is important and has many useful applications. Due to its significance it should be included in texts on measure theory. THEOREM 3.2. A charge on the semialgebra of measurable rectangles of the pmduct of two measurable spaces is ?-additive if at least one of the marginals is perfect. An example of Ryll-Nardzewski in the same paper shows that the assumption of perfectness of at least one marginal is essential in Theorem 3.2 for a charge on the measurable rectangles to be countably additive. Ramachandran A996) has the following generalization of Theorem 3.2: THEOREM 3.3. Let {(X/, Д,), ;' e /} be a family of measurable spaces. If ? is a charge on the semialgebra TZ of measurable rectangles in ? e / %i w'tn countably additive marginals such that all but perhaps one of its marginals is perfect, then ? is countably additive. Let Qj be a measure on (X, C,), / = 1. 2. where each C, is an algebra. A measure Q on a/g({Ci,Ci}) is called a common extension of Q \, Qi if Q\c, = Q\ for/ = 1.2. A common extension Q is called a splicing if Q(C\ ПС:) = Q\(C\)Qi(Ci). For results on common extensions and splicings refer to Stroock A976). Kallianpur and Ramachandran A983), Schmidt and Waldschaks A991), Hackenbroch A992), Ramachandran A996). For projective limits of perfect measure spaces see Musiat A980). The following theorem of Pachl A979) gives two more characterizations of perfect measures. THEOREM 3.4. Let (X, A, P) be a probability space. Then the following are equivalent: (i) ? is perfect. (ii) If(Y,B, Q) is a probability space and ? is a charge on alg({A ? В: А е А. В е B\) with marginals ? and Q then ? is countably additive.
Perfect measures and related topics 775 (iii) If (Y, B, Q) is a probability space and ? is a measure on А® В with marginals ? and Q, then ?*(? ? F) = Q^F) for every F CY. An example due to Pachl A981) shows that condition of (ii) of Theorem 3.4 cannot be replaced by the following weaker condition: "Any charge ? on alg({A\ ? ??. A,- e A, i = 1,2}) with both marginals ? is countably additive." 4. The marginal problem The marginal problem of Strassen A965) can be formulated as: Let (X, A, P) and (У, B, Q) be two probability spaces. Given 5" с ? ? ? what are the conditions that would ensure the existence of a probability ? on (? ? ?, A <g> B) with marginals ? and Q such that ?*(S) = 1? If such a ? exists then we say that 5" is marginalizable. A general formulation of the problem and its importance in applications can be found in Hoffman-J0rgensen A987). Hansel and Troallic A978, 1986), Lembke A982), Kellerer A964a, 1964b) and Shortt A983) treat this problem under topological restrictions. Definition 4.1. Let S be a semialgebra of subsets of ?. A set S с ? is called S-en- closable if 5"* = (C\{E e a(S): ? D S}\ e S*s. In the context of a product space and the semialgebra TZ of measurable rectangles we shall simply use the term enclosable instead of TZ-enclosable. THEOREM 4.1. Let (X,- ,Ai,Pi), i = 1, 2, be two probability spaces where at least one of the measures P\ and P2 is perfect. A subset S С Х\ х Хг is marginalizable if (a) 5" и enclosable; and (b) {?? xX2)n5"C(X| xA2)nS^ Pi(Ai) <C P2(A2). Theorem 4.1 unifies and extends beyond the context of analytic and separable spaces the results contained in Theorem 1 of Shortt A983) and Proposition 3.8 of Kellerer A984). Recently Plebanek A989) has used the above approach together with a result concerning charges by Hansel and Troallic A986) to establish the following general solution of the marginal problem for two-dimensional products. Let (Xi,Ai, Pi), i = 1,2, be two probability spaces. For ? с ?? ? Хг define ?(?) = sup{P|(A|) + P2(A2): (?? ? А2)П? = 0}. THEOREM 4.2. Let (X,-, Ai, Pi), i = 1,2, be two probability spaces and let at least one of P\ and P2 be perfect. Let d ^ 0. If D e A\ <g> Аг is enclosable then the existence of a probability ? with marginals P\ and P2 such that P(D) > 1 — d is equivalent to the condition ?{?) ^ 1 +d.
776 D. Ramachandran In view of Definition 4.1, using Theorem 4.2, we can recast Theorem 4.1 as the following COROLLARY 4.1. Let I = {1,2} and let at least one of P\ and Рг be perfect. A subset S С ?? ? Хг is marginalizable if (a) 5" и enclosable; and (b) 4(S*) ^ 1. Let (Xi,Ai, Pi), i e /, be a family of probability spaces. Consider the product space (П,е/ ^?>?>?€/ -4')- Analogous to Theorem 4.1 we have THEOREM 4.3. A subset S С Пге/ %i IJ niarginalizable if the following conditions hold: (a) All but perhaps one of the Pi are perfect. (b) For each finite subset J С I and for every choice {gj, j e J] where gj on X j is Aj -measurable and bounded for each j e J jeJ jeJJxJ (c) 5" is enclosable. For additional interesting results in this direction see Plebanek A989), Ramachandran A996). 5. Monge-Kantorovich duality spaces The measure theoretic version of the classical transportation problem dating back to Monge A781) and developing into infinite-dimensional linear programming in the work of Kantorovich A940, 1942) concerns the validity of a duality theorem which is formulated below. General treatment of Monge-Kantorovich duality theorems can be found in Ruschendorf A981), Kellerer A984), Levin A984), Rachev and Ruschendorf A998), Ramachandran and Ruschendorf A995). They arise in the study of (i) probabilities with given marginals and given support, stochastic ordering (Strassen, 1965; Sudakov, 1975; Hoffmann-J0rgensen, 1987;Dall'Aglioetal., 1991;Ruschen- dorf et al., 1996; Benes and Stepan, 1997); (ii) probability metrics, central limit theorems and asymptotic analysis of algorithms (Rachev, 1991; Rachev and Ruschendorf, 1998); (iii) equlibria in assignment models in economics (Gretsky et al., 1992; Ramachandran and Ruschendorf, 1997); (iv) operator algebras (Arveson, 1974; Haydon and Shulman, 1996) and many others. Let M(P\, Рг) = {? on ?? <8> Аг: ? has marginals Pi and Рг) = ?1^,®^2(??, Рг). ?,¦: ?? ? Хг —> Xi denote the canonical projections for i = 1,2. The abbreviation ®g, is used for X]i=i gi ? ?,-. For a measurable space (X, A) the notation / e A indicates that / is a real-valued, bounded Л-measurable function on X.
Perfect measures and related topics 777 For h e ? ? <g> Аг, the marginal problem is concerned with S(ft) = sup| / ???: ? e ? (?,, ?2) = 5??^(?) UXixX2 J while the dual problem deals with /(ft) = inf V f ?,-dP/: A/er'^OandAicffiA, =/?^,(?). [i = \Jx< i I The transportation problem seeks the validity of the duality S(h) = I(h). (D) The main duality theorem of Kellerer A984) deals essentially with second countable or metrizable spaces X,¦, i = 1,2, with tight (or Radon) probabilities defined on the Borel sets in which case (D) is shown to hold for a suitably large class containing all the bounded, measurable functions. The following result of Ramachandran and Riischendorf A995) is the most general duality theorem of this type. THEOREM 5.1. If at least one of the underlying probability spaces is perfect then (D) holds for all h e A\ <g> Аг- The direct converse of Theorem 5.1 does not hold; that is, we can construct two nonperfect probability spaces in the product of which (D) holds for all h e A\ <g> Аг (see Example 1 in Ramachandran and Riischendorf A996). In order to study probability spaces which ensure the validity of the general duality theorem we introduce: Definition 5.1. A probability space (X\, A\, Pi) is called a duality space if, for every (X2, Аг, Рг), the duality (D) holds for all heA\® Аг- A natural strengthening of the notion of a duality space arises if we postulate additionally that for any sub ?-algebra Сг С Аг such that h e ? ? <g> Сг the optimal value .^д, ®_д2 (h) = ¦SUiS^CO (°r equivalently, /а,®.42(А) = АД|®С2(^))' i-e-. the optimal value remains the same with refinements on the second space as long as the measurability conditions on h are fulfilled. In other words the value of the transportation problem remains the same for only 'technically' different formulations of the problem. Ramachandran and Riischendorf A995) have shown that perfect spaces have this property which motivates the Definition 5.2. A probability space (X\, A\, P\) is called a strong duality space if (i) it is a duality space; and (ii) for every sub ?-algebra Сг С Аг and for every h e A\ <g> Сг the condition U^A2(h) = Ui®C2(h) (SD) holds.
778 D. Ramachandran Perfect spaces have the following extension property for measures (see Ramachandran A996, Theorem 9)) which arises naturally in the marginal problem. Definition 5.3. We say that (X\,A\,P\) has the extension property (respectively charge extension property) if for every (X2, Ai, Pi) and for every sub ?-algebra Ci С ??, if ? e M(P[, Pi\c2) then ? /~? e ??(?\, Pi) (respectively ? is a charge on A\ <8>Ci with marginals Pi and P\c2)< i-e> ? extends to a measure (respectively charge) ? on A\ <g> Ai with marginals Pi and P2. It is well known that the projection of a Borel set in the product of standard Borel spaces is an analytic set and therefore universally measurable (see Hoffmann-j0rgensen A970), Cohn A980)). This property is useful in descriptive set theory. There is no measure theoretic analogue for the projection of a measurable set in the product of two probability spaces. The following gives a suitable measure theoretic definition of the projection property. Definition 5.4. A probability space (X\,A\, P[) is said to have the projection property if for every (?? ? ??,?? ®Ai) and for every С e A\ ®Ai there exists ? ? = ? ? (С) е А\ with />i(Ai) = 1 such that щ{С ? (?? ? ?2)) еА22 ¦ Ramachandran and Riischendorf B000) have characterized strong duality spaces in THEOREM 5.2 (Equivalence Theorem). Let (X\,A\, P\) be a probability space. Thenthe following are equivalent: (a) (X1, A1, P\) is a strong duality space; (b) (X\,A\,P\)isperfect; (c) (X1, ? ?, Pi) has the extension property; (d) (X1, A\, P[) has the projection property. (e) (X1, ? ?, Pi) has the charge extension property. The equivalence of the notions of strong duality and perfectness enables us to obtain several new properties based on PI to P10 that strong duality spaces inherit from being perfect spaces. These properties are not easily established by direct arguments. 6. Regular conditional probabilities Let (X, A, P) be a probability space and let ? be a sub ?-algebra of A. A function ?(?. A) defined on (X, A) is called a regular conditional probability (r.c.p. for short) given В if ?(?, A) satisfies the following conditions: (CP1) ?(?, ¦) is a probability on A for each fixed ? e X; (CP2) ?(-, A) is ?-measurable for each A e A; and (CP3) ? (? ? ?) = fB ?(?, A)dP for every A e А, В е В. A r.c.p. ?(?, A) given В is proper at jto e X if (CP4) ?(?0. ?) = 1 whenever x0 e В еВ.
Perfect measures and related topics 779 Classic example of Dieudonne using a non-Lebesgue measurable subset ? с [0,1] shows that reps need not always exist. However, if ? on (?, ?, ?) is perfect and A is e.g. for any sub ?-algebra В С A, r.c.p. ?(?, A) given В such that each ?(?, ·) is proper at each ? ? ?. Blackwell and Ryll-Nardzewski A963) have shown that the exceptional set ?, in general, cannot be removed even in standard Borel spaces while giving conditions under which the set N can be removed. For related results see Ramachandran A979, Part I, Chapter 4). It is not known whether a probability space (X, A, P) in which there is a r.c.p. given В for every sub ?-algebra ? of Л is necessarily perfect. For sufficient conditions of a purely measure-theoretic nature for existence of r.c.p. see Pfanzagl A969), Musiat A972). For results concerning r.c.p. in Doob's sense see Ramachandran A979, 1981). For further generalizations of the notion of a r.c.p. and their connection to perfect measures, see Faden A985), Sokal A981). 7. Independence and Blackwell spaces There are two definitions of independence of two real random variables - one due to Steinhaus and the other due to Kolmogorov. An example due to Doob A948), Jessen A948) using a nonperfect probability space shows that Kolmogorov's definition is more restrictive; however, in perfect probability spaces the two notions are equivalent (see appendix by Doob in Gnedenko and Kolmogorov A954)). Recent work concerning the necessity of perfectness for the equivalence of the two notions of independence leads to the study of measures on Blackwell spaces (see Ramachandran A979), Shorn A987a, 1987b)). Let (X, A) be a measurable space where A is separable. (X, A) is said to be a Blackwell space if, for any separable sub ?-algebra A\ of A, we have A\ = A. (X, A) is said to be strongly Blackwell space if any two e.g. sub ?-algebras of A with the same atoms are identical. The two definitions of independence are equivalent in strongly Blackwell spaces. Ryll-Nardzewski has shown the existence of a non-Lebesgue measurable subset X* С [0, 1] with ?*(?*) = 1 such that (X*,B[0,\] ? ?*) is strongly Blackwell; this nonperfect space shows that perfectness is not necessary for the equivalence of the two definitions of independence. A probability space (X, A, P) is called an independence space if the two definitions of independence are equivalent in (?,?, ?). A measurable space (X, A) is called a universal independence space if for every probability ? on (X, A), the space (X, A, P) is an independence space. Strongly Blackwell spaces are universal independence spaces. Shortt and Bhaskara Rao A987) have shown, under (CH), the existence of a Blackwell space which is not a universal independence space answering a question of Ramachandran A975, P930) in the negative. A probability space is called almost surely (as.) Blackwell (respectively as. strongly Blackwell) if for some ? e A with P(Y) = 1, the space (Y, А П Y) is Blackwell (respectively strongly Blackwell). Ramachandran A979) has shown that separable perfect spaces are as. strongly Blackwell and that as. strongly Blackwell spaces are independence spaces. Shortt A987a, 1987b) has shown, under (CH), the existence of a nonperfect independence space which is not even as. Blackwell (answering P931 in Ramachandran A975) in the
780 D. Ramachandran negative). Note that concepts such as universally measurable space, strongly Blackwell space, universal independence space and Blackwell space are set-theoretic concepts while concepts such as perfect probability space, as. strongly Blackwell space, independence space and a.s. Blackwell space are measure-theoretic concepts. For implications involving the above mentioned, see papers by Shorn A987a, 1987b) and Shortt and Bhaskara Rao A987). Imposing set-theoretic conditions like the strong Blackwell property does not avoid measure theoretic pathologies such as being a non-independence space. Shortt A984/85) has shown the existence of a strongly Blackwell space in which there is no r.c.p. given a e.g. sub-?-algebra. These findings reinforce the fact that perfect probability spaces are the best candidates for providing a technically most pleasing model for probabilistic measure theory. 8. Standard measure spaces A probability space (?, ?, P) is called a standard measure space (separable measure space) if there exists X\ e A with P(Xi) = 1 such that (X\, А П X\) is a standard Borel space (separable space). Lebesgue spaces studied by Rohlin A949) are completion of standard measure spaces and standard measure spaces are separable measure spaces where ? is a perfect measure. Every standard measure space with a nonatomic measure is almost isomorphic to ([0,1],#[?,?],?). For detailed classical results about standard measure spaces we refer to Rohlin A949) and Ramachandran A979). We mention the following results: Let (X, A, P) be a standard measure space and let ? с Л be a e.g. sub ?-algebra. A set A e A is called a measurable partial selector for В if ? ? ? contains at most one point for every ?-atom B. THEOREM 8.1 (Rohlin). There is a decomposition ofX of the form X = Mo U M\ U ¦ ¦ ¦ U M„ U ¦ ¦ ¦ , where (i) M„ is a measurable partial selector for В for every ? ^ 1; (ii) M„ is a set of maximal measure among all measurable partial selectors for В which are subsets of (U?=i ?i)c for every ? ^ I; and (iii) Mq contains no partial selector for В of positive measure. A decomposition of X as in Theorem 8.1 is called a maximal decomposition. Since ? is perfect there is a r.c.p. ?(?, A) on XxA given В which is almost everywhere proper; it can be shown that ?(?, -)\м0 is nonatomic a.s. [??] andM(;c, Mn) ^ ?(?, ??+\) as. [??] for every ? > 1. The following theorem summarizes Rohlin's results on the existence of independent complements in standard measure spaces. THEOREM 8.2. Let (X, A, P) be a standard measure space. Let В С A be a e.g. sub- ?-algebra and let X = U/^=o Mn be a maximal decomposition. Then the following are equivalent: (i) В has an independent complement which is e.g. i.e., there exists a e.g. sub-?-algebra B* of A such that В and В* are independent with ?({?, ?*}) = A as. [P].
Perfect measures and related topics 781 (ii) ?(?, M„) = const, as. [??] for each ? ^ 1. (iii) Every maximal decomposition of X consists of sets ?-independent of В. (iv) There is a maximal decomposition ofX consisting of sets ?-independent of В. Corollary 8.1. Let (X, А, Р) be a standard measure space. Let В С Л be a e.g. sub ?-algebra such that [Pg]-almost all measures ?(?, ¦) are continuous where ?(?, A) is any version of the re.p. given B. Then В has an independent complement which is e.g. What is more, there is an almost isomorphism of(X, A, P) with the unit square equipped with the planar Lebesgue measure under which В can be identified with the ?-algebra of vertical cylinders. For another proof of Rohlin's theorem on independent complements see Ershov A977). For classification of e.g. sub ?-algebras of a standard measure space based on the structure of the corresponding reps see Ramachandran A979). 9. Mixtures of perfect measures Let (X, A) and (У, В) be two measurable spaces and let ?(?, ?) be a transition probability on ? ? ?. Let ? be a measure on (X, A). The measure ?? on В defined by ??(?)= ? ?(?,?)??, in В е В is called the ?-mixture of ?(?, ¦); ? is called the mixing measure, ?(?, ¦) are called mixand measures and ?? is called mixture measure. Rodine A966) initiated the study of the role of perfectness in the mixture problem and conjectured that a perfect mixture of perfect measures is perfect. Ramachandran A974, 1979a, 1979b) (see these papers for details) showed that the conjecture is false and that all the possible cases can arise in the mixture problem with respect to perfectness of measures and gave sufficient conditions for perfect mixtures of perfect measures to be perfect. Since perfectness and compactness are equivalent, all the possible cases involving compact measures can arise in the mixture problem. Ramachandran A979a) also showed that a perfect mixture (and hence a compact mixture) of discrete measures is perfect. The following question is still open: Is a compact mixture of discrete measures compact! 10. Disintegrations When working with ?-algebras which are not countably generated reps need not exist even when the space is perfect. Hence we consider the notion of disintegration (see Valadier A973), Edgar A975), Maharam A975), Pachl A978)): Definition 10.1. Let (X,A) and (У, B) be two measurable spaces and let R be a probability on(X xY,A®B). Let Q be the marginal of R on (У, В). Suppose that for each
782 D. Ramachandran у e У there is a ?-algebra Ay С AonX and a probability P{y, ¦) on (X, Ay) satisfying the conditions: (Dl) For each A e A there is NA e В with Q(NA) = 0 such that A e Ay for every у ?Na and the function P(-, A) on (У — Мд) is ? П (У — Мд)-measurable; and (D2) For every AeAandB eB f P(y,A)dQ(y) = R(AxB) Jb (in view of (Dl) this integral is well defined). The family {(Ay, P(y, )}уеЛ is then called a ^-disintegration of R. THEOREM 10.1 (Pachl A978)). A probability space (?,?, ?) is compact iff for every complete probability space (У, B, Q) and for every joint probability R on (? ? ?,?<8> ?) with marginals ? and Q there exists a Q-disintegration of R. Pachl's characterization of compactness was used to establish properties C7-C10. Ramachandran A979) introduced a more general notion of disintegration of a probability space. Definition 10.2. Let (Z,C, P) be a probability space and let В be a sub ?-algebra of A. Suppose that for each ? e ? there is a sub ?-algebra Cz of С and a probability P(z, ¦) on (Z, Cz) satisfying the following properties; (Dl') For each С eC there exists Nc e В with P(Nc) = 0 such that С eCz for every z?Nc and the function P(-, C) on (Z - Nc) is ? П (Z - Nc)-measurable; and (D2') For every В е В and С е С, I P(z,C)dP = P(Br\C) Jb (in view of (Dl') this integral is well-defined). The family {(Cz, (??, )}zez is then called a ?-disintegrationof (Z,C, P). The above definition generalizes the notion of a r.c.p. to ?-algebras that are not necessarily e.g. If (Z, C, P) С (У, В, Q) (as a thick subspace) then let C* = ?({?, ?}) and let P* on (У, C*) be defined by P*(C*) = P(C* ? ?), С* е С*. Definition 10.3. We say that a probability space (Z,C,P) is disintegrable if for every (У, B, Q) such that (Z, С, Р) С (У, В, Q) and for every sub ?-algebra B* of C* there is a ^-disintegration of (У, С*, />*). We have the following unified version of Pachl's results:
Perfect measures and related topics 783 THEOREM 10.2 (RamachandranA979)). For a probability space (Z,C, P) the following are equivalent: (i) (Z, С, Р) is compact. (ii) (Z, C, P) is disintegrable. (iii) For every compact probability space (Y,B, Q) such that (Z, С, ?) С (?, ?, Q) and for every sub-?-algebra B* ofC* there is a B*-disintegration of(Y, С*, Р*). (iv) For some compact probability space (Y,B, Q) such that (Z,C, P) С (?, ?, Q) there is а В-disintegration of(Y,C*, P*). Blackwell and Maitra A984) have introduced the notion of factorization; in Definition 10.1, if Ay = Л and P(y, ¦) is a charge on (X, A) for each у е Y, we say that {Р(У> )¦ У е У} is a <2-factorization of P. Analogous to Theorem 10.1, Maitra and Ramakrishnan A988) and have established the following characterization of perfectness. THEOREM 10.3 (Maitra and Ramakrishnan). Let (X, A, P) be a probability space where A is ? ?- gene rated. The following are equivalent: (i) ? is perfect. (ii) For any probability space (У, B, Q) and for any measure R on (? ? ?, ? <g> В) with marginals ? and Q, there is a Q-factorization of R. (iii) For any probability space (Y, B, Q) with В e.g. and for any measure R on (? ? Y,A®B) with the marginals ? and Q, there is a Q-factorization of R. (iv) For any probability Q on (91, By) and any measure R on (? ? Ш, А х Вщ) with marginals ? and Q, there is a Q-factorization of R. While (iv) => (i) is true for any ? -algebra A, it is not known whether the reverse implication holds if ? is not ? ?-generated. Adamski A986) has shown that for any e.g. ?-algebra A (i) ? (?): For any probability space (У, B, Q) and for any measure R on {? ? У, Л® В) with marginals P and Q, there is ? ? -additive Q-factorization ofR. For additional results in these directions refer to Adamski A986), Blackwell and Maitra A984), Pachl A978, 1979, 1981), Ramachandran A979), Remy A988). References Adamski, W. A986), Factorization of measures and perfection, Proc. Amer. Math. Soc. 97, 30-32. Aniszczyk, B. A986), A note on "Two classes of measures" by J.K. Pachl, Colloq. Math. 50, 231-232. Arveson, W. A974), Operator algebras and invariant subspaces, Ann. Math. 100, 433-533. Banach, S. A948), Onmeasures in independent fields, Studia Math. 10, 159-177. Benes, V. and Stepan, J. (eds) A997), Distributions with Fixed Marginals and Moment Problems, Kluwer, Dordrecht. Bhaskara Rao, K.P.S. and Rao, B.V. A981), Borel spaces, Dissertationes Math. (Rozprawy Mat.) 190, 1-63. Billingsley, P. A986), Probability and Measure, 2nd edn, Wiley, New York. Bjomsson, O.J. A980), A note on the characterization of standard Borel spaces, Math. Scand. 47, 135-136. Blackwell, D. A956), On a class of probability spaces, Proc. Third Berkeley Symp. Math. Stat. Probab., Vol. 2, 1-6.
784 D. Ramachandran Blackwell, D. and Maitra, A. A984), Factorization of probability measures and absolutely measurable sets, Proc. Amer. Math. Soc. 92, 251-254. Blackwell, D. and Ryll-Nardzewski, C. A963), Nonexistence of everywhere proper conditional distributions, Ann. Math. Stat. 34, 223-225. Cohn, D.L. A980), Measure Theory, Birkhauser, Boston. Dall'Aglio, G., Kotz, S. and Salinetti, G. (eds) A991), Advances in Probability Distributions with Given Marginals, Kluwer, Dordrecht. Darst, R.B. A969), Properties ofLusin sets with applications to probability, Proc. Amer. Math. Soc. 20, 348-350. Darst, R.B. A971), On universal measurability and perfect probability, Ann. Math. Stat. 42, 352-354. Dekiert, M. A993), Two results for monocompact measures, Manuscripta Math. 80, 339-346. Doob, J.L. A948), On a problem of Marczewski, Colloq. Math. 1, 216-217. Edgar, G.A. A975), Disintegration of measures and the vector valued Radon—Nikodym theorem, Duke Math J. 42, 447^450. Erochin, V.D. A961), Zametka к teorii mery, Uspekhi Mat. Nauk 16, 175-180. Ershov, M.P. A977), On Rohlin's theorem of the independent complement, Uspekhi Mat. Nauk 32, 187-188 (in Russian). Faden, A. A985), The existence of regular conditional probabilities: necessary and sufficient conditions, Ann. Probab. 13, 288-298. Gardner, R.J. A982), The regularity of Borel measures, Oberwolfach Measure Theory Conference Proceedings 1981, Lecture Notes in Math., Vol. 945, Springer, Berlin, 42-100. Gnedenko, B.V. and Kolmogorov, A.N. A954), Limit Distributions for Sums of Independent Random Variables, Addison-Wesley, Cambridge. Gretsky, N.E., Ostroy, J.M. and Zame, W.R. A992), The nonatomic assignment model, Econ. Theory 2, 103-127. Hackenbroch, W. A992), Conditionally multiplicative simultaneous extension of measures. Measure Theory Proceedings, Oberwolfach 1990, D. Kolzow, D. Maharam-Stone, S. Graf, eds, Supplemento ai Rendiconti del Circolo Matematico di Palermo, Series II 28, 49-58. Halmos, PR. A950), Measure Theory, Von Nostrand, New York. Hansel, G. and Troallic, J.P A978), Mesures marginales et theoreme de Ford-Fulkerson, Z. Wahrscheinlich- keitsth. verw. Geb. 43, 245-251. Hansel, G. and Troallic, J.P. A986), Surle probleme des marges, Probab. Theory Related Fields 71, 357-366. Haydon, R. and Shulman, V A996), On a measure-theoretic problem of Arveson, Proc. Amer. Math. Soc. 124, 497-503. Hoffmann-J0rgensen, J. A970), The Theory of Analytic Spaces, Various Publications Series, No. 10, Aarhus Universitet Matematisk Institut, Aarhus. Hoffmann-J0rgensen, J. A987), The general marginal problem. Functional Analysis II, S. Kurepa, H. Kraljenic and D. Butkovic, eds, Lecture Notes in Math., Vol. 1242, Springer, Berlin, 77-367. Jasinski, J. and Weiss, T. A991), Sierpinski sets and strong first category, Proc. Amer. Math. Soc. Ill, 235-238. Jessen, B. A948), On two notions of independent functions, Colloq. Math. 1, 214-215. Kallianpur, G. A959), A note on perfect probability, Ann. Math. Stat. 30, 169-172. Kallianpur, G. and Ramachandran, D. A983), On the splicing of measures, Ann. Probab. 11, 819-822. Kantorovich, L.V A940), A new method of solving some classes of extremal problems, Comptes Rendus (Doklady) Acad. Sci. USSR 28, 211-214. Kantorovich, L.V. A942), On the translocation of masses, Doklady Akad. Nauk SSSR 37, 199-201. Translated into English and reprinted in Manage. Sci. 5 A958), 1^4. Kellerer, H.G. A964a), Masstheoretische Marginalprobleme, Math. Ann. 153, 168-198. Kellerer, H.G. A964b), Verteilungsfunktionen mit gegebenen Marginalverteilungen, Z. Wahrscheinlichkeitsth. verw. Geb. 3, 247-270. Kellerer, H.G. A984), Duality theorems for marginal problems, Z. Wahrscheinlichkeitsth. verw. Geb. 67, 399- 432. Koumoullis, G. A981), Onperfect measures. Trans. Amer. Math. Soc. 264, 521-537. Kuratowski, K. A966), Topology I, Academic Press, New York. Lembcke, J. A982), On simultaneous preimage measures on Hausdorff spaces. Measure Theory Proceedings, Oberwolfach 1981, D. Kolzow, D. Maharam-Stone, eds, Lecture Notes in Math., Vol. 945, Springer, Berlin, 110-115.
Perfect measures and related topics 785 Levin, V.I. A984), The mass transfer problem in topological space and probability measures on the product of two spaces with given marginal measures, Soviet Math. Doklady 29, 638-643. Maharam, D. A971), Consistent extensions of linear junctionals and of probability measures, Proc. Sixth Berkeley Symp. on Math. Stat, and Probab., Vol. 2, 127-147. Maharam, D. A975), Strict disintegration of measures, Z. Wahrscheinlichkeitsth. verw. Geb. 32, 73-79. Maharam, D. A978), An example of tail fields, Oberwolfach Measure Theory Conference, 1977, Lecture Notes in Math., Vol. 695, Springer, Berlin, 151-152. Maitra, A. and Ramakrishnan, S. A988), Factorization of measures and normal conditional distributions, Proc. Amer. Math. Soc. 103, 1259-1267. Mallory, D. A975), Extension of set functions to measures, Canad. Math. Bull. 18, 547-553. Marczewski, E. (Szpilrajn) A938), The characteristic function of a sequence of sets and some of its applications. Fund. Math. 31, 207-223. Marczewski, E. A948), Ensembles independants et leurs applications ? la theorie de la mesure. Fund. Math. 35, 13-28. Marczewski, E. and Ryll-Nardzewski, C. A953), Remarks on compactness and non-direct products of measures. Fund. Math. 40, 165-170. Miller, A.W. A984), Special subsets of the real line. Handbook of Set-Theoretic Topology, North-Holland, Amsterdam, 201-233. Monge, G. A781), Memoire sur la theorie des deblais et ramblais, Mem. Math. Phys. Acad. Roy. Sci. Paris, 666-704. Musial, K. A972), Existence of proper conditional probabilities, Z. Wahrscheinlichkeitsth. verw. Geb. 22, 8-12. Musial, K. A976), Inheritness of compactness and perfectness of measures by thick subsets, Oberwolfach Measure Theory Conference 1975, Lecture Notes in Math., Vol. 541, Springer, Berlin, 31—42. Musial, K. A980), Projective limits of perfect measure spaces, Fund. Math. 110, 163-189. Nadkarni, M.G. and Ramachandran, D. A978), On a question of D. Maharam concerning tail fields, Sankhya, Series A 40, 313-318. Neveu, J. A965), Mathematical Foundations of the Calculus of Probability, Holden Day, London. Pachl, J. A975), Every weakly compact probability is compact. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astronom. Phys. 23, 401^405. Pachl, J. A978), Disintegration and compact measures. Math. Scand. 43, 157-168. Pachl, J. A979), Two classes of measures, Colloq. Math. 42, 331-340. Pachl, J. A981), Correction to the paper "Two classes of measures", Colloq. Math. 45, 331-333. Pfanzagl, J. A969), On the existence of regular conditional probabilities, Z. Wahrscheinlichkeitsth. verw. Geb. 11, 244-256. Plebanek, G. A989), Measures on two dimensional products, Mathematika 36, 253-258. Rachev, ST. A991), Probability Metrics and the Stability of Stochastic Models, Wiley, New York. Rachev, S.T. and Ruschendorf, L. A998), Mass Transportation Problems. Part I: Theory; Part IT. Applications, Springer- Verlag, New York. Ramachandran, D. A974), Mixtures of perfect probability measures, Ann. Probab. 2, 495-500. Ramachandran, D. A975), On the two definitions of independence, Colloq. Math. 32, 227-231. Ramachandran, D. A979), Perfect Measures, Vols 1, 2, ISI - Macmillan Lecture Notes Series, Vols 5, 7, Macmillan, New Delhi. Ramachandran, D. A979a), Perfect mixtures of perfect measures, Ann. Probab. 7, 444—452. Ramachandran, D. A979b), Existence of independent complements in regular conditional probability spaces, Ann. Probab. 7, 433^*43. Ramachandran, D. A981), A note on regular conditional probabilities in Doob 's sense, Ann. Probab. 9, 907-908. Ramachandran, D. A996), Marginal problem in arbitrary product spaces. Distributions with Fixed Marginals and Related Topics, L. Ruschendorf et al., eds, IMS Lecture Notes - Monograph Series, Vol. 28, 260-272. Ramachandran, D. and Ruschendorf, L. A995), A General Duality Theorem for Marginal Problems, Probab. Theory Related Fields 101, 311-319. Ramachandran, D. and Ruschendorf, L. A996), Duality and perfect probability spaces, Proc. Amer. Math. Soc. 124, 2223-2228.
786 D. Ramachandran Ramachandran, D. and Ruschendorf, L. A997), Duality theorems for assignments with upper bounds. Distributions with Fixed Marginals and Moment Problems, V. Benes and J. Stepan, eds, Kluwer, Dordrecht, 283-290. Ramachandran, D. and Ruschendorf, L. B000), On the Monge—Kantorovich duality theorem. Theory Probab. Appl. 45, 403^409. Remy, M. A988), Disintegration and perfectness of measure spaces, Manuscripta Math. 62, 277-296. Rodine, R.H. A966), Perfect probability measures and regular conditional probabilities, Ann. Math. Statist. 37, 1273-1278. Rohlin, V.A. A949), On the fundamental ideas of measure theory. Mat. Sb. 25 F7), 107-150 (in Russian). Amer. Math. Soc. Transl. Series 1, Vol. 10, 1-54. Ruschendorf, L. A981), Sharpness of Frechet-bounds, Z. Wahrscheinlichkeitsfh. verw. Geb. 57, 293-302. Ruschendorf, L., Schweizer, B. and Taylor, M. (eds) A996), Distributions with Fixed Marginals and Related Topics, IMS Lecture Notes - Monograph Series, Vol. 28. Ryll-Nardzewski, С A953), On quasi-compact measures, Fund. Math. 40, 125-130. Sazonov, V A962), On perfect measures, Izv. Akad. Nauk SSSR Ser. Mat. 26, 391—414 (in Russian). Amer. Math. Soc. Transl. Series 2, Vol. 48, 229-254. Schmidt, K.D. and Waldschaks, G. A991), Common extensions of positive vector measures, Portugalia Math. 20, 155-164. Shortt, R.M. A983), Strassen's marginal problem in two or more dimensions, Z. Wahrscheinlichkeitsth. verw. Geb. 64, 313-325. Shortt, R.M. A984), Universally measurable spaces: an invariance theorem and diverse characterizations. Fund. Math. 121, 169-176. Shortt, R.M. A984/85), Products of Blackwell spaces and regular conditional probabilities. Real Anal. Exchange 10,31^41. Shortt, R.M. A986), Sets with no uncountable Blackwell subsets, Czechoslovak Math. J. 37 A12), 320-322. Shortt, R.M. A987a), Borel dense Blackwell spaces are strongly Blackwell, Colloq. Math. 53, 35—41. Shortt, R.M. A987b), Notions of independence for random variables, Probab. Math. Stat. 8, 81-88. Shortt, R.M. and Bhaskara Rao, K.P.S. A987), Generalized Lusin sets with the Blackwell property. Fund. Math. 127, 9-39. Sierpinski, W. A956), Hypothese du continu, 2nd edn, Chelsea, New York. Sokal, A.D. A981), Existence of compatible families of proper regular conditional probabilities, Z. Wahrscheinlichkeitsth. verw. Geb. 56, 537-548. Sparre-Anderson, E. and Jessen, B. A948), On the introduction of measures in infinite product sets, Danske Vid. Selbskab. Mat. Fys. Medd. 25 D), 1-7. Strassen, V A965), The existence of probability measures with given marginals, Ann. Math. Stat. 36, 423^439. Stroock, D. A976), Some comments on independent ?-algebras, Colloq. Math. 35, 7-13. Sudakov, V.N. A975), Measures on subsets of direct products, J. Soviet Math. 3, 825-839. Tops0e, F. A979), Approximating pavings and construction of measures, Colloq. Math 42, 377-385. Valadier, M. A973), Disintegration d'une mesure sur un product, C. R. Acad. Sci. Paris Ser. A 276, 33-35.
CHAPTER 19 Riesz Spaces and Ideals of Measurable Functions Martin Vath Department of Mathematics, University of Wiirzburg, Am Hubland, D-97074 Wiirzburg, Germany E-mail: vaeth@mathematik.uni-wuerzburg.de Contents 1. Preliminaries 789 2. Riesz spaces of measurable functions 790 2.1. Riesz spaces 790 2.2. Dedekind completeness and support 792 2.3. Order convergence and topology 796 2.4. Bibliographical remarks 797 3. Locally solid ideals of measurable functions 798 3.1. Ideals and bands in Riesz spaces 798 3.2. Ideals of measurable functions 800 3.3. Comparison of various sorts of convergence 803 3.4. Bibliographical remarks 805 4. Ideal spaces 805 4.1. Preideal spaces 805 4.2. Fatou property and perfectness 808 4.3. Completeness 810 4.4. Convergence theorems 811 4.5. Bibliographical remarks 813 5. Order duals 815 5.1. The associate space 815 5.2. Applications to ideal spaces on product measures 817 5.3. Order duals of a Riesz space 818 5.4. Duals of ideals of measurable functions 820 5.5. Bibliographical remarks 822 References 822 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 787
This page intentionally left blank
Riesz spaces and ideals of measurable functions 789 1. Preliminaries Throughout this chapter, (S, ?, ?) denotes a measure space with a positive measure ?. To simplify notation, we will assume that this measure space is complete, i.e., that subsets of null sets are again null sets. It will be convenient to consider functions with values in У where У is a Banach space, or ? = Ш, or where У is a Banach space with an additional element oo. By \y\, we denote the norm respectively absolute value of у e ?. Of course, we put | ± oo| := oo. We equip ? with the canonical uniform structure, i.e., the restriction to bounded sets is the uniform structure induced by the norm, and y„ ->· oo if and only if \y„\ —>¦ oo. Since this uniform structure has a countable base, the classical metrization theorems imply that it is induced by a metric d. If У is a Banach space, we may choose d(x, y) = \x — y\, and for У = R, we may choose d(x. y) = | arctan.v - arctan;y| (with the natural conventions for the case x, у = ±oo). Also if У is a Banach space with an additional element oo, an "explicit" formula for d can be given by means of the stereographic projection (see, e.g., Proposition 2.2.1 of M. Vath A997)): ж л уУA + Ы2)-уA + М2I2 + (И2-1у12J , , . d(x,y):= г (?, у ? ос), A + И2)A + Ы2) d(x, oo) :=d(oo, x) : = ^ (jc^oo), d(oo, oo) :=0. 1 + |jc|2 ^ Since we are not only interested in integrals, it will be convenient to enlarge the class of measurable functions slightly compared to the definition of Chapter 17: We call a function ?: S —>¦ ? measurable if the restriction to each finite set is ?-measurable in the sense of Chapter 17, i.e., if for each set ? e ? with ?(?) < oo the restriction x\e can be approximated almost everywhere (with respect to d) by a sequence of iT-step functions (for the terminology see Chapter 17); it may always be arranged that these approximating functions attain only finite values. For functions with ?-finite support the two definitions are equivalent, of course. Note that (Bochner) integrable functions have automatically ?-finite support (because {s: \x(s)\ ^ и-1} must have finite measure), and so this difference plays no role in the context of Chapter 17. If two measurable functions x,y:S-*Y coincide almost everywhere, we tacitly identify them unless something else is said. By ?ul((S, ?, ?), У), we denote the set of all (equivalence classes of) such functions. For brevity, we usually write just S in place of (S, ?, ?) and do not mention У in case У = Ш. In particular, Wl(S) is the same as m((S, ?, ?),?). Recall that if S is ?-finite, then there is a normalized measure v, i.e., (S, ?, v) is a measure space with the same null sets as (S, ?, ?) but v(S) = 1 (in particular, v(S) < oo). In this case, we consider on X с 9ЛE', У) the metric p(x,y):= inf (m + v(\s: d(x(s),y(s)) > m\)) 0<m<x
790 ?. Vath or f d(x(s),y(s)) p(x,y): = I dv(s). Jsd(x(s),y(s))+\ We say that a sequence xn e Wl(S, Y) converges in measure to ?: S ->· ? on a set ? e ? if for any ? > 0 the relation m*({s e ?: d(jt„0), ;c(i)) > ?}) -> 0 holds where ?* denotes the outer measure. PROPOSITION 1. Let S be ?-finite. For ?,,,? e 9K(S, ?) we /lave ? (?,,,*) -> 0 (p(x„, x) —> 0) if and only if xn -^ ? on each set of finite ?-measure. In particular, ? and ? induce the same topology on Wl(S, Y), and this topology does not depend on the particular choice of v. The following classical results carry over. THEOREM 1 (Egorov). Let S be ?-finite. If xn e VK(S, Y) converges almost everywhere to ? : S —>¦ ? (with respect to d), then ? e 9JiE", Y), and there is an increasing sequence E\ с Е2 с ··· with ?(??() < oo and ?E \ (jEk) =0 such that d(x„(s),x(s)) -+ 0 uniformly on each E^. In particular, p(x„,x) —> 0 (and p(x„, x) —> 0). THEOREM 2 (Riesz). If xn e Wl(S, Y) converges in measure to some x, then there is a subsequence which converges to ? almost everywhere (with respect to d). If S is ?-finite, the same conclusion holds if x„ —> ? in measure on each set of finite measure (i.e., if p(x„ ,x)^0or p(x„ ,*)_>()). We will mark all places explicitly where the axiom of choice is used in an uncountable form which goes beyond the so-called axiom of dependent choices. 2. Riesz spaces of measurable functions 2.1. Riesz spaces Definition 1. A partially ordered vector space is a real vector space X endowed with a partial order ^ with the following properties: A) ? ^ у implies ? + ? ^ у + ? for all ? e X. B) ? > 0 implies ?? > 0 for all ? > 0. The set of all leX with ? ^ 0 is the positive cone and is denoted by X+. Of course, ? ^ у if and only if у — ? e X+. The notion "positive cone" is explained by the fact that a subset С с X is a positive cone for some order on X which turns X into a partially ordered vector space if and only if С is a cone, i.e., A) С is convex with С + С с С, and B) СП(-С) = {0).
Riesz spaces and ideals of measurable functions 791 Definition 2. An element ? of a partially ordered set X is an upper bound of a nonempty subset ? с X if у ^ ? for all у e ?. A /ower footmii is defined analogously. If ?? has an upper/lower bound, we call it order bounded from above/below. We call ? order bounded if it is bounded from above and below. If ? is an upper bound for ? and simultaneously a lower bound for the set of all upper bounds of ?, then ? is called the supremum of ?, and is denoted by sup ?. The infimum inf ? is defined analogously. If supAi or infAi exists, then it is uniquely determined. Definition 3. A partially ordered space X is called a lattice, if ? ? у : = sup{*, у] and ? л у := inf{j:, у) exists for any Jt, у е X. A Riesz space is a partially ordered vector space which is simultaneously a lattice. A partially ordered vector space X is a Riesz space if and only if sup{;c, y) exists for any x,y e X. Indeed, then inf{;c, y) automatically exists and is the same as — sup{— x, —y). Definition 4. Let X be a Riesz space. Then the Riesz operations are defined by x+ := ? vO, x~ := (-x) лО, |jc | := (—?) ? ? for each ? e X. The lattice operations can be obtained by the Riesz operations: Proposition 2. We always have 0^л±^|л|, x =x+ — x~ and \x\=x+ + x~. Moreover, xv у = -(x + y + \x-y\) and хлу = -(х + у-\х-у\). The order of a Riesz space X is completely determined by each of the three Riesz operations. A linear subspace U с X with the inherited order is a Riesz space if it is invariant under one (and thus all) of the three Riesz operations. Definition 5. For a sequence x„ in a partially ordered space X, we use the notation x„ ? ? if ? ? ^ X2 ^ · · · and ? = inf{jt|, xi,...}. x„ | ? is defined analogously. For later usage, we already mention a related more general notion: Definition 6. A set ? с X is called directed upwards if for any x, у еМ there is some ? e ? with ? ^ ? and ? ^ y. If ?? is directed upwards and jc = sup ? exists, then we write ? 1 ?. ? ? ? is defined analogously. To compare the two definitions, the following notion is useful.
792 ?. Vath DEnMTlON 7. A partially ordered space X is called order separable if any ? с X which posses a supremum or infimum contains an (at most) countable subset which has the same supremum respectively infimum. PROPOSITION 3. Ifx„ | x, then {x„: n] | x. Conversely, if ? | ? and either X is order separable or ? is countable, then ? contains a sequence with xn | x. A corresponding result holds for xn l x. We are mainly interested in the lattice Wl(S, R). Clearly, 9JiE", R) is a lattice with respect to the order ? ? у «=> x(s) ^ y(s) for almost all s e S. Unless something else is said, we always consider this order. Note that we have xn | ? if and only if x„(s) | x{s) for almost all s e 5". The subspace W(S) = Wl(S, R) is a Riesz space whose order is determined by the Riesz operation \x\(s) = \x(s)\- In view of Proposition 2, we call a subset X с Wl(S) a Riesz space of measurable functions (with respect to our canonical order) if it is a subspace such that ? e X implies |*| eX where \x\(s):= \x(s)\. Example 1. If5" = [0, 1] (with the Lebesgue measure), then C([0, 1]) is a Riesz space of measurable functions in the above sense while C1 ([0, 1]) is not. Nevertheless, C1 ([0, 1]) is a Riesz space with respect to a different order (defined, e.g., by ? ^ у «=> *@) ^ y@) and x'(t) ? y'(t) for all t e [0, 1]). It is noteworthy that any Riesz space X of measurable functions (with the canonical order) is Archimedean, i.e., for any ? e X+ we have /i~'jc \. 0. This excludes many pathologies arising in the general theory of Riesz spaces. 2.2. Dedekind completeness and support We intend now to study certain order properties of Wl(S) in connection with the algebra of measurable sets: Let ^(S) = *B(S, ?, ?) denote the algebra of all measurable subsets of S where we identify two sets when they differ only by a null set. Then *B(S) becomes a lattice with respect to the order Z)<C? «=> D с ? UN for some null set N. A) Note that D„ | D in 55(S1) means that up to null sets the relations D\ с ZJ <? · ¦ · and D = \JD» hold. Dehnition 8. Given ? e 9JiE"), the support of ? (denoted by supp*) is defined by supp? := {s e 5": x(s) ? 0). Given some ? с Wl(S, ?, ?), we say that ? has a support if there is some ? & ? with the following properties:
Riesz spaces and ideals of measurable functions 793 A) For any ? e ? the set supp ? \ ? does not contain a set of positive measure. B) For each OC? with ?(?)) > 0 there is some ? e ? such that D П supp ? contains a set of positive measure. In this case, the set E, considered as an element of 25E), is called the support of ? (denoted by suppAi). PROPOSITION 4. ? has a support if and only if the set (De!8(S): D ^ supp ? for some ? eM) B) has a supremum in *B(S). In this case, suppAi is this supremum. In particular, suppAi is uniquely determined. Recall that the elements of 97tE) and 55E) are actually equivalence classes (because we decided to ignore null sets). Hence, the union U of the set B) is in general not well- defined. But even if we fix representative sets D, it may happen that U is non-measurable. Moreover, even if U is measurable, it need not be the support: it is only clear from the definition that if U is measurable it is an upper bound, but it need not necessarily be the least upper bound. Of course, if ? is countable then all these problems do not arise. However, for uncountable ? the set supp ? need not exist, in general. Example 2. Let N, So, S\ be pairwise disjoint sets with uncountable So, S\. Put S : = N U So U S\, and let ? be the system of all ? с 5" with the property that either ? or its complement Ec is countable. In the first case, we let ?{?) be the number of elements in E\N, and in the second case ?(?) = oo. Let ? be the set of all measurable functions which vanish on So- Then a set ? с S satisfies the two properties of Definition 8 if and only if ? = S\ U No for some No <? N. Since none of these sets belongs to ?, ? does not have a support. Example 2 shows additionally that if we would drop the requirement ? e ? in Definition 8, we had difficulties to speak of the uniqueness of a support (the set N considered in Example 2 is not a null set but only a so-called local null set, see Theorem 4 below). Dehmtion 9. We call a lattice X A) Dedekind complete if for each subset ? ? X which is order bounded from above (below) sup ? (inf ?) exists; B) ?-Dedekind complete if for each countable С с X which is order bounded from above (below) sup С (inf С) exists; C) super Dedekind complete if for each subset ? с X which is bounded from above (below) there is some countable С <^M such that sup С = sup ? (inf С = inf ?). Clearly, each super Dedekind complete space is order separable. _ Note that for ordered vector spaces (and similarly also for <8E) and 971E, K)), it suffices to verify the properties for the suprema, because inf ? = — sup(—M) (for ? с *8E), we have similarly inf ? = sup{?': ? e ?}'').
794 ?. Vath The spaces 5?E) and 37iE) are always ?-Dedekind complete (with sup С = {J С respectively supC(i) = {supjt(r): ? e C}). Note that in general this is not true for all spaces of measurable functions. For example, C([0, 1]) is not ?-Dedekind complete. Corollary 1. <8E) is Dedekind complete if and only if any set ? с 971E) has a support. The space 371E) is (super) Dedekind complete if and only ;/971E, R) is (super) Dedekind complete. PROOF. The first statement follows from Proposition 4 (for the converse conclusion consider for ? с «8E) the set {??: ? e ?} с 371E)). For the connection of_97iE) and 371E, R), consider the set {arctan ? ?: ? e ?] C37iE) in place of ? с 371E, R). D At first glance, one might suspect that the Dedekind completeness of 371E) implies the Dedekind completeness of 55E), because for any ? с 55E) the set {??: ? e ?} has a supremum which is evidently of the form ?^ for some ? с 5. However, ? need not necessarily be measurable: The relation ?? e 371E) just means that ? П F is measurable for each set F of finite measure. This difficulty does of course not arise in ?-finite measure spaces. In view of the difficulties discussed after Proposition 4, it may appear somewhat surprising that in ? -finite measure spaces there arise even no problems at all concerning Dedekind completeness: THEOREM 3. //5 is ?-finite, then 37iE,1) (and thus 971E)) and 55E) are super Dedekind complete. In particular, any ? -Dedekind complete subspace is super Dedekind complete (and order separable). PROOF. In view of the above remarks, it suffices to prove that 371E) is super Dedekind complete. Passing to a normalized measure, it is no loss of generality to assume ? E) < oo. Thus, let ? с 971E, IR) be given. By considering {arctanojc: ? e M) in place of M, if necessary, we may assume that ? has integrable upper and lower bounds. For countable А с ?, define a number m (A) := {5$???(?)??(?). The supremum s of all these numbers is finite, and there is a sequence A„ с ? with m(A„) —>¦ s. Putting С := {jA,,, we have m(C) = s. For any у e ? we have m(C U {>¦}) ^ m(C) < oo which implies у ? sup С. Hence, sup С is an upper bound of ?. In view of С с ??, it is the least upper bound. D Corollary 2. Let E, ?1, ?) be an arbitrary measure space, and X — Lp(S) @ < ? < oo). If ? с X is order bounded in X, then supAi and mi ? exist, and there is some countable С с X with sup ? = sup С and infM = inf C. PROOF. Let x\,X2 e L,,E) be upper and lower bounds for M, respectively. Since \xj\p is integrable, it has ?-finite support. Hence, the support of any ? e ? is contained in the ?-finiteset 5o = suppjf| UsuppjC2-Now apply Theorem 3 with 5o in place of 5. D It turns out that (super) Dedekind completeness of 55E) or 97tE) are intrinsic properties of the underlying measure space. Sometimes, measure spaces 5 for which 5SE) is Dedekind complete are called localizable.
Riesz spaces and ideals of measurable functions 795 The Dedekind completeness of ^(S) and Wl(S) is closely related with the finite measure property introduced in Chapter 17. The following result gives more insight into this property. Recall that a set N с S is called a local null set if N П F is a null set for any set F of finite measure. Clearly, each null set is a local null set. The converse holds precisely in spaces with the finite measure property: THEOREM 4. For a measure space (S, ?, ?) the following statements are equivalent. A) 5" has the finite measure property. B) Each set in Ъ{Б) is the supremum of its subsets of finite measure. C) Each local null set is a null set. D) Any ? e ? with ?(?) = ?? contains some D e ? with 0 < ?(?) < oo. In view of the last statement, the finite measure property is sometimes also called finite subset property. THEOREM 5. Let S have the finite measure property. If'BiS) is Dedekind complete, then also 9JiE") is Dedekind complete. The proof of Theorem 5 makes essential use of the axiom of choice. Dehnition 10. A measure space (S, ?, ?) has the direct sum property if there exists a family of pairwise disjoint sets Sa of finite measure with S = (J Sa such that each set ? с 5 of finite measure is (up to a null set) contained in a countable union (J Sa„ ¦ One may equivalently replace the assumption that the sets Sa have finite measure by the assumption that they are ?-finite. The direct sum property is a rather straightforward generalization of ?-finite sets. It is characterized by the fact that functions on S (with values in Banach spaces) are measurable if and only if they are measurable on each Sa (this is Proposition A. 2.1 of Vath A997)). THEOREM 6. Let S have the finite subset property. If S has also the direct sum property, then ^(S) (and thus also ffl(S)) is Dedekind complete. Also the proof of Theorem 6 relies on the axiom of choice. There is an important exception of an uncountable subset of 9Л E) where it makes sense to define a pointwise supremum. It is a rather deep result that this pointwise supremum is actually the order supremum. THEOREM 7 (Luxemburg-Gribanov). Let T, S be ?-finite measure spaces, and ? : ? ? S —> ? be measurable with respect to the product measure. Then for any ? с 9ЛE, ?) the function zm@'-= sup I \x(t,s)\\y(s)\ds yeMJS
796 ?. Vath belongs to 9JiE", Ш) and is the order supremum of the set Z- z(t) = ? \x(t, s)\\y(s)\ds for some у e ? \. In view of Theorem 3, the statement of Theorem 7 can be reformulated in the form that there is some countable С с ? with zm = zc almost everywhere: This is how the Luxemburg-Gribanov theorem is usually formulated. 2.3. Order convergence and topology We recall that there is a natural notion of convergence and a topology in Riesz spaces. DEnNlTlON 11. A sequence xn e X is called order convergent to ? if there is some sequence y„ \. 0 with \x„ — x\ ?. yn. A set А с X is called order closed if for any sequence x„ e A which order converges to some ? e X we have ? e A. The system of all complements of order closed sets determines the order topology of X. Clearly, each sequence is order convergent to at most one point. Moreover, the order topology is in fact a topology on X. We point out that we define order closed sets only with respect to sequences which is in accordance with the definitions by Luxemburg and Zaanen A971). Occasionally (e.g., by Aliprantis and Burkinshaw A977) or by Fremlin A974)) order closedness is defined instead with respect to more general directed sets which we will not discuss here. The difference between these two possible definitions is analogous to the difference between closed and sequentially closed sets in topological spaces. Although the latter appears more natural from the viewpoint of a general theory of Riesz spaces, it appears to the author that the above definition is more appropriate for subsets of Wl(S) (because sequences are more important here than nets). Surprisingly, it is not true that order convergence means the same as convergence with respect to the order topology (also not for the alternative definition of the order topology mentioned before). In fact, Wl(S) is the most important example for which these notions do not coincide, as we will see now. Proposition 5. A sequence xn e 9ft(S) is order convergent to ? e Wl(S) if and only if x„ —> ? almost everywhere. The order topology can be described in view of Theorems 1 and 2 as follows. PROPOSITION 6. Let S be ?-finite. Then the order topology on Wl(S) is induced by ? (and p). In particular, a sequence converges with respect to the order topology if and only if it converges in measure on each set of finite measure.
Riesz spaces and ideals of measurable functions 797 One could of course ask whether there exists also a topology on Wl(S) (possibly different from the order topology) such that convergence of a sequence with respect to this topology is the same as convergence of that sequence almost everywhere. The following observation implies that such a topology does usually not exist. Moreover, if any such topology exists, then the order topology has the same property (if S is ?-finite). PROPOSITION 7. Let S be ?-finite and such that convergence almost everywhere is not the same as convergence in measure on sets of finite measure (i.e., there is a sequence x„ e Wl(S) which does not converge almost everywhere but which on each set of finite measure converges in measure to some ? e ffl(S)). Then there is no topology on Wl(S) such that convergence of sequences with respect to this topology is the same as convergence almost everywhere. PROOF. Assume by contradiction that such a topology exists. But then x„ ->· ? with respect to this topology and thus almost everywhere. Indeed, otherwise we could find a neighborhood of ? which does not contain infinitely many x„k . Since a subsequence of xHi converges to ? almost everywhere and thus with respect to the assumed topology, this is a contradiction. ? Any ?-finite measure space which contains a set ? of positive measure with no atoms (i.e., any subset of ? of positive measure can be divided into two sets of positive measure) satisfies the assumptions of Proposition 7. 2.4. Bibliographical remarks Riesz spaces are a widely used concept in functional analysis. In particular, Banach lattices (which are simultaneously Riesz and Banach spaces with a Riesz norm; see the next sections) are well developed. A first systematic treatment of such spaces was initiated by Riesz A930), Kantorovich A936), and Freudenthal A936). An introductory textbook into this topic was written by Zaanen A997). More advanced monographs are by Luxemburg and Zaanen A971) and Zaanen A983) as well as by Aliprantis and Burkinshaw A976, 1985), Fremlin A974), Krasnoselskii et al. A989), Meyer-Nieberg A991), Nakano A950), and Schaefer A974), where partially the theory is considered from a somewhat different point of view. Closely related monographs are by Birkhoff A967), Kutateladze (as editor) A996), Semadeni A971), and Sikorski A964). Particular emphasis on the connection with measurable functions is put in the monographs by Fremlin A974), Luxemburg and Zaanen A971), and Zaanen A983). The Dedekind completeness of ???(?) and thus the existence of a support on ?-finite measure spaces (Theorem 3) is mathematical folklore but goes probably back to Kantorovich. Slightly different proofs can also be found in Paragraph 34, Theorem 1, of Zaanen A967) and in Example 23.3(iv) of Luxemburg and Zaanen A971). Theorems 4, 5, and 6 are Paragraph 34, Theorem 2, Exercise 35.1, and Paragraph 35, Theorem 1 of Zaanen A967). We note that the Dedekind completeness of *BE") for ?-finite measure spaces 5" is also an immediate consequence of the exhaustion theorem which is extremely useful
798 ?. Viith in measure and integration theory and which may be considered as a measure theoretic analogue of Zorn's lemma: THEOREM 8 (Exhaustion theorem). Let ? с Ъ{Б) be a system of measurable subsets of ? ?-finite set ? e *B(S) satisfying the following two conditions: A) ? is directed upwards, i. e. for any D \, Di eT there is some ? e ? with D \ U D2 <? ?. B) For any measurable ? С Г with ?(?) > 0 there is some D e ? with D С Е and ?(?))>0. Then there is a sequence of sets Tn e ? with Tn | T. PROOF. Considering ? as a measure space, we equip ? with a normalized measure v. There is some sequence E„ e ? with v(E„) —>¦ s := sup{v(E): ? e T}. By induction, we find T„ e ? with Tn+\ э Jn и En. Then T„ | Г. Otherwise, ? \\JT„ would contain some set D e ? with ?(?)) > 0 which would yield the contradiction v(D U E„) = v{D) + v(E„) > s for sufficiently large и. ? The Luxemburg-Gribanov theorem was originally used to obtain a measurability result for a mixed norm space by Gribanov A970) and Luxemburg A958). We will turn to this topic later. The current version can be found as Theorem 99.2 and Corollary 99.3 in Zaanen A983); it has been generalized recently by Vath B00?) for Z/w@'-=sup I \x(t,s)y(t,s)\ds, \€M JS where ? с Ш(Т ? 5", ?) is a subset of the set of all functions of the form y(t, s) = v(t)u(s) where и e C(S, Y) and ? e V with V being a so-called equimeasurable set of functions set of functions ? : S —>¦ [0, oo]. We do not go into details but just point out that if S is a locally compact metric space with a ?-finite Radon measure, then the choice V = C(S) is possible. 3. Locally solid ideals of measurable functions 3.1. Ideals and bands in Riesz spaces Dehnition 12. Let X be a Riesz space. A subset ? с X is called solid if the relations ? e ?, у е X, and \y\ ^ |jc| imply that у е М. A) A solid subspace of X is called an {order) ideal. B) An ideal / с X is called a ?-ideal if for any countable С с / for which * = sup С е X exists, we have ? e /. C) An ideal / с X is called a band, if for any ?? с / for which ? = sup ? e X exists, we have jc e /. Note that any ideal is a Riesz space by Proposition 2.
Riesz spaces and ideals of measurable functions 799 The notion "ideal" might arise some confusion, because it is also used as an algebraic notion for certain subsets of rings. However, in this algebraic notion the crucial property of ideals is that they are characterized as the null sets of ring homomorphisms (namely of the quotient maps). An analogous property is satisfied for order ideals. DEnMTlONl3. An linear map of Riesz spaces /: X \ -> Xi is called a A) Riesz homomorphism if |/(jc)| = f(\x\)\ B) ? -Riesz homomorphism ifsup /'(C) = /(supC) for any countable С ? X for which sup С exists; C) normal Riesz homomorphism if sup/(Ai) = f(supM) for any ? с X for which sup ? exists. Recalling Proposition 2, we see that Riesz homomorphisms preserve the linear structure and order as well as all lattice and Riesz operations (like finite suprema and infima). If X is a Riesz space and ? с X is an ideal, then the quotient space X/M can be endowed with a canonical order defined by [jc? ] ^ [??] whenever the equivalence classes [x·,] contain representatives y, with y\ ? yi- Propositions. Let X be Riesz space. A) If I с X is an ideal, then X/I is a Riesz space, and the quotient map X —> X/I is a Riesz homomorphism with null space I. Conversely, the null space of any Riesz homomorphism is an ideal. B) If a Riesz homomorphism is normal (respectively ? ?-Riesz homomorphism) then its null space is even a band (respectively ?-ideal). The converse holds if the Riesz homomorphism is surjective (in particular, for the quotient map X —> X/I). Ideals have very natural properties concerning Dedekind completeness: PROPOSITION 9. Let X be a (super) (?-) Dedekind complete Riesz space. Then also each ideal of X is (super) (?-) Dedekind complete. Bands have a particular importance in connection with the following definition. Dehnition 14. Let X be a Riesz space, and x, у e X. Then we call ? and у disjoint (and write X-Ly) if I*| Л \y\ — 0. For ? с X, the disjoint complement is defined by ? := [? e X: for any у e ? we have X-Ly}. The set M1- is always a band. PROPOSITION 10. Let X be Dedekind complete, and В с X be a band. Then В = (B^I- and X = ? ? ?-1. In particular, there is a projection Рв ¦ X -> В onto В with null space B^. This projection satisfies 0 <; PBx ? ? (? e X+).
800 ?. Viith 3.2. Ideals of measurable functions We are interested in subspaces X с 97tE) which are simultaneously ideals in 97tE), i.e. which have the property that ? eX, у e 371E), \y(s)\ <c \x(s)\ for almost alb => yeX. C) In view of Proposition 9, these spaces are always ?-Dedekind complete and, under reasonable mild assumptions on 5, they are even (super) Dedekind complete (recall Theorems 3 and 6). With ? as described at the beginning, the space 37tE, Y) is not a Riesz space, in general, because it is not ordered. Nevertheless, we also have a "Riesz operation", defined by \x\{s) :— \x{s)\ almost everywhere. Moreover, we can define a corresponding "order topology" which we call o-topology. A set ? с 37tE, ?) is closed in this topology if and only if x„ eM, у e37iE, У), d(x„(s), y(s)) ^0 for almost alb =>· у е М. The following analogue of Proposition 6 follows from Theorems 1 and 2: PROPOSITION 11. IfS is ?-finite, then the o-topology is induced by ? (and p). We also call a linear subspace X с 371E, Y) an ideal if C) holds (with 371E) replaced by 37tE, Y)). Note that, since we require that X be a linear subspace, the functions ? e X are by definition almost everywhere finite. For most practical purposes, ideals in the above sense come equipped with a topology which turns them into topological vector spaces and which is compatible with the order structure in the sense that it is locally solid, i.e. 0 has a neighborhood base consisting of solid sets. Of course, we call a subset ? of an ideal X с 37tE, Y) solid if ? e M, у e 371E, Y), \y(s)\ ? \x(s)\ for almost all s => yeM. The (locally solid) ideals of 97?E, ?) and of 97iE) are in a one-to-one correspondence with each other: Proposition 12. Let e eY with \e\ = 1. A) IfX с 97tE, Y) is an ideal (with a locally solid topology), then XK:={jte97iE): x()e e X} is an ideal in 97iE) (with a locally solid topology determined by the neighborhoods 0/0 SK:={;te97iE,yK): x(-)eeB\, where В runs through all neighborhoods ofO in X).
Riesz spaces and ideals of measurable functions 801 B) Conversely, if Xr с tyl(S) is an ideal (with a locally solid topology), then X :={xe 3Jt(S, Y): \x(-)\ eX} is an ideal (with a locally solid topology determined by the neighborhoods ofO B:={xeWl(S,Y): \х(-)\еВл], where Z?r runs through all neighborhoods ofO in Хк). In view of the correspondence of X and Xr, we call Xr the real form of X, but unless the danger of confusion arises, we will notationally not distinguish between them. Recall that a pseudo-norm on a linear space X is a map || · ||: X —> [0, oo) which satisfies the triangle inequality ||jc + y\\ ^ ||jc|| + \\y\\ and which is continuous for scalar multiplication in the sense that ||?.?|| —> 0 as ? —> 0. If X is a Riesz space, we call || · || a lattice pseudo-norm if additionally 1*1 «? ? => 11*11 <Ы1- Occasionally, lattice pseudo-norms are also called Riesz pseudo-norms in literature. Recall that by the classical metrization theorem of uniform spaces the topology of any topological vector space is determined by the family of continuous pseudonorms. It is not surprising that for locally solid topologies even lattice pseudonorms will do. THEOREM 9. Let X be a Riesz space or an ideal in VK(S, Y). Then each locally solid topology on X is determined by the family of continuous lattice pseudo-norms, i.e. for each point ? e X a neighborhood base is given by the system of all "solid balls " (i.e. balls with respect to some continuous lattice pseudo-norm) with center ?. PROOF. For Riesz spaces, a detailed proof can be found in Section 22C of Fremlin A974). The case of a Banach space ? reduces to the case ? = Ш in view of Proposition 12 (see also Feledziak and Nowak A997)). ? Corollary 3. Let X be a Riesz space or an ideal in Wl(S, Y) with a locally solid topology. Then 0 has a countable neighborhood base if and only if the topology is generated by a single continuous lattice pseudo-norm. PROOF. If 0 has a countable neighborhood base, we find a countable family of continuous lattice pseudo-norms || · |k whose balls form a base for the neighborhoods of 0. Then 00 II II ^ 1 + II* Ik k=\ " "* is the required pseudo-norm. D
802 ?. Vath In case ? = Ш, we can evidently also interpret the term "ideal" in some algebraic sense. Indeed, a linear subspace X is an ideal of Wl(S) if and only if it is invariant under multiplication by functions from LX(S). A similar characterization can be given for the locally solid topology. One might suspect that in case of a Banach space ? an analogous characterization would hold when LX(S) is replaced by Loc(S, C(Y)) where C{Y) denotes the set of bounded linear operators of ? with the operator norm. This is indeed true, although the proof is not straightforward, because the measurability of functions in LX(S, C(Y)) is a more restrictive requirement than one might expect at first glance: Example 3. For 5" = [0,1] (with the Lebesgue measure) and ? = L|([0, 1]) there are functions x,y e LodS, Y) with |jc| = |y| such that there is no measurable function ?: S -+ C(Y) with \z(s)\ ^ 1 and y(s) = z(s)x(s). Indeed, fori e [0, 1] define f 1 if/ iCs, and let x{s) := es and y{s) := e\. Assume by contradiction that a measurable function z-S -> C(Y) with \z(s)\ ? 1 and y(s) = z(s)x(s) would exist. Let / be the bounded linear map from C(Y) into the dual space Y*, defined by /(А)(и) := /0' (Au)(t)dt. Then I := / ? ?: S -> ?* is measurable. Since t(s) e Y* can, as any bounded linear functional on L ? ([0, 1 ]), be written in the form l(s)u= / ks(t)u(t)dt, Jo where ks e Loo([0, 1]) with l^loc = \i(s)\ ^ |/| \z(s)\ ^ 1, the equality l(s)es = 1 implies that we must have ks = —es almost everywhere (see, e.g., Theorem 1.39(c) of Rudin A987)). But this implies |?(j|) - e(s2)\ = \ku - kS2\x = 2 for s\ ^s2- Hence, €E0) is not separable for any uncountable SO <? S, and so I is not measurable, a contradiction. As we can already guess from Example 3, the dual space (more precisely, the so-called duality map) plays the essential role in this problem. For this reason, the following result is of much use. LEMMA 1. For any Banach space ? and any ? > 0 there is a continuous map F :Y —>¦ Y* such that F(y)- \y\2 and \F(y)\ ? \y\(\ + ?) for each у е Y. The proof of Lemma 1 makes essential use of the axiom of choice. COROLLARY4. Let ? be a Banach space. Then for any measurable functions ?, у: S ->· ? with \y(s)\ ^ \x(s)\ and any ? > 0 there IS SOYYIC ? ? Ьх E",C(Y)) with \z(s)\ ?\+???? y(s)=z(s)x(s).
Riesz spaces and ideals of measurable functions 803 PROOF. Choose F as in Lemma 1, and put ? CO := y(s)\x(s)\~2F (define ? CO :=0 for those s with x(s)=0). О Example 3 shows that Corollary 4 (and thus Lemma 1) is no longer true for ? = 0. Corollary 4 now implies the desired alternative characterization of ideals: PROPOSITION 13. Let X be a linear subspace ofWl(S, Y), and Kx denote the unit ball of L^S, C(Y)). A) X is an ideal in Wl(S, Y) ifandonly if KXX с X or, equivalently, LX(S, C(Y))X с X. B) Let X be also a topological vector space. Then X is an ideal in Wt(S, Y) with a locally solid topology if and only if for each neighborhood ?/cX ofO the inclusion KooU с U holds. PROOF. Only the sufficiency requires a proof. By Corollary 4, we have for any ?/cX and any ? > 0 that K^lf contains the solid set U?:={y eWl(S,Y): \y\ «? UKl+?) for some ? eU). For U = X, we obtain that X is an ideal, and if U is a neighborhood of 0 with U 2 ?^??, then U э ?? 5 A + ?)-1 U, and so U? is a solid neighborhood of 0 contained in U. D 3.3. Comparison of various sorts of convergence Probably the most important property of ideals in ?ul(S, Y) with a locally solid topology is that this topology is compatible with "convergence a.e." in the following sense: PROPOSITION 14. Let X be an ideal in 9JiE", Y) with a locally solid Hausdorff topology. Ifx„ —>· ? in X, then for almost all s e S there is a subsequence with x,4 (s) —> .x CO- PROOF. We prove that D„m := ??1»{·?: \x(s) ~ xk(s)\ > w~') is a nul1 set· ?*1611 Un.m Bum is a null set which implies the statement. We thus have to show that у : = m~ Vd„.„, vanishes almost everywhere. But since \y\ < \x — Xk\ and хи —> x,y is contained in any locally solid neighborhood of 0 and thus in any neighborhood of 0 which means у = 0, because the topology is Hausdorff by hypothesis. ? Corollary 5. Let X be as in Proposition 14. Suppose that x„ e X is such that for some (not necessarily finite) у : S —>· ? we have d(x„(s), y(s)) —* Ofor almost all s. Ifx„ —> ? in X, then y(s) = x(s)for almost all s (in particular, у is finite a.e.). Under a slightly more restrictive assumption than in Proposition 14, a much stronger conclusion can be obtained.
804 ?. Vath ???????? 15. Let X be an ideal in Wl(S, Y). Then we call a locally solid topology on X entire if for each xq e X \ {0} there is some ? e X \ {0} with \x(s)\ ^ |jco(^)| and some continuous pseudo-norm || · || on X with ||>>|| > 0 for each у e 97t(S, ?) \ {0} with |y(i)l^|jc(i)|. Clearly, each entire topology is Hausdorff (because ||jco|| ^ \\y\\ > 0). Conversely, the notion entire appears only as a slight generalization of the Hausdorff property which is more appropriate for comparisons with the order. We point out that metrizable topologies are always entire. In fact, if a locally convex topology on X is Hausdorff and 0 has a countable neighborhood base, then Corollary 3 implies that there is a continuous pseudo- norm || · || on X with ||jc|| =0 if and only if ? — 0. THEOREM 10. Let X be an ideal in Wl(S, Y) with a locally solid topology. Let S be ?-finite, and 9JiE", Y) be endowed with the topology of Proposition 1. Then the embedding of X into W(S, Y) is continuous if and only if the topology on X is entire. This is in particular the case if X is Hausdorff and 0 has a countable neighborhood base. Recall that the topology on UR(S, Y) considered in Theorem 10 is nothing else but the o-topology (since S is ? -finite). We remark that if, in addition to the hypothesis of Theorem 10, the space X is metrizable and complete (i.e., a so-called F-space, see, e.g., Rudin A990)), then Theorem 10 follows from Proposition 14. In fact, in this case one can apply the closed graph theorem: The embedding of X into ?ul(S, Y) has trivially a closed graph in view of Corollary 5 (and Riesz' Theorem 2). By considering the restriction of X to functions which vanish outside a given set E, we obtain a corresponding result for measure spaces which are not necessarily ?-finite. Corollary 6. Let X be an ideal in Wl(S, Y) with an entire locally solid topology (e.g., X is Hausdorff and 0 has a countable neighborhood base). Ifx„ —> ? in X, then on each set of finite measure the sequence x„ converges to ? in measure. Corollary 6 is a very surprising result, because our definition of ideals and locally solid topologies is apparently not at all related to the values of the measure: the measure ? only appears implicitly in the form of null sets. By Riesz' Theorem 2, Corollary 6 implies a remarkable strengthening of Proposition 14 for ?-finite measure spaces: Corollary 7. Let X be as in Corollary 6. If S is ?-finite and x„ -> ? in X, then a subsequence of x„ converges almost everywhere to x. The converse question (for which sequences xn does "convergence almost everywhere" imply convergence in X) will be discussed in Section 4.4. Although the topology in X is "compatible" with the o-topology (even if oo e Y), this "compatibility" has its limits. For example, the following property is not always satisfied (examples will be given later).
Riesz spaces and ideals of measurable functions 805 Dehnition 16. We say that a locally solid ideal X in 9JiE", Y) has the Fatou property if each point has a neighborhood base of sets which are closed in Wl(S, ? U {oo}) with respect to the o-topology. The name will become clear later. We point out that our definition slightly differs from the definition used sometimes in literature (e.g., by Aliprantis and Burkinshaw A977) or Fremlin A974)), mainly because we consider sequential closedness. On ?-finite measure spaces, this makes no difference, because Wl(S) is order separable (recall Theorem 3). 3.4. Bibliographical remarks Spaces of measurable functions are together with spaces of continuous functions the most natural examples of Riesz spaces. Using the axiom of choice, standard results state that any Riesz space X satisfying certain natural assumptions is Riesz isomorphic to (a subspace of) C(K) where К is the set of all maximal (order) ideals, equipped with a certain topology (analogously to the Gel'fand representation theorem for Banach algebras). Similar results establish a Riesz isomorphism to L^iS). For such representation theorems, we refer to Luxemburg and Zaanen A971), Semadeni A971), Schaefer A974), and Zaanen A983), In particular, one may also establish Riesz isomorphisms of ideals of Wl(S) to C(K). However, these homomorphisms do not immediately imply, e.g., Theorem 3. Moreover, ideals in Wl(S) are among the best understood classes of Riesz spaces, and so it makes not much sense to represent them in a less known form. The precise connection of Example 3 with the duality map is revealed in Theorem 2.1.1 of Vath A997). Lemma 1 is from Dirr and Vath B000). Proposition 14 is apparently new, although it was known already in the special case of preideal spaces (Lemma 2.2.1 in Vath A997))· The notion of entire topologies is related with the so-called support of the topology, see, e.g., Feledziak and Nowak A997). For the latter, see also Aliprantis and Burkinshaw A977, 1980a, 1980b). In these references, the reader will also find other conditions than metrizability which ensure that a Hausdorff topology is entire (on ? -finite measure spaces, the Fatou property is such a condition, for example, in view of the remarks following Definition 16 and by the results of Aliprantis and Burkinshaw A977)). Theorem 10 was proved for scalar functions by Wnuk A986); the case of vector functions follows from this straightforwardly in view of Proposition 12 (see Feledziak and Nowak A997) for details). In the context of preideal spaces, the important Corollary 6 was obtained first independently by Zabreiko A974) and by Luxemburg and Zaanen A963d). (The arguments of Luxemburg and Zaanen are completely different.) 4. Ideal spaces 4.1. Preideal spaces Henceforth, we will restrict our attention to locally solid ideals of Wl(S, Y) which are generated by a single (quasi-)norm. Recall that for a linear space X, a map || · ||: X —>¦
806 ?. Vath [0, ??) is called a quasi-norm if ||jc|| =0 implies ? = 0, ||?*|| = |?|||.?||, and if \\x + y\\·^ C{\\x\\ + \\y\\) for some constant С < oo. If X is an ideal of 97tE, Y), we call || · || a lattice (quasi-)norm if it has additionally the property ? eX, ye 971E), \y(s)\ ^ \x(s)\ for almost alb => ||y|| ^ \\x\\. D) It will be convenient to define \\x || := oo for any ? e 97tE, ?) which does not belong to X. With a slight misuse of notation, we nevertheless call this extended map || · || a (quasi-) norm. With this convention (which we assume throughout), the relation ||*|| < oo is a reformulation of the statement ? e X. If we interpret D) in this extended sense, any subspace X с 97?E, ?) with a lattice (quasi-)norm is automatically an ideal in 97?E, ?). Dehnition 17. A subspace X с 97tE, ?) with a lattice (quasi-)norm is called a (quasi- normed) preideal space. If X is also complete, it is called ideal space. PROPOSITION 15. Any (quasi-normed) preideal space X is a locally solid Hausdorff ideal in 971E, Y). Its real form is also a (quasi-normed) preideal space with the (quasi-)norm determined by III*IIIxr = IWI· Henceforth, we will notationally not distinguish between X and its real form. Analogously to Proposition 13, one obtains the following result. PROPOSITION 16. A (quasi-)normed linear subspace ? o/97tE, ?) is a (quasi-normed) preideal space if and only if its unit ball is invariant under multiplication of functions by the unit ball of Loo(S, C(Y)). Recall that Corollary 4 (which was used for the previous result) essentially depends on the axiom of choice. Let us give a number of examples which show that most spaces of measurable functions occurring in applications are actually ideal spaces (we will see later how the completeness is proved). Examples 4. A) The Lebesgue-Bochner spaces Lp(S, Y) @ < ? ? oo) are quasi-normed ideal spaces. They are normed for ? ^ 1. B) The sequence spaces lp(Y) @ < ? ^ oo) are obtained as the special case 5 = N with the counting measure. C) The space со of null sequences (considered as a subspace of lx) is an ideal space. D) The space cfin of all sequences which are eventually 0 (considered as a subspace of cq) is a preideal space but no ideal space.
Riesz spaces and ideals of measurable functions 807 E) Lebesgue-Bochner spaces weighted by some measurable function w : S —> @, oo] are ideal spaces. Here, the space consists of all measurable functions ? : S ->· ? for which y(s) := w(s)x(s) belongs to Lp(S, Y). F) Orlicz spaces are important ideal spaces. We briefly recall the definition. Let Ф:Ш —> [0, oo] be a Young function, i.e., convex, even with Ф@) = О, and not almost everywhere oo or 0. Then the Orlicz space Lcp{S, Y) consists of all measurable functions ? : S —> ? for which the Luxemburg norm is finite. For the choice Ф(г) = \t|'', one obtains the spaces Lp, and also L^ is a special case. G) One may also generalize the Orlicz spaces (and Lp spaces) by choosing a Young function ? which also depends on s. Under natural assumptions, one still obtains ideal spaces, see, e.g., Appell and Zabreiko A990). One may consider this dependence on s as a generalization of a weight function. (8) Lorentz and Marcinkiewicz spaces are ideal spaces. They belong to a particular class of preideal spaces whose norm depends only on the decreasing rearrangement of the function. In other words: The norm does not change if the function is replaced by an equimeasurable function. For this reason, these spaces are usually called rearrangement invariant, or sometimes also symmetric. One of the difficulties in the abstract theory of ideal spaces is that one does not know whether the spaces contain the constant functions, even if the underlying measure space is finite (think of weighted Lebesgue spaces or on the more involved implicit weighting of Orlicz spaces described above). For this reason, for example, one cannot prove that uniform convergence of a sequence implies convergences in the (quasi-)norm. The example of weighted Lebesgue spaces (or also, e.g., of ЬР(Щ) suggests that instead of a constant function one should consider some strictly positive function which belongs to the space (in the abstract notion of Riesz spaces such a function plays the role of a so-called weak unit). The following result ensures the existence of such a function for complete spaces, and a reasonable substitute for incomplete spaces. It is needed for the proof of almost all results concerning ideal spaces. THEOREM 11. Let X be a (quasi-normed) preideal space with ? ? -finite support. Then there are sets E„ | supp X with ??? e supp X. If additionally X is complete, there is a function ? e X with supp ? = supp X. PROOF. The first statement follows from the exhaustion Theorem 8. For the second statement, choose a sequence ?„ > 0 such that ? ^nC" \\??„ II converges (C is the constant of the quasi-norm). The partial sums of ? ???,, are then a Cauchy sequence of X, and so the sequence converges in norm to a desired element ? e X (apply Proposition 14). D The example Cf\„ shows that, in general, apreideal space X does not contain a function ? with suppjc = supp X.
808 ?. Vdth For ideal spaces, an important strengthening of Proposition 14 holds which is useful in connection with the dominated convergence theorem discussed later: THEOREM 12. Let x„ —>· ? in a {quasi-normed) ideal space X. Then there is some у е X and some subsequence with \x„k | ^ | у | such that x„K —> ? almost everywhere. Note that, in contrast to Corollary 7, it is not required for Theorem 12 that the underlying measure space be ?-finite. The sequence xn = n~]e„ in ct\„ (with e„ being the canonical «th unit vector) shows that, in general, Theorem 12 does not hold for preideal spaces. 4.2. Fatou property and perfectness Dehnition 18. Let 1 ^ a < oo, and X be a (quasi-)normed preideal space. Then X is called A) semi-perfect if the relations 0 < x„ | ? in ?ul(S, R) and sup ||*„|| < oo imply ? e X. B) a-almost perfect if the relations 0 ^ x„ | ? e X imply \\x\\ ? a lim || xn \\. C) ?-perfect if the relations 0 ^ x„ | ? in 9Jt(S\ R) imply \\x || ^ a lim \\x„ \\. In case a = 1, we do not mention a. In view of our convention that ||jc || < oo means ? e X, we have that X is perfect if and only if it is semi-perfect and almost perfect. Note that the above definition is actually only a requirement on the real form of X. The properties may be considered as abstract analogies of the monotone convergence theorem. Examples 5. A) The spaces Lp(S, Y) @ < ? ^ oo) as well as their weighted forms are perfect (by the monotone convergence theorem). B) Orlicz spaces are perfect. C) Neither cq nor ct\„ are semi-perfect. However, they are almost perfect. Further examples will follow, soon. We may reformulate the properties of Definition 18 in a way which does not make use of the real form of X. These equivalent formulations remind of the lemma of Fatou: PROPOSITION 17. Let X be a (quasi-)normedpreideal space and 1 ^ a < oo. A) X is semi-perfect if and only if for any sequence x„ e X with sup \\x„ || < oo and d(xn(s),x(s)) -+0 a.e. for some ? eWl(S, ? U {oo}) we have ? e X. B) X is a-almost perfect if and only if for any sequence x„ e X with \x„(s) — x(s)\ —> 0 a.e. for some ? & X we have ||*|| ^ orliminf ||*„||. E) n—>oo
Riesz spaces and ideals of measurable functions 809 C) X is a-perfect if and only if for any sequence x„ e X with d(x„(s),x(s)) —> 0 a.e. for some ? e Wl(S, ? U {oo}) we have E). PROOF. For necessity, consider xn := inft>„ |**|, and for sufficiency, consider Xn '.— Xfj€ witheey, \e\ = 1. " ? PROPOSITION 18. Let X be a (quasi-normed)preidealspace. A) X is perfect if and only if its unit ball is closed in W(S, ? U {oo}) with respect to the o-topology. B) X is almost perfect if and only if its unit ball is closed in X with respect to the o-topology. C) IfX has the Fatou property, then X is ?-perfect for some a. The converse holds if S is ?-finite. PROOF. All statements are consequences of Proposition 17. We prove the last one: X has the Fatou property if and only if its unit ball К is contained in some o-closed set ? с Wl(S, ? U {oo}) which in turn is contained in a ball of some radius a. This implies the necessity of ?-perfectness, but also the sufficiency if S is ?-finite. Indeed, if S is ? -finite, the o-topology is induced by a metric (namely p), and so the set ? of all o-limits of sequences of the unit ball is o-closed. ? Proposition 18 explains the name "Fatou property". The properties considered here are not independent of each other: THEOREM 13. Any semi-perfect (quasi-normed) preideal space is ?-perfect for some a. The following examples show that one cannot always choose a = 1, and that no other implications for the properties of Definition 18 hold, even in the case of ideal spaces. Examples 6. A) The space of sequences ? = (?„),, determined by ||*|| :=sup|?„| + Xlimsup |?„| ? ?—>oo (for fixed ? e [0, oo)) is an ?-perfect ideal space for a = 1 + ? but not for any smaller a. B) The space of all double sequences ? = (?,,.^,,? determined by ||*|| :=sup |?„.(t|+sup(fc limsup |?«.*|) п.к к V и-юс ' is an ideal space which fails to be ?-almost perfect for any a (and thus also fails to be semi-perfect).
810 ?. Vath C) The space of all integer-indexed sequences * = (?;-)* ez determined by II^H := |sup)teZ|^| +Xlimsup^3C |&| if linu·^-^· =0, I oo otherwise (for fixed ? e [0, oo)) is an ideal space which is ?-almost perfect for a — 1 + ? but for no smaller a and which fails to be semi-perfect. Definition 19. Given some (quasi-normed)preideal space X, the corresponding Lorentz space Xl consists of all ? e Wl(S, ? U {oo}) for which ||* Hi. :=inf{lim||*„||: 0 ^ *„ | |*|} F) is defined and finite. PROPOSITION 19. We always have X с XL and \\x\\L ^ ||.x||. Moreover, X = Xl (assets) if and only if X is semi-perfect, and II*Hz. ^<*_l ||*|| (* e X) if and only if X is almost ?-perfect. In particular, X = Xl with the same norms if and only if X is perfect. It is not immediately clear whether || · ||z. is a (quasi-)norm, namely whether ||*||i. = 0 implies that * = 0 (almost everywhere). However, for ?-finite measure spaces, this is a consequence of the following result which is obtained by combining Egorov's Theorem 1 with Theorem 11 and a fancy diagonal argument. THEOREM 14. If ? has ?-finite support, then the infimum in F) is actually a minimum. Moreover, for any sequence 0 ^ *„ | |*| the relation \\x„\\l t II* Hz. holds. Corollary 8. Let S be ?-finite. Then Xl is a perfect (quasi-normed) ideal space in which X is continuously embedded (with embedding constant at most 1). The quasi-norms are equivalent on X if and only if X is almost ?-perfect for some a. In other words, Xl is the "perfect hull" of X. The only statement of Corollary 8 which does not follow from the previous results is the completeness of Xl which however follows from Corollary 10 below. 4.3. Completeness We have seen that any preideal space has a perfect hull which is an ideal space. Since any preideal space has a completion, one might hope that this completion might be written in a canonical way as an ideal space. This is an especially natural conjecture if one recalls that the completion of the preideal space ct\„ is the (not perfect!) ideal space со- The following discouraging example shows that this is not true, in general.
Riesz spaces and ideals of measurable functions 811 Example 7. Let X be the preideal space of sequences ? = (?„)„ defined by the norm oo IW|:=532~"l&l+limsuPlS«l- n=l Then the sequence *„:=A,..., 1,0,0,...) // times is a Cauchy sequence, but in any preideal space containing X the only candidate for a limit is ? := A, 1,...) (Corollary 5). But we have x„, ? e X and ||jr„ — *|| -^O. Dehnition 20. A (quasi-normed) preideal space X has the Riesz-Fischer property if the relation 0 ^ x„ | ? in 9Jt(S\ R) for some Cauchy sequence x„ e X implies that ? e X. THEOREM 15. A (quasi-normed) preideal space X is complete if and only if it has the Riesz-Fischer property. Corollary 9. X is a (quasi-normed) ideal space if and only if its real form Xr is a (quasi-normed) ideal space. COROLLARY 10. Any (quasi-normed) perfect preideal space is a (quasi-normed) ideal space. 4.4. Convergence theorems Given D e ? and ? e 371E", Y), we define PDx e OTE, Y) by PDx(s):=<pD(s)x(s) = \ y ' 10 if s ? D. Dehnition 21. Let X be a (quasi-normed) preideal space. We say that a subset ? с Х has equicontinuous norm if the relations lim sup sup ||/??||=0, G) inf sup || ? ?' ? || = 0 (8) hold. The regular part X,- of X is defined as the subset of all ? e X for which \x) has equicontinuous norm. If X — X,, the space X is called regular. Roughly speaking, condition (8) means that the functions ? e ? become "uniformly small" (in norm) outside of large sets of finite measure. This condition is automatically
812 ?. Valh satisfied if ?(?) < oo. Similarly, condition G) means that the functions ? e ? become "uniformly small" on sets of small measure. This condition is automatically satisfied if the measure space consists only of atoms of a positive minimal measure (e.g., for the counting measure). Regularity may be considered as an abstract analogue of Lebesgue's dominated convergence theorem: PROPOSITION 20. If ? с ? has ?-finite support, then ? has equicontinuous norm if and only if Dn ; 0 in <8E) => sup ||PDiix\\ I 0. PROPOSITION 21. X, is always a closed ideal in X. In particular, ifX is a (quasi-normed) ideal space, then also X, is a (quasi-normed) ideal space. Examples 8. A) The spaces Lp(S) and /,, @ < ? < oo) are regular in view of the dominated convergence theorem. In contrast, ^oc([0, 1]) is not regular (because G) fails), /30 is not regular (because (8) fails), and LX(R) is not regular (because G) and (8) both fail). The regular part of l^ is cq. B) Orlicz spaces (assuming ?(?) < oo) are regular if and only if the generating Young function satisfies a so-called A^-condition, see, e.g., Rao and Ren A991). If the generating Young function is everywhere finite, their regular part is the closure of the family of iT-step functions. Orlicz spaces without /i^-condition are the most important class of spaces for which the regular part is strictly smaller than the space but not trivial. THEOREM 16 (Dominated convergence theorem). Let X be a (quasi-normed) preideal space, andx,,,x e X be such that xn —> ? almost everywhere or at least in measure on each set of finite measure. If there is some у e Xr with \xn\ ?. у and \x\^y, then \\xn — ? || —> 0. Proposition 20 implies that the functions in X, are even characterized by the fact that they have the property of Theorem 16: Corollary 11. Let a quasi-normed preideal space X have ? -finite support. Then ? e X is regular if and only if the relation \x \ ^ x„ I 0 in ffl(S) implies \\x„ \\ I 0. COROLLARY 12. Any regular quasi-normed preideal space X is a-almost perfect where a = С is the constant of the quasi-norm. PROOF. If 0 ^ xn | x, then Цл- - jc„|| 4, 0, and so ||jc|| ^ C(||jc - x„\\ + \\x„\\) -> Clim ||jr„||. ? THEOREM 17 (Vitali's convergence theorem I). Let X be a (quasi-normed) preideal space, and x„ e Xr. Then the following statements are equivalent:
Riesz spaces and ideals of measurable functions 813 A) x„ —> 0 in measure on each set of finite measure, and {x„: n] has equicontinuous norm. B) ||*n||-»0. In complete spaces, we have an analogous result also for convergence to nonzero elements (this is what is usually called Vitali's convergence theorem): THEOREM 18 (Vitali's convergence theorem II). Let X be a (quasi-normed) ideal space, and x„ e Xr and ? e Wl(S, Y) be almost everywhere finite. If S is ?-finite, the following statements are equivalent: A) x„ —> ? in measure on each set of finite measure, and {x„: n] has equicontinuous norm. B) ? e X and \\x„ - ? || -> 0. In general (i.e., if S is not necessarily ?-finite), the first of the following statements implies the second: A) xn —у ? almost everywhere, and {xn: n] has equicontinuous norm. B) ? eX and \\xn -x\\ -> 0. 4.5. Bibliographical remarks Unfortunately, there is no unique terminology in the literature concerning ideal spaces. Sometimes, they are called Kothe spaces (although, historically, G. Kothe was mainly interested in a special class of such spaces which were sequence spaces), sometimes they are called Banach function spaces, and sometimes more or less restrictive requirements are made for the definition. A systematic treatment was given in a celebrated series of papers by Luxemburg and Zaanen A963b, 1963c, 1963d, 1963e, 1963f, 1963g, 1963h, 1964a, 1964b, 1964c, 1964d, 1964e, 1964f), and by Luxemburg A965a, 1965b, 1965c, 1965d, 1965e, 1965f) (see also Luxemburg and Zaanen A963a)) where the case of scalar functions is covered almost completely, and simultaneously the related theory of Riesz spaces is developed in much depth. Some surveys (containing also new results) were written by Zaanen A967), Zabreiko A974), and the author A997), where only the latter covers also the vector-valued case. The vector-valued case was also considered by Bukhvalov A972, 1978), Macdonald A973a, 1973b, 1974, 1976), and by Feledziak and Nowak A997). The quasi-normed case was particularly studied by Povolotskii and Kalitvin A991) and Vath B000). Many special ideal spaces arise naturally in interpolation theory, see, e.g., Bergh and Lofstrom A976), Krasnoselskii et al. A958), Krein et al. A978), and Lindenstrauss and Tzafriri A979). In particular, Lorentz and Marcinkiewicz spaces play a special role there, see Lorentz A950, 1951, 1961), Marcinkiewicz A939), Milman A978), and O'Neil A963). Rearrangement invariant function spaces were studied by Luxemburg A967) and Chong and Rice A971). Introductory monographs to the theory of the mentioned Orlicz spaces are by Krasnoselskii and Rutickii A958) and Rao and Ren A991). Preideal spaces are closely related with so-called modular spaces which are generated by certain functionals (called modulars). In particular, in the case of an Orlicz space
814 ?. Vath Ъф, the corresponding modular is given by m(x) = fs<P(\x(s)\)ds. Such a modular induces a topology which is not necessarily the same as the topology we discuss here. For comparisons of these topologies (in case of Orlicz spaces), see, e.g., Nowak A984a, 1984b, 1993). Proposition 16 suggests the following definition: call a Banach subspace X с W(S, Y) ideal* if it has the property that its unit ball is closed with respect to multiplication by functions from the unit ball of Loo(S, R). In case ? — R, this is just a characterization of ideal spaces in view of Proposition 16 (because C{Y) = R). However, in general, this class of spaces is much larger. Unfortunately, this class lacks in general the important properties of Section 3.3 if У has infinite dimension, see Example 3.1.1 of Vath A997). For finite-dimensional Y, it still has analogous properties, but the proofs are rather cumbersome. By means of an example, consider a function ? :R" —>¦ [0, oo) which is convex and even (but not of the form ? (и) — Фп(|и|) for some Young function Фо). Then the set of all measurable ? : S -*¦ R" for which the norm |?|:={?>0:/5?(^),^?} is finite is an important example of an ideal* space which is not an ideal space, in general. Ideal* spaces and the above generalizations of Orlicz spaces were considered by Nguyen A987, 1991) and by Nguyen and Zabreiko A990). Theorem 13 builds on a result of Amemiya A953), see, e.g., Paragraph 65, Theorem 2, ofZaanen A967) or Theorem 3.2.1 of Vath A997). The result has been formulated there only for the normed case, but the proof of the quasi-normed case is similar: one mainly has to insert an appropriate power of the quasi-norm constant С in order to ensure the convergence of the series in the proof of Theorem 3.2.1 of Vath A997) (similarly, as we did in the proof of Theorem 11). Theorems 12 and 15 are proved simultaneously in Theorem 3.26 of Vath B000). The proofs for the claims in Example 6 is given in Examples 3.2.3, 3.2.4 and 3.2.5 of Vath A997). Theorem 14 follows from Lemma 3.2.9 and Corollary 3.2.6 of Vath A997) (see also Paragraph 66, Theorems 2 and 4 of Zaanen A967)): the result is claimed there only for the seminormed case, but the proof for a quasi-norm carries over unchanged. The question whether a preideal space can be completed to an ideal space is rather involved and has been discussed by Luxemburg and Zaanen A963d, 1964e). We mention only the following result. THEOREM 19. Let X be a preideal space with ?-finite support. Then X can be completed to an ideal space if and only if for any Cauchy sequence xn I 0 the relation \xn II X 0 holds. Theorems 16, 17, and 18 follow from Theorems 3.22, 3.18, and 3.20 of M. Vath B000) and from Corollary 6. For further refinements of Vitali's convergence theorem (e.g., if the limit function is not assumed to be finite a priori), we refer to Vath A997). Straightforward variants of the Lebesgue and the Vitali convergence theorem can be proved also in the more general context when X is only an ideal in W(S, Y) with a locally
Riesz spaces and ideals of measurable functions 815 solid topology: the results then are formulated in terms of a generating family of pseudo- norms. For details, see Feledziak and Nowak A997). 5. Order duals 5.1. The associate space In this section, we assume that У is a Banach space, and that Y* is the corresponding dual space. Dehnition 22. Let X с 97?E, Y) be a preideal (normed!) space with supp X = 5". Then we define the associate space X' с 97tE, Y*) by the lattice pseudo-norm IIУIIr := sup / |;y(s)||.x(j)|dj. \\x\K\JS PROPOSITION 22. X' is a perfect ideal space. For any у e 971E, ?*), ? e 971E, Y), we have the generalized Holder inequality / \y(s)x(s)\ds ^ / |yE)||^E)|di ^ 1М1НИ1· The name Holder inequality is indeed justified: Example 9. Let 5 have the finite measure property. Then X = LP(S, Y) A ^ ? ^ oo) satisfies suppX = 5, and we have X' = Lp>(S, Y*) with \/p+ \/p' = 1. We could have equivalently used the pairing of Y* and ? for the definition of the associate space: THEOREM 20. For any у e97iE, ?*) we have IIУIIx' = sup / y(s)x(s)ds. ll-rKI JS Theorem 20 appears rather natural. Nevertheless, its proof involves some surprising technical difficulties if 5 fails to be ?-finite. If X is complete, we have a characterization of the elements of X' which is simpler to verify for applications: THEOREM 21. Let X be an ideal space. Then a function у e 97tE, Y*) satisfies \\y\\x> < oo if and only if у ? e L\(S)for any ? e X.
816 ?. Vath We have not excluded so far that X' is the trivial space. In fact, we cannot do this, in general. Example 10. Let S be such that each set which is not a null set has infinite measure. Then, for X = LooCS"), we have X' = {0}. It is a rather deep result that for ? -finite measure spaces the problem of Example 10 cannot happen. In this case, the associate space is never trivial and actually has full support. It follows then easily that this result holds also under a weaker assumption on the measure space: THEOREM 22. Let S have the finite measure property. Then supp X' = S. The proof of Theorem 22 is based on the separation theorem in the Hubert space Li(E) for an appropriate set ?CX. We consider now the second associate space X" := (X1)'. For simplicity, we put now ? =R. PROPOSITION 23. X is continuously embedded into X" with embedding constant 1, i.e., \\x\\x" ^ \\x\\for any ? e X. It is a natural question to ask whether X" is strictly larger than X. By a refinement of the proof of Theorem 22, one can obtain the following result which (for ? -measure spaces) answers this question completely: THEOREM 23 (Lorentz, Luxemburg). Let ? e X be such that supp ? is ?-finite. Then \\x\\x" = \\x\\l, where \\x \\ ? is given by F). Theorem 22 is actually a simple consequence of Theorem 23 and Corollary 8. COROLLARY 1 3. Let the preideal space X satisfy supp X = S and assume that any ? eX has ?-finite support. Then X" = X l with the same norms. Together with the properties of the Lorentz space, we obtain: COROLLARY 14. Let S bea -finite, and Xbea preideal space with supp X = S. A) X is semi-perfect if and only if X = X" as sets. B) X is a-almost perfect if and only if \\ ¦ \\?" is an equivalent norm on X with a~l\\x\\<\\x\\x»^\\x\\ (xeX). In particular, X is perfect if and only if X" = X with the same norms. Historically, this property is the reason for our terminology "perfect", because G. Kothe used this term for this property.
Riesz spaces and ideals of measurable functions 817 5.2. Applications to ideal spaces on product measures In this section, let Sj (i = 1,2) be ? -finite measure spaces, У, be Banach spaces, and ?(Y\, Yl) be the corresponding set of bounded linear operators. Moreover, let X,- с ЗЛE,-, Yi) be preideal spaces. The following result is a motivation to define a certain class of preideal spaces on the product space S = S\ ? Si. PROPOSITION 24. For any ? e Wl(S, C(Y\, Yi)) the function y(si) : = \\x(si, ·)||?< is measurable. If in addition \\y\\x-, < 00, then the integral operator Az(s2):= x(s2,s\)z(s\)ds (si e Si) Js, acts from X\ into X2 with \\A\\ ^ ||}'||??. PROOF. The measurability of у is a consequence of the Luxemburg-Gribanov Theorem 7. The remaining statements follow from the generalized Holder inequality. ? ????????? 23. Let [X| -> X2] denote the set of all ? e 9JiE", Y) for which the quantity 1И1[х,—х2] := ||-S2 ь> Iл-E7, ·)|? ??, (9) is defined and finite. Then [X| —> Xi] is called the waerf «orw .vpace of X| and Xi. Proposition 24 states that in case ? = C(Y\, Yi) the number ||jc ||(?'_?,] is a bound for the norm of the integral operator Ae?(X|,XT)with kernel ?. It is not clear whether [X| —> Xi] is a preideal space, in general; moreover, it is not even clear whether [X| —>· Xi] is a linear spaces. The difficulty is that the function у (si) := ||* E2, )llxi may fail to be measurable for measurable x. It is not easy to find such an example, but by nonstandard methods it is possible to obtain some, see Luxemburg A963). On the other hand, the Luxemburg-Gribanov Theorem 7 implies that if X| is an associate space, then у is always measurable. If X1 is almost perfect, then the embedding of X| into X" is norm preserving by Corollary 14. Hence, we obtain: THEOREM 24 (Luxemburg-Gribanov). If X\ is almost perfect, then [X\ -> Xi] is a preideal space. For completeness, let us collect some further properties of mixed norm spaces. THEOREM 25. Suppose that X := [X, -> Xi] is a preideal space (e.g., X1 is regular or almost perfect). Then we have: A) suppX=suppX| xsuppXi. B) If X\ and X2 are semi-perfect, then X is a semi-perfect ideal space.
818 ?. Vath C) If X? is a\-almost perfect and Xi is ai-almost perfect, then X is a\a2-almost perfect. D) If X\ and X2 are regular, then X is regular. E) [X^ —>¦ X!,] is defined and the same as X' (with the same norm). 5.3. Order duals of a Riesz space We return now to the general theory of Riesz spaces (and consider only ? = R). We consider in the following linear operators A : X —>¦ ? for a Riesz space X and a Dedekind complete Riesz space Z. Actually, we are only interested in the case ? = R, but the results hold for more general Riesz spaces as well (provided ? is Dedekind complete). Dehnition 24. The linear operator A : X -> ? is called A) positive if Ax > 0 for ? > 0; B) regular if A = A+ — A~ for two positive operators A± :X—>Y; C) order bounded if A maps order bounded sets into order bounded sets. The space of all order bounded operators A: X —>¦ ? is denoted by ?/,(X, Z). In case ? = R, the space X~ := Cb(X, R) is called the order dual of X. Any positive operator and thus any regular operator is order bounded. The space Сь(Х, ?) is partially ordered by the relation ? ?; ? «=> В — A is positive. THEOREM 26 (Jordan decomposition). The space ?/,(X, Z) is a Dedekind complete Riesz space which consists only of regular operators. The corresponding Riesz operations are given by A+x = sup Ay — sup Ay, 0<v<jr+ 0<v<.r" A~ ? = inf Ay — inf Ay, 0<v<.r- 0<v<.r + |A|*= sup Ay— sup Ay = sup \Ay\ — sup \Ay\. |?|<?·+ |.??.? 1уКл+ I.vK-?- If X is equipped with a lattice norm, let X* denote the usual dual space, i.e., X* : = ?(X,E). THEOREM 27. Let ?, ? be equipped with lattice norms. Then we have: A) X* с X~, a/jii X* is even an ideal in X~. The norm ofX* is a corresponding lattice norm. B) IfX is complete, then Cb(X, Z) с ?(X, Z); in particular X* = X~.
Riesz spaces and ideals of measurable functions 819 We note that for ? ? R the set ?(X, Z) is in general not contained in Cj,(X, Z). In case X = ? = ?.2([0, 1]), for example, there exist integral operators A e ?(X, Z) which fail to be regular (and thus fail to be order bounded). The first example of such an integral operator was given by Mytiagin, see Example 4.3 in Krasnoselskii et al. A958). Theorem 27 implies that in order to study the dual space X*, it is sufficient (and even better) to study the order dual X~. DEHNITION 25. An operator A e (X, Z) is ?-order continuous if x„ 4 0 implies inf{| Ajc„|: n} = 0. Similarly, A is called order continuous if ? 4 0 implies inf{| Ajt |: ? e M}=0. The set of all ?-order continuous operators A e ?/>(X, Z) is denoted by CC(X, Z), and the set of all order continuous operators is denoted by ?„ (X, Z). In case ? = R, the elements of X~ := ?((X,R) are also called integrals, and the elements of X~ := ?,,(X, R) are called normal integrals. The term "integral" will be explained later. Proposition 3 immediately implies: PROPOSITION 25. We have ?„(X, Z) с ?(.(Х, Z) with equality ifX is order separable. The reader should be warned that order separability is not necessary for ?„(X, Z) — ?C(X, Z) (even if ? — R). The question whether X~ and X~ differ is rather delicate, see, e.g., the discussion in Paragraph 87 of Zaanen A983). The following result gives a different characterization of the set of (a-)order continuous operators. THEOREM 28. ?C(X, Z) and ?H(X, Z) are bands in ?/>(X, Z). In particular, we havefor any operator A e ?^(?, ?): A) A is ? -order continuous if and only if\A\ has this property. This is the case if and only if xn 4 0 implies \A\x„ 4 0. B) A is order continuous if and only if\A\ has this property. This is the case if and only if ? 4,0 implies |A|M 4,0. Since ?C(X, Z) and ?,,(X, Z) are bands in Cb(X, Z), Proposition 10 implies that the corresponding disjoint complements ?tJ(X, Z) := ?( (X, Z)L respectively ?,«(X, Z) := ?„(X, ZI- are bands, and Cb(X, Z) = ?C(X, ?) ? ?„(X, Z) = C„(X, ?) ? C„S(X, Z). A0) The operators in ?C.?(X, Z) are called singular, and the operators in C„.,(X, Z) are called normal singular. To describe the corresponding projection, it suffices to consider positive operators, because any A e Сь(Х, ?) is the difference of two positive operators. Moreover, since ? = x+ — x~ for any ? e X, it suffices, by linearity, to describe the projection for positive values. In this sense, the following result gives an "explicit" formula for the corresponding projection.
820 ?. Vath THEOREM 29. Let A e Сь(Х, ?) be positive, and Ac and A„ be the corresponding component in ?C(X, Z) and C„(X, Z), respectively. Then Acx =inf{supA;c„: 0^ x„ | x\ {? ^ 0), /I A,,jc = infjsup A(M): X+ 2 ?? | jc } (jc > 0). If A e?fc(X,Z), then thcnull ideal is defined as the set {jc: |A||jt| =0}.lf A e ?,(X, Z) or A e ?n(X, Z), then the null ideal is a ?-ideal respectively a band. The converse does not hold in general, even in case of functionals (Z = R). But slightly less does hold. THEOREM 30. For any A e?(.(X, Z) (A e?„(X,Z)) the null ideal [x: \A\\x\ =0} is a ?-ideal (respectively a band). Conversely, if A e ?/>(X, Z) /s smc/i f/ш? for any positive operator В e ?/>(X, Z) vv/fft 0 ^ В ^ |A| f/ie null ideal {x: B\x\ = 0} /s ? ?-ideal (respectively a band), then A e CC(X, Z) (respectively A e ?„(X, Z)). Corollary 15. We have CC(X, Z) = Сь(Х, ?) if and only if for any positive operator В е Сь(Х, Z) the null ideal is ? ?-ideal. Similarly, ?„(X, Z) = ?/,(X, Z) //a/id ои/у // for any positive operator В е Сь(Х, Z) the null ideal is a band. In case ? = R, we denote the band of singular and normal singular functionals by X~ := ?„(X, Z) and X~s := ?„.S(X, Z), respectively. If X is equipped with the lattice norm, we have in view of Theorem 27 and A0) X* = (X* П X~) ? (X* ПХ~) = (X* П X~ ) ? (X* П Х~ ). A1) Proposition 10 immediately implies: PROPOSITION 26. The spaces occurring /и A1) are all closed subspaces of X*. and the corresponding projections are positive and have norm 1 (if they are not trivial). 5.4. Duals of ideals of measurable functions For the rest of this chapter, let S be a ?-finite measure space, ? = R, and X be an ideal in Wl(S). To simplify notation, we will assume throughout that supp X = S which is actually no loss of generality (consider the measure space S := supp X instead of S, if necessary). Since X is an ideal in ffl(S), and Vft(S) is super Dedekind complete by Theorem 3, it follows that also X is super Dedekind complete and in particular order separable. Proposition 25 thus implies: PROPOSITION 27. For any Dedekind-complete Riesz space ? we have the equality ?,(X,Z) = ?„(X, Z).
Riesz spaces and ideals of measurable functions 821 In particular, all integrals on X are normal. The following result characterizes the integrals and is the reason for the term "integral": Similarly to the classical calculation of the dual space of Lp, the proof of this result is essentially based on the Radon-Nikodym theorem. THEOREM 31. Afunctional f e X~~ belongs to X~ = X~ if and only if there is some у e 9JiE") such that у ¦ ? is integrablefor each ? e X and f{x)= ? y(s)x(s)ds (xeX). A2) Moreover, у is (almost everywhere) uniquely determined by this property, and f±(x)= ? y±(s)x(s)ds (xeX), A3) \f\(x) = f\y(s)\x(s)ds (xeX). A4) The formulas A3) and A4) are straightforward consequences of the corresponding formulas in Theorem 26. COROLLARY 16. Let X be also a preideal space. Then the bounded integrals f e X~ ? ?* = ?~ ? ?* are in a canonical one-to-one correspondence with the elements у ofthe associate space X' which is given by A2). Moreover, \\f\\ = \\\f\\\ = \\y\\x'· Corollary 11 implies that if X is a regular preideal space, then any positive functional / e X* is ?-continuous, and thus any / e X* is an integral. Using the Hahn-Banach theorem, it can be proved that this trivially sufficient condition is actually necessary. An analogous result holds for the corresponding trivially sufficient condition for reflexivity. THEOREM 32. Let X be also a preideal space. A) An element ? e X belongs to the regular part X,· if and only if f(x) = 0 for any singular f e X*. In particular, X is regular if and only if any f e X* is an integral. B) X is reflexive if and only if X and X' both are regular and X is perfect. Theorem 32 is the natural generalization of the classical result that (in the canonical way) Lp([0, 1])* = Lp>([0, 1]) with l/p+l/p' =\ if and only if 1 ^ ? < oo and that Lp([0, 1]) is reflexive if and only if 1 < ? < oo. It is a curiosity that, without the axiom of choice, the necessity of the conditions in Theorem 32 completely fails: If we do not assume the axiom of choice (only the weaker so-called axiom of dependent choices), we cannot exclude that X* = X' holds for any preideal space (regular or not), and consequently any perfect ideal space is reflexive, see VathA998). The connection of the regular part X,· of X with singular functionals is emphasized by the following result.
822 ?. Vath THEOREM 33. Let X be also a preideal space and suppose that supp Xr = S (= supp X). Then afunctional f e X* is singular if and only if f vanishes on Xr. Moreover, X* = X' in the sense of A2). Note that, conversely, if X is a preideal space for which suppX, is strictly smaller than S, then any nonzero function у e X' with supp у П suppXr = 0 (such a function exists by Theorem 22) determines a nontrivial integral / e X* via A2) which vanishes on Xr (hence, also ?* ? ?'). 5.5. Bibliographical remarks The omitted proofs of Section 5.1 can, e.g., be found in Section 3.4 of Vath A997). Slightly different proofs for the most important Theorems 22 and 23 are in Paragraph 71, Theorems 4(a) and 2, of Zaanen A967). For an immediate proof of Theorem 22 (without applying Theorem 23), see Zabreiko A974). Theorem 24 was the original motivation for the Luxemburg-Gribanov theorem, and this is what was actually proved by Luxemburg A958) (for perfect spaces). For the proof of the various statements in Theorem 25, see Sections 4.1 and 4.3 of Vath A997). For some applications, it is useful to consider also the case where the space ? ? in (9) may depend on 52- If this dependence is not "too bad" (i.e. continuous or at least measurable in a certain sense), many results carry over by the earlier mentioned generalization of the Luxemburg- Gribanov theorem by Vath B00?). However, the reader should be warned that even if each of the spaces ? ? (S2) and also Xi have full support, it may happen that the corresponding "mixed norm space" is trivial. Such an example, based on the continuum hypothesis, is given in Example 4.3.3. of Vath A997). The theory of positive operators is one of the earliest results in the theory of Riesz spaces. Correspondingly, Theorem 26 already occurs in the papers of Riesz A930), Kantorovich A936), and Freudenthal A936), at least for functionals. For the proof of Theorems 27 and 28, see Zaanen A983), Theorems 85.6 and 83.12, Lemma 84.1 and Theorem 84.2. Theorems 29 and 30 are reformulations of Theorems 22.9 and 22.3 of Zaanen A997), and the main part of Theorem 31 can be found as Theorem 86.3 of Zaanen A983). The proofs of Theorems 32 and 33 can be found in Paragraphs 72 and 73 of Zaanen A967). For ideal spaces X с Wl(S, Y) when У is a Riesz spaces, some of the results in Section 5.4 have been generalized by Mullins A974, 1976). References Aliprantis, CD. and Burkinshaw, O. A976), Locally Solid Riesz Spaces, Academic Press, New York. Aliprantis, CD. and Burkinshaw, O. A977), On universally complete Riesz spaces. Pacific J. Math. 71, 1-12. Aliprantis, CD. and Burkinshaw, O. A980a), Minimal topologies and Lp-spaces, Illinois J. Math. 24, 164-172. Aliprantis, CD. and Burkinshaw, O. A980b), On the structure of locally solid topologies, Canad. Math. Bull. 23, 185-191. Aliprantis, CD. and Burkinshaw, O. A985), Positive Operators, Academic Press, New York. Amemiya, I. A953), A generalization of Riesz-Fischer's theorem, J. Math. Soc. Japan 5, 353-354.
Riesz spaces and ideals of measurable functions 823 Appell, J, and Zabreiko, PR A990), Nonlinear Superposition Operators. Cambridge Univ. Press, Cambridge, Bergh, J, and Lofstrom, J. A976), Interpolation Spaces -An Introduction. Springer, Berlin, Birkhoff, G, A967), Lattice Theory, 3rd edn, Amer. Math, Soc. Coll, Publ,, Providence, R.I, Bukhvalov, A,V. A972), Vector-valued function spaces and tensor products (in Russian). Siberian Math, J, 13 F), 1229-1238. Bukhvalov, A.V. A978), Geometric properties of Banach spaces of measurable vector valued functions. Dokl, Akad, Nauk SSSR 239 F), 1279-1282. Chong, K,M, and Rice, N,M. A971), Equimeasurable rearrangements and functions. Queen's Papers in Pure and Applied Math., No, 28, Queen's University, Kingston Ontario, Canada. Dirr, G, and Vath, M, B000), Continuity ofnear-dualin maps and characterizations of ideal spaces of measurable functions. Recent Trends in Nonlinear Analysis, Appell, J., ed.. Festschrift Dedicated to Alfonso Vignoli on the Occassion of His Sixtieth Birthday, Birkhauser, 139-148. Feledziak, K, and Nowak, M. A997), Locally solid topologies on vector valued function spaces. Collect. Math. 48 D-6), 487-511. Fremlin, D,H, A974), Topological Riesz Spaces and Measure Theory. Cambridge Univ. Press, Cambridge. Freudenthal, H, A936), Teilweise geordnete Moduln. Proc. Netherl. Acad, Sci, 39, 641-651. Gribanov, Yu,I. A970), On the measurability of a function, lzv. Vyssh. Uchebn, Zaved. Mat. 3, 22-26 (in Russian), Kantorovich, L,V, A936), Concerning the general theory of operations in partially ordered spaces, Dokl, Akad, Nauk SSSR 1,271-274, KrasnoselskiT, M,A,, Lifshits, Je,A, and Sobolev, A.V. A989), Positive Linear Systems. Heldermann, Berlin, KrasnoselskiT, M.A, and Rutickil, Ya.B. A958), Convex Functions and Orlicz Spaces. Fizmatgiz, Moscow (in Russian), English translation! Noordhoff, Groningen, 1961. KrasnoselskiT, M,A,, Zabreiko, PR, Pustylnik, E.I, and Sobolevskil, RE. A958), Integral Operators in Spaces of Summable Functions, Nauka, Moscow (in Russian). English translation: Noordhoff, Leyden, 1976. KreTn, S.G,, Petunin, Ju,I, and Semenov, E.M. A978), Interpolation of Linear Operators. Nauka, Moscow (in Russian), English translation: Providence, RI, Amer. Math. Soc., 1982. Kutateladze, S,S, (ed,) A996), Vector Lattices and Integral Operators, Kluwer, Dordrecht. Lindenstrauss, J, and Tzafriri, L, A979), Classical Banach Spaces II: Function Spaces. Springer, Berlin. Lorentz, G.G, A950), Some new functional spaces. Ann. Math. B) 51 A), 37-55. Lorentz, G.G. A951), On the theory of spaces ?. Pacific J. Math. 1, 411^429. Lorentz, G.G. A961), Relations between function spaces. Proc. Amer. Math. Soc. 12 A), 127-132. Luxemburg, W,A,J. A958), On the measurabilirv of a function which occurs in a paper by A.C Zaanen, Proc. Netherl. Acad. Sci. (A) 61, 259-265 (Indag. Math. 20, 259-265). Luxemburg, W,A,J, A963), Addendum to On the measurability of a function which occurs in a paper by A.C. Zaanen. Proc, Netherl. Acad, Sci, (A) 66, 587-590 (Indag. Math. 25 A963), 587-590). Luxemburg, W.A.J, A965a), Notes on Banach function spaces XIV A. Proc. Netherl, Acad. Sci. (A) 68, 229-239 (Indag. Math, 27 A965), 229-239). Luxemburg, W,A,J. A965b), Notes on Banach function spaces XIV в. Proc. Netherl. Acad, Sci. (A) 68, 240-248 (Indag, Math, 27 A965), 240-248). Luxemburg, W.A.J, A965c), Notes on Banach function spaces XV \. Proc. Netherl. Acad. Sci. (A) 68, 415^429 (Indag, Math, 27 A965), 415^429). Luxemburg, W,A,J, A965d), Notes on Banach function spaces XVB, Proc, Netherl. Acad. Sci. (A) 68, 430-446 (Indag. Math. 27 A965), 430-446). Luxemburg, W.AJ. A965e), Notes on Banach function spaces XVIA. Proc. Netherl. Acad. Sci. (A) 68 A), 646- 657 (Indag. Math. 27 D) A965), 646-657). Luxemburg, W.A.J. A965f), Notes on Banach function spaces XVIB. Proc. Netherl. Acad. Sci. (A) 68 A), 658- 667 (Indag. Math. 27 D) A965), 658-667). Luxemburg, W.A.J. A967), Rearrangement invariant Banach function spaces. Queen's Papers in Pure and Appl. Math., No. 10, 83-144. Luxemburg, W.A.J. and Zaanen, A.C. A963a), Compactness of integral operators in Banach function spaces. Math. Ann, 149, 150-180. Luxemburg, WA.J, and Zaanen, A.C. A963b), Notes on Banach function spaces I. Proc. Netherl. Acad. Sci. (A) 66, 135-147 (Indag. Math. 25 A963), 135-147).
824 ?. Vath Luxemburg, W.A.J, and Zaanen, A.C. A963c), Notes on Banach function spaces II, Proc. Netherl. Acad. Sci. (A) 66, 148-153 (Indag. Math, 25 A963), 148-153). Luxemburg, W.A.J, and Zaanen, A.C. A963d), Notes on Banach function spaces III, Proc, Netherl. Acad. Sci. (A) 66, 239-250 (Indag. Math. 25 A963), 239-250). Luxemburg, W.A.J, and Zaanen, A.C. A963e), Notes on Banach function spaces IV, Proc. Netherl. Acad. Sci. (A) 66, 251-263 (Indag. Math. 25 A963), 251-263). Luxemburg, W.A.J, and Zaanen, A.C. A963f), Notes on Banach function spaces V, Proc. Netherl. Acad. Sci. (A) 66, 496-504 (Indag. Math. 25 A963), 496-504). Luxemburg, W.A.J, and Zaanen, A.C. A963g), Notes on Banach function spaces VI, Proc. Netherl. Acad. Sci. (A) 66, 655-668 (Indag. Math. 25 A963), 655-668). Luxemburg, WA,J. and Zaanen, A.C. A963h), Notes on Banach function spaces VII, Proc. Netherl. Acad. Sci. (A) 66, 669-681 (Indag. Math. 25 A963), 669-681). Luxemburg, W.A.J, and Zaanen, A.C. A964a), Notes on Banach function spaces VIII, Proc. Netherl. Acad. Sci. (A) 67, 104-119 (Indag. Math. 26A964), 104-119). Luxemburg, W.A.J, and Zaanen, A.C. A964b), Notes on Banach function spaces IX, Proc. Netherl. Acad. Sci. (A) 67, 360-376 (Indag. Math. 26 A964), 360-376). Luxemburg, W.A.J, and Zaanen, A.C. A964c), Notes on Banach function spaces X, Proc. Netherl. Acad. Sci. (A) 67, 493-506 (Indag. Math. 26 A964), 493-506). Luxemburg, W.A.J, and Zaanen, A.C. A964d), Notes on Banach function spaces XI, Proc. Netherl. Acad, Sci. (A) 67, 507-518 (Indag. Math. 26 A964), 507-518). Luxemburg, WA.J. and Zaanen, A.C. A964e), Notes on Banach function spaces XII, Proc. Netherl. Acad. Sci. (A) 67, 519-529 (Indag. Math. 26 A964), 519-529). Luxemburg, W.A.J, and Zaanen, A.C. A964f), Notes on Banach function spaces XIII, Proc. Netherl. Acad. Sci. (A) 67, 530-543 (Indag. Math. 26 A964), 530-543). Luxemburg, W.A.J, and Zaanen, A.C. A971), Riesz Spaces, Vol. I, North-Holland, Amsterdam. Macdonald, A.L. A973a), Vector valued Kothe function spaces I, Illinois J. Math. 17, 533-545. Macdonald, A.L. A973b), Vector valued Kothe function spaces II, Illinois J. Math. 17, 546-557. Macdonald, A,L. A974), Vector valued Kothe function spaces III, Illinois J. Math. 18, 136-146. Macdonald, A.L. A976), ? weak theory of vector valued Kothe function spaces, Illinois J. Math. 20, 410-424. Marcinkiewicz, J. A939), Sur I 'interpolation d'ope'rateurs, C. R. Acad. Sci. Paris Ser. I Math. 208, 1272-1273. Meyer-Nieberg, P. A991), Banach Lattices, Springer, Berlin. Milman, M.M. A978), Some new function spaces and their tensor products. Bull. Austral. Math. Soc. 19, 147- 149. Mullins, C.W A974), Linear junctionals on vector valued Kothe spaces, Proc. Madras Conference Functional Analysis 1973 (Berlin), Garnir, H.G., Unni, K.R. and Williamson, J.H,, eds. Lecture Notes in Math., Vol. 399, Springer, Berlin, 380-381. Mullins, C.W. A976), Order vector valued Kothe spaces, J. London Math. Soc. B) 13 A), 34-40. Nakano, H. A950), Modulared Semi-Ordered Linear Spaces, Maruzen, Nihonbashi, Tokyo. Nguyen, H.T. A987), The superposition operator in Orlicz spaces of vector valued functions, Dokl. Akad. Nauk BSSR 31, 197-200 (in Russian). Nguyen, H.T. A991), The theory of semimodules of infra-semiunits in ideal spaces of vector-valued functions, and its applications to integral operators, Dokl. Akad. Nauk SSSR 317, 1303-1307 (in Russian), English translation: Soviet Math, Dokl. 43A991), 615-619. Nguyen, H.T, and Zabrelko, P.P. A990), Cones of vector functions in Orlicz spaces of vector functions (in Russian), VescT Akad. Navuk BSSR Ser. FIz.-Mat. Navuk, No. 3, 30-34. Nowak, M. A984a), On the finest of all linear topologies on Orlicz spaces for which ?-modular convergence implies convergence in these topologies. Bull. Acad. Polon. Sci. Ser. Sci. Math. 32, 439^t45. Nowak, M, A984b), On the modular topology on Orlicz spaces, Bull. Acad. Polon. Sci, Ser. Sci. Math. 36,41-50. Nowak, M, A993), Order continuous seminorms and weak compactness in Orlicz spaces. Collect. Math. 44, 217-236, O'Neil, R. A963), Convolution operators arid L(p,q) spaces, Duke Math. J. 30, 129-142. Povolotskil, A.I. and Kalitvin, A.S. A991), Nonlinear Partial Integral Operators, Leningrad, 1991 (in Russian). Rao, M,M. and Ren, Z,D. A991), Theory of Orlicz Spaces, M. Dekker, New York.
Riesz spaces and ideals of measurable functions 825 Riesz, F. A930), Sur la decomposition des operations fonctionnelles line'aires, Atti Congr. Intemaz. Mat. Bologna, 1928, Vol. 3, 143-148, Rudin, W. A987), Real and Complex Analysis, 3rd edn, McGraw-Hill, Singapore. Rudin, W. A990), Functional Analysis, 14th edn, McGraw-Hill, New York. Schaefer, H.H. A974), Banach Lattices and Positive Operators, Springer, Berlin. Semadeni, Z. A971), Banach Spaces of Continuous Functions, Vol. I, Polish Scientific Publ., Warszawa. Sikorski, R. A964), Boolean Algebras, Springer. Vath, M. A997), Ideal Spaces, Lecture Notes in Math., Vol. 1664, Springer. Vath, M. A998), The dual space of Lx is L\, Indag. Math. 9 D), 619-625. Vath, M. B000), Volterra and Integral Equations of Vector Functions, Marcel Dekker, New York. Vath, M. B00?), Some measurability results and applications to spaces with mixed family-norm (submitted). Wnuk, W. A986), On a continuous embedding into a space of measurable functions. Bull. Acad. Polon. Sci. Ser. Sci. Math. 34, 413^416. Zaanen, A.C. A967), Integration, North-Holland, Amsterdam. Zaanen, A.C. A983), Riesz Spaces, Vol. II, North-Holland, Amsterdam. Zaanen, A.C. A997), Introduction to Operator Theory in Riesz Spaces, Springer, Berlin. Zabreiko, P.P. A974), Ideal spaces of functions I, Vestnik Jaroslav. Univ. 8, 12-52 (in Russian).
CHAPTER 20 Measures on Quantum Structures Anatolij Dvurecenskij* Mathematical Institute, Slovak Academy of Sciences, Stefa'nikova 49, SK-814 73 Bratislava, Slovakia E-mail: dvurecen@mat.savba.sk Contents 1. Introduction 829 2. States on quantum logics 831 2.1. Orthomodular lattices and orthomodular posets 831 2.2. Orthoalgebras and effect algebras 833 2.3. Decompositions of states 836 3. Gleason's measures 840 3.1. Introduction 840 3.2. Generalizations of Gleason's theorem 842 3.3. Gleason's theorem and completeness criteria of inner product spaces 848 4. States on MV-algebras 853 5. Measures on BCK-algebras 856 5.1. Commutative BCK-algebras with the relative cancellation property 856 5.2. Measures on commutative BCK-algebras with the relative cancellation property 858 6. States on pseudo MV-algebras 860 6.1. Pseudo MV-algebras 860 6.2. States and pseudo MV-algebras 863 6.3. Existence of states 865 7. Conclusion 866 References 866 *The work has been supported by the grant 2/7193/20 SAV, Bratislava, Slovakia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 827
This page intentionally left blank
Measures on quantum structures 829 1. Introduction D. Hubert, in his famous program presented at the Second International Mathematical Congress in Paris, 1900, formulated many open mathematical problems which were stimulating for the development of mathematics and were a fruitful source of very deep and fundamental ideas. During the whole 20th century, mathematicians and specialists in other fields have been solving problems which can be traced back to Hilbert's program, and today there are many basic results stimulated by this program. It is sure that even at the beginning of the third millennium, mathematicians will still have much to do. One of his most interesting ideas, lying between mathematics and physics, is his sixth problem: to find a few physical axioms which, similar to the axioms of geometry, can describe a theory for a class of physical events that is as large as possible. We try to present some ideas inspired by Hilbert's sixth problem and give some partial results which may contribute to its solution, mainly we will concentrate at measure- theoretical point of view of solving such a problem. In the Thirties the situation in both physics and mathematics was very interesting. A.N. Kolmogorov published his fundamental work [48] Grundbegriffe der Wahrschein- lichkeitsrechnung in which he, for the first time, axiomatized modern probability theory. From the mathematical point of view, in Kolmogorov's model, the set L of experimentally verifiable events forms a Boolean ? -algebra and, by the Loomis-Sikorski theorem, roughly speaking can be represented by a ? -algebra S of subsets of some non-void set ?. The Kolmogorov model is very important, but it does not describe the situation that arises in quantum mechanics. A physical system is investigated through a measurement process. If a measurement reaches the microlevel, non-negligible indeterminacies appear that must be respected in the model. Therefore, in many cases, the use of classical probability theory is not sufficient, as follows, for example, from the Heisenberg uncertainty principle [43] which asserts that the position ? and the momentum ? of an elementary particle cannot be measured simultaneously with arbitrarily prescribed accuracy. If ?,„ ? and ?,?? denote the inaccuracies of the measurement of the momentum ? and position ? in a state m, then (AlnPJ-(AmxJ^-h2>0, A.1) where ft = h/?? and h is Planck's constant. Similar relation holds for the energy-life time (AlnEJ.(AmtJ^-h2>0, It is interesting to notify that similar uncertainty relations is possible to find also in other scientific areas, for example social sciences, psychology, the activity of the human brain, automata. In psychology it is known that when we examine a patient through questions, the patient's psychological stress and the accuracy of his answers are roughly speaking connected via A.1),
830 A. Dvurecenskij J. von Neumann [71 ] made the first fundamental step towards a rigorous mathematical apparatus, as well as an interpretational logic of the new mechanics on the basis of a complex separable Hubert space. In 1936, G. Birkhoff and J. von Neumann published their paper [6] The Logic of Quantum Mechanics. In this historic paper the authors showed that the set of assertions of quantum mechanics has a different algebraic property from a Boolean algebra, and they suggested the structure of a projective geometry. The physical system is described through a measuring process which is described by a state which is a probability measure (in many case countably additive) on the physical system. From this, we have that the system L(H) of all closed subspace of a real, complex, or quaternion Hubert space is of particular interest. G. Mackey [53] in his investigation of the mathematical foundations of quantum mechanics posed the following problem: describe the set of all states on the quantum logic L{H) for a separable real or complex Hilbert space. A.M. Gleason [37] published the answer to Mackey's problem in 1957, and this result deeply influenced the progress in the Hilbert space quantum mechanics. In addition, Gleason's theorem influenced also mathematical research and the theorem was extended to many cases and structures. A turbulent development of quantum theories started in the Sixties, and there appeared many important results which were reflected in numerous monographs. There appeared structures like orthomodular posets and orthoalgebras. Today all these structures which were inspired by attempts of describing mathematical foundations of quantum mechanics are called quantum structures, and in the beginning of the Nineties the International Quantum Structure Association was founded which organizes Biannual Congresses on which mathematicians, physicists, logicians, and philosophers meet. In the last decade, Slovak and Italian schools contributed with an important new notion, D-posets (difference posets), or equivalently, weak orthoalgebras. Later the American school presented effect algebras. All these equivalent algebraic partial structures combine algebraic as well as fuzzy set ideas. It was shown that the space of effect operators of a Hilbert space and also very important MV-algebras can be formalized within this frame. In 1958, C.C. Chang [8] presented a new axiomatic structure, MV-algebras, which generalizes the two-valued reasoning, taking only values 0 and 1, to infinitely-valued one taking values from the whole real interval [0, 1]. Later in 1986, D. Mundici [59] showed that there is a natural equivalence among MV-algebras and Abelian unital lattice-ordered groups with strong unit as the interval [0, и], where и is a strong unit. Today MV-algebras present an important substructure of quantum structures while, for example, a compatible part (i.e., "classical one") of lattice-ordered effect algebras forms an MV-subalgebra. Another generalization of Boolean rings are BCK-algebras which were introduced by Imai and Iseki [44] inspired by prepositional calculus and set theory, and they have been intensively studied by many authors. Also for the case of commutative BCK-algebras with the relative cancellation property there exists an ?-group representation, [25]. Recently, G. Georgescu and A. Iorgulescu [35] introduced pseudo MV-algebras which are a non-commutative generalization of MV-algebras which are a generalization of a two-valued reasoning. A non-commutative reasoning can be found, e.g., in psychological processes: in clinical medicine on behalf of transplantation of human organs, an experiment
Measures on quantum structures 831 was performed in which the same two questions have been posed to two groups of interviewed people: A) Do you agree to dedicate your organs for medical transplantation after your dead? B) Do you agree to accept organs of a donor in the case of your need? If the order of questions was changed in the second group, the number of positive answers was more higher than in the first group. For these pseudo MV-algebras, the author [21] presented a representation via intervals [0, и] where и is a strong unit in an ?-group not necessarily Abelian. From the measure-theoretical point of view all above structures present also important algebraic structures where it is possible to built up measure theory as well as probability theory and which will be done below. We will see how nice mathematical constructions it was necessary to present to show for example that there are even finite quantum logics having no state, which in the Kolmogorov model is a trivial fact that there is plenty of probability measures. Or we hope to be fascinated with the mathematical beauty of Gleason's theorem to which a monograph was dedicated [16]. 2. States on quantum logics 2.1. Orthomodular lattices and orthomodular posets States on quantum structures are the basic way through which we are able to derive an information from the physical system under measurement. States are roughly speaking probability measures and they describe the physical system. For classical Kolmogorov model, we have plenty of states which are ? -additive probability measures, e.g., the system of Dirac measures {<5?: ? e ?] determines completely the order on the corresponding ?- algebra S of subsets of ?. If we take Boolean algebras, they have plenty of finitely additive states, but there exists an example, [68], of a Boolean ?-algebra having only plenty of finitely additive states but no ? -additive state. We recall that an orthomodular poset (OMP, for short) is a poset L with two elements 0 ^ 1 and a unary operation -1, called an orthocomplementation such that, for all a,b e L, we have (i) ?±± = ?; (ii) b1- ^ a1- whenever a ^ b\ (iii) a v a1- = 1; (iv) a v b & L whenever a ^ b^\ (v) b = a v (b л a^) whenever a ^ b (orthomodular law). We write a ±b whenever ? ? b-1, and we say that a and b are orthogonal or mutually exclusive. If an OMP is a lattice, we call it an orthomodular lattice (OML for short); if it is a ?-complete lattice, we called it a quantum logic; if \/,? ? ?\ exists in L for any sequence of mutually orthogonal elements {a,} of L, L is said to be an ortho-?-complete logic. For example, if S is a ?-algebra of subsets of ? ? 0, then putting 0 := 0, 1 := ?, A1- := ? \ A, and ^ := c, we see that S is a quantum logic. The most important example of quantum logics is the quantum logic L(H) of a real, complex or quaternion Hubert space H, where L(H) consists of all closed subspaces ?
832 A. Dvurecenskij of H, and ?1- denotes its orthogonal complement; the partial order is the set-theoretical inclusion. This example is of basic importance for the so-called Hubert space quantum mechanics. An equivalent structure is the system P(H) of all orthogonal projectors on H. We say that two elements a and b of an OMP L are compatible if there exist three mutually orthogonal elements a\,b\,c'\TiL such that a = a \ ? с and b = b\ ? с. We recall that an OMP L is a Boolean algebra iff all elements of L are compatible. This property shows that a system of mutually compatible elements generates a Boolean subalgebra of L. Moreover, this property determines areas where we can use classical mechanics, or a classical Kolmogorov model. Anyway, every our structure has a locally classical part, and any maximal Boolean subalgebra is said to be a block. It is not hard to show that every OMP can be covered by the set of all its blocks, but some of them can not interact with others. We recall that a state on a OMP L is a mapping m : L —>¦ [0, 1] such that w(l) = 1, and m(a ? b) =m{a) + m{b) whenever a _L b. If m is ?-additiveon L, i.e., m(\J',· a,·) = J2i w (a,) for any sequence of mutually orthogonal elements {a,} for which V, ai is defined in L, we called it a ?-additive state. In a similar way we define a completely additive state. The problem of existence of states on quantum logics, such as orthomodular lattices or orthomodular posets, showed that the situation with existence of states on quantum logic is a complicated problem, and it generated new ways of constructions of finite orthomodular lattices and posets, respectively. The main contribution is due to Greechie [39,40]. Posets are pictured via Hasse diagrams, which in the case of many elements became very blind. A more elegant and simple method was suggested by R. Greechie. His method allows to construct new quantum structures and it is a base for many interesting examples. This method of construction of quantum logics uses the pasting of Boolean algebras, based on the familiar assertion that a finite Boolean algebra is completely determined by its atoms. He supposed that a quantum logic has a special form: for any two blocks A and В of L only one of the conditions is satisfied: (i) A = B; (ii) АПб = {0, 1); (iii) А П В = {0, 1, ?, ?-1}, where a is an atom in A and B. Such logics are said to be Greechie logics. The Greechie diagram consists of the set of points and lines. Points represent atoms in blocks, and atoms of a given block are connected by a smooth line. An atom which belongs to more blocks lies on more smooth lines which are "crossing" at the point corresponding to this atom. We notice that a more general method of logic construction using pasting of so-called astroids has been suggested by M. Dichtl [11], and a method of pasting of OMPs is used in [61] by M. Navara. For example, the Boolean algebra 23 which has three atoms a,b,c has the Greechie diagram consisting of a line with three atoms while the Hasse diagram has eight points and twelve lines. The propositional logic of the model consisting from pasting three Boolean algebras 23, where each two blocks have only one atom in common, has the Greechie diagram given by Figure 1 and the corresponding Hasse diagram by Figure 2 (this is an orthoalgebra as we will see below). The following Greechie diagram with Figure 3 gives an OMP which has no state.
Measures on quantum structures 833 / Fig. I. 0 Fig. 2. II II 1. II Fig. 3. 2.2. Orthoalgebras and effect algebras D.J. Foulis and C.H. Randall [33] introduced orthoalgebras which are more general than OMPs. We recall that an orthoalgebra is a set L with two particular elements 0, 1 @ ? 1), and with a partial binary operation ?: L ? L —>· L such that for all a, b, с е L we have: (OAi) if ? © b e L, then b®a e L and ? ? fo — b ? ? (commutativity); (OAii) if fo ? с e L and ? ? (b ? с) e L, then ? ? fo e L and (? ? fo) ? с е L, and ? ? (fo ? с) = (а ф fo) ? с (associativity);
834 A. Dvurecenskij (OAiii) for any a e L there is a unique be L such that ? © b is defined, and a ®b—\ (orthocomplementation); (OAiv) if ? ?? is defined, then a = 0 (consistency). If the assumptions of (OAii) are satisfied, we write ? ? b ? с for the element (? ? b) ? с = ? ? (fr ? с) in L. We introduce a partial order ^ on L via a ^ fo iff а ф с — fo for some с е L, and we write a _L fo iff ? ? fo exists in L. A iiaie is a mapping m : L —> [0, \] such that w(l) = 1 and m(a ®b) = m(a) +m(b). For example, if we put ? ? b := ? ? b whenever a _L b in an OMP, then we see that any OMP is an orthoalgebra. Figure 2 gives an example of orthoalgebra which is not OMP We recall that quantum mechanics is not a unique source of inspiration for quantum logics theory. We now present one more-less realistic example of an experiment whose propositional structure satisfies only axioms of orthoalgebras. EXAMPLE 2.1 {Firefly in a three-chamber box). Consider a firefly in a three-chamber box pictured in Figure 4. The firefly is free to roam among the three chambers and to light up to will. The sides of the box are windows with vertical lines down their centers. We make three experiments, corresponding to the three windows А, В and C. For each experiment E, we record 1е,ге,пе if we see, respectively, a light to the left, right, of the center line or no light. It is clear that we can identify гд = /с =¦' e, re = lв ='¦ с, rg — /д —: ?, but now we do not identify / :=ид, b :=пв, d := nC- The propositional logic of this model has the Greechie diagram given by Figure 2. For orthoalgebras we can reconstruct the original ideas of G. Boole [3] who said that for our reasoning we need to know only disjunction of mutually exclusive elements and not of all pairs of elements. This is also the requirement for the definition of states. In the beginning of the Nineties, two my former students, F. Корка and F. Chovanec [49] introduced a new structure, difference poset, D-poset for short. This structure requires difference of comparable elements as a primary notion. An equivalent partial structure is an orthoalgebra introduced by D. Foulis and M.K. Bennett [32], see also [36] with addition as a primary notion. An effect algebra is a non-empty set L with two particular elements 0, 1 @^1), and with a partial binary operation ©: L ? L —>¦ L such that, for all a, b, с е L, we have /«/V / \ / \ I , \ Fig. 4.
Measures on quantum structures 835 (EAi) if ? © b e L, then b®a e L and ? ? b = b © a (commutativity); (EAii) if b ? с e L and ? ? (fo ? с) e L, then ? ? fo e L and (? © b) ? с е L, and ? ? (b © с) = (? ? b) ? с (associativity); (EAiii) for any a e L there is a unique b e L such that ? ? b is defined, and ? ? b = 1 (orthocomplementation); (EAiv) if 1 ? ? is defined, then a = 0 (zero-one law). Let a and fo be two elements of an effect algebra L. We say that (i) a is orthogonal to b and write a _L fo iff ? © fo is defined in L; (ii) ? is less than or equal to b and write a ^ b iff there exists an element с е L such that ale and а ? с = b (in this case we also write fo ^ a); (iii) b is the orthocomplement of a iff b is a (unique) element of L such that b La and ???)=? and it is written as ?~4 We recall that every orthoalgebra is an effect algebra, and an effect algebra L is an orthoalgebra iff a _L a implies a = 0. A basic example of effect algebras is the system E(H) of all Hermitian operators on a Hubert space ? which are between the zero operator and the identity. E(H) contains as a proper subject the system of all orthogonal projectors, and E(H) is not a lattice. We recall that E(H) is a basic tool for the so-called Hubert space quantum mechanics. All above defined algebraic structures are nowadays called quantum structures. The notion of a state is the same as that for orthoalgebras. A system S on L (where L is any quantum structures) of states is order-determining iff m(a) ^ m(b) for all m e <S entails a ? b. The existence of states, e.g., of a system of order-determining ones, enables to represent any quantum structure as a system of fuzzy sets. For example [23]: THEOREM 2.2. Let a system of fuzzy sets L с [0, l]x, ? ? 0, satisfy the following conditions: (i) ixeL; (ii) \? — f e L whenever f e L; (iii) f +g e L whenever f + g ? \?. Then L = (L; ?,??, 1?,), where f®g is defined ifand only if f + g ^ \x,(f,g e L) and we put f (&g — f +gt is an effect algebra. In addition, the system S = {mx: ? e X], where mx: L —>· [0, 1] is defined via mx(f) := f(x), f e L, is an order-determining system of states. Conversely, let L = (L; ?, 0, 1) be an arbitrary effect algebra with an order-determining system of states S. Then L is isomorphic with the system of fuzzy sets L с [0, 1] , where L = {a e [0, l]5: a(m) :—m(a), m eS, a e L], and L satisfies the conditions (i)-(iii). Among effect algebras we can meet a very interesting subfamily of effect algebras, called interval effect algebras, i.e., an effect algebra of the form T(G, u) (where (G, u) is an Abelian unital po-group with a strong unit u), where ? ? b is defined in T(G, u) iff a + b e T(G, u) and in such the case ? ? b := a + b (it is clear that T(G, u) is always an effect algebra); we recall that a positive element и е G is a strong unit (or order-unit) if, given g e G, there exists an integer ? ^ 1 such that g ^ им. For example, the system of all effect operators, E(H) on a Hubert space Я is an interval effect algebra, while
836 A. Dvurecenskij ?(?) = ?(?(?), I), where B{H) is the po-group of all Hermitian operators on a Hubert space H, and / is the unit operator. 2.3. Decompositions of states In what follows we will study some questions concerning the decomposition theorem for semigroup-valued measures. Let (S; 0, +, ^s) be an ordered semigroup, that is, S is a commutative semigroup written additively, with the neutral element 0, and partial ordering ^5 such that if ? ^5 v, then ?+? ^sv+? for any ? e S. Let S+ denote the set of all positive elements of S, that is, S+ - {? e 5": 0 ^s Mb We say that 5" is (i) positive if S — S+\ (ii) naturally ordered if, for ?, ? e S+, we have ? ^s ? iff ? + ? = ? for some ? e S+. It is clear that an ordered semigroup S is positive and naturally ordered iff M^sv <s> 3? e S" with ? + ? = v. B.1) Indeed, if S is positive and naturally ordered, then 0 ^s ? for any ? e S. If now ?+? = v, ????? = ?+0 ^S ?+? = ?, so that ? ^s v. If B.1) holds, then 0 + ? -?, which implies 0 ^s ? for any be S. A cone of S is a subset С of S+ such that (i) if ??, ?? e С, then ?? + ?? & С, and (ii) 0 e С. The cone С is ?-closed if, for any bounded chain С in C, the join \JS С := V5{M·' деС) exists in 5" and is an element of С For more details on ordered semigroups see [34,5]. We say that an element ? e S+ is singular with respect to the cone С if ? ^s ? f°r some ? e С implies ? = 0. We denote by C: the set of all elements of S+ which are singular with respect to C. Now we present the following important result [24]: THEOREM 2.3 (Decomposition theorem). Let (S; 0, +, ^s) be a commutative, naturally ordered semigroup and let С a v-closed cone in S. Then, for any ? e .?>+, there exist two elements ? e С and ? e C~ such that ? = ?+?. B-2) Suppose that L is an effect algebra. We say that a finite sequence F = {a \,..., an} in L is (^-orthogonal if ? ? ? · ¦ ¦ ? ?„ exists in L. In this case we say that F has a ®-sum, ?/'= ? ai. defined via n ??,-=?? ?---??„. B·3) ?=? It is clear that two elements a and b of L are orthogonal, i.e., a _L b, iff {a, b) is ©-orthogonal.
Measures on quantum structures 837 An arbitrary system G = {a, },e/ of not necessarily different elements of L is ?-orthogonal iff, for every finite subset F of /, the system {a,-},-ef is ?-orthogonal. If G = {a/he/ is ?-orthogonal, so is any {a,-},<=,/ for any J с /. An ?-orthogonal system G = {a,-},¦<=/ of L has a ф-яиш in L, written as ф/е/ a,-, iff in L there exists the join /e/ F ieF where F runs over all finite subsets in /. In this case, we also write фС:= ф,е/ сч¦ It is evident that if G = {a\,..., a„) is ?-orthogonal, then the ©-sums defined by B.3) and B.4) coincide. We recall that if G = {?,},€/, ?, = a for any i e / and card / = oo, then ? G exists in L iff ? =0. Indeed, letao = ®G, thenao = ai(, ® Ф,е/\|/(,]а' =? ®?° which gives ? =0. In addition, if G is ?-orthogonal, then it does not entail, in general, that a = 0. Really, for example, let L be the set of all non-zero integers with the usual addition as ©. Then any system G = {a, he/, at = a for any / e / is ?-orthogonal, but a is not necessarily equal to 0. Throughout the rest of this subsection, by (L; 0, ©) and (V;0, +, ^y) we mean a effect algebra with with a zero 0 and a binary operation ? and an ordered semigroup, respectively. Define the following natural ordering ^„ on VL: ?\ ^„ ?? iff Ml (?) ^ M2(a) for any a e L. Example 2.4. Let [0, К] с [0, oo], К > 0, endowed with the usual order ^. Define a binary operation +v: [0, ?] ? [0, К] —>¦ [0, К] via s +v t = s ? / := max(i, /), s, t e [0, AT]. Then ([0, AT]; 0, +v, ^) is a positive, naturally ordered semigroup which is not cancellative. We say that an element ? e VL is a finitely additive measure if (i) ?@) = 0; (ii) ? (? ? b) = ? (?) + ??) whenever ? ? b is defined in L; if ? : L —> V+ satisfies (i) and (ii), we call ? ? positive finitely additive measure, and then ? ^ fo implies ?(?) <y ?(^). We denote by a(L, V) the system of all finitely additive measures from VL. If we define the addition of two measures ?, ? e a(L, V) pointwise, i.e., (? + ?)(?) := ?(?) + ? (?), ? e L, and if o(a) := 0 for all a e L, then (a(L, V); o, +, ^n) is an ordered semigroup with the positive cone a (L, V)+, the set of all positive finitely additive measures on L with values in V. Let / be an arbitrary non-void set, respectively. We denote by T(V) the set of all finite non-empty subsets of /. A mapping ? e V^ such that M@)=0, ?????= V ]??(?,)
838 A. Dvurecenskij holds whenever ф,е/ ?, is an element of L is said to be a positive completely additive measure or a positive ?-additive measure if the former equality holds for an arbitrary or countable index set /. We denote by ca(L, V)+ and aa(L, V)+ the sets of all positive completely additive or ? -additive measures, respectively, from V+. It is clear that ca(L, V)+ cCTa(L, V)+ ca(L, V)+ Ca(L, V). It is worth recalling that a mapping ? e vl· with ?@) = 0 is a ? -additive measure iff \n = l / n=l i'=l whenever ?,^, ?„ exists in L. We say that a mapping ? e a(L, V)+ is (i) continuous from below if ? ? ^ ?? $5 ¦ ¦ ¦, Vol ? ??/ = ?> ?, ??, ??,... e ?-, imply {?(??I? ?(?); (ii) continuous from above at the least element 0 if {? (?,,)} 4- ? @) whenever ? ? ^??^ PROPOSITION 2.5. A finitely additive measure ?&?A, V)+ " ?-additive if and only if ? ;s continuous from below. ? ?-additive measure is not necessarily continuous from above at 0 as we can see in the following example. Example 2.6. Let ? be an uncountable set and let L be the system of all subsets A such that either A or ? \ A is countable. Then (L, 0, ?), where A®S = AUBiffAnS = 0, is a ? -effect algebra. Take any non-trivial ordered semigroup, for example ([0, K]; 0, +v, О, К > 1, from Example 2.4, and define, for any A e L, JO, cardCA^No, [ 1, otherwise. Then ? is a ? -additive measure being not continuous from above at 0. We can apply Decomposition Theorem to the semigroup S = a(L, V) to obtain Yosida- Hewitt-type or Lebesgue-type decomposition theorems for positive finitely additive measures defined on effect algebras with values in a relatively resituated semigroups. THEOREM 2.7. Let the semigroup (a(L, V)+; o, +, ^„) of all positive finitely additive measures from an effect algebra L into a Dedekind upwards complete, relatively resituated, naturally ordered semigroup V be naturally ordered with respect to ^„. Then every positive finitely additive measure ? ea(L, V)+ can be expressed as a sum ? = ?+?, B-5)
Measures on quantum structures 839 where ? is a positive completely additive measure from VL, and ? is a finitely additive measure such that if ? ^,, ?, ? e ca(L, V)+, then ? = о. PROOF. Define 5" :~a(L, V) and the set С := ca(L, V)+. We show that С is a v-closed cone in a(L, V)+; and consequently, С is a cone. Further, let {у,-} be a chain in a(L, V)+ with a bound ? ea(L, V)+ with respect to the natural ordering ^„, and define У0(с) = \/ Ydc), ceL. i Since 0 ^v Yi (c) ? ? у (с), the Dedekind upwards completeness of V implies that yo(c) is defined correctly on L. Moreover, yo is finitely additive. Indeed, let ? ? b be defined in L. Then [yi(a)} and {y/(fr)} are equidirected, and ?,(?) f yo(a), ?, ?) ? ю(&)- We conclude that уа(а ®b) = Yi (? ? fo)t = (y/ (a) + У, (&)) ? = Yi (a)T + У/ (fo)T = У0(а)+У0(^)· Therefore, Y0 = \JSYi, i where V is taken in 5, and conversely, if y^ = \/f у,-, then yo = yo- Now we claim to show that yo e ca(L, V)+ if {y,} is a bounded chain in C. Let a = (§)j<zjaj exist in L. Then from the monotonicity of yo we have yo(af) ^v yo(o), where «f := 0 -ef ay for any F e J'(i). Assume that yo(af) ^ ? -* for some ? e V and for any F e JTG). Then y, (aF) ? у x for any / and any F e J"G). The complete additivity of y, entails that y,(a) ^y ? for any /, so that ??(?) ^v x¦ This gives Y0(a)= V ?^? consequently ?? e ca(L, V)+, and ca(L, V)+ is a v-closed cone of a(L, V)+. Now we can apply Theorem 2.3 to obtain the assertion in question. D Now we present two Lebesgue-type decompositions of measures on effect algebras. Let W be another lattice naturally ordered semigroup, and let ? e a(L, V)+, ? e a(L, W)+ be given. We say that (i) ? is ?-continuous, and we write ? <ЗС? ?, if for every ? >y 0, ее V+, there is ? >w 0, <5 e W+, such that every a e L with ?(?) <w <5 implies ?(?) <y ?; (ii) ? is dominated by ?, and we write ? <?C ?, if ?(?) — 0 implies ? (?) = 0; (iii) ? is ?-singular if, whenever yeafi,, V)+, у <5?? ? and у ^„ ?, then у = о.
840 A. Dvureienskij THEOREM 2.8. Let the semigroup (a(L, V)+; o, +, ^„) of all positive finitely additive measures from an effect algebra L into a Dedekind upwards complete, relatively resituated, naturally ordered semigroup V with the property that, for any и e V, 0 <y и, there exist u\, И2 e V, 0 <y mi, mi with u\ + uj = u, be cancellative and naturally ordered with respect to ^„. In addition, if {?,} \. ? for {?,} С a(L, V)+, then (y,(l)} 4- 0· Let W be a lattice ordered semigroup with the property that if ? \, ?? e W, 0 <w ? ?, ?2, then 0 <w ?\ ?^2· Let ? ea(L, V)+, andke a(L, W)+. Then ? can be expressed in the form ? = ? + ?, where ?, ? e a(L, V)+, ? <iCf ?, and ? is ?-singular. THEOREM 2.9. Let the conditions of Theorem 2.7 be satisfied. Then, for any pair of finitely additive measures ?,???^, V)+, there exist two elements ? and ? in a (L, V)+ such that ? = ? + ?, ? «?, and ? ? ? = ?, where the meet л is taken ina(L, V)+. Finally we note that in contrast to the classical Yosida-Hewitt decomposition theorem, for effect algebras we do not have guaranteed the uniqueness of the decomposition. The uniqueness is guaranteed, for example, for L = L(H) or L = E(S), where E(S) is the system of all splitting subspaces of an inner product space S; see below Aarnes-type theorems Theorem 3.7 (for L(H)) and Theorem 3.31. 3. Gleason's measures In the present section, we describe the state space of the most important quantum logic L(H) of a real, complex or quaternion Hubert space H\ for more details on Gleason's theorem see, e.g., [16]. 3.1. Introduction Recall that a state on L(H) is a mapping m : L(H) -> [0, 1] such that @0 \ ЭС 0M,¦ J =]Гт(М,-) (here by 0,e7- M, we shall mean the join of a family of mutually orthogonal subspaces {Mt: t еГ) of L(H)). It is clear that if ? is a unit vector in H, then mx : L(H) -> [0, 1] defined via mx(M)=\\PMx\\2, MeL(H),
Measures on quantum structures 841 where Рм is the orthoprojector from ? onto M, is a state on L(H) for any H\ it is said to be a vector state. More generally, if ? is a positive, Hermitian trace operator1 on H, with trT = 1, then шг, where mT(M) = tr(TPM), MeL(H). C.1) defines a state on L(H). Clearly there is a very large class of states on the quantum logic L (H) defined by C.1). In the case that ? is two-dimensional, it is possible to show that C.1) does not describe all states. Example 3.1. Let #2 be a two-dimensional, real or complex Hilbert space. If II if ? = H2, 0 ifM = {0}, 0, or 1 if dim ? = 1, where in the latter case we suppose m (M) + m(M±) = 1, then m is a two-valued state on L(H2). Fix a one-dimensional subspace Mo. If we define a state m via Example 3.1 such that one-dimensional subspaces ?,?1- ? Mq attain the value either 0 or 1 in m, and for w(Mo) — 1/2, then those states are not expressible via C.1). A.M. Gleason [37] published the answer to Mackey's problem in 1957: THEOREM 3.2 (Gleason theorem). If ? is a separable, real or complex Hilbert space, dim ? ? 2, then for any state m on L{H) there exists a unique positive, Hermitian trace operator ? on ? with trT = 1, such that m(M) = tr(TPM), MeL(H). C.2) Gleason's theorem can be reformulated into an equivalent form: for any state m on L(H) of a separable Hilbert space H, dim ? ? 2, there exists an orthonormal system of vectors {?-,} and a system of positive numbers (?,} such that J2i ^< = 1 > and w(M)=^%m.V;(M), MeL(H). C.3) Gleason's theorem belongs to the fundamental results of quantum logic theory. The heart of Gleason's result lies in the care of a real, three-dimensional Hilbert space. The original proof for this particular case is very complicated, and, consequently, many attempts to simplify it have been made, which continue to the present day. 1 We recall that an operator ? on a Hilbert space ? is a trace operator if ?f | (Tx,, xj ) | < 00 for any ONB [xj ) of H; the expression trT := J^ (Txj, Xj) is said to be a trace of ?.
842 A. Dvurecenskij For example, the following anecdote, [16], is told concerning J.S. Bell, the author of the well known Bell inequalities: When J. Bell became familiar with the Gleason result, he said that either he would find a more elementary proof or he would leave from this area. Fortunately, he found a relatively simple proof of the partial result that there are no two-valued states on L(H), dim ? ? 2. An elementary proof of Gleason's theorem with the additional assumption that m assigns the value 1 on some one-dimensional subspace has been presented by С Piron [63]. S. Kochen and E. Specker [47] have found a proof for a special case restricted to a finite number of closed subspaces. The most elementary proof has been found by the triumvirate R. Cooke, M. Keane, and W. Moran, [10], which was first presented at a conference in Cologne, Germany, June 13-16, 1984. Their proof is accessible to undergraduates who have completed a first course in real analysis. When we regard the quantum logic of a Hubert space as the quantum logic of all orthoprojectors of a von Neumann algebra A on H, we see that Gleason's theorem describes states on the quantum logic of the factor type I (more precisely, for I„, ? = 1, 3,4,..., oo). The Gleason method is not, however, applicable to factors type II and III. The problem of linearity is the following: if ? is a finitely additive measure on the quantum logicof a von Neumann algebra A, L^{P, H), i.e., the system of all orthogonal projectors in A, is it possible to extend it to a bounded linear functional on ?? Conversely, any continuous linear functional on A restricted to L^(P, H) gives a finitely additive measure on Lj\(P, H). For states on L(H) this problem is evident due to Gleason's theorem and the linearity of trace. The linearity problem is important for non-commutative integration theory and non- commutative probability theory. The first result generalizing Gleason's theorem is due to J. Gunson [42], and others are due to J.F Aarnes [1,2]. Then there appeared the paper of A.A. Lodkin [50], which asserted the correct result but with an given incomplete proof. Important contributions were made by M.S. Matvejchuk [56,57], A. Paszkiewicz [62]. The complete proof was presented by E. Christensen [9] and F.J. Yeadon [72,73]; for a lucid exposition see S. Maeda [55]. For ?-finite measures this problem is studied in [58]. For complex-valued measures the solution is given by L.J. Bunce and J.D.M. Wright [7]. 3.2. Generalizations of Gleason's theorem In the present part, we show how Gleason's theorem was generalized to signed states and also to nonseparable Hubert spaces as well as to unbounded measures. We recall that the three-dimensional version of Gleason's theorem is most important for applications of Gleason's theorem. Let Я be a real or complex Hubert space. Let S{H) be the unit sphere in H. We say that a mapping /: S(H) -> R is a frame function on ? if there is a constant W, called the weight of /, such that for any ONB {jc, } of ? we have J2f(xi) = w. i It is clear that if / is a frame function on H, then
Measures on quantum structures 843 (i) /(a;c)=/(jc)foranyaeC, \a\ = \,x eS(H); (ii) f\S(M), where ? is a closed subspace of H, is a frame function on M. PROPOSITION 3.3. Let m be a completely additive signed measure on L{H). Then the mapping f,„ defined by f„(x)=m(sp(x)), xeS(H), is a frame function on ? with the weight m (H). A frame function / on ? is (i) bounded if sup{|/(jc)|: ? eS(H)} < oo; (ii) semibounded'iiinf{/(jc): ? e S(H)} > —oo; (iii) regular if there is a Hermitian operator ? on ? such that f(x) = (Tx,x), xeS(H). We recall that if / is regular, then Г satisfying the former equation is a unique trace operator on H. PROPOSITION 3.4. For any integer ? ^ 2, there exists an unbounded frame function (in fact infinitely many) on an ?-dimensional Hilbert space H. Equivalently, there exist unbounded signed measures on L(H). To see that, it is sufficient to take a discontinuous additive functional ?: R —> К of Hamel, and let ? ? к I be a Hermitian operator on H, where к is a real, non-zero constant and / is the identity operator on H. Define a frame function f(x) = ф((Тх,х)), х е S(H). Then it is unbounded. We recall that a signed measure on L(H) is any mapping m : L(H) —> M. which is ?-additive. A completely additive state or a signed measure is additive on all joins of any system of mutually orthogonal elements of L(H). Any finitely additive measure on L(H) is said to be a charge. A charge is Jordan if it is difference of two positive charges. If m is a bounded signed measure, then we have the first generalization of Gleason's theorem by A.N. Sherstnev [67] and the author [14]: THEOREM 3.5. Let m a bounded signed measure on L(H) of a separable Hilbert space H, dim ? ^ 3. Then there exists a unique Hermitian trace operator ? on ? such that m(M) = \i(TPM), MeL(H). As we have seen, on any L(H) of a finite-dimensional Hilbert space ? there exists an unbounded frame function. Therefore, the result of Dorofeev and Sherstnev [12] saying, in particular, that any signed measure on L(H) is bounded whenever dim ? = oo, was very surprising. It needs the following notion.
844 A. Dvurecenskij A mapping /: S(S) —> ? is said to be a frame type function on ? if (i) for any orthonormal system (ONS, for short) {jc,} in S {/(.*,)} is summable; and (ii) for any finite-dimensional subspace К of S, f\S(K) is a frame function on K. We recall that condition (i) is equivalent to the requirement that {/(.*,)} is summable for any sequence of orthonormal vectors {jc,}. It is clear that if / is a frame type function on ? with given S, then if К is a closed subspace of ? such that S П К is dense in ?, then ?? '¦= f\S(S П К) is a frame type function on K. THEOREM 3.6 (Dorofeev-Sherstnev). Any frame type function on a Hilbert space H, dim ? = oo, /s bounded. Now we are able to present a variation of the Yosida-Hewitt decomposition theorem for signed measures on L(H) of arbitrary dimension. THEOREM 3.7 (Aarnes theorem). Any Jordan charge m on L(H), dim ? ? 2, is uniquely decomposed as a sum m —m\ + mi, where m\ as a Jordan completely additive signed measure and mj is a Jordan charge vanishing on any finite-dimensional subspace of H. We recall that according to S. Ulam, a cardinal number / is said to be non-measurable if there exists no probability measure ? on the system of all subsets of a set A whose cardinality is / such that v{{a}) = 0 for any a e A. In the opposite case / is said to be measurable. For example, any finite cardinal, No, К ?, ??, ?? are non-measurable cardinals, and also с (the cardinal of the continuum) (under the continuum hypothesis) is also a non-measurable cardinal. If J ? I and / is a non-measurable cardinal, so is J. The existence of non-measurable cardinals is a well known problem in set theory and measure theory. The class of all non-measurable cardinals is huge, and it possesses, roughly speaking, all cardinals which usually appear in mathematical problems. M. Eilers and E. Horst [31] andT. Drisch [13] proved that Gleason's theorem also holds for a non-separable Hilbert space whose dimension is a non-measurable cardinal ? 2. The author [15] extended this result to the following one. THEOREM 3.8. Any finite (?-additive) measure m on L(H) is a Gleason measure if and only if the dimension of a Hilbert space ? is a non-measurable cardinal ? 2. For completely additive states and signed measures we have the following generalization of the result of Maeda [54], [16]. THEOREM 3.9. Let m be a completely additive signed measure on L(H), dim Я = oo. Then there exists a unique Hermitian trace operator ? on ? such that т(М) = 1т(ТРм), MeL(H). Now we present a generalization of Gleason's theorem for measures which can also attain improper values. For example, m (M) := dim ?, ? e L(H), is such one. We show that even for this case we can find a special formula which for finite measures converts into
Measures on quantum structures 845 the Gleason formula. The first result in this direction was obtained by G.D. Lugovaja and A.N. Sherstnev [52]. First we extend the notion of frame function. A mapping /: S(H) —>· RU {-00} U {+00} is said to be г. frame function if (i) f{X) = f(Xx), ? e C, ? e S(H); (ii) there is a constant W (can be ±00), called the weight of / such that ?,- f(xi) — W for any ONB {*,¦} in tf. From the definition there follows that / can attain at most one of the improper values ±00. A frame function / has afiniteness property if | ???? /(-*<)! < °° f°re some ^NS {*,¦: i e 1} entails that f\S(G) is a frame function on G = 0ie/sp(.Xj) with a finite weight. It is clear that any frame function with a finite weight has the finiteness property. A frame function / is regular if there exists a symmetric bilinear form t with D(t) = {x e ?: ? ? {0}, I/(л:/H*||)I < 00} U {0} such that f(x) = t(x,x), ? e S(H) ? D{t). Let ? be a cardinal number. We shall say that a frame function / is ?-finite if there exists an ONB {xa: a e A) in ? such that A = (J/e/ ^ь where А, П Ay = 0 for / ? j, and ? ???? f(xi)\ < °° f°r апУ j e ^. anci the cardinal of / is n. For example, if ? = H,„ we shall say that / is ?-finite. Without loss of generality we shall assume that m : L{H) —> ]—00, 00]. First we present the basic result for L(E3) which has the same importance as that for finite states, [52]: LEMMA 3.10 (Lugovaja-Sherstnev). Let m : L(E3)^· ]-oo,oo] be a finitely additive charge with m (??) — oo, and let there be a two-dimensional subspace Q of a finite measure. Then if ? is a one-dimensional subspace of finite measure, then ? <zQ. Using this basic result, it is possible to prove the following result, [16]: THEOREM 3.11. Let ? be the quantum logic of a real or complex Hilbert space H, dim Я ? 2. Let ? be a cardinal number and let m be an ?-finite measure. The following assertions are equivalent: A) There exists a (unique) positive, symmetric bilinear form t with the domain D(t) dense in ? such that m(M) = tT(toPM) iftoPMeTT(H), 00 otherwise. B) m has a support, i.e., such an element M,„ that m (N) = 0 if and only if N _L Mm. C) m is a completely additive measure. PROPOSITION 3.12. Any positive self-adjoint operator ? defines by C.4) a Gleason measure, where t is defined by t(x, y) = (?1/2*, T^/2y) with D(t) = D(T]/ ). Now we show that the situation is more complicated when we use bilinear forms which are not determined by positive a self-adjoint operator.
846 A. Dvurecenskij Example 3.13. Let {e„\ be an ONB for an infinite-dimensional Hilbert space ? which is a part of a Hamel basis {e,},€/ for ? consisting of unit vectors from #.Fix an element e,0, /'o e /, which does not belong to the ONB {e,,}, and define a linear operator ? : ? —>¦ ? and a linear transformation /: ? —>¦ С via T ( ? "'' e' ) = ?'? e'o and / ( ?!?'e/ ) = ?'°' ? ^ i where <*,-„ is the scalar corresponding to e,-0 in the decomposition ? = ?? ai еь x e #' w'tn respect to the given Hamel basis. Then \\T\\ = oo = ||/||, and let t(f, g) - (Г/, Tg), then t defines via C.4) only finitely additive measure attaining infinite values. G.D. Lugovaja [51] showed that on a separable Hilbert space any singular positive symmetric bilinear form does not define any ?-finite Gleason measure by C.4). For that reason the following important result has been proved by B. Simon [69]: THEOREM 3.14. Let t be a densely defined positive symmetric bilinear form on a Hilbert space H. Then there exist two positive symmetric bilinear forms tr and t5 with D(tr) = D(t) = D(ts) such that t = tr+ts, where tr is the largest closable bilinear form less than t. The following result of Lugovaja gives only finitely additive measures: THEOREM 3.15. Let t be a singular, positive symmetric bilinear form with the dense domain D(t) in H. Then the mapping m : L(H) -> [0, oo] defined by C.4) is not a measure on L(H) of an infinite-dimensional Hilbert space H. The affirmative answer is also due to Lugovaja: THEOREM 3.16. A positive symmetric bilinear form t with a dense domain D(t) defines through C.4) a Gleason measure on L(H)for every infinite-dimensional Hilbert space ? if and only if for any ? e L(H), (to PM)r eTr(tf) implies to Рм еТт(Н), where (t о Рм )r is the regular part of the closure t о Рм- Now we shall study the question of which kind of functions is defined by C.4). It will be shown that it is precisely the set of all P(#)-regular finitely additive measures on L(H) which can also attain even infinite values. We recall that a finite charge m on L(H) is
Measures on quantum structures 847 P(H)-regular if given ? > 0 and any ? e L(H) there is a finite-dimensional subspace N of ? such that \m(M nW)! <?. For finite measures we have the following characterization: THEOREM 3.17. Let m be a finite Jordan charge on L(H), dim ? ? 2. Then m is P(H)- regular if and only if there is a Hermitian trace operator ? on ? such that C.1) holds. For infinite measures we have this characterization: THEOREM 3.18. Let m be ? ?-finite, ?'(H)-regular, finitely additive measure on L(H), dim ? ? 2 of a separable Hilbert space H. Then there exists a unique bilinear form t with a dense domain such that C.4) holds. Conversely, for any positive bilinear form t on ? such that D(t) is dense in H, the mapping mt: L(H) —>¦ [0, oo] given by mt(M) ¦¦ H(toPM) iftoPMeTT(H), oo otherwise, is ? ?-finite ?(?)- regular finitely additive measure. Finally, we show some convergence properties of finite signed states on L(H). We recall that a Gleason measure we understand any signed measure m on L (H) such that C.1) holds for some Hermitian trace operator ? on H. THEOREM 3.19 (Nikodym theorem). Let {m„} be a sequence of finite signed measures on L(H) of a Hilbert space H. If there is a finite limit m(M) = lim„ m„ (M) for any ? e L(H), then m is a finite signed measure on L(H), and {m,,} is uniformly ? -additive with respect to n. THEOREM 3.20. The space of all finite completely additive signed measures on L(H) is sequentially weakly complete for any Hilbert space H. In addition, the space of all finite Gleason signed measures is sequentially complete. We say that a system of finite charges {v,: / e /) on L(H) is uniformly absolutely continuous with respect to a finite, finitely additive measure m if given ? > 0 there is a S > 0 such that m (?) < ? implies | v,- (M) | < ? for any / e /. THEOREM 3.21 (Vitali-Hahn-Saks). Let [v„] be a sequence of semi bounded, completely additive signed measures on the quantum logic L(H), dim Я ф 2, such that every v„ is absolutely continuous with respect to a completely additive measure m. If there is a finite limit v(M) = lim„ v„(M), ? e L(H), then {v,,} is uniformly absolutely continuous with respect to m.
848 A. Dvurecenskij For uniform convergence of bounded signed measures we have the following statement combining this convergence with the convergence of trace operators, [45]: THEOREM 3.22. A sequence of Gleason signed measures m„() = tr(r„P(.)) converges uniformly to 0 if and only if ????(?„) = 0, II where ? is the trace norm of trace operators on H. 3.3. Gleason's theorem and completeness criteria of inner product spaces In this section, we show how Gleason's theorem can be used even for characterization of completeness inner product spaces (= Hubert spaces). Let S be a real or complex inner product with an inner product (·, ¦)· We recall that for any MCSwe put M1- = [x e S: (x,y) = 0 Vy e ?}, and by ? we mean the completion ofM. Denote by F(S) = [M с 5": М = М±±}, i.e., the set of all orthogonally closed subspaces of S, and E(S)= [M C5: M + M^ = S], i.e., the set of all splitting subspaces of S. It is easy to see that E(S) с F(S). Moreover, 5" is complete iff E(S) — F(S), which is a consequence of the Amemiya-Araki theorem. We present now two simple completeness criteria, [17]: THEOREM 3.23. A real or complex inner product space S is complete iff there is a unit vector ? e S and a positive number W > 0 such that, for any MONS {.v,} in S, 0< W ^J2\(x,Xi)\2 i holds. THEOREM 3.24. A real or complex inner product space S is complete iff there is a unit vector ? e S and a positive number W > 0 such that, for any MONS [xi} in S, 0< W^^|(jc,jc,)|2 holds.
Measures on quantum structures 849 Slight generalizations of the above facts are the following ones. A mapping /: S(S) :— {x e S: ||jr|| = 1} -> E+ is said to be a frame function iff there is a constant W (called the weight of /) such that i holds for any MONS {*,¦} in 5". A mapping / :S(S) —>¦ E+ is said to be a weak frame function iff (i) there is a positive constant W such that 0< W ^Y^f(xt)< oo i holds for any MONS {*,¦ } in S; and (ii) f\S(M) is a frame function for any finite-dimensional subspace ? of S. From (ii) we have that f(Xx) = f{x) for any |?| = 1. It is evident that any frame function is a weak frame function, and the converse holds, too, as we shall see below, which will prove the completeness. We recall that if ? is a unit vector in S or in S, then f:z ь* ? (?,*) ?2, ? e S(S), is sometimes (iff S is complete) a special type of a frame function or a weak frame function. THEOREM 3.25. An inner product space S is complete iff there is at least one weak frame function f on S(S). COROLLARY 3.26. Any weak frame function is a frame function. A mapping / : S(S) := [x e S: \\x || = 1} —>¦ ? is said to be a signed frame function iff there is a constant W (called the weight of /) such that i holds for any MONS {*,¦} in 5". A mapping / :S(S) —>¦ ? is said to be a weak signed frame function iff (i) for any MONS {*,} is S, {/(*/)}/ is summable; (ii) there is a positive constant W such that o< ?^??-?-*')! < oo i holds for any MONS {*,-} in S; and (iii) f\S(M) is a signed frame function for any finite-dimensional subspace ? of S. From (iii) we have that f(Xx) = f{x) for any |?| = 1. It is evident that if / is a signed frame function with a non-zero weight, then / is a weak signed frame function.
850 A. Dvurecenskij THEOREM 3.27. An inner product space S is complete if and only if there exists at least one weak signed frame function f on S(S). What is important for us is that applying the Gleason theorem for bounded signed measures on finite-dimensional Hilbert spaces we have that there is a bounded bilinear from; on S ? S such thatt(x,x) — f(x), ? e S(S). We have seen above that it has been shown that S is complete iff there is at least one non-zero signed measure frame function on S(S). Using the property f(x) = m(sp(x)), ? e S(S), we have the following measure-theoretical characterization of the completeness of inner product spaces. A completely additive signed measure (state) on E(S) or F(S) is any completely additive function m on E(S) or F(S) which preserves all existing joins of mutually orthogonal elements of ?E') or F(S), respectively. THEOREM 3.28. A real or complex inner product space S is complete if and only if on E(S) or F(S) there exists at least one nonzero completely additive measure. For the reader's convenience, we collect some completeness criteria of inner product spaces. For other explanation of new notions, see, e.g., [16]; we recall only that: A) D(S) is the system of all Foulis-Randall subspaces of 5", i.e., of all subspaces ? of 5" for which there is an ONS {и,} in ? such that М={и^^, which is a complete orthoposet. Any ? of D(S) possesses at least one local complement M'', i.e., such an element ?' e D(S) for which M' _L ? and ? ? ?' = S. B) R(S) is the set of all subspaces ? of S such that ? - [u^1- for any MONS {и,} in ?, which is a poset. C) V(S) is the set of all subspaces ? of 5" such that ? = {ui}^1- and M1- = {v^1- for all MONSs {и,} and {uy} in ? and M^, respectively, which is an orthocomple- mented poset. D) C(S) is the set of all subspaces of S of finite or cofinite dimension, which is an orthocomplemented, modular lattice. E) P(S) is the set of all finite-dimensional subspaces of S, which is a lattice. F) W(S) is the set of all closed subspaces of S. THEOREM 3.29. Let S be a real or complex inner product space. The following criteria are equivalent. A) S is complete. B) E(S)=W(S). C) F(S) = W(S). D) For any proper closed subspaces ? of S, ?1- ? {0}. E) If f is a continuous linear functional on S, there exists у e S such that f(x) — (x,y)forallx e S. F) For any non-zero continuous linear functional f on S, (Кет f) ? {0}. G) For any continuous linear functional f on S, Ker/ e E(S). (8) F(S) is orthomodular (9) E(S) = F(S).
Measures on quantum structures 851 A0) E(S) is a complete lattice. A1) E(S) is ? ? -lattice. A2) E(S) is a ?-orthoposet (= quantum logic). A3) ?E") possesses the join of any sequence of mutually orthogonal one-dimensional sub spaces ofS. A4) ? (S) has the subsequential interpolation property. A5) ?(S) has the strong subsequential completeness property. A6) Any modular pair in W(S) is dual-modular A7) D(S) isanOML. A8) R(S)=F(S). A9) D(S) = E(S). B0) К (S), if К e {C, E,V, R, D, F, W], possesses at least one non-zero, completely additive signed measure. B1) 5" possesses at least one non-zero frame function. B2) Every MONS in S is an ON В in S. B3) There exists a unit vector у e S such that у = ?V:(y, jc, )jc, for any MONS [xj } in S. B4) F(S) (W(S)) possesses at least one Jordan P(S)-regular, non-zero charge. B5) К (S), where К e [E,V, R} and dimension of S is a countable cardinal, possesses at least one non-trivial, strongly ?(S)-regular, finitely additive measure. B6) К (S), where К e [D, F, W}, possesses at least one state having a support. B7) K(S), if К e {D, F, W] and the dimension of S is a non-measurable cardinal, possesses at least one non-trivial signed measure. B8) К (S), where К e {F, W}, possesses at least one signed measure non-vanishing on P(S). B9) К (S), where К e [F, W} and S is Dacey. possesses at least one signed measure non-vanishing on P(S). C0) К (S), where К e {F, W], possesses at least one finitely additive state with a finite- dimensional support. C1) K(S), where К e {F, W], possesses at least one finitely additive state with a P(S)- regular support. C2) Any P(S)-regular state on E(S) has a support in E(S). C3) For any sequence {jc,} of orthonormal vectors in S and all positive numbers {?/}, with J2i ?; = 1, the state J2i ?/ mXi has a support in E(S). C4) For any sequence {jc,} of orthonormal vectors in S, the state 5Z,- /ял, /2' has a support in E(S). C5) There is a strong system ? of states on E(S), dim S ? 2, such that any counter on Con(.M) is ? -expectational. C6) Any P(S)-regularfinitely additive state on E(S) has the ?-Jauch-Piron property. C7) Any ?(S)-regular finitely additive state on E(S) has the ?-weak Jauch-Piron property. C8) Any mx, ? eS(S), has the ? -weak Jauch-Piron property. C9) Sis Dacey. D0) D(S) is an OMP D1) (S(S), T{S)), where T(S) is the setofallMONSs in S, is an algebraic test space, [27].
852 A. Dvure(enskij D2) There is a unit vector ? e S (x e S) and a constant W such that 0 < W ? ?,? I С*,*/) I2 holds for any MONS {x,-} in S. D3) There is a weak frame function on S(S). D4) There is a weak signed frame function on S(S). Finally we present another generalization of Gleason's theorem to describe regular finitely additive measures on E(S). We note that on E(S) there exist plenty of finitely additive states for a non-complete inner product space S, but no completely additive. For example, let ? be a unit vector in S, then mx(M)=\\xM\\2, MeE(S), where ? = хм +хм±,хм e ?, хм^ е М^, is a finitely additive state on E(S). Moreover, the system of all such states describes the order on E(S). We say that a charge m on E{S) (F(S)) is ?(S)-regular if given ? e E(S) (F(S)) and given ? > 0 there is a finite-dimensional subspace N of ? such that \m(M nW)! <?. It is clear that ? (S) -regularity is equivalent to the statement that given Me E(S) we find a non-decreasing sequence [M„} of finite-dimensional subspaces of ? such that m(M) = \im m(M„). PROPOSITION 3.30. Let ? e ??EJ be a non-zero Hermitian operator, then тт defined via mT(M) = \i(TPg), MeE(S), C.5) is a P (S)-regular Jordan charge on E(S). The following result generalizes Aarnes' theorem and we give a Yosida-Hewitt-type decomposition for positive measures; here the role of Gleason measures will be played by ? (S) -regular charges. THEOREM 3.31. Any Jordan charge m on E(S), dim S ? 2, can be uniquely decomposed as a sum m —m\ + mj, where m\ is a P(S)-regular charge, and mi is a Jordan charge vanishing on all finite-dimensional subspaces ofS. THEOREM 3.32. A Jordan charge m on E(S), dim 5" ? 2, is P(S)-regular if and only if there is a Hermitian operator ? e TrE") such that т(М) = 1т(ТРй), MeE(S). By Tr(H) we denote the system of all Hermitian trace operators on H.
Measures on quantum structures 853 As wee see, formula C.5) describes all measures on E{S) which can be described by Gleason's formula, i.e., using a Hermitian trace operator on ? — S. We recall, that E(S) can be embedded into L(S) via ? ь* ?, ? e E(S). Problem 3.33. Finally we note that the analogue of Nikodym's theorem for the space of all ?(S)-regular Jordan charges on E(S), dim S ? 2, seems to be open. Similarly, the existence of at least one finitely additive state on F(S) for an incomplete S seems to be open, too. 4. States on MV-algebras MV-algebras are many-valued analogues of a two-valued logic, and they have were introduced by Chang [8]. The importance of MV-algebras for quantum structures follows from an important property that they model a classical situation with unsharp measurements. We recall that they form a counterpart to Boolean algebras in orthomodular poset models. We recall that according to Mundici [59], they can be characterized as follows. An MW-algebra is a non-empty set ? with two special elements 0 and 1 @^1), with a binary operation ?: ? ? ? —>¦ ? and with a unary operation *: ?? —>¦ ?? such that, for all a,b,ce ??, we have (MVi) ? © b = b © a (commutativity); (MVii) (? ? b) ? с — ? ? (b ? с) (associativity); (?Viii) ? ? 0 = a; (MViv) ? ? 1 = 1; (MVv) (a*)* = a; (MVvi) a ©a*= 1; (MVvii) 0*= 1; (MVviii) (a*®b)*®b= (a®b*)*®a. We recall that the above axioms are not independent, e.g., (MVvii) follows from (MVvi) and (MViii) for a = 0. We define binary operations ©, v, л as follows, for all a, b e M: aQb:=(a*®b*)*, a v b:= (a* ®b)* ? b, а Л b := (a* v b*)*. For a, b e M, we define a ^ b o· a = a l\b, then ^ is a partial order on M, and (?; ?, ?,?, 1) is a distributive lattice with the least and greatest elements 0 and 1, respectively. We recall that ? ?? biff b®a* = 1. Let (G; +, 0, ^) be an Abelian ?-group with a strong unit и, i.e., given ? e G, there is an integer/г > 1 such that —nu^v^nu. Define r(G,u)={geG: O^g^u] D.1)
854 A. Dvurecenskij and set for all g\ ,g2,g? T(G, u) g\®g2-{g\+gl)/\u, g\Og2 = (g\+gl-u)vO, g* = u-g. Then (f(G, и); ?, ?,* , 0, и) is an MV-algebra. The famous Mundici result [59] says that given an MV-algebra ? there is an Abelian ?-group G with a strong unit и such that ? is isomorphic with some T(G, u). In addition, ? defines a categorical equivalence between the category of unital ?-groups and the category of MV-algebras. The MV-algebra of the real interval [0, 1], i.e., (T(R, 1)), is of particular interest. We recall that the categorical equivalence of MV-algebras with unital ?-groups implies that if ? is an order- and +-preserving mapping from ? = T(G, u) into a po-group H, then there exists a unique group homomorphism f:G —>¦ ? such that ? {?) = ? (a), ae M. Given an MV-algebra M, we can introduce a partial binary operation + via a + b is defined iff a ^ b*, and in this case we put a +b:=a ®b. It is easy to see that a+0 = 0 + a = afor any a e M, and + is commutative, i.e., if a + b is defined in M, so is b+a,anda +b = b+a\ + is associative, i.e., if (a +b), (a+b)+c are defined in ? so are defined b + с and a + (b + c), and (a + b) + с = a + (b + c). Identifying the MV-algebra ? with r{G,u) via D.1), we can see that our partial operation + coincides with the group addition +. A non-void subset / of ? is said to be an ideal of ? if (i) x,ye I imply ? ? у е /; (ii) ? e /, у ^ ? imply у е I. A proper ideal A of ? is said to be maximal if there is no proper ideal of ? containing A as a proper subset. Let ? (?) denote the set of all maximal ideals of ?. Then ? (?) ? 0. Denote by Rad(M) := [\\A: A e M(M)}, and we call Rad(Ai) the radical of M. An MV-algebra ? is said to be semisimple if Rad(M) = {0}. Since if ? is an MV-algebra, then ? with the partial addition and 0 and 1 is an effect algebra, we define a state on an MV-algebra in the same way as that for effect algebras. A state on an MV-algebra ? is a mapping m :M —>¦ [0, 1] such that w(l) = 1, and m(a + b) = m(a) +m(b) whenever a + b is defined in M. Denote by S(M) the set of all states on M, then S(M) ? 0. Let (G, u) be an Abelian unital ?-group. A state on (G, u) is any mapping s : G -> К such that (i) i(«)=l; (ii) s(gi +g2) = i(gi)+i(g2), gi,g2eG; and (iii) s (g) > 0 for each g e G+. Due to the above mentioned categorical equivalence, we have that there exists a one-to- one equivalence between the states on T(G, u) and (G, и); this correspondence is given
Measures on quantum structures 855 by the property that if s is a state on (G, и), then m := s\T(G, u) is a state on T(G, и), and conversely, each state m on T(G, u) can be uniquely extended to a state s on (G, и). Moreover, extremal states on T(G, и) correspond to extremal states on (G, и), and vice versa. For example, the MV-algebra T(R, 1) possesses a unique state m, namely m{t) = t, re [0,1]. We write that a net {ma} of states on ? converges weakly to a state m iff m„(a) —>· m(a) for any a e M. Then 5(??) is a convex compact Hausdorff topological space. Let Ext(Ai) denote the system of all extremal states on M. A state m is said to be state- morphism \im{a®b) — min{m(a) +m(b), 1). THEOREM 4.1. Let ? be an MV-algebra. The following statements are equivalent: (i) m is an extremal state on M. (ii) m is a state-morphism on M. (iii) m(x Ay) = min{m(jt), m(y)}, x, у е М. The set of all extremal states is in a one-to-one correspondence with maximal ideals of ?. This correspondence is given by m «*· K(m):= [aeM: m(a) = 0}, m e Ext(<S(M)). D.2) THEOREM 4.2. Let ? be an MV-algebra. Then S(M) is a non-empty compact convex Hausdorff space with respect to the weak topology, and the space of all state-morphisms on X is a non-void compact Hausdorff space. Any state is a weak limit of convex linear combinations of the set of extremal points ofM. For every a e ?, we put M(a):= {I eM(M): a ?1]. Then(i)M(O)=0, ? (а) с ? ?) whenever ? ? b, ? (а лЬ) = ? (?) ? ? ?), a, be ?, ? (? ? b) = ? (a) U ? ?), a,b e ?, and {? (a): a e M] is the base of the so-called hull-kern topology Т_м on M(M). PROPOSITION 4.3. Let ? be an MV-algebra. Then the hull-kern topology defines a compact Hausdorff topology such that the closed subspaces ofM(M) are exactly of the form C = C(J):= {I eM(M): I^j], where J is an ideal of M. Similarly, every open set О is of the form 0= 0(J):=[I eM(M): /2 7}.
856 A. Dvurecenskij THEOREM 4.4. Let ? be an MV-algebra. Then the correspondence D.2) defines a homeomorphism of the space Ext(M) of all extremal states on ? and the space ?4 (M) of all maximal ideals of M. Now we show how order-determining systems of states on ? V-algebras determine they character. THEOREM 4.5. Let ? be an MV-algebra. The following statements are equivalent: (i) ? is semisimple. (ii) ? possesses an order-determining system of states. (iii) ? possesses an order-determining system of extremal states. (iv) ? is isomorphic to a system of fuzzy sets on a non-void set. (?) ? is isomorphic to a system of continuous fuzzy sets on a non-void compact Hausdorffset. For other aspects of states on MV-algebras we recommend also the paper of B. Riecan and D. Mundici [66] in this Handbook. 5. Measures on BCK-algebras BCK-algebras, introduced by Imai and Iseki [44], constitute a generalization of Boolean algebras, arising from the many valued logic of Lukasiewicz in the same manner as Boolean algebras arise from two-valued logic. In particular, bounded commutative BCK- algebras are equivalent to MV-algebras. 5.1. Commutative BCK-algebras with the relative cancellation property We recall that a BCK-algebra is an algebra (X; *,0) of type B,0) such that, for all x, y, ? e X, we have (i) ((x * y) * (x * z)) * (z * y) — 0; (ii) {x * (x * y)) * у = 0; (iii) ? * ? = 0; (iv) ? * у = 0 and у * ? = 0 imply ? = )·; (?) 0*jt = 0. In every BCK-algebra X = (X; *, 0) we can define a partial order ^ via ? ^ у iff ? * у = 0. X is said to be bounded if there exists the greatest element 1 in X. A BCK-algebra (X; *, 0) is said to be commutative if ? * (x *y) = у * (у * ?), x,yeX, and in this case, хлу=х*(х*у).
Measures on quantum structures 857 Example 5.1. Let X = {0, 1,2,3,4} and * operation be given by the table. Then (X; *, 0) is a bounded non-commutative BCK-algebra. * 0 1 2 3 4 0 1 0 0 1 0 2 1 3 3 4 4 2 3 4 0 0 0 0 0 0 0 1 0 3 0 0 4 4 0 In [25], an important class of commutative BCK-algebras was studied: we recall that a commutative BCK-algebra (X; *,0) has the relative cancellation property if, for x,y,a e X, such that ? > а, у ^ a and x*a = y*av/c have ? = у. Such BCK-algebras form a variety which is a proper subvariety of the variety of commutative BCK-algebras. In this case we can introduce a partial binary operation + on X as follows a + b is defined in X and equals с iff с > a and b = c*a. For the basic properties of + we recall only that + is commutative, associative and with a neutral element 0. For example, ([0, oof; *r, 0), where s *r t = max{0,i — t], s,te [0, oo), is an example of a commutative BCK-algebra with the relative cancellation property. Suppose that (G; +, ^,0) is a lattice-ordered group with positive cone G+ = [g e G: g > 0). Then (G+; *g,0) is a commutative BCK-algebra with the relative cancellation property, where *g is defined via и *g ? '¦— (u — t>) ? 0, u, ? e G+. More generally, if Go is a non-void subset of G+ such that u, ? e Go imply и *g v e Go, then (Go;*g,0) is a commutative BCK-subalgebra of (G+;*g,0) having the relative cancellation property. In addition, the derived partial sum in X = Go coincides with the sum in G, provided that this is again in Go- The latter example is in some sense the archetype of a commutative BCK-algebra with the relative cancellation property, owing to the following basic representation theorem for commutative BCK-algebras, proved in [25]: THEOREM 5.2. Let (X; *,0) be a commutative BCK-algebra with the relative cancellation property. Then there exists a lattice-ordered group (G; +, ^, 0) with positive cone G+ and an injective BCK-homomorphism h:X—> G+ such that the pair (G, h) is the universal group for X: that is, for any partial ordered Abelian group (K; +, ^,0) andanyorder- and -^-preserving mapping g.X —> К there is a homomorphism of Abelian ordered groups g' :G —> К such that g—g'oh.
858 A. Dvurecenskij 5.2. Measures on commutative BCK-algebras with the relative cancellation property A mapping m : X —>· [0, oo) such that, for all x, у е X. (i) m(x * y) = m(x) — m(y) whenever у ?. ? is said to be a measure; (ii) if 1 e X and m is a measure with m A) = 1, then m is said to be a state; (iii) m(.x * y) = max{0, m(x) — m(y)} is said to be a measure-morphism; (iv) if 1 e X and m is a measure-morphism with m{\) = 1, w is said to be a state- morphism. An /dea/ of a BCK-algebra (X;*,0) is a non-empty set / of X such that, for all x,yeX, (i) 0e/; (ii) ? * у e I and у e I imply jc e /. A proper ideal of X which is not included as proper subset in any proper ideal of X is said to be maximal; we denote by M(X) the set of all maximal ideals on X. In contrast to bounded commutative BCK-algebras (= MV-algebras), it can happen that X possesses no maximal ideal. Similarly, there exists a commutative BCK-algebra which has only measure identically equals to 0, [28]: Example 5.3. Let Z+ = {0, 1,2,...} and let ? = (Z+)K. Set К = {/ e H: f has a countable support}. Then К is an ideal of the upwards directed commutative BCK-algebra H. Put X := H/K, then X is a Dedekind ?-complete3 with M(X) — 0, and any measure on X is identical 0. IfM(X)^0, the set Rad(X):=f^|{M: MeM(X)} is called the radical of X. If Rad(X) = {0}, X is said to be semisimple. If m is a measure on X, then Ker(w) := {x e X: m(x)=0} is an ideal of X. In view of the categorical equivalence of commutative BCK-algebras with the relative cancellation property, any measure m on X can be uniquely extended to a measure on the corresponding group G; we recall that a measure on an Abelian ?-group G is any mapping s : G —> ? which preserves + and the positivity on X. PROPOSITION 5.4. Let m ? апатг be two non-zero measure-morphisms on a commutative BCK-algebra (X;*,0) with the relative cancellation property. Suppose Ker(wi) — Ker(w2). If there is an element xq e X such that m\(xo) = mi(xo) > 0, then m\ = mi- Otherwise, there exists a constant с such that m\ = стг- It means, e.g., that the corresponding ^-groups is so.
Measures on quantum structures 859 THEOREM 5.5. Let (X; *, 0) be a non-trivial commutative BCK-algebra with the relative cancellation property. Then there is a one-to-one correspondence between the set of all measures-morphisms on X and the set of maximal ideals ?4(?) given by m -o· Ker(m). The correspondence is unique up to a constant с In particular, if(X; *, 0) is a bounded commutative BCK-algebra, then ? (?) ? 0, and there is a one-to-one correspondence between ?4(?) and the set of all state-morphisms onX. We can say more for BCK-algebras with a quasi strong unit. We recall that an element и of X is a quasi strong unit of X if the ideal of X generated by и is equal to X. It is possible to show that и is a quasi strong unit for X iff и is a strong unit in the corresponding ?-group G representing u. Therefore, X possesses a measure m such that m(u) = 1. Denote by SU(X) the space of all measures m on X such that m(u) — 1. The following result extends Theorem 4.2 to commutative BCK-algebras which need not be bounded. THEOREM 5.6. Let и be a quasi strong unit of a non-trivial commutative BCK-algebra (X; *, 0) with the relative cancellation property. Then S„(X) is a non-empty compact convex Hausdorff space, and the space of all measure-morphismsfrom <S„ (X) is a non-void compact Hausdorff space. Any measure from <S„ (X) is a weak limit point of the convex hull of the set of extremal points ofSu (X). In addition, the following statements are equivalent: (i) m is an extremal measure from <S„ (X). (ii) m is a measure-morphism from S„(X). (iii) m(x л у) = m'm{m(x),m(y)}, ?, у е X, m(u) = 1. The notion of an order-determining system is similar as that in previous chapters. We recall that X is said to be A rchimedean iff, for a, b e ?,?? ^ b for any ? > 1 implies ? = 0. This notion is equivalent with the Archimedeanicity of the corresponding representation ?-group. For example, let X be a system of nonnegative functions on ? ?? which is closed with respect to the requirement /, g e X then / * g e X, where (/ * g)(co) = /(?) *? #(?), ? e ?, is an important example of BCK-algebras. THEOREM 5.7. Let и be a quasi strong unit of a non-trivial commutative BCK-algebra (X; *, 0) with the relative cancellation property. The following statements are equivalent: (i) X is semisimple. (ii) X is Archimedean. (iii) X has an order-determining system of measure-morphismsfrom S„(X). (iv) X has an order-determining system of measures from <S„ (X). (?) ? is isomorphic to some commutative BCK-algebra of functions on some ? ? 0. A nonzero element a e X is said to be an atom if b ^ a implies b e [0,a]. We denote by A(X) the set of all atoms of X. X is said to be atomic if given a nonzero ? e X, there exists an atom a e X such that a ^ x. Given an element xeX.we define ja(x) =sup{n >0: /?? is defined in X, na^ x}. E.1)
860 A. Dvurecenskij THEOREM 5.8. Let a be an atom of an Archimedean commutative BCK-algebra (X; *, 0) with the relative cancellation property. Then ja defined via E.1) is a non-trivial measure- morphism on X, and there exists a unique maximal ideal Ma ofX such that a ? Main addition, X is atomic if and only if{ja ¦ a e A(X)} is order-determining. In this case, X is semisimple, and X is a subdirectproduct of\\a X/Ma. We can say more, if X is roughly speaking the direct product of BCK-algebras of the form {0, a,2a, 3a,..., Ka], where К is a finite integer or A" — oo. Such BCK-algebras are the following: we say that an atomic commutative BCK-algebra is atomically bounded if any system {iaa: a e A(X)}, where ;'„ is a non-negative integer such that iaa is defined in X, has an upper bound in X. For example, any bounded atomic commutative BCK-algebra is atomically bounded, [26]. THEOREM 5.9. Let (X; *, 0) be a Dedekind complete and atomically bounded commutative BCK-algebra with the relative cancellation property. Then: (i) A complete ideal ? of X is a maximal ideal if and only if ? — Ma for some a e A(X), where Ma is defined in Theorem 5.8. (ii) Every ja is a completely additive measure-morphism, and {ja: a e A(X)} is order- determining. (iii) A non-trivial completely additive measure m on X is a completely additive measure-morphism if and only ifm — сja for some atom a and a positive constant с 6. States on pseudo MV-algebras Recently, G. Georgescu and A. lorgulescu [35] introduced pseudo MV-algebras which are a non-commutative generalization of MV-algebras introduced in 1958 by C.C. Chang [8] and which are an extension of a two-valued reasoning. A non-commutative generalization of reasoning can be found, e.g., in psychological processes. J. Rachunek [65] introduced non-commutative MV-algebras, which in fact are equivalent with pseudo MV-algebras. Such a noncommutative reasoning we can meet for example in psychological or social processes. We present new results of the author concerning states on pseudo MV-algebras. 6.1. Pseudo MV-algebras Definition 6.1. A pseudo MV-algebra is an algebra (??;?,~,~,0, 1) of type B, 1, 1, 0,0) such that the following axioms hold for all x, y, ? e ? with an additional binary operation © defined via (??) ? ® (y ® z) = (x <?> y) <?> z; (A2) ??0 = 0??=?;
Measures on quantum structures 861 (A3) x® 1 = \®x= 1; (A4) 1~ = 0; Г =0; (A5) (гшу-г = (гел_; (A6) x®x~Oy-y®y~~Ox=xOy-®y = yOx-®x; (A7) д:0(Г9у) = ие y~) ? у; (A8) (дг)~=дг. We shall assume that 0 ? 1. If we define ? ^ у iff jc~ ? у = 1, then ^ is a partial order such that ? is a distributive lattice with ;tvy=jt©jt~Oy. For basic properties of pseudo MV-algebras see [35]. For example, if и is a strong unit of a (not necessary Abelian) ?-group G, Г(Си):=[0,и] and ? ®y -=(x +y) ли, х~:=и-х, x~ := — ? + u, ? © у :— (? — и + у) ? ?, then (T(G, и); ф,_ ,~, 0, и) is a pseudo MV-algebra [35]. Example 6.2. Let G = (? ? ? ? ?; +, @,0,0), ^) be the Scrimger 2-group, i.e., (? ? + m2,m\ + k2,ri\ +П2), ifniisodd, (k\,m\,n\)-\-{k2,m2,nj) : — , ' (it? + k2,m\ + m2,n{ +п2), if n2 is even. Then 0 — @,0,0) is the neutral element, and (—m,—k,—n), if ? is odd, (k,m,n) (—k,—m,—n), it ? is even. and G is a non-Abelian ?-group with the positive cone G+ = ? ? ? ? Z+0 U Z+ ? ?+ ? {0}, or equivalently, (k[,m\,n\) ? (k2,m2,n2) iff (i) «? < n2, or (ii) «? = n2, k\ ^ b, m ? ij Ш2. Then (k\,m\,n\) ? (k2,m2,n2) (it|,b,M|), if «1 > «2, (&i ? it2,wi ? Ш2,И| vna), if«i=«2, (k2,m2,n2), if «l < «2,
862 A. Dvurecenskij and и — A, 1, 1) is a strong unit for G. Consequently, the corresponding pseudo MV-alge- bra has the form r(G,u) = Z+ xZ+xjOjUZ^i xZ^i ? {1}. with (k,m,0)~ = (\ -k,\-m,\), (k, m,0)~ = A - m, 1 - k, 1), (k,m, 1)" = A -m, 1 -?,?), (?,/я, 1)~ = A - ?, 1 -w,0), and {к\,т\,0)® (k2,m2,0) = (к\ +к2,т\ + m2,0), (?i,mi,0)©(?2,w2, 1) = ((mi +Ь) л l,C«2+*i)A 1, 1), (kum\, l)®(k2,m2,0) = ((?| +Ь)л1,(ш|+ш2)л1,1), (*i,i»i|,l)e(*2,W2,l) = (l,l,l)· We recall that every finite pseudo MV-algebra is an MV-algebra. Now we show that the pseudo MV-algebras of type T(G, u), where (G, u) is a unital ?-group, are prototypical examples of pseudo MV-algebras in view the following principal representation of pseudo MV-algebras proved in [21, Theorem 3.9] which generalizes the famous result of Mundici for MV-algebras. THEOREM 6.3. For any pseudo MV-algebras ?, there exists a unique {up to isomorphism) unital l-group G with a strong unit и such that ? = Г(С«). The functor Г defines a categorical equivalence of the category of pseudo MV-algebras with the category of unital l-groups. In addition, if h is a given isomorphism of ? with T(G, u), (G, h) is the universal group for M: that is, for any partially ordered group (K; +, ^, 0) and any order- and -^-preserving mapping g:M—>-K there is a homomorphism of partially ordered groups g' :G —> К such that g = g' о h. We say that an ideal of a pseudo MV-algebra ? is a subset I of ? such that (i) 0e/; (ii) if x, у e /, then ? ? у е / and у ? ? е /; and (iii) if ? e /, у e M, and у ^ x, then у е /. An ideal / of ? is said to be (i) normal if ? ? / = / ? ? for any x e ?;4 (ii) prime if ? А у e / entails ? e / or у e /; (iii) maximal if / is a proper ideal of ? and it is not included into any proper ideal of ? different of/. We define ? ® / := {? ® ?: ? e /); similarly we define / ?-?.
Measures on quantum structures 863 6.2. States and pseudo MV-algebras In the present section, we introduce states and state-morphisms on pseudo MV-algebras, and we show their close connections with normal and normal maximal ideals on pseudo MV-algebras, A state on a pseudo MV-algebra ? is a mapping m : ? —>· [0, 1 ] such that (i) w(l) = 1; and (ii) m (a + b) = m (a) + m(b) whenever ? + b is defined in M. If ? is a commutative pseudo MV-algebra (i.e., an MV-algebra), then ? possesses at least one state. Take now ? = T(G, u) from Example 6.2, and define a mapping m on ? by m((k,m,0)) = 0 and m((k,m, 1)) = 1. Then m is a unique state on this pseudo MV- algebra ? which is not an MV-algebra. The basic properties of states are as follows. PROPOSITION 6.4. Letm be a state on a pseudo MV-algebra M. Then, for all a,b e M, we have (i) m@) = 0. (ii) If a ^ b, then m(a) ^ m(b) and m(bQa~) = m(b) — m{a) = m(a~~ ? b). (iii) m(a~) = 1 — m(a) = m(a~). (iv) m{a~)=m{a) = m{a^). (v) m(a ? b) +m(a A b) = m{a) +m{b). (vi) m(a ? b) + m(b Qa)=m(a) + m(b). (vii) The set Ker(w) := [a e M: m(a) = 0} is a normal ideal of M. (viii) a/Ker(w) = fo/Ker(m) if and only if m (a) = m(a A b) = m(b). (ix) There is a unique state m on Ai/Ker(w) such that m([a]) = m(a), [a] ? ?I Ker(w), where [a] is the coset in M/ Ker(w) determined by a e M. (x) m(a ®b) =m{b®a). A mapping m from a pseudo MV-algebra ? into the standard MV-algebra [0, 1] such that, for all a, foe M, (i) m(a®b)=m(a)®mm(b), (ii) m(a~) = w(a~) = 1 — m(a), and (iii) w(l) — 1 is said to be a state-morphism. THEOREM 6.5. Letm be a state on a pseudo MV-algebra M. Then m is a state-morphism if and only if Ker(w) is a maximal ideal of M. Let m\ and mi be two measure-morphisms on a pseudo MV-algebra ? such that Ker(wi) = Ker(w2). Then m\ = m^.
864 A. Dvurecenskij Conversely, let I be a normal and maximal ideal of a pseudo MV-algebra M. Then there is a unique state-morphism m on ? such that Ker(w) = /. PROPOSITION 6.6. Let m be a state on a pseudo MV-algebra M. Then the following statements are equivalent: (i) m is an extremal state on M. (ii) m is a state-morphism on M. (iii) m(x Лу) = m'm{m(x),m(y)}, х, у е М. Denote by S(M) the set of all states of a pseudo MV-algebra ?. Then S(M) is a convex set, and let ExtE(M)) be the set of all its extremal states. A net {ma} of states on ? converges weakly to a state m iff ma{a) —>¦ m(a) for every a e M. We recall that due to the categorical equivalence, there exists a one-to-one correspondence between states on T(G, u) and states on (G, u). THEOREM 6.7. Let ? be a pseudo MV-algebra. IfS(M) ? 0, then S(M) is a compact convex Hausdorff space with respect to the weak topology, and the space of all state- morphisms on ? is a non-void compact Hausdorff space. Any state is a weak limit of convex linear combinations of the set of extremal points ofS(M). For every a e ?, we put M(a):= {I eAfM(M): a ?/}. Then (i) M@) = 0, ? (а) с ? ?) whenever я ^ b, ? (a Ab) = ? (?) ? M{b),a,b e M, M(a ? b) = ? (a) U ? ?), a,be M, and {M(a): a e M) is the base of the so-called hull-kern topology TtfM on ???4(?). Then the hull-kern topology defines a Hausdorff topology such that the closed subspaces of ???(?) are exactly of the form C = C(J):={I еЛГМ(М): /ЗУ}, where J is an ideal of ?. Similarly, every open set О is of the form О = O(J) := {I еЛГМ(М): I 2 ·/}· The following statement shows that extremal states and normal maximal ideals are homeomorphical. THEOREM 6.8. The mapping ? : ExtE(M)) -+ ???(?) defined by 0(/и):=Кег(/и), т е Ext(<S(M)), is a homeomorphism.
Measures on quantum structures 865 6.3. Existence of states As we have seen in Section 4, every MV-algebra possesses at least one state. For pseudo MV-algebras, the situation can be more complicated, because as it was shown in [22], there exists a pseudo MV-algebra having no state on it. Therefore we will now concentrated to the question when a pseudo MV-algebra has at least one state. Example 6.9. Let G be the group of all matrices of the form where ? and a are rational (real) numbers such that ? > 0; the group-operation is the usual multiplication of matrices. We denote A — (?, a). Then A = (\/?,-?/?), and A,0) is the neutral element. We define G+ :— {(?, a); where (i) ? > 1, or (ii) ? — 1 and a > 0). Then G with the positive cone G+ is a linearly ordered ?-group with a strong unit U = B,0).DefineM = r(G,tf).ThenM = M|UM2UM3,whereM| ={(?,«): 1 <? <2), Мг = {B, ?): ? ^0), and ?/3 = {(?,?): ? ^ 0). My is a unique normal and maximal ideal of M, and there is a unique state-morphism m, namely ??((?, a)) = log2(?), (?, a) e M. THEOREM 6.10. Every linearly ordered pseudo MV-algebra ? possesses a unique state. We say that a pseudo MV-algebra ? is representable if it can be represented as a subdirect product of linearly ordered pseudo MV-algebras. THEOREM 6.11. Every representable pseudo MV-algebra ? possesses at least one state. Let g be a nonzero element of a pseudo MV-algebra M. According to [35], we say that a value of g is an ideal V of ? such that g ? V, and V is maximal with respect to this property. As it was shown in [35, Theorem 2.15] such an ideal always exists in ? and is a prime ideal. We denote by r(g) the set of all values of g. For any value V of an element g we have that there is a unique least ideal V* properly containing V; it is called a cover of V. ? is said to be (i) finite valued if every non-zero element g e ? has finitely many values; (ii) normal-valued if, for any g > 0, any values V e r(g) is normal in its cover V*. THEOREM 6.12. Each normal-valued pseudo MV-algebra ? possesses at least one state. In particular, //|?A)| = ?, then ? possesses exactly ? state-morphisms. Finally we recall only that every MV-algebra is representable, every representable pseudo MV-algebra is normal-valued, and the converse implications do not hold, in general. Moreover, due to categorical equivalence, we have also built up theory of states on unital ?-groups not necessarily Abelian.
866 A. Dvurecenskij 7. Conclusion As we have seen, the sixth Hilbert's problem inspired experts to develop a very nice theory of quantum structures which today can live independently on the original quantum mechanics motivation. Nowadays we can find situations very far from quantum mechanics, where measured data do no fulfill axioms of the Kolmogorov axiomatical model of probability theory. Theory of quantum structures is a typical example of interdisciplinary scientific area where mathematicians, physicists, logicians can exchange their ideas. Today there exists an extensive list of monographs dedicated to this theory, we mentioned only [4,14,28,41, 46,64,70] and others. In this chapter we showed how deep is theory of states and measure on these structures. Moreover, we have shown their very close connections with classical mathematical objects as ?-groups and po-groups, respectively. Pseudo MV-algebras are probably the first algebraic structures with noncommutative addition for which measure theory is developed. Other new mathematical structures are pseudo effect algebras introduced recently by the author and ? Vetterlein [29,30] which also need measure theory. Finally we can say that on the theoretical analysis of any measured object, the relationship between the apparatus of quantum mechanics and quantum structures is similar to that between statistical physics and probability theory. Recent results show that the effort of mathematicians and physicists in building up probability theory and mathematical statistics was successful. Hence, there is a real hope that the investigation of quantum structures can bring new and useful results for both mathematics and quantum mechanics. References [1] J.F. Aarnes, Physical states on a C*-algebra. Acta Math. 122 A969), 161-172. [2] J.F. Aarnes, Quasi-states on C*-algebras. Trans. Amer. Math. Soc. 149 A970), 601-625. [3] G. Boole, An Investigation of the Laws of Thought, Macmillan A854). Reprinted by Dover, New York A967). [4] E.G. Beltrametti and G. Cassinelli, The Logic of Quantum Mechanics, Addison-Wesley, Reading A981). [5] G. Birkhoff, Lattice Theory, Amer. Math. Soc. Colloq. Publ., Vol. 25, Providence, RI A967). [6] G. Birkhoff and J. von Neumann, The logic of quantum mechanics, Ann. Math. 37 A936), 823-834. [7] L.J. Bunce and J.D. Wright, The Mackey-Gleason problem. Bull. Amer. Math. Soc. 26 A992), 288-293. [8] C.C. Chang, Algebraic analysis of many-valued logics. Trans. Amer. Math. Soc. 88 A958), 467^490. [9] E. Christensen, Measures on projections and physical states, Comm. Math. Phys. 86 A982), 529-538. [10] R. Cooke, M. Keane and W. Moran, An elementary proof of Gleason's theorem. Math. Proc. Cambridge Phil. Soc. 98A985), 117-128. [11] M. Dichtl, Astroids and pasting. Algebra Univer. 18 A984), 380-385. [12] S.V. Dorofeev and A.N. Sherstnev, Frame-type functions and their applications, Izv. Vuzov Matematika, No. 4 A990), 23-29 (in Russian). [13] T. Drisch, Generalization of Gleason's theorem, Intemat. J. Theoret. Phys. 18 A979), 239-243. [14] A. Dvurecenskij, Signed states on a logic. Math. Slovaca 28 A978), 33^40. [15] A. Dvurecenskij, Generalizations of Maeda's theorem, Intemat. J. Theoret. Phys. 25 A986), 1117-1124. [16] A. Dvurecenskij, Gleason's Theorem and Its Applications, Kluwer, Dordrecht, and Ister Science Press, Bratislava A993).
Measures on quantum structures 867 [17] A. Dvurecenskij, Frame functions and completeness of inner product spaces. Ann. Inst. H. Poincare Phys. Theor. 62A995), 429^438. [18] A. Dvurecenskij, Gleason's theorem in applications. Quantum Measures and Spaces, G. Kalmbach, ed., Kluwer, Dordrecht A998), 39-50. [19] A. Dvurecenskij, Measures on commutative BCK-algebras, Atti Sem. Mat. Fis. Univ. Modena B000), to appear. [20] A. Dvurecenskij, Loomis-Sikorski theorem for ?-complete MV-algebras and l-groups, J. Austral. Math. Soc. Ser. A 68 B000), 261-277. [21] A. Dvurecenskij, Pseudo MV-algebras are intervals in ?-groups, J. Austral. Math. Soc. 72 B002), to appear. [22] A. Dvurecenskij, States on pseudo MV-algebras, Studia Logica 68 B000), 301-327. [23] A. Dvurecenskij, Fuzzy set representation of some quantum structures. Fuzzy Sets and Systems 101 A999), 67-78. [24] A. Dvurecenskij, P. de Lucia and E. Pap, On a decomposition theorem and its applications. Math. Japonica 44A996), 145-164. [25] A. Dvurecenskij and M.G. Graziano, Commutative BCK-algebras and lattice ordered groups. Math. Japonica 49 A999), 159-174. [26] A. Dvurecenskij and M.G. Graziano, Dedekind complete commutative BCK-algebras. Order 17 B000), 23-41. [27] A. Dvurecenskij and S. Pulmannova, Test spaces, Dacey spaces, and completeness of inner product spaces, Lett. Math. Phys. 22 A994), 299-306. [28] A. Dvurecenskij and S. Pulmannova, New Trends in Quantum Structures, Kluwer, Dordrecht, and Ister Science, Bratislava B002). [29] A. Dvurecenskij and T. Vetterlein, Pseudo-effect algebras. I. Basic properties. Intemat. J. Theoret. Phys. 40 B001), 685-701. [30] A. Dvurecenskij and T. Vetterlein, Pseudo-effect algebras. II. Group representation, Intemat. J. Theoret. Phys. 40B001), 703-726. [31] M. Eilers and E. Horst, The theorem of Gleason for nonseparable Hilbert space, Intemat. J. Theoret. Phys. 13A975), 419^424. [32] D.J. Foulis and M.K, Bennett, Effect algebras and unsharp quantum logics. Found. Phys. 24 A994), 1325- 1346. [33] D.J. Foulis and C.H. Randall, Operational statistics. I. Basic concepts, J. Math. Phys. 13 A972), 1667- 1675. [34] L. Fuchs, Partially Ordered Algebraic Systems, Pergamon Press, Oxford A963). [35] G. Georgescu and A. Iorgulescu, Pseudo-MV algebras, Multi-Valued Logic B000), in print. [36] R. Giuntini and H. Greuling, Toward a formal language for unsharp properties. Found. Phys. 19 A989), 931-945. [37] A.M. Gleason, Measures on the closed subspaces of a Hilbert space, J. Math. Mech. 6 A957), 885-893. [38] K.R. Goodearl, Partially Ordered Abelian Groups with Interpolation, Math. Surveys Monogr., Vol. 20, Amer. Math. Soc., Providence, RI A986). [39] R. Greechie, On the structure of orthomodular lattice satisfying the chain condition, J. Comb. Theor. 4 A968), 210-218. [40] R. Greechie, Orthomodular lattices admitting no states, J. Comb. Theor. 10 A971), 119-132. [41] S. Gudder, Stochastic Methods in Quantum Mechanics, Elsevier, Amsterdam A979). [42] J. Gunson, Physical states on quantum logics, Ann. Inst. H. Poincare, Phys. Theor. 17 A972), 295-311. [43] W. Heisenberg, The Physical Principles of Quantum Theory, Dover, New York A930). [44] Y. Imai and K. Iseki, On axiom systems of prepositional calculi, Proc. Japan Acad. 42 A966), 19-22. [45] R. Jajte, On convergence of Gleason measures. Bull. Acad. Polon. Sci., Ser. Math. Astr. Phys. 20 A972), 211-214. [46] G. Kalmbach, Orthomodular Lattices, Academic Press, London A983). [47] S. Kochen and E.R. Specker, The problem of hidden variables in quantum mechanics, J. Math. Mech. 17 A967), 59-67. [48] A.N. Kolmogorov, Grundbegriffe der Wahrscheinlichkeitsrechnung, Berlin A933). [49] F. Корка and F. Chovanec, D-posets, Math. Slovaca 44 A994), 21-34.
868 A. Dvurecenskij [50] ?.?. Lodkin, Every measure on the projectors of a W*-algebra can be extended to a state, Funct. Anal. Appl. 8 A974). Translated from Funkc. Anal. I Ego Priloz. 8 A974), 54-58. [51] G.D. Lugovaja, Bilinear forms defining measures on projectors. Izv. Vuzov Matematika, No. 2 A983), 88-88 (in Russian). [52] G.D. Lugovaja and A.N. Sherstnev, On the Gleason theorem for unbounded measures, Izv. Vuzov Matematika, No. 2 A980), 30-32 (in Russian). [53] G.W. Mackey, Mathematical Foundations of Quantum Mechanics, Benjamin, New York A963). [54] S. Maeda, Lattice Theory and Quantum Logic, Maki-Shoten, Tokyo A980) (in Japanese). [55] S. Maeda, Probability measures on projectors in von Neumann algebras. Rev. Math. Phys. 1 A990), 235- 290. [56] S.M. Matvejchuk, Finite measures in an approximative-finite factor, Izv. Vuzov Matematika, No. 5 A976), 79-85 (in Russian). [57] S.M. Matvejchuk, Extension of measures in approximative factors, Izv. Vuzov Matematika, No. 2 A977), 84-90 (in Russian). [58] S.M. Matvejchuk, An extension of unbounded measure to weight, Izv. Vuzov Matematika, No. 4 A987), 47-50 (in Russian). [59] D. Mundici, Interpretation ofAF C*-algebras in Lukasiewicz sentential calculus, J. Funct. Anal. 65 A986), 15-63. [60] D. Mundici, Averaging the truth-value in Lukasiewicz logic, Studia Logica 55 A995), 113-127. [61] M. Navara, Two descriptions of state spaces of orthomodular structures, Intemat. J. Theoret. Phys. 38 A999), 3163-3178. [62] A. Paszkiewicz, Measures on projections of von Neumann algebras, J. Funct. Anal. 62 A985), 87-117. [63] C. Piron, Foundations of Quantum Physics, Addi son-Wesley, Reading, MA A976). [64] P. Ptak and S. Pulmannova, Orthomodular Structures as Quantum Logics, VEDA and Kluwer, Bratislava and Dordrecht A991). [65] J. Rachunek, A non-commutative generalization of MV-algebras, Czechoslovak Math. J. 52 B002), 255- 273. [66] B. Riecan and D. Mundici, Probability on MV-algebras, Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam B002), 869-909. [67] A.N. Sherstnev, On a notion of a charge in noncommutative scheme of measure theory, Veroj. Metody Kibem., Kazan, No. 10-11 A974), 68-72 (in Russian). [68] R. Sikorski, Boolean Algebras, Springer, Berlin A964). [69] B. Simon, A canonical decomposition for quadratic forms with application to monotone convergence theorems, J. Funct. Anal. 28 A978), 377-385. [70] VS. Varadarajan, Geometry of Quantum Theory, Vol. 1, Van Nostrand, Princeton, NJ A968). [71] J. von Neumann, Mathematische Grundlagen der Quantenmechanik, Springer, Berlin A932). [72] F.J. Yeadon, Measures on projections in W*-algebras of type ll\. Bull. London Math. Soc. 15 A983), 139-145. [73] F.J. Yeadon, Finitely additive measures on projections in finite W* -algebras. Bull. London Math. Soc. 16 A984), 145-150.
CHAPTER 21 Probability on MV-Algebras Beloslav Riecan Mathematical Institute, Slovak Academy of Sciences. Stefanikova 49. SK-814 73 Bratislava, Slovakia E-mail: riecan@mat.savba.sk Daniele Mundici Department of Computer Science, University of Milan, Via Comelico 30-41, 20135 Milan, Italy E-mail: mundici@mailserwr.uiiimi.it Contents 1. Background on ?V-algebras 871 Prologue: Operator algebras and MV-algebras 871 1.1. MV-algebras: basic notions 872 1.2. Representation of semisimple MV-algebras 873 1.3. MV-algebras and ^-groups 874 1.4. Tribes and ?-complete MV-algebras 876 1.5. The Loomis-Sikorski theorem for MV-algebras 877 1.6. Bibliographical remarks 878 2. States and observables 879 2.1. States and observables in MV-algebras 879 2.2. Independence 883 2.3. Kolmogorov's construction 884 2.4. Functions of several observables 885 2.5. The MV-algebraic central limit theorem 886 2.6. MV-algebraic laws of large numbers 888 2.7. Bibliographical remarks 889 3. MV-algebras with product 890 3.1. Product and joint observables 891 3.2. Conditional expectation 893 3.3. Upper and lower limits 895 3.4. Individual ergodic theorem 897 3.5. Bibliographical remarks 898 4. Finitely additive measures 899 4.1. Basics 900 4.2. Entropy of dynamical systems 901 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V All rights reserved 869
870 В. Riecan and D. Mundici 4.3. Entropy of full tribes 903 4.4. Bibliographical remarks 904 5. Open problems 905 References 906
Probability on MV-algebras 871 1. Background on MV-algebras Our aim is to survey the MV-algebraic generalization of notions and results of boolean algebraic probability theory stemming from the work of Caratheodory and von Neumann. The former author was motivated by the program of founding probability theory without mentioning points in the event space ?; the second author was driven by the hope that an algebra of observables might be built on the lattice structure of the quantum mechanical counterpart of ?. MV-algebras (to be defined in 1.2 below) are a non-commutative generalization of boolean algebras, in a sense that will be made more precise in Theorem 1.1, using the language of C*-algebras. Readers only interested in MV-algebraic probability theory can safely skip the following prologue, and proceed to the next section. Space constraints will not allow us to include many proofs; however, at the end of each section the reader will find adequate bibliographical references for the proofs of all stated results. Prologue: Operator algebras and MV-algebras In the approach of Birkhoff and von Neumann [3], properties of a quantum system S are identified with self-adjoint idempotent elements (i.e., projections) in the algebra B(H) of continuous linear operators on the Hilbert space ? of S. Equivalently, properties are elements of the family С-ц of all closed linear subspaces of ?. Observables of S are then built from the projections. The second main important notion is that of a state of S. The following definition captures the intuitive ideal that a "physical state" should be a sort of continuous averaging process for these propositions. A state is a map m : С-ц -> [0, 1] such that m(H) = 1 and whenever ? ?, ??, A3,.., с Cn are pairwise orthogonal closed linear subspaces of ?, then m(\J^L, A,) = Y^LX w(A,). Here, V/^i ?? denotes the closed linear subspace of ? generated by the set-theoretic union U A,, Measure theory for lattice structures associated to Hilbert space projections will be the object of other chapters in this Handbook. By contrast, no Hilbert space ?. can be canonically assigned to a system with infinitely many degrees of freedom, such as those naturally occurring in quantum statistical mechanics and in quantum field theory. Any such system S typically possesses many physically inequivalent irreducible Hilbert space representations, paralleling its macroscopically different phases, or preparations. This happens in particular when S arises as the thermodynamic limit of finite spin-like systems. The C*-algebraic counterpart of S is then an AF C*-algebra, i.e., the norm-closure A of the union of an ascending sequence ?\ с f2 ? ¦ · ¦ of finite-dimensional C*-algebras, all with the same unit [1,4]. While all physical information about S is now coded into A, rather than in a Hilbert space ?, the role of the projections of A is no less important than in the case of B(H). To see this, one first defines two projections p,q e A to be equivalent if there exists an element ? e A such that vv* = ? and v*v — q, where v* is the adjoint of v. We denote by [p] the equivalence class of p, and by L(A) the set of equivalence classes of projections of A, The Murray-von Neumann order over L (A) is defined by stipulating that
872 В. Riecan and D. Mundici [p] ^ [?] if ? is equivalent to a projection r such that rq = r, Elliott's partial addition, denoted +, is the partial operation on L(A) given by adding two projections whenever they are orthogonal. Then + is associative, commutative, monotone, and satisfies the following residuation property; (*) For every projection ? eA, among all classes [q] such that [p] + [q] = [1д] there is a smallest one,1 denoted -*[p], namely the class [1_д — ?]. Theorem 1.1. (i) There is at most one extension of Elliott's partial addition to an associative, commutative, monotone operation © over the whole L(A) having the above residuation property (*). Such extension © exists if, and only if L(A) is a lattice. (ii) Let >C(A) — (L(A), [0], ->, ?). Then the map А ь* ?,(?) is a one-one correspondence between isomorphism classes of AF C*-algebras whose L(A) is lattice- ordered, and isomorphism classes of countable MV-algebras, (iii) In particular, up to isomorphism, the map А ь* ?,(?) is a one-one correspondence between commutative AF C* -algebras and countable boolean algebras. The inverse correspondence is given by the map clop(X) ь* C(X), the latter denoting the C*- algebra of complex-valued continuous functions over an arbitrary separable totally disconnected compact Hausdorff space X, Here clop(X) denotes the boolean algebra of closed-and-open subsets ofX, 1.1. MV-algebras: basic notions The above theorem gives a precise formulation to the intuition that (countable) MV-algebras are a non-commutative generalization of (countable) boolean algebras. Accordingly, the MV-algebraic probability theory developed in this survey can be thought of as a non- commutative generalization of classical boolean algebraic probability theory. Definition 1.2. An MV-algebra ? = (?, 0, 1, -л ?, ?) is a system where ? is associative-commutative with neutral element 0, and, in addition, -? = 1, -? = 0, ? ? 1 = 1, x Oy = -¦(-¦ ? ®-\y) and у ® -*(у ® -*х) =х 0-4* ? ~\у), for all ?, у е ?, Since both the constant 1 and the operation ? are definable from the remaining operations 0, ->, ?, there no danger of confusion between the present definition, and the statement of Theorem l.l(ii). MV-algebras are the models of the equational theory of magnitudes with a distinguished archimedean unit, in a sense that will be made precise by Theorem 1.7 below. They stand to the infinite-valued calculus of Lukasiewicz as boolean algebras stand to the classical two-valued calculus. An example of MV-algebra is the real unit interval [0, 1] equipped with the operations -a— 1-х, ? ®y = min(l,;t +y), ? ? у = max@, x + у - 1). A) ?? fact, ~·[?] is the only class [q] satisfying [p] + [q] = [I.4].
Probability on MV-algebras 873 This MV-algebra has the same role for MV-algebras as the two-element algebra {0, 1} has for boolean algebras. Indeed, Chang's completeness theorem states that if an equation is satisfied by [0, 1] then the equation is satisfied by all MV-algebras. It follows that every MV-algebra is a homomorphic image of a subalgebra of a product of copies of [0, 1 ]. In every MV-algebra ? = {?, 0, 1, ->, ?, ?), the binary relation ^ given by a ^ b iff ? ? -<b = 0, is a partial order. Furthermore, the derived operations ? and л given by avb = -"-(--? ? b) © b and a /\b = --(--? ? -*b) make ? into a distributive lattice with least element 0 and greatest element 1; this is called the underlying lattice of ?. Boolean algebras coincide with those MV-algebras such that ? ? ? = ?. When the latter equation holds, then the ? operation coincides with v, and © coincides with л. DEFINITION 1.3, An MV-algebra ? is said to be ?-complete if its underlying lattice is ?-complete, i.e., every non-empty countable subset of ? has a supremum in ?. Every finite MV-algebra ? is ?-complete - indeed, ? is complete, in the sense that every non-empty subset of ? has a supremum in M. Further examples of ?-complete MV-algebras are given by ?-complete boolean algebras, and in particular, ?-algebras of sets. More examples will be given in Section 1.4. In every ?-complete MV-algebra ? we have the distributivity law b A (b\ ? Ьг ? · · ·) = (b л b\) ? (b л bi) ? · · ·, B) 1.2. Representation of semisimple MV-algebras An MV-algebra ? is said to be semisimple if for each non-zero element ? e ? there is a homomorphism ?:?->[0,\] with ?(?) ? 0. Ideals are kernels of homomorphisms. Unless otherwise specified, every ideal / of any MV-algebra ? considered in this paper shall be assumed to be proper, i.e., / ? ?. An ideal / is said to be maximal if there is no ideal of ? strictly containing / (other than the improper ideal M). Thus, maximal ideals are automatically proper. We let M(M) denote the set of all maximal ideals of M. Since / is maximal iff it is maximal for the property of not containing the unit element 1, it is not hard to see that M(M) ? 0. PROPOSITION 1,4. For any MV-algebra ? and maximal ideal I of M, there is a unique isomorphism i/ of the quotient MV-algebra M/I onto a subalgebra o/[0, 1]. Let ? l: ? —>¦ M/I be the quotient map. Then the map I ь* M/I ь* ?/(?/I) is a one-to-one correspondence between the set of maximal ideals of ? and the set of homomorphisms of ? into [0, 1]. The inverse of this map sends any such homomorphism into its kernel. For every ideal / of M, let 0/ = {K e M{M) \ К 2 /}. Then the collection of all sets of the form ? ? is a compact Hausdorff topology on ? (?), called the spectral topology. By abuse of notation, we shall let M{M) also denote the space of maximal ideals of ? equipped with the spectral topology. Also known as the hull-kernel topology.
874 В. Riecan and D. Mundici For any compact Hausdorff space X let us agree to denote by C(X) the MV-algebra of all continuous [0, l]-valued functions on X with pointwise operations as in A), Following tradition, let [0, l]x denote the MV-algebra of all [0, l]-valued functions on X. Recalling Proposition 1,4, for every MV-algebra ? let the map a e ? ь* а* е [0, l]x be defined by stipulating that, for each / e M(M), ?*(/) = (?/?0,)(?)=?,(?//)?[?, 1], C) where о denotes composition of functions. The following theorem gives a concrete representation of semisimple MV-algebras: THEOREM 1.5. For every semisimple MV-algebra M, the above map а ь* a* is an isomorphism of ? onto a separating subalgebra M* of C(M(M)), in the sense that whenever I, J are distinct maximal ideals of ? there are elements a*, b* e M* with а*A)фЬ*У). COROLLARY 1.6. An MV-algebra ? is semisimple iff ? is isomorphic to a separating MV-algebra o/[0, \]-valued continuous functions defined over some compact Hausdorff space, iff ? is isomorphic to an MV-algebra of[0, \\-valued functions defined over some set. 1.3. MV-algebras and i-groups By an ?-group we shall mean a lattice-ordered Abelian group. For any ?-group G, an element и e G is said to be a strong unit of G if for all g e G there is an integer ? > 1 such that nu ^ g. By a morphism ?: (G, u) -> (G', и') we mean a group homomorphism 0: G —>¦ G' that also preserves the lattice structure and satisfies the condition ф(и) = и'. THEOREM 1.7. For any ?-group G with a strong unit и let T(G, u) be the unit interval [0,H] = (/ieG|0^/i^H), equipped with the operations ->g = u-g, g®h = u A(g+h), and g ? h = 0 ? (g +h - u). Then; (i) the structure ? = ([0, и], 0, и, ->, ?, ?) = T(G, u) is an MV-algebra; (ii) letting, for any morphism ?; (G, u) -> (G\ и'), Г (ф) be the restriction of ? to [0, u], then ? is a categorical equivalence (i.e., a full, faithful, dense functor) from ?-groups with strong unit to MV-algebras; (iii) the lattice operations on ? agree with those ofG; (iv) the map J ь* J ? [0, и] is an isomorphism between the lattice of l-ideals of G {equipped with set-theoretic inclusion) and the lattice of ideals of ? (also equipped with inclusion). All infinite suprema and infima are preserved by this correspondence. There is a natural isomorphism between r(G/J, u/J) and Г(С,»)/GП[0,«]).
Probability on MV-algebras 875 Thus in particular, up to isomorphism, every MV-algebra ? can be identified with the unit interval of ? unique ?-group G with strong unit u, in symbols, M = r(G,u), D) We say that G is the I-group (with strong unit u) corresponding to ?. COROLLARY 1,8. Let ? bean MV-algebra, (i) ? is ?-complete iff its corresponding ?-group G is Dedekind ?-complete, in the sense that every bounded sequence of elements in G has a supremum in G. (ii) If ? is ?-complete then ? is semisimple, and the set of maximal ideals of ? equipped with the spectral topology is a basically disconnected compact Hausdorff space. Recall that a compact Hausdorff space X is basically disconnected if whenever ? c. X is an open FCT subset, then the closure of ? is open. This is equivalent to saying that X is homeomorphic to the maximal ideal space M{B) of a ?-complete boolean algebra B, A subset of X is meager (also called, a set of first category) if it is the countable union of subsets of X whose closure has empty interior. Notation. Throughout this paper we let N = {1,2,,..} denote the set of non-zero natural numbers. Suppose aij is a double sequence of real numbers > 0 such that, for each fixed / the sequence a,j tends to 0 as j tends to infinity. Then for each ? > 0 and every / there is an index j* = ??(?) such that aiy < ? for all j > j*. It follows that Vai'.<M') ^ ?> whence °= ? Va'X"· феЦм i DEFINITION 1,9, An ?-group G is said to be weakly ?-distributive if it is Dedekind ?-complete, and whenever 0 ^ aiy e G is a bounded double sequence and for each / e N, a\j > aij+\ and Д aiy = 0, then °= ? Va<>«> tj>eNH i An MV-algebra ? is said to be weakly ?-distributive if it is ?-complete, and whenever bij e ? is a double sequence and for each / e N, bij > foiy+i and Ду b/j = 0, then °= Л Vbw)< From Theorem l,7(iii) we get
876 В. Riecan and D. Mundici PROPOSITION 1,10, Let ? be an MV-algebra and G its corresponding l-group with strong unit u. Then ? is weakly ?-distributive iff so is G, 1.4. Tribes and ?-complete MV-algebras Examples of ?-complete MV-algebras are given by tribes, the latter being a generalization of ?-algebras of sets: Definition 1.11. By a tribe over a non-empty set ? we mean an MV-algebra ? of [0, l]-valued functions on ? such that for each sequence f\, /2,... e ? the pointwise supremum / = sup, /, also belongs to ?. While ? is a ?-complete MV-algebra, not every ?-complete MV-algebra is isomorphic to a tribe (already a ?-complete boolean algebra need not be isomorphic to a ?-algebra of sets). The totality of ?-complete MV-algebras shall be characterized in Theorem 1.16 below. Let ? be a non-empty set and S a ?-algebra of subsets of ?. Let ? be the set of all 5-measurable functions from ? to [0, 1]. Then ? is a tribe, and, trivially, all [0, l]-valued constant functions over ? are members of ? (for short, ? is a fall tribe).3 Conversely we have: THEOREM 1.12. Let ? be a tribe over a non-empty set ?. Let S be the family of subsets of ? whose characteristic function belongs to T. We then have: (i) S is a ?-algebra of subsets of ? and each function f in ? is S-measurable; (ii) // ? is full then ? coincides with the set of all S-measurable functions f : ? —>¦ [0, 1]. Notation. For any ? с ? we let ?? : ? -> {0, 1} denote the characteristic function,4 of Y. Following tradition, a sequence a 1, ai, ...of sets, is said to be monotone if a \ Ce2C • · ·. The notation a„ / a stands for "? ?, ai,... is a monotone sequence of sets whose union is a". Similarly, for real numbers x\, xi,... the notation xn / x stands for "x\ ^xi ^ · · · and x is their supremum". Finally, given elements bi,bi,... in an MV-algebra M, the notation bn / b stands for "b\ ? bi ^ · · · and b is their supremum, with respect to the underlying order of ?". In the light of Theorem 1.12, the following is a generalization of the notion of probability measure on a ?-algebra of sets. Definition 1.13. Given a tribe T, by a state on ? we mean a map m:T^>- [0, 1] satisfying the following conditions: (i) m(l) = 1; Also known as a generated tribe [5]. Or indicator.
Probability on MV-algebras 877 (ii) if a, b,c e ? and a = b + c, then m (a) =m(b) + m(c)\ (iii) for all a,a\,a2,... e J7, if a„ /a, then m(a„) / т(а). The following is an integral representation theorem for tribes with a state: THEOREM 1.14. Let (?, J7) be a (not necessarily full) tribe with a state m.LetSbe the ?-algebra of all those subsets ? of ? whose characteristic function ?? belongs to T. Let the map ? : S -*¦ [0, 1 ] be given by P(Y) = m (??). Then ? is a probability measure on S, and for every f e ? we have m(f)= f fdP. E) As a trivial converse, for any probability space (?, S, P) let ? be the full tribe of all <S-measurable [0, l]-valued functions over ?. Let the state mp :T —> [0, 1] be given by mp(f) = fn /dP. Then (?, J7) is a full tribe with a state mp. By applying Theorem 1.14 to the triplet (?, ?,mp) we recover the probability space (?, S, P). 1.5. The Loomis—Sikorski theorem for MV-algebras Definition 1.15. Given two ?-complete MV-algebras F and M, a homomorphism ?: F —>¦ ? is said to be a ? -homomorphism if ? preserves all countable suprema, in symbols, ч(Уа«-)=\/(^а'-» for all sequences a \, ai,... e F. In particular, when F is a tribe ? over a set ?, and ? a ?-complete MV-algebra, a homomorphism ?:?—>-? is said to be a ? -homomorphism if for each sequence /?. /2, /3, · · · e T, letting / = sup, /, be their pointwise supremum, r\(f) coincides with the supremum V; l(fi) m M. The following result generalizes the classical Loomis-Sikorski theorem: THEOREM 1.16. Let ? be a ?-complete MV-algebra. Let ? = M(M) be the set of maximal ideals of ? with the spectral topology. Then there exists a tribe ? over ? and a ? -homomorphism ? of ? onto M, satisfying the following conditions: For each a e ? there is precisely one continuous function a* e ? such that ?(?*) = a. A function in ? has the same image as a* iff it differs from a* on a meager subset of ?. The above tribe ? precisely consists of all functions / : ? -> [0, 1 ] which differ only on a meager set from some (necessarily unique) function arising from the representation of ? as a separating algebra of continuous functions over ? (Theorem 1.5). In general, ? need
878 В. Riecan and D. Mundici not be full. To ensure fullness one must assume ? to satisfy an additional condition, as follows: PROPOSITION 1.17. Let ? be a ?-complete MV-algebra and G its corresponding l-group with strong unit u. Then the following conditions are equivalent: (i) ? has no finite quotients; (ii) for every maximal ideal J of M, the quotient MV-algebra M/J is infinite; (iii) ? is divisible (in the sense that G is a divisible group); (iv) for each integer ? ^ 1, there is w e ? such that nw = 1 = u inG, or equivalently, w ? w ? · · · ? w = -*w; ? — I times (v) the MV-algebra Q ? [0, 1] is embeddable into M. For ?-complete divisible MV-algebras the Loomis-Sikorski theorem has the following formulation: THEOREM 1.18. Let ? be a ?-complete divisible MV-algebra. (i) Let ? = M(M) be the compact Hausdorff space of maximal ideals ofM. Then the map а ь* a* of Theorem 1.5 is an isomorphism between ? and the MV-algebra ?(?) of all continuous [0, \~\-valued functions over ?. (ii) Let ? be the set of all functions f: ? —> [0, 1] which are equal to a continuous function g* e ?(?) except for a meager set. Then g* is uniquely determined by f, ? is a full tribe over ?, and the map f ь* g* ь* g is ? ? -homomorphism of ? onto M. The proof depends on the following LEMMA 1.19. Adopt the above notation. Let ? be a ?-complete divisible MV-algebra. Then the map а ь* a* is an isomorphism of ? onto ?(?) that preserves all countable suprema, in the sense that for any sequence a\ e ? the element (\J a,-)* is the supremum of the functions a* : ? —> [0, 1] in ?(?), (which need not coincide with the pointwise supremum of these functions). 1.6. Bibliographical remarks See, e.g., [21] for the Hubert space representation of finite systems, and for the C*-alge- braic representation of systems with infinitely many degrees of freedom, including quantum spin systems and their AF C*-algebras. Elliott's classification theory originated in [20]. See [27] for a comprehensive account, using ??-theory and partially ordered groups with the Riesz interpolation property, of which Abelian ?-groups are a special important case. Theorem 1.1 was proved in [63] using the ? functor and the main results of [57]. The monograph [13] gives a comprehensive and self-contained account of MV-algebras. The latter were introduced by C.C. Chang in [7,8] to give an algebraic proof of the
Probability on MV-algebras 879 completeness of the infinite-valued calculus of Lukasiewicz [45,46], [83, Chapter IV], [13,Chapter 4].See[9]forfurther historical remarks. All results of Sections 1.1 and 1.2 are due to Chang. Self-contained proofs are given in [13, Chapters 1 and 3]. The distributive law B) is a well known fact (the proof given in [13, Lemma 6.6.4] for complete MV- algebras can be trivially adapted to the present case). We refer to [26] and [2] for background on ?-groups. The equivalence between MV- algebras and Abelian ?-groups with strong unit was first obtained in [57]. An elementary presentation can be found in [13, Chapter 7]. The ? functor has an ubiquitous role in MV-algebraic probability theory, allowing the application to MV-algebras of various results originally proved for ?-groups. Further, the very definition of certain basic notions is naturally given by resorting to the ?-group with strong unit corresponding to M. This is the case, e.g., of partitions of unity, to be defined in Section 4.2. The purely MV-algebraic definition (which can be obtained from [57, Definition 3.2, Proposition 3.3]) would not be as useful and immediately clear as the group-theoretical definition given here. The same applies to the definitions of state, observable, product given in the next sections. As explained in [57], composition of ? with Grothendieck's Ко functor provides the link between countable MV-algebras and AF C*-algebras whose Murray-von Neumann order is a lattice. Proposition 1.10 is due to Jakubik [33]. The weak ?-distributivity of the underlying ?-group structure of any Riesz space V is a necessary and sufficient condition for every V-valued measure to be extendable from a subalgebra of a set to its enveloping ?-algebra (see [79, Chapter 3]). For a proof of Theorem 1.12(i) see [39,5, Proposition 3.2]. For Theorem 1.12(h) see [5, Proposition 3.3]. For Theorem 1.14 see [4, Theorem 2.6(c)] and [5]. Proofs can also be found in [79, 8.1]. A characterization of tribes is given in [41, Theorem 4.3]. Piasecki's concept of P-measure [67], [79, pp. 140-141] was a pioneering anticipation of the notion of state on a tribe. The MV-algebraic Loomis-Sikorski theorem was independently proved in [17,61], building on [29] via the ? functor. An analysis of divisible MV-algebras can be found in [19]. See [79, Chapters 3 and 9] for more information on weak ?-distributivity. 2. States and observables Caratheodory's approach to probability theory is based on the idea of disregarding sets of zero measure, and considering instead ?-complete boolean algebras equipped with a distinguished strictly positive probability measure. Our aim is to generalize this approach from ?-complete boolean algebras to ?-complete MV-algebras. 2.1. States and observables in MV-algebras The following is a generalization of Definition 1.13: Definition 2.1. Let ? be a ?-complete MV-algebra. A state is a map m : ? -+ [0, 1] satisfying the following conditions for all a, b, c, a„,b„ e M:
880 В. Riecan and D. Mundici (i) w(l)=l; (ii) whenever b,c e ? and с 0 b = 0 it follows that m(b ? с) =m(b) + m(c); (iii) if a„ /a, then m(a„) / m(a). We say that ш is faithful^ if ш(;с) ^0 whenever O^jc gM. The assumption с © /? = 0 in condition (ii) amounts to asking b ? с = b + с, where + is addition in the ?-group with strong unit corresponding to M, as given in Theorem 1.7 and D). Putting together Theorems 1.16 and 1,14 we have the following integral representation theorem, where о denotes composition of functions: PROPOSITION 2.2. Let ? be a ?-complete MV-algebra equipped with a state m. Let ? = M{M) be the space of maximal ideals of ? with the spectral topology. Let the tribe (?,?) and the ? -homomorphism ?: ?' —»· ? be as in Theorem 1.16. Then (?, J7) is a tribe and moijij a state on it. Let (??, <S, P) be as given by Theorem 1.14 starting from the triplet (?, ?, m ? ?). Let the map * : ? —>· ? send each a e ? into the unique continuous function a* e ? such that ?(?*) =a. Then for all a e M, m(a)= / a*UP. '? Generalizing Caratheodory's approach to boolean probability theory, we shall be mainly concerned with a situation where the state m is faithful, as follows: DEFINITION 2.3. A probability MV-algebra is a pair (M, m) where ? is a ?-complete MV-algebra and m is a faithful state on ?. We say that (??, m) is divisible if so is ?. We say that (M, m) is weakly ?-distributive if so is ?. Example, Let (?, J7) be a tribe with a state m. Let J be the set of all / e ? such that m(f) = 0, Then J is an ideal of the MV-algebra ?, and J is closed under countable suprema. The quotient MV-algebra ? = T/J is ?-complete,6 and the quotient map ?: ? —> ? is a ?-homomorphism. Two functions f,g e ? have the same image under ? iff m(\f - g\) = 0. Let the map m : ? -> [0, 1] be defined by m(d(f)) =m(f) for each element 9{f) e M. Then the map is well defined and is a faithful state on M, and (M, m) is a probability MV-algebra. A variant of Proposition 2.2 can be used to show that this construction yields the most general possible example of a probability MV-algebra. A particular case of this example is given by the real unit interval [0, 1 ] equipped with the state m(x) =x. As another particular case, let ? be a non-empty, finite or denumerably infinite, set of elements e\, ei equipped with a corresponding sequence p\, ??, ¦ ¦ ¦ of real numbers pi > 0 such that ]T; p,¦ = 1. Let ? be the set of all sequences /:?->· [0, 1] equipped Also called, strictly positive. Indeed, since ?IJ has a faithful state, it cannot have uncountably many non-zero pairwise disjoint elements. Thus, arguing as in the boolean case, we see that ?IJ is a complete MV-algebra.
Probability on MV-algebras 881 with termwise MV-operations. Let m : ? -> [0, 1] be defined by m(f) = J2i fiPi- Tnen m is a faithful state on the complete MV-algebra ?. A first property of probability MV-algebras is given by the following PROPOSITION 2.4. Any probability MV-algebra (M, m) is weakly ?-distributive. PROOF. Let bjj e ? be a double sequence having the property that, for each / e N, Ьц ^ bij+\ and /\jbij = 0, with the intent of proving 0 = /\фец\\ Vi^/'.tfx/')· Assume (absurdum hypothesis) 0 < b e ? is a lower bound for each element V, ^/.^аь where 0 ranges over NN. From the assumed faithfulness of w it follows that m(b) > 0. Since for each fixed /, b л />,y tends to 0 as j tends to infinity, then so does m(b л />,y). There is j* = ?(?) such that m{b л />,>(')) ^ m(b)/2'+]. By absurdum hypothesis, /? ^ V, ^?.?(?)· Then from the distributivity law B), we get the contradiction m (/>) = mO л \Ji />,>(')) = m(\/i(bAb,^(,)))^J2im(bAb,^0)) <m(b). D As usual, we let R denote the set of real numbers. In the point-free version of probability, the algebraic counterpart of a random variable is not given by a real-valued function over the sample space ? but, vice versa, by a function from the ? -algebra B(R) of Borel sets in R into the ?-boolean algebra of events. The explicit construction of sums and products of such random variables in terms of boolean operations was successfully carried out by Caratheodory. In order to generalize his results to MV-algebras we prepare DEFINITION 2.5. Let ? be a ?-complete MV-algebra. An ?-dimensional observable of ? is a map ? :B(R") —>¦ ? satisfying the following conditions: (i) jc(R")= 1; (ii) whenever А, В e B(W) and А П В = 0, then ? (?) ??(?) = 0 and x(A U B) = x{A)®x{B)\ (Hi) for all A, A,, A2,... e ?(R"), if A„ / A, then x(A„) / x( A). When ? = 1 we simply say that ? is an observable. Thus, condition (ii) above states that, whenever ? ? ? = 0 then x(AUB) =x(A) + x(B) in the ?-group with strong unit corresponding to ? via Theorem 1.7. Remark. In the particular case when ? is a ?-complete boolean algebra, an я-dimen- sional observable ? :B(W) —> ? is a ?-homomorphism into ? of the ?-complete boolean algebra ??(R") of all Borel sets in R". In general, this need not be true of ?-complete MV- algebras. Examples. l.Let (?,?, ?) be a probability space, ? :? —>¦ R a random variable, and J" the full tribe of all [0, 1 ]-valued 5-measurable functions on ?. Let the map ? : ??(R) —>¦ ? be defined by x{A) = x^-^A), where /? is the characteristic function of ? с ?. Then ? is an observable. 2. Let ? = [0, 1 ] with the usual MV-algebraic operations. Let ?: R ->· [0, oof be a (Borel measurable) density function, in the sense that f^° (p(t)dt = 1. Define ? : В(Ш) ->·
882 В. Riecan and D. Mundici ? by the formula ? (A) = f ? ??, where ? is Lebesgue measure. Then ? is an observable of M. 3. Again assume ? = [0,1]· Let v\,...,vm be a finite set of real numbers. Let (p\,...,pm) be an w-tuple of real numbers > 0 such that p\ + ··¦ + pm = 1· Let у: В(Ш) -> ? be defined by y(A) = ??-ел A· Then У is an observable of M. 4. Given a ?-complete ?V-algebra M, let G be the ?-group with strong unit и = 1 corresponding to M, as given by Theorem 1.7. Then, for any observable ? of M, the map ? :В(Ш) —> [0, и] с G also defines a group-valued measure, i.e., a positive, additive, continuous map from В(Ш) to G. Conversely, every group-valued measure у: ? (?.) —* G such that y(R) = и is an observable of M. The above construction yields the most general possible observable ? in ?. For instance, the observable of Example 1 arises as a group- valued measure ? : ? (?.) ->· G, where G is the ?-group of all measurable functions on ? with the constant function 1 as the strong unit. To construct the observable of Examples 2 and 3, one can similarly let (G, u) = (R, 1). Having thus described the most general construction of an observable, let us now give a down-to-earth property of observables, when combined with states: PROPOSITION 2.6. Let ? be ? ? -complete MV-algebra with an ?-dimensional observable ? : В(Ш") -> ? and a state m : ? -> [0, 1 ]. Define the composite map mx : B(W) -> [0, 1 ] by the stipulation that, for all X e B(R"), mAX) = (mox)(X)=m(x(X)). F) Then mx is a probability measure on B(M"). The proof is immediate. The above map mx has the same role as the probability distribution of a random variable in the classical Kolmogorov theory. One can for instance investigate conditions ensuring the existence of the moments of observables in probability MV-algebras, in the following sense: Definition 2.7. Let (M,m) be a probability MV-algebra. Let x:M -> B(R) be an observable of M. Then ? is said to be integrable in (M,m), and we write ? e L]m, if the expectation E(x)= I tdmx(t) G) exists. We say that ? is square integrable, in symbols, ? e L2m, if the dispersion (variance) ?2(?)= \\t-E(x)fdmx(t)= I t2 dm АО - ? (xJ (8) Jm J& exists. Also called the mean.
Probability on MV-algebras 883 2.2. Independence One of the most important notions in probability theory is independence. Let ? and ? be two random variables in a probability space (?, «S, P). We then have • a random vector ?: ? ->· R2 given by ? (?) = (? (?), /j(o>)) for each we ?, • a ?-homomorphism h: B(R2) ->· 5 given by А ь* Т~' (A), • a probability distribution Pt:B(R2) -> [0, 1] determined by the stipulation /MA) = Р(Г-'(А)) = Р(Й(А)). Two random variables ? and /j are said to be independent if for all Borel sets А, В с R, ? (? "'(?) ?/j (S)) = /'(?"'(?)) -P(/j_l(S)), where · denotes ordinary multiplication. Since ? ~' (?) ? rf' (?) = ?~' (? ? ?), the last identity can be written PT(A ? ?) = Pf (A) · ??(?) = (?? x P„)(A х ?), where ?? ? ?? is the product measure of the measures />? = Ро?-' and ?? = ?°?-\ (9) in agreement with the notation of F). Since ?? ? ?? coincides with ?? on rectangles, these two measures automatically coincide over all of ??(R2). It follows that Pj = ?? ? ?,. Since ?? = ? ? h, we can say that ? and /j are independent if and only if there exists a ?- homomorphism h:??(R2) -> 5 such that Poh = ?? ? ??. The generalization to n random variables is straightforward. We are thus led to the following Definition 2.8. Let (M,m) be a probability MV-algebra. We say that the observables x\,...,x„ are independent (with respect to m) if there exists an и-dimensional observable h:B(R")^ ? such that m(h(C\ x ¦ ¦ ¦ x C„)) = w(jc,(C|)) m{xn(Cn)) = mXl{C\) mx„(C„) forallC|,...,C„ eB(R). The map h is called the joint observable of x\,.. .,xn (with respect to m) in (M,m). While h is not uniquely determined, any two mappings h \ and /22, satisfying the conditions in the above definition automatically satisfy moh\ =mohi- Indeed, moh\ and m о /г 2 must coincide on all sets of the form C\ ? Ci ? ¦ ¦¦ ? C„, whenever C\,..., C„ e ??(R). Since m о h \ and m о hi are probability measures on ??(R") they must coincide over allof?(R"). In the particular case when ? is full tribe, ? comes equipped with pointwise multiplication of functions, by Theorem 1.12(ii). This additional structure allows one to drop the assumption that ? is equipped with a (faithful) state, and still yields a natural definition of joint observable. To this purpose one first proves the following key fact:
884 В. Riecan and D. ? undid LEMMA 2.9. Let (?, T) be a full tribe and x\,...,x„: B(R) -> ? observables. Then there exists an observable h : B(Rn) —> ? such that h{C\ x-xCn)=x](C])-x„(C„) forallC\,...,C„ eB(R). PROOF. For notational simplicity assume ? = 2. Fix a point ? e ?. For each / = 1,2 let ??., :В(Ш) -* [О, 1] be defined by ??.,(?) = (jt;(A))(a>). Each ??? is a probability measure. Let ??:?(??) ->· [0, 1] be the product measure ?? = ??,\ ? ??.2- Let the map h :B(R2) -* [0, \]? be defined by (fc(C))(w) = v„(C) for each С е ?(?2). For all ?, ? e #(R) and ? e ? we can write (й(А ? ?))(?) = (??.| ? ??.2)(? ? ?) = ??.? (?)??.2(?) = {?\{?)){?) ¦ (?2(?))(?) = {?? (?) ·?2(?))(?). The function h(C): ? -+ [0, 1] is measurable whenever С = ? ? ? and ?, ? e #(R), because /?(C) is then the product of two measurable functions. Further, the family of those Borel sets D for which h{D) is measurable is a ?-subalgebra of B(M.~), whence it must coincide with B(R2). In conclusion, h(C) e JF, for all С e B(R2) and /г is a two- dimensional observable such that, forall A, ? e ?(?),/г(А ? ?) =?\(?)?2(?). The proof is complete. ? By a slight abuse of terminology, one can naturally say that h is the joint observable of x\,...,x„ in the full tribe (?,?) - independently of any state m. When (?,?) happens to be equipped with a (not necessarily faithful) state m, we have the following equivalent reformulation of independence: PROPOSITION 2.10. Letx\,...,x„: B(R) -> ?be observables in a full tribe (?,?) with a state m. Then ? \,..., xn are independent {with respect to m) if and only if m(x\(C\)---xn (C,,)) =m(x\(C\))---m (x„(C„)) forall Ci,...,C„ eB(R). The proof is an immediate consequence of Definition 2.8, together with Theorem 1.12(H) and Lemma 2.9. 2.3. Kolmogorov 's construction A main idea of MV-algebraic probability theory is to represent every sequence of observables in a probability MV-algebra (M,m) by a sequence of random variables. The construction is straightforward in the particular case when ? \, x2, ¦ ¦. is an independent sequence of observables, in the sense that every finite subsequence x\,x2,.. .,xn is
Probability on MV-algebras 885 independent (with trivial adaptations for ? = 1). As a matter of fact, let wv,, mXl,... be their associated probability distributions mXi = m о jc,. One then constructs the infinite product measure ? of the mXj as follows: First Step. Let RN denote the set of all sequences of real numbers. Let С be the collection of all subsets С of RN that can be written in the form (П denoting here Cartesian product) C = f]Ci, A°) where C,· e B(R) for each / e N and the set {/ | С, ф R} is finite. (Note this expression is unique, unless С = 0.) Then elements of С are called the cylinders of RN. Second Step. Let a(C) be the ?-algebra generated by the family of all cylinders of RN. The probability measure ?: a(C) -+ [0, 1] is uniquely determined by the stipulation (? denoting here multiplication) Р(С) = \\тХ1(С{), (?) whenever the cylinder С еа(С) has the above form A0). DEFINITION 2.11. The above triplet (RN, a(C), P) is called the Kolmogorovprobability space of the observables jc,- in (M,m). For each ? e N, the canonical projection function („:RN^R given by 1„(U\,U2,...) = U„ is called the nth coordinate random variable of (RN, a(C), P). The function 7„ : RN -> R" given by 7„(И|,И2, ...) = (« I, «2, ¦¦·,"«) is called the «th coordinate random vector. 2.4. Functions of several observables Let /: R -+ R be a Borel measurable function and ? : ? (?.) -+ ? an observable. Then one can naturally define the observable f(x): B(R) -+ ? by stipulating that for each A e B(R.) (f(x))(A)=x(f-\A)). Using in a self-explanatory way the symbol о for composition of functions, we can write f{x)=X0f-\.
886 В. Riecan and D. ? undid More generally, given independent observables x\,...,x„: В(Ш) ->· ? in a probability MV-algebra (M, w), together with a Borel measurable function g: R" ->· R, the observable gC*i, ...,*„) is defined by the formula where h : В(Ш") —> ?? is the joint observable of ? \,..., x„. As an example, addition ? + у: ? (?.) ->· ? of independent observables л and у in a probability MV-algebra (M, w) is defined by stipulating that, for all X e ??(R) U+y)(X) = /J(plus-|(X)), where the function plus: R2 —> R is the natural addition operation, plus(w, v) = u + v. This definition is a counterpart of the classical identities for arbitrary random variables ? and ?, (? + ?)~\?) = (plus($, ?))~1(?) = (plus о Г) (X) = Г~' (plus (?)). Similarly, product is defined by (x ¦ y)(X) = h(times"'(X)), where the function times -. R2 —> R is multiplication. Further, let x\,..., x„ be independent observables in a probability MV-algebra (M,m) and let h : B(W) -^ МЫ their joint observable. Let the map average: R" -+ R be defined by 1 " average^ ?,..., i„) = - У^'с for all real numbers t\,..., tn. One can now define the arithmetical mean - X]' = | X\ of ? ?,..., x„ by writing 1 " - ) x,¦ = h о average-'. n '—' 2.5. The MV-algebraic central limit theorem THEOREM 2.12 (Central limit theorem). Given a probability MV-algebra (M,m), let ? ?, JC2, ... be an independent sequence of square integrable observables having the same probability distribution mXi = mXl = ¦ - ·. Let further a = E(x\) = E(xi) — ¦ ¦ ¦ be their common expectation, and assume that, for some 0 < ? e R, their common dispersion is given by ?2. Then for all t e R, (x\+ Vxn-na . \ 1 [' _.jL lim ml — (]-oo, t[) = —7= I e ? du. n^oo \ ajn ') slln J-oc
Probability on MV-algebras 887 PROOF. For each ? = 1, 2, 3,... let the Borel function gn : R" w\ + h wn — na be given by ?,,(»!,..., №,,): ? Un Let further (a) h„ :??(R") —>¦ ? be the joint observable of x\,X2, - ¦ -,xn\ (b) the observable y„:B(R) ->· ? be given by the stipulation y„ = /?„ о g~l = g„(x\,...,x„y, (c) the random variable tj„:Rn ->· R be given by /j„ = g„(i\,i2, ¦ --,'«) = ?>i °ii, where Г,, is the nth coordinate random vector in the Kolmogorov probability space (RN,a(C),P). By definition of joint observable, we then have the identities m о h„ = mx ? ¦ ¦ ¦ ? m , r-l ? ? ? = m_r, ? ¦¦¦ ? mXn =mohn Poin =Pln=mXn=moxn, ? Р°лп ¦m о y„. It follows that x\-\ \-x„ -na , . m| — (]-oo,i[) = w(g,,Ui,...,j:,1)(]-oo,i[)) :m(hn(g„ l(]-oo,i[))) = (w.v, ? ¦¦¦ xw.v„)(gn '(]-oo,i[)) -'(!¦¦ M| + ? у/? \) ¦+i„(u)-na \\ ??? *'])¦ The coordinate random variables i\,ii,... in (RN,a(C), P) have their common distribution Ph = Pn = - · · coinciding with the common distribution mXl =тхг = ¦ ¦ - of the observables x\,X2, Thus in particular, all i„ are square integrable, with non-zero common dispersion ?2 and expectation a. A moment's reflection recalling A1) shows that i ?, B,... is an independent sequence; as a matter of fact, Р(G1(А|)П.-.П(,71(Ап))=тл-,(А|)--.ш.с„(А,1) = Р1,(А|).--Р(„(А„) = Р((Г1(А|)).--Р((„-'(А,1)). By the classical central limit theorem, we have ? !(«) + ¦ lim ? ?—»oo ({.. .¦ + hl(u)-na < ]\ 1 f e-4du ? y/n J/ y/?? J-oc as required to complete the proof. D
888 В. Riecan and D. Mundici 2.6. MV-algebraic laws of large numbers Suppose ?\, 7j2, · · · to be a sequence of random variables in a probability space (?, S, P): (i) The sequence is said to converge in distribution to a function F :R ->· [0, 1], if for all t e R, lim P(/?,7l(]-oo,i[)) = F@. n—>oo x v " (ii) The sequence is said to converge to 0 /и measure P, if for each real ? > 0, (iii) The sequence is said to converge ?-almost everywhere to 0, if OO OO ОС p ПиГК'0-?/??/?) =1, \?=?*=??=?· / i.e., lim lim lim P( ?/,,;'(]-l/p,l/p[) ) = 1. \)i=k I The above classical notions have the following MV-algebraic counterparts: Definition 2.13. Let y\, yi,... be a sequence of observables in a probability MV-algebra (M ,m). (i) The sequence is said to converge in distribution to a function F :R —* [0, 1] if for each t e E, lim m(y„(]-oo,t[)) = F(t). (ii) The sequence is said to converge in measure to 0 if for each 0 < ? e R, lim m(y„(]-e,?[)) = 1. (iii) We say that the sequence converges to 0 almost everywhere (а.е.), if /k+i \ lim lim lim m Д yn(]-\/p, \/p[) = 1. /;^oci;^oc/^oc \' ? / \n=k PROPOSITION 2.14. Let x\,x2,..- be an independent sequence of observables in a probability MV-algebra (M,m), with h„:B(M") —>¦ ? being the joint observable of
Probability on MV-algebras 889 ? ?, X2, . .., x„. Let ? be the infinite product measure of the m V| as given by Kolmogorov 's construction A1). For each ? = 1, 2, 3, ... let g„ : R" —> R be an arbitrary Borel function. Let further the observable y„ : B(R) —>¦ ? be given by y„ =h„ogn] = g„(x\,..., x„), and the random variable ?„ : RN —> R be defined by ?„ = g„ G„), where T„ is the nth coordinate random vector, it follows that ? ? ?~' = m о у„ and (i) the sequence y\, yi,... converges in distribution to a function F if and only if so does the sequence ? ?, ??, ...; (ii) y\, yi,... converges to ? in measure m if and only if ? ?, /j:, ... converges to 0 in measure P\ (iii) if ?\,??, ¦ ¦ ¦ converges P-almost everywhere to 0, then yi, уз, ... converges m- almost everywhere to 0. The following results are MV-algebraic generalizations of two fundamental theorems in classical probability theory: THEOREM 2.15 (Weak law of large numbers). Given a probability MV-algebra (M,m), let x\,X2,... be an independent sequence of integrable observables having the same probability distribution mXl = wv, = ¦¦·. Let a = E(x\) = fU:) = ¦¦¦· Then the sequence of observables a (n = 1, 2, ...) n converges in measure to 0. THEOREM 2.16 (Strong law of large numbers). Given a probability MV-algebra (M,m), let x\,X2, ... be an independent sequence of square integrable observables such that ??^? ?2(??)/?2 < oo. Then the sequence of observables x\ - E(x\) + x2- E(x2)-\ \-x„-E(x„) (n = 1,2, 3, ...) n converges m-almost everywhere to 0. 2.7. Bibliographical remarks Caratheodory's point-free probability theory is expounded, e.g., in [6]. Early papers containing results about states and observables in MV-algebras are [10,69]. See [68] for a more recent paper. A more general class of states without the ?-additivity assumption was investigated in [59,60]. Also see [15]. They shall be discussed in Section 4 below. Probability MV-algebras generalize the classical notion of "probability algebra", or "?-complete boolean algebra with a normalized positive measure" (see [23], [24, 322A] and [31, p. 64]). The proof of Proposition 2.4 is new here, and is a natural generalization of its well known counterpart for ?-complete boolean algebras equipped with a faithful state, [24, 325 I-K].
890 В. Rlelan and D. Mundici When ? is a ?-complete boolean algebra, our definition of observable boils down to Sikorski's notion of real homomorphism [82, p. 152ff], and also coincides with Varadarajan's definition of observable [84, p. 14ff]. Up to restriction to unit intervals, both states and observables are instances of a unique notion: that of a normalized positive ?-additive group homomorphism between Dedekind ?-complete ?-groups with strong unit [29]. Lemma 2.9 is taken from [79, Theorem 8.3.2]. The idea of using the Kolmogorov construction in MV-algebras for a representation of observables by random variables was first presented in [71]. Further information can be found in [79, 8.4]. While the construction is used in this chapter only for sequences of independent observables, the same method can be applied, without assuming independence, to any sequence of observables having joint observables. This is the case, e.g., of full tribes with a state (by Lemma 2.9). By Theorem 3.6 below, this is also the case of all weakly ?-distributive MV-algebras with product (to be defined in 3.1 below). Our present proof of the MV-algebraic central limit theorem (Theorem 2.12) is a particular case of [79, Theorem 9.2.6]. The proofs of the identity ? ? ?~] = m о у„, as well as of conditions (i) and (ii) in Proposition 2.14 are straightforward. Condition (iii) follows from the inequality (k+i \ /k+i ?^?/?,?/?[)<™ ???-1/?,?/?? n=k I \n=k (see [79, Proposition 9.3.3]). A proof of Theorem 2.15 can be obtained from [37] and, in a more general context, from [75]. Theorem 2.16 is proved in [79, 9.3.4]. 3. MV-algebras with product As shown by Lemma 2.9, given a full tribe (?,?), for any two observables x, y:B(R.) —>· ? one can always construct a two-dimensional observable h : В(Ш~) —> ? such that h(C xD)=x(C)y(D). A2) While ? need not be equipped with any state m, in the above formula a crucial role is played by pointwise multiplication of the two [0, l]-valued functions ? (С) and y(D). With a view of giving a purely algebraic generalization of this construction, in this chapter we shall consider semisimple MV-algebras ? enriched by an additional "product" operation. Since ? is canonically isomorphic to an algebra of [0, l]-valued functions, we shall take care of relating the abstract product operation of ? with the natural pointwise multiplication of functions. We shall freely denote by 1 e ? the strong unit и of the ?-group corresponding to ?.
Probability on MV-algebras 891 3.1. Product and joint observables Definition 3.1. An MV-algebra with product is a pair (M, ¦). where ? is an MV-algebra and ¦ is a commutative and associative binary operation on ? satisfying the following conditions, for all a, b, с e ?: (i) 1 ¦ a = a; (ii) a-(bQ-'c) = (a-b)Q -*(a - c). Since the operation aQd = aO-*d coincides with truncated subtraction (a — d) ? 0 in the ?-group G with unit и = 1 corresponding to ?, then condition (ii) amounts to requiring the distributivity of ¦ over (truncated) subtraction. Examples. Every boolean algebra becomes an MV-algebra with product, by letting the product operation coincide with the infimum operation. Another example is given by the unit interval [0, 1] equipped with multiplication. More generally, every MV-algebra of [0, l]-valued functions (closed under pointwise multiplication and) equipped with pointwise multiplication is an example of an MV-algebra with product. As we shall see in Theorem 3.4 below, this is the most general possible example of a semisimple MV-algebra with product. The following are almost immediate consequences of the definition: PROPOSITION 3.2. in every MV-algebra with product (M, ¦) we have: (i) 0 ¦ a = 0; (ii) (monotony) ifa^b then с ¦ a < с ¦ b; (iii) аОЬ^а-Ь^алЬ; (iv) ifaQb = 0 then с ¦ (? ? b) = (c - ?) ? (с ¦ b) and by (ii), (c ¦ ?) ? (с ¦ b) = 0; (?) с ¦ (?? b) = (c - ?) ? (с ¦ b)\ (vi) с (а лЬ) = (с -а) л (с ¦ b)\ (vii) the congruences of(M, ¦) coincide with the congruences of the MV-algebra M. PROPOSITION 3.3. Let (M, ¦) be an MV-algebra with product. Fix an element a & ? and consider the map ?? : ? —>· ? given by ? ь* а ¦ ?. Then ?? is an MV-homomorphism of ? into the interval MV-algebra ([0, ?], -?,, ?„, Q0) of all elements of ? between 0 and a, with a being the unit o/[0, a], and the latter interval being equipped with the operations --„c = a ? с = a ?->c, b фн с = а Л (b ? с), Let now ? be a subalgebra of [0, 1]. Assume the operation *:M ? ? —>¦ ? makes (?, •) into an MV-algebra with product. Fix a e ? and, to avoid trivialities, assume ? ? 0. Then the above map ?? is an MV-homomorphism of ? into the interval MV- algebra [0, a]. By Theorem 1.7(ii), letting (G, a) be the subgroup of ? corresponding to [0, a], and (Я, 1) be the subgroup of R corresponding to M, there is an order preserving
892 В. Riecan and D. Mundici group homomorphism ??:? ->· G sending 1 into a, such that ?? = ?(??). From the general theory of ordered subgroups of R it follows that ?„ is multiplication by a. Thus the * operation coincides with ordinary multiplication, and ? is closed under such operation. Since the congruences of (M, ¦) coincide with the congruences of M, recalling Theorem 1.5 we have THEOREM 3.4. Let ? be a semisimple MV-algebra. Then there is at most one operation ¦ making (M, ¦) into an MV-algebra with product, if ? does admit such operation ¦ then, upon representing ? via the isomorphism а ь* a* onto a separating MV-algebra M* of[0, \\-v alued functions over the maximal ideal space ofM, it follows that M* is closed underpointwise multiplication. Further, map а ь* a* also preserves the product operation. By Theorem 1.12(H), every full tribe can be made into an MV-algebra with product. On the other hand, boolean tribes provide examples of tribes that can be equipped with a product, while not being full. More examples are given by the following variant of Theorem 1.12(H). A proof can be obtained using Theorems 1.18and3.4: PROPOSITION 3.5. For any ?-complete divisible MV-algebra ? there is precisely one operation ¦ such that (M, ¦) is an MV-algebra with product. As expected, the existence of a product operation simplifies the construction of joint observables. The proof of the following theorem relies on Theorem 1.7, together with classical results on extensions of group-valued measures: THEOREM 3.6. Suppose (M, ¦) is a weakly ?-distributive MV-algebra with product. Then for any two observables x,y:B(R) —> ? there exists an observable hx> ;B(R') —*¦ ? such that hxy(CxD)=x(C)-y(D) for all C,De B(R). Equipping ? both with a product and with a faithful state, we are led to consider the following DEFINITION 3.7. A probability MV-algebra with product (M,m, ¦) is a probability MV- algebra (M, m) equipped with a binary operation ¦ on ? making (M, ¦) into an MV-algebra with product8. Examples. 1. If (M,m) is a probability algebra, and ? is boolean, then we obtain a probability MV-algebra with product (M,m, ¦) by letting ? ¦ у = ? Л у = ? ? у for all ?, у е ?. 8Since M is complete (see Footnote 6), by a celebrated theorem of Stone and von Neumann, ? can be decomposed into a product ? ? ?, where A (resp., B) is the MV-algebra of all {0, l)-valued (resp., [0, l]-valued) continuous functions over an extremally disconnected space X (resp., Y).
Probability on MV-algebras 893 2. Let (?,?,?), be a probability space, and (?,?,??) be the full tribe of all «S-measurable functions f :? ->· [0, 1], with m(f) = jQfdP. As in the example following Definition 2.3, let У с f be the ideal of all / e ? with m(f) = 0. Let ? = T/ J. Then ?? is ?-complete. The quotient map ? :\T —> ? is а ?-homomorphism. The map m : ? ->· [0, 1] defined by m(9(f)) = m(f) is a faithful state on M. The probability MV-algebra (M,m) can now be enriched with a binary operation ¦ arising via ? from pointwise multiplication of two functions in ?. A straightforward verification shows that (M, m, ¦) is a probability MV-algebra with product. Since by Proposition 2.4, weak ?-distributivity is automatically ensured in all probability MV-algebras, Theorem 3.6 yields the following characterization of independence: PROPOSITION 3.8. Let (M,m, ¦) be a probability MV-algebra with product. Then two observables x,y: В(Ш) —>· ? are independent if and only if m(x(C) ¦ y(D))=m(x(C))m(y(D)) forallC,DeB(R). 3.2. Conditional expectation In order to define the conditional expectation ?(*|.?) of observables ? and у in a probability MV-algebra with product (M,m, ¦), we shall proceed by analogy with the classical case, making use of the joint observable hxy. Traditionally, given a probability space (?, S, P), by the conditional expectation of two random variables ? and ? we understand a Borel function ?(?\?): R -> R such that, for all В е B(R), ? ?(?\?)???= f ???. JB JirHB) The latter integral intuitively gives the expected value of ? once the value of ? is known to lie in B. This suggests that, given observables ? and у in a probability MV- algebra with product (M,m, ¦), we should define E(x\y) as a Borel function satisfying fB ?(;c|>')dwv = fy(B)xdm, and this leaves us with the problem of defining faxdm, for any fixed element a e ?. Again proceeding by analogy with the classical case, for each integrable random variable ? : ? —> R and С е S we can write [???=[ tXcdP, ? whence for each Borel set В we have , Cnf'(i) ifO^B, (^Xc)-|(S)=l (Cnr'(fi))U(i2\C) ifOeS.
894 В. Riecan and D. Mundici PROPOSITION 3.9. Let (M,m, ¦) be a probability MV-algebra with product. Let a e ? and ? be an observable of M. (i) Let the map xa : B(R) -> ? be defined by Xa(D) = < \?-(?(?)) + -? ifOeB. Then xa is an observable, and is integrable whenever ? is integrable. (ii) Define the indefinite integral of ? over a, in symbols f ? dm, by the stipulation J' ? dm = I a JR tdmXa(t). Then faxdm = j^xadm = E(xa). For every observable у of ? let the map к: B(R) -> ? be defined by k{B)= / xdm = E(xx(B))- Jy(B) it follows that к is a (possibly non-normalized) ?-additive measure, and к <К my. Definition 3.10. Let x, у be observables, with ? integrable, in a probability MV-algebra with product (M,m, ¦). Then the conditional expectation (of ? given y) is the Borel measurable function ? (? \ у): R —> R such that E(x\y)dmy= xdm = E(xy(B)), JB JviB) forallBeS(R). The existence of E(x\y) :R —>¦ R follows from the Radon-Nikodym theorem. THEOREM 3.11. Let x, у be observables, with ? integrable, in a probability MV-algebra with product (M, m, ¦). Lethxy be the joint observable of ?, у as given by Theorem 3.6 and Proposition 2.4. In the probability space (R2, ??(R2), m о hxy) let the random variables ? and ? be defined by ?(и, ?) = и and /j(m, v) = v. Then Pn = my and E{x\y) = ?(?\?), my-almost everywhere. One can now prove the following variant of the martingale convergence theorem. Here the sequence y\,y2, ¦¦ ¦ of conditioning observables is generated by a fixed observable у together with a sequence g\, gi,... of Borel measurable functions.
Probability on MV-algebras 895 THEOREM 3.12. Let (M,m, ¦) be a probability MV-algebra with product. Let g\,gi, ... be Borel measurable functions g„ : Ш —> Ш such that DC g (В(Щ) с g^, (B(R)) fl„d |J g-' (?(?)) = B(R). /?=? G/ve/г /iow да observable у : ??(R) —> ?? /ef ms write y„ = у о g~' a/id assume my <K w v„ (и = 1,2,...). Then for every integrable observable ? : ? (?.) —>¦ ? the sequence of Borel measurable functions E(x\y„) converges to E(x\y), m ^-almost everywhere. 3.3. Upper and lower limits As shown by Theorems 2.15 and 2.16, frequencies A /?) ]?"_, Xi converge to mean values. When such limits are not known a priori, one can still profitably use liminfs and limsups, as follows. Let ?i, ?2. - ¦ ¦ be a sequence of random variables in a probability space (?, S, P). Then for every ? e ? we have ОС ОС ЗС lim sup?„ (?) < t iff ? e (J (J f] ?~ ?=\ k = \ n=k This motivates the following: Definition 3.13. Given a sequence x\,xi,... of observables in a probability MV-algebra (M, m), we write ? = limsup,,^^ xn if ? : В(Ш) -+ ? is an observable having the following property: for every t e R, /¦V"i i-V^ '4V~- ^Aj v^* v^* i(]-oo,/[)=VVAx« /,= l k = \n=k Note that if another observable у satisfies the above condition then my = m;x. Similarly, we write lim inf,,^^ x„ = ? if ? is an observable satisfying the condition DC DC DC *(]-oo,f[) = \/Л\Л» p=\k=\n=k for alii el. PROPOSITION 3.14. Let x\,X2, ... be a sequence of observables in a probability MV- algebra (M,m). Suppose that both observables ? and * exist. Then for every feR, x(]-oo,t[) ^x(]-oo,t[). —00, t 1 ? -oo,? ? -00, t - PL
896 В. Riecan and D. Mundici Definition 3.15. A sequence ?\,??,... ofobservables in a probability MV-algebra (M, m) is said to converge m-almost everywhere to an observable ?, if both observables ? and * exist, and for each ieR, m(x(]-oo,t[)) =m(x(]-oo,t[)) = m(.x(]-oo, t[)). Remark. As shown by the following proposition, in the case of probability MV-alge- bras with product, one recovers Definition 2.13(iii) by letting ? coincide with the zero observable Ом. The latter is given by 0M(A)= { [0 ifO^A. PROPOSITION 3.16. A sequence ofobservables in a probability MV-algebra with product (M,m, ¦) converges m-almost everywhere to the zero observable Ом if and only if it converges m-almost everywhere to 0. The following result is the key tool for the proof of the /.''-completeness Theorem 3.19 and the Individual Ergodic Theorem 3.21 below. Note that we are not assuming our sequence of observables to be independent. All we need to define g(x\,..., x„) is the existence of joint observables, and this is ensured by Proposition 2.4 and Theorem 3.6. THEOREM 3.17. Let ? \, xj,... be a sequence of observables in a probability MV-algebra with product (Ai,w,·). Let и,'2,··· be the coordinate random variables in the Kolmogorov probability space (RN, a(C), P). Let g\,gi,-.- be a sequence of Borel measurable functions, where g„ : R" —> R. If the sequence g\ (? \), g2(i ?, ?),... converges ?-almost everywhere, then the sequence ofobservables g\(x\),g2(x\,xi), ¦ ¦ ¦ converges m-almost everywhere to an observable. Further, we have the identity P(^u eRN | Jim^gnGn(u)) < ij) = m(limsupg„(*?,.. .,x„)(]-oo,t[fj for every ieR. The following definition generalizes the notion of integrable and square integrable observable, Definition 2.7: DEFINITION 3.18. Fix a real number ? ^ 1. Given a probability MV-algebra (M,m, ¦) with product, an observable ? : ? (?.) ->· ? is said to be in L','„ if the integral / Js. \t\pdmx@ exists.
Probability on MV-algebras 897 The map ?'. ъ\п ? LJjj —>· К is defined by stipulating that, for any ?, у 6 l^m·* where the joint observable hxy: M.2 —>¦ ? is as in Theorem 3.6, and the function g: R2 ->· R is given by g(u, v) = \u — v\. THEOREM 3.19. lf(M,m,)isa probability MV-algebra with product then (L','„, p) is a complete pseudometric space. The proof follows from Proposition 2.4, and Theorems 3.17, 3.6, together with the classical Lp-completeness theorem. 3.4. Individual ergodic theorem The individual ergodic theorem is traditionally formulated for dynamical systems (? ,<S, ?, ?), where (?,?, ?) is a probability space and ?: ? —>¦ ? is a measure-preserving map, i.e., ? satisfies the following condition: whenever ? eS then Г-|(У) e<S and P(T'](Y)) = P(Y). A3) The theorem states that whenever /: ? —>¦ ? is an integrable function, then for P-almost all ? e ? the limit jfi-l /»= lim -YfoT'(w) n—>oc ? '—' ?=0 exists. Further, ?(/*) = E(f). To give an MV-algebraic generalization to (M,m, ¦), we shall replace the random variable / by an observable ? : B(R) -> ?. We shall also replace the transformation ? : ? —> ? by a mapping ? : ? —> ?? as follows: Definition 3.20. Let (M,m,-) be a probability MV-algebra with product. Then a mapping ?:?? —> ?? is said to be an m-preserving transformation, if the following conditions are satisfied: (i) г(и) = и; (ii) if a + b sC u, then ? (? + b) = ? (a) + z{b) for all a, b e ?? (with reference to the ?-group with unit и = 1 corresponding to ?); (iii) if ?„ /?, then ?(?„) /* ? (?); (iv) w(r(ai) ¦ ?{?2) ¦ ¦ -x{ak)) = m{a\ ¦ ai ¦ ¦ a*) for all a\, ..., a„ e M. The above condition (ii) can be equivalently rephrased as follows: whenever a,b e ? satisfy ? ? b = 0 then ? (?) ? ?(/>) =0 and ? (? ?/?) = ?(?)? ? ?). The following result is an MV-algebraic extension of the individual ergodic theorem; its proof follows from Proposition 2.4, and Theorems 3.17,3.6, together with the classical individual ergodic theorem.
898 В. Riecan and D. Mundici THEOREM 3.21. Let{M,m,)bea probability MV-algebra with product. Let ? : ? (?.) -> ? be an integrable observable. Let ? : ? —>¦ ? be an m-preserving transformation. Then there exists an integrable observable x* such that E(x*) = E(x) and lim ^=^ -=x* /7—»00 Ц m -almost everywhere. 3.5. Bibliographical remarks In defining MV-algebras with product we have followed Montagna's approach [56, Definition 2.8]. The assumption that there exists a product in a semisimple MV-algebra ? is then equivalent to saying that ? is multiplicative in the sense of [61, Section 5]. To see this one notes that a commutative bimorphism ([61, Section 2]) ? of A x A into A such that ?(\,?) = ? equips the MV-algebra A with a product operation. Conversely, in any MV-algebra ? with product ·, for each fixed a e ? the map ??:? \-+ a ¦ ? is a homomorphism of ? into the interval MV-algebra [0, a], and ?\ is the identity homomorphism over ?. MV-algebras with product are an interesting object of study in se, owing to their relationship with real closed fields. (See, e.g., [56].) For a proof of Proposition 3.2 see [56, Lemmas 2.9 and 2.11]. Proposition 3.3 is now a consequence of [61, Proposition 2.3]. Theorem 3.4, as well as the analysis of the possible products in subalgebras of [0, 1] are due to Ioana Leustean (private communication to D.M.). The result on subgroups of R mentioned before the statement of Theorem 3.4 can be found, e.g., in [26, Proposition 2, p. 46]. Proposition 3.5 was first proved in [19]. As already noted, any tribe which is closed under pointwise multiplication is automatically a ?-complete MV-algebra with product. The structure of all such tribes was characterized in [52] (when the universe is finite or denumerably infinite) and [53] (in the general case). One can also find in the literature a weaker notion of product, as follows: Definition 3.22 ([14,19,34,72]). A generalized product in an MV-algebra ? is an associative and commutative binary operation * satisfying the following conditions, for all a,b,c,an,b„ e M: (I) и*и = и. (II) If ? ? b = 0 then (c * a) Q (c * b) = 0 and с * (? ? b) = (с * а) ф (с * b). (Ill) If a„ \ 0 and b„ \ 0, then an *bn\0. With reference to the addition operation in the ?-group with unit и corresponding to M, the premise in condition (ii) can be equivalently stated as follows: a + b < u. The conclusion can be stated as a distributive law, as follows: for every с е М, с * (a + b) = c*a + c + b.
Probability on MV-algebras 899 As noted by Montagna in [56, p. 98], the ¦ operation in MV-algebras with product satisfies all conditions of Definition 3.22. Suppose now condition (I) in the definition is naturally strengthened as follows: (?) u*x = ? for all ? e M. Then the resulting operation * is monotone, and condition (III) is automatically satisfied, together with the inequality ? * у < ? л у.9 As already noted, every MV-algebra with product satisfies conditions (Г) and (II). Conversely, Jakubik (private communication to B.R.) proved that any associative commutative operation satisfying conditions (Г) and (II) automatically satisfies Montagna's conditions in Definition 3.1. As a first step, one derives from (Г) and (II) the distributivity of * over the ?V-algebraic ? and л operations. Then, a variant of [61, Propositions 2.2] yields distributivity of * over the ? operation. Many results about independent and joint observables were originally proved for the larger class of MV-algebras equipped with the above generalized product. In the papers [22] and [86] one can find proofs of the extension theorem used in the proof of Theorem 3.6. The proof of this latter theorem was first given in [72, Theorem 2] using [79, Theorems 3.4.2 and 9.5.3]. The special case of full tribes had been considered in [54]. The proof of Proposition 3.9 can be found in [35, Section 2]. For its tribe-theoretic counterpart see [74, Proposition 1 ]. Theorems 3.11 and 3.12 are proved in [35, Theorems 1 and 2] respectively. For a proof of the tribe-theoretic version of Proposition 3.14 see [79, Proposition 8.6.4]. The present algebraic version is similarly proved, as a straightforward consequence of the definitions. A proof of Proposition 3.16 can be found in [78, Theorem 3]. (The tribe- theoretic variant was proved in [79, Proposition 8.6.6].) Also Theorem 3.17 follows from its tribe-theoretic counterpart [79, Theorem 8.6.9]. For a proof of the present MV-algebraic version see [78, Theorem 2]. Under the assumption of weak ?-distributivity, the Lp-completeness Theorem 3.19 was proved in [76] (while the special case of full tribes with a distinguished state had been considered in [73]). The proof of Theorem 3.19 follows from the proof of [76], together with Proposition 2.4. Under the same redundant assumption of weak ?-distributivity, together with an additional boundedness assumption, the Individual Ergodic Theorem 3.21 was first proved in [36, Theorem 4], for probability MV-algebras with product. Actually, the boundedness hypothesis can be dropped using Theorem 3.17 together with the proof of [79, Theorem 8.7.2] dealing with the special case of full tribes. 4. Finitely additive measures In this section we shall relax the ?-completeness assumption in the definition of state, and apply the resulting finitely additive measures to various topics in probability theory, such as partitions and entropy of dynamical systems. We shall also relate finitely additive measures in MV-algebras to tracial states in their corresponding AF C*-algebras. Unless otherwise specified, throughout this chapter, ? is a not necessarily ?-complete MV-algebra. In the literature, such operation is sometimes called a "strong product".
900 В. Riecan and D. Mundici 4.1. Basics DEFINITION 4.1. A (normalized) finitely additive measure*0 on an MV-algebra ? is a function ? : ? ->· [0, 1] с R satisfying the following conditions: A) M(D=1, B) for all a,be ? with ? ? b = 0 we have ?(?) + ?(/>) = ?(? ? />). As above, we say that ? is faithful if ?(?)^0 whenever 0 ? a e M. We say that ? is invariant if for every automorphism ? of ? and every aeMwe have ? (?) = ? (? (?)). PROPOSITION 4.2. Lef ? foe a finitely additive measure on an MV-algebra M. Then we have: (i) ? /s a valuation: ?(?) + ??) = ?(? ? b) + ?(? Ob) for all a, b e M; (ii) ? /s monotone: ifa^b then ?(?) ? ??)\ (iii) if ? is faithful then ? is strictly monotone: a < b implies ?(?) < ??)\ (iv) ? is also a valuation with respect to the underlying lattice order of M; stated otherwise, for all a, b e M, we have ? (?) + ??) = ?(? ? b) + ?(? л b)\ (?) ? is subadditive, in the sense that ? (a V b) ^ ? (? ? b) ^ ? (?) +??)· PROPOSITION 4.3. The function ?: ? ? ?? -> [0, 1 ] g/ve/г by ?(?, fo) = ?(\? - b\) = ?((? ? ^b) ®(bQ -a)) /? a pseudometric on M. This is a metric iff ? is faithful. PROPOSITION 4.4. Every countable semisimple MV-algebra has a faithful finitely additive measure. Conversely, if a (possibly uncountable) MV-algebra ? has a faithful finitely additive measure then ? is semisimple. The finitely additive measures of ? form a convex set, which inherits the topology of the product space [0, 1 ]M of all [0, 1 ]-valued functions defined over ?. THEOREM 4.5. Let ? be an MV-algebra. Then the set of extremal finitely additive measures of ? equipped with the topology inherited by restriction from the product space [0, \]M is a compact Hausdorff space which is homeomorphic to the space M(M) of maximal ideals of ? with the spectral topology. Every finitely additive measure of ? is in the closure of the convex hull of the set of extremal finitely additive measures of M. Closing a circle of ideas originating in our prologue, we now discuss the relationship between finitely additive measures in MV-algebras, and "states" on C*-algebras. To this purpose we prepare: Dehnition 4.6. Let ? be a C*-algebra. 'Called "state" in [59.60].
Probability on MV-algebras 901 (i) A tracial state on Л is a normalized positive linear (complex-valued) functional s satisfying s(aa*) = s(a*a) for all a e A. We say that s is faithful if 0 ? a* a e A implies 0^s(a*a). (ii) A [0, l]-valued map m defined on the projections of A is called a measure on the projections of A if m(\j\) = 1 and m(p) + m(q) =m{p + q) whenever pq = 0. We say that m is faithful if ? ? 0 implies m{p) ? 0. We say that m is invariant if m(p) =m(a(p)) for every projection ? and every automorphism a of A THEOREM 4.7. Le? Л foe да AF C*-algebra whose Murray-von Neumann order is a lattice. Let ? be the corresponding MV-algebra as given by Theorem 1.1 (ii). Then we have: (i) Invariant finitely additive measures of ? are in one-one correspondence with invariant measures on the projections of A. Faithful invariant finitely additive measures correspond to faithful invariant measures. (ii) Finitely additive measures of ? are in one-one correspondence with tracial states of A. Faithful finitely additive measures of ? correspond to faithful tracial states of A. 4.2. Entropy of dynamical systems Given a probability space and a partition whose /th component has probability p,, one defines the entropy of the partition by - ? Pi log Pi. This is naturally extended to partitions of unity in MV-algebras, as defined below. The simplest way to construct joint refinements of partitions of unity is to assume that ? is a (not necessarily ?-complete) semisimple MV-algebra with product. Dehnition 4.8. A partition of unity in a semisimple MV-algebra with product (M, ·) is an и-tuple A = (??,..., a„) of elements of ? such that a\ ? \-a„=\, where + is addition in the lattice-ordered group with strong unit corresponding of M. If В = (b\,..., Ь^) is another partition of unity in M, then the product joint refinement ? ? ? is the (nk)-tup\c defined by ? ? ? = (a,¦ ¦ bj | / = 1,..., n; j = 1,... ,k). It is easy to see that ? ? ? is again a partition of unity. Given partitions of unity ? ?,..., A,„ in M, we shall denote by V!'=i A tneir product joint refinement.
902 В. Riecan and D. Mundici Dehnition 4.9. Let (M, ·) be a semisimple MV-algebra with product. A generalized dynamical system is a quadruple (??,-,?,?/), where the maps ? : ? —>¦ [0, 1] and U :M —>· ?? satisfy the following conditions, for all ?, b e ??: (i) ? is a finitely additive measure on M; (ii) if a + b ^ и, in the ?-group with strong unit и = 1 corresponding to M, then I/(a+fc) = I/(a) + I/(fc); (iii) t/(w) = и; (iv) ?(?/(?)) = ?(?); (?) U(a-b) = U(a)-U(b). Condition (ii) can be equivalently rephrased as follows: whenever ? ? b = 0 then U(?) ? t/(fo) = 0 and U(a ®b) = U(a) ? U(b). As usual, we define the entropy function ?:[0, 1 ] —> [0, 1 ] by @ if jc =0, <p(jc) = | I — .xlog.x if jc > 0. Dehnition 4.10. Let A = (a\,.. .,a„) and В = (b\,. ..,bk) be two partitions of unity in a semisimple MV-algebra with product (M, ·). Assume ? to be equipped with a finitely additive measure ?. Then we define the entropy of A by ? ?{?) = ??{?(?{)) A4) ?' = ? and the conditional entropy ?(?\?) = ±^??])?(>^-?\ A5) where all summands with ?(?>^) = 0 are omitted. Dehnition 4.11. Given a partition of unity A = (??,..., a„), we write i/°(A) = A, and inductively, ?/1(?) = (?/(?,),...,?/(?„)), ?/2(?) = (?/(?/(?,)),...,?/(?/(?„))), .... Note that i/(A) is a partition of unity. Upon letting a„ = Я(\/"~о U'(A)), one can prove the subadditive law an+m ^ a„ + a,„ for all и, т е N. This implies the existence of the limit h(A,U)= lim -HI V i/'(A) |. A6) \i=0 /
Probability on MV-algebras 903 THEOREM 4.12. Adopt the above notation and assumptions. Suppose that the semisimple MV-algebra with product (M, ·) is equipped with a finitely additive measure ?. Then for any two partitions of unity A and С in ? we have the inequality h(A,U) ^ h(C, U) + НЩС). 4.3. Entropy of full tribes Let (?,?, ?) be a probability space and, as in A3) above, assume ?:? —>¦ ? to be a measure preserving transformation, in the sense that for all ? e S, T~\E) e S and P{E) = P(T~[ (?)). Let (?, ?) be the full tribe of all 5-measurable functions from ? to [0, 1]. Equip (?, Л with the state m : ?-> [0, 1] given by m{f) = fn fdP. A partition of unity A = (f\,..., /„) in (?,?) is said to be boolean if all functions fi are {0, 1 (-valued. We let V denote the set of all boolean partitions of unity in (?,?). Further, we let U denote the family of all partitions of unity in (?, J7). Upon defining the map U: ? -> ? by U(f) = f о Г for each / e T, we obtain a generalized dynamical system (T,m, U) in the sense of Definition 4.9. Recalling A4)- A6), let us consider the quantity sup{h(A,U)\AeV]. A7) This is known as the Kolmogorov-Sinai entropy of (J7, m, U). Suppose now that the boolean partition A* e V is a generator, in the sense that the ?- algebra generated by the set IJ^o W (^*) coincides with S. Then the Kolmogorov-Sinai theorem establishes the identity sup{h(A,U)\AeV} = h(A*,U). A8) One can prove the following tribe-theoretic generalization of the Kolmogorov-Sinai theorem: THEOREM 4.13. Adopt the above notation, and again assume the boolean partition A* to be a generator in the generalized dynamical system {T, m,U). Then for every partition of unity A = (g\,... ,gk) eW we have the inequality h(A, U) ? h(A*, U)+ [ Y<p(gi)dP. A9) To obtain from this result the classical Kolmogorov-Sinai theorem, it is enough to note that, in the particular case when A = (g\,... ,gk) is a boolean partition of unity, then <p(gj) = 0, whence from A9) we get A8). Generalizing A7), we can now try to introduce a "tribe-theoretic" notion of entropy, by considering the quantity hu=sup{h(A,U)\ AeU},
904 В. Riecan and D. Mundici and more generally, for each subset Q of T, let hu.g = sup{h(A, U)\A = {gi,...,gk)eU, gi e Q]. This definition has the following drawback: PROPOSITION 4.14. If Q с ? contains all constant functions /: ? -> [0, 1], then hu.g = oo. Thus in particular, h\j = oo. Hudetz suggested the following alternative notions: DEnNlTlON 4.15. Let A = (/?, ·.., ?) be a partition of unity in a full tribe (?,?) equipped with a (not necessarily faithful) state m and a map U as above. Then we let 1 = 1 \i = l / and hb(A,U)=Yim ??,1\/?'(?)\. Further, for each subset Q of ? we let the quantity hv g be defined by hbug=sup{hb(A,U)\A = (f^...,fk)eU, f eQ]. Using Theorem 4.13 we then obtain the following tribe-theoretic version of the Kolmogorov-Sinai theorem, with respect to Hudetz entropy: THEOREM 4.16. Let (?,?,?) be a probability space equipped with a measure preserving transformation T. Let ? be the full tribe of all functions f : ? —>¦ [0, 1] which are measurable with respect to S. Let U :T'-> ? be given by U(f) = f oT. Suppose the boolean partition A* = (g*,..., g?) is a generator ofS. Also assume Q to be a subset of ? such that g* e Q for each i = l,...,k. Then hbug = hb(A*, U) = h(A*, U). 4.4. Bibliographical remarb Finitely additive measures on MV-algebras, called "states" by the author, were first investigated in [59] as averaging procedures for the truth-value of propositions in the infinite-valued calculus of Lukasiewicz. These measures correspond, via the ? functor, to normalized positive real-valued homomorphisms (called "states" the literature) of ?-groups with strong unit (see, e.g., [29]). They also find an interesting application in the probabilistic approach to the Ulam-Renyi game of Twenty Questions with lies [28].
Probability on MV-algebras 905 The correspondence between finitely additive measures on MV-algebras and AF C*-algebraic tracial states was established in [60]. Proposition 4.3 immediately follows from Proposition 4.2(i) recalling the basic properties of Chang's distance function [7,13, Proposition 1.2.5]. Proofs of all the other results of Section 4.1 can be found in [59,60]. In this latter paper the reader can find further references to measures on projections. Theorem 4.12 was first proved by [64, Proposition 8] building on [65], for MV-algebras equipped with the generalized product of Definition 3.22, and without assuming semisimplicity (compare with [79, Theorem 10.2.14]). For a proof of Theorem 4.13, Proposition 4.14 and Theorem 4.16, see [79, Theorem 10.3.4], [79, Proposition 10.3.7] and [79, 10.3.16], respectively. Entropy as an invariant for dynamical systems was introduced in [42]. The tribe-theoretic generalization was given in [80] and was further developed in [16,48,55]. See [79, p. 250] for more information. In [66] the Kolmogorov-Sinai theorem is extended to to dynamical systems based on general T-norms. In his recent paper [1] Ban, building on previous work of [ 16], presents an axiomatic approach virtually encompassing the most general dynamical systems. It turns out that Ban's theory can be reduced to two known cases: the one first considered in [49], and actually collapsing to the classical boolean case (as shown in [51], [79, 10.5]), and the one presented here in Section 4.3 (as shown in [79, 10.4]). According to the notion of partition of unity presented here, (Definition 4.8) a component a, may be zero; further, a component may occur several times. Thus a partition of unity is not a set, in general. Further, partitions of unity do not generalize the classical notion of boolean partitions, because all components of a boolean partition are non-empty. In his definition [60, Section 5], [62] of MV-algebraic partition on ?, the author assumes a multiset of components, thus incorporating the integer multiplicity of each a,; components are further assumed to be linearly independent (with respect to the Z-module structure of the ?-group corresponding to M). In this way, zero components are automatically excluded, as in the classical definition of boolean partition. The joint refinability of MV-algebraic partitions in any MV-algebra is a deep result of Marra [50]. 5. Open problems A) For every probability space (?, S, P), element A eS and ?-subalgebra ? of S, the conditional probability of A with respect to ? is a T-measurable function P(A\T)::? -> R such that, for all С eT, J P(A\T)dP = P(ADC). Generalize this to the case of a probability MV-algebra (M,m,) with product, together with an element a & ? and a ?-subalgebra ? of M. B) Prove a strong version of the martingale convergence theorem leading to a proof of the Kolmogorov-Sinai theorem on generators in general MV-algebras (compare with [79, 10.3.3]). C) Extend Maharam's classification [47] to divisible probability MV-algebras, and to probability MV-algebras with product.
906 В. Riecan and D. Mundici D) Given a ?-complete MV-algebra with product (?, ¦), consider the following related MV-algebras: • its boolean skeleton B(M), i.e., the ?-subalgebra of ? given by the elements a e ? such that ? ? a = a; • its boolean quotient Q(M) which is obtained by canonically representing ? as a separating MV-algebra of functions over M{M), and then restricting any such function / to the set of all points corresponding to those maximal ideals J such that M/J is finite (whence, M/J = {0, 1}); • the quotient D(M), which similarly arises by restricting / to the set of all maximal ideals J such that M/J is infinite (whence M/J = [0, 1]). Under which conditions can ? be uniquely recovered (up to isomorphism) from the above three algebras? For tribes closed under multiplication, see the analysis in [52,53]. E) Given two ?-complete MV-algebras ? and ?, let us agree to define their ?-tensor product by the following procedure: • let R = ? ® ? be their semisimple tensor product as in [61]. In the particular case when ? and N are boolean, one obtains their "free product"; • in the light of [61, Theorem 4.3], identify R with a separating MV-algebra of [0, 1]-valued continuous functions over the maximal ideal space M(R) = M{M) xM(N); • let Я+ be the smallest ?-complete MV-algebra of [0, l]-valued functions over M(R) containing R. Warning: already in the boolean case, the maximal ideal space of M(Rf) need not coincide with M{R): indeed, the product of two basically disconnected spaces need not be basically disconnected. Assuming now ? and N to be probability MV-algebras, generalize the classical theory of "stochastically independent" ?-subalgebras А с ? and В с N as defined in Fremlin's treatise [24, 325L]. F) Extend to probability MV-algebras with product the classical boolean construction of free products of probability algebras [24, 325J]. G) Given partitions of unity A and В in an MV-algebra with a finitely additive measure ? (without assuming ? to have a product) investigate alternative notions of "joint refinement" ? ? ? of ? and В, replace the product preservation condition (v) in Definition 4.9 by an appropriate "joint u-refinement" preservation property. Then develop the Kolmogorov-Sinai theory for the corresponding dynamical systems. References [1] A.I. Ban, Entropy oj fuzzy ? dynamical systems, J. Fuzzy Math. 6 A998), 351-362. [2] A. Bigard, K. Keimel and S. Wolfenstein. Groupes et Anneaux Reticules. Lecture Notes in Math.. Vol. 608, Springer, Berlin A971). [3] G. Birkhoff and J. Von Neumann. The logic of quantum mechanics, Ann. Math. 37 A936), 823-843. [4] D. Butnariu, Values and cores of fuzzy games with infinitely many players, J. Game Theory 16 A987), 43-68. [5] D. Butnariu, E.P. Klement, Triangular Norm-Based Measures and Games with Fuzzy Coalitions. Kluwer. Dordrecht A993).
Probability on MV-algebras 907 [61 C. Caratheodory. Mass und Integral und ihre Algebraisierung. Birkauser, Boston A956). English translation: Algebraic Theory of Measure and Integration. Chelsea. New York A963). [7] C.C. Chang, Algebraic analysis of many-valued logics. Trans. Amer. Math. Soc. 88 A958), 467^490. [8] C.C. Chang, A new proof of the completeness of the Lukasiewicz axioms. Trans. Amer. Math. Soc. 93 A959), 74-90. [9] C.C. Chang, The writing of the MV-algebras, Studia Logica 61 A998), 3-6. (Special issue on Many-Valued Logics, D. Mundici, ed.) [10] F. Chovanec, States and observables on MValgebras, Tatra Mt. Math. Publ. 3 A993), 55-65. [11] F. Chovanec andE. Rybarikova, Ideals and filters in D-posets, Intemat. J. Theoret. Phys. 37 A998), 17-19. [12] R. Cignoli and D. Mundici, An invitation to Chang's MV-algebras, Advances in Algebra and Model-Theory, M. Droste and R. Gobel, eds, Gordon and Breach, Readings, UK A998), 171-197. [13] R. Cignoli, I.M.L. D'Ottaviano and D. Mundici, Algebraic Foundations of Many-Valued Reasoning, Trends in Logic, Studia Logica Library, Vol. 7, Kluwer, Dordrecht B000). [ 14] A. Di Nola and A. Dvurecenskij, Product MV-algebras, Multi-Valued Logic 6 A-2) B000). 193-215. [15] A. Di Nola, G. Georgescu and A. Lettieri, Extending probabilities to states on MV-algebras, Coll. Logicum, Ann. Kurt-Godel-Soc. 3 A999), 31-50. [16] D. Dumitrescu, Fuzzy measures and the entropy of fuzz\ partitions, J. Math. Anal. Appl. 176 A993), 359- 373. [17] A. Dvurecenskij, Loomis-Sikorski theorem for ?-complete MV-algebras and l-groups. J. Australian Math. Soc. Series A 67 A999), 1-17. [18] A. Dvurecenskij and S. Pulmannova, New Trends in Quantum Structures, Kluwer, Dordrecht B000). [19] A. Dvurecenskij and B. Riecan, Weakly divisible MV-algebras and product. J. Math. Anal. Appl. 234 A999), 208-222. [20] G.A. Elliott, On the classification of inductive limits of sequences of semisimple finite-dimensional algebras. J. Algebra 38 A976), 29^44. [21] G.G. Emch, Mathematical and Conceptual Foundations of 20th Century Physics, North-Holland, Amsterdam A984). [22] D.H. Fremlin, A direct proof of the Matthes-Wright integral extension theorem, J. London Math. Soc. 11 A975), 276-284. [23] D.H. Fremlin, Measure algebras. Handbook of Boolean Algebras, Vol. 3, J.D. Monk, ed.. North-Holland, Amsterdam A989). [24] D.H. Fremlin, Measure Theory, Available from the author's site in the University of Sussex A995). [25] R. Fric, On observables. Int. J. Theor. Phys. 39 B000), 673-682. [26] L. Fuchs, Partially Ordered Algebraic Systems, Pergamon Press, NY A963). [27] K. Goodearl, Notes on Real and Complex C*-algebras, Shiva Mathematics Series, Vol. 5, Birkhauser, Boston A982). [28] B. Gerla, Conditioning a state by a Lukasiewicz event: a probabilistic approach to Ulam game, Theoret. Comput. Sci. 230 B000), 149-166. [29] K.R. Goodearl, Partially Ordered Abelian Groups with Interpolation, Amer. Math. Soc., Providence, RI A986). [30] S. Gudder. Fuzzy probability theory. Demonstrate Math. 31 A998), 235-254. [31] PR. Halmos, Lectures on Boolean Algebras, Springer, New York A974). [32] T. Hudetz, Quantum topological entropy: first step of a pedestrian approach. Quantum Probability and Related Topics, VIII. L. Accardi et al., eds. World Scientific, Singapore A993), 237-261. [33] J. Jakublk, On complete MV-algebras, Czechoslovak Math. J. 45 A995). 473^480. [34] J. Jakublk, On the product MV-algebras with product, Czechoslovak Math. J., to appear. [35] M. Jureckova, On the conditional expectation on probability MV algebras with product. Soft Comput. 5 B001), 381-385. [36] M. Jureckova, A note on the individual ergodic theorem on product MV-algebras, Intemat. J. Theoret. Phys. 39 B000), 753-760. [37] M. Jureckova and B. Riecan, Weak law of large numbers for weak observables in MV-algebras. Tatra Mt. Math. Publ. 12A997). 221-228. [38] A. Kacian, Probability on t-rings, PhD thesis. University of Bratislava. Slovak Academy of Sciences A999).
908 В. Riecan and D. Mundici E.P. Klement, Construction of fuzzy ?-algebras using triangular norms, J. Math. Anal. Appl. 85 A982), 543-565. E.P. Klement, R. Mesiar and E. Pap, Triangular norms. Trends in Logic, Studia Logica Library, Vol. 8, Kluwer, Dordrecht B000). E.P. Klement and M. Navara, A characterization of tribes with respect to Lukasiewicz t-norm, Czechoslovak Math. J. 47A997), 689-700. A.N. Kolmogorov. A new metric invariant of transient dynamical systems and automorphisms in Lebesgue spaces, Dokl. Akad. Nauk SSSR 119 A958), 861-864 (in Russian). F. Корка, Boolean D-posets as the factor spaces, Intemat. J. Theoret. Phys. 37 A998), 93-101. K. Kuriyama, Entropy of a finite partition ofafuzz\ sets, J. Math. Anal. Appl. 94 A983). 38—43. J. Lukasiewicz, Ologice trojwarkosciowej [On three-valued logic]. Ruch Filozoficzny 6 A920). 170-171. J. Lukasiewicz and A. Tarski. Untersuchungen iiber den Aussagenkalkiil, Comptes Rendus des seances de la Societe des Sciences et des Lettres de Varsovie, Classe III. 23 A930), 30-50. D. Maharam. An algebraic characterization of measure algebras. Ann. Math. 48 A947). 154—167. P. Malicky and B. Riecan. On the entropy of dynamical systems, Proc. Conf. Ergodic Theory and Related Topics, II. H. Michel, ed„ Teubner. Leipzig A986). 135-138. D. Markechova, The entropy of fuzzy dynamical systems and generators. Fuzzy Sets and Systems 48 A992), 351-363. V. Marra, Every abelian i-group is ultrasimplicial, J. Algebra 225 B000), 872-884. R. Mesiar. The Bayes principle and the entropy on fuzzy probability spaces. Int. J. General Systems 20 A991), 67-72. R. Mesiar, Fundamental triangular norm based tribes and measures, J. Math. Anal. Appl. 177 A993), 633-640. R. Mesiar and M. Navara. Ts-tribes and Ts-measures. J. Math. Anal. Appl. 201 A996), 91-102. R. Mesiar and B. Riecan, On the joint observable in some quantum structures. Tatra Mt. Math. Publ. 3 A993). 183-190. R. Mesiar and J. Rybarik. Entropy of fuzzy partitions—a general model. Fuzzy Sets and Systems 99 A998), 73-79. F. Montagna, An algebraic approach to prepositional fuzzy logic, J. Logic. Lang. Inf. 9 B000). 91-124. (Special issue on Logics of Uncertainty. D. Mundici. ed.) D. Mundici, Interpretation of AF C*-algebras in Lukasiewicz sentential calculus. J. Funct. Anal. 65 A986), 15-63. D. Mundici, Logic of infinite quantum systems, Internat. J. Theoret. Phys. 32 A993), 1941-1955. D. Mundici, Averaging the truth value in Lukasiewicz sentential logic, Studia Logica 55 A995), 113-127. (Special issue in honor of Helena Rasiowa.) D. Mundici, Uncertainty measures in MV-algebras and states of AF C*-algebras, Notas Soc. Mat. Chile 15 A996). 42-54. (Special issue in memoriam Rolando Chuaqui.) D. Mundici, Tensor products and the Loomis-Sikorski theorem for MV-algebras, Adv. Appl. Math. 22 A999), 227-248. D. Mundici, Ulam game, the logic of MaxSat. and many-valued partitions. Fuzzy Sets. Logic and Reasoning about Knowledge. D. Dubois et al., eds, Kluwer. Dordrecht A999). 121-137. D. Mundici and G. Panti. Extending addition in Elliott's local semigroup. J. Funct. Anal. 171 A993). 461- 472. J. Petrovicova, On the entropy of dynamical systems in product MV-algebras, Fuzzy Sets and Systems, 121 B) B001). 347-351. J. Petrovicova, On the entropy of partitions in product MV-algebras, Soft Comput. 4 B000). 41^44. J. Petrovicova and B. Riecan, On the entropy of T-dynamical systems, Busefal, D. Dubois, ed., 80 A999), 9-13. K. Piasecki. Extension of fuzzy P-measure by usual measure. J. Fuzzy Math. 7 A987), 117-124. S. Pulmannova, A note on observables on MV-algebras, Soft Comput. 4 B000). 45^48. B. Riecan, Fuzzy connectives and quantum models. Cybernetics and System Research '92, Vol. 1, R. Trappl, ed.. World Scientific, Singapore A992), 335-338. [70] B. Riecan, Upper and lower limits of observables in D-posets of fuzzy sets. Math. Slovaca 46 A996). 419- 431.
Probability on MV-algebras 909 [71] B. Riecan, On the almost everywhere convergence of obsenables in some algebraic structures. Atti Sem. Mat. Fis. Univ. Modena44 A996). 95-104. [72] B. Riecan. On the product MV-algebras, Tatra Mt. Math. Publ. 16 A999). 143-149. [73] B. Riecan. On the W space of obsenables. Fuzzy Sets and Systems 105 A999), 299-306. [74] B. Riecan, On the conditional expectation of obsenables in MV-algebras of fuzzy sets. Fuzzy Sets and Systems 102 A999), 445^450. [75] B. Riecan. Probability theory on some ordered structures, Atti Sem. Mat. Fis. Univ. Modena 47 A999), 305-315. [76] B. Riecan, On the W space of obsenables on product MV-algebras, Intemat. J. Theoret. Phys. 39 B000), 847-854. [77] B. Riecan. On the probability theory on MV-algebras, Soft Comput. 4 B000). 49-57. [78] B. Riecan, Almost everywhere convergence in probability MV-algebras with product, Soft Comput 5 B001), 396-399. [79] B. Riecan and T. Neubrunn. Integral, Measure, and Ordering, Kluwer, Dordrecht A997). [80] E. Ruspini. A new approach to clustering. Inform, and Control 15 A969), 22-32. [81] J. Rybarik. The entropy of partitions on MV-algebras, Intemat. J. Theoret. Phys. 39 C) B000), 885-891. [82] R. Sikorski, Boolean Algebras, Springer. Berlin A960). [83] A. Tarski. Logic. Semantics. Metamathematics. Clarendon Press, Oxford A956), Chapter IV. Reprinted 1983, Hackett, Indianapolis. [84] V Varadarajan, Geometry of Quantum Theory, Vol. 1, Van Nostrand Reinhold, Princeton, NJ A968). [85] M. Vrabelova. On the conditional probability in product MV-algebras, Soft Comput. 4 B000), 58-61. [86] J.D.M. Wright, The measure extension problem for vector lattices, Ann. Inst. Fourier (Grenoble) 21 A971), 65-85.
CHAPTER 22 Measures on Clans and on MV-Algebras Giuseppina Barbieri and Hans Weber Universita di Udine, Dipartimemo di Matematka e Infonnatica via delle Scienze 206 33100 Udine. Italy E-mail: weber<g>dimi. uniud. it Contents Introduction 913 1. MV-algebras and clans 914 1.1. Basic properties of MV-algebras 914 1.2. The centre of an MV-algebra 916 1.3. Representation of MV-algebras 918 1.4. Loomis-Sikorski theorem 919 1.5. Decomposition of complete MV-algebras 919 2. Submeasures on MV-algebras 921 3. Real-valued measures on MV-algebras 923 3.1. The space ba(L) 923 3.2. Representation of real-valued measures 927 4. Uniform MV-algebras 927 4.1. The lattice CUA(L) 927 4.2. Decomposition of order continuous uniform MV-algebras 930 4.3. The lattice CUA(L. w) 933 4.4. The uniformity generated by a measure 934 5. Measures on MV-algebras with values in a locally convex space 936 5.1. Representation of measures 936 5.2. Decomposition theorems 938 5.3. Hammer-Sobczyk's decomposition theorem and Lyapunov's theorem 939 5.4. Vitali-Hahn-Saks-Nikodym theorem and Nikodym boundedness theorem 941 5.5. Extension of measures 943 References 944 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V All rights reserved 911
This page intentionally left blank
Measures on clans and on MV-algebras 913 Introduction The main aim of this chapter is to present in a unified approach the results about a certain type of fuzzy measures, namely about Tx-valuations on clans of fuzzy sets. A clan ? of fuzzy sets is a family of [0, 1 ]-valued functions defined on a set ? such that 1 e ? and (/ - g) ? 0 e ? whenever /, g e T. Identifying sets with their characteristic functions (i.e., {0, 1 (-valued functions), any algebra of sets is a clan of fuzzy sets. A ???-valuation on a clan С of fuzzy sets is a real-valued function such that ?(/ + g) = m(/) + M(g) if /, g e ? and / + g < 1. Any finitely additive measure on an algebra of sets is a ???-valuation. Tx-valuations were introduced by Klement A982) and intensively studied by Butnariu and Klement A993) in their book "Triangular norm-based measures and games with fuzzy coalitions". There they treat the real-valued or R"-valued case. Here we present T^-valuations with values in a locally convex linear space in order to include also vector-valued measures on algebras of sets. Any ??-valuation is a modular function, see Proposition 1.1.9. This fact allows us to apply some topological techniques elaborated by Weber A996,2002) in the study of modular functions. Indeed, we develop a topological approach to study Гзс-valuation inspired by the method of Weber A984a, 1984b) using FN-topologies in the study of measures on Boolean algebras. To realize this approach we need, as a domain of fuzzy measures, instead of a clan of fuzzy sets a more general structure, which is equationally defined and therefore closed under quotients and uniform completions. A natural generalization of clan of fuzzy sets is an MV-algebra. DEnNlTlON 0.1. An MV-algebra (L. +,'; 0, 1) is a commutative semigroup (L,+) with 0, 1 and a unary operation ': L —>¦ L which satisfies the following axioms: (LI) jt + 1 = 1, (L2) {x')'=x, (L3) 0'= 1, (L4) (x' + y)' + y = (x+ y')' + ? for every x.yeL. Actually, we don't use the previous axioms for an MV-algebra; but, after defining ? ? у iff ?' + у = 1 and у ?? := (? + у')' whenever x ^ у we use the following axioms (PI) x^x+y; (P2) x^y^z+x=*yAx<:z; (P3) у = у Ax +x whenever ? ? у; (P4) ?" =?. Incidentally, the first three axioms define a A-?-semigroup, a structure defined in Weber A997). MV-algebras were introduced by Chang in 1958 in the theory of multi-valued logic. "So because of my own shortcomings in following proofs in Polish notation during a seminar held by Rosser on the Completeness of Lukasiezwicz axioms for infinite valued propositional logic, MV-algebras came to be." (Chang, 1998) "It occurred to me that the approach used in the proof of completeness of the two valued propositional logic via the Boolean (Lindebaum) algebra of
914 G. Barbieri and H. Weber equivalence classes of formulas and the Boolean maximal ideal theorem might be another way to do what Rosser did, but avoiding syntactical manipulations of formulas in Polish notation. This was the beginning of MV-algebras" (ibidem). Chang found a proof of the completeness of the 1%3-valued logic (developed by Lukasiewicz and Tarski and by Rose and Rosser). Since then MV-algebras have found increasing interest. In 1995 Mundici introduced finitely additive measures (called states) on MV-algebras with the intent of capturing the notion of 'average degree of truth' of a proposition. Here we carry on some measure theory on MV-algebras. This mainly follows Barbieri and Weber A998), Barbieri A999), Barbieri et al. B001). The topics we are interested in are representation of measures, decomposition of measures, extension of measures and Vitali-Hahn-Saks and Nikodym theorems. For a treatment of measures on MV-algebras from a probabilistic point of view we refer to Riecan and Mundici B002). From now on, L stands for an MV-algebra. 1. MV-algebras and clans 1.1. Basic properties of MV-algebras We always consider on the MV-algebra L the relation ^ defined by ? ^ у iff ? + у = 1. It can be shown that ^ is a partial order on L and (L, ^) is a distributive lattice (see Mangani A973), where the lattice operations are given by ? ? у = (? + у')' + ? and x/\y = (x'v у')'). A basic example of an MV-algebra is an order interval of a commutative ?-group: PROPOSITION 1.1.1. If (?, ?, ^) is a commutative i-group and e is a positive element of H, then the interval {x e H: 0 ^ ? ^ e} becomes an MV-algebra with respect to the operations ? + у := (? ? у) ? e and x' := e — ?; we denote this MV-algebra by Lq(H, e). Chang A958b) and Mundici A986) have proved that any MV-algebra is of this type. THEOREM 1.1.2. Let L be an MV-algebra. Then there exists a commutative l-group ? with strong unit e (i.e., Wg e ? Bn e N with ne ^ \g\) such that L = Lo{H,e). L is complete iff ? is Dedekind complete and L is ? -complete iff ? is Dedekind ? -complete. The completeness statement in Theorem 1.1.2 was observed by Jakubik A995). COROLLARY 1.1.3. If(L, +,';0, 1) is an MV-algebra and ее L, then Le:=[x eL: x^e] is an MV-algebra with operation ? defined by ? ? у := (? + у) л е and complementation * defined by ?* := ?' ? е.
Measures on clans and on MV-algebras 915 In the introduction we have already defined у Ax if x, у e L and ? ^ y. We extend ? on Lx L by ? Ay = (xvy)A(x Ay). With the notations as in Theorem 1.1.2, if L = Lo(H,e), then xAy = \x — y\. Every MV-algebra satisfies an important property known as the Riesz decomposition property. Proposition 1.1.4 (Riesz decomposition property). If x,x\,xi e L with ? ^ x\ + хг, then there are f, e L(i = 1, 2) such that f, ^ jc, and ? =t\ +t2,t\ ^ t'r Proposition 1.1.5. (a) x' = \Ax, ? +x' = 1. (b) xAy = Oiffx = y. (c) xAy^(xAz) + (zAy). (d) [xA(xAy)] + y = xvy, (x Ay) + (x vy)=x +y. (e) (y + x)Az^(yAz) + (xA z). (f) If о denotes one of the operations +, ?, ?, ?, then (? ? z)A(y ? ?) ^ xAy. (g) If(xa)aeA I x and (yfl)fleB I }'¦ Then xa + у ? \. ? +y. (h) Let a„,s„,s e L such thats,, = supi>nai ands,, Is. Then a \ As ^ ??=\ ?/???,,+|. One can deduce these rules from the axioms (P1)-(P3) given in the introduction (see Weber A997)). Proposition also 1.1.5 follows from Theorem 1.1.2 and the known rules in ?-groups. Proposition 1.1.6. (a) If x, у e L, then (x + y)Ay = ? л у'. (b) If(x„) is a sequence in L with J2"=i x\ -^ x'n+l (n e N), then s„ := ?'?=\ ?' defines an increasing sequence and s„Asn-\ = x„(n e N). (c) If (s„)newu{0] is an increasing sequence, then x„ := s„As,,-i defines a sequence with snAs0 = ?"=| ?, ^ x'n+i (n e N). We call a subset N of L an idea! if x, у e ?, ? e L, ? ^ ? + у imply ? e ?. ? ?-ideal is an ideal closed under countable suprema. The following proposition shows that there is a natural bijection between congruence relations and ideals in L. PROPOSITION 1.1.7. Let ~ be a congruence relation with respect to + and ' (and therefore with respect to ?, ?, ?) on L. (a) Then N = [x e L: ? ~ 0} is an ideal and ? ~ у iff xAy e N. Conversely, any ideal N in L by x~y iff xAyeN defines a congruence relation with respect to +,', ?, ?, ?. (b) The quotient L/N := L/ ~ with the induced operations is an MV-algebra.
916 G. Barbieri and H. Weber DEFINITION 1.1.8. A vector-valued map ?: L —>¦ ? is called a measure if ?(>0 = ?(???) + ?(?) whenever ?, у е Р and x ? у. PROPOSITION 1.1.9. Let ?: L^- ? be a vector-valued map. (a) Then ? is a measure iff ?(? + у) = ?(?) + ?(}') whenever ?, у e L and x ? у'. (b) If ? is a measure, then ? is a modular function, i.e. ?(? А у) + ?(? ? у) = ?(?) + ?-iy) whenever ?, у е L. PROOF, (a) (=>) If x,y e L and ? ^ y', then (jc + y)A)· = x. Hence, ?(? + у) = ?((? + у) Ay) + ?(>0 = ?(?) + ?(}'). (<=I? x,y e L and ? ? у, then )>Ajc ^ x'. Hence, ?()·) = ?((? ??) + ?) = ?(???) + ?(?). (b)Let ?, у & L. Since xA(x Ay) ? у', we obtain using (a) ?(?) + ?(?) = ?(? Лу) + ?(??(? л ?))+?{?) = ?(? А y)+?((xA(x ? }¦)) + >') = Д(* ? ?) + ?(? ? )·)· ? PROPOSITION 1.1.10. Let ? : L -> ? be a vector-valued measure. (a) Then ?(?) := [? e L: ?(?) = Ofor every ? e [0, ?]} is an ideal in L. (b) Let ~ be the congruence relation induced by ?(?) according to Proposition 1.1.7. Then a ~ b iff ?(?) = ?(}')^ ?, у е [а A b, a v b]. (c) For an ideal N С ?(?), define ? by ?(?) := ?(?), ? e a e L/N. Г/геи ? /s ? measure on L/N and ?(?) = {0} iff ? = ?(?). PROOF, (a) Let х\,хг e ?(?) and у ^ jc? + X2- By the Riesz decomposition theorem there are y, e L with y, ^ jc, (/ = 1,2) and у = y\ + yi, y\ ^ y'-,. Therefore ?^) = ?(?\) + ?(?2) = ?. (b) (=>) If ? ~ b and jc e [а л b, ? ? b]. Then г := (? ? b)Ax ^ ??/> e ?(?), hence, ?(?) = 0 and ?(? w b) = ?(?) + ?(?) = ?(?). («=) Let ? ^ aAb. Then ? + (а л fo) e [а л fo, ? ? b], thus ?(? л fo) = ?(? + (а л ?)) = ?(?) +?(? ? fo) by Proposition 1.1.9, so ?(?) =0. It follows that aAb^O and a ~ fo. (c) follows from (b). ? 1.2. Г/ге centre of an MV-algebra The centre of L here has a basic role in decomposition theorems and in the representation of measures. The centre of a lattice R with 0 and 1 is the set of elements e e R such that for some element f e R the map R э ? н> (jc л e, ? л /) е [0, е] ? [0, /] defines a lattice isomorphism. In this case / is the unique (!) complement of e. If R is distributive then C(R) is the biggest Boolean subalgebraof (R, л, v,0, 1). Here we give some characterizations of the centre. PROPOSITION 1.2.1. C(L) = {x e L: ? ax' =0} = {x e L: x' is the unique complement of x] = [x e L: ? has a complement in (L, ^)) = {x e L: 2x = x). PROOF, (i) By Proposition 1.1.6(a) jc ajc' = 0 holds true iff 2jc = x. (ii) We will show that C(L) с {jc e L: jc л jc' = 0).
Measures on clans and on MV-algebras 917 Let ? e C{L) and ? the complement of x. Then by Proposition 1.1.5 ?' = 1?* = (? ? ?)?(? ? 0) ^ ? АО = ?, hence ? лх' ?? ??=0. (iii)Letjc e L and ? ??' = 0, then jc' is a complement of jc: in fact, by Proposition 1.1.5, we have ? ? ?' = ? + ?' = 1. The rest follows from the distributivity of (L, л, ?). ? Corollary 1.2.2. If L is ?-complete, then a := sup„eti na e С(L) for every a e L. PROOF. Thanks to Proposition 1.2.1 it suffices to prove that 2<5 = a. Indeed 25 = sup„eNna +sup„6N/ia =sup„ w6N[(n +m)a] = supneNna =a. D PROPOSITION 1.2.3. С\L) is regular embedded in L. PROOF. It is sufficient to prove that if C(L) э xa \, ? in L, then ? e C(L). By Propositions 1.1.5 and 1.2.1 we have xa = 2xa I 2x, therefore 2x = x, and so by Proposition 1.2.1 xeC(L). О PROPOSITION 1.2.4. Letae C{L). Then ?(?) := (jc ? ?, jc л ?') defines an MV-algebra isomorphism between L and [0, a] x [0, a']. PROOF. Since a e C(L), ?:? н> (jc л а,х л ?') defines a lattice isomorphism from L onto [0, ?] ? [0,a']. What is left to show is that, with notation as in Corollary 1,1.3, ?(? +y) = ?(?) ? ф(у) and ф(х') = ф(х)* whenever jc , ? e L, i.e., that for all x,y e L and с е C(L) we have (i) (x + у) л с = (? л с) + (}> л с); (п) jc' лс = (jc ас)*. (i) We have (* лс) + (у лс) = (jc +у) л (jc +с) л (у +с) а(с+с) and (? +с) л (у +с) л (с + с) = с if с + с = с. (ii) (jc л с)* = (jc л с)' л с = (jc' ? с') л с = (jc' л с) ? (с' л с) = х' л с. D The following proposition, which immediately follows from Proposition 1.2.4, makes it understandable that the centre C(L) plays an important role for decomposition theorems of measures. Corollary 1.2.5. Let a e C(L), ? : L —>· ? a measure with values in a linear space. Then ?(,(?) := ?(? л jc) and ??'(?) := ?(?' ? jc) define two measures on L and ? = ??+??·- We conclude the section with a remarkable example. Example 1.2.6. Let ? be a nonempty set, ? an ?-subgroup of Жп such that ?? e ? and, with the notation of Proposition \.\.\, L = Lq(H, ??)- Then L is a Гэс-clan of fuzzy sets. The centre C(L) is the space of {0, 1 (-valued functions contained in H, called the space of crisp sets, ? := {А С ?: ?? e ?] is an algebra of subsets and ДэЛихл defines a Boolean isomorphism between Л and C(L).
918 G. Barbieri and H. Weber 1.3. Representation of MV-algebras A ???-tribe is a ???-clan such that (?„6? /,() л ?? e ? whenever /„ e T. Its centre C(T) is a ?-algebra. Important is here Artstein's observation that any function of a ???-tribe ? is measurable with respect to C(T) (see Butnariu and Klement A993, 3.2)). In Proposition 1.3.1 we give a simpler proof of this fact. For an algebra A of subsets of ? we denote by S(A) = spanfxA: A e A} the linear space generated by the characteristic functions хд, А еД and by CX(A) the closure of S(A) in the space /oo(i2) of all bounded functions with respect to the sup norm || ||oo. Proposition 1.3.1 will be a model for a representation of an MV-algebra. PROPOSITION 1.3.1. Let ? be a Toe-tribe of fuzzy sets on ? and A = C(T) the ?-algebra of crisp sets in T. Then ? с CX(A). PROOF. Let / e ? and ALn :={???: f(x) <i/n] for ? e N and / eNU(O). Then ???? = sup,n6N(w(/ - nf)+ л ??) e ?, hence, A,.„ e A, g„ := ?,"=??'('/") (Хл/+1.„ - Xa,„) e 5(Л) and 0 < / - g„ < \/n. ? Now we will represent an MV-algebra. This representation theorem is obtained as a corollary of a well-known representation theorem for ?-groups. Its proof can be found in Vulikh A967, Section V, Theorem V.3.1). We recall that the set of all components of an element e of the positive cone of an ?-group G coincides with the centre of the MV-algebra {x e G: 0 ^ ? ^ e] (cf. Proposition 1.2.1). Bernau A963) proved that if G is a Dedekind ? -complete ?-group with strong unit e, then there exists a set ?, an algebra of sets Ao and a group and lattice isomorphism f:G^ Go С Loo(Ao) with f(C) С Ao, where С is the set of components of e, and moreover ?(?) = ?^. Now we prove that Aq can be chosen to be isomorphic to С THEOREM 1.3.2. Let G be a Dedekind ?-complete l-group with strong unit e, С the Boolean algebra of components of e. Then there exists A an algebra of subsets of a set ? and a group and lattice isomorphism ?: G —> Go С СХ(А) in such a way G is regular embedded in Coo(A) and ф(С) = (хд: A e A] (i.e., f{C) = A if we identify sets with characteristic functions). PROOF. By the theorem of Bernau A963) mentioned above we have to show that if A = f(C), then Go С L^A). Let / e Go, ? e N. We shall have established the theorem if we prove that there exists s e S(A) such that ||/ — s\\x ? \/?. As / e/oo(i2) there exists A: e N such that ||/||зс ^k/n. We will show that for / e ? there exists A, e A such that {f<Ln)CA-C{f^\ Let g := (??? - nf)+ л ?? and h := (??? - nf)~ л ?? = (nf - ???)+ л ??. Then g, h e [0, ??] = {/ e Go: 0 ^ / ^ ??). With the same notations as in Corollary 1.2.2
Measures on clans and on MV-algebras 919 ~g and h e C([0, ???)· By Example 1.2.6 g~, h are {0, 1 (-valued functions, so there exist ?,, ?, e Л with g~ = ??? and /? = ???. Moreover, as ~g ^ g and, analogously, h ^ h, we have {g > 0} С A, (*) and {h > 0} с S, (**); as g л h = 0, we get J л /г = 0 and consequently А, П ?, = 0 (***). We have {/ < i/n} с А,: If /(л) < i/n, then g(;t) > 0, therefore by (*) x e A,. We have А, с {/ ? i/n}: Let /(jc) > ;'/n, then /j(jc) > 0, therefore by (**) x e S, and consequently by (***) x$Aj. Puti:=Ef=^('/")(XA,+, - хл;). Then ? e SD) and ||/-i Hoc ?? 1/n. ? THEOREM 1.3.3. Lei L fee ? -complete. Then there exist an algebra A of subsets of a set ? and a regular embedding ? : L -> Coc(A) such that *P(C(L)) = (хд: А е A} and ?(? +y) = (?(?) + ? (у)) л ?? and ? (?1) = ??- ? (x) far ?, у е L. PROOF. From Theorem 1.1.2 L is the interval [0, e] of a Dedekind ?-complete ?-group (G, ?, ^) with strong unit e, where the operations of L can be defined by the ones of G in the following way: х+у = (х(Ву)ле and x' = e - x. Now put A and ? as in Theorem 1.3.2 and define ? := ?\? the restriction of ? to L. ? 1.4. Loomis-Sikorski theorem The Loomis-Sikorski theorem for Boolean algebras says that every ?-complete Boolean algebra is isomorphic to a ?-complete Boolean algebra of point sets modulo a ?-ideal in that algebra. A. Dvurecenskij B000) and D. Mundici A999) have independently given a generalization of this theorem for MV-algebras. Here we give an alternatively and easy proof of this result based on the representation Theorem 1.3.3. THEOREM 1.4.1. Let L be ?-complete. Then there exists a tribe ? of fuzzy sets and a ?-order continuous epimorphism <p:T —>¦ L onto L (with respect to the operations of an MV-algebra). Proof. We use the same notation as in Theorem 1.3.2. By the classical Loomis-Sikorski theorem there is a ?-algebra Aq and a ?-order continuous epimorphism ? : Ao —>¦ A such that N = ?'\??) is a ?-ideal. By 45B of Fremlin A974) ? extends ? : Cx(Ao) -> ?оо(Л) and jt~' @) = ???(?0. By 14L(b) of Fremlin A974) ? is ?-order continuous. Now put ? := ? (\lr(L)). Then ? is a tribe and there is a ?-homomorphism from ? into L (with respect to the operations of an MV-algebra). ? 1.5. Decomposition of complete MV-algebras Any complete Boolean algebra С is isomorphic to the product of an atomless Boolean algebra and {0, 1}A where A denotes the set of atoms of C. We use this decomposition
920 G. Barbieri and H. Weber for the centre of L to obtain a decomposition of L (see Theorem 1.5.4). This result will be applied to obtain a topological decomposition of uniform MV-algebras which itself is a basic tool to prove the decomposition Theorem 5.3.1 for measures and properties of the range of measures (see Theorem 5.3.2). We recall that an atom of a lattice Lo with zero 0, is an element a e Lo\{0) such that if ? e Lq and 0 ^ ? ? a, then either ? = 0 or ? = a. Lq is atomic if for every ? e Lq\{0] there exists an atom a of Lo with ? ? ?; Lq is atomless if Lo doesn't contain atoms. PROPOSITION 1.5.1. Let L be ?-complete and a be an atom of L. Then a is an atom of C(L). (Notation as in Corollary 1.2.2.) PROOF. Let c\, сг be disjoint elements of C(L) with c\Vc2=a. We will show that с ? =0 orc2 = 0. Since ? = (а лс\) ?(? лс2) and a is an atom, we have ? лс| = 0ora лег =0. Suppose that а л сг = 0. Then a ^ c\, hence, a ^c\ =c\ ^ a. It follows that a = c\ and therefore сг = 0. ? With notations as in Proposition 1.1.1 let Сх := Lq(R,1) the unit real interval considered as an MV-algebra and C„+\ := {//и: / = 0,..., ?] the MV-subalgebra of Cx which consists of ? + 1 elements for n e N. PROPOSITION 1.5.2. Let L be ?-complete and irreducible (i.e., C(L) = {0, 1}). Then L is (as an MV-algebra) isomorphic to C„ for some neNU{oo},«^ 1. PROOF. We use the same notations as in Theorem 1.3.3. Since C(L) contains exactly two elements, we have A = {0, ?). Hence, S(A) = {???: a e Щ = Cx(-A)< so ??(?) is isomorphic to R. Therefore we may assume that L is an MV-subalgebra of ??. If L \ {0} has a minimal element a, then a = \/n for some ? e N and therefore L = C,l+\. If L \ {0} has not a minimal element, then L is dense in jCoc and therefore L — jCoc» since L is ?-complete. ? We need a generalization of Proposition 1.2.4. PROPOSITION 1.5.3. Let L be complete and D a set of pairwise disjoint elements of C(L) with supD = 1. Then ?:? н> (? ? d)(/eo defines an isomorphism between L and PROOF. By Proposition 1.2.4 ? н> ? л d defines for any d e D an homomorphism between MV-algebras. Therefore ? is an MV-algebra homomorphism. Since sup D = 1 and therefore ? = ? л sup D = sup(/eD ? Ad, ? is injective. ? is surjective since for any (Xd)ddD e ??/6?[°' <*] we have 0(sUP(/6D-*rf) = (xd)deD- ° THEOREM 1.5.4. Let L be complete. Denote by AX(L) the set of all atoms a ofС(L) for which [0, a] is infinite and by Af(L) the set of atoms a ofC(L)for which [0, a] is finite. Put aoo :=sup^oo(L), a/ :=sup.4f(L) and с := (ax ??/)'·
Measures on clans and on MV-algebras 921 (a) Then L is isomorphic to [?, ? ? ? [0, ax] ? [0, с]. (b) [0, af] is isomorphic to a product Y\aeA ?,,„ where A = Af(L). (c) [0, ax] is isomorphic to the power C^ where В = Ax (L). (d) The centre of [0, c] is atomless. (e) L is atomless iff a) — 0 iff A/ (L) = 0. (f) L is atomic iff ?? = с = 0. PROOF. aoc,a/,c e C(L) by Proposition 1.2.3. (a), (b), (c) now follow from Propositions 1.5.3 and 1.5.2. (d) follows from the fact that С([0,a]) = C(L) ? [?,?] for any a e C(L). (e) and (f) follow from (a), (b), (c) and Proposition 1.5.1. ? 2. Submeasures on MV-algebras Let ?-.S —>¦ [0, +00] be defined on an MV-subalgebra of L. ? is called subadditive if ?(? + у) !? ?(?) + ? (у), ? is called ? -subadditive if ?,?,, e S (и e ?) and ? = SUP„ ?"=?-*, =·' ??=\?'? ??1?^ П(х) < ??=\ n(x,i)- Л is а (?-) submeasure if 77 is monotone, (?-) subadditive and ?@) = 0. Example 2.1. Let (E, p) be a seminormed linear space, S an MV-subalgebra of L and ?: S —> ? a measure. (a) Then the semivariation \\?\\?: S^> [0, +00] defined by ||?||,>(·*¦)= &??{?(?(?)): t e [0, x]} is a submeasure. (b) If ? is ?-order continuous (i.e., for every monotone sequence (x„) in S with order limit jceSwe have ????(.?„) = ?(?)), then ||?||,, is a ?-submeasure. PROOF, (a) Let x, jc 1, x2 e 5 with jc ^ л 1 + *2 and t e [0, v]. By the Riesz decomposition property there are f, e 5 with f, ^ jc, A = 1,2) and t = t\ + л», ? ^ '?- Therefore ?(?) = ?(?,) +?(?2) and ?(?(?)) ? ?(?(?\)) + ?(?(?2)) ? ||?|1,,(-*?) + ?|???/>(*:>)· It follows that ||?||/;(?) ? \\?\\?(?])+ \\?\\,,(?2). (b) Let jc, jc„ e S(n eN)and;c = ?^1, x„.Lett e S, t <;c.Then ||?||,,(? л^, *,¦) ^ ?!=? II?II/.>(-*,') by (a). Moreover, t л ?!=? -*'' t '· Therefore ?(?@) = ???^???,?? л]Гх, J J ^ Дгт^ ]? ||?||/,№) = ]? ||?||,,(-?„)· it follows ?^??^-?^?^? Им им*,,)· D It follows from Example 2.1 that any positive real-valued (?-order continuous) measure on L is a submeasure (?-submeasure). The next proposition shows that a ?-submeasure can be extended from a ?-subalgebra of L to L.
922 G. Barbieri and H. Weber PROPOSITION 2.2. Let L be ?-complete and ? : S —>· [0, +oo] be ? ?-submeasure on an MV-subalgebra of L. Then /?*(*) = inf defines ? ? -submeasure on L extending ?. PROOF, ?* extends ? since ? is a ?-submeasure. Let x,xn e L with jc ^ ??=\ xn- We show that ?*{?) ^ ?,^? /7*(лг„). We may assume that ?*(??) < +oo for each n e N. Let ? > 0 and x,Lm e S with л:,, ^ ?^=\ Xn.m and ?~=? l(xn.m) < /j*(jc„) + ?/2" (и e N). Then дс <; ??„,= \ xn.m and therefore /?*(*) ^ ?»°°..=?4^.-.)^?»"?4*?+?· ? Let /j: L —>¦ [0, +oo] be a submeasure. It follows from Proposition 1.1.5 thatdtl(x, y) := ?(???) defines a pseudometric on L and d,((;t oj.yo:)^ d,((.x, y) (?, ?, ? e L) where о denotes one of the operations +, ?, л, v. Therefore the operations +, ?, л, ? are uniformly continuous; in particular, ? н> ?' is uniformly continuous since ?' = 1?*. DEFINITION 2.3. A submeasure ? is called ?-order continuous (order continuous) if every decreasing sequence (net) with order limit 0 converges in (L, ??) to 0. A submeasure ? is called exhaustive if every monotone sequence in (L, ??) is Cauchy. The following proposition shows that any ?-order continuous submeasure is a ?-submeasure. PROPOSITION 2.4. Let ?: L -> [0, +oo] be a submeasure. Then the following two condition are equivalent A) ? is a ?-submeasure. B) Ifa„ I 0 and (a„)„6pj /s a Cauchy sequence, then a„ —> 0 in (L, ??). PROOF. A) => B) Let (/>„) be a subsequence of (a„) with ^b„Ab„+\) ^ 2~" for и e N. Since by Proposition 1.1.6(c) bnAbm+\ = ??=,? biAbi+\ for m ^ и and therefore bn = ?^„ bi ?^?+?, we obtain + i пФп) ^ilMi'i+i) < ?2~' = 2_" Therefore /j(/>„) —>¦ 0. Since (/>„)„6? is a subsequence of (a„),,6N and (a„)„6pj is a Cauchy sequence, it follows that also (?„)„6? converges to 0 in (L, dn). B) => A) Let (a„)„6N e L,s„ = ?-'=l a/(" e N) and s - sup„6Ni„. We shall show that r](s) ^ ??=\ nian). We may assume that ??=\ ^(?,?) < +°°· Then (???»)«6? is a Cauchy sequence since ?((sAs,l)A(sAs,n)) ? ??=? l(fli) ^ 0 for m > ? ^ +oo.
Measures on clans and on MV-algebras 923 MoreoversAsn i 0. Hence, ?^??,,) -> 0 by B). Since II DC ?=? ?=? and /?(???„) —>¦ 0, we obtain /j(s) ^ ?,?? ???)· ^ THEOREM 2.5. Lei L be a-complete and ? : L —>¦ [0, +oo] fee ? ?-submeasure. (a) Г/геи (L,dn) is complete. (b) If(an)nef$ converges to a in (L, ??), then (?„)„6? /?as ? subsequence (fe,i)„6N such that ? (a A lim sup fe„) = 0. PROOF. Let (а„)„6щ be a Cauchy sequence in (L,d,j) and (fe,,)„6N a subsequence of (a„)„6N with /j(fe„Afe„+|) ^ 2'" for и е N. Let fe = limsupfe,,. Since bAb„ ^ ?~„/>,?/7, + |, we obtain ij(*A*„) < ?,?„»?(*??*?+?) < 2""+'· Therefore (fe„) converges to fe. It follows that also (a„)«6N converges to b. Moreover, if (а„)„ещ converges to ? in (L,dn), then /j(aAfe) = 0. ? If ?: L -> [0, +00] is a submeasure, then N(/j) := [a e L: /j(a) = 0} is an ideal in L. THEOREM 2.6. Let L be ?-complete, ? : L -> [0, +oo] a ?-submeasure and ?: L —> [0, +oo] ? ?-order continuous submeasure on L. Then ?(?) С ? (?) ??[?(?„) —>¦ 0 implies ?(?„) —>¦ 0/or any sequence (a„)„6pj /и L. PROOF. («=) is obvious. (=>) Suppose that L contains a sequence (?„)„6^ such that ?(?„) —> 0 but (?(?„)),?6? doesn't converge to 0. Then there is a subsequence (fe„)„6N of (йп)мем and an ? > Osuch thatX(fe„) ^ ? for every ? e N. By Theorem 2.5(b), (fe,i)„6N has a subsequence (c„)„6pj such that /j(limsupc„) = 0. Let s = lim sup c„ and s„ = sup,^,( c,. Then lim„^+00 ?(?„) = ?(?) = 0, a contradiction to k(s„) ^ ?(?„) ^ ? (n e N). D THEOREM 2.7. Lei L fee ?-complete and ? and ? be ?-submeasures on L. If for every ? > 0 there is an element ?? e L such that ?(??) < ? and ?(?^) < ?, then there is an element a e C(L) such that ?(?') = ?(?) = 0 PROOF. Let ? > 0. For every n e N we can choose x„ e L such that ?(?„) ^ ?/2" and ?(*;,) <; ?/2". Put ? = ??\x"- Then ?(?) ^ ??=\ ?(?,·) <? and ?(·?') < ?«> -»¦ °· hence, ?(?') = 0. Therefore for every n e N there exists )¦„ e L such that ?()„) = 0 and ?(?'„) <: l/л. Put у := ?~=? >'«· Then ??0 < ?? М}'„) = 0 and /j(y') ^ /?(}',',) -»· 0, hence, jjO") = 0· By Corollary 1.2.2, a := sup„6N ny e C(L). Then /j(a') ^ ?(?') = 0 and ?(?) =0 since ?(?) is aa-ideal. ? 3. Real-valued measures on MV-algebras 3.1. The space ba(L) Let us denote by ba{L) the space of all bounded real-valued measures on L.
924 G. Barbieri and H. Weber Obviously, ba(L) is a partially ordered linear space where the partial order and the operations on ba{L) are defined pointwise. The basic tool in this section is the following result. THEOREM 3.1.1. ba{L) is a Dedekind complete Riesz space and (i) if ?,? e ba{L) and ? e L, then (? ? v)(x) = sup^(;t|) 4- v(xi): x\, xi e L\ x\ ? x'y, x\ + xi =¦*}', (ii) if (??)??? is an increasing pointwise bounded net in ba(L), then (ду)у6г has a supremum in ba{L) and (sup еГ ??)(?) = sup{My(.x): ? e ?] for ? e L. PROOF. To simplify the calculation we assume that L is an order interval [0,e] of an ?-group (G, ?, ^) and ? + у = (? ? у) л е. First observe that for ? \, x2 e L, the condition ? \ ^ x'2 and x\ + x2= ? means exactly X\ ®X2 = X- Let ?,? e ba(L), ? e L and ?(?) be defined by the right-hand side of (i). Then ? is well-defined and bounded. We shall check that ? is a measure. Let x, у e L with ? ^ y'. If x\,xi\ y\, y2 e L such that x\ ? xi=x and y\ ? y2 = y, then by additivity of ? and v, we have ?(?\) + v(x2) + ?(}'?) + ?(}'?) = ? С* ? + Л) + i>(*2 + уг) ^ ?(* 4- у). Hence, ?(?) + k(y) ? ?(? + у). Now let w\,w2 e L such that w\ ? W2 = ? ? у. Then by the Riesz decomposition property there exist z,, tj e L (i =1,2) such that w\ = z\ ? t\, w2 = ?? ? t2 and z\ ? ii = x, t\ ? ь = }'¦ Therefore ?(?? ?) 4- ?(??2) = ? (г ?) + ?(? ?) + ?(?2) 4- ?(?2) = ? (? ?) + ?(^2) + ?(?\) + v(t2). Since ? ? ? ?2 = ? and ?| ? ti = у, we have ?{??\) 4- у(иъ) ^ Цх) + ?)·)- So ? (л 4- }') ^ Цх) + Цу). Obviously, ? is the supremum of ? and v, since it is an upper bound and any upper bound of ? and ? is necessarily greater than ?. By the rules valid in any partially ordered linear space we get ? л ? = -[(—?) ? (—?)] in the sense that if one side exists then the other side exists and they are equal. To prove the Dedekind completeness of ba(L) it suffices to show that any increasing pointwise bounded net (?7)/6? has a supremum in ba(L). ТЫ ?(?) :=supyer ??(?) for every ? e L. We shall prove ? a measure. We have ?(? +y) = $\}???(? + у) = sup (??(?) + ??(у)) = sup ??(?) + sup ?7(H уеГ уеГ уеГ уеГ = ?(*) + ??0 whenever ?, у e L and ? ^ y'. Since the pointwise supremum ? of (?7)/6? is a measure, it is the supremum of (?7)/6? in ba(L). ? PROPOSITION 3.1.2. For ? e ba{L) denote ?+ := ? ? 0, ?" := (-?) ? 0 and |?| := ? ? (—?). Г/геи (а) //?, ? e ba(L), then (? ? i>)(;t) = inf^(jt|) 4-v(jt2): ??,?? e ?<; ¦*! ^ x->'> ?\ + -*2 =-*}·
Measures on clans and on MV-algebras 925 In particular \?\ ? |?| =0 (in symbols ? J_ ?) iff for every ? > 0 there exists ? e L such that \?\(?) ^ ?, |v|(jc') ^ ?. (b) ?+(?) = sup{M(y): Lb у ? ?}. (c) M~(-x) = -inf{M(H: L3y^x}. (d) \?\(?) = sup{M(jf|) - ?(?2): ?\, ?? e L; .V| ^ x'2; x\ +x2 = x}- (e) ||?|| ^ \?\ ^2\\?\\ where \\?\\(?) := $??[\?(}·)\: Еэ у ? ?). Proof, (a) By the rules valid in any Riesz space we have (???)(?) = -[(-?)?(-?)](?) = — sup{— ?(?\) — v(xi): x\,x2 e L\ x\ ^ x'2\ x\ + xi = x\ = ???{?(?\) + v(x2): x\, x2 & L\ x\ ^x'2\ x\ +x2 = x]. (b) and (d) follow from Theorem 3.1.1 (i). (c) follows from (a) and (e) from (d). D The Jordan decomposition theorem valid in any Riesz space yields the following decomposition theorem for measures. It generalizes a result contained in Butnariu and KlementA993). THEOREM 3.1.3 (Jordan decomposition). Let ? e ba(L). Then ?+, ?~ are the unique measures in ba(L) such that ? = ?+ — ?~ and ?+ ??~ =0. Moreover, |?| =?+ + ?~¦ The Hahn decomposition theorem for measures on MV-algebras can be derived from the Jordan decomposition theorem. THEOREM 3.1.4 (Hahn decomposition). Let L be ?-complete and ?-.L -> ? ? ?- order continuous measure. Then there exists a e C(L) such that ?(?) ^ 0 for ? ^ a and ?(?) ^ Ofor ? ^ a'. In particular, ?(?) = ????(?,) and ?(?') = 5???(?,). PROOF. Apply Theorem 2.7 for ? = ?+ and ? = ?". For ? < a we have ?(?) = ?(? л а) = ?+(? л а) - ?"(? л ?) = -?"(? ? ?) ^ 0. Analogously ?(?) ^ 0 for ?^?'. ? Other decompositions of measures can be obtained as band decompositions applying the Riesz decomposition theorem to ba(L). THEOREM 3.1.5 (Riesz decomposition). Let ? be a Dedekind complete Riesz space, F a band in ? (i.e., an order closed solid Riesz subspace). Then ? is equal to the direct sum ofF and F1- = {x e E: \x\ л |/| = Ofor every f e F]. We use the Riesz decomposition theorem to transfer the Hewitt-Yosida decomposition theorem and the Lebesgue decomposition theorem in the setting of measures on MV- algebras. In any Dedekind complete Riesz space ? the band generated by a subset A of ? is A-1-1-. We now describe the band generated by an element ? e ba(L).
926 G. Barbieri and H. Weber NOTATION 3.1.6. For ?, ? eba(L), we write ? <^? iff \?\{?„) -> 0 implies ?(?„)-> 0; equivalently, ? <g ? iff for every ? > 0 there exists <5 > 0 such that \?\(?) ^ <5 implies \?(?)\??. LEMMA 3.1.7. Let ?, ? eba(L); ?;«? ?/?^? J_ ? /wp/j ? = 0. PROOF. Since ? J_ ?, for every ? e N there exists jc„ e L such that |?|(^„) ^ \/n and ???-?,',) <: 1J п. Therefore \?\{?'?) -> 0 and |?| (-*,,) -> 0. As ? « ? we get \v\(x'n) -> 0. Now by additivity of |v|, we have M(l) = \v\(x„) + \v\(x'n) ^ 0, hence, |v|(l) =0 and consequently ? = 0. ? PROPOSITION 3.1.8. For ? e ba(L), В := {? e ba(L): ? «С ?] is the band generated by ?. PROOF. Obviously, В is an ideal. We will prove that ? is a band, in other words we will prove that if В э va ^ 0, va \ v, then ? e B. Let (jc„) be a sequence in L such that \?\(?,?) —>¦ 0. We show v(x„) —>· 0. Let ? > 0. Then u(l) ^ vp(\) + ? for some index ?. Since vp{x„) -> 0, there exists /o e N such that for every ? ? (jc, ) ^ ? ?' ^ /о; then for /' ^ /o we have ? (?,¦) = (? - ? ?) (*,¦) + Vfi(Xj) ? (? - ??)(\) + vp(xj) ^ ? + ?. Soi>(x„)^0. Of course ? e ?, so {?}-1"", the band generated by ?, is contained in B. Now suppose ? e S, we shall prove ие(д)и. By the Riesz decomposition theorem, ? = v\ + v2, where v\ e {?}-1 and V2 e {?}  As | V| | ^ \v\, we have v\ « и « ? and v\ J_ ?. By Lemma 3.1.7 v\ =0 and ? = ?>2 e Corollary 3.1.9. Lei ? eba(L) and В be a band in ba(L). Then ? e B1- iff ? e B, ?«? imply ? = 0. We now are ready to prove the Lebesgue decomposition theorem. THEOREM 3.1.10 (Lebesgue decomposition). Let ?, ? e ba(L). Then there are unique measures ?, eba(L) (i = 1, 2) smc/? ?/??? ? =?\ + ??, ?\ <&? and ?? -L ?. PROOF. Apply the Riesz decomposition theorem for F = {?}-1. ? Analogously, applying the Riesz decomposition theorem for F = ca(L), where ca(L) is the set of all real-valued ?-order continuous measures on L, which is a band in ba(L), we get the Hewitt-Yosida decomposition theorem. We need the following definition. DEFINITION 3.1.11. We say that ? eba(L) is purely ???-?-order continuous if the zero measure is the only ? e ca{L) such that ? <? ?.
Measures on clans and on MV-algebras 927 THEOREM 3.1.12 (Hewitt-Yosida decomposition). Let ? e ba(L). Then there are unique measures ?, e ba(L) (/ = 1, 2) such that ? = ?\ + ??, ?? e ca(L) шЛ ?? риге/у пои ?-order continuous. Now we leave this algebraical approach and we shall develop a topological approach in order to study measures on MV-algebras. 3.2. Representation of real-valued measures In this section we show that Butnariu's integral representation theorem (see Butnariu and Klement A993, 6.2)) is an easy consequence of the Hahn decomposition theorem. Theorem 3.2.1. Let ? be а Тх-тЬе of fuzzy sets on ? and A = C(T) the ?-algebra of crisp sets in T. Then the restriction ?: ? н> ?| д defines a linear isome try from the Banach space (ca(T), || ||oo) onto the Banach space (ca(A), \\ \\x)- For ? e ca(T) and f e ? we have ?(/) = / /??, where ? = ?\?. PROOF. Obviously, ? is linear and, by Theorem 3.1.4, norm preserving, in particular injective. The surjectivity of ? follows from Proposition 1.3.1: Let ? e ca(A). Define v(f) := f f dv for / e CX(A). ? is ?-order continuous by Lebesgue's convergence theorem. By Proposition 1.3.1, ? = ?|? is well-defined and ?(?) = ?. D 4. Uniform MV-algebras 4.1. The lattice CIAA(L) We denote by CUA{L) the system of all uniformities on L which makes + and ' (and therefore ?, л, v) uniformly continuous. If и e CUA(L), we call (L, u) a uniform MV- algebra. If ? : L -> [0, +oo] is a submeasure, then (L,dn) is a uniform MV-algebra, see Section 3. If и e CUA(L), then the system F = и@) of 0-neighbourhood in (L,u) has the following property VFeF3GeFV;ceLVy,zeG, xsiy + z => x e F. (*) We denote by TNA{L) the set of all filters F on L with property (*). If F and G are taken according to (*), then s(G) := {x e L: By e G ? ^ }'} is a solid subset of L with G cs(G)c F. So, equivalently, we can define TNA(L) as the set of filters F on L such that F has a basis of solid sets and F + F e F for each F e?. The symbol TJ\fA(L) is to remind one of FN-topologies defined on Boolean algebras. The next theorem shows in particular that any и е CUA(L) is uniquely determined by its 0-neighbourhood system.
928 G. Barbieri and H. Weber THEOREM 4.1.1. (а) ^:ми и@) defines an order isomorphism from CUA(L) onto TNA(L), where и@) denotes the ^-neighbourhoodsystem in (L, u). (b) For и e CUA(L) the sets FA := {(x, y) e L2: ? Ay e F), F e и@), form a basis ofu. PROOF, (i) Obviously ? is well-defined, i.e., и@) е FNA{L) for и е CUA(L) and ? is monotone. (ii) We show that ? (?) с V(i>) implies и с и for и, и e CUA(L). Let ?/ e и. We may assume that ?/ is symmetric. Choose ?/' e и with U' + U' cU and symmetric V, W e и with V@) с U'@) and WvlVAWc V. We show that W С ?/ ? U. Let (.?, у) е W and г = ? ? y. Then (zAjc,0) = ((у,*) v (x,x))A(x,x) e (W ? 1V)AW С V, hence ??* e V@)c i/'@)and (z,Jt) = (?:,?:) + (???:,0) e U' + U' С ?/. Analogously we have (z, y) e i/. It follows that (x, y) eU oU. (iii) V is surjective. Let F e TAfA(L). Then the sets FA, F e F, form a basis of filter u. One easily sees using Proposition 1.1.5 that и is a uniformity and +, ?, л, ? are uniformly continuous. Hence, ? e CUA(L). Obviously ?(?) = F. ? COROLLARY 4.1.2, Any homomorphism between MV-algebras continuous at 0 is uniformly continuous. Proposition 4.1.3. TNA(L) is a distributive complete lattice. If?\, F2 e TAiA{L), then the sets F\ ? F2 and Ft + F2 (F/ e F,) form a basis of?\ ? F2 and ?\ ? F2, respectively. PROOF. Let F„, a e A, be a family in TNA(L). Then the system of finite intersection of sets of UaeA ?a is a basis of a filter F e TAfA(L). Therefore F is the supremum of the family (?a)aeA in TNA(L). Since TAfA(L) contains a smallest element, namely {0, L), it follows that TMA{L) is a complete lattice. We now prove the statement about F| л F2. Let F, e F,-. Choose G, e F, such that for every ? e L and for every y,, zi e G,, the inequality ? ^ y,- + Zi implies j: e F,. Using the decomposition property Proposition 1.1,4 one sees that if y, ? e G\ +G2 and Lsx^y+z then ? e F\ + F2. Therefore the filter F generated by the sets F\ + F> (F, e F,) belongs to TNA(L). Since every filter of TNA{L) coarser than F| and Ft is coarser than F, we obtain F = F, aF2. It easily follows from the description of F| ? Ft and F| л Ft that TAfA(L) is distributive. ? COROLLARY 4.1.4. ?UA(L) is a distributive complete lattice. We have already observed that any submeasure on L generates a uniformity of CUA(L). We now show that, vice versa, any uniformity of CUA(L) is generated by a family of submeasures. LEMMA 4.1.5. Let (S, +) be a semigroup with 0 and let С := (?/„)n€N be a family of subsets of S such that U\ := S, 0 e U„ and U„+\ + U,l+\ + Ull+\ С U„ for every ? e N.
Measures on clans and on MV-algebras 929 Let g : S -> R defined by g(x) = 0 ifxef^iU.-.neN}, g(x) = 2~" ifxeU„\U„+i. Let ? : S -> R defined by ? ?(?) = inf ]Pg(Zi): z\ +zi-\ l·Zp 11 = 1 Then ? is subadditive, ? ^ 0 and the system {x ? S: ?(?) < ?}, ? > 0, generates the same filter as (i/„)„6N- In the case of S being a group, the proof can be found in Warner A989); without modifications it works also in this case. THEOREM 4.1.6. Every uniformity и ? CUA(L) is generated by a system (??)??? of submeasures defined on L, i.e., the sets {(x, y) ? L2: ??(???) < ?} (a ? ?, ? > 0)form a subbasis for u. PROOF. Since for any и ? OAA(L) there is a family (ua)aeA of uniformities of OAA{L) with countable basis such that и = supaeAua, we may assume that already и has a countable basis. Then there is a sequence U\, U2, ¦ ¦ ¦ of solid O-neighbourhood in (L, u) with U,i+\ + U,i+\ + ?/,,+ ? с U„ (n e N) and U\ = L. Define ? as in Lemma 4.1.5. Then ? is subadditive and ?@) = 0. Moreover, it follows from the Riesz decomposition property that ? is monotone. Therefore ? is a submeasure and [x e L: ?(?) < ?], ? > 0, is a basis for the O-neighbourhood system of (L, u), i.e., ? generates и see Theorem 4.1.1. D PROPOSITION 4.1.7. Let и ? CUA(L) be Hausdorff and (xa)aeA be an increasing (decreasing) net converging to ? in (L, u). Then ? = supxa (x = infxa). PROOF. We consider only the case that (ха)аел is increasing. Then ((xa ? х)Ах)аел is an increasing net converging to (? ? ?) Ax = 0. Therefore (xa ? ?) Ax = 0, hence, xa ^ ? for any a ? A; i.e., ? is an upper bound of (ха)аел- Let now у be an arbitrary upper bound of (Xa)aeA- Then ? = \\mxa = Yimxa л у =х л у, hence, ? ^ у. ? We now introduce some important properties for uniformities of CUA(L). DEFINITION 4.1.8. A uniformity и е CllA(L) is called ?-order continuous (order continuous) if every decreasing sequence (net) with order limit 0 converges in (L, u) to 0. A uniformity и is called exhaustive if every monotone sequence in (L, u) is Cauchy. It is easy to see that a uniformity и of CUA(L) is order continuous (?-order continuous) iff order convergence implies topological convergence in (L, u) for any net (sequence) in L; and that и is exhaustive iff every monotone net in (L, u) is Cauchy.
930 G. Barbieri and H. Weber The following proposition shows that, for L being a Boolean algebra, и е CUA(L) is exhaustive iff every disjoint sequence converges to 0 in (L, u). PROPOSITION 4.1.9. Let и е ?UA(L). и is exhaustive iff every sequence (x„)„m in L with the property ??=\ ?; ^ x'n+i(n € N) converges to 0 in (L, u). PROOF. (=>) If (хп)пец is a sequence in L with s„ := ?'?=\ Xi ^ x'n+i (n e N), then *„+? =?„+|??„ by Proposition 1.1.6. Therefore (jc„)„6n converges to 0 since the increasing sequence Cs„)«sN is Cauchy. («=) Suppose that (a„)„6N is an increasing sequence in L which is not Cauchy with respect to u. Then (a„)„6pj has a subsequence (fe,i)rt6N such that (Ь„+\АЬ„)„ещ does not converge to 0 in (L, u). Let bo = 0 and jc„ = b„Abn-\. Since ?"_| -*; ^ -*,',_,_, by Proposition 1.1.6, the sequence converges to 0 by assumption, a contradiction. If (a„)„6pj is a decreasing sequence, then (a;'()„6pj is increasing, therefore Cauchy in (L,m) as we have already proved. It follows that (а„)„6щ is Cauchy since a„Aam = a'nAa'm. О The property formulated in Proposition 4.1.9 is equivalent to Гзс-disjointness in Butnariu and Klement A993, p. 37). 4.2. Decomposition of order continuous uniform MV-algebras Let L\, L2 be uniform MV-algebras. A topological isomorphism ?: L\ —> L2 is an isomorphism such that ? and ?'] are continuous. By Corollary 4.1.2, ? is then also a uniform isomorphism, i.e., ? and ?'' are uniformly continuous. PROPOSITION 4.2.1. Let L be complete, D a disjoint subset of С\L) with sup D = 1 and и an order continuous uniformity of ?UA(L). Let Ud = и|[о.</] be the relative topology induced by и on [0, d]. Then ?:?\-^>- (? A d)ljeo defines a topological isomorphism from (L,u)onto(Y\deD([0,d],ud). PROOF. By Proposition 1.5.3 ? is an isomorphism, ? is continuous since ? н> х л d is continuous for any d e D. We show that ф'] is continuous at 0 (and therefore continuous by Corollary 4.1.2). Let U and Uq be solid 0-neighbourhood in (L, u) with Uq v Uq С ?/. Let J" be the system of all finite subsets of D directed by inclusion and sf '¦= sup(Z) \ F) for F e ?. Since sf 4- 0 and и is order continuous, jf0 e ?/o f°r some Fo e .F. Let и = | Fq\ be the cardinality of Fo and V a 0-neighbourhood in (L, и) such that ? ? ? · · · ? ?„ e Uq if zi,...,z„e V.Then W := {(^) e П</6о[°' rfl· x<i ? V for d e F0) is a 0-neighbourhood in П([0'^]' "</)· For any (д-^) e W we have 0~'((¦*</)) = suprf6D^/ ??/. ? PROPOSITION 4.2.2. For и e CUA{L) the intersection N(u) of all 0-neighbourhood in (L, u) is a closed ideal in L. If L is complete and и order continuous, then N(u) = [0, a] for some a e C(L).
Measures on clans and on MV-algebras 931 PROOF. Obviously, N(u) is an ideal. N(u) is closed since any point of a uniform space has a neighbourhood basis of closed sets. To prove the second statement, consider N(u) as an upwards directed set. Since и is order continuous this directed set converges to a := sup N(u). Since N(u) is closed, we obtain a e N(u), hence N(u) = [0, a]. Since 2a e N(u) and therefore 2a = a, we have a e C(L) by Proposition 1.2.1. D Тн EOR EM 4.2.3. Let L be complete. (a) Then CUA(L) contains at most one order continuous Hausdorff uniformity. (b) If и e CUA(L) and и is Hausdorff and order continuous, then (L, u) is (topol- ogically and algebraically) isomorphic to a product ?\??/\(^?, m) of metrizable MV-algebras. PROOF. Let ? be the finest order continuous uniformity of CUA(L), и be an arbitrary Hausdorff order continuous uniformity of CUA(L) and Л be a set of submeasures generating u. For ? e ? there is by Proposition 4.2.2 an aA e C(L) with ?(?) = [0, ??]. Since и is Hausdorff, we have ????€? aA = 0 and 5ирЯбЛ ?· = 1. Let (<?0л.6л be a disjoint family in C(L) with d\ < ?'? for any ? e Л. Let u,_ be the uniformity generated by ? on Lx := [0, dx]. For any order continuous submeasure ? on L we have N(/?k,.) Э {0} = ?(?-\?*?)- Therefore, by Theorem 2.6, the d,,-uniformity is coarser than их. Since this holds for any order continuous submeasure ? on L, we obtain that v\lx С их. Obviously v\Lk D u\l-k Dux. Therefore ? |^? = u\l} = их. It follows with Proposition 4.2.1 that и = ? and (L, u) is isomorphic to ????^?- их). П THEOREM 4.2.4. Let L be complete. Let u,v e CUA{L) order continuous. Then N(v) С N(u) iff и С и. PROOF. (<=) Is obvious. (=>) By Proposition 4.2.2 ? = supN(w) e C(L). The uniformities ?|[?,?'] and и|[о.й'] are order continuous and Hausdorff and therefore equal by Theorem 4.2.3(a). ?|[?.?] is the trivial uniformity and therefore coarser than u|[o.«]. It follows from Proposition 4.2.1 that и с v. ? Theorem 4.2.5. LetueCUA(L). (a) If L is complete as a lattice and и is order continuous, then и is exhaustive and (L, u) is complete as a uniform space. (b) If и is exhaustive and (L, u) is complete as a uniform space and Hausdorff, then L is complete as a lattice and и is order continuous. PROOF, (a) Obviously, order continuity implies exhaustivity. Let a = supN(u). Then (L,u) is isomorphic to ([0, а], и|[0.л]) х ([0> a'L и|[0.л'))· и induces on [0, a] the trivial uniformity and on [0, a'] a Hausdorff uniformity. Therefore, by Theorem 4.2.3(b), ([0, а'], и\[0м']) is isomorphic to a product Плел^' и^ of metrizable MV-algebras. Since и is order continuous and L is complete, uA is order continuous and Lx is complete (as a lattice) and therefore (Lx, их) is complete uniform space by Theorem 2.5. Since a product of complete uniform spaces is complete, it follows that (L, u) is complete.
932 G. Barbieri and H. Weber (b) Let (xa) be an increasing net. Then (xa) is a Cauchy net in (L,u) since и is exhaustive and therefore has a limit ? in (L, u) since (L, и) is complete. By Proposition 4.1.7 ? = sup jc„ . It follows that L is a complete lattice and и is order continuous. D THEOREM 4.2.6. Let и be an exhaustive Hausdorff uniformit}' of CUA(L) such that (L,u) is complete. (a) Then the isomorphisms of Theorem 1.5.4 (a)-(c) are also topological isomorphisms. Here [0, af], [0, ax], [0, c] are endowed with the uniformity induced by u; ?„ and Coo with the real Euclidean uniformity and the products have the product uniformity. (b) [0, ay] is totally disconnected and compact, [0, ax] is arcwise connected and compact, [0, c] is arcwise connected and not compact ifc^O. PROOF, (a) follows from Proposition 4.2.1 and the uniqueness of the Hausdorff order continuous uniformity of CUA(C„), neNU {oo} \ {1}. (b) [0, aj] is topologically isomorphic to the totally disconnected and compact space TiaeA^-'ia- [0>азс] is topologically isomorphic to the arcwise connected and compact space ??,. Suppose с ? 0. The centre С of [0, c] is an atomless Boolean algebra, and closed in (L, u) since С = {? e [0, с]: ? =2?], hence, complete (as a uniform space). Therefore there is by Volkmer and Weber A983, Satz 1.4) a continuous map y.I^-C defined on the real closed unit interval / with y@) = 0 and ? (I) = с. For any a e [0, c], a(t) := y(t) A a defines then a continuous map <*:/—>· [0, c] with <*@) =0and a(\) = a. Therefore [0, c] is arcwise connected. Since С is connected, С cannot be compact (see Kaplansky A947)). Therefore [0, c] is not compact. ? Under the assumptions of Theorem 4.2.6, L is compact or totally disconnected or connected, respectively, iff с = 0 or с = ax — 0 or at = 0. This leads to the following characterization of compact, totally disconnected or connected MV-algebras. COROLLARY 4.2.7. Let и be an exhaustive Hausdorff uniformity of CUA(L) such that (L,u) is complete. (a) Then L is atomic iff (L,u) is totally disconnected iff (L,u) is isomorphic and topologically isomorphic to a product Y[aeA C„a for a e A and na e N \ {1}. In particular if (L, u) is totally disconnected, then (L, u) is compact. (b) L is atomless iff {L, u) is arcwise connected iff (L, u) is connected. COROLLARY 4.2.8. Let ? a Hausdorff topology definedon L which makes continuous the operations + and' ofL. Then (L, ?) is compact iff(L, ?) is isomorphic and topologically isomorphic to a product Cf^ ? Па6д?„ц for a e A and na e N. Proof. (=>) Since ? is compact, ? is induced by a uniformity u, and + and ' are uniformly continuous with respect to u. Therefore и е CUA(L). Moreover, и is exhaustive by Weber A991/1993, 6.1) and complete. Therefore и satisfies the assumptions of Theorem 4.2.6, and so L is isomorphic to [0, c] ? ?^ ? П„6д?„а, and с = 0 since и is compact. D
Measures on clans and on MV-algebras 933 4.3. The lattice CUA(L, w) We will study uniformities as well as measures on L passing to a suitable uniform completion of L. This is motivated by the fact that the used uniform completion is easier to handle since it is complete as a lattice and its uniformity is order continuous. PROPOSITION 4.3.1. Let w e CUA(L) be Hausdorff. Then (L,w) is a dense MV- subalgebra of a Hausdorff complete uniform MV-algebra (L, w). Proof. Let (L,w) be the uniform completion of (L,w). Then the addition and the complementation ' of L have a continuous extension on L. So (L, w) becomes a uniform MV-algebra. ? (L, w) of Proposition 4.3.1 is called the completion of (L, w). PROPOSITION 4.3.2. Let L be a dense subspace of a uniform MV-algebra (L, w). Then w is exhaustive iff w\t is exhaustive. PROOF. (=>) is obvious. («=) By Theorem 4.1.6, it is enough to prove that any continuous submeasure ?^on (L, w) is exhaustive if ?\? is exhaustive. Let (?,?),?6? be an increasing sequence in L, ? > 0 and b„ e L with /?(?„?/?„) ^ ?1~". Then c„ := sup,-^„ b\ defines an increasing sequence in L and is therefore a Cauchy sequence with respect to d,t. Moreover, /j(c„Aa„) = /j(sup,<„ bjA supl<:„ ?,) ^ ??'=? ?????,) ? ?. It follows that (an) is a Cauchy sequence with respect to dn. ? From Proposition 4.3.2 and Theorem 4.2.5 follows: COROLLARY 4.3.3. Let w e ?UA{L) be Hausdorff and exhaustive and (L, w) be the completion of(L, w). Then (L, $5) is a complete lattice and w is order continuous. For w e CUA(L) we put CUA(L, w) := {u e CUA(L): и с w]. The following proposition can easily be verified using Theorem 4.1.1. PROPOSITION 4.3.4. Let w e CUA{L). Denote by a the equivalence class of L : = L/N(w) containing a and let U := {a: a e U}for U с L. Then, for и е ?UA(L, w), the sets U(U e u@))form the 0-neighbourhood basis of a uniformityие CUA(L). ?:«и« defines an order isomorphism from CUA(L, w) onto CUA(L, w). THEOREM 4.3.5. Let w be an exhaustive uniformity ofCUA(L), (L, w) the completion of {L,w) := (L,w)/N(w). Then r:u н> u\i defines an order isomorphism from ?UA(L, w) onto CUA(L, w). s :u \-^> supN(u) and s о г~' о q define a dual order isomorphism from CUA(L, w) onto C(L) and from CUA(L, w) onto C(L), respectively. PROOF. The statement about r follows from the fact that for и е CUA(L, w) the set of all continuous submeasures ?: (L, w) —> [0, +oo] such that the restriction ?\? is continuous with respect to и generates the unique uniformity и e ?UA(L, w) with r(u) = u.
934 G. Barbieri and H. Weber It follows with Proposition 4.2.2 and Theorem 4.2.4 that s is a dual isomorphism. With Proposition 4.3.4 one obtains that ior'o^ isa dual order isomorphism. D Remark 4.3.6. Let w e ?UA(L) be exhaustive and (L, w) be the uniform completion of (L, w). Then L is complete as a lattice, C(L) is a complete Boolean algebra and OAA{L, w) is the lattice of all order continuous uniformities of CUA(L). PROOF. The first two statements follow by Theorem 4.2.5 and Proposition 1.2.3. For the last statement, let и be order continuous and a = supN(u). If ? e CUA(L, w) such that s(v) = a, then we have, since N(u) = N(v), by Theorem 4.2.4 и = v. D Remark 4.3.7. By the notation of Theorem 4.3.5, let ф(и) be the FN-topology induced by и on C(L). Then и н> ф(и) is an order isomorphism from CUA{L, w) onto the space of all order continuous FN-topologies on C(L). Corollary 4.3.8. If w is an exhaustive lattice uniformity ofCUA(L), then CUA(L, w) is a complete Boolean algebra. COROLLARY 4.3.9. By the notation of Theorem 4.3.5 «н>й:= с(Г> " an or^er isomorphism between ?UA(L, w) and ?UA(C(L), w|C(?,)· 4.4. The uniformity generated by a measure In this section let ? : L -> ? be a measure with values in a locally convex linear space. This measure raises a uniformity in a natural way. Definition 4.4.1. With the notation as in Example 2.1. Let pa, a e A, be a family of seminorms generating E. The uniformity generated by the submeasures (||?||/,11)?6? is called ?-uniformity. The ?-uniformity can also be described without using seminorms on E. In fact the sets U* := {x e L: ?([0,.?]) С U], where U is a O-neighbourhood in ?, form a filter base of TNA(L). The ?-uniformity is the uniformity which corresponds to this filter according to Theorem 4.1.1. This description of the ?-uniformity immediately yields that ?(?) = ? (?-uniformity). PROPOSITION 4.4.2. For и е CUA(L) the following conditions are equivalent: A) ? : (L, u) —> ? is continuous at 0. B) ? : (L, u) —> ? is uniformly continuous. C) ?-uniformity is weaker than u. PROOF. Denote by ? the ?-uniformity. Obviously, B) is equivalent to C), i.e., to и с и; and A) is equivalent to u@) С i/@).But и@) С и@) iff и С и by Theorem 4.1.1. D
Measures on clans and on MV-algebras 935 By Proposition 4.4.2, the ?-uniformity is the weakest uniformity of ?UA(L) which makes ? uniformly continuous. In Barbieri and Weber A998, Section 3.1) it is observed that the ?-uniformity is also the weakest lattice uniformity that makes ? uniformly continuous. Therefore Definition 4.4.1 is in accordance with the definition of the ?-uniformity given by Weber A996). DEFINITION 4.4.3. (a) ? is order continuous if xa I 0 implies ?(??) -> 0 for each net (xa) in L. (b) ? is exhaustive iff {?{?„))„?? is a Cauchy sequence for each monotone sequence (*„) in L. PROPOSITION 4.4.4. (a) ? is exhaustive (?-order continuous, order continuous) iff the ?-uniformity is so. (b) ? is exhaustive iff for every sequence (x,,),ieN in L with J2"= \ ¦*/ ^ x'll+i (n e N) the sequence (?(?,?))??? converges to 0. PROOF, (a) («=) Follows from the uniform continuity of ?. (b) (=>) Let (jc„)„6n be a sequence in L with s„ := ?"=? ¦*/ ^ x'„+\(n e ^)· Then ?(??+\) = ?(?„+|) — ?(?„) —>¦ 0 since (?(?,())„6? is a Cauchy sequence. Suppose that ?(?„) —>¦ 0 whenever ?"=] Xi ^ x'n+\(n e N). We show that the ?-uniformity is exhaustive using Proposition 4.1.9. Suppose that there is a sequence (>?)„6? in L such that ?"_| >? ^ 3-',',+|(" e ^) anc' (yn)neN doesn't converge to 0 with respect to the ?-uniformity. Then there is a sequence (¦*n);ieN such that x„ ? y,,(n e N) and (мСхя))леМ does not converge to 0, a contradiction since ?"=\ *? < ?"=? У· < ?,+? < <?+? · The fact that the ?-uniformity is (a)-order continuous if ? is (a)-order continuous can be found in Weber A996, 3.5). ? PROPOSITION 4.4.5. Let ?-.L —>· ? be continuous with respect to a uniformity w e CUA(L) and let и be the ?-uniformity. (a) If ? is defined on L := L/N(w) by ?(?) = ? (?) for a e a e L, then ti defined as in Proposition 4.3.4 is the ?-uniformity. (b) If (L, w) is a dense subspace of a uniform MV-algebra (L, w) and ? = ?|?. is the restriction of a continuous measure ?: (L, w) —> E, then и is the restriction of the ?-uniformity. PROOF. We may assume that ? is a Banach space. (a) follows from the fact that ||?||(?) = ||?||(<3) for a e a e L, where ||?|| denotes the semivariation of ? (cf. Example 2.1). (b) It is enough to show that ||?||(?) = ||?||(?) for ? e L. Obviously, ||?||(?) ^ II? II (a): let jc e L andjc ^ a. Let (xa) be a net in L converging to ? in (L, w). ThenJcaAa converges to jc, too. Therefore \?(?)\ = \\?\\?(?? ла)\ ?. ||?||(?). It follows ||?||(?) $5 ||?||(?). D
936 G. Barbieri and H. Weber NOTATION 4.4.6. Let u, ? e CUA(L) and ?, ? measures defined on L. We write и ±v iff и Л и is the trivial uniformity. If и = ?-uniformity, we write also ? J_ ? instead of и J_ u; if moreover ? = ?-uniformity, we write ? J_ ? instead of и 1 ?. ? « и means that ?-uniformity is weaker than v, or equivalently by Theorem 4.1.1, ? is continuous with respect to v. If ? = ?-uniformity, we write ?«?; instead of ? <<C ? and we say that ? is v-continuous. 5. Measures on MV-algebras with values in a locally convex space In this section, let (?, ?) be a complete Hausdorff locally convex linear space. 5.1. Representation of measures In Theorem 5.1.3 we give an isomorphism between the space of all ?-valued exhaustive measures on an MV-algebra and the space of all ?-valued order continuous measures on a complete Boolean algebra. This isomorphism allows us to transfer results known for measures on Boolean algebras to the case of measures on MV-algebras. PROPOSITION 5.1.1. Let Abe an algebra of sets. (a) Then ? н>- ??, T?(f) = J f ?? defines a linear isomorphism between ba(A, E), the space of all ?-valued bounded finitely additive measures defined on A and the space L((Coo(A), || Hoc), E) of all ?-valued linear continuous maps defined on Coo{A). (b) A measure ? eba (A, E) is (?-) order continuous iff ?? : Cx (A) —>¦ ? is (?-) order continuous. PROOF, We only prove (b) in the order continuous case. Identifying sets with their characteristic functions, A becomes a regularly embedded sublattice of Coo(A) (cf. Theorem 1,3,3 and Proposition 1.2.3) and ? is the restriction of ?? to A. This immediately yields the proof of («=), (=>) Since ? can be embedded in a product of Banach spaces, we may assume that ? is a Banach space. Let ? e ba(A, E) be order continuous. Then the semivariation ||?|| of ? is by Proposition 4.4.4 order continuous, too. Let (fa)aeA be a net in CX(A) with fa I 0. Then, for any a e A, there is a sequence (/г„.„)„6м in S(A) with han \. fa and ||/г„,„ — /?||?? < l/n. With the order defined on ? := ? ? ? componentwise we have /г/3 ; 0. Let ? > 0. Then {hp ^ ?) ; 0, hence ||?||({/?/3 ^ ?}) -> 0. Therefore (hp) converges in measure to 0. By the dominated convergence theorem, ^??? —> 0. It follows that / /„ ?? -> 0. ? THEOREM 5.1.2. Let L be ?-complete. Then ?:?? Mlc(L) defines a linear isomorphism from the space ca(L, E) of all ?-order continuous ?-valued measures on L onto ca(C(L), ?), ? is order continuous iff ?\c^L) is order continuous. Moreover, ???(?,) = со?@\L)) for ? e ca(L, E).
Measures on clans and on MV-algebras 937 PROOF. Since C(L) is regular embedded in L by Proposition 1.2.3, the restriction r (?) := /J-lc(L) belongs to ca(C(L), E) for any ? e ca{C{L), E). Obviously r is linear. r is injective. Let ? e ca(L, E) with ?(?) = 0. Then for any continuous linear functional x' on ? we have ?' ? ?|(;(?.) = 0 and therefore .?' ? ? = 0 by Theorem 3.1.4. It follows that ? = 0. r is surjective. By Theorem 1.3.3 we may assume that L is a Гзс-clan of fuzzy sets regular embedded in CX(A) for some algebra A of sets and that A = C(L). If ? e ca(C(L), E), then the integral operator Tv defined in Proposition 5.1.1 is ?- order continuous by Proposition 5.1.1. Therefore ? := Tv\l e ca(L, E) and ?(?) = v. Proposition 5.1.1 (b) shows that ? is order continuous iff ? is order continuous. To prove the last statement, observe that for ? e ca(L, E) and ? = ?(?) we have со ?(?) = [f f dv: f e S(A)} (cf. Diestel and Uhl, 1979, proof of Lemma IX. 1.3, p. 263). Therefore ?(?) с ?(?,) с {f f dv: f e CX(A)} С {j f dv: f e S(A)} = ???(?). Hence, ???{?) = ???(?). ? THEOREM 5.1.3. Let w be an exhaustive uniformity of CUA(L) and (L,w) be the completion of(L, w)/N(w). (a) Then for any continuous measure ? : (L, w) —> ? the function ?: L —>¦ ? defined by ?(?) = ?(?), ? e x e L, has a unique continuous extension ?: (L, w) —> E. Denote by ?: C(L) -+ ? the restriction of ? to the centre C(L) of L. Then ?(?,) = fi(L) ???~???{?) = cop.(C(L)). (b) ди/i defines an isomorphism from the linear space of all continuous measures ? : (L, w) —> ? onto the linear space of all ?-valued order continuous measures on C{L). (c) If F is a complete locally convex Hausdorff linear space and ? : (L, w) —> ? and ?: (L, w) —> F are continuous measures, we have ?«?; iff ? «? and ? J_ ? iff ? J_ у. Moreover, if ? : ? -^> F is a continuous linear map and ? = ? ? ?, then ? = ? ? ?, ? = ? ? ? and ? = ? ? ?. PROOF, (a) ? has a unique continuous extension ? to (L, w) since ? is uniformly continuous with respect to w. Then the range of ? is dense in the range of ?, therefore ?(?,) = ?(?,) = fi(L) and со?(?,) = cofi(L). Now observe that, by Corollary 4.3.3, L is a complete lattice and w is order continuous, therefore ? is order continuous and cofL(C(L)) = ~???(?) by Theorem 5.1.2. (b) Obviously, ? н> ? (? н> ?) defines a linear 15отофЬ15т between the space of all w-continuous measures on L (of all ю-continuous measures on L) and w-continuous measures on L, respectively. To show that ? t->- ? defines a linear isomorphism from the space of all w-continuous measures on L and the space of all order continuous measures defined on C{L) use Theorem 5.1.2 and the fact that a measure on L is order continuous iff it is ю-continuous. If ?: L —>¦ ? is an order continuous measure, then the ?-uniformity is order continuous by Proposition 4.4.4 and therefore by Remark 4.3.6 weaker than w, i.e., ? is ю-continuous, see Proposition 4.4.2. As already observed in the proof of (a), any ю-continuous measure is order continuous since w is order continuous. (c) The first statement follows from Corollary 4.3.9 and from the fact that if и is the ?-uniformity, then й is the ?-uniformity. To show the last statement, we have to show that
938 G. Barbieri and H. Weber the correspondent elements of C(L) with respect to the isomorphisms of Theorem 4.3.5 are equal. Thanks to Proposition 4.4.5 it is sufficient to prove that for a e C(L) we have ?(?) = 0 for every ? e L with ? < a iff ?(?) = 0 for every ? e C{L) with ? < a. To prove («=), we put v{z) := ?(? ? ?) for ? e L. From ? = 0, we have ? = 0 and hence ? = 0, i.e., ?(*) = 0 for every ? e L with jc < a. The second statement of (c) is obvious. ? 5.2. Decomposition theorems With the aid of Theorem 5.1.3 we now transfer a general decomposition theorem for measures from the Boolean case to the case of MV-algebras. THEOREM 5.2.1. Let ? be exhaustive and и е CUA(L). Then there are unique measures ? ?, ??: L —>· ? such that ? = ? ? + ??, ? \ <? и and ?? J_ и. Moreover, the ?-uniformity is the supremum of the ? \ -uniformity and the ??-uniformity. PROOF. Existence. Let w be the finest exhaustive uniformity of ?UA(L), i.e., the supremum of all exhaustive uniformities of CUA(L). Replacing и by и л w, we may assume that и is coarser than w. We use the notation of Theorem 5.1.3. By Weber B002, 9.2) ? has a decomposition of the form ? = ? ? + ?2, where ? ? and ?2 are ?-valued //-continuous measures on C(L)X| « и, Аг 1 и and ? is the supremum of the ? ?-uniformity and the X2-uniformity. By Theorem 5.1.3 there are w-continuous measures ?, : L —>· ? with ?,- = ?,· decomposing ? according to Theorem 5.2.1. Uniqueness. If ? = ? ? + ?? = ?\ + ?>2 are two measures decompositions with ? ? <g и, v\ <K и and ?2 -L и, ^2 -L и, then ? := ? ? - v\ = ?>2 - M2 J- " and ? <? и, hence, ? = 0. D As in the Boolean case, different choices of и yield different decomposition theorems. If и is the yo-uniformity for some measure p, then Theorem 5.2.1 becomes the Lebesgue decomposition theorem. If we take for и the supremum of all ?-order continuous uniformities of CUA(L), then Theorem 5.2.1 yields the Hewitt-Yosida decomposition theorem. THEOREM 5.2.2 (Hewitt-Yosida decomposition). If ? is exhaustive, then there are unique measures ? \, ?2: L —>¦ ? such that ? ? is ?-order continuous and ?? is purely поп ? -order continuous. Here ? is called (in accordance with Definition 3.1.11) purely non-?-order continuous if the zero measure is the only ?-order continuous measure v: L —>¦ ? with ? <5? ?. In this case there is also no non-zero ?-order continuous and ?-continuous measure with an arbitrary range space. This follows from Remark 5.2.3. If ?: L -> ? is an exhaustive measure and и е CUA(L) is coarser than the ?-uniformity, then there is a measure ? : L -> ? such that и is the ?-uniformity.
Measures on clans and on MV-algebras 939 PROOF. Let ? = ?\ + ?? be the decomposition according to Theorem 5.2.1, ? the ?- uniformity and u,- the ?,-uniformity. Then и л v\ =v\ and и л V2 = 0, hence и = и ? ? = и л (и ? ? V2) = (и л ?\) ? (и л V2) = v\, i.e., и is the ? ? -uniformity. ? 5.3. Hammer-Sobczyk's decomposition theorem and Lyapunov's theorem THEOREM 5.3.1. Let (L, <) be complete, ? order continuous and ?(?) = {0}. Then there is a ?-continuous measure v. L —>· ? and there are ?-continuous positive measures pa : L —>· R a/id ?/,: L —>· R a/id elements ya, Zb ^ ? (a e A, b e B) with the following properties: A) (/0а(л:)>'а)а6д and (ai,{x)zb)b<iB ore summable uniformly in ? e L; ? = ? + ? +? where p(x) := J2aeA Ра(х)Уа anda(x) := Y^heBob{x)Zb\ B) pa{L) = C„Jor SOW6 H(i eN\(l), ob(L) — Coo (a e A, b e B); p(L) is compact; a(L) is convex and compact; cov(L) = cov(C(L)), v(L) is arcwise connected; C) the restriction v\c(L) is an atomless measure; D) ? =0 iff L is atomless. PROOF. We use the notation as in Theorem 1.5.4. Put A = A/(L) and В = A<x(L). For d e A U ?, the interval [0, d] is an irreducible complete ?V-algebra. Therefore there is by Proposition 1.5.2 an isomorphism kj: [0, d] -> ?„(/ where nj e ? \ {1} for d e A and nj = oo for d e B. We put v(x) = ?(? л с), ра(х) = Ki(x ? ?), ?/>(.?) = ?/,(^ л b), ya = ?(?), ?/, = ??) ??? a e A, b e В and jc e L. Let d e AU B. Since ?(* л d) = ??·(? ??)?(?) for any ? e C(L), the equality holds by Theorem 5.1.2 even for any ? e L. Therefore ?(? л ?) = А,(-х);уа and ?(* л />) = a/,(;c)z/, for ? e L, ? e A and /? e S. We now verify the properties from A) to D). A) Let || ? II,, be the semivariation with respect to a fixed continuous seminorm ? on E. For any finite subset F of A and ? e L we have ?(? ла/)-^ pa(x)ya = ?(? ? af) - ]??(.? ? ?) = ?(.? ? (?/ \ sup F)), aeF aeF therefore ?(?(? л af) - ??<=?- Ра(х)Уа) < II?II/Да/ \ sup F). Since ||?||,, is order continuous, the right-hand side converges to 0. Therefore {ра{х)Уа)ачА is summable uniformly in ? e L and p(x) := ???? Ра(х)Уа = ?(? л af). In the same way one sees that (ob(x)zb)beB is summable uniformly in ? e L and ? (*) = ?(^ л ???). Therefore v(x) + р(х) + ? (?) = ?(? Ac) + ?(? Aaf) + ?(? ??,?) = ?{?). a(L) is the image of the compact convex set C^ under the continuous affine map Ub)beB ^ iLbeB^Zb and therefore it is compact and convex. Similarly we obtain that p{L) is compact. Since [0, c] is arcwise connected by Theorem 4.2.6, v(L) = ?([0, с]) is arcwise connected. By Theorem 5.1.2 we have cov(L) = cov(C(L)). C), D) and the last statement of B) hold obviously. ? The last result allows us to transfer results about the range of measures from the Boolean case to the case of MV-algebras. If in Theorem 5.3.1 v(C(L)) is compact and convex,
940 G. Barbieri and H. Weber then v(L) = v(C(L)) and therefore ?(?,) is compact as a sum of three compact sets (see Theorem 5.3.1B)); if, moreover L is atomless, then ?(?,) is convex. This shows that the classical Lyapunov theorem for measures on Boolean algebras yields the following version of Lyapunov theorem for measures on MV-algebras. THEOREM 5.3.2. Let L be ?-complete, ? e N and ?: i, -> R" a ?-order continuous measure. (a) Then ?(?,) is compact. (b) If ? is atomless, then ?(?,) is convex. For the proof, we can apply Theorem 5.3.1 to the quotient measure ?: L/?^?) —>¦ К"· With a quite other method Butnariu and Klement A993) have proved Theorem 5.3.2(b) for ?-order continuous measures on tribes. A finitely additive version of Theorem 5.3.2(b) is contained in Barbieri B001b). Another consequence of Theorem 5.3.1 is the decomposition Theorem 5.3.5 for exhaustive measures on MV-algebras, which generalizes the Hammer-Sobczyk decomposition theorem for real-valued bounded measures on Boolean algebras (see Bhaskara Rao and Bhaskara Rao A983)). We recall the concept of chained uniform space. A uniform space (X,u) is called chained if for x, у e X and U e и there are jco, x„ e X with xq = x, x„ = у and (*,-_ \,Xi)eUfori=l,...,n (see Hejcmann A959, p. 547) and Bourbaki A966, p. 204)). PROPOSITION 5.3.3. (a) Every connected space is chained. (b) // ? is a dense subspace of a uniform space X, then X is chained iff ? is chained. The following proposition shows in particular that a charge ? on a Boolean algebra A is strongly continuous in the sense of Bhaskara Rao and Bhaskara Rao A983) iff A is chained with respect to the ?-uniformity. PROPOSITION 5.3.4. Letu e CUA(L). Then (L, u) is chained iff every O-neighbourhood U in (L,u) contains a finite number of elements x\,..., x„ eU such that ??'=? ?? = 1· PROOF. (=>) Let U be a solid O-neighbourhood in (L,u) and ao,...,a„ with ao = 0, an= 1 and ?(·??,-_? eU (i = \,...,n). Put st := sup{ao, · · · .a/} for /' > 0 and *,- := SjAsj-\ for/ ^ l.Then ]T"=| ¦*; =s„ = \ and jc, = (j,-_i ??,-)?(?/_? ??,-_?) ^ ?(-??,-_?, hence, *; eU (/ = 1,..., n). (<=) Let U be a solid O-neighbourhood in (L, u) and x\,...,x„ e L with ]Г"=| л-,· = 1 and x,- e U (i = ?,.,.,?). Let a,b e L. Put jc0 = 0, a, = a ? ?'^?^' and b' = b л ?'^?^' (' = 0, ...,и). Then ?,-??,-? ^ *,-, hence, ?,??,-? e i/ and similarly b;Abj-\ e i/. Therefore for (zo, · · · ,Z2«) := (a;i. ...,a\,0,b\, ...b„) we have zo = a, Z2„ =b and ?,??,-1 e U (i = 1, ...,2n). ? THEOREM 5.3.5 (Hammer-Sobczyk decomposition), //? ? exhaustive, then there is ? ?- continuous measure ?: L —> ? and there are ?-continuous positive measures pa : L —>¦ ? andelements ya e ?(a e Л) лмс/г that
Measures on clans and on MV-algebras 941 A) (ра(х)Уа)аеА " summable uniformly in ? e L; ? = ? + ? where ? := ?? ел РаУа', B) L is chainedwith respect to the ?-uniformity; C) pa(L) = C„a for some na e ? \ {1} (a e A); ^(L) и relatively compact; k(L) is connected. PROOF. Let w be the ?-uniformity, (L, w) the completion of (L, w) = (L, w)/N(w) and ?: L —>¦ ? the continuous extension of ?, where ? is defined as in Proposition 4.4.5. Then (L, $5) is complete and ? is order continuous. Let ? = ? + ? + ? and ? = ???? РаУа the decomposition of ? according to Theorem 5.3.1 and ? = ? + ?. Define ?: L —>¦ ? by v(x) = ?>(?) if ? e L and ? e ? e L, and analogously ?, ?? and ?. Then obviously A) holds. We will prove B): let и be the ?-uniformity, и the ?-uniformity and и = и|?. By the notation of Theorem 1.5.4, let ax = supAoc(L),a f = sup A/ (L) and с = (?? ??/)'. In the proof of Theorem 5.3.1 it is mentioned that v(x) = ?(? л с) and ?(?) = ?(? ла^с) for ? e L. Therefore ?(?) = ?(? л (с ? ах)), ? е L. Since the interval [0, с ? ???] is, by Theorem 4.2.6, a connected subspace of (L, w) and ? is ю-continuous, L is connected with respect to u. By Proposition 5.3.3, (L, u) and its dense subspace (L, w) are chained. Since (L,u)/N(u) = (L,??), by Proposition 4.4.5, (L, и) is chained, too. ^ ^ We will prove C): pa(L), p(L) and ?(?,) are dense subsets, respectively, of pa(L), p(L) and^X(L). By Theorem 53.1B), pa(L) = ?„,, for some nu e ? \ {1} and therefore finite, p(L) is compact and ?(?,) is connected as a sum of the connected set v(L) and cr(L). Therefore pa(L) = pa(L) = ?„o, p(L) is relatively compact and ?(?,) is connected. D 5.4. Vitali-Hahn-Saks-Nikodym theorem andNikodym boundedness theorem We here transfer Vitali-Hahn-Saks-Nikodym convergence theorem and Nikodym boundedness theorem from the Boolean case to the case of measures on MV-algebras. With another method P. de Lucia and E. Pap have given theorems of this type in a more general setting. For details, we refer to de Lucia and Pap B002). Definition 5.4.1. A sequence (м,я)тем of measures is called uniformly exhaustive if for each monotone sequence (a,,)neN in L the sequence (?,„ (a„))n6pj is Cauchy uniformly with respect to m e N. THEOREM 5.4.2. Let ? : L —>· (?, ?) be an exhaustive measure and ? a Hausdorff linear topology on ? weaker than ?. Then ?-uniformity with respect to ? coincides with ?- uniformity with respect to ?. Proof. Let ?:(?, ?)-> (?,?) the identity map. We apply Theorem 5.1.3 with w being the ?-uniformity with respect to ?. Put ? := ? ? ?. Then by Theorem 5.1.3(c), ? = ? ? ?. As ? (?) = ? (?), we have ? <<C ? and ? <<C ? and by Theorem 5.1.3(c) ? <? ? and ? <? ?, i.e., the ?-uniformity with respect to ? coincides with the ?-uniformity with respect toa. ?
942 G. Barbieri and H. Weber Proposition 5.4.3. Letwe CUA(L) ????,,: (L, w) -> (?, ?), ? e ?, be a pointwise bounded, uniformly exhaustive sequence of continuous measures. Then ?,,.??,») —> (?, ?), ? eN, ? equicontinuous. PROOF. We denote by 1~?(?) the space of all ?-valued bounded sequences, by ?-» the topology of uniform convergence on lx(E) and by ?? the topology of pointwise convergence on loo(E). Then ? := (?„)„6? is a measure on L with values in /«;(?). Denote by Wcc the ?-uniformity with respect to ?? and by w,, the ?-uniformity with respect to ??. As ?: L —>· A?(?), ??) is exhaustive, we obtain by Theorem 5.4.2 Юоо = wp. Since ?„ : (L, w) —>¦ (?, ?) is, for every ? e N, continuous, we have wp С w. It follows Woo С w, i.e., (?„)?6? is equicontinuous with respect to w. ? PROPOSITION 5.4.4. Let (?„)„6? ^e ? pointwise bounded sequence of exhaustive measures defined on L with values in E. By the same assumptions and notations as in Theorem 5.1.3, // {?„: ? e ?) is uniformly exhaustive, then (??)„6? is uniformly exhaustive. PROOF. Let loo(E) and ?^ be defined as in the proof of Proposition 5.4.3. Suppose {?„: ? e ?} uniformly exhaustive. Then ? := (/x„)neN: C(L) -+ (?N, ??) is exhaustive. Since (?„)«6? is pointwise bounded by assumptions, ? attains values in loc(E). Moreover, by Proposition 5.4.3 {?„: ? e ?) is equicontinuous with respect to w, i.e., ? : (C(L), w) —>¦ loc(E) is continuous; here w = w\C{ir Let ? : (L, w) -+ lx(E) be the measure which corresponds to ? according to Theorem 5.1.3. Applying Theorem 5.1.3(c) to the projections (x„),!6n ?-» xn from ??(?) into ? we obtain ? = (?„)„6?· Since ?: (L, w) —> loo(E) is continuous and hence exhaustive, (?„) is uniformly exhaustive. D THEOREM 5.4.5. Let L be ?-complete and ?„ : L -> E,n e N, a sequence ofa-order continuous measures, which converges pointwise to ? : L —> E. (a) Г/геи (?„)?6? " uniformly exhaustive and ? is ? ?-order continuous measure. (b) ///or every и e ?, ?„ /j continuous with respect to ? ? ?UA(L), then {?„: ? e ?) U {?} /? equicontinuous with respect to v. PROOF. Let w the uniformity generated by {?„: ? e ?). As ? is the subspace of a product of Banach spaces, we may assume ? is a Banach space. Then (L, w) is complete by Theorem 2.5. Passing to the quotient (L, w) := (L, w)/ N(w), we can suppose w Hausdorff. So (L, w) = (L, w) = (L, w). By classical Vitali-Hahn-Saks-Nikodym theorem the restrictions An:=Mnlcu), neN, are uniformly exhaustive. Hence ?„, ? e ?, are uniformly exhaustive by Proposition 5.4.4 and therefore {?„: ? e ?) U {?} is uniformly exhaustive. Hence, by Proposition 5.4.3, {?„: ? e ?} U {?} is equicontinuous with respect to ? if ?„ <«C ? for any ? e N. Applying this fact for ? = w we obtain ? ? -order continuous. ?
Measures on clans and on MV-algebras 943 THEOREM 5.4.6. Let L be ?-complete and ? a pointwise bounded set of ?-order continuous measures with values in E. Then ? is uniformly bounded. PROOF. As in the proof of Theorem 5.4.5 we may assume ? a Banach space. Since a subset A of ? is bounded iff all countable subsets of A are bounded, we may assume ? countable, i.e., ? = {?„: ? еЩ. Let w the uniformity generated by {?„: ? e ?). Then (L,w) is complete by Theorem 2.5. Passing to the quotient (L,w) := (L,w)/N(w), we may assume w Hausdorff. So (L, w) = (L, w) = (L, w). By classical Nikodym boundedness theorem, {?„: ? e ?) is uniformly bounded, i.e., for some positive real number r we have Д„(С(?)) с {? e ?: \\x\\ < r} for every ? e N. By Theorem 5.1.3 ?,,(?,) с cojln(C(L)) С {? e ?: \\x\\ ^ r]. Hence, {?„: ? e ?) is uniformly bounded. D 5.5. Extension of measures One of the basic results of classical measure theory is the extension theorem for measures from an algebra to the generated ? -algebra. We will generalize this theorem for measures on MV-algebras. An extension theorem in the real-valued case you can find in Jureckova A995). THEOREM 5.5.1. Let L be ?-complete, S an MV-subalgebra of L and ?:? -> ? ? measure. Then ? has a unique ?-order continuous measure extension to the ?-complete MV-subalgebra ?(?) of L iff ? is ?-order continuous and exhaustive. {Here, sup and inf are always taken in L.) PROOF. (=>) is obvious. (<=) Existence. Let ? be a system of seminorms generating the topology of E. Then, for ? e P, the semivariation ||?||? of ? with respect to ? is an exhaustive ?-submeasure on S by Example 2.1 and Proposition 4.4.4. Therefore /j„(a):=inf ]Р||МЫ*/.): х„ eS", a s^J2x" /1=1 /1 = 1 defines a ?-submeasure on L extending ||?||,, for ? e ? by Proposition 2.2. Let и be the uniformity generated on L by {??)?<??, ? the closure of S in (L, u) and ? the continuous extension of ? on ?. Obviously, ? is a measure. We now show that ? is ?-complete (hence ?(?) с ?) and ? is ?-order continuous. Let (a„) be an increasing sequence in ? and a = supa,,. Since и\т is exhaustive by Proposition 4.3.2, (a„) is a Cauchy sequence; therefore (aAan) is Cauchy, moreover ???„ | 0, hence (aAa„) converges to 0 in (L, u) by Proposition 2.5, i.e., (a„) converges to a. It follows that a e ? = ? and ?(?„) -> ? (?) by the continuity of ?. Uniqueness. For / = 1, 2 let ?>, : ?(?) —> ? be a ?-order continuous measure extension of ?, ? := ?| — V2, and и the v-uniformity. Since и is ?-order continuous, the closure S oi S
944 G. Barbieri and H. Weber in (a(S), u) is a ?-complete subalgebraof a(S), hence S = a(S). Since ? is continuous with respect to и and ? vanishes on S, we get ? = 0, i.e., ?>? = ?>?· ? References Barbieri, G. A999), Misure su A-l-semigruppi e su MV-algebre. Ph.D. thesis. Naples. Barbieri, G. B001a), A note on fuzzy measures. J. Electrical Engrg. 52. 67-70. Barbieri, G. B001b), Lyapunov's theorem for measures on D-posets, Intemat. J.Theoret. Phys. (submitted). Barbieri, G., Lepellere, M.A. and Weber, H. B001), The Hahn decomposition theorem for fuzzy measures and applications, Fuzzy Sets and Systems 118 C), 519-528. Barbieri, G. and Weber, H. A998), A topological approach to the study of fuzzy measures. Functional Analysis and Economic Theory, Abramovich, Avgerinos. Yannelis, eds, Springer, Berlin, 17^46. Basile, A. and Traynor, T. A991), Monotonely Cauchy locally solid topologies. Order 7,407^416. Bhaskara Rao, K.P.S. and Bhaskara Rao, M. A983), Theory of Charges, Academic Press. San Diego. Bemau, J. A963), Unique representation of Archimedean lattice groups and normal Archimedean lattice rings, Proc. London Math. Soc.3A5),599-631. Bourbaki, N. A966), General Topology, Part I. Hermann, Paris. Butnariu, D. A983), Decomposition and range for additive fuzzy measures, Fuzzy Sets and Systems 10, 135-155. Butnariu, D. A985), Non-atomic fuzzy measures and games, Fuzzy Sets and Systems 10, 39-52. Butnariu, D. A987), Values and cores for fuzzy games with infinitely many players, Intemat. J. Game Theory 16, 43-68. Butnariu, D. and Klement, E.P. A993), Triangular Norm Based Measures and Games with Fuzzy Coalitions, Kluwer, Dordrecht. Chang, C.C. A958a), Algebraic analysis of many valued logics. Trans. Amer. Math. Soc. 88, 467^490. Chang, C.C. A958b), A new proof of the completeness of Lukasiewicz axioms. Trans. Amer. Math. Soc. 93,74-80. Chang, C.C. A998), The writing of the MV-algebras, many-valued logics, Studia Logica 61 A), 3-6. Cignoli, R., D'Ottaviano, I.M.L. and Mundici, D. B000), Algebraic Foundations of Many-Valued Reasoning, Trends in Logic, Studia Logica Library, Vol. 7, Kluwer, Dordrecht, de Lucia, P. and Pap, E. A995), Nikodym convergence theorem for uniform space valued functions defined on D-posets, Math. Slovaca 45 D), 367-376. de Lucia, P. and Pap, E. B002), Comergence theorems for set functions, Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 125-178. Diestel, J. and Uhl, J.J. Jr A979), Vector Measures, Amer. Math. Soc.. Providence, RI. Di Nola, ?., Georgescu, G. and Lettieri, A. A999), Extending Probabilities to States of MV-algebras, Collegium Logicum, Annales of the Kurt-Godel-Society, Vol. 3. 31-50. Drewnowski, L. A972), Topological rings of sets, continuous set functions, integration I, II, III, Bull. Acad. Polon. Sci. Math. Astr. Phys. 20, 269-286, 439-445. Dunford, N. and Schwartz, J.T. A957), Linear Operators. Parti, Interscience, New York. Dvurecenskij, A. B000), On Loomis-Sikorski's theorem for MV-algebras and BCK-al gebras, Contribut. General Algebra, Vol. 12 (Vienna, 1999). Heyn, Klagenfurt, 165-180. Fleischer, I. and Traynor, T. A980), Equivalence of group-valued measures on an abstract lattice. Bull. Acad. Polon. Sci. 28, 549-556. Fremlin, D.H. A974), Topological Riesz Spaces and Measure Theory, Cambridge Univ. Press, Cambridge. Jakubik, J. A995), On complete MV-algebras. Czechoslovak J. Math. 45, 473^480. Jureckova, M. A995), The measure extension theorem on MV?-algebras. Fuzzy Sets '94 (Liptovsky Jan, 1994), Tatra Mt. Math. Publ. 6, 55-61. Hejcmann, J. A959), Boundedness in uniform spaces and topological groups, Czechoslovak J. Math. 9, 544-562. Kaplansky, I. A947), Topological rings, Amer. J. Math. 69, 153-183. Klement, E.P. A982), Characterization offuzz\ measures constructed by means of triangular norms, J. Math. Anal. Appl. 86 B), 345-358. Lacava, F. A979), Sulla struttura delle C-algebre, Accademia Nazionale dei Lincei Estratto dai Rendiconti della Classe di Scienze fisiche matematiche e naturali, serie VIII LXVII, 275-281.
Measures on clans and on MV-algebras 945 Mangani, P. A973), Su certe algebre connesse con logiche a piii valori. Boll. Un. Mat. Itai. 8, 68-78. Mundici, D. A986), Interpretation of AF C*-algebras in sentential calculus, J. Funct. Ana). 65, 15-63. Mundici, D. A995), Averaging the truth-value in Lukasiewicz logic, Studia Logica 55 A), 113-127. Mundici, D. A999), Tensor products and the Loomis-Sikorski theorem for MV-algebras, Adv. Appl. Math. 22, 227-248. Riecan, B. and Mundici, D. B002), Probability on MV-algebras, Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 869-909. Rose, A. and Rosser, J.B. A958), Fragments of many-valued statement calculi. Trans. Amer. Math. Soc. 87, 1-53. Schmidt, K.D. A989), Jordan Decomposition of Generalized Vector Measures, Pitman Research Notes in Mathematics Series, Vol. 214, Longman, Essex. Traynor, T. A976), The Lebesgue decomposition for group-valued set functions. Trans. Amer. Math. Soc. 220, 307-319. Volkmer, H. and Weber, H. A983), Der Wertebereich atomloser Inhalte, Arch. Math. (Basel) 40 E), 464-474. Vulilch, B.Z. A967), Introduction to the Theory of Partially Ordered Spaces, Wolthers-Nordhoff, Groningen. Warner, S. A989), Topological Fields, North-Holland, Amsterdam. Weber, H. A984a), Topological Boolean Rings. Decomposition of finitely additive set functions. Pacific J. Math. 110 B), 471-495. Weber, H. A984b), Group- and vector-valued s-bounded contents. Measure Theory (Oberwolfach, 1983), Lecture Notes in Math., Vol. 1089, Springer, Berlin, 81-198. Weber, H. A991/1993), Uniform lattices I: A generalization of topological Riesz spaces and topological Boolean rings; Uniform lattices. II: Order continuity and exhaustivity, Ann. Mat. Рига Appl. 160, 347-370; 165, 133- 158. Weber, H. A996), On modular functions, Funct. Approx. 24, 35-52. Weber, H. A997), An abstraction of clan of fuzzy sets, Ricerche Mat. 46 B), 457^472. Weber, H. B002), FN-topologies and group-valued measures. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 703-743. Zadeh, L.A. A965), Fuzzy sets. Inform. Control 8, 338-353.
CHAPTER 23 Triangular Norm-Based Measures Dan Butnariu* Department of Mathematics, University of Haifa, 31905 Haifa, Israel E-mail: dbutnaru @ math2. haifa.ac. il Erich Peter Klement Department of Algebra, Stochastics and Knowledge-Based Mathematical Systems, Johannes Kepler University, Linz, Austria E-mail: ep.klement@jku.at Contents 1. Introduction 949 2. Triangular norms, fuzzy subsets 949 3. ?-tribes 956 4. 7"-measures and their representation by Markov kernels 958 5. Integra) representation of 7"l -measures 964 6. Integra) representation of monotone 7\-measures 970 7. Decomposition of monotone 7\-measures 973 8. Jordan decomposition of bounded 7"L-measures 983 9. Jordan decomposition of finite ^-measures 988 10. Absolute continuity of 7"l-measures 991 11. Vector 7"l-measures with Darboux property 992 12. Nonatomic 7"l-measures 997 13. A Liapounoff type theorem for 7"l-measures 1004 14. A Liapounoff type theorem for 7? -measures 1006 References 1007 The first author gratefully acknowledges the support of the Israel Science Foundation founded by the Israeli Academy of Sciences and Humanities. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. AU rights reserved 947
This page intentionally left blank
Triangular nonv-based measures 949 1. Introduction Triangular norm-based measures, or simply ?-measures, are special real valuations defined on Г-tribes, the latter being classes of fuzzy sets (generalizing the concept of a ?-algebra) based on a triangular norm ?. They attracted the interest of scientists because of their applications in fields like mathematical statistics (see [28,29]), pattern recognition [80,75], possibility and plausibility theory [81,40] and many-valued logics [31,32,69]. Butnariu and Klement [17, Chapter IV] have used ?-measures to identify classes of games for which Aumann-Shapley values in the spirit of [2] exist. Among the basic tools employed in this respect are integral representations of ?-measures. This survey first presents the necessary preliminaries about triangular norms and fuzzy sets, including a concepts of disjointness, emphasizing the special role of Frank t-norms ?? [30]. Next, Г-tribes are introduced, and their most prominent properties are listed. Three different representations for ?-measures (for measures with respect to the minimum Гм and to the Lukasiewicz t-norm Tl, and for general T^-measures) as integral with respect to a suitable Markov kernel follow, as well as a decomposition of monotone ?? -measures. The next group of sections concentrates on measures with respect to the Lukasiewicz t-norm Tl and some of their most important properties: Jordan decomposition, absolute continuity, Darboux property, nonatomicity. The two last sections provide a Liapounoff type theorem concerning the compactness and convexity of the range of vector measures, first for T^-measures (this turned out to be one of the basic tools to show the existence of an Aumann-Shapley value on the space of games with fuzzy coalitions spanned by positive integer powers of monotone Tl-measures [17, Chapters IV-VI]) and then for general Гд-measures, as recently proved by Barbieri and Weber [4]. 2. Triangular norms, fuzzy subsets Fuzzy sets, as introduced by Zadeh [79], are a many-valued generalization of Cantorian sets. For a Cantorian set A the possible truth values of the statement 'x is an element of A' are 0 and 1 (false and true), whereas for a fuzzy set A the truth values of the statement 'дг is an element of A' can be any number in the interval [0, 1]. We shall describe fuzzy sets in more detail below - first we want to provide the tools for extending operations such as intersection and union of sets. In order to do so properly, we consider suitable functions Г: [0, 1]" —>¦ [0, 1] and S:[0, l]2 —>¦ [0, 1], whose restrictions to {0, l}2 coincide with conjunction and disjunction, respectively, and which satisfy such general properties like associativity, commutativity and monotonicity. Such functions are known as triangular norms and triangular conorms, respectively; they proved to be useful in the theory of probabilistic metric spaces, as initiated by Menger [56] and extensively studied by Schweizer and Sklar [73]. In such spaces the distance between two points is measured by probability distribution functions rather than by real numbers, and the triangular norms allow the triangle inequality to be properly generalized.
950 D. Butnariu and E.P. Klement In continuum physics Menger [59,58,57] interpreted triangular norms as rules for generating new probabilistically determined objects from existing ones in the psychophysical continuum space. In accordance with one of the main ideas underlying the foundation of modern game theory in Neumann and Morgenstern [66], we shall use triangular norms as models for interaction rules of economic agents in a way which is similar to the modelling of the interaction of physical particles. Triangular norms are also closely related to the still unsolved general associativity functional equation [21]. In the setting of many-valued logics with truth values in the whole interval [0, 1] (generalizing the Boolean two-valued logic whose set of truth values equals {0, 1}), triangular norms are natural models for the interpretation of the conjunction (naturally extending the classical conjunction л of the Boolean logic) [31,32,55,50]. Dehnition 2.1. (i) A function T:[0, l]2 —>¦ [0, 1] is called triangular norm{t-norm for short) if it satisfies the following conditions for all ij,ze [0, 1]: (Tl) T{x,\) = x, {boundary condition) (T2) T(x,y) ???(?,?) whenever y^z, (monotonicity) (T3) T(x, y) = T(y, x), (commutativity) (T4) T(x, T(y,z)) = T(T(x, y),z). (associativity) (ii) If ? is a triangular norm, then the corresponding triangular conorm (t-conorm) is the function 5": [0, 1 ]2 -> [0, 1 ] defined by (TCO) S(x,y)=l-T(l-x,\-y). It is obvious that, given a t-norm T, the corresponding t-conorm 5 of ? satisfies conditions (T2), (T3) and (T4) and, dually to (Tl), (SI) S(x, 0) = x. (boundary condition) A function 5": [0, l]2 -> [0, 1] fulfilling (SI) and (T2)-(T4) is called a triangular conorm (t-conorm). It is easily seen that given a t-conorm 5, then ?: [0, l]2 —>¦ [0, 1] defined by T(x,y)=l-S(l-x,l-y). is a t-norm whose corresponding t-conorm is exactly the t-conorm 5 we started with. Example 2.2. Some of the most important pairs of t-norms and corresponding t-conorms are the minimum Гм and the maximum 5м, the product Tp and the probabilistic sum 5p,
Triangular norm-based measures 951 the Lukasiewicz t-norm 7l and the Lukasiewicz t-conorm 5l, and the drastic product Tq and the drastic sum SO given by, respectively: Tu(.x,y) = \ nin(x, y), Tp(x,y)=x ¦ y, TL(x, у) = тах@, x +y - 1), Ых,у) = ? ify=l, У if x = 1, 0 otherwise, Sm(*,>') = i ъах(х,у). SP(x, y) = ? + у - ? ¦ у, SL(x,y) = min(l,x + y), So(x,y) = ? if у = 0, у ifx=0, 1 otherwis From the boundary condition (Tl) and the monotonicity (T2) it readily follows that Тц is the smallest and Гм is the largest t-norm, i.e., for each t-norm Г we have To < ? < Гм- A t-norm is a rather special semigroup operation on the unit interval [0, 1 ]. It is therefore possible to consider additional algebraic properties such as the Archimedean property and strictness. A t-norm ? is called Archimedeanii for each (x, y) e ]0, 1[2 there is an ? e N such » that x{T < y, where xyT ' is the и-th power of ? with respect to ? defined inductively by m = 1 and >+n ¦7D"'·*) A t-norm ? is called stricni it is continuous and if it satisfies T(x, y) < T{x,z) whenever ? e]0, 1] and y,z e [0, 1] with у <z- Obviously, each strict t-norm is Archimedean. The Lukasiewicz t-norm 7l is continuous and Archimedean but not strict, the drastic product Tq is Archimedean but not continuous (and hence not strict). The minimum Гм is continuous but not Archimedean (and hence not strict), and the product T? is strict (and therefore Archimedean). A very special role is played in this survey by the family of Frank t-norms (Г^)лб(О.эс] (they were called fundamental t-norms in [17]). The starting point for the investigation of Frank t-norms was the following problem in connection with the associativity of duals of copulas [65,74] in the framework of distribution functions and probabilistic metric spaces [73]: characterize all continuous (or, equivalently, nondecreasing) associative functions F: [0, l]2 —>· [0, 1], which satisfy for each ie[0,l] the boundary conditions F@, x) = F(x,0)and F(x, 1)= F(l,x) = x, such that the function G : [0, l]2^ [0, l]givenby G(x,y) = x F(x,y) is also associative. Frank [30] has shown that then F has to be an ordinal sum [20,54,72] of members of the following family of t-norms. In particular this means that each pair G\, S\) of mutually corresponding Frank t-norms and t-conorms satisfies the functional equation T(x,y) + S(x,y) = x +y. A)
952 D. Butnariu and ?.P. Klement Example 2.3. (i) The family of Frank t-norms Gх)х6[о.зс| is given by. Ых,у) = Тм(х,у) Tp(x,y) TL(x,y) log.fl (?-?-1)·(?-?-1) if ? = 1, if ? = oo, ifXe]0,oo[\{l] ?- 1 (ii) The family of Frank t-conorms E;0ле[0.зс| is defined as follows: Sx(x,y) = Sm(x,)>) Sp(x,y) Sh(x,y) l-logx(l (? I-л 1)·(? l-v 1) 1 ifX = 0, if ? = 1, if ? = oo, ifXe]0,oo[\{l] All Frank t-norms are continuous, and a Frank t-norm ?? is strict if and only if ? e ]0, oof. Recall that Tx (= 7l) is Archimedean but not strict, and that Го (= Гм) is neither Archimedean nor strict. The family (Г>.)л6[о.зс| is a continuous family in the sense that for each ?? e [0, oo] we have lim ?? ??0. A nontrivial result is that the family of Frank t-norms (??)?6??.??] is strictly decreasing with respect to the parameter ?, i.e., for ?|, ?? e [0, oo] we have ?·?? < ??? if and only if ? ? > k2- A first (and rather complicated) proof of this monotonicity of the family of Frank t-norms was given by Butnariu and Klement [17, Proposition 1.12], a significantly shorter proof based on the additive generators of the Frank t-norms was published in [47, Example 3.2]. Many more details about Frank t-norms are contained in [30] and in Section 4.4 of Klement et al. [48]. Turning back to an arbitrary t-norm ?, the associativity (T4) allows us to extend this binary operation in a unique way to an /i-ary operation in the usual way by induction, defining for each и-tuple (дг|, дгг,..., дг„) e [0, 1]": ?\, ?2, ···,*«)· The fact that each t-norm ? is bounded from above by Гм makes it possible to extend it to a (countably) infinitary operation, putting for each sequence (*;);'6N of elements of [0, 1]: OO II ? Xj = lim ? Xj.
Triangular norm-based measures 953 Note that the limit on the right-hand side always exists since the sequence (? ? \i=\ /,,6N is nonincreasing and bounded from below. Coming back to sets, a well-known fact about Cantorian sets (for which we shall use the term crisp set to distinguish them from the fuzzy sets to be defined below) is that, given a universe of discourse X, each subset A of X can be identified with its characteristic function A:X —>¦ {0, 1), assigning the value 1 to all elements of X which belong to A, and the value 0 to all remaining elements of X. It was exactly this concept of a characteristic function which was generalized by Zadeh [79] who replaced the two-element set {0, 1} by the unit interval [0, 1], therefore considering a continuum of truth values rather than the two classical truth values true and false (which correspond to 1 and 0, respectively) only. Therefore, given a (crisp) universe of discourse X, a fuzzy subset A of X (or, briefly, ? fuzzy set A) is characterized by its membership function A:X->[0, 1], where for ? e X the value A{x) is interpreted as the degree of membership of ? in the fuzzy set A. The class of all fuzzy subsets of X will be denoted [0, \]x. In a logical setting, the degree of membership A(x) can also be seen as the truth value of the statement 'x is element of A'. Since the membership function of a fuzzy subset of X generalizes the characteristic function of a crisp subset of X, each crisp subset of X is just a special case of a fuzzy subset of X, and its characteristic function is a special membership function (assuming the values 0 and 1 only). Subsequently, the universe X and the empty set 0 are identified with the constant functions mapping each ? e X to 1 and 0, respectively. Given a fuzzy subset A of a universe X we can associate with it several crisp subsets of X, among them the kernel ker(A) and the support supp(A) defined by, respectively, ker(A) = {x eX: A(x) = l}, B) supp(A) = {xeX: A(x)>0}. C) Let ? be a ?-norm and S its corresponding r-conorm. In order to define the intersection and the union of two fuzzy subsets A and В of a universe X, we extend the binary operations ? and S on [0, 1] to [0, \]x pointwise, i.e., (ATS)U) = T(A(x), B(x)), (ASB)(x) = S(A(x), B(x)). Finite and countable intersection and union are introduced in a straightforward way. Note that the duality of ? and S implies the validity of the De Morgan laws, i.e., for a sequence (A„),i6N of fuzzy subsets of X we have
954 D. Butnariu and E.P. Klement @0 \ OO / OO \ ОС ? a„ =SCa,„ С Sa„) = TCa„, n=\ / n=\ \n=\ / n=\ where the complement CA of a fuzzy subset A of X is defined by CA(x) = 1 - A(x). Restricted to crisp sets, these operations coincide with intersection, union and complement, respectively, regardless which t-norm and t-conorm are considered. The class [0, 1 ]x of all fuzzy subsets of X, together with either one of the operations ? and S, forms a partially ordered commutative semigroup having 0 as smallest element and X as largest element. However, [0, l]x provided with the operations T, S and the complement С is not a Boolean algebra. In general, ? and S are not distributive with respect to each other, and aTCa may be different from 0 and A S С A may be different from X. In order to classify ?-measures and to distinguish between ?-measures and other kinds of valuations which can be defined on ?-tribes (see, for instance, [71]), we also need a concept of disjointness. There are different approaches to disjointness (mostly in the context of fuzzy partitions) [22,25-27,37-39,42,51,63,70,76]. Our concept is based on Butnariu [11] (see also [17]). Dehnition 2.4. Let X be a nonempty set, Г a t-norm and S its corresponding t-conorm. (i) A finite family (? ?, ??,..., A„) of fuzzy sets in [0, 1 ]x is said to be ?-disjoint if for each к e {1,...,«} (TD) (S АЛТA* = 0. (ii) A countable family (?„)?6? <? [?, \]x is called ?-disjoint if, for each ? e N, the finite family (? ?,..., A„) is ?-disjoint. (iii) A sequence (A„)„6n ? [0, l]x is called a T-partition of a fuzzy set A if it is ?-disjoint and if OO (TP) A= S A„. Each subfamily (A„)n6/ of an at most countable ?-disjoint family (An)nej is also ?-disjoint: it suffices to show this for finite index sets J and /?. Indeed, for each / e / we get ( S ?„)??,<;( S а„)та,=0. vne/\li] ' ??|?] ' Also, the definition of the ?-disjointness does not depend on the order in which the fuzzy sets (A„)n6pj are numbered, i.e., if ? is a permutation of N and the countable family (An)n6pj is ?-disjoint, then the permuted family (A,t(„))„6n is ?-disjoint too. Different t-norms may lead to different concepts of disjointness. However, for some classes of t-norms the corresponding disjointness concepts do not depend on the choice of the t-norm in that class.
Triangular norm-based measures 955 Example 2.5. Let (А„)„ещ be a countable family of fuzzy subsets of X. (i) If all A„ are crisp subsets of X, then for each t-norm ? the ?-disjointness is equivalent to the pairwise disjointness of the crisp sets, (ii) For each t-norm ?, the ?-disjointness implies pairwise ?-disjointness, but the converse is not generally true: if we take the family of constant fuzzy sets (A |, A2, A3) given by А|(дг)= A2(x) = Аз(дг) = 0.5, then A\, A2, A3 are pairwise TL-disjoint, but the family (A\, A2, A3) is not TL-disjoint. (iii) For ? e [0, oof we get: (А„)пещ is ??-disjoint if and only if each elements e X is "contained" in at most one Ak (i.e., if and only if Ак(х) > 0 for at most one к е N). (iv) The ??-disjointness of (A„)„6pj means that for each ? e X exactly one of the following conditions holds: (a) there is exactly one к e N such that А* (дг) = 1, and A„ (x) = 0 for all ? ? к; (b) there are at most two indices k, I e N such that {А*(дг), А/(х)} С ]0, 1[, and we have A„(x) = 0 for ? <?{k,l]. The following results (see Lemma 4.4 and Corollary 4.5 in [17]) emphasize the crucial role of the functional equation A) with respect to disjointness. LEMMA 2.6. Let ? be a t-norm and S be its corresponding t-conorm such that the functional equation A) holds, and let (A1, A2- ¦¦, A„) be a finite family of fuzzy subsets of X. Then for each ? ^ 2 the following are equivalent: (i) the family (A\, A2,..., A„) is ?-disjoint; (ii) for all к e {2,..., n] we have (Sf=i' A,) ? Ak = 0; (iii) for all к e [2,.. .,n) we have Si=\ A, = ?/=? ^ь (iv) for each subset I с {1,2,...,«} containing at least ? - 1 elements we have biel Ai = ?/6/ A-i- Corollary 2.7. Let ? be a t-norm and S its corresponding t-conorm such that the functional equation (I) holds, and let ( A„)„6n be a countable family of fuzzy subsets ofX. Then the following are equivalent: (i) the family (А„)„ещ is T-disjoint; (ii) for all k^lwe have (S*=|' A/) T Ak = 0; (iii) for all k^lwe have Si-\ A, = ?/=? ^'; (iv) for each finite subset I с N we have Si6/ A, = ?]i6/ A,-. As an immediate consequence we have that an at most countable family of fuzzy sets (A„)„eJ is rL-disjoint if and only if J2nej A„ ^ 1. From Lemma 2.6 and Corollary 2.7 we know that if Г and its corresponding t-conorm 5" satisfy A) and if (A„)neJ is ?-disjoint, then necessarily J2„eJ A" ^ '· However, the converse is not generally true (see Example 2.5 (iii)). Also, the requirement that Г and S satisfy A) cannot be dropped in Lemma 2.6 and in Corollary 2.7. For instance, for the t-norm To and its corresponding t-conorm So the conditions (i), (ii) and (iii) are no longer equivalent.
956 D. Butnariu and E.P. Klement 3. ?-tribes We now study certain subclasses of [0, \]x, the class of all fuzzy subsets of a (crisp) universe of discourse X, which are closed under intersections and unions based on t- norms and t-conorms, respectively. In many situations it is also desirable to have also a kind of ?-completeness. The following concept of ?-tribes was presented in [17], it basically was introduced by Klement [46] for arbitrary t-norms ?, in [44] for ? = Гм and by Butnariu [11] for ? = TL (see also [23]). For all lattice-theoretical notions we usually refer to Birkhoff [8], in the context of algebras and ?-algebras of crisp sets to Halmos [35] and Bauer [5]. Dehnition 3.1. Let ? be a t-norm. (i) A subclass С of [0, l]x is called a T-clan on X if the following properties are satisfied: (TCI) 0eC; (TC2) for all A e С we have CA e C; (TC3) for all A, fie С we have AT ? eC. (ii) A ?-clan ? с [0, l]x is called a T-tribe if DC (TT) for all sequences (A„)„6n m ? we have ? A„ e T. Trivially, each algebra of crisp subsets of X [35,5] is a ?-clan with respect to an arbitrary t-norm ?. Moreover, a subclass Д of ? (X), i.e., a class of crisp subsets of X, is a ? -algebra if and only if it is a ?-tribe with respect to an arbitrary t-norm ?. Each Гм-clan is a De Morgan algebra [41], and each Гь-clan is a semi-simple MV- algebra as introduced by Chang [18] (compare also [6,7,24,41]). Also, each Гм-tribe is a ?-complete De Morgan algebra, and each Tl-tribe is both a ?-complete semi-simple MV-algebra and a clan in the sense of Wyler [78]. Obviously, not each ?-clan is a ?-tribe (a rather simple counterexample is given by the TL-clan {A: for all ? e X: A(x) e Q) which is not a TL-tribe). Example 3.2. (i) Define for each ? e N and к e {1,2 ? - 1} the fuzzy subset A*/„ of X by Ak/n(x) = k/n. Then the class T„ = {0, A\/n, Аг/„ ?(„_?,/„,?} is both a Гм-clan and а Гь-с1ап, but it is never a ?-clan when Г is a strict t-norm. (ii) Given a Borel measurable t-norm Г and a ? -algebra ? of crisp subsets of X, then the class Лл of all fuzzy subsets of X with Д-measurable membership functions is a ?-tribe. Note that Ал was called a generated (T-)tribe in [17,44,46], and that it was used in [80] as the class of fuzzy events, (iii) Given a ?-tribe T, the set Tv of all crisp sets in ? is a ?-algebra, and hence a ?-tribe.
Triangular norm-based measures 957 (iv) The class ? consisting of all fuzzy subsets of X, whose membership functions are either constant or have all their values in the interval [1/3,2/3], is both a Го-tribe and а Гм-tribe, but no ?-tribe whenever Г is a continuous Archimedean t-norm [46]. (v) If in (iv) we additionally require that all the membership functions of the elements of ? be continuous, then ? is a Го-tribe, but no longer a Гм-tribe [46]. (vi) Consider a crisp subset ? of X. Given a ? -algebra A of crisp subsets of X, the class Ay of all fuzzy subsets of X, whose membership functions are Д-measurable and assume on ? only the values 0 and 1, is a ?-tribe with respect to an arbitrary Borel measurable t-norm ?. Note however that, up to the case ? = X, Ay does not contain any fuzzy subset of X with a nontrivial constant membership function (compare Theorem 3.3(iii)). The tribe Ay was called a semigenerated (T-)tribe by Mesiar [60]. The following result for tribes based on Frank t-norms (see Propositions 2.7 and 3.3 and Theorem 3.2 in [17]) demonstrates the special role of the Frank t-norms in this context. The crucial argument is, of course, that the Frank t-norms and their dual t-conorms are solutions of the Frank functional equation A). THEOREM 3.3. For all ? e ]0, oo] the following assertions hold: (i) Each ??-tribe is a Tj^-tribe (and, hence, ? ?-complete De Morgan algebra) as well as a T^-tribe {and, hence, ? ?-complete semi-simple MV-algebra). (ii) For each ??-tribe ? we have ? с (??)?, i.e., each element in ? has a Tv- measurable membership function. (iii) A ??-tribe ? is generated if and only if it contains all fuzzy subsets of X which have a constant membership function. The interesting problem to characterize all subclasses of [0, \]x which are ?-tribes with respect to an arbitrary Borel measurable t-norm ? was solved by Navara [64] (compare also Mesiar [60,61] and Mesiar and Navara [62]). The result turns out to be a generalization of the fact that each class of crisp subsets of X is a ? -algebra if and only if it is a ?-tribe with respect to an arbitrary t-norm ?, and of Theorem 3.3(i) (for the definition of supp(A) and ker(A) see C) and B)). THEOREM 3.4. Let X be a nonempty set and ? a subclass of [0, 1 ]x. Then the following are equivalent: (i) ? is a T-tribe for all Borel measurable t-norms ?; (ii) there is ? ? e ]0, oof such that ? is a ??-tribe; (iii) ? is a weakly generated tribe, i.e., there is ? ? -ideal V ofTv such that T= (??(??)?: supp(A) \ker(A) e V). This means that a ??-tribe for ? e]0, oof on X is not only a ? -complete lattice (in particular, it is closed under Гм) and a ?-complete semi-simple MV-algebra (in particular, it is closed under Tl), but that it is also closed under each Borel measurable
958 D. Butnariu and E.P. Klement t-norm. Observe that this is not true for 7l-tribes: for example, the Гь-clan mentioned in Example 3.2(i) is a TL-tribe, but no ?-tribe if ? is a strict t-norm (see also Example 3.6 below). We already mentioned that each TL-tribe is a ?-complete semi-simple MV-algebra. In fact, each semi-simple MV-algebra is isomorphic to some Гь-с1ап (see Belluce [6,7], a complete characterization of all MV-algebras, involving nonstandard real numbers, can be found in Di Nola [24]). Another characterization of ^-tribes was given by Klement and Navara [49]. Both results are summarized below. THEOREM 3.5. Let ? be a subclass o/[0, \}x. Then the following are equivalent: (i) ? is a T^-tribe; (ii) ? is ? ?-complete semi-simple MV-algebra; (iii) there is ? ? -algebra ? of crisp subsets of X and a sequence (?n)neN of ?-filters in ?, with ?m с ?n whenever ? is a divisor of m, such that Т={АеЛл: {? e X: п-А(х)еЩ e?„forallneN}. Combining all these results, it is possible to give, as a simple example, a rather complete picture of all possible tribes with respect to Frank t-norms, to arbitrary strict t-norms, and to the weakest t-norm To, respectively. Example 3.6. Let {xo} be a singleton. Then the only generated tribe on {x0}, let us call it %o, is just isomorphic to the unit interval [0, 1]. There are two semigenerated tribes, namely, %o and 7j (which is isomorphic to {0, 1}). Since {xo) is finite, these are also the only weakly generated tribes. Due to Theorem 3.4 each ??-tribe with ? e ]0, oof equals either Too or T\, the same being true for each strict t-norm. The only 7l-tribes on {xq} are Та and T„ with ? e N (as mentioned in Example 3.2(i)). Each Гм-tribe on {xq} is isomorphic to some closed subset of [0, 1] which contains 0 and which is symmetric with respect to the midpoint 0.5, and each Го-tribe on {xq} is isomorphic to some arbitrary subset of [0, 1] which contains 0 and which is symmetric with respect to 0.5. 4. ?-measures and their representation by Markov kernels Dehnition 4.1. Let ? be a t-norm, S its corresponding t-conorm, let ? be a ?-clan and let m: ? —> [—oo, oo] be a function which assumes at most one of the values — oo and oo. (i) The function m is called a T-valuation on Tif it satisfies the following conditions: (TM1) m@) = O. (TM2) m(ATB) + m(ASB) = m(A)+m(B). (ii) The function m is said to be finitely ?-additive if it satisfies (TM1) and (TM3) m(ASS) = m(A) + m(S) whenever ??? = 0.
Triangular norm-based measures 959 (iii) The function m is said to be monotone if (TM4) m(A)<m(B) whenever ? ? ?. Remark 4.2. (i) If m:T —>¦ [-00, 00] is a ?-valuation on the ?-clan T, then m is also finitely Г-additive, the converse not being generally true: if, for instance, ? consists of all the constant functions in [0, \]x and if ? e [0, oof, then because of the absence of any nontrivial ??-disjoint elements in the 7\-clan T, each function m:T —>· [-00,00] which satisfies (TM1) is finitely TA-additive without being necessarily a ??-valuation. This shows that ?-valuations are particular additive functions in the sense of Schmidt [71, p. 558] and, consequently, if they are finite, they can be represented as differences of monotone Г-additive functions (compare [71, Theorem 2.2]). However, it is not possible to conclude directly that the ?-valuations have Jordan decompositions, (ii) If ? is a Гь-clan and m is a finite T^-additive function on T, then m is also a Tl-valuation, (iii) If ? is an algebra of crisp subsets of X and if ?* is some t-norm, then the finite r*-additive functions are ?-valuations for each t-norm ?. ?????????4.3. (i) Let ? be a ?-clan. A function m: ? —> [—00, 00] is called a T-measure if it is a ?-valuation and if it is continuous from the left in the following sense: (TM5) lim m(A,,) = m(A) whenever n—»зо (A„)„6n с ?, (?„)„6? / A and A e Т. (ii) If a ?-measure m on ? assumes only values in [0,1 ]and if m(X) = 1, we shall call m г. probability ?-measure. (iii) Let ? be a Г-tribe. The function m: ? —> [—oo, 00] is said to be countably ?-additive if it satisfies (TM1) and if for each sequence (A„)„6n с ? we have @0 \ °° S A„ I = ]Рт(А„) whenever (А„)„6к is Г-disjoint. It is clear that countably Г-additive functions are always finitely Г-additive. Moreover, Г-measures on Г-tribes are countably Г-additive since if we take a Г-disjoint sequence (Ai)neN in the Г-tribe ? and define Bn = S"=i An, then (S„)„6n is a sequence in ? which converges nondecreasingly to S ^=1 A„ and, therefore, m( S ?=??( lim B„) = lim m(Bn) = lim Ут(А„) = ) m(A„). \ „_| / VH^DC ' ;l-»30 H-»30 '—' '—'
960 D. Butnariu and E.P. Klement Remark 4.4. In general, for arbitrary t-norms ? and ?-tribes T, the countable ?- additivity does not imply the continuity from the left condition (TM5), and thus countably ?-additive functions are not necessarily ?-measures. For example, if ? e [0, oof and ? = [0, l]x, then for each fixed xoe X the function m: ? -> [-oo, oo] defined by [ 0 otherwise, is countably ??-additive, but it is not a Ti-measure since it does not satisfy (TM5). However, it will be shown later (see Proposition 8.1) that countably 7l-additive functions are 7l-measures, that is, the countable TL-additivity and the continuity from the left are equivalent in this case. The same is true when ? is an arbitrary t-norm and ? consists of crisp subsets of X only. If ? is a ?-algebra of crisp subsets of X, then each (?-additive) measure on ? is a Г-measure with respect to any t-norm ?. In this case, the countable ?-additivity and the continuity from the left are equivalent too. The following result will turn out to be useful when we represent ?? -measures by Markov kernels. LEMMA 4.5. If ? is a t-norm and if ? is both a T-clan and a Тм-clan, then each T- valuation on ? is also a Тм-valuation on T. PROOF. For А, В e ? we get m(ATMS) +m(ASMS) = m((ATMS)T(ASMS))+m((ATMS)S(ASMS)) = m(A ? ?) + m(A S ?) = m(A) + m(fi), showing that m is indeed a Гм-valuation. ? Remark 4.6. (i) Lemma 4.5 shows that if m is a Г-measure on ? (? being both а Г- and а Гм- clan), then m is also a Гм-measure. (ii) Taking into account Theorem 3.3(i), each m is a ??-measure on a ??-tribe ? with ? e ]0, oo] is also a Гм-measure. (iii) The converse of Lemma 4.5 does not generally hold: for instance, let ? be the family of all Borel measurable fuzzy sets on X = [0, 1]. Then the function m:T —» [—oo, oo] defined by m(A)= / A+A(jt))d* J\A>0} is a Гм-valuation (even a Гм-measure) but not a T^-valuation.
Triangular norm-based measures 961 The Гм-measures play a fundamental role in the following exposition. In order to give an integral representation for them we need the notion of Markov kernels (for more details see, e.g., Bauer [5]). Dehnition 4.7. Let (X, A) be a measurable space, В be the family of all Borel subsets of [0, 1], and B\ = В П [0, 1[. A function К : ? ? ?\ -> R is called an ?-Markov kernel if it satisfies the following conditions: (i) for each ? eX, the function K(x,):B\ -> R defined by K(x,-)(B) = K(x,B) is a probability measure on B\; (ii) for each ? e B\, the function K(-, B) :X -> R defined by ?(·, B){x) = K(x, B) is Д-measurable. From Example 3.2(ii) we know that if (X,A) is a measurable space, then the family Ал of all Д-measurable functions from X to [0, 1] is a Гм-tribe. The next result, from Klement [43], shows that Гм-measures on Ал can be represented as integrals of Markov kernels in the following way. THEOREM 4.8. Let A be ? ?-algebra of crisp subsets of X and Ал the family of all ?-measurable functions in [0, \]x. Ifm is a finite monotone ?^/?-measure on Ал, then there exists a unique measure m on A, namely, the restriction ofm to A, and an m-almost everywhere uniquely determined ?-Markov kernel К : ? ? ?\ —> ?. such that for all A e A m(A) = / K(x, [0, A(x)[)dm(x). D) PROOF. The proof is carried out in several steps. First, observe that if D) holds for some measure m on A, then m has to be necessarily the restriction of m to A since for ? e A equality D) gives m i(M)= / K(x,[0, l[)dm(jt) = m(M). E) Without loss of generality we may restrict our proof to the case when m(X) = 1. CLAIM 1. For each rational number a e [0, 1 ], the function m„ : A -> [0, 1 ], defined by ma(M) = m(a-M), F) is a nonnegative finite measure on A, and it satisfies m„ ^ m. Obviously, m„@) — 0 and m„ is finite and monotone because of the corresponding properties of m. To prove ?-additivity of m„, let (A„)„6pj be a sequence of mutually disjoint sets in A. Then m« ( [J A„ ) = mi sup I a · ? [J A, \ I: ? e N
962 D. Butnariu and E.P. Klement = sup mi a · I [J A, sup ^т(а.Д,·) ОС ]Гт(а· А„). neN ? eN n=l The last equality follows from the monotonicity of m. Therefore, Claim 1 is proved. Claim 1 implies that each measure m„ defined by F) is m-absolutely continuous. Hence, for each a e [0,1] П Q there exists a Radon-Nikodym derivative fa = dm« dm Since the Radon-Nikodym derivative of an m-absolutely continuous measure is m-a.e. unique, it follows that for all a e ]0, 1 [ /o = 0, m-a.e., /? = 1, m-a.e., fa =sup{ft: /3eQn[0,a[}, m-a.e. G) (8) (9) Therefore, changing (if necessary) the values of the functions fa on an m-null set, we may assume that G) holds everywhere on X, and we still have ma(M)= / /„dm. Jm For an arbitrary (not necessarily rational) a e [0, 1], define ga = sup{fp: /3eQn[0,<*[}. A0) A1) Since ga is the supremum of a countable family of Д-measurable functions, it is also A- measurable, and for each ? e A we again have Ша(М) = I ga Jm dm A2) because of G) and the Monotone Convergence Theorem. Furthermore, for each ? e X the function hx : [0,1] -> R defined by hx(a)=ga(x) A3)
Triangular norm-based measures 963 is a probability distribution function which determines a unique probability measure qv on ([0,1[,#|) which, whenever0 ^ a < ? ^ 1, satisfies 4*([?,0[)=?0)-??). Claim 2. The function ? : ? ? ?, -> R <te/wi<;d fey K(*,B) = qu(B) A4) и an ?-Markov kernel, i.e., K(x,-) is a probability measure on B\ for each ? e X and K(-, B) is A-measurable for each В &B\. The first statement of the claim is trivial. The second one will be proved now. Consider the set V = {С е B\: K(-,C) is Д-measurable}. This is a Dynkin system (see, e.g., [5]) on [0, 1 [. Indeed, [0,1 [ e V since for each ? e X K(*,[0, l[)=q,([0, l[) = l. Also, if C, D e V and С с D, then for each jc e X K(x,D\C) = q.r@) - q.v(Q = K(x, D) - *(*, С), and this shows that K(x, D\ C) is Д-measurable too. Finally, if (C,,)„6n is a sequence of pairwise disjoint sets in V, then we have for each ? e X (OO \ / OO \ OO 00 *. U c")=чЛ ^. U c")=Sit(Cn)=? ^^' c"}' n = l / V n = l / ;?=? ;?=? and, subsequently, ?(-,U,^i C„) is also measurable. Thus, ? is a Dynkin system on [0, 1 [. Now, observe that for all ?, ? e [0, 1 ] with a < ? and for each ? e X v/c have and this implies that ?(·,[?,?[) is Д-measurable, i.e., [a,fi[e V. Thus the Dynkin system V contains all intervals [?, ?[ с [0, 1[ which means V = B\, and so Claim 2 is proved. Now we have to show that the Л-Markov kernel К defined in A4) indeed satisfies D). To this end, let A be an Д-measurable step function A = ]T"=| <*; · A. where all a,- e [0,1] and the sets A \, A2, · · ·, A„ form a partition of X. Then ш(А) = У]т(а,-А,) = У" / gaf dm = Y] / K(*,[0,a/[)dm(*) = ?]? ^(•i,[0,a,[)A,(;c)dm(jc)= f К(дг, [0, A(x)[)dm(x)-
964 D. Butnariu and E.P. Klement If A is an Д-measurable fuzzy set and (?„)„6? 1S a nondecreasing sequence of A- measurable step functions converging pointwise to A, then D) holds because of m(A) = sup{m(A„): ? e N} = sup|/ K(x,[0, A„(x)[)dm(x): n e N = / K(x,[0,A(x)[)dm(x). Jx Finally, we have to show the uniqueness of K. Since for Д-measurable functions «,v:X->Rwe can conclude и = ? m-a.e. if for all ? e ? we have / и dm = / ? dm, Jm Jm it follows that К is m-a.e. unique in the sense that any others-Markov kernel satisfying D) may differ from К on an m-null subset of X only. ? Taking into account Theorem 3.3(i), Proposition 4.5 and Remark 4.6(ii) we can restate Theorem 4.8 in the following form for measures based on Frank t-norms. COROLLARY 4.9. IfTi is a Frank t-norm with ? e [0, oo], ? is a generated ??-tribe and m is a finite monotone ??-measure on T, then there exists a unique measure m on Tv, namely, the restriction o/m to Tv, and an m-a.e. uniquely determined Tv-Markov kernel К : ? ? B\ -> ? such that the representation D) holds for all A e T. 5. Integral representation of 7l-measures Theorem 4.8 and Corollary 4.9 show that monotone finite measures based on Frank t-norms ?? and defined on generated ??-tribes can be represented as integrals of Markov kernels. It is clear that this holds for ? = T^ too. However, in this particular case the result is true even if the underlying TL-tribe ? is not generated. This is based on the following properties proved by Butnariu [14], which are specific for TL-measures. PROPOSITION 5.1. If ? is a T^-tribe and ifm is a finite T^-measure on T, then we have: (i) m и monotone if and only ifm is nonnegative; (ii) г/m is nonnegative, then m is continuous in the sense that lim m(A,,) = m(A) whenever (А„)„ец С.Т and lim A„ = A. A5) n—*oo ii—>oc
Triangular norm-based measures 965 PROOF. In (i) the "only if" part is trivial. Assume that m is nonnegative, А, В e ? and A sC B. Then В = A SL(B TL С A) and A JL(B TL С A) = 0. Thus m(B) = m(ASL(STLCA))+m(ATL(BTLCA))=m(A) + m(STLCA), i.e., m(B)^m(A). In order to prove (ii), let (A„)„6pj be a nonincreasing sequence in ? converging pointwise to A. Then lim m(A„) = m(X)- lim m(CA„) = m(X) - m(CA) = m(A) because of the continuity from the left of m and since for all С е T,C and CC are TL-disjoint and satisfy CS|_Cc = X. Now, if (A„)„6n is a (not necessarily monotone) sequence in ? converging to A, then ОО/СЮ \ ОС / ОС \ Tm I Sm A„ I = A = Sm I Tm A„ I. k = \ \ n=k ) /t = l V n=-k / and ? is a Го-tribe (compare Theorem 3.3(i)). Therefore, we have mi Tm Ak\ ^ m(A„) ^ ml Sm Ak \ and, taking here the limit ? -> oo, we obtain m(A) = lim,,^^ m(A„). ? The following representation of TL-measures is essentially Theorem 2.6(c) of Butnariu [15]. We present it here with an alternative proof (see Butnariu and Klement [17, Theorem 6.2]) THEOREM 5.2. If ? is a T^-tribe and m is a finite nonnegative T^-measure on it, then there exists a unique measure m on Tv, namely, the restriction ofm to Tv, such that for each A in ? m(A)= / A(jc)dmU). A6) PROOF. It is clear that E.2) implies that m has to be the restriction of m to Tv. Claim 1. // A e ?, ?, ? e [0, 1] and a < ?, then the set Aa.p = {xeX \a< A(x) ^ ?} A7) belongs to Tv, and the product A ¦ ??_? = AT|_ ??.? e T, and ?-??(??.?)^ m(A· Aa_p). A8)
966 D. Butnariu and E.P. Klement The first assertion follows from Theorem 3.3(H). The second results from the fact that for each ? e Tv we have A ¦ ? = A T|_ ? e T. Now, in order to prove A8), it is sufficient to show that this inequality holds for all A in ? and for all ?, ? e [0, 1], such that a < ? and a has a finite binary representation k 2' i = l with ieN and a,- e {0, 1} for each i e {1,..., к]. Suppose that A8) holds whenever ? has a finite binary representation. For all 0 < a < ? ^ 1 we can find a sequence (a„)„6pj which is nonnegative, nonincreasing and converges to a, and such that each an has a finite binary representation as above. Using the continuity of m (see Proposition 5.1 (ii)) we get a ¦ m(Aa a) = lim a„ · m(A„„ «) < lim m(A · A„ «) = m(A · ??_?), since (A„n,/j) gN \ A„,/j. Now assume that a has the form as above. If a = 0 there is nothing to prove. Suppose a > 0. In this situation we proceed by induction upon the number к in the binary representation of a. If к = 1, then a = 0 or a = \. In the first case A8) clearly holds. In the second case we have ??_? = (A · A„./j)Sl(A · A„,/j), and this implies m(Aa./j) = m(A„./0 = m(A · ??_?) + m(A · ??.?) - m[(A · A„./j)TL(A · ??_?)] ^ 2 · m(A · ??.?), which is exactly A8) with a = \. Suppose that A8) holds for all A e ? and for all ?, ? in [0, 1] such that a < ? and a has a binary representation with at most к binary digits. Put k + \ 2' with a, e {0,1} for each ? e {1,..., m + 1} and suppose that a*+| ^ 0. Then a = 8/2 where k + \ ;=i Case 1.1. 5 < 1 and ? < \. Then Aa.p = (AS|_ А),5.2/з, and using the inductive assumption for the set (ASlA)^,2/j (this is possible since the sum in A9) has at most к nonzero terms in our case) we get S · m(Aa,/j) ^ m[(AS|_ A) · ??,?]. Observing that
Triangular norm-based measures 967 (A SL A) · A„,/j = (A · A„,/j) SL(A · A„./j), we obtain 5 · m(Aa.p) sC m[(A SL A) · ??,?] ^m[(A· A„./j)SL(A· A„./j)] = 2 · m(A · A„./j) - m[(A · A„./j)TL(A · Aff./j)] < 2 · m(A · ???) by the monotonicity and additivity of m (see Proposition 5.1(i)). This implies A8) in this case. Case 1.2. S < 1 and ? = +. Consider a sequence (y„)neN in ]<*, ?[ which converges increasingly to ?. The sets A„, Yn satisfy the conditions of Case 1.1. Thus we have for each a-m(Aa.Yn) ^m(A-Aa.Y„). B0) The continuity of m (compare Proposition 5.1(H)) and the convergence (Aa-Yil)„e^ / Sm,^i AcYn imply ?·??( Sm Aa.y„ J ^m n=\ A ¦ ( ом Aa,y„ ;?=? B1) where ( U A«-r„ ) U {* e X: А(дс) = ?} = ???. Since the sets in the latter union are disjoint, we may write a ¦ m(Aa.p) = a ¦ ml [J Aa,Y„ J +a ¦ m({A = ?)) ? A-^Sm AaA + <*-m({A = /*}). С m B2) Since for ? = J the fuzzy sets B\ and Bz defined by Bi(x) = B2(x) = 0 ???(?)??, ? ???(?) = ?, are Tl-disjoint and are contained in T, we have m(S| S|_S2) = 2 · m(/3 · {A = /3}) (compare Remark 4.2(i)) and S| Sl St = ? = /3}. Hence, m {?-{? = ?))=?- m({A = /3}) > <* · m({A = ?\).
968 D. Butnariu and E.P. Klement Combining this with B2) and using the TL-additivity of m we deduce ? -?\(??_?) ^ m m A-(Sm AaA + ??(?-[? = ?}) А-({А = Р)и(?лАаЛ\ = m(A· ??.?). This proves A8) in this case. Case 1.3. S < 1 and/3 > ±. Then A · ??.? = {A-Aa.|/2)SL(A · A|/2./j), where the fuzzy sets on the right hand side are 7l -disjoint. Hence, according to Remark 4.2(i) we have m(A · ??_?) = m(A · AaA/2) + m(A · ?\/2.?). The first term on the right-hand side satisfies the conditions of Case 1.2 and the second one meets the inductive assumption. Hence, m(A· А„./з) >а-т(А|/2./0+ ? · ??(?\/2.?) > <* ¦ [т(А„.|/2) + ??(?\/2.?)] = a ¦ т(А„./з) showing that A8) holds in this case too. Case 1.4. S > 1. In this case a can be written as a = (? + l)/2 where ??'+? гг. ?? -?- e [0, 1]. ?=? We also have A„,/j = [ATL A]^2p with ? = ? - 1/2. Using the inductive assumption for the set [A T|_ A] - we obtain 1 ? ? ? a ¦ т(А„./з) = - · m(A„.^) + - · m(Aa.p) = --m(Aa.fl)+?--m([ATLA]?2p) ^-•m(Aa./j) + --m([ATLA]?2p) = - · &(??.?) + ? · т(Ас.р ¦ [ATL A]e 2p). B3)
Triangular norm-based measures 969 Observe that Aa.p · (ATLA) = DJlCC with D = ??.? and С = ??.? ¦ (CaSlCA). According to Remark 4.2(i), it follows that whenever E, F e ? and ? ^ F then m(?TLCF)=m(?)-m(F). B4) Since we clearly have D > C, B3) combined with B4) gives a ¦ m(Aa.fl) < 3 · m(Aa.p) + 5 · [m(O) - m(C)] = m(A„./)) - \ ¦ m(C). B5) Now, taking into account <5 > 1, we deduce C = (CA-A0./i)Sl(CA-A0./») and (CA-Ao./j)Tl(CA-Ao./j)=0. Thus m(C) = 2 · m(CA · A„./j) by the TL-additivity of m. Since CA · ??? = ??.? TL C(A · A„./j), we get m(C) = 2 · [т(А„./з) - m(A · A„./j)] because of B4). Substituting this into B5) we obtain A8), and Claim 1 is completely proved. Claim 2. If A e T, 0 < a < ? ^ 1 and Ao.fl = {xeX\ot <?(?)<?) B6) then ??,? e Tv, A · Aa^ e ? a/id m(A-Aa.p)^fi-m(Aa_p). B7) The first two assertions are obvious. We only have to prove B7). To this end, observe that А0./, = СЛ>.0. B8) where a' = 1 - a and /3' = 1 - ?. Then m[C(A · ??,?)] = m(X) - m(A · A„./j) = т(А„./з) + т(СА„./з) - m(A · ??_?) = m(CA · A„./j) + m(CA„./j) = m(CA · СА/з-,?') + m(CA„./j) >/3'· m(X) + т(САм) = т(Х)-/3-т(А„^), where the forth equality is a consequence of B8), and the inequality follows from Claim 1. This implies B7), and Claim 2 is proved.
970 D, Butnariu and E.R Klement In order to complete the proof, choose A e T. For a given ? e N denote {xeX: A(jt) = 0} if i = 0, "¦' ' A(,--i)/2»,,72» iff e {1,2, ...,2я}, _ ??)/2-.(,4-?)/2- if/e{0, l,...,2"-l}, {jceX: A(;t) = 1} iff = 2". From Claims 1 and 2 it follows that the step functions 2" . 2" . i=l ?'=? are all Tv-measurable. It is clear that (s„) / A and (t„) \ A. Taking into account A8) and B7) we deduce у ? 1 ~ / s„ dm = 22 —^- ¦ m(G„.i) < 2jm(A · G,,.,) = ? 2" ; — ? .—. :m(A) i = l " i=0 and 111 "}/t ? ?„ dm = J2 ^ ¦ MH„j) > j^miA ¦ HnJ) = m(A). ? = ? ?=0 Taking the limit ? -» oo in the last two inequalities we obtain A6), completing the proof of the theorem. ? Comparing Theorem 5.2 with Corollary 4.9 it is easily seen that the 7l-measures on a generated ?? -tribe with ?? a Frank t-norm are exactly those ?? -measures for which the corresponding Markov kernel K, in the representation D), for each ? e X can be written in the form ?(?,[?,?[)=?-??. This suggests that the concept of Lebesgue integral could be axiomatically defined as a 7l -measure. 6. Integral representation of monotone ?? -measures Theorem 5.2 not only drops the condition that ? be generated in Corollary 4.9 in the case ? = 7l, but also specifies the form of the Markov kernel occurring in the representation D) in this particular case. For Frank t-norms ?? with ? e ]0, oof and generated 7\-tribes it is also possible to specify the form of the Markov kernel in D). To be precise, we get the following result of Klement [45].
Triangular norm-based measures 971 THEOREM 6.1. If T\ is a Frank t-norm with ? e ]0, oof, ? is a generated ??-tribe and m is a monotone finite ??-measure on T, then there exists a unique measure m on Tv, namely, the restriction of m to Tv, and an m-a.e. uniquely determined Tv-measurable function f: X -> [0, 1] such that for all A e ? m(A)= / [/ + (!_/). a] dm. B9) |A>0) '=/" [/ + (l-/)-A]dm ./|A>0) PROOF. First, observe that if B9) holds for some measure m on Tv and for some Tv- measurable function / on X, then, necessarily, m has to be the restriction of m to Tv. Now let us show that the required function / exists. To this end, consider the Tv-Markov kernel К corresponding to m in the representation D). For each a in [0, 1] define the function fa : X -> [0, 1 ] by fa(x)=K(x,[0,a[). C0) Each fa is Tv-measurable since К is a Tv-Markov kernel. We claim that there exists a function /: X -> [0, 1] such that for all a,be]0,\[ with a + b ^ 1, fa+b = fa + fb- f m-a.e. C1) In order to prove that, consider the sequences (a„)„6pj and (Ьп)„ещ defined as follows: a\ =a, b\ = b, a„+\ = S\(a„,b„), bn = Tx(an,b„). Since ? is generated, these sequences belong to ? (compare Theorem 3.3(iii)). Exactly as in the proof of Theorem 3.3(i) we show that the sequence (??)?6? IS nondecreasing, (bn)neN is nonincreasing and converges to zero, and for each ? e N we have a„ + b„ = a + b. Define the Tv -measurable function / = inf{//,„:«eN}. C2) For each ? in Tv, the fuzzy sets A„ = an · ? and Bn = b„ ¦ ? are contained in ? (compare Theorem 3.3(i)), and (An)neN / A| Sl B\ and (S„)„sN \ 0. Also, for each ? e N, we get A„ + S„ = (a + b) ¦ M. Induction upon ? yields that m(A„) + m{B„) = m(a ¦ M) + m(b ¦ M) for all ? e N. Consequently, we have / fa+b dm = m[(a + b)-M]= m(sup{ A„: ? e N}) = sup{m(A„): ? e N} = m(a ¦ M) + m(b ¦ M) - inf{m(B„): пеЩ = ? {fa + ?) dm - inf{m(B„): ? e N}
972 D. Butnariu and E.P. Klement by C0), D) and the continuity from the left of m. From C0), D) and the Monotone Convergence Theorem we also have f{m(S„): ? e N} = infl ? fbn dm: ? e N = ( inf{//,„:«eN}dm= f /dm. Jm Jm Hence, / fa+bdm = / (fa + fb- /)dm Jm Jm for each ? in Tv. This implies C1), and our claim is proved. Equation C1) is a Gaussian functional equation (see Aczel [1]) with the border conditions f\ = 1 and /o = /· The unique solution of such an equation has to be /, = / + A - /) · a m-a.e. for all a in ]0, 1 [. Taking into account C0) this gives K(x,[0,a[)=f + (\-f)a m-a.e. for a e [0, 1]. Inserting this into D), we get B9) and the proof is complete. ? Theorem 6.1 is proven for 7\ -measures on generated 7\-tribes when ? e]0, oof. Combining this result with Theorem 5.2 we get the following corollary. COROLLARY 6.2. IfTi is a Frank t-norm with ? e ]0, oo], ? a generated ??-tribe and m a monotone finite ??-measure on T, then there exists a unique finite measure m on Tv, namely, the restriction o/m to Tv, and an m-a.e. uniquely determined Ty-measurable function f : X -> [0, 1], such that for all AeT the equality B9) holds. It is obvious (compare Theorems 3.3(i) and 5.2) that Гь-measures on 7\-tribes with ? e ]0, oof are also 7\-measures (even if the ??-tribe is not generated). However, not each ??-measure is necessarily a TL-measure, even on generated ??-tribes. The following shows that a ??-measure on a generated 7\-tribe is a TL-measure (and hence an integral) only if it is continuous from the right at 0 (and then, clearly, at each A e T). COROLLARY 6.3. Let ? e ]0, oo[, ?? be the corresponding Frank t-norm and ? be ? ??- tribe. A monotone finite ??-measure топ ? is a ?^-measure on ? if and only if it satisfies the following condition: lim m(A,,)=0 whenever (Ап)пец С.Т and (A„)ne^\&. C3) > oo PROOF. From Proposition 5.1 we already know that TL-measures are continuous and, therefore, satisfy C3). Conversely, assume that m is a ??-measure on the generated 7\-tribe ? satisfying C3). Then m can be represented in the form B9) and for each ? in Tv the
Triangular norm-based measures 973 sequence (M„)neN = (^ · M)„6n is contained in T. Using the Monotone Convergence Theorem we deduce that for each ? e Tv 0= lim f Г/ + A-/).Л dm = ? /dm. "^¦^JmL n\ J M This implies that / = 0 m-a.e., and rewriting B9) in this case we obtain for A e ? m(A) = / Adm = / A dm J\A>0\ JX l[A>0) JX which immediately implies that m is a 7^ -measure. ? 7. Decomposition of monotone 7\-measures Theorem 6.1 shows that for ? e]0, oof finite monotone Гл-measures m which are defined on generated tribes ? differ from 7l-measures (i.e., from integrals according to Theorem 5.2) only by functions of the form At—> / /dm, which are themselves monotone finite 7\ -measures on T. The question we deal with in this section is how much a ??-measure defined on a nongenerated 7\-tribe differs from a 7l- measure (i.e., from an integral). The results are drawn mostly from Butnariu and Klement [16], some important completions of the proofs there are due to Mesiar [60] and Mesiar and Navara [62] (see also Section 8 in Butnariu and Klement [17]). PROPOSITION 7.1. Let ? e ]0, oo], ? be a ?-,.-tribe and m: ? -> ? a finite monotone ??- measure on T. Then there exists a unique pair (m^, m-A) of functions from ? to [0, oof such that: (i) mL is a T^-measure on T; (ii) niA и а ??-measure on T\ (iii) m = mL + m;u (iv) mL is the greatest ??^-measure in the sense that we have mL > ? whenever ?: ? —> [0, oof и a T^-measure such that m — ? is monotone; (v) There exists a unique measure m on Tv, namely, the restriction ofm to Tv, and an m-a.e. unique Tv -measurable function f: X -> [0, 1] such that for all A e ? mL(A)= / (l-/)Adm C4) Jx and for all ? e Tv ???(?)= /" /dm. C5)
974 D. Butnariu and E.P. Klement PROOF. Let M(m) be the set of all TL-measures p:T -» [0, oof, such that m - ? is monotone (and, therefore, a Ti -measure due to Remark 4.6(H) and Corollary 6.3). The set M(m) is nonempty since it contains the zero measure. M{m) is naturally equipped with the partial order ? ^ q if and only if p(A) sC q( A) for all AeT. C6) CLAIM 1. There exists a maxima! element in _M(m). If (Pj)jej is a chain in M(m), then the function ?: ? -н>- [0, oo] defined by p(A) = sup{py(A): j e j} C7) is a ^-valuation. Indeed, p@) = 0 clearly holds and for all А, В e ? we have p(ASLfi) + p(ATLfi) =sup{p,(ASLfi): j e J} +sup{p,(ATLB): jej] = sup{p7(ASLS)+py(ATLS): jeJ} = sup{py(A) + py(S): jeJ} = ?(?)+?(?) where the second and the fourth equality follow from the monotonicity of the chain (Pj(C))jej for each С e T. It is clear that ? is monotone. Hence, 0 ^ p(A) ^ m(A) ^ m(X) for all A&T, implying that ? is also finite. If (??)«6? is a nondecreasing sequence in T, then lim p(A„) = sup{p(A„): ? e Nl = sup{sup{py(A„): ;' e J}: ? e Nl = sup{sup{py(A„)·· ? e N}: j eJ}= supl^liir^p^A,,): j e J) = p( lim A„), showing that ? is continuous from the left. Hence, ? is a finite nonnegative TL-measure on T. Moreover, m — ? is also monotone since for all A, В e ? with ? ? ? it follows from pj e M(m) that (m - p)(A) = inf{(m - py)(A): j e J} < inf{(m - py)(S): jeJ] = (m-p)(S), thus showing that ? e M(m). In other words, each chain in M(m) has an upper bound in M(m) and, by Zorn's Lemma, M(m) contains a maximal element denoted mL. The 7l-measure mL is also a 7\-measure, and so is the difference ??? =m-mL. C8)
Triangular norm-based measures 975 Because of niL e M(m) we know that m^ is monotone and finite, and the pair (niL, m^) satisfies properties (i)-(iii). The following claim [60,62] shows that M(m) is a directed set, thus immediately implying that m^ (and subsequently ш;) are uniquely determined. Claim 2. For all qi, Ц2 e M(m) there exists a q e .M(m) smc/i ?/??? qi ^ q and q2 ^ q. Since qi, q2 are ^-measures, we may write for i e {1, 2} q,(A) = / Adq,, Jx where the finite ?-additive measure q, is the restriction of q, to Tv . Then qo = qi + q2 is also a finite-additive measure on Tv, and q, is absolutely continuous with respect to q for i e {1,2}. Then the Radon-Nikodym Theorem provides Tv-measurable functions h ?, /12: X -> R such that -l·" q,(L)= / /i,-dqo for all L e Tv and ? e {1,2}. Define the Tv-measurable function /i:X -> R by /i = max(/i ?, /12) and the finite ?-additive measure q on Tv by q(L)= / ftdqn. Then the function q: ? -+ [0, oof given by q(A)= /"Adq is a finite monotone TL-measure such that q, < q for i e {1,2}. The monotonicity of m - q can be seen as follows. Consider the crisp sets ? — {h = h \} and Q = [h > h \}. Obviously P, Q e Tv and ? U Q = X and ? ? ? = 0. Note that for any fuzzy set С е [0, \]x and for any crisp subset L с X we always have CT/ L = С · L. Therefore, using the fact that for fuzzy sets D, ? e ? with Z) < ? and ? < Q we always have q(D) = qi (D) and q(?) = q2(?), we obtain for all А, В e ? with A < S (m - q)(A) = (m - q)(ATA P) + (m - ?)(??? ?) = (??-4|)(????) + (??-42)(????) <(??-4|)(????)+(??-?2)(????) = (m - q)(STA P) + (m - q)(ST/ ?) = (m-q)(S), showing that m - q is monotone. This completes the proof of Claim 2 which together with Claim 1 implies that (mL, ш;) satisfies (iv) too.
976 D. Butnariu and E.P. /Clement In order to prove (v), we take m for the restriction of m to Tv. We have to find a Tv- measurable function /: X -> [0, 1] such that C4) and C5) hold. To this end, observe that ? is a TL-tribe by Theorem 3.3(i). Hence, according to Theorem 5.2, the ^-measure mL can be written as mL(A)= f AdmL C9) Jx for each A e ? where riiL is the restriction of mL to Tv. From mL ^ m it follows that riiL is absolutely continuous with respect to m. Because of riiL < m there is a Radon-Nikodym derivative d(mL)v dm which maps X into [0, 1]. Putting / = 1 - driii/dm, then / is also a Tv-measurable function with values in [0, 1], and C9) implies C4). Now, taking into account C8), we can write for all crisp sets L e Tv mA(L) = m(L)-mL(L) = m(L)- /*(l-/)dm= I /dm, and therefore, C5) also holds. Formulae C4), C5) and C8) imply that no other choice for m (and, subsequently, for /) is possible. ? Remark 7.2. (i) The component m^ of a TA-measure m, which exists due to Proposition 7.1, is a pure ??-measure in the sense that it has a zero TL-component: (т^)ь = О. In fact, assuming the contrary contradicts the maximality of mL in Proposition 7.1. (ii) If the ??-tribe ? is generated, then C5) somehow holds for each A e ? (and not only for crisp sets in the ? -algebra Tv). To be precise, we always have J\A>C ?»?(?)= / /dm. '|A>0) This follows from B9) in combination with C4) and C5), and the m-a.e. uniqueness of / in B9). Let ? e ]0, oof and ? be a T^-tribe. If ? is a ?-additive measure on the ?-algebra Tv and if g, h : X ->¦ [0, oo] are two Tv-measurable functions, then the function m: ? -> [0, oo] defined by m(A)= / (g+A-A)dp D0) J[A>0\ is a monotone 7\-measure on T. Indeed, monotonicity is obvious as is m@) = 0; continuity from the left follows from the Monotone Convergence Theorem, taking into
Triangular norm-based measures 977 account that for a nondecreasing sequence (?„)„6? <? ? which converges pointwise to A we have /1 \J[A„>0} = {A>0}. The 7\-additivity is shown as follows: ??(????)+??(?3??) [g+h(AJxB)]dp+ [ [g+h(ASAB)]dp ,?>0)?|?>0) J[A>0)U[B>0) = ? [2-g+A.((ATxB) + (ASiB))]dp J[A>0)n\B>0} + [ [g + h(AS/iB)]dp+ ? [g+h(ASAB)]dp J\A>0}n\B=0) J\A=0)n\B>0) = / [2-g+h-(A + B)]dp+ I (g + h-A)dp J\A>0}n\B>0) J[A>0)n[B=0) + / (g+h-B)dp J[A=0}D\B>0} = / (g + h-A)dp+ ? (g + h-B)dp J[A>0} J\B>0} = m(A) + m(B). Note that the measure ? involved in the representation of m via D0) does not necessarily coincide with the restriction of m to Tv. Dehnition 7.3. ? ??-measure m on ? which can be represented in the form D0) for some nonnegative ? -additive measure ? on Tv and for some ??-measurable functions g,h:X^f [0, oo] is said to be generated (by p. g and h). Theorem 6.1 shows that if ? is generated, then all finite monotone 7\-measures on ? are generated. From Theorem 5.2 we already know that Tb-measures on ? are always generated, even if ? is not generated. Thus, it is natural to ask whether, in general, ??- measures on T^-tribes are automatically generated. In order to answer this question we introduce the following notion. Dehnition 7.4. ? ?? -measure m on a ??-tribe ? is called monotonically irreducible if it is monotone and if the zero measure on ? is the only generated 7>. -measure q on ? such that m — q is monotone. It is obvious that a T\-measure m is generated if and only if it can be extended to a ?? -measure on the generated 7\-tribe (??)?: if m is generated, then it is always defined ¦/
978 D. Butnariu and E.P. Klement on the whole generated tribe (??)?; the converse follows from Theorem 6.1. On the other hand, monotonically irreducible ?? -measures, with the exception of the zero measure, are not generated and, therefore, cannot be extended to (??)?. THEOREM 7.5. Let ? e]0, oo], ? be a ??-tribe and m a finite monotone ??-measure on T. Then m can be uniquely decomposed into the sum of a monotonically irreducible and a generated ??-measure, i.e., there exist a unique monotonically irreducible ??-measure monT, a measure m on Tv (namely, the restriction ofm to Tv) and two m-a.e. uniquely determined Tv -measurable functions g,h:X—> [0, 1], such that for all A e Tv m(A)=m(A)+ / (g + h-A) dm. D1) PROOF. If ? = oo, then the result follows from Theorem 5.2 taking for g and/i the constant functions g(x) = 0 and h(x) = 1, respectively, and for m the zero measure. Assume, therefore, that ? e ]0, oof. In this case we proceed in several steps. CLAIM 1. If ? is a finite monotone ??-measure on T, then there exists a finite monotone ??-measure ? on the generated ??-tribe (??)? which is maximal in the sense that the difference ? — q is also monotone on (??)? whenever q is a ??-measure on (??)? such that ? — q is monotone on T. In order to verify this, let JV(p) be the set of all monotone 7\-measures q on (??)? such that ? — q is monotone on T. This set is partially ordered by the dominance relation defined by q^>r if and only if q — г is monotone on (Tv) . D2) Let (qj)jej be a chain in jV(p), and for each A e (??)? define q(A) = sup{q,-(A): j ej}. Exactly as in the proof of Proposition 7.1, it follows that q is a monotone finite TA -measure on (??)?. For A,B eT with A ^ В we have p(A) - q(A) = inf{p(A) - qy (A): ; e J} < inf{p(S) - qy(S): j e J} = p(B)-q(B), i.e., ? - q is monotone on T. For each к e J and for А, В e (??)? with A < B, the fact that (qj)jej is a chain induces (q - q*)(A) = sup{(q; - qk)(A): je^ sup{(qy- - q*)(B): jej} = (q-q*)(B),
Triangular norm-based measures 979 which means that q — q* is monotone for each к e J, i.e., q ^> q*. Hence, each chain in jV(p) has an upper bound in JV(p) and, by Zorn's Lemma, this implies that Л/"(р), equipped with the dominance relation ^>, contains a maximal element, completing the proof of Claim 1. The following claim shows that ??(?) is also a directed set, implying that the maximal element is indeed the unique greatest element. Claim 2. For all q\, q2 e Л/"(р) there exists a q e Л/"(р) such that q » qi and q » q2- Let qi, q2 be the restrictions of qi and q2 to Tv, respectively, let (P, Q) be a Hahn decomposition of the ?-additive signed measure ?| - q?, and define q: (??)? -> [0, oo] by 4(?) = 4,(????)+42(????). Obviously, q@) = 0 and q is continuous from the left. To see that q is also a revaluation, take into account that ? and Q are crisp subsets of X, and then we get for all A, Be (??)?: q(ATAS)+q(ASAB) = 4,(???????)+42(???????) + qi((AS,S)T,P)+q2((AS,S)T,e) = 4?((????)??(????))+4?((????K?(????)) + q2((AT, ?)??(??? Q)) +q2((AT, Q) S,(STA Q)) = 4?(????)+4,(????)+42(????)+42(????) = q(A) + q(fi). The ??-measure q is monotone since for fuzzy sets А, В e ? with A ^ В we have q(A) = (| (??? ?) + 42(??? Q) <? (| (??? ?) + qi^T;, ?) = q(S). The monotone 7\-measure q is an element of A/"(q) since for fuzzy sets А, В е (??)? with A < В we get (p-q)(A) = p(A)-q(A) = ?(??? ?) +?(??? ?) - (| (??? ?) - 42(??? Q) = (?- 4,)(??? ?) + (? - ?2)(??? ?) ^(?-4,)(????) + (?-42)(????) = ?(?)-4(?) = (p-q)(B),
980 D, Butnariu and E.P. Klement where the inequality is a consequence of q ?, q2 e Л/"(р). То complete the proof of Claim 2, it remains to show that q J?> qi andq^> q2,i.e., q —qi andq — q2 are monotone on (??)?. For each A e (??)? we get D-4?)(?)=4?(????)+42(????)-4?(????)-4?(????) = D2-4,)(????)>0, showing that q - q ? is nonnegative. This also implies that the restriction of (q - q ?)v to the ?-algebra of all crisp sets of the form L П Q with L e Tv is nonnegative and, therefore, monotone. Applying Theorem 6.1 to the 7\-measure qi — qi which is monotone on the ??-tribe of all fuzzy subsets of Q contained in (??)?, we get for all А, В e (??)? with A ^ В and for a suitable ??-measurable function k:X->[0.l]: (q-qi)(A) = (q2-qi)(ATxj2)= [ {k + A - k) · A) d(q2 - q, )v J\A7;.Q>0] = [ (* + (l-*)-A)d(q2-qi)v J[A>0)nQ ?? [ (? + A -A:)-B)d(q2-qi)v J[A>0}nQ + 1 (* + (l-*)-S)d(q2-qi)v J\A=o\n[B>o\nQ = [ (A: + (l -Ar)-B)d(q2-qi)v J\B>0)nQ = D2-4\){BTkQ) = (q-qi)(B). Monotonicity of q — q2 is shown by complete analogy, interchanging the roles of q ? and q2 as well as of ? and Q, thus completing the proof of Claim 2. Now let in be the greatest ?? -measure (with respect to the dominance relation ») in jV(m) resulting from Claims 1 and 2. Since in is defined on (??)?, Theorem 6.1 yields the representation m(A)= [ [/-(l-/).A]d(in)v D3) where (in)v is the restriction of in to ((??)?)? = Tv and /: X -> [0,1] is a suitable Tv- measurable function. Remark 7.2(H) tells us that the unique decomposition ((in)^, (т)л) of in (compare with Proposition 7.1) is given by (in)L(A)= /"(l-/).Ad(m)v, D4) Ух
Triangular norm-based measures 981 (mMA)= [ (l-/)d(in)v. D5) J[A>0\ Let (ть, шл) be the unique decomposition of m resulting from Proposition 7.1, let rii be the restriction of m to Tv, and let f: X -> [0, 1] be the Tv-measurable function such that C4) and C5) hold. Claim 3. (in)L = mL. From in e ??(?\) it follows that the function г: ? -н>- R defined by г = m - in is a monotone ??-measure. Using m - (m)L = г + (??)?, the 7\-measure m - (m)L is monotone too. Since niL satisfies Proposition 7.1(iv), we get mL ^ (m)L. D6) Conversely, note that m - ть is a monotone ??-measure on T, which, together with the fact that in is the greatest element in Л/"(т), implies that in ^> m^, from which we deduce that in - niL is monotone, showing that niL e .M(in). Since (in)L is the greatest element of .M(in), we get (in)b ^ niL and, together with D6), (т)ь = ть- CLAIM 4. The monotone ??-measure m: ? —> [0, oof defined by m = m - in D7) is monotonically irreducible. In the case in = m the claim is proved by definition. Suppose that in ? m, i.e., m is different from the zero measure. Then Claim 3 yields m = m-in = mL+m/- (in)L - (??)? = ??? - (??)? = ??? - (???)~ + (???)~ - (??)?. If we are able to show that (???? = (??)?, D8) then the proof is complete because of the maximality of {тхУ in jV(mx). In order to prove D8), note that (тя)ь is identically zero on (??)? because of m = mL + (m/)L + (ша)а and the maximality of mL in Л/"(т). Similarly, ((???)~)?, is the zero measure on (??)?. Observe that m - (mx)~ = mL + (??? - (???)~), the ?? -measure on the right-hand side being monotone on T. Since in is the greatest element in JV(m), we have in » (m\)~. From the monotonicity of m = m^ - (??)? and
982 D. Butnariu and E.P. Klement the maximality of (ша)~ in ?{?\?), we deduce that (m/)~ - (in)/ is monotone and, subsequently, (т;)~ ^ (??)?. Assume that {m\)~ ? (in)/, and define the monotone T^- measure m: (??)? -> [0, oof by m = (in)L + (mi)". Then we have m - in = (m)L + (???? - m = (m)L + (??)? - (m)x + (mA)~ - m = (???? - (??)?, D9) the latter being monotone, showing that m ^> in. On the other hand, Claim 3 yields m-m = mA - (шаГ, which implies that m — m is monotone too, i.e., m e Af(m). Therefore, m = in which, together with D9), contradicts the assumption (тд)" ф (in)/, thus completing the proof of Claim 4. Now D7) shows that m = m + in, where m is monotonically irreducible and in is generated by definition. It remains to show that D1) holds. From Claim 1 we know that (m)v < m and, therefore, (m)v is absolutely continuous with respect to m. Thus we may assume that the Radon-Nikodym derivative d(in)v dm has values in [0,1] only. Taking into account D5) we obtain for each A e (??)? (m)k(A)=[ ld(m)v=f /.^-dm. J[a>0] J[a>o\ dm Writing d(in)v g=l- —r^- dm and taking into account Claim 3 and C4), we have for all A e (??)? in(A) = (m)L(A) + (in)A(A) = mL(A) + (??)?(?) = f (l-/)-Adm+ /" gdm= [ [g + (l-/)-A]dm. J[A>0} J[A>0} J[A>0} Putting h = 1 - / gives D1). Since in is unique and /, / and the Radon-Nikodym derivative d(in)v dm are m-a.e. uniquely determined, so are g and h. ?
Triangular norm-based measures 983 A combination of the results of Theorems 4.8 and 7.5 leads to the following corollary. COROLLARY 7.6. If ? e ]0, oo], ? is a ??-tribe and m is a finite monotone ??-measure on T, then there exists a unique finite nonnegative ?-additive measure ? on Tv, a p- a.e. uniquely determined Tv -Markov kernel К : ? ? ?\ —> Ш and a unique monotonically irreducible ??-measure топ ? such that for all A e ? m(A)=ih(A)+ / K(jt,[0,A(jt)[)dp(jt). •/? From the results in Mesiar [60] and Mesiar and Navara [62] it follows that for ? e ]0, oo] each ??-measure m, which is defined on a 7\-tribe on a countable set, is a generated measure, i.e., m equals the zero measure. 8. Jordan decomposition of bounded T^-measures It was shown in Theorem 7.5 that monotone Frank triangular norm-based measures can be decomposed into a sum consisting of a generated measure and a monotonically irreducible measure. Moreover, if the tribe on which the Frank triangular norm-based measure is defined is a generated one, then the monotonically irreducible component is necessarily zero (compare Theorem 6.1). It is an interesting open question whether the monotonically irreducible component of a Frank triangular norm-based monotone measure on a nongenerated tribe is always identically zero. We shall prove that in the particular case of the ^-measures, this happens even if the TL-measure is not monotone. The fact that ^-measures are Jordan decomposable can be deduced directly from a result in Schmidt [71]. However, we present the proof due to Butnariu [14] because some technical details of this proof will be used later. It is natural to ask in which sense ^-measures differ from other Frank triangular norm- based measures. In order to answer this question, we recall that ?-measures are always T- countably additive functions but, in general, ?-countable additivity does not characterize ?-measures (see Remark 4.4). However, 7l-countably additive functions are necessarily 7l-measures. To be precise, we have the following result. PROPOSITION 8.1. Ifm is a finite T^-countably additive function on a T^-tribe T, then m и a T^-measure on T. PROOF. Observe that (ATLC(ATLS))TLB = 0, (ATlC(ATlS))SlS = ASlS. Therefore, using the TL-additivity of m we get m( A S|_ B) = m(A TL C(A T|_ B)) + m(S). But A = (ATLC(ATL S))Sl(ATl S), and the fuzzy sets ATLC(ATL B) and ATL В are 7l -disjoint. Hence, т(А) = т(АТ|_С(АТ|_б))+т(АТ|_б),
984 D. Butnariu and E.P. Klement and this implies m(ASLS) = m(A) - m(ATL B) +m(B), showing that m is a ^-valuation on T. Finally, we show that TL-countably additive functions are left continuous. Let (A„)neN be a nondecreasing sequence in ? which converges to A. Define Aq = 0 and, for each ? e N, B„ = A„ TL С A„_ ?. Then for all ? e N (Sl S,)tlZ?„+i = A„Tl(A„+|TlCA„) =0, showing that the sequence (В„)пец is Tl-disjoint. Since for each ? e N we have we A„ obtain A = = Sl k = oo = Sl ,Bk, 1 B,u and, consequently, lim m n—>oo (A„) = lim m( Sl Si I = Hm Fm(Bt) = Vm(fi„) v*-' / k = \ n=\ = тЫ^В,Л=т{А). Hence, m is a ^-measure. ? Proposition 8.1 allows us to approach Tl-measures on T^-tribes in a way similar to the treatment of (?-additive) measures on ?-algebras of subsets of X. Dehnition 8.2. Let ? be a TL-tribe, m: ? -> [-oo, oo] a function such that m@) = 0, and AeT. (i) The function m+: ? -> [0, oo] defined by m+(A) = sup{m(B): BeT(A)} E0) is called upper variation of m, where T(A) denotes the class of all fuzzy subsets of A contained in T, i.e., T(A) = {B: BeT,B^A]. (ii) The function m~ : ? -> [0, oo] defined by m~(A) = sup{-m(B): В eT(A)} E1) is called lower variation of m.
Triangular norm-based measures 985 (iii) The function m* : ? -» [0, oo] defined by m* = m+ + m is called total variation of m. Obviously, these functions are all monotone, and they satisfy m+ = (-mr, пГ = (-т)+, m+@) =m_@) = m*@) =0. E2) To simplify notation, from now on we shall use the following shortcut: for fuzzy sets А, В e ? we shall write AQB = AlLZB. Obviously, each 7l-tribe ? is closed under this operation. Proposition 8.3. (i) Ifm is finitely T^-additive, then m+, m~ and m* are finitely T^-additive too. (ii) Ifm is a T^-measure, then m+, m~ and m* are monotone ?^-measures. PROOF. When proving (i), it is sufficient to show that m+ is 7l-additive (compare with E2)). Let ? and F be TL-disjoint fuzzy sets in T. For each В е T(E SL F) the fuzzy sets С = ? ?|_ ? and D = ? ? С are elements of T, and they form a TL-partition of B. Since D e T(E) and С е T(F), we have ??(?) = m(C) + m(D) < m+(F) + m+(F), which implies m+(F SL F) sC m+(F) + m+(F). For each ? > 0 there exist fuzzy sets A e T(E) and В eT(F) such that m+(F)<m(A)+^, m+(F) < m(S) + |. From A Tl S ^ F Tl F we deduce m+(F) + m+(F) ^ m(A) + m(B) + ? = m(A SL ?) + ? < m+(F SL F) + ?. Taking the limit i^Owe obtain m+(F) + m+(F) < m+(F Sl F), and this completes the proof of (i). In order to show the validity of (ii), assume that m is a TL-measure. According to Proposition 8.1 and equations E2), it is sufficient to prove that m+ is TL-countably additive. Let (А*)*ем be a TL-disjoint sequence of fuzzy sets in T. Then the sequence (S„)„sN defined by B„ = Sl Ak
986 D. Butnariu and E.P. Klement is nondecreasing and, since m+ is 7l -additive, it follows that for each ? e N ? т+(б„) = ]Гт+(А*). Thus we have lim m+(Bn) = Ym+(Ak). n—>oo ·—> k=\ Since for each ? e N oo Bn ^ Sl ^* k = \ and since m+ is monotone, we obtain @0 4°° S_LA*b]Tm+(A*). E3) m _„ .. , \ I— I / * = l For each agt(sIa* we have (OO \ / ОС \ ОС Sl аЛ = ATM ( Sm S» ) = Sm (ATM B„) = lim (ATM B„). k = \ / V n=\ / »=l "^°° The sequence (АТм В„)пещ is nondecreasing and m is left-continuous since it is a 7l- measure. Hence we have m(A) = m( lim (АТмб,,)) = "т т(АТмй„) ^ lim m+(B„) \n—>oov '/ и—>oc и—>oo because of ATM B„ e T{B„), which implies oo т(А)^]Гт+(А*). Therefore, we obtain equality even in E3). and the proposition is proved. ?
Triangular norm-based measures 987 Proposition 8.4. (i) A finitely 7l-additive function m.T —> [—00, 00] is bounded if and only if m* is bounded. (ii) If the finitely T^-additive function m: ? —> [—oo, 00] is bounded, then we have m = m+-m~, E4) i.e., the pair (m+, m_) is a Jordan decomposition ofm. PROOF. We first show (i). If m* is bounded, then for each AeT |m(A)| sC sup{m*(B): В e T] < 00, i.e., m is bounded. Conversely, if m is bounded, then max(m+(A),m~(A)) sC sup{|m(B)|: В eT(A)} sCsup{|m(B)|: В e T} < 00 for all A eT. Hence, m+ and m~ are both bounded, and, consequently, so is m*. In order to prove (ii), assume that m is bounded. Then m+, m~ and m* are bounded too, and for A e ? and ? e ? (A) we have m(M) = m(A) - m(A QM). Since ? ? ? e T(A), it follows that m+(A) > m(A ? ??) and m~(A) > -m(A ? ??). Hence, m(A) - m+(A) ^ m(A) - m(A ? ??) ^ m(A) + m~(A) and, consequently, m(A) - m+(A) < m(M) sC m(A) + m~(A). From E0) and E1) we deduce m~(A) sC m+(A) - m(A), m+(A) < m(A) + m~(A), and this completes the proof. ? A straightforward combination of Proposition 8.4, Theorem 5.2 and Theorem 3.3(H) yields the following corollary. Corollary 8.5. Let m be a bounded T^-measure on a T^-tribe T. Then we have: (i) m и continuous in the sense of A5); (ii) for all AeT, m(A) = f A(;t)d((m+)v - (m-)v), E5) Jx where (m+)v and (m~)v are the restrictions ofm+ and m- to Tv, respectively;
988 D. Butnariu and E.P. Klement (iii) (m+)v and (m )v are the upper and lower variation o/m, the restriction ofm to Tv, and, therefore, m = (m+)v-(m-)v. E6) 9. Jordan decomposition of finite 7l-measures In Section 8 we have shown that bounded ^-measures on T^-tribes have Jordan decompositions and, therefore, are continuous (in the sense of A5)) and can be represented by integrals (see E5)). The aim of this section is to show that finite 7l -measures on 7l- tribes are necessarily bounded and, consequently, have Jordan decompositions by finite TL-measures, a result which is due to Butnariu [14]. LEMMA 9.1. Let m be a finite T^-measure on a T^-tribe T. The following are equivalent: (i) m и bounded; (ii) m+(X) < oo and m~(X) < oo. PROOF. The implication (i) =>· (ii) is trivial. Assume that (ii) holds. The monotonicity of m+ and m~ implies that they are both bounded. Thus m* and, consequently, m are bounded (see Proposition 8.4). ? THEOREM 9.2. Ifm is a finite T^-measure on a T^-tribe T, then there exist nonnegative finite T^-measures m+ and m~ on ? such that m = m+ —m~, E7) i.e., (m+, m~) is a Jordan decomposition ofm. PROOF. Let m be a finite TL-measure on T. According to Proposition 8.4 and Lemma 9.1, it suffices to show that m+(X) < oo and m~(X) < oo. In order to show m+(X) < oo, a fuzzy set ? & ? such that m+(M) = oo will be called unbounded. CLAIM. If ? is unbounded, then there exists a nonincreasing sequence (Ап)„ещ of unbounded fuzzy sets in T(M) such that m(A„) ^ ? for all ? e N. Let ? be unbounded and assume that there exists an no e N such that m(A) < no for all unbounded fuzzy sets A e T(M). Since ? is unbounded, there exists an M\ e T(M) such that m(M\) ^ no, which means that M\ is not unbounded, i.e., sup{m(A): ???(??,)} <oo. E8) Choose an arbitrary В e T(M) and denote B\ = ? ?? ?\, ?? = ? ? Si. It is clear that ?, TL B2 = 0, Si SL B2 = S, Si e T(M|) and B2 e T(M ? M\). Therefore, m(S) = m(B|)+m(B2)and m+(M ? M\) > m(B2) = m(B) - m(S,) > m(S) - m+(Ai,).
Triangular norm-based measures 989 Since this is true for all В e T(M), since ? is unbounded and since E8) holds, we have m+(M ? ? ?) ^ m+(M) - m" (?,) = oo, i.e., ? ? ? ? is unbounded. Then, by assumption, m(M Q M\) < щ, and there exists a fuzzy set C, e ?(?? ? ?,) such that m(C|) ^ 1. Put M2 = ?, SL C|. Then ?, TL С ? ^ Ai|TL(MeM|) = 0and m(M2)=m(M|) + m(C|)^ii0+1. E9) Using the assumption again yields that Mi is not unbounded. A similar reasoning as above shows that ? ? ??2 is unbounded too. Hence, m(Ai ? Mi) < no and m(Ci) ^ 1 for some C2 e ?(?? ? M2). Denote M3 = Мг^Ст. Since Ai2TL C2 < ??2?,_(?? ? ?2) = 0 it follows from E9) that we have т(ЛГ?) = m(M2) + m(C2) ^ no + 2, i.e., My, is not unbounded. In this way we obtain fuzzy sets M\, Mi, My,,... in T(M), none of which are unbounded, and C\, C2, C3,... such that C„ e T(M ? M„) and m(C„) > 1 for all и е N. Moreover, we can show that for all и e N ? ]Гс, <?-??|. F0) i = \ Indeed, if ? = 1 this is true because of the definition of C\. If F0) holds for all к е {1, 2,...,«}, then we have for each ? e X: C„+| (x) < max@, ? (л) - ?„+\(?)) = Щх)-М„+\(х) = М(х)-(Мп(х) + Сп(х)) = М(х) - (??„_? (л) + Ся_| (.?) + С„(дс)) = М(дс)-М|(дс)-5]С1-(дс). i-l Hence, F0) is proved by induction. An immediate consequence of F0) is that the sequence (Cn)neN is Tl-disjoint which, together with m(C„) ^ 1, yields (oo \ °° m( Sl C„ ) = 2^m(C„) = oo. n=\ Since m is finite, this contradicts the assumption that m(A) < no for all unbounded fuzzy sets A e T(M). Therefore, there exists an unbounded fuzzy set ? ? e T(M) such that
990 D. Butnariu and E.P. Klement m(A ?) ^ 1. Since A | is unbounded, there exists an unbounded fuzzy set A 2 e T( ? ?) such that m(A2) ^ 2, and so on, showing that the claim holds. To complete the proof of the theorem, assume that X is unbounded and choose the nonincreasing sequence (A„)neN of unbounded fuzzy sets in ? which exists according to the claim. For each ? e N denote n-l S„=Sl(A*©A*+i). k = \ Clearly, since (A„)n6N is nonincreasing, the sequence (?* ? A^+\ )km is 7L-disjoint, and therefore, m(B„) = ]Tm(A* ? Ак+\) = m(A,) - m(A„). k=\ The sequence (В„)пещ is nondecreasing which leads to m( lim B„) = lim m(B„) = m(A\)— lim m(A„) = -oo, \il—>OC / И—>ЭС И—>3C where the latter equality follows from m(A„) ^ и and m(A|) < oo. This contradicts the finiteness of m. Therefore, the assumption that X is unbounded is false and, consequently, m+(X) <oo. The inequality m~ (X) < oo is shown in complete analogy, m is bounded by Lemma9.1, and the Jordan decomposition exists because of Proposition 8.4. ? Since each bounded TL-measure is automatically finite, in the proof of Theorem 9.2 we have implicitly proven the following result. Corollary 9.3. A T^-measure топа T^-tribe ? is bounded if and only if it is finite. Combining Corollary 8.4 and Theorem 9.2 we realize that finite TL-measures on TL-tribes are exactly (Lebesgue) integrals. COROLLARY 9.4. Let m be a finite T^-measure on a T^-tribe T. Then we have (i) m и continuous in the sense of A5); (ii) Ifim+У and (m~)v are the restrictions o/m+ and m~ to Tv, respectively, then for all AeT m(A) = [ A(;t)d((m+)v - (m-)v); F1) Jx (iii) (m+)v and (m~)v are the upper and the lower variation, respectively, ofm, the restriction ofm to Tv, and m = (m+)v-(m-)v; F2)
Triangular norm-based measures 991 (iv) If(P, Q) is a Hahn decomposition of X with respect to the ?-additive measure m on Tv, then for all AeT m(<2) = -??-(?) ^ m(A) < m+(P) = m(P). F3) 10. Absolute continuity of ^-measures TL-measures on not necessarily generated 7l -tribes will play an essential role in the following exposition. Some of the important properties of such Tb-measures are the absolute continuity and the singularity of 7l -measures. These will be studied in this section. DEnNlTlON 10.1. Let m and ? be TL-measures on a TL-tribe T. The measure ? is called m-absolutely continuous (briefly ? <<c m) if for each A e ? (AC) n(A) = 0 whenever m*(A) = 0. Absolute continuity is a reflexive and transitive relation. Moreover, if ? ? <*C m and П2 <K m, then also ? ? ± 112 « m. There are several characterizations of the absolute continuity which are rather easy to prove (see Lemma 11.2 of Butnariu and Klement [17]). LEMMA 10.2. Let m and ? be T^-measures on a T^-tribe T. The following are equivalent: (i) ? « m; (ii) for all A e T: n(A) = 0 whenever m*(A) = 0; (iii) for all A e T: n+(A) = 0 = n~ (A) whenever m*(A) = 0; (iv) for all A e T: n*(A) = 0 whenever m*(A) = 0; (v) n* «cm*; (vi) for each ? > 0, there exists a <5F > 0 such that for each AeT n*(A) < ? whenever m*(A) < <$F; (vii) for each ? > 0, there exists an ?? > 0 such that for each A e ? |n(A)| < ? whenever m*(A) < ??\ (viii) ? <? m; (ix) (?)* «(m)*. A property which is dual to absolute continuity is that of singularity. Some characterizations of the singularity which are dual to the characterizations of the absolute continuity and which are listed below (see Lemma 11.4 of Butnariu and Klement [17]).
992 D. Butnariu and E.P. Klement Dehnition 10.3. Let m and ? be FL-measures on a FL-tribe T. The FL-measure ? is said to be m-singular (briefly ? _L m) if there exists a fuzzy set A e ? such that n*(A) = m*(CA)=0. LEMMA 10.4. Let m and ? be T^-measures on a T^-tribe T. The following are equivalent: (i) ? _L m; (ii) m _L n; (iii) there exists a fuzzy subset A of X such that AeT and m*(A) = n*(CA) = 0; (iv) n+ _L m and n" _L m; (v) n* _L m*; (vi) ? _L m; (vii) (n)*_L(m)*. The following properties of the absolute continuity and the singularity of finite T^- measures will be used later. Proposition 10.5. (i) If п\,П2 and m are finite T^measures on a ??^-tribe ? such that ? ? _L m and П2 -L m, then n\ ± 112 _L m. (ii) Ifm and ? are finite T^-measures on a T^-tribe ? such that ? <K m and ? _L m, then ? equals the zero measure. PROOF. The proof of (i) is trivial. In order to show (ii), let m and ? be TL-measures on ? such that ? <^ m and ? _L m. Since ? <K m, it follows that for each к eN there is a &\/k > 0 such that n*(F) < l/k whenever F e ? and m*(F) < 5|/*· Since nlm, there is a set A e ? such that n*(CA) = m*(A) = 0. Clearly, n*(A) = 0 since m*(A) < 5|/* for all k e N. If ? e T, then n*(F) = n*(FTLA)+n*(FTLCA)=0 because of F Tl A eT(A), FTlCA e T(CA) and the monotonicity of n*. D 11. Vector /^-measures with Darboux property In Section 9 we proved that finite Tl-measures on FL-tribes have bounded range (see Corollary 9.3(iv)). The question is now whether or under which conditions the range of a finite Гь-measure on a Гь-tribe (generated or not) is exactly an interval. This problem is studied here in a more general context, i.e., we ask whether or under which conditions a finite vector valued rL-measure on a rL-tribe has a convex and compact range. For measures defined on a ? -algebra of crisp subsets of X (which are particular FL-measures) this problem was approached in Poprugenko [67], and it was solved in Liapounoff [52]. In Halmos [33,34], Blackwell [9], Chernoff [19], and Lindenstrauss [53], Liapounoff's
Triangular norm-based measures 993 result was reproved and/or extended. In view of Corollary 9.4, in Dvoretzki, Wald and Wolfowitz[29] it was shown that the range of a vector ^-measure on a generated TL-tribe is necessarily convex and compact. We intend to generalize this result by showing that nonatomic ^-measures on TL-tribes (generated or not) have convex and compact range. To this end, we first study the class of ^-measures with Darboux property. In this section ? denotes a Гь-tribe on X. In what follows, all ^-measures are assumed to be defined on T. An ?-vector T^-measure on ? is a function m = (Ш|, m2,..., m„) from ? to R" such that each m, is a finite TL-measure. If m is an ?-vector TL-measure, we denote the range Ran(m) of m by Ran(m) = {m(W): W eT}. Clearly, 0 e Ran(m) and Ran(m) is bounded in R" (compare Corollary 9.3). DEHNITION 11.1. An ?-vector ^-measure m = (Ш|, Ш2,..., m„) has the Darboux property if for each pair (E, F) of fuzzy sets in ? with ? ^ F there exists a family (Wr)re[0.\] in ? such that for all r e [0, 1] ? <; wr <: F, F4) m(Wr) = (l-r)-m(E) + r-m(F). F5) This is just an extension of the notion introduced in Poprugenko [67]. A typical example of a measure having the Darboux property is the Lebesgue measure on the Borel subsets of [0, 1 ]. By Corollary 9.4, finite TL-measures on generated tribes have the Darboux property. PROPOSITION 1 1.2. Let m = (mi, m2,..., m„) be an ?-vector T^-measure on T. Then m has the Darboux property if and only ifm is semiconvex in the sense that for each F eT there exists a fuzzy set W eT{F) such that m(W) = t.m(F). PROOF. Assume that m has the Darboux property. Then for each F eT and for ? = 0 there exists a family (Wr)r6lo.ij с ? such that F4) and F5) hold. In particular, W = W\/2 e T(F) and m(W) = j ¦ m(F), and the semiconvexity of m follows. Conversely, let m be semiconvex, consider E, F eT such that ? ^ F and denote Wq = Vq = ? and W\ = F. Then there exists a fuzzy set U\ e T(W\ ? Vn) such that m(U\) = \ ¦ m(W\ ? V0). Denoting V| = Vo S|_ U\ and using the TL-disjointness of Vo and U\, we obtain m(V|) = m(Vo) + m(i/i) = ? · (m(W|) + m(W0)). By the semiconvexity again, there is a fuzzy set Ui e T(W\ ? V|) such that m(i/2) = { ¦ m(W| ? V|). Denote V2 = V, SLU2, then we get m(V2) = m(V,) + m(i/2) = \ ¦ m(W|) + g · m(Wo). Continuing in this way we can generate sequences (U„)„e^ and
994 D. Butnariu and E.P. Klement (Vn)neN in ? such that (t/„)neN is TL-disjoint, (V„)iieN is nondecreasing, and for each ? e N we have m(V„) = ^ ¦ m(W0) + ( 1 - — ) · m(W,), 04) m(t/„) = --m(W|©V,,_|)=^-(m(W|)-m(V,,_i)). Choose an arbitrary r e ]0, 1 [ and let oo ? Pi 2' i = \ be its (unique) nonterminal binary representation (i.e., the binary representation with Pi e {0, 1} and pi ? 0 for infinitely many i e N). For each к eN define r*=Ef' Zri=WoSL(SLPi-u\ Clearly, Zn eT(W,)and m(Zrt) = m(W0) + ??·- т(^") = r* · m(w\) + (!-»·*)· m(Wo)· i = l The sequence (Zrt)igN is nondecreasing. Hence, the limit Wr = Нт*-»-»^. exists, belongs to T( W\) and satisfies m(Wr)= lim m(Zn) = r ¦ m(W\) + A - r) -m(W0). Obviously for each r e [0, 1] we have W0 ^ Wr ^ W\, and thus the proposition is proved. ? PROPOSITION 11.3. An ?-vector T^-measure m= (mi,m2, ...,m„) on ? has the Darboux property if and only if for each pair (E, F) of fuzzy sets in ? with ? ^ F there exists a family of fuzzy sets (Wr)r6[0.i] Я Т such that F4), F5) and Wr ? Ws whenever 0 ^ r < s ? 1 F6) are satisfied.
Triangular norm-based measures 995 PROOF. Sufficiency trivially holds. Assume that m has the Darboux property and start with W0 = ? and W\ = F. Then there exists Z|/2 e T(W\ ? Wo) such that m(Z|/2) = 2 · m(W| ? Wo) (see Proposition 11.2). For W\p_ = Wo S|_ ?\? we have m(W,) = i-m(W|) + i-m(W0). Using again the semiconvexity of m, we obtain a fuzzy set ? ?/4 e T(W\/2 ? Wo) and a fuzzy set Z3/4 e T(W| ? W|/i) such that m(Z|/4) = ? -111(^1/2© W), m(W3/4)=2-m(W| ©W|/2). Denote W|/4 = Wo Sl ?1/4 and W3/4= W1/2SLZ3/4. It is clear that WoiJ W|/4<W|/2<W3/4< W| and m(W|/4) = I · m(W,) + I · m(Wo), m(W3/4) = | · m(W,) + J · m(W0). In this way we obtain for each ? e N a finite family (Wk/2")ken 2 2"-?? °^ fuzzv sets m ? such that Wo ?? W\pn ij W2/2" < · · · < WB--|)/2" < W, and m(Wk/2n) = ^ · m(W,) + ? - ?? . m(w0) for all A: e {0, 1,..., 2"}. Each r e ]0, 1 [ can be written as r = lim„^oo rk with rk as above. The fuzzy set Wr = Hindoo Wn is contained in ? and satisfies F5). A simple computation shows that the family (Wr)r6[o, 1] also fulfills F4) and F6). ? COROLLARY 11.4. An ?-vector TL-measure m = (Ш|,..., m„) on ? has the Darboux pmperty if and only if for each ? e ? there exists a family of fuzzy sets (Wr)r6[0.1] in ? such that the condition F6) holds and for each r e [0, 1 ] m(Wr) = r-m(?). F7) Remark 11.5. Halmos [34] called и-vector measures m (defined on a ?-algebra A of crisp subsets of X) convex measures if the following property holds: For each ? e A the set /C(m, E) of all (А П ?)-measurable functions g:E-> [0, 1 [, such that m([x e E: g(x) < r}) = r · m(?) for all r e [0, 1 ], is convex.
996 D. Butnariu and E.P. Klement Here Ad ? denotes the restriction of the ?-algebra A to F, i.e., АПЕ = {АПЕ: АеА}. Combining Lemma 4 from [34] and Corollary 11.4, it follows that each convex measure has the Darboux property. Since the convexity of an и-vector measure m implies its semiconvexity (compare [34, Lemma 2]), an и-vector measure m (defined on a ?-algebra of crisp subsets of X) is convex if and only if it has the Darboux property. Now we are able to state the main result of this section, namely, a convexity theorem of Liapounoff type for vector 7l-measures with Darboux property: THEOREM 11.6. If m = (m\,..., m„) is an ?-vector T^-measure on ? with Darboux property, then its range Ran(m) is convex. PROOF. It suffices to prove that for each pair (?, F) of fuzzy sets in ? and for all r e [0, 1] there exists a fuzzy set G e ? such that m(G) = r-m(E) + (l-r)-m(F). F8) First assume that FTLF = 0. Then there exist Er e T(F) and F|_r e T(F) with m(Fr) = r ¦ m(F) and m(F|_r) = A - r) · m(F) (compare Corollary 11.4). Denote G = Er S|_ F| _r. Since Er T\_ F\ _,· = 0, we have proven F8) in this case. Consider the case that ? and F are not 7l -disjoint. Then, however, A = ? ? (FTL F),B = F ? (FTl F) and С = FTl F are TL -disjoint. Hence, there exist Ar e T(A)and B\-r e ?(?) such that ArTLS|-r = 0 and m(ArSLS|_r)=r-m(A) + (l -r)m(B). Denote G = (Ar SL В, _r) SL С. Then m(G) = r ¦ m(A) + A - r) · m(B) + m(C) F9) since Ar, ?|_? and С are 7l-disjoint. From m(A) = m(F) — m(C) and m(B) = m(F) - m(C) it follows that F8) holds in this case too (see F9)). ? In the special case of the finite 7l-measures Theorem 11.6 can be restated as follows: COROLLARY 11.7. Ifm is a finite T^-measure on ? having the Darboux property and if (P, Q) is a Hahn decomposition of X with respect to the ?-additive measure m on Tv, then for each ? e ? (m(W): W e T(F)} = [m(Q ¦ F), m(F · ?)]. In particular, we have Ran(m) = [m(<2), m(P)].
Triangular norm-based measures 997 PROOF. For ? e ? and W e T(?) we get m(? · Q) = -m"(?) < -m-(W) < m(W) ^ m+(W) < m+(?) = m(P · ?). If r e [m(? · Q), m(? · P)] then there is some s e [0, 1] such that _lsm(EP) ifr^O, ?"|?·??(? Q) ifr<0. Hence, in both cases there exists a fuzzy set W e T(?) such that m(W) = r (see Corollary 11.4). ? Remark 11.8. Taking into account Corollary 11.7, a finite 7l-measure m on ? has the Darboux property if and only if for each ? e ? the set {m(W): W e T(?)} is convex, i.e., if and only if this set is an interval. Generally speaking, if m = (mi,..., m„) is an и-vector 7l -measure on ? with ? ^ 2 and if for each ? e ? the set Ran?(m) = {m(W): W e T(?)} is convex, then m has the Darboux property (compare Proposition 11.2). However, we do not know whether the converse also holds. Clearly, for ? = X the set Ran?(m) is convex by virtue of Theorem 11.6. But the proof of Theorem 11.6 can not be applied for ? ? ? since, in general, we do not have AS|_ В e T(?) for each pair (?, ?) in T(?). Such an implication, however, is substantially involved in the proof. 12. Nonatomic /^-measures In the preceding section we have shown that the range Ran(m) of an и-vector 7l-measure m = (mi, ??2,..., m„) with Darboux property is convex. Under which conditions for the components m, of m can we deduce the Darboux property for m? Note that if m has the Darboux property, each of its components Ш|, mi,..., m„ necessarily has the Darboux property. In this section we show that the converse is also true. This is one of the results of Butnariu [12,13], and it involves ideas contained in the proof of the Liapounoff theorem given in Halmos [34]. Let ? be a 7l-tribe. All 7l-measures considered in this section are assumed to be defined on T. If m = (mi, ni2,..., m„) is an и-vector 7l-measure, then define the finite 7l -measure m': ? -> R by m' = mf + ni2 + h m*. Dehnition 12.1. Let m = (mi, Ш2,..., m„) be an и-vector 7l-measure on T. (i) A 7l-measure ? on ? is called m-absolutely continuous if it is m'-absolutely continuous, i.e., if n<K m'.
998 D. Butnariu and ?.P. Klement (ii) A 7l -measure ? on ? is called atomic if there exists a fuzzy set A e ? with n(A) ^ 0 such that for each В e T(A) we have either ?(?) = 0 or ?(?) = n(A). (iii) A T^-measure ? on ? is called nonatomic if it is not atomic, i.e., if for each AeT with n(A) ? 0 there exists а В e T(A) such that ?(?) ? {?, ?(?)}. (iv) The и-vector 7l -measure m is called nonatomic if each of its components Щ|, ??2,..., m„ is nonatomic. THEOREM 12.2. Let m = (mi,m2, ...,m„) be an n-vector T^-measure on T. The following are equivalent: (i) m has the Darboux property; (ii) for each i e {1, 2, ..., n] the component m/ has the Darboux property, (iii) m is nonatomic. PROOF. The implications (i) =>· (ii) and (ii) =>· (iii) are trivial. It remains to show that (iii) =>· (ii) and (ii) =>· (i). The proof of these implications will be done in the following sequence of lemmas. ? Lemma 12.3. Assume that E, F e ? with El\_ F = 0, and let W=(W|,W2,...,W„) be a nonnegative ?-vector T^-measure on ? with Darboux property and ? a finite T^-measure on ? such that n«w. Then there exists a family of fuzzy sets (Hr)re[0.\] in ? such that for each r e [0, 1] w(Hr) = r -w(?) + (l -r)-w(F), G0) H0 = F, Я,=?. G1) Moreover, for the function /„ : [0, 1] -> R given by fa{r) = n(#r) we have /n is continuous. G2) PROOF. If w(?) = 0 or w(F) = 0, then such a family (#r)r6[o. i] exists by Corollary 11.4. Assume that w(?) ? 0 and w(F) ? 0. Since w has the Darboux property, by Proposition 11.3 there are two families (Wr)re[o. i] and (Zr)r6[o. и in ? such that for r, s e [0, 1] with r < s we have Wo = Zo = 0, W,=?, Z,=F, G3) Wr < Ws, Zr^Zs, G4) w(Wr) = r -w(F), w(Zr) = r-w(F). G5) Since FTLF = 0 it follows that WrlLT\-r = 0 for all r e [0,1]· Denote Hr = Wr SL Z,_r. Then G0) and G1) are clearly satisfied. Choose s,re[0,l] with s > r. Then l/n(r) - /n(i)l = ln(Z|_r © Z|_.5) - n(Ws ? Wr)|
Triangular norm-based measures 999 ^ \n(Ws ? Wr)\ + |n(Z,_r ? Z\-s)\. G6) For a given ? > 0 there exists a ?? > 0 such that for each A e ? we have , . ? |n(A)| < - whenever w (A) < <5?, G7) because of n<Z? w1. Since w is nonnegative, we get W(WS ? Wr) = W(WS) - w'(Wr) = (s- r)-w'(E) and W(Zi-r ez\-s) = W(Z\-r)-W{Zi-s) = (j - r) · W(F) because of G4) and G5). Denote . (, ?? ?? \ ?? = nun 1, — , — . V 'w'(F) W(F)J If 0 < s - r < ?? then we deduce (compare with G7)) |n(Ws ? Wr)\ < ?/2 and |n(Z|_r ? Z\-s)\ < ?/2. Taking into account G6) it follows that |/„(r) - fn(s)\ < ? whenever 0 < s — r < ?(?). Hence, /„ is continuous. ? LEMMA 12.4. Ifw = (wi, wi,..., w„) is a nonnegative ?-vector T^-measure on ? which has the Darboux property, then for each finite T^-measure ? with ? <? w the (n + \)-vector T^-measure w' = (wi, W2,..., w„, n) has the Darboux property. PROOF. It is sufficient to show that w' is semiconvex (see Proposition 11.2). Suppose that A eT satisfies w(A) ? 0. Then there exists an ? e T(A) with w(?) = 5 · w(A). If n(?) = 2 ¦ n(A), we are done. Assume that n(?) ? \ ¦ n(A) and denote F = ? ? ?. Clearly, j ¦ n(A) is contained in the interval with the boundaries n(?) and n(F). Since ? and F satisfy Lemma 12.3, there exists a family (#r)r6[0.1] ш Т sucn mat G0), G1) and G2) hold. Hence, there is an ro e [0, 1] such that n(//ro) = /n(r0) = i-n(A) because of /n@) = n(F), /n(l) = n(?) and the continuity of /„. For ? - НГA we deduce thatn(W) = J n(A) and yv(H) = r0 ¦ w(?) + A - r0) · w(F) = i · w(A), i.e., w'(W)= ^ -w(A). D
1000 D. Butnariu and ?. P. Klement LEMMA 12.5. Ifm and ? are finite T^-measures on T, if ? <g m and ifm is nonnegative and has the Darboux property, then ? also has the Darboux property. Moreover, m ± ? are finite T^-measures with Darboux property. PROOF. Indeed, if in Lemma 12.4 we take ? = 1 and W| = m, then the 2-vector measure (m, n) has the Darboux property, and the result follows. ? LEMMA 12.6. Ifm = (m|,mi,...,m„) is a nonnegative ? -vector T^-measure on ? with m, <g m,_ | for all i e {2, 3,..., n) and ifm\ has the Darboux property, then m itself has the Darboux property. PROOF. For ? = 1 the result is trivial. Assume that the result holds for each к < n. In order to show its validity for к = ? consider the (n - l)-vector T^-measure w = (mi, ??2,..., m„_ ?) which has the Darboux property. Since m„ «; m„_| « mi + m2 ? h m,,-i = v/, it follows that m„ «; w. Hence, (w, m„) = (mi, mi,..., m„) has the Darboux property by Lemma 12.4. ? LEMMA 12.7. If ? is a finite nonatomic nonnegative T^-measure on T, then for each AeT with n(A) ? 0 there exists a sequence (А/0/teN,, in T such that for all к e N (i) А = А0>А\>А2>--->Ак, (ii) 0<?(?*)<;1·?(?*_|), (iii) Hindoo Ak =0. PROOF. Put So = A and assume that we have already obtained fuzzy sets Bq ^ B\ ^ B2 > ¦ ¦ ¦ > Bk in ? such that 0 < ?(?,) < A/2') · n(A) and ?(?,) < { ¦ ?(?,-?) for i e {1,2, ...,k]. Then there exists В e T(Bk) with 0 < ?(?) < n(Bk) because of the nonatomicity of n. Define ?* + ? by _ \B ifn(B)<;-n(Bt), Bk + \ = \ [ Bk ? ? otherwise. Clearly 0<?(?*+?)<?·?(?*) and ?* + ? < Bk. Moreover, we have ?(?(+?)^·?(?). The sequence (Bk)ke^ с ? obtained in this way is nonincreasing and its limit W = Hindoo Bk belongs to ? (see Theorem 3.3(i)). Since n(Bk) < ^ · n(A) for all к eN,
Triangular norm-based measures 1001 it follows that n(W) = 0 because of the continuity of n. If we define Ak = Bk ? W for к e N and Ao = A, then the sequence (Ак)кец0 satisfies the properties (i)—(???). ? LEMMA 12.8. If ? is a nonnegative nonatomic finite 7l measure on T, then ? has the Darboux property. PROOF. We prove that ? is semiconvex. For each A eT with n(A) ? 0, there exists a sequence (A*)*eN0 - ^ sucn tnat (i)-(i») ш Lemma 12.7 are satisfied. Denote Wk = Ak-\ ? Ak for each к e N. From (i) it follows that (W,t)/ieN is a ^-disjoint sequence in T(A). For ieN and re@,1) define the fuzzy set @ ifr=0. Ifre[0, l]and ЭС k = \ its unique nonterminal binary representation, denote t*W = SLWk.ll. k=\ Clearly, t *W eT(A) and ОС n(t*W) = J2tk -n(Wk). k = \ We claim that the function g : [0, 1 ] -> R defined by g(t) = n(i * W) is continuous. Indeed, if 0 ^ s < t !? 1, then there exists a smallest jeN such that f, = 1 and sy- = 0. Hence, ос эс g(t)-g(s) = n(Wj)+ ? (tk-sk)-n(Wk)>n(Wj)- ? n(Wk). k=j+\ k=j + \ Since the latter sum equals n(Aj), it follows that g(t) - g(s) ^ n(Aj-\) - 2 · n(Ay) ^ 0 because of Lemma 12.7(ii). Now observe that ОС g(t) - g(s) ? ?("< - J*> · n(A^-i) < 2 · (i - i) · n(A), k=\ and the continuity of g follows. The function g satisfies g@) = 0 and g(l)=n(?lwk\=n(A).
1002 D. Bulnariu and E.P. Klement Hence, there exists a t° e [0, 1] such that g(t°) = \ · g(l), i.e., ?(?) = j · n(A) and В e T(A) for В = t° * W, and ? is semiconvex. ? LEMMA 12.9. If П\,П2,...,п„ are nonnegative finite T^-measures on ? having the Darboux property, then ? = ? ? + П2 + h n„ has the Darboux property. PROOF. We prove that ? is nonatomic. Assume that AeT and n( ?) ? 0. Then ?, (?) ? 0 for at least one index i. Since n, has the Darboux property, there exists a fuzzy set В e ? {A) such that ?,-(?) = \ · ?;(?) < ?, (A). Hence, 0 < ?(?) < n(A) since all n, are monotone (see also Proposition 5.1). ? Lemma 12.10. //m= (mi,m2, ...,m„) is a nonnegative ?-vector T^-measure on ? and if each of the T^-measures Ш|, m2,..., m„ has the Darboux property, then m itself has the Darboux property. PROOF. For i e {1,2,..., n} denote m^ = Y?k=i mu. Each 111° has the Darboux property by Lemma 12.9. Observe that m° «; m^, whenever i e {2,3,...,и}. Thus m° = (m", 1П2,..., m,^) has the Darboux property by Lemma 12.4. Let / = (/1,..., /„) be the linear operator /: E" -> R" defined by ? f,(y\,y2,.-.,yn) = J2yk- It is clear that / is one-to-one and onto (its associated determinant is different from zero). Therefore, /~' exists and is linear. From m° = / о m it follows that m = /~' о m° and, subsequently, that m is semiconvex. ? Now we are ready to show that, in Theorem 12.2, property (ii) implies indeed property (i). LEMMA 12.11. If m = (mi, 1112,..., m„) is an ?-vector T^-measure on ? and if each component Ш|, m2,..., m„ has the Darboux property, then m itself has the Darboux property. PROOF. For each index i e {1,2,..., n) there exists a Hahn decomposition (/>,-, Q;) of X with respect to m, (the restriction of m, to Tv). Denote E® = Pi and EJ = Q;. For a = (??,?2,...,?„) e {0, 1}" denote ? (?) = ?"=? ET ¦ Clearly, ? (a) e T. Define the finite Гь-measures m", m",..., mjj by mii(A) = (-l)a'-m,(ATL?(a)). Each m? is nonnegative since ? Qi if a,- = 1, ATL?(a)< I Pi ifai=0.
Triangular norm-based measures 1003 We can even show that each m" has the Darboux property: if A e ? and m"(A) ? 0, then there exists а В е T(A) such that та(В) = (-\У' ¦mi(BJLE(a)) = \-(-\f· ¦ml(ATLE(a)) = \-mc;(A), since m, is semiconvex and E(a) is a subset of X. Hence, m" = (m", m",..., m°) is a nonnegative и-vector ^-measure with Darboux property (compare Lemma 12.10). Since [E(a) | a e {0, 1}"} is a partition of X, we have m= ]T (-l)fl-m°, G8) «6@.1)" where the и-vector Tb-measure (—1)" · m" is given by (-1У · mfl = ((-1 )<" · m'l, (-1)"- ¦ m$,...,(-1)"" · m?). By the Darboux property of each m", for each AeT and for each a e {0, 1}", there exists an A(a) e T(A) such that ma(A(a)) = \ ¦ m"(A). Denote B(a) = A(a)TL ?(a) and B= ? ?(?). я6{0.1)" Then because of G8) and the TL-disjointness of the family (S(a))«6jo. i)" we have m(B)= ? m(B(a))= ? ("D" т"(А(а)) аб@.|)" дб{0.1)" = \· ? (-l)"-ma(A) = ^-m(A). яб@.1)" This shows that m is semiconvex. ? The following lemma verifies implication (iii) =>¦ (ii) of Theorem 12.2, and this will complete the proof of this theorem. LEMMA 12.12. Ifm = (???,???, ...,m„) is a nonatomic ?-vector TL-measure on ? then each component Ш|, ni2,..., m„ has the Darboux property. PROOF. If m is nonatomic, each of its components m, is finite and nonatomic. If each in; is nonnegative, then the result follows from Lemma 12.8. Assume that m/ has also some negative values. If we can show that m^ and m~ are also nonatomic, then they have the Darboux property (compare Lemma 12.8) and, consequently, the 2-vector T^- measure (m+,m~) (compare Lemma 12.10) and the TL-measure m,- = m+ - m~ also have the Darboux property due to m, = /(m+, m~) and to the fact that the linear function
1004 D. Bulnariu and E.P. Klement f :R2 -> R defined by f{y\,yi) = }'\ — yi preserves semiconvexity. If A e ? with m+(A)^0, then m,(P,TLA)=m+(A)^0 for each Hahn decomposition (/>,-, <2,) of X with respect to m, (compare Corollary 9.4). Since m, is nonatomic, there exists a fuzzy set В еТ(Р, T|_ A) such that 0 ? m, (?) ? m/ (?, TL A) = m+(A). As a consequence, m+(S) = iii,(P,7lS) = m,(Z?) ? {0, m+(A)}, i.e., m,+ is nonatomic. The nonatomicity of m~ can be seen analogously. ? Theorem 12.2 together with Corollary 9.3 allows Theorem 11.6 to be restated as follows: COROLLARY 12.13. // m = (m|,...,m„) is a finite nonatomic ?-vector T^-measure on T, then its range Ran(m) is convex and bounded. In the case ofn= 1, moreover, Ran(m) = [m(<2),m(P)], (P, Q) being a Hahn decomposition of X with respect to the ?-additive measure m. 13. A Liapounoff type theorem for /^-measures In Section 12 we have shown that the range Ran(m) of a nonatomic TL-measure m on a T^- tribe is convex and bounded (see Corollary 12.13). In this section we show that Ran(m) is also compact. The idea of this proof is extracted from the proof of the Liapounoff theorem by Halmos [34]. Throughout this section ? will denote a T^-tribe, and all Гь-measures are assumed to be defined on T. THEOREM 13.1. //m = (mi,..., m„) is a nonatomic ?-vector T^-measure on T, then its range Ran(m) is convex and compact. PROOF. Convexity and boundedness of Ran(m) follow from Corollary 12.13. Without loss of generality, we may assume that the affine dimension of Ran(m) equals n. Otherwise, all the following arguments can be carried over from R" to the linear hull of Ran(m). It remains to show that Ran(m) is closed. To this end, it is sufficient to show that ? ? cl(Ran(m)) с Ran(m), G9) where cl stands for the topological closure, for each supporting hyperplane ? of Ran(m) (see Rockafellar [68, Theorem 11.6.2 and Corollary 11.7.1]). Let ? be such a supporting hyperplane. Then there exists an r el and a linear function /:R" -> R such that ? = [? e R": f(x) = r} and /(m(?)) > r for all ? e T. The proof is carried out in several steps.
Triangular norm-based measures 1005 CLAIM 1. For each crisp set AeTv the function mA : ? -> R defined by mA(?) = m(CA?)-m(A?) (80) is a nonatomic n-vector ?^-measure and Ran(mA) = Ran(m) — m(A). Indeed, a simple computation shows that mA is a /i-vector TL-measure. For ? e ? we have mA(E) = m((CA · ?)SL(A · C?)) - m(A), i.e., Ran(mA) с Ran(m) - m(A). Conversely, since A e Tv, we get m(?) - m(A) = mA((CA · ?)SL(A · C?)), i.e., Ran(m) — m(A) с Ran(mA). To show the nonatomicity of mA, observe that for ? e ? there exist TL-disjoint fuzzy sets F| e T(CaTl ?) and F2 e T(ATL ?) such that m(F|)= ^m(CATL?), m(F2) = i, m(ATL?), since m is nonatomic (see Theorem 12.2 and Proposition 11.2). Therefore, шд (Fi SL F2) = mA(F,) + mA(F2) = m(F,) - m(F2) = !, ¦ mA(?), showing that mA is nonatomic. Claim 2. 77?еге ejcww aset Qe Tv ii/c/г ?/??? ??(?) e ? ? Ran(m). Define the finite ^-measure u:T->Rbyu = /om. Observe that r = inf{u(?): ?еГ}, and let (P, Q) be a Hahn decomposition of X with respect to the ?-additive measure u. By Corollary 9.4 (iv), we have u(Q) = r, and this proves Claim 2. Now we are in the position to show G9). Observe that / ? ???(?) > 0 for all ? e T, and define uq = f ? ???. Let L be the linear subspace L = {x еШ": f{x) = 0}, i.e., L = H - m(<2). By Claim 1, G9) is equivalent to Lncl(Ran(me)) cRan(me). (81) Denote w = m'Q (see Definition 71). Clearly, each coordinate of mg is w-absolute continuous. We prove (81) by induction upon the number ? of components of the vector rL-measurem. If и = 1 then (81) follows from Corollary 12.13. Assuming that (81) is true for all A; < n, we want to prove it for к = ?. Then, by Corollary 60(H) and Halmos work [36,
1006 D. Butnariu and ?. P. /Clement p. 125], there exists a crisp set В e Tv such that ???(?) = 0 and the и-vector TL-measure wb defined by wb(E) = w(B ¦ E) is ???-absolutely continuous. Define the и-vector T^- measureswi and W2 by W|(?) = ???(?-?) and \??(?) =iiiq(CB E),respectively.Since for each ? e ? we have /oW|(?) = /ome(B-?) = uQ(B ¦ E) = 0, we also get Ran(wi) с L. Hence, the affine dimension of Ran(wi) is less than or equal to ? — 1, implying that Ran(W|) is compact due to the nonatomicity of wi and the inductive assumption. Let ? be a point in L П cl(Ran(mg)). Then there exists a sequence (E/)/eN Я: Т with lim/^oo ???(?7) = ?, and therefore, 0 = f(x) = f( lim mQ(Ei)) = lim ???(?,). By the ???-absolute continuity of wi we get lim/^эс wi(?/) = 0. Note that ??? = W| + wi which implies lim mn(?;)= lim W|(?/)=^, or, in other words, ? e cl(Ran(W|)) =Ran(wi) с Ran(m?). Hence, (81) holds. D Theorem 13.1 obviously extends the Liapounoff Theorem [52]. Observing that 7l- measures on generated tribes are necessarily nonatomic (see Corollary 9.4 (ii)), Theorem 13.1 contains the theorems of Dvoretzki et al. [29] stating that the range of a vector integral is convex and compact as a special case. 14. A Liapounoff type theorem for ??-measures Most recently, Barbieri and Weber [4] have proved a Liapounoff Theorem for general ?? -measures. In order to state these results, some preparatory steps are necessary [4]. PROPOSITION 14.1. Let ? be a TM-c!an and m: ? -> R be a function satisfying (TM 1) and (TM2) (with respect to Гм). Then the function dm:T -> [0, oo] defined by dm(A, B) = sup{\m(C) - m(D)\: C,DeT, ATM В с С с ?) с ASM 6} « ? pseudo-metric on the T^-clan T. Definition 14.2. Let ? be a TM-clan and m:T -> R be a function satisfying (TM1) and (TM2) (with respect to Гм). Then ? is said to be m-chained if, for all А, В e ? and for each ? > 0, there exist Co, C|,..., C„ in ? with A = Co <? С ? с · · · с С„ = S and dm(Ci-1, d) < ? for all i e {1,2,..., n).
Triangular norm-based measures 1007 We only mention that, in general, whenever ? is m-chained then m is nonatomic, but not conversely. However, a TL-measure m on a Гд-tribe ? (and, therefore, its restriction m to the ?-algebra of crisp sets Tv) is nonatomic if and only if ? is m-chained or, equivalently, if Tv is m-chained. Also note that, in the case of a measure m on a ?-algebra ? of crisp subsets of X, the pseudo-metric space (A, dm) is always complete. The following result is also due to [4]. THEOREM 14.3. LetT bea ??-tribe andm.T'-> Ш" a nonatomic n-vector ??-measure on ? for some ? e ]0, oof. Then we have: (i) if ? is m-chained, then Ran(m) is convex; (ii) if the space (T, dm) is complete, then Ran(m) is compact. Theorem 14.3 does no longer hold if we drop the additional assumptions in (i) and (ii), as the following counterexamples show [4]. Example 14.4. (i) Consider the (rA-)tribe [0, l](v,I of all fuzzy subsets of the singleton {*o}. Then the function m: [0, l](l,I -> [0,2] defined by ,лч @ ifA(*0) = 0, m(A) = 1 [ A(xo) + 1 otherwise, is a nonatomic TA-measure for each ? e ]0, oof whose range, however, is not convex, (ii) IfX:??—» [0, 1] denotes the usual Lebesgue measure on the ?-algebra В of all Borel subsets of [0, 1 ], then the function m: Вл -> [0, 1 ] given by (compare C)) m(A)=X(supp(A))- / Adk ./{0.1] is a rA-measure for each ? e ]0, oof and Вл is m-chained, but Ran(m) is not compact. Most results of this section can be generalized: the values of the measures in question may be not only numbers in the (extended) real line, but in an arbitrary normed commutative group (see Weber [77] and Barbieri and Weber [3,4]). References [ 1 ] J. Aczel, Lectures on Functional Equations and their Applications, Academic Press, New York A966). [2] R.J. Aumann and L.S. Shapley, Values of Non-Atomic Games. Princeton University Press, Princeton A974). [31 G. Barbieri and H. Weber, A topological approach to the study of fuzzy measures. Functional Analysis and Economic Theory, Y. Abramovich. E. Avgerinos and N.C. Yannelis. eds. Springer. Berlin A998). 17^46. [4] G. Barbieri and H. Weber, A representation theorem and a Lyapunov theorem for 7", -measures: The solution of two problems of Butnariu and Klement. J. Math. Anal. Appl. 244 B000). 408^424. [5] H. Bauer, Probability Theory and Elements of Measure Theory. Academic Press, London A981).
1008 D. Butnariu and ?.P. /Clement [6] L.P. Belluce, Semi-simple algebras of infinite valued logic and bold fuzzy set theory, Canad. J. Math. 38 A986), 1356-1379. [7] L.P. Belluce, Semi-simple and complete MV-algebras, Algebra Universalis 29 A992). 1-9. [8] G. Birkhoff, Lattice Theory, Amer. Math. Soc.. Providence. RI A973). [9] D. Blackwell, The range of certain vector integrals, Proc. Amer. Math. Soc. 2A951). 390-395. [10] L. Borkowski (ed.), J. Lukasiewicz: Selected Works, Studies in Logic and Foundations of Mathematics, North-Holland, Amsterdam A970). [11] D. Butnariu, Additive fuzzy measures and integrals. I, J. Math. Anal. Appl. 93 A983). 436-452. [12] D. Butnariu, Decomposition and range for additive fuzzy measures. Fuzzy Sets and Systems 10 A983), 135-155. [13] D. Butnariu, Non-atomic fuzzx measures and games. Fuzzy Sets and Systems 17 A985), 39-52. [14] D. Butnariu, Fuzzy measurability and integrability, J. Math. Anal. Appl. 117 A986), 385^410. [15] D. Butnariu, Additive fuzzy measures and integrals. Ill, J. Math. Anal. Appl. 125 A987), 288-303. [16] D. Butnariu and E.R Klement, Triangular norm-based measures and their Markov kernel representation, J. Math. Anal. Appl. 162 A991), 111 -143. [17] D. Butnariu and E.P. Klement, Triangular Norm-Based Measures and Games with Fuzzy Coalitions, Kluwer, Dordrecht A993). [18] C.C. Chang, Algebraic analysis of many valued logics. Trans. Amer. Math. Soc. 88 A958), 467^490. [19] H. Chemoff, An extension of a result of Liapounoff on the range of a vector measure, Proc. Amer. Math. Soc. 2A951), 720-726. [20] A.H. Clifford, Naturally totally ordered commutative semigroups, Amer. J. Math. 76 A954), 631-646. [21] A.C. Climescu, Sur Vequation fonctionelle de I'as.sociativite. Bull. Ecole Polytechn. Iassy 1 A946), 1-16. [22] B. De Baets and R. Mesiar, Г-partitions. Fuzzy Sets and Systems 97 A998), 211-223. [23] M. de Glas, Representation of Lukasiewicz' many-valued algebras. The atomic case, Fuzzy Sets and Systems 14 A984), 175-185. [24] A. Di Nola, Representation and reticulation by quotients of MV algebras, Ricerche Mat. 40 A991), 291- 297. [25] D. Dubois, M. Grabisch and H. Prade, Gradual rules and the approximation of functions. Proceedings 2nd International Conference on Fuzzy Logic and Neural Networks, Iizuka A992), 629-632. [26] D. Dumitrescu, Fuzzy partitions with the connectives T^, Sy_, Fuzzy Sets and Systems 47A992), 193-195. [27] D. Dumitrescu, Fuzzy measures and the entropy of fuzzy partitions. J. Math. Anal. Appl. 176 A993), 359- 373. [28] A. Dvoretzky, A. Wald and J. Wolfowitz, Elimination of randomization in certain problems of statistics and game theory, Proc. Nat. Acad. Sci. USA 36 A950), 256-259. [29] A. Dvoretzky, A. Wald and J. Wolfowitz, Relations among certain ranges of vector measures. Pacific J. Math. 1A951), 59-74. [30] M.J. Frank, On the simultaneous associativity of F(x.y) and x+y- F(x, v), Aequationes Math. 19A979), 194-226. [31] S.Gottwald, A Treatise on Many-Valued Logic, Research Studies Press, Baldock B001). [32] P. Hajek, Metamathematics of Fuzz\ Logic, Kluwer, Dordrecht A998). [33] PR. Halmos, On the set of values of a finite measure. Bull. Amer. Math. Soc. 53 A947), 138-141. [34] PR. Halmos, The range of a vector measure. Bull. Amer. Math. Soc. 54 A948), 416-421. [35] PR. Halmos, Measure Theory, Van Nostrand Reinhold, New York A950). [36] PR. Halmos, Measure Theory, Springer, Berlin A974). [37] U. Hohle, Fuzzy equalities and indistinguishability. Proceedings EUFIT '93, Aachen, Vol. 1 A993), 358- 363. [38] U. Hohle, On the fundamentals of fuzzy set theory, J. Math. Anal. Appl. 201 A996), 786-826. [39] U. Hohle, Many-valued equalities, singletons and fuzzy partitions. Soft Computing 2 A998), 134-140. [40] U. Hohle and E.P. Klement, Plausibility measures - a general framework for possibility and fuzzy probability measures, Aspects of Vagueness, H.J. Skala, S. Termini, and E. Trillas, eds, Reidel, Dordrecht A983), 31-50. [41] U. Hohle and S. Weber, Uncertainty measures, realizations and entropies. Random Sets: Theory and Applications, J. Goutaias, R.P.S. Mahler, and H.T. Nguyen, eds. Springer, Heidelberg A997), 259-295.
Triangular norm-based measures 1009 F. Klawonn and R. Kruse, Equality relations as a basis for fuzzy control, Fuzzy Sets and Systems 54 A993), 147-156. E.P. Klement, Characterization of finite fiizzv measures using Markoff-kemels, J. Math. Anal. Appl. 75 A980), 330-339. E.P. Klement, Fuzzy ? -algebras and fuzz\ measurable functions. Fuzzy Sets and Systems 4 A980), 83-93. E.P. Klement, Characterization of fuzzy measures constructed by means of triangular norms. J. Math. Anal. Appl. 86A982), 345-358. E.P. Klement, Construction of fuzzy ?-algebras using triangular norms, J. Math. Anal. Appl. 85 A982), 543-565. E.P. Klement, R. Mesiar and E. Pap, A characterization of the ordering of continuous t-norms. Fuzzy Sets and Systems 86 A997), 189-195. E.P. Klement, R. Mesiar and E. Pap, Triangular Norms, Kluwer, Dordrecht B000). E.P. Klement and M. Navara, A characterization of tribes with respect to the Lukasiewicz t-norm, Czechoslovak Math. J. 47 A22) A997), 689-700. E.P. Klement and M. Navara, A sunev on different triangular norm-based fuzzy logics. Fuzzy Sets and Systems 101 A999), 241-251. R. Kruse, J. Gebhardt and F. Klawonn, Foundations of Fuzzy Systems, Wiley. Chichester A994). A.A. Liapounoff, Sur les functions-vecteurs completement additives, Bull. Acad. Sci. URSS Ser. Math. 4 A940), 465^478. J. Lindenstrauss, A short proof of Liapounoff's convexity theorem, J. Math. Mech. 15 A966), 971-973. CM. Ling, Representation of associative functions, Publ. Math. Debrecen 12 A965), 189-212. J. Lukasiewicz, О logice trowartosciowej, Ruch Filozoficzny 5 A920), 170-171. (English translation contained in [10].) K. Menger, Statistical metrics, Proc. Nat. Acad. Sci. USA 8 A942), 535-537. K. Menger, Ensembles flous et functions aleatoires. C. R. Acad. Sci. Paris Ser. A 232 A951), 2001-2003. K. Menger, Probabilistic geometry, Proc. Nat. Acad. Sci. USA37 A951), 226-229. K. Menger, Probabilistic theories of relations, Proc. Nat. Acad. Sci. USA 37 A951), 178-180. R. Mesiar, Fundamental triangular norm based tribes and measures, J. Math. Anal. Appl. 177 A993), 633-640. R. Mesiar, On the structure ofTs-tribes, Tatra Mt. Math. Publ. 3 A993). 167-172. R. Mesiar and M. Navara, Ts -tribes and Ts -measures, J. Math. Anal. Appl. 201 A996), 91-102. R. Mesiar and J. Rybarik, Pan-operations structure. Fuzzy Sets and Systems 74 A995). 365-369. M. Navara, A characterization of triangular norm based tribes, Tatra Mt. Math. Publ. 3 A993). 161-166. R.B. Nelsen, An Introduction to Copulas. Lecture Notes in Statistics, Vol. 139, Springer, New York A999). J. von Neumann and O. Morgenstem, Theory of Games and Economic Behavior. Princeton University Press, Princeton A944). G. Poprugenko, Sur la proprie'te de Darboux des functions continues d'ensemble, Fundam. Math. 12A928), 254-268. R.T. Rockafellar, Convex Analysis. Princeton University Press, Princeton A970). A. Rose and J.B. Rosser, Fragments of many-valued statement calculi. Trans. Amer. Math. Soc. 87 A958), 1-53. E. Ruspini, A new approach to clustering. Inform, and Control 15 A969). 22-32. K.D. Schmidt, A general Jordan decomposition. Arch. Math. 38 A982). 556-564. B. Schweizer and A. Sklar, Associative functions and abstract semigroups, Publ. Math. Debrecen 10A963), 69-81. B. Schweizer and A. Sklar. Probabilistic Metric Spaces, North-Holland, New York A983). A. Sklar, Functions de repartition a n dimensions et leurs marges, Publ. Inst. Statist. Univ. Paris 8 A959), 229-231. M. Sugeno, Theory of fuzzy integrals and its applications, Ph.D. thesis, Tokyo Institute of Technology A974). H. Thiele and N. Schmechel, On the mutual definability of fuzzy equivalence relations and fuzzy partitions. Proceedings of the International Joint Conference of the Fourth IEEE International Conference on Fuzzy Systems and the Second International Fuzzy Engineering Symposium, Yokohama, Vol. Ill, IEEE Press A995), 1383-1390.
1010 D. Butnariu and ?. P. Klement [77] H. Weber, On modular functions, Funct. Approx. Comment. Math. 24 A996), 35-52. [78] O. Wyler, Clans, Compositio Math. 17 A966), 172-189. [79] L. A. Zadeh, Fuzzy sets. Inform, and Control 8 A965), 338-353. [80] L.A. Zadeh, Pmbability measures of fuzzy events, J. Math. Anal. Appl. 23 A968), 421^427. [81] L.A. Zadeh, Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems 1 A978), 3-28.
CHAPTER 24 Geometric Measure Theory: Selected Concepts, Results and Problems Miroslav Chlebik Department of Mathematics, Comenius University. Mlynska dolina, 842 48 Bratislava, Slovakia E-mail: chlebik@fntph.uniba.sk Contents Introduction 1013 1. Preliminaries from measure theory 1013 2. Structure theory for integral dimensional sets Ю17 3. Densities of measures and rectifiability 1020 3.1. Preiss's theorem 1020 3.2. Related results and problems 1022 3.3. Besicovitch j-problem 1023 4. Sets of finite perimeter 1025 5. Measure-theoretic calculus of variations 1027 5.1. Normal and integral currents 1028 5.2. Closure and compactness theorem 1033 5.3. Regularity of minimal surfaces 1034 References 1035 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1011
This page intentionally left blank
Geometric measure theon:· Selected concepts, results and problems 1013 Introduction One of the main themes of geometric measure theory is the detailed study of geometric properties of general sets and Borel measures in R". The first decades after Caratheodory's fundamental paper in 1914 were spent studying how subsets of R" behave with respect to m-dimensional measures, mainly Hausdorff-like. Geometric measure theory originates in the pioneering work done by Besicovitch on the structure of the subsets of the plane having finite one-dimensional Hausdorff measure. Besicovitch's work was extended by Federer, Marstrand, and later by Mattila and Preiss, to m -dimensional subsets and measures in R", whose rectifiability, density, and projection properties were analyzed. During the last fifty years the results and methods developed in the study of geometry of sets and measures by many researchers have fused with modern techniques from functional and harmonic analysis, and the calculus of variations. This resulted in producing new and very efficient tools: the theory of generalized surfaces and varifolds, the theory of real flat chains, normal and integral currents of Federer and Fleming, the (partial) regularity theory of solutions of measure-theoretic variational problems (De Giorgi, Fleming, Reifenberg). From the recent trends one should mention results in fractal geometry (Falconer), works on quantitative notions of rectifiability and their applications to singular integral operators (Jones, David, Semmes). The very new is the progress in the results from geometric measure theory in metric spaces (Ambrosio, Kirchheim, Cheeger). This chapter gives an exposition of selected concepts, results and problems from geometric measure theory. For more comprehensive treatment we refer to the list of references. Nevertheless we hope that the choice of topics presented here can show the character of this modern piece of mathematics. General references. The most standard reference for the geometric measure theory is Federer's extensive book [19]. Other monographies covering much of the topic are due to Rogers [37], Simon [39], Giusti [22], Hardt and Simon [23], Morgan [32], Ziemer [44], Evans and Gariepy [13], Mattila [30]. The mathematical theory of fractals is covered by books of Falconer [14,15]. A good account of open problems in geometric measure theory is the list edited by Brothers [8]. Manila's book [30] discusses many open problems and provides an excellent list of references. A nice treatment on the topic is an expository paper by Federer [20]. 1. Preliminaries from measure theory We shall work mainly in Euclidean and closely related spaces, but most of measure theory treated here is valid in much more general setting. To introduce the basic notations, let X be a metric space with a metric d. The closed and open balls with center a e X and radius r > 0 are denoted by B(a,r) = {xeX: d(a,x) <; r}, U(a,r)=\x e X: d(a,x)<r].
1014 ?. Chlebt'k In measure theory usually a "measure" means a non-negative countably additive set function defined on a ?-algebra of subsets of X. In our exposition we use the notion of "measure" as a convenient abbreviation for "outer measure", the notion derived from Caratheodory's one. A set function ?: {А: А С X} -> [0, +oo] is a measure on a set X if ?@) = 0 and ? (A) sC ??\ H(Ai) whenever А с U?i A,¦ С X. A set А С X is ?-measurable if ?(?) = ?(???) + ?(?\?) whenever Г С X. It is rather convenient to have measures defined for all subsets of X. On the other hand, a measure ? gives a countably additive set function when restricted to the ?-algebra of ?-measurable sets. With each measure ? on X and each subset В of X we associate the measure ? L В on X, the restriction of ? to ?, defined by (???)(?) = ?(????) for А С X. A measure ? on a metric space X is called a Borel measure if all Borel sets are ?- measurable. It is Borel regular if it is a Borel measure and if for every А с X there is a Borel set В С X such that А С ? and ?(?) = ?(?). It is well known that a measure ? on a metric space X is a Borel measure if and only if ?(? U ?) = ?(?) + ?(?) whenever ?, ? С X with dist(A, ?) > 0. Further, ? is a Radon measure if it is a Borel measure and A) ?{?) < oo for compact sets К с Х, B) ?(?/) = sup^(?): ? cU,K compact} for open sets U С X, C) ?(?) = inf^(A): А С U, U open} for all А С X. In R", a measure ? is a Radon measure if and only if it is Borel regular and finite on bounded sets. More generally this holds for any ?-compact metric space. For any two positive integers m < ? there are many distinct m -dimensional Borel regular measures on Ш" that are invariant under the Euclidean group and coincide with the surface w-measure, when restricted to "nice" m-dimensional subset. Some of such measures are based on the integral geometry approach, another ones on coverings or packings. The following very general construction can yield, in particular, many m-dimensional measures. Caratheodory's construction. Let (X, d) be a metric space, ?' a family of subsets of X and ? a non-negative function on У. For 0 < S ? oo we construct the size S approximating measure <ps as follows: <ps($) = 0 and whenever Ъф А с ?, ?&(?) is the infimum of the set of numbers (possibly +oo)
Geometric measure theory: Selected concepts, results and problems 1015 corresponding to all countable (possibly finite) families & with У С {Е: Ее ^ and diam(?) ^ <$} and ACU^. The fact that ?& ^ ?? whenever 0 < ? ^ ? ^ oo implies the existence of ?(?) = Unburn (A) = supi>0^(A). The measure ? is called the result of Carathiodory's construction from ? on ?. One finds that ??, can be highly non-additive but, on the other hand, ? is always a Borel measure. If all members of & are Borel sets, ? is a Borel regular measure. Hausdorff measures. Using Hausdorff measures is the most natural way to measure lower dimensional objects in higher dimensional space. For every non-negative real number m we define a(m) = ?A /2)'"/ Г(т/2 + 1). For a fixed metric space (X, d) we apply Caratheodory's construction with ,9 = {A: Ъф AcX} and ? (A) = a(wJ"'"(diam A)'". The resulting measure ? is called the m -dimensional Hausdorff measure on X and denoted by Ж'". One obtain the same measure by letting & be the family of all nonempty closed subsets of X, or the family of all non-empty open subsets of X. It follows that Жт is Borel regular. In case X is a normed vector space, the same measure results when the members of & are further restricted to be convex. For a fixed subset A of (X, d) only the restriction of a metric d to the set A is relevant to determine Жт (A), regardless of the ambient space. In fact, Ж™ (А) = Ж"' (A) whenever AcX and X isometrically embeds in Y. The integral dimensional Hausdorff measures are of a special importance. One finds that Ж° is the counting measure over X, Ж°(А) = the number (possibly +oo) of elements of A. We recall (see for instance [24, Lemma 6(i)]) that if X is an m-dimensional vector space and ?@, 1) is its unit ball, then Ж"'(В@, 1)) is a dimensional constant independent of the norm of X, and equal, in particular, to a(m). For m = 1, Ж' has a concrete interpretation as a generalized length measure. More generally, if m is an integer, 1 ^ m < n, and ? is a sufficiently regular (e.g., С' smooth) m -dimensional surface in R", then the restriction Жт\—М coincides with the surface measure on M. For m = n, Ж" coincides in R" with the Lebesgue measure Jif". The proof is rather difficult and relies on the isodiametric inequality ?"'(A) sC 2""a(«)(diam(A))" whenever 0 ф А с R", see [19, 2.10.33].
1016 ?. Chlebik Invariant measures. The orthogonal group 0{n). Let 0(n,m) be the space of all orthogonal injections (i.e., inner product preserving linear maps) of R"' into R". Moreover, 0{n) = 0(n, n) be the orthogonal group of R", with composition as a group operation. From the general theory of invariant measures recall the well-known existence result of a Haar measure on a compact topological group to ensure that there exists a unique (left and right) Haar measure 9„ on 0(n) normalized with ?[0(??)] = 1. The Grassmannian G(n,m). For any positive integers m < и the Grassmannian G(n, m) consists of all m-dimensional linear subspaces of R". The action (g, V) ь* gV (g e 0(n) and V e G(n,m)) is transitive and induces a unique normalized invariant measure y„.„, on G(n,m). The special cases m = 1 or m = n — 1 are much easier to handle, as the measures ?„.\ and ?„.„-\ can be easily represented using the surface measure on the unit sphereS". Other measures that agree with the surface m -measure when restricted regular torn- dimensional sets are the integralgeometric measures ^'" A ^ t ^ oo). Integralgeometric measures. Let us start with Crofton's classical method for finding the length of a plane curve A: for each line L count the number of points in the intersection А П L and integrate this varying number over all lines L in R2. Doing this in R" with affine (n — m)-dimensional subspaces of R" we can define the m-dimensional integralgeometric measure (with parameter 1) on R" as the Borel regular measure such that S['\A)= f f Ж0(АГ)Р-\у))<МГ"(у)а-уп,п(У)/Рап,т) JGU'.m) JV for every Borel set А с R", where Pv : R" -> V is the standard orthogonal projection on V e G(n,m), and is the normalizing constant. In the above formula for J"\A) we integrate the varying number of points of intersection of A with arbitrary (и — m )-dimensional affine subspaces Py ' (y) corresponding to all pairs (V, y) with V e G(n, m) and ye V. It turns out that J?"' can also be defined via Caratheodory's construction. For this purpose we can choose У to be the family of all Borel subsets of R" and we let ? be the function defined on Borel sets by ?(?)= ? JPm(Pv(B))dY„.m(V)/fii(n,m). JG(n.m) More generally, to get a one-parameter family ,_?"' A ^ t ^ oo) of m-dimensional integralgeometric measures, we let ?, be the function defined on Borel sets by b(B)=\\V»jr,(PviB))\\L,lYHM)/fr(,i,m), where fit(n, m) is a properly chosen normalization constant.
Geometric measure theory: Selected concepts, results and problems 1017 One can observe that А с R" is ./'/"-null if and only if A is contained in a Borel set В with Ж'"(Ру(В)) = 0 for y„.„, almost all V e G(n, m). Densities. Given a measure ? on a metric space (X, d) and ? e X, the иррег and /ower m-dimensional densities of ? at ? are respectively defined by в^,Ху.= Ш«*<±1», ^V,^:=lim^^. /-40 a(w)r'" 7?0 a(w)r"' In the case ?*'"(?, ?) — ?"'(?, ?), this common value is denoted by ?'"(?,?) and termed the m-dimensional density of ? at ?. If ? = Жт^А for a set А С X, we use simpler notation ?*'"(?,?), ??(?,?) and ?"'(?,?). In this chapter we do not treat the case of non-integral m. From now on, m is a positive integer. The case m = 0 could be trivially covered with some care. 2. Structure theory for integral dimensional sets In this section we present parts of the structure theory: a characterization of rectifiable sets in terms of their projection and tangential properties. In his pioneering papers [5-7] Besicovitch investigated ways of differentiating between regular and irregular 1-sets in the plane. It is easy to see that for almost all straight lines (through the origin) the orthogonal projection of a rectifiable (or regular) 1-set is a set of positive length. Besicovitch [7] studied the projection properties of irregular 1-sets and showed that the orthogonal projections onto lines of a purely unrectifiable (or irregular) 1 -set in the plane would have zero length for almost every line. Concerning tangential properties, a rectifiable 1 -set has an approximate tangent at almost all its points. In contrast, at almost all points of a purely unrectifiable 1 -set no approximate tangent exist. Federer [19] generalized results of Besicovitch to m-dimensional subsets of R*. His work has extended considerably our understanding of the structure of (integral dimensional) sets in Euclidean space and clearly illustrates the profound dichotomy which exists between rectifiable and purely unrectifiable sets. A good account of the Besicovitch- Federer structure theory, variants of projection theorems and results on approximate tangent planes, may be found in the books by Federer [19] and Mattila [30]. Let m be a positive integer and ? be a measure on R". A set А с R" is called (?, m) rectifiable if there are Lipschitz maps f : R'" -> R", / = 1,2...., such that ? A\|Jy;(R''') =0. i = \ A set А С R" is called purely (?, m) unrectifiable if ? (А П /(R'")) = 0 for every Lipschitz map /: R'" -> R". In case ? = Жт we will use terms w-rectifiable and purely m-unrectifiable set instead.
1018 ?. Chlebik This notion of rectifiability very naturally generalizes to a measure-theoretic setting the classical approach to first order smoothness. Indeed, one can prove (see Federer [19, 3.2.29]) that А с R" is w-rectifiable if and only if there are w-dimensional C1 submanifolds ?,, M2, ¦. ¦, of R" such that Ж'"(A \ [jfi, M,-) = 0. Let С с R" with Ж'" (С) < оо be given. It is easy to see that the set С can be decomposed into w-rectifiable and purely w-unrectifiable part, and that such decomposition is unique up to Жт null-sets. Indeed, one can show that among the sets of the form U^i fi№m), corresponding to all sequences {./}}?, of Lipschitz maps /, :E'" -> R", / = 1,2,..., there is one, say B, which maximize Ж'"(С ? ?). Clearly ? is a Borel set (?-compact), С ? В is w-rectifiable and С \ В is purely w- unrectifiable. To describe a notion of the (?, w) approximate tangent cone Tan'" (?, -*) at * e R", we will use the open cones ?(?,?,?) = jyel": \\t(y - x) - v\\ < ? for some? > 0} with the vertex x, whenever ? e R" and ? > 0. We define the (?, w) approximate tangent vectors at the point ? as the elements of the closed cone with vertex 0 Тап'и^,.х) = р){иеК"·. ?*'"^?(?,?,?),?) > 0]. The following deep result was proved first for w = 1 and ? = 2 by Besicovitch, Morse and Randolph, then for arbitrary dimensions by Federer. STRUCTURE THEOREM. Let m < ? be positive integers, ? be a finite Borel regular measure on M." such that 0<?*'"(?,?) <?? for ? almost all ? eR", and A = {x e R": there is ? e G(n, m - n) such that Tan'"(?, x) П ? С {0}}. Then: A) A is ? (?, w) rectifiable Borel set. B) For ? almost all ? e A, Tan"V,jt) e G(«,w) ????*'"(?,?) = ?"(?,?)- C) Wie/iever ? С ?, ?(?) > 0 implies Жт(Ру(В)) > 0 for ?„.,„ almost all V e G(n, m). Consequently, whenever В С А, У"(В) = 0 implies ?{?) = 0. D) R" \ A is purely (?, w) unrectifiable. E) JT"(/V(R" \ A)) = 0/or y„.„, ?/woii a// V e G(n, w), /геисе ^"'(R" \ A) = 0. As a corollary we obtain that the following conditions are equivalent: (i) R" is ^,w) rectifiable. (ii) Tan'"(?,*) e G(n, m) for ? almost all ? e R". (iii) Whenever В с R", </,'" (?) = 0 implies ?(?) = 0. In particular, for a Borel set С С R" with Ж'"{С) < oo, and ? =Ж'"\- С, the structure theorem implies that С П A is w-rectifiable and С \ A is purely w-unrectifiable.
Geometric measure theory: Selected concepts, results and problems 1019 An important consequence (see also Federer [19, 3.2.26]) is the following: Жт ^ У"' and whenever С с R" is such that Ж'" (C) < oo. then Ж'п (С) = J™ (С) if and only if С is w-rectifiable, and, У" (C) = 0 if and only if С is purely w-rectifiable. This is exactly the reason, why we find the Hausdorff measure Ж'" and the integralgeo- metric measure У"' the two most important w-dimensional measures. The dichotomy between rectifiable and purely unrectifiable sets concerning their projection properties can be summarized as follows: THEOREM. Let Ac R" be an Ж'"-measurable set with Ж"\А) < oo. A) A is m-rectifiable if and only if Jff"'(Pv(B)) > Ofor ?,,.,,, almost all V e G(n, w) whenever В С A is an Ж'" -measurable set with Ж"\В) > 0. B) A is purely m-unrectifiable if and only if Jf'"(Pv(A)) = Ofor ?„,,„ almost all V eG(n,m). To describe the dichotomy between rectifiable and purely unrectifiable sets concerning their tangential properties we need some notation and the definition of the approximate tangent planes. If V e G{n, w), ? e R" and j e @,1) we denote Xix, V,s) = {yeR": dist(}--;c, V) < s \\y - x\\\. This is a kind of an open cone around the affine in -plane (V + x). Let А С ?", ? е ?" and V е G in, m). We say that V is an approximate tangent m -plane for A at ? ???* (A, x) > 0 and for all j e @,1) \imr-'nJf'"(AC\Bix,r)\Xix, V,s))=0. THEOREM. Let Ac R" be an Ж'"-measurable set with Ж'" (A) < oo. A) A is m-rectifiable if and only if for Ж'" almost all ? e A an approximate tangent m-planefor A at ? exists. B) A is purely m-unrectifiable if and only if for Ж'" almost all ? e A no approximate tangent m-planefor A at ? exists. It is very useful to find weaker sufficient conditions for a set to be w-rectifiable. One can conclude that A is w-rectifiable from the existence for Ж'" almost every дг е A of so called "rotating approximate tangent w-plane" for A at x. THEOREM. Let А с R" be an Ж'"-measurable set with Ж"\А) < oo. Then A is m- rectifiable if and only if for Ж'" almost all ? e A the following holds: whenever ? > 0 there are ro > 0 and ? > 0 such that for any r e @, rn) there is V e Gin, m) satisfying Ж'п (AC\ Bix ,r)\{y &R": dist(j - x, V) < ?}) <?"\
1020 ?. Chlebik and Ж'п(АГ)В(у,г)) >krm for every ye (V + ?) ? B(x, r). In this theorem an "approximate tangent w-plane" V for A at ? in a ball B(x,r) is allowed depend on radius r. Therefore it is called a "rotating approximate tangent w- plane". The theorem is due to Marstrand [27] and Mattila [29] (see also [30, Chapter 16]). It was the first powerful method to prove, for m > 1, w-rectifiability, and a simpler precursor to the Preiss's method of tangent measures, discussed partially in the next section. 3. Densities of measures and rectifiability In this section we will work with general Radon measures on R" and investigate the relation between their w-densities and w-rectifiability. First we have to introduce the notion of rectifiability of measures. A Radon measure ? on R" is said to be m-rectifiable measure if ? <<C Hm and there exists an w-rectifiable Borel set ? such that ?(?" \ E) = 0. The conjecture that Radon measures on E" with positive finite w-density are w- rectifiable was one of central problems in geometric measure theory for a long time. Special cases were established over a period of fifty years by Besicovitch [6], Morse and Randolph [33], Moore [31], Marstrand [28] and Mattila [29]. This conjecture was positively resolved by Preiss [34]. 3.1. Preiss's theorem Preiss's theorem says that if the behaviour of a Radon measure on small balls settles down in the positive and finite density then the measure is just given by a locally H'" summable function which can be non-zero only on countably many С' submanifolds; the measure of any Borel set A just being the integral of this function over A. We shall now state his theorem. THEOREM. Let ? be a Radon measure on W with the property ?(?(?,?)) 0 < hm < oo r|0 Г'" for ? almost all ? e R". Then ? is m-rectifiable. For Hausdorff measures restricted to sets this Theorem gives the following: COROLLARY. Let A be a subset ofR" with 7i"\A) < oo and such that the density ?'" (A, x) exists for almost all ? e A. Then A is m-rectifiable.
Geometric measure theory: Selected concepts, results and problems 1021 The proof of the theorem of Preiss is very complicated and we shall only discuss some ideas behind it. The very important notion introduced and effectively used in his proof was that of tangent measure. Given a Radon measure ? on R" and a point ? el", the set ???(?,?) of tangent measures to ? at ? consists of the non-zero measures obtained from weak limits of scaled push-forwards of ? by sequences of expansive homotheties around x. More precisely, ? e Тап(д, jc) if and only if ? is a non-zero Radon measure on R" and there exist sequences (с,-) and (r,) of positive numbers, such that r, 4- 0, and ?,??.??#? -> ? weakly as / -> oo. Here TXJ is the map given by TXJ(y) = (y — x)/r and the image ???#? of ? under Txr is defined by ??.*?(?) = ?(??+?), А С М". It should not be too hard to believe that the assumption that ? has positive finite w -density almost everywhere will result in the fact that for ? almost all ? e R" every tangent measure ? e Тап(д, х) is m-uniform in the following sense: There exists a positive number с = c(x) such that v(B(y,r)) = cr'" for all у e sptv and all r > 0. Anyone would think that every w-uniform measure on R" is just the one given by a constant multiple of W" restricted to an w-plane. Such a measure will be called w-flat. If this was true this would imply that for ? almost every ? e spt ? inside small enough balls B(x, r) the measure ? would be concentrated close to an w-plane through ?; ? would have at ? so called "rotating approximate tangent w-plane". Due to methods developed earlier by Marstrand [27] and Mattila [29] and mentioned in the previous section, this would imply rectifiability. But this was found to be true for w < 3 only! For w ^ 3 let K = {xeR'"+]: *f=jcf+*?+*;}. Then TV" L? is an w-uniform measure. Moreover, Kowalski and Preiss [25] proved that it is effectively (up to rotations, translations and multiplication by a constant) the only possible example of a non-trivial w-uniform measure in R'"+l. The structure of w-uniform measures in higher codimensions is open. What was important in the proof of Preiss's theorem was a partial classification of w-uniform measures into two distinct types, those "flat at infinity" such as the w-flat examples, and those "curved at infinity" like Ti.'"\—K. This classification of possible limiting behaviour of w-uniform measures resulted eventually in the proof that at ? almost every ? e R" every tangent measure ? e Тап(д, х) is flat that is enough to conclude the proof.
1022 ?. Chlebik Firstly, pushing methods developed by Marstrand [28] to the tangent measure setting one concludes that at ? almost every ? e R" there exists tangent measure ? e Тап(д, х) that is flat. Secondly, if Тап(д, х) contains tangent measures of both types, flat and curved, then a proper choice of blow-up sequence would generate a tangent measure which is neither flat nor curved; which contradicts with the partial classification of m-uniform measures. This is how the proof works. 3.2. Related results and problems For the special case of the Hausdorff m -measure on R" restricted to sets of locally finite measure, and with the m -density equal to 1, some results had been proved earlier. THEOREM. If А с R" is a set with the property that the density ?'" (A, x) exists and equals 1 for Jif" almost all ? e A, then A is m-rectifiable. The converse of this implication is well known and not difficult. The statement of the theorem was proved for m = 1 and ? = 2 by Besicovitch [5], for m = 1 and all ? by Moore [31], for m = 2 and ? — 3 by Marstrand [27], and for general m, ? by Mattila [29]. The case m = 1 is simpler an a nowadays proof can be found in the book by Falconer [14]. Marstrand's method (extended by Mattila) by which this theorem was proved for any m is based on a relatively simple geometric observation. It uses a certain rotundity property of the Euclidean ball and the fact that the m -density of A is equal to 1 to establish a very strong kind of approximate local symmetry of the set. This additional information about A allows to prove its m-rectifiability. The author was able to prove (essentially using the method of Marstrand and Mattila) that the theorem above holds in any real Hubert space. On the other hand, it is an open question (except when m = 1) if this theorem generalizes to all metric spaces. Question. Given a metric space (X,d) with Jif""(X) < oo, Ж'" being the m-dimen- sional Hausdorff measure on (X,d). Suppose X has the property that hm = 1 r|0 a(m)r'" for Ж"" almost all points xeX. Does it follow that X is m-rectifiable (in the sense that Ж'" almost all of X is contained in countably many Lipschitz images of subsets of R'")? The positive answer would be the ultimate (and very surprising) rectifiability result for metric spaces. It would also provide the characterization of m-rectifiable spaces, as the converse implication holds in any metric space by the theorem of Kirchheim [24]. For case m = 1 the positive answer has been given by Preiss and Tiser [35].
Geometric measure theory: Selected concepts, results and problems 1023 As any separable metric space is isometric to a subset of lx, the first interesting open part of this problem is the case of metric space /^ and m = 2. One can ask as well if the theorem of Preiss could be generalized to the setting of metric spaces. But the result that 2-rectifiability follows from the existence of finite and non-zero 2-density fails even in a separable Hilbert space for a restriction of Ж2 to a properly chosen subset. We may consider an easier metric space example (but it could be embedded isometrically into /2), namely (X, d) being the real line with metric d(x, y) = \x - y\!/ . In this space we have for any ball B{x, r) Jf2(B(x,r))=l-a{2)r2, hence, 2-density of the measure Ж2 exists everywhere and equals constantly j. On the other hand, it can be shown easily that the image of any Lipschitz map from a subset of E2 to (X,d) will have zero Ж2 measure, hence the measure Ж2 on (X,d) is purely 2-unrectifiable. But in finite dimensions there is a quite natural conjecture generalizing Preiss's theorem. Conjecture. Suppose X is a finite dimensional normed vector space and ? is a Radon measure on X with the property that ?(?(?,?)) 0 < lim < 00 no r'" for ? almost all ? e X, then ? is w-rectifiable. The first result that proves w-rectifiability from an initial condition about m-densities for m > 1 and outside inner product spaces is the recent theorem of Lorent [26] stating that locally 2-uniform measures in /^ are 2-rectifiable. We state his theorem that supports the conjecture above. THEOREM. Suppose ? is a Radon measure on l^ with the following property: There exist some S > 0, с > 0 such that for all ? e 5??? and all r e ]0, <5[ ?{?(?,?)) = cr~. Then spt ? is 2-rectifiable. 3.3. Besicovitch ^-problem Let us recall that a subset A of a metric space (X,d) is called w-set if 0 < Ж'"(А) < oo. Such a set A is said to be purely w-unrectifiable if Ж"'(? ? ?) = 0 for every m- rectifiable set B.
1024 ?. Chlebik For an w-rectifiable w-set A in (X, d) we have ?'"(?,?)= 1 for Ж'" almost all jc e A. This is basically due to Besicovitch for subsets of R", and for subsets of metric spaces it is due to Kirchheim [24]. At least in Euclidean space a purely w-unrectifiable w-set А с R" have remarkably different behaviour, namely, for Жт almost all jc e A ??(?,?) ? ???(?")?*'"(?,?) (^ t„,(R")), where r,„(R") < 1 is some constant. This is due to Preiss [34], but the one of its consequences, that it is possible to characterize w-rectifiability in R" by means of the lower w-density ?™, follows also from the earlier results of Besicovitch [5], Martstrand [27] and Mattila [29]. Let ?,„ (X), for a metric space (X, d), be the smallest non-negative number such that, if A is a purely w-unrectifiable w-set in X, then ?*(?,?) ? ???(?) for Ж'" almost all ? e A. As any w-set A satisfies ?*'"(A, x) <: 1 for Ж"' almost all ? e A (see Federer [19, 2.10.18C)]),clearly ?,,,(?) sC 1. If a metric space (X, d) is such that ?,,,(?) < 1, one can characterize w-rectifiability in X by means of the lower w-density ?"'. It is proved by Besicovitch [5] that a,(R2)iC 1- 1(?2576, and in [6] he showed that ?\ (R2) ^ |. He also gave an example of a purely 1-unrectifiable 1-set A inR2 for which ?± (A, x) = \ for Ж' almost all jc e A. In this way he proved that ?? (R2) > J and conjectured that ?\ (R2) = j. This conjecture is still open. In less formal way it reads as follows: Conjecture. Suppose А с R2 is such that 0 < Ж' (A) < oo and ??(?,?)>- for Jf1 almost all jc e A. Then A is 1-rectifiable.
Geometric measure theory: Selected concepts, results and problems 1025 Preiss and Tiser [35] slightly improved the Besicovitch's upper bound on ?\. Their proof works for subsets of general metric spaces. Namely, for any metric space X, ?, (?) ? B + л/46)/12 % 0.732. On the other hand, metric spaces with ?\ (X) > \ exist. The recent results on the problem are due to Farag [16]. He proved that ^ is the correct upper bound on ?} (A, ·) for purely 1-unrectifiable 1-set A in a Hubert space, under the additional assumption, that the set A satisfies a measure-theoretic flatness condition. For higher dimensional densities (m ^ 2) we only know that sm := sup(a,„(X): ? is a real Hubert space} < 1 for every m. 4. Sets of finite perimeter Research on the problem of finding the most general and geometrically natural form of the Gauss-Green divergence theorem has contributed greatly to the development of geometric measure theory. For a certain time, the theory of sets for which Gauss-Green theorem holds was developed independently using various approaches. Later it became clear that they are all equivalent. The notion of set of finite perimeter was introduced by R. Caccioppoli and developed by De Giorgi [9,10]. Their approach was based on approximation (with respect to the metric of convergence in measure) of a given set by a sequence of sets with smooth (or, which is equivalent, polyhedral) boundary and with equibounded perimeters. Such limiting sets possess many good properties of sets with smooth boundary. In particular, a version of Gauss-Green theorem holds for them. On the other hand, Federer [17] introduced the notion of measure-theoretic exterior normal to a set and established the Gauss-Green theorem for every open set G С К" with JF"~\dG) < oo. Some additional observations of Federer [18], based on results of De Giorgi, and a contribution of Volpert [41], lead to an optimal version of Gauss-Green theorem valid for sets of (locally) finite perimeter. A good account on the sets of finite perimeter can be found in Federer [19], Giusti [22], Ziemer [44], and Evans and Gariepy [13]. Later it became clear that the notion of set of finite perimeter is the simplest particular case of normal и-dimensional currents in R", which are discussed in the next section. Nevertheless, here we give an independent exposition for sets of finite perimeter, since in this case no highly developed apparatus, as in the general theory of currents, is required. We say that a set А с К" has finite perimeter in an open set ?/cl" if А П i/ is measurable and if there exist a finite Borel measure ? on U and a Borel function И : U -> S"~' U {0} С К" such that the following generalized Gauss-Green formula holds f div<pdx= ? ?????? A JU
1026 ?. Chlebik for every smooth (or, equivalently, Lipschitzian) 1 -vectorfield <p:U —» R" with compact support in U. Hence, the vector measure (-???) is the derivative of ?? (denoted by ???) in the sense of distributions on U, while ? = \???\ is its total variation. The perimeter Pu(A) of A in U is defined by |?? A\(U), and we use the notation P(A) in the case i/ = R". Otherwise, if A is not of finite perimeter in i/, we write Pu(A) = oo. We further say that A is of locally finite perimeter in U if Pv(A) < oo for every open set V <ё U. By Riesz theorem, whenever ? ? U is measurable P,;(A) = sup| J ??????: 0 e C('(t/, R"), \\?\\ ? 1 Essential boundary. An inequality Ри (A) iC Jf""' (9 А П i/) always holds. A much deeper (and ultimate) result is due toFederer [19,4.5.6,4.5.11]: Pu(A) = Жп~' (deA DU) = J"~\beA П U) for all А С Е". Here 9fA stands for the essential boundary (called also the measure-theoretic boundary) of A, defined by deA = {xeR": ?*"(?, x) > 0 and <9*"(R" \ A,x) > 0}. Measure-theoretic exterior normal. Assuming А с ?" and * e R" we say that и e S"-1 is a measure-theoretic exterior normal of A at л: if 9"({ye A: (y-x)ou >0},jc)=0, and в"({уе (R"\A): (y-x)ou <0},jc)=0. There exists at most one exterior normal of A at x. We denote n(A, x) = и if A has the exterior normal и at x, n(A, x) = 0 if A has no exterior normal at x. Reduced boundary. Given А С R", the set Э*А = {хеШ": n(A,x) e S"_l} of points at which the measure-theoretic exterior normal of A exists, is called the reduced boundary of A. According to results of De Giorgi, Federer and Volpert, if А с R" is a set of locally finite perimeter in an open set U с R", then the sets Z)XA(Z?(*,r)) ^gl,_,l jc e t/: lim -. e э ] rW\DXA\(B(x,r)) J 3,AU[/, d*ADU and
Geometric measure theory: Selected concepts, results and problems 1027 are up to Jf" null-sets the same, (n - l)-rectifiable, |?>xA|Li/ = Jf" L(9,,An i/), and lim— „ =-n(A,x) forJf" almost all ? e de? П ?/. In particular, we obtain the following: GAUSS-GREEN THEOREM. If Ac K" is a set of locally finite perimeter in an open set ?/cK", then / ?\\???= / 0(jc)on(A,;c)iiJf","l(jc) 7a Ju for every Lipschitzian 1 -vectorfield <p:U->M." with compact support in U. Lower semicontinuity, approximation and compactness. Unlike various (n — 1 ^dimensional measures of the boundary, the functional А н> P(A) is lower semicontinuous with respect to local convergence in measure (i.e., L'oc convergence of the characteristic functions). For any set A with P(A) < oo there exists a sequence of sets An with smooth (or polyhedral) boundary locally converging in measure to A and such that ? (A) = lim,,^oo P(An). Any sequence of sets with equibounded perimeters admits subsequences locally converging in measure. Isoperimetric inequalities. If А С ?" (? > 2), then Р(А)^па(пУ/"[тт{^"\А),^"'(Шп \А)}]'^\ The constant na(n)^n is the optimal isoperimetric constant. A local counterpart is the relativite isoperimetric inequality in a ball Puu.r)(A)>fi(n)[mm{J?"(U(x,r)nA),2"l(U(x,r)\A)}]'^i, where ?(?) is a positive constant. 5. Measure-theoretic calculus of variations The methods and results of geometric measure theory have been successfully applied in the modern calculus of variations. The Plateau problem of finding the minimal surface, i.e., of finding the surface of least area among those bounded by a given curve, was one of the most fundamental variational problems. In the thirties a solution was given to the problem of Plateau in R3 by results of Douglas [12] and Rado [36]. The possibility to
1028 ?. Chlebik generalize their methods even for minimal hypersurfaces in higher dimension was found very limited. The difficulty to prove, by parametric methods, existence of solution for area minimization problem in spaces of dimension higher than 2 was the main motivation for the development of intrinsic theories for this kind of variational problems. After the work of De Giorgi [9,10] on sets with finite perimeter, Federer and Fleming [21] introduced a series of new techniques in the calculus of variations, which leads directly to the existence of (weak, but geometrically well acceptable) solutions for a wide class of variational problems. 5.1. Normal and integral currents The approach of Federer and Fleming modeled w -dimensional geometric objects in R" as currents, i.e., continuous linear functional on differential w-forms. The general concept of current was first introduced by G. De Rham for use in the theory of harmonic forms. His work was closely related to the development of distribution theory by L. Schwartz. Independently, L.C. Young introduced a notion of generalized surface, defined as a continuous linear functional on the space of continuous parametric integrands. Perhaps the central result of the theory of rectifiable currents developed by Federer and Fleming is one known as the Closure Theorem. It says that certain useful classes of integral currents (currents which, together with their boundaries, correspond to integration over rectifiable sets against integer valued density functions), form a weakly closed subspace of the much larger space of normal currents. This closure theorem allows one to solve many minimization variational problems in a weak sense, while still obtaining solutions which correspond to rectifiable geometric objects. This is accomplished by passing to the (weak) limit of (properly chosen subsequence of) minimizing sequence of integral currents. Almgren [ 1 ] was able to extend many of results of Federer and Fleming to the class of size bounded rectifiable currents. We recall here some basic facts from the theory of normal currents and we refer for more information to Federer and Fleming [21], Federer [19], Simon [39], Hardt and Simon [23] and Morgan [32]. A logical prerequisite is the knowledge of differential forms and operations with m-vectors and w-covectors of R". We refer to the book by Federer [19] for a very comprehensive treatment. We denote by $m (U) (m being a non-negative integer) the vector space of all infinitely differentiable w-forms with compact support in an open set U of R". Each of the subspaces 9™{U) = 9'"{U)n{4>: spt0c/n. К с U compact, is relatively topologized by the seminorms ?'? (?) = sup{ || DJ<p{x) 1: 0 sC j'^ i and ? e К} corresponding to all non-negative integers i. (Here || · || stands for the comass norm on covectors.) Their union $'" (U) is endowed with the largest topology making the inclusion
Geometric measure theory: Selected concepts, results and problems 1029 maps from all the spaces 3>'^{U) continuous. Members of the dual space %,(U) = 3>m{U)*, i.e., continuous linear functionals on &>'"(U), are called m-dimensional currents in U. If ? e Q)m (U) and V с U is an open set, the mass of ? in V is defined by Mv(T) = sup{TD>): фе®'"(У), spt<? с V and \\?(?)\\ ? 1 in V]. We write briefly ?(?) instead of Ми(Т). A current Г e $>m{U) with finite mass, М(Г) < oo, extends naturally as a continuous linear functional on the space of all compactly supported continuous m-forms endowed with the sup norm. Consequently, from the Riesz representation theorem we deduce the existence of a Radon measure \\T\\ on U, and of a || ГЦ-measurable function ? :U -> ?„,(?") satisfying ||f(;c)|| = 1 for \\T\\ almost all xeU, such that Т(ф) = j(f(x),4>(x))d\\T\\(x) forany0e^"'(i/), where (·, ¦> denotes the dual pairing for /\m(R") and /\'"(W), the spaces of w-vectors and w-covectors of Ш", on which we use dual norms, the mass and the comass. More generally, the representation of Г е %„(U) in this form is possible if we only assume ??(?) < oo for all open sets Vei/. Any such current ? is said to be representable by integration. The last formula and the Lebesgue theorem allow us to extend ? to all m -forms with ЦГЦ-summable coefficients (in particular, when the mass of ? is finite, to m-forms whose coefficients are bounded Borel functions). For such ? we can define for any ЦГЦ- measurable subset A of U (and, in particular, for any Borel set А С U) the restriction of ? to A as a new current 7?_ A e 3>m{U) by ?\-?(?)= f[f,4>)d\\T\\, 4>e&"(U). More generally, if /: U -> ? is any locally ЦГЦ-integrable function (in particular, any locally bounded Borel function) we define T\-fD>)= f[f,<l>)fd\\T\\, 4>e@m(U). For any ? e $>m{U) the support of T, spt ?, is the smallest relatively closed subset С of U such that ? (?) = 0 for all ? e S>m (U) with spt<? с U \ С The boundary operator 3:&m+i(U)^&m(U) is defined to be dual to the operator of exterior derivative d:9>m{U)->$>m+\u)
1030 ?. Chlebik by means of Stokes theorem. That means that for given ? e @>m+\ (U), dT is defined by дТ(ф) = Т(аф), фе$'"(и). A sequence of currents {7^} С S>„,{U) is said to converge weakly in U to ? if it converges in the sense of distributions, i.e., Тк(ф) -> ? (?), for every ? e $m(U). This weak convergence will be denoted by Tk —>¦ ? in U. It is easy to prove that the mass is lower semicontinuous with respect to the weak convergence. By the standard Banach-Alaoglu theorem we deduce that from a sequence of currents {7jt} с $>m(U), with equibounded masses My(Tk) on each open set V <s U, we can extract a subsequence convergent weakly to a current ? e @,„(U) representable by integration. Normal currents. We call a current ? e S>„,{U) locally normal if and only if ? is representable by integration and either m = 0 or 9? is representable by integration. ? is normal if it is locally normal and has compact support. The set N,„(i/) = {T e ®m{U): ? is normal} is the union of the sets ?,„(??) = {? e $>,„ (U): ? is normal and spt Г с К} corresponding to all compact sets К CU. For Г е @m(U) we define ?(?) = ?(?) + ?(9?) incasem>0, ?(?) = ?(?) incasem=0. For example, the и-dimensional locally normal currents in R" are the currents E" L/ corresponding to all functions / of locally finite variation in R". Here E" e S>„(R") is the current defined by Е"(ф)= 1[е\/\ег/\---Аеп,ф{х))а^"\х), 0e^"(R"), e\,e2,...,e„ being the standard orthonormal basis in R" and e\ л ет А · · · л е„ the n- vectorfield orienting R". The current E" L A corresponding to .if"-measurable set А с R" is locally normal if and only if A is a set of locally finite perimeter. Real flat norm. Whenever Г is an w-dimensional current in U, and К с U is compact, the FK-seminorm of ? (possibly +oo) is given by FK(T) = inf{M(T -dS) + M(S): 5e^m+i(t/)with sptScK}. The slightly weaker norm F, called the real flat norm, is defined by F (?) = inf[FK (T): К с U is compact}.
Geometric measure theory: Selected concepts, results and problems 1031 The corresponding metrics using distances Fk(T\ - ??) (respectively F(T| - ??)) between T\ and T2 are defined in an obvious way. Rectifiable currents. For a generic current ? representable by integration there is no way of defining "tangent space". Even if spt ? is a smooth w -manifold, the w-vector ? associated to ? has not to be related in any way to tangent space to spt T. Federer and Fleming [21] introduced the subclass of rectifiable currents. This class has good closure properties, and its elements enjoy, in an approximate sense, the differential properties of smooth manifolds. We recall that a subset S с К" is said to be m -rectifiable if, except for а Ж'"-пи\\ set SO, it can be covered by the countable union of C1 smooth w-dimensional manifolds Sj, 7 = 1,2,.... For Ж'" almost all ? e S the approximate tangent space Tan ,5 ofS at ? is defined as the tangent space to Sj at x. (One can show that, up to an Jf""-null subset of S, the definition of Tan, S1 does not depend on the choice of smooth Sj's in the covering S С U^=o $j ·) A current Г е 3>m(U) is said to be an integer multiplicity locally rectifiable current, briefly a locally rectifiable current, if it can be expressed as ? (?) = /^(х),ф(х))в(х)аЖ'"(х), ф е ®"\U), where the set S is an w-rectifiable Ж'"-measurable subset of U, the multiplicity function ?-.S -> Z+ is an Ж'"^^ locally summable positive integer valued function (i.e., fvnS \9\аЖ'" < oo for all open sets V <s U), and ? is an orientation on S. This means that ? is an Ж'" -measurable w-vectorfield on 5" which for Ж'" almost every ? e 5" is associated with Tanks' (i.e., ?(?) can be expressed in the form ?? л · · · л ?,„, where ??,..., ?,„ form an orthonormal basis for Tan, S). In this case we write ? in the form t(S, ?, ?). If, moreover, spt ? is a compact subset of U, a current ? = t(S, ?, ?) is called rectifiable. For example, if S is an m-dimensional C'-submanifold of U with Ж'"(У П S) < 00 for all open sets V <s U, and if ? is a tangent w-vectorfield on S orienting S, so that ? is continuous and ||?|| = 1, then the classical operator integrating differential w-forms over the oriented manifold S, is a locally rectifiable current. Let^,„(i/) be the space of all w-dimensional rectifiable currents in U. (See Federer [19, 4.1.24 and 4.1.28] for a detailed definition and several alternate descriptions of rectifiable currents.) For any compact К с U we further set &m(K) = {Te&m(U): sptTcK}, and when w > 0, 1т(К) = {ТеЯ,п(К): 9Ге^,„_|(К)}, l„(U) = {Te&m(U): ЭГе^-???/)}. The elements of I,„ (U) are w-dimensional integral currents in U.
1032 ?. Chlebik The union of the groups ,?m(K) = \R + dS: R e &m(K) and S e^m+i(K)} corresponding to all compact sets К с U is the group J?m(U) of m-dimensional integral flat chains in U. We largely use the notations and conventions established by Federer [19] in his book, and listed there on pages 669-671. But it should be noted that spaces t%m(K), lm(K), and ^m(K) do not generally coincide with the corresponding spaces &т.к, 1т.к an(J &т.к defined there except when К is a compact Lipschitz neighbourhood retract. Our exposition is, in this respect, more closely related to the approach of Solomon [40]. Recall that К с К" is a Lipschitz neighbourhood retract in R" if there exists a Lipschitzian map which retracts a neighborhood of К in Ш" onto К. Integral flat metric. There is an integral analogue &ц of the real flat seminorm Fa: (K cU being a fixed compact set), defined by ^(r) = inf{M(*) + M(S): T = R + dS, Re&n(K), Se&m+t(K)}. The corresponding metric using distances ^к{Т\ - ??) between T\ and ?? induces so called integral flat metric topology. Deformation theory. One main object of study of general rectifiable or special (for example, area minimizing) m-currents is to obtain strong and useful representations or approximations of those currents by simple deformations of polyhedral chain of the m- skeleton of the standard cubical subdivision of Ш". The deformation theory which first appears in Federer and Fleming [21, 5.5] (see also Federer [19, 4.2.1-4.2.9]) became a basic technique in the theory of normal currents. Using multiple-valued function theory Almgren [1] was able to develop and apply the deformation theory to the size bounded rectifiable currents. A direct consequence of the deformation theorem of Federer and Fleming is the following isoperimetric inequality. ISOPERIMETRIC THEOREM. Suppose 0 < m < n, К С Ш" is convex and compact, and S e I,„(K) with dS = 0. Then there is a T e I,„+i (K) with dT = S and M(T) iC c(n, m)M(S)'J^1. It has been improved by Almgren [2], who proved that the optimal value of the isoperimetric constant does not depend on и. Decomposition of integral currents. A current ? e I,n(IR") is called indecomposable if and only if there exists no Se I»,(K") except 0 (the null w-current) and ? with ?(?) = ?E)+?(?-5).
Geometric measure theorx: Selected concepts, results and problems 1033 It may be shown that an integral current ? e I,„ (R") can always be written as a countable sum of indecomposable currents Tt e I,„(E") such that ОС ЗС T = J2Ti and ?(?) = ? ?(?,). i=\ i=l The structure of 1-dimensional integral currents is relatively simple: every ? e 11 (?") is a sum of finitely many oriented simple arcs and countably many oriented simple closed curves, with finite total length. However, in case 1 < m < ti, even indecomposable currents from I,„(R") can be topologically very complicated; spt ? need not be w-rectifiable. An important conclusion of the deformation theorem is the following: Total boundedness lemma. Suppose 0 ^ с < oo and К с К" is a compact Lipschitz neighbourhood retract. Then A) N,„(/0 П {?: ЩТ) iC с} is ?K-compact, B) \m{K) П {?: ЩТ) iC с} is &K-totally bounded. Since N is lower semicontinuous, the Fa:-completeness of the set in A) is easy, so the only issue in either conclusion is total boundedness. The deformation theorem implies that given any ? > 0, a current as in A) (respectively B)) is within ? of one of finitely many polyhedral chains in the appropriate flat metric. 5.2. Closure and compactness theorem The following theorem and its corollaries convey the most essential information about closure and compactness properties of integral and normal currents, and about the structure of integral flat chains. CLOSURE THEOREM. If К С К" is compact and m ^ 0 is an integer, then lm(K) is F- closed in Nm(K). Corollaries. (l)&m+](K)?Щ,+](К) = Im+i(К). B) If ? e &m(K) andM(T) < oo, then ? e 3?,„{K). Combining the closure theorem with conclusion 2 of the total boundedness lemma we obtain the following: Compactness theorem for integral currents. If К с Ш" is a compact Lipschitz neighbourhood retract, and с < oo, then {T e lm(K). ЩТ) ^ с} is ^K-compact in&m(K). The closure and compactness theorems were proved by Federer and Fleming [21 ]. Their proof relies on the measure-theoretic structure theory developed by Federer and discussed
1034 ?. Chlebik in Section 2. As its proof is quite difficult it has long been an obstacle to those seeking an understanding of the closure theorem. Later other proofs have been given, independent of the structure theory, by Solomon [40], Almgren [1], White [42,43], and Ambrosio and Kirchheim [3]. The compactness theorem for integral currents leads directly to the existence of solutions for a wide class of variational problems. In particular it allowed to establish the existence theorem for the (measure-theoretic) Plateau problem min{M(r): Telln(K), dT = S}, whenever К с К" is convex and compact and S e I,„_ ? (?) with dS = 0. 5.3. Regularity of minima! surfaces Using the theory of integral currents we have obtained the existence of a solution ? e I,n(IR") of the m-dimensional problem of Plateau. As it is a kind of weak solution, the present research directs itself to the problem of regularity of m -dimensional "surface" sptT \spt dT. The first significant partial results on the problem of smoothness of area minimizing currents have been obtained by De Giorgi, Reifenberg and Almgren: (i) the regular points form a dense relatively open subset of sptT \spt9T, (ii) in case m = ? — 1 the set of all singular points is Жт null. We will discuss in some details hypersurfaces (i.e., w-surfaces in R",+l) only. In this case we can use the formalism of De Giorgi [11]. A good account on this approach is the bookofGiusti[22]. In this formalism, an w-surface in R'n+I is a boundary of a set ? of locally finite perimeter. The main idea of De Giorgi is the following. For ? e dE define an approximate normal vector vr(x) = DXE(B(x,r))/\DXE\(B(x,r)). It is possible to prove that if for some ro > 0 the vector vro(x) has length close enough to 1, then H1TV40 IK(Jt)|| = 1· This is a crucial part of the proof from which it is easy to show that dE is regular (even analytic) in a neighbourhood of x. Once the generic regularity has been established, one can ask whether the singular set can exist at all. It is natural to study behaviour of dE near a point ? e dE blowing up the situation by sequences of expansive homotheties Txs.{ri I 0) around ? (as in Section 3, Tx,r(y) = (y~ x)/r). Clearly, all the homothetic images of dE are minimal surfaces and subsequence will converge in a measure to a set C, which is itself minimal. Moreover, С is a cone (with vertex at 0), roughly speaking a tangent cone to 9 ? at ?. One can conclude that dE is regular near ? if and only if С is a hyperplane. So the existence of singularities of minimal hypersurfaces is reduced to the existence of singular minimal cones. Almgren proved the non-existence of singular minimal cones in ? , and
Geometric measure theory: Selected concepts, results and problems 1035 Simons [38] extended this result up to R7 concluding that m-dimensional minimal surfaces in R'n+1 are regular, for m ^ 6. This result is the best possible since the cone C = {*eR8: ^2+JC2+JC2+JC2=JC2+JC2+JC2+JC2j is a singular minimal cone. This was shown by Bombieri, De Giorgi and Giusti [4]. References [1] F.J. Almgren Jr., Deformations and multiple valued functions. Geometric Measure Theory and Calculus of Variations, Proc. Sympos. Pure Math., Vol. 44, Amer. Math. Soc., Providence, RI A986), 29-130. [2] F.J. Almgren Jr., Optimal isope rime trie inequalities, Indiana Univ. Math. J. 35 A986), 451-547. [3] L. Ambrosio and B. Kirchheim, Currents in metric spaces. Acta Mathematica 185 B000), 1-80. [4] E. Bombieri, E. De Giorgi, and E. Giusti, Minimal cones and the Bernstein problem, Inv. Math. 7 A969), 243-268. [5] A.S. Besicovitch, On the fundamental geometric pmperties of linearly measurable plane sets of points (I), Math. Ann. 98 A928), 422-484. [6] A.S. Besicovitch, On the fundamental geometric properties of linearly measurable plane sets of points (II), Math. Ann. 115 A938), 296-329. [7] A.S. Besicovitch, On the fundamental geometric properties of linearly measurable plane sets of points (HI), Math. Ann. 116 A939), 349-354. [8] J.E. Brothers, Some open problems in geometric measure theory and its applications, Proc. Sympos. Pure Math., Vol. 44, Amer. Math. Soc., Providence, RI A986), 441-464. [91 E. De Giorgi, Su una teoria generate della misura (r — 1) -dimensionale in uno spazio ad r dimensioni, Ann. Mat. Рига Appl. 36 D) A954), 191-213. [10] E. De Giorgi, Nuovi teoremi relativi alle misure (r — \)-dimensionali in uno spazio ad r dimensioni, Ricerche Mat. 4A955), 95-113. [11] E. De Giorgi, Fmntiere orientate di misura minima, Sem. Mat. Scuola Norm. Sup. Pisa, Editrice Tecnico Scientifica, Pisa A961). [12] J. Douglas, Solution of the problem of Plateau, Trans. Amer. Math. Soc. 33 A931), 263-321. [13] L.C. Evans and R.F. Gariepy, Lecture Notes on Measure Theory and Fine Properties of Functions, Stud. Adv. Math., CRC Press A992). [14] K.J. Falconer, The Geometry of Fractal Sets, Cambridge University Press, Cambridge A985). [15] K.J. Falconer, Fractal Geometry: Mathematical Foundations and Applications, Wiley A990). [16] H.M. Farag, Unrectifiable I-sets with moderate essential flatness satisfy Besicovitch's 4 -conjecture. Adv. Math. 149B000), 179-186. [17] H. Federer, The Gauss-Green theorem. Trans. Amer. Math. Soc. 58 A945), 44-76. [18] H. Federer, A note on the Gauss-Green theorem, Proc. Amer. Math. Soc. 9 A958), 447-451. [19] H. Federer, Geometric Measure Theory, Springer, New York A969). [20] H. Federer, Colloquium lectures on geometric measure theory, Bull. Amer. Math. Soc. 84 A978), 291-338. [21] H. Federer and W.H. Fleming, Normal and integral currents, Ann. of Math. 72 A960), 458-520. [22] E. Giusti, Minimal Surfaces and Functions of Bounded Variation, Birkhauser A984). [23] R. Hardt and L. Simon, Seminar on Geometric Measure Theory, Birkhauser, Boston A986). [24] B. Kirchheim, Rectifiable metric spaces: local structure and regularity of the Hausdorff measure, Proc. Amer. Math. Soc. 121 A994), 113-123. [25] O. Kowalski and D. Preiss, Besicovitch-type pmperties of measures and submanifolds, J. Reine Angew. Math. 379A987), 115-151. [26] A. Lorent, Rectifiability of measures with locally uniform cube densities, Proc. London Math. Soc. B002), in print. [27] J.M. Marstrand, Hausdorff two dimensional measure in 3-space, Proc. London Math. Soc. 11 C) A961), 91-108.
1036 ?. Chlebik [28] J.?. Marstrand, The (<p,s) regular subsets of ? space. Trans. Amer. Math. Soc. 113 A964), 369-392. [29] P. Mattila, ?ausdorff m-regular and rectifiable sets in ?-space, Trans. Amer. Math. Soc. 205 A975), 263- 274. [30] P. Mattila, Geometry of Sets and Measures in Euclidean Spaces, Cambridge Stud. Adv. Math. A995). [31] E.F. Moore, Density ratios and(?, I) rectifiability in ?-space. Trans. Amer. Math. Soc. 69( 1950), 324-334. [32] F. Morgan, Geometric Measure Theory -A Beginner's Guide, Academic Press A988). [33] A.P Morse and J.F. Randolph, The ? rectifiable subsets of the plane. Trans. Amer. Math. Soc. 55 A944), 236-305. [34] D. Preiss, Geometry of measures in K": Distribution, rectifiability, and densities, Ann. of Math. 125 A987), 537-643. [35] D. Preiss and J. Tiser, On Besicovitch \ -problem, J. London Math. Soc. 45 A992), 279-287. [36] T. Rado, On the Problem of Plateau, Springer, Berlin A933). [37] С A. Rogers, Hausdorff Measures, Cambridge University Press A970). [38] J. Simon, Minimal varieties in Riemannian manifolds, Ann. of Math. 88 A968), 62-105. [39] L. Simon, Lectures on Geometric Measure Theory, Proc. Centre for Math. Anal., Vol. 3, Australian Nat. Univ., Canberra A983). [40] B. Solomon, Л new proof of the closure theorem for integral currents, Indiana Univ. Math. J. 33 A984), 393-418. [41] A.J. Volpert, Prostranstva BV i kvazilinejnyje uravnenija. Mat. Sb. 73 A967), 255-302. [42] B. White, A new proof of the compactness theoremfor integral currents. Comment. Math. Helv. 64 A989), 207-220. [43] B. White, Rectifiability of flat chains, Ann. of Math. 150 A999), 165-184. [44] WP. Ziemer, Weakly Differentiable Functions, Springer, Berlin A989).
CHAPTER 25 Fractal Measures Kenneth J. Falconer Mathematical Institute. University of St Andrews, Fife KYI6 9SS, Scotland. UK E-mail: kjf@st-andrews.ac.uk Contents 1. Introduction 1039 2. Hausdorff and packing measures and dimensions 1039 2.1. Hausdorff measures 1039 2.2. Packing measures 1041 2.3. Calculation of Hausdorff and packing measures and dimensions 1042 2.4. Geometric measure theory 1044 2.5. Geometry of Hausdorff measures and dimensions 1046 2.6. Subsets of finite measure 1047 3. Measures with a fractal structure 1047 3.1. Local dimensions 1047 3.2. Dimension decomposition 1048 3.3. Multifractal measures 1049 3.4. Geometry of multifractal measures 1051 References 1051 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1037
This page intentionally left blank
Fractal measures 1039 1. Introduction A fractal is a highly irregular set with a fine structure, that is with irregular detail at arbitrary small scales. Often a fractal has some sort of self-similarity or self-affinity, perhaps in a statistical or approximate sense. Many fractals of interest have a simple recursive definition, and perhaps a 'natural' appearance. Fractals are too irregular to be analysed using the traditional calculus or geometry of smooth objects, and so were largely ignored by mathematicians until well into the 20th century. Hausdorff measures provided the first major tool for studying such sets, but for many years attracted only limited attention (largely by Besicovitch and his co-workers), with individual fractals studied as pathological curiosities. The area received enormous impetus in the late 1970s with the publication of Mandelbrot's essays [55,56] which presented fractals as the rule, rather than the exception, in both mathematics and nature. At the same time, computer technology had reached a level that made it possible to produce good pictures of many and varied fractals. These developments led to a renaissance of interest in Hausdorff measures and dimensions and in their applications to fractal analysis. The first part of this chapter gives brief account of Hausdorff measures and their variants and extensions as the natural tools for studying the geometry of fractals. The definition of Hausdorff measure immediately leads to the notion of Hausdorff dimension, and many properties are then conveniently expressed in terms of dimensions rather than measures. In a sense, Hausdorff measures are measures that are distributed as uniformly as possible across fractal sets. However, it is possible to construct measures that themselves have highly irregular distributions across their support; indeed such measures arise naturally in areas such as dynamical systems, number theory and turbulence. Measures with such a fine structure are often thought of as 'fractal measures' or perhaps 'multifractal measures', a term that reflects a single measure giving rise to a spectrum of fractal sets determined by local densities. The second part of the chapter considers these fractal measures and their analysis, which generally involves measures and dimensions of Hausdorff type. There is now an enormous literature on fractal measures, and there is only space here for a few selected references. Texts which give detailed treatments of fractal measures at various levels of sophistication and which give many further references include [16,18,21, 32,62,83]. 2. Hausdorff and packing measures and dimensions This section gives a rapid treatment of Hausdorff and related measures. More details may be found in the texts mentioned above. 2.1. Hausdorff measures In his famous paper of 1919, Hausdorff [37] showed that a general construction of Caratheodory could be used to construct 's-dimensional measures', where s is not necessarily an integer. In particular, he showed that the middle-third Cantor set has positive
1040 K.J. Falconer and finite log2/log3-dimensional measure. Ever since, such Hausdorff measures have been a principal tool for studying fractal sets. Let (X,d) be a metric space (often и-dimensional Euclidean space R" with the usual metric). The diameter of a non-empty set U is diami/ = sup(d(;t, у): х, yet/}. For ? > 0 we term a finite or countable family of sets {i/,},^, a S-cover of F с X if diam i/, ^ 5 for all / and F с (J^, i/,. For s ^ 0 we write ? * 1 ?^(F) = inf{^(diaint/,)s: {t/,} is a<S-coverof F }. A) Clearly, as S decreases, the allowable <5-covers for the infimum is reduced, so HSS(F) is non-decreasing as S I 0. Thus the limit as S I 0 exists, and we define the s-dimensional Hausdorff (outer) measure of F as H\F) =lim HUF). B) It is easily verified that 7is is a metric outer measure, that is H*(EUF) = HS(E) + HS(F) if ? and F are separated by a positive distance, and consequently that the Borel and Souslin sets are W -measurable for s > 0. Moreover, W is Borel regular, that is for all F с X there exists a Borel set F э F with Ws(?) = HS(F), see [18,62]. Hausdorff measures on Ш" extend the classical Lebesgue measures of length, area, volume, etc. For example, if F is a subset of R or is a curve in R" for ? ^ 2, we have W1 (F) = length (F) and if F is a subset of R2 or a smooth surface in R" for ? > 3, we have H2(F)= -area(F), etc. Hausdorff measures have many nice properties. For example, if F с R" and /: F -> R" is a Lipschitz mapping, that is if |/(*)-/00|<Ф-;у| for*,>-eF, C) where о 0, then H>(f(F))^csHs(F). D)
Fractal measures 1041 In particular, this leads to the congruence property, that if /: R" -> R" is a congruence or isometry then W{f{F)) = HS(F) and the scaling property; that if /: R" -> R" is a similarity transformation with scaling ratio c, then Hs(f{F))=csHs(F). E) This generalizes the familiar scaling properties of length, area and volume, and is indicative of the strong geometrical nature of Hausdorff measures. A simple exercise using A) shows that if 0 ^ s < t then H's(F)<:ni(F)S"\ On letting ? i 0 it follows that if HS{F) < oo, then H'(F) = 0 for all t > s. This fact allows us to define the Hausdorff dimension of F с X as dimw F = inf{s: Hs{F) < oo} = supji: Hs(F)>0}. F) A number of important properties of Hausdorff dimension are immediate from the measure properties of W. For example, DC dimw yj F,- = sup dim^ F,-. 0) /=| 1ф'<зс If F с R" and /: F -> R" is Lipschitz it follows from D) that dimw /(F) <; dimw F (8) with equality if / is a bi-Lipschitz mapping (with /"' : /(F) -> F also Lipschitz) such as a (non-singular) affine transformation. A consequence of this is that if F is a smooth m-dimensional submanifold of R" then dim/y F = m. Sometimes more delicate Hausdorff measures are useful. A gauge function h : [0, oof -> [0, oof is a strictly increasing function with /г@) = О and continuous on the right. The Hausdorff measure construction goes through in just the same way on replacing (diam t/,-)s by h (diam i/,-) in A) to give a generalized Hausdorff measure Hh. This can be useful for analyzing sets F for which there is no real number s with 0 < HS(F) < oo. For example, a Brownian path F in R", ? > 3, has (with probability 1) Hausdorff dimension 2, with H2(F)=0, bm0<Hh(F) <ooif/2@ = f2loglogf-'. 2.2. Packing measures Packing measures are in many ways dual to Hausdorff measures (which might be termed 'covering measures'), and, rather surprisingly, were not introduced until the early 1980s [90,91,94].
1042 K.J. Falconer Let (X, d) be a metric space. Given FCX.a (finite or countable) collection of disjoint balls {?,·} is called a S-packing of F if each ball has centre in F and radius at most S. For OOwe write ^(radius Bif: {B,} is a ^packing of F \. (9) i=\ I The limit ^(F) = lim^(F) A0) exists, but Vq(F) need not be countably subadditive. To overcome this we set EW):fiUF' A1) i=\ i=l J which is a Borel measure, and which is termed s-dimensional packing (outer) measure onX. An alternative approach is to work with the diameter based s-dimensional packing measure. This is defined in a similar way, except that 'radius B,¦' is replaced by 'diamfi;' in (9). The radius and diameter based packing measures are equivalent on E", but may be very different for certain nonlinear metric spaces. Packing dimension is then defined in the obvious way: dimP F = M{s: Vs(F)<oo} =sup{s: Vs(F)>0}. Basic properties of packing dimensions are easily derived from measure properties, for example the analogs of G) and (8) hold for dim/>. Not surprisingly, packing and Hausdorff measures are related by an inequality. Some basic estimates together with the Vitali covering theorem establish that for all subsets F of a metric space X and s ^ 0 HS(F)^VS(F) and consequently dim/y F ^ d'imp(F). See, for example, [ 16,21,62] for further details of packing measures and dimensions. 2.3. Calculation of Hausdorff and packing measures and dimensions Calculation of Hausdorff and packing measures and dimensions of a set F is often non- trivial; the exact value of the Hausdorff measure of many simple fractals is still unknown.
Fractal measures 1043 Often, a direct covering estimate can give good upper bounds for Hausdorff measures. Lower bounds may require indirect methods, and a frequent approach is to define a natural measure supported by F and to examine its properties. In particular, if F с R" supports a measure ? such that 0 < m(F) < oo, and 0 < с < oo: if limsupM(B(;c, r))/rs <: с for all ? e F then W?(F) ^ m(F)/c, A2) r—0 if ?\?\$???(?(?^))/?* ^c foralljceF then WS(F) ^ 2V(F)/c, A3) r—>oo if ???\??(?(?, r))lrs ^ с foralljceF then V\F) > 2'V(F)/c, A4) if liminfM(B(jc,r))/rs>c foralljceF then -p?(F) ^ 2>(F)/c A5) (see [13,26]). These properties illustrate the duality between Hausdorff and packing measures. To illustrate the basic approach, let F be the middle-third Cantor set, obtained from the unit interval by repeatedly removing the middle-third of intervals, so at the kth step of the construction we are left with a (closed) set Fk consisting of 2k intervals of lengths 3~k, with F = Пь=о^· Tnen takmg tne intervals of Fk as а 3~к-со\ст of F gives W3_k(F) ? 2k3~sk, so fti°g2/iog3(F) <; { and dimH F <; iog2/log3. On the other hand, if ? is the natural measure supported by F which assigns each interval of Fk a measure 2~k, a simple estimate gives ?(?(?, r)) <; 2rl0-2/|0=3 so that Ti}^2' |0?3(F) > | anddimw F ^ log 2/log 3 by A2). This type of calculation extends to a large class of 'self-similar' sets. For simplicity, let /|,..., fm : M." -> Ш" be a family of (strict) contractions; such a family is called an iterated function system (IFS). It may be shown [18,43] that there exists a unique nonempty compact FCR", known as the attractor or invariant set of the IFS, satisfying m f = \Jmf). A6) i = l If the contractions f are similarities, that is if \f(x)- /?(>'I = ^?\?- y\ for all xjel", the set F is termed self-similar. The type of calculation indicated for the middle-third Cantor set extends to show that dimw F = dim/> F = s with 0 < W{F) < oo, and with computable lower and upper bounds for HS(F), where s is the solution of m ?'? = 1- A7) i=l provided that the /, satisfy the open set condition. This condition requires that the components {//(F)}^., are 'reasonably separated' in that there exists a (non-empty) open set U such that {/Hi/)}?!, are disjoint and (Jit ? /i(^) ? i/. This condition holds for many familiar fractals, for example the von Koch curve and the Sierpinski triangle.
1044 K.J. Falconer The theory of generalized capacities developed alongside Hausdorff measures, and there are strong links between them. In particular, A2) may be used to show that, for F с ?", dim/y F is the supremum value of s for which there exists a positive measure ? supported by F with ? \?-?\-???{?)??{?)<??. A8) This provides a further, now standard, technique for getting lower bounds for dimensions, and is central in the study of the geometry of measures and dimensions. Hausdorff and packing measures and dimensions have been calculated or estimated for a wide range of fractals. Some of the main classes that have been considered are listed below, with selected references. • Self-similar sets (the /} in A6) are similarities) [18,43]. • Graph directed self-similar sets [3,66]. • Self-affine sets (the /, in A6) are affine maps) [20,22,68,76]. • Self-conformal sets (the ft are conformal maps) [6,85]. • Self-similar sets with infinitely many maps (/} are similarities with m = oo in A6)) [58,63,64]. • Self-conformal sets with infinitely many maps (/, are conformal maps with m = oo in A6)) [64]. • Random self-similar sets (ft are random similarities) [19,34,35,48,54]. • Sub/super self-similar sets ('=' is replaced by 'c' or "э' in A6)) [25]. • Sets defined by continued fractions [38,64]. • Sets defined by Diophantine approximation properties [4,15]. • Graphs and surfaces of self-affine functions [60,65]. • Paths and graphs of Brownian motion and its generalisations [1,46,92]. • Julia sets (invariant sets of analytic mappings) [85]. • Attractors or repellers of dynamical systems [23,49,52,67,78,88,93]. 2.4. Geometric measure theory One of the early uses of Hausdorff measures was the study the geometric structure of subsets of E", a programme pioneered by Besicovitch through the middle part of the 20th century. An early realisation [5], see [18,62], was that a compact subset F of E2 with 0 < ?) (F) < oo can be decomposed into a 'regular' or 'curvelike' part and an 'irregular' or 'fractal' part which can be distinguished either measure-theoretically, in terms of the existence of local densities, or geometrically, in terms of rectifiability or the existence of tangents. Higher dimensional results of this type require sophisticated machinery. A set F с E" is called m-rectifiable if it is made up of countably many pieces which are Lipschitz images of E", that is if there exist countable many Lipschitz maps /,: E'" -> E" such that Wf\Q/;(e'"))=o.
Fractal measures 1045 A regular Borel measure ? on Ш" is m-rectiflable if there is an m-rectifiable set F with ?(?." \ F) = 0 and ? is absolutely continuous with respect to H'". The fundamental result relating densities and rectifiability was proved in full generality by Preiss in 1987 [81]. Fuller details are given by Chlebik [10] (Chapter 24 of this Handbook), but briefly, if ? is a positive finite Borel measure on E" and the density \??\?^?(?(?(?, r))/rm) exists and is positive and finite for ?-almost all x, then m is an integer and ? is m-rectifiable. A consequence of this is that a Borel set Fwith H'"(F) < oo is m-rectifiable (with m an integer) if and only if the density . H'"(B(x,r)nF) lim r^o Br)'" exists for W"-almost all ? e F, in which case this density equals 1 almost everywhere. Moreover, there is a number 0 < c{m) < 1 such that if Hm(B(x,r)(lF) liminf > c(m) /•^0 Br)'" forW'-almostalljt e F then F is m-rectifiable. It has long been conjectured that c(l) = y, but the best value to date is -^B + V46) ^ 0.732 [82]. Packing measures also provide characterizations of rectifiability. For example, if F is a Borel subset of R" with 0 < V"\F) < oo, then V'"(F) = 7i"l(F) if and only if m is an integer and the restriction of V" to F is m-rectifiable [62,86]. The modern approach to such geometric questions uses tangent measures, which are limits of normalized enlargements of a given measure but which behave in a more regular and tractable way than the original measure. More precisely, if a e M." and r > 0, let Ta.r denote the transformation on measures defined by (??,-?)(?) = ?(? ? + a) for every set A. A non-zero measure visa tangent measure of ? at ? if there are positive sequences r, -> 0 and c, such that c,Ta ;/? -> ? in the vague topology. Tangent measures have been used to great effect in relating local features of a measure to properties such as integral dimensionality and rectifiability. For example, the proof of Preiss' theorem uses tangent measures to reduce density properties to more manageable questions on the structure of m-uniform measures, that is measures ? with v(B(x, r)) = const r'n for all ? e spt ? and r > 0. Tangent measures have also been used to show that integral dimensionality and rectifiability follow from even weaker density conditions, such as conditions relating to average densities of the form 1 fTW(FnB(x,e-')) liminf— I dt, 7-^oc ? J0 Bе-'У see [69].
1046 K.J. Falconer 2.5. Geometry of Hausdorff measures and dimensions Good geometrical properties are a key feature of Hausdorff measures and dimensions, properties which in many way generalize those of classical length and area on curves and surfaces. For example, conczmingproducts of sets ? с R" and F cRffl, there are numbers q > 0 (depending on n, m, s, t) such that c\Hs(E)H\F) ^ HS+'{E ? F) <: c2Hs(E)V'(F) ? ciVs+'(E ? F) <; c47>i(EO>'(F), A9) so, in particular, dim/y ? + dim/y F ^ dim/y (? ? F) ^ dim/y F + dim/> F ^ dim/>(F ? F) ^ dirnp F + dim/> F. B0) More generally, there are integral inequalities of the form Hs(Gx)dH'(x) iC cHs+\G), B1) / for G a Borel subset of R"+"' where G.v is the section {y e R": (jc, y) e G) for jc e R", see for example [21,40,59,62]. For V a w-dimensional subspace of R", let projy : R" -> V be orthogonal projection. Since projy is a Lipschitz mapping, for all F с R" and all subspaces V, we have Hs(pro)v F) ^ W(F), and dim/^projy F) ^ min{w,dimw F). In fact this dimension inequality is nearly an equality: for a Borel set F с R" and for almost all w-dimensional subspaces V of R" (in the sense of the natural invariant measure on subspaces), dim/y (projy F) = min{w, dim^ F). Proofs of this type of identity nowadays use the potential theoretic approach A8), see [21, 50,62]. When F is a Borel set with dimw F = w and 0 < H'"(F) < oo the projection properties onto an w-dimensional subset V are delicate. If F contains a rectifiable subset of positive measure then ?.'"(???)? F) > 0 for almost all V, otherwise ?'"(???)? F) = 0 for almost all V [18,62,81]. Packing dimensions behave less well under orthogonal projection. For example d'imp ? ^ ?'??\?(???')? F) ^ min{w, dim/> F] 1 +(l/w- \/n)dimPE for almost all w-dimensional subspaces V, with these inequalities best possible. Nevertheless, dimp projy F is constant for almost all V, see [27,28].
Fractal measures 1047 These are also natural results on the dimensions of intersections. If ?, F are Borel subsets of R" then dimw(?n(? + Jt))^max{0,dimw(? ? F) - n\ for almost all ? e R", where F + x denotes F translated by ?. This follows using B1) with G = ? ? ?, and is the best possible upper bound. The inequality in the other direction, dimw(?na(F)) > dimw(?) + dimw(?) - ? holds for a set of transformations ? e G of positive measure in a group of transformations G in the following cases: (a) G is the group of similarities; (b) G is the group of congruences and either ? is a rectifiable set or dimw ? > \(n+\). The proof uses potential theoretic methods [47,61,62]. Intersection properties involving packing measures and dimensions are complicated [44]. 2.6. Subsets of finite measure For a measure to be of much use for studying a set F, the measure of F needs to be positive and finite, or at least ?-finite. However, for many F, there is no value of s with 0 < W(F) < oo nor even any gauge function h such that 0 < Hk{F) < oo. One approach then available is to seek a (topologically reasonable) ? с ? with positive finite measure. Much technical work has been motivated by this aim. The basic result is that if ? с R" is a Borel or Souslin set (that is a continuous image of a Borel set) with Hh{F) > 0, then there exists a compact ? с ? with 0 < Hh (?) < oo. Early proofs of this [83,14] involve setting up a 'net measure' on ? equivalent to the Hh and using the ultrametric structure to pick out the subset ?. Recently, a completely different approach has been introduced [39,62] using weighted Hausdorff measures to enable functional analytic techniques to be employed. Analogous results are available for packing measures, see [45]. 3. Measures with a fractal structure We now turn to measures that are of such widely varying intensity that they may be thought of as having a fine structure, that is as fractal measures in their own right. Indeed, a single measure may give rise to many fractal sets. 3.1. Local dimensions Given a Borel regular measure ? in R" with 0 < m(R") < oo, it is natural to define the (lower) local dimension or local Holder exponent of ? at ? by dimioc ?(?) = liminf r^O logr B2)
1048 K.J. Falconer Thus, dimioc ?(?) = a if a is the greatest number such that, for all ? > 0, ?(?(?, г)) ^ const ra~? for all sufficiently small r. Local dimensions are intimately related to Hausdorff and packing dimensions. For example, dim/y F = supjs: there exists ? with 0 < ?(?)< oo and dimioc ?(-*) ^ s for ? almost all * e F}, see [26]. We say that ? has ejcaci (lower) dimension s if dim|0C ?(?) = s for ?-almost all x. Surprisingly many measures have an exact dimension, for example, if ? is an invariant ergodic measure of a Lipschitz mapping /: X -» X for X a closed subset of R", then ? is exact dimensional [31]. For certain 'rich' measures ?, the sets Fa = \x: dimioc ?(*) = <*} B3) may be 'large' for a range of a. There are two natural ways to quantify Fa: in terms of the measure ? itself, leading to the 'dimension decomposition' into components of different exact dimensions, and in terms of Hausdorff dimension, leading to the 'multifractal spectrum'. 3.2. Dimension decomposition The dimension decomposition aims to split a measure up into exact dimensional components, as far as this is possible. For ? a Borel measure on R" with 0 < ?(?") < oo, and with Fa given by B3), the set S={ae[0,n]: ?(?„)>0} is finite or countable. For each aeSwe define a Borel measure ?" by Mff(A)=M(AnF„) for А С R"; the measure ?" may be shown to be of exact dimension a for all a e S. We also define md(A) = m[a\(JfA, then ?° is a d/j^se measure, in that ?°(??) = 0 for all a. Then ? has a decomposition into a countable number of exact dimensional components and a diffuse part: ? = ??"+?°, aeS see [ 12,26,84] for proofs of this.
Fractal measures 1049 3.3. Multifractal measures An alternative, perhaps richer, approach is to look at the dimension of the sets at which a measure takes a given local density. This approach stems from Mandelbrot's idea [54] which was taken up in the physics literature [33,36]. Let ? be a finite Borel regular measure on R". It is convenient here to set ? & R . hm =a , r—o log r J so that now Fa is the set of points ? at which the local dimension exists in a strong sense and equals a, so that, roughly, ?(?(?, r)) ~ ra for small r. If ? is Lebesgue measure restricted to [0, 1 ] then F\ =[0,1] and Fa = 0 for ? ? 1, a consequence of the uniformity of Lebesgue measure. However, if ? is highly irregular, the set Fa can be substantial, with dim/y Fa > 0, for a range of a. Such а ? is thought of as a multifractal measure, with its (fine) multifractal spectrum defined as /(a)=dimw Fa for the range of ? for which Fa ? 0. Thus the multifractal spectrum of a measure attempts to quantify the intensities of the singularities of the measure. There is an enormous literature on multifractal analysis; general accounts may be found in [16,21,26,57,71,74,78]. Calculation of multifractal spectra is rarely easy; however there are several important classes of measure for which the multifractal spectrum may be obtained as the Legendre transformation of a straightforward 'auxiliary function'. This is best illustrated by the standard example of a self-similar or multinomial measure. Let /?,..., /„,: R" -> R" be an iterated function system of contractions satisfying the open set condition, where f has scaling ratio c, < 1, and let F be the non-empty compact attractor of this IFS, see A6). Let p\,..., p,„ be 'probabilities', so that p, > 0 for all /' and Y%'-\ Pi = 1 · Then F supports a measure ? defined by the requirement that ? (У/, ° /ь ° · · · ° fh (F>) = Ph Ph ¦ ¦ ¦ Pik for all к and all 1 $5 ?'?, b,. ¦ ¦, Ц· ^m. The measure ? satisfies III ?(?) = ????(/?\?)). B4) ;=? We now assume that ? is self-similar, that is that the f are similarities, which we also assume satisfy the open set condition. We define the auxiliary function ?: R -> R by 1П / = l Fa =
1050 K.J. Falconer It is easy to see that ? is smooth, decreasing and strictly convex (provided that log pi/ log r, is not constant for all /, a straightforward case we do not consider here). The multifractal spectrum of ? turns out to be the Legendre transform of /3, that is /(<*)= inf {P(q)+aq}. B5) This may be established by defining an 'auxiliary measure' ? on F by i>(/,-, о Д ? · ¦ · о fik(F)) = (pi,pi2 ¦¦¦piK)q(ritrb ¦¦¦??? and examining the local properties of ? using A2)-A5). The multifractal spectrum f(a) is defined and concave for amjn ^ a ^ «max, where amin = mini ^(^m log ?; / logr/ and amax = max|^,„logp,71ogr;, with /(a) attaining a maximum of dim/y F. There are many measures for which the multifractal spectrum is the Legendre transform of an appropriately defined auxiliary function. Some of the main cases that have been examined are listed below. • Self-similar measures (/, are similarities in the IFS B4) defining the measure) [2,9, 71]. • Graph directed self-similar measures [17]. • Self-conformal maps (f are conformal maps in B4)) [8,79]. • Self-similar measures for infinite IFS (/) are similarities in B4) with m = oo) [58,63]. • Vector-valued self-similar measures [29]. • Quasi self-similar measures [75]. • Self-affine measures (/; are affine maps in B4)) [24,51,73], • Random self-similar measures (/, are random similarities, p, are random probabilities in B4)) [2,24,48,54,70]. • Random graph-directed self-similar measures [70]. • Equilibrium measures of hyperbolic dynamical systems [78,89]. • Invariant measures for complex maps [11,53]. • Occupation measures of Brownian motion and generalizations [41,77,87]. A very general approach to multifractal analysis utilizes measures of Hausdorff type, see [7,71,78,80]. Given a finite regular measure ? on E" and q, ? e ? we may define auxiliary measures Hq^ on E" using the following steps: HfP(E) = infjJ3 ???,/'^???,/: ? с Ц ?, where ?, are balls centered in ? with diamfi, ^ S , 7?·?(?)=????7^·?(?), u a^o ?.?·?(?) sup ??0·?(?'). B6) E'CE (Using covers by balls centered in ? avoids difficulty when q < 0, and the final step is needed to ensure monotonicity.)
Fractal measures 1051 Analogously to the definition of Hausdorff dimension, we define for q e R /3(<7) = sup{/3: №р(тл) = 0}. B7) For all measures ? there is an inequality with the Legendre transform of ?: f(a) = dimw Ea ^ inf \fi(q) + qa\, and there are many measures for which equality holds, including self-similar and self- conformal measures, see [8,70]. Generalized packing measures V4^ may also defined along these lines [71]. 3.4. Geometry of multifractal measures It is natural to seek identities relating the multifractal spectra of measures to those of their products, projections, intersections, etc., analogous to the geometric properties of dimensions of sets mentioned in Section 2.5. Such properties are elusive, and though some results are known, many open questions remain. For products of measures there are good analogous of A9)—B1). Let ?, ? be finite Borel measures on R" and R"\ respectively, and let ?\^ and V^ be the generalized (q, /3)-Hausdorff and packing measures respectively, constructed from ?, see B6). Then for?CR"andFcR'», cinlJip(E)ny(F) ? Hl^y(E xF)<: c2W/{E)Vy{F), leading to inequalities between the fi(q) and f(a) functions for the measures involved [72]. Results on projections of measures are more limited in scope, see [30,42,75]. For slices of measures (that is intersections of measures with w-planes) and more general intersections of measures, see [74] for a survey of the (often rather technical) results that have been established. References [1] R.J. Adler. The Geometry of Random Fields, Wiley A981). [2] M. Arbeiter and M. Patzschke, Random self-similar multifractals. Math. Nachr. 181 A996), 5^42. [3] T. Bedford, Dimension and dynamics of fractal recurrent sets, J. London Math. Soc. B) 33 A986), 89-100. [4] V.I. Bemik and M.M. Dodson. Metric Diophantine Approximation on Manifolds, Cambridge University Press A999). [5] A.S. Besicovitch. On the fundamental geometric properties of linearly measurable plane sets of points II, Math. Ann. 115 A938), 296-329. [6] R. Bowen, Equilibrium States and the Ergodic Theory ofAnosov Diffeomorphisms, Lecture Notes in Math., Vol. 470, Springer, Berlin A975). [7] G. Brown, G. Michon and J. Peyriere, On the multifractal analysis of measures, J. Stat. Phys. 66 A992), 775-790.
1052 K.J. Falconer [8] J. Cole, The multifractal geometry of graph directed self-conformal measures A999), to appear. [9] R. Cawley and R.D. Mauldin, Multifractal decomposition of Moran fractals. Adv. Math. 92 A992), 196— 236. [10] M. Chlebik, Geometric measure theory: Selected concepts, results and problems. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam B002), 1011-1036. [11] P. Collet, R. Dobbertin and P. Moussa, Multifractal analysis of nearly circular Julia sets and the multifractal formalism. Ann. Inst. H. Poincare Phys. Theor. 56A992), 91-122. [12] CD. Cutler, The Hausdorff dimension distribution of finite measures in Euclidean space. Canad. J. Math. 38A986), 1459-1484. [13] CD. Cutler, Strong and weak duality principles for fractal dimension in Euclidean space. Math. Proc. Cambridge Philos. Soc. 118 A995), 393-410. [14] R.O. Davies and С A. Rogers, The problem of subsets of finite positive measure. Bull. London Math. Soc. 1 A969), 47-54. [15] M.M. Dodson, B.P. Rynne and J.A.G. Vickers, Diophantine approximation and a lower bound for Hausdorff dimension. Mathematika 37 A990), 59-73. [16] G.A. Edgar, Integral, Probability and Fractal Measures. Springer, Berlin A998). [17] G.A. Edgar and R.D. Mauldin, Multifractal decompositions of digraph recursive fractals, Proc. London Math. Soc. C) 65 A992), 604-628. [18] K.J. Falconer, The Geometry of Fractal Sets, Cambridge University Press A985). [19] K.J. Falconer, Random fractals. Math. Proc. Cambridge Philos. Soc. 100 A988), 559-582. [20] K.J. Falconer, The Hausdorff dimension of self-affine fractals. Math. Proc. Cambridge Philos. Soc. 103 A988), 339-350. [21] K.J. Falconer, Fractal Geometry - Mathematical Foundations and Applications. Wiley A990). [22] K.J. Falconer, The dimension of self-affine fractals 11, Math. Proc. Cambridge Philos. Soc. HI A992), 169-179. [23] K.J. Falconer, Bounded distortion and dimension for non-conformal repellers. Math. Proc. Cambridge Philos. Soc. 115 A994), 315-334. [24] K.J. Falconer, The multifractal spectrum of statistically self-similar measures, J. Theor. Probab 7 A994), 681-702. [25] K.J. Falconer, Sub-self-similar sets. Trans. Amer. Math. Soc. 347 A995), 3121-3129. [26] K.J. Falconer, Techniques in Fractal Geometry. Wiley A997). [27] K.J. Falconer and J.D. Howroyd, Projection theorems for box and packing dimensions. Math. Proc. Cambridge Philos. Soc. 119 A996), 287-295. [28] K.J. Falconer and J.D. Howroyd, Packing dimensions of projections and dimension profiles, Math. Proc. Cambridge Philos. Soc. 121 A997), 269-286. [29] K.J. Falconer and T.C. O'Neil, Vector-valued multifractal measures. Proc. Roy. Soc. London A 452 A996), 1-26. [30] K.J. Falconer and T.C. O'Neil, Convolutions and the geometry of multifractal measures. Math. Nachr. 204 A999), 61-82. [31] A.H. Fan, On ergodicity and unidimensionality, Kyushu J. Math. 48 A994), 249-255. [32] H. Federer, Geometric Measure Theory, Springer, Berlin A969). [33] U. Frisch and G. Parisi, Fully developed turbulence and intermittancy. Turbulence and Predictability of Geophysical Flows and Climate Dynamics, M. Ghil, R. Benzi and G. Parissi, eds, North-Holland, Amsterdam A988), 84-88. [34] S. Graf, Statistically self-similar fractals, Probab. Theory Related Fields 74 A987), 357-394. [35] S. Graf, R.D. Mauldin and S.C Williams, The exact Hausdorff dimension in random recursive constructions. Mem. Amer. Math. Soc. 71 C81) A988). [36] T.C. Halsey, M.H. Jensen, L.P. Kadanoff, I. Procaccia and B.J. Shraiman, Fractal measures and their singularities: the characterization of strange sets, Phys. Rev. A 33 A986), 1141-1151. [37] F. Hausdorff, Dimension und ausseres Mass. Math. Ann. 79 A919), 157-179. [38] D. Hensley, A polynomial time algorithm for the Hausdorff dimension of continued fraction Cantor sets, J. Number Theory 58 A996), 9-45. [39] J.D. Howroyd, On dimension and on the existence of sets of finite positive Hausdorff measure, Proc. London Math. Soc. 70 C) A995), 581-604.
Fractal measures 1053 J.D. Howroyd, On Hausdorff and packing dimension of product spaces. Math. Proc. Cambridge Philos. Soc. 119 A996), 715-727. X. Hu and S.J. Taylor, The multifractal structure of stable occupation measure. Stochastic Process. Appl. 66A997), 283-299. B.R. Hunt and V.Y. Kaloshin, How projections affect the dimension of fractal measures, Nonlinearity 10 A997), 1031-1046. J.E. Hutchinson, Fractals and self-similarity, Indiana Univ. Math. J. 30A981), 713-747. M. Jarvenpaa, Hausdorff and packing dimensions, intersection measures, and similarities, Ann. Acad. Sci. Fenn. Math. 24 A999), 165-186. H. Joyce and D. Preiss, On the existence of subsets of finite positive packing measure, Mathematika 42 A995), 15-24. J.-P. Kahane, Some Random Series of Functions, Cambridge University Press A985). J.-P. Kahane, Sur la dimension des intersections, Aspects of Mathematics and its Applications, North- Holland Math. Library, Vol. 42, North-Holland, Amsterdam A986), 419-430. J.-P. Kahane and J. Peyriere, Sur certaines martingales de Benoit Mandelbrot, Adv. Math. 22 A976), 131- 145. A. Katok and B. Hasselblatt, Introduction to the Modem Theory of Dynamical Systems, Cambridge University Press A995). R. Kaufman, On the Hausdorff dimension of projections, Mathematika 15 A968), 153-155. J. King, The singularity spectrum for general Sierpinski carpets. Adv. Math. 116 A995), 1-11. O. Ladyzhenskaya, Attractors for Semigroups and Evolution Equations, Cambridge University Press A991). A.O. Lopes, The dimension spectrum of the maximal measure, SIAM J. Math. Anal. 20 A989), 1243-1254. B.B. Mandelbrot, Intermittent turbulence in self-similar cascades: divergence of high moments and dimension of the carrier, J. Fluid Mech. 62 A974), 331-358. B.B. Mandelbrot, Les Objects Fractals: Forme, Hasard et Dimension, Flammarion A975). B.B. Mandelbrot, The Fractal Geometry of Nature, W.H. Freeman A982). B.B. Mandelbrot, Multifractals and \/f Noise, Springer A999). B.B. Mandelbrot and R. Reidi, Multifractal formalism for infinite multinomial measures. Adv. Appl. Math. 16A995), 132-150. J.M. Marstrand, The dimension of Cartesian product sets, Proc. London Math. Soc. 50 A954), 198-206. PR. Massopust, Fractal Functions, Fractal Surfaces and Wavelets, Academic Press A994). P. Mattila, Hausdorff dimensions and capacities of intersections of sets in ?-space. Acta Math. 152 A984), 77-105. P. Mattila, Geometry of Sets and Measures in Euclidean Spaces. Cambridge University Press A995). R.D. Mauldin and M. Urbanski, Dimensions and measures in infinite iterated function systems, Proc. London Math. Soc. 73 C) A996), 105-154. R.D. Mauldin and M. Urbaiiski, Confornwl iterated function systems with applications to the geometry of continued fractions. Trans. Amer. Math. Soc. 351 A999), 4995-5025. R.D. Mauldin and S.C. Williams, On the Hausdorff dimension of some graphs. Trans. Amer. Math. Soc. 298A988), 793-803. R.D. Mauldin and S.C. Williams, Hausdorff dimension in graph directed constructions. Trans. Amer. Math. Soc. 309A988), 811-829. H. McCluskey and A. Manning, Hausdorff dimension for horseshoes, Ergodic Theory Dynam. Systems 3 A983), 251-260. C.T. McMullen, The Hausdorff dimension of general Sierpinski carpets, Nagoya Math. J. 96 A984), 1-9. P. Morters and D. Preiss, Tangent measure distributions of fractal measures, Math. Ann. 312 A998), 53-93. L. Olsen, Random Geometrically Graph Directed Self-similar Multifractals, Longman A994). L. Olsen, A multifractal formalism. Adv. Math. 116 A995), 82-196. L. Olsen, Multifractal dimensions of product measures. Math. Proc. Cambridge Philos. Soc. 120 A996), 709-734. L. Olsen, Self-affine multifractal Sierpinski sponges in R';. Pacific J. Math. 183 A998), 143-199. L. Olsen, Multifractal geometry, Progr. Probab, Vol. 46, Birkhauser B000), 3-37. T.C. O'Neil, The multifractal spectrum of quasi-self-similar measures, J. Math. Anal. 211 A997), 233-257.
1054 K.J. Falconer [76] ?. Peres and B. Solomyak, Problems on self-similar sets and self-affine sets: an update. Progr. Probab., Vol. 46, Birkhauser B000), 95-106. [77] E.A. Perkins and S.J. Taylor, The multifractal structure of super-Brownian motion, Ann. Inst. H. Poincare Probab. Statist. 34 A998), 97-138. [78] Y.B. Pesin, Dimension Theory in Dynamical Systems, University of Chicago Press A997). [79] Y.B. Pesin and H. Weiss, A multifractal analysis of equilibrium measures for conformal expanding maps and Markov Moran geometric constructions, J. Stat. Phys. 86 A997), 233-275. [80] J. Peyriere, in: Proceedings of the NATO ASI on Probabilistic and Stochastic Methods in Analysis with Applications, NATO ASI Series, Vol. 372, Kluwerf 1992), 175-186. [81] D. Preiss, Geometry of measures in R": distribution, rectifiability and densities, Ann, Math. 125 A987), 537-643. [82] D. Preiss and J. Tiser, On Besicovitch xs \ -problem, J. London Math. Soc. 45 B) A992), 279-287. [83] C.A. Rogers, Hausdorff Measures, 2nd edn, Cambridge University Press A998). [84] C.A. Rogers and S.J. Taylor, Functions continuous and singular with respect to a Hausdorff measure, Mathematika8A962), 1-31. [85] D. Ruelle, Repellers for real analytic maps, Ergodic Theory Dynam. Systems 2 A982), 99-107. [86] X. Saint Raymond and С Tricot, Packing regularity of sets in ?-space. Math. Proc. Cambridge Philos. Soc. 103A988), 133-145. [87] N.-R. Shieh and S.J. Taylor, Logarithmic multifractal spectrum of stable occupation measure. Stochastic Process 75 A998), 249-261. [88] K. Simon, Hausdorff dimension for noninvertible maps, Ergodic Theory Dynam. Systems 13 A993), 199— 212. [89] D. Simpelaere, Dimension spectrum of Axiom A diffeomorphisms I, II, J. Stat. Phys. 76 A994), 1329-1358, 1359-1375. [90] D. Sullivan, Entropy, Hausdorff measures old and new, and limit sets of geometrically finite Kleinian groups. Acta Math. 153 A984), 259-277. [91] S.J. Taylor and C. Tricot, Packing measure and its evaluation for a Brownian path. Trans. Amer. Math. Soc. 288 A985), 679-699. [92] S.J. Taylor, The measure theory of random fractals. Math. Proc. Cambridge Philos. Soc. 100 A986), 383- 406. [93] R. Temam, Infinite-Dimensional Dynamical Systems in Mechanics and Physics, Springer, Berlin A988). [94] С Tricot, Two definitions of fractional dimension. Math. Proc. Cambridge Philos. Soc. 91 A982), 54-74.
CHAPTER 26 Positive and Complex Radon Measures in Locally Compact Hausdorff Spaces T.V. Panchapagesan* Departamento de Matemdticas, Facultad de Ciencias, Universidad de los Andes, Merida, Venezuela E-mail: panchapa@ciens.ula.ve Contents Introduction 1057 1. Preliminaries 1057 2. Regular extensions of (positive) measures 1059 3. Complex Radon measures and their properties 1062 4. Regular extensions of positive and complex measures 1068 5. Bounded complex Radon measures 1071 6. Characterizations of complex Radon measures 1073 7. Isomorphic representations of /CG")*,/CG". R)*,/CG")*, and /CG\R)*, 1077 8. Applications 1081 9. Generalization to Radon vector measures 1086 References 1090 *Supported by the C.D.C.H.T. project C-845-B-97 of the Universidad de los Andes, Merida, Venezuela. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1055
This page intentionally left blank
Positive and complex Radon measures 1057 Introduction If ? is a right continuous complex function of finite variation in R" (i.e., ? is of bounded variation in each bounded interval of R"), then ? induces a complex Lebesgue-Stieltjes measure ??? on R", whose domain is a S -ring ? ? containing all the compact subsets of R". If ft = МФ П ?(R"), then it is well known (see McShane A944) for the real case and Panchapagesan A991a) for the complex case) that ? ? (respectively ???) is the Lebesgue completion of ft (respectively ш*|тг) with respect to ?(???\??, ft) (respectively toft). Conversely, let ? be a complex measure on a <5-ring V containing the compact subsets of R" and let ft = V ? ?(R") = {E e B(M."): |?|*(?) < oo), where |?| = ?(?, V) and |?|* is the outer measure induced by |?|. If ? and V are the Lebesgue-completions of ?\? and of ft, respectively, with respect to ft and ?\??, then there exists a right continuous complex function ? of finite variation in R" such that ? ? = V and ? = ???. This result is essentially the same as Theorem 54.2 of McShane A944), where McShane considers ? to be real. Since the positive and complex Lebesgue-Stieltjes measures on R" are precisely the positive and complex Radon measures on R", respectively, in the sense of Bourbaki A965a), the question arises whether one can generalize the above results to Radon measures on a locally compact Hausdorff space ?. This question is answered in the affirmative in Panchapagesan A992, 1993). The object of the present chapter is to give a detailed exposition of these results. Also a section is included to discuss some results obtained in Panchapagesan A998a, 1998b) as applications of the isometric isomorphism theorem, i.e., of Theorem 7.3. Finally, the last section is devoted to state some results which generalize those in Sections 3,6 and 7 to Radon vector measures of Thomas A970). By positive (respectively complex) measures we mean countably additive non negative valued (respectively complex valued) set functions defined on a ring of sets. Unlike Bourbaki A965a), the continuous linear functionals on K,(T) (see Section 1) are not called real or complex measures. 1. Preliminaries In this section we fix the notation and terminology. Also we give some definitions and results from the literature. ? denotes a locally compact Hausdorff space. С is the family of all compact subsets of T. CC(T) = {f:T -> C, / continuous, supp/ e C] is a normed space with the supremum norm || ¦ Ц7- where ||/||r = supteT\f(t)\. For each С e C, CC(T,C) — {/ e CAT): supp/ с С}. For С е С, let IC-CAT,C) ^ CAT) be the canonical injection. Let ? be the inductive limit locally convex topology on CAT) induced by the family {CAT, C), Ic)ceC where each CAT, C) is provided with the topology ?„ of uniform convergence. Then we denote (CAT), ?) by K,(T). Clearly, ? is a Hausdorff topology. fC(T, R) = {/ e K,(T), f real} and fC+(T) = {/ e K,(T): f > 0). K,(T, R) is endowed with the relative topology induced by ?. ?,(?)* (respectively K,(T, R)*)
1058 Т. V. Panchapagesan is the topological dual of the locally convex Hausdorff space K,(T) (respectively /CG\R)). A linear functional ? belongs to K,(T)* if and only if, for each С е С, there exists a constant Kc such that |0(/)| sC Kc\\f\\? for all / e CC(T,C). Given ? e IC(T)*, we define the conjugate ? of ? by 0(f) = 0(f) for / e /С(Г). 0 е K,(T)* is said to be real if 0 = ?. Then it can be shown that 0 e K,(T)* is real if and only if 0(/) is real for all / e K,(T, R). ±@ + 0) and ^ @ - ?) for 0 e /С(Г)* are continuous real linear functionals and are called the real part of 0 and the imaginary part of 0, respectively; are denoted by Re 0 and Im 0, respectively. A linear functional 0 on /C (T) is said to be positive if ? (/) ^ 0 for / e /С+(Г). Then it can be shown that all positive linear functionals on K,(T) belong to K,(T)*. Given ? & ?,(?)*, there exists a unique positive linear functional |0|, called the absolute value of ?, on K,(T) given by |0|(/)= sup |0(g)| A.1) |«|5?/. «e/C(D for/e/C+(r). /С(Г, R)* is a boundedly complete vector lattice under the partial ordering ?\ ^ 9j given by 0|(/) ^ 6»2(/) for all / e /С+(Г). For 0 e /С(Г, R)*, 0+ = max@, 0) and ?~ = -min@,O) and ? has the canonical decomposition ? = ?+ - ?~. ? e ?,(?, R)* has a unique extension to K,(T) as a continuous linear functional and we denote it also by ? and \0(f)\<W\{\f\), fefC(T), where \?\ is given by A.1). When ? e ?;(?)* is real, then \в\\/С(т.Ш) = Щка.Ш))+ + @/CG\R))-- Given 0 e /С(Г)*, by an abuse of notation we denote the unique extensions of ((Refl)|x;G-.K))+ and ((Re0)|/c(r.K))_ by (Re^)+ and (Re^)", respectively. Similarly, are defined (Im0)+ and (Im0)_. Thus, ? = {(Re0)+ - (Re0)-} + /{(Im0)+ - (Im0)-}. For the above definitions and results the reader is referred to Bourbaki A965a, Chapter III, § 1) and Panchapagesan A991 a, Chapter 6, §41). Co (respectively U) is the family of all compact G& (respectively open) subsets of T. For a non void class ? of subsets of ?, ? (?) (respectively ? (?)) denotes the <5-ring (respectively ?-ring) generated by S. B(T) = ?(??) (respectively BC(T) = a{C), BQ(T) = a(CQ)) is the family of all Borel (respectively ?-Borel, Baire) subsets, of T. A subset ? of ? belongs to BC{T) if and only if it is Borel and ?-bounded (in the sense, it is contained in a countable union of compact sets) and hence the nomenclature for the members of BC(T). Given a real measure ? on a ? -ring V, ? has the Jordan decomposition ? = ?+ — ?- with ?+ and ?~ being finite positive measures so that ? is of finite variation on V. Moreover, if ? = ? - ?, ?, ? positive measures, then ?+ ^ ? and ?~ ^ ? (see p. 50 of Dinculeanu A967)).
Positive and complex Radon measures 1059 The proof of Theorem III.5.17 of Dunford and Schwartz A958) can be adapted to prove the following proposition which gives the concept of Lebesgue completion of a <5-ring. PROPOSITION 1.1. Let ? be a complex measure on a S-ring V. Let V* = {E U ?: ? e V, N CM eV with v(v, V)(M) = 0). For such ? U N eV*, let v(E U N) = v(E). Then V* is a S-ring, V* D V, ? is well defined, v\p = ? and ? is a complex measure on V*. ? (respectively V*) is said to be the Lebesgue completion of ? (respectively V) with respect to V (respectively v). By a measure (on a ring) we mean a positive measure. From Chapter IV of Bourbaki A965a) we have the following theorem. The reader may also refer to Hewitt and Stromberg A965), Naimark A959), Panchapagesan A991 a, 1991 b) and Rudin A967). THEOREM 1.2. Let ? be a positive linear functional on CC(T). Let M+(T) = {f:T -> [0, oo], / lower semi-continuous], T+(T) = {f:T -> [0,oo]} and C+(T) = {/ e CAT): f > 0). Let ?*(?)= inf sup{0(VO: t^g. феС+(Т)} for ? С Т. Then: (i) ?? is an outer measure on V(T). (ii) Let ??* =(?сГ: ? is ?%-measurable]. Then ??* is ? ? -algebra and contains B(T). We denote ?^\?t, by ??. (Hi) д*(С) <оо, СеС. (iv) ?*?(?) = ???{?*?(?): EcUeU},EcT. (?) ?*(?) = sup^tC): С С Е, С е С] if ??*(?) is ?-finite and E e ??* or if Ее U. (vi) Given ? e ??* with ?*?(?) ?-finite, then there exist А, В е B(T) such that А С ? С ?, ? ?-compact and ?%(?\?) = 0. Consequently, ?^(?) = ?^(?). (vii) Let ?%* = ?*?\?{?? Then ?(?) = 1?<1?**, feCAT). Moreover, if ? is a measure on B(T) satisfying (iii)-(v) above and if 9(f) = fTfdv, f e CC(T), then ? = ?^0?. ?^0? is called the Borel-Radon measure induced by ?. 2. Regular extensions of (positive) measures We introduce several notions of regularity for measures defined on certain <5-rings or ?- rings of subsets of ? and study the existence and uniqueness of regular extensions. These results play a key role in the next section.
1060 Т. V. Panchapagesan ???????? 2.1. A measure ? on Bq(T) (respectively on BC(T), on B{T)) is a Baire (respectively ?-Borel, Borel) measure if ?@ < oo for С e Co (respectively С е C). Dehnition 2.2. Let Tl be a ring of subsets of ? with 5(Co) С U or 5(C) С U. A measure ? defined on 7? is said to be 7^-regular if (i) ?@ <oo, CeCilft; (ii) ?(?) = ?{?(?/): ? С ?/ e U ? 7г} for ? е 7г; and (iii) ?(?) = sup^(C): С С ?, С е С ? 7г} for ? е 7г. A complex measure ? on 7^ is said to be 7?-regular if, given ? e 1Z and ? > 0, there exist С eTZnC and U e Ti f)U such that С С ? с ?/ and \v(G)\ < ? for all G e 7г with GcU\C Dehnition 2.3. A Borel measure ? on ?(?) is said to be Radon-regular if (i) ?(?) = ???{?(?): ? С U е Щ for ? е ?(?); and (ii) ?(?/) = 5??{?@: С С ?/, С е С) for U e W. Using Proposition 17, Paragraph 15 of Dinculeanu A967), it is easy to prove the first three statements of the following proposition while the fourth is due to Theorem 10.30 of Hewitt and Stromberg A965). Proposition 2.4. (i) A Baire measure is Bo(T)-regular. (ii) A measure ?? on S(Cq) is S(Co)-regular. (iii) A complex measure щ on S(Cq) is S(Co)-regular. (iv) If ? is a Radon-regular measure on B(T) and ? e B{T) with ?(?) ?-finite, then ?(?) = $\}?{?(?): С С ?, С еС}. The following proposition is the same as Proposition 11 in Dinculeanu A967, § 14). PROPOSITION 2.5. If С С U, С e С and U e U, then there exist Co e Co and Щ e U П <5(Co) such that С С U0 с Со С U. Using Propositions 2.4(iv) and 2.5 and the fact that each ? e BC(T) is ?-bounded, one can easily prove the following LEMMA 2.6. Let ? be a Radon-regular measure on B(T). Then: (i) ifv = ?\^?· Men v is S(C)-regular; (ii) if ? = ?|??G·), then ? is Bc (?)-regular. The first part of the following lemma is due to Theorem 56.E of Halmos A950). For С e Co, we can construct a sequence (/„) с C+(T) such that /„ \ ?? and by the Lebesgue dominated convergence theorem ?@ = lim„ fT f„ ?? for a Baire measure ?. Using this observation and Proposition 2.4(i) we have the following
Positive and complex Radon measures 1061 LEMMA 2.7. (i) Let ?? and ?? be Bc(T)-regular (respectively (ii) Baire) measures. If f ???\ = ? /??2, feCAT) B.1) Jt Jt then ?? = ??- THEOREM 2.8. Let ?? be a Baire measure on T. Then: (i) ?? has a unique extension ?<- to BC(T) such that ?(- is a Bc(T)-regular measure; (ii) ?? has a unique extension ? to B(T) such that ? is a Radon-regular measure; (iii) If?(· and ? are as in (i) and (ii), then ??- = ?\?,(?)- PROOF, (i) is due to Theorem 54.D of Halmos A950). For proving (ii), let 0(f) = ffdw, /еСДГ). B.2) Then by Theorem 1.2 there exists the Borel-Radon measure ?°??? = ?(say) induced by ? and 9(f) is also given by B.2) with ?? replaced by ?. If ? = ?\?(^?), then by Lemma 2.7(H) ? = ?? and, hence, ? extends ??. The uniqueness of ? follows from the uniqueness part of (i), from Lemma 2.6(ii) and from (iv) and (v) of Theorem 1.2. D Taking 9(f) = fT f ??, f e CC(T), where ? is a ??<(r)-regular measure and arguing as in the proof of Theorem 2.8 applying Lemmas 2.6(ii) and 2.7(i) we obtain the following: THEOREM 2.9. Every Bc(T)-regular measure has a unique extension ? to B(T) as a Radon-regular measure. THEOREM 2.10. Let ? be a S(C)-regular measure. Then: (i) the unique extension ? of ? to BC(T) as a measure is Bc(T)-regular; (ii) ? admits a unique extension ? to B(T) as a Radon-regular measure and ?\?? (j) = ?, where ? is as in (i). PROOF, (i) As ? is finite on 8(C), ? admits a unique extension ? to a(C) = BC{T) as a measure. Clearly, ?@ < oo for С e С If ? e BV(T) with ?(?) — oo, then obviously Definition 2.2(ii) holds for ?(?) with TZ = BC(T). Let ?(?) < oo and let ? > 0. As ? is ?-bounded, there exists (C,,)^ С С such that ? С UГ с« ¦ By hypothesis, there exists U„ e U ? 8(C) such that ? ? C„ С U„ and ?(?/„) - ?(? ? C„) < ?/B"), ? e N. Let U = U?° U„. Then ? cU eUn BC(T) and ?(t/) - ?(?) < ?. Thus ?(?) satisfies Definition 2.2(H) with П = BC(T). With C„ as above, let E„ = \J" ? ? С*. Then (?„)f С 5(C) and ?„ / ?. Then ?(?) = sup,, ?(?„) = sup,, sup^(C): С С ?„, С eC}^ sup{M(C): С С ?, С е С) < Д(?), since ? is <S(C)-regular. Hence, ? is ?(·(?)- regular. (ii) Taking ? as in (i) and applying Theorem 2.9 to Д, we obtain a Radon-regular extension ? of ? and, hence, of ?, to B(T). Moreover, ? is unique by the uniqueness part of (i) and that of Theorem 2.9. ?
1062 ?, V. Panchapagesan If ? is a complex valued additive set function on a ring of sets TZ, then it is well known that sup |v(F)|<u(v, ¦?)(?)< 4 sup |v(F)|. B.3) FcE.FeTZ FcE. Fell Using B.3) one can prove the following PROPOSITION 2.11. Let TZ be a S-ring of subsets ofX with 8(C) cTZor 5 (Co) С И and let v, vj, i = 1,2, be complex measures on TZ. Then: (i) If v\ and V2 are TZ-regular, then a\v\ + a2v2 is also TZ-regular, where a\ and a2 are scalars. (ii) ? is TZ-regular if and only ifv(v, TZ) is TZ-regular. (iii) Ifv(v,TZ) is TZ-regular, then Rev, Im v, (Rei>)+, (Rev)", (Imv)+ and (Imv)" are TZ-regular. (iv) If\v\ (E)\ ^ \v2(E)\for all ? еЯ and ifvi is TZ-regular, then v\ is TZ-regular. 3. Complex Radon measures and their properties With each ? e fC(T)* we associate a unique complex measure ?? defined canonically on a <5-ring Me containing 8(C) and study the properties of ?? and Mq. The following result is the same as Proposition 15 in Bourbaki A965a, Chapter IV, §1). PROPOSITION 3.1. For the positive linear junctionals ?\ and 02 on CC(T) we have: @ ?*??+?2=?*??+??. (ii) //0| ^??,????*?? <?*,. For a positive linear functional ? on CC(T), let us recall from Theorem 1.2 that ?? = ?*\? t and ?^0? = ?*?\?(?)- ??°? is tne Borel-Radon measure induced by ?. THEOREM 3.2. A measure ? on B(T) is Borel-Radon in the sense that there exists a positive linear functional ? on CC{T) such that ? = ?^?? if and only if it is Radon-regular. PROOF. Clearly, the condition is necessary by Theorem 1.2. Conversely, let 9(f) = fT f ??, f e CC(T). Then ? is a positive linear functional on CC(T) and by the uniqueness part of Theorem 1.2(vii), the condition is also sufficient. ? From Theorem 1.2 we have the following: PROPOSITION 3.3. Let ? eK,(T)* and let ?\ =Re# ande2 = lm0. Then: (i) If ?? = [A CX: ??+(?) < ??, ??-(?) < oo, j = 1,2), then Me is a 8-ring and contains 8(C). (ii) For ? eMo, let ??(?) = {(??+ -??-) + '?{??+ -??-)}(?). Then ?? is a complex measure on Mq.
Positive and complex Radon measures 1063 (iii) If ? is real, then ?? is real. (iv) If ? is positive, then ?? is positive and finite. Definition 3.4. Let ? e K,{T)*. The complex measure ?? in Proposition 3.3 is called the complex Radon measure induced by ? and Me is called the domain of ??. Notation 3.5. Let ? e K,{T)* and let ?? be the complex Radon measure induced by ?. We denote ?(??, ??) by \??|. If ? is a real measure on a <5-ring ??? and 1Z\ is another <5-ring contained in 7^2, then (v,n\,n2)+(E) = %\^\v(F): FcE, F еП\} and (?,?],?2?(?) = -mf{v(F): ? С ?. F eTZ\} for ? e IZi. If ? is a real or complex measure on 7^2, then ?{?,?\,??2){?) In ? 1 J]| v(Ei)\: (?,)',' С 7г,, ?, П ?7¦ = 0. i ? j, {J ?, С ? for ? e 7г2. THEOREM 3.6. ?*>? # e /С(Г)*, # rea/. Then the following assertions hold: (i) ?? is Mg-regular. (ii) ?^ is of finite variation on Me- (iii) ?^ ^ ?^+ a/id ?~ ^ ?^- /и ??^, where ?? = ?^ - ?^ " f^e Jordan decomposition of ?? in the ?-ring Me. (iv) \??\, ?^ and ?~ are Мд-regular. (v) ?-tWo = (??\?(?)+· Мн U@ = (ЫаюГ ??^^(?«??(?·5(^))= Im«IIs<C)· (vi) lM«IU(C). М<»Ъ(С)> M^li(C) and ??\?(? are S(C)-regular (vii) (a) (???,{(?.??)+(?) = ^(?),?????. (b) (??, 5(C), ??»?(?) = ?~(?), ? e ??«. (c) и(д„,а(С),Л*„)=|Ы(Е). ЕеМв. (viii) G/ve/г ? е Мв, there exist A, B eK = ?(?) ? Mff яис/г ?/??? А С ? С В and ??+(?\?) = ??-(?\?) = 0. Then ??~(?) = ??+(?), ??-(?) = ?„-(?) and ?? (respectively Me) is the Lebesgue completion of ??\?? {respectively of TV) with respect to ?? {respectively to ?? \ц). PROOF. By Theorem 1.2, ??+ and ??- are Me-regular and hence ?? and \??\ are also ???-regular by Proposition 2.11. Since Me is a <5-ring and ?? is a real measure, (ii) holds. As ?? = ??+ - ??-, (iii) holds. As ?? + and ??~ are ??^-regular, (iv) holds by the inequalities in (iii) and by Proposition 2.1 l(iv). Thus (i)—(iv) hold.
1064 T.V. Panchapagesan (v) Let F e 8(C) and ? > 0. By (iv), given F e ?#, there exists Cf eC such that CfcF and \^f,\(F\CF) < ?. Thus, (VeWo)+(E) = sup{Mff(F): Fc?. F e 5(C)} > sup^«@): FcF, FeMff} > sup{Mff(F)-?: FcF, F e ??«} = ?+(?) - ?. Hence, (??\?@))+ ^ M^U(C)- Since the reverse inequality is obvious, this proves the first equality. Similarly, the remaining two equalities can be proved. (vi) Clearly, it suffices to show that |Мб> ИгсС) is <5 (С)-regular. Let ? e 8(C) and ? > 0. By (iv) there exist U e U ? ?? and С е С such that С С Е с i/_and \??\(U\C) < ?. As F е С, by Proposition 2.5 there exists i/()eWn 8(C) such that F С t/(). If V = U ? ?/(), then Fc V eWn<5(C)and |ms|(V\C) < ?. (vii) (a) Let F e M#. By (iv) and (v) we have = sup{M+(C): С СЕ, С eC}= sup sup^F): F С С, F e 5(C)} Ccfc". CeC ^sup{Mfl(F): FCF, F e<5(C)} iC sup{M„(F): FcF, F e ??} = ?+(?). Similarly, (b) is proved by using the ?? -regularity of ?~. (vii) (c) For F e ??, given ? > 0, there exists a partition {Ej}" of F in ?? such that " 1 ]P|Mfl(?i)| > lMff|(F)- -?. By (i), for each / there exists Cj e С such that С, с F, and |m«(G)| < ?/(In) for GeMfl with G С Ei\d. Then ?',' 1д»(С/)| ^ ?'? 1д»(Е/I - ?/2 > \Чв\{Е) - ?. Thus ?(??, 5(C), My)(E) > |Mfl|(F). Since the reverse inequality is obvious, (c) holds, (viii) This is immediate from (iii) and from Theorem 1.2(vi). ? THEOREM 3.7. Let0e K,(T)*, ? real and let ?+ and ?? be as in Theorem 3.6. Then: @ ME~ls(C) an^ Mfl la(C) a^w/f unique extensions ?^ ami ?,7> respectively, to B(T) as Radon-regular measures and ?^~(?) = ?^(?), JLtff~(F) = ?# (?) /or F e ??(?)?????. (ii) If ? = \??\\?(?> then ? has a unique extension ? to B(T) as a Radon-regular measure and 0(F) = \??\(?)/?? ? e BC(T) ПМ«. (iii) There exists a unique positive linear functional ? on CC(T) such that ? = ?^??, where ? is as (ii). (iv) \?\ ^ ?, where ? is as in (iii). (?) ??+ = ?+ and ??~ = ?~ in ??. (vi) Мв = Мщ and \??\ = ?\?\.
Positive and complex Radon measures 1065 PROOF, (i) The first part is due to Theorems 3.6(vi) and 2.10(ii). By Lemma 2.6(ii) and by Theorem 3.6(iv) we have ? + (?) = sup{ji+(C): СсЕ, С eC}= sup^+(C): ССЕ, С еС) = ?+(?) for ? e BC(T) П ??. Similarly, the other result holds, (ii) The proof is similar to that of (i). (iii) This is due to Theorem 3.2. (iv)Let / e C<(X) with suppf = C. Then \0(f)\ = I f fd^e\B[C])\ < ? \f\dv{M\Bio.B(C)) \Jc I Jc < j \f\dvfa\slC). 8(C)) < f \?\?\?„\ = f \f\dv = f\f\dj$* = ,l,{\f\) and, hence, by the definition \?\ (see A.1)), (iv) holds. (v) By (iv) and by Proposition 3.1, ?*^ ^ ?*? and ц.щ = ?«+ + M«- in ? (?) П ??. Consequently, by (ii) and (iii) and by Theorem 3.6(iii) we have Ш(Е) = (?+ +?")(?) < (??++ ??-)(?) = ?„,(?) < ?^'?(?) = \??\(?) for ? еВс(Т) П ??. Hence, ?[?\(?) = \??\(?) for ? e ?,.(?) П ??. Now let ? eMe. Then by Theorems 3.6(viii) and 1.2( vi) there exist А, В е ?((?)?)?? with A ?-compact such that А С ? С ? and ??+(?\?) = ??-(?\?) = 0. Hence, by Proposition 3.1(i) we have pm(B\A) = 0. Consequently, ? = A U (E\A) e Мщ. Thus Aifl с Мщ. Moreover, as A e BC(T) П M«, by Theorem 3.6(iii) we have ?\?\{?) = ????(?) = \??\(?) = ?+(?) + ?-(?) = ?+(?)+??(?) = \??\(?) and hence М|«||мн = |?#|. Therefore, by Theorem 3.6(iii), we have |?«| = ?^ + ?^ ^ ?«+ + M^- = M|fl| = |М«1 in Mff. This shows that ??+ = ?+ and ??- = ?„ in M«. Hence (v) holds. (vi) In the proof of (v) we showed that Мн С Мщ. If ? e Мщ, then by Theorem 1.2(vi) there exists А, В еВ(Т)Г\Мщ such that А С ? С S, ?|«|(?\?) = 0 and ?|«|(?) < oo. Then by Theorem 3.6(iii) and Proposition 3.1 (ii), it follows that A and E\A belong to ?? and hence ? e M«. Thus (vi) holds. ? Now let us consider an arbitrary ? e K,(T)*. THEOREM 3.8. Let ? e Х](Г)*. Г/геи the following assertions hold: (i) ?^ /? ??-regular. (ii) ?^ ? of finite variation in M#.
1066 Т. V. Panchapagesan (iii) \??\ is Mo-regular. (iv) u(Mfl|a(C),5(C)) = |Mfl||a(C). (v) |Мй11г(С) is &(C)-regularand hence ??\&^) is S(C)-regular. (vi) и(дй,5(С), Mfl) = |Mfl|. (vii) G/ven ? e Mfl, ?/геге exist A,B eTL = B(T) ? Mff яис/г ?/??? ? С ? С В and \??\(?\?) = 0. Consequently, ?$(?) = ?$(?) and ?? (respectively ??) is the Lebesgue completion of ??\?? (respectively ??) with respect to TL (respectively ??\?)- (viii) If ? = \??\\$(?· then ? has a unique Radon-regular extension ? to B(T) and HE) = \??\(?)? ? e B,(T) П ??. (ix) There exists a positive linear functional ? on CC(T) such that 0 = ?^??, where ? is as in (viii). Moreover, \?\ ^ ?. PROOF. Let ?\ = Re# and ?2 = Im#. As ?? = ??? ? ??2 and U\ ? U2 e W ? ?? for Uj еИП ??[ for / = 1,2, (i) holds by Theorem 3.6(i). (ii) holds as ?? is a 5-ring. By (i) and Proposition 2.11, (iii) holds. The proof of Theorem 3.6(vii)(c) holds here verbatim to prove (vi). (iv) is immediate from (vi). The proof of (v) is similar to that of Theorem 3.6(vi) but for the fact that we have to invoke (iii) of the present theorem instead of (iv) of the said theorem. Thus (i)—(vi) hold. (vii) Let ? e ??. Since ??+(?) < oo and ??-(?) < oo for j = 1,2, by Theorem 1.2(vi) there exist A.fie BC(T) П Me such that А С ? С В, А ?-compact, and ??+(?\?) = ??7(?\?) = 0 for j = 1,2. Thus \??\(?\?) = 0. Also by (vi), |/ie|(F) = ?(?? 1тг, K)(F) for F e K. Thus (vii) holds. (viii) The proof is similar to that of Theorem 3.7(ii). (ix) The first part is due to Theorem 3.2. As in the proof of Theorem 3.7(iv), for / e CC(T) with supp / = C, we have \0(f)\ = \S ?????€)+? f /???,\???\^ ? \?\??(??\?(.0,?@) \Jc Jc I Jc ^J \f\dv = J \/\??^ = ?{\?\) and, hence, by A.1) we have \?\ < ?. ? Theorem 3.9. Let ? e fC(T)*. Then: (i) |де(С)|^Д|в|(С)./ЬгСеС. (ii) For A e Me, \??\(?) = 0 if and only if ?\?\(?) = 0. (iii) ??? = ?|?| and \??\ = ?\?\. PROOF. Let ?\ and вг be as in the proof of Theorem 3.8. Let С eC. Then by Theorem 1.2, ??+1Me and M#-1Ms are ??# -regular for ; = 1,2. Therefore, given ? e N, there exists t/„ e W П ??„ such that С С ?/„, ??+(?„\0 < \/? and ??-(?„\0 < l/и for ;' = 1,2. Let W„ = ?? Uk and W = ПГ W„. Then W e M« as ?? is a <S-ring. Moreover, С С W and
Positive and complex Radon measures 1067 ??+(?\0 = ??? (W\C) = 0 for ; = 1,2. By Urysohn's lemma there exists /„ e C+(T) with xc ^ /„ ^ xw„- If g„ = ?'? fi· tnen gn \ Xc a.e. with respect to ??+ and ?^- for ;' = 1,2. If С ? is the support of g \, then by the Lebesgue dominated convergence theorem we have lim f gn ??^ = ?+(? and lim [ g„ ??*°.? = ??- (С) for ; = 1, 2. " Jt °J ? " JT ? ? Then as gn -> xc Mjflj-a-e-, by the same theorem we have |мй(С)| = lim / &???? = lim0(g,,) < lim |0|(g„) = lim / ?„<*?^? Iй Ус, " " " Jt = Ml«l(C). Hence, (i) holds. (ii)Let |де|(Л) = О. By Proposition 3.1 we have Mjflj(A) < (m|S,| +M|«2l)(A) = u(M«,, Мй,)(А)+и(м«:.Мй:)(А) = sup{u(Mff,,Mff,)(C): CCA, С еС) + sup{u(Mfl:,Mff2)(C): CCA, CeC} = sup{u(MSl|i(C),a(C))(C): CCA, CeC} + sup{u(MS2|i(C),a(C))(C): CCA. С e С} ^ 2|м«|(А) = О by (iv) and (v) of Theorem 3.6 applied to ?\ and ?7 and by Theorem 3.7(vi) and by (iii) and (iv) of Theorem 3.8. Conversely, let M|«|(A) = 0. Then ?\? |(A) = 0 for j = 1,2 and, hence, by Theorem 3.7(vi) we have iMfll(A) ^ ?(???,???)(?) + ?(??2,??:)(?) = ?\?,\(?) + ?\#2\(?) = 0. Hence, (ii) holds. (iii) Let ? еМв. Then, by Theorem 3.8(vii), there exist А, В e B(T) ? ?? with А с ? С S such that \??\(?\?) = 0. Then by (ii) ?]?\(?\?) = 0 and, hence, м^|(?\А) = 0. Therefore, E\A e ??|#| and hence ? e ??^| since ??+(?) < oo and ??-(?) < oo for ; = 1,2. Thus ?? с ??]#|. Conversely, let ? e M|fl|. Then by Theorem 1.2(vi) there exist А, В е ?(?) ? Aifl such that А с ? С S and ???|E\?) = 0. Thus ?\?\(?) = ?\?\(?)- Then by Proposition 3.1, ??+(?\?) = ??-{?\?) = 0 for ; = 1,2. Consequently, E\A,E e ?? and thus ?? = ??\. Let ? e ?? and let ? > 0. By Theorem 3.8(vi) there exists {f,}',1 С 8(C) with ?, П ?y = 0 for 1 ^ j and (J? Ei С ? such that ?? |м«(Е,-I > Ш(Е) - ?/2. As
1068 Т. V. Panchapagesan Mflla(C) is <5(C)-regular by Theorem 3.8(v), there exists С, е С such that С, С ?, and \??(??) - ?^@| < ?/B?) for/ = 1,2,..., п. Then by (i) we have /1 /1 5^М1«|(с,-)>^|мй(С/)| I I 1 - ? > |?«|(?) -?. Hence, ?\?\(?) > |?^|(?). By (viii) and (ix) of Theorem 3.8, ?\?\(?) ^ ?°™(?) = \??\(?) for ? e B( (?) ? Mff and, hence, ???? = |?»| ???,(?)??„. C.1) Finally, if ? e Me, then by the proof of Theorem 3.8(vii) there exist А, В e ?(?) ? Mff with Л С ? С S, A ?-compact, \??\(?\?) = 0 and \??\(?) = |?«|(?). Then by (ii) and C.1) we have \??\(?)= \??\(?) = ?]??(?) = ????(?) + ?1??(?\?) = ???\(?). Hence, (iii) holds. ? 4. Regular extensions of positive and complex measures Using the various notions of regularity given in Section 2 for positive and complex measures defined on a <5-ring 7? containing 8(C) or ? (Co), we study the regular extensions of positive and complex measures defined on ? (Co). The results of this section play a key role in the sequel. THEOREM 4.1. Let ?? be a finite (positive) measure on <5(Co). Then there exist unique extensions ?, ? and ? of ?? to <5(C), BC(T) and B(T), respectively, such that ? is <5(C)- regular, ? is Bc(T)-regular and ? is Radon-regular as measures. Moreover, ? = v\S{C) — w\HC) and ? = w\BATy PROOF. Let ?? also denote its unique extension to Bq(T) as measure. Then the existence and uniqueness of the extensions ? and ? are due to Theorem 2.8. By Lemma 2.6, ? = v\&(C) an(J ?\?(? are <5(C)-regular extensions of ?? and hence by Theorems 2.8(i) and 2.10(i), ? = v\S(C) = ?|«?- ВУ Lemma 2.6(ii) and by Theorems 2.8(i) and 2.10(i), ? = ?\???). ? LEMMA 4.2. Let ?\ and ?? be finite (positive) measures on 8(Co) (respectively 8(C)- regular measures on 5(C)). Then there exist unique extensions ?\, ?>2 and ац of ?\, ?? and ?? + ??, respectively, to 5(C) (respectively B(T)) as 8(C)-regular (respectively Radon-regular) measures and ??, = ?>? + an. ? > ^2\??(?:)\
Positive and complex Radon measures 1069 PROOF. Let ?3 = ? ? + ?? and let ?\, ?'-,, ?\ be the unique extensions of ? ?, ?? and ?^, respectively, to Bo(T) (respectively ?, (?)) as measures. Clearly, ?', = ?, + ?'-,. Let 0j(f) = fjfdp'j, f e CC(T), for ; = 1, 2, 3. Then 03 = 0| + 0?. By Proposition 3.1, M^ = M^°r + м?'г on B(T). Let ?, = ?^ if ?, is 5(C)-regular and Wj = ??><0 if M/ is defined on 5(Co). Then by Theorem 1.2, ?^? is Radon-regular and by Lemma 2.6(i), Ме°Ъ(С) is <5(C)-regular. From the proofs of Theorems 2.8 and 2.9 it follows that Wj is a unique extension of ? у with the desired properties of regularity, for j = 1, 2, 3. D Lemma 4.3. (i) If ? is a S(C)-regular real measure, let ц> = HatC»)- ^en vo = V+I^(C(,) an^ »0 = v'licC,,)- (ii) Ifv\ and vi are S(C)-regular complex measures on 8(C) and if v\ \^c{)) — V-\&(C»)· then v\ = vi. PROOF, (i) Clearly, v? iC v+\S{Ci)) and v~ ^ v_|s<c„)- Let ?, = i^latC,,) and ?2 = v~\s(C())- By Lemma 4.2 there exist unique <5(C)-regular extensions ?>(|, ?^, ?\ and ?? of v?, Vq , a)| and oil, respectively, such that ?\ + v^ = 2л + ^ on 5(C). By hypothesis and by Proposition 2.11, v+ = lu\ and v~ = аи so that ? = a>T — ?? = i>(| — vj". Hence, ?+ ^ ?^ and ?~ ^ vj" and thus (i) holds. (ii) Clearly, it suffices to prove the result for v\ and vi real. Let ?? = v\ \^co) = v2ls(C0)- By Theorem 4.1 there exist unique <5(C)-regular extensions ?(| and ?~, of ?(| and ?^, respectively, to 8(C). Now, by hypothesis and (i), v~[\&(C{)) = vtWc») = Mo so tnat ЬУ the uniqueness part of Theorem 4.1 we conclude that vt = ?(| for j = 1,2. Similarly, vj = ?^ for j = 1,2. Thus ?>? = vi. ? THEOREM 4.4. Let ??, Ц) fee complex measures on 8(Cq), ?? bemg of bounded variation on 8(Co). Г/геи: (i) ?? ?<« ? unique extension ? to 8(C) as a 8(C)-regular complex measure, ? is real (respectively positive) if vq is so. (ii) The unique extension ?? of ?? to Bo(T) as a complex measure is B()(T)-regular. (iii) ?? has a unique extension ? to BC(T) (respectively ? to B(T)) as a Bc(T)-regular (respectively B(T)-regular) complex measure, ? and ? are real (respectively positive) if ?? is so. (iv) ? = ?|?(.(?) and ?5 = ?|?()(?) = ??„G> (?) LetM =sup{u(W,,<5(C0))(?): ? e <S(C0)}. Then sup{u(/j, TZ)(E): ЕеП} = М for ? = ??\ ? or ? and К = B0(T), ВС(Т) or B(T), respectively. PROOF, (i) Let ? ? = Re ?>? and v2 = Im ц>. By Theorem 4.1 there exist unique^ (C)-regular extensions i>+ and vj of i>+ and v~, j = 1, 2 and let ? = (?* - vj") + /(?-? - ?-?)- By
1070 Т. V. Panchapagesan Proposition 2.11, ? is <5(C)-regular, extends ?>? and is unique by Lemma 4.3. The rest of (i) is obvious. Let |??? = ?(??, ? (Co))- By Theorem 17.26 of Panchapagesan A991a) there exists a unique extension ?? of ?? to Bq(T) as a complex measure of bounded variation. Moreover, |??| = ?(??, ??(?)) extends |?0| to Bq(T) and sup{|/75|(?): EeB0(T)} = M. D.1) If ?\ = Re?o and ?? = ?????, then ?5 = (?+ -?~[) + '?(?? - ??) on ?()(?), where /? + and /j" are the unique extensions of ?+ and ?~ to Bq(T) as measures for у = 1,2. (ii) By Proposition 2.4(i), ^t and /j- are ??o(T)-regular for ; = 1, 2 and hence ?? is SoG')-regular. _ _ (iii) By Theorem 2.8(H) there exist unique extensions a>; and yy of ^t and /j~, respectively, to ?(?) as Radon-regular measures. For С е С, by Proposition 2.5 there exists Co e C0 with С С Co so that w;(C) ^ ?, (Co) = /?y+(Co) ^ ?? by D.1) and similarly, yy(C) ^ M. Consequently, wj(X) ?? ?? and Yj(X) ^ ? for ; = 1, 2. Then by Propositions 2.4(iv) and 2.11, ? = (?\ — ?\) + /(?? - yi) is i3(T)-regular and extends ?? and ??- Moreover, (Reo))|^(C0) = ?\ and (ImoOlaiC,,) = >?2; ^?? and Ima> are B(T)- regular. Let a and ? be also i3(T)-regular scalar extensions of ?\ and /ji, respectively. Then, as Aта)|Во(Г) = 0, we have fTfd(lma)+ = fT f d(\ma)~, for / e CC(T) and hence by Lemma 2.7(H), (Ima)+ = (Ima)" so that a is real. Similarly, ? is real. Clearly, ?+ +a~\S(c0) = ^7 +а+Ыс»)- Since a+ and a" are the unique Radon-regular extensions of their respective restrictions to <5(Co), by Lemma 4.2 we have ?\ + a~ = ?\ + a+ and hence a = ?\ — ?\ = Re ?. Similarly, ? = Ima> and hence ? is unique. Taking ? = ?|?,G> by Lemma 2.6(H) we observe that ? is a Bt (T)-regular extension of ??. As ?|^(? is <5(C)-regular by Lemma 2.6(i), the uniqueness of ? follows from Lemma 4.3(H). From the above proof it is clear that ? and ? are real (respectively positive) if ?? is so. (iv) This follows from the uniqueness of ? and ? and from their definition. (v) By applying the above extensions of |??| to BC(T) and B(T) we deduce the results from D.1) from Proposition 2.5 and from Definition 2.2(iii). The proof of the following theorem is similar to that of Theorem 4.4 and, hence, omitted. ? THEOREM 4.5. Let ? be a S(C)-regular complex measure of bounded variation with sup{u(M, S(C))(E): ? e S(C)} = M. Then: (i) The unique extension ? of ? to BC(T) as a complex measure is В'С(Т)-regularand is real (respectively positive) if ? is so. (?) ? has a unique extension ? to B(T) as a B(T)-regular complex measure and ? is real (respectively positive) if ? is so. Moreover, ? = ?|?, G> (iii) sup{v(?, BC(T))(E): ? e BC(T)} = sup{v(cu, B(T))(E): ? e B(T)} = ?. COROLLARY 4.6. Every complex measure ?? on Bq(T) has a unique extension ? to BC(T) (respectively ? to B(T)) as a Bc(T)-regular (respectively B(T)-regular) complex
Positive and complex Radon measures 1071 measure and ? (respectively ?) is real if ?? is real and ? (respectively ?) is positive if ?? is positive. Moreover, sup{v(n,U)(E): ?eft}=sup{u(Mo,?o(O)(?): ? e B0(T)} < oo where ? = ? or ? and ?? = BC(T) or B(T), respectively. 5. Bounded complex Radon measures In this section we give several characterizations of a bounded linear functional ? e fC(T)*. Definition 5.1. For ? e K,(T)*, let ||<9|| = sup{|<9(/)|: / e K,(T), \\f\\T ? 1}. ? e K,(T)* is said to be bounded if ||#|| < oo. Note that ? e K,(T)* is bounded if and only if ? e (CC(T), \\ ¦ \\T)* = (Со(Г), || ¦ ||r)* where (Со(Г), || ¦ Ц7-) is the Banach space of all continuous complex functions vanishing at infinity in T. Notation 5.2. The linear subspace of all bounded ? e K,(T)* (respectively ? e fC(T, R)*) is denoted by К,(Щ (respectively K,(T, Щ). Definition 5.3. A complex Radon measure ?? on ? is said to be bounded if sup{|/z»(E)|: EeS(C)}<oo. Definition 5.4. For ? elC(T)* we define \\??\\ = sup{v^e\S{C].8(C))(E): Ее 8(C)}. Lemma 5.5. Let0efC(T)* and let E e 5(C()). Then u(Mflla(C„), <HCo))(?) = ?(??,??)(?) = ?\?\(?). In particular, ?(?? |5(C|)), &(Co))(E) = ?(??\?(?< <5(C))(?). PROOF. Let ? = м^ад,) and let |v| = v(v, 5(C())). By Theorems 3.2 and 4.1 there exist a unique Radon-regular extension |i>| of |i>| to B(T) and a positive linear functional ? on Cc-CT) such that ? = ?^0?. Let / e CC(T) with supp / с С e Co- Since / is integrable with respect to every Baire measure (see p. 241 of Halmos A950)) and since ?()(?) ? С = B()(C), as in the proof of Theorem 3.8(ix) we have |0(/)| = I fc /?(?«|?|)<?)? < fT \?\?(?^') = iMI/l) and, hence, by A.1), \?\ sC ?. On the other hand, for ? e <S(C0), by Theorem 3.9(iii) we have \v\(E) ^ ?(??, MH)(E) = ?\?\(?), so that д^ог|г(С„) ^Ml«lkc0)- If ? = (??^? - м^огIг(С(,)' then by Lemma 4.2 there exists a unique Radon-regular extension
1072 T.V. Panchapage.san ? of ? to B(T) such that ? + ?0^ = ?^? on B(T). Thus ?^?? «? ?${. Therefore, ? = \?\ and then by Theorem 3.9(iii) we have \v\{E) = ?^!(?) = ?\?\(?) = ?(??,??)(?) for ? e S(Co). The last part is due to Theorem 3.8(iv). ? THEOREM 5.6. Let ?? be a complex Radon measure on T. Then the following assertions are equivalent: (i) ?? is bounded. (ii) ? is bounded. (iii) TZcMe, where Ti = B0(T) or BC(T) or B(T). (iv) Mq is ? ?-algebra in T. (v) sup{|Mfl(E)|: ? eTZ] < oo, where TZ= Bu(T) or BC(T) or B(T) or MH- (vi) sup{u(Mff |тг, П)(Е): ? e ft} < oo, vWiere ?? = ?? or B(T) or BC(T) or B0(T) or 8(C) or 8(C()). (vii) sup{u(/iS|i(Co„a(C(,))(C): CeQ < эо- (viii) ||?«|| < oo. Moreover, \\?\\ = \\??\\ for ? e K,(T)*. The functional ? is bounded if and only if Me = Мц* and when ? is bounded. \\??\\ is given by the supremum in (vi) with 11 being anyone of the the S-rings MH, B(T), BC(T), B0(T). 8(C) or ? ((X)) and also by the supremum in (vii). In particular, \\??\\ = ?{??\??),?(?))(?). PROOF, ((i) =>· (ii)) Let feCc(T), with ||/||r ^ 1 and let supp/ = C. Then \0(f)\ = I f /?(?„\???)\ < f \f\dvfaH\BlC),B(C)) \Jc I Jc <f(M»li(C),i@)(C)<4sup{|/i«(?)|: ?e5(C)} by B.3) and, hence, ? is bounded if ?? is bounded. ((ii) =f. (vii)) Let С e C(). By Proposition 2.5 there exists UQ eU Г) 5 (Co) such that С С ?/0. Let / e С+(Г) with Xc ^ / ^ Xt;,,- Then ||/||r = 1 and by (ii) we have v{fieWC),&(C))(C)< f |/??(?«|?(?,5@) J и» <f\f\d^) = \e\(f)^№\ E.1) since ?(??\&??,?@)(?) = ?\?\(?) for ? e 5(C) by Theorems 3.8 and 3.9 and since ||0|| = PHI by Corollary 1 on p. 58 of Bourbaki A965a). If M() is the supremum in (vii), thenbyE.1),Mo<||0||. Since E.1) holds also for С еС, by (iv)and (v) of Theorem 3.8 we have IIm«IKIH|. E.2)
Positive and complex Radon measures 1073 Let ? = и(мй|а(С),5(С)) and let ? ? = $??{?(??\??· ??)(?): ? e К], where К is one of the <5-rings in (vi). By Theorem 3.8(iv), ? = ?(??,??)\&<,?- For ? e Я(С). by Proposition 2.5 there exists С e Co with ? С С and, hence, by Lemma 5.5 we have ?(?) ? Ш(С) = ?(??, Мв)(С) = u(/i«|i(Co„ S(C0))(C) whence it follows that MslC) = Mq. By Corollary 4.6, MBo[T] - ??[?] and by Theorem 3.8(vii), MM„ = ??(?)- Since ?(??, ??) is ??^-regular by Theorem 3.8, we have Mm,, = M^C)- The proof of the equivalence of the remaining assertions is easy and, hence, omitted. From the proof of ((i)^(ii)) and from E.2) it follows that \\?\\ = ||?«|| forfl e ?,(?)* (even thought ? is not bounded). If ? is bounded, by (iv), ? e MH and by Theorem 3.9, Me = M\H\ = ??> . The condition is also sufficient since Мц* is a ?-algebra in T. ? 6. Characterizations of complex Radon measures Using the results of earlier sections we characterize the complex Radon measures in terms of complex measures defined on S(Co) and in terms of those defined on 5(C) which are further <5(C)-regular. Also we give another characterization which generalizes Theorem 54.2 of McShane A944) to complex Radon measures. See introduction. Lemma 6.1. Let ? е/С(Г)*, 0, = Re0 and fa. = Im0. Let ? = ?(?«|5,?,5@). Then: (i) ? is S(C)-regular and if ? is the Radon-regular extension of ? to B(T), then ? — ubor (n) B(T) f)M„ = {Ee B(T): v(E) < oo}. (iii) Me = {E С ?: there exist А, В e B(T) ? Me such that А С ? С В, and НВ\А) = 0]. _ _ (iv) If ? у = ??] \&ic) and ?+ and ?~ are the unique Radon-regular extensions of ?+ and ?~, respectively, to B(T) for j = 1,2, then ?«(?) = (?/" - ?~[)(?) + ;(?+ - M7)(?)/or ? е В(Т) П ??. (?) If ? = {? e ?(?): 0(E) < oo), i/?e/j ?« ? ?/ге Lebesgue completion of ??\? with respect to ??. In short, ?? and ?? are uniquely determined by ?«|^(?- PROOF, (i) By (iv) and (v) of Theorem 3.8, ? is <5(C)-regular. From the proof of Theorem 3.9(iii) and from the Radon regularity of 0 and ?^4 it follows that 0 = ?^?. (ii) Let TZ be as in (v). Then by (i) and by Theorem 3.9(iii), (ii) holds. (iii) By Theorem 3.8(iv), v= \??\\&{?- ВУ the Me -regularity of \??\ and by the Radon- regularity of 0 we have \??\{?) = v(E) for ? e B(T) ? ??- Then by Theorem 3.8(vii), (iii) holds. (iv) By (i) and (v) of Theorem 3.7, by the MH~-regularity of ??~ and by the MH— I J J regularity of ??-, we have ?| = ??~ and ?" = ??- in B(T) П Me, for j = 1, 2. Since
1074 Т. V. Panchapagesan Mg = Me. П Me-,, M+(?) = sup{Mt(C): С СЕ, С е С\ = sup^+(C): С С Е, С еС} = ??+(?) for ? е В(Т) П ??? and for ; = 1,2. Similarly, ??(?) = ?„-(?) for such ? and for j = 1,2. Hence, (iv) holds. (v) By (iv) and (vi) of Theorem 3.8, ?(??\?,?)(?) = ?(??,??)(?) for ? e К and consequently, by (i) and Theorem 3.9, ?(??\??,??)(?) = 0(E) for ? ell. Now (v) is immediate from (iii). ? If ? = ?\ - 02, where ?\ and ?? are positive linear functionals on CC(T), then ?+ +?? = ?~+?\ and, hence, by Proposition 3.1 we have ?^? + ?^,? = ?^ + ?^ on ? (?). Hence, we have the following LEMMA 6.2. If 9\ and ?? are positive linear functionals on CC{T), then ?(?, _#2) = ??? - ??2 on 8(C). LEMMA 6.3. Let ? be a 8(C)-regular real measure on 8(C). Let |?| = ?(?^(.0)). ?/\?\ is the unique Radon-regular extension of\?\ to B(T), let Tl= [E e B(T): |?|(?) < oo}. Then: (i) There exists a unique ? e K,(T)*, ? real, such that ?^? = ?- Moreover, ?+ = ??+Wc) and?~ = ?-н-Ыс)· (ii) \?\ = ?\?\\&(?· (in) \?\ = ?$. (iv) ?? = ?(?)???. _ (?) Me is the Lebesgue completion ofTZ with respect to \?\\??- (vi) If?+ and ?" are the unique Radon-regular extensions of?+ and ?~, respectively, to B(T), then ??(?) = ?+(?) - ?-(?) for EjeTZ. _ Consequently, given F e ??, ??(?) = ??(?) = (?+ - ?~)(?) where А С F С В, А, В е7г with |?|(?\?) = 0. PROOF. Since ? is 3(C)-regular, by Proposition 2.11 the measures |?], ?^, andjjT are «5(C)-regular. Then by Theorem 2.10(ii) the Radon-regular extensions |?|,?+ and ?" exist uniquely on B(T). (i) By Theorem^.2 there exist positive linear functionals ?\ and ?? on Cc (X) such that ?+ = дЬог and ?- = дЬог Let ? _ ?] __ ??_ Then by Lemma 6.2, ??\??? = ?- Then ЬУ Theorems 3.6(v)and 3.7(v) we have ?+ = ?^+?^? an(J M~ = M«- Uo- To prove the uniqueness of ?, if possible, let ? e /С(Г)* be such that ? = ??|^?· Let ?? = Rew and ?? = Imw. Since ??,|5,? = 0, ??7\??? = ??-|«<0 Then ЬУ the
Positive and complex Radon measures 1075 uniqueness part of Theorem 2.10(H), we have ? °I = ? 1 so that ?2+(/> = f fdi^) = f fdi^) = ?2_(/) f0r / e Cc(T). Thus W2 = 0. Hence, ? is real and ? = ??\&?? = (??- -MwOko = (?«- -Mfl-)ko· Therefore, (??+ +Me-)b@ = (М^+ + ??-)??«7) and consequently, by Proposition 3.1 we have ?\°++?~}\?^) = M^++ft,-)l5(C)- Thus, by the uniqueness part of Theorem 2.10(ii) we conclude that ?!™' , = ?^ _, so that (?+ + 0~)(f) = (?+ + w~)(f) for f e CC(T). Hence, ? = ? and thus ? is unique. Since |?| = ?(??\&?), 8(C)) by (i), the assertions (ii) and (iii) hold by Lemma 6.1(i), whereas (iv) follows from Lemma 6. l(ii). Similarly, (v) is immediate from Lemma 6. l(iii). Finally, (vi) and the last part follow from (iv) and (v) of Lemma 6.1. ? As a consequence of the above lemmas we give the following theorem, which among other things, characterizes a complex Radon measure in terms of its restriction to 8(C). THEOREM 6.4. (i) A complex measure ? on 8(C) is the restriction of a complex Radon measure ?# if and only if ? is 8(C)-regular In such case, ? is unique and is called the functional determined by ?. (ii) Let ? be a 8(C)-regular complex measure on 8(C) and let \?\ = ?(?, 8(C)). Let \?\ be the unique Radon-regular extension ?[\?\ to B(T) and let 1Z = {E e B(T): \?\(?) < oo}. Let ?? = Re ?, ?2 = ????, and ?+ and ?~ j be the Radon-regular extensions of ?| and ?^, respectively, to B(T) for j = 1,2. Moreover, by (i), let ? be the functional determined by ?. Then: (a) TZ = B(T)f)M„. _ (b) ? ? is the Lebesgue completion oflZ with respect to \?\\??- ^ (c) Given ? e ??, there exist А, В ell with А с ? с В such that |?|(?\?) = 0. Moreover, ??(?) = (?+ - ?^)(?)+?(?? - ?!)(?). Thus ?? is the Lebesgue completion of ?? |тг with respect to TZ. (d) |?| = Д|0||з(С) and thus \?\ determines \?\. (e) ? is real if and only if ? is real; ? is positive if and only if ? is positive. When ? is real, ?+ and ?~ determine ?+ ????~, respectively. PROOF, (i) The condition is necessary by Theorem 3.8(v). Conversely, let ? be 8(C)- regular, with ?? = Re ? and ?? = ????. By hypothesis and by Proposition 2.11,?? and ?? are <5(C)-regular. Consequently, by Lemma 6.3 there exists 0,- e fC(T)*, 0,- real, such that My = M0/ls(C) for; = 1,2. Let0 = 0| +???. Then ? eK,(T)* and ? = ?^?^?- Clearly, ? is unique by the uniqueness part of Lemma 6.3(i). (ii)(a) As ? = ??\&?), the result follows from Lemma 6. l(ii). (b) This is the same as Lemma 6. l(iii).
1076 Т. V. Panchapagesaii (c) This follows from (iv) and (v) of Lemma 6.1. (d) By Theorems 3.8(iv) and 3.9(iii), |?| = ?(??, Me)\S{C] = MioikC) and< hence, (d) holds. (e) This is immediate from Lemma 6.3(i). ? The following result generalizes Theorem 54.2 of McShane A944) to complex Radon measures. The hypothesis that B(T) ? V = [? e ? (?): \v\(E) < oo} is the same as B(T) П V = {? e B(T): \v\*{E) < oo} when ? = R" and in this case, the hypothesis of P-regularity of ? in Theorem 6.5 is redundant. THEOREM 6.5. Let ? be a 8-ring containing 8(C) and let ? be a V-regular complex measure on V. Let ? = ?^? and let ? = ?(?· ? (С)). Then \v\ is S(C)-regular. Suppose ? = ?(?)?)? = {? eB(T): M(?)<oo}, where \v\ is the unique Radon-regular extension of \v\ to B(T). If ? and V are the Lebesgue completions of ?\? and 1Z, respectively, with respect to ?? and ?\??, then there exists a unique ? e fC(T)* such that ? = ?? and V = Me- Moreover, ? is real {respectively ? is positive) if ? is real {respectively if ? is positive). PROOF. Let \?\ = ?{?, V). By hypothesis and by Proposition 2.11, \?\ is D-regular and as 8(C) С Р, the argument given in the proof of Theorem 3.6(vi) holds here to show that ЫкС) is «5(C)-regular. Since ? is D-regular, as in Theorem 3.6(vii)(c) we have ?{?,&@,?)(?)=\?\(?) for ? e V. Consequently, |i>| = ????^? and, hence, |?| and ? are <5(C)-regular. Then M(?)=sup{M(C): CCE, С eC} = sup{|M|(C): СсЕ, СеС} = |д|(?) for ? e П and hence М1тг = 1м11тг- Since 8(C) С К С V and since ?(?, 8(C), V)(E) = \?\(?) for ? e ?, it follows that ?(?\?· K)(E) = \?\(?) for ? e TL. Hence, ?(?\?. Щ = Since ? is 5(C)-regular, by Theorem 6.4(i) there exists a unique ? e K,(T)* such that v = М«1г(С) and by (ii)(e) of the said theorem, the functional ? is real (respectively positive) if ? is real (respectively positive). Moreover, by (ii)(d) of the said theorem, ? = M|fl|ls(C)- Then by hypothesis, by (a) and (b) of Theorem 6.4(ii) and by the fact that М1тг= 1м11тг = ?(?\??,11), we conclude that V = MH- Let ?? = Re ?, ?? = ????, ?\ = Re ? and v2 = Imv. Since ?? and ?? are P-regular, by following an argument similar to that in the proof of Theorem 3.6(v) we note that М/Ъ(С) = VJ and ?~\^? = vj for j = 1,2. Moreover, as 1/xllstC) is ^(C)-regular, by Proposition 2.11 ?, and ?+ and v~ are 5(C)-regular for j = 1,2. Let vj~ and vj be the unique Radon-regular extensions of v+ and vj to B(T) for j = 1,2. Then by
Positive and complex Radon measures 1077 Proposition 2.4(iv), by the hypothesis that ? is P-regular and by Proposition 2.11, we have it(?)= sup{vt(C): СсЕ, С eC}= sup{?j(C)¦. Cc?, €&0) = ?^{?) for ? e ft and for ; = 1,2. Similarly, v~ (?) = mJ(?) for ? e ft and for j = 1.2. Consequently, given ? e V = ??, by Theorem 6.4(ii)(c) there exist A, S_e ftjvith А с ? С S and M(B\A) = |?|(?\?) = 0. Then ??(?) = (vj - vf )(A) + /(?2+ - vJ)(A) = (?+ -?~)(?) + ?(?? - ?7)(?) = ?(?) = ?(?) since by hypothesis ? is the Lebesgue completion of ?\? with respect to ft. Hence, ?? = ?. ? In the following theorem we characterize a complex Radon measure in terms of its restriction to 8(Cq). THEOREM 6.6. (i) A complex measure ? on ? (Co) is the restriction of a complex Radon measure ?? and such ? e fC(T)* is unique. We say that ? is the functional determined by vifv = Mfl|s(c0); ? is real {respectively positive) if ? is real {respectively positive). (ii) If ? is a complex measure on S(Co) and ? is the unique extension of ? to 8(C) as a S(C)-regular complex measure, then ? and ? determine the same functional ? e K,(T)*. That is, v = ??\&@?)) and ? = ?.?\&?? In the following, let ?, ? and ? be as in (ii). Let v\ = Re ?, ?: = Im ?, ?\ = Re ? and ?? — ????. Then: (iii) The unique S(C)-regular extensions v+ and v~ of v^ and vj, respectively, to S(C) are given by v^ = ?+ and v~ = ?~, j = 1, 2. (iv) // ? is real and \v\ = v(v, 8(Си)), then v+ = ?+?^,,), v~ = M~li(C„) and \v\ = ?(?, 8(C))\s[C„)- Consequently, v+, v~ and \v\ determine ?+, ?~ and \?\, respectively. (?) If ? is complex, then \v\ = ?(?. 8(C))\S{c„) =M|«|l5(C0) sothat\v\ determines Щ. PROOF. By Theorem 4.4(i) there exists a unique 8(C)-regular complex measure ? on 8(C) such that ?\???) = ? and such ? is real (respectively positive) if ? is real (respectively positive). Then by Theorem 6.4(i) there exists a unique ? e K,(T)* such that ? = ??\&(? and hence, ? = ??\&@»)- This functional is real (respectively positive) if v is real (respectively ? is positive) by Theorems 4.4(i) and 6.4(ii)(e). Since ? determines ? uniquely, it follows that ? determines ? uniquely. Thus we have proved (i) and (ii). (iii) By Lemma 4.3(i), vj = m||5(C|)) and vj = ?]\&(?0) and as ?j and ?~ are 8(C)- regular, by the uniqueness part of Theorem 4.1 we conclude that vj = ?j and vj = ?~ for; =1,2. (iv) Let ? be real. Then by Theorem 4.4(i), ? is real and Lemma 4.3(i) implies v+ = ?+?^,,) and v~ = ?" |г(С()). Consequently, by Theorem 6.4(ii)(e) the result holds. (v) This is immediate from Lemma 5.5. ? 7. Isomorphic representations of fC(T)*, K,(T. R)% K(T)*7 and K,(T, Щ Making use of the results of the last section we show that fC(T)* is isomorphic to the complex vector space of all complex measures on 8(Co) and to that of all <5(C)-regular
1078 Т. V. Panchapagesan complex measures on 8(C). The same isomorphism, when restricted to /CG\ R)*, is order preserving and maps K,(T, R)* onto the real vector space of all real measures on 8(Cq) and that of all <5(C)-regular real measures on 8(C). Using the results of Section 5 we also show that IC(T)l (respectively /С(Г, R)?) (see Definition 5.1) is isometrically isomorphic to the Banach space of all bounded complex (respectively real) measures on 8(Cq) and to that of all bounded 8 (C)-regular complex (respectively real) measures on 8(C). Finally, the vector space of all complex valued additive set functions of finite (respectively bounded) variation on a ring of sets is shown to be isomorphic (respectively isometrically isomorphic) to K,(T)* (respectively to K,(T)*) for a suitably chosen totally disconnected locally compact Hausdorff space T. Before giving the relevant theorems we fix the notation of various spaces of real and complex measures. Notation 7.1. In the following vector spaces, the operations of addition and scalar multiplication are defined setwise. Mo(T) (respectively MAT)) denotes the vector space of all complex (respectively <5(C)-regular complex) measures on 8 (Co) (respectively on 8(C)). M(T) (respectively MAT)) is the vector space of all ??(r)-regularjrespectively i3c(r)-regular) complex measures on B(T) (respectively on BC(T)) and MAT) is the vector space of all complex measures on BAT)- Mq(T)i, = {? e MAT): ? bounded) and Mc(T)b = {? e MAT): ? bounded). The real vector spaces MrQ(T), M[.(T), M(T, R), MAT,R), M0(T,R), Mr0(T)h, M[(T)h are the subspaces of the corresponding real measures in MAT), MAT) etc., respectively. THEOREM 7.2. Let ? :MAT) -> K,(T)* be given by ?(?) = ? if ?- ?.<*\&(? and ?0:???)-> K(T)* be given by ?0(?) = ? if ? = ?? |г(С()). Then: (i) ? and ?? are well defined and are linear isomorphisms onto fC(T)*. (ii) If ?(?) = ? (respectively ??(?) = ?), then ?(?(?,?@))) = \?\ (respectively Фо(и(М(Со))) = |0|). (iii) Let Ф(г) = Ф\м[\Т) and Ф^'1 = ??)\?\[?)· Then ??? (respectively ?^'}) is an order preserving linear isomorphism of M[(T) (respectively of M'0(T)) onto /CG\R)*, where ?, < ?2 if ?, (?) iC ?2(?) for ? e 8(C) (respectively for ? e 8(C0)). In particular, if ?"Vy = 0,-, ?7 e M[.(T), for 7 = 1,2, then ???(?\ ??2) = ?\ ??? where ??,????)= sup {?^,(?) + ?«,(?\?)} FcE. F€S{C) ????(?)(?\ л ??) = ?\ ???, where ??]????)= inf \?? (F) + ????\?)\ FcE. Fe,5(CI ' ' for ? e 8(C). A similar result holds if ?\ and ?2 belong to MrQ(T). (iv) MAT) (respectively MAT)) is the dual of)C(T) and 0{f) = f fdivLehiO). /е/С(Г), G.1)
Positive and complex Radon measures 1079 respectively 0(f) = j fd^e\b{Ct))\ /еЩ), G.2) where ?(?) = ? (respectively ??(?) = ?). (?) ?[.(?) (respectively M'Q(T)) is the dual offC(T. R) and an expression similar to G.1) (respectively G.2)) holds ?/???(?) = ? (respectively ?^\?) = ?). PROOF, (i) By Theorems 6.4(i) and 6.6(i), ? and ?0 are well defined. If ?(?\) = ?(?2) = ?, then ?? = ??\S(c) = ?3 and, hence, ? and similarly, ?? are injective. Making use of Proposition 3.1 and Theorem 6.4(ii)(e) it can be shown that ? and ?? are linear. ? is an onto mapping by Theorem 3.8(v), while ?? is evidently an onto mapping. (ii) This follows from Theorem 6.4(ii)(d) for ? and from Theorem 6.6(v) for ??. (iii) Ф(г) is order preserving by Theorem 6.4(ii)(e) and Proposition 3.1 and Ф0'' is order preserving by Theorem 6.6(i) and Proposition 3.1. The rest of (iii) is an immediate consequence of the order preserving property of these isomorphisms Ф"'1 and Ф() . (iv) As in the proof of Theorem 3.7(iv) we have 0(f) = / ??(??]\?[€)) + ? / /?(?(,,\???)= / fd(jie\StC)) for / e CC(T) with supp/ = С where ?\ = Re# and ?? = 1тв. Thus G.1) holds. Since ? is an onto isomorphism, the result holds for MC(T). Similarly, the result holds for Mq(T) and ??. (?) The proof is similar to that of (iv). ? Theorem 7.3. Let S =J3c(T)Jjespectively B(T), B0(T), 8(C), S(C0)) and let M(S) = MC(T) (respectively M(T), M0(T), Mc(T)b, M„(T)b). For ? e M(S), let \\?\\ = sup{u^, ?)(?): ? eS]. Then: (i) The mapping Ф^гЛК^) -> K,(T)l given by ?^(?) = ? if ? = ??\? 's weH defined, and is an isomorphism onto ?.(?)^ and ||?5(?)|| = ||?|| for ? e M(S) so that ?5 is an isometric isomorphism. (ii) Each one of the spaces (M(S). || · ||) is the dualof(Cc(T), || · ||r) (or equivalently, of(Co(T), || · ||7-)) and consequently, (M(S), || · ||) are Banach spaces. (iii) Results similar to (i) and (ii) hold if K,(T)l and M(S) are replaced by K,(T. R)*? and M'(S), respectively, where M! (S) = {? e M(S): ? real]. PROOF, (i) Let ??,?? e M(S) and let ?, ? e C. Clearly, ??? + ??? e M(S) and (?? ? + ??2)\????) =?·?\ \Slc()) + ? ¦ М21г(С„)· Thus- by the uniqueness part of the various assertions in Theorem 4.4 we conclude that Mo(T)i, is the image under a linear onto isomorphism Г5: M(S) -> Mo(T)i, given by rs(M) = MU(C„), ?^?(?).
1080 T.V. Panchapagesaii Let ??(?) = (?? ° ^s)(m) for ? e A1(<S). where ?? is as in Theorem 7.2. Clearly, ?5 is a linear isomorphism of M(S) onto its image in fC(X)*. If ?^??) = ?, then ??(???(?0)) = ? and by hypothesis, sup{^(?)|: ? e <5(Co)} < 00. Consequently, by the equivalence of (ii) and (vi) of Theorem 5.6, ? is bounded and hence <I>s(M(S)) С fC(T)fy Conversely, if ? e K,(T)*} then by Theorem 5.6 ?? is bounded in ?? and ?? D B(T). Consequently, ??\s belongs to M(S) and <Ps(Ph\s) = ?, so that <t>s(M(S)) = K,(T)*r By the last part of Theorem 5.6, \\<Psiv)\\ = HmII for ? e M(S). (ii) This is immediate from (i) and from the fact that K.(T)*} is the dual of (CC(T), || · ||r)· (iii) The proof is similar to the earlier parts. ? The following theorem is based on the Stone representation theorem for Boolean rings and Theorems 7.2 and 7.3. THEOREM 7.4. Let ? be a non void set and let TZ be a ring of subsets of ?. Let M-jz (respectively (M-ji)b) be the vector space of all complex valued finitely additive set functions of finite (respectively bounded) variation on TZ and let \\?\\ = sup{u(M, "?)(?): ? eTZ] for ? e (Mn)b- Let M'n {respectively (Мхп)ь) be the subspace of real valued set functions in M-r (respectively (M-jz)b)· Then there exists a totally disconnected locally compact Hausdorff space ? such that M-jz is isomorphic to K,(T)*; Mrn is order isomorphic to К,(Т.Ж)* and (Mn)b (respectively (M'n)b) is isometrically isomorphic (respectively isometrically order isomorphic) to K,(T)*7 (respectively to К.(Т,Ш)*7). When TZ is an algebra in ?, the space ? can further be assumed to be compact. PROOF. By the Stone representation theorem for Boolean rings (see Dinculeanu A967, § 18)), there exists a totally disconnected locally compact Hausdorff space ? such that TZ is ring-isomorphic to the ring TZ of all compact-open subsets of T. Let ? be such an isomorphism from TZ onto TZ. Let С e Co of T. Then by Proposition 2.5 there exist U„ e U П 5(Co) such that С = П^° U„. Since the members of TZ form a base for the topology of Г, each U„ is of the form Un = (J · ? (A,^), Anj eTZ. A.s С is compact, there exist A„y;, / = 1. 2,... ,k„, in TZ such that С с U; = i ф^А>ч,)- If An = Ufl, Anji. then С = ПГ ф(А») and' hence' С e ?(??). Since TZ С Co, it follows that S(C0) = S(TZ). For ? e M-jz, let ?(?)(?) = ?(?-?(?)) for ? e TZ. Sincej?~'@) = 0 and since each countable disjoint union {En}f in TZ with (J ? E„ = ? eTZ has E„ = 0 for all but a finite number of и, it follows that ? = ? (?) is a complex measure on TZ and hence admits a unique extension 0 to S(TZ) — <5(Cq) as a complex measure. Conversely, given a complex measure ? on <5(Co), let ?(?~ '(?)) = v(E) for ? eTZ. Clearly, ? is well defined on 7? and is a complex valued finitely additive set function on TZ. Since i>|^ is of finite variation, ? e М-ц. Moreover, ? (?) = v\^. Also, the mapping ?: M-r -»· Mq(T) given by ?(?) = {?(?)} is linear and bijective, where Mq(T) is as in Notation 7.1 and {?(?)} is the unique countably additive extension of ? (?) to S(Co) = S(TZ). Consequently, by Theorem 7.2(iv) M-r is isomorphic to K,(T)*. The other results follow on similar lines. ?
Positive and complex Radon measures 1081 8. Applications In this section we discuss some results obtained in Panchapagesan A998a, 1998b) as applications of Theorem 7.3. Also we include a generalization of Theorem 4.4 to locally convex Hausdorff space-valued (briefly, IcHs-valued) Baire measures, whose proof as given in Panchapagesan A998b) is motivated by the proof of the former theorem. In the sequel, ? (?) denotes the dual of (Со(Г). || · ||r) and is identified with the space of all bounded complex Radon measures on Г - i.e., with the space K(T)*r In virtue of Theorem 7.3 it is also identified with the Banach space of all (bounded) ??G>regular complex measures ? on B(T) with ||?|| = ?(?. ?(?))(?). Let К be a metrizable compact Hausdorff space. Proposition 8 of Dieudonne A951) states that a sequence {?,,}^ in ? (?) is weakly convergent if and only if {?„(?/)} j* is convergent in С for each open set U in K. Later, for a bounded subset A of ? (?), Grothendieck A953, Theorem 2) gave several characterizations for A to be relatively weakly compact. One of them is that, for each disjoint sequence (i/,)^ of open sets in ?, lim, ?(?/,) = 0 uniformly in ? in A. As a corollary of this characterization, he obtained the following generalization of Dieudonne's result to (non-metrizable) locally compact Hausdorff spaces T. THEOREM 8.1 (Grothendieck, 1953). A sequence {?,,}^ '" ? (?) is weakly convergent in M(T) if and only if hm,, ?„(?/) e С for each open set U in T. The hypothesis that lim„ ?„(?/) e С for each open set U in ? implies that the set {?,,} j* is bounded in M(T). The above generalization of Grothendieck cannot be considered optimum for the following reason. In a metrizable compact space K, each open set is a Baire set and hence an optimum generalization of Dieudonne's result would be the following: A sequence {?„ }^ in M{T) is weakly convergent in M(T) if and only if lim„ ?„ (U) e С for each open Baire set U in T. Such a generalization with the additional hypothesis of boundedness of {?,,}^ is obtained in Corollary 1 of Panchapagesan A998a). For this, Theorem 7.3 is used to give a Baire version of Theorem 2 of Grothendieck A953). In fact, the following result is obtained. Theorem 8.2 (Theorems 1 and 2 of Panchapagesan A998a) - Baire and ?-Borel versions of Theorem 2 of Grothendieck A953) and of Lemma VI.2.13 of Diestel and Uhl A977)). Let ? be a locally compact Hausdorff space and let M(T) be the dual о/(Со(Г), || ¦ Цг)· Let A be a bounded set in M(T). Then the following .statements are equivalent: (i) A is relatively weakly compact. (ii) For each disjoint sequence {i/,} of open Baire (respectively (ii)' ?-Borel) sets in T, 1????(?/,) = 0 uniformly in ? e A.
1082 T.V. Panchapagesan (iii) For each disjoint sequence {i/,}^ of open Baire (respectively (iii)' ?-Borel) sets inT, limu(M|7e,ft)(t/,) = 0 I uniformly in ? e A where H— Bq(T) (respectively BC(T)). (iv) (a) Given ? > 0 and an open Baire (respectively (a)' ?-Borel) set U in T, there exists a compact Gs С (respectively a compact C) such that С С Е and supu(M|7e,ft)(t/\C)<f ?€? where ??= Bq(T) (respectively BC(T)). (b) For each ? > 0, there exists a compact Gs С (respectively (b)' compact C) in ? such that sup sup ?(?\??,??)(?) < ? ??? E€Tl.EcT\C where 1Z= Bq(T) (respectively BC(T)). (v) Given ? e B$(T) (respectively (\')\ Given ? e BC(T)) and ? > 0, there exists a compact Gs С (respectively a compact C) in ? such that С С Е and suPMeA ?(?\??, Tl)(E\C) < ? where K= B0(T) (respectively BC(T)). In other words, ?\??(?) (respectively ?\?^?)) is uniformly Baire (respectively ?-Borel) inner regular. (vi) Given a sequence E„ \ 0 in Bq(T) (respectively (vi'): in BC(T)) and an ? > 0, there exists no such that sup ?(?\??, П)(Е„) <? ?€? for ? ^ no, where ?? = Bq(T) (respectively BC(T)). In other words, ?\?{^?) (respectively ?\???)) is uniformly countably additive. (vii) Given ? e Bo(T) (respectively ? e BC(T)) and ? > 0, there exist a compact Gs С and an open Baire set U in ? (respectively a compact С and an open ?-Borel set U in T) such that С С Е С U and supu(Ml7e,ft)(t/\C)<e ?€? where ?? = Bo(T) (respectively BC(T)). In other words, ?\?{){?) (respectively ?\?,.(?)) is uniformly Baire (respectively ? -Borel) regular. Using the equivalence of (i) and (ii) of Theorem 8.2 and following an argument similar to that on p. 150 of Grothendieck A953) we have the following optimum generalization of Dieudonne's proposition.
Positive and complex Radon measures 1083 COROLLARY 8.3 (Corollary 1 of Panchapagesan A998a) - Generalization of Proposition 8 ofDieudonneA951)). A bounded sequence {щ}^ in M(T) is weakly convergent in M(T) if and only //lim, ?,(U) e С for each open Baire set U in T. The above result is really an optimum generalization of the said result of Dieudonne and is more natural and stronger than Theorem 8.1. Moreover, the boundedness of {?,}]'0 can also be dropped in the above corollary. See Remark 9.18 below. Using the strict Dunford-Pettis property (briefly, SDP-property) of Со(Г) and other deep results, Grothendieck A953) characterized weakly compact operators on C(K) whose range is contained in a complete IcHs. In fact, he obtained the following THEOREM 8.4(Theorem6ofGrothendieckA953)). Let К be a compact Hausdorff space and let X be a complete IcHs. Let u:C(K) ^> X be a continuous linear mapping. Then the following statements are equivalent: (i) и is weakly compact. (ii) For each closed set F in К, и**(ху) e X, where u** is the biadjoint of и and the dual X* ofX is endowed with the strong topology ?{?*, ?). (iii) For each closed Gs С in К, н**(хс) e X- (iv) For each increasing sequence (f,,)^ С C(K), with 0 < /„ < 1, (uf,)^0 converges weakly in X. Then Remark 2 on p. 161 of Grothendieck A953) says that without any difficulty the techniques developed in earlier sections can be used to show that the enunciation of the above theorem is textually valid for Со(Г), with ? locally compact and Hausdorff. Later, Edwards A965) gave the details of the proof of the said remark, using the techniques of Grothendieck A953). But, unfortunately his proof is incorrect and as observed in Panchapagesan B000) Grothendieck's techniques hold if and only if ? is further ?-compact. On the other hand, Bartle-Dunford-Schwartz A955) developed a theory of integration of scalar functions with respect to a countably additive Banach space-valued vector measure and using it, they gave an alternative method to study weakly compact operators и on C(K), К a compact Hausdorff space, when the range of и is contained in a Banach space, and obtained an integral representation of the operators u. When и are weakly compact, we call this the Bartle-Dunford-Schwartz representation theorem for u. They also deduced the SDP-property of C(K) from the integral representation of и. The Bartle-Dunford-Schwartz representation theorem is generalized in Panchapagesan A998b) as follows. Theorem 8.5 (Theorem 1 of Panchapagesan A998b) - Generalized Bartle-Dunford- Schwartz representation theorem). Let X be an IcHs and let u:Co(T) —> X be a continuous linear mapping. Then there exists an X**-valued vector measure (i.e., a vector valued additive set function) m on B(T) with the following properties: (i) x*m e M(T) for each x* e X*. and consequently m :B(T) -> X** is countably additive in the ?(?**, X*)-topology.
1084 Т. V. Panchapage san (ii) The mapping x* —* x*m of x* into M(T) is weak*-weak* continuous. (Hi) x*uf= fTfdx*m for each f e CQ(T) and x* e X*. (iv) {m{E): ? e B(T)} is ze-bounded in X**, when X** is endowed with the locally convex topology ze of uniform convergence in equicontinuous subsets of X*. Conversely, let m : B(T) -> X** be a vector measure satisfying (i) and (ii). Then there exists a unique continuous linear map и: Co(T) -> X such that (iii) holds. Moreover, m(E) = u**(xE) for ? e B(T) and m satisfies (iv). Finally, the vector measure m satisfying (i)-(iii) is uniquely determined by the continuous linear map и and is called the representing measure of и and m has ze -bounded range in X**. When X is quasicomplete, uf = fT f dm, for / e Со(Г), where the integral is defined in the sense of Definition 3 of Panchapagesan A998b). When X is quasicomplete and и is weakly compact, then uf = fT f dm for / e Co(T), the vector measure m has range in X, and is countably additive and Borel-regular (in the sense of Definition 5 of A998b)). Remark 8.6. Clearly, the above theorem generalizes the Riesz representation theorem for positive linear forms on Cq(T) to quasicomplete IcHs-valued weakly compact operators on C0(T). Using Theorems 8.2 and 8.5, 35 characterizations are given in Panchapagesan A998b) for a quasicomplete IcHs-valued continuous linear map и on Со(Г) to be weakly compact. These characterizations include those in Remark 2 of Grothendieck A953). In particular, we have the following theorem, which subsumes the said remark. THEOREM 8.7 (Theorem 3 of Panchapagesan A998b)). Let X be a quasicomplete IcHs. Let и :Co(T) -> X be a continuous linear mapping. Then the following statements are equivalent: (i) и is weakly compact. (ii) u**(xu) e Xfor all open sets U in T. (iii) h**(xf) e X for all closed sets F in T. (iv) u**(xu) & Xfor all open sets U belonging to BC(T). (v) u**(xu) e Xfor all open Baire set U in T. (vi) u**(xu) e X for all open sets U in ? which area-compact. (vii) m**(xf) e X for all closed sets F in ? which are Gs- (viii) u**(xu) e X for all open sets U in ? which are a countable union of closed sets inT. (ix) For each increasing sequence (f,)^ С Cq(T), with 0 < /„ < 1, (uf,) converges weakly in X. In Sections 2 and 4 we used the Riesz representation theorem for a positive linear functional ? on CC(T), to prove the existence and uniqueness of ??(r)-regular(respectively ??f(r)-regular) extension of a complex measure on Bq(T) (see Theorem 4.4). In virtue of Remark 8.6 the following question arises:
Positive and complex Radon measures 1085 As in the scalar case, can the integral representation of a weakly compact operator be used to generalize Theorem 4.4 to X-valued countably additive vector measures ?? on Bq(T), where X is a quasicomplete IcHs? The question is answered in the affirmative in Theorem 10 of Panchapagesan A998b). First the concept of 7?-regularity is generalized to vector measures, where TZ is a <5-ring of sets in T, containing С or C0 (see Definition 5 of Panchapagesan A998b)). When 1Z is Bq(T) (respectively BC(T), B{T)) we use the terminology of Baire regularity (respectively ?-Borel regularity, Borel regularity). Then using the integral representation theorem (Theorem 8.5) and other theorems in Panchapagesan A998b), the following generalization of Theorem 4.4 is proved. Theorem 8.8 (Theorem 10 of Panchapagesan A998b)). Let ? be a locally compact Hausdorff space and let X be an IcHs. Then each X-valued countably additive vector measure ?? on Bq(T) is Baire-regular. If X is further quasicomplete, then there exists a unique X-valued countably additive Borel-regular (respectively ?-Borel regular) vector measure ? on B(T) (respectively ?, on B,(T)) such that ?|??)(?) = Mo (respectively Мс-1в(,(Г) = Mo)· Moreover, ?? = ?\? (?). Remark 8.9. The above result has already been proved by different methods. For example, see Dinculeanu and Kluvanek A967). Kluvanek A967). Sion A969), Dinculeanu and Lewis A970), etc. However, recently a new proof, which is simpler than all the known proofs, has been given in Dobrakov and Panchapagesan B002). Let us recall from the definition on p. 634 of Edwards A965) that a Banach space ? is said to have the SDP-property if, for each weakly compact operator и : ? —> ?, X а quasicomplete IcHs, и transforms each weak Cauchy sequence in ? into a convergent sequence in X. As remarked earlier, Grothendieck A953) showed that Cq(T) has SDP-property. Using Theorems 8.5 and 8.7, an alternative proof is given in Panchapagesan A998b) (see Theorem 11 there). Pelczynski A960) showed that every continuous linear mapping u:C(K) —> ? is weakly compact if ? is a Banach space with со <? E. This result was improved in Pelczynski A962) where he proved that every unconditionally convergent operator u:C(K) -> E,E an arbitrary Banach space, is weakly compact. (A continuous linear map u: ? -> F, E, F, Banach spaces, is said to be unconditionally convergent if, for every sequence (x,,)^ of vectors in ? with Y^ |**(л-„)| < oo for each x* e X*, ?^° u(x„) is unconditionally convergent in F.) Using the technique of reduction to metrizable compact space and Theorem 8.4, Thomas A970) extended the former result of Pelczynski to a continuous linear map и :Со(Г) -> X, X a quasicomplete IcHs with со <t X- Using Theorems 8.5 and 8.7 and dispensing with the technique of reduction to metrizable compact space, both the results of Pelczynski are generalized to continuous linear mappings on Co(T) with range in a quasicomplete IcHs. See Theorems 12 and 13 of Panchapagesan A998b). For an alternative simple proof see Remark 9.18 below.
1086 Т. V. Panchapagesan 9. Generalization to Radon vector measures Thomas A970) developed a theory of Radon vector measures analogous to the theory of essentially integrable scalar functions given in Chapter V of Bourbaki A965b). In this section we briefly discuss the results announced in Panchapagesan A995/96) along with others which generalize some of the results in Section 7 to these measures. A more detailed complete exposition of these results including others will be published elsewhere. (See Remark 9.18 below.) Definition 9.1. Let X be a quasicomplete IcHs and let Г be the family of all continuous seminorms on X. A linear mapping и : ?,(?) -> X is called an X-valued Radon operator (or simply, a Radon operator when there is no ambiguity about the range space X) if, given a compact set С in ? and ? e ?, there exists a (finite) constant Mc.p > 0 such that р(иф)^Мс.р\\Ф\\т for all ? e fC(T) with supp<? с С. (Though Thomas A970) developed his theory for K,{T, E), it is equally valid for fC(T) also.) Let T+(T) and M + (T) be as in Theorem 1.2. As ? is fixed, we shall denote them by T+ and M + , respectively. Following Thomas A970) we give the following definitions. Definition 9.2. Let и : ?,(?) -^Ibea Radon operator. Let / e T+ and let ? e ?. If / e M+, we define "*,(/)= sup р(иф). \фК/.фе)С(Т) If / has compact support, then we define «·(/)= inf u'(g). Finally, if / is arbitrary in T+, we define "*(/) = sup{i/*(/2): 0< h ^ /, supp/г compact}. For ? С Г, we define и* (Е) = и* (??·). For / e K,(T), it is easy to show that и* (|/|) < oo for all ? e Г. This fact plays a key role in the following: Definition 9.3. Let u:K,(T) ->· X be a Radon operator. Let T°(u) = [f:T -> C, "" (l/l) < oo for each pef). Then /С(Г) с Т°. T° is endowed with the locally convex topology induced by the seminorms {и* ? e ?}. The closure of K,(T) in J"° for this topology is denoted by C\(u) and all members of C\ (и) are said to be и-integrable.
Positive and complex Radon measures 1087 THEOREM 9.4. C\{u) is a vector space containing K,(T). C\(u) = {/ e T°(u): given ? > 0 and ? e ?, there exists ? e K,(T) such thatu'(\f — ?\) < ?]. Since р{иф) ^ и* A01) for ? e ?,(?) and ? e ?, it follows that и is continuous in K,(T) also for the topology of C\ (u). Thus и admits a unique continuous linear extension й to the whole of C\ (и) with values in X, the IcHs completion of X. However, as X is quasicomplete, by Theorem 1.35 of Thomas A970) uf е X, for / eC\(u). Definition 9.5. Given a Radon operator и : ?,(?) -н» X, for each / e ?| (и), the value of the integral of / in ? is denoted by ii (/). Remark 9.6. An X-valued Radon operator is referred to as Radon measure with values in X in Thomas A970). For / e C\ (и), ?(/) is denoted by fT f du in Thomas A970). Definition 9.7. Let и : ?,(?) -+ X be a Radon operator. We define Mu = {E cT: ?? e ?|(и)} andM„(?') = ii(x?). THEOREM 9.8. For a Radon operator и on K.(T), Mu is a ring of sets and ?? is countably additive on Mu. Definition 9.9. In virtue of Theorem 9.8, the countably additive vector measure ?;, is called the Radon vector measure induced by и and Mu is called the domain of ???. (Compare Definition 3.4.) Definition 9.10. A Radon operator и :K,{T) -> X is said to be bounded if, for each реГ, there exists a (finite) constant Mp such that р{иф)^Мр-\\ф\\т forah>e/C(r). Definition 9.11. A bounded Radon operator u:K,{T) -> X is said to be weakly compact if its unique continuous linear extension to Cq(T) is a weakly compact operator on (Со(Г), || · ||r). A Radon operator и : ?,(?) -+ X is said to be prolongable (in the sense of Thomas A970)) if its restriction и\)сШ) is a weakly compact Radon operator for each relatively compact open set U in T. Using Remark 2 on p. 161 of Grothendieck A953), Thomas A970) proved the following theorem, on which rests the major part of his theory. Theorem 9.12 (Theorems 2.2 and 2.2 bis, Thomas A970)). Let u:K,{T) -> X be a bounded Radon operator. Then the following conditions are equivalent: (i) Every bounded Borel measurable function belongs to C\ (u). (ii) For every open set ? in T, there exists a vector ?? e X such that ?*{??) = fT Xwd{x*u) for each x* e X*. That is, the weak integral {in the sense of Thomas A970)) of ?? belongs to X for each open set ? in T. (iii) и is weakly compact on (Cq(T). \\ · \\т).
1088 Т. V. Panchapagesan Thomas A970) argues that ((ii) =>· (iii)) by the results on p. 160 of Grothendieck A953). Really, as Г is locally compact, he means Remark 2 on p. 161 of the said paper. Unfortunately, till the publication of Panchapagesan A998b) the said remark remained unproved as noted in the paragraphs following Theorem 8.4 above. The condition (ii) of Theorem 9.12 says that ?*?? = (??**(??))(?*) for each x* e X*, where и** is the biadjoint of и :Co(T) -> X. Consequently, ?**(??) = ?? e X. Hence, by Theorem 8.7 (which subsumes Remark 2 of Grothendieck A953)) и is weakly compact and, hence, (iii) of Theorem 9.12 holds. Consequently, the theory of Radon vector measures developed in Thomas A970) remains valid. The following result generalizes a part of Theorem 5.6 to weakly compact Radon operators. THEOREM 9.13. Let u: K,(T) -> X be a Radon operator. Then the following statements are equivalent: (i) и is weakly compact. (ii) Mu is ? ? -algebra in ? and С С М„. (iii) KcMu, where П = В{Т) or ?,.(?) orB0(T). (iv) Every bounded u-measurable scalar function (in the sense of Definition 1.18 of Thomas A970)) belongs to C\(u) and и is prolongable. (v) Every bounded IZ-measurable scalar function belongs to C\(u), where 1Z = B(T) or BAT) or B0(T). (vi) For each open set U e 1Z there exists a vector хц e X such that x*(xu) = fT Xu d(jielt) for each x* e X*, where ??* = x*u e K,(J)*h and П = B(T) or BC(T) or Bo(T). If и is a weakly compact Radon operator on fC(T) and ifmu is the representing measure of its continuous linear extension to (Со(Г), || · \\т) (see Theorem 8.5), then mu = /j.u\b(T)- Moreover, for each ? e Mu, ??<(?)=\\?\???(??H CtC where С is directed by the partial ordering C\ ^ C2 ifC\ С С2 for С \, C: eC. The following theorem characterizes prolongable Radon operators. THEOREM 9.14. Let u: K,(T) -> X be a Radon operator. Then the following statements are equivalent: (i) и is prolongable. (ii) г (Со) с ми. (iii) 8(C) С Mu. (iv) Mu is a ?-ring containing all relatively compact open sets in T. (v) Mu is a S-ring containing С (vi) ?,? is a ?-ring containing Co- (vii) Every bounded scalar IZ-measurable function with compact support belongs to C\(u), where П = B(T) orBC(T) orBq(T).
Positive and complex Radon measures 1089 Remark 9.15. In the light of the above theorem and Proposition 3.3, the members of K,(T)* are precisely the scalar valued prolongable Radon operators on K,(T). Similarly, in virtue of Theorems 5.6 and 9.13, the members of fC(T)*} are precisely the scalar valued weakly compact Radon operators on K,(T). Definition 9.16. Let и :K,(T) -> X be a Radon operator with the associated Radon vector measure ?„. Then ?„ is called a weakly compact (respectively prolongable) Radon vector measure if и is weakly compact (respectively prolongable). Let ? be a <5-ring containing C() and let ?: V -> X be countably additive. Based on two new concepts of Lebesgue-Radon completion and localized Lebesgue-Radon completion of V (respectively of ?) with respect to ? (respectively to D), we have the following theorem which generalizes Theorems 6.4 and 6.6 to prolongable Radon vector measures. THEOREM 9.17. Let ?:8(C) -> X be countably additive. Then ? is the restriction of an X-valued prolongable (respectively weakly compact) Radon vector measure ?? if and only if ? is 8(C)-regular (see Definition 5 of Panchapagesan A998b)) (respectively and if ? has relatively weakly compact range). In that case, и is unique and и is called the prolongable (respectively weakly compact) Radon operator determined by ?. Moreover, Mu is the localized Lebesgue-Radon (respectively Lebesgue-Radon) completion of 8(C) with respect to ? and ?„ is the localized Lebesgue-Radon (respectively Lebesgue-Radon) completion of ? with respect to 8(C). Remark 9.18. An alternative simple proof of Theorems 12 and 13 of Panchapagesan A998b) is given in Panchapagesan B002). The proposed generalizations in Theorems 9.13, 9.14 and 9.17 are closely related to the study of the Bartle-Dunford-Schwartz integral of scalar functions with respect to a Banach space-valued and more generally, with respect to a quasicomplete IcHs-valued countably additive measure defined on a ?-ring or on a <5-ring of sets. Such a study is carried out in author's forthcoming papers [PI] (The Bartle- Dunford-Schwartz integral, I. Basic properties); [P2] (The Bartle-Dunford-Schwartz integral, II. ?/7-spaces, 1 ^ ? < oo); and [P3] (The Bartle-Dunford-Schwartz integral, III. Integration with respect to IcHs-valued measures) which have been communicated for publication. A measure-theoretic treatment of many of the results of Thomas A970) for weakly compact and prolongable Radon operators on K,(T) with the improvement of replacing open sets by open Baire sets is given in [P4] (The Bartle-Dunford-Schwartz integral, IV. Applications to integration in locally compact Hausdorff spaces - under preparation), where the hypothesis that {?,·}^ is bounded in Corollary 8.3 is shown to be redundant by proving that, for each open set U in ?, there exists an open Baire set V С U such that ?, (V) = ?,(?/) for all;'. [P5] (The Bartle-Dunford-Schwartz integral, V. Complements to the Thomas theory of vectorial radon integration - under preparation) obtains some of the results of Thomas A970) by simpler proofs, improves many of them by replacing open sets by open Baire sets, gives a correct proof of Theorem 1.35 of Thomas A970) whose original proof is incorrect and obtains Theorems 9.13,9.14 and 9.15. Moreover, it is shown in [P5] that, for a prolongable (respectively weakly compact) Radon operator и on K,(T) with values in a quasicomplete IcHs, C\ (u) given in Thomas A970)
1090 Т. V. Panchapagesan is essentially the same as C\ (?„) given in [P3], where ?„ is the representing measure of и as given in [P4] and for / e C\(u), Ct(f) = fT f du (see Remark 9.6)= fT f d\xu (the Bartle-Dunford-Schwartz integral of / with respect to ?„ as given in [P3]). We would also like to emphasize that [P4] and [P5] are not only based on [PI, P2, P3] but also on Panchapagesan A998a, 1998b) References Bartle, R.G., Dunford, N. and Schwartz J.T. A955), Weak compactness and vector measures. Canad. J. Math. 7, 289-305. Bourbaki, N. A965a), Integration, Chapitres I-IV, Hermann, Paris. Bourbaki, N. A965b), Integration, Chapitre V, Hermann, Paris. Diestel, J. and Uhl, J.J. A977), Vector Measures, Amer. Math. Soc., Providence, RI. Dieudonne, J. A951), Sur la convergence des suites de mesures de Radon, Anais. Acad. Bras. Ciencias 23, 21-38. Dinculeanu, N. A967), Vector Measures, Pergamon Press, New York. Dinculeanu, N. and Kluvanek, 1. A967), On vector measures, Proc. London Math. Soc. 17, 505-512. Dinculeanu, N. and Lewis, P.W. A970), Regularity ofBaire measures. Proc. Amer. Math. Soc. 26, 92-94. Dobrakov, I. and Panchapagesan, T.V. B002), A simple proof of the Borel extension theorem and weak compactness of operators, Czechoslovak Math. J., in print. Dunford, N. and Schwartz, J.T. A958), Linear Operators, Part I. General Theory, lnterscience. New York. Edwards, R.E. A965), Functional Analysis, Theory and Applications, Holt, Rinehart and Winston, New York. Grothendieck, A. A953), Sur les applications lineares faiblement compactes d'espaces du type C( A"), Canad. J. Math. 5, 129-173. Halmos, PR. A950), Measure Theory, Van Nostrand, New York. Hewitt, E. and Stromberg, K. A965), Real and Abstract Analysis. Springer, New York. Kluvanek, I. A967), Characterization of Fourier-Stieltjes transforms of vector and operator valued measures, Czechoslovak Math. J. 17, 261-277. McShane, E.J. A944), Integration, Princeton University Press, Princeton. Naimark, M.A. A959), Normed Rings, Noordhoff, Groningen. Panchapagesan, T.V. A991a), Medida e Integracion, Pane I, Vols. 1 and 2, Universidad de los Andes, Facultad de Ciencias, Merida, Venezuela (Spanish). Panchapagesan, T.V. A991b), Integral de Radon en espacios localmente compactos у de Hausdorff, Cuarta Escuela Venozolana de Mat., Universidad de los Andes, Facultad de Ciencias, Merida, Venezuela (Spanish). Panchapagesan, T.V. A992), On complex Radon Measures I, Czechoslovak Math. J. 42, 599-612. Panchapagesan, T.V. A993), On complex Radon measures II, Czechoslovak Math. J. 43, 65-82. Panchapagesan, T.V. A995/96), On Radon vector measures. Real Analysis Exchange 21, 75-76. Panchapagesan, T.V. A998a), Baire and ? -Borel characterizations of weakly compact sets in M( T), Trans. Amer. Math. Soc. 350, 4839-4847. Panchapagesan, T.V. A998b), Characterizations of weakly compact operators on Cq(T), Trans. Amer. Math. Soc. 350, 4849-4867. Panchapagesan, T.V B000), On the limitations of the Grothendieck techniques. Rev. Real Acad. Cienc. Exact. Fis. Natur., Madrid 94, 437^440. Panchapagesan, T.V. B002), Weak compactness of unconditionally convergent operators on Cq(T), Math. Slovaca, in print. Petczynski, A. A960), Projections in certain Banach spaces, Studia Math. 19, 209-228. Petczynski, A. A962), Banach spaces on which every unconditionally converging operator is weakly compact. Bull. Acad. Polon Sci. Ser. Sci. Math. Astronom. Phys. 10, 641-648. Rudin, W. A967), Real and Complex Analysis, McGraw-Hill, New York. Sion, M. A969), Outer measures with values in topological groups, Proc. London Math. Soc. 19, 89-106. Thomas, E. A970), L'integration par rapport a line mesure de Radon vectorielle, Ann. Inst. Fourier (Grenoble) 20,55-191.
CHAPTER 27 Measures on Algebraic-Topological Structures Piotr Zakrzewski Institute of Mathematics, University of Warsaw, ul. Banacha 2. 02-097 Warsaw. Poland E-mail: piotrzak@mimuw.edu.pl Contents Introduction 1093 1. Invariant measures on arbitrary G-spaces '094 2. Nonmeasurable sets for invariant measures ' ?2 3. Extensions of invariant measures · 107 4. Invariant measures on Polish groups 1109 5. Invariant measures on Polish G-spaces " И 6. Isometrically invariant measures on Euclidean spaces 3 References 1127 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1091
This page intentionally left blank
Measures on algebraic-topohgic/il structures 1093 Introduction This article is entirely devoted to countably additive measures invariant under groups of transformations. Our general framework is a G-space, i.e., a pair (X. G), where G is a group acting on a set X. If, moreover, ? is a ? -algebra of subsets of X, we say that ? is G-invariant if gA e ? whenever g e G and A e ?. Then we also say that G acts by measurable transformations on the measurable space (?, ?). We say that a (?-additive, non-zero) measure ? : ?1 —» [0, oo] is G-invariant if /J.(gA) = ?(?) for every g e G and Ael, i.e., if for each g e G the transformation (g,-):X->Xis measure-preserving. Invariant measures are present in many parts of mathematics including harmonic analysis, ergodic theory and topological dynamics. A typical example of such a measure is the Lebesgue measure: one of its key properties is that it assigns equal values to congruent sets, so that the measure of a set does not depend on its position in the space. The Lebesgue measure is a particular example of a Haar measure on a locally compact topological group G: the only, up to a constant factor, regular Borel measure, positive on nonempty open sets, finite on compact sets and invariant under the left shifts from G. Haar measure plays an important role in abstract harmonic analysis and group representation theory. In ergodic theory one focuses on the case where a single measure-preserving transformation ? of a probability measure space (?, ?. ?) is given, so that X becomes a Z-space via the map (к, x) ь* Tkx for к e ? and ? e X. Invariant measures for more general group actions are also of considerable interest. But in this article we leave aside these aspects of the theory of invariant measures that are specific for harmonic analysis or ergodic theory. Instead, given a group G acting by measurable transformations on a measurable space (?, ?), we focus on the fundamental problems concerning the interplay between algebraic, combinatorial and topological properties of the action and the existence and properties of G-invariant measures defined on ?. In particular, we are interested in the way in which in variance of a measure influences the size and the structure of the underlying ?-algebra, and the existence and properties of nonmeasurable sets. General necessary and sufficient conditions for the existence of finite and ?-finite invariant measures on arbitrary G-spaces are discussed in Section 1. We omit these conditions obtained for the special case of Z-actions which do not have meaningful extensions to more general classes of actions. Such conditions properly belong to classical ergodic theory and are discussed in many standard textbooks on the subject (see, e.g., Friedman, A970)). We concentrate on results related to a characterization that was obtained by Hopf A932) for the case of Z-spaces and was later generalized by Kawada A944), Hajian and Ito A969) and others to arbitrary G-spaces. It is formulated in terms of the notion of countable equidecomposability, first investigated by Banach and Tarski A924). We admit after Hopf that such results are not fit for immediate applications but nevertheless they throw a light on the intrinsic nature of G-spaces admitting invariant measures. Section 2 is devoted to invariant measures defined for all subsets of a given set X. Results are parallel to the work of Tarski A938) on the existence of finitely additive invariant measures. Conditions imposed on a G-space X, which refute the existence of a G-invariant
1094 P. Zakrzewski ?-finite measure defined on V(X), at the same time guarantee that for an arbitrary G- invariant ?-finite measure ?: ? -> [0, oo] on X, we have ? ? V{X), i.e., nonmeasurable sets exist. These investigations are related to the well-known general measure problem, whether there exists a ?-additive ?-finite measure defined on the power set of X and vanishing on all singleton sets. Thus the answers, as is well-known, depend on set-theoretic assumptions. In Section 3 we deal with extensions of invariant measures, the matter closely connected to the invariant version of the general measure problem. Given a G-invariant measure ?: ? -> [0, oo] on a G-space X we look for another G-invariant measure ?: ?' -> [0, oo] such that ? ?. ?' and ?\ ? = ?. This reduces to finding a suitable ???-?-measurable set that can be added to the domain of the measure ?. In Section 4 topology enters into the picture, as we discuss the existence of Borel measures on Polish (i.e., separable, completely metrizable) groups. In particular, standard facts about Haar measure in locally compact groups are summarized. For the sake of conceptual coherence and making the interplay between measure and topology more transparent, we restrict ourselves to the case of second countable groups. The discussion of invariant properties of Borel measures on Polish spaces is continued in Section 5, where continuous or, more generally, Borel actions of Polish groups are considered. Some parts of this section belong to descriptive dynamics, a branch of descriptive set theory that has been recently attracting increasing interest. Our exposition is based on the widely distributed set of lecture notes by A.S. Kechris A994). In the last section we come back to extensions of invariant measure, this time concentrating on isometrically invariant measures in Euclidean spaces R". We present a survey of results connected with Sierpinski's problem, formulated in Szpilrajn A935), concerning the existence of a maximal isometrically invariant extension of Lebesgue measure. 1. Invariant measures on arbitrary G-spaces We say that a group (G, ·, e) acts on a set X, if there is a map (g,x) ь* gx of G ? ? into X, called an action of G on X, such that for every ? e X: (i) ex = x; (ii) for all gi, g2 e G, g](g2x)= (g\ ¦ gi)x. We then say that (X, G), or just X, is a G-space. It follows that for each g e G the function ? ь* gx is a permutation of X. If G itself consists of permutations of X, then we tacitly assume that the action is given by gx = g(x) and say that G is a group of permutations of X. Given a G-space X a (G-)orbit is the set Gx = {gx: g e G). A subset А с X is (G-) invariant if it is the union of orbits of its elements or, equivalently, it is closed under the action of G. We say that the action is transitive, if X is the only orbit. The action is free, if gx ? ? for any g ? e and ? e X. If G is a subgroup of a group (X, ·), then G acts on X by left shifts, i.e., gx = g ¦ ? for g & G, ? e X. Of course, this action is free and it is transitive if and only if G = X; G-orbits coincide with the left-cosets of G in X.
Measures on algebraic-topological structures 1095 If ? is a ?-algebra of subsets of a set X, then we say that (?, ?) is a measurable (or Borel) space. The elements of ? are called measurable sets. By a measure on X we mean a ?-additive (unless explicitly stated otherwise), non-zero measure defined on a ?-algebra ? of subsets of X. We always assume that {x} e ? for any ? e X. A measure m : ? -> [0, oo] on X is diffused if it vanishes on all singleton sets, i.e., m({jt})=0forall;t el Given sets А, В e ? we write A c,„ ?, if m(A \ ?) = 0; if A c,„ ? and ? c,„ A, then we write A =„, B. Suppose a group G acts by measurable transformations on a measure space (?, ?, m). The action of G is m-free if m*({.r e X: gx = x}) = 0 for any g e G \ {e}, where m* stands for the outer measure of m. A subset А С X is (G-)almost invariant (with respect to w) if m(gA ? A) = 0 for every g e G, where ? denotes the symmetric difference. It is easy to check that the collection ?* = {A & ?: A is almost invariant} is a ?-algebra of subsets of X contained in ?. The measure m is (G-)quasi-invariant if m(A) = 0 implies m(gA) = 0, and (G-) invariant if w(A) = m(gA), for every A e ? and g e G. It is (G)-ergodic, if w(A) = 0 or m(X \ A) = 0 for any A e 2?*. Ergodicity of m is equivalent to the fact that the associated measure algebra ?* /m is the two-element algebra. Note that if the measure m is quasi-invariant, then the ?-algebra ?* is invariant. If m is, moreover, ?-finite, then every A e ? is contained in a minimal almost invariant set A* e ?*, where minimal means that A* c,„ ?, whenever AC, В and В е ?* (see, e.g., Ramachandran and Misiurewicz A982)). If the group G is countable, then simply A* = \Jg€GgA;'mgcncTal A* = \Jn€Ng„A for certain gn e G, и eN={0, 1,...}. The problem of existence of invariant measures on an arbitrary G-space have been approached from two different but closely related points of view. (A) In classical ergodic theory the problem was formulated as follows (see Halmos A956, p. 81)): Given a group G acting by measurable transformations on a ?-finite measure space (?, ?, m), find necessary and sufficient conditions for the existence of a G- invariant, probability or ?-finite measure ? defined on ? and such that ??«?, i.e., m is absolutely continuous with respect to ?. It is usually assumed that the measure m is quasi-invariant (in this case elements of G are sometimes called nonsingular transformations of the measure space (?, ?, m)). It is then required that ? ~ m, i.e., the measures ? and m have the same null sets. (B) Influenced by Tarski's work on finitely additive invariant measures (see Wagon A986) or Laczkovich B002) in this Handbook) one searches for "purely combinatorial" conditions for the existence of an arbitrary probability or ? -finite G-invariant measure, when a group G acts by measurable transformations on a measurable space (?, ?), with no measure defined on the ?-algebra ? given in advance. The two approaches lead to closely related results. The reason is that if all G-orbits are uncountable, then any ?-finite invariant measure defined on the ?-algebra ? has to be diffused but even in relatively simple cases, when the ?-algebra is, say, countably
1096 P. Zakrzewski generated, it may happen that it carries no ?-finite, diffused measure at all, even without the additional requirement of G-invariance. So, natural necessary conditions for the existence of an invariant measure have the form: "? carries a measure m satisfying some additional properties." In order to formulate the desired conditions, we need to recall the notion of equivalence by countable decompositions, introduced by Banach and Tarski A924). Our terminology is consistent with that from Wagon's monograph A986). Given a group G acting by measurable transformations on a measurable space (?, ?) we say that sets А, В & ? are countably (G-)equidecomposable (in ?), in symbols А ~эо В in ?, if there is a partition of A into countably many sets An e ?, ? e N, and elements gn e G such that the sets gn An form a partition of B. A set A e ? is called countably (G-)paradoxical (in ?), if A contains disjoint subsets A\, ?? e ? with A ~x A\ ~x At in ?. A set A & ? is called G-negligible (in ?) if X contains pairwise disjoint subsets A/, s ?, ? ? ?, with each A„ ~3C A in ?. Let Nq consist of all G-negligible elements of ?. It can be proved (see, e.g., Chuaqui A977)) that the family No is closed under countable unions and taking subsets in ?, i.e., it constitutes a ?-deal in ?. In fact, it is not difficult to prove that the ? -ideal Nc is generated by countably paradoxical sets in ?, i.e., it is the smallest ?-ideal in ? containing all measurable subsets of X that are countably paradoxical in ?. Additionally suppose that m is a measure defined on ?. A set ? e ? is called (G-) bounded (with respect to m), if ? ~эс A in ?1 implies m(Z \ A) = 0 for every A e ?, Ac. Z, i.e., ? is not countably G-equidecomposable in ? with any measure-theoretically proper measurable subset of itself. It is not difficult to prove that ? is bounded if and only if the measure m vanishes on all measurable subsets ? of ? with the property that ? contains pairwise disjoint sets E„ e ?, ? e N, with each En ~x ? in ? (see, e.g., Kawada A944)). In particular, we have the following: PROPOSITION 1.1. Suppose a group G acts by measurable transformations on a measure space (?, ?^). The following are equivalent: (i) The set X is G-bounded with respect to m. (ii) The measure m vanishes on all G-negligible sets from ?. (iii) The measure m vanishes on all measurable sets that are countably G-paradoxical in ?. The following fundamental theorem on the existence of invariant probability measures was first proved by E. Hopf A932) for the case of Z-actions and then generalized by Y. Kawada A944) to arbitrary group actions. THEOREM 1.2. Suppose a group G acts by measurable transformations on ? ?-finite measure space (X, X\m) and assume that the measure m is quasi-invariant. Then the following are equivalent: (i) There exists an invariant probability measure ? on ? such that ? ~ m. (ii) The set X is bounded with respect to m.
Measures on algebraic-topohgical structures 1097 Obviously, any invariant probability measure on ? vanishes on sets that are countably paradoxical in ?, so the existence of such a measure ? with ? ~ m immediately implies that X is bounded with respect to m. The nontrivial part of the Hopf-Kawada theorem asserts that the converse is true. Given sets А, В e ? we write A «; В if there isCel with С ~эс A in ?, С c„, В and for every Q e ?*, if Q c„, ?* and m@) > 0, then m((B \ С) П 0 > 0, i.e., С ? ? is a measure-theoretically proper subset of ? ? (? Given sets ?,?,??? and « e N we write A = nB + C if A =„, ? ? U · · ¦ U A„ U С where ? ? A„, С are pairwise disjoint sets in ? with each Ai ~oc ? in ?. The essence of proof (which uses, as Kawada admits, some ideas taken from J. von Neumann's 1936-37 classical lecture notes on "Continuous Geometry" - see von Neumann A998)) are the following two lemmas. Lemma 1.3. For every sets А, В e ? of positive measure there is a partition (up to a measure zero set) A* =„ (J Qn of A* into pairwise disjoint sets Q„ e ?*, ? e N, such that for each ? e N m(Qn) > 0 and AnQ„=n(BD Q„) + R„, where R„ « ? ? Q„. Moreover, the partition is unique, up to a measure zero set. We say that a set В e ? is a G-atom if m(B) > 0 and В is minimal with respect to <<C in the sense that A <<C В implies m(A) = 0, for any A e ?. Aset Q e ?* is called discrete if Q =m ?* for a certain G-atom B. LEMMA 1.4. There are disjoint sets D,C e ?* such that X =„, DUC and the following conditions are satisfied: (i) Ifm(D) > 0, then D is discrete. (ii) If m(C) > 0, ?/геи there is a decreasing sequence of sets C„ e ? with Co = C, C,* =m С and C„ = 2C„+| for every n e N. Moreover, such sets С and D are unique, up to measure zero sets. Taking the lemmas above for granted, the proof is completed as follows. Assume, without loss a generality, that m(X) = 1. Let D and С be sets from Lemma 1.4. We shall define measures ?</, ?< on measurable subsets of D and C, respectively (if m(D) = 0 or m(C) — 0, then we let ??? s 0 or ?( ? 0, respectively).
1098 P. Zakrzewski So assume that m(D) > 0 and let D= B* for a certain G-atom ?. Take A e ?, A c,„ D. By Lemma 1.3 and the fact that ? is a G-atom, there exists a partition of D into sets Qn e ?* such that m(Q0) = 0 and Qn = n(B П Qn) for each ? > 0. Fix и > 0 and additionally assume that A cw Q„. Then, using again Lemma 1.3, we have АПРк = к(ВПРк) for к = I,...,n, where A* =,„ UILi ?k is the appropriate partition of A* into sets A e ?1*. Set For an arbitrary A & ? with A cm Z) define 00 ??/(?) = ^?^",(???„)· /f=l Now assume that m(C) > 0 and take ???,? c,n C. Fix и e N. Note that С = 2"C„. Using Lemma 1.3 we find a partition of A* into sets Qkn) e IT*, k = 0,...,2", such that А П ?'"> = л(С* П ?'"') + R™ where ^'" « QkH). Then let /-w=E^'»(ei,,)) * = l and define Mc(A)= Hm /„(A). n—>oc It can be checked that this definition is correct. Finally, for an arbitrary A e ? define ?(?) = ??(? П D) + ??(? П С). For details see Kawada A944). Kawada A944) noticed that this proof reveals the following uniqueness property of invariant measures.
Measures on algebraic-topological structures 1099 THEOREM 1.5. Suppose a group G acts by measurable transformations on a measurable space (?, ?). Then any invariant probability measure ? on ? is determined by its values on ?*, the ? -algebra of measurable sets, almost invariant with respect to ?. Moreover, if m is a probability quasi-invariant measure on ? such that X is bounded with respect to m, then there exists a unique invariant probability measure ? on ? such that ? ~ m and ?(?) = m(A)for any A e ?*. In particular, ifm is ergodic, then there exists a unique invariant probability measure ? on ? with ? ~ m. Shorter proofs of the Hopf-Kawada theorem were later found by Ramachandran and Misiurewicz A982), and Ramachandran A989). The simplification was achieved by using the semigroup of (countable equidecomposability) types (see Section 5 of Laczkovich B002) with the obvious changes). Ramachandran's proof was based on a theorem of Tarski (see Tarski A938, Theorem 1.55)) concerning measures on semigroups. In fact, this theorem yields the existence of a finitely additive invariant measure equivalent to a given quasi invariant measure m. The latter, however, suffices, due to the following result of A.P. Calderon A955) (see also Friedman A970, Theorem 3.13)). THEOREM 1.6. Suppose a group G acts by measurable transformations on ? ?-finite measure space (?, ?, m) and assume that the measure m is quasi-invariant. If ? is a finite, finitely additive, invariant measure defined on ? with ? ~ m, then the function ? defined by ?(?) = infl ]T P(A„): A„ e ?, А с (J A„ forAe ?, is a finite, ?-additive, invariant measure on ? with ? ~ m. To formulate another necessary and sufficient condition for the existence of an invariant probability measure we need the notion of a weakly wandering set. Suppose a group G acts by measurable transformations on a measurable space (?, ?). A set A e ? is called weakly wandering (under G) if there exist elements g„ e G, ? e N, such that gn А П g„,A = 0 for ? ? т. Clearly, every weakly wandering set is G-negligible. The following strengthening of the Hopf-Kawada theorem was first proved by Hajian and Kakutani A964) for Z-actions and then generalized by Hajian and Ito A969) to arbitrary actions (partial generalizations were obtained by several authors - see the references in Hajian and Ito A969)). THEOREM 1.7. Suppose a group G acts by measurable transformations on ? ?-finite measure space (X, E,m) and assume that m is quasi-invariant. Then the following are equivalent: (i) There exists an invariant probability measure ? on ? such that ? ~ m. (ii) The measure m vanishes on all weakly wandering sets in ?. Under the hypotheses of Theorem 1.7 condition (ii) above is equivalent to each of the following three assertions:
1100 P. Zakrzewski (iii) m(A) > 0 implies \nii,€Gm(gA) > 0, for every A e ? - see Hajian and Kakutani A964). (iv) For every ? > 0 there is ? > 0 such that m(A) < ? implies m(gA) < ?, for every Ae ?-seeSakaiA978). (v) For every ? > 0 there is ? > 0 such that m(A) < ? and ? ~^ ? imply m(B) < ?, for every А, В e ? - see Sakai A978). Hajian and Ito used functional analytic methods in their proof, treating the existence of an invariant measure as the problem of finding a fixed point under certain group of linear operators on an appropriate Banach space. More precisely, let us assume that the measure m is a probability measure and consider the following action of the group G on the Hubert space L2(X, ?, m): Ug(f) = f(g-]x)[w!,(x)]w2 forge G, /eL2(X,?,m), where wg is the Radon-Nikodym derivative with respect to m of the measure mg~' (A) = m(g~] A) for ??? Any common fixed point, i.e., a function /о е L2(X, ?, m) with Ug(fo) — /o f°r all g e G, gives a desired invariant probability measure ? ~ m by the formula ?(?)= / fl{x)dm{x) for A e ?. Yet another proof of the Hopf-Kawada theorem was given by R. Chuaqui A977). Chuaqui (quoted in Wagon A986, Question 9.13)) conjectured that the the existence of a ?- finite measure m on ? which vanishes on all G-negligible sets is sufficient for the existence of a G-invariant probability measure on ?, even without the additional requirement that m is quasi-invariant. By a result of Halmos A947), this was known to be true for Z-actions. Chuaqui's conjecture was confirmed in Zakrzewski A993a). More exactly, we have the following result which also generalizes the Hajian-Ito theorem. THEOREM 1.8. Suppose a group G acts by measurable transformations on ? ?-finite measure space (?, ?, m). Then the following are equivalent: (i) There exists an invariant probability measure ? on ? such that т«д. (ii) The measure m vanishes on all G-negligible sets in ?. (iii) The measure m vanishes on all weakly wandering sets in ?. Let us now turn to the conditions for the existence of invariant ?-finite measures. If a group G acts by measurable transformations on a measure space (?, ?, m), a set ? e ? is called (G-)a-bounded (with respect to m), if it is a countable union of bounded sets. In Kawada A944), quoted above, the following theorem is proved. THEOREM 1.9. Suppose a group G acts by measurable transformations on a ?-finite measure space (?, ?, m) and assume that m is quasi-invariant. Then the following are equivalent:
Measures on algebraic-topological structures 1101 (i) There exists an invariant ? -finite measure ? on ? such that ? ~ m. (ii) The set X is ?-bounded with respect to m. (iii) The re is a bounded set A G ? with A* =w, X. This result, for the special case of Z-actions, was later rediscovered by Halmos A947) who also noticed that in this case the assumption that the measure m is G-quasi-invariant may be omitted if we want the resulting invariant measure only to dominate m in the sense of absolute continuity. The following theorem from Zakrzewski A993b) generalizes the results of Kawada and Halmos to arbitrary groups of measurable transformations of any ?-finite measure space (?, ?, m). The new element that arises here is the necessity of adding to the condition that X is ?-bounded another property, which guarantees the existence of a quasi-invariant measure dominating m in the sense of absolute continuity (and is, therefore, superfluous if m itself is assumed to be quasi-invariant). Let Ic consist of all A e ? with m(gA) = 0 for every g e G. It is clear that /c is a ?-ideal in ?. We say that a ?-ideal / in ? satisfies the countable chain condition in ?, if there is no uncountable collection of pairwise disjoint sets in ? \ I. Clearly, m being ?-finite, the ?-ideal of m-measure zero sets satisfies the countable chain condition in ?. THEOREM 1.10. Suppose a group G acts by measurable transformations on ? ?-finite measure space (?, ?, m). Then the following are equivalent: (i) There exists an invariant ?-finite measure ? on ? such that т«д. (ii) The set X is ?-bounded with respect to m and the ?-ideal Ic satisfies the countable chain condition in ?. Moreover, the following are equivalent: (i) The ?-ideal Ic satisfies the countable chain condition in ?. (ii) There exists a quasi-invariant, ?-finite measure ? on ? such that Ic = {A e ?: ?(?) = 0}. (iii) There exists a quasi-invariant, ?-finite measure ? on ? such that т«д. In fact, the proof of the nontrivial part of Theorem 1.8, asserting the existence of an invariant probability measure ? on ?, m «?, boils down to showing that ifm vanishes on all weakly wandering sets in ?, then the ?-ideal Ic satisfies the countable chain condition in ?. In the special case when the group G is countable, the ?-ideal /c always satisfies the countable chain condition in ?. But in fact in this case even an invariant ?-finite measure always exists, the measure defined by ?(?) = \A П Gx\ for A e ?, i.e., the counting measure concentrated on the orbit Gx of an arbitrary point ? e X, being a trivial example. However, if we look for diffused invariant measures, then it follows that a quasi-invariant ?-finite diffused measure defined on ? exists, provided that the ? -algebra ? carries ? ? - finite diffused measure at all. As is well known, the latter may not be true even if ? is countably generated. On the other hand, even in the case of a Z-action, the existence of a quasi-invariant ?-finite diffused measure does not imply that such an invariant measure exists. An example is given in Zakrzewski A993b). It is based on the result of D.S. Ornstein
1102 P. Zakrzewski A960) that there exists a Ъ-action by measurable transformations of the Lebesgue measure space ([0, 1], C, ?) such that the Lebesgue measure ? on the ?-algebra С of Lebesgue measurable subsets of [0, 1] is quasi-invariant and ergodic but there is no invariant, ?- finite measure ? on С such that ? ~ ?. Other conditions for the existence of an invariant ?-finite measure, equivalent with a given quasi-invariant one, were obtained in Arnold A968) (see also Zakrzewski A993b, Theorem 3.3), Rosenblatt A974) - it concerns the case when the group G is finitely generated - and Rosenblatt A980) - it concerns the existence of an invariant measure which is strongly equivalent with the given quasi-invariant measure in the sense that the measures have the same sets of measure zero and the same sets of infinite measure). Theorems 1.2 and 1.9 are in fact special cases of a more general theorem from Kawada A944) on the existence of a strictly positive measure on a Boolean algebra A, invariant with respect to a certain group of automorphisms of A (see Kawada A944, Satz 4)). Of course, this is closely related to the problem of finding an invariant measure ? with ? ~ m, if a measure m, quasi-invariant under an action by measurable transformations on a measurable space (?, ?) of a group G is given. Indeed, the action of G on X canonically induces the action of G on the associated measure algebra A= ?/m. This Boolean-algebraic point of view is also present in Vladimirov A965), Chuaqui A977). Let us note that the search for invariant measures on Boolean algebras is connected to the study of groups of automorphisms and measure-preserving automorphisms of measure algebras (see Fremlin B000) for an account of the subject and further references). A remarkable feature of existence proofs from Kawada A944) is that they also establish certain uniqueness properties of invariant measures. Let us mention the following generalization of Theorem 1.5: Suppose a group G acts by measurable transformations on a measurable space (?, ?). Let ? be an invariant ? -finite measure on ? and suppose that A e ? is a bounded set with ?* =? ?. Then ? is determined by its values on sets of the form AC\Q, where Q e ?*. In particular, we have the following corollary established earlier in von Neumann A940, Lemma 3.3.4). PROPOSITION 1.11. Suppose a group G acts by measurable transformations on a measurable space (?, ?). If ? is an invariant ? -finite ergodic measure on ?, then ? is unique, up to a constant factor, among invariant ? -finite measures ? on ? with ?«?. An account of results connected to the Hopf-Kawada theorem on one hand and a classification of G-spaces with a quasi-invariant measure, studied by H.A. Dye A959, 1963) and W. Krieger A969a, 1969b), on the other, may be found in Dang-Ngoc-Nghiem A973). 2. Nonmeasurable sets for invariant measures We say that a measure m is universal on a set X, if it is defined on the ? -algebra V(X) of all subsets of X, i.e., if m-nonmeasurable sets do not exist.
Measures on algebraic-topological structures 1103 A particular case of the problem discussed in the preceding section concerns the existence of invariant universal measures on G-spaces. As is well known, given a set X, there might be fundamental set-theoretic obstacles preventing the existence of any ?-finite diffused universal measure on X (see, e.g., D.H. Fremlin's article A993) for a thorough discussion of this point). On the other hand, in many cases, the reason why a particular measure cannot be defined on V(X) is precisely its G-invariance, regardless of any new set-theoretic assumptions which might possibly be added to the usual Zermelo-Fraenkel set theory with the axiom of choice (ZFC). Let us call a set X large, if there exists a ?-finite or, equivalently, probability diffused universal measure on X (in the commonly used terminology this means that there exists a real-valued-measurable cardinal к ^ \X\ - see, e.g., Fremlin A993, 1C-D)); otherwise call X small. The prototype of arguments employing invariance properties of measures to show the existence of nonmeasurable sets within ZFC is the Vitali construction. By a selector or a transversal of a collection С of nonempty, pairwise disjoint subsets of a set X, we mean a subset ? с X which intersects every element of С in exactly one point. The classical Vitali subset ofR, constructed by G. Vitali A905), is a selector of the family of all cosets of the rationals Q, treated as a subgroup of the additive group R. It is easy to note that a Vitali subset ofR is nonmeasurable with respect to any translation- invariant measure on R which assumes a finite non-zero value on [0, 1]. In particular, no translation-invariant probability measure on R is universal. In fact, Vitali's argument works equally well for, say, left action of a countable subgroup of an arbitrary uncountable group G. Let Я be a subgroup of G and call a Vitali set of ? а selector of the collection of all left-cosets of ? in G. Now, if ? is countably infinite and S is a Vitali set for H, then S is nonmeasurable with respect to any finite invariant measure on G. This follows directly from the fact that G= [J hS, i.e., G can be partitioned into countably many translates of S. The latter shows also that G is countably paradoxical (in arbitrary pieces). (It is in fact not difficult to prove that in the general case of an arbitrary G -space X, a subset ? of X is countably paradoxical (in arbitrary pieces) if and only if there exists a countable subgroup ? of G such that the intersection with ? of the ?-orbit of every point of ? is infinite - see, e.g., Zakrzewski A991a, Lemma 2.2).) Recall that according to a theorem of Tarski A938), given an action of a group G on a set X, a finitely additive probability invariant measure on X exists if and only if X is not paradoxical, i.e., X does not contain disjoint subsets X\, ??, each finitely equidecomposable with X - see Laczkovich B002) for more details. The exact analog of Tarski's theorem for countably additive measures is not true in general, since there may be no ?-additive probability universal measure on X vanishing on all countably paradoxical sets for purely set-theoretic reasons like, say, X being small, which have nothing to do with the obvious obstacle that X is countably paradoxical. So the following special case of Theorem 1.8, proved in Zakrzewski A991a), seems to be the
1104 P. Zakrzewski optimal analog of Tarski's theorem in the countably additive case for arbitrary G-spaces (see, however, Theorem 5.1). THEOREM 2.1. For an arbitrary G-space (X, G) the following are equivalent: (i) There exists a universal invariant probability measure on X. (ii) There exists a universal probability measure on X vanishing on all countably paradoxical subsets of X. Let us now turn to ?-finite measures. A.B. Khararazishvili A975) and, independently, P. Erdos and R.D. Mauldin A976) clarified the situation for the case when a group acts on itself by (say) left shifts (see also С Ryll-Nardzewski and R. Telgarski A978) for a generalization of this result). THEOREM 2.2. Let G be an uncountable group acting on itself by left shifts. Then no ?-finite invariant measure on G is universal. PROOF. Take a subgroup ? of G of cardinality ?? and let S с G be a Vitali set of H. Suppose that ? is a ?-finite invariant universal measure on X. Then we cannot have ?(S) > 0, since this implies that the collection [hS: h e H] consists of uncountably many sets of positive measure. But ?(?) = 0 is also impossible, since then G = [Jh€H hS has measure zero (the union of any family of ? ?-many null sets with respect to a universal ?-finite diffused measure is a null set - see Fremlin A993, ID)). ? The above argument proves also the nonexistence on G of a universal ?-finite measure, quasi-invariant with respect to its uncountable subgroup H. On the other hand, for an arbitrary countable group ? acting on a large set X, a universal ? -finite invariant diffused measure always exists. (Proof. Assume that m is a universal probability diffused measure on X. Fix a selector S of the collection of all Я-orbits such that m(S) > 0. Define the measure ? by ?(?)= ]? km({x eS: \АГ)Нх\=к\) forACX, (*) /teNU|Dc) where oo · 0 = 0 and oo · t = oo, if t ? 0.) The case of an arbitrary G-space X which admits a universal probability measure m is covered by Theorem 1.10. However, it turns out that now the situation is simpler: the existence of a quasi-invariant ?-finite universal measure on X implies the existence of an equivalent invariant one. This is a consequence of the following structural theorem for universal ?-finite quasi-invariant measures established in Zakrzewski A991 a) (for a shorter proof, due to D.H. Fremlin, see Zakrzewski A997), Fremlin A993, U)). THEOREM 2.3. Suppose G is a group of permutations of a set X. If m is a universal quasi-invariant ?-finite measure on X, then there exists a countable subgroup ? of G such that m({x e X: gx ? Hx}) = 0 for every geG.
Measures on algebraic-topological structures 1105 Now, under the notation of Theorem 2.3, if S is a selector of the collection of all H- orbits, then formula (*) above defines a G-invariant universal ?-finite measure ? with ? ~ m. Combining this with Theorem 1.10 we have the following theorem (for more details see Zakrzewski A991a, 1997)). THEOREM 2.4. For an arbitrary G-space (X, G) the following are equivalent: (i) There exists a universal invariant ?-finite measure on X. (ii) There exists a universal quasi-invariant ?-finite measure on X. (iii) There exists a universal ?-finite measure m on X such that there is no uncountable collection ofpairwise disjoint sets outside the ?-ideal Iq = {AC X: m(gA) = 0 for every g e G). In particular, Theorem 2.3 implies that // a group G acting on a set X contains an uncountable subgroup G' whose action on X is free, then no universal ?-finite quasi- invariant measure on X exists. This was earlier proved by Khararazishvili A983) (see also Khararazishvili A996)) in the following stronger form: under the above assumptions, if ? : ? —> [0, oo] is ? ?-finite quasi-invariant measure on X, then every set ? e ? with ?(?) > 0 contains a nonmeasurable set. But in fact, Theorem 2.4 easily implies that, in general, for an arbitrary G-space X, if there is no universal quasi-invariant ?-finite measure on X, then any ?-finite quasi-invariant measure on X has the property that every measurable subset of X of positive measure contains a nonmeasurable set. (Proof. If ?: ? -> [0, oo] is a ?-finite quasi-invariant measure on X and ? e ? is such that V(Y) с ?, then the universal measure m on X defined by m(A) = ?(? ? ?), for А с X, satisfies condition (iii) above.) More examples of natural G-spaces admitting no universal ?-finite invariant measures can be found in Penconek and Zakrzewski A994), Zakrzewski A997). While the results above concern the possibility of simultaneously making all subsets of a given G-space X measurable with respect to an invariant measure, there still remains the more modest problem, whether a particular subset of X can be "made measurable" by an invariant measure. This question, aimed at getting a deeper insight in the properties of nonmeasurable sets, is motivated by the following facts: • Every Vitali subset ofR is nonmeasurable with respect to any translation-invariant extension of the Lebesgue measure. • There exists a Vitali subset of R which is measurable with respect to a certain ?-finite diffused translation-invariant measure defined on ? ? -algebra containing all Lebesgue measurable subsets ofR (this is due to Khararazishvili A983, 1991, 1996)). Along these lines the following theorem of Khararazishvili A980) generalizes Theorem 2.2 in the case of Abelian groups. THEOREM 2.5. Let G be an uncountable Abelian group acting on itself by left shifts. Then there exists a subset A of G nonmeasurable with respect to any ?-finite invariant measure on G. It seems open whether the same holds for arbitrary groups. Let us call a subset A of an arbitrary G-space X absolutely nonmeasurable if it does not belong to the domain of any ?-finite invariant measure on X. Khararazishvili A983, 1991, 1996) gave more examples
1106 P. Zakrzewski of absolutely nonmeasurable sets in certain natural G-spaces but the problem, posed in Khararazishvili A996), to find necessary and sufficient conditions for the existence of an absolutely nonmeasurable subset of X, is still open. We may also look at sets nonmeasurable with respect to extensions of a given invariant measure. Answering a question of A. Pelc A986), S. Solecki A993a) obtained the following general result, particular instances of which were earlier proved by Khararazishvili A978) (the case of a translation invariant extension of the Lebesgue measure on E" and Pelc A986) (the case of a left invariant extension of a (left) Haar measure on a locally compact, ?-compact topological group; see also Hewitt and Ross A963, 16.13)). THEOREM 2.6. Suppose G is an uncountable group acting on a set X and ? is ? ?-finite invariant measure on X. If the action of G is ?-free, then each set of positive measure contains a subset nonmeasurable with respect to any invariant extension of ?. The above holds, in particular, for every uncountable group G acting on itself by left shifts; this gives another strengthening of Theorem 2.2. Let us emphasize a remarkable role played by Vitali sets in the constructions of sets nonmeasurable with respect to invariant measures. If we define a Vitali set of a subgroup ? of a group G acting on a set X as a selector of the collection of all Я-orbits, then it is certainly true to say that playing with Vitali sets is the basic technique leading to the results above. Measurability properties of Vitali sets with respect to invariant measures were investigated by Solecki A993b). He showed that under the hypotheses of Theorem 2.6 one can always find a countable subgroup of G whose Vitali sets behave just like the original Vitali sets of the rationals on the real line. More precisely, there exists a countable subgroup ? of G such that each Vitali set of ? is nonmeasurable with respect to any invariant extension of ?. On the other hand, under certain assumptions a subgroup of full cardinality with properties like that cannot be constructed: If the cardinality of the group G has uncountable cofinality, G acts freely on a set X and ? is ? ?-finite invariant ergodic measure on X, then there exists an invariant extension ? of ? such that each subgroup ? of G with \H\ = \G\ has a ?-measurable Vitali set. For the special case of additive subgroups of ? the latter was strengthened by A. Nowik A996/97) who proved that there exists a translation invariant extension of the Lebesgue measure on ? which measures at least one Vitali set of each uncountable subgroup o/E. Recently, Solecki B001) showed the same in the context of invariant extensions of the Haar measure on any locally compact, second countable Abelian group. Vitali sets of subgroups of ? were also studied by J. Cichon, A.B. Khararazishvili and B. Weglorz A993). They proved, in particular, that // ? is an uncountable analytic (i.e., a continuous image of a Borel set) subgroup o/E then there exist a Vitali set of ? which is Lebesgue measurable and all Vitali sets of ? are measurable with respect to a certain isometrically invariant extension of the Lebesgue measure on E. More information on various set-theoretic aspects of the theory of invariant measures can be found in Khararazishvili A998). We complete this section with a result which shows that the assumption that invariant measures under consideration are ?-finite is crucial for all results concerning the existence of nonmeasurable sets.
Measures on algebraic-topological structures 1107 We say that a measure m defined on a ? -algebra ? of subsets of a set X is semi-finite or semiregular if every set of positive measure contains a set of positive finite measure. Zakrzewski A987) and earlier in the Abelian case Pelc A982), solving a problem posed by V. Kannan and S.R. Raju A980), proved the following theorem. THEOREM 2.7. If G is an arbitrary large group, then there exists a universal semi-finite invariant measure on G. 3. Extensions of invariant measures We have seen in Section 2 that given a ?-finite invariant measure ?: ? -> [0, +oo] on a G-space X satisfying some additional conditions, there are nonmeasurable sets which cannot be "made measurable" with respect to any invariant extension of ?. A complementary question is whether there is a nonmeasurable set which can be "made measurable" by an invariant extension of ?. More precisely, whether a proper invariant extension of ?, i.e., an invariant measure ? defined on an invariant ?-algebra ?' D ? and such that ?| ? = ?, exists at all. This problem will be referred to as the invariant measure extension problem. A particularly important instance of it, concerning isometrically invariant extensions of the Lebesgue measure, whose investigations were initiated by E. Szpilrajn (Marczewski) A935), will be discussed in Section 6. The obvious necessary condition for the existence of a proper extension of a given invariant ?-finite measure ?:?-> [0, oo] on a G-space X is that it is nonuniversal, i.e., ? ? V(X). Moreover, without loss of generality we will always assume that the measure ? is diffused. (For let С = {x e ?: ?({?}) > 0); С is a countable subset of X and ? = ?, + ?? where ?\, ?2: ? -> [0, +??] are defined by ?? (A) = ?(? П С) and ?2(?) = ?(? \ C). If ?@ > 0, then ?? is a universal measure on X and it follows that the measure ? has an invariant extension if and only if so does the measure ?2, which is clearly diffused.) In general, there are the following basic ways of obtaining an invariant extension ?:?'-> [0, +Oo]of ?: (I) Find a set ? <? ? such that for any sequence (g„: ? e N) of elements of G, the union UneN Sn В has inner measure zero. Let I consist of all subsets of the unions of this form. Then define: ?'= {? ? ?: A e ? and N el}. ?(???) = ?(?) (see Szpilrajn (Marczewski) A935)). (II) Find an almost invariant set ? ^ ?, then define: ?'= {(A, nZ)U(A2\Z): A\,A2eE\, ?((?| П ?) U (A2 \ ?)) = i · [?*(?, П ?) + ?*(?2 \ ?) + ?*(?|??) + ?*(?2\?)],
1108 P. Zakrzewski where ?* and ?* stand for the inner and outer measure for ?, respectively - see Los and Marczewski A949). Recall that a set X is small, if there is no ?-finite diffused universal measure on X, i.e., there is no real-valued-measurable cardinal ^ \X\; otherwise X is large. A. Hulanicki A962) used method II to prove the following general result. THEOREM 3.1. Let G be a group of permutations of a set X such that \G\ ^ | X \ and X is small. If ? is an invariant ? -finite diffused measure which is uniform, i.e., every subset of X of cardinality smaller than \ X \ has measure zero, then ? has a proper invariant extension. Pelc A986) applied method I to solve the invariant measure extension problem for the case of an Abelian group acting on itself as follows. THEOREM 3.2. Let G be an uncountable Abelian group acting on itself by left shifts. Then every ?-finite invariant measure on G has a proper invariant extension. Pelc conjectured that the same is true for any uncountable group G. A refinement of Hulanicki's argument that lead to Theorem 3.1 above shows that it is indeed the case when G is small. So one cannot falsify Pelc's conjecture on the basis of ZFC alone, the nonexistence of large sets being relatively consistent with ZFC (see, e.g., Fremlin A993, Ue). It is also known that the conjecture is true for some other algebraically or topologically defined classes of groups (see Zakrzewski A996)) but the general question is still open. Thus the invariant measure extension problem for uncountable groups of permutations is not solved even for the case of the left-shift action of a group on itself. On the other hand, subsequent investigations started in Pelc A986) and then continued in Krawczyk and Zakrzewski A991), brought a satisfactory solution for countable permutation groups. So let ?: ? —> [0, +oo] be a ?-finite diffused measure on X, invariant with respect to a countable group G of permutations of X. A Hulanicki-style argument, based on method II above, shows that if X is small, then ? always has a proper invariant extension. On the other hand, if X is large, then ? can be universal (see the remarks following Theorem 2.2). So assume that ? is defined on a proper ?-algebra ? of subsets of X. We may additionally assume that ? is complete and every invariant subset of X is measurable, since otherwise we can immediately extend ? using methods I or II, respectively. The following result from Krawczyk and Zakrzewski A991) reveals the structure of such measures. We say that a partition ? of a subset ? of X is a G-partition if A e ? implies gA e ? for every g e G. We call ? a minimal G-partition of ? if there is no G-partition ?' ? ? of ? with the property that every element of ? is the union of finitely many elements of P'. It turns out that the structure of ? can be completely described in terms of G-partitions of G-orbits. THEOREM 3.3. Assume that G is a countable group of permutations of a set X and ? : ? —» [0, oo] is an invariant ?-finite complete diffused measure defined on X such that every invariant subset А с X is in the ?-algebra ?. Let S be a selector of G-orbits.
Measures on algebraic-lopological structures 1109 Then there exists a function s ь* Ps, determined uniquely up to a measure zero set, which assigns to each s e S a countable G-partition Ps of the orbit Gs, with the following property: (i) The ? -algebra ? consists of all sets Ac. X such that for ?-almost every s e S, the set ADGs is the union of some pieces of the partition Ps. Moreover: (ii) For any A e ?, ?(?)= ]? к ¦ ?(? C\\J{Gs: s eSandAHGs ePf}). JteNU[3c) where ? is a selector of the family {Ps: s e 5") and Ps consists of sets which are unions of к elements of the partition Ps (oo 0 = 0 and oo-t = oo if t ? 0). (iii) The measure ? does not have a proper invariant extension if and only if ?-almost every partition Ps is minimal. Now the point is that properties of G -partitions of the orbit Gx of a point ? e X are reflected by algebraic properties of G and the stabilizer of x, i.e., the group Gx — {g e G: gx = g]. This is due to the fact that every G-partition ? of Gx has the form ? = {gHx: g e G), where ? is a subgroup of G containing G,- Moreover, such a P is minimal if and only if every proper subgroup of ? containing G.( has infinite index in ? (see, e.g., Kargapolov and Merzljakov A979)). In view of this, we have the following corollary. THEOREM 3.4. Suppose G is a countable group of permutations of a set X. (i) If every nontrivial subgroup ? ofG has a proper subgroup of finite index in H, then every ?-finite nonuniversal invariant measure ? on X such that the action of G is ?-free has a proper invariant extension. (ii) If there exists a nontrivial subgroup ? of G, every proper subgmup of which has infinite index in H, X is large and the action of G on X is free, then there exists a ? -finite nonuniversal invariant measure on X without proper invariant extension. The additive group Q of the rationals is an example of a group every proper subgroup of which has an infinite index. It follows that ifQ acts freely on a large set X, then there exists a nonuniversal invariant ?-finite (or even a probability) measure on X without proper invariant extension. Further applications to measures on Euclidean spaces R", invariant with respect to groups of isometries of W, will be discussed in Section 6. 4. Invariant measures on Polish groups In the previous sections we considered discrete G-spaces and discussed properties of measures which were related only to the algebraic structure of underlying spaces. Now we are going to let topology enter the scene. In order to make the interplay between
1110 P. Zakrzewski measure and topology more transparent we decided to stay within the realm of standard Borel spaces. Let В(У) or ?(?) denote the ?-algebra of Borel subsets of a topological space (?, ?), i.e., the ?-algebra of subsets of ? generated by the open sets of Y. A measurable space (?, ?) is a standard Borel space if either X is countable and ? = V(X) or it is isomorphic to (R, B(R)), i.e., if there is a bijection / between X and R such that A e ? if and only if f[A] e B(R), for A^X. Equivalently, ? = ?(?) for a certain topology ? on X which is Polish, i.e., admits a compatible separable complete metric (see Kechris A995, 12.5)). By a Borel measure on a standard Borel space (?, ?) we mean a measure defined on ?. A Borel measure on a Polish space X is a measure defined on the ?-algebra B(X). Note that if m is a ?-finite Borel measure on a standard Borel space (?, ?), then m is diffused if and only if it is nonatomic, i.e., any set of positive measure splits into two sets, each having positive measure. A Polish group is a topological group whose topology is Polish (a survey of some basic facts from the theory of Polish groups may be found in the book of Becker and Kechris A996)). Examples of Polish groups include (for more examples see Becker and Kechris A996,1.3)): (i) Hausdorff second countable locally compact groups: countable groups (with the discrete topology), Lie groups, the Cantor group Z^; (ii) (X, +), where X is a separable Banach space; (iii) The group of homeomorphisms H(X) of a compact metric space (X,d), where the group operation is composition, with the topology it inherits as a Gg subset of C(X, X), the Polish space of all continuous functions / : X -> X with the sup metric du(f, g) = supx€Xd(f(x), g(x)), for /. g e C(X, X); (iv) The group of isometries lso(X,d) or just Iso(X) of a complete separable metric space (X,d) with the topology of pointwise convergence. It can be proved that if the space (X,d) is compact, then lso(X,d) is a compact subgroup of H(X). By a result of J. van Dantzig and B.L. van der Waerden A928), if the space (X,d) is locally compact and connected, then the group \so(X,d) is locally compact (for recent extensions of the latter result to more general classes of spaces see Clemens et al. B000)). The problem of existence of a ?-finite invariant Borel measure on a Polish locally compact group was solved by Haar A933), as is well known. The uniqueness part of the following result is due to von Neumann A936). THEOREM 4.1. Let G be a Polish locally compact group acting on itself by left shifts. Then there exists a unique, up to a multiplicative positive constant, ?-finite invariant Borel measure on G. This measure is called the (left) Haar measure on G and we will denote it by ??- Similarly, there is a right Haar measure ?'? defined by ?'?(?) = ??(?-*) forSeB(G). In order to realize the essential difference of approach compared with the existence arguments from Section 1, let us briefly sketch a more or less standard proof of existence
Measures on algebraic-topological structures 1111 of Haar measure (the "covering ratios" proof - for details see Parthasarathy A977, Proposition 54.2) or Mycielski A974)). Proof. Fix a compatible metric d in G and let Be denote the open neighborhood of the identity of radius ?. Without loss of generality we assume that ? ? is compact. For any compact set К с G and any ? ^ 1 let ?(?,?) = min{|C|: С is a finite covering of К with left shifts of Be}. and then put ?(?)= lim _ ' ' , n^T E(B\, 1/n) where ? is a nonprincipal ultrafilter of subsets of N. Finally, for every open set U с G, let ?(?/) = sup{A(tf): К С f/, K compact} and for any В е B(G) define ?(?) = inf(?(?/): В с U, U open}. ? From this proof it may be seen that Haar measure is finite on compact sets. In particular, we have the following proposition. PROPOSITION 4.2. If G be a Polish locally compact group, then the following are equivalent: (i) Haar measure on G is finite. (ii) The group G is compact. For a compact group G it is customary to assume that y.c(G) = 1. The uniqueness of Haar measure does not imply that the left and right Haar measures on G coincide (even up to a multiplicative constant), since a (left) Haar measure is in general not right-invariant. On the other hand, left Haar measure is always right-quasi- invariant, i.e., quasi-invariant with respect to right shifts. This follows from the following more general fact, which is also the first step towards von Neumann's proof of uniqueness of Haar measure. PROPOSITION 4.3. Let mi and m,- be arbitrary ?-finite measures on a Polish locally compact group G. If mi is left quasi-invariant and m, is right quasi-invariant, then m/ ~ mr. Consequently, m/ ~ mr ~ \xq.
1112 P. Zakrzewski PROOF. Take an arbitrary set В е B(G). By Fubini's theorem and the appropriate quasi- invariance of the measures under consideration we have mi(B) = 0 <s> mi(g~] ?) = 0 for w/-almost all g e G <?> mr(Bh~]) = 0 for mr-almost all h e G &mr(B) = 0. ? It is known that any Polish group admits a left-invariant compatible metric (the Birkhoff- Kakutani theorem - for a proof see, e.g., Berberian A974, p. 28)). In view of a result of S. Banach A937), if d is a left-invariant compatible metric on a Polish locally compact group G, then (left) Haar measure on G is invariant under all isometries of the metric space (G, d) (for various generalizations of this result see Segal A949), Segal and Kunze A978, Corollary 7.5.1), Bandt A983)). If (left) Haar measure on G is right-invariant, G is called unimodular. Since right shifts are isometries of the space (G,d) if and only if the metric d is right-invariant, it follows that every Polish locally compact topological group admitting a compatible (two-sided) invariant metric is unimodular. Examples of such groups include Abelian groups and compact groups. The next result reveals another important property of Haar measure and is the second step towards the proof of its uniqueness. PROPOSITION 4.4. A Haar measure on a locally compact Polish group is ergodic. PROOF. Let A e B(G) be ?? -almost-invariant. It means that хд = хг1Д,дс-а.е., where Хд is the characteristic function of A in G. By Fubini's theorem there is ho e G such that хд(/?о) = хя-|д(Ло) for дс-а.е. g e G. Equivalently, either gho e A for ??-а.с. g e G or gho <? A for ??-а.с. g e G. By the (right) quasi-invariance of ?? it follows that ??(? \ A) = 0 or ??(?) — 0, respectively. ? Now the uniqueness of Haar measure follows at once from Proposition 1.11. A consequence of uniqueness is that unimodular groups may also be characterized as follows: G is unimodular if and only if Haar measure on G is inversion-invariant, i.e., invariant under the transformation g ь* g_l. (Proof. If ?? is right-invariant, then ?'0 is left-invariant. Then, by uniqueness, there is a positive constant с such that ??(?~]) = ?'?(?) - с ¦ ??(?) for any В e B(G). Now it suffices to take as В any symmetric set, i.e., such that В = B~', of finite positive measure (for example the closure of a symmetric neighborhood of the identity of a sufficiently small diameter) to conclude that с = 1. The other implication is easy.) Another consequence of uniqueness is the existence of a continuous homomorphism A of G into the multiplicative group of positive real numbers satisfying the following property: ??(?8) = ?(8)??(?) for all g e G, A e B(G). The function ? is called the modular function of the group G. Clearly, G is unimodular if and only if A(g) = 1 for all g e G. From this it becomes obvious why compact groups are
Measures on algebraie-topological structures 1113 unimodular. (Proof. If G is compact, then 1 = ??(?) = ??^?) = Mg)^c(G) = A(g) for any g & G.) It is also easy to see that the function ? is the Radon-Nikodym derivative ???/??'?. It should be underlined that the basics of Haar measure theory, surveyed above, were appropriately generalized by A. Weil A940) to arbitrary locally compact groups. Also, Weil's proof of uniqueness is entirely different from that of von Neumann (other uniqueness arguments were given by H. Cartan A940) and D.A. Raikov A942)). A complete account of Haar measure in general locally compact groups may be found in many standard textbooks (for example, see Hewitt and Ross A963), Nachbin A965), HalmosA974)). Now let us turn to Polish groups which are not locally compact. S. Ulam (quoted by J.C. Oxtoby A946)) proved that if G is a Polish, non-locally-compact gmup and m is any left quasi-invariant Borel measure assuming at least one positive finite value, then every open subset of G contains uncountable many pairwise disjoint left shifts of a compact set of positive measure. Combining this with the theorem on existence of Haar measure we immediately obtain a characterization of locally compact groups in terms of invariant measures. THEOREM 4.5. Assume that G is a Polish group. Then the following are equivalent: (i) G admits a (left) quasi-invariant Borel ? -finite measure. (ii) G is locally compact. This characterization can be considerably strengthened. A standard Borel group is a group G together with a ?-algebra ? such that (G. ?) is a standard Borel space and the mapping (x,у) ь* xy~f is a Borel function from G ? G into G. If G is a standard Borel group, then there is, of course, a Polish topology ? such that ? = ?(?) but it is not necessarily true that there is one for which (G. ?) is a topological group. On the other hand, if such a topology exists, then it is unique (see, e.g., Kechris A995, 12.24)). A. Weil A940) (a weaker result for a broader class of groups) and G.W Mackey A957) proved that // G is a standard Borel group and there exists a quasi-invariant Borel ?- finite measure on G, then there is a topology ? giving its Borel structure and making G a Polish locally compact group. The idea of a proof is to use a quasi-invariant Borel probability measure ? on G to embed G into the unitary group U(H) of the Hubert space ? = L2(G, B(G), ?) (see, e.g., Parthasarathy A977, §56)). The Mackey-Weil theorem shows that Polish locally compact groups can be characterized in the following way. THEOREM 4.6. Assume that G is a standard Borel group. Then the following are equivalent: (i) G admits a quasi-invariant Borel ? -finite measure. (ii) G carries a Polish topology which gives its Borel structure and makes it a locally- compact group. Though, as we have just seen, in a non-locally-compact Polish group G, for any ? -finite Borel measure on G, the ?-ideal in B(G) of measure zero subsets of G is neither left nor
1114 P. Zakrzewski right invariant, it turns out that every Polish group G carries a certain natural (two-sided) invariant ?-ideal in B(G). The following concept was introduced by J.PR. Christensen A972) for the Abelian case and then extended by F. Tops0e and J. Hoffmann-J0rgensen A980), and J. Mycielski to all Polish groups. We say that a Borel (or, more generally, a universally measurable, i.e., measurable with respect to the completion of every ?-finite Borel measure on G) subset A of G is Haar null, if there exists a Borel probability measure m on G such that m(gAh) = 0 for all g.heG. It is not difficult to prove that in Polish locally compact groups Haar null Borel sets coincide with the Borel sets of Haar measure zero. In an arbitrary Polish group G, Haar null Borel sets form a ?-ideal in B(G) which is (two-sided) invariant, i.e., closed under left and right shifts (the closure under countable unions requires proof- see Christensen A972)). Another similarity of Haar null sets with the Haar measure zero sets, at least for Abelian groups, is exhibited by the following result of Christensen A972) (for the case of a (not necessarily Abelian) locally compact group this is the well known Steinhaus property which is due to H. Steinhaus A920) in the case of G = Ш and to Weil A940) in general). If G is an Abelian Polish group and A is a Borel (or universally measurable) set which is not Haar null, then the set AA~' contains an open neighborhood of the identity of G. It seems to be open, whether the same is true for a broader class of Polish groups. Motivated by another property of the Haar measure zero sets in locally compact groups Christensen asked, whether every collection of pairwise disjoint, universally measurable sets which are not Haar null is countable. R. Dougherty A994) obtained negative answer for many particular non-locally compact groups. Finally, Solecki A996) proved that // G is a Polish non-locally-compact group admitting a (two-sided) invariant metric, then there exists an uncountable collection of closed, pairwise disjoint sets which are not Haar null. This leads to another characterization of the local compactness of a Polish group (admitting an invariant metric) in measure theoretic terms. THEOREM 4.7. Assume that G is a Polish group, admitting an invariant metric. Then the following are equivalent: (i) The ? -ideal of Haar null Borel subsets of G satisfies the countable chain condition in B(G). (ii) G is locally compact. It again seems to be open, whether the same is true for a broader class of Polish groups. Let us close this section by pointing out an important role played in the results above by the assumption that invariant measures under consideration are ?-finite. Namely, Oxtoby A946) proved the following theorem. THEOREM 4.8. Every Polish group carries a Borel semi-finite invariant measure. 5. Invariant measures on Polish G-spaces A G-space X is called a standard Borel G-space if G is a standard Borel group, X is a standard Borel space and the action of G on X is a Borel function from G ? X into X.
Measures on algebraic-lopological structures 1115 If G is a Polish group, X is a Polish space and the action is continuous, then X is called a Polish G-space. By a fundamental theorem of Becker and Kechris A996, Theorem 5.2.1) // G is a Polish group, then any standard Borel G-space (?, ?) is Borel isomorphic to a Polish G-space, i.e., ? = ?(?) for a certain Polish topology ? on X which makes the action of G on X a continuous function from G ? ? into X. Polish G-spaces have long been studied in several areas of mathematics including ergodic theory, operator algebras and group representation theory. Of particular interest are actions of Polish locally compact groups which cover the case of the group of integers acting on X via the map (к, x) ь* Tkx, where Г is a homeomorphism of X, flaws, i.e., R-actions and actions of Lie groups. Other examples of Polish G-spaces include: (i) The action of a Polish group G on itself by (say) left shifts. (ii) The canonical action of a Polish group G on the quotient space G/H of all left- cosets of a closed subgroup ? of G (it is a Polish space with the quotient topology consisting of all U с G/H such that n~' [U] is open in G, where ?(?) = ? ? is the canonical projection) given by (g, xH) ь* (gx)H, for g.x eG. (iii) The action of the group of homeomorphisms G = H(X) on a compact metrizable space X given by (g, x) ь* g(x) for g e H(X) and ? e X. (iv) The action of the group of isometries G = Iso(X) on a Polish space X given by (g,x)i-+ g(x) for g elso(X) and ? e X. An excellent account of the topic (and more examples) is given in Becker and Kechris A996). The case when G is locally compact is the most important one as far as invariant measures are concerned. There is, however, a general result valid for Borel actions of arbitrary Polish groups, which in this case strengthens the existence theorems from Section 1. Recall that Tarski's theorem asserts that the existence of a finitely additive probability universal invariant measure on a G-space X follows already from the assumption that the set X is not paradoxical. Becker and Kechris A996, Theorem 4.2.1) and earlier M.G. Nadkarni A990) in the case G is countable, proved the following exact analog of Tarski's theorem for countably additive measures on Polish G-spaces. THEOREM 5.1. Let G be a Polish group and X a standard Borel G-space. Then the following are equivalent: (i) There exists an invariant Borel probability measure on X. (ii) The space X is not countably paradoxical in Borel pieces. Note that by the result of Becker and Kechris, quoted at the beginning of this section, we may assume that X is actually a Polish G-space. Then the crux of the proof of the nontrivial implication ((ii) => (i)) lies in the case G is countable. For assume that we have already proved the theorem for the countable case. Let Go be a countable dense subgroup of G and assume that X is not countably G-paradoxical in B(X). Then X is not countably Go-paradoxical and by the countable case there exists a Go-invariant probability Borel measure ? on X. It turns out that ? is also G-invariant, due to the following general fact which may be established by a straightforward continuity argument. In a Polish G-space any Borel probability measure invariant under a countable dense subgroup ofG is G-invariant (it should be noted that the latter is not true for ? -finite measures).
1116 P. Zakrzewski The countable case was proved by Nadkarni for Z-actions but, as noted by Becker and Kechris, the argument works as well for any countable group with some easy modifications in the proof. The following outline is extracted from Becker and Kechris A996, Section 4). Let us assume that G is a countable group of homeomorphisms of a Polish space X and that ? ? Nc, the ?-ideal in B(X) consisting of all G-negligible Borel subsets of X, i.e., sets A e B(X) such that X contains pairwise disjoint subsets A„ e B(X), ? e N, with each A„ ~oc A in B(X). This is equivalent to the assertion that X is not countably paradoxical in B(X). We assume, moreover, that all orbits are infinite, since otherwise the result holds trivially. Denote by B' (X) the ?-algebra of invariant Borel sets and recall that since the group G is countable, A* = {Jg€G gA is the minimal invariant set containing a Borel set А с X. The idea of the proof is similar to that of the proof of the Hopf-Kawada theorem (Theorem 1.2), outlined in Section 1. Given sets А, В e B(X) we write A < В if there is С е B(X) with С ~oc A in B(X), С с ? and for every ? e В, С П Gx is a proper subset of В П Gx. Given sets A, B,C e B(X) and ? e N we write A = nB®C if A = ? ? U · · · U An U С where ? ?,..., ?„, С are pairwise disjoint Borel sets with each A, ~XB inB(X). The following two lemmas are analogs of Lemmas 1.3 and 1.4. LEMMA 5.2. For every Borel sets А, В с X there is a partition (up to a set from Nc) a*=\Jq„ of A* into pairwise disjoint Borel sets Qn e ?' (?), ? e N, such that for each ? e N А П Qn = n(B П Q„) ? tf„, wfcere tf„ < ? ? ?>„. Moreover, the partition is unique, up to a set from Nc- For every Borel sets ?, ? с X and .? e X, let ?, if л: е Q„, [A/B](x)=ln . 1 0, otherwise, in the notation of the preceding lemma. LEMMA 5.3. There is a decreasing sequence ofBorel sets Cn with Co = X< C,* = X and [C,,/C,,+ |](.x) ^ 2 modulo ?q, for every ? e N. Taking the lemmas above for granted, the proof is carried out as follows.
Measures on algebraic-topological structures 1117 Fix a sequence (C„) from Lemma 5.3 and for any A e B(X) and ? e X let [А/С„](дО m(A,;t) = hm . It can be proved that this definition is correct for all ? outside a set from Nc and the function m(A, x) has the following properties: (i) 0 ^ m(A, ?) ? 1 and m(X, x) = 1; (ii) m(A, x) — 0 modulo Nc if and only if A e Nc; (iii) m(gA,x) = m(A, gJt) = w(A, x) modulo Nc; (iv) m(|J,ieN A„,;t) = HT=am(An<x) modulo Nc, whenever A„, ? e ?, arepairwise disjoint Borel sets; (v) if Q is an invariant Borel set and ? e Q, then m(A, x) = m(A П Q,x) modulo Nq. Now fix an invariant countable open base U for the topology of X and let ?(?) = infl ]? ?(?„): A„eW. А с (J A„ for A e B(X), ^;ieN /? eh' ' where ? :U ^> [0, oo] is defined by T(A) = m(A,xQ), for a suitably chosen *о е X, so that the function ? is an invariant probability Borel measure on X (the point is that checking this requires only countably many conditions of the form (iii) and (iv) above plus w@, .vq) = 0, hence the set ? of exceptional points is in Nc and, since X & No, we may choose *? e X \ E). For details see Nadkarni A990), Becker and Kechris A996). As an immediate consequence of Theorems 4.1, 4.5 and 5.1, and Proposition 4.2 we have the following characterization of compact Polish groups. THEOREM 5.4. If G is a Polish group, then the following are equivalent: (i) G is not countably paradoxical in Borel pieces. (ii) G is compact. Let us now consider actions of Polish locally compact groups. Classically, of particular importance is the case when a locally compact group G acts continuously on a compact metric space X, i.e., when X is a compact Polish G-space. N.N. Bogoliouboff and N.M. Kryloff A937) proved that every compact Polish Ъ-space admits an invariant Borel probability measure. This was later extended by S.V. Fomin A950) to actions by solvable groups. More generally, we say that a Polish locally compact group G is topologically amenable if there exists a finitely additive invariant Borel probability measure on G. Thus every compact group is topologically amenable, the left Haar measure being a witness of
1118 P. Zakrzewski its amenability. Other examples include Abelian and solvable groups (see Wagon A986, p. 162)). (According to a theorem of A.L.T. Paterson A986) topologically amenable groups are characterized among Polish locally compact groups as exactly these groups which are not paradoxical in Borel pieces - this should be compared with Theorem 5.1 above.) The following general theorem (see, e.g., Greenleaf A969) or Plante A983)) establishes a necessary condition for the existence of invariant Borel probability measures on compact Polish G-spaces. THEOREM 5.5. Let G be a Polish locally compact topologically amenable group. Then every compact Polish G-space X admits an invariant Borel probability measure. A functional analytic proof of Theorem 5.5 is based on the fact that a locally compact topologically amenable group G satisfies the following version of the Markov-Kakutani fixed-point theorem: if G acts continuously on a compact convex subset К of a locally convex topological space ? so that all the transformations ? ь* gx, g e G, are affine (i.e., g(ax + A — a)y) = agx + A — a)gy for x,y e К and 0 ^ a ^ 1), then they have a common fixed point (see, e.g., Wagon A986, p. 163)). For an arbitrary Polish space X, let P(X) be the space of all probability Borel measures on X, equipped with the topology generated by the maps m ь* f f dm, where/ varies over the set of all bounded continuous real-valued functions on X. Then P(X) is a Polish space. If, moreover, X is compact, then so is the space P(X) and, by the Riesz Representation Theorem, it may be viewed as a convex, compact subspace of the dual space C(X, K)* equipped with its weak*-topology (see Kechris A995, Section 17E). Now the proof runs as follows: let ? = C(X, Ш)*, К = P(X) and consider the action of G on К defined by g? = \xg~', where \xg~' (A) = ?(g~' A) for A e B(X). Since this action is continuous, the Markov-Kakutani theorem, quoted above, gives a common fixed point ? e P(X), which is an invariant probability Borel measure on X. (This should be compared with the proof of Theorem 1.7 of Hajian and Ito, outlined in Section 1.) Since the group Iso(X) of all isometries of a compact metric space (X, d) is compact, hence topologically amenable, and X is a compact Polish Iso(X)-space, we have the following corollary. PROPOSITION 5.6. IfX is a compact metric space, then there exists an isometry-invariant Borel probability measure on X. Returning to general considerations, suppose that G is a Polish locally compact group and X is an arbitrary Polish G-space. Denote by INVc the set of all invariant probability Borel measures on X. Then INVc is a convex subset of the Polish space P(X). Let EINVc denote the set of all ergodic measures ? e INVc · It follows from Theorem 5.1 that Nq is the collection of all Borel sets A^ X such that ?(?) = Ofor every ? e INVG. (Proof. Without loss of generality assume that the group G is countable. If A ? Nc, then A* <? Nq and by Theorem 5.1 there is a measure ? e INVc with ?(?*)= 1.) A thorough study of INVG was undertaken by R.H. Farrel A962) and VS. Varadarajan A963). Their main results may be summarized as follows.
Measures on algebraic-lopological structures 1119 THEOREM 5.7. Let G be Polish locally compact group and X a Polish G-space. (i) Any measure ? e INVc is determined by its values on B' (X). Moreover, //INVc ? 0, then any probability measure defined on B'(X) which vanishes on every set AeB1 (X) П No has the unique extension to a certain measure ? e INVc · (ii) If there exists an invariant pmbabilit}· Borel measure on X, then there also exists an ergodic one. (iii) The sets INVc and EINVG are Borel in the space P(X). (iv) (A uniform ergodic decomposition theorem) If INVc ? 0, then there is a Borel mapping ?, ? : ? ь* ?? from X onto EINVc such that • the mapping ? is constant on G-orbits, i.e., figx = Д, for ? e X, g e G; • for every ? e EINVG, ifXv = {x e ?: ?? = ?), then ?(?,) = 1; • for any ? e INVc, ?(?)= / ??{?)??(?) for all A eB(X). (*) Jx Moreover, the decomposition map ? is uniquely determined modulo No- Some remarks are in order concerning the above result. • Assertion (i) is related to Theorem 1.5. The point is that when G is locally compact, then for any Borel subset A of a Polish G-space X there exists a set A e B' (X) with ?(? ? A) = Ofor any ? e INVG such that A e B*(X) (where B*(X) is the ?-algebra of all Borel subsets of X which are almost invariant with respect to ?). It follows that the values of any measure ? e INVc on B*(X) are determined by its values on B'(X). This also implies that a measure ? e INVG is ergodic if and only if?(?) = 0 or 1 for any A eB'(X). • In the "moreover" part of (i) the assumption that INVc ? 0 may be omitted. Indeed, in view of Theorem 5.1, this already follows from the existence of any probability measure defined on B' (X) and vanishing on each ? &?'(?)??? (since this implies XiNG). • Assertion (ii) is related to the fact that when G is locally compact, then EINVc is the set of all extreme points of the convex set INVc (this is known to be false for arbitrary Polish groups). • In assertion (iv) we obtain a partition of X into invariant Borel sets Xv with the property that (by (*)) ? e EINVc is the only invariant Borel probability measure on the standard Borel G-space X,., i.e., Xv is uniquely ergodic. Note that this does not imply that Xv is a single G-orbit. But it is so if, for instance, the group G is compact (see remarks following Theorem 5.8 below); in this case the partition {Xv: ? e EINVc} consists of all G-orbits and if Xv = Gxo, then the measure ? is defined by v(A) = Mc({?eG: gx0 e A)) forAeB(X), where ? с is the Haar measure on G.
1120 P. Zakrzewski Some generalizations of the Farrel-Varadarajan's results to quasi-invariant measures were obtained by Y.I. Kifer and S.A. Pirogov A972), K. Schmidt A977), A. Ditzen A992) and G. Greschoning and K. Schmidt B000). So far in this section we have been looking at invariant probability measures only. The case of ?-finite measures is less clear in general but it is best understood when the group G is locally compact. Given a Polish G-space X let us say that X is (G-)homogeneous if the action of G is transitive, i.e., if X coincides with the only G-orbit. Recall that ? is the modular function of the group G and G.< = {g e G: gx = x} is the stabilizer of a point ? & X. Note that a transitive action of G on a Polish space X can be identified in a natural way with the canonical action of G on the quotient space G/Gx; thus all homogeneous G-spaces are essentially of the form G/H, where Я is a closed subgroup of G. The study of invariant Borel measures on homogeneous G-spaces was undertaken by A. Weil A940). The main results may be summarized as follows. THEOREM 5.8. Suppose that G is a Polish locally compact group and X is a Polish G-homogeneous G-space. Fix a point xq e X and denote by ? the modular function of the group GX0. (i) There exists a quasi-invariant probability Borel measure on X and any two such measures are equivalent. (ii) There exists an invariant ?-finite Borel measure on X if andonly ifS = A\GXo, the restriction of ? to G ,<·„· Such a measure, if it exists, is unique, up to a constant factor, and ergodic. PROOF (Sketch). In order to prove (i), it suffices to take a probability Borel measure ? ~ ?? on G and then define v(A) = ?({# e G: gx0 e A}) for A e B(X). The equivalence of any two quasi-invariant ?-finite measures and the uniqueness of an invariant ?-finite measure can be proved by a Fubini-type argument similar to that used for the case of the left action of G on itself - see Propositions 4.3 and 4.4. So the main part of the proof is to show that under the assumption that A\G.f0 = <5, there is a non-negative Borel function ?: X —>¦ R such that the measure ? defined by ?(?)= / p(x)dv(x) forAeB(X). JA is G-invariant. For details see, e.g., Parthasarathy A977, Proposition 55.6). ? In particular, a unique invariant Borel probability measure on a Polish homogeneous G-space X exists, if the group G is a compact. On the other hand, X may possess such a measure without G being compact - conditions for its existence were studied by S.P Wang A976) and T.S. Wu and J.S. Yang A986). If G is a unimodular locally compact Polish group but ? is its closed subgroup which is not unimodular, then the quotient G-space G/H admits no invariant ?-finite Borel measure (an example is given in Parthasarathy A977, Exercise 55.8)).
Measures on algebraic-topological structures 1121 A special case of a homogeneous G-space is a homogeneous Polish metric space (X,d), i.e., an Iso(X)-homogeneous Polish Iso(X)-space. Recall that the group Iso(X) is Polish but not necessarily locally compact, even if X is so. However, Banach A937) proved that // X is a homogeneous Polish locally compact metric space, then there exists an isometrically invariant Borel ? -finite measure ? on X. The existence of Borel measures on metric spaces with various invariance properties was also studied by L.L. Loomis A945), J. Mycielski A974, 1977), С Bandt A983), С Bandt and G. Baraki A986) and S. Gao and A. S. Kechris B001, Theorem 6.10). The more general problem of existence of a Haar-like Borel measure on a locally compact topological space X, invariant with respect to a group G of homeomorphisms of X, have also been considered by several authors. The following general result is due to R.C. SteinlageA975). THEOREM 5.9. Let X be a Polish locally compact space and G a group of homeomorphisms of X onto itself Suppose that, for every disjoint pair of compact subsets of X, there is a nonempty open set U С X such that no image ofU under any element ofG intersects both of the compact sets. Then there exists an invariant ?-finite Borel measure ? on X. In particular, the condition holds ifG is a group of isometries of a Polish locally compact metric space (X, d). In this case, if at least one G-orbit is dense in X, then ? is positive on open sets, finite on compact sets and is a unique, up to a constant factor, invariant Borel measure on X with these properties. As a matter of fact, Steinlage proved the theorem in a more general framework of arbitrary locally compact Hausdorff spaces - it strengthened earlier results of Banach A937), Segal A949), Segal and Kunze A978, Theorem 7.2), Y. Mibu A958) and J. Poncet A954) (see also J.W. Roberts A974/75) where a similar result for compact spaces is proved). Even if a given Polish G-space is not homogeneous, we can apply Theorem 5.8 to the induced action of G on an arbitrary G-orbit (which is an invariant Borel subset of X - see Becker and Kechris A996, 2.3.3)). In particular, it follows that if G is a Polish locally compact group, then every Polish G-space X admits a quasi-invariant Borel probability measure. But in fact the latter can also be proved by another method which gives some new information (see, e.g., Kechris A994, Proposition 3.30)). PROPOSITION 5.10. Let G be a Polish locally compact group, X a Polish G-space and m a probability Borel measure on X. Then there exists a quasi-invariant probability Borel measure m on X with the following properties: (i) m'(A) = m(A)foranyAeBi(X); (ii) m' <g ? for any quasi-invariant probability Borel measure ? on X with m«n; (iii) m is ergodic if and only if m is ergodic. PROOF. Let ? с be a (left) Haar measure on G and take a probability Borel measure ? = ?? on G. Put m'(A)= J m(g-,A)d?(g) forAeB(X). It is easy to see that this works. D
1122 P. Zakrzewski Let us finally briefly touch the case when G is a countable group and X is a standard Borel G-space. In this case the existence of an invariant (or quasi-invariant) probability (or ?-finite) Borel measure on X is really a property of the orbit equivalence relation Eg, whose equivalence classes are the G-orbits. Let us say that an equivalence relation ? on a set X is countable if every equivalence class is countable. The equivalence relation ? с is Borel (as a subset of ? ? ?) and countable. Conversely, J. Feldman and C.C. Moore A977) proved that if ? is a countable Borel equivalence relation on a standard Borel space X, then there is a countable group G and a Borel action of G on X such that ? = Eq. Now, given such an E, a ?-finite Borel measure ? on X is called ?-invariant (E-quasi-invariant, respectively) if for any Borel set А с X and a Borel 1-1 map f:A-+ X with f(x)Ex for all ? e A, we have ?(f[A]) = ?(?) (?(/[?]) = 0 <s> ?(?) = 0, respectively). It is easy to check that this is equivalent to the assertion that ? is G-invariant (G-quasi-invariant, respectively) for any countable group G and a Borel action of G on X with ? = Eq. As we have already seen (Theorem 5.1), Nadkarni A990) proved that the existence of an ?4nvariant probability Borel measure on X is equivalent to the fact that X is not countably paradoxical in Borel pieces with respect to any Borel action of a countable group inducing E. Let us say that ? is compressible if there is a Borel map /: X —>¦ X such that for each ? e X, f[[x]E] is a proper subset of the equivalence class [x]e of x. Now, Nadkarni's theorem may be stated in the following way, with no appeal to any action that induces E. THEOREM 5.11. Suppose ? is a countable Borel equivalence relation on a standard Borel space X. Then the following are equivalent: (i) There exists an ?-invariant Borel probability measure on X. (ii) ? is not compressible. Hence the existence of an f-invariant Borel probability measure on X tells us something about the structure of E. On the other hand, a ? -finite ?-invariant Borel measure on X trivially exists for any countable Borel equivalence relation ? and it may even be proved that there always exists a diffused one, provided that ? has uncountably many equivalence classes. Proof. By a theorem of J. Silver A980), since ? is a Borel equivalence relation on a standard Borel space X with uncountably many equivalence classes, there is a nonempty perfect set S с X which meets every equivalence class in at most one point. Let m be a Borel probability diffused measure on S and suppose G = {g„: ? e N}, go = id*, is an arbitrary countable group of Borel automorphisms of X with Ec — E. Let S„ = g„S and define an f-invariant ?-finite Borel measure ? on X as follows: ?(A) = YJm(sng;l AnU,\\JSk forAeB(X). However, the existence of an ?-quasi-invariant ?-finite diffused Borel measure which is additionally E-ergodic, again expresses an important property of the equivalence relation E.
Measures on algebraic-topological structures 1123 We say that a ?-finite Borel measure ? on X is E-ergodic if ?(?) = 0 or ?(? \ A) = 0 for any invariant A e B(X), where a subset А с X is called invariant if it is the union of all equivalence classes of its elements. This is equivalent to asserting that ? is G-ergodic for any countable group G of Borel automorphisms of X with ? = Eg ¦ It follows from Proposition 5.10 that if there exists a diffused E-ergodic probability Borel measure on X, then there also exists an ?-quasi-invariant one. A Borel equivalence relation ? on a standard Borel space X is called smooth if there is a standard Borel space ? and a Borel map f.X^Y with xEy <s> f(x) = f(y) for any x, у e X. It might be proved that// ? is countable, then ? is smooth if and only if ? has a Borel transversal (see Kechris A995, 18.D)). The following theorem is closely related to the so-called Glimm-Effros dichotomy - see Becker and Kechris A996, Section 3.4). THEOREM 5.12. Suppose ? is a countable Borel equivalence relation on a standard Borel space X. Then the following are equivalent: (i) There exists an ?-quasi-invariant, ergodic, diffused Borel probability measure onX. (ii) ? is not smooth. It is not difficult to prove that if a countable aperiodic (i.e., with all equivalence classes infinite) Borel equivalence relation ? is smooth, then it is compressible. But if ? is not compressible, then by Theorems 5.11 and 5.7(ii), there even exists an ?-invariant, ergodic, diffused Borel probability measure on X. An example of a nonsmooth compressible ? is the Vitali equivalence relation on R (i.e., xEy «==> ? — у е Q). 6. Isometncally invariant measures on Euclidean spaces The study of isometncally invariant measures on the Euclidean space W is particularly appealing, since it is directly connected to the search for a "good" notion of a measure that would generalize the notions of the length of an interval, the area of a region and the volume of a solid. Though the Lebesgue measure ?„ : ?„ -> [0, oo] has been widely accepted as the ultimate solution to the problem of finding a satisfactory way to "measure" complicated subsets of E", we have to face the fact that some Lebesgue nonmeasurable sets still exist (at least if we accept the axiom of choice - see Wagon A986, Chapter 13) for a discussion of this point). One way to cope with this disadvantage is to consider isometrically invariant extensions of ?„. Let G be a subgroup of Iso(R") - the group of all isometries of R". Recall from Section 3 that given an invariant ?-finite measure ? defined on a ?-algebra ? of subsets of M." there are two basic ways of obtaining an invariant extension Д : ?' —>¦ [0, +00] of ?. The first is to find a nonmeasurable set such that the G-invariant ?-ideal I it generates in V(R"), consists of sets of inner ?-measure zero. The extension ? is then defined on the ?-algebra generated by ? UI so that ?(?) = 0 for any N e I (see (Szpilrajn, 1935)). The second boils down to finding a ?-almost invariant set A ? ? and then defining ? on the ?-algebra generated by ? and A (see Los and Marczewski A949)).
1124 P. Zakrzewski The existence of a proper isometrically invariant extension of ?„ was proved by Szpilrajn (Marczewski) A935) using the first method. Szpilrajn constructed a generalized Bernstein set ? с R" with the property that any countable union of isometric copies of Z, is neither contained in nor disjoint from an arbitrary nonempty perfect subset ? of R". In the same paper Szpilrajn stated the problem posed by Sierpinski, whether every isometrically invariant extension of Lebesgue measure in R" has a proper isometrically invariant extension. It is well known that if the continuum is large, then there exists an extension of ?„ to a universal measure, i.e., a measure defined on V(R") (see, e.g., Fremlin A993, 1D)). However, no such extension can be isometrically invariant. Indeed, the classical Vitali subset of R" is nonmeasurable with respect to any translation-invariant extension of ?«. (In fact, by a theorem of Khararazishvili A978), every set of positive Lebesgue measure contains a subset nonmeasurable with respect to any translation invariant extension of ? ?.) Moreover, by Theorem 2.2, no universal, translation invariant, ?-finite measure on R" exists at all. Thus no isometrically invariant extension of ?„ is universal and Sierpinski's problem amounts to asking if a maximal one exists. The negative answer, i.e., the theorem that no maximal isometrically invariant measure on R" exists was first obtained independently by S.S. Phakadze A958) and A. Hulanicki A962) under the additional assumption that the continuum is small (this is closely related to Theorem 3.1). Then Khararazishvili A980) proved it without any additional set-theoretical assumptions for the one-dimensional case. Finally, K. Ciesielski and A. Pelc A985) got the negative answer in any dimension, showing even more than was required. THEOREM 6.1. Suppose G is any subgroup o/Iso(R") containing all translations. Then every G-invariant ?-finite measure on R" has a proper G-invariant extension. The Ciesielski and Pelc's proof is also based on the first method of extending invariant measures, described above. More precisely, it is showed that the space R" is a countable union of G-absolutely negligible sets, where a set N с R" is called G-absolutely negligible if the G-invariant ?-ideal, it generates in VCR"), consists of sets of inner measure zero with respect to every G-invariant ?-finite measure on R". Now, given a ?-finite G- invariant measure ?: ? —>¦ [0, oo] on R", if R" = IJisN ^' -wnere eacri ? 's G-absolutely negligible, then N,0 ? ? for at least one /'о е N. The idea of partitioning R" into countably many G-absolutely negligible sets is due to Khararazashvili A983) who used it for the case when G is the group of all translations of R". A more detailed account of the long history of Sierpinski's problem can be found in Ciesielski A989, 1990). In view of Theorem 6.1, Ciesielski and Pelc A985) stated the problem of characterizing those subgroups G of Iso(R") for which the conclusion of their result remains true. Using ideas of Hulanicki A962) they noticed that if the continuum is small, then no restrictions on G are needed. (For the reasons, explained at the beginning of Section 3, we will assume that all measures, we want to extend, are diffused.) Ciesielski A990) proved, in particular, that it is enough to require that either the action of G is transitive (this was earlier proved by Khararazishvili under the additional assumption of the Continuum Hypothesis) or G contains uncountably many translations. In fact, in the same paper Ciesielski studied a more general question: given a subgroup G of
Measures on algebraic-topological structures 1125 Iso(R") and a ?-finite G-invariant diffused measure ? on R" does there exist a proper G- invariant extension of ?? He proved that it is so, provided that G contains an uncountable subset ? with the property that every two different elements of ? coincide exactly on a set of inner measure ? zero. Finally, Zakrzewski A990) noticed how to apply the key technical tool from Ciesielski's paper to get the following strengthening. THEOREM 6.2. Suppose G is a subgmup o/Iso(R") and let ? be a ?-finite G-invariant diffused measure on R". If ? does not vanish on the set of points with uncountable G-orbits, then ? has a proper G-invariant extension. As a corollary we have the following solution to Ciesielski-Pelc's problem. THEOREM 6.3. Let G be a subgroup o/Iso(R"). The following are equivalent: (i) Every G-invariant ?-finite diffused measure on R" has a proper G-invariant extension. (ii) Either the continuum is small or exactly one of the following holds: • all G-orbits are uncountable: • there exists a common fixed point of all elements of G and all other G-orbits are uncountable. Theorem 6.3 reduces the problem of finding necessary and sufficient conditions for extendabilityofagiven G-invariant ?-finite diffused measure on R" to the case in which G is countable. This case was fully examined by Krawczyk and Zakrzewski A991) as a particular instance of the general problem, whether a given ?-finite measure, invariant with respect to a countable group of permutations of a set X, has a proper invariant extension. The solution to the latter was presented in Section 3 (see Theorems 3.3 and 3.4). Combining it with Theorem 6.3 leads, in particular, to the following characterization. THEOREM 6.4. Let G be a subgroup o/Iso(R"). The following are equivalent: (i) Every nonuniversal G-invariant ?-finite diffused measure on R" has a proper G-invariant extension. (ii) The set of all points ? e R" such that the G-orbit Gx is countable and there is a nontrivial minimal G-partition ofGx is small. On the other hand, Krawczyk and Zakrzewski A991) obtained the following theorem, related to the original Sierpinski's problem. THEOREM 6.5. Suppose G is any subgroup of Iso(R"). Then every nonuniversal G-invariant ?-finite extension of the Lebesgue measure in R" has a proper G-invariant extension. Note that when we consider arbitrary groups of isometries, we have to assume that a G-invariant extension of ?„, we want to extend, is nonuniversal. Indeed, as has already been mentioned, if the continuum is large, then ?„ can be extended to a universal measure. But as a matter of fact such an extension is rather poorly invariant with respect to isometries, as another result from Krawczyk and Zakrzewski A991) shows.
1126 P. Zakrzewski PROPOSITION 6.6. Assume that the continuum is large and let G be a subgroup of Iso(E"). Then the following are equivalent: (i) There is an extension of the Lebesgue measure to a G-invariant universal measure onR". (ii) The group G is discrete. Here "discrete" refers to the topology inherited from Iso(E") and is equivalent to the fact that each G-orbit consists of isolated points. For example, we can imagine an extension of the Lebesgue measure to a universal measure invariant with respect to all translations by the integers but this definitely is a very weak property. To emphasize the fact that the non-discreteness of a group G implies that certain extensions of ?„, even by countably many new sets, cannot be G-invariant, let us quote an observation from Zakrzewski A991b): IfG is a non-discrete subgroup o/Iso(E"), then every open set А С ?" is countably G-equidecomposable with R" (in arbitrary pieces). Until now we discussed the basic question, whether a given invariant measure can be properly extended in the invariant way. But there are also some interesting results concerning particular properties of such extensions. For example, if we move from measure spaces to the respective measure algebras, then we see that extending a measure ?:?—> [0, +oo] to ?: ?' —>· [0, +oo] does not necessarily mean extending the associated measure algebra. More precisely, it may happen that WYeZ'BXeE ?(???) = 0. (*) This means that the canonical embedding e: ?/? —>¦ ?'/? of measure algebras defined by e([A]M) = [?]? is an isomorphism. Let us call such an extension weak, otherwise call it strong. Clearly, the first of the two methods of extension, recalled at the beginning of this section, produces a weak extension, while the second gives a strong one. It was already realized by Szpilrajn A935) that there exists a strong isometrically invariant extension of the Lebesgue measure ?„ on R" . Every Iso(E")-invariant weak extension of ?„ is Iso(E")-ergodic. Khararazishvili A983) proved that there exists a strong Iso(E" yinvariant extension ?/?„ which is Iso(E")- ergodic. Another property of a weak extension is that it does not change the character of the measure space (?", ?,?), i.e., the smallest cardinality к of a collection {Aa: a < к) с ? such that for any set A e ? and every ? > 0 there is a < к with ?(? ? ??) < ?. For the Lebesgue measure the character is Ко and the question, whether there exists a translation invariant extension of the Lebesgue measure on ? of uncountable character, was already stated by Szpilrajn A935). The affirmative answer was given by K. Kodaira and S. Kakutani A950). In fact, Kakutani and Oxtoby A950) constructed a translation invariant extension of ? ? of the maximal possible character (see also Hewitt and Ross A963, 16.3), where a generalization to the case of Haar measure on a compact metric group is proved). THEOREM 6.7. There exists a translation invariant extension of the Lebesgue measure on R of the character 22N|.
Measures on algebraic-lopological structures 1127 Using the same technique Khararazishvili A978) proved that there is an Iso(R")- invariant extension ? : ?' —>¦ [0,+oo] of the Lebesgue measure on R" of the character 2? such that no restriction of ? to an lso(R")-almost invariant (with respect to ?) set ? e ?' is ergodic. After realizing that the methods by which Sierpinski's problem was solved lead to weak extensions, Ciesielski A990/91) posed the problem, whether every isometrically invariant ?-finite measure on R" has a strong isometrically invariant extension. The affirmative answer is given in Zakrzewski A995/96), where it is also proved that ifG is any subgroup o/Iso(R"), then every complete ?-finite G-invariant measure on R" which has a proper G-invariant extension, has a strong G-invariant extension as well. We conclude this section with another result from Ciesielski and Pelc A985): If the continuum is large, then there exists a universal semi-finite isometrically invariant measure onR". References Arnold, K. A968), On ?-finite invariant measures, Z. Wahrscheinlichkeitsth. verw. Geb. 9, 85-97. Banach, S. A937), On Haar's measure. Appendix to Theory of the Integral by S. Saks, 2nd rev. edn, Monografie Matematyczne, Vol. 7, Warszawa-Lwow. Banach, S. and Tarski A. A924), Sur la decomposition des ensembles de poits en parties respectivement congruents. Fund. Math. 6, 244-277. Bandt, С A983), Metric invariance ofHaar measure, Proc. Amer. Math. Soc. 87, 65-69. Bandt, C. and Baraki, G. A986), Metrically invariant measures on locally homogeneous spaces and hyperspaces. Pacific J. Math. 121, 13-28. Becker, H. and Kechris, A.S. A996), The Descriptive Set Theory of Polish Group Actions, London Math. Soc. Lecture Note Series, Vol. 232. Berberian, S.K. A974), Lectures in Functional Analysis and Operator Theory, Springer-Verlag, New York. Bogoliouboff, N.N. and Kryloff, N.M. A937), La theorie generate de la mesure dans son application a I'etude des systems dynamiques de la mecanique non-lineare, Ann. of Math. 38, 65-113. Calderon, A.P. A955), Sur les mesures invariantes, С R. Acad. Sci. Paris 240, 1960-1962. Cartan, H. A940), Sur la mesure de Haar, С R. Acad. Sci. Paris 211, 759-762. Cichoii, J., Khararazishvili, A.B. and W?glorz, B. A993), On sets ofVitali's type, Proc. Amer. Math. Soc. 118, 1243-1250. Christensen, J.PR. A972), On sets ofHaar measure zero in abelian Polish groups, Israel J. Math. 13, 255-260. Chuaqui, R.B. A977), Measures invariant under a group of transformations. Pacific J. Math. 68, 313-329. Ciesielski, K. and Pelc, A. A985), Extensions of invariant measures on Euclidean spaces. Fund. Math. 125. 1-10. Ciesielski, K. A989), How good is Lebesgue measure?. Math. Intelligencer 11, 54-58. Ciesielski, K. A990), Algebraically invariant extensions of ?-finite measures on Euclidean spaces. Trans. Amer. Math. Soc. 318, 261-273. Ciesielski, K. A990/91), Query 5, Real Anal. Exchange 16 A), 374. Clemens, J.D., Gao, S. and Kechris, A.S. B000), Polish metric spaces: their classification and isometry groups. Preprint. Dang-Ngoc-Nghiem A973), On the classification of dynamical systems, Ann. Inst. H. Poincare 9 D), 397^425. Ditzen, A. A992). Ergodic measures for Borel group actions. Preprint. Dougherty, R. A994), Examples of non-shy sets. Fund. Math. 144, 73-88. Dye, H.A. A959), On groups of measure preserving transformations 1, Amer. J. Math. 81, 119-159. Dye, H.A. A963), On groups of measure preserving transformations 11, Amer. J. Math. 85, 551-576. Erdos, P. and Mauldin, R.D. A976), The nonexistence of certain invariant measures. Proc. Amer. Math. Soc. 59, 321-322. Farell. R.H. A962), Representation of invariant measures, Illinois J. Math. 6, 447^467.
1128 P. Zakrzewski Feldman, J. and Moore, C.C. A977). Ergodic equivalence relations and von Neumann algebras. I, Trans. Amer. Math. Soc. 234, 289-324. Fomin, S.V. A950), On measures invariant under certain groups of transformations. Izv. Akad. Nauk SSSR 14. 261-274 (in Russian). Fremlin, D.H. A993), Real-valued-measurable cardinals. Set Theory of the Reals, H. Judah ed., Israel Math. Conf. Proa, Vol. 6, 151-304. Fremlin, D.H. B000), Measure Theory. Vol. 3, Internet: ftp essex.ac.uk/pub/measuretheory. Friedman, N.A. A970), Introduction to Ergodic Theorx. Van Nostrand-Reinhold. New York. Gao, S. and Kechris, A.S. B001), On the classification of Polish metric spaces up to isometry. Mem. Amer. Math. Soc., to appear. Greenleaf, F-P. A969), Amenable actions of locally compact groups. J. Funct. Anal. 4, 295-315. Greschonig, G. and Schmidt, K. B000), Ergodic decomposition of quasi-invariant probability measures (Dedicated to the memory of Anzelm Iwanik), Colloq. Math. 84/85 B), 495-514. Haar, A. A933), Der Massbegrif in der Theorie der kontinuierlichen Gruppen, Ann. of Math. 34. 147-169. Hajian, A. and It6, Y. A969), Weakly wandering sets and invariant measures for a group of transformations. J. Math. Mech. 18, 1203-1216. Hajian, A. and Kakutani, S. A964), Weakly wandering sets and invariant measures. Trans. Amer, Math. Soc. 110. 136-151. Halmos, PR. A947), Invariant measures, Ann. of Math. 48, 735-754. Halmos, PR. A956), Lectures on Ergodic Theory. Publ. Math. Soc. Japan. Halmos, PR. A974), Measure Theory. Springer. Berlin. Hewitt, E. and Ross. K.A. A963). Abstract Harmonic Analysis, Vol. I. Grundl. Math. Wiss., B. 115. Springer, Berlin. Hopf, E. A932), Theory of measures and invariant integrals. Trans. Amer. Math. Soc. 34, 373-393. Hulanicki, A. A962), Invariant extensions of the Lebesgue measure. Fund. Math. 51. 111-115. Kakutani, S. and Oxtoby, J.C. A950), Construction of a non-separable extension of the Lebesgue measure space. Ann. of Math. 52 B), 580-590. Kannan. V and Raju, S.R. A980). The nonexistence of invariant universal measures on semigroups, Proc. Amer. Math. Soc. 78, 482^484. Kargapolov, M.I. and Merzljakov, Ju.I. A979), Fundamentals of the Theory of Groups. Graduate Texts in Math.. Springer-Verlag, Berlin. Kawada, ? A944), Uber die Existenz der invarianten lntegrale. Japan. J. Math. 19, 81-95. Kechris, A.S. A994), Lectures on definable group actions and equivalence relations. Preprint. Kechris, A.S. A995), Classical Descriptive Set Theory. Graduate Texts in Math., Vol. 156, Springer-Verlag, Berlin. Khararazishvili, A.B. A975), On certain types of invariant measures. Dokl. Akad. Nauk SSSR 222, 538-540 (in Russian). Khararazishvili, A.B. A978), Some Questions of Set Theory and Measure Theory. Izd. Tbiliss. Gos. Univ., Tbilisi (in Russian). Khararazishvili, A.B. A980), Absolutely nonmeasurable sets in Abelian groups, Soobshch. Akad. Nauk Gruzin. SSR 97, 537-540. Khararazishvili, A.B. A983), Invariant Extensions of Lebesgue Measure. Izd. Tbiliss. Gos. Univ., Tbilisi (in Russian). Khararazishvili, A.B. A991), Vitali's Theorem and its Generalizations. Izd. Tbiliss. Gos. Univ.. Tbilisi (in Russian). Khararazishvili, A.B. A996), Selected Topics of Point Set Theory. Lodz University, Lodz. Khararazishvili, A.B. A998), Transformation Groups and Invariant Measures. Set-Theoretic Aspects, World Scientific, River Edge, NJ. Kifer, Y.I. and Pirogov, S.A. A972), The decomposition of quasi-invariant measures into ergodic measures, Usp. Mat. Nauk 27 E), 239-240. Kodaira, K. and Kakutani, S. A950), A non-separable translation invariant extension of the Lebesgue measure space, Ann. of Math. 52 B), 574-579. Krawczyk, A. and Zakrzewski, P. A991), Extensions of measures invariant under countable groups of transformations, Trans. Amer. Math. Soc. 326, 211-226.
Measures on algebraic-topological structures 1129 Krieger, W. A969a), On non-singular transformations of a measure space, I. Z. Wahrscheinlichkeitsth. verw. Geb, 11, 83-97. Krieger, W, A969b), On non-singular transformations of a measure space, II. Z. Wahrscheinlichkeitsth. verw. Geb. 11,98-119. Laczkovich, M. B002), Paradoxes in measure theory. Handbook of Measure Theory, E. Pap. ed., Elsevier, Amsterdam, 83-123. Loomis, L.L. A945), Abstract congruence and the uniqueness ofHaar measure, Ann. of Math. 46 B), 348-355. Los, J. and Marczewski, E. A949). Extensions of measure. Fund. Math. 36, 267-276. Mackey, G.W. A957), Borel structures in groups and their duals. Trans. Amer. Soc. 85, 134-165. Mibu, Y. A958), On measures invariant under given homeomorphism group of a uniform space. Math. Soc. Japan. 10, 405^429. Mycielski, J. A974), Remarks on invariant measures in metric spaces, Colloq. Math. 32, 105-112. Mycielski, J. A977), A conjecture of Ulam on the invariance of measure in Hubert's cube, Studia Math. 60, 1-10. Nachbin, L. A965), The Haar Integral, Van Nostrand, Princeton, NJ. Nadkarni, M.G. A990), On the existence of a finite invariant measure, Proc. Indian Acad. Sci. (Math. Sci.) 100, 203-220. Nowik, A. A996/97), On a measure which measures at least one selector for every uncountable subgroup. Real Anal. Exchange 22 B), 814-817. Omstein, D.S. A960), On invariant measures. Bull. Amer. Math. Soc. 66, 297-300. Oxtoby, J.C. A946), Invariant measures on groups which are not locally compact. Trans. Amer. Math. Soc. 60, 215-237. Parthasarathy, K.R. A977), Introduction to Probability and Measure, Macmillan India, Madras. Paterson, A.L.T A986), Nonamenability and Borel paradoxical decompositions for locally compact groups, Proc. Amer. Math. Soc. 96, 89-90. Pelc, A. A986), Invariant measures and ideals on discrete groups, Dissertationaes Math., Vol. 255, PWN, Warszawa. Pelc, A. A982), Semiregular invariant measures on Abelian groups, Proc. Amer. Math. Soc. 86, 423^426. Penconek, M. and Zakrzewski, P. A994), The existence of nonmeasurable sets for invariant measures, Proc. Amer. Math. Soc. 121, 579-584. Phakadze, S.S. A958), The theory of Lebesgue measure, Trudy Tbiliss. Mat. Inst. 25 (in Russian). Plante, J.E. A983), Solvable groups acting on the line. Trans Amer. Math. Soc. 278. 401^414. Poncet, J. A954), Une classes d'espaces homogenespossedant line mesure invariante, C. R. Acad. Sci. Paris 238, 553-554. Raikov, D.A. A942), A new proof of uniqueness ofHaar's measure, Dokl, Akad. Nauk SSSR 34, 211-212. Ramachandran, D. A989), ? new proof of Hopf's theorem on invariant measures. Contemp. Math. 94, 263-271. Ramachandran, D. and Misiurewicz, M. A982), Hopf's theorem on invariant measures for a group of transformations, Studia Math. 74, 183-189. Roberts, J.W. A974/75), Invariant measures in compact Hausdorff spaces. Indiana Univ, Math. J. 24, 691-718. Rosenblatt, J.M. A974), Equivalent invariant measures. Israel J. Math. 17. 261-270. Rosenblatt, J.M.( 1980), Strongly equivalent invariant measures. Math. Proc. Cambridge Philos. Soc. 88, 33^43. Ryll-Nardzewski, С and Telgarsky, R. A978), The nonexistence of universal invariant measures, Proc. Amer. Math. Soc. 69, 240-242. Sakai, K. A978), Remarks on the existence of finite invariant measures for groups of measurable transformations, Proc. Japan Acad. Ser. A Math. Sci. 54, 285-287. Schmidt, K. A977), Lectures on Cocycles of Ergodic Transformation Groups, Lecture Notes in Math., Vol. 1, MacMillan India, New Delhi. Segal, I.E. A949), Invariant measures on locally compact spaces, J. Indian Math. Soc. 13, 105-130. Segal, I.E. and Kunze, R.A. A978), Integrals and Operators, Springer-Verlag, Berlin. Silver, J. A980), Counting the number of equivalence classes of Borel and coanalytic equivalence relations, Ann. Math. Logic 18, 1-28. Solecki, S. A993a), On sets nonmeasurable with respect to invariant measures, Proc. Amer. Math. Soc. 119, 115-124. Solecki, S. A993b), Measurabilitv properties of sets ofVitali's type, Proc. Amer. Math. Soc. 119, 897-902.
изо P. Zakrzewski Solecki, S. A996), On Haar null sets. Fund. Math. 149 C), 205-210. Solecki, S. B001), Translation invariant ideals, Israel J. Math., to appear. Steinhaus, H. A920), Sur les distances des points des ensembles de mesure positive. Fund. Math. 1, 93-104. Steinlage, R.C. A975), On Haar measure in locally compact 7i spaces, Amer. J. Math. 97, 291-307. Szpilrajn, E. A935), Sur /'extension de la mesure lebesguienne. Fund. Math. 25, 551-558. Tarski, A. A938), Algebraische Fassungdes Massproblems, Fund. Math. 31, 47-66. Tops0e, F. and Hoffmann-J0rgensen, J. A980), Analytic sets and their applications. Analytic Sets, Academic Press, San Diego, pp. 317^401. van Dantzig, J. and van der Waerden, B.L. A928), Uber metrisch homogene Raume, Abh. Math. Sem. Univ. Hamburg 6, 374-376. Varadarajan, VS. A963), Groups of automorphisms of Borel spaces. Trans. Amer. Math. Soc. 109, 191-220. von Neumann, J. A936), The uniqueness ofHaar's measure, Rec. Math. [Mat. Sb.] 1 D3). 721-734. von Neumann, J. A940), On rings of operators III, Ann. of Math. 41.94-161. von Neumann, J. A998), Continuous Geometry. Princeton Landmarks Math., Princeton Paperbacks, Princeton University Press, Princeton, NJ. Vitali, G. A905), Sul problema delta mesura dei gruppi di punti di una retta. Bologna. Vladimirov, D.A. A965), Invariant measures on Boolean algebras. Mat. Sb. (N.S.) 67 A09), 440-460 (in Russian). Wagon, S. A986), The Banach-Tarski Paradox, Cambridge Univ. Press, Cambridge, UK. Wang, S.P. A976), On density properties of certain subgroups of locally compact groups, Duke Math. J. 43, 561-578. Weil, A. A940), [.'integration dans les groupes topologicues et ses applications, Actualites Sci. Indust., Publ. Math. Inst. Strasbourg, Hermann, Paris. Wu, T.S. and Yand, J.S. A986), Homogeneous space with finite invariant measure, Tamkang J. Math. 17, 153- 159. Zakrzewski, P. A987), The existence of universal invariant semiregular measures on groups, Proc. Amer. Math. Soc. 99, 507-508. Zakrzewski, P. A988), On universal semiregular invariant measures, J. Symb. Logic 53, 1170-1176. Zakrzewski, P. A989), The existence of universal invariant measures on large sets. Fund. Math. 133, 113-124. Zakrzewski, P. A990), Extensions of isometrically invariant measures on Euclidean spaces, Proc. Amer. Math. Soc. 110,325-331. Zakrzewski, P. A991a), Paradoxical decompositions and invariant measures, Proc. Amer. Math. Soc. Ill, 533- 539. Zakrzewski, P. A991b), When do equidecomposable sets have equal measures?, Proc. Amer. Math. Soc. 113, 831-837. Zakrzewski, P. A993a), The existence of invariant probability measures for a group of transformations, Israel J. Math. 83, 343-352. Zakrzewski, P. A993b), The existence of invariant ?-finite measures for a group of transformations, Israel J. Math. 83, 275-287. Zakrzewski, P. A994), When do sets admit congruent partitions. Quart. J. Math. Oxford 45 B), 255-265. Zakrzewski, P. A995/96), Extending isometrically invariant measures on R" -A solution to Ciesielski's query. Real Anal. Exchange 21 B), 582-589. Zakrzewski, P. A996), Extending invariant measures on topological groups. The Proceedings of the Tenth Summer Conference on Topology and Applications, Ann. New York Acad. Sci., Vol. 788, pp. 218-222. Zakrzewski, P. A997), The uniqueness of Haar measure and set theory, CoWoq. Math. 74, 109-121.
CHAPTER 28 Liftings Werner Strauss Mathematisches Institut A, Universitat Stuttgart, Postfach 80 11 40. D-70511 Stuttgart, Germany- E-mail: strauss@mathematik.uni-stuttgart.de Nikolaos D. Macheras Department of Statistics. University of Piraeus, 80 Karaoli and Dimitriou str, 185 34 Piraeus, Greece E-mail: macheras@unipi.gr Kazimierz Musial Institute of Mathematics, Wroclaw University, PI. Gruiidwaldzki 2/4, 50-384 Wroclaw, Poland E-mail: musial@math.uni. wroc.pl Contents Introduction 1133 1. Terminology 1133 2. Existence of liftings and densities 1135 3. Liftings for functions 1140 4. Liftings on topological spaces 1147 5. Liftings on topological groups 1154 6. Permanence of liftings 1157 6.1. Liftings in products 1157 6.2. Strong liftings in products 1161 6.3. Liftings on projective limits 1163 6.4. Various Fubini products 1166 6.5. Applications of Fubini products to stochastic processes 1168 7. Liftings for abstract valued functions 0 8. Liftings and densities with respect to ideals of sets 1173 9. Beyond ?°°(,u) 1174 10. Further applications 1175 References 1178 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1131
This page intentionally left blank
Liftings 1133 Introduction The classical monograph of A. and С Ionescu Tulcea A969a) provides a systematic exposition of almost all results about liftings known at that time, in particular the basic existence theorems of J. von Neumann and D. Maharam were given with a new and more direct proof. The significance and power of the existence of liftings was illustrated there by important applications to mathematical analysis, e.g., to the point realization for automorphisms of spaces of measurable functions, to disintegration of measures, representation of integral operators which is equivalent to the differentiation of vector valued measures, and to the separable modification of stochastic processes. By a proper use of liftings they succeeded to put many classical results in their final form. Practically at the same time Kolzow A968) proved the equivalence of the existence of liftings, Vitali differentiation, and Dunford Pettis theorem for locally convex spaces. Nevertheless many interesting problems were left and formed the starting point for further developments in this field, such as the existence proof for densities on arbitrary finite measure spaces of Graf and von Weizsacker A975), the negative solution of the so-called 'strong lifting problem' by Losert A979), the discussion of the existence of (strong) Borel liftings by Mokobodzki A975), Fremlin A977) and that of the non-existence of translation invariant Borel liftings for Haar measures by Johnson A980), Talagrand A982), Kupka and Prikry A983), Losert A983), and Burke A993), the notion of lifting compactness studied by Bellow A980) as well as Edgar and Talagrand A980), existence results for strong Baire liftings by Grekas and Gryllakis A991), the application of forcing methods for the non-existence of certain types of liftings (an example which demonstrates that lifting theory provides challenging problems for other areas of mathematics) by Shelah A983), Burke and Just A991) and Burke and Shelah A992), and the discussion of permanence results mainly in products of probability spaces starting with a paper of Talagrand A989) and subsequently developed by Burke A995), Fremlin B00?), and the authors to mention probably the most important ones. Because of the large number of contributions, in this article we can only give an overview of these developments with short indications of methods and proofs. Concerning the abounding number of applications we had to restrict ourselves drastically to either the most spectacular ones or to the most recent ones, giving only references for all the others. As far as we could single out, we have tried to incorporate any paper dealing with liftings in the list of references at the end of this article. Sometimes we found it difficult to give full credit to authors, since many results in that field are circulated unpublished, but on the other hand they have become folklore. 1. Terminology For a measure space {?, ?, ?) we denote by ? /? its measure algebra (a Boolean algebra under its canonical Boolean operations), and r : ? -> ?/? is the canonical map (a Boolean homomorphism). We assume throughout that ? is nontrivial, i.e., ?{?) > 0. _ К stands for one of the fields R of real numbers or С of complex numbers, R for the extended real line, and N = {1, 2, 3,...} stands for the set of natural numbers.
1134 W. Strauss et a]. ?^(?) is the space of all K-valued ? -measurable maps on ?. ?^ (?) is a K-algebra underpointwise addition and multiplication together with multiplication by scalars from K. ?°(?) := ??(?) and ?^(?), the space of all R-valued ? -measurable functions are lattices under pointwise order. The subspace ?^(?) of ?^(?) consisting of all strictly bounded / e ?^(?) (i.e., Il/ll := supwei2 |/(?)| < oo), is a Banach algebra and ?3€(?) := ?f (?) is a Banach lattice. A set N e ? with ?(?) = 0 is called a ?-null set and the ?-ideal of all ?-null sets is denoted by ?? and ?/ is the ideal of all sets A e ? with ? (A) < oo. For А, В е Г we write A = ? a.e. (?), or only ? ? ? for short if there arise no doubts about the measure, if ???, the symmetric difference of A and B, is a ?-null set, and we write f = g a.e. (?), or / = g for short, if {/ ? g} e 2?0 for f,ge ?^(?) U ?|(?)· The equivalence class of all functions in ?^(?) U ?^(?) or of all sets in ?, that are ?-a.e. equal to / or to A will be denoted by /* or by A*, respectively. Equivalent functions or sets will never be identified. The (Caratheodory) completion of (?,?,?) is written (?,?,?). The ?-algebra generated by a family ? of subsets of ? is denoted by ?(?). A measure space is locally determined if ? = {A<^ ?: ?? ? & ? for A e ??) and (?, ?, ?) (or just ?) is semi-finite, i.e., ?(?) = sup^(fi): A э В e ? f) for any A e ?. A measure space (?, ?, ?) is localizable if it is semi-finite and, for any S c. ? there exists an ? e ? such that (i) ? \ Я e ?0 for any ? e ?, (ii) if G e ? and E\G e ?0 for every ? e ? then ? \G e ??. It will be convenient to call such a set Я an essential supremum of ? in ? (see Fremlin A980, A6B)). If (?, T, v) is a measure space and /: ? -> (9 is a measurable function such that ?(?) = ?(/~'(?)) for all ? e ? then ? is called the range of ? via / and we write ? = /(?). A pretopological measure space is a quadruple (?,?, ?, ?) such that (?, ?) is a topological space and (?, ?, ?) is a measure space. This notion is introduced only in order to avoid repetitions of the form (?, ?, ?, ?) such that (?, ?) is a topological space and (?, ?, ?) is a measure space. A topological measure space is a quadruple (?,?,?,?) such that (?,?) is a Hausdorff topological space and (?,?,?) is a measure space with ? с ?. Let (?, ?, ?1, ?) be a topological measure space. The measure ? is ?-additive if for any increasing family (G,)ie/ of open subsets of i? we have G, ) =5???@. For any Hausdorff topological space (?, ?) we denote by ?(?) its Borel ?-algebra, i.e., the ?-algebra generated by T. The Baire ?-algebra of (?, ?), i.e., the ?-algebra generated by the system of all cozero subsets of ?, is written ??(?). We call (?, ?, ?, ?) a Baire or Borel measure space, respectively if (?, ?, ?) is the completion of a finite measure space (?, ?, v), where ? = ??(?) or A = ?(?), respectively. Let (?, ?) beaHausdorff topological space. A measure on ??(?) is called completion regular, if for any В е ?(?) there exist A\, Аг &??(?) such that A\ с ? с A3 and ?(?3 \ ??) =0. A measure ? on
Liftings 1135 ?(?) is completion regular if its restriction to ??{?) is completion regular. A topological measure space (?, ?, ?, ?) is called a category measure space if ? is the system of all sets with the Baire property with respect to ? and ?? is equal to the system of all sets of the 1st category in ?. We use the notion of a quasi-Radon measure space in the sense of FremlinA974,72A). For an arbitrary probability space (?, ?, ?) we call (R, ?, ??, v) its associated hyper- stonian space if R is the Stone space of the measure algebra of (?, ?, ?), ? the topology generated by [s(a): a e ?/?], where s(a) с R is the corresponding closed-open set of ? according to the Stone duality, ?? denotes the ?-algebra of all subsets of R with the Baire property (namely those sets А с R such that ???/ is a first category set for some open subset U of R), and ? = ??? :TZ-> R where ?:??-* ?/? is the canonical epimorphism and ? : ? /? -> R is unambiguously defined by ?(?) := ?(?) if a = A' for АеГ. Throughout we assume the validity of the Axiom of Choice. 2. Existence of liftings and densities A lifting for a given measure space (?, ?, ?) is a Boolean homomorphism ?: ? —> ? with the additional properties P(A) = A (LI) and p(A) = p(B) if ? ? ?, (L2) i.e., besides (LI) and (L2) ? satisfies more explicitly the equations p@) = 0 and ?(?) = ?, (Bl) р(ЛПЯ)=р(Л)Пр(Я), (В2) and /)(AUS) = p(A)Up(B), (B3) if A, Be ?. It follows p(Ac) = [p(A)]c forAeT. (B4) Conversely (B2) and (B4) imply (B3). We denote by ?(?) the space of all liftings for (?, ?,?). There is another way of looking at liftings. For ? e ? (?) we can define unambiguously a Boolean homomorphism ?': ?/? —> ? by means of ?·(?'):=?(?) ifAeT
1136 W. Strauss et al. with the property r ? ?* = idr/?, the identical map of ?/?. For this reason it is perhaps more appropriate to call p* lifting (J. von Neumann's original definition), since this reveals its algebraic character more precisely. But in applications the first given definition seems to be more in common use. It is also clear that the measure ? enters only through the ?- ideal ?? of its null sets into the lifting. Therefore measures on ? producing the same null set ideal produce the same liftings, i.e., liftings depend in fact on the triple (?, ?, ??), where ?? is an ideal in ? and there are generalizations of the notion of lifting along these lines already in the paper of von Neumann and Stone A935). In 1931 A. Haar raised the problem of the existence of a lifting for the Lebesgue measure space on Rd, d e N. J. von Neumann gave a positive solution based on the classical Lebesgue density. Clearly the same problem can be raised for arbitrary measure spaces and there too, as we will see below, densities provide a useful step in the construction of liftings. A lower density for the measure space (?, ?, ?) is a map <5 : ? -> ? with the properties (LI), (L2), (Bl), and (B2) and for the notion of upper density we have to replace (B2) by (B3) therein. If we define ?1'(A) := (8(AC))C for arbitrary maps ?: ? -> ? then the operation ? -> <5C' is a bijection from the space ?{?) of all lower densities onto the space ? (?) of all upper densities obeying the law (<$')' = <5 for all ? e ?(?) U ? (?), where <$'' = ? if and only if ? e ? (?). For this reason we consider only lower densities and call them "densities" for short. The best known example of a (lower) density (being no lifting) is the Lebesgue density D, defined by means of DjA) :=LeR'; Urn ^n^U))! [ s\0 ?(?8(?)) J for Lebesgue measurable sets А с R'', ? the Lebesgue measure on R'', and B$(x) the ball of center ? and radius ? > 0. Lebesgue's celebrated density theorem is just (LI) while the other axioms of a lower density follow more or less by technicalities (see, e.g., Oxtoby A971, Theorems 3.20 and 3.21)). Starting from the Lebesgue density J. von Neumann A931) constructed liftings for the Lebesgue measure on R1' by a process which has been generalized (see Graf and von Weizsacker A976), Traynor A974)) to arbitrary densities on measure spaces (?, ?,?) and, at the same time, was made more transparent in the following way. For ? e ?{?) and ? & ? define a filterbase ?(?):={???: ? e ?(?)} and apply the axiom of choice to find an ultrafilter ??(?) finer than ?{?). Then put ? (?) := {???: A eU(w)} if A e ?. It follows <$(A) с р(А) с <$' (A) for A e ? and this implies for complete measure spaces (?, ?, ?) that p(A) e ? and ? satisfies (LI), while all other properties of a lifting are immediate by construction. THEOREM 2.1. If the measure space (?, ?, ?) is complete, then for any ? e ?(?) there exists ape ?(?) such that ? (А) с р(А) с ?' (A) for all A e ?.
Liftings 1137 By an application of Theorem 2.1 to the Lebesgue density the first existence result of von Neumann A931) is now immediate. COROLLARY 2.2. There exists a lifting for the Lebesgue measure space on Ш , d e N. The construction above leading to Theorem 2.1 makes it obvious that the liftings of the corollary rely on a more or less arbitrary choice of an ultrafilter in a non-constructive way with the consequence that any trace of uniqueness or "naturalness" is hopelessly lost. Though starting from the, in a certain sense "natural" Lebesgue density, we cannot single out some sort of "canonical" lifting. Taking any lifting ? for the Lebesgue measure on R obtained by this process, we cannot without further information answer such a simple question as, e.g., 0 e p(]—oo, 0]) or 0 e p(]0, oo[). On the other hand it can be easily seen that the axiom of choice is necessary for producing a lifting for the Lebesgue measure space (see, e.g., Burke A993)). For general measure spaces not even a "natural" density is at hand as it was for the Lebesgue measure space. But for finite (even incomplete) measure spaces (?, ?,?) with ?(?) > 0 a density can be constructed by transfinite induction using the following two extension lemmata for densities. Clearly it is sufficient to consider probability spaces for simplicity. LEMMA 2.3. Let (?, ?, ?) be a probability space, ? a ?-subalgebra of ? with ?? с ?, A e ?, and denote by ? the ?-subalgebra of ? generated by ? U {A}. Then for every ? e ?(? | ?) there exists ? ? & ?(? \ ?) extending ?. For ? e ?(? \ ?) may be chosen ?<??(? \?). First note ? = {(D П A) U (E П A'): D, ? e ?] and choose elements B.C e /? such that B=essinf{De/j: ACZ)} and С = essinf{D e ?: А' с D}. Then г((?>ПА)и(?ПАг)) := [А П <5((Z) П S) U (? ? Bc))] U [Ac ? ?((? ? С) U (D ? С'))], if D, ? ? ? is a solution given by Graf and von Weizsacker A976). LEMMA 2.4. Let (?, ?,?) be a probability space, (??)„?? an increasing sequence of ?-subalgebras of ? with ?? с ?\, and let ?? be the ?-subalgebra of ? generated by U,,eN *?и· Рог each n e N let be given ?„ ??(?\??) with <5„ | ?? = <5„, for all m < n. Then exists ? ?? e ?(? \ ??) satisfying ?? | ?„ = ?„ for all n e N. If ?,,(/) denotes a version of the conditional expectation of / e ?°°(?) with respect to the ? -subalgebra ? of ? we may define (following Graf and von Weizsacker A976)) ОС ОС ОС Soo(A) = ? U ? *т(\ЕчАхл) > 1 - 1/*}) if A e ??. к=\п~\ т=п
1138 W. Strauss et al. From Doob's martingale convergence theorem (see, e.g., Fremlin A989, Chapter 22, 1.6)) follows <$oo(A) = A a.e. (?). A measure space (?, ?, ?) is called strictly localizable if there exists a family (A,-),-e/ in ? of pairwise disjoint sets A,- e ? with ? (A,) < oo for all /' e / such that {^??=?,? = {????: ? ? A,- e ? for all / e/}, /e/ and ?(?) =^д(АП A,) for A e ?. /e/ Such a family (A,-),-e/ is called a decomposition oi ?. We will derive existence theorems from the following strictly more general extension theorem. THEOREM 2.5. Let be given a strictly localizable measure space (?, ?, ?), a ?-subalgebra ? of ? with ?? с ?, and a density ? e ?(? \ ?). Then there exists a density 5 e ?(?) with ? | ? = ?. This and the next theorem are in fact theorems about probability spaces since the generalization to strictly localizable spaces is obvious and purely technical. We give a short indication of the proof for a probability space (?, ?,?) for later reference. Anyproofuses induction in one form or another. We can apply induction taking the following steps. (A) Choose the smallest cardinal d with the property that there exists a collection ? с ? of cardinality d such that the ? -algebra generated by ? U ? is dense in ? for the pseudometric generated by ?. Let ? = {Ma)a<K be indexed by ordinals less than к, where к is the first ordinal of cardinality d. For a ^ к denote by ?? the ?-subalgebra of ? generated by ? U {Mp: ? < a], where we may assume Ma ? ?? for a < к. Inductively we can construct a family (&а)а^к of densities Sa e ?(? | ??) with ?? | ?? = Sa for a ^ ? < ?. (B) The induction starts with So = ?. (C) The step from a to a + 1 is covered by Lemma 2.3. (D) For a limit ordinal ? ? ? of an uncountable cofinality put Sa = U/3<a &?· (?) For a limit ordinal ? ?. ? of countable cofinality apply Lemma 2.4. (F) Finally put <$:= SK. Densities constructed in this way will become important later on in Section 6. We therefore turn their proof in a definition and call any density ? for a probability space {?, ?,?) admissible if it is constructed inductively taking the steps (A) to (F) above starting from the ?-subalgebra ? =?(?0); ??{?) denotes the family of all admissible densities for ?. Clearly ??(?) ? 0. The last theorem and Theorem 2.1 imply the next result. THEOREM 2.6. Let be given a strictly localizable complete measure space (?, ?, ?), a ?-subalgebra ? of ? with ?? с ?, and a lifting ? e ?(? | ?). Then there exists a lifting ~p e ?(?) with ~?\? = p.
Liftings 1139 We call any ? e ?(?) admissibly generated if there exists a density S e ??{?) such that <5(A) с p(A) for all A e ?; ???(?) is the (for complete probability spaces clearly non-empty) class of all admissibly generated liftings for ?. If in the last two theorems ? is the ?-subalgebraof ? generated by ?? with the obvious lifting on it, we get the following existence theorems of Graf and von Weizsacker A976) for densities and of von Neumann A931) and Maharam A958) for liftings, respectively. DENSITY THEOREM. For every nontrivial strictly localizable measure space there exists a density. LIFTING THEOREM. For every nontrivial strictly localizable complete measure space there exists a lifting. Radon measure spaces are strictly localizable since they have a concassage (see, e.g., Schwartz A973)) and hence the lifting theorem applies. COROLLARY. Each nontrivial Radon measure space has a lifting. Around 1942 J. von Neumann gave an oral proof to S. Kakutani and D. Maharam for the lifting theorem, but "the proof was unfortunately forgotten beyond hope of reconstruction" according to Maharam A958). In 1958 D. Maharam gave a different proof based on her structure theorem for measure algebras, reducing the general case to the product space {0, 1}?, where к is any infinite ordinal, hence only needing a special instance of the martingale convergence theorem. The proof indicated above using general martingale convergence theorem in connection with induction was given by A. and C. Ionescu Tulcea A961) and is today, in one form or another, standard in literature. It is an open problem whether (even for probability spaces) the lifting theorem remains true without the assumption of completeness. The most interesting case is when measure is defined on the ?-algebra of Borel sets. A lifting ? e ?(?) is called a Borel lifting for a topological measure space (?, ?, ?,?) if ? (A) e ?(?) for all A e ?, and a similar definition applies for Baire liftings. According to Shelah A983) it is consistent with ZFC that there exists no Borel lifting for Lebesgue measure on [0, 1] . On the other hand already von Neumann and Stone A935) proved the existence of a lifting for the Borel measure space on [0. 1] under the assumption of the continuum hypothesis. This has been later generalized to the following result of Mokobodzki A975) andFremlin A977). THEOREM 2.7 (CH). Subject to the continuum hypothesis any ?-finite measure space with a measure algebra of cardinality less or equal to ?? has a lifting. The assumption of the strict localizability implies that the basic measure space is locally determined. Within the class of all complete locally determined (c.l.d. for short) measure spaces, the strict localizability is in fact a necessary condition for the existence of a lifting. To see it we need only to consider for a measure space weaker types of decompositions, so called decompositions (ND) and (D), respectively, i.e., families (A, )ie/ in ? f such that ? (A) = ?,??? M(A n Ai) for a11 A e ?/ as well as ?(?, ? Ay) = 0 and ?, ? A, = 0,
1140 W. Strauss et al. respectively for /' ? j in / (see Ellis and Snow A963)). Due to an application of the axiom of choice decompositions (ND) exist for any measure space and a lifting converts a decomposition (ND) into a decomposition (D). If (A, ),e/ is such a decomposition (D) it follows supi6/ A* = ?' and by Fremlin A978, Theorem 2), the measure space (?, ?, ?) is c.l.d. It is obvious that this argument remains true for a much weaker type of "lifting", the so-called orthogonal lifting, i.e., for a map ? from ? into itself with the properties (LI), (L2), as well as (?) ?(?) ? ?(?) = 0 for all А, В e ? with ? ? ? = 0. Another such weaker notion has been considered by Kolzow A968), the monotonous lifting, i.e., a map ? from ? into itself with the properties (LI), (L2), and (?) ?(?) с ?(?) if А с ? for ?, ? e ?. Any density is a monotonous lifting, and from the existence of a monotonous lifting ? follows that of an orthogonal lifting ? by taking ?(?) := ?(?) П ?'(?) if A & ?, see Gapaillard A973) and Strauss A971). Bichteler A972) considers pre-densities, i.e., maps ? from ? into itself with the properties (LI), (L2), (Bl), and ?(??)?· · ·??(?*) =0 if ? ? П ¦ · ¦ П А^ = 0 for ? ?,..., Ak e ? and shows that the existence of a pre-density implies the existence of a density in complete measure spaces. For this reason we get the somewhat surprising result that within the class of all c.l.d. measure spaces the existence of a lifting is equivalent to the existence of considerably weaker types of set functions. THEOREM 2.8. For a c.l.d. measure space (?, ?, ?) the following conditions are equivalent. (i) There exists a lifting for (?, ?, ?). (ii) There exists a density for (?, ?, ?). (iii) There exists a pre-density for (?, ?, ?). (iv) There exists a monotonous lifting for (?, ?, ?). (?) There exists an orthogonal lifting for (?, ?, ?). (vi) The measure space (?, ?, ?) is strictly localizable. Any measure space has a c.l.d. version (see, e.g., Fremlin A978)) with the same measure algebra if the measure space is localizable. At this point we should mention that localizable c.l.d. measure spaces are precisely spaces with the Radon-Nikodym property, equivalently the Riesz property (see Segal A951)) and that by Fremlin A978) there exist c.l.d. localizable, non strictly localizable measure spaces, i.e., within the class of all c.l.d. measure spaces the class of measure spaces with lifting is strictly smaller than the class of all measure spaces with the Radon-Nikodym property. According to Halmos A950, Section 31 (9), p. 131) there exist measure spaces without decomposition (D). For such spaces even an orthogonal lifting cannot exist. 3. Liftings for functions Given a measure space (?, ?,?) a (function-)lifting for C^(?) is a K-algebra homo- morphism ?: ?^(?) -> ?^(?) (i.e., a K-linear, multiplicative map) with the additional properties v(f) = f, 00 <P(f) = <P(g) if/sgfor/,ge??0i). A2)
Liftings 1141 and <p(l) = l. (n) We denote by Л^(д) the space of all liftings for ?^(?). Any lifting ? for ?3€(?) is a lattice homomorphism, i.e., <p(\f\) = \<p(f)\ and ?(/±) = ?(/)± for / e ?3€(?) (see A. and С Ionescu Tulcea A969a)). The original question of A. Haar was about the existence of a lifting ? for ?^(?), ? the Lebesgue measure on R'1, d e N satisfying <p(f) = <p(f) for / e ?j^(?). This problem can be easily reduced to the existence of liftings for sets in the following way. Given ? e Л (?) put for any simple function / = ?"=? aiXAi with ?,??,?,??(/ = 1 ?), И i=\ This defines unambiguously a K-algebra endomorphism of the algebra ??.(?) of all K-simple functions over ? of norm < 1 having a unique extension to an algebra endomorphism of ?^ (?) with px(f) = p~*-(f), since ??(?) is dense in ?^(?) by the Lebesgue ladder theorem. Conversely for any ? e ?^(?) with <p(f) = <p(f) for / e ?jj? (?) we can define ape ?(?) by means of p(A) := {?(??) = 1} for A e 27. PROPOSITION 3.1. 77?e map ? e Л (?) -> ?00 e ???(?) « ? bijection which is completely determined by the equation ?°°(??) = xpiA)for A e 27. A different proof of Proposition 3.1 was given by von Neumann A931) in his first paper on liftings based on the formula p°(/)("):=inf{reQ: wep([f<r))} ifp-.?^ ?, /??^(?). and ? e ?. Then p° and p°° can be identified on ?3€(?) since ?°\?0€(?) e ?°°(?) and р°(хл) = Xp(A)- As a corollary we get the next result. THEOREM 3.2. For any c.l.d. measure space (?, ?, ?) the following conditions are all equivalent. (i) The measure space (?, ?, ?) is strictly localizable. (ii) The measure space (?, 27, ?) /гая ? lifting. (iii) 77геге ешм ? /////ng <р for ?? (?) iMC^ ^?? <?(/) = <P(/)/or ?" / е ^к (м)- J. von Neumann noticed already in his first paper from 1931 that for the Lebesgue measure on Ш the last theorem no longer holds true if the algebra ?^ (?) is replaced by the algebra ? (?).
1142 W. Strauss et al. As for liftings of sets there exist weakenings of the notion of lifting for functions which arise naturally in de Possel derivation as well as in Fourier analysis in case of one dimensional Lebesgue measure. Such a type is the linear lifting ? for ?°°(?), i.e., a map ? : ?°°(?) -> ?°°(?) which is a positive linear map with the additional properties A1), A2) and (n) (hence ?(?) = a for all a e Ш). Clearly ?°°(?) с (?(?), where we write G^) for the class of all linear liftings and indeed ??(?) := ?^(?). Note that a linear lifting is already a lifting if it is a lattice homomorphism of ?3€(?) into itself. In fact for / := хд - XA<, if A e ? follows ?(/+) л ?(/~) = 0. Since ?(/+) = 1 - ?(/~), the inequality 0 < ?(/~) < 1 implies 0 < ?(/+) < 1, a contradiction. Hence ?(/) e {0,1}?. By a classical result of Lebesgue A910) we have f(x)= lim —i- for a.a. ? e Rd k^X ?(??(?)) for any sequence 0 < r^ -> 0 (к -> oo), and any Lebesgue integrable /, if ? is the Lebesgue measure on Rd, d e N. For r^ := 1 /ifc, ? e Rrf and / e ?°°(?) writing Uk(f,x) = —' ?(?„(*)) we get |«*(/,Jt)| < II/IIoo for all к e N and all ? eW1. Therefore, if we choose a free ultrafilter U on N, then there exists \jj{(f){x):= Mm Uk{f,x) forall/e?эc(?)andл: eRd keli and ?\ e 0(?), since by Lebesgue's result ?/?(/) = / a.e. (?), hence ?? e ?°°(?) by completeness of ? while all the other properties of a linear lifting follow for ?\ by definition. This result can be generalized to more general measure spaces with suitable derivation bases. For d = 1, ? the Lebesgue measure on ]—?,?] similar results are obtained by Cesaro and Abel summability for the Fourier series of a Lebesgue integrable function / on ]—?, ?]. In fact, if 1 Г -? is the nth Cesaro-mean, 1 /sin- K„(x)=-l
Liftings 1143 the Fejer kernel, and fr@):=h f f^P,@-t)dp(t), 0<r<l, -?<?<:? is the harmonic of /, where ?,·(?) := A — r2)/(l — 2rcos# + r2) denotes the Poisson kernel of /, then lim,,^^ ?„(?) = f(x) by a theorem of Lebesgue A905) (see Zygmund A968, 3.9, p. 90)) and limr^i fr(x) = f(x) for a.a. ? е]-тг,?] by Fatou A906) (see Hoffmann A965)). If /„ := /,_1/(I then again \?„(?)\, \f,,(x)\ ^ ll/lloc for ? e N, ? e]-?r, ?] for all / e ?°°(?) and if the ultrafilter ?/ is chosen as above, then by means of i/2(f)(x):=liman(x), fo(f)(x) :=\im f„(x), хе]-л,л], neU neU can be defined 1Д2, тДз е ?(?). There are two procedures (both due to A. and С lonescu Tulcea A969a)) for converting a linear lifting ? e (?(?) into a lifting in ?00(?) provided the basic measure space is complete. First note that by means of ±(?):={?(??)=?}, ^(?):={??(??)>0} for A e ? we can define ? e #(?), -^ e У (?) with ~? = (?I' (see A969a, p. 36)), so a solution is given by choosing ape ?(?) with ?(?) с /э(А) с i/r(A) for A e Г, according to Theorem 2.1. Then apply Proposition 3.1 of this section. But according to A. and С lonescu Tulcea A969a, Chapter III, Section 2, Theorem 1), there is no need for resorting to liftings for sets due to the following result paralleling Theorem 2.1. THEOREM 3.3. Let be given a complete measure space (?, ?,?). For any ? e <?(м) the set ?? := {и e <7(?): ??(?) ? ?(??) ? ??{?), A e ?} is a non-empty, convex and compact subset of the locally convex space ?.^(?)??. An element ? e 0? is extremal in ?}? if and only if ? is a lifting for ?°°(?). The existence of an extremal element in ??? follows now from the Krein-Milman theorem. In general measure spaces there are no "natural" linear liftings at hand, a situation very similar to that for densities. But again there is an inductive construction which parallels that one for densities in Section 2. LEMMA 3.4. If (?, ?,?), ?, A and ? are as in Lemma 23 of Section 2 then for each ? e <?(? | ?) there exists ??<? <?(? \?) extending ?. For ? e ?0€(? | ?) may be chosen ?<???(? \?).
1144 W. Strauss et al. First note that ?°°(? | ?) = {fXA + gXA<: fge ?^(? | ?)}, and if В, С е ? are defined as in Lemma 2.3 of Section 2 then put iA(/Xa +gXA') :=iA(/xb+^XB')Xa + VK/xo +gXc)XA· for f,ge ?°°(? | ?), a formula completely analogous to the corresponding formula for densities in Lemma 2.3 of Section 2 (see Graf and von Weizsacker A976, p. 156)). LEMMA 3.5. If (?, ?,?) is a complete probability space, (?„),??? and ? эс are as m Lemma 2.4 of Section 2 then for all ?,, e §{? \ ?„) with ?,, \ ?„, = ?,,, ifm ^ n e N there exists ? тДоо e ?(? | ??) satisfying ?? \ ?„ = ?„ for и е N. As in the above examples choose again a free ultrafilter WonN and put ????(?):=?™??(?„„(/))(?) for / e ??(? ? ??)< ? e ?. (Here limne^ could be replaced by any Banach limit, see Dunford and Schwartz A958, Chapter II, 4.22-23).) The existence of the limit is guaranteed by |?«(?,,,(/))(?)| ^ ||/|| oo for и e N and ? e ?. Now Doob's martingale convergence theorem implies VOo(/) = / a.e. (? | ??). Since (?,?,?) is assumed to be complete this implies ???(?) e ??(? I 'Jdc)· All other properties of a linear lifting are immediate by the definition of ?? as a limit. Using Lemmata 3.4 and 3.5 for linear liftings in the same way as for densities in Section 2 in an inductive proof taking exactly the same steps (A) to (F) exhibited after Theorem 2.5 we get extension theorems for linear liftings and in the same way as for densities the (non-empty) class AC?(?) of admissible linear liftings. But note that we need completeness of the basic measure space. THEOREM 3.6. If (?, ?,?) is a strictly localizable, complete measure space and ? a ?-subalgebra of ? with ?? с ? then for any ? e 0(? \ ?) there exists ? ? e Жц) with ?\? = ?. If we take here ? as the ? -algebra generated by ?? then the existence of a linear lifting follows. It should be however noted that according to Burke and Shelah A992) it is consistent with ZFC, the Zermelo-Fraenkel set theory including the axiom of choice, that ?°°(?) admits no linear lifting for many non-complete probability spaces including Borel measure space on [0, 1]. For this reason there is no need for stating an existence result for linear liftings on strictly localizable complete measure spaces, since there the better result of the existence of liftings is available. In the context of extensions naturally arises the problem of mapping of liftings and densities. THEOREM 3.7. Let (?,?,?) and (?,?,?) be measure spaces together with a measurable map /: ? -> ? such that ? = /(?) is the image measure of ? under f. If (?, ?,?) is strictly localizable then for every ? &?(?) there exists a S e #(?) such that a(/-'(/»)) = /-'(a*)) (C)
Liftings 1145 for all В е Г. If in addition (?, ?, ?) is complete then for given ? e ?(?) we may choose ? e ?(?) satisfying equation (C) and in that case we have Sx(hof) = ix(h)of (Cx) for all h eC°°(v). In fact, if ? is the ?-subalgebra of ? generated by ?? and {f~[(B): В еТ},а density <$o for ? | ? is defined by means of 5<)(A) := /~](?(?)) if A e ? and A = f~l(B) for some ? e ?. By Theorem 2.5 we can extend <$() to a density <$ on ?. If ? is a lifting then <$o is too and we can apply Theorem 2.6 to extend it to a lifting on ? because (?, ?, ?) is assumed to be complete. Since we have applied extension theorems for getting ? from ? it it obvious that ? is not uniquely determined by ? via the equation (C). We call any ? e #(?) satisfying the equation (C) an inverse density of ? and write /~' (?) for the class of all inverse densities for ?. On the other hand for a surjective map / any ? e i?(v) satisfying (C) is uniquely determined by <$ e #(?) since then ?(Z?) = /E(/_l (Я))) for S e ?. For this reason we call ? ihe direct density of ? and write ? = /(<5). In the same way we can define the inverse ///f//ig and the direct lifiing. But note that for given ? e ?(?) in general no direct lifting exists. Clearly Uretfd') /-'(f) is the class of all densities for ? having a direct density, and similarly for liftings, but no inner characterization for the elements in this class is known and as yet very poor partial results can be given, e.g., if ? := [В с ?: f~] (?)} and / is injective then for any ? e ? (?) there exists a direct lifting, see Macheras and Strauss A992, Lemma 2.2). We refer to Kupka A983) for his 'projection' Theorem 2.7 which gives a positive result for the 'projection' of strong liftings in the presence of a disintegration. Another way of projecting from products onto its factors is discussed by Macheras and Strauss B000). For linear liftings similar results as for liftings can be obtained by an application of the extension Theorem 3.6. THEOREM 3.8. If (?, ?, ?) is a complete, strictly localizable measure space and the map f is surjective then for each ? e Q(v) exists ? ? e ?}(?) such that *(hof) = (<p(h))of (C30) for all h e ?°°(v). Again ? can be chosen as a lifting if ? is a lifting. Starting from the last theorem the inverse and direct linear lifting can be defined in analogy to the inverse and direct lifting. Note that the equation (C00) implies ? e f~](<p) and ? e /_l(^) if ? is the lower density defined by ?(?):= {?(?,*,) = ? and ? is the upper density defined by the formula ?(?) := {?(??) > 0} for A e ?. The above indicated construction of linear liftings by means of de Possel derivation has some sort of converse. As a preparation we need the following result in which (i), (ii), (iii) were given by Maharam A958), Theorem 4 for lifted sets. It is remarkable in itself since in general uncountable unions of measurable sets are no longer measurable, but it has interesting consequences in addition.
1146 W. Strauss et al. THEOREM 3.9. Let be given a c.l.d. measure space (?, ?, ?) with a density ? e #(?). Then for every non-empty collection BQ ? with В с S(B)for В € В we have (i) U?e?; (") UB^UBeBS(B)?8(UB); (iii) \J BeB B'= (UB)' (V denotes the upper bound of В in ? /?); (iv) ?0 := LUexv ?(?) ^? ????? = ? a.e. (?); (?) $\??)????{?) = ?({]?) if ? is directed upwards; (vi) (?, ?, ?) is localizable. There is a corresponding version for liftings of functions given by A. and C. lonescu Tulcea A969a). J. Gapaillard asserts condition (i) of the last theorem for monotonous liftings with a proof being convincing at least for finite measure spaces. As a first consequence of Theorem 3.9 we get Vitali derivation bases (see Kolzow A968, Section 12) for definition) in c.l.d. measure spaces with lifting by means of the next result of Kolzow A968). THEOREM 3.10. For given c.l.d. measure space (?, ?,?) with a lifting ? e ?(?) put R :=LUer,- P(A) and define gp(w) := [A eEf. ? e ACp(A)| and a p (?) := {g: g is a cofinal subset ofgp(cu)}for ? e R. Then (??(?))?(?/? is a strong Vitali derivation basis. If conversely a weak Vitali derivation basis a = (?(?))?(?/?, Rc e ?? is given in a c.l.d. measure space (?, ?,?) we define the lowerdePosselderivative with respect to a Da(f)(w)= inf liminfiAJ-f gea(u>) Aeg ?(?) for all locally integrable functions, i.e., all ? -measurable functions such that //д is ?- integrable for all A e ?/, and ? e ?. Put A,(A) := [D_ci(xa) = 1} for A e ?. Then Д, is a density for ? which can be converted into a lifting by Theorem 2.1, i.e., we have the converse of Theorem 3.10 which is also due to Kolzow A968), see also С. lonescu Tulcea A971) and Sion A973). COROLLARY 3.11. For each c.l.d. measure space (?, ?, ?) with ?(?) > 0 there exists a lifting if and only if the measure space has a weak {strong) Vitali derivation basis. In the situation of Theorem 3.10 the upper and lower de Possel derivatives with respect to ap are given by _ f. f ?? f.f ?? D(f)(w) = limsupJAJ , D(/)(^)=liminfi^f— Аеар1ш) IJ-(A) Aea„Uu) ? (A) for all locally integrable functions, i.e., for all ? -measurable functions such that /?? is ?- integrable for all A e ?/ and all ? e R := LUer P(A). If in addition R = ? then they satisfy D(f) = Z)(/) = p(f) for all / e ^(?), i.e., the lifting appears as a de Possel
Liftings 1147 derivation. The assumption LUer /°(^) = & ls trivially satisfied for every probability space and can be achieved for any ? -finite measure space. A linear lifting ? for ??(?) is a positive linear map from ?\?) into ?''(?) satisfying the basic properties A1) and A2) of liftings, where Ср(ц) is the Banach space of all finitely real-valued functions f e ?°(?) with ||/||p := /\/\?? < oo, for 1 ^ ? < oo. A. and С Ionescu Tulcea A969a, Chapter IV, Section 4, Theorem 6) noticed that there can't exist a linear lifting for ??(?), 1 ^ ? < oo, if there exists a non-negligible measurable set A which is diffuse, i.e., whose class A' e ?/? does not contain any atom (a similar conclusion holds true in case ? = 0 under an obvious definition of the linear lifting for ?°(?)). With a similar proof one can see that under the same assumption there can't exist a lifting ? e ?(?) satisfying the additional condition ????^? ?«"> = ?^=? P(An) if A.„ e ?, ? e N, in fact only the properties (LI) and (L2) of a lifting are needed for this conclusion. All these results are on the basis of ZFC, the Zermelo-Fraenkel set theory including the axiom of choice. In Solovay's model of Zermelo-Fraenkel set theory (where the axiom of choice fails) the above mentioned proof for the non-existence of a linear lifting for С (?) in case 1 ^ ? < oo carries over to the case ? = oo since (/^(?))' = L1 (?) (a result of Christensen A974)), hence <?(?) = 0 for Lebesgue measure ?, saying again that some sort of non-constructive tool like the axiom of choice is needed for an existence proof of a lifting even for Lebesgue measure (see Graf and von Weizsacker A976)). 4. Liftings on topological spaces Throughout this section {?,?,?,?) is a pretopological measure space with (?,T) being Hausdorff. Theorem 3.9 gives raise to two classical "lifting topologies" of A. and С Ionescu Tulcea A969a), which according to A. and С Ionescu Tulcea A964b, p. 445) are partially tracing back to J.C. Oxtoby and which convert lifted functions into functions continuous with respect to the lifting topology. For given measure space (?,?,?) and S e ?(?) the collection T>s := {8(A): Aeljisa basis for a topology tg on ?. We don't assert ts <? ? at the moment, but it follows from Theorem 3.9 that tg П ? с ?^ := {A e ?: AC5(A)}. The next theorem gives a collection of the basic properties of the density- and lifting topologies ts, ?$, tp, and zp, taken from A. and С Ionescu Tulcea A969a) and A. Ionescu Tulcea A967a). Before we state it, we need a definition. If (?, ?, ?, ?) is a pretopological measure space, a density ? e ?(?) is called ?-strong, if G с 8(G) for all G e ? ? ?, and topologies ? satisfying such a condition are called compatible with S by A. and С Ionescu Tulcea A969a). It is well known that the Lebesgue density D on the Lebesgue measure space on Rd is ? ''-strong for the euclidean topology Zd on E'7, and so is any lifting ? for the Lebesgue measure satisfying D(A) с p(A) for all A e ? (such liftings exist by Theorem 2.1). THEOREM 4.1. Forgiven c.l.d. measure spaces (?, ?,?) with S e ?(?) and ? e ?(?) we have the following results. (i) ft с Ti с ?.
1148 W. Strauss et al. (ii) Tg П ?? = 0 and for all A e ? there exists a G e z$ with A AG & ?^. (iii) A subset К с ? is of the 1st category with respect to the topology ?& if and only if К is closed and nowhere dense or equivalently if К е ??- (iv) The topologies tp and ?? are extremally disconnected and tp с ??. (?) The topology tp {hence ??) is Hausdorff if the set [p(A): A e ?) separates the points of ?. (vi) ??(?,??) = Cf(i2, ??) = {p°(/): / e ?°(?)? and p°(f) is the unique continuous function with respect to tp (respectively, ??) in f. (vii) Ch&jp) = Cb(n,Tp) = {px(f): feCx^)}. (viii) Iftp is Hausdorff then the Stone space of the measure algebra ? /?? is the Stone Cech compactification of^,tp). If ? e {tp, ??) we can add the following conditions: (ix) We have clT(A) = p(A)for all AeT and ? = ?(?, ??). (?) ? is the unique T-strong lifting for the topological measure space {?, ?, ?,?). (xi) The topological measure space (?, ?, ?, ?) is a quasi-Radon measure space. Here (i) follows from Theorem 3.9 and (ii) and (iii) are from A. Ionescu Tulcea A967a). A proof for (iv) to (viii) is available in A. and C. Ionescu Tulcea A969a, Chapter V, Section 3), as well as the equation ?^(?, tp) = 0%(?, ??) in (?). If S is the Lebesgue density (see Section 2) of the Lebesgue measure space (?, ?, ?) on R and ? is a lifting with <5(A) с p(A) for A e ? then ?? is completely regular and tp = ? ? by A. and С Ionescu Tulcea A969a, Chapter V, Section 4, Theorem 2). But there exists a po e ?(?) such that ??) is finer than the euclidean topology on R and fA) ? ??). For any ? & ?(?) with ?? finer than the euclidean topology on R the topology ?? is not normal and every ? ? -compact К с R is finite by A. Ionescu Tulcea A967a). It is now easy to see that generally the topological measure space (?,?, ?, ?) from (vii) will not be a Radon measure space. Take for instance as (?, ?, ?) the Lebesgue measure space with ? := zp for a lifting ? e ?(?) for which zp is finer than the euclidean topology Ed on R. Since by the above remark any zp-compact subset К of R is finite the inner regularity ?(?) = sup^(A"): К с А, К rp-compact} with respect to rp-compact subsets К of R can't be true if ?(?) > 0. If (?,?,?) and (?,?,?) are measure spaces, ? e #(?), and ? e ?(?) then a ?-?-measurable map f: ? -> ? is ts-t(-continuous if and only if /~'(f(S)) <? 5(/~'(?(#))) for all S e T. If moreover ? e ?(?), and ? e A(v) these conditions are equivalent with f~[^(B))=6(f-[ (?)) for all В e ? as well as with <(/?) о / = 5(й о /) for all/г ??°°(?). PROPOSITION 4.2. If (?,?, ?, ?) a/id (<9,5, ?, ?) are topological measure spaces, the map f is a T-S-continuous surjection, and f(8) existsforS e ?(?), then f (8) isT-strong provided ? is ?-strong. But note that under the assumptions of Proposition 4.2 for a strong lifting ? for ? an inverse lifting ie/_1(i;) needs not to be strong, see the example before Theorem 6.20. As we state below, the last theorem gives in fact a topological characterization for c.l.d. measure spaces having a lifting and there are again characterizations by weaker types of
Liftings 1149 "liftings" in an abounding number, hence we can mention only the most spectacular ones. We call a map ? from ?^{?) into itself satisfying the properties A1), A2) a monotonous lifting if in addition <p(f) ^ q>{g) for/, g e ?°°(?) and / ^ g, it is called a bounded linear lifting if ? is a linear map with \\?\\ : = sup{||<p(/)||/||/||oc: 0 < ||/||эс < oo} < oo, where 11/11 is the strict supremum and ||/||oc the essential supremum of a function / e ?^°(?), it is called г. function lower respectively upper density if <p(f A g) = (p(f) Л (p(g) and 4>{fvg) =(p(f) v<p(g), respectively for/, g eCx^). THEOREM 4.3. For given c.l.d. measure spaces (?. ?,?) the following conditions are all equivalent with the existence of a lifting for ?°° (?). (i) There exists a topology ? с ? such that the topological measure space (?,?, ?,?) is a category measure space. (ii) There exists a topology ? с ? such that ? ? ?? = 0 and a set К с, ? is of first category if and only if it is closed and nowhere dense in ?. (iii) There exists a topology ? с ? such that card(/* П Q,(i2)) = 1 for any f e ?°°(?), where C/,(i2) denotes the space of all ? -continuous, bounded real-valued functions on ?. (iv) There exists a monotonous lifting for ?°°(?). (?) There exists a function lower respectively upper density for ?°°(?). (vi) There exists a linear lifting for ?°° (?), (vii) There exists a bounded linear lifting ? for ?°°(?) of norm \\?\\ < 3. For (i) and (ii) compare Graf A973), for the sophisticated equivalence proof for (vii) see Erben A983), where an example is given that the bound 3 cannot be improved. We call ? e ?(?) (and ? e 0(?)) almost ?-strong, if there exists a set N e ?? such that for all G e ? ? ? follows G \ N с <S(G) (respectively 1^(/) \ Nc = f \ Nc for all / e Сь№) П ?°°(?)). 5 and тД are called T-strong in case N = 0. It is obvious that S e ??(?) is T-strong if and only if ? с ?^. If ?d is the euclidean topology of Wl then it is well known that the Lebesgue density D and any lifting ? for the Lebesgue measure with D(A) с p(A) for A e ? are ?f/-strong, and so are the linear liftings ?/м, 1Д2 from Section 3 obtained by L. Fejer's theorem (see Hoffmann A965, pages 20 and 33)). Any hyperstonian space has a uniquely determined strong lifting, which is given by choosing the unique continuous function from each equivalence class of /^(?). For any complete measure space (?, ?,?) any ? e ?(?) is tp-strong as well as zp-strong by the definition of tp and zp. PROPOSITION 4.4. If the topology ? is ?, 1 then for each ? e ?(?) the following conditions are equivalent. (i) ? is almost T-strong. (ii) /0°° is almost T-strong. (iii) There exists a N e ?0 such that p(F) CFU N for all closed Fel. (iv) There exists a N e ?0 such that p°(f) \ Nc = f \ Nc for all f e C(i2Kl ?°(?), where C(i2) denotes the space of all continuous functions from ? into E. (v) There exists a N e ?0 such that poc(f)\ Nc = f \ Nc for all f e С 1^H^ (?), where Сь№) denotes the space of all bounded continuous functions fют ? into R.
1150 W. Strauss et al. Here we can choose the same set N e ?? in (i) to (iv), where in particular N = 0 for T-stmng ? and remember that p, p°°, and p° are in biunique correspondence by means of the equations ?°°{??)=?° (??) = ??(?) for A e ?. The implication (i) =>· (iv) is quickly achieved by observing p° = po if po(f) = sup{r e Q: ? e p({f > r})} since then po(/) I ^ > f I Nc for / e C(i2) and p0(-f) = —po(f) hence po(f) \ Nc = f \ Nc if if ? is almost T-strong with universal set N e ??- The implication (iv) =>· (ii) and the equivalence of (i) and (ii) are obvious. Moreover (ii) =>· (i) works for almost T-strong ? e <?(?), since for a ?-,1 topology ? follows xc = sup{h e Q,(i2): /i^xG)if GeT hence xc\yv ^ ?(??) I W therefore G\N c^OforG e ? ? ? if ?_ is defined by i^(A) = [?(??) = 1} for A e ?, where for yo e Л (?) follows ? = p. So we have in addition the following result. PROPOSITION 4.5. If the topology ? is ?3 ? and the measure space (?, ?, ?) is complete then the existence of a T-almost strong density for ? is equivalent with the existence of a ?-almost strong {linear) lifting. Here we may replace "almost strong" by "strong'". If a topological measure space admits a T-strong density its measure has to be of full support, i.e., 5ирр(д) = ?, since then for all G e ? follows G = 0 from G e ??- The notion of the almost strong lifting allows us to cover in full generality the cases with 5ирр(д) ? ?, for which the notion of strong lifting is inappropriate. PROPOSITION 4.6. If (?, ?,?) is a complete measure space with supp(M) = ?, then fmm the existence of a T-almost strong lifting for ? follows the existence of a T-stmng lifting. This is easily achieved for ? e ?(?) with G\N с p(G) for G e ? ? ?, ? e ?? by choosing an ultrafilter U(a>) finer than the filter basis {G e ?: ? e G) for ? e N and putting p(A) := (p(A) П Nc) U {? e ?: AeU(co)} for A e ?. The space (?, ?, ?, ?) (or just ?) has the almost stmng lifting property (ASLP for short), if there exists an almost T-strong lifting for ?. Since for general spaces (?, ?, ?, ?) there is no "natural" candidate for an almost strong lifting, one possible issue is to check whether arbitrary liftings are almost strong. This leads to the stronger notion of universal stmng lifting property (USLP for short), which says that ? (?) ? 0 and every ? e ? (?) is almost T-strong. Results sufficient for applications (polish spaces) rely on the next result which is a generalization of a result from Maher A978), see alsoFremlin B00?), and Macherasand Strauss A996a). Before we state it we need a modification of the purely topological notion of network (see Gruenhage A984)). A family ? с ? is called a measurable network for the pretopological measure space (?,?, ?,?) if for each G e ? there exists a subfamily $cf such that G = UQ\ we denote by ?????(?) the least cardinal of a measurable network for (?,?, ?,?).
Liftings 1151 LEMMA 4.7. If (?,?, ?,?) is a pretopological measure space with a measurable network ? then ? e ?(?) is almost ?-strong if there exists N e ?0 such that G \ N с ? (G) for all G ef. The last lemma remains true if we replace liftings by monotonous liftings. It follows that a complete topological measure space (?.?, ?,?) has the USLP if there exists a cardinal К with mnw^) < К and if for any family A<^ ? with card(^) < К and ?(?) = 0 for A e ? follows UAe ? and ?A?4) = 0. Indeed for given lifting ? e ?(?) apply Lemma 4.7 for N := \^jFejr(F \ p(F)) if ? is a measurable network of cardinality mnw^). For К = K| this gives the next result of Fremlin B00?, 453F). COROLLARY 4.8. A complete strictly localizable topological measure space (?.?, ?,?) with a countable measurable network has the USLP. In particular a complete strictly localizable topological measure spaces (?,?, ?,?) possesses the USLP if their topology ? is second countable. This is an improvement of a result of Graf A975), see Maher A978). In particular, polish as well as locally compact metrizable measure spaces have the USLP (see A. and С Ionescu Tulcea A969a, Chapter VII, Theorem 8)). The next result is from Macheras, Musial and Strauss B00?a). PROPOSITION 4.9. Let (?,?, ?, ?) and (?, S, ?, ?) be Borel probability spaces and f:: ? -> ?? measure preserving map. Suppose that ? admits a strong lifting ? which has an almost strong inverse image lifting in ?(?). Then there exists a strong inverse lifting ? e ?(?) of ? under f if and only if f~\x(B)) ? G ? 0 implies ?(/~' (?) ? G) > Ofor all В eS and all G e T. To sketch the proof of the above proposition, let ? e ?(?) be an almost strong inverse lifting of ? under /. Then there exists a null set N e ?? such that G с V(G) U N for each G eT. For each ? e N, let ?(?):={?: Ае/_|(Г), ???(?)}, S(w):={f-\G): GeS, f(w)eG]. and ?(?) := {А П G: A e ?(?), G e ? (?)} and ?(?) с ? be the filter defined by Jr(w):={E e ?: 3F eH(co) with Fc ? a.e. (?)}.
1152 W. Strauss et al. Define a density ? e #(?) by means of p(E) := [?(?) ?) Nc]u p*(E) for each ? e ?, where p*(E) :={weN: Ее ? (?)). It can be shown that ? e ?(?) П /_1 (?) and that ? is strong. By Theorem 2.1 there exists ape ?(?) such that p(E) с p(?) for each ? e Г. It follows that ? is a strong inverse lifting of ? under /. A measure ? on a totally ordered space (?,?) is a measure defined on the Borel ?- algebra ?(?) generated by the order topology, ? is the completion of ?(?) under ?, where we denote the extension of ? to ? again by ?. Then the quadruple (?, ^, ?, ?) is called a totally ordered measure space. For such spaces Sapounakis A983) proved the existence of strong liftings. THEOREM 4.10. Let (?,-?,?, ?) be a totally ordered measure space such that its measure is of full support. Then there exists a strong lifting for ?. For Baire and Borel measure spaces necessary conditions for the ASLP are given by Babiker and Strauss A980a) as well as by K.P. Dalgas. PROPOSITION 4.11. If (?, ?, ?, ?) is a Baire measure space with finite measure and a completely regular topology ? then the ASLP implies that the measure ? is ?-additive and completion regular. For any topological measure space (?, ?, ?, ?) with the ASLP the measure ? is necessarily ?-additive. It follows, e.g., that the Wiener measure restricted to the completion of the Baire ?- algebra on R l°·'' does not have the ASLP. But the Wiener measure considered on the completion of the Borel ? -algebra has the USLP (see, e.g., Macheras and Strauss A996b)). Clearly the last proposition raises the problem whether the necessary conditions for the ASLP given there are sufficient? By Babiker and Knowles A978), there exists a Baire measure space (?,?, ?, ?) with compact ?, finite and non-atomic ? of full support, that is ?-additive but not completion regular. This space is an example of a compact Baire measure space without the ASLP. Fremlin A979) has given an example of a Radon measure space on a compact set with completion regular ? of full support but without the ASLP, improving a result of Losert A979) which lacked completion regularity. The completion regularity is of interest here because in important known cases where the ASLP holds true, the completion regularity is also fulfilled. Mokobodzki A975) and Fremlin A977) gave a converse for Proposition 4.11. It was noticed by Dalgas that this result is the main step towards a result on strong Borel liftings improving the result of Mokobodzki and Fremlin. In the subsequent theorem of Dalgas A99?) с stands for the cardinal of the set of reals. Different approaches have been given by Musial A973) and Lloyd A974). THEOREM 4.12 (CH). Let be given a topological measure space (?,?,?,?) with finite measure ? and ? possessing a basis of the cardinality less or equal to с Then,
Liftings 1153 (?, ?, ?, ?) has a strong Borel lifting if and only if ? с ??, supp(M) = ?, and ? is ?-additive (?? is the completion ???(?) with respect to ?\?{?)). The theorem remains true for ?-finite ? if ? is moderated or all finite Borel measures on (?,?, ?, ?) are ?-additive. The proof of the last result relies on the following theorem of Fremlin A977). THEOREM 4.13. Let be given a topological measure space (?,?, ?,?) with ? с ?? satisfying the following conditions. (i) For any A e ? and any Q с ? with ?(? П G) = 0 for all G ? Q follows ?(????)=0. (ii) If к is the cardinal of the measure algebra ? /? then the union of fewer than к sets of measure zero is measurable and of measure zero. (iii) 5ирр(д) = ?. Then there exists a strong Borel lifting for (?,?. ?, ?). Burke A993a, Proposition 3.6) gives an elementary proof that the existence of a Borel lifting for the Borel-Lebesgue measure space on [0, 1 ] implies already the existence of a strong Borel lifting for this measure space. It follows from the example of Losert A979) that the Theorem 4.12 is no longer valid without the cardinality restriction. Dalgas gives an example that for ?-finite measures spaces additional hypotheses must be imposed for a corresponding characterization. Mauldin A978) gives related results, following the original results of von Neumann A931). The first one states that subject to the continuum hypothesis, any Borel measure space (?,?, ?, ?) has a Borel lifting if (?,?(?)) is Borel isomorphic to a universally measurable subset of the unit interval [0, 1] of the reals, thus generalizing a result of A. and С Ionescu Tulcea A969a, p. 182). The second one applies Martin's axiom to construct liftings ranging in the c-algebra ??(?) of the topological space (?, ?), i.e., in the the smallest algebra containing the Borel ?-algebra ?(?) and being closed under unions of less than с sets. If the continuum hypothesis does not hold true the c-algebra may be much larger than the Borel ? -algebra. This result is one of the very few ones in lifting theory applying Martin's axiom. For more results on Borel liftings we refer to Section 5 on translation invariant liftings. Assuming the continuum hypothesis, first existence results for strong Baire liftings have been given by Losert A980), see also Talagrand A978a), for a restricted class of measures, i.e., for any product of less than or equal to Hi Radon probability measures of full support, each on a compact metric space. Grekas and Gryllakis A992) improved this result as well as a result of their own from 1991 by the following theorem. THEOREM 4.14 (CH). Subject to the continuum hypothesis every product of less than or equal to К 2 many completion regular probability measures, each supported on a product of less than or equal to ? ? many compact metric spaces admits a strong Baire lifting. The proof is based on a result for measures on product spaces satisfying a certain condition which reduces in case of compact metric factor spaces to completion regularity. The next theorem from Babiker and Strauss A980a) gives necessary and sufficient conditions for the USLP.
1154 W. Strauss et al. THEOREM 4.15. Let (?,?) be a locally metrizable space. Then a Borel measure space (?, ?, ?, ?) with a finite measure ? has the USLP if and only if ? is ?-additive, and a Baire measure space (?, ?, ?, ?) with finite measure ? and completely regular topology ? has the USLP if and only if ? is ?-additive and completion regular. This theorem implies that under the mild set theoretic assumption of the non-existence of measurable cardinals every metrizable space with finite Borel or Baire measure has the USLP. COROLLARY 4.16. Suppose that every closed discrete subspace of the metric space (?, ?) has non-measurable cardinality. Then every Baire (respectively Borel) measure space (? ,?,?,?) with a finite measure ? has the USLP. For existence of strong liftings on products and projective limits, respectively we refer to the Sections 6.2 and 6.3 below. The interest in strong liftings comes from the following theorem of A. and С Ionescu Tulcea A969a) on the existence of strict disintegrations (see the same book for terminology). THEOREM 4.17. A finite Radon measure space (?,?, ?, ?) over a compact set ? with 5ирр(д) = ? has the ASLP if and only if for each Radon measure space (X, S, ?, v) over a compact set X and each continuous surjection ?: ? —* X with ? = ?(?) there exists a strict disintegration of ? with respect to p. The notion of a strong lifting has been generalized to the so-called W-lifting, for which we refer to Levin A975) and for existence to Babiker and Strauss A980a, 1980c). Bichteler A970, 1971) shows that the problem of the existence of strong liftings for Radon measures on locally compact Hausdorff spaces can be reduced to the problem of the existence of strong liftings for Radon measures on products of unit intervals. Bichteler A972) shows that the set of all signed Radon measures ? on a locally compact Hausdorff space X such that \?\ admits an almost strong lifting is a band in the Dedekind complete lattice of all Radon measures on X, see also Bichteler A973) as well as С Ionescu Tulcea and Maher A971). For weakenings of the notion of strong lifting such as the so-called almost strong pre- density, the idempotent lifting, and the almost ?-continuous lifting, respectively we refer to Bichteler A972), Georgiou A974), Grekas A989), and Rinkewitz A997), respectively. 5. Liftings on topological groups Throughout this section a topological group X carrying Haar measures together with its ?-algebra ? of Haar measurable sets (the common domain of all its left and right Haar measures) and a left Haar measure ? on ? are given. Then (?, ?, ?) is called a Haar measure space over X by Fremlin B00?). (The case of a right Haar measure can be reduced to that of the left one.) The map ? e X -> jc_1 e X is the inversion operation. For s e X,
Liftings 1155 A<^X, and / : X -» К we consider sA := {sx: ? e A], the /e/f s-translate of A as well as y(s)f, the /e/r s-translate of / defined by means of (y(s)f)(x):=f(s-lx) for all * e X. A density 5 e #(?) and a linear lifting ? e <?(?), respectively is called left-translation invariant if S(jA) = jS(A) for all A e ? and all s e X and \l/(y(s)f) = y(s)(i/(f)) for all / e ?^(?) and all s e X, respectively. If again (see Section 3) p°° and p° are the liftings for functions in ?{^(?) and ?|?(?) uniquely generated by a lifting ? e ? (?), then ? is left-translation invariant if and only if ?00 and p° are such. In the above definition we have fixed explicitly a Haar measure ? on X for easier reference. But the definition of a left-translation invariant density and (linear) lifting is completely independent of this choice, since all left (and right) Haar measures produce the same domain ? and the same null sets ??. For this reason we can speak of Haar densities and Haar (linear) liftings. In Abelian groups we simply speak on translation invariant densities and (linear) liftings. For the next proposition compare A. and С Ionescu TulceaA967, Section 3, Proposition 1) and Fremlin B00?,448C). PROPOSITION 5.1. Any left-translation invariant linear lifting is strong. The same holds true for translation invariant densities. The Lebesgue density D defined in Section 2 for the Lebesgue measure space on Rd, d e N, is translation invariant and similarly the examples 1Д2, ?3 quoted in Section 3 provide translation invariant linear liftings on the circle group. A. and С Ionescu Tulcea A967, p. 90), show that in contrast to the uniqueness of the Haar measure, Haar densities are by no means uniquely determined even in case of the Lebesgue measure on the real line. By the next two theorems (which are the analogues of the Theorems 2.1 and 3.3, respectively) translation invariant densities and linear liftings can be converted into translation invariant liftings. THEOREM 5.2. For any left-translation invariant density 8 for ? there exists a left- translation invariant lifting ? for ? with 8(A) с p(A)for all A e ?. In fact by Theorem 2.1 we may first choose are ?(?) with 8(A) с ?(?) for all A e ? and then take p(A) := {x e X: ее ?-1 A)} for all A e ?, where e denotes the identity of X.
1156 W. Strauss et ah THEOREM 5.3. For any left-translation invariant linear lifting ? for ? there exists an extremal element ? e ?}? (see Theorem 33 for the definition) which is a left-translation invariant lifting for ?. Compare A. and C. Ionescu Tulcea A967, Section 6, Corollary 3) for the last theorem. The construction of left-translation invariant densities and linear liftings goes through the induction steps (A) to (E) lined out for Theorem 2.5. But these steps become now more complicated since the construction has to run through translation invariant ?-subalgebras. In particular, in the step from an ordinal a to its successor ordinal a + 1 the extra difficulties are a consequence of the fact that translation invariant ? -subalgebrafor a + 1 is much larger than that one generated by the ?-subalgebra for a and the new element which enters. A. and С Ionescu Tulcea A967) had overcome these difficulties by exploiting special structural properties of locally compact groups, in particular of Lie groups, getting the next fundamental result. THEOREM 5.4. For each Haar measure space (?, ?, ?) on a locally compact group X there exists a left-translation invariant lifting. The last step from a locally compact group to a group X carrying Haar measures which we are going to state now has been done by Fremlin B00?) using the fact that for a Haar measure space (?, ?, ?) on X there exists a Haar measure space (?, ?, v) on a locally compact group ? and a continuous map /: X -> ? such that ? = f^) and for all ? e ? there exists a set F e ? with /_1 (F) с ? and ? \ /_1 (F) e ?0. COROLLARY 5.5. For any topological group carrying a Haar measure there exists a left- translation invariant lifting adequate for all its Haar measures. For the special group X = {0, 1}' for a non-empty index set / with its "usual Haar measure" the proof of the existence of a Haar density from Fremlin B00?, 345C), can be modified to give a proof of the existence of an admissible translation invariant density and a translation invariant admissible linear lifting being in particular a strong admissible density and strong linear lifting, respectively. By Theorem 5.2 there exists an admissibly generated translation invariant lifting, hence by Proposition 5.1 an admissibly generated strong lifting. Concerning translation invariant Borel liftings for Haar measure spaces Johnson A980) proved that in ZFC no translation invariant lifting for the Haar measure on the circle group R/Z can be a Borel lifting. This result was extended by Talagrand A982) to non discrete compact Abelian groups. While R.A. Johnson used results from topological dynamics on the circle group M. Talagrand's proof is simpler since it is based on the well known construction of the Cantor set. Generalizing Johnson's procedure Kupka and Prikry A983) succeeded to extend M. Talagrand's result to non-discrete (possibly non-Abelian) locally compact groups replacing also Borel liftings by more general liftings ? with Baire property, i.e., p(A) is a set with Baire property for any A e ? (note that every Borel set has this property). M. Talagrand's simpler method was developed for the non-Abelian case first by Losert A983) and subsequently by Burke A993) who succeeded in the next stated theorem to improve the result of J. Kupka and K. Prikry.
Liftings 1157 THEOREM 5.6. In each non-discrete locally compact group X there exists a Borel set ? such that for an arbitrary left-translation invariant lifting ? for the Haar measure space (?, ?,?), theset p(E) is not universally measurable and does not have the Baire property. Here a set А С X is called universally measurable if A is ?-measurable for every Radon measure ? on X. Fremlin A989) gives a list of open problems. Questions 5.7. A) Is there a cardinal к such that the Haar measure on {0, 1}* has no Baire lifting? This problem is connected with the next unsolved one about the existence of product liftings (see Section 6 for the definition) in incomplete probability spaces. B) Do there exist product liftings for the Borel measure space on [0, l]2? C) Does {0, l}*1 possess a Borel lifting? By Mokobodzki's Theorem 2.7, subject to CH, the spaces {0, 1}^' for /=0,1,2 have Baire liftings. According to Burke and Shelah A992) it is consistently true with ZFC that there is no Borel lifting on {0, \)K for any к. D) Is the existence of a Borel lifting for {0, 1}'" consistent with 2K° > N3? The problem of existence of translation invariant liftings has been generalized to the problem of the existence of Q-invariant (linear) liftings for a given set Q of bi-measurable maps s: ? -> ? over a measure space (?, ?, ?) by A. and С Ionescu Tulcea A969a, p. 182) and Maher A974, p. 69). Besides the positive solution for the group of left translations in topological groups carrying Haar measures from above (see Corollary 5.5) another positive solution was given by A. Ionescu Tulcea A965) for the existence of a Q- invariant linear lifting if (?, ?, ?) is strictly localizable and Q is a countable amenable group. On the other hand von Weizsacker A977) proved on the basis of a lemma on automorphisms for complete Boolean algebras criteria for the non-existence of ^-invariant liftings. In particular he gives a maximality argument for the set of left translations in the situation of Theorem 5.4, more precisely he showed in von Weizsacker A976), Corollary A.3. on the basis of his fixed-point result the next theorem. THEOREM 5.8. IfX isa connected locally compact group with left Haar measure then for every set G of continuous bi-measurable and null-set preserving bijections on X which is strictly larger than the set of all left translations there exists no lifting commuting with G. 6. Permanence of liftings 6.1. Liftings in products We denote by (f]i6/ ??, <g)i6/ 27,-, (g),6/ ?/) or by <g)i6/ (?,, 27,-, ?,) or by (?,, ?,, ?,) the product probability space of the probability spaces (??,, 27,-, ?,) (/ e /) and (??6/ ^" <8>i6/ ??. <8>,6/ Mi) or <8>,6/ №. ??' ?/) denotes its (Caratheodory) completion. For each
1158 W. Strauss et at. 0 ? J с / we denote by (Qj, ?],??) the product measure space (?)iej(&i, IT,-, ?,-). For any 0 ^ J с / the canonical projection of i2/ onto i2y is denoted by p./, where Pi : = p(,¦} if / e / and the ?-algebra ? у' (Ej) с X1/ is written ITy. For a probability space (<9, T, v) and a non-empty set / we write (?1, ?1 ,?1) for the product probability space 0/Ej(C/, ??,??) with all its factors (?,-, 2?,-, ?,-) equal to (<9, 7, v) for /' e /. If / is a function defined on П/е/ ^ ап(^ (?<? · ¦ ¦¦ '?'«) e ??'=?^ are fixe(J. then /(?,-, ?,„) is the function on ?/6/\|?, ?„] ?? obtained from / by fixing (?,-,,..., ?,-,,). In a similar way the sets E(Wi ? ) being sections of a set ? С П/6/ ? are defined. In case of the product of two spaces, we shall be using the notation /?, fe and ??, ?? rather. During the last fifteen years a good deal of research in lifting theory concentrated on the problem of the existence of liftings compatible with the product structure of probability spaces leading to different types of compatibility of increasing complication, starting from TalagrandA989). Throughout what follows let be given a family ((??, 2?,-, ?,))/6/ of probability spaces with index set / ? 0 and a probability space (?, ?, ?) with ?=??,?^??,?\?/ = ?/. If 0 ? J с /, then С) := {f о pj: f e ?^(?^)} is the set of functions, determined by coordinates in J. Moreover, 0 A< := ? J1 (? A<) for A< e ?;, / e 7 ieV \e./ ' and <S>fi ¦= (? /') °PJ for /' e ^(W), ' e У ie./ \e./ ' are the cylinder sets and functions, respectively. We call S e #(?) a product-density for ? if for all / e / exists 5,- e #(?,) such that 5 @ A,A = 0 5,- (A,-) for all A, e 27,-, i eF^I, F finite. (P) The 5,-, / e / are uniquely determined by <5 via (P) but conversely S is only uniquely determined on the cylinder sets by the family (<5,);6/ via (P). We therefore write S e 0i6/ <5, and call <5, the /th marginal of 5. ???(?) is the class of all product-densities, and ??(?) := ??,,(?) ? ?(?) is the class of all product-liftings for ?. Clearly the marginals of any ? e ?,,(?) are in ? (?,) for / e /. In the same way we can define the class 0,,(?) of all product linear liftings and the class ?^(?) : = 0?(?) ? ?30(?) of all product (function-) liftings, where indeed ? e ?,,(?) if and only if px e ?^(?) and in that case the marginals satisfy the equation (pi)°° = (p30)/ if / e /. The product-lifting was first investigated in A. and С Ionescu Tulcea A969a, Chapter VIII). Then Talagrand A982) introduced a consistent lifting. A lifting ? for a complete probability space (?, ?, v) is said to be consistent if for each ? e N there exists a product lifting p" e A@ v) with all its /th
Liftings 1159 marginals equal to ? for / = 1,..., n. Consistent densities can be defined in the same way. The existence problem for consistent liftings has been definitely solved by Talagrand A989). THEOREM 6.1. For each complete probability space (?. ?. v) there exists a consistent lifting ? e A(v). It is however not true that each lifting is consistent. Talagrand A988) assuming (CH) constructed such a Borel lifting ? on [0, 1] with respect to Lebesgue measure that its product p2 defined on Borel rectangles by p2(A ? ?) = ?(?) ? ?(?) cannot be extended to a lifting on Lebesgue measurable subsets of the unit square (in particular it cannot be also extended to a Borel lifting). Question 6.2 (cf. Burke A993)). Can one produce in ZFC a lifting for Lebesgue measure on [0, 1] such that p2 does not extend to a lifting on [0. I]2? Question 6.3 (cf. Burke A993)). Is it a theorem of ZFC that there is no Borel lifting ? on [0, 1] such that p2 extends to a Borel lifting on [0. I]2? Question 6.4 (A.H. Stone). Is there consistently a Borel lifting on [0, 1] of bounded Borel class? The existence of a product lifting in case of a product of arbitrary complete probability spaces was solved by Macheras and Strauss A996d). For an improvement of this result see Theorem 6.11 below. S & ?(?) and ? e 0(?) will be called respecting coordinates for ? if S(Ej) с E*j and i/(C*j) с С* for all 0 ? J с /. The notion traces back to works of Burke A995) and Fremlin B00?). For finite index set / all the above concepts make sense for ?-finite measure spaces (?,-, IT,-, ?,-), /' e /, instead of probability spaces, since in that case there exists a unique product measure. But again it is only a matter of technique to carry over the results. So we are going to consider probability spaces only. The first essential result concerning liftings respecting coordinates is due to Burke A995). It has been obtained by an application of a theorem of Erd6s-Rado. THEOREM 6.5. For any finite family ((?,-, ?,, ?,)),"=1 of complete probability spaces there exists a lifting for (^)J = | ?, respecting coordinates. Fremlin B00?, 345G), attempting to prove an infinite version of Theorem 6.5 took first into account densities respecting coordinates. Applying D. Maharam's theorem on the structure of measure algebras (cf. Fremlin A989,3.9)) Fremlin B00?) has got the following result. THEOREM 6.6. For any family ((?,. Ej, ?,)),6/ of complete probability spaces there exists a separately additive density S for ? / respecting coordinates.
1160 W. Strauss et al. Separate additivity above (D.H. Fremlin called it the (*) property) means that 8(AUB)=8(A)US(B) for any A e ?) and В е ?*? (*) for all disjoint J, К с /. It seems that the proof procedure applied above is restricted to complete probability spaces and gives no information about the marginals. Probably without the completeness assumptions one cannot obtain the separate additivity above. The admissible densities have convenient properties from the product point of view and for this reason they are the natural candidates for marginals in the existence results below. The next result of Macheras, Musial and Strauss A999) gives not only a solution of the existence problem for densities respecting coordinates and arbitrary probability spaces but at the same time it produces a unifying approach to all former existence results for existence of product densities and liftings as well as to those respecting coordinates. The conclusion of this result is on one side weaker since it can not be shown that the density which is respecting coordinates is separately additive in addition but on the other hand its proof is quite elementary (in particular D. Maharam's theorem is not applied there) and the product probability needs no completion. THEOREM 6.7. Let ((?,-, 2?,-,?,-))?6/ be a family of probability spaces. If io e / is fixed, then for each <5,0 e #(?,-0) and for arbitrary- <5, e ?#(?,) with i e / \ {io} there exists ? ? e ??(?) such that S respects coordinates and ? e <S>,6/ &· ^n Particular the theorem holds true if all the densities <$,, i e /, are identical and admissible. As a consequence, it follows that each admissible density is a consistent density. Since ??(?,¦) ? 0 if /' e / the above result is a generalization of Macheras and Strauss A995). The proof of Theorem 6.7 is based in principle on the inductive steps exhibited before Theorem 2.6, but this time the induction is more complicated (see Macheras, Musial and Strauss A999) for details). Applying Theorem 6.6 Fremlin B00?) proved the following nice result. THEOREM 6.8. For any family ((?,-, 2T;,ji;))ie/ of complete Maharam homogeneous probability spaces there exists a lifting for <S),-6/Mi respecting coordinates. The proof of Theorem 6.8 has been reduced to the next Theorem 6.9 of Fremlin B00?) which is of an even more special nature by using the fact that the measure algebra of any Maharam homogeneous probability space is isomorphic to some {0, 1}'' with its usual measure v,- for i el. THEOREM 6.9. For any set I any translation invariant lifting for the usual measure ? on {0, 1}' respects coordinates. Then Macheras, Musial and Strauss B00?c) applying Theorem 6.22 to the conclusion of Theorem 6.8 were able to generalize Theorem 6.8 to the following form:
Liftings 1161 THEOREM 6.10. For an arbitrary family ((?,-, ?1,, ?,)),6/ of complete Maharam homogeneous pmbability spaces and arbitrary finite collection (((У/, Tj, Vj))jej of complete probability spaces there exists a lifting for «S>,6/ M/) § «8> /е./ vi) respecting coordinates. In case when / = 0 one gets Burke's theorem with a completely new proof. It is as yet an open problem whether for arbitrary families of complete probability spaces there exist liftings respecting coordinates. Only partial results are known. The next result seem to be the most far reaching for general complete probability spaces at the moment. (See Macheras, Musial and Strauss A999) as well as Fremlin B00?) for this result.) THEOREM 6.1 1. Let к be an ordinal and ({??, ??, ??))?<? be a family of complete probability spaces. Then for each po e ?(??) and each collection {pa e AG ?{??): 0 < a < к) there exists a lifting ? e ®a<K Pa respecting coordinates of each initial segment ofK. Considering linear liftings a general existence theorem was given by Macheras, Musial and Strauss B000). The proof is based on the extension Lemmata 3.4 and 3.5. THEOREM 6.12. ((?,-, ?{,?,)),-6/ be a family of complete probability spaces with product pmbability space {?,?,?). If iq e / is fixed, then for each ?,0 e <?(?,0) and for arbitrary ?, e ?<?(?,) with ie/\ {/<)} there exists ? ? e 0(?) such that ? respects coordinates and ? e <S>,6/ ?'· The above result gives also an alternative proof of Theorem 6.6. It seems however that the extreme point method of Theorem 3.3 does not work while trying to convert a linear lifting respecting coordinates found in Theorem 6.12 into a lifting respecting coordinates. 6.2. Stmng liftings in products Though weaker than liftings respecting coordinates, product liftings are useful for transporting strong liftings from the factors onto the product as well as for attacking a problem posed by Kupka A983), whether the product of two topological (especially Radon) probability spaces has the ASLP if the factors have this property. In particular it is natural to ask, whether the product of hyperstonian spaces has the ASLP. It is known from Macheras and Strauss A996c) that the only possible candidate for a strong lifting on the ordinary completed product (? ? ?, ? ? ?, ? §> ?, ? §> ?) of a hyperstonian probability space (?,?, ?, ?) with itself is necessarily a product lifting of the canonical strong liftings in the factors. If (?, ?, ?, v) is the hyperstonian space of, e.g., the Lebesgue probability space on [0, 1 ], it follows from the next theorem that there exists no strong lifting for ? <g> ? since ? ? ? g ? §> ?, the latter a result of Fremlin A976). But let us remark that subject to the continuum-hypothesis the Radon product of (?, ?, ?, v) with itself has the ASLP by Theorem 4.12 (see Macheras and Strauss A996c, Section 3, Remark 5) for details). It remains an open problem, whether the Radon product of two Radon probability spaces
1162 W. Strauss et al. with the ASLP has the ASLP, in particular it is unknown, whether the Radon product of two hyperstonian measure spaces has the ASLP. Let us remark that the existence result for product liftings, Theorem 6.11, can be stated in an equivalent form in terms of products of lifting topologies which implies Theorem 6.13, a permanence result for strong liftings on products. For the proofs of the next three theorems see Macheras and Strauss A996d) as well as A996c). If ((i2,-,7/)),-6/ is a family of topological spaces we write f],6/ % f°r the product topology on f],6/ ?- THEOREM 6.13. Let be given a family ((?,, 7/, ?,, ?,)),6/ (/ a non-empty set) of complete topological probability spaces, pj e Л (?,) for each i e I,_Jet (?, ?,^) be a complete probability space such that ? = ?]/6/ ?,, ®2Г,- с ?, д|0F/2Г,- = <S>/6/Mi. and let ? e ?(?) satisfy ? e ®,6/ Pi- Then we have that ?,6/ % <? ? and ? is strong with respect to ?]|6/ % if and only if all p, are strong for i e /. If (?, ?, H, v) is again the hyperstonian space of the Lebesgue probability space on [0, 1 ] then it follows from Theorem 6.13 the surprising fact that the unique strong lifting ? for ? is neither consistent nor admissible and any consistent lifting ? for ? is not strong. In particular, ? has the ASLP but not the USLP and the lifting topologies ta and ?? satisfy ta ? t„, ?? ? ?? g ? <g> ?. Under a mild set-theoretic assumption, i.e., assuming the non-existence of measurable cardinals, a purely topological result of P.C. Curtis, M. Hendricksen, and J.R. Isbell (see Gillman A960, p. 53)) says that a product of two topological spaces is extremally disconnected if and only if one factor is extremally disconnected and the other one is discrete. This implies the following result whose assumptions are satisfied, e.g., by the Lebesgue measure space on [0, 1]. It answers the question, whether the product of lifting topologies is a lifting topology to the negative. THEOREM 6.14. If we assume the non-existence of measurable cardinals then for two complete probability spaces (?(, ??, ?,) and pi e ? (?,) with non-discrete lifting topologies tPl, ??? (/ = 1,2) the pmduct topologies tP] ? tp,, ??] ? ??2 are not extremally disconnected. #(?\ ? ?2, ?, ?) is a complete probability space such that ?\ <g> ?2 с ?, ?\?\ ®?2=?\ <8> ??, tP] ? tP2, ??] ? ??2 с ?, then in particular we have for each ? e ?(?), and if ? is stmng with respect to tP] ? tP2 respectively ??] ? ?? (for example if ? e p\ <g) p2) then tPl ? tp-, С tn and ??] хтР2Стп, but not equality. The next result is an analogue of Theorem 6.11 for strong liftings. THEOREM 6.15. Let к be an ordinal, ((??, %, ??, ??))?<? any family of complete topological probability spaces with completed product space (?,?,?) and pmduct topology T. Suppose that all measures ?? (? < ?) have full support and that ?? has
Liftings 1163 the ASLP while all other measures ?? for 0 < a < к have the USLP. Then for any stmng lifting po & ?(??) there exist strong liftings pa e A(\ia)for 0 < a < к and a strong lifting ? e ?(?) such that ? e ®?<? ??, ? <? ? and ???? ?\ ??\ = ((?)??(?)? ?\ ?? ?<^?<? ' ^?<? ' ?^?<? for all A e &)?<??? and all a < к, that is ? is a strong lifting respecting individual coordinates and initial segments of coordinates. In particular, if ?? = B^a)for all a < к then ? = ?(?) and if all the measures ?? (a < к) are completion regular then ? is also completion regular. The last theorem together with Corollary 4.8 imply the next classical result of Maharam A958) and Kakutani A943). COROLLARY 6.16. For every ordinal к there exists a strong lifting for the usual measure ? on {0, 1}?. Another consequence of the last theorem together with Corollary 4.8 is the next one given by Fremlin B00?, 453H). COROLLARY 6.17. Let ((??,??, ??,??))??? be a family of complete topological probability spaces with completed product measure space (?, ?, ?) and product topology T. Suppose that all measures ?? (a e /) have full support and that every Ta has a countable measurable network. Then ? is ? ?-additive topological measure and has a T-strong lifting. 6.3. Liftings on projective limits Throughout what follows (??, ??,??, fap, I) denotes a projective system of probability spaces which will be assumed complete if liftings are involved. If / = к, where к is an infinite cardinal, we say that the projective system (??, ??,??, /??,?) of probability spaces is continuous, if for every limit ordinal ? < к we have that (??, ??, ?^, {fa}a<^) is the projective limit of the projective system (??, ??, ??, /??, ?}. For projective systems of topological probability spaces a similar definition can be posed. As for products and inductive limits suitable notions of compatibility of projective systems and liftings or densities are crucial. A family (pa)aei °f densities pa e ?{??) is called self-consistent, if ??(/??(?))=/-??(??(?)) (?) for all A e ?? and all ?, ? e /with a ^ ?. For liftings pa e ?(??),? e / this is equivalent to the ???-??? -continuity of the maps fap as well as with the equations P?(hofaP) = (P?(h))ofaP (pOC)
1164 W. Strauss et al. for all h e ?°°(??) and all ?, ? e / with ? ? ? by Section 3. The latter equation can be taken as a definition of a self-consistent family of linear liftings for the above projective system. If (?, ?,?, (fa)aei) IS the projective limit of the above projective system a density ? e ?(?) is called ? projective limit of the self-consistent family (pa)aei if p{fal(A))=f-](pa(A)) for all A e ? and all a e /. Again for liftings (where we take the completed projective limit) this is equivalent to the ???-??-continuity of the maps fa as well as with the validity of the equations px(b/„) = (p„xW)o/„ for all h e ?°°(?„) and all a e / and this equation can be taken as the definition of the projective limit of linear liftings. In symbols we write ? e proja6/ lim pa for densities and (linear) liftings. We assume throughout that all canonical projections fa of a projective limit (?, ?, ?, (fa)aei) of probability spaces are surjections. Suppose we have a projective system (?, ?,?, (fa)a<K)- In general, it is not obvious whether there exists at all a self-consistent family of densities or liftings associated with the system. By Macheras, Musial and Strauss B00?a) there is an answer to the positive in the next result. THEOREM 6.18. Let к be an infinite ordinal and (??, ??, ??, fap> к) a continuous projective system of probability spaces with pmjective limit (?, ?,?, (/?)?<?)· Then there exist a self-consistent family {??)?<? of densities Sa e ?(??) and a density ? e #(?) such that S is a pmjective limit of the system (Sa)a<lc. If all probability spaces of the pmjective system are complete and the pmjective limit is taken completed, then we may in the above replace the word "density" by "linear lifting" and "lifting", respectively, throughout. We will give examples below showing the existence of self-consistent families over non well ordered index set without projective limit. Necessary and sufficient conditions for the existence of projective limit liftings are given by Macheras and Strauss A994). THEOREM 6.19. Let be given a pmjective system (??, ??, ??, /??. ?) of complete probability spaces with completed projective limit (?, ?,?, (fa)aei), ? self-consistent family (pa)aei of strong liftings pa e ?(??) and a lifting ? e ?(?). Then the following conditions are all equivalent: (i) the pmjective limit topology ? of {???}aei 's contained in ? and ? is stmng w.r.t. ?; (ii) the pmjective limit topology ? of(tPa)aei is contained in ? and ? is strong w.r.t. T; (iii) ? is a pmjective limit of (pa )ae/; (iv) ? с ?p\ (?) ? с tp.
Liflings 1165 For projective systems of (complete) topological probability spaces we may ask for the existence of self-consistent families of strong liftings and densities and of strong projective liftings. In general self-consistent families of strong liftings do not have any projective limit lifting as shown in the next example. Let X := {0, 1}', where / is an index set with card(/) = N2, and let ? be the probability measure on Bo(X) constructed by Fremlin A979). Denote by T(I) the system of all finite subsets of /. For any a e JF(/) put Xa := {0, 1}" and ?? : = fa(fJ-), where /„ is the canonical projection from X onto Xa, and denote by Ta the discrete topology on Xa. For any ?, ? e ? (I) with a ^ ? denote by fap the canonical projection from Xp onto Xa. If by ? is denoted the projective limit topology of the family (Ta)aeyr{l), then (?,??(?),?, (f^aeFU)) 1S tne projective limit of the system {Xa,Bo(Xa), ??, /??,?(?)). For any a e T(I) the map pa from B0(Xa) onto itself defined by pa(A) = A for any A eBo(Xa) is a strong lifting in ?(??) and the family (pa)a€^(i) is self-consistent. Since ? does not admit a strong lifting, in particular there cannot exist any lifting ? e ?(?), which is a projective limit of (pff)ff6^-(/), because in such a case we should have G = p(G) for any element G of the family G-={fal(Ga): Gae%,aeT(I)}. But since Q is a basis for the topology T, it follows from Lemma 4.7 that ? should be strong; this yields a contradiction. The same example shows also that an inverse lifting of a strong one under a continuous and measure preserving map is not in general strong. In fact, for each a e T{I) there exists by Theorem 3.7 a pa e ?(?) which is an inverse lifting of pa under /„. But according to Fremlin A979) pa cannot be strong. But for well ordered index set there is always a positive solution due to Macheras, Musial and Strauss B00?a). THEOREM 6.20. Let к bean infinite ordinal and (Xa,Ta, ??, ??, /??, ?) be a projective system of complete topological probability spaces. Suppose that (?, ?, ?, (/?)?<?) " 'he projective limit of the above projective system and {??)?<? is a self-consistent family of strong densities ?? e ?{??). Then for the projective limit topology ? of (Ta)a<K follows ТсГ and there exists a strong density ? e ?( ? ) such that ? is a projective limit of the system (??)?<?. ^ ^ _ In particular, if for every a < к we have ?? = B(Xa) then ? = B(X) and, if in addition ? = Bq(X) and all the measures ?? are completion regular, then ? is also completion regular. The same is true if we replace "densities" by "liftings" throughout. The last result raises the problem of the existence of a self-consistent family of strong liftings for projective systems of complete topological probability spaces. Even for well ordered index set the answer is to the negative in general as witnessed by the following example. Take Fremlin's Radon probability measure ? on X := {0, l}^2 which has no strong lifting and is supported by X (cf. Fremlin A979)). If к is the smallest ordinal of
1166 W. Strauss et al. cardinality N2, then (?, ?(?), ?, (fa)a<K) is the projective limit of the projective system {Xa,B(Xa), ??, /??,?) of probability spaces, where Xa := {0, ?}", /?? (respectively /„) is the canonical projection from ?? onto Xa (respectively from X onto Xa), and ?? is the image measure /?(?) on B(Xa) for a ^ ? < к. Assume that there exists a self-consistent family (pa}a<K of strong liftings pa e ?(??). Then by Theorem 6.20 there exists a strong lifting ? e Л (?); this yields a contradiction. On the other hand we know two classes of spaces with projective systems admitting a self-consistent family of strong liftings even for arbitrary index set. These are the hyperstonian spaces with measure of full support and the extremally disconnected Baire spaces where each set of the first category is closed, endowed with a category measure (cf. Macheras and Strauss A994, Remark 2.2(iii) (b) and (c)), which includes also the definitions of the above notions). 6.4. Various Fubini products The most difficult type of lifting in products arises if we ask for lifted functions with measurable respectively lifted sections or even for Fubini-like formulas. There is still a comparatively satisfactory answer for densities in the next theorem. All results in this subsection are taken from the paper Musial, Strauss and Macheras B001) if not otherwise indicated. THEOREM 6.21. Let (?,?,?) be an arbitrary probability space. If ? e ??(?) then for each (?, ?, ?) and each S e #(?) there exists ? e ?{? <g) v) with the following properties: (i) ? ?<5<8>?; (ii) [?{?)]? = ?{[?(?)]?)^?????? and ? e ?® ?; (iii) [?(?)]? is ?-measurable for all ? e ? and ? e ? ®T. Admitting the completeness of both measure spaces one can find ? e ?(? <g> v) satisfying the properties (ii), (iii), and the following two properties: (iv) ?(? ? S) с S(A) ? ? (?) for all A e ? and В еТ: (?) ?([?(?)]? U [<р(Ес)]ш) = 1 /or ?// ? е ? and ? е ? % ?. The condition (iii) cannot be improved essentially. More precisely, if S e ?(?), ? e ??(?) and ? e ?(? <g> ?) are such that (ii) is satisfied and [?(?)]? = ?([?(?)]?) for all ? e ? and ? e ? § ? then either ? or ? is purely atomic. For liftings a corresponding result looks as follows: THEOREM 6.22. Let (?, ?, ?) and (?, ?, v) be complete probability spaces. For each ? e AGA{v) and each ? e ?(?), there exists ? e ?(? ® v) such that the following conditions are satisfied: (a) ? e ? <8>?; (b) [?(?)]? = ff([7r(E)L)/or ?????? and ? &?&?.
Liftings 1167 Equivalently, n~epx®a~ and [???)]? = ??([*??)]?) for each f e ?°°(? <§> ?) and each ? e ?. The above result is in a sense the best possible, as it follows from the next theorem. THEOREM 6.23. Let (?, ?, ?) be a complete non-atomic and perfect probability space and let (<9, T, v) be a complete non-atomic probability space. There exist no liftings ? e ?(?) and ? e ?(? ® ?) satisfying the following two conditions: (j) there exists ? e ? such that for each ? e ?®? [<p(E)f e ?- (jj) for each ? e ? ® ? there exists a set Ne e ?? such that [?(?)]? = ?([?(?)]?) for each ? ???. In particular, no ? e ?(?) is an admissible density. So we are left with the following problem for given complete probability spaces (?,?,?.)???(?,?,?). Question 6.24. (i) Does there exist a lifting ?? e ?°°(? § ?) such that [<?^(/)]? e ?°°(?) and [??(/)]? eCoc(?)fo?аnfeC^x(?®v)c?ndfo?гLn(?,?)e? ? ?? (ii) Can we choose in addition ?? e ?^(? § ?) in (i) for some marginal liftings? In case both these complete probability spaces coincide with the Lebesgue probability space over [0, 1] subject to the continuum hypothesis we get an answer to the positive for Question 6.24(i) just by choosing a Borel lifting according to Theorem 4.12 for the 2-dimensional Lebesgue probability on [0, l]2. But what is the position without the continuum hypothesis? In case of linear liftings Macheras, Musial and Strauss B00?d) proved the following: THEOREM 6.25. Let (?, ?, v) be a complete separable probability space and (?, ?, ?) a complete probability space. Then for each ? e <?(?) and each ? e AQ(v) there exists a ? e 0(? <8> v) such that (i) (p(g ® h) = p(g) <g> r(h) for all g e ?зс(м) and h ??^(?); (ii) for each f e ?°°(? ® v) and each we ? M/)L = r(H/)]J·
1168 W. Strauss et al. It remains an open question whether the separability assumption is necessary. However again there is no hope for a linear product lifting with all sections being lifting invariant. The subsequent theorem of Macheras, Musial and Strauss B00?d) is a direct consequence of the appropriate part of Theorem 6.21. THEOREM 6.26. Let (?,?,?) be a complete probability space and (?,?,?) be an arbitrary complete probability' space. Assume that there are ? e 0(?), ? e Q(v) and ? e 0(? <8> v) such that the inequalities И/)]ш@) ="([*>(/)] J@) and [<p(f)]\o>) = p([<p(f)f)(o) hold true for all (?, ?) and all f e ?^ (? <§> ?). Then at least one of the measures is purely atomic. Question 6.27. Does there exist ? e 0(? ? ?), ? ? ^?) and ? e Q(v) such that ? ep®a, [?°°(/)]? e Cx(v) and [<px(f)f e ?^(?) for all / e ?^(? ® v) and for all (?, ?) &? ? <9? Question 6.28. Does there exist a ? e <?(? § ?) and a ? e ^(v) such that for all / e ?°°(? § ?) exists tf/ e ?0 with ?([<?(/)]?) = [?(/)]? e ??(?) for all ? ? tf/ and [<p(f)f e ?°°(?) for all ? e 6>? For general complete probability spaces these problems are open as far as we know. The next two results are related to the topic of this section since they also concern some measurability properties of liftings from a different point of view. They are due to Christensen A974). THEOREM 6.29 (CH). Let ? be a separable ?-algebra on ? (i.e., generated by countably many sets and containing all points) and let ? be a probability on ?. Let ? be the weak*-topology on the space Loc(?). Then there exists ? e 0(?) such that for each probability measure ? defined on the product ?-algebra ??(?,^(?)) <g> ? the function if, ?) -> p(f)(w) is measurable with respect to the completion of v. In particular each functional f* —> p(f)(a>) is universally measurable on ?,^(?). THEOREM 6.30. We keep the notation of Theorem 6.29. Assume that ? is non-atomic and let Ba (?) be the ? -algebra of sets possessing the Baire property with respect to the weak* - topology of ?.°°(?). If ? e Л (?), then for almost every ? e ? there exists a measure ?? on ??(?) such that the functional f -> p(f)(a>) is not measurable with respect to the completion of ??. 6.5. Applications of Fubini products to stochastic processes Theorem 6.22 has applications to functions of two variables and stochastic processes. All results in this subsection are taken from Musial, Macheras and Strauss B001). Let
Liftings 1169 (?,?,?), (?,?, ?) be complete probability spaces and let {Хв}неп be an arbitrary real-valued stochastic process on (?,?,?). If {??}??(-) is another stochastic process then it is called to be equivalent to {Хн}неп if for each ? e ? we have XH = ? ? a.e. (?) (the exceptional set may depend on ?) according to Talagrand A987). [Хв}не(-> is said to be measurable if the map (?, ?) -> ?«(?) is measurable with respect to the product ?-algebra ? ® Т. {Хн}не(~> is bounded if the family {XH: ? e ?] is bounded in ?,°°(?). There are several papers concerning the existence of measurable (or separable) processes that are equivalent to a given process (cf. Cohn A978), Talagrand A987, 1988)). Sometimes a measurable process equivalent to a bounded {Хн}ве(-) can be defined by setting ?? = р(Хв), where ? e ?(?) and the initial process X or (?, T, v) satisfy some additional conditions. In particular, the lifting ? in Cohn A978) is assumed to be strong and ? is taken to be an interval. In general however, a strong lifting might not exist on an investigated topological measure space. With the help of Theorem 6.22 we get, - by a different method-, the following two results stated in Cohn A978) under additional topological assumptions. THEOREM 6.31. For each ? & AGA(v) and each bounded measurable stochastic process {XeJflew on a space (?,?,?) there is a collection of measurable functions {УеЬем °n (?, ?, ?) satisfying the following conditions: (i) ?.(?) = ?(?.(?)) for each ???. (ii) There is ? ? e To such that for every- ? ? ? ? we have XH = Ye a.e. (?) and {??\???? is a measurable stochastic process on (?,?,?). (iii) There is ?? e ?? such that for every ? ? ?? ? .(?) = ? .(?) a.e. (?). (iv) If ? is a separable metric space and (Хн)н$М\ '·* continuous in probability, then (??)???? is separable. Furthermore, every countable dense subset of&\ ? ? is a separating set for (Yh)h$mx- In the terminology of Cohn A978) the process {J«}«6w is called ?-canonical. Another application of Theorem 6.22 is the next result. THEOREM 6.32. For each ? e ???(?) and each bounded measurable stochastic process {??)??? on a space (?,?.?) there is a measurable process {Yh}h€(-> on (?, ?, ?) that is equivalent to {Хн}не(-> and satisfies the following conditions: (i) ??=?(??) for each ?; (ii) There exists a set ?? e ?? such that for each ? <? ?? we have ?(?) = ???) a.e. (?). Notice that in Theorem 6.32 we had to use properties of ? (X) in order to assure the plane measurability of Y, where ? is given by Theorem 6.22. Direct defining of the process ? by setting ?? = p(Xe) for quite an arbitrary lifting might destroy the plane measurability properties of the process X (cf. Cohn A978)).
1170 W. Strauss et al. One of the important problems in the theory of functions of two variables is pointing out of conditions guaranteeing the plane measurability of a separately measurable function. A notion of a stable set investigated by D.H. Fremlin and M. Talagrand (see Talagrand A984)) turned out to be very fruitful in this field. In particular, Talagrand A984, 10-2-1) proved that if ? is a compact set, ? is a Radon measure on ?, /: ? ? ? ^ R is measurable as a function of the first variable and continuous as a function of the second variable, then / is measurable, provided the family {fH: ? e ?] is a stable set. Applying a result of Fremlin A983) on stable sets together with Theorem 6.22, we get a similar result for functions that have lifting invariant sections. It may be considered as a strengthening of M. Talagrand's method from Talagrand A988) of modification of a stochastic process with the help of consistent liftings. THEOREM 6.33. Let (?,?,?), (?, ?, ?) be complete probability spaces and, and {Хв)ве&) be an arbitrary real-valued bounded stochastic process on (?,?,?) with measurable paths (i.e., all functions ?? are ?-measurable). If the set {X«: ? e ?] is stable then for each ? e ???(?) the process {YH}HeH given for each ? by Yh '¦= ?(??) ls ? <g> ? -measurable and equivalent to {Хн}не(->- COROLLARY 6.34. Let (?, ?,?), (?, ?, ?) be complete probability spaces and let /:йх e^Rteo separately measurable function such that the set {fH: ? e ?] is stable. If there exists ? e ???(?) such that for each ? e ? the equality p(fH) — fH holds true, then f is ? ® ?-measurable. 7. Liftings for abstract valued functions Throughout let be given complete probability spaces (?. ?, ?) and a completely regular Hausdorff space T. For any lifting ? e ?(?) a Baire-measurable function f-.?^? induces a Borel-measurable map pj (/) from ? into ??, the Stone-Cech compactification of T, defined via the formula hoPT(f) = p(hof) {0TheCh(T). According to Bellow A980) the map / is called lifting compact if for every ? e ?(?) we have pr(f)(M) e ? for ?-a.a. ? e ?. This implies h о pr(f) = h ° f a.e. (?) for every h e C/,(T), where the null-set involved depends on h e C/,(T). If for lifting compact / the stronger equivalence pr(f) = f ?-a.e. holds true for any ? e ?(?), then / is called strongly lifting compact by Babiker et al. A986), and a completely regular Hausdorff topological space ? is (strongly) lifting compact by definition, if for any complete probability space (?, ?,?) every Baire-measurable map f-? -> ? is (strongly) lifting compact. Lifting compactness of completely regular Hausdorff spaces is a property lying strictly in between strong measure compactness and measure compactness by Bellow A980, 6.1) as well as Edgar and Talagrand A980, Theorem 1). By Bellow A980) lifting compactness has good stability properties.
Liftings 1171 Theorem 7.1. (i) An arbitrary subspace of a compact metrizable space is lifting compact. (ii) Every Baire set in a lifting compact space is lifting compact. (iii) Any continuous bijection between completely regular spaces with Baire measurable inverse transforms a lifting compact space into a lifting compact one. (iv) A countable product of lifting compact spaces is lifting compact. The results (ii) to (iv) are analogs of results proved by Moran A969) for (strongly) measure compact spaces. A Banach space ? under its weak topology is lifting compact if and only if every ?-valued scalarly measurable function is scalarly equivalent to a Bochner measurable function by Bellow A980, Section 6, Remark 2). Every subspace of a compact metric space is strongly lifting compact. Moreover, this holds true for every strongly lifting compact space. The next three theorems are taken from Babiker et al. A986). Theorem 7.2. (i) An arbitrary subset of a strongly lifting compact space is strongly lifting compact. (ii) Countable unions of strongly lifting compact Baire subsets of a completely regular Hausdorff space are strongly lifting compact. (iii) A countable product of strongly lifting compact spaces is strongly lifting compact. (iv) The image of a strongly lifting compact space under a continuous surjection with Baire measurable section is strongly lifting compact. (v) A measure compact space is strongly lifting compact if every point has a strongly measure compact neighborhood. The next result states a relation between strong lifting compactness and the USLP. Its proof relies on Theorem 3.7. THEOREM 7.3. If the map f-.?—* ? is strongly lifting compact and ? is the image measure of ? under f on the ?-algebra В := [В с ?: f~' (?) e ?], then the topological measure space (T, T, B, v) has the USLP, if ? denotes the completely regular Hausdorff topology of T. From the last theorem together with the existence of a compact Radon probability space without strong lifting (see Section 4) follows that neither lifting compactness nor strong measure compactness imply strong lifting compactness, nor does strong lifting compactness imply strong measure compactness, as witnessed by the standard Lebesgue non-measurable subset of [0, 1]. For general metric spaces strong lifting compactness is equivalent with measure compactness, which means in that case that every closed discrete subspace has non-measurable cardinal. It is an open problem, whether the converse of Theorem 7.3 is true, compare Macheras and Strauss A992). The next result gives among other equivalent conditions an answer to the positive for (E, weak), a metrizable locally convex spaces ? under its weak topology. An essential tool for its proof is A. Tortrat's Theorem 8, from Tortrat A975). THEOREM 7.4. For a metrizable locally convex space ? the following conditions are all equivalent.
1172 W. Strauss et al. (i) (E, weak) is strongly lifting compact. (ii) Every Baire probability measure space based on (E, weak) has the USLP. (iii) Every Baire probability measure space based on (E, weak) has the ASLP. (iv) (?, weak) is completion regular and measure compact. (v) Every Baire probability measure ? on (E, weak) is supported by a ?-measurable closed linear subspace of ? which is separable with respect to the metric of E. (vi) (?, weak) is measure compact and every Borel subset of (E, metric) is measurable with respect to any Baire probability measure on (E, weak). (vii) Every scalarly measurable function from a complete probability space into ? agrees a.e. with a Bochner measurable function. (viii) (E, weak) is measure compact and {0} is a Baire subset of ? with respect to (?, weak). (ix) (E, weak) is measure compact and there exists a sequence in ?", the topological conjugate of E, which separates the points of E. (?) (?, weak) is measure compact and there is a continuous linear injection from (E, metric) into RN. (xi) (?, weak) is measure compact and submetrizable. If the locally convex space ? is normable, we may add the following condition. (xii) (E,weak) is measure compact and there is a continuous linear injection from (E,norm) ?????°°(?). It can be seen from the last theorem that within the class of all metrizable locally convex spaces strongly lifting compact spaces can be characterized in a purely topological way. In Strauss A992) the strong lifting compactness of conjugate Banach spaces under their weak* topology has been discussed. In this class the equivalence of the condition (vii) in the last theorem with strong lifting compactness as well as with the conditions corresponding to conditions (ii) respectively (iii) of the last theorem breaks down. There are similar characterizations for strongly lifting compact functions in Babiker et al. A986). The conjugate ??([0, 1]) of C/,([0, 1]) is a non-separable Banach space which is submetrizable under its weak topology. The mild set-theoretic assumption that the continuum is measure compact implies that (??([0, 1]), weak) is measure compact. Therefore by the last theorem ?([0, 1 ]) is an example of a non-separable strongly lifting compact space. In connection with the definition at the beginning of this section one should mention the following construction of a lifting of Banach space valued functions, which is commonly used in the context of differentiation of vector measures and integration of vector functions. If X is a Banach space and X' is the space of continuous functionals on X then for given function /: ? -> X', satisfying (x, f) e L^ip), where by definition (*, /)(?) : = (f(co))(x) for every ? e ? and for every ? e X, we can for each ? e ?(?) define a function ?(/):? -> X' by setting (x,p(f)(w)) := p((x,f})(a>) where ? e X for all ? & ?. Von Weizsacker A978) proved that p(f) is measurable when X' is equipped with its weak* -topology. This function is an essential tool in investigating several aspects of integration. More details and abandon references can be found in Musial B002). In fact we have the following result equivalent to the existence of a lifting (compare A. and С Ionescu Tulcea A962, 1969a) and Kolzow A968)):
Liftings 1173 THEOREM 7.5. For given c.l.d. measure spaces (?, ?,?) the following conditions are all equivalent: (i) There exists a lifting on ?°°(?); (ii) For any normed space X and any bounded linear operator и from L1 (?) into X' (the topological dual space of X) there exists a map f from ? into X' with (x,f) е?°°(м)/ога//л:еХ and\\f\\ ? N1 such that J{x, f)gd^ = (x,u(g')) for all ? e X and all g e ?' (?). 8. Liftings and densities with respect to ideals of sets The notion of measure-theoretical lifting and density is a particular case of more general notions of lifting and density with respect to an ideal of sets. One may call them respectively by ^-lifting and j7-density. Given a measurable space (?, ?) and an ideal J С ? one says that a Boolean homomorphism ?: ? -> ? is an J7-Iifting if p(A) = A for all A e ? and p(A) = p(B) whenever A = В in the sense of J. Similarly j7-density is introduced. As in the case of measure spaces there is also an equivalent formulation for the Boolean algebra ? /J. In fact the original paper of von Neumann and Stone A935) has been written in the language of ideals. In spite of this, the theory of J'-liftings is much less developed than the theory of measure liftings. Here are some known facts. Maharam A977) noticed that the following result follows easily from Graf A973): THEOREM 8.1. If ? is a Baire space (i.e., no non-empty subset of ? isofthe first category in itself or, equivalently, in ?), and ? is the ? -algebra generated by Borel subsets and the ?-ideal С of sets of the first category, then there is a strong lifting on ?/C. For some time there has been an open question whether each Boolean algebra of the form ?IJ which additionally satisfies the countable chain condition has a lifting. Shelah A998) answered it negatively. THEOREM 8.2. There is a cardinal к (к = 2*°++ suffices) a ?-ideal J С V(k) and a ?-algebra ? С V(k) containing J and such that ?/J satisfies the countable chain condition but ?/J has no lifting. The next theorem of von Neumann and Stone A935) is a classical tool for converting a density into a lifting in the situation of this section. Let Ka be a fixed infinite cardinal. Let us remember that a lattice V is called (conditionally) ?-complete for some cardinal к, if every subset W of V having cardinality ^ к (with lower and upper bounds in V) has an infimum and a supremum in V. THEOREM 8.3. Let J be an ideal in a Boolean algebra В which is conditionally к- complete for all к < Ha. Ifcard(B/J) ? Ha, then for any J-density ? there exists a J-lifting ? with <p(B)^p(B)for all В е В.
1174 W. Strauss el ai Von Weizsacker A976, Theorem B.3) shows that the condition card(??/j7) ^ K„ in the last theorem cannot be weakened to card(B/J) ^ ??+?, even if one is interested only in the existence of linear liftings. THEOREM 8.4. Assume sup ? < ?? for all families ? of cardinals with card(T) < Ka and ? < N„/or all ? e ?. Then there exist set algebras Bi, ideals Ji in Bi, Ji-densities ?? for / = 1,2 such that the following conditions hold true. (i) Ji is к-complete for all к < Ka/or i = 1,2. (ii) B\/J\ and Bi/Ji are isomorphic with card(??,/J/) = ??+? for / = 1,2. (iii) There exists no linear lifting p\ :?^(?|) -> C^(B\) satisfying р\(Хв) > ??^?) for all В еВ{. (iv) There exists a linear lifting pi: ?^(??) -> C^(Bi) satisfying рг(Хв) ^ XK(fl)/or ?// ? e Й2, but there is no Ji-lifiing ~p~, for Bi satisfying ?? (В) с ~р2(В) for all BeB2- At last we formulate a few facts concerning liftings on the set of natural numbers. It is well known that if J is the ideal of all finite subsets of N, then there is no J7-Iifting on V(N). To formulate the next result denote by #A the cardinality of А с N. Then, define a density measure ? : V(N) -> [0, 1] by setting for each А с N #(АП{1,2,...,»}) ?(?) := Lim , where Lim is a Banach limit on l^. With this concept, we have the following result of D. Maharam and P. Erd6s (see MaharamA976)): THEOREM 8.5. Let ? be a densit}- measure on N and let J := {А С ?: ?(?) = 0). Then there is no J-lifting on V(H). 9. Beyond Сж(?) As we have mentioned at the end of Section 3 it has been proved already by A. and С Ionescu Tulcea A969a) that there is no lifting on ?''(?) if 1 < ? < oo and ? is non- atomic. It turns out however that each lifting can be extended far beyond ?^(?), as the following result of Monakov-Rogozkin A974) shows THEOREM 9.1. Given ? e ?(?) there exists an ideal K,(p) in the lattice ?'^(?) with the following properties: (i) Cx^)C>C(p). ^ (ii) There is a lifting ? on K,(p) which is an extension of p. (iii) IC(p) is the largest ideal in ?^(?) possessing the above two properties. (iv) The extension of ? onto K,(p) is unique.
Liftings 1175 More precisely, / e K,(p) if and only if f/g e ?^(?) for a function g e ?^(?) such that g ^ 1 and /0(l/g) = 1/g. One can also say that K,(p) = ?(?(?, T)) where ? identifies functions ?-equivalent, ? is one of the lifting topologies and C(i2, T) is the space of T-continuous real-valued functions. Monakov-Rogozkin A974) proves also that if ? e 0(?) \ ?(?) then a counterpart of fC(p) for yo does not exist. 10. Further applications Lifting theory has so many applications in mathematical analysis that it is impossible to give a detailed account for all within this article since this would afford too much additional terminology. But we want to give references now for some of the applications which could not be mentioned as yet. Jirina A959) proved the existence of disintegrations for measures under separability assumptions. The most common procedure taken now is to eliminate separability by an application of liftings, see С lonescu Tulcea A965b), A. and С lonescu Tulcea A969a), Hoffmann-Jorgensen A971), Pellaumail A972), Chatterji A973), Valadier A973), Saint- Pierre A975), Pachl A978), Heller A983), Babiker and Strauss A982), Choksi and Duncan (unpublished), and Rinkewitz A997). In A. and С lonescu Tulcea A969a) and Babiker and Strauss A982), the application of a lifting comes indirectly in via the Dunford Pettis theorem. Heller A983) determined the class of all Baire measures which can be disintegrated with the help of liftings. Corresponding results are given by Rinkewitz A997). We are going to recall two particular applications of liftings to disintegration. Let (?,?) be a measurable space and let ?: ? -> ? be a measurable map (i.e., p~] (T) с ?). ? : ? ? ? -> [0, 1] is a regular conditional probability if (cp 1) Pg is a probability measure on ? for all ? e (9; (cp2) ? -> ??(?) is ? := ? (?)-measurable for all A e ?; (cp3) ? (А П p'[(B)) = fB ??(?)??(?) for allA e ? and all В е Т. The next result was given by Hoffmann-J0rgensen A971). THEOREM 10.1. Let ? be a Hausdorff space, ? is a regular probability on ?(?) (i.e., ? is inner regular with respect to compact sets), (?,?) is a measurable space, ?:?—*? is a measurable function and ? = ?(?). Then there exists a regular conditional probability ? : ?(?) ? ? -> [0, 1] such that for every ? e ? the measure PH is regular. PROOF (Sketch). We assume for the simplicity that ? is compact. If / e C(i2) then we define a ?-continuous finite signed measure vt on ? by setting for every В e ? ?/(?)= ? ???. Jp~hB) Let ?(?, f) be a Radon-Nikodym derivative of vf with respect to v. One easily sees that ?(?, f) is bounded in L°°(v) by ||/||эс· Take an arbitrary ? e ?(?) and set for every
1176 W. Strauss el al. /?C(C) P(-,f)-=p(p(-,f))- It follows from the properties of 17 that ~?(?, ·) is for each ? a positive continuous linear functional on С (i2). Moreover, for every ? and / e C(i2) we have ?@,??) = 1 and |^,/)|«;||/||зс. Hence, for every ? there is a regular probability P# on ?(?) such that for all / e C(i2) P@,/) = f fdPe. The rest of the proof is a matter of calculations. ? The second application of liftings we are going to present here is an application to disintegration of compact measures. We follow the terminology of Pachl A978). Definition 10.2. Let (?, ?) be a measurable space and let F>, T, v) be a probability space. Let к be a probability on ? ® ? such that ?(? ? S) = v(B) for every ? e T. Suppose that for every ? e ? there is a ? -algebra ?? on ? and a probability /># on 2^ such that the following two conditions are satisfied: (a) for each Ae ? there exists N e ??(?) such that A e ?? for all ? e ? \ ? and the function ? \ N э ? -> ??(?) is ?|(<9 \ ^V)-measurable; (b) if A e Г and В е Г then /c(A ? ?)= ??{?)??{?). Jb The collection {(.?^, Ро)}о6б> is called a v-disintegration of к. Here is the main result of Pachl A978). THEOREM 10.3. Let (?,?,?) and (?,?,?) be probability spaces and let к be a probability on ? <g> ? with marginals ? and v. Assume that ? is complete and ? is approximated by an ?-compact lattice К С ? which is closed under countable intersections. Then there is ? ?-disintegration {(??, Рв)}вев °fK suc^ ?^?? ?? D К, and К, approximates Pg for every ?. Basic idea OF the proof. Let ? be a lifting on L^iv). By the Radon-Nikodym theorem, for each A e ? there exists a ?-measurable function hA such that for every В е Г, we have ?(? ? B)= / hAdv. JB
Liftings 1177 For each ? e ? define a function ?# on K, by ??(?) := р(Ик(в)). It needs some work to show that for each ? there exists a probability ?? ???(?,) such that ?? ^ ?? on /C and its completion Pfi gives a ?-algebra ?? satisfying the requirements of the theorem (see Pachl A978) for details). D It should be mentioned here that using the above result J.K. Pachl proved that a restriction of a compact measure (in the sense of Marczewski A953)) to a sub-a-algebra is also a compact measure, thus answering a question of E. Marczewski. For applications to the realization of homomorphisms see A. and С Ionescu Tulcea A969a, Chapter X), Graf A980a), Babiker and Graf A983), Vesterstr0m and Wils A969), Fremlin A989, 4.12 and 4.14), and Fremlin B00?). Strongly connected with the above are applications of liftings to the existence of measurable selections and sections discussed by Edgar A976), Talagrand A978), Losert A980), Graf A980, 1982), and Kupka A983). The results are too technical to be presented here so we mention only two of them, taken from Graf A980a). THEOREM 10.4. Let ? фЪ be a Hausdorff space. Let ? be a non-trivial complete strictly localizable measure which is locally determined and semifinite. Then, let ? : ?(?) —> ?/? be a Boolean ?-homomorphism such that ? ? ? is a Radon measure on ?. Then there is a E-B@)-measurable map f : ? -> ? with f'[(B)e ? (?) for all В e ?(?), i.e., ? is induced by f. Proof (Sketch). If ? e ?(?) then ? ? ? :?(?) -> ? is a homomorphism. Let ?,(?) be the collection of all compact subsets of ?. Then, for each ? let ?,? := {К e ?,(?): ? e ? ? ?(?)}. One can easily see that П/С<„ ? 0. In fact for almost all ? the set ?\?,? is a singleton. Consequently we can define /: ? -> ? by taking /(?) e ?\??. f satisfies the conclusion of the theorem. ? The next result is related to the extension of measures, ? ? 0 is Hausdorff, (?, ?, ?) is a finite measure space and ?: ? -> ? is a measurable map. Denote by ? (?, ?) the collection of all positive measures ? on ?(?) such that ? = p{v). THEOREM 10.5. If ? e ? (?, ?) is Radon, then ? is an extreme point of ?{?, ?) if and only if there exists ? ?-?(?) measurable weak section f : ? —* ? for ? with ? = /(?) (i.e., ?(??/-'/>(?))=0/???//?? ?). The separable modifications of stochastic processes and the domination of measures are treated by A. and С Ionescu Tulcea A969b; 1969a, Chapter VII, Section 7 as well as Chapter IX, Section 7). Here again the lifting eliminates restrictive hypotheses (metrizability, for example) needed previously. For separable measurable modifications of empirical processes and regularizations of stochastic processes see in addition Talagrand A987, 1988), respectively. Schreiber et al. A971), use liftings for the construction of probability measures corresponding to stochastic processes. This construction applies algebraic models for probability spaces introduced before by Dinculeanu and Foias A968). In these models liftings apply as well.
1178 W. Strauss et al. A. and С Ionescu Tulcea (cf. A969a)) applied lifting in order to describe the space of functionals on the space of Bochner integrable functions. Further applications of that type can be found in Dinculeanu A967) and several papers concerning vectorial integration. For applications of the consistent lifting to the Pettis integral see Talagrand A984). С Ionescu Tulcea A965b) applies the strong lifting property and linear liftings for the decomposition of measures into its ergodic parts and in A965) he uses the translation invariant lifting (see Theorem 5.4) to solve a problem on almost stable sets in locally compact groups posed by A.B. Simon. Marechal A969) gives a definition of a measurable field of Hubert spaces over a Hausdorff space based on the almost strong lifting property and shows that isomorphisms between measurable fields can be induced by pointwise isomorphisms. She gives decompositions of operators on Hubert space by means of liftings in Marechal A968, 1969). Von Weizsacker A978) applies liftings to the regularization of functions with Radon image measures and arbitrary completely regular range. Applications to subdifferentials and convex functions are given by Levin A975). Graf A995) gives an application of the lifting to self-similar measures. A similar procedure was taken by Schief in an unpublished note. Another application of the lifting theory is the existence and uniqueness of preimages of a given measure (see, e.g., Edgar A976) as well Graf A980a, 1982)). Also Lipecki A998) used liftings for the extension of measures. References Babiker, A.G. A981), Lifting properties and uniform regularity of Lebesgue measures on topological spaces, Mathematika 28, 198-205. Babiker, A.G. and Graf, S. A983), Homomorphism compact spaces, Canad. J. Math. 35, 476-558. Babiker, A.G. and Knowles, J.D. A978), An example concerning completion regular measures, images of measurable sets and measurable selections, Mathematika 25, 120-126. Babiker, A.G. and Strauss, W. A980a), Measure spaces in which every lifting is an almost ?-lifting, Arab J. Math. 1, 11-21. Babiker, A.G. and Strauss, W. A980b), Almost strong liftings and ?-additivity. Measure Theory Proc. Oberwolfach 1979, D. Kolzow, ed., Lecture Notes in Math., Vol. 754, Springer, Berlin, 221-226. Babiker, A.G. and Strauss, W. A980c), Measure spaces in which every lifting is an almost strong H-lifting, Measure Theory Proc. Oberwolfach 1979, D. Kolzow, ed.. Lecture Notes in Math., Vol. 754, Springer, Berlin, 228-232. Babiker, A.G. and Strauss, W. A982), The pseudostrict topology on function spaces. Rend, lstit. Mat. Univ. Trieste 14, 99-105. Babiker, A.G., Heller, G. and Strauss, W. A984), On a lifting invariance problem, Measure Theory Proc. Oberwolfach 1983, D. Kolzow and D. Maharam-Stone, eds, Lecture Notes in Math., Vol. 1089, Springer, Berlin, 79-85. Babiker, A.G., Heller, G. and Strauss, W. A986), On strong lifting compactness, with applications to topological vector spaces, J. Austral. Math. Soc. Ser. A 41, 211-223. Bellow, A. A980), Lifting compact spaces. Measure Theory Oberwolfach 1979, Proceedings, Lecture Notes in Math., Vol. 794, Springer, Berlin, 233-253. Bhaskara Rao, K.P.S. and Bhaskara Rao, M. A983), Theory of Charges. Academic Press. Bichteler, K. A970), A reduction of the strong lifting problem. Invent. Math. 11, 159-162. Bichteler, K. A971), An existence theorem for strong liftings, J. Math. Anal. Appl. 33, 20-22.
Liftings 1179 Bichteler, К. A972), On the strong lifting property, Illinois J. Math. 16, 370-380. Bichteler, K. A973), A weak existence theorem and weak permanence properties for strong liftings, Manuscripta Math. 8, 1-10. Bliedtner, J. and Loeb, P.A. B000), The optimal differentiation basis and liftings of L*-, Trans. Amer. Math. Soc. 352,4693^4710. Burke, M.R. A993), Liftings and the property of Baire in locally compact groups. Proc. Amer. Math. Soc. 117, 1075-1082. Burke, M.R. A993a), Liftings for Lebesgue measure, Israel Math. Conf. Proc. 6, 119-150. Burke, M.R. A995), Consistent liftings, Unpublished notes of 1995/01/23. Burke, M.R. and Just, W. A991), Liftings for Haar measure on @, 1 }k. Israel J. Math. 73, 33^44. Burke, M.R. and Shelah, S. A992), Linear liftings for non-complete probability spaces, Israel J. Math. 79, 289- 296. Carlson, Т., Frankiewicz, R. and Zbierski, P. A994). Borel liftings of the measure algebra and the failure of the continuum hypothesis, Proc. Amer. Math. Soc. 120, 1247-1250. Carothers, D.C. A990), Order continuous Borel liftings, Rocky Mountain J. Math. 20, 51-57. Carothers, D.C. A992), Liftings into countably complete Banach lattices, Houston J. Math. 18. 467^472. Chatterji, S.D. A973). Disintegration of measures and lifting. Vector and Operator Valued Measures and Applications, Academic Press. Choksi, J.R. and Duncan, R. (unpublished). Disintegration of measures and measure-valued conditional expectation, unpublished and undated notes. Christensen. J.PR. A974). Topology and Borel Structure. Math. Studies. Vol. 10, North-Holland. Amsterdam. Cohn, D.L. A978), Liftings and the construction of stochastic processes. Trans. Amer. Math. Soc. 246. 429^438. Dalgas, K.P A99?), On the existence of strong Borel liftings. Preprint. Dieudonne. J. A948), Sur le theoreme de Lebesgue-Nikodym (III). Ann. Univ. Grenoble 25. 25-53. Dieudonne, J. A951), Sur le theoreme de Lebesgue-Nikodym (IV). J. Indian Math. Soc. 15. 77-86. Dinculeanu, N. A967), Vector Measures, Pergamon Press and VEB Deutscher Verlag der Wissenschaften. Dinculeanu, N. and Foia§, С A968), Algebraic models for measures, Illinois J. Math. 12, 340-351. Donoghue, W.F. A965), On the lifting property, Proc. Amer. Math. Soc. 16, 913-914. Dunford, N. and Schwartz, J.T. A958), Linear Operators Part I, lnterscience, New York. Edgar, G.A. A976), Measurable weak sections, Illinois J. Math. 20, 630-646. Edgar, G.A. and Talagrand, M. A980), Liftings of functions with values in a completely regular space, Proc. Amer. Math. Soc. 78, 345-349. Eifrig, B. A972), Ein nicht-standard Beweis fiir die Existenz eines Liftings, Arch. Math. 23, 425^427. Eifrig, B. A972a), Ein nicht-standard Beweis fiir die Existenz eines starken Liftings in C^ @. 1 ], Contributions to Non-Standard-Analysis, W.A.J. Luxemburg and A. Robinson, eds, North-Holland, Amsterdam, 81-83. Eifrig, B. A975). Ein Nicht-Standard-Beweis fiir die Existenz, eines Liftings, Measure Theory, Proc. Conf., Oberwolfach, 1975, Lecture Notes in Math., Vol. 541, Springer. Berlin, 133-135. Ellis, H.W. and Snow. D.0.A963). On (L1)* for general measure spaces, Canad. Math. Bull. 6,211-230. Erben, W. A983), Topologische und masstheoretische Liftings, Ph.D. thesis. University of Stuttgart. Farah, 1. A998), Completely additive liftings. Bull. Symbolic Logic 4, 37-54. Fatou, P. A906), Series trigonometriques et series de Taylor. Acta Math. 30, 335^400. Fillmore, PA. A966), On topology induced by measure. Proc. Amer. Math. Soc. 17, 854-857. Fremlin, D.H. A974), Topological Riesz Spaces and Measure Theory, Cambridge University Press, Cambridge. Fremlin, D.H. A976), Products of Radon measures: A counter-example, Canad. Math. Bull. 19, 285-289. Fremlin, D.H. A977), On two theorems of Mokobodzki, Note of 1977. Fremlin, D.H. A978), Decomposable measure spaces, Z. Wahrscheinlichkeitsth. verw. Geb. 45, 159-167. Fremlin, D.H. A979), Losert's example. Note of 18/9/79. University of Essex, Mathematics Department. Fremlin, D.H. A980), Consequences of Martin's Axiom, Cambridge University Press, Cambridge. Fremlin, D.H. A983), Stable sets of measurable functions. Note of 17 May 1983. Fremlin, D.H. A989), Measure algebras. Handbook of Boolean Algebras, J.D. Monk and R. Bonnet, eds, Elsevier. Fremlin, D.H. B000a), Measure Theory, Vol. 1, The Irreducible Minimum, published by Torres Fremlin. Fremlin, D.H. B00?), Measure Theory, Vols. 2-5.
1180 W. Strauss et al. Fuglede, B. A971), The quasi topology associated with a countably subadditive set function, Ann. Inst. Fourier (Grenoble) 21, 123-169. Gapaillard, J. A970), Sur un theoreme de Maharam, С R. Acad. Sci. Paris 271A, 39^41. Gapaillard, J. A973), Relevements monotones. Arch. Math. 24, 169-178. Gapaillard, J. A975), Relevements sur une algebre d ensembles. Measure Theory Proc. Oberwolfach 1975, Lecture Notes in Math., Vol. 541, 137-153, Gardner, R.J. A975), The regularity of Borel measures and Borel measure compactness, Proc. London Math. Soc. 30,95-113. Georgiou, P. A973), Liftings and disintegration. Bull. Soc. Math. Grece (N.S.) 14, part A. 56-74. Georgiou, P. A974), A semigroup structure in the space of liftings, Math. Ann. 208, 195-202. Georgiou, P. A980), On "idempotent" liftings. Measure Theory Oberwolfach 1979, Proceedings, Lecture Notes in Math., Vol. 794, Springer, Berlin, 254-260. Gillman, L. A960), A P-space and an extremally disconnected space whose product is not an F-space, Arch. Math. Basel 11, 53-55. Goldman, A. A977), Mesures cylindriques. mesures vectorielles et questions de concentration cylindrique. Pacific! Math. 69, 385^413. Graf, S. A973), Schnitte Boolescher Korrespondenzen und ihre Dualisierungen, Ph.D. thesis, University of Erlangen-Niimberg. Graf, S. A974), On a disintegration theorem of Dorothv Maharam, Note of 1974. Graf, S. A975), On the existence of strong liftings in second countable topological spaces. Pacific J. Math. 58, 419^426. Graf, S. A980), Measurable weak selections. Measure Theory, Proc. Oberwolfach 1979, D. Koelzow, ed., Lecture Notes in Math., Vol. 794, Springer, Berlin, 117-140. Graf, S. A980a), Induced ?-homomorphisms and a parametrization of measurable sections via extremal preimage measures, Math. Ann. 247, 67-80. Graf, S. A982), Selected results on measurable selections, Rend. Circ. Mat. Palermo B) Suppl. 2, 87-122. Graf, S. A995), On Bandt's tangential distribution for self-similar measure, Monatsh. Math. 120, 223-246. Graf, S. and von Weizsacker, H. A976), On the existence of lower densities in noncomplete measure spaces, Measure Theory Proc. Oberwolfach 1975, Lecture Notes in Math., Vol. 541, 155-158, Grekas, S. A985), On the existence of idempotent liftings. Bull. Soc. Math. Grece (N.S.) 26, part A, 47-52. Grekas, S. A987), On the strong lifting property for products, Bull. Soc. Math. Grece (N.S.) 28. part A, 63-70. Grekas, S. A989), On the existence of idempotent liftings, Proc. Amer. Math. Soc. 107, 367-371. Grekas, S. and Gryllakis, C. A991), Completion regular measures on product spaces with application to the existence of Baire strong liftings, Illinois J. Math. 35. 260-268. Grekas, S. and Gryllakis, С A992), Measures on product spaces and the existence of strong Baire liftings. Monatsh. Math. 114, 63-76. Gruenhage, G. A984), Generalized metric spaces. Handbook of Set-Theoretic Topology, K. Kunen and J.E. Vaughan, eds, Elsevier, Amsterdam. Halmos, P. A950), Measure Theory. Van Nostrand, Princeton. Hansel, G. A972), Theoreme de relevement et measures bivalentes, Ann. Inst. Poincare 8,395—401. Hebert, D.J. A973), A general theorem for decomposition of linear random processes, Proc. Amer. Math. Soc. 38, 331-336. Heller, G. A983), Zur Desintegration topologischer Masse, Ph.D. thesis. University of Stuttgart. Hoffmann, K. A965), Banach Spaces of Analytic Functions, 2nd edn, Prentice Hall, Englewood Cliffs, NJ. Hoffmann-j0rgensen, J. A971), Existence of conditional probabilities. Math. Scand. 28, 257-264. Ionescu Tulcea, A. A965), On the lifting property (V0, Ann. Math. Stat. 36, 819-828. Ionescu Tulcea, A. A966a), Sur le relevement fort et la desintegration des mesures, С R. Acad. Sci. Paris 262A, 617-618. Ionescu Tulcea, A. A966b), Sur la domination et la desintegration des mesures. C. R. Acad. Sci. Paris 262A, 1142-1445. Ionescu Tulcea, A. A967a), Liftings compatible with topologies. Bull. Soc. Math. Grece 8, 116-126. Ionescu Tulcea, A. A967b), On the lifting property. Proceedings Symposium in Analysis. Queen's University, Kingston Ontario (June, 1967).
Liftings 1181 Ionescu Tulcea, A. A973), On pointwise convergence, compactness, and equicontinuity in the lifting topology (/), Z. Wahrscheinlichkeitsth. verw. Geb. 26, 197-205. Ionescu Tulcea, A. A974), On pointwise convergence, compactness, and equicontinuity II, Adv. Math. 12, 171- 177. Ionescu Tulcea, A. and Ionescu Tulcea, С A961), On the lifting property (/), J· Math. Anal. Appl. 3, 537-546. Ionescu Tulcea, A. and Ionescu Tulcea, С A962), On the lifting property (//), representation of linear operators on spaces LPE, 1 < ? < oo, J. Math. Mech. 11, 773-769. Ionescu Tulcea, A. and Ionescu Tulcea, С A963), Problems and remarks concerning the lifting property, Technical report, US Army Research Office, Durham, NC. September 1963. Ionescu Tulcea, A. and Ionescu Tulcea, С. A964а), On the lifting property (III). Bull. Amer. Math. Soc. 70, 193-197. Ionescu Tulcea, A. and Ionescu Tulcea, С A964b), On the lifting property (IV). Disintegration of measures, Ann. Inst. Fourier (Grenoble) 14, 445^472. Ionescu Tulcea, A. and Ionescu Tulcea, С A967), On the existence of a lifting commuting with the left translations of an arbitrary locally compact group, Proc. of the Fifth Berkeley Symposium on Mathematics, Statistics, and Probability, Berkeley, California, 1967, Vol. II, 63-67. Ionescu Tulcea, A. and Ionescu Tulcea, C. A969a), Topics in the Theory of Lifting, Springer, Berlin. Ionescu Tulcea, A. and Ionescu Tulcea, С A969b), Liftings for abstract valued functions and separable stochastic processes, Z. Wahrscheinlichkeitsth. verw. Geb. 13, 114-118. Ionescu Tulcea, C. A955), Deux theoremes concernant certains espaces de champs de vecteurs. Bull. Sci. Math. 79, 106-111. Ionescu Tulcea, С A965a), On the lifting property and disintegration of measures. Bull. Amer. Math. Soc. 71, 829-842. (Invited address presented before the Chicage Meeting of the AMS on April 9, 1965.) Ionescu Tulcea, С A965b), Remarks on the lifting property and the desintegration of measures. Technical Report, US Army Research Offices, Durham, NC. Ionescu Tulcea, С A966), Surcertains endomorphismes de L?(Z. ?), С. R. Acad. Sci. Paris 261A, 4961^4963. Ionescu Tulcea, С A971), On liftings and derivation bases. J. Math. Anal. Appl. 35, 449^466. Ionescu Tulcea, С A972), Liftings commuting with translations. Proceedings of the Sixth Berkeley Symposium of Math. Stat, and Probability, Vol. 2, Univ. of California Press, Berkeley. 97-100. Ionescu Tulcea, С and Maher, R.J. A971), A note on almost strong liftings. Ann. Inst. Fourier (Grenoble) 21, 35^41. Jirina, M- A959), On regular conditional probabilities. Czechoslovak Math. J. 9, 445^450. Johnson, R.A. A978), Existence of strong liftings commuting with a compact group of transformations. Pacific J. Math. 76, 69-81. Johnson, R.A. A979), Existence of strong liftings commuting with a compact group of transformations II, Pacific J. Math. 82, 457^461. Johnson, R.A. A980), Strong liftings which are not Borel liftings. Proc. Amer. Math. Soc. 80, 234-236. Kakutani, S. A943), Notes on infinite product measures, II, Proc. Imperial Acad. Tokyo 19, 184-188. Kolzow, D. A968), Differentiation von Massen, Lecture Notes in Math., Vol. 65, Springer, Berlin. Kupka, J. A983), Strong liftings with applications to measurable cross sections in locally compact groups, Israel J. Math. 44, 243-261. Kupka, J. and Prikry, K. A983), Translation invariant Borel liftings hardly ever exist. Indiana Univ. Math. J. 32, 717-731. Kupka, J. and Prikry, K. A984), The measurabilitv of uncountable unions. Amer. Math. Monthly 91 B), 85-97. Lahiri, B.K. and Chakrabarti, S. A991), Density topology, Math. Student 59, 89-108. Lebesgue, H. A905), Recherches sur la convergence des series de Fourier. Math. Ann. 61, 251-287. Lebesgue, H. A910), Sur I' integration des functions discontinues, Ann. Sci. Ecole Norm. Sup. 27, 361^450. Levin, V. A975), Convex integral functions and the theory of lifting, Russian Math. Surveys 30, 119-184 (Uspekhi Mat. Nauk 30). Lipecki, Z. A998), Quasi-measures with finitely or countably many extreme extensions, Manuscripta Math. 97, 469-481. Lloyd, S.P. A974), Two lifting theorems. Proc. Amer. Math. Soc. 42, 128-134. Losert, VL. A979), A measure space without the strong lifting property. Math. Ann. 239, 119-128.
1182 W. Strauss et al. Losert, V.L. A980), A counterexample on measurable selections and strong lifling. Measure Theory Proc. Oberwolfach 1979, D. Kolzow, ed.. Lecture Notes in Math., Vol. 794, Springer, Berlin, 153-159. Losert, V.L. A983), Some remarks on invariant liftings. Measure Theory Proc. Oberwolfach 1981, Lecture Notes in Math., Vol. 1080, Springer, Berlin. Macheras, N.D. A984), Permanenzprobleme fiir induktive Limiten, Dissertation, Universitat Erlangen-Niimberg. Macheras, N.D. A989), On inductive limits of measure spaces and projective limits of LI'-spaces, Mathematika 36, 116-130. Macheras, N.D. A998), On the Lx-spaces of inductive limits of measure spaces, Atti Sem. Mat. Fis. Univ. Modena 44, 513-523. Macheras, N.D., Musiat, K. and Strauss, W. A999), On products of admissible liftings and densities, Z. Anal. Anwendungen 18, 651-667. Macheras, N.D., Musiat, K. and Strauss, W. B000), Linear liftings respecting coordinates. Adv. Math. 153, 403- 416. Macheras, N.D., Musial, K. and Strauss, W. B00?a), On strong liftings on projective limits. Preprint. Macheras, N.D., Musial, K. and Strauss, W. B00?b). A filter approach to product densities and liftings. Manuscript. Macheras, N.D., Musial, K. and Strauss, W. B00?c), On the existence of liftings respecting coordinates. Preprint. Macheras, N.D., Musiat, K. and Strauss, W. B00?d), Linear Fubini liftings. Preprint. Macheras, N.D. and Strauss, W. A992), On various strong lifting properties for topological measure spaces. Rend. Circ. Mat. Palermo B) Suppl. 28, 149-162. Macheras, N.D. and Strauss, W. A993), On the permanence of almost strong liftings, J. Math. Anal. Appl. 174, 566-572. Macheras, N.D. and Strauss, W. A994), On strong liftings for projective limits. Fund. Math. 144, 209-229. Macheras, N.D. and Strauss, W. A995), Products of lower densities, Z. Anal. Anwendungen 14, 25-32. Macheras, N.D. and Strauss, W. A996a), On self-consistent families of almost strong liftings in projective systems, Atti Sem. Mat. Fis. Univ. Modena 44, 105-111. Macheras, N.D. and Strauss, W. A996b), Products and projective limits of almost strong liftings, Atti Sem. Mat. Fis. Univ. Modena 44, 119-133. Macheras, N.D. and Strauss, W. A996c), On products of almost strong liftings, J. Austral. Math. Soc. Ser. A 60, 311-333. Macheras, N.D. and Strauss, W. A996d), The product lifting for arbitrary products of complete probability spaces, Atti Sem. Mat. Fis. Univ. Modena 44, 485^496. Macheras, N.D. and Strauss, W. A999a), On Products of Hvperstonian spaces, Atti Sem. Mat. Fis. Univ. Modena 47, 369-382. Macheras, N.D. and Strauss, W. A999b), On consistent lower densities, Atti Sem. Mat. Fis. Univ. Modena 47. Macheras, N.D. and Strauss, W. B000), Mazur-Orlicz theorem in lifting theon; J. Math. Anal. Appl. 241, 122— 133. Macheras, N.D. and Strauss, W. B00?), On strong product liftings. Preprint. Maharam, D. A958), On a theorem of von Neumann, Proc. Amer. Math. Soc. 9, 987-994. Maharam, D. A976), Finitely additive measures on the integers, Sankhya, Ser. A 38, 44-59. Maharam, D. A977), Category. Boolean algebras and measures. Lecture Notes in Math., Vol. 609, Springer, Berlin, 1243-135. Maher, R.J. A974), Strong liftings and Borel liftings. Adv. Math. 13, 55-72. Maher, R.J. A978), Strong liftings on topological measured spaces. Studies in probability and ergodic theory. Adv. Math. 2, 155-166. Maitland Wright, J.D. A969), A lifting theorem for Boolean ?-algebras. Math. ? 112, 326-334. Marczewski, E. A953), On compact measures. Fund. Math. 40, 113-124. Marechal, O. A968), Operateurs decomposable dans le champs mesurables d'espaces de Hilbert, С R. Acad. Sci. Paris 266A, 710-713. Marechal, O. A968a), Decomposition des ope'rateures dans le champs mesurables d'espaces de Hilbert, C. R. Acad. Sci. Paris 267A, 636-639. Marechal, O. A969), Champs mesurables d'espaces Hilbertiens, Bull. Sci. Math. 93, 113-143. Mauldin, R.D. A978), Some effects of set-theoretical assumptions in measure theory. Adv. Math. 27, 45-62. McMinn, T.J. A975), Commuting and topological densities and liftings. Trans. Amer. Math. Soc. 211, 1-22.
Liftings 1183 Mokobodzki, G. A975), Relevement borelien compatible avec une classe d ensembles ne'gligables. Application a la desintegration des mesures, Seminaire des probabilites IX, 1974/5, Lecture Notes in Math., Vol. 465, Springer, Berlin, 437^*42. Monakov-Rogozkin, A. A974), Spaces admitting a lifting. Works in Mathematics and Physics, Tallin, 5-14 (in Russian). Monakov-Rogozkin, A. A978), The structure of spaces of measurable functions that admit lifting. Acta Comment. Univ. Tartu 448, 12-20 (in Russian). Monakov-Rogozkin, A. A981), Spaces which do not have the linear lifting property. Acta Comment. Univ. Tartu 504, 10-16 (in Russian). Monakov-Rogozkin, A. A991), A description of measure spaces with liftings. Acta Comment. Univ. Tartu 928, 73-88. Moran, W. A969), Measures and mappings on topological spaces, Proc. London Math. Soc. 19, 493-508. Musial, K. A973), Existence of Borel liftings, Colloq. Math. 27. 315-317. Musiat, K. A980), Projective limits of perfect measures. Fund. Math. CX, 163-189. Musial, K. B000), Lifting and some of its applications to the theory of Pettis integral. Rend. Istit. Mat. Univ. Trieste 31, Suppl.l, 1-35. Musial, K. B002), Pettis integral. Handbook on Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 531-586. Musial, K. and Macheras, N.D. B000), Liftings of Pettis integrable functions. Hiroshima Math. J. 30, 205-213. Musiat, K., Strauss, W. and Macheras, N.D. B001), Product liftings and densities with lifting and density invariant sections. Fund. Math, von Neumann, J. A931), Algebraische Reprasentanten der Funkionen "bis auf eine Menge von Masze Nulf, Crelle's J. Math. 165, 109-115. von Neumann, J. and Stone, M.H. A935), 77k· determination of representative elements in the residual classes of a Boolean algebra. Fund. Math. 25, 353-378. Oxtoby, J.C. A971), Measure and Category, Springer, Berlin. Pachl, J.K. A978), Disintegration and compact measures. Math. Scand. 43, 157-168. Pellaumail, J. A970), Sur la derivation d'une mesure vectorielle. Bull. Soc. Math. France 98, 305-318. Pellaumail, J. A972), Application de /'existence d'un relevement a un theoreme sur la desintegration des mesures, Ann. Inst. H. Poincare 8, 211-215. Rinkewitz, W. A997), Zur Existenz. Desintegration und Choquet-Darstellung von Urbildmassen - Ein Zugang iiber Stonesche Darstellungsraume, Ph.D. thesis. Ludwig-Maximilians-Universitat, Miinchen. Rodriguez-Salinas, B. A978), ?-spaces of Suslin and Lusin, The strong •'lifting" property. Rev. R. Acad. Cienc. Exact. Fis. Natur. Madrid 72, 541-557 (Spanish). Rodriguez-Salinas, B. A978a), Radon measures of type ? and measures with ''lifting". Rev. R. Acad. Cienc. Exact. Fis. Natur. Madrid 72, 605-610 (Spanish). Rodriguez-Salinas, B. A979), Suslin and Luzin semi ?-spaces. Strong lifting property. Rev. R. Acad. Cienc. Exact. Fis. Natur. Madrid 73, 33^40 (Spanish). Ross, D.A. A990), Lifting theorems in nonstandard measure theory, Proc. Amer. Math. Soc. 109, 809-822. Ryan, R. A962), The lifting property and direct sums, MRC, Technical Report No. 328, US Army, University of Wisconsin. Ryan, R. A964), Representative sets and direct sums, Proc. Amer. Math. Soc. 15, 387-390. Sapounakis, A. A983), The existence of strong liftings for totally ordered measure spaces. Pacific J. Math. 106, 145-151. Saint-Pierre, J. A975), Desintegration dune mesure поп borne. Ann. Inst. H. Poincare 8. 275-286. Schief, A. A99?), SOSC and OSC are equivalent. Unpublished and undated notes, Ludwig-Maximilians- University, Munich. Schreiber, B.M., Sun, T.C. and Bharucha-Reid. A.T A971), Algebraic models for probability measures associated with stochastic processes. Trans. Amer. Math. Soc. 158, 93-105. Schwartz, L. A973), Radon measures on arbitrary topological spaces and cylindrical measures, Oxford Univ. Press. Segal, I.E. A951), Equivalences of measure spaces, Amer. J. Math. 73. 275-313. Shelah, S. A983), Lifting problem of the measure algebra, Israel J. Math. 45, 90-96. Shelah, S. A989), Baire irresolvable spaces and lifting for layered ideal. Topology Appl. 33, 217-221. Shelah, S. A998), Lifting problem with the full ideal, J. Appl. Anal. 4, 1-17.
1184 W. Strauss et al. Sion, M. A973), A Theory of Semigroup Valued Measures, Lecture Notes in Math., Vol. 355, Springer, Berlin. Strauss, W. A971), Funktionalanalytische Fassungdes Satzes von Radon-Nikodvm I, J. Reine Angew. Math. 249, 92-132. Strauss, W. A974), Die Obstruktion zur strengen Lokalisierbarkeit eines Maszraumes, Manuscripta Math. 12, 1-10. Strauss, W. A975), Retraction numbers, liftings and the decomposability of a measure space. Bull. Acad. Polon. Sci. Ser. Math. Astronom. Phys. 23, 27-33. Strauss, W. A992), On strong lifting compactness for the weak* topology. Rocky Mountain J. Math. 22, 1057— 1081. Talagrand, M. A978), Non existence de certaines sections et applications ? la theorie du relevement, C. R. Acad. Sci. Paris 286A, 1183-1185. Talagrand, M. A978a), En general il n'existe pas de relevement line'aire borelien fort, C. R. Acad. Sci. Paris 287A, 633-634 Talagrand, M. A981), Non existence de relevement pour certaines mesures finiement additives et retracte de /8N, Math. Ann. 256, 63-66. Talagrand, M. A982), La pathologic des relevements invariants, Proc. Amer. Math. Soc. 84, 379-382. Talagrand, M. A984), Pettis Integral and Measure Theory, Mem. Amer. Math. Soc., No. 307. Talagrand, M. A987), Measurability problems for empirical processes, Ann. Probab. 15, 204—212. Talagrand, M. A988), On Liftings and the Regularization of Stochastic Processes, Probab. Theory Related Fields 78, 127-134. Talagrand, M. A989), Closed convex hull of set of measurable functions. Riemann-measurable functions and measurability of translations, Ann. Inst. Fourier (Grenoble) 32, 39-69. Tortrat, A. A975), Prolongements ?-reguliers, applications aux probabilite's Gaussiennes, Symposia Mathemat- ica21 (convegno sulle Misure su Gruppe e su spazi Vettoriali ), INDAM, Rome 1975, 117-138. Traynor, T. A974), An elementary proof of lifting theorem. Pacific J. Math. 53, 267-272. Valadier, M. A973), Desintegration d'une mesure sur un produit, С R. Acad. Sci. Paris 276A, 33-35. Vesterstr0m, J. and Wils, J. A969), On point realizations of L^-endomorphisms, Math. Scand. 25, 178-180. Volcic, A. A982), Liftings and Daniel! integral. Measure Theory Oberwolfach 1981, Proceedings, Lecture Notes in Math., Vol. 945, Springer, Berlin, 180-186. von Weizsacker, H. A976), Some negative results in the theory of liftings. Measure Theory, Proc. Conf., Oberwolfach, 1975, Lecture Notes in Math., Vol. 541, Springer, Berlin, 159-172. von Weizsacker, H. A977), Eine notwendige Bedingung fur die Existenz invarianter mass-theoretischer Liftings, Arch. Math. 28,91-97. von Weizsacker, H. A978), Strong measurability. liftings and the Choquet-Edgar theorem. Vector Space Measures and Applications II, Proc. Conf. Univ. Dublin, 1977, Lecture Notes in Math., Vol. 645, Springer, Berlin, von Weizsacker, H. A982), The nonexistence of "liftings" for arithmetic density, Proc. Amer. Math. Soc. 86, 692-693. Zygmund, A. A968), Fourier Series, Vol. I, 2nd edn, Cambridge Univ. Press, Cambridge.
CHAPTER 29 Ergodic Theory Frank Blume Department of Mathematics, John Brown University, Siloam Springs. AR 72761, USA E-mail: fblume @acc.jbu. edu Contents Introduction 1187 1. Basic examples 1189 2. Ergodic theorems 1192 3. Ergodicity 1196 4. Recurrence 1200 5. Mixing 1203 6. More about convergence in ergodic theory 1210 7. Entropy and information 1214 8. Constructions in ergodic theory 1226 References 1231 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1185
This page intentionally left blank
Ergodic theory 1187 Introduction Historically, ergodic theory emerged from the study of dynamical systems which in turn originated in Newton's formulation of the laws of motion and the universal law of gravitation. In his theory Newton described a dynamical system consisting of ? point masses by a set of differential equations that completely determined the motion of the point masses given certain initial conditions, However, this approach to the study of dynamical systems was beset with grave difficulties, because even in the case of a three-body system an explicit solution to the equations of motion was impossible to obtain. Facing this dilemma, Henri Poincare initiated at the end of the 19th century a qualitative analysis of dynamical systems which shifted the focus away from differential equations to global geometric properties of the phase space and transformations inherent in the system. His famous recurrence theorem (Theorem 4.1) which establishes the arbitrarily close future return of a system to states that existed in the past can be regarded as the prototype of an ergodic-theoretic argument (see Furstenberg A981)). The modern formulation of ergodic theory dates back to the theorems of von Neumann and Birkhoff (Theorems 2.2, 2.4 and 3.2) concerning the equality of time and space means of physical quantities in dynamical systems. This question, known as the ergodic hypothesis, was first introduced by Boltzmann in connection with the theoretical foundations of statistical mechanics. For intuitive reasons, we will frequently make references to physical interpretations of ergodic-theoretic results (see Petersen A996)), but it also needs to be mentioned, that following the work of von Neumann and Birkhoff, ergodic theory soon developed a life of its own and lost to a certain extent its ties to physics and in particular to statistical mechanics. On the one hand the assumption of ergodicity (Definition 3.1) used to demonstrate the equality of time and space means was difficult to verify even for simple systems such as ideal gases, and on the other hand the discovery of rich connections to other fields such as probability (Section 1), information theory (Section 7) and number theory (Example 3.4 and Section 4) generated interest in ergodic theory as an independent area of research. The theory of dynamical systems is divided into three major branches: differentiable dynamics - the study of diffeomorphisms on differentiable manifolds, topological dynamics - the study of homeomorphisms on topological spaces and ergodic theory which is best described as the study of statistical properties of measure-preserving systems. Here we say that (?, ?, ?, ?) is a measure-preserving system, if (?, ?, ?) is a complete probability space and ?: X -> X is a measure-preserving transformation in the sense that ? satisfies the following conditions: (a) T is bijective, (b) ТА, T-'AeSforall A e Sand (c) ?(?~[?) = ?(?) for all AeB. The assumption of bijectivity is not always included in the definition of measure-preserving transformations, but it is convenient and does not represent a major restriction. For a given measure-preserving transformation, we may define a group action of the integers on X via the map (n, x) н> T"x where ? e ? and ? e X. It is of considerable interest to consider more general group or semigroup actions on X, and in particular the case of measure-preserving flows, that is actions of R on X, is very important, but for the
1188 F. Blu me sake of notational and conceptual coherence we will restrict ourselves in this exposition to the main case that the group acting on X is Z. In modeling dynamical systems via measure-preserving transformations on a probability space, we think of each point ? in the state space X as representing perfect knowledge about the system at the level of abstraction that is considered relevant. For example if we consider a system to be completely described by the positions and momenta of its parts, then ? represents perfect knowledge of these two properties - position and momentum. Measure theory naturally enters into the picture as we observe that for practical and also principal reasons (uncertainty in quantum mechanics), perfect knowledge of a system's state is unattainable so that instead of identifying single points ? e X we may only assume knowledge of a set of states A e В of which ? is a member (see Petersen A996)). Given this measure-theoretic approach, it is natural to represent measurable, that is observable, physical quantities by measurable functions /: X -> К (or C). In order to explore the evolution of the system over (discrete) time, we examine the values of a measurable quantity / along the orbit of a state ? e X under the action of T, that is we examine the sequence [f(Tkx)}^=Q. In such a mathematical model the ergodic hypothesis, as introduced by Boltzmann, is equivalent to the assertion that the time mean exists and is equal to the space mean fx f ?? for almost all ? e X with respect to ? (Theorem 3.2). In other words, the claim is here that the space mean of an observable quantity can be derived almost surely from discrete measurements along the time evolution of a single state ? e X. This idea of deriving global information from a single state is very important and will resurface in Section 7 in Theorems 7.23 and 7.31 (see also the concluding remark in Section 6). Besides the quantitative problem of establishing the existence of longterm averages (in the mean or pointwise along orbits under the action of T) we may also study on a more qualitative level the recurrence behavior of sets. The earliest result in this context was the above mentioned theorem by Poincare which, in its simplest form, states that for any set A & В with ? (A) > 0 do not only there exists an ? e N such that ?(? П T~n A) > 0. Recurrence statements of this type do not only have profound philosophical implications for our understanding of dynamical systems, but also provide an important link to the field of combinatorial number theory. Several major results are listed in Section 4. Another important problem in ergodic theory is the classification problem. We say that two measure-preserving systems (?, ?, ?. ?) and (?, С, ?>, S) are (metrically) isomorphic if there are sets Xo С X and YqGY and a map ?: X \ Xo -> ? \Yo such that (a) ? is bijective, (b) ? and ?~] are measurable, (c) м(Х0) = у(У0)=0, (d) ??(?) = ??(?) for all ? e X \ X0 and (e) ?(?~[ A) = v(A) for all measurable sets А с ? \ Yo- The problem of distinguishing measure-preserving systems at the level of metric isomorphisms remained essentially unsolved until Kolmogorov A958) introduced entropy as an
Ergodic theory 1189 isomorphism invariant and thereby created an entirely new field of research. The entropy of a system is a single number (invariant under isomorphisms) that in a sense measures the average degree of randomness in the orbits of points under the action of ? (Section 7). Prior to the work of Kolmogorov, measure-preserving systems could only be classified at the level of spectral properties, which can be defined in sole reference to the induced unitary operator UT:L2(X,B,v)-> ???,?,?), / н> / о Г, and do not take fully into account the action of ? on individual points. We say that two measure-preserving systems (?,?, ?. ?) and (?, С, ?, S) are spectrally isomorphic if there exists a unitary operator V : L2(X,B, ?) -> L2(Y,C, v) such that VUT = UsV. Examples of dynamical properties that serve as spectral invariants are ergodicity (Section 3) and mixing properties (Section 5). These invariants are completely characterized by Uj and are intuitively best described as quantitative recurrence properties. For example, a measure- preserving system (?, ?, ?, ?) is strongly mixing if and only if for all A e В we have Hindoo ?(? П Т~"А) = ?(?J (Theorem 5.4). We see that here the general qualitative statement of the Poincare recurrence theorem is replaced by the stronger quantitative requirement that A and T~" A are approximately independent for all sufficiently large n. Finally, we also need to mention the representation problem as an important question in ergodic theory. We may for example ask which measure-preserving transformations can be represented by (that is, are isomorphic to) diffeomorphisms on differentiable manifolds or homeomorphisms on topological spaces. One such result is stated in Section 3 (Theorem 3.10), but for a more detailed discussion of this and many other important subjects the reader is referred to standard expositions of ergodic theory and dynamical systems such as Petersen A983), Cornfeld et al. A982), Friedman A970), Halmos and von Neumann A958), Katok and Hasseblatt A995) and Rudolph A990). The very well researched account in Petersen A983) served as the main reference for the standard results and examples of ergodic theory in the present exposition. 1. Basic examples In this section we list a number of the most important examples of measure-preserving systems. Several of these highlight the close relationship between ergodic theory and probability. Stationary random processes. Assume that С is a ? -algebra of subsets of a given set Y. For ? := Yz we denote by Bo the ?-algebra of subsets of ? generated by cylinder sets of the form [y = {y,>}„ez e ?: y,u eB\,..., y„k e Bk}, where к e ?, ? ?,..., щ е ? and B\, ?* е С. For a probability measure ? on Bo, we denote by В the completion of Bo with respect to ?. The triple (?,?, ?) is said to be a
1190 F. Blume {discrete time) random process with state space (Y, C). If the random process (?, ?, ?) is stationary in the sense that for all m e ? and all cylinder sets as above we have ?({?<??: у,,, еВи...,у,ч е Вк}) = м({у е ?: y„l+„, e B\,..., Ущ+т е ?*}), then the left shift ? :? ->¦ ? defined via the equation ipy)n -=Уи+\ is easily seen to be measure-preserving with respect to ? (because ? preserves the measure of cylinder sets). Therefore, (?,?, ?,?) is a measure-preserving system. Conversely, every measure-preserving system gives rise to stationary random processes via finite or countable partitions. To see this, let us assume that (?, ?, ?. ?) is a measure-preserving system and or = {Av: у e Y] is a finite or countable measurable partition of X, in the sense that the index set ? is finite or countable, the sets in a are pairwise disjoint and measurable and Uvey Ay = X. For ? := Yz we define a map ?: X -> ? via the equivalence (фх)„ = у <S> ?"? e Av. Denoting by Co the ? -algebra generated by cylinder sets of the form {y = {yn}nez e ?: y„,=z\,---,y,,k = Zk} with zi, ¦ ¦ ¦, ?* e У, the map 0 is clearly measurable with respect to Co and В (i.e., 0~'Co С В), and the equation v(A) := ?@-1 A) for A e Co defines a probability measure on Co- If С is the completion of Co with respect to v, then the fact that Г is measure-preserving implies that (?, C, v) is a stationary random process. Bernoulli shifts. A Bernoulli shift is a special case of a measure-preserving system generated by a stationary random process. Here the state space (Y, C) consists of a finite set ? = {1,..., m), and the ?-algebra С consists simply of all subsets of Y. The set ? is frequently also referred to as a finite alphabet. In order to define a measure ? on the ?-algebra Bo generated by the cylinder sets in ? = ?", we assume that p\,..., p,„ are strictly positive real numbers such that ?'"=, ?,¦ = 1. Then for any elements г ?,..., ц е ? we define и{{у={Уп}пе%?Я: y„t =i\ y„k = /*}) = p,, ¦ ¦ ¦ Av Extending ? to a probability measure on the completion В of Bo with respect to ?, the system (?, ?, ?, ?) is measure-preserving where, as above, ? is the left shift on ?. Markov shifts. Markov shifts form another class of measure-preserving systems generated by stationary random processes. The finite state space (Y, C) and the ?-algebra Bo of
Ergodic theory 1191 subsets of ? are given as in the case of Bernoulli shifts. To define the measure ?, we assume that A = (fl,y) is an m ? m stochastic matrix (i.e., a,y ^ 0 for all i, j, and the sum of the components in each row of A is equal to one). Then according to the Perron-Frobenius theorem (see Varga A962)) there exists a row probability vector ? = {p\,..., p,„) (i.e., Pi ^ 0 for all / and ??=, ?, = 1) such that pA = p. Given the vector p, we define ?({???: уП[ = i\,...,y„k = ik}) = р^щ^2- ¦ ¦ aik_{ik. Extending ? again to a probability measure on the completion В of Bo, it follows that (?, ?, ?, ?) is a measure-preserving system. Gaussian systems. The importance of Gaussian measure-preserving systems is due to the fact that Gaussian random processes are frequently encountered in probability theory and also in statistical mechanics and other branches of the physical sciences. We set ? := Rz and denote by С the ?-algebra of Borel subsets of R so that the state space is (R,C). A measure ? on the ?-algebra Bo generated by the cylinder sets in ? is said to be a Gauss measure (see Shirayayev A984)) if for any integers n\ < ¦ ¦ ¦ < nk there is a row vector (m|,..., mk) e K* and a symmetric positive-definite к ? к matrix В = {b\j) with inverse B~' = A = (ajj) such that for any Borel set ? с R* we have ?({?&?: (уП1,...,у„к)еЕ}) VdetA / eXP\ 2 ? a'j(x' ~ m'^xJ 2? Extending ? to a probability measure on the completion В of Bo, the Gaussian random process (?, ?, ?) is known to be stationary (see Cornfeld et al. A982)) if for all /, j eZ the projections ?,¦: ? -> R and ?;¦: ? -> R onto the /th and ;'th factors of ? respectively (i.e., ni(y) := yi and тгДу) := у,) satisfy the following conditions: (a) fn ?,- ??? = /? ???^? ='. w and (b) fn (тг, — m)(jij — m)d[i depends only on the difference i — j. In this case (?,?, ?,?) is a Gaussian measure-preserving system, and the vector and matrix components ш, and bjj are given by the equations w, = m and biy = /? (?,,, —m) ? (Knj -™)??. Rotations on the circle and on compact Abelian groups. For a e R we define a map 7^ : [0, 1 [ -> [0, 1 [ via the equation Га;с := ? + a mod 1 = ? + ? — [? + ?],
1192 F. Blume where we denote by [t] as usual the greatest integer function, that is [t] = max{n e Z: ? ^ t]. Identifying [0, 1[ with the unit circle К in the complex plane, Ta can also be represented by the equation ? е2л'х '= g27r'<-v+0?' Since Ta obviously preserves Lebesgue measure ? on the ?-algebra В of Lebesgue measurable subsets of [0, 1 [, we see that ([0, 1 [, ?, ?, ??) is a measure-preserving system. In the more general situation that G is a compact Abelian group and g e G, the map Tg:G -> G, Гял· := gjt is measure-preserving with respect to Haar measure on G, because Haar measure is translation invariant. Translations on the torus. As a generalization of rotations on the circle and as another special case of a rotation on a compact Abelian group, we define a map Ta : [0, l[m -> [0, l['n on the m-dimensional torus via the equation Tax := (jc? +ot\ mod 1,.. .,x„, +a„, mod 1), where a = (a\,... ,am) e R'". Then Ta is a measure-preserving transformation with respect to Lebesgue product measure on [0, 1 ['". Finite interval exchanges. Assume that I\,..., /,„ are pairwise disjoint half open subintervals of [0, 1 [ whose union is [0, 1 [. A map ?: [0, 1 [ -> [0, 1 [ is said to be a finite interval exchange transformation if there are real numbers a\,...,a„, such that (a) Tx = ? + аь for all ? e h and (b) TIjHTIj =0foralliV;. Given these conditions, it is clear that ? is bijective and measure-preserving with respect to Lebesgue measure on [0, 1 [. We also note that a rotation Ta on the circle is a special case of a finite interval exchange with m = 2, I\ = [0, 1 - [<*][, h = [1 - [a], 1[, a\ = [a] and «2 = [a] — 1. 2. Ergodic theorems In this section we give an overview of the principal convergence theorems of ergodic theory with particular emphasis on von Neumann's mean ergodic theorem and Birkhoff's pointwise ergodic theorem. The proof of von Neumann's theorem follows the exposition in Parry A981). THEOREM 2.1 (Mean ergodic theorem in Hilbert space). Assume that ? is a Hilbert space, U : ? -> Ha unitary operator and let ? := {/ e H. Uf = /}. If ? : ? -> ? is the projection of ? onto M, then lim n—>oc I" Y,Ukf-Pf ? k=0 = 0.
Ergodic theory 1193 PROOF. Let N := {Uf - / | / e #}. If g A. N then (i//, g) = (/, g) for all / e #. This shows that (t/-'g - g) J. / for all / e #. Therefore, i/"'g = g = Ug and ~N С М. Since t// = / implies (/, Ug - g) = 0 for all g e Я, we also have ? с N . Hence H = M®~N. Furthermore, if g:= f- Pf, then g e N and therefore, j-t ?"?? Ukg -* 0 andiE"lii/V^JP/. ? THEOREM 2.2 (Mean ergodic theorem (von Neumann, 1932a)). Let (?,?,?, ?) be a measure-preserving system and assume that f 6 L-(X, ?, ?). Then we have lim 11—»OC /i-l ^foTk-E(f\M) k=0 = 0, where ? = {A e ? \ ? ? = ? а.е.), and E{f\M) is the conditional expectation of f with respect to Л4. PROOF. The mean ergodic theorem is an immediate consequence of Theorem 2.1 applied to ? := L2(X, ?, ?) and the induced operator Uj, where Urf = f°T. D Remark 2.3. The mean ergodic theorem in Hubert space is also valid for linear contractions, that is linear operators U: ? -> ? for which \\Uf\\ ^ ||/|| for all / e ? (see Petersen A983)). THEOREM 2.4 (Ergodic theorem (Birkhoff, 1931)). preserving system and let f e ? (?, ?, ?). Then Let (?,?,?,?) be a measure- /i-l 1 lim -Yf(Tkx) = E(f\M)(x) a.e. k=0 and lim и—>oo /i-l -У>оГ*-?(/|.М) k=0 = 0. The question of whether Birkhoff's pointwise approach or von Neumann's mean convergence approach is more adequate for modeling physical reality has been the subject of considerable philosophical debate. At first sight it appears that taking pointwise averages along the orbit of a single state is a true reflection of scientific practice because each value f(Tkx) represents the outcome of a specific measurement. On the other hand, according to quantum theory each measurement changes the state of the system, so that pointwise convergence statements may not only be unnecessary, but actually impossible to match in the laboratory. Even if quantum effects are not taken into account, the measuring process itself (such as the reading of a needle) may represent an average due to the inertia of the
1194 F. Blume measuring device so that von Neumann's mean convergence approach may indeed be the more plausible one. (For further discussion along these lines see Petersen A996).) Of central importance for the proof of the ergodic theorem is the so-called maximal function: I" /*(*):= sup-?/(Г*дг) %=0 The following theorem is due to Wiener A939) and Yosida and Kakutani A939). THEOREM 2.5 (Maximal ergodic theorem). If (?,?,?,?) is a measure-preserving system and f e L\X,B, ?), then f fa-?^?. J[f*>0] COROLLARY 2.6. For all a eR we have fd?>a?({f*>a}). L PROOF. Apply Theorem 2.5 tog := f-a. ? To see how the maximal function is instrumental in the proof of the ergodic theorem, we will now show how pointwise convergence of the ergodic averages follows from the maximal ergodic theorem. We follow the exposition in Petersen A983) where the reader may also find a complete proof of the ergodic theorem. Proof of pointwise convergence in Theorem 2.4. For ?, ? e ? with a < ?, we define ? 1 ""' 1 ""' ) ???:= \xe X: liminf- У" f(Tkx) < ? <? < limsup- Y^ f(Tkx) \. I "^°° " to "— л ыо J It is sufficient to show that ?(???) = 0, because then the union over all rational a and ? has measure zero as well, so that the limit exists a.e. Since ??.? is a ?-invariant subset of {/* > ?], we may apply Corollary 2.6 to the restriction of ? to ??.? to conclude that f f??> ??(??.?). J ??.? Applying essentially the same argument to — / it is not difficult to see that fd?^a?(Ea.?), JEa,
Ergodic theory 1195 and therefore. ??(??.?)!? /?????(??.?). J??.? Since a < ?, this is possible only if ?(??.?) = 0. ? Just as von Neumann's mean ergodic theorem, the maximal ergodic theorem can be derived from a more general operator version: A map U: L1 (?, ?, ?) -> L1 (?, ?, ?) is called positive contraction if (a) ||?//||? ? \\f\U for all f ? L^X.B, ?), (b) ?/6?.'(?.?.?) / > ° a-e- => i// > 0 a.e. Theorem 2.7 (Maximal ergodic theorem for operators (Hopf, 1954)). IfU is ? positive contraction on L1 (?, ?, ?) and /ei,1 (?, ?, ?), then ? ???^?, Jlf*>0] where f*(x) := sup„6N I Щ !/*/(*)· A very simple proof of this theorem was given by Garsia A965). Using the maximal inequality for operators, it is possible to prove the following very general convergence theorem due to Chacon and Ornstein A960). THEOREM 2.8 (Chacon-Ornstein theorem). If {?,?,?) is ? ?-finite measure space, 0 ^ /, g e V (?, ?, ?) and U: L' (?, ?, ?) -> L' (?, ?, ?) is a positive contraction, then ?-1 /?-1 ?"*//?"** *=0 *=0 converges to a finite limit a.e. on {x e ?: ??=0 Uk g{x) > 0}. Another important strengthening of the ergodic theorem is due to Kingman A968). We say that a sequence {F„ }„6pj of measurable real-valued functions on X is subadditive with respect to a measure-preserving transformation ? if Fm+n(x) iC F„(x) + F,„(T"x) a.e. for all n, m e N. THEOREM 2.9 (Subadditive ergodic theorem). Assume that (?, ?,?, ?) is a measure- preserving system. If{ F„ }„6n is a subadditive sequence ofintegrable real-valued functions on X such that inf - [ F„ ? eN П Jx ?? > — oo,
1196 F. Blume then there exists a T-invariantfunction F e V (?, ?, ?) such that F„(x) lim = F(x) a.e. a—>oo ? and lim 77^00 ? = 0. For more about subadditive ergodic theory see Kingman A973). Wiener considered the problem of finding a criterion for the ergodic averages 7, ?2"=? f ° Tk t0 ^ dominated by an integrable function. To formulate his result, we define log+W := 0 if;te@, l[, log(jc) if ? e [1, 00 [. Theorem 2.10 (Dominated ergodic theorem (Wiener, 1939)). Let (?,?,?,?) be a measure-preserving system. Iff \f\ log+ |/| ?? < oo, then sup 716N 71-1 k=0 eLl(X,B,p). Remark 2.11. Petersen A979) extended this result to ergodic measure-preserving flows and showed that the condition / |/|log+|/|??? < oo is not only sufficient, but also necessary if / ^ 0. 3. Ergodicity Historically, the notion of ergodicity was introduced by Ludwig Boltzmann in connection with the foundations of statistical mechanics. Here it was desirable that the time mean of a physical quantity is equal to its space mean (see Theorem 3.2(d)). Later, the role of ergodicity for the description of systems in statistical mechanics such as ideal gases proved to be less significant than the presence of a large number of particles (atoms or molecules). However, from a purely mathematical point of view the importance of the concept of ergodicity cannot be overestimated, because it serves as a central assumption in the formulation of a wide variety of theorems and plays an important role as a spectral invariant. Definition 3.1. A measure-preserving system (?, ?, ?, ?) is said to be ergodic if all measurable ?-invariant subsets of X have measure 0 or 1. THEOREM 3.2 (Equivalent characterizations of ergodicity). We assume that (?, ?, ?, ?) is a measure-preserving system. Then the following statements are equivalent:
Ergodic theory 1197 (a) (?, ?, ?, ?) is ergodic. (b) Every measurable ? -invariant function (i.e. f = f ? ? a. e.) is constant a.e. (c) The induced transformation Ut '¦ L2(X, ?, ?) -> L2(X, ?, ?) has a one-dimensional eigenspace corresponding to the eigenvalue 1. (d) For each f ??,'(?,?,?) we have 1 ""' Г lim -Yf(Tkx)= / ??? a.e. jt=0 J ? (e) For all А, В е В we have 1 ""' lim -??(?~????\?)=?(?)?{?). и—» ос п *—* к=0 (f) For ?// f,geL2(X, ?, ?) we have 1 ""' lim -Y\№f,g)=(f,D(i,g). *=0 (g) For a// / e L2(X, ?, ?) with (f,l)=0we have 1 ""' lim -?(?/?/,/) = 0. PROOF of (a) =>· (b) =>· (d) =>· (a). If (?,?,?, ?) is ergodic and / is ?-invariant, then Er := {x e X: f(x) > r} is ?-invariant for all rel. Thus ?(??) = 0 or 1. This proves that / is constant a.e., because otherwise there would exist anreR with 0 < ?(??) < 1. The implication (b) =>· (d) follows from Theorem 2.4 by observing that E(f\M) is ?- invariant and fx f ?? = fx ?(/\?)??. Finally, to prove ((d) =>· (a)), we assume that ? e Bis ?-invariant. Then ? ?"?? XE(Tkx) = xe for all ? e N and therefore, d) implies that xe is constant a.e. This proves that ?(?) = 0 or 1. ? For a complete proof of Theorem 3.2 see Petersen A983) and Parry A981). Remark 3.3. In analogy to Theorem 3.2(d), the limit in the Chacon-Ornstein theorem can be shown to be a constant if we impose on a positive contraction the additional assumption of conservative ergodicity. If a positive contraction U:L](X,B,?) -> L1 (?, ?, ?) is conservative ergodic in the sense that for all 0 ^ / e L1 (?, ?, ?) with ?({/ > 0}) > 0 we have oo ^Ukf(x) = oo a.e..
1198 F. Blume then the limiting function in the Chacon-Ornstein theorem has the constant value Examples 3.4. (a) Bernoulli shifts. For Bernoulli shifts condition (e) of Theorem 3.2 is easily verified for cylinder sets, and using standard approximation arguments it can be shown to hold for all measurable sets A and B. Thus, Bernoulli shifts are ergodic. (b) Markov shifts. A stochastic matrix A is said to be irreducible, if for every pair of indices (/, j) there exists an ? e N such that the element at position (/,j) in A" is strictly positive. If (?, ?, ?, ?) is a Markov shift on a finite alphabet, then (?, ?, ?, ?) is ergodic if and only if the stochastic matrix associated with A is irreducible. For a proof see Petersen A983). (c) Rank one transformations. Assume that ?: [0, 1] -> [0, 1] is a measure-preserving transformation with respect to Lebesgue measure ? on [0, 1]. We say that ? has rank one if for every ? > 0 and for every NeN there exist pairwise disjoint intervals I\,..., /„ for some ? > N such that ?(/| U ¦ ¦ ¦ U /„) > 1 - ? and ?G7*?/*+|) = 0 for 1 ^ к ^ ? - 1. Using the Lebesgue density theorem (see Rudin A987)) it is not difficult to see that a rank one system is ergodic. For more information concerning rank one systems see Ferenczi A990). (d) Irrational rotations on the circle. If Tax := ? + amodl is an irrational rotation on [0, 1[ (i.e., ? e R \ Q), then the measure-preserving system ([0, 1[, ?, ?, ??) (where ? denotes Lebesgue measure) is ergodic. To prove this statement we assume that / e L2([0, 1[) is ?-invariant with Fourier expansion f(x) = Y^L-^ а„е2л'"х. Then we have 27Г//1Л ? ane2*""=f(x) = f(Tx)=f(x+a)= ? а„е2™юе and therefore, a„ = а„е2л'"а for all ? e Z. This shows that a„ = 0 for all ? ? 0, because the irrationality of ? implies that ?2?'"? ? 1 for ? ? 0. Hence / = ?? a.e., and with / = ? a for some ?-invariant set A e B, the result follows directly from Definition 3.1. Using the ergodicity of Ta it is not difficult to prove Hermann Weyl's A910) famous equidistribution theorem. If / is a subinterval of [0, 1 [, then 1'·?? . ,. . card({0^ к ^n - 1: ka mod 1 e /}) л^оо ? ?—' п-»зс п Weyl A916) improved this result using Fourier analysis to show that the equation above is still valid if ?/ is replaced by a Riemann integrable function. Due to this early work of Weyl and more recently, the work of Bourgain( 1988a, 1988b, 1989), a very active field of research has developed at the interface of ergodic theory and Fourier analysis. A few results are listed in Section 6, but for more information on this subject and further references the reader is referred to Petersen and Salama A995). (e) Translations of the torus. If a = (a \,..., <*,„) e E'" and Ta is a translation on the torus [0, 1 ['" (see Section 1), then Ta is ergodic if and only if a is rationally independent in the sense that VkeZ'"k -a e ? ? ?: = 0 (for a proof see Petersen A983)).
Ergodic theory 1199 Another important question concerns the existence and uniqueness of ergodic measures for homeomorphisms on compact metric spaces. Dehnition 3.5. We say that a system (?,?) is a cascade if X is a compact metric space and ?: X -> X is a homeomorphism. (X, T) is said to be uniquely ergodic if there is exactly one ?-invariant Borel probability measure on X. THEOREM 3.6. If ? is the unique ?-invariant Borel probability measure of a uniquely ergodic cascade (X, T), then (?, ?, ?, ?) is ergodic. PROOF. For any set A e В we define Мл(б) := ?(?? ?)/?(?). If there were a T- invariant set A e В with 0 < ?(?) < 1, then ?^ and ??\? would be distinct ?-invariant measures on X (because ?^(?) = 1 and мх\л(А) = 0) contradicting the assumption of unique ergodicity (see Katok and Hasselblatt A995)). ? THEOREM 3.7. (X, T) is uniquely ergodic if and only if for every continuous function f: X —> X the averages 1 ""' converge uniformly to a constant function on X. In case that (?, ?) ? uniquely ergodic, with invariant measure ?, the constant is equal to fx f ??. Remark 3.8. According to the Kryloff-Bogoliouboff Theorem A937) any continuous map on a compact metric space X has an invariant Borel probability measure. The proof of this result is based on the Riesz representation theorem (see Rudin A987)). Details are provided in Katok and Hasselblatt A995) where the reader will also find a proof of Theorem 3.7. As explained in Furstenberg A981), Theorem 3.7 can be applied to obtain the following interesting generalization of the equidistribution theorem (Example 3.4(d)): THEOREM 3.9. If p(x) is a real polynomial with at least one coefficient other than the constant term irrational, then for every subinterval I o/[0, 1 [ we have card({0 ^k^n-\: p{k) mod 1 e /}) lim =?(/). и—>зс п This result was first proved by Weyl A916). Dynamical proofs were given by Furstenberg A960), Hahn A965) and Postnikov A966). We conclude this section with a very general representation theorem which asserts that every ergodic measure-preserving transformation on a Lebesgue space is isomorphic to a minimal, uniquely ergodic homeomorphism on a compact metric space. Here we say that a homeomorphism ?: X -> X on a compact metric space X is minimal if for every
1200 F. Blume x e X the orbit {T"x: ? e Z) is dense in X. This result was first proved for weakly mixing transformations (see Section 5) by Jewett A970) and then extended to ergodic transformations by Krieger A972) using the Krieger generator theorem (Theorem 7.16). THEOREM 3.10. IfT.X -> X is an ergodic measure-preserving transformation on a Lebesgue probability space (?, ?, ?), then there is a minimal homeomorphism S on the Cantor set С such that (S, C) is uniquely ergodic and isomorphic to (?, ?, ?, ?). 4. Recurrence One of the earliest results in the study of measure-preserving systems is due to Poincare A899). In contrast to the ergodic theorems of von Neumann and Birkhoff the Poincare recurrence theorem gives us only a qualitative rather than quantitative description of the longterm behavior of a measure-preserving system. THEOREM 4.1 (Poincare recurrence theorem). Assume that (?, ?, ?, ?) is a measure- preserving system. Then for every A € В we have (ЭС ОС \ А П p| [Jr-"A = ?(?). ,n=0"='« / In other words, almost all points of A are infinitely recurrent with respect to A. PROOF. Following the exposition in Parry A981), we note that the sets ЭС Am := (J T-"A n—m form a decreasing sequence in the sense that A„, D Am+\ for all m e No. Since Г-'" Aq = A,„ for all m e N, it follows that for all m\,mi ^ 0 we have ?(?„,,??,„2) = 0 and therefore, ?(??? ?,^=? ?>?) = 0- The result follows by observing that А С Ао- ? A stronger, quantitative statement concerning the recurrence of sets has been established by Khintchine A934). For this we need the following definition: Dehnition 4.2. A set ? с ? is said to be relatively dense if there exists an integer К such that ? П {;',...,;' + ?] ? 0 for all ; e ?. The proof of Khintchine's recurrence Theorem uses the Hilbert space theory of L2(X, ?, ?) and the mean ergodic theorem (see Parry A981)).
Ergodic theory 1201 THEOREM 4.3. //(?, ?, ?, ?) is a measure-preserving system, then for every A e В and every ? > 0 the set ?? := [? e ?: ?(? ? ?~" A) > ?(?J - ?} ? relatively dense. We will now turn our attention to theorems concerning multiple recurrence, which is a subject that has found very interesting applications in combinatorial number theory. We begin by stating two famous number-theoretic results that have been shown to be consequences of a multiple recurrence theorem by Furstenberg and Katznelson. THEOREM 4.4 (Van der Waerden's theorem A927)). If {C\,..., C,} is a partition of the positive integers, then there is a j e {1, r] such that Cj contains arbitrarily long arithmetic progressions, i.e. for every к &N there exist integers a, n e N with [a,a + n,...,a + (k~ \)n] CC,. Van der Waerden's theorem is a corollary of a more general result obtained by Szemeredi A975). To formulate this theorem we need the following definition: A set S С If is said to have positive upper Banach density if for some sequence of products П„ : = [a„. ?, bn. ?] ? ¦ · ¦ ? [a„,r, b„.r] С If with \im„-+x{b„.j - a„j) = oo (for all ;') we have card^nn,,) lim > 0. н-юо card(n„) For Szemeredi's theorem we only need to consider sets of positive upper Banach density in Z, but for later reference in Theorem 4.8 we gave here already the general definition for the r-dimensional case. THEOREM 4.5 (Szemeredi's theorem A975)). Any set of positive integers with positive upper Banach density contains arbitrarily long arithmetic progressions. THEOREM 4.6 (Furstenberg and Katznelson A978)). Let Т\,Ъ, ¦ ..,Tm be commuting measure-preserving transformations (i.e. T, oTj = TjoTi) on a probability space (X.B, ?) and let A e В with ? (A) > 0. Then 1 " lim sup - ^?(?~?? ? ¦ ¦ · ? T~"A) > 0. »—>эс П In particular, there exists an ? e N such that ?G7"??---??„7"?) > 0. As a corollary we obtain the following multiple recurrence theorem:
1202 F. Blume Theorem 4.7 (Ergodic Szemeredi theorem (Furstenberg, 1977)). Let (?, ?, ?, ?) be a measure-preserving system, m e N and A e В with ? (A) > 0. Then there exists an ? e N such that ?(? П Т~"А П · ¦ ¦ П Т-('"~])"А) > 0. It is important to point out that Furstenberg's proof of the ergodic Szemeredi theorem (which preceded the more general Theorem 4.6) is based on ergodic theory—it does not use combinatorial number theory. Using the ergodic Szemeredi theorem, Furstenberg was thus able to provide purely ergodic-theoretic proofs of the Theorems of van der Waerden and Szemeredi. Furthermore, using Theorem 4.6, Furstenberg and Katznelson A978) obtained the following multidimensional extension of Szemeredi's theorem: Theorem 4.8 (Multidimensional Szemeredi theorem). We assume that S С 27 has positive upper Banach density. If F С 27 is a finite set, then there is an integer ? e N and a vector a e 27 such that a + nF С S. An interesting polynomial version of Theorem 4.5 with its corresponding generalization of Szemeredi's theorem (Theorem 4.10) has been obtained by Bergelson and Leibman A996): THEOREM 4.9. If T\, T2,..., Tm are commuting measure-preserving transformations on a probability space (?,?,?) and p\(n),..., p„,(n) are polynomials with rational coefficients taking integer values on the integers such that p,@) = 0 for i = 1,..., m, then for all A e В with ? (A) > 0, we have limsup - ??(?-?,?)? П · · · П Г„7''"'(А)А) > 0. ?^ сю П k=\ THEOREM 4.10. Assume that S С 27 is a set of positive upper Banach density and let P\ (n),..., p,„(n) be polynomials with rational coefficients taking integer values on the integers such that pi @) = 0 for i = 1,..., m, then for any vectors b \,..., b„, e 27 there exist an integer ? and a vector a e 27 such that a + pi {n)b[ e S for all i = 1,..., w. For further improvements and refinements of Theorems 4.3 and 4.4 the reader is referred to Furstenberg and Katznelson A985,1986, 1991), Bergelson et al. A996) and in particular also Furstenberg A981) which gives an excellent account of the connections between ergodic theory, topological dynamics and combinatorial number theory. Finally, we wish to make a remark on the philosophical significance of Theorem 4.9. As a trivial consequence of the Poincare recurrence theorem, we see that for every measure-preserving system (?,?,?, ?) and every A & В there is an ? e N such that ?(? П Т~"А) > 0. Interpreting this statement as a prediction of arbitrarily close future return of a system to states that existed in the past, it is interesting and surprising to notice that such arbitrarily close returns also happen along polynomial sequences, because if p(n)
Ergodic theory 1203 is a polynomial with the properties stated in Theorem 4.9, then for any A e В there is an ? e N such that ?(? ? Т~р{п) А) > 0 (apply Theorem 4.9 to Т\=Ъ = ?, p\ (?) = ? and P2(n) = p(n) + n). So in a certain sense the world looks the same in polynomial time as in standard linear time. 5. Mixing Definition 5.1. A measure-preserving system (?, ?, ?, ?) is said to be weakly mixing if 1 ""' lim -???(?"*???)-?(?)?(?)|=0 forallA,fie?, *=0 and we say that (?, ?, ?, ?) is strongly mixing if lim ?(?~"? ?) ?) = ?(?)?(?) for all A,B eB. Given these definitions and Theorem 3.2, it is easy to see that strong mixing =>· weak mixing =>· ergodicity. Furthermore, strong mixing, weak mixing and ergodicity are isomorphism invariants. THEOREM 5.2 (Equivalent characterizations of weak mixing). For any measure-preserving system (?, ?, ?, ?) the following statements are equivalent: (a) (?,?, ?, ?) is weakly mixing. (b) Ш^оо^'Ц^иУ^) ~ {f,\){\,g)\=0forall f,geL\X,B^). (c) For all f e L2(X, ?, ?) with (/, 1) = 0 we have 1 ""' lim -?(^/./>=?· A=0 (d) There is a set ? С N of density zero (i.e., lim,,^^ card(Ai П {1,...,«})/« = 0) such that lim ?(?-'???) =?(?)?(?) for all A, Be В. (e) ? ? ? is weakly mixing on ? ? X with respect to the product measure ? ? ? (see also Section 8 for a definition of products). (f) ? ? S is ergodic on ? ? ? for each ergodic measure-preserving system (Y, C, v, S). (g) ? ? ? is ergodic. (h) All measurable eigenfunctions of Ut are constant a.e.
1204 F. Blume (i) (Krengel, 1972) For all A e В there exists a strictly increasing sequence {«a}„6n С N such that the intersection of the ?-algebras Cm (m ^ 0) generated by the sets {T~""'+k А: к eN] is trivial (in that it contains only sets of measure zero or one). We wish to give a partial proof of Theorem 5.2 following the exposition in Parry A981) and Petersen A983). For this we need the following analytic fact (see Walters A975)): LEMMA 5.3. If {а„}„ец is a bounded sequence of non-negative integers then we have ???,,^?? - J2"kZ0ak = 0 if and only if there is a set ? С N of density zero such that ???,?^??. п<?м an = 0. Proof of (а) <s> (d) and (a) => (e) =>· (g) => (h) =>· (a) in Theorem 5.2. The equivalence (а) о (d) is a trivial consequence of Lemma 5.3. To prove (a) =>· (e) we assume that (?, ?, ?, ?) is weakly mixing. For ? ? ? to be weakly mixing on ? ? ? it is sufficient to show (using standard approximation arguments) that 1 ""' lim - V \? ? ?((? ? T)~kA, ? ?? ? ?, ? ??) ?—>эс ? '?-~' ~ — ? ? ?(?\ ? ??)? ? ?(?\ ? ??)\ =0 for all ? ?, Аг, В], ?2 е ?. According to (d) there are sets ? ?, Mi с ? of density zero, such that (lim ?(?~"???]??)=?(??)?(??) for/e {1,2}. Hence lim ? ? ?((? ? ?)~"?? ? ?^ П ?, ? ??) = ? ? ?(?\ ? ??)? ? ?(?\ ? ??). Applying the equivalence (a) o (d) to ? ? ?, we conclude that (a) implies (e). The implication (e) => (g) is trivial. To see that (g) implies (h), we assume that Ut has a nonconstant eigenfunction / e L2(X, ?, ?) corresponding to the eigenvalue ? e C. Since Ut is unitary, we have |?| = 1. Thus, for g(x, y) := f(x)f(y) we obtain UTxTg(x,y) = Ut/(x)Ut f(y) = g(x,y) and therefore, g is a nonconstant ? ? ?-invariant function so that ? ? ? is not ergodic according to Theorem 3.2(b). Finally, we will provide a brief outline for the proof of (h) =>· (a) that illustrates the use of the spectral theorem. We denote by V the closed linear span of the eigenfunctions of Ut and for / e V1- and g e L (?, ?, ?) we define a measure ? on the spectrum o(Ut) of Ut via the equation v(A) := {E(A)f g), where ? is the projector-valued measure generating Ut according to
Ergodic theory 1205 the spectral theorem. Then it follows that " *=0 " k=0UalUT> I ""' Г kklkdv{k)dv^) k=QJ'r(t>T)xat,UT) =1 a(Uj)y,a(Uj) n(\ — ??) ( ?-^??(?)??{?). A) The last step is permissible, because 1 — ?? = 0 if and only if {?, ?) is a point on the diagonal ? of a(Ur) ? ?(???) and /i has measure zero, because using the spectral theorem, it is not difficult to show that ?([?}) = 0 for all ? e ?(???)- Since the integrand in A) converges to zero a.e., the bounded convergence theorem and Lemma 5.3 imply that Hindoo + ?*=? \(Ujf,g)\ = 0. If now / is an arbitrary function in L2{X,B, ?), then (h) implies that / - (/, 1) e V1 and therefore, , n— I | «- I 0= Hm -??(?/?/-</,??|= lim -???-?-?/??1·*)!- *=0 it=0 Replacing / and g by хл and ?^, it follows that ? is weakly mixing. D For a complete proof of Theorem 5.2 the reader is referred to Petersen A983), Parry A981) and Krengel A972) (an interesting improvement of Krengel's characterization of weak mixing in (i) has been given by Bergelson A993)). THEOREM 5.4 (Equivalent characterizations of strong mixing). For a measure-preserving system (?, ?, ?, ?) the following statements are equivalent: (a) (?,?, ?,?) is strongly mixing. (b) For all f,ge L2(X, ?, ?) we have lim {UT f,g) = (f, I)(I,g). 17—>00 (c) (Renyi, 1958) For each A e В we have lim ?(?""?? ?)=?(?J. II—>00 V ' (d) (Blum and Hanson, 1960) For all ? e [l,oo[, all strictly increasing sequences {ki}ieN С ? and all f e U7(X, ?, ?) we have lim II—>3C №'-f. ??? :0.
1206 F. Blume (e) (Ornstein, 1972) (?, ?, ?, ?) is completely ergodic (i.e., (?, ?,?, ?") is ergodic for all ? e Z) and there is а с e R such that for all А, В е В we have limsupM(r_"A ? ?) ? ??(?)?(?). ?—>?? Examples 5.5. (a) Bernoulli shifts. For Bernoulli shifts it is easy to see that lim ?(?~"????)=?(?)?(?) for all cylinder sets A and B. Standard approximation arguments show that this equation actually holds for all measurable sets A and B. Thus, Bernoulli shifts are strongly mixing. (b) Markov shifts. An m ? m stochastic matrix A is said to be aperiodic, if there is no i e {1,..., m) and no integer к > 1 such that the condition a" > 0 (where a" is the /th element on the diagonal of A") implies that к divides n. If (?,?, ?, ?) is a Markov shift on a finite alphabet, then (?, ?, ?, ?) is strongly mixing if and only if the stochastic matrix associated with A is irreducible and aperiodic (see Petersen A983)). (c) Finite interval exchanges. If ?: [0, 1[ -> [0, 1[ is a finite interval exchange transformation and ? is Lebesgue measure on [0, 1 [, then ([0, 1 [, ?, ?, ?) is not strongly mixing (see Cornfeld et al. A982)), but possibly weakly mixing (see Choe A993)). (d) Lebesgue spectrum. We say that a measure-preserving system (?,?,?,?) has Lebesgue spectrum of multiplicity N, if there is a set ? of cardinality N (possibly infinite) and a set of functions {f\j: ? e ?, jeZ) which, together with the constant function 1, form an orthonormal basis for L2(X, ?, ?) such that Urfx.j = fx.j+\ for all ? e ? and j e Z. It can be shown that a measure-preserving system is strongly mixing if it has Lebesgue spectrum (see Petersen A983)). (e) A weakly mixing system that is not strongly mixing. We have already mentioned that strong mixing implies weak mixing, but it is actually not easy to see that the reverse implication is false. We will now give a brief outline of a construction due to Chacon A969) which not only yields a weakly mixing system that is not strongly mixing, but also illustrates well how measure-preserving transformations can be constructed using cutting- and-stacking procedures (see also Section 8). We begin by assuming that we are given a finite tower (or stack) ? =(I\,..., I„) of disjoint intervals I\ ,...,/„ of equal length. The intervals I\,..., /„ are also referred to as the levels of ?. We define a measure-preserving transformation и—I n 7|:U''-U'i ? = | ?=2 such that the restriction of T\ to Д for к e {1,..., ? — 1} is an affine map onto h+\ ¦ We then 'cut' the tower ? vertically into three substacks of equal width and stack up these three subtowers with a spacer (that is an interval equal in length to the width of the three subtowers) inserted between the second and the third subtower. The resulting tower Г2 consists of Ъп + 1 levels (see the diagram below). We proceed to define ?? in analogy to T\ in such a way that the restriction of Ti to one of the first Ъп levels in Г2 is an affine
Ergodic theory 1207 map onto the next higher level. Now we repeat this cutting-and-stacking procedure with ?? in place of ? to produce a tower гз of height 3Cи + 1) + 1 and continuing this way we obtain a sequence of towers {?„} where each tower is associated with a transformation T„. The pointwise limit ? of the transformations T„ is a measure-preserving transformation (with respect to Lebesgue measure) on the union X of the levels of the towers ?„. Since X has clearly finite measure, we have constructed a measure-preserving system (?, ?, ?, ?), where we denote by ? Lebesgue measure divided by the Lebesgue measure of X. 79 ? To see that ? is not strongly mixing, we choose a set A e В such that ? (A) < \,. Since the levels of the towers ?„ generate B, it follows that for large ? most of the levels of ?„ are almost entirely contained in A or in X \ A. Denoting the length of ?„ by /„ and observing that the first /„ levels in ?„+| are mapped onto the second /„ levels, it follows that ?{?'" А П A) > ^?(?) > ? (AJ contradicting strong mixing. To see that ? is weakly mixing, we assume that / is an eigenfunction of Ut corresponding to the eigenvalue ?. Then for large ? the function / is almost constant on almost all levels of ?„. If / were actually constant on each level of ?„ then the construction of ?,,+ ? from ?„, as illustrated above, would imply that ?'" = ?'" + ? and therefore ? = 1. Since ? has rank one we know that ? is ergodic (see Example 3.4(c)), and it would follow that / is constant a.e. on X. Thus according to Theorem 5.2(h) we would be able to conclude that ? is weakly mixing. For a detailed discussion that makes these arguments precise see Petersen A983). It is interesting to notice that in a certain sense almost every measure-preserving system is weakly mixing but not strongly mixing. To formulate this result, we assume that (?,?,?) is a Lebesgue space. We say that two measure-preserving transformations on X are equivalent if they differ only on a set of measure zero, and we define the weak topology on the set of equivalence classes of measure-preserving transformations on X via the statement lim T„ = T ? ??6? lim ?(?,,????) = 0. ?—>oo /?—>эс
1208 F. Blume It can be shown that with respect to the weak topology as defined above, the set of weakly mixing transformations on X is residual (Halmos, 1944), while the set of strongly mixing transformations is of first category (Rokhlin, 1948). Along these lines Glasner and King A998) established a very general zero-one law that classifies sets of transformations defined via certain isomorphism invariants as either residual or meagre. We now turn our attention to the subject of higher orders of Mixing. Our first theorem in this context says that a weakly mixing measure-preserving system is weakly mixing of all orders. For a proof see Furstenberg A981). THEOREM 5.6. Assume that (?,?,?,?) is a weakly mixing measure-preserving system. IfAo, A\,...,Aj e В and p\ < ¦ ¦ ¦ < pj e N then for all ? > 0 the set {keN: |?(?0??-^?| ? ¦ ¦ · ? T^'J Ay) - ?(?0)?(?|) ¦ ¦ -M(Ay-)| >?} has density zero (see Theorem 5.2(d)). Given the statement of Theorem 5.6, it is obvious that for all ? > 0 we have 1 ""' limsup- ^|?(?0? ?-^'?, ? ¦¦ ¦ ? ?'^?? "^°° n k=o -?(?0)?(?|)···?(?7)| ?? ?. Therefore, using Lemma 5.3, we obtain the following corollary: COROLLARY 5.7. Assume that (?,?,?,?) is a weakly mixing measure-preserving system. IfAo, A |,..., Aj e В and p\ < ¦ ¦ ¦ < p; e N then 1 "_l lim - ???(????-^?| ? · ¦ · ? T~kpJ А Л - ?(?0)?(?,) ¦ ¦ -?(?,-)| = 0, and there is a set ? С N of density zero, such that lim ?(?0??-"',|?|?-??-"^?.)=?(??)?(?|)-?(?7). A result similar to Theorem 5.6 can also be obtained for mildly mixing systems (see Furstenberg A981)): DEnNlTlON 5.8. Assume that (?, ?, ?, ?) is a measure-preserving system and let / e L2(X, ?, ?). We say that / is rigid if there is a sequence {/ц }„6n С N such that lim ||/-t/"l/IU=0.
Ergodic theory 1209 Definition 5.9. A measure-preserving system (?,?,?, ?) is said to be mildly mixing if there are no nonconstant rigid functions in L2(X, ?, ?). Note: Using Theorems 5.2(h) and 5.4(b) it is not difficult to show that strong mixing =>· mild mixing =>· weak mixing. THEOREM 5.10. Assume that (?, ?, ?. ?) is a mildly mixing measure-preserving system. IfAo, ? ?,..., Aj eB and p\ < ¦ ¦ ¦ < pse N then for all ? > 0 the set [k e ?: \?(?0 П Г-**" А, П ¦ ¦ ¦ П T'ki''Aj) - ?(?,)?(?2) · ¦ ¦ ?(?*)| < ?} is relatively dense (Definition 4.2). For stronger versions of this statement see Furstenberg A981). Now we will discuss higher order strong mixing. A measure-preserving system (?, ?, ?,?) is said to be strongly mixing of order m, if for all Ao, A,„ e В we have lim ?(?0 П ?""' ? ? ? · ¦ · ? ?-("?+ ¦+"'")Am) Л| И,,,-» 00 ' = ?(?0)?(?|)-?(?„,). Kalikow A984) showed that strong mixing implies strong mixing of order three (and by the same argument strong mixing of all orders) for rank one transformations, and Ryzhikov A993) extended this result to finite rank transformations. Furthermore, Host A991) demonstrated that strong mixing implies strong mixing of all orders for systems with singular spectral measure and Thouvenot showed that if it is true that two-fold mixing implies three-fold mixing for zero entropy systems (see Section 7), then this implication is true for all systems. However, in general the problem of whether strong mixing implies higher order strong mixing is still open. An important class of measure-preserving transformations that are strongly mixing of all orders is formed by the so-called К-automorphisms. Definition 5.11. A measure-preserving transformation Г on a probability space (?, ?, ?) is said to be а К-automorphism ('K' stands for 'Kolmogorov'), if for all m e N and all sets Aq, ..., A,„ e В we have lim sup |?(?0? ?)-?(??)?(?)| =0, "^°°B6C,f(/l, A,„) where C™(A \,..., A,„) is the smallest ? -algebra containing the sets TkAi ioxk^n and ie(l,...,m). Given this definition it is easy to show that every /^-automorphism is strongly mixing of all orders, but the reverse conclusion does not hold (see Rudolph A990) and Cornfeld etal. A982)).
1210 F. Blume Theorem 5.12 (Equivalent characterization of ^-automorphisms). IfT:X-*Xisa measure-preserving transformation on a probability space (?, ?, ?), then ? is a K-automorphism if and only if there is a sub-? -algebra Л С В such that (a) Г'ЛсА (b) U^-oo T"A generates B, and (c) nL-oo Т"Л " the trivial ?-algebra {0, X). In Section 7 (Example 7.18(c)) we will give other characterizations of К -automorphisms that establish an important link to the theory of entropy. For a proof of Theorem 5.12 see Cornfeld et al. A982). Furthermore, it can be shown that every ?-automorphism has countable Lebesgue spectrum (see Parry A981) or Petersen A983)). Finally, we wish to discuss briefly the concept of topological mixing for cascades (Definition 3.5) and its relation to measure-theoretic mixing. Dehnition 5.13. A cascade (X, T) is said to be: (a) topological^ ergodic if all proper closed Г-invariant subsets of X are nowhere dense inX; (b) topological^ weakly mixing if for all nonempty open sets ?, ? С X the set {n e N: T"A П ? ? 0} contains arbitrarily long blocks of consecutive integers; (c) topologically strongly mixing if for all nonempty open sets ?, ? С X there exists an N eN such that T"A ? ? ? 0 for all ? > ?. If (?, ?) is a cascade and ? a ?-invariant Borel probability measure with support X, then it is easily verified that measure-theoretic ergodicity, weak mixing and strong mixing (Definitions 3.1 and 5.1) imply topological ergodicity, weak mixing and strong mixing respectively. However, the converses of these implications are not true, even if (X, T) is uniquely ergodic (see Kolmogorov A953) and Petersen A970)). 6. More about convergence in ergodic theory In this section we list several non-standard ergodic theorems. We begin with a discussion of averages along subsequences. THEOREM 6.1 (Bourgain, 1989). Let (?,?,?, ?) be a measure-preserving system and assume that p(n) is a polynomial such that p(n) e Ifor all ? e Z. Then for any q > 1 and any f e L4(X, ?, ?) the averages l-Zf(TP*>x) converge pointwise a.e.
Ergodic theory 1211 THEOREM 6.2 (Furstenberg and Weiss, 1996). If (?,?,?,?) is a measure-preserving system and f,g? L°°(X, ?, ?), then the averages are convergent in L2(X, ?, ?). THEOREM 6.3. Assume that (?, ?, ?, ?) is a measure-preserving system and let /, g,h e L°°(X, ?, ?) and a,b,c e N. Then the averages 1 ""' -~TuaT"fUbT"gU^h ? '—' k=0 are convergent in L2(X, ?, ?). For a proof and further discussion of this result see Conze and Lesigne A984, 1988) and Furstenberg and Katznelson A990). A very general result for convergence along polynomial sequences in weakly mixing systems was obtained by Bergelson A987). THEOREM 6.4. Assume that p\ (и),..., p,„(n) are polynomials with integer coefficients such that each pj is nonconstant and each pi — pj with i ? j is nonconstant as well. If (?,?, ?,?) is a weakly mixing measure-preserving system and f\,...,f„ e L°°(X,B,v),then = 0. In general, for a sequence A = {?„bisN С N and a given measure-preserving system we may examine the convergence behavior of the averages M,(A,f)(x):= J ? /(?^)' where A(t) := {к е A: 1 ^ к ^ t] for all t > 1. Using Fourier analytical methods developed by Bourgain, it can for example be shown (see Rosenblatt and Wierdl A995)) that if A = {?„}„?? is the (strictly increasing) sequence of prime numbers, then for every measure-preserving system (?. ?,?,?) and every / e L°°{X,B. ?) the limit lim,^oo M, (A, /) exists in L2(X, ?, ?). This approach is also easily extended to randomly generated sequences. To see this, we assume that (?,C. P) and {УлЬегг is an i-i-d. sequence of 01-valued random variables on ? with P(Y„ = 1) = ? and P(Y„ = 0) = 1 - p. For each wefiwe define a sequence ?? via the equivalence ? e ?? о· У» (?) = 1. Then lim 'f^U^k)fr..U?'»(k)fm- f /,??..· [ /„?? ,„ Jx Jx
1212 F. Blume for almost every ? e ? the sequence ?? is ergodic in the sense that for any measure- preserving system (?, ?, ?, ?) and any / e ?,°°(?, ?, ?) the limit linv^oc ?,(??, f) exists in L2(X,B, ?) and is equal to fx f ??. A more general result concerning pointwise convergence along random subsequences is due to Lacey A994): THEOREM 6.5. For an ergodic measure-preserving system (?, С, P, S) andan integrable function ? : ? —> Ъ we set k-\ Tk(w):=J2s(siv) for all к е N. i=0 If fnSdP ф0, then there is a set ?? e С with ?(??) = 1 such that for every ? e ??, every measure-preserving system (?, ?, ?, ?) and every integrable function f on X the averages *=l converge a.e. with respect to ?. Intuitively, Theorem 6.5 addresses the question of the influence of time fluctuations on experimentally collected data. For example, we may ask what happens if instead of making a measurement every 10 hours, we make small random errors around the designated target times. In Theorem 6.5 the possibility of such errors is taken into account via the sequence of measuring times Tk(w) generated by the transformation S and the function S. As stated in Theorem 6.5, the averages of measurements taken along such perturbed time sequences will almost surely converge so that in a sense we have a wide latitude for error. For further discussion of time fluctuations see Petersen A996). In the context of random ergodic theorems we also wish to mention the original random ergodic theorem by Ulam and von Neumann A945): Theorem 6.6 (Random ergodic theorem). Let (?,?,?) and (?,?, ?) be probability spaces and assume that for every ? e ? we are given a measure-preserving transformation ??: X —>¦ X such that for every measurable function f on X the map (?, ?) ь* f(T0)x) is measurable on ? ? ? with respect to the product measure ? ? P. Then for all f e L] (?,?,?) and almost all sequences {?„}„6? in the product measure space ??^^?, С, ?) the averages 1 " k=\ converge a.e. with respect to ?. Note: The proof of this theorem is based on a so-called skew product construction which is defined in Section 8.
Ergodic theory 1213 An important strengthening of Birkhoff's pointwise ergodic theorem was obtained by Wiener and Wintner A941). THEOREM 6.7 (Wiener-Wintner theorem). //(?, ?, ?, ?) is a measure-preserving system and f & L1 (?, ?, ?), then there is a set Xq e В with ?(??) = 1 such that ?? lim -У/(Ткх)е'к *=0 exists for all ? e К and all ? e Xo. The significance of this theorem lies in the fact that convergence of the ergodic averages occurs on a set Xo of full measure that is common to all ? еШ. For a single ? еШ, the existence of the ergodic averages on a set of full measure is a trivial consequence of the ergodic theorem, but it is not at all obvious that such a set exists for all ? e К simultaneously, because ? is not countable. As a generalization of the Wiener-Wintner theorem, Bourgain, Furstenberg, Katznelson and Ornstein obtained the following result (see Bourgain A989)): THEOREM 6.8 (Return-times theorem). If (?,?,?,?) is a measure-preserving system and f e L°°(X, ?, ?), then there is a set Xq&B with ?(??) = 1 such that I" lim -^f(Tkx)g(Sky) k=0 exists for all ? e Xo, every measure-preserving system (Y, C, v, S), every g e L1 (Y,C, v) and a.e. у e ? with respect to v. There is an interesting connection between the Wiener-Wintner theorem and the ergodic Hilbert transform (Theorem 6.10) via a double maximal inequality for the helical transform. For a measure-preserving system (?, ?, ?, ?) and a function /: X -> С we define the helical transform as H**f(x):= sup neN.flelf A f(Tkx)eike - f(T-kx)e-,kt> k = \ THEOREM 6.9 (Campbell and Petersen, 1989). If'(?,?,?,?) is a measure-preserving system and f e L2(X,B, ?), then for all ? > 0 we have II fll2 ?{???: ?** f(x) > ?} ^ ^.
1214 F. Blume THEOREM 6.10 (Ergodic Hubert transform (Cotlar, 1955)). We assume that (?, ?, ?, ?) is a measure-preserving system. If f e L2(X, ?, ?), then the sums " f(Tkx)-f(T~kx) converge pointwise a.e. as ? —> oo. The theorem by Campbell and Petersen is a profound result which is closely related to Carleson's theorem concerning the pointwise convergence of Fourier series (see also Assani and Petersen A992) and Assani A993)). Furthermore, efforts have been made to use the helical transform to describe the full spectral measure of the induced unitary operator corresponding to a measure-preserving transformation via individual sample paths, that is via orbits of points under the action of T. For a more detailed discussion of this subject the reader is referred to Petersen A996). 7. Entropy and information One of the central problems in ergodic theory is the classification problem: How can we decide whether two given measure-preserving systems are isomorphic or not? One option is to distinguish systems via dynamical properties such as ergodicity or mixing. For example, a Bernoulli shift is not isomorphic to a finite interval exchange, because the former is strongly mixing while the latter is not (see Examples 5.5). Another option is to distinguish measure-preserving systems according to the spectrum of the induced unitary operator. For the special case of discrete spectra, a very satisfactory answer to the classification problem is given by the following theorem (see von Neumann A932b) and Halmos and von Neumann A942)): THEOREM 7.1 (Discrete spectrum classification theorem). We assume that S and ? are measure-preserving transformations on a non-atomic Lebesgue probability space (?, ?, ?) such that the induced operators U$ and Ut both have discrete spectrum (that is the eigenfunctions of Us and Ut span L2(X,B, ?) respectively). Then the measure- preserving systems (?,?,?, S) and (?,?,?,?) are isomorphic if and only if the set of eigenvalues of Us is equal to the set of eigenvalues ofUj. The case of transformations with continuous spectrum remained intractable until Kolmogorov A958) introduced entropy as a new isomorphism invariant to show that there are non-isomorphic measure-preserving systems with identical continuous spectra. We will now begin our discussion of entropy by defining the information function for finite (or countable) partitions. For a finite (or countable) index set / we say that a = {A,¦: i e /} is ? finite (or countable) measurable partition of a probability space (?, ?, ?) if (a) A, e В for all г el, (b) А, П Aj¦ = 0 for all i, j e / with i ? j, and (c) m(U,6,A,)=1- ?
Ergodic theory 1215 If a = {A,-: i e /} and ? = {?}: j e J] are two finite (or countable) partitions, then the refinement of a and ? is defined as ??? = {?,?)?] |/e/, j ej]. For a given point ? in a probability space (?, ?, ?) and a given finite (or countable) partition a = {A,: / e /} we can gain some information about the location of ? in X by identifying the set A e a of which ? is a member, and the smaller the measure of A, the larger is the gain in information. Thus it is reasonable to define the information function associated with a by an equation of the form /„(jc) = J^Aea ??(?)?(?(?)), where ? : [0, 1] -> R is a positive decreasing function. We say that two finite (or countable) partitions a and ? are independent if ?(? ? ?) = ?(?)?(?) for all A e ? and all S e /3. It is natural to require that for independent partitions a and /3 the information function ???? associated with the refinement of a and ? is equal to the sum of the information functions Ia and ??. This implies that ? should satisfy the equation ?(?(?)?(?)) = ?(?(?)) + ?(?(?)). Imposing the additional requirement that ? is continuous, we are led to define ? as a negative logarithmic function, and by convention, we choose 2 as the base of the logarithm. Dehnition 7.2. If ? is afinite (or countable) partition of the probability space (?,?,?), then the information function associated with a is /ff(jc):=-^XA(jc)log2M(A) for all ? e X Aea and the entropy of a is H{a):= ? ??{?)??. Jx Remark 7.3. If ? is afinite (or countable) partition and i0 ifjt=0, J(x)- |-;clog2(jc) if* e @,1], then Definition 7.2 implies that H(a)=Yjf^{A)). Aea (Note: If ? is not finite, then it is possible that ? (a) = oo.)
1216 F. Blume In order to work efficiently with entropy and information, it is necessary to introduce also the concepts of conditional entropy and conditional information. For a probability space (?, ?, ?) and a sub-a-algebra ? С В we denote by ?(?\?) the conditional expectation of xa with respect to T, i.e., ?{?\?) := ?(?? \?) for all AeB. Definition 7.4. For a finite (or countable) partition ? of a probability space (?, ?, ?) and a sub-a-algebra ? С В we define the conditional information function with respect to ? and .Fas Ia\r{x) ·= - J] ХлОО log2 ?(?\?){?) for all jc e X, and the conditional entropy of a with respect to F is Я(а|Л:= f ??^(?)??. Jx Remark 7.5. If ? is a finite (or countable) partition, then the conditional information function and the conditional entropy with respect to the ?-algebra ?{?) generated by the sets in/3 are denoted by ??\? and ?(?\?), and the conditional expectation of ? a is denoted by ?{?\?). It is easy to see that ?(?\?)(?)=???(?? /рч tfl ?(?) and therefore, ??\?(?) = ?(?\?) = ¦¦ ??^?(.)??& Лба v fie/3 fie/3 Furthermore, if ? is trivial so that ?{?) = {0, X) is trivial, then ??\? = Ia and ?{?\?) = ? (a). A few basic, easily verifiable properties of entropy and information are listed in the following proposition: PROPOSITION 7.6. Assume that a and ? are finite (or countable) partitions of a probability space (?, ?, ?) and let ? and Q be sub-?-algebras ofB. (a) ????\? = Ia\T + IfiiavF· where we denote by ay ? the ?-algebra generated by ? and the sets in a. (b) H(av ?\?) = ?(?\?) + ?(?\?? ?). (c) If ? С д, then Н(а\П ? H(a\Q).
Ergodic theory 1217 (d) If ? is a refinement of a (i.e., ? ? ? = ?), then ?(?\?) ^ ?(?\?). (e) ?(???\?)^?(?\?) + ?(?\?). (f) If ? (?), ? (?) < oo, then a and ? are independent if and only if ? (? ? ?) = ?(?) + ?(?). For a sequence {.7>}„6n of sub-a-algebras, we denote by V(?L| ?n tne ?-algebra generated by the sets in the union UhLi ·?>· The next theorem is a consequence of the martingale convergence theorem. THEOREM 7.7. Assume that (?, ?, ?) is a probability space and {Tn}„eN is an increasing sequence of sub-?-algebras of В (i.e., T\ С ?? С · · ¦)¦ Ua 's a finite partition ofX, then lim H(a\Tn) = H\a\\l T„ \ /i = l / If (?, ?, ?, ?) is a measure-preserving system and ? is a finite (or countable) partition of X, then we denote by a"n the refinements of a under T, that is n < := T~ma ? · · · ? T~"a = \/ T~ka к =tn is the partition consisting of all the possible intersections of sets of the form T~k A with A e a and m ^ к ^ и. Furthermore, we denote by a^ and a"^ the ?-algebras generated by the sets in the unions \JfL,„ T~ka and U"=-oc T~ka respectively. THEOREM 7.8. If (?,?,?,?) is a measure-preserving system and a is a finite (or countable) partition ofX with ? (a) < oo, then n—*oo ? exists and is finite. PROOF. Following the exposition in Petersen A983), we begin by showing that the sequence {#„}„6pj with H„ := Н(а'^^) is subadditive. For и, m e N we have Hn+m = H(al~] ? ?-?'-') ^ H(a''-i) + H(T~"a'^) = H„ + Hm. Therefore, Hj„ ^ jH„ and Hj ^ jH\, so that {#,7;'},6? is bounded and L : = liminf/_3c HjH < oo. Now let ? > 0 and choose a large j with Hj/j < L + ?. Then for ? > j and к := [n/j] we have L-^iktZfiini^ + Mj <!i±liL+eh ? kj kj к and this clearly shows that L = lim„^oc H„/n. ?
1218 F. Blume We are now ready to define the entropy of a measure-preserving transformation: Definition 7.9. Assume that (?, ?, ?, ?) is a measure-preserving system. (a) If ? is a finite (or countable) partition of X, then the entropy of ? with respect to a is h(a,T):= lim ° . n—>oo n (b) The entropy of ? is defined as the supremum of h(a, T) over all finite partitions of X, i.e., h(T):= sup h(a,T). [a\a finite] Remark 7.10. It is easy to see that if (?,?,?,?) and (Y,C,v,S) are isomorphic measure-preserving systems, then h(T) = h{S). In other words, entropy is an isomorphism invariant. THEOREM 7.11. Assume that (?, ?, ?, ?) is a measure-preserving system. (a) If a is a finite partition ofX, then h(a,T)= lim ?(a\a"\ = ?(a\al°). (b) If a and ? are finite (or countable) partitions ofX with ? (?), ? (?) < oo, then htf,T)^h(a.T) + H(P\a). (c) For allkeZ we have h(Tk) = \k\h(T). Now we will discuss an alternative approach to the definition of entropy for ergodic transformations via counting of names (see Rudolph A990)). For this it is useful to consider a finite partition a = {A\,..., A,,,} of a probability space (?,?,?) as a function from X into a finite set {a\,...,am\ of symbols or 'names'. More precisely, if a : X -> {??,..., a,„} is a measurable function, then the sets A, = a~' {a,} are the elements of the associated partition. If Г is a measure-preserving transformation on X, then for all ? e N and all ? e X the T, a, ?-name of ? is a„(x) := (a(x), a(Tx),..., ?(?,,_ '*)). Given this definition, it is easy to see that <-'={<*,;'{*}·. *е{а|,....а,„}"}.
Ergodic theory 1219 In other words, the names a„ (x) provide a coding for the sets in a'^ . For ? > 0 we say that a collection of names S С {? ?,.. .,am}" is ?-good if the measure of the union Uses'*/?'!5) has measure greater than 1 — ?, and we define N(T, ?, ?, ?) := min{cardE'): S is ?-good}. In words: ?(?,?,?,?) is the smallest number of sets in a'^'] that cover a set greater than 1 — ? in measure. Dehnition 7.12. Assume that (?,?,?,?) is a measure-preserving system and let a: X -> {a\,.. .,am] be a finite partition of X. Then ut-r ч log2 NG\ or, ?, ?) h(T, ?,?,?) := —f and ? h(T, ?, ?) := lim inf h(T, ?, ?, ?). ?—>эс THEOREM 7.13. If (X, В, ?, ?) is an ergodic measure-preserving system, then lim sup/г (?, ?, ?, ??) ^ ????^?,?,?,??) ?—>?? ?—» ?? for all О < ?? < ?2 < 1 and a// ^im'fe partitions a of X. Furthermore, the limit lim?^oh(T, ?, ?) exists and h{T)= sup lim h(?, ?, ?). [a\a finite] ?—>0 (For a proof see Rudolph A990).) Entropy as an isomorphism invariant is of course only useful, if it can be easily calculated. For the computability of entropy, the concept of generating partitions is of crucial importance: Dehnition 7.14. If (?, ?, ?, ?) is a measure-preserving system, then a finite partition ? of X is said to be a generator with respect to ? if orf^ = B. The following theorem is due to Kolmogorov A958) and Sinai A959). THEOREM 7.15 (Kolmogorov-Sinai theorem). //(?, ?, ?, ?) is a measure-preserving system and a is a generator with respect to T, then h(T) = h(a,T). (For a proof of this theorem see Parry A981).) For finite entropy ergodic measure- preserving systems on Lebesgue probability spaces generators with respect to ? do always exist (see Theorem 7.16). So in principle, entropy can be calculated via the Kolmogorov- Sinai theorem for a very large class of measure-preserving systems.
1220 F. Blume THEOREM 7.16 (Krieger generator theorem A970)). // ? is an ergodic measure- preserving transformation on a Lebesgue probability space (?, ?, ?) such that h(T) < oo, then there exists a finite partition aofX which is a generator with respect to T. If it is not possible to explicitly determine a generator, then the following theorem can still be useful in calculating the entropy of a measure-preserving transformation (see RokhlinA967)). THEOREM 7.17. //(?, ?,?,?) is a measure-preserving system and {a,,} is an increasing generating sequence of finite partitions of X (i.e., a„+\ ? a„ = an+\ for all ? e N and \I^L\an = B), then h(T)= lim h(a„,T). n—»oc EXAMPLES 7.18. (a) Bernoulli shifts. Assume that (?,?,?,?) is a Bernoulli shift on the finite alphabet {1,..., m] with weights p\ pm. If a is the time-zero partition of ?, i.e., a = {A\,..., Am] with A, := [x e ?: xq = i], then ?"*?, = {? e ?: Xk = i}. Therefore, a^ contains all cylinder sets and it follows that a^x = B. Since the partitions ?, ?~'?,?~2?,... are pairwise independent. Proposition 7.6(f) and the Kolmogorov- Sinai theorem imply that #(?"~'? ? " h(a) = h(a,a)= lim V ° ; = lim - У^Н(а~ка) = ? (a) /I—>ЭС Ц II—>ЭС П *—' к=0 in i=\ In words: The entropy of a Bernoulli shift is equal to the entropy of the time-zero partition. (b) Markov shifts. Assume that (?, ?, ?, ?) is a Markov shift determined by an m ? m stochastic matrix A and a corresponding probability vector p. Using similar techniques as in a), it can be shown (see Petersen A983)) that m h(p) = ~ X) PiOij log2 ajj. i.j = \ (c) К -automorphisms. If (?, ?, ?, ?) is a measure-preserving system, then ? is a K- automorphism if and only if ? has completely positive entropy, that is if and only if h(a,T) >0 for all non-trivial finite partitions ? of X. Furthermore, it can be shown that ? is a A"- automorphism if and only if for any finite partition a we have oo f>r = {0,x). /7=0
Ergo die theory 1221 For proofs of these results see Rokhlin and Sinai A961) and Cornfeldet al. A982). (d) Finite interval exchanges. If ?: [0, 1[ -> [0, 1[ is a finite interval exchange transformation, then h(T) = 0. This result follows from the more general fact that every measure-preserving transformation of finite rank has entropy zero (see Ferenczi A990)). (e) Discrete spectrum. If (?, ?, ?, ?) is a measure-preserving system such that Ut has discrete spectrum, then h(T) = 0 (for a proof see Petersen A983) or Parry A981)). (f) Product transformations. Let (X\,B\,?\.?\) and (Xi.Bj, ??, ?>) be measure- preserving systems and let ? := ?| ? Ъ be the product transformation on the product space (?? ? Xi,B\ <8>??2,М| XM2)-If<*i and ?? are finite partitions of ? ? and X2 respectively, then ? = <*? x<*2:={A| ? Ai: A| ea\, Ai e ??} is a finite partition of ? ? ? ??. It is easy to see that ?^-1 = (??)'?_? ? (??)^-1 and ^KH) = %i)S"')+^(KH)· Thus, /?(?, ?) = h{ot\, ?\) + h(ai, ?>). Choosing increasing generating sequences {ot\„} and {<*2.п} and setting a„ :=а\ш„ х ??.„, it is not difficult to show that {a,,} is an increasing generating sequence in the product space and therefore, Theorem 7.17 implies that h(T\ хГ2) = й(Г|)+й(Г2). A very important new area of research in the theory of entropy developed when Ornstein A970a, 1970c) showed that two Bernoulli shifts are isomorphic if they have the same entropy. In other words, for Bernoulli shifts entropy is a complete isomorphism invariant. We will summarize here briefly the main results of Ornstein's theory. Dehnition 7.19. Let ? > 0 and assume that a and ? are finite partitions of a probability space (?, ?, ?). We say that a and ? are ?-independent, if there is a family Q С ? such that the following conditions are satisfied: (a) M(UB6gS)> 1-е; (b) Елеа ЫА\В) -?(?)\ < ? for all B eQ. Dehnition 7.20. Assume that (?,?,?,?) is a measure-preserving system. A finite partition ? of X is said to be weakly Bernoulli if for every ? > 0 there exists an m e N such that for all ? e N the partitions aJJ and <*!,"'_„ are ?-independent. THEOREM 7.21 (Friedman and Ornstein, 1970). Assume that (?,?,?,?) is an er- godic measure-preserving system with a weakly Bernoulli generator. Then the system (?,?,?,?) is isomorphic to any Bernoulli shift of the same entropy. Remark 7.22. If (?, ?, ?, ?) is a Bernoulli shift, then the time zero partition a of ? is a generator such that org and <*!,"_„ are independent and in particular also ?-independent for all m, ? e N. Thus, Bernoulli shifts are weakly Bernoulli, and therefore, Bernoulli shifts
1222 F. Blume of equal entropy are isomorphic. It can also be shown that the time zero partition of a strongly mixing Markov shift (see Example 5.5(b)) is weakly Bernoulli so that a strongly mixing Markov shift is isomorphic to any Bernoulli shift of the same entropy (see Petersen A983)). Ornstein A970b) also proved a strengthening of Theorem 7.21 by replacing the assumption of 'weakly Bernoulli' successively by the weaker assumptions of 'very weakly Bernoulli' and 'finitely determined'. An interesting new proof of Theorem 7.21 (and its extension to the very weakly Bernoulli case) constructing an isomorphism explicitly (rather than just establishing the existence of an isomorphism as in the original proof by Ornstein) was given by Keane and Smorodinsky A979a) (see also Keane and Smorodinsky A977, 1979b, 1979c)). Given that the entropy of a partition a is defined as the integral of the information function /„, it is interesting to examine the convergence behavior of Ia„-\. If (?, ?,?,?) is a measure-preserving system, then / ,,-iU) can be regarded as a measure for the amount of information encoded in the и-name a„(x). The next theorem says that in the ergodic case this information converges to the entropy h(T) a.e. We encounter here again the important theme of deriving global information about a measure-preserving system from a single sample path. More precisely, the information encoded in the orbit of a point ? under the action of ? is almost surely equal to the entropy of T. THEOREM 7.23 (Shannon-McMillan-Breiman theorem). // (?, ?, ?, ?) is an ergodic measure-preserving system and a is a countable partition ofX with ? (a) < oo, then /„-.(*) lim —2 =h{T) a.e. n^oc ? For the original proof of this theorem see Shannon A948), McMillan A953) and Breiman A957). A proof using the Martingale convergence theorem is given in Petersen A983) and for a proof in the spirit of Definition 7.12 and Theorem 7.13 see Rudolph A990) or Shields A996). We will now discuss briefly some facts related to zero entropy systems. If (?, ?, ?, ?) is a measure-preserving system and ? is a finite (or countable) partition of X, then the tail field of a is -? j=-oo A proof of the following theorem is given in Rudolph A990) and Smorodinsky A971). THEOREM 7.24. Assume that (?, ?, ?, ?) is a measure-preserving system. (a) If a is a finite partition ofX, then h(a, T) = 0 if and only if а С ??-?,. (b) If a and ? are finite partitions ofX such that а С ??, then h(T, a) = 0. Given that there are numerous interesting examples of transformations with entropy zero (in fact, the set of entropy zero transformations is residual with respect to the weak
Ergodic theory 1223 topology), we may ask whether the concept of entropy can be successfully employed to study such systems. The most natural approach is here to replace the rate и by a 4-' slower rate a„ and to examine the behavior of the sequence {?(?^ ')/an}n6pj instead of {#(orQ-l)/n}„6pj. Unfortunately, it can be shown (see Blume A997)) that for any aperiodic measure-preserving system (that is for any system for which the set of periodic points has measure zero) and any positive monotone increasing sequence {a„}„6N with Hindoo ?,,/и = 0 the supremum over all finite partitions of the limit inferior of the sequence {#(??~')/?„}„6? is always 00,so that a meaningful invariant directly analogous to h(T) cannot be obtained in this way. However, if instead of taking the supremum, we look for lower bounds of entropy convergence rates, then we have the following nontrivial result: THEOREM 7.25 (Blume, 1998). Assume that ? is a completely ergodic (i.e., Tk is ergodic for all к e Z) measure-preserving transformation on a Lebesgue probability space (?,?,?). (a) There exists a positive concave function g:[l,oo[—» Ш (depending on T) with /1 g(x)/x dx = 00 such that for all nontrivial finite partitions a of X we have lim sup ———- > 1. «^00 g(\og2n) Note: A slightly weaker, but more general version of this statement for aperiodic systems was given by Blume A997). (b) There exists a positive monotone increasing sequence {ап}„ещ (depending on T) with lim„^oo an = 00, such that for all nontrivial finite partitions a of X we have liminf v ° ; > 1. «->¦ oo a„ Another possibility for generalizing the concept of entropy is to consider refinements of partitions along subsequences. This leads us to the notion of sequence entropy: Dehnition 7.26. Assume that (?,?,?,?) is a measure-preserving system and let A = [кп}пещ be a sequence of integers. Then, for all countable partitions ? of X we set hA(a, r):=limsup vv'~' >-, n—>зс П and the sequence entropy of ? is defined as hA(T):= sup hA(a,T). [a\H(a)<oo] Dekking A980) successfully used sequence entropy as an isomorphism invariant to distinguish between an ergodic transformation and its inverse. The following theorem lists a number of interesting properties of sequence entropy (see Kushnirenko A967), Newton A970) and Saleski A977)):
1224 F. Blume THEOREM 7.27. Assume that (?, ?, ?, ?) is a measure-preserving system. (a) If {a„}„6pj is an increasing generating sequence of finite partitions ofX, then for all sequences А С N we have йд(Г) = lim йд(а„, Г). и—>oo (b) If A = {к„]„ем is relatively dense (see Definition 4.2), then Ид(Т) = h(T)\imsup—. I!-» ОС П (c) (X, ?, ?, ?) is strongly mixing if and only if supAcB йл(Г, а) = H(a) for all positive increasing sequences of integers В and all a with ? (a) < oo. (d) (?, ?, ?, ?) is weakly mixing if and only if supA йл(Г, a) = ? (a) for all a with H(a) < oo. We will now turn to the concept of topological entropy and the variational principle. We assume that ? is a homeomorphism on a compact metric space X, so that (X,T) is a cascade. A collection U of nonempty subsets of X is said to be an open cover of X if each set in U is open and U(y6W^ = %¦ An open cover V is a refinement of U if every element of V is a subset of an element in U. In this case we write V > U. By N(U) we denote the minimal cardinality of subcovers of U (which is finite, because X is compact), that is N{U) :=min{card(W): WcWis an open cover of X}, and we define the entropy oiU as H(U) := log2 N(U). The refinement of two open covers U and V is WvV:=(№: W = U ? V ? 0 for some i/eWandVeV) As in the case of partitions of a probability space, we denote by U"n the refinements of an open cover under ?, i.e., ? U'^x := T~mU ? · · · ? T~"U = \J T~kU. k—m Using the easily verifiable inequality H(UvV)^ H(U) + H(V), it can be shown that the limit h(U,T):= lim v ° ; exists and that h(U, ?) ?, H(U).
Ergodic theory 1225 Dehnition 7.28. If (X, T) is a cascade, then the topological entropy of ? is defined to be htop(T):= sup h(U,T). и An alternative but equivalent definition of topological entropy that makes explicit use of the metric structure of X was given by Bowen A971). By contrast, the definition of topological entropy in Definition 7.28, due to Adler et al. A965), applies also to the case that X is a compact Hausdorff space. In analogy to Theorem 7.17 the following result is useful in computing topological entropy: THEOREM 7.29. If {Ып}п(,ц is an increasing generating sequence of open covers (i.e., U„ ^ U„+\ and for every open cover V there exists an ? e N such that V $5 U„), then htop(T) = lim h(U„,T). The variational principle as stated below establishes a very important link between topological entropy and measure-theoretic entropy (see Goodwyn A969, 1972), Dinaburg A970) and Goodman A971)). It allows us to deduce many properties of topological entropy from the corresponding properties of measure-theoretic entropy. THEOREM 7.30 (Variational principle). Assume that (X, T) is a cascade. For a T- invariant Borel probability measure ? on X (see Remark 3.8), we denote by h?(T) the measure-theoretic entropy of the measure-preserving transformation ? with respect to ?. Then hiop(U) = sup{/?M(r): ? is a T-invariant Borel probability measure on X}. The last subject in this section is a discussion of the relation between entropy and Kolmogorov complexity. The definition of the algorithmic complexity of a finite 01-name, that is a finite sequence in the 'letters' 0 and 1, is based on the concept of Turing machines (for a definition of Turing machines see Jones A973) or White A991)). For a finite alphabet A := {1,..., m] we define A* : = U,?L| л" ¦ With the appropriate coding (see White A991)) a Turing machine ? can be regarded as a function that outputs elements s e A* on an input r e {0, 1}* via a certain algorithm. The complexity K\j(s) of a name s e A* with respect to ? is defined to be the minimal length of all 01-names r e {0, 1}* which produce s when used as input for M. If there is no such input r, then Km(s) := oo. More precisely, if we denote by l(r) the length of a 01-name s e {0, 1 }*, then KM(s)=\m{n{l{r)- геГ'М1 ifM-'{i}^0, loo ifM-'{5}=0. For any ? e Аг we denote by ?"? the name w(m)w(m + 1)· ·?(?). If (X, T) is a cascade and U = {U\,..., Um] is a finite open cover of X, then for ? e X we define <pu(x) := {? e Az: T"x e ?/?(,„ for all neZ]
1226 F. Blume and we set Km (?'? ) sup-?"?(jc,6/, T) := limsup min —^—-, п-юо и>ефц(х) П in{-KM(x,U,T):=hminf min ?^° '. п-юс 0;ефи(х) П Since any universal Turing machine N is asymptotically optimal in the sense that for every Turing machine ? there exists a constant См such that K^(s) ^ Km(s) + См for all s e Л* (see White A991)), it follows that for any pair of universal Turing machines N\ and N2 we have sup-A";v,(*,?/, T) = sup-KN^(x,U,T) and inf-AJv,(-*,?</, T) = inf-A"yv2 (x ,14, T). This observation allows us to define sup-K(x,U,T) :=sup-Kn(x,U,T) and mf-K(x,U, T) := sup-KN(x,U, T), where N is an arbitrary universal Turing machine. Now we set sup-K(x,T) :=sup[sup~K(x,U,T): 14 is an open cover of X} and inf-A"(;c, T) := sup{inf-A"(;c, 14, T): 14 is an open cover of X}. The following theorem shows that the algorithmic complexity of trajectories in ergodic measure-preserving systems is almost surely equal to the entropy of the system, so that once again we are able to derive global information about a system from a single sample path. THEOREM 7.31 (Brudno A987), White A991, 1993)). If (?, ?) is a cascade and ? is a ?-invariant Borel probability measure on X, then sup K(x,T) = inf K(x,T) = hu(T) a.e. 8. Constructions in ergodic theory Products. If (?,?,?,?) and (Y,C,v,S) are measure-preserving systems, then the product map ? ? S defined via the equation ? xS(x,y):=(Tx,Sy) is measure-preserving on the product space (? ? ?, ? <%> С, ? ? ?).
Ergodic theory 1227 Skew products. Assume that (?, ?, ?, ?) is a measure-preserving system and {(Sx, Y, C, ?): ? e X} is a family of measure-preserving systems such that the map (x,y)h+ Sxy is measurable from ? ? У to У (with respect to the product measure ? ? ? on ? ? ?). Then the map г:ХхУ^ХхУ, (x,y)^(Tx,Sxy), is a measure-preserving transformation on (? ? ?, В <g> С, ? ? ?). Note: The ergodicity of skew products has been discussed by Anzai A951) for the special case that ? = G is a topological group with Haar measure v, and Sx is right-multiplication with an element f(x) e G. A formula for the entropy of skew products was found by Abramov and Rokhlin A962): For a finite partition ? of ?, ? e N and ? e X we define ?'1(?) := S-'?? S-'S-'J ? ··· ? S-*S^ ¦ ¦ ¦ S.,^, oo ?\°(?)·-=\/?"(?) and M0,S):= f Я(^|^(дс))^. ./? With these definitions it can be shown that h(r) = h(T) + sup{hT(fi, S): ? is a finite partition of ?}. Cutting-and-stacking and infinite interval exchanges. We already encountered one example of a cutting-and-stacking procedure in Example 5.5(e), but we will now discuss how in fact almost every measure-preserving system can be constructed in a similar way. To give an exact definition of a cutting-and-stacking procedure (see Arnoux et al. A985)) is somewhat cumbersome and we will therefore be content to give an informal description. At any stage in a cutting-and-stacking procedure, we are given finitely many towers of finitely many intervals (in the special case of Example 5.5(e) the number of towers was equal to one at every stage in the process). The intervals in a given tower have equal length, but the length of the intervals may vary between different towers. We proceed to cut each of these towers into finitely many subtowers (the number of subtowers may be different for different towers), add possibly a finite number of spacers (see Example 5.5(e)) on top of each subtower and stack up subtowers of equal width to obtain a new collection of finitely many towers. The measure-preserving transformation corresponding to such a cutting-and-stacking process maps each level in a given tower onto the next higher level. Furthermore, a cutting and stacking process can always be arranged in such a way that the union over all stages in the process of all the levels in the towers is equal to [0, 1[. It can be shown that every aperiodic measure-preserving system (i.e., the measure of periodic points under ? is equal to zero) is isomorphic to a measure-preserving transformation on [0, 1 [ (with respect to Lebesgue measure) obtained from a cutting-and-stacking process as described above. Moreover, it is not difficult to
1228 F.BInme see that every measure-preserving transformation obtained from a cutting-and-stacking process can be represented as an infinite interval exchange. Thus we obtain the following theorem: THEOREM 8.1 (Arnoux et al., 1985). Any aperiodic measure-preserving transformation on a probability space is isomorphic to an interval exchange transformation ? on [0, 1 [, that is a map ? : [0, 1 [ -> [0, 1 [ which satisfies the following properties: (a) There exists a strictly increasing sequence {t„}„ещ С [0, 1 [ and a sequence {а,,} С К such that to = 0, lim,,^^ t„ = 1 and T(x) =x + a„ for all ? e /„ := [?„_ ?, t„ [. (b) Г(/„)с[0, \[forallneN. (c) ? is bijective. Adic transformations. An interesting combinatorial approach for representing cutting- and-stacking procedures (that is aperiodic measure-preserving transformations) has been introduced by Vershik A989, 1991). The so-called adic transformations are defined on the set X of all infinite paths in a graph which has the following characteristics: (a) The graph consists of infinitely many levels with a finite number of vertices in each level. (b) The vertices in each level are connected only to vertices in the levels directly above and below. (c) In each level the vertices are ordered from left to right. A ?-algebra on X is naturally generated by the cylinder sets of infinite paths. Furthermore, we introduce a partial order on X such that two paths x = (x\,x2,X7,,...) and у = (у\,у2,уз,..-) are comparable if there is an ? e N such that ? к = у к for all к > п. For any pair of comparable paths ? and у we say that ? < у if xn < y„ for the largest integer ? for which xn ? yn. For a given nonmaximal path ? we define ? ? to the smallest path у which is larger than x. In order for ? to represent a cutting and stacking process, it is necessary to remove from X a countable set of minimal and maximal elements (for details see also Vershik and Livshits A992)). To illustrate how a cutting- and-stacking procedure can be represented by an adic transformation, we consider an example: Cutting-and-stacking procedure: ~J ~1 t о t
Ergodic theory 1229 Equivalent adic transformation: The measure of a cylinder set С = {x e X \ x\ = a\,...,x„ = an} is here 4r· Furthermore, the 2" cylinder sets of the form {x e ? | ?\ = a\,..., x„ = a,,} represent the 2" levels obtained in the «th step of the cutting-and-stacking procedure. Rokhlin towers. The usefulness of the so-called Rokhlin tower decomposition for the efficient formulation of proofs and as a basis for various constructions in ergodic theory depends on the following theorem: THEOREM 8.2 (Rokhlin lemma (Rokhlin, 1948)). If (?, ?, ?, ?) is an aperiodic measure- preserving system, then for any ? > 0 and any ? e N there is a set A e В such that the sets А, ТА,..., Г"-1 A are pairwise disjoint and the measure of their union is greater than 1-?. Remark 8.3. The Rokhlin Lemma is a consequence of the more general Backward Vitali Lemma (see Rudolph A990)) which is useful for example in the proof of the pointwise ergodic theorem and in the proof of Theorem 7.13. Induced transformations. If (?,?, ?, ?) is a measure-preserving system and A e В with ? (A) > 0, then according to the Poincare Recurrence Theorem, for almost all ? e A there exists an ? e N such that T"x e A. Thus, идС*) := ???{? e ?: ?"? e A] is well defined for almost all ? e A and this allows us to define a map Та on A (up to a set of measure zero) via the equation ТАх:=Т"л(х)х. It is easy to see that with дд := ?/?(?) and Ba ¦= {В П А \ В е В} the system (А, Вл, мл, Та) is measure-preserving. The entropy of an induced transformation is given by the formula h(T) h(TA)= -^ (Abramov, 1959). ?(?) For a given measure-preserving system, it is possible to topologize the space ?4? '¦= {Та: A e ?, ?{?) > 0} of all induced transformations via the metric d(TA,Te) : = ?(???). Del Junco and Rudolph A996) showed that for an ergodic measure-preserving
1230 F. Blume system (?,?,?, ?) the set of all weakly mixing induced transformations is residual in Mt and if (?, ?, ?, ?) has positive entropy, then the set of induced A'-automorphisms is residual in Mt as well. Factors. If (?,?,?,?) and (Y, C, v, S) are measure-preserving systems, then a map 0 : X —» У is called a homomorphism or factor map if (a) ф-^СсВ, (b) фТ = Бф а.е., and (c) ?@-'?) = v(A)forall AeC. If ? is a factor map, then (Y,C, v, S) is said to be a factor of (?, ?, ?, ?). Factor maps play a prominent role in the theory of joinings (see below) and are therefore also useful in Ornstein theory, because as shown in Rudolph A990) joinings can be used to prove Ornstein's isomorphism theorem. Joinings and disjointness. If X = (?,?,?,?) and ? = (Y,C,v,S) are measure- preserving systems, then a ? ? S'-invariant complete probability measure ? on ? ? У is said to be a joining of X and ? if for all A e В and В е С we have (a) A x В is measurable with respect to ?, and (b) ?(? ? ?) = ?(?) and ?(? ? ?) = ?(?). To discuss the connection between joinings and factor maps, let us assume that X and ? are factors of a measure-preserving system (?,?,?,??) with factor maps ?: ? -> X and ? : ? -> У. If we restrict ? to the smallest ?-algebra containing ?~] ?? ?~]0, then the countably additive set function ?(? ? ?):=?(?-^??)?-^?) has a unique extension to a joining of X and ? (see Rudolph A990)). On the convex space J(X, ?) of all joinings of X and ? (convexity means here that pk\ + A - ?)?2 e J(X, ?) for all ?,, ?2 e J(X, ?) and all pe[0,l])a metric d can be defined (see Rudolph A990)) such that (a) (J(X, Y), d) is compact, and (b) for any sequence {?,,} с J(X, ?) and any ? e J(X, Y) we have lim <*(?„,?) = 0 n—>oo if and only if \??\„^??„(? ? ?)=?(? ? ?) for all A e В and В е С. THEOREM 8.4. IfX = (?, ?, ?, ?) and ? = (?, С, ?, 5") are ergodic measure-preserving systems, then ? is an ergodic joining of X and ? if and only if ? is an extreme point of J(X,Y) (see Rudolph A990)). The space J(X, Y) is never empty, because we always have ? ? ? e J (?, ?). If ? ? ? is the only joining of X and ?, then we say that X and ? are disjoint. The concept of disjointness allows for the following interesting characterizations of ergodicity, weak mixing and the A"-property:
Ergodic theory 1231 THEOREM 8.5. Assume that X = (?, ?, ?, ?) is a measure-preserving system. (a) X is ergodic if and only if X is disjoint from any identity map on a Lebesgue space. (b) X is weakly mixing if and only if X is disjoint from all isometries on compact metric spaces. (c) ? is а К-automorphism if and only if X is disjoint from any ergodic system (Y, C, v, 5") for which h(S) = 0. For the case of self-joinings, that is joinings of a system with itself, the space J(X, X) contains always (in addition to the product measure ? ? ?) the diagonal joining ?0(? ? ?):=?(???) and the off-diagonal joinings ?„(?? ?):=?(???"?). (Note: The study of self-joinings was initiated by Rudolph A979).) We say that X has two-fold minimal self-joinings if У (X, X) is the convex hull of {? ? ?) U {?„}„6?. THEOREM 8.6. If ? = (?,?,?,?) is completely ergodic (i.e., T" is ergodic for all ? e Z) and has two-fold minimal self-joinings then ? commutes only with its powers T" and X has no nontrivial factors. In generalizing the concept of self-joinings we may also consider multiple self-joinings and in particular off-diagonal joinings supported on graphs of the form {(T"]x,..., T"kx) \ ? & X]. Partitioning the index set {1,..., k] into disjoint subsets, each set in the partition determines itself an off-diagonal measure, and the directproduct of the resulting measures is called a product of off-diagonal joinings. We say that X has к-fold minimal self-joinings if all k-fo\d multiple self-joinings of X are in the convex hull of the products of off- diagonal joinings. Del Junco et al. A980) showed that the Chacon transformation as defined in Example 5.5(e) has minimal self-joinings of all orders. Given the statement of Theorem 8.6, this implies that the Chacon transformation has no nontrivial factors. Furthermore, Glasner et al. A992) showed that any system with 3-fold minimal self joinings has minimal self-joinings of all orders. For proofs of the results in this section, applications of joinings to Ornstein theory and further references, the reader is referred to Rudolph A990) and Thouvenot A995). References Abramov. L.M. A959). Entropy of induced automorphisms. Dokl. Akad. Nauk SSSR 128. 647-50. Abramov, L.M. and Rokhlin, V.A. A962), The entropy of a skew product of measure-preserving transformations. Vestnik Leningrad Univ., No. 17, 5-13. Amer. Math. Soc. Transl. Ser. 2. Vol. 48 A965), 225-65. Adler, R.L., Kohnheim, A.G. and McAndrew. M.H. A965). Topological entropy. Trans. Amer. Math. Soc. 114, 309-19. Anzai, H. A951), Ergodic skew product transformations on the torus. Osaka Math. J. 3. 83-99.
1232 F. Blume Arnoux, P., Omstein, D.S. and Weiss. B. A985). Cutting and stacking, interval exchanges and geometric models, Israel J. Math. 50, 160-168. Assani, I. A993), The helical transform and the a.e. convergence of Fourier series, Illinois J. Math. 37, 123-146. Assani, I. and Petersen, K. A992), The helical transform as a connection between ergodic theory and harmonic analysis. Trans. Amer. Math. Soc. 331, 131-142. Bergelson, V. A987), Polynomial ergodic theorems, Ergodic Theory Dynam. Systems 7. 337-349. Bergelson, V., Furstenberg, H. and McCutcheon, R. A996). IP-sets and polynomial recurrence, Ergodic Theory Dynam. Systems 16, 963-974. Bergelson, V., Komfeld, I. and Mityagin, B. A993), Unitary I}1-actions with continuous spectrum. Proc. Amer. Math. Soc. 119, 1127-1134. Bergelson. V. and Leibman, A. A996). Polynomial extensions of van der Waerden's and Szemeredi's theorems. J. Amer. Math. Soc. 9. 725-753. Birkhoff. G.D. A931). Proof of the ergodic theorem, Proc. Nat. Acad. Sci. USA 17. 656-660. Blum. J.R. and Hanson, D.L. A960). On the mean ergodic theorem for subsequences. Bull. Amer. Math. Soc. 66. 308-311. Blume. F. A997), Possible rates of entropy convergence. Ergodic Theory Dynam. Systems. 17. 45-70. Blume, F. A998), Minimal Rates of entropy convergence for completely ergodic systems. Israel J. Math. 108, 1-12. Bourgain. J. A988a). On the maximal ergodic theorem for certain subsets of the integers. Israel J. Math. 61. 39-72. Bourgain, J. A988b). On the pointwise ergodic theorem on L1' for arithmetic sets. Israel J. Math. 61. 73-84. Bourgain. J. A989). Pointwise ergodic theorems for arithmetic sets (Appendix: The return time theorem). Publ. Math. IHES 69. 5^45. Bowen, R. A971), Entropy for group endomorphisms and homogeneous spaces. Trans. Amer. Math. Soc. 153, 401^414. Breiman, L. A957). The individual ergodic theorem of information theory, Ann. Math. Stat. 28. 809-11. Correction, ibid. 31 A960). 809-810. Brudno. A. A. A987), Entropy and the complexity of the trajectories of a dynamical system. Trans. Moscow Math. Soc. 2, 127-151. Campbell, J.T. and Petersen. K. A989). The spectral measure and Hilbert transform of a measure-preserving transformation. Trans. Amer. Math. Soc. 313. 121-130. Chacon. R.V. A969). Weakly mixing transformations which are not strongly mixing, Proc. Amer. Math. Soc. 22, 559-562. Chacon, R.V. and Omstein. D.S. A960). A general ergodic theorem, Illinois J. Math. 4. 153-160. Choe, G.H. A993), Weakly mixing interval exchange transformations. Math. Japon. 38 D). 727-734. Conze, J.-P. and Lesigne, E. A984), Theoremes ergodique pour des mesures diagonales, Bull. Soc. Math. France 112, 143-175. Conze, J.-P. and Lesigne, E. A988), Sur un theoreme ergodique pour des mesures diagonales. С R. Acad. Sci. Paris, Ser. 1306, 491^493. Comfeld, I.P., Fomin, S.V. and Sinai. Y.G. A982), Ergodic Theory. Springer, New York. Cotlar. M. A955). Л unified theory of Hilbert transforms and ergodic theorems. Rev. Mat. Cuyana 1. 105-167. Dekking F.M. A980), Some examples of sequence entropy as an isomorphism invariant. Trans. Amer. Math. Soc. 259A). 167-183. Dinaburg. E.I. A970). The relation between topological entropy and metric entropy. Dokl. Akad. Nauk SSSR 190. 19-22. Soviet Math. Dokl. 11. 13-16. Ferenczi. S. A990), Systemes de rangfini. These de doctorat. Universite d'Aix-Marseille. Friedman. N.A. A970). Introduction to Ergodic Theory. Van Nostrand Reinhold Studies. No. 29. New York. Friedman, N.A. and Omstein. D.S. A970). On isomorphisms of weak Bernoulli transformations. Adv. Math. 5. 365-394. Furstenberg. H. A960). Stationary Processes and Prediction Theory. Ann. Math. Studies, Vol. 44. Princeton University Press, Princeton. NJ. Furstenberg. H. A977), Ergodic behavior of diagonal measures and a theorem of Szemere'di on arithmetic progressions. J. Anal. Math. 31. 204-256.
Ergodic theory 1233 Furstenberg. H. A981). Recurrence in Ergodic Theory and Combinatorial Number Theory, Princeton University Press. Princeton, NJ. Furstenberg. H. and Katznelson. Y. A978). An ergodic Szemeredi theorem for commuting transformations, J. Anal. Math. 34. 275-291. Furstenberg. H. and Katznelson. Y. A985). An ergodic Szemeredi theorem for IP-systems and combinatorial theory, J. Anal. Math. 45. 117-168. Furstenberg. H. and Katznelson, ? A986). ??,-sets, Szemeredi's theorem and Ramsey theory. Bull. Amer. Math. Soc. 14 B), 275-278. Furstenberg. H. and Katznelson. ? A990), Nonconventional ergodic averages. The Legacy of John von Neumann. Proc. Sympos. Pure Math.. Vol. 50. Amer. Math. Soc.. Providence. Rl. 43-56. Furstenberg. H. and Katznelson, ? A991), ? density version of the Hales-Jewett theorem, J. Anal. Math. 57 B), 64-119. Furstenberg, H. and Weiss, B. A996). A mean ergodic theorem for jj ?^=? f(T"x)g(T"~ ?). Convergence in Ergodic Theory and Probability, Ohio State Univ. Math. Res. Inst. Publ.. Vol. 5. de Gruyter, Berlin, 193-227. Garsia, A.M. A965), A simple proof of E. Hopf's maximal ergodic theorem, J. Math. Mech. 14. 381-382. Glasner, E., Host, B. and Rudolph, D.J. A992). Simple systems and their higher order self-joinings. Israel J. Math. 78, 131-142. Glasner, E. and King, J.L. A998), A zero-one law for dynamical properties. Contemp. Math. 215, 231-242. Goodman, T.N.T. A971), Relating topological entmpy and measure entmpy. Bull. London Math. Soc. 3, 176— 180. Goodwyn, L.W. A969), Topological entmpy bounds measure-theoretic entmpy, Proc. Amer. Math. Soc. 23. 679- 688. Goodwyn, L.W. A972), Comparing topological entmps with measure-theoretic entmpy, Amer. J. Math. 94, 366- 388. Hahn, F.J. A965), Skew pmduct transformations and the algebras generated by exp(p(»)). Illinois J. Math. 9, 178-189. Halmos, PR. A944), In general a measure-preserving transformation is mixing, Ann. Math. 45, 786-792. Halmos, PR. and von Neumann, J. A942), Operator methods in classical mechanics II, Ann. Math. 43, 332-350. Halmos, PR. and von Neumann, J. A958), Lectures on Ergodic Theory, Chelsea, New York. Hopf, E. A954), The general temporally discrete Markov pmcess, J. Rat. Mech. Anal. 3, 13-45. Host, B. A991), Mixing of all orders and pairwise independent joinings of systems with singular spectrum, Israel J. Math. 76, 289-298. Jewett, R.I. A970), The prevalence of uniquely ergodic systems, J. Math. Mech. 19, 717-729. Jones, N.D. A973), Computability Theory: An Intmduction, Academic Press, del Junco, ?., Rahe, A.M. and Swanson, L. A980), Chacon's automorphism has minimal self-joinings, J. Anal. Math. 37, 276-284. del Junco, A. and Rudolph, D. A996). Residual behavior of induced maps. Israel J. Math. 93, 387-398. Kalikov, S.A. A984), Twofold mixing implies threefold mixing for rank one transformations, Ergodic Theory Dynam. Systems 4, 237-259. Katok, A. and Hasselblatt, B. A995), Intmduction to the Modern Theory of Dynamical Systems. Cambridge University Press, Cambridge. Keane, M. and Smorodinsky. M. A977). A class offinitary codes, Israel J. Math. 26, 352-371. Keane, M. and Smorodinsky, M. A979a), Bernoulli schemes of the same entmpy arefinitarily isomorphic, Ann. Math. 109, 397^406. Keane, M. and Smorodinsky, M. A979b). Finitary isomorphism of irreducible Markov shifts, Israel J. Math. 34, 281-286. Keane, M. and Smorodinsky, M. A979c), The finitary isomorphism theorem for Markov shifts. Bull. Amer. Math. Soc. 1, 436-438. Khintchine, A.I. A934), Eine Verscharfung des Poincareschen "Wiederkehrsatzes". Сотр. Math. 1, 177-179. Kingman, J.F.C. A968), The ergodic theory of subadditive stochastic pmcesses, J. R. Stat. Soc. Ser. В 30. 499- 510. Kingman, J.F.C. A973), Subadditive ergodic theory, Ann. Probab. 1, 883-909. Kolmogorov, A.N. A953), On dynamical systems with an integral invariant on the torus, Dokl. Akad. Nauk SSSR 93, 763-766.
1234 F. Blume Kolmogorov, A.N. A958), New metric invariants of transitive dynamical systems and automorphisms ofLebesgue spaces, Dokl. Akad. Nauk SSSR 119, 861-864. Krengel, U. A972), Weakly wandering vectors and weakly independent partitions, Trans. Amer. Math. Soc. 164, 199-206. Krieger, W. A970), On entropy and generators of measure-preserving transformations. Trans. Amer. Math. Soc. 149, 453^464. Krieger, W. A972), On unique ergodicity, Proc. Sixth Berkeley Symp. A970), Vol. 1, University of California Press, Berkeley and Los Angeles, 327-346. Kryloff, N. and Bogoliouboff, N. A937), La theorie generate de la mesure dans son application a I'etude des systemes dynamiques de la mecanique поп Imeaire, Ann. Math. 38 A), 65-113. Kushnirenko, A.G. A967), On metric invariants of entropy type. Uspekhi Mat. Nauk 22. 57-65, Russian Math. Surveys 22 A967), 53-61. Lacey, M., Petersen, K., Rudolph, D. and Wierdl, M. A994), Random ergodic theorems with universally representative sequences, Ann. Inst. H. Poincare Probab. Statist. 30 C), 353-395. McMillan, B. A953), The basic theorems of information theory, Ann. Math. Stat. 24, 196-219. von Neumann, J. A932a), Proof of the quasi-ergodic hypothesis, Proc. Nat. Acad. Sci. USA 18, 70-82. von Neumann, J. A932b), Zur Operatorenmethode in der klassischen Mechanik, Ann. Math. 33, 587-642. Newton, D. A970), On sequence entropy 1. Math. Systems Theory 4 B), 119-125. Ornstein, D.S. A970a), Bernoulli shifts with the same entropy are isomorphic. Adv. Math. 4, 337-352. Ornstein, D.S. A970b), Imbedding Bernoulli shifts inflows. Contributions to Ergodic Theory and Probability, Lecture Notes in Math., Vol. 160. Springer. New York, 178-218. Ornstein, D.S. A970c). Two Bernoulli shifts with infinite entropy are isomorphic, Adv. Math. 5, 339-348. Ornstein, D.S. A972). On the root problem in ergodic theory, Proc. Sixth Berkeley Symposium A970), University of California Press, Berkeley and Los Angeles. 345-56. Parry, W. A981), Topics in Ergodic Theory, Cambridge University Press, Cambridge. Petersen, K.E. A970), A topologically strongh mixing symbolic minimal set, Trans. Amer. Math. Soc. 148, 603- 612. Petersen, K.E. A979), The converse of the dominated ergodic theorem. J. Math. Anal. Appl. 67, 431^436. Petersen, K.E. A983), Ergodic Theory, Cambridge University Press, Cambridge. Petersen, K.E. A996), Ergodic theory and the basis of science, Synthese 108, 171-183. Petersen, K.E. and Salama, I. A995), Ergodic Theory and its Connections with Harmonic Analysis, Cambridge University Press, Cambridge. Poincare, H. A899), Les methodes nouvelles de la mecanique celeste, I A892), II A893) and III A899), Gauthier- Villars, Paris and Dover, New York, 1957. Postnikov, A.G. A966), Ergodic Problems in the Theory of Congruences and of Diophantine Approximations, Tr. Mat. Instit. Steklova 82, 3-112. Proc. Steklov Institute of Math. 82, Amer. Math. Soc., Providence, RI, 1967. Renyi, A. A958), On mixing sequences of sets. Acta Math. Acad. Sci. Hungar. 9, 215-228. Rokhlin, V.A. A948), A 'general' measure-preserving transformation is not mixing, Dokl. Akad. Nauk SSSR 60, 349-351. Rokhlin, V.A. A967), Lectures on the theory of entropy of transformations with invariant measure, Uspekhi Mat. Nauk 22, 3-56, Russ. Math. Surveys 22, 1-52. Rokhlin, V.A. and Sinai, Ya.G. A961), Construction and properties of invariant measurable partitions, Dokl. Akad. Nauk SSSR 141, 1038-1041, Sov. Math. 2A966), 1611-1614. Rosenblatt, J. and Wierdl, M. A995), Pointwise ergodic theorems via harmonic analysis, Ergodic Theory and its Connections with Harmonic Analysis, London Math. Soc. Lecture Note Ser., Vol. 205, Cambridge University Press, Cambridge, 3-151. Rudin, W. A987), Real & Complex Analysis, McGraw-Hill, New York. Rudolph, DJ. A979), An example of a measure-preserving map with minimal self-joinings and applications, J. Anal. Math. 35,97-122. Rudolph, D.J. A990), Fundamentals of Measurable Dynamics, Oxford University Press, New York. Ryzhikov, V.V. A993), Joinings and multiple mixing of finite rank actions, Funktsional. Anal, i Prilozhen. 27 B), 63-78. Saleski, A. A977), Sequence entropy and mixing, J, Math. Anal. Appl. 60, 58-66. Shannon, C.E. A948), A mathematical theory of communication, Bell System Tech. J. 27, 379^423, 623-656.
Ergodic theory 1235 Shields, P.C. A996), The Ergodic Theory of Discrete Sample Paths, Graduate Studies in Math., Vol. 13, Amer. Math. Soc. Shiryayev, A.N. A984), Probability, Springer, New York. Sinai, YG. A959), Flows with finite entropy, Dokl. Akad. Nauk SSSR 125, 1200-1202. Smorodinsky, M. A971), Ergodic Theory; Entropy, Lecture Notes in Math., Vol. 214, Springer, New York. Szemeredi, E. A975), On sets of integers containing no к elements in arithmetic progression. Acta Anth. 27, 199-245. Thouvenot, J.-P. A995), Some properties and applications of joinings in ergodic theory, Ergodic Theory and its Connections with Harmonic Analysis, Cambridge University Press, Cambridge. Ulam, S.M. and von Neumann, J. A945), Random ergodic theorems. Bull. Amer. Math. Soc. 51, 660. Varga, R.S. A962), Matrix Iterative Analysis, Prentice-Hall, Englewood Cliffs, NJ. Vershik A. A989), A new model for ergodic transformations, Banach Center Publ., Vol. 23, 381-384. Vershik A. A991), The Fibadic expansion of real numbers and adic transformation. Report no. 4, 1991/92, Inst. Mittag-Lefler. Vershik, A.M. and Livshits, A.N. A992), Adic models of ergodic transformations, spectral theory, substitutions and related topics, Adv. Soviet Math. 9, 185-204. van der Waerden, B.L. A927), Beweis einer Baudet'schen Vermutung, Nieuw. Arch. Wisk. 15, 212-216. Walters, P. A975), Ergodic Theory - Introductory Lectures, Lecture Notes in Math., Vol. 458, Springer, Berlin. Weyl, H. A910), Uber die Gibbs'sche Erscheinung und verwandte Konvergenzphanomene, Rend. Circ. Mat. Palermo 330, 337^407. Weyl, H. A916), Uber die Gleichverteilung von Zahlen mod. eins. Math. Ann. 77, 313-352. White, H.S. A991), On the Algorithmic Complexity of the Trajectories of Points in Dynamical Systems, Doctoral Dissertation, University of North Carolina at Chapel Hill. White, H.S. A993), Algorithmic complexity of points in dynamical systems, Ergodic Theory Dynam. Systems 13, 807-830. Wiener, N. A939), The ergodic theorem, Duke Math. J. 5, 1-18. Wiener, N. and Wintner, A. A941), Harmonic analysis and ergodic theory, Amer. J. Math. 63, 415^426. Yosida, K. and Kakutani, S. A939), Birkhoff's ergodic theorem and the maximal ergodic theorem, Proc. Japan Acad. 15, 165-168.
CHAPTER 30 Generalized Derivatives* Endre Pap Institute of Mathematics, University of Novi Sad, Trg D. Obradovica 4, 21000 Novi Sad, Yugoslavia E-mail: pape@eunet.yu, pap@im.ns.ac.yu Arpad Takaci Institute of Mathematics, University of Novi Sad, Trg D. Obradovica 4,21000 Novi Sad, Yugoslavia E-mail: takacsi@eunet.yu, takaci@im.ns.ac.yu Contents Introduction 1239 1. Sobolev spaces and derivatives 1239 1.1. Introduction 1239 1.2. Sobolev spaces 1242 1.3. Imbedding theorems and traces of functions '244 2. Distributional derivatives 1246 2.1. Introduction 1246 2.2. Distribution spaces 1248 2.3. Fundamental solutions '251 3. The Mikusinski operational calculus '254 3.1. Introduction 1254 3.2. Operators 1254 References 1259 *The authors want to thank for the partial financial support of the Project in the Fields of Basic Research "Mathematical models of nonlinearity, uncertainty and decision" A866) supported by the Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1237
This page intentionally left blank
Generalized derivatives 1239 Introduction Many problems with partial differential equations (PDEs) appearing in applications cannot have classical solutions, and this fact forced research throughout the twentieth century on new concepts of the solutions and their differentiability. At the same time, the classical spaces of functions became insufficient and inadequate for such new, "generalized" solutions. These new theories developed in the thirties of the last century, mainly in the works of S.L. Sobolev and K. Friedrichs, and one can say that they have been intensively developing ever since. In this chapter we shall expose three types of generalized derivatives, namely that of Sobolev type, distributional one, and the one appearing in the Mikusinski operational calculus. Of course, some elements from the theory of Sobolev and distribution spaces, as well as the construction of the Mikusinski operator field, will also be given. Characteristic applications to PDEs will be demonstrated. 1. Sobolev spaces and derivatives 1.1. Introduction The notion of generalized derivative, closely connected to the notion of generalized solution of differential equations, was introduced in the thirties by Sobolev A935, 1936), Friedrichs A934), though some ideas in that direction appeared much earlier, e.g., in B. Levi's paper from 1906, In order to give an idea how to introduce a generalized derivative of a (say) locally integrable function, let us start with a partial integration. If / and ? are continuously differentiable complex valued functions on E, the latter with compact support, i.e., ?{?) = 0 outside some closed bounded interval [—?, ?], then it holds f f'(x)^(x~)dx = ? Г(хЩх~)ах = /(?)??\"_?- ? f{x)V<S)dx JR J-a J-a = - f f{x)V<7)dx. A) Recall that the support of a continuous function ?:?->?, where ? с К" is a region, denoted by supp<p, is the closed set defined by supp<p = {x ? ?: ?(?) ^0}. In view of the usual scalar product (·, ·) of complex valued functions from i--(R), given by (/?, /2) := f /? (x)Mx)dx, /?, /2 e L2(R), JR the left- and the right-hand side of A) can be written as (/'.*>) = -(/,*>'). B)
1240 ?. Pap and A. Takaci Now if / belongs to the Hubert space L2(R), then it might not have a derivative (in the usual sense) at each point ? e R. However, there might exist a function g e L~(R), such that for each ? e C0' (R) (the vector space of continuously differentiable functions with compact support) it holds (*.*>) = -(/,*>')· C) Then g e L2(R) from C) is the generalized first derivative (in the Sobolev sense) of /eL2(R). Example 1.1. The function /, given by, 0 for ? < 0, ? for ? ^ 0, has no (classical) derivative at the point ? — 0. However, the Heaviside function #, given by ? 0 for ? < 0, A for jc > 0, is the generalized derivative on R of the function /. In order to show this, note that for an arbitrary but fixed ? e Cq (R), there exists an R > 0 such that ?(?) = 0 for ? ^ R, implying ?'(?) = 0 for every ? ^ R. Therefore /¦+OC rR „ /¦« (/>')=/ f(x)<p'(x)dx= ??'(?)??=??(?)\0- \-?(?)?? J-oc J0 JO (R = - / ?{?)??. Jo On the other hand, we have /+oc ^ К ?{?)?{?)?? = ? ?(?)??. -oo ^0 The preceding two equalities imply that for every ? e C, 1 (/(jt)i!>'W + i/W*>W)^ = 0, which means that the generalized derivative of the function / exists on R, and is equal to H. Further on, ? will always denote a region, i.e., a connected open set in R", and a will denote an и-tuple (ot\, or,,) e NQ!; by definition, we put |a| := a\ +aj-\ +<*„.
Generalized derivatives 1241 Recall that C*(i2), к e No, is the vector space of functions ?: ? -> С such that ?(?), 0 ^ |?| ^ ?, is a continuous function on ?, while ?^(?) is a proper subset of &{?), whose elements ? have their supports contained in a compact set К с ? (depending on ?). Analogously to B), the ath generalized derivative (in the Sobolev sense) of / e L2(i2), for a = (??,..., an) e NQ! and ? a region in R", is defined as follows. ????????? 1.2. The function f(a) e L2(i2) is the ath generalized derivative (of order |or|) on the region ? of a function / e L2(i2) if for every function ? e C(, (?) it holds f /WD°^j^ = (-l)^ f /«"(*M^. Together with the notation /@,) for generalized derivative, one can use also the usual notations for classical derivatives, such as Da f or Э^/ДЭ^ ¦ ¦ ¦ dxna"), provided one distinguishes the two derivatives. Of course, if / e С1 (?), then the generalized derivative /(o?) and the corresponding classical one, Da f, are equal. Whenever the ath generalized derivative fia) of a function / e L2(i2) exists, then it is unique and is independent of the order of differentiation. It also holds that if all first order generalized derivatives of a function / e L2(i2) exist and are zero, then this function is equal to a constant almost everywhere. Note that not every function from L-(i2) has a generalized derivative in the sense of Definition 1.2. For instance, the function f{x\, xi) = sgn*| does not have the first order generalized derivative /"*' for a = A,0), i.e., the generalized partial derivative in jc ?, on the unit disc 6@, l) = (j: = (xi,j:i)el2; |*| < 1}. Next we give another "counterexample". Example 1.3. The function /, defined on the unit disc ?@, 1) by Ia for ? ? + jc2 < 1, ?-> > 0, b for JC]" + JC, < 1, ?2 < 0, has first order generalized derivatives in jci both on the upper and on the lower open semidisc (i.e., for X2 > 0 and for xi < 0), but for ? ? b it does not have a generalized derivative, with respect to jc2, on the whole unit disc. It is a well known fact that whenever a function / e L2(i2) has a classical derivative Da f on ?, ? = (??,..., a„), then it certainly has every lower order derivative D^ f, where ? = (?\,..., ??) and /S/ ^ aj, 1 ^ j ^ ?. However, the following example shows that things are different with the generalized derivative in the sense of Definition 1.2, giving us an essential difference between the classical and generalized derivatives. Example 1.4. The function /: ?@, 1) -> ?, defined by /Cx) = sgn.xi + sgn*2,
1242 ?. Pap and A. Takaci does not have first order generalized derivatives df/dx\ and df/dx2 on 6@, 1). However, it does have the second order mixed generalized derivative 9 f /{dx\dxi) on 6@, 1); in fact, we prove next that the last generalized derivative on ?@, 1) exists and is equal to 0. To that end, we have for every ? e Cq(S@, 1)) \ /(¦*)„ „ dx JB@.\) дХ\дХ2 dx = I sgn*i dx + I sgn*->-—-—dx JB@,l) дх\дх2 JB@,\) дх\дх2 JB@.l)n|.r|.r,<0| дХ[дХ2 7?@.?)?|.?|.?|>0| дх\дХ2 -[ ^dx+( ^dx J В@.\)П[.х\х2<0\ дх\дХ2 JB@. |)П|.г|.г><0| дх\дХ2 = 0 = / 0-?(?)??. ?@.|) A necessary and sufficient condition for a function / e L2]a, b[ to have a generalized derivative is that it is absolutely continuous on ]a, b[ and, moreover, that f'eL ]a, b[. The last supposition of the existence of the derivative /' does not imply the absolute continuity of the function /. For example, the function /, given by ijt2sin;t-2 for0<;t<l, 10 for ? = 0, has a derivative on the interval [0, 1], but /'? L']0, 1[ and so /' i L2]0, 1[. Note that the fact that a function / has a generalized derivative /' e L' ]0, 1 [, still does not always imply the absolute continuity of /. In the upper analysis, instead of L2(Q), one can take the (larger) space L|-oc(i2), whose elements are square integrable on every compact set contained in ?. 1.2. Sobolev spaces As before, ? will always denote a region in Ш", i.e., a connected open set in Ш". Dehnition 1.5. Let ? be a non-negative integer. The Sobolev space Wk№) consists of all functions / e L2(i2) having a generalized derivative /@° e L2(Q) for every a, 0<:\a\<:k. Note that for к = 0 we simply have \?°(?) = L2(Q).
Generalized derivatives 1243 The scalar product on ??<(?), defined by (f\g)w4v)--= ? (/""? = ? / f'w'g^Wdx, induces the norm 1/2 jJfa\x)\Jdx] \/\\*4?) = (? Jjfia)w\2dx) which turns W*(i2) into a separable Hubert space. Next we define another class of spaces; historically, it was an alternative (though equivalent!) approach to Sobolev spaces. Dehnition 1.6. Let A: be a non-negative integer. The space ??<(?) is the completion of '(?)· the space C*(i2) in the norm || · || wk,ny In other words, ??<(?) is the set of all limit points ofCauchy sequences in C*(i2) in the norm || · ?1??*(?)· Surprisingly enough, only in 1964 it was proved by Meyers and Serrin, see Adams A975), ??<(?) = ??<(?). This important result can be stated also as follows. THEOREM 1.7. If ? is a bounded region, then C*(i2) = W*(i2), ik=0, 1,2,..., where the closure ?/??<(?) is taken with respect to the norm \\ ¦ || Wk,?). Slightly changing Definition 1.6 by replacing the space C*(i2) with its subspace C?°(i2), whose elements are infinitely differentiable functions on ? having compact supports contained in ?, we come to the following definition. Dehnition 1.8. Let ? be a non-negative integer. The Sobolev space Wk№) is defined by ??(?) = ^(?) (* = 0,1,2,...), where the closure of C^°(i2) is taken with respect to the norm || · II iv*tr?)·
1244 ?. Pap and A. Takaci One can prove that the space Wk(Rn) is equal to Wk(R"). In general, if R" is replaced with an arbitrary region, then this is not always true. It turns out that the segment property of a region is a sufficient condition for the equality Wk(Q) = Wk(Q). Recall that a region ? has the segment property if for every ? from the boundary of ? there exists a neighborhood Ux of ? and (another) element yv e ? such that for every ? e ? П Ux the element ? + tyx belongs to ?. Intuitively, if a continuously differentiable function / on ? equals zero on the boundary of a bounded region ?, then it should belong to the space W1 (?). In fact, this is true if the region ? is a starshaped locally quadratic bounded region. Recall, a bounded region ? is starshaped if there exists jco e ? such that for every ? > 1 the set {л:: xq + (x — xo)/k e ?) is a subset of ?. An important generalization of the Hubert space Wk (?) is the Banach space ??<·?(?), for ? ^ 1, obtained by replacing the space L2(i2) and its usual norm in Definition 1.5 by Lp(i2) and its norm. If, moreover, ? = [?] + q, for ? e N and q e ]0, 1[, then \??<?(?) is the so-called Sobolev-Slobodeckij space. Using the Fourier transformation, one can extend the space ??<(?) for negative values of k. For such generalizations, and its application to PDEs, one can consult, e.g., Adams A975), Nikol'skii A975), Triebel A978), Hormander A983-85), Wloka A987). 1.3. Imbedding theorems and traces of functions When seeking the solutions of PDEs, one often firstly finds generalized solutions; of course, whenever possible it is desirable to replace them by classical solutions, i.e., to have a certain regularity of solutions. The statements giving relations between Sobolev spaces and (more classical) spaces of continuously differentiable functions are usually called imbedding theorems. In fact, from the very first theorems of this type, obtained by S.L. Sobolev himself in the thirties, it became clear that they are well suited for the study of PDEs and applications to problems in mathematical physics. THEOREM 1.9 (The Sobolev lemma). Let ? С К" be a bounded region with the boundary 9? belonging to the class Ck. If к > s + и/2, s e No, and f e W*(i2), then there is a function g eCs(i?) such that f = g almost everywhere. One of the consequences of Theorem 1.9 is that, if ? = 1, then Wk+i ]a, b[ с Ck[a,b] for every к е No, and, moreover, the imbedding is continuous. The mutual relation between Sobolev spaces of different order is given in the following, Rellich's, theorem, see Pap et al. A997). THEOREM 1.10 (The Rellich lemma). // ? is a bounded region of R" and m > к (m,k e No), then the imbedding map of the space ?'"(?) into the space ??<(?) is a compact operator.
Generalized derivatives 1245 In general, compact mappings, in particular compact imbeddings, have numerous important applications in analysis, e.g., in the analysis of the spectra of linear elliptic partial differential operators over bounded domains. Let / e W*(#) and let ? be a subregion of a region ?'. Then the extension of the function / on ?', given by f(x) = 0 for ? e ?' \ ?, belongs to WkW). Many PDEs arising in applications can not be divided from appropriate boundary conditions, e.g., the Dirichlet problem. If a solution / of such a PDE is a sufficiently smooth function on a region ? with a sufficiently smooth boundary 8?, there are no problems about assigning values of / on 9?. However, if / belongs to a Sobolev space, then / is, in general, defined only almost everywhere. In particular, the boundary 9? is of measure zero, meaning that the value of / at the boundary is not uniquely determined. This implies that one has to precise what is the value of a generalized solution of PDE on the boundary - this important and not at all easy question leads to the notion of the trace of a function. Clearly, the trace of a continuous function /, defined on ?, on a surface S с ? should be a continuous function on S from C(i2), which coincides almost everywhere with /. The general case has to be handled with care, as will be exposed next. Let ? с Ш" (? ^ 2) be a bounded region and S с ? a locally quadratic (n — 1)- dimensional surface with the property that for every xq e S there exist regions U(xq) and V and С' -diffeomorphism ? from U onto V such that some subset of ? П U is mapped on some и-dimensional parallelepiped ? с V, and S П U is mapped on a side or union of sides of P. Then there exist constants С ? > 0 and Ci > 0 such that for every function / e С' (?) and ?, 0 < ? < 1, we have [ \f\2dS<:^- f \f(x)\2dx+C2sT f \Dif(x)\2dx. D) Js ? ?? ^\'? The preceding inequality D) can be written also in the following form {Jslfl2dS) ^Cnfllw'^ for some constant C3 > 0. Now if we define the operator Г : С' (?) -> L2(S) by Г(/) = /1эя, then one can show that there exists a unique linear and bounded extension ?: W' -> L (S) of T. The obtained value of T(f) (a function from L2(S)) is the trace of the function / e W1 (?) on the surface S, which will be denoted by /Is· One can interpret the trace of a function / e W1 (?) also in the following way. From the fact that С' (?_)_is dense in the space ?\?), it follows that there exists a sequence (//)./€N from ?](?) such that Нт^зс \\fj - f\\wi(n) =0. Then by D) \\fk~ fm\\C-(S) ^C\\\fk -/??????'(?), k'm eN·
1246 ?. Pap and A. Takaci Hence (//)y6N is a Cauchy sequence in the complete space L~(S), which gives the existence of an element /5 e L2(S) satisfying lim \\fj-fs\\L2lS)=0. Now the obtained function /5 is the trace of / e W' (?) on the surface S. 2. Distributional derivatives 2.1. Introduction The approach that led to to the derivative in the Sobolev sense was exposed in the previous section. Next, we shall see further generalization of the derivative, namely in the distributional sense. For a motivation, we shall start with two problems. Example 2.1. The problem of measurement of a physical value at a point xq; in fact, one prefers measuring its average value in a sufficiently small neighborhood of xq- For an example, let S be the density of a point located at the origin, with initial mass m = 1. If one spreads this mass throughout a central ball ?@, r) in R3 with a radius equal to r > 0, then its average density becomes fr(x) = \x\\ < r, 4лт3' 0, ?лгИ >r. Clearly for all r > 0 it holds fB(Q r) f,-(x)dx = 1. However, what about the density for r = 0? Denoting it by S — S(x), then it is natural to ask I. S(x)dx=\. E) On the other hand, this density <$(-*) is equal to +00, ? = 0, 5(jc) = , @, jc^O, which contradicts E), and, in particular, it holds fr(x) ¦/*· S(x), asr^0 + . Still, for every continuous function ? on R3 it holds / fr(x)<p(x)dx^><p@)= &{?)?(?)??, r -> 0+, F) Jb@.i) Jmy
Generalized derivatives 1247 meaning that fr -> S as r -> 0+ holds in a "weak" sense. In fact, the function ? has been replaced by afunctional denoted again by ?: <5:C(K3)^C, defined by ?(?):=?@). G) Actually, this functional is exactly the Dirac's delta function, introduced by Paul Dirac in the late twenties of the last century. Example 2.2. Let the hyperbolic second order PDE дхду be given, where и is the unknown and / a given continuous function on R2. If we multiply (8) by ? e C2(R2), then after two integrations by parts, we get / f(x,y)<p(x,y)dxdy = / —-<p(x,y)dxdy JR2 Jr2 охду f JR d-<p(x,y) u(x,y)—r~z dxdy. : дхду This means that (8) holds in the "weak" sense, provided that the linear functional on C2(K2), given by <рь* / и(х,у)—— dxdy, (9) УК2 дхду equals to the functional ?"-* / f(x,y)<p(x,y)dxdy. JRi Thus the functional (9) is a weak solution of Equation (8), but it can also be taken as a weak second order mixed derivative of the solution u. Let us note that the two functionals, namely ? from G) and the one from (9) are not functions in the classical sense, but rather some new elements - distributions. In his famous book from 1951, L. Schwartz, using the somewhat earlier developed theory of locally convex spaces, gave the mathematical foundation of the theory of distributions. Other classical references on the distribution theory and PDEs are, e.g., Gelfand and Shilov A958), Vladimirov A976), Hormander( 1983-85).
1248 ?. Pap and A. Takaci 2.2. Distribution spaces If ? is an open set in R", then an infinitely differentiable function ?: ? -> С is in the set Co°(i2) if it has a compact support. The space ?>(?) is the set C^(i2) endowed with the convergence defined as follows: A sequence (<p/)y-6pj/rom ?>{?) converges to a function ? {from ?>(?)) if there exists a compact set К С ? such that for every j eN it holds supply С К, and, moreover, for every multiindex a e Щ the sequence (da/dxa)(pj uniformly converges to (da/3??)?, as j -> oo. Dehnition 2.3. Adistribution ? on ? is a linear continuous functional on ?>(?), where the continuity of ? means that for every sequence (<py)y6pj which converges to zero in ?>(?) it holds lim T(<pj) = 0. The set of distributions on all ? will be denoted by ?>'(?)\ clearly it is a vector space. The value ? (?) of a distribution ? e ?'(?) at a function ? e ?>(?) is also denoted by (?,?). The space of locally integrable functions on ?, L|OC(i2) is a proper subspace of the space of distributions ?>'(?). Namely, if / is a locally integrable functions on ?, then the functional Tf on ?>(?), defined by the relation (Tf,<p):=jf(x)<p(x)dx, ?€Р(Й), A0) is, in fact, a distribution from ?>'(?). The mapping i-|OC(K) -> V, given by / ь> Г/, is injective, i.e., if /, g e i-|'oc(IR), then Tf = Г? if and only if / a= g. The distributions of the type Tf, i.e., defined with a function / e i,,1 (R) by A0), are called regular. A typical example of a non-regular (or: singular) distribution is the delta distribution given by G), and also its generalization, the functional Sa, a e ?, given by (??,?)=?(?), ?&?{?). A1) A characterization of distributions is given in the following theorem. THEOREM 2.4. A necessary and sufficient condition for the continuity of a linear functional ? on ?>(?) is that for every compact set К С ? there exist numbers С > 0 and m e No such that \(T,<p)\^C-pK,m(<p), ???(?), A2>
Generalized derivatives 1249 where the norms рк.т (·) are given by in ??,??)= ? sup|D°V(*)|. A3) м=оле* In general, the number m in A2) depends on the compact set K. If, however, m can be chosen independently of K, then ? from A2) is said to be a distribution of finite order; the smallest m with this property is then called the order of ? in ?. It is easy to see that every regular distribution, i.e., the distribution Tf defined by an / e /^(.?), see A0), is of order 0. The characterization of distributions of order 0 gives the following theorem; in short, it states that such a distribution is a (Radon) measure, and vice versa. THEOREM 2.5. A distribution ? is of order 0 on E" if and only if it is a continuous linear functional on the set C(R"). The delta distribution Sa, a e R, is clearly of order 0, which, in view of Theorem 2.5, means that it can be observed as a measure on a Borel subset ? of the open set ?: ( il, if ? Sa(E):= / Sa(dx)=\ J ? 10, if ? We saw in Definition 1.2, how the generalized derivative in Sobolev sense of a function from L2(i2) was defined. Next, in a similar manner, one introduces the distributional derivative of a distribution. Let a = (? ?, <*2, · · ·, <*„) ф @,0, 0) be a multiindex from Nq and / a function from Cl0,|(K")· Then, for every ? e ЩШ), it holds (compare A)) f Daf{x)-<p(x)dx = (-\)^ [ f(x).EP<p(x)dx. JR" JW< Since both / and its partial derivative Da f define unique regular distributions Tf and ???), respectively, the last equality can be written as №'·'>-<-?'·???^ Thus for an arbitrary element ? e ?>'(?) the ath distributional derivative of T, T(a\ is defined by (r<°\v):=(-l)^r,??), ?*?{?). A4) The obtained functional ?@?) is also a distribution from ?>'(?). It is important to note the fact that the space ?>(?) is closed under differentiation of arbitrary order. This implies an
1250 ?. Pap and A. Takaci essential property of the space ?>'(?), namely that every distribution has a distributional derivative of arbitrary order, and, moreover, that the order of differentiation is irrelevant. Perhaps this is one of the main reasons why distributions turned out to be so useful in the theory of PDEs. The distributional derivative is a linear operation, a continuous mapping from ?>'(?) into itself, and, moreover, the Leibniz formula holds. Let us see few examples. Example 2.6. The kth derivative of the delta distribution is given by (a(*\v) = (-!)<*) ?-v@), „eP. Example 2.7. We have /•+3C (?', ?) = -[?(?), ??(?)\ = - / H(x)D(p(x)d) Jo f+OC - / ??(?)??=?@) Jo Thus (?,?), ?&?\ ?' = ?. A5) where ? is the delta distribution given in G) (with R3 replaced by E). Next, if the continuous function / is given by \x, if-OO, ??, if jc <c 0, then /' = H, compare with Example 1.1, which together with A5) implies /" = ?. Moreover, in view of Example 2.6, it holds /(*+2)=i(*)i keNo. A6) The derivative almost everywhere of H, DH, is the locally integrable function 0, since the point ? = 0, where ? has no derivative, has measure 0. Thus from A5) it follows ?' = ? ф 0 = DH, meaning that the two derivatives: the distributional and the classical one, are not always the same. The relation between these two derivatives in a rather general case is given in the following theorem, see, e.g., Vladimirov A976). THEOREM 2.8. Let f be a continuously dijferentiable function on the set R\J, where J is a discrete set J = {x\, X2,. ¦.} (i.e. J has no finite accumulation point). At every xj,
Generalized derivatives 1251 j e J, f is assumed to have a discontinuity of the first kind, i.e., both the right- and the left-hand side limits f(Xj ± 0) exist (but are not equal). Put sj = f(xj + 0) — f(xj — 0). Then it holds {Tf)' = TDf+Y^SjSX), where (Tj)' is the distributional derivative of the regular distribution Tf, while To/ is the regular distribution defined by the locally integrable function Df. In Example 2.7, we saw that the S distribution can be obtained as a finite order derivative of a continuous function. Actually, the following theorem gives an essential property of distributions. THEOREM 2.9. Let a distribution ? e ?>'(?) and an open set ?' such that ?' С ?, be given. Then there exist an r e No and a continuous function f on ?', with support in ?', such that (T,<p)=[f(r\<p), „??(?'). In short, every distribution is locally a finite order (distributional) derivative of a continuous function. Note that, in general, the order of the derivative, r, depends on ?'. 2.3. Fundamental solutions One of the main results on linear PDE of order m P(D)u = f, A7) where P(x,D)= ? aa(x)Da, aa are given functions in some region ?, and / is a given function, in the context of distributions, is the representation of the solution of A7) in the form of convolution of / and a special solution, ?, of Equation A7) with right-hand side equal to the delta distribution. DEHNITION 2.10. A distribution ? is the fundamental solution of a differential operator P(x, D) from A7) if it satisfies the equation P(x,D)E = S.
1252 ?. Pap and A. Takaci Example 2.11. The function /: R2 -> R given for (x, y) = x +<)' ЬУ f(x+iy)=-.—L- A8) ? x+iy is the fundamental solution for the Cauchy-Riemann operator -1/9 9 9 = - — + / — 2\9jc 9y Note that / is a locally integrable function on R2, since \f(z)\ = - ¦— for ? =x+iy, ? \?\ and thus it defines a distribution on R2. Let us prove that the function / from A8) is the fundamental solution of the Cauchy-Riemann operator, i.e., that we have ?/,?) = ?@). A9) We start from the equality \ J-?? V' ?? 2? Jr\dx ^ dy)x+iy Changing the variables ? — rcost, у = rsini, for 0 < r < oo, 0 ^ t ^ 2?, and taking <Pi(r, t) = <p(rcosf, rsini), we obtain ^---1Сш-т^- Therefore we have C/,*>>=-^- f (?(??, t) - <pdO, t))dt + ^- f (<р(г,Ъг)-<р\(г,0))аг. 2? Jo ?? J0 By the compactness of the support of the function ?, and hence also of ?\, and the periodicity of ?\, we obtain A9). One of the great successes of the distribution theory is the following theorem, proved by Malgrange A955-56) and Ehrenpreiss A954) in the mid fifties (see Pap et al. A997)). THEOREM 2.12. Every linear partial differential operator with constant coefficients P(D) has a fundamental solution E, i.e., there exists a solution of the equation P(D)E = S.
Generalized derivatives 1253 The existence of the fundamental solution ? from Theorem 2.12 enabled Nirenberg A955) to obtain the following result. Theorem 2.13. Let P(D)= ? ?"°? B°) be a linear partial differential operator with constant coefficients. Then for any given function f from C^° there exists a solution и in Cx of the equation P(u) = f The solution и is given by the convolution u = E*f, B1) where ? is the fundamental solution of the partial differential operator P(D) from Theorem 2.12. In all three main equations of mathematical physics, the fundamental solutions are known. Example 2.14. The function ?:RxR^ R given by 1 ? 1/2 ift>\x\, E(x,t)=-H(t-\x\)=\' ' B2) 2 [0 if t < \x\, is the fundamental solution of the one-dimensional wave equation. In other words, it holds д2Е д2Е _ W~^x2=&' The fundamental solution of the two-dimensional wave equation is the function Ег{х, t) = 2?^?2-\?\2 where for ? — (x\,X2) e K2, we put \x\2 = x^ +x2. Example 2.15. The locally integrable function ? given by E(x)= = for и ^ 3, ??B-?)\?\"-2
1254 ?. Pap and A. Takaci where ?„ = 2??"/2/ Г(и/2), is the fundamental solution for the Laplace operator 92 92 ? = —= +^. Example 2.16. The function ж a H(t) ( m2\ E(x, t) = , exp — , is the fundamental solution for the heat equation, i.e., it holds Эй -> — ~azAu=S(x,t). at Further generalizations one can find in Colombeau A985), and in Pap B002), Chapter 34 of this Handbook. 3. The Mikusinski operational calculus 3.1. Introduction Simultaneously with the introduction and the fast development of the theory of distributions, in the early fifties the Polish mathematician J. Mikusinski with his collaborators gave the foundations and systematic presentation of his operational calculus in a book first published in 1959, see Mikusinski A983). Essentially, he introduced a field ? of convolution quotients, much in the way the field Q of rational numbers is obtained from the ring ? of integers. The elements of M, called (Mikusinski) operators, can be taken both as a generalization of complex numbers, but also as a kind of generalized functions. Due to its deep connection to other mathematical theories (in particular, with that of generalized functions and the Laplace transformation), convenient algebraic properties, but also its relative simplicity, the Mikusinski operator calculus had impact on numerous authors both in theoretical and applied mathematics. Perhaps, the field ? is best suited for (both ordinary and partial) differential equations with initial conditions. The continuation of Mikusinski's book A983), written by J. Mikusinski and his collaborator T. Boehme A987), presented the advanced topics and results obtained in a period of thirty years till the mid eighties, in particular on the problems connected with the topological and convergence structure of the Mikusinski operator field ?. 3.2. Operators Next we present the main notions from the Mikusinski operational calculus; the complete exposition is given in the mentioned monographs by Mikusinski A983) and by Mikusinski and Boehme A987).
Generalized derivatives 1255 Throughout this section, we denote by C[0, oof (respectively ?) the additive group of continuous (respectively locally integrable) functions on the interval [0, oof. Let / and g be two continuous functions on the interval [0, oof, i.e., /, g e C[0, oof. The convolution of the functions / and g is the continuous function /*gon [0, oof, defined by the integral (f*g)(t)=[f(T)g(t-T)dz, f>0. B3) Jo It is easy to check that C[0, oof is a commutative ring, when equipped with the multiplication given by the convolution; in other words, the convolution * in C[0, oof is a commutative and associative binary operation which distributes over the addition of functions. From now on, we shall write simply fg for the continuous function on [0, oof from B3). (One must distinguish the convolution product fg = f*g from the multiplicative product (/ · g)(t) — f(t) ¦ g(t), t > 0.) It is important to note that C[0, oof has no unit element for the convolution, since there is no continuous function /' = i(t) such that (/ * /)(f) = /(f) for each / e C[0, oof. For the construction of the Mikusinski operator calculus, essential is the following Titchmarsh theorem. THEOREM 3.1. If the functions f and g of class C[0, oof are not identically equal to zero, then neither is their convolution f * g identically equal to гею. In other words, C[0, oof has no divisors of zero, which implies that its quotient field can be defined. This field shall be denoted by ? and called, by its inventor, the Mikusinski operator field. Its elements, the (Mikusinski) operators, are quotients of the form f/g, /eC[0,oo[, 0#seC[0,oo[. The last division is observed in the sense of convolution from B3), meaning that two operators /i/g, and f2/g2 B4) are equal iff (/| *g2)(t) = (h *gi)@ for each t e [0, oof. The sum and the product of the operators f\ jg\ and frjgi from B4) are defined by /l h _ f\g2+f2g\ f\ fl __ f\ fl g\ 82 g\g2 ' g\ gl g\gl' One easily sees that the right-hand sides of the upper two equalities are invariant under the replacement, say, of the quotient f\ /g\ with the quotient /з/#з, as long as (f\ * #з)@ = (/з *g\)U) for each t^0. In view of the obvious analogy with the arithmetic of fractions (since the usual addition and multiplication of operators can be treated in the same way as with real or complex numbers), it is no surprise that the field ? has very good algebraic properties.
1256 ?. Pap and A. Takaci Let us remark that the set ?, equipped with the usual addition and the multiplication considered in the sense of convolution, is also a ring without divisors of zero provided that the usual equality of functions is replaced with the equality almost everywhere. Clearly, С is a subring of ?; however, the set of the convolution quotients from С coincides with the field ? obtained from С Each continuous function / = f{t). t ^ 0, defines a unique operator (fg)/g, where g is an arbitrary function from C[0, oof which is not identically equal to zero. It is convenient simply to denote this operator by /; we then write / = {/@1 and say that / is defined by the continuous function /. The subset of M, consisting of the operators representing continuous functions, is denoted by M(; clearly C[0, oof can be bijectively mapped onto Mc. Note that the identity operator / = ///, / e C[0, oo[ and / # 0, does not belong to Melt is convenient to write simply 1 for the identity operator / in M, which allows the equality 1 / — / for any operator f &M- However, one must distinguish it from another important element from M, namely the integral operator ? = {{}, thus defined by the function identically equal to 1 on [0, oof. Clearly I e Mc\ its name comes from the equality [Jo J The inverse operator to I, 1 /? is the differential operator denoted by s, which, however, does not belong to M.c (otherwise, the identity operator would also be in M(·). Note that s is not exactly the usual derivation operator, namely for a function / with a continuous derivative /' on [0, oof it holds sf={f'(t)}+f@). B5) Since any two operators can be multiplied, the expression sf in B5) has sense in ? even if / is a continuous (or just a locally integrable) but not necessarily differentiable function on R. In this way, one can observe the operator sf as the generalized derivative of/. If ? e N, then the «th power of ?, the operator ?", is defined by the continuous function t"~] /(n - 1)!, t >0. Clearly, it can be observed as the operator of repeated integration; its inverse, the operator s", is the nth power of the differential operator s. If / is an и-times continuously differentiable function on [0, oof, then the following generalization of B5) holds: {/(,,)W} = snf - f@)s'-] /'"""(?)/. Moreover, for ? e R and a > 0 it holds ?a = {ta~]/?(?)}, and these operators are exactly the well known Riemann-Liouville (convolution) operators of fractional integration. Thus the operator sa, the inverse of the operator la, is the operator of fractional differentiation, provided that a > 0.
Generalized derivatives 1257 The step function HT(t), ? > 0, is defined by 0, t<r, A, t > ?. Then the operator e-5T:=i{WT@}, ?>0, B6) is the i/ii// operator (or: translation operator) and the last term is justified by the following. If / is a locally integrable function, then «-"{/('>} = { 0, 0^?<?, f(t-?), ? ?. Let us remark that the identity operator 1, the powers of the operator s", ? e N, and the shift operator e~ST, ? > 0, correspond to the delta distribution, its «th derivative and the delta distribution supported by the single point ?, respectively. In general, the connection between the field ?4 and the space of distributions is complicated; there are operators (e.g., arising from the heat equation) that are not distributions, and vice versa. In particular, any distribution with unbounded support both to the left and to the right cannot be identified with any operator. In applications, of great importance are the polynomials in operator s given by P„(s) = a„s"+a„_is"-' + · · · +a\s +a(), where a„ ? 0, ?„_?,..., ao are numerical constants. Operations on these polynomials are treated in the same way as in ordinary algebra. A rational operator (in s) is a quotient of two polynomials in s: a„s" +?„_|?"_? ? \-a\s+a0 Rn.m(s) fomJm +fo„I_|S-1 +---+b]S+b0' where a„ ? 0, ?„_?,..., ao, bm ? 0, fo,„_|, bo, are complex numbers and m, ? e N. If these numerical constants are real, then the rational operator Rn.m(s) can be decomposed into a linear combination of the powers of the following simple fractions 1 1 s-D s-A' (s-BJ+C2' (s-EJ+F2' for appropriate real numbers A, B,..., F, and, if ? ^ m, of a polynomial of order ? — m in s. A rational operator /?„.,„ (s) represents a continuous function only if the power of the denominator, w, is greater then the power of the numerator, n, i.e., if m > n. If, however, m = и, then the rational operator /?„.,„(s) can be written as Rn.m(s)=^I + Rt, bm
1258 ?. Pap and A. Takaci where Rc is an operator representing a continuous function. The solving and analysis of regularity of solutions of linear differential equations with appropriate initial value conditions is rather simple, since one works with polynomials and rational operators in s as in arithmetic. There is a close connection between a set of (one sided) Laplace transformable functions and a subset of the field ? of Mikusinski operators, established and analyzed by Ditkin and Prudnikov A975). We give only some examples in order to represent this connection; they show that the Mikusinski calculus can be taken as a generalization of the classical right-sided Laplace transformation. To that end, let us note first that the functions r, , neN, ? ?, B7) s s-l (s-a)" (s-aJ+b2 where s stands for the complex variable, are the Laplace transforms (in appropriate half planes of the complex plane) of the functions м-\ eat ? 1, e', , -ea'sinbt, B8) (n-1)! b respectively. However, if in B7) we treat s as the differential operator from Л4, then these operators are exactly represented by the functions from B8) respectively - the inverse Laplace transforms of the functions from B7). Of course, there are continuous functions on [0, oof, e.g., Bi — l)e'~, which are not Laplace transformable, since the integral / e-stBt-l)e'~dt JO does not converge for any seC, Thus the following problem x'(t)-x(t) = 2t + Bt-l)e'2, Jt@) = 0, B9) can not be solved using the Laplace transform. The differential equation in the field ? s ? — ? = 2 + /, where / = {Bt — \)e'~] corresponds to the problem B9). It, however, has a solution in the field ?: 2 + / + ^—=\2е'+ f е'-т{2т-\)ет'ат\ ={е' + еГ'}. s - 1 [ Jo J An operational function и (х) is a function that maps a set of real numbers into the set of operators (see Mikusinski A983, Part 3, Chapter I)). An operational function u(x) is said to be continuous in a finite open interval У с К if it can be represented in the form u(x) = q{u(x,t)\, C0)
Generalized derivatives 1259 for some nonzero operator q from M, and some continuous function (in the usual sense) u(x, t) on the domain {{x, t), ? e J, t ^ 0}. Moreover, an operational function u{x) is said to have a continuous first derivative u'{x) on an open interval У с К if it is given by C0) and the function u(x,t) has a continuous partial derivative j^u(x, t) in the upper mentioned domain. Then, by definition, it holds u'(x) = q\—u(x,t)\. The derivatives of higher order of an operational function are also defined in this manner. Let as note that the properties of continuous operator derivatives are similar to the usual properties of the derivatives of numerical functions. In the field M, one mostly uses the type I convergence (shortly: convergence). By definition, a sequence of operators (an)„6pj converges to an operator a iff there exists an operator b ? 0, such that (ban)neu is a sequence of continuous functions on f 0, oo [ which converges uniformly on every finite interval to the continuous function ba. Unfortunately, the type I convergence is not topological, i.e., there is no topology in ? such that the type I convergent sequences are exactly those that are convergent in this topology. For that reason, another convergence was introduced by J. Mikusinski, namely type I' convergence. A sequence of operators (a„ )пещ converges type Г to an operator a iff each subsequence of (a„)„6pj has a subsequence which type I converges to a. The type Г convergence is the smallest extension of type I convergence which is topological. One can obtain the field of Mikusinski operators also if one starts from the ring С of locally integrable functions on [0, oof. The space С is usually endowed with the topology defined by the family of seminorms || · || ?, ? > 0, where ||/||7·= ? \f@\dt, T>0. Jo The subspace of С consisting of all the functions / such that || · || 7- > 0, for any ? > 0, is denoted by ?0· The algebra of all operators of the form f/g, f e C, g e Co, is denoted by .Mo· The convergence in С is by definition the convergence in all seminorms || · || 7-, ? > 0. Thus the sequence (xn)neN, of the elements from M, converges type / to ? e M, if there exist representations x„ = f„/g, ? = f/g, /«, f,g e ?, g # 0, such that /„ -> / in C. The problems connected with the convergence, topology and approximations in ? were analyzed, besides Mikusinski's books A983) and (with Boehme) A987), e.g., in Burzyk A983), Pap and Takaci A990), Takaci and Takaci A997). References Adams, R.A. A975), Sobolev Spaces, Academic Press. Burzyk, J. A983), On the convergence in the Mikusinski operational calculus, Studia Math. 75, 313-333. Colombeau, J.F. A985) Elementary introduction to new generalized functions. North-Holland Math. Stud., Vol. 113, North-Holland, Amsterdam.
1260 ?. Pap and A. Takaci Ditkin, V.A. and Prudnikov, A.P. A975), Operational Calculus, Visa Skola, Moscow (in Russian). Ehrenpreiss, L. A954), Solutions of some problems of division, I, Amer. J. Math. 76, 883-903. Friedrichs, K. A934), Spektralzerlegung halbbeschrankter Operatoren und Anwendung zu Spektraizerlegung von Differenzialoperatoren, Math. Ann. 109, H. 4-5, 465^487. Gelfand, I.M. and Shilov G.E. A958), Generalized Functions I. II, III Fizmatgiz (in Russian). Hormander, L. A983-85), The Analysis of Linear Partial Differential Operators I-IV, Springer, Berlin. Levi, B. A906), Sul principio di Dirichlet, Rend. Circ. Mat. Palermo 22, 293-359. Mikusinski, J. A983), Operational Calculus, Vol. I, PWN - Polish Scientific Publishers, Warszawa and Pergamon Press, Oxford. Mikusinski, J. and Boehme, Т.К. A987), Operational Calculus, Vol. II. PWN - Polish Scientific Publishers, Warszawa and Pergamon Press, Oxford. Malgrange, B. A955-56), Existence et approximation des solutions des equations ma derivees partielles et des equations de convolution, Ann. Inst. Fourier (Grenoble) 6, 271-355. Nikol'skii, S.M. A975), Approxinmtion of Functions of Several Variables and Imbedding Theorems, Springer, Berlin. Nirenberg, L. A955), Remarks on strongly elliptic partial differential equations, Comm. Pure Appl. Math. 8, 648-674. Pap, E. B002), Pseudo-additive measures and their applications. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 1403-1468. Pap, E., Takaci, A, and Takaci, Dj. A997), Partial Differential Equations through Examples and Exercises, Kluwer Texts Math. Sci., Vol. 18, Kluwer, Dordrecht. Pap, E. and Takaci, Dj. A990), On linear differential equations in spaces of locally integrable functions and of Mikusinski operators. Math. Nachr. 148, 197-208. Schwartz, L. A950-51), Theorie des Distributions 1,11, Hermann. Paris. Sobolev, S.L. A935), Le probleme de Cauchy dans I'espace des fonctionnelles. Dokl. Akad. Nauk SSSR 3 G), 291-294. Sobolev, S.L. A936), Methode nouvelle ? resoudre le probleme de Cauchy pour les equations lineaires hyperboliques normales. Mat. Sb. No. 1, 39-72. Takaci, Dj. and Takaci, A. A997), Numerical methods for operator differential equations, PAMM Monograph Booklets, Technical University of Budapest. Thomson, B.S. B002), Differentiation, Handbook of Measure Theory. E. Pap, ed., Elsevier, Amsterdam, 179— 247. Triebel, H. A978), Interpolation Theory, Function Spaces, Differential Operators. North-Holland Math. Library, Vol. 18, North-Holland, Amsterdam. Vladimirov, VS. A976), Equations of Mathematical Physics, Nauka. Moscow. Wloka, J. A987), Partial Differential Equations, Cambridge University Press, Cambridge.
CHAPTER 31 Real Valued Measurability, Some Set-Theoretic Aspects* Aleksandar Jovanovic School of Mathematics, Universiry of Belgrade, Stiidentski trg 16. 11000 Belgarde, Yugoslavia Entail: aljosha@infosky.net Contents Introduction 1263 0. Notation 1264 1. RVM - first equiconsistency 1268 2. Cardinal monotony 1274 3. Nonregular ultrafilter 1277 4. Rudin-Keisler order 1279 5. Measures and normal Moore spaces 1281 6. Counting remarks 1283 7. Some complementary results 1286 8. Set-theoretic measure theory 1288 References 1290 *This work was partly supported by project "Mathematical models of nonlineanty, uncertainty and decision" A866) financed by Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1261
This page intentionally left blank
Real valued measurability, some set-theoretic aspects 1263 Introduction Besides mostly used (totally atomic) probability measures on finite elementary event spaces which easily posses nice properties of totality and maximal additivity, nontrivial measures in the case of infinite sets usually tend towards one of the two ancient roots: arithmetics and geometry. The former-counting measures, accumulating measures, tend to measure the whole and bigger by measuring elementary and smaller. The later, initiating in dichotomy: zero measure dots versus length measuring intervals, with a desire to measure everything (totality) tend to satisfy a natural demand (well established much earlier in integral calculations) that the addition and subtraction of a lot of points does not change the result, and that the measure of union of disjoint intervals equals their lengths sum (additivity), well established much earlier in summation of infinite series. In working mathematics, the expressions beginning like: "let ? be ... probability on space X..." are common and widely needed. The axiom of choice AC, besides supplying all sorts of measures, was essential in Vitali's construction of Lebesgue nonmeasurable set, which crashed the totality of Lebesgue measure. And having an atotal geometric probability means that there are (quite big) events without probability at all (while obviously their small parts have probability zero). These issues attracted serious attention early in the century. An early solution to this kind of discomfort was discovered by Banach A923) who proved that there are total extensions of Lebesgue measure in R1 and R2 which are finitely additive and isometry invariant. Thus, infinite additivity is broken on some (quite big) events. On the countable infinite event spaces situation is simplified by the fact that measure additivity is limited by the space cardinality, which for the countable spaces means finite additions only. As shown in Solovay A970), weakening of AC to dependent choice DC leaves room to kill Vitali sets and allow Lebesgue measure to be total. On the other hand reducing axiom of choice to DC might leave us with no measure at all. Such model of ZF + DC is given in Pincus and Solovay A977), thus leading to a nice Universe of Mathematics with no probability at all except finite probabilities. Variety of related notions and properties were studied in fine details in twentieth century. This study revealed that the measure problem is tightly related to the well ordering principles and "cardinal geometry of universe of mathematics" - displacement function- solution of the continuum problem, compactness properties of the infinitary languages and their combinatorial equivalents, with specific positions in the hierarchy of the strong axioms of infinity, elementary embeddings of models of set theory. It's solutions are closely related to other important problems in other areas of mathematics. Curiously, measure problem became a kind of periodic: it surfaces as a big issue every ten to twenty years, either by it self or involved in some other context. The content of this chapter intended to present some of the major components in this dance of properties. Somewhat complementary exposition is given in Laczkovich B002) in this Handbook.
1264 A. Jovanovic 0. Notation Axioms for set theory emerged between hundred and eighty years ago and are nicely stabilized since then. The axioms were probably selected as fundamental, simple, practical and sufficiency general for the need - development of all Mathematics in uniform way and set theory represents a kind of "fundamental geometry" of the Universe in which Mathematics happens. Set theory is the first order theory with e as the only nonlogical symbol and following axioms. Empty set: C.x) ("x is empty") C;t)(Vy)(->y e x). Extensionality: (V*)(Vy)(Vz)[(z e ? ++ ? e у) -> ? = у)]. Foundation: (V;t)[Cy)(y e ?) -> Cy)(y e ? ? (Vz)--(c е Jt л ; е у))]· Pair set: (У*)(Уу)(Зг)(Уи)(и ег^к=ху« = у). Union set: (V;c)Cy)(Vz)[z e у «-> (Зи)(г еилке *)]. Power set: (Vjt)Cy)(Vz)[z ey« (Уи)(и e:^«e *)]. Replacement: If <p(;t, y) is a formula with ? and у free, not involving ? then (V;t)(Vy)(Vz)M;t, у) л ?(?, ?) -* у = г) -»· (VM)Cu)(Vy)[y ev^ Cjc)(jc e и л <pU, у)]. Infinity: Cjc) ("? is ?"). (We denote the set of natural numbers, the first initial ordinal and its cardinality by ?.) AC/DC Axiom of choice: "every set has a choice function". Dependent choice: Let ? ? 0 and R a binary relation on x. Assume that (Vye*)Cze*)(y/?z). Then there is a map h: ? -> jc such that for all и е ? й(и)Кй(и + 1). First 8 axioms constitute Zermelo Fraenkel set theory ZF. jc is an ordinal, Ord(jc) iff ? is e transitive and well ordered by e. Thus Ord is a unary predicate, but it is nicely organized with e as well (well ordered) so we prefer to think about this predicate as one object like sets, that is somewhat bigger than usual sets (with | Ord | = oo) and call such objects proper classes. Thus instead of ???{?) we write ? e Ord. Ord is related to the sets similarly to the proportion: ? and the finite sets of natural numbers. In arithmetics finite objects are treated, no one mentioning infinity but all these objects dwell in ? = ?. Expanding to set theory was initiated by the basic need of Mathematics - to investigate infinity. If somehow Cantor diagonal argument did not exist, then we would have integral infinity, in one piece. However, maybe then mathematics and especially set theory would not be so exciting. In either case we (would) have Cantor-Russell paradox or Russell paradox (with Cantor's diagonal proof) showing that infinity always leaks out:
Real valued measuiability, some set-theoretic aspects 1265 "the world is bigger than the objects in it". The universe of all sets is usually denoted by V. Thus, ? e V -?· ? = ? or V = {? \ ? = ?]. On the other hand we have cumulative hierarchy (Von Neumann): VO = 0, V„+i = P(Va) (all subsets of Va), Va = ? V/j when ? is a limit ordinal, and ?<? V= (J Va. aeOrd It is nice that V = V, i.e., that V of cumulative hierarchy is equal to the universe of all sets. V is transitive (e transitive). Thus, everything takes place in (predicate, "big thing", proper class, universe) V. Instead of collecting all subsets Godel's hierarchy of constructible sets Lis defined using definable subsets only: a subset X с Л is definable in A (with parameters) if there is a formula <?(??>?,..., u„) of the language of set theory and elements ? ? atl & A such that X = \a e A | (A, e) \=?[??\ ...?,,]}. Thus, La+\ = {x I x <? La л ? is definable in La), La = ? Lfi for limit a, and ?<? aeOrd V = L stays for "all sets are constructible". Godel A938) proved equiconsistency of ZF and ZFL (ZF + V = L) and proved that AC and GCH are theorems of ZFL, whence AC and GCH are consistent with ZF. If у = {? ? ?[?,?\,.. .,<*„]} for ? ?,.. .,<*„ e Ord, we say that у is ordinal definable and write OD(y) or у e OD. If transitive closure of ? TC(jc) с OD we say that ? is hereditarily ordinal definable and write HOD(jc) or ? e HOD. Similarly ODR means definable from ordinals and reals and HODR for hereditarily ODR. All combinations of consistency results equalizing some (none) elements of the chain L с HOD с V is established by Levy and McAloon together with the relationship between OD and HOD. A first order structure (M,E) for the language {e} is a model of set theory if the axioms of set theory hold in (M,E). The model is transitive if ? is transitive. If ZF is consistent then its models are infinite. By Skolem-Lowenheim theorem, if consistent ZF has a countable model. Let 3?? and У1, be structures of a first order language С and for
1266 A. Jovanovic f: ? -* N we say that it is an elementary embedding, in symbols /: 9Л < 9? iff for every formula ?(?\ ...xn)oiC and all ? ? a„ e ? mt=<p[al...a„] iff 4l\=<p[fai...fa„]. Transitive e models of ZFC containing all ordinals are called inner models. The nontrivial elementary embeddings (different from identity) of V are important in set theory and model theory. If j is such an embedding, then j(a) ^ a and there is a first ordinal moved by j - its critical point. Vopenka PRINCIPLE. If A: is a class of models of language C, then there are ОТ, У1 e К such that m<<Jl. Generic extensions (full details are well presented in, e.g., Jech A971a, 1971b, 1978) or Kunen A980)). For a first order formula ? and a complete Boolean algebra В е 9Jt, where 9Я is a model of set theory, the Boolean valued satisfaction \\<р\\в е В is defined by induction on formula complexity, so that \\?\\ = 1 when ? is a logical axiom. Usually the same is for axioms of ZF. Let h be an iW-complete homomorphism of ? to 2 (corresponding to a generic ultrafilter G). Composition of the Boolean valued satisfaction with h represents the standard satisfaction, leading to the minimal extension Wl[h](= 9Jt[G]) of the initial model Ш of ZF, containing h (and G) which satisfies/violates certain property, depending on the selection of В and h. Compactness. An important property of the first order logic is given in the COMPACTNESS THEOREM. A set of sentences ? has a model if and only if every finite subset of ? has a model. Its generalizations to infinitary languages attracted some attention as well. Infinitary language Ck has к variables, and besides usual ways of formula formation has infinitary conjunctions and quantifications: if Г is a set of formulas of Ck and \Г\ < к then лГ is a formula of Ck; if ? e Ck and X is a set of variables, | X | < k, then (VX)<p is a formula of Ck- Some generalizations of compactness: Definition. A cardinal к > ? is weakly compact iff for every set ? of sentences of Ck with \?\ < к, ? has a model if every subset of ? of cardinality < к has a model. A cardinal к > ? is strongly compact iff for every set ? of sentences of Ck, ? has a model if every subset of ? of cardinality < к has a model. A cardinal к > ? is medium compact if the following is true. Let ??, ? < к, be sentences of Ck and let U/j<a ?? nas a model for each a < k. Then U/3<* ?? has a model. None of the increased compactness can be proved to exist from original axioms of set theory. They are introduced with axioms that guarantee the existence of a weakly compact cardinal or a strongly compact cardinal or a medium compact cardinal.
Real valued measurability, some set-theoretic aspects 1267 Ultrafilters - binary measures. By ultrafilter theorem, AC supplies plenty of nonprincipal ultrafilters over each cardinal. List some definitions. Let ? be a [0, 1] valued finitely additive measure on P{k) (this includes ultrafilters). Cardinal norm or simply norm is defined as ||?||=????{?|?(*)>0}. Measure additivity is given with add (?) = min||jc| ? ? с dom(M) л (Vy e ?)(?(?) = 0) ? ?([_Jjc) > ?!. A cardinal к is RVM (real valued measurable) if it has a measure of additivity k. A measure ? on P(k) is (?, ?) regular iff there is ? с dom(M), |?| = ? and if X с ? and |X| > a then ?(??) = 0. ? is regular if it is (?, A:)-regular. There is plenty of regular ultrafilters. Nonpricipal ultrafilters whose all elements are of the same cardinality are called uniform. There are plenty of uniform ultrafilters on each cardinal. The existence of nonregular ultrafilters, known as a problem of Gillman and Keisler and related problems of powers of ultrapowers were of major importance in the development of theory of ultrafilters with variety of solutions, dependable on set theory axioms contributed by Prikry, Jensen, Magidor, Woodin, Laver. But additivity of an ultrafilter can be either ? or a measurable cardinal. Similarly for nonbinary (total) real valued measures, their additivity is either ? or a real valued measurable cardinal. Thus: a cardinal к is measurable iff there is a nonprincipal ultrafilter ? on к whose additivity is k. Scott proved that the existence of measurable cardinals implies that there are nonconstructible sets, i.e., V ? L. If к is measurable then there is a normal measure ? over A:: for / e kk if f{x) < ? ??(?) (?({? | f(x) < ?}) — 1) then there is a constant b such that f(x) = b AEi?) (?({.? | fix) = b)) = 1). Weakening normality we get weakly normal ultrafilter: ultrafilter ? over к is weakly normal iff for / e kk if fix) < ? AEi?) then there is a constant b such that fix) < b ??(?). Strengthening normality we get compact cardinal: к such that for every ? > к there is an ultrafilter ?? over ?*(?) of additivity к (?(?) = [? с ? | |jc| < к}). Coincidentally, compact are just strongly compact, measurable are just medium compact and cardinals with weakly normal ultrafilters are tied to the weakly compact cardinals. Mention the supercompact cardinals. For ? < ? let Q = {x e Pa-(?) | ? e x]. Let ? be a binary measure over Pk (?) of additivity к, containing all Q. Let fix) e ? for all ? in ? (?), ? is normal over ?-(?) if for some a < ?, ?({* e ? (?) | fix) = <*}) = 1. Then к is supercompact iff for all ? there is a normal measure on Pk U). All compactness properties are properties of high concentration and of the increasing strength and consistency strength. Detailed treatment one can find in Drake A974), Chang and Keisler A973), Jech A978). Mention that if j : V -> V is a nontrivial elementary embedding then the first ordinal moved by j is a measurable cardinal. Strong compactness and super compactness are
1268 A. }ovano\4c expressed as further closure properties of nontrivial j 's. If Vopenka principle holds then there are many supercompact cardinals (see Jech A978), Solovay et al. A978)). 1. RVM - first equiconsistency In early thirties Ulam A930) set the field with must-had-been surprising discoveries collected in the following two theorems. THEOREM 1.1. if there is ? ?-additive nontrivial measure on S, then either there exists a two valued measure on S and \S\^ the least strongly inaccessible cardinal, or there exists an atomless measure on 2K|) and 2K() ^ the least weakly inaccessible cardinal. THEOREM 1.2. If there is a real valued measurable cardinal < 2K(I then there is a total extension of Lebesgue measure. If к is real valued measurable then к is weakly- inaccessible. The additivity of a {total) measure is real valued measurable cardinal. One can see instantly that CH simplifies the situation: if CH is true then there is no real valued measurable cardinal and no ? additive total extension of Lebesgue measure. Thus the first consistency result of Godel provides a model (L) without total ? additive extensions of Lebesgue measure. Scott A961) proved inconsistency of existence of any total nontrivial ? additive binary measure with Godel's axiom of constructibility V = L. Thus in L there are no total ? additive measures. The minimal universe L(?) containing a binary ? additive measure ? must be larger than L. Opposite to disagreement of CH and total ? additive measures on R, Silver proved consistency of GCH with the existence of a binary ? additive total measure. In 1966 Solovay proved this remarkable theorem. THEOREM (Solovay A971)). The following theories are equiconsistent. A) ZFC + "there is a total ? additive extension of Lebesgue measure". B) ZFC + "there is a measurable cardinal". That the consistency of B) implies the consistency of A) is proved with Cohen method of forcing with nicely chosen Boolean algebra and impressively nice harmony of measures. Let 9Л be a model of B) and let к be a measurable cardinal. The proof- building a model - generic extension 9Jt[G] of ground model ШТ, has two components: (a) make 2H° ^ к in the generic extension; (b) keep к real valued measurable. Then, the rest follows from theorems of Ulam. All details of the proof which is partly shortened here, can be found in Jech A978) or Solovay A971). Let ? ^ к be such that ??|> = ?. The simple measure space (S,-, ?[, ?') for i e / = ? ? ? given with: S,= {0, 1), Tx¦ = P(St), ?'@) = 0, ?'({0}) = ?'({?) = з, м'({0, 1}) = 1.
Real valued measurabilirv, some set-theoretic aspects 1269 Let (S,F, ?) be a product measure space for / e /; S = П;е/ ?, ? be the least ?- complete field of subsets of S containing the sets {t e '{0, 1} 11(/') = 0} for /' e /. Let В be the measure algebra T/lhe, ideal of sets of ?-measure 0. В satisfies countable chain condition c.c.c. and is a complete Boolean algebra. Define a ?-additive measure ?? on В with M2([-*]) = ?(?)- Let G be an 9Л-generic ultrafilter on B. Then in 9Jt[G] the cardinals are preserved because В satisfies c.c.c. In Wl[G], 2K" = ?. First, BK<»)OTlGl < (|B|K»)OT = ?. On the other hand in 9Jt[G] there are ? subsets of ?. For (a, n) e /, let UaM e ? be such that ?/?.? = {?€/{0, 1} |/(?,?)= ?}. Let «„,„ = [?/„,„] (e ?). For a < ? let x„ be the ?-valued subset of ? such that \\h eXa\\= ua„ (? < ?). Let xa be the G-interpretation of x«. It can be checked that ? ? ? implies xa ? ??, confirming that in 9Ji[G], 2^" = ? > к. Proof that к remains real valued measurable in 9Jt[G]. So, let к be a measurable with a binary measure ?\ in 9Л and let ?, ?, ?2, ? be as above. We shall define a ?-valued name ?3 and show that for any generic ultrafilter G, ?3 the G-interpretation of ?? is a к -additive measure on k. Thus we have a nice interference of measures ??, ?, ? 2 and G resulting in ?3 with desired properties. Let a e ? \ {0} and let A e 9JtB be a ?-valued name such that a ll· А с ?. For each ? < A; let ,.. . ?2(?· ||?? ?||) /a (A, a) = — M2(a) ?? is A;-additive, hence /?(?,?) = ???{?\) for some real r. Let ??(?) = r. It has the following properties (i) all· A] =A2 implies ?„(?|) = ??(?2), (ii) all· А] С A2 implies ??(?|)<?„(?2), (iii) for X С к in Ш ? ? (?) = 1 implies ?? (?) = 1, ?? (?) = 0 implies ??(?) = 0, (iv) let A^, ? < ? < к be such that a ll· Af с ? and ?7^/7 implies ? ll· A^ П A,, = 0. Let A be such that a ll· A = Uf <y ^?· Then ?-additivity of ?? implies Ma(A) = ^??,^??), (?) ? ^ b implies ??(?) ^ ?/,(?). Letr be a real number from [0, 1] and let {an \n < ?] be a partition of ? e B. If ?«„(?) < r for all и, then for almost all ?, ?2(?„ · ||? e A||) < r · ??(?„), and thus for almost all a, ??(? ¦ ||? e A||) < r ¦ ?2(?). Hence ??(?) < r. As a consequence we have if for every nonzero b ^ a there is a nonzero с ^ b such that ?<-(?) < r then ??(?) < r. (*)
1270 A. Jovanovic Similar holds for ^, >, ^. If b ll· А с к, define ?^(?)= inf ?«(?). ?? satisfies (i), (ii), (iii) but it is not additive and instead of (iv), we only have М?(А)>]ГМ*(А?) for ? < к. Let G be a generic ultrafilter. In 9Jt[G] we define дз: P(k) -» [0, 1] with M3(A) = supM*(A), b€G where A is a name for A. Let ? 3 be the canonical name for ??,, defined in fflB using the canonical G. дз does not depend on the name A for A, А] сд2 implies ??,(?\) ^ дз(Л2)· ForXe9^3(X) = 1 ifM,(X) = 1 and ?3(?) = 0 if ?, (?) = 0. The proof that ??, is к additive. Start with the meaning of ??. For a real r e Wl, 0 < r ^ 1, we claim that ??(?)^? iff fclh/?3(A)>r. If ??(?) ^ r then for every generic G for which be G, мз(Д) > г, hence foil· ?., (A) >r. Opposite if b ll· ? 3(A) > r then foil· V? < г ЗЛ e G ?^(?)^? that is V<7 <r Vc<fo3diCc ?*|(A)^q. Let 4 < r; we claim that ??(?) ^ 4. If ? ^ b then Vc < ? 3d ^ с such that ?} (A) > 4 and hence ?^?) > ^. Thus ??(?) ^ 4. Since this holds for all q < r, we have ??(?) > r. Proof that ??, is finitely additive. Let A, A| and At be such that every condition forces that A is the disjoint union of A| and A2. If ? and гг are real numbers and b ll· (?3(?|) > f\ and /?з(Аг) ^ r2) then (by first claim) b ll· ?3(A) > ? + r2; hence ?3(?) > ?3(?|) +?3(?2)· Conversely, assume that ?3(?) > ?3(?|) + ??(?2). There are reals r\, гг е 9Л and kG such that fo ll· /? 3 (? ?) < f\, ?3(??)<?2 and ? 3(A) ^ ? + ?. Since b ll· ? з(А|) < r?, there is for each с ^ fo some ii < с such that ?</(?|) < r\; hence ?;,(?|) < ? and ?b{Aг) < ?, and so ?,*(?) ^ ?/,(?) < r\ + r2. Contradiction.
Real valued measurability, some set-theoretic aspects 1271 Now since дз is finitely additive it suffices to show that ?<? ?<? for any family {?? | ? < ?] of less than к subsets of A:. So let у < к and let Af, ? < у and A be such that ||A = \J^<Y Af || = 1 and let ?3(?)> ]Гмз(А?). Then there is г е 9Л and beG such that b\l· 2_]?3(??) < f and дз(А)>г. Let ? be an arbitrary finite set and let Ag = Uf e? ^i ¦ Since M3(A?)iC ^дз(А^) = 1 we have b lh дз(А^) < r. By (*) we get ?/,(??-) < r. Since Mfc(Af) < r for all finite ? с у, it follows that ?/,(?) < r, whence ?,*(?) ^ r, a contradiction. This completes the proof that in 9Jt[G] дз is (a nontrivial) A:-additive measure on k. Proof that consistency of ZFC + "there is a total extension of Lebesgue measure" implies the consistency of ZFC + "there is a measurable cardinal". Certain combinatorial properties of measures and their zero measure ideals are essential: saturation of ideal and normality. For a Boolean algebra В we say that it is ? saturated if whenever {b$: ? < ?] is a pairwise disjoint family of elements then b$() = 0 for some ?? < ?. Let sat(S) be the least ? such that ? is ? saturated. For a nontrivial ideal / С Р(к) we say that it is ? saturated if В = P(k)/I is ? saturated and denote sat(/) = sat(B). Useful information on sat is given in the following propositions. PROPOSITION (Tarski). Let В a Boolean algebra, let sat(S) ^ K0. Then sat(B) > K0 and sat(Z?) is regular.
1272 A. Jovanovic PROPOSITION (Ulam). Let к be a real valued measurable with nontrivial measure ? on k. Let I be the ideal of sets of ? measure zero. Then I ist<\ saturated. With respect to an ideal / on к we say that А с к is of measure zero if A e /. Otherwise we say that A is of positive measure. Let / be а к -complete ideal on к > ?. The following definition of normal ideal is tightly related to normal (ultra) filters. Incompressible function corresponds to the minimal unbounded function in the case of normal ultrafilter. We say that / is normal if whenever Kt has positive measure, and /: В -> A: is such that /(?) < ?, ? e В, then there is ?' с ? of positive measure, and a ? < к such that f(t) = i, ???'. THEOREM 1.3. Let к be an uncountable cardinal, I a k-saturated nontrivial ideal on P(k). Then there is a normal nontrivial ideal, J on P{k), with satG)$sat(/). If I is a measure ideal, so is J. PROOF. Begin with some definitions. Let /: A -> k. We say that / is almost bounded if there is a ? < к such that {? e A | /(?) > ?) has measure zero. We say that / is nowhere bounded if for each ? < к, {? e A | F(?) ^ ?) has measure zero. Finally, / is incompressible if A) /is nowhere bounded. B) If ? с л has positive measure, g: В -> к and g(?) < f(%) for all ? e B, then g is almost bounded. These concepts are only important if A has positive measure. If A has measure zero, every map /': A -> к is simultaneously almost bounded, nowhere bounded, and incompressible. LEMMA 1.1. Let f': A -> к be nowhere bounded. Then we can write A as the disjoint union of sets В and С such that A) f \ В is incompressible. B) There is a g:C -> к with g(?) < /(?). ? e C, such that g is nowhere bounded. LEMMA 1.2. There exists an incompressible function f:k —> k. We can now prove Theorem 1.3. Let h:k -> к be incompressible. Define a subset J с ? (?) by J = {А с jt | /г (Л) е/}. It is straightforward to check that У is a A:-complete ideal, and that к <? J. Since h is nowhere bounded, h~[ ({?}) has /-measure zero. Hence {?} has ./-measure zero. An easy check shows that satG) 5i sat(/).
Real valued measurability, some set-theoretic aspects 1273 We check that У is normal. Suppose A has positive У-measure, and g : A -> к is such that g(?) < ? for ? e A. Put В = h~[(A). Then ? has positive /-measure (since A has positive У-measure). Let f :B —> kbe, the composition Then if ? e ?, /(у) = g(h(y)) < h(y). But /г is incompressible. It follows that there is a ? < к such that {y e В \ f(y) ^ ?) has positive /-measure. Since / is к -complete and ? < к, there is a ?' ^ ? such that D = {y e В \ f(y) = ?'} has positive /-measure. Let ? = {? e A | g(y) = ?'}. Then ? e D+> g(h(y)) = ?' <-> h(y) e E. Thus D = h~](E). It follows that ? has positive У-measure. That is, given g, A as above, there is an ? с А of positive У-measure on which g is constant. Thus У is normal. Finally, let к be a real-valued measurable cardinal. Let ? be a nontrivial A:-additive real- valued measure on к. Let / be the ideal of sets of measure zero of ?. Let h: к —> к be /-incompressible. Define v. P(k) -> [0, 1] by ?(?) = ?(/2-'(?)). (+) Let У be as above. Then ? is easily checked to be A:-additive and nontrivial, and У is the ideal of sets of ?-measure zero. Thus У is a measure ideal. The proof of Theorem 1.3 is complete. ? We now assume that к is as above, but that У is a nontrivial ^-saturated normal ideal. LEMMA 1.3. Let A have positive measure. Let h : A -> к be such that /?(?) < ?, for all ? e A. Then h is almost bounded. The inner model construction in the form in which it is used in the proof. Let A be a set. We are going to construct a class, L[A], with the following properties: A) L[A] is an e-model of ZFC. B) L[A] is transitive and contains all ordinals. C) А П L[A] is in L[A]. D) L[A] is the smallest transitive class satisfying (l)-C). Suppose that ? is a set. Let e, be the e relation on x. Let Ал be А П x. Let ? be a first order language with one two place predicate, e, and one one place predicate A. We interpret С in the relational system 2lv = (x; е.,, Ал) in the obvious way. A set В с ? is 2l.r-definable if there are elements yi, v„ of ? (possibly ? = 0) and a formula ? (у, у\,..., y„) of С such that В = {уех\Ъх\=ф(у,У1 у»)}. We put D(x, A) = {? с ? \ ? is 21 v-definable). We now define L„[A] for ? e OR by induction on a: Lo[A] = Q, a = ? + 1:LJA] = D{Lp[Al A), a is a limit ordinal: L„[A] = U/j<ff Lfl[A].
1274 A. Jovanovic There is a canonical bijection Fa : OR ^ L[A]. Fa is definable in set-theory. I.e. there is a set-theoretical formula \j/{x,y,z) which expresses "y is an ordinal and Fa(y) = z". Let ? be the conjunction of a large finite number of axioms of ZFC. _Let N be a transitive model of ?. Let ? be a set. Suppose that ? ?? ? e N. Put A = ADN. Then for ? an ordinal of N, La[A] = La[A] = L%[A] and FA(a) = Fj(a) = F^(a). Let <*o be the least ordinal not in N. Then <*0 = OR^, and i-fA]^ = Lao[A] = LaQ[A]. Now let к be an uncountable cardinal. Let У be a normal nontrivial ?-saturated ideal in P(k). Let_L[7] be defined as before. Put J = L[J] П J. It is straightforward to check that in L[J], У is a normal nontrivial ?-saturated ideal in P(k). THEOREM 1.4. Suppose that J, k, ? are as above and that ? < к. Then in L[J], к is measurable. The proof of this theorem uses the adaptation of argument in Silver's theorem: THEOREM (Silver A971)). If к is measurable with normal measure ? and it's measure zero ideal J then in L[J], GCH holds. Using method of this theorem, Solovay proved that "? < ? -> 2* < ?" is true in L[J]. This was sufficient in showing that к is measurable in L[J] with the use of the following result of Tarski A930). THEOREM 1.5. Let I be a nontrivial ideal in P(k). Suppose I is ? saturated for some ? < к. Suppose further that ? < ? -> 2K < к. Then к is measurable, and the quotient algebra В = P(k)/I is totally atomic. This completes the proof of Solovay's theorem on equiconsistency of total extension of Lebesgue measure and measurable cardinals. 2. Cardinal monotony Condition of cardinal monotony which is satisfied for e.g. finite probabilites with uniform distribution: \x\^\y\ => mW^W
Real valued measurability, some set-theoretic aspects 1275 is trivial in the presence of cardinal bisection, e.g., for Lebesgue measure, if CH holds, ? additivity prevents that a set of size less than continuum has a positive measure. The same is true for any measure over continuum with CH. In the case of the total ? additive measures observe first their indices. The theorem of Ulam does not leave too much room: if a cardinal is RVM then it has to be ^ 2? or measurable, seemingly there are no RV measures in the interval B?, 1st measurable). This restriction seems to eliminate the possibility of other RV large cardinals, the RV counter parts of large cardinals (RV Strongly compact, RV super compact), obtained from large cardinals by substitution of binary with RV measures in their defining properties (JovanovicA980, 1981)). There is a small leak however: Ulam's theorem states in fact that for every measure ? add(M)?B<", 1st measurable), with no restrictions on cardinal measure norm ||?||; besides the obvious add(M) < ||?|| ^ |index (?)| what variations are allowed here? Strait forward we get the following questions. Is there any RV measure ? such that add^) < ||?||? Is there any RV measure ? over continuum such that add^) < ||?||? The same for extensions of Lebesgue measure. Search for cardinal invariants in Solovay's model results in the following. For the measure ?? which makes к RVM and ??, constructed from ?? which extends Lebesgue measure to all sets of reals we have add^3) = add^ \) — k, 11дз|1 = 11Д||| = *, |??<1(??)|=? and |ind(/Z3)| = 2?. In the construction of the model one can choose to have either к < 2? or к — 2?. The first case gives sets of reals of positive ? з measure but of cardinality smaller than 2? (unusual), thus answering the starting question of cardinal monotony. To answer the last questions the procedure is similar as in Solovay's construction with stronger hypothesis. Let к be ? strongly compact with ? > к, in the ground model 9Jt. Let U be an uniform к complete ultrafilter over ?. Let (x, ?, ??) be product measure space ???{0,1} and let В be the corresponding measure algebra JF/the ideal of sets of measure 0. The algebra В and 9Л-complete homomorphism h are used to obtain a Cohen type model which preserves cardinals since В has countable chain condition and in which 2? — ? or 2? > ? as desired. From the ultrafilter U a uniform real valued measure ? is constructed with the following properties ???(?) = ?, ???=?, add^) = к < ?.
1276 A. Jovanovic RV-large cardinals. Introduce other real valued large cardinals on the example of real valued compact cardinals. We say that к is real valued strongly compact or real valued compact cardinal iff for every ? such that cfk > к (other ? are obviously excluded) there is a measure ?: ?(?) -> [0, 1 ] such that \\? || = ? and add^) = k. Obviously if к is strongly compact cardinal then к is real valued compact. The analog of Ulam's theorem can be proved similarly as the original theorem, thus PROPOSITION 2.1. If к is real valued compact then either к ^ 2? or к is strongly compact. THEOREM 2.1. Consistency of the theory ZFC + "there is a strongly compact cardinal" implies that the theory ZFC -f "there is a real valued compact cardinal к ^ 2? " is consistent. PROOF. First we note that in Solovay's method the measure ?? which makes к measurable in the ground model is independent of measure ?? on measure algebra used to force 2? = ? > к. On the other hand it is the additivity of ?? what counts not merely its index. So if ?? is ?: additive binary measure over <5 such that ||?|| — ? and if ?? is as before we can apply Solovay's method and with sequence of measures as in Solovay's proof, we obtain in 9Jt[G] a measure ? which maps ?{?) into [0, 1] which is к additive and properly real valued or non atomic and uniform, i.e., ||?|| = ?. All the construction took place in 9Jt[G]. It can be checked that ? and ? above are independent. Thus, let к be strongly compact cardinal in the ground model 3??. Let К be the class of binary measures witnessing compactness of k: K = {?s\c?.?^k, ??:?(?)^{0,1], \\?*\\=&, add(Ma) = *}. Do the forcing as before. In 9Jt[G] let us form a class A" of real valued measures: for ?$ e К as noted before make ?'& which is non atomic and such that add(/4)=add^,5) and \\?'?\\ = ||??||. By known facts it is clear that ?' = {?'&\????}^<?[0]. A" witnesses that к is real valued compact cardinal < 2?. How about the other consistency implication? The model L[K], the inner model constructed from a class of real valued measures К is not a solution, its worst problem being appearance of a lot of new cardinals. Thus we can only state the following conjecture: consistency of ZFC + "there is a real valued compact cardinal ^ 2'"" implies consistency
Real valued measurability, some set-theoretic aspects 1277 of the theory ZFC + "there is a strongly compact cardinal". Similarly for other RV large cardinals (e.g., weakly compact, supercompact,...). ? Real valued measurable cardinals, with Solovay's discovery pulled to the continuum of real numbers or it's part, the (consistency) power of binary measurable cardinals. The fruitful investigation of saturated ideals, done by Jech, Prikry, Magidor and others (see Jech A978)) was a kind of main generalization of Solovay's arguments. In the times when full zoology starting with measurable cardinals up to the  = 1" was well mapped, it looked quite odd that the real valued measurables are so powerful and so lonely, which pushed the search for the RV large. Kunen did similar thing with PMEA which is discussed in Section 5. Prikry A975) defines real valued supercompact mentioning that it's consistency is derived from super compact by the method of Solovay. All the three sources, quite similar and at the similar time - seventies, use Solovay's forcing for the downward relative consistency and upward constructibility relative classes of measures for the partial back consistency. The applications like in Section 5 made these kind of axioms interesting and useful in problem solving and classifications. 3. Nonregular ultrafilter Let us now mention the first example of a nonregular countably incomplete ultrafilter due to Jack Silver in the following theorem THEOREM 3.1. If к is a real valued measurable then there is an ultrafilter over к which is not (?, ?) regular for all ? < к such that c/? > ?. PROOF. Let ? be a A: additive measure over k. Let D be any ultrafilter extending the filter {x <? к ? ?(?) = 1). Thus ? e D implies ?(?) > 0. The following remark is useful: for any sets xa, a e ?, there is a countable set С с ? such that (+) aeX c/€C Proof of (+). Let У a — ?? ? У а disjoint and {ja )'<* = {Jaxa> s0 Ц UXa)=? (Uya)= Л >*&<*)¦ aeX Choose С so that ^2^(Уа) = ^2^(уа). Ct?C
1278 A. Jovanovic Let c/? > ?, xa e D, a < ?; put There is some ? such that ?, ?' ^ ? implies ? ? = ? a (otherwise, since c/? > ?, we would have ? ? descending chain of reals). For each ? > ?, let С ? с ? - /3, С/з countable, be such that ?( U Xa) = ?? (which = b- Use (+))· Since c/? > ?, there is set ? с ? with \Z\ = ? such that /3 < /3' in ? implies that every element of С ?', is greater than any element of С ?. For each ? e Z, let w = U ¦*«¦ Since ?( U ¦*<* ~^) =0 one has: ???????) = 0, for /3, /3' e ?; also ??'/з) = ?? > 0. So N/3eZ hence ?(?^) = ^>0' /3eZ But ?^= ? ?^'<«>· /JeZ f-.?^? ??? (V/3)/(/3)eC,) So for some /: ? -> ? such that for all /3 e Z, /(/3) e С/з we have P| xfw фЪ. aeZ Thus we have ? distinct xa whose intersection is nonempty.
Real valued measurability, some set-theoretic aspects 1279 Using more sophisticated arguments, Magidor proved consistency of existence of nonregular ultrafilters over <«2 and а>з relative consistency of huge cardinals. His nonregular ultrafilter F over 013 has cardinality jump ? ? ^ 013, while (obviously) I I 013 F _ 2?? F Nonregular ultrafilters are hard to obtain, jumping ultrafilters even harder. D 4. Rudin-Keisler order For /: ? -> ? and ultrafilters D and ? (over ? or its subset), E^RKD iff E={y\f-l(y)eD\, E^RKD iff ?sCRKD and D sCRK E. Complexity classification of nonprincipal ultrafilters due to Rudin and Keisler is related to combinatorial properties of ultrafilters, hence the structure of Rudin-Keisler order (RK order) of types of ultrafilters depends on the axioms of set theory (see Comfort and Negrepontis A974)). The existence of minimal elements is characterized by normality properties which are connected with a sort of completeness degree of ultrafilters. If there is a minimal unbounded function / for D it generates a <rk minimal ultrafilter ? in the set of all uniform ultrafilters over k. The minimal unbounded function for ? is the identity function (hence ? is weakly normal). Thus we can say that D above contains the information on a minimal element which is type unique, below it in ^rk for set of uniform ultrafilters over k. If the function / above is such that every function g:k —* k, such that {/ I g(i) < /(/)} e D, is equal to a constant modulo D, then the ultrafilter ? is normal and it is the minimal element in ^rk of all (nonprincipal) ultrafilters over k. Type equivalence in ^rk preserves norms, thus the set of all <rk equivalence types over some index can be divided into layers of ultrafilter norms which are cardinals from ? up to the cardinality of a given index /, position of uniform ultrafilters over /. What information can one level contain about the others? The existence of normal ultrafilters implies that there are no types below them, and weakly normal ultrafilters are minimal within the top layer. If there are minimal elements at lower levels those are type equivalent to some weakly normal ultrafilter within the same layer. A generalization of weak normality could be useful in the above consideration. Let ? be a cardinal ^ k. An ultrafilter D over к is ?-weakly normal iff there is a minimal unbounded function /: к -> a modulo D, i.e., iff for every g <d f there is a constant ? < a such that g <dY- Weak normality trace of ultrafilter D over к is then defined as WNtr(D) = {a \ D is ?-weakly normal). Some elementary facts we give in the following lemma.
1280 A. Jovanovic Lemma 4.1. A) If D is "only" normal or "only" weakly normal over k, then WNti(D) = {k}; if D is ordinary ultrafilter then WN tr(D) = 0; B) if a e WNtr(D) then in ^rk there is a minimal element below D in the layer of ultrafilters of norm a. As a consequence we have that ultrafilters with |WNtr| > 1 are essentially multiply minimal in ^rk- The existence of ?-weakly normal ultrafilters for a smaller than the cardinality of the index are interesting. The first examples come from properties like an instance of strong compactness. Namely if к is ?-strongly compact then there is an к -complete uniform ultrafilter ? over ? which is (k, X)-regular. It is simple to check that WNtr(?) = {a < ? | cf. a ^ k], which can be scaled down using the following theorem. THEOREM 4.1. Weakly normal trace is preserved by c.c.c. extensions. Consequently, starting from an instance of strong compactness, (ultrafilter E) tuning a little Solovay forcing and Silver's construction, one can get an ultrafilter F over ? < 2?, with WNtr equal to WNtr(?). RK extension to RV measures. Let fi(k) denote the space of all ultrafilters over k. Let M(k) be the space of all [0, 1] total measures on k. Clearly fi(k) с М(к). We can naturally extend ^rk from fi(k) to ? (к) using (+) from Solovay equiconsistency proof: for v, ? e M(k) put i'^rksM iff ? (?) = ?(/?_1(?)) for some h e kk. Similarly to RK put %RKs for ? <RKs ? and ? sSrks v. Put rRKs^) = {v\ ? %RKs M)· In this way we obtain a combined classification for binary and real valued measures. List some facts for RKS. Lemma 4.2. (i) For every ? e ? (к), | rRKs (?) I ^2*; (ii) \M(k)/^RKS\=22k- (iii) ? e P(k) implies ?^?(?) = trks^). thus ultrafilters and measures do not mix. (iv) P(k)/^RK с M(k)/^RKS; in fact P(k)/^RK is an initial part in <rks· (?) ? ^rks ? implies that add(i>) < add^) and \\v\\ ^ ||?||, (vi) normal, weakly normal and ?-weakly normal ultrafilters keep their positions in ^rks; normal measures are minimal among measures. It would be interesting to see if other normality properties (weakly, a -weakly) can nicely be translated to measures and learn more about possible complexity of ^rks·
Real valued measurability, some set-theoretic aspects 1281 5. Measures and normal Moore spaces Moore spaces are regular spaces, which have a development. The problem if every normal Moore space is metrizable, known as Normal Moore Space Conjecture (NMSC) was unresolved for decades, attracting exclusive attention. Treatment of number of related and restricted properties preceded its solution by Nyikos and Fleissner. A lot of work contributed to the fact that this problem is nicely studied and investigated in a variety of details (Tall A984)). Essential in problem solution was use of the following strong property. PMEA. For every k, the product measure on 2k can be extended to a total 2H° additive measure. Using Solovay forcing Kunen proved (Fleissner A984)) that consistency of ZFC +  strongly compact cardinal" implies consistency of ZFC + PMEA. First half of the normal Moore space problem was solved by Nyikos A980): THEOREM (PMEA). Let X be a space of character < с Every normalized collection of subsets of X is well separated. PROOF. Let X be a space of character < с and let [Ca I a < ?) be a normalized family of subsets of X. Thus for each Л с ?, there exist disjoint open subsets UA and VA of X containing \^j[Ca I a e A] and \J{Ca \ ? ? A] respectively. Let ? be a c-additive measure, extending the usual product measure on 2A, such that ?(?) exists for all А с 2?. We identify ? with the set of all ordinals whose cardinal is < ?. For each a e ?, let Ba = {fe2x\f(a)=l]. NotethatMB^= 1 and ?(?? - ??) = 1/4 for all ?, ? e ?, ? ? ?. Each / e 2? is the characteristic function of some subset A/ of ?: Af = {a e ? | /(a) = 1}· For a given a and a given ? e Ca, let {UY(p)\y < kp] be a base of open sets for the neighborhoods of p, with kp < с Let MP, Y] = {/ e 2? | Uy(p) с UAf or UY(p) С VAj}. For a fixed p, \J{A[p, ?]\ ? < kp] = 2?. This is because, for any given / e 2?, either Ua{ or VAf is an open set containing Ca, hence there is а у such that UY(p) is contained in the one which contains Ca. Because ? is c-additive and {A[p, ?] | ? < kp] is a collection of fewer than с subsets that fill up 2?, the transfinite sequence \jA[p, ? <? o^s^kp
1282 A. Jovanovic converges monotonically to 1. By the principle of Archimedean order, there exists a finite set of indices {?\ (?),..., ?„?(?)} such that d\jA[p,Yi(p)]\>T/*. Let ?? be such that tip (P)(P)· 1 = 1 Then ?(?[?,?,,])> 7/8. If we do this for all a and ? e Ca, we then have ?(?[?,??]??[?,?4])>3/4 for all p, q. Suppose peCa,qeCfi,a^p. Because ?(?-?) = 1/4 there exists / e A[p, Yp] П A[q, ?4] П (Ba - Bp). From /(a) = 1, /(/3) = 0 it follows that ? e UAr q ? VAf- And from / e A[p, y,,] it then follows that UYp(p) CUAf. Similarly, UY<l(q) CVAr Therefore, Uyp(p)nUY4(q) = e>. For each a, let Ua=\J{Uyp(p)\peCa}. Each Ua is an open set containing Ca and by the preceding paragraph, ????? = ? ????^?. Hence {Ca I a < ?} is well separated. ? COROLLARY (PMEA). Every normal Moore space is metrizable. After theorem of Nyikos the obvious question remained: could PMEA be replaced with some weaker property in the theorem of Nyikos? More precisely: what is the consistency strength of NMSC? This enigma was resolved by Fleissner who proved the following theorem (Fleissner A984)). THEOREM 5.1. Consistency o/ZFC + NMSC implies consistency o/ZFC +  a measurable cardinal".
Real valued measurability, some set-theoretic aspects 1283 The complete characterization of NMSC consistency strength meets similar difficulties like RV strongly compact. Mitchell made it more precise by proving consistency of hypermeasurable from NMSC. Together with original work of Solovay the above leads to a sort of more old style mathematical and less set-theoretic conclusion. ZFC +  total ? additive measure extending Lebesgue measure" is equiconsistent with ZFC + "a large portion of NMSC". Formulation "a large portion of NMSC" should mean that there must be some rather exotic nonmetrizable example of NMS consistent with ZFC +  total ? additive extension of Lebesgue measure". In this way "existence of total ? additive extension of LM" is closely related to "metrizability of all NMS". It is interesting that these properties remain related in weakened form. Concerning LM extensions first mention that Banach and Kuratowski proved: CH implies that LM can not be extended by even countable many sets. The following result of Carlson and Prikry solves the middle case with respect to measure extensions. Theorem 5.2. (a) If set theory is consistent, it is consistent to assume in addition that 2 " is anything reasonable and that given any < 2K(I sets of reals LM may be extended to measure all of them. (b) If the consistency of к weakly compact is assumed in addition then LM may be extended to measure 2K° sets. Beside PMEA people are using some weaker forms. CMEA (c Measure Extension Axiom). Given с subsets of c2 there is a c-additive measure extending the usual product measure and defined on those sets. WMEA (Weak Measure Extension Axiom). For any к < с, for any collection of к subsets of k2, the usual product measure on k2 can be extended to a A;-additive measure measuring these sets. Consistency of WMEA is proved from the consistency of ZFC, consistency of cMEA from weak compactness. THEOREM (Tall A984)). PMEA (cMEA) (WMEA) implies normal Moore spaces (of weight < c) (of weight < c) are metrizable. 6. Counting remarks What is so nice about a countable set? The fact that we can count it! Set counting is a sort of simplest measurement = determining the size of a set. Since Cantor discovered that infinity is not unique and that there is a vast array of sets (Va, a e Ord), each more numerous than its predecessors, one basic need remains: try to maintain size counting as
1284 A. Jovaiwvic simple as needed. This is usually accomplished with the axiom of choice (AC) implying that all sets are bijective to (initial) ordinals (alephs). In fact well orderability of arbitrary set means that it can be counted step by step, one element at each step, through transfinitely many steps if necessary, but this procedure "ends" certainly, in the similar way the counting of finite or countable sets is done. Thus 2 " = Nff+/(ff). The index function / is nondecreasing and satisfies cofinality condition cf(Kff+/-(ff)) > Cf(Kc). (From Cantor Theorem) f(a) ^ l,aeOrd.(CHo)/@)= 1 and (GCH <s>) f(a) = 1, a & Ord hold in L. /@) = 2 (for example) in Cohen's original model. Easton proved that for regular cardinals / can take arbitrary values (obviously respecting cofinality and nondecreasing conditions). So, for regular Ha, f(a) can be: 1,2, 3,..., ? + 1,..., ?\,... it can be ? + 1,..., ??+\,.... Generally f(a) ^ 2Ku. When /(?) = 2?? we could say that it is reasonably unbounded at a and then 2Ku = N2n„ . Quite surprisingly, Silver A974) proved that for singular cardinals displacement / depends on its values on predecessors: THEOREM 6.1. If к is a singular cardinal of uncountable cofinality then: 2"=?+??(?:) implies 2k = k+. (Here AE(k) stays for "stationary in k'\) Thus, if GCH holds AE below к it is true at к as well. More generally, the way f is bounded AE bellow к will determine the way it is bounded at k. Regular cardinals satisfying equation K„ = a are weakly inaccessible. Regular к is strongly inaccessible if ? < к implies 2Л < к. GCH implies that weakly inaccessible cardinals (wine) and strongly inaccessible cardinals (sine) coincide. Existence of inaccessible cardinal can not be proved in ZFC. Even, nicer, in ZFC + 1 sine" the consistency of ZFC is proved, hence by Godel theorem ZFC + 1 sine" is stronger theory. If к is the 1st sine, it is followed by 2nd, 3rd, ..., kth, ... then by hyper inac, hyper hyper, etc., quite exhaustive sequence of properties one stronger than the predecessors and with the stronger consistency strength. The inaccessibility properties have been iterated to postulate bigger, stronger and consistency-stronger cardinals, all the way up through the ordinal line. Nice journey through their world is given in Drake A974), Jech A978). Ulam A930) proved that measurable cardinal is sine. The results of Hanf, Scott, Keisler and Tarski in 1960s established that inaccessible bearing weakly compact are very large, but that measurable cardinals are even much larger. In fact if к is measurable with normal measure ? then {a < к | a e sine) has measure 1. The same with all of inaccessible properties, the same for weakly compact: {a < к \ a is WC) has measure 1. Similar concentration properties show strongly compact, supercompact, extendible cardinals, Vopenka principle and huge cardinals, scratching the ceiling of cardinal skies,
Real valued measurability, some set-theoretic aspects 1285 i.e., where there is no more room for iteration of cardinal properties, since they start producing inconsistency. Thus, position in the hierarchy of large cardinals, practically the coordinate on the Ord line corresponds to the consistency strength of a given property. In fact the Ord line serves as simplest space (line) where relative position of sets reflects their relative consistency strength. It might seem that there are uselessly many strong cardinal properties, that those things cannot exist or that the cost/benefit ratio is weakly promising when using them in "mathematics". However, if they serve in solving harder exercises or problems, or even more, e.g., allow us to characterize properties or problem solutions, then no doubt we will make no faster progress if we ignore them when they are involved!? Equalization or weighing the properties on the Ord scale tightly connects sets of mathematical properties with syntactic-language properties, with elementary embeddings of models of set theory, with combinatorial properties of binary measures or their ultrapowers incompressible (normalizing) elements. Strong axioms of infinity are nicely mapped in Drake A974), Jech A978), usually divided in small large, medium large (weakly compact) and large large cardinals (> measurable). Consistency of GCH with existence of measurable cardinal was proved by Silver (?,[?] ? GCH). In fact similar theorem to the Silver's for singular cardinals holds for measurable cardinals (the first restriction of Eastons freedom): THEOREM 6.2. If к is measurable (with normal measure ?) then 2V = v+ AE(k) implies 2k = k+ (f(v)=\ AE implies f(k)=\). Thus if GCH holds AE below к then it is true at к as well. More generally, the way f is bounded AE below к will determine the way it is bounded at k: f(k)Kot(j\(f(a),e)\ ? ' Kunen proved that 2k > k+ for measurable к is stronger property than measurability, i.e., allowing models with many measurable cardinals. That the size of continuum with total measure is big proved by Ulam A930): THEOREM 6.3. [f к is a real valued measurable (< 2K(I) then к is weakly inaccessible. Solovay proved that if к is real valued measurable then it is the kth inaccessible and Mahlo cardinal. Consequently if с is real valued measurable then: (a) с is cth cardinal; (b) 2K° = Кп+дО) = K/@) = N2tv or /(°) = 2*">thus / is unbounded at 0. In other words if 2K° is real valued measurable then 2K(I makes extraordinary jump in Ord. How peculiar it is, probably the best illustration is provided by the following theorem of Prikry.
1286 A. Jovanovic THEOREM 6.4. //2K° is real valued measurable then 2K = 2*° for all infinite ? < 2*4 Hence, the continuum displacement function /, first makes enormous jump at 0, then it is constant for all indexes smaller than its value at 0. 2«"=Ka+f{ah /@) = 2K° and 2K| = 2*2 = · · · = 2Ka = · · · = 2K» for a < 2*°, thus /@) = /A) = ··· = /(?) = ··· = 2?? fora<2K°, quite nice diversity from that / = 1 at GCH to this exceptional - strongly hyperbolic geometry of the universe in the presence of the total measure on reals. Thus geometrically we could say that the total measure on reals strongly (locally) curves the universe of mathematics. Here we list some examples of continuum displacement behavior influenced by nonregular ultrafilter. Let D be a jumping ultrafilter over K|5 and K^4 = K|5. If ? ???? ^ K>5 then 2*'5 < Кзо. If 2K° = ??+? then there is no jumping ultrafilter over N15. For displacement / in 2K" = На+ца), in circumstances similar to above for Ha, we have /(?) ^ a, thus / is well bounded. Remark. The measurement | |: V -> Cards is in fact a nice universal, totaly additive measure. Its restriction to Va+\ (a e Ord) ranges within the well ordered N^+,(a) and measures all subsets of Va. For regular | Va |, it is essentialy | Va \ = Д,-additive: X с Va, such that |X| = | Va\, is X = U/j<y УА an^ ? <^a tnen for some ? \??\ — |X|. 7. Some complementary results Reducing axiom of choice AC eliminates Vi tali's examples of Lebesgue nonmeasurable sets. Back in 1964 Solovay proved the following theorem, published in Solovay A970). THEOREM 7.1. Suppose that there is a transitive model o/ZFC + "There is a strongly inaccessible cardinal" (ZFC + /). Then there is a transitive model of ZF in which the following propositions are valid. A) The principle of dependent choice (= DC). B) Every set of reals is Lebesgue measurable (LM). C) Every set of reals has the property ofBaire. D) Every uncountable set of reals contains a perfect subset (P). E) Let {Ax I x e R) be an indexed family of nonempty sets of reals with index set the reals. Then there are Borelfunctions, h\, hi mapping К into R such that (a) {x I h\ (jc) ^ Ax] has Lebesgue measure zero. (b) {x I h2(x) ? ??] is of first category.
Real valued measurabllity, some set-theoretic aspects 1287 Mycielski and Swierczkowski A964) proved that if ZF + DC + ? has a transitive model, then ZFC + "There is a strongly inaccessible cardinal" has a transitive model. This shows that the inaccessibility is indeed needed in this theorem's assumption. Concerning ZF + DC + LM B), Solovay anticipated that the inaccessibility is not needed. The Theorem 7.1 is related and complemented with the following two theorems. THEOREM 7.2. Suppose that ZFC + / has a transitive model. Then so does the theory ZFC + GCH plus analogs of B) to E) of Theorem 7.1. Analogs are obtained by substitution of "set of reals " with "set of reals definable from a countable sequence of ordinals ". Thus the analog of B): B') Every set of reals definable from a countable sequence of ordinals is Lebesgue measurable. THEOREM 7.3. Assume the hypotheses of Theorem 7.1. Then there is a transitive model of ZFC in which 2H° = tt> and the analogs of B)-E) of Theorem 7.1 referred to in Theorem 7.2 are valid. Instead of 2K° = Щ one can have a similar theorem with 2s" = anything reasonable. With stronger assumption: there is a transitive model of ZFC + "there is a strongly inaccessible weakly compact cardinal", Solovay made a model of Theorem 7.3 in which Martin's axiom is true. Theorem 7.2 is proved in the model of Azriel Levy obtained in the following way. Let 9Л be the initial model: 9Л N ZFC + "There is a strongly inaccessible cardinal". Let ? be strongly inaccessible in 3??. For an ordinal ? define Px to be the set of functions / whose domain is a finite subset of ? ? ?, range (/) с ? and f({a,n)) < a if {a,n) edom(f). Order ?? with с. If ? e ЭК then (?*K* = ??. Let G be an OT-generic filter on ?? and let <Jt = 9K[G]. With very reach contrapunctus in cofinal set of lemmas developed on the techniques of Levy, Solovay proves the fuge. LEMMA 7.1. Let U be a set of reals in VI which is ЧЖ-R-definable. Then 9t N U is Lebesgue measurable. The other properties listed in Theorem 7.2 are proved to be true in 9? as well. Further, let 9?? = (HOD)''1. Solovay proves that reals and sequences of ordinals in УХ and 9?| are the same. DC holds in 9?| and if A e "Я\ then in 9?, A is iW-R-definable. In 9?? every set of reals has the property of Baire and every uncountable set of reals contains a perfect subset. Once again reconsidering Lebesgue measure (and its extensions) the closely related are the properties and their intensities: Axiom of choice, additivity, translation invariance and measure domain (i.e., total/partial on R).
1288 A. Jovanovic Some more subtle property combinations treated in Pincus and Solovay A977), led to the theorem of Pincus and Solovay. THEOREM 7.4. It is consistent with ZFC that there is an I* such that: (a) /* is a finitely additive translation invariant extension of Lebesgue measure to a measure on [0, 1]. (b) No ultrafilter on the set of reals is definable from ordinals, reals and I*. In the same paper it is shown that generally it is harder to obtain nonprincipal ultrafilters than nonprincipal measures in ZFC, ZF + DC. The following very important theorems are proved there. THEOREM 7.5. It is consistent with ZFC that no nonprincipal ultrafilter on ? is ODR but there exists a definable (in fact projective) nonprincipal measure on ?. THEOREM 7.6. It is consistent with ZF + DC + Hahn-Banach theorem that every ultra- filter on any set is principal. The last theorem, nicely complements the theorem of Luxemburg A969) which proved that in ZF Hahn-Banach theorem is equivalent to the existence of a finitely additive real- valued measure on every Boolean algebra. The model for Theorems 7.3 and 7.4 is the same, 9Л = L(r) where r is a random element of2<4>. 9Л is similar to the models of Solovay A971). Continue with the model У1 of sets hereditarily definable from ordinals, reals and F (the enumeration of orbits of countable restrictions of r). The measures used to prove these theorems are of the kind introduced in Solovay A971) but on different indexes. 8. Set-theoretic measure theory The research of so far mentioned problems expanded to the set theoretic measure theory. It is not easy to list all mathematicians with major contributions. The gigantic paper of Fremlin A993) integrated impressive collection of results since Solovay set the base, including comprehensive references and historic remarks. There has been general diffusion of set theory into measure theory with exhaustive discussion of fine details. The following two theorems from Gitik and Shelah A989) represented a major advance in measure algebra complexity description. THEOREM 8.1. Let к bean atomlessly measurable cardinal with witnessing probability v. Then the Maharam type of(k, Pk, v) is at least min(k{+w\ 2k). THEOREM 8.2. Let (X, PX, ?) be an atomless probability space with к = add(M). Then the Maharam type of (X, PX, ?) is at least min(J((+"'), 2k), and in particular is greater than k.
Real valued measurability, some set-theoretic aspects 1289 COROLLARY 8.1. With the assumptions of Theorem 8.2. Let (Z,3, ?,?) be a Radon probability space ofMaharam type ? ^ min(A:(+";), 2k). Then there is an inverse-measure- preserving function f:x—*Z. Corollary 8.2. With the assumptions of preceding corollary, there is an extension of ? to a k-additive measure on PZ. Fremlin further developed measure and integral calculus in the presence of real valued measurable cardinals. He gave a number of theorems with detailed characterizations involving all aspects of real valued measurability. The classification of indescribability hierarchy, combinatorics and partition properties related to the real valued measurability is given. A number of forcing absoluteness, and the Scott type reflection arguments known for binary measures are presented. Let us mention the following consequences. (a) Let к be an atomlessly-measurable cardinal and / a set of cardinal less than k, ? a second order formula, Ci,..., Q relations on / and ?\, ?,„ e /. Then for any random real p.o. set P, (/,Ci,..., ?,„)>=*> iff ll-p(/.C| \m)t<p. (b) Let Ci,..., C„ be relations and ??, ..., ?,? ordinals, all with definitions which are absolute for random real forcing. Let ? be a second order formula such that ZFC I- "for every atomlessly measurable cardinal к > max, <?„, ??, ^?\,...?„;?\,...,???)?=?". Then for every atomlessly-measurable cardinal к > max,^„, ?,, [a \a < k, (a; C\, ...,?,„)?=?} belongs to the RVM filter of k. Some examples of nice measure theoretic implications are given in the following theorems. THEOREM 8.3. Let к be a real valued measurable cardinal and ? a normal witnessing probability on k. Let (?, ?) be a Radon probability space and f: ? ? к -> a bounded function. Then J(j' ?(?,????)\?(??)^ J (j'/(?,?)?(??)\?(??). THEOREM 8.4. Let к be an atomlessly measurable cardinal, let Xq Xm-\ be Radon probability spaces ofMaharam type less than к and let f: f]KM *i -+ R be a bounded
1290 A. Jovanovic function. Let ? :m —» m be a permutation such that the two repeated integrals f(xo,...,xm-.l)dxm-i J... \dxQ, Г= / ( '"( / /(·?°'···'·?'"?)^·?^('"-1))··· )dx<r@) /(-(/¦ both exist. Then I = /'. THEOREM 8.5. Let (x, p) be a metric space. (a) X is Bore! measure-complete iff there is no real valued measurable cardinals. (b) If X is complete (as metric space) then it is Radon iff there is no real valued measurable cardinal ^ d(X). Further, impact of real valued measurable cardinals to the properties of p.o. sets is presented. Finally, Fremlin A993) presents discussion of quasi-measurable cardinals. References Banach, S. A923), Surle probleme de la measure. Fund. Math. 4. 7-33. Bing, R.H. A951), Metrization of topological spaces. Canad. J. Math. 3, 175-186. Bing, R.H. A965), A translation of the normal Moore space conjecture. Proc. Amer. Math. Soc. 16, 612-619. Chang, C.C. A967), Descendingly incomplete ultrafilters. Trans. Amer. Math. Soc. 126, 108-118. Chang, C.C. and Keisler, H.J. A973), Model Theory. North-Holland, Amsterdam. Cohen, P.J. A964), The independence of the continuum hypothesis I, II, Proc. Nat. Acad. Sci. USA 50, A963) 1143-1148,51A964) 105-110. Cohen, P.J. A965), Independence results in set theory. The Theory of Models, J.W. Addison, L. Henkin and A. Tarski, eds, North-Holland, Amsterdam, 39-54. Comfort, W.W. and Negrepontis, S. A974), The Theory of Ultrafilters, Springer, Berlin. Devlin, K.J. A973), Aspects of Constructibility, Lecture Notes in Math., Vol. 354, Springer, Berlin. Dickman, M. A. A974), Large Infinitary Languages - Model Theory, North-Holland, Amsterdam. Drake, F.R. A974), Set Theory, North-Holland, Amsterdam. Easton, W.B. A964), Powers of regular cardinals, Ph.D. Dissertation, Princeton University A964); Ann. Math. Logic 1A970), 139-178. Fodor, G. A966), On stationary sets and regressive functions. Acta Sci. Math. (Szeged) 27, 105-110. Fremlin, D.H. A993), Real-valued-measurable cardinals. Israel Math. Conf. Proc., Vol. 6, 151-304. Fleissner, W.G. A974), Normal Moore spaces in the constructible universe, Proc. Amer. Math. Soc. 46, 294-298. Fleissner, W.G. A975), When is Jones' space normal?. Proc. Amer. Math. Soc. 50, 375-378. Fleissner, W.G. A976), A normal collectionwise Hausdorff not collectionwise normal space. Gen. Topology Appl. 6, 57-64. Fleissner, W.G. A982a), Normal nonmetrizable Moore space from continuum hypothesis or nonexistence of inner models with measurable cardinals. Proc. Nat. Acad. Sci. USA 79, 1371-1372. Fleissner, W.G. A982b), If all normal Moore spaces are metrizable, then there is an inner model with a measurable cardinal. Trans. Amer. Math. Soc. 273, 365-373. Fleissner, W.G. A984), The normal Moore space conjecture and large cardinals. Handbook of Set-Theoretic Topology, K. Kunen and J.E. Vaughan, eds, North-Holland, Amsterdam, 733-760. Fleissner, W.G., Hansell, W. and Junnila, H.J.K. A982), PMEA implies Proposition P, Topology Appl. 13, 255- 262.
Real valued measurability, some set-theoretic aspects 1291 Godel, K. A938), The consistency of the axiom of choice and of the generalized continuum hypothesis. Proc. Nat. Acad. Sci. USA 24, 556-557. Godel, K. A939), Consistency-proof for the generalized continuum hypothesis. Proc. Nat. Acad. Sci. USA 25, 220-224. Godel, K. A940), The Consistency of the Continuum Hypothesis, Ann. Math. Studies, Vol. 3, Princeton Univ. Press, Princeton, NJ; printing 1966. Gitik, K. and Shelah, S. A989), Forcing with ideals and simple forcing notions, Israel J. Math. 68, 129-160. Hanf, W.P. A960), Models of languages with infinitely long expressions. Intern. Cong. Logic, Methodology and Philosophy of Science, Stanford University, Abstracts of contributed papers (mimeographed). Hanf, W.P. A964), lncompactness in languages with infinitely long expressions. Fund. Math. 53, 309-324. Hanf, W.P. and Scott, D.S. A961), Classifying inaccessible cardinals. Notices Amer. Math. Soc. 8, 455. Jech, T.J. A967), Non-provability of Souslin 's hypothesis. Comment. Math. Univ. Carolin. 8, 291-305. Jech, T.J. A968), ?, can be measurable. Israel J. Math. 6. 363-367. Jech, Т., Magidor, M., Mitchell, W. and Prikry, K. (unpublished). Precipitous ideals. Jech, T.J. A971a), Trees, J. Symb. Logic 36, 1-14. Jech, T.J. A971b), Lectures in Set Theory with Particular Emphasis on the Method of Forcing. Lecture Notes in Math., Vol. 217, Springer, Berlin. Jech, T.J. A978), Set Theory, Addison-Wesley, New York 1978. Jensen, R.B. A972), The fine structure of the constructible hierarchy, Ann. Math. Logic 4, 229-308. Jones, F.B. A937), Concerning normal and completely normal spaces. Bull. Amer. Math. Soc. 43, 671-677. Jones, F.B. A966), Remarks on the normal Moore space metrization problem. Ann. Math. Stud. 60, 115-120. Jovanovic, A. A980), Uniform measures over continuum. Teoria della misura e sue applicazioni, atti del convegno, Trieste, 127-129. Jovanovic, A. A981), On real valued measures. Open Days in Model Theory and Set Theory, Proc. of a Conference held at Jadwisin, Poland, W. Guzicki, W. Marek, A. Pelc and С Rauszer, eds, 145-152. Jovanovic, A. and Mijajlovic, Z. A993), On weak normality and selectivity ofultrafilters. Bull. Greek Math. Soc. 35, 3-9. Keisler, H.J. A971), Model Theory for Infinitan Logic, North-Holland, Amsterdam. Keisler, H.J. and Tarski, A. A964), From accessible to inaccessible cardinals. Fund. Math. 53, 225-308; corrections: ibid. 57, 119. Ketonen, J. A971), Everything you wanted to know about ultrafilters. but were afraid to ask. Dissertation, Univ. of Wisconsin, Madison, WI. Ketonen, J. A972a), On non-regular ultrafilters, J. Symb. Logic, 37, 71-74. Ketonen, J. A972b), Strong compactness and other cardinal sins, Ann. Math. Logic 5, 47-76. Ketonen, J. A973), Ultrafilters over measurable cardinals. Fund. Math. 77, 257-269. Ketonen, J. A974), On the existence of p-points. Ketonen, J. A976), Non-regular ultrafilters and large cardinals. Trans. Amer. Math. Soc. 2241. Ketonen, J. A976), Open problems in the theory of ultrafilters. Kunen, K. A968), Inaccessibility properties of cardinals, Ph.D. Dissertation, Stanford Univ., Stanford. Kunen, K. A970), Some applications of iterated ultrapowers in set theory, Ann. Math. Logic 1, 179-227. Kunen, K. A971), On the GCH at measurable cardinals. Logic Colloquium '69, R.O. Gandy and C.E.M. Yates, eds, North-Holland, Amsterdam, 107-110. Kunen, K. A980), Set Theory, An Introduction to Independence Proofs, North-Holland. Amsterdam. Kunen, K. A984), Random and Cohen reals. Handbook of Set-Theoretic Topology, K. Kunen and J.E. Vaughan, eds, North-Holland, Amsterdam, 887-911. Kunen, K. and Prikry, K.L. A971), On descendingly incomplete ultrafilters. J. Symb. Logic 36, 650-652. Kuratowski, K. and Mostowski, A. A968), Set Theory. North-Holland, Amsterdam. Kurepa, D. A935), Ensembles ordonnes et ramifies, Publ. Math. Univ. Belgrade 4, 1-138. Laczkovich, M. B002), Paradoxes in measure theory. Handbook of Measure Theory, E. Pap ed., Elsevier, Amsterdam, 83-123. Levy, A. A960), A generalization ofGodel's notion of constructibiliry, J. Symb. Logic 25, 147-155. Luxemburg, W.A.J. A969), Reduced products of the real number system. Applications of Model Theory to Algebra, Analysis and Probability, Holt Reinbout and Winston, New York.
1292 A. Jovaitovic Magidor, M. A974a), How large is the first strongly compact cardinal? Magidor, M. A974b), Combinatorial characterization of supercompact cardinals. Proc. Amer. Math. Soc. 42, 279-285. Magidor, M. A976), How large is the first strongly compact cardinal?. Ann. Math. Logic 10, 33-57. Magidor, M. A977), On the existence of nonregular ultrafilters. Technical report. Menas, Т.К. A973a), On strong compactness and supercompactness. Dissertation. Univ. of California, Berkeley. Menas, Т.К. A973b), On strong compactness and supercompactness. Thesis, University of California, Berkeley. Mostowski, A. A948), On the principle of dependent choices. Fund. Math. 35. 127-130. Mycielski. J. and Swierczkowski. S. A964). On the Lebesgne measurabiliry and the axiom of determinateness. Fund. Math. 54, 67-71. McAloon, K. A971), Consistency results about ordinal definability, Ann. Math. Logic 2, 449^467. Nyikos, PJ. A980), A provisional solution to the normal Moore space problem, Proc. Amer. Math. Soc. 78, 429^35. Prikry, K.L. A970), Changing measurable into accessible cardinals. Dissertationes Math. (= Rozprawy Mat.) 68. Prikry, K.L. A971), On a problem of Gillman and Keisler. Ann. Math. Logic 2. 179-188. Prikry, K.L. A975), Ideals and powers of cardinals. Bull. Amer. Math. Soc. 81, 907-909. Pincus, D. A974), The strength of the Hahn-Banach Theorem. Victoria Symposium on Nonstandard Analysis. Lecture Notes in Math., Vol. 369, 203-248. Pincus, D. and Solovay, R. A977), Definability of measures and ultrafilters. J. Symb. Logic 42 B), 179-190. Przymusinski, T. A975), A note on collectionwise normality and product spaces. Collect. Math. 33, 65-70. Przymusinski, T. and Tall, F.D. A974), The undecidability of the existence of a non-separable normal Moore space satisfying the countable chain condition. Fund. Math. 85. 291-297. Rowbottom, F. A964), Doctoral dissertation. University of Wisconsin. Sacks, G.E. A969), Measure-theoretic uniformity in recursion theory and set theory. Trans. Amer. Math. Soc. 142,381^420. Scott, D.S. A961), Measurable cardinals and constructible sets. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astron. Phys. 7, 145-149. Shelah, S, A972), Every two elementarily equivalent models have isomorphic ultrapowers, Israel J. Math. Shoenfield. J.R. A967), Mathematical Logic, Addison-Wesley. Reading. MA. Silver, J.H. A966), Some applications of model theory in set theory, Ph.D. Dissertation. Univ. of California, Berkeley. Silver, J.H. A971), The consistency of the GCH with the existence of a measurable cardinal. Axiomatic Set Theory, D.S. Scott, ed., Proc. Symp. Pure Math., Vol. 13, Part 1, Amer. Math. Soc., Providence, RI, 391-396. Silver, J.H. A974), On the singular cardinals problem, Proc. of Intern. Congress of Mathematicians, Vancouver 1974, 265-268. Solovay, R.M. A970), A model of set theory in which every set of reals is Lebesgue measurable. Ann. Math. 92, 1-56. Solovay, R.M. A971), Real-valued measurable cardinals. Axiomatic Set Theory, D.S. Scott, ed., Proc. Symp. Pure Math., Vol. 13, Part 1, Amer. Math. Soc.. Providence. RI. 397^428. Solovay, R.M. A972), Strongly compact cardinals and the G.C.H., Proc. 1971 Tarski Symp. Solovay, R.M., Reinhardt, W.N. and Kanamori, A. A978), Strong axioms of infinity and elementary embeddings. Ann. Math. Logic 13, 73-116. Sierpiiiski, W. A938), Functions additives non-completement additives et functions non-measurables. Fund. Math. 30, 96-99. Tarski, A. A930), Une contribution a la theorie de la mesure. Fund. Math. 15, 42-50. Tarski, A. A945), ldeale in vollstandige Mengenkorpen 11. Fund. Math. 33, 51-65. Tall, F.D. A969), Set-theoretic consistency results and topological theorems concerning the normal Moore space conjecture and related problems. Thesis, University of Wisconsin, Madison, 1969; Dissert. Math. 148, 1-53. Tall, F.D. A974a), On the existence of normal metacompact Moore spaces which are not metrizable. Canad. J. Math. 26, 1-6. Tall, F.D. A974b), ?-points in ?? - ?, normal nonmetrizable Moore spaces and other problems of Hausdorff, TOPO-72, General Topology and its Applications. Proc. Univ. of Pittsburgh Topology Conf., 1972. R.A. Lo et al., eds. Springer, New York, 501-512.
Real valued measurability, some set-theoretic aspects 1293 Tall, F.D. A979), The normal Moore space problem. Topological Structures II, P.C. Baayen and J. van Mill, eds. Math. Centre, Amsterdam, 243-261. Tall, F.D. A982), Collectionwise normality without large cardinals, Proc. Amer. Math. Soc. 85, 100-102. Tall, F.D. A984), Normality versus collectionwise normalin. Handbook of Set-Theoretic Topology, K. Kunen and J.E. Vaughan, eds, North-Holland, Amsterdam, 685-732. Vopenka, P. and Hrbacek, K. A966), On strongly measurable cardinals. Bull. Acad. Polon. Sci. Ser. Sci. Math. Astron. Phys. 14, 587-591. Ulam, S. A930), Zur Masstheorie in der allgemeinen Mengenlehre, Fund. Math. 16, 140-150.
CHAPTER 32 Nonstandard Analysis and Measure Theory Peter A. Loeb Department of Mathematics, University of Illinois, 1409 West Green Street, Urbana, IL6I80I, USA E-mail: loeb@math.uiuc.edu Contents 1. Introduction 1297 2. Extending the real number system , 1297 3. Calculus 1300 4. Further principles 1303 5. Set-theoretic measure theory 1303 6. A lattice approach to measure theory 1305 7. Internal functionals on continuous functions 1312 8. Lebesgue measure 1314 9. Representing measures in potential theory 1314 10. Poisson process 1316 11. Anderson's Brownian motion . . , 1317 12. The martingale convergence theorem , 1319 13. On an infinite number of independent random variables , 1322 14. Exact law of large numbers and independence 1326 References . . . , 1327 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1295
This page intentionally left blank
Nonstandard anahsis and measure theory 1297 1. Introduction Infinitesimal numbers have been used by mathematicians for thousands of years as a means of expressing intuitive mathematical ideas. Such numbers were always considered to be a useful fiction since for a positive number to be infinitesimal, it would have to be greater than 0 and yet smaller than any positive number you might write using a decimal expansion. This fiction has always had its critics. In 1734, Bishop Berkeley called the infinitesimals of Newton's calculus foundations "ghosts of departed quantities." He was questioning the intellectual consistency of atheists who believed in infinitesimals. It was not until 1960, that the "fiction" of infinitesimals became a reality with the work of Abraham Robinson [28], Now, once Robinson's discovery is understood, one can use it to obtain new results in many areas of current mathematical research. The key to these applications is found in the strong intuitive feeling we have for the behavior of infinite mathematical structures based on an understanding of related finite structures. Using Robinson's work, we can transform finite constructions into infinite ones without losing the initial combinatorial properties. Here is an example: Brownian motion is the random motion of microscopic particles suspended in a liquid or gas. An important part of mathematical probability is the construction of mathematical models for this motion. An intuitive model for Brownian motion works with a particle performing a random walk with steps of infinitesimal length. For this model, one divides time into infinitesimal intervals; in each time interval, the particle moves in a straight line over an infinitesimal distance. At the end of each time interval, the particle chooses at random a new direction for its motion. Until recently, probabilistdid not know how to make a mathematically correct version of this construction, but nevertheless, they thought of Brownian motion in these terms. For them, it was a useful fiction. Now, as we shall see, it is a correct model for Brownian motion. 2. Extending the real number system Before saying more about this aspect of probability theory, as well as measure theory in general, we first discuss Robinson's extension of the real number system. A more detailed discussion can be found in the first two chapters of [12,24]. Just as the real numbers can be constructed from Cauchy sequences of rational numbers, a number system with infinitely small and infinitely large numbers can be constructed from sequences of reals. Sequences tending to 0 represent infinitesimal numbers, and sequences tending to +oo represent infinitely large numbers. Of course, one never works with equivalence classes of Cauchy sequences of rational numbers when doing real analysis. Similarly, the construction using sequences to extend the real numbers is for the most part irrelevant in applications. Since it is important to convey how one actually works with infinitesimals, we concentrate on what is relevant for applications. Any particular project in which mathematics is applied employs a universe built from a set X of individuals; perhaps X is the set of real numbers or the set of real numbers together with points in some topological space or some Banach space. These individuals may in fact be sets, but this is ignored in all subsequent considerations. The superstructure
1298 PA. Loeb V(X) based on X consists of all of the set theoretic objects one can form from X in a finite number of steps. All subsets of X are in V(X)\ also the set of all functions from X into the set of all subsets of X is an element of V(X), etc. For now, we assume that we are working with the set of real numbers R and the superstructure V(R); the general case is essentially the same. Each Borel subset of R is an element of V(R); the collection of Borel subsets of R is an element of V(R); every Borel measure on R17 is an element of V(R); etc. The point is, V (R) contains everything one needs to consider in doing ordinary real analysis. One augments R with additional points when working with some larger structure such as a topological space. What do we really know about V(R)? With the few exceptions of folklore and some theorems hidden in desks, our knowledge is contained in the books of any superb library. If we assume a common notation, that knowledge can be exhibited by a list of theorems. We think of V (R) as "reality", but our only hold on that reality is a collection of theorems. In this chapter, we need to make a distinction between reality and the strings of symbols that we use to describe reality. Let us start with the unrealistic assumption that every object in V(R) has a name. Of course, there is a difference between a name and an object. The symbol 5 is different from the number it names; for example, the Roman numeral V names the same object. The symbol sin is not a function; it is a symbol we use to name a function. Using the names of objects in V (R) together with quantifiers, brackets and other logical symbols, we form theorems about V(R). We assume that all possible theorems about V(R) are gathered together in a collection T. This is an idealized collection that contains every statement we actually know to be true about V(R). Robinson showed that the set R (or in general X) can be imbedded in a larger set *R (or *X) so that the following properties hold: A) Every name of an object in V (R) is associated as the name of a similarly constructed object in V(*R). (For example, the name "sin" names a function from *R to *R.) Names of elements of R name the same element when R is imbedded in *R. B) (Transfer Principle) The scope of the universal quantifier V can be limited so that when a theorem in ? is interpreted in V(*R) using the new meaning of the names that appear in that theorem, the theorem is again true. C) *R contains positive infinitesimal numbers, i.e., numbers that are larger than 0 but smaller than 1 /n for any ordinary natural number ?. The reciprocal of such elements are infinite or "unlimited" in that they are greater than ? for any ordinary natural number ?. To understand the Transfer Principle, we need to understand that "all" is a primitive notion. There is no way we can specify what we mean even by "all subsets of the natural numbers"; in terms of the language we actually use, we can only describe a countable number of such sets. It is possible, therefore, to cheat, and in a consistent way limit the meaning of "all" so that the Transfer Principle holds. Although *R is not unique, we will assume that we have fixed such a set so that all three properties are satisfied. Now we have two "realities" described by the theorems in T. Working just with T, we cannot distinguish between V(R) and V(*R). For this reason, intuition about the real number system can be transferred to the larger number system. At the same time, we can take note of the differences between the two systems and obtain
Nonstandard analysis and measure theory 1299 new information not codified in T. This means that we have two modes of viewing and working with V(*R). On the one hand, we see it as the usual system in which we do real analysis, and on the other hand, it is a larger number system. To say more, we need some terminology. Each object A in V(R) is called standard. The object with the same name and same formal properties in V(*R) is denoted by *A; *A is called the nonstandard extension of A. We do not use a star in denoting the elements of R when imbedded in *R. For example, 5 is the same number regardless of whether it is in R or*R. An object in the superstructure V(*R) is called internal if it is an element of the nonstandard extension of a standard object. This means, for example, that each element of *R is internal, but otherwise, this is not a helpful definition. The following facts are helpful. First, the nonstandard extension of a standard object is internal. Second, any object that can be described just in terms of objects already known to be internal is itself internal. We call such a description "an internal description." Every element of an internal set is internal. Noninternal objects are called external. In interpreting the sentences of ? in V(*R), we interpret "for all" as meaning "for all internal." For example, we interpret "for all subsets of R" in V (*R) as meaning for all internal subsets of *R. Let N denote the set of natural numbers. We call *N the nonstandard natural numbers. Where are the new elements? Using ? to denote "or", a theorem of ? states that V/ieN, /isC3 =>· n = lvn=2vn = 3. We interpret this sentence as saying V« e *N, ? <c 3 =>· ? = 1 ? ? = 2 ? ? = 3. This means there are no new elements in *N below 3. With similar reasoning, we see that it is necessary to go through all of the standard elements before we come to any new members of *N. How do we know there are any new elements? A theorem of ? states that VreR, 3/ieN with |r-«K 1. Since there are elements of *R greater than any ordinary natural number, there must also be elements of *N greater than any ordinary natural number. We call the usual natural numbers limited, and we call the new elements of *N unlimited. The set of unlimited elements of *N is denoted here by *NX\ it is an external set. To see that this is so, we note that if *N;x were internal, then the theorem of ? that says that every nonempty subset of N has a first element would force the existence of a first unlimited element m. In this case, N would contain a last element, namely m — 1, but this is impossible. It now follows that the set N of limited natural numbers is also an external subset of *N; if it were internal, we could describe *N;x with the internal description "*Noo := *N \ N". It is convenient that N is external, since the formal name for N is already used for *N. We call *R the nonstandard real numbers. By transfer of the properties of R, *R is an ordered field extension of R. What does *R look like? First, there are elements greater
1300 P.A. Loeb than any ordinary natural number; we call these the positive unlimited elements of *R. Multiplying each of these by — 1, we have the negative unlimited elements of *R. Any other element of *R is limited; this means that it is bounded in absolute value by some ordinary natural number. Among the limited elements are the infinitesimal elements. A number ? in *R is infinitesimal if |?| < 1 /? for each ordinary natural number n. Zero is the only real number that is infinitesimal. When two numbers a and ? differ by an infinitesimal, we write a ~ ?. Of course, a ~ 0 means that a is infinitesimal. If ? is limited in *R, it determines a Dedekind cut in R: (seR:Op).N R-" s < p]. This Dedekind cut, determines a unique real number r such that p — r. Thus, every limited element ? e *R is infinitely close to a unique real number r called the standard part of p. We write r = °porr = stp. We will write °p = +oo if ? is positive and unlimited, and °p = — oo if ? is negative and unlimited. Every real valued function / with domain А с R, has an extension */ mapping * A into *R. All rules for / apply to */. For example, V/oe*R, \?\ = ?J. It follows that for each a e A, *f(a) = f(a). That is, */ is a true extension of /. We will omit the star when */ is applied to points in A. We next give some examples due to Robinson of how the calculus works using these extended functions. 3. Calculus A function /defined on А с R is continuous at a point r e A, if and only if for each ? e *A with ? — r we have *f(p) — f(r). That is. if you change from r by an infinitesimal, you change the value of the function by an infinitesimal. If. for example, f(x) = .v, and ? ? ~ 0, then for each real number ?, (? + AxJ = x2 +2-x ¦ ?? + (??J ~ ?2. To see why this criterion for continuity holds, we assume that / is continuous in the usual sense and fix r e A. Fix ? > 0 and a corresponding ? > 0 in R so that the following is a theorem in T: Vie A, \s-r\<& => |/(i)-/(r)]<e. We interpret this theorem in V(*R) in terms of *A and */. If ? — r, then of course |p — r| < <5, and so \*f(p) - f(r)\ < ?. Since ? is an arbitrary positive real number, it follows that *f(p) — f(r). To see why the converse implication holds, we use a principle called "Downward Transfer." It says that if a sentence using only the names of standard objects is true for V(*R), then it must be true for V(R) since its negation cannot be in T. Here, we assume that for each ? e * A with ? ~ r we have *f(p) — f(r), and we fix an ? > 0 in R. Now the
Nonstandard analysis and measure theory 1301 following sentence is true when the symbols R, A, and / are interpreted as *R, * A and */, respectively. 3<S > 0 in R such that Vs e A, |s - r| <S =>· |/(i) -/(r)| < ?. The existence requirement is satisfied by any positive <5 ~ 0. It follows that what we have written is a theorem in T. A similar proof shows that a function / is uniformly continuous on a set А с R iff for each pair a, /3 in * A, a ~ /3 =>· */(<*) - */(?). Here is an important theorem due to Robinson; it generalizes to arbitrary topological spaces. THEOREM 3.1 (Robinson). A set А С R is compact if and only if for each ? e * A, f/?ere /j ana e A with ? — a. PROOF. Assume A is compact but 3p e*A not in the monad of any standard point of A. Then Va e A, |p - a\ is a limited, non-infinitesimal number, so 3<5„ > 0 in R such that \p — a\ ^ <5„. Since A is compact, there is a finite set {??,..., an] с A and numbers ? ? <5„ in R+ with <5, = <$<,, so that using л for "and", the following sentence holds forR: (Va-) [x e А л \x -a\\ ^?\ л · · · л \х -дя-|| ^5я-1 =>¦ ^ -?«? < ^«]· By transfer, |p - a„ \ < Sn, which contradicts the choice of <5„ — 5a„. It follows that if A is compact, then each ? e *A is infinitely close to some element of A. Conversely, if A is not compact, then it is easy to see that there is a sequences (On) of open sets forming a covering of A with no finite subcovering. For each /; e N, let ? (?) be an element of A not in the first ? sets Ot. Then we have (Vn)(Vit) [??????????^?(?)(???]. By transfer, for /j e *?^, ? = *?(?) i*Ok for any standard к eN. Since for each a e A, ЗА: e N with ? e *(?*, ? is not infinitely close to any standard point of A. D COROLLARY 3.2. A continuous function f on a compact set A is uniformly continuous on A. PROOF. If we have a ~ ? in *A, then there is an r e A with ? ~ r ~ /3, and so *f(a)~f(r)~*ftf). О Robinson's compactness criterion is used in his proof that a continuous function / defined on [0, 1] assumes its maximum value. First, fix ? e *Ncc, and consider the internal subset S of *[0, 1] described as follows: Vpe'R peS <==> 3k e *N with к sC ? and ? = —
1302 RA. Loeb That is, *=<1Д...,^,1 ? ? ? The set S is in internal one-to-one correspondence with an initial segment {1,2,..., H] of the nonstandard natural numbers *N. A set with this property is called a hyperfinite set, and we think of it as a finite set since it satisfies the formal definition of a finite set. That is, let Vf denote the finite subsets of R. A subset of *R is hyperfinite if and only if it is in *Vf- Any theorem about all elements of Vf is interpreted in V(*R) as a theorem about all hyperfinite subsets of *R. In particular, every hyperfinite set has a maximum element. To return to our problem, consider the hyperfinite set */? = |·/(^).·/(|) v(^).V(D Let */ (k/H) be the maximum element of this set, and let r = st(k/H). Then / takes its maximum value at r, since if s e [0, 1 ] and j/H is the next larger or equal point of S, then /<·>=·/(?)<·/(?)=/(?. Since the first and last terms are real numbers, f(s) ? f(r). The notion of a hyperfinite subset is applicable for the extension of any standard set, not just R. Hyperfinite sets are extremely useful since combinatorial results continue to be valid even when the external cardinality is infinite. The derivative of a function / at a point ? is not the ratio [f(x + ?*) — f(x)]/Ax, it is the standard part of this ratio. For example, if f(x) — x2, then the ratio is (x+AxJ-x2 2xAx + (AxJ = = 2x + Ax — 2x. Ax Ax The integral of a continuous function / on [a,b], is given as follows after fixing ? e *Noo and setting Ax = (b - a)/H: L b f(x)dx=st T,*f(a + l-tr{b-a))Ax Here, we have extended the summation operator to *Vf- In general, in applying the integral, one partitions the interval [a, b] over which one is integrating into subintervals [xi-\, Xi], where Axj =x,- -x,-\ ~0. One then approximates each part <2, of the quantity one wants with the /th term of the hyperfinite Riemann sum. Let us express the error in this approximation as the product e, - ?*, where e, e *R. As in [14], we want to have each ej ~ 0, so that \У2е' ' Ах'\ ^ (maxleil) · ? — ?) ~ 0.
Nonstandard analysis and measure theory 1303 4. Further principles We will from now on assume that we are working with a set X of individuals that contains R but may be larger than R. A principle of nonstandard analysis that generalizes the existence of infinitesimals in *R is the following: If a set S is an element of V(X), then there is a hyperfinite set 5" с *S such that for each a e S,*a e 5". One can say that S c. S' ?*S, and all of the containments are strict if S is infinite. To see that this principle forces the existence of infinitesimals, one can take S equal to the strictly positive real numbers, and let ? be the minimum element in 5". Here is a proof, which generalizes, showing that if А с R is not compact, then there is an element ? e*A that is not infinitely close to any element of A: Let {Oa} be an open covering of A with no finite subcovering. Let {?/?} be a hyperfinite set extending the collection {*Oa}. By assumption, there is a p e *A that is not in any ??? and thus is not in any *Oa. It follows that ? is not infinitely close to any element of A. Finally, to make nonstandard measure theory work, we need an additional property for our nonstandard models, called ? ? -saturation. This means that any ordinary sequence (A„: ? e N) of elements from an internal set ? is just the initial part of an internal sequence {A„: ? e *N) of elements from E. Every "ultrapower" model, for example, has this property. 5. Set-theoretic measure theory We now turn to the nonstandard measure theory initiated by the author in [ 16]. Working in an ? ? -saturated enlargement, we can construct a hyperfinite set Л as the set of elementary outcomes in a conceptual experiment in the "nonstandard world". For coin tossing, for example, Л can be the set of internal sequences of 0 and 1 of length ? e *Noc. Given such a hyperfinite set A, we can let С consist of all internal subsets of A. The collection С is an internal ?-algebra, but it is also an algebra in the ordinary sense. Suppose ? is an internal probability measure on {A,C). For the coin tossing experiment, for example, each internal set A would have internal cardinality \A\ and would be given the probability ?(?) = \?\/2? in *[0, 1]. In the general case, we can form a finitely additive real-valued measure ? on (A, C) with values in the real interval [0, 1] by setting ?(?) = ??(?(?)). We need to extend ? to a countably additive measure on the ?-algebra generated by C. To consider this extension, let's start with an arbitrary internal measure space (A, C, ?). We will assume that the internal measure ? takes only limited values, although a more general theory is also available. When A is a hyperfinite set, one usually sets С equal to the collection of all internal subsets of A, and ? is usually an internal probability measure. In general, С is an internal ?-algebra as well as an algebra in the usual sense in the set A. Let ? be the finitely additive, real valued measure on (A, C) defined at each A e С by setting ?(?) = ??(?(?)). Let a(C) denote the ?-algebra in the ordinary sense (using the ordinary natural numbers) generated by С The collection ?@ is, in general, an external collection of subsets of A. We can extend ? to a measure ?/, defined on the measure completion LM(C) of ?(?), and thus obtain a standard measure space (A, Ltl(C), ??.) on A, by using the Caratheodory Extension Theorem in the following way.
1304 RA. Loeb First note that when a sequence (A,-: /' e N), indexed by the ordinary natural numbers, consists of pairwise disjoint elements of С and the union A is also in С then A is actually a finite union since all but a finite number of the A, are empty. Here is the simple proof: Using ? ?-saturation, we extend the sequence (A,-: /' e N) to an internal sequence (A,: /' e *N); the set S:=|ine*N: AC [J A, | is internal and contains *?^, so, since *N;x is external, S must contain some standard natural number. It now follows that ?(?) = ???? М(Л)- By the Caratheodory Extension Theorem, the finitely additive measure ? has a unique ?-additive extension ?/, defined on the completion L?(C) of the ?-algebra a(C). Now (A,Lfl(C), ?^ is an ordinary finite measure space formed on the internal set A. We note that in much of the literature, (A, L?(C), ??.) is called a Loeb measure space and ?/, is called a Loeb measure. Given ? in ??(?), it is well known that for any ? > 0 in R, there are sets A eCs and В eC„ with А с ? с ? such that ??.(? \ ?) < ?. By saturation, we may assume that A and В are actually sets in C. To see this, we suppose that (Bn: ? e N) is an increasing sequence in С with limit B. We may extend the sequence to an internal increasing sequence indexed by *N, and choose an unlimited integer ? such that ??.(?/) = 11гпиеК^^(В„). A similar proof works for A. It also follows from saturation that there is a set С е С such that the symmetric difference ? AC is a null set in LM(C). These set approximation properties characterize L/((C). They have also been used by several authors to define LM (C). (See, for example, [ 1,29].) Given an *R-valued function / on A, we set °f(x) = st(f(x)) if f(x) is limited in *R. We set °f(x) = +oo if f(x) is positive and unlimited in *R, and we set °f(x) = -oo if f(x) is negative and unlimited in *R. It follows from the set approximation properties that a function ^:X->KU {+oo, —do) is measurable with respect to L/( (C) if and only if there is an internally C-measurable function / such that °f = g ??,-a.e. The function / is called a lifting of g. If such an / takes only limited values, then g is integrable, and by the set approximation properties 0 J /?? = J°fdvLL = jgdvLL. A) If / ^ 0, then for each ? e N, / л /; ^ /, so f"fdiiL ¦= \™? f{°f) ?????. = \imx f °(f ??)???. ? ° f f ??. It follows that Equation A) holds when |[?/|-(?/|?/?)]??~0 ?/,?*?^ B) Condition B) is a simple expression of the condition called S'-integrability in the literature. (See [3]; we have taken a different approach to arrive at this notion.) If, on the other hand,
Nonstandard analysis and measure theory 1305 we know that g is integrable, then it is easy to see that for a sufficiently small ? e *NX, replacing / with the function / - Хц/1^,,) gives an S-integrable lifting of g. That is, with this replacement, the internal integral of / is limited and Equation A) holds. Let X be the nonstandard extension of a compact Hausdorff space Y. For each у е Y, the monad of у is the set m{y):= ? *0. О open ye О Clearly, this definition generalizes the definition of a monad in the real line. Again, Robinson's criterion for compactness holds: A set A is compact if and only if each point in *A is in the monad of a point in A. For each Borel set В с У, set B:=\J{m(y): ye В}. Let ? be an internal Baire probability measure on X. As in [5,19], a Borel measure ? on ? can be defined by setting ?(?) = vl(B) for each Borel set В с ?. The measure ? restricted to the Baire sets is the standard part of ? with respect to the weak* topology on Baire measures. By Robinson's criterion for convergence and clustering, the correspondence between ? and ? yields a nonstandard approach to the weak convergence of measures. We shall take a lattice theory approach for the development of these facts. The above measure construction for hyperfinite coin tossing was used by the author in [16] as the underlying space for standard coin tossing. Similar constructions were used for Poisson processes in [ 16] and to produce representing measures for harmonic functions in [17]. The coin tossing space was then used by Anderson [3] to make rigorous the probabilist's intuitive treatment of Brownian motion as an infinitesimal random walk and the Ito integral as a pathwise integral. We will discuss these constructions below. First, in the next two sections, we will use a lattice approach to the integral to combine both set theoretic measure theory and the construction of measures on topological spaces. 6. A lattice approach to measure theory In classical analysis, the approach to general integration is either through measure theory or through functional analysis with a lattice formulation of the integral. Starting from the notion of "length", for example, one constructs Lebesgue measure and the class of Lebesgue measurable sets. Lebesgue measurable functions form an extension of the class of simple functions (linear combinations of characteristic functions of sets of finite measure), and the Lebesgue integral extends the obvious calculation for simple functions. The Riemann integral can also be viewed as a positive linear functional on the space of continuous functions with compact support; as such, it is represented by a measure. That measure is, of course, Lebesgue measure; it is obtained by extending the Riemann integral
1306 P.A. Loeb from the continuous functions with compact support to the class of measurable functions and then noting the action on the characteristic functions of measurable sets. With this approach the Lebesgue integral is constructed before Lebesgue measure. In the development to follow, we will extend a positive linear functional from an internal lattice of *R-valued functions to an external space of "measurable" real-valued functions. Moreover, ? ? -saturation will allow a development from first principles here with no use of the Riesz Representation Theorem. Indeed, the Riesz theorem will follow as a consequence of our construction. We assume throughout that the constant function 1 is a member of the internal lattice and has a finite integral. This corresponds to starting with a finite measure space or a compact topological space. The original development of these results for more arbitrary measure spaces and locally compact rather than just compact topological spaces can be found in [21,22]. A recent modification by Bianconi et al. in [7] produces a nonstandard approach to the Henstock-Kurzweil integral on a finite, closed interval of the real line. Recall that a vector lattice of functions on a set X is a vector space with a pointwise ordering. That is, (/ +g)(x) = f(x) +g(x), (af)(x) = a(f(x)) and / iC g if f(x) ? g(x) for all ? & X. A lattice is closed under the operations ? and л; i.e., the function / ? g defined by setting (/ ? g)(x) — f(x) ? g(x) at each ? e X is in the lattice; so is the function / л g = — (—/ ? — g). A linear mapping ? from one lattice to another is called positive if T{f) ^ 0 whenever / ^ 0; ? is called a functional if the range is in the scalar field. In our development here we will use the following notation: (a) X will denote an internal set in an ? ? -saturated enlargement of a structure containing the real numbers R. (b) L will denote an internal vector lattice of *R-valued functions on X. (c) / will denote an internal positive linear functional on L. We will assume that 1 e L and /A) is limited in *R. Example 6.1. The set X can be a hyperfinite set and L the space of all *R-valued internal functions on X. The functional / can be determined by uniform counting measure; i.e., 1(f) = A/\?\)??€? f(x) where |X| denotes the internal cardinality of X. Example 6.2. A more general construction starts with an arbitrary internal probability space (?, ?, v). (I.e., ? is an internal ?-algebra and ? an internal measure with v(X) = 1.) Here, L can be the space of internal Д-measurable simple functions and /(/) = f f dv for all / e L. Example 6.3. A third example is constructed on the internal compact set X = *[0, 1] with L the set of continuous functions on X. Here, / can be any positive linear functional on L; a prime example is the nonstandard extension of the Riemann integral: /(/) = *fofWdx- From the internal lattice L, we construct two external vector spaces Lq and L over the real numbers R.
Nonstandard analysis and measure theory 1307 Definition 6.4. The class of null functions Lo is the set of all internal and external *R- valued functions h on X such that for any ? > 0 in R there is a ? e L with | h | < ? and ? (?) ^ ?. The class L is the set of real valued functions / on X having representation / = ? + h for some ? e L and h e Lo- Given such a representation of an / e L, we set I(f) = °I(<p). First we need some preliminary results showing, among other things, that L is a lattice and / is a well-defined positive linear functional on L. Note that when ? e L П Lo, ?(?) ~ 0. Also note that because of our assumption that 1 e L and L(l) is limited, 1 e L. PROPOSITION 6.5. The sets Lo and L are vector lattices over R. /// is in L with f = <p+hfor<p e L andh e Lo, then 1{\?\) is limited in *R. Ifg is also in L with g = ? + к for \j/ e L and к e Lq, then (f ? g) - (? ? ?) e Lo and (/ л g) - (? л тД) e Lq. Moreover, if f =g (that is f = (p + h = \l/+k) then ?-? e Lf)L0, whence 1(f) = °?(?) = °?(?). It follows that I is a well-defined positive linear functional on L. PROOF. It is easy to see that Lq is a vector lattice and L a vector space over R. To show that ?(\?\) is limited, fix ? in L with \h\ ^ ? and ? (?) ^ 1. Then <р-у^/^<р + У- Since / is real valued, the internal set {« e ?: ? — ? ? ?] contains every unlimited element of *N and thus some limited element. Similarly, ? + ? ^ — ? for some ? e N. It follows, since /A) is limited, that ?(\?\) = ? (? ? 0) + ?(-? ? 0) is limited. Now fix ? > 0 in R, and ? in L with \h\ + \k\ ^ ? and ?(?) < ?. From the arbitrary choice of ? and the inequality (? V ?) - ? = (? - ?) ? (? - ?) ?? (? + h) V (? +k) = f V g <: (? V ?) + ? it follows that (/ ? g) - (? ? ?) e Lq. The rest is clear. ? The reader should note that we have assumed no continuity properties for the internal functional /. For standard Daniell or Radon lattice integration theory, one assumes at least that if /„ \ 0, then / (/„) \ 0. This corresponds to the assumption in measure theory that the original finitely additive measure is countably additive on the initial algebra. With K|- saturation, however, we can establish continuity for the external functional / in the form of a monotone convergence property; we need no continuity assumption on the internal functional /. THEOREM 6.6 (Monotone convergence). If {f„: ? e N) is an increasing sequence in L with real upper envelope F and sup /(/„) < +oo, then F e L and 1(F) = lim /(/«)- PROOF. By replacing /„ with /„ - f\, we may assume that each /„ ^ 0. By Proposition 6.5, we may fix ?? e L and hn e Lo for each ? e N so that /„ = ?„ + h„ and 0 ^ ?? ^ ?,?+1 -By the ? ? -saturation of our enlargement, there is a ?? e L with ?? ^?,,??? each ? e N and °?(?,?) = lim„epj °?(?„). We need only show that F - ?? e L0. Fix ? > 0 in R. Choose for each ? e Nai/r„ e L with \h„\ ^ ?„ and ?(?„) ^ ?/2". By ? ?-saturation, we may extend the sequence {?,,: « e ?) to an internal sequence [?,,: ? e *N} с L. We
1308 P.A. Loeb may also choose к е *Мзс so that 0 ^ ?„ and ? (?,,) < ?/2" when 1 ^ ? ^ ? in *N. Setting ? = ?*?=\ ??, we have ? (?) < ?. Now for each ? e N, ?,? - ? < <P„ - V» < <Ри + «» ^ F ^ A + ?)(?? + ?), so (<Pn -<??)-? ^ F -??????+A + ?)?. The rest is clear. D The next two results exhibit the close relationship between the internal lattice L and the external lattice L. This corresponds to the approximation of external but measurable sets by internal measurable sets in the set-theoretic approach to nonstandard theory in Section 5 and in [16]. Theorem 6.7 (Internal approximation theorem). A real valued function f on X is in L if and only if for each ? > 0 in R there exist functions ?\ and ?? in L with ?\ ^ / ^ ?? andI(\//2~ ?\) < ?, in which case, °?(?\)? 1(f) iC °?(?\) +?. PROOF. First assume / = ? + h e L with ? e L and h e Lq. Fix ? > 0 in R. For each ? e N, choose ?„ e L with \h\ ^ ?„ and ?(?„) < ?/?. Setting ?\ = ? - ?2 and ?2 =? +?2, we have ?\ ? / ^ ?? and /(?/? — ?\) < ?. Moreover, given any ?\ and ?? in L satisfying these conditions, we have for each ? e ?, ?\ — ?„ ^ ? ^ ?? + <р,г, whence °/?) - ?/? ^ °/(<p) = /(/) ^ °/(iA2) + ?/? ^ °/(???) + ? + ?/?. It follows that °/(i/f|)<c /(/) sC °/(?/?) +?. Assume now that / is an arbitrary real valued function on X for which there exists an increasing sequence {?„: ? e ?} с L and a decreasing sequence {?,,: ? e ?} с L with ?? ? f ? ?? and ?(?? —?„) < \/n for each// e N. By ? ?-saturation, we may extend both sequences to *N and choose a \j/n e L such that for each ? e ?, ?„ ^ ???^. ??, whence ?? — ?? ^ f — ?? ^ ?? — ??- It follows that f — ?? e Lq and thus / e L. D Given an *R-valued function g on X, we will let °g denote the extended-real valued function on X defined by setting °g(x) — st(g(x)) for each ? in X. Here, st(g(x)) equals +00 or —00 if g(x) is unlimited in *R. For any function g taking values in R or *R, we set g+ — g ? 0 and g~ = — g ? 0. PROPOSITION 6.8. If ? e L and ?(?) is limited in *R for each ? e X, then °? e L and ?(°?) = ??(?). PROOF. By assumption, °/(l) < +00. The proposition follows from the fact that for each ? > 0 in R, \°? - ?\ < ?, so °? - ? e Lq. ?
Nonstandard analysis and measure theory 1309 Definition 6.9. Let M+ denote the set of nonnegative, extended-real valued functions g on X such that for each и е N, g An е L, and set 7(g) = sup/(g ли) for all g in M+. Let ?? = {g- g+ e ??+ andg~ e ??+). For each g in M, set 7(g) = -/(g+)- У (g^) if at least one of the right hand values is finite. Let B={A?X: XAeM+) = {A<zX: ?? e L\. For each A in ? let ?(?) = J(xA). Note that in going from "integrable functions" to "measurable functions", we only need to worry about infinite values. If we do not make our initial assumption that 1 e L and / A) is limited, then this definition must be replaced with a more complicated one (see [21 ]). THEOREM 6.10. The collection В is ? ?-algebra in X and ? is a complete, countably additive, finite measure on (X, B). PROOF. The fact that ? is a ?-algebra and ? is countably additive follows from the monotone convergence theorem. Of course, for each A e ?, ?(?) ^ /A). The completeness of ? follows from the internal approximation theorem and the fact that a real valued function / = / + 0 in Lq is also in L. The rest is clear. ? The next proposition contains preliminary results concerning the space M; from these results it will follow that J(g) = §xgd^ for each g e M+. Note that in the above definition, we may replace the truncations g л ? with truncations g л / ??? arbitrary elements / ^ 0 in L since g л / = sup„g a f An; therefore, J(g) = sup{/(/): / e L, f^gl PROPOSITION 6.11. Fix g e ?+, ? e M+ and a ^ 0 in R. Then g + p, ag, gv ? and g ? ? are in M+. Moreover, J(g + p) = J(g) + J(p), J (ag) =aJ(g), and if g iC ? then J(g) ^ J(p)- Ifign- ? e N} is an increasing sequence in M+ with upper envelope G, then GeM+ and J(G) = supJ(g„). PROOF. For и eN, (g + p) An = [(g An) +(p An)] An e L,and for ? > 0, (ag) ли = a(g A n/a) e L. It is easy to seejhat J(g) + J(p) ^ J(g + p). The reverse inequality follows from the fact that if / e L and / sC g + ? then / л g ^ g and / - (/ л g) ^ p. The rest is clear. ? THEOREM 6.12. A nonnegative extended-real valued function g on X is B-measurable if and only if g e M+ in which case J(g)= f gdii. PROOF. Fix g in ? + and let A = {g > 1}; we will show that ?? e L. Let / = (g л 2) - (g A 1). Then / e L and ?? = lim(l л nf) e L\. Now for any positive a in R, ?? e L
1310 P.A. Loeb when A = {g > a] = {g/a > 1}, and by the monotone convergence theorem, the same is true for ? = 0. Therefore, if g e M+ then g is ?-measurable. The converse and the equality J(g) = f gdp are obtained from the corresponding facts for ?-simple functions. D Remark 6.13. In light of the above theorem, we may now call J an integral. If ? e L then for any ? e ?, °(—? ? ? л п) = -? ? °? л ? e L, so °? e M. We say that ? is ? "lifting" of °?. It may not be the case, however, that J(°\(p\) = °/(|<p|); ? may take a large infinite value on an internal set A with ? (? a) — 0, i.e., ? (A) = 0. Functions for which the standard part of the internal integral equals the integral of the standard part of the function form an important class in this integration theory. DEFINITION 6.14 (Variation of Anderson [3]). We say that ? e L is S-integrable ???(\?\) is limited and J(°<p) = °?(?). Proposition 6.11 shows that a limited valued (and therefore bounded) function ? is S- integrable; the proof is much simpler than that for the measure-theoretic case ([16]). The following criterion for the S-integrability of unbounded functions is an application of the usual procedure for extending integrals from bounded functions to unbounded ones, with the finiteness of the internal integral following as a bonus. PROPOSITION 6.15. A function ? in L is S-integrable if and only if for each ? e *N3C, ?(\?\-\?\??)~0. PROOF. We may assume ? > 0. By the definition of the integral and the equality of the standard part of the integral with the integral of the standard part for bounded functions, J С ?) = sup J (°? An) =sup°/(<p An) =lim°/(<p An). ,i я " Since °?(?) = °?(? - ? л п) + °?(? л п) for each n e *N, if for each ? e *Noc we have ?(\?\ - \?\ л ?) ~ 0 then for some finite /; e ?, °?(?) ^ 1 + °?(? An) < +oo, and moreover, lim„ °?(? An) = °I(?). The converse is clear. ? If / = / dv, then the condition given in some of the literature for ? to be S-integrable is that ?(\?\) is finite and fA \?\ dv^O for any internal set A with v(A) ~ 0. Note that we do not need to check the finiteness, and only the integral over sets of the form {\?\ ^ ?) need be checked. It then follows that for any A with v(A) ~ 0, fA \?\ dv ~ 0. This definition shows that there is a close connection between what is called "uniform integrability" of a family of functions and the S-integrability of the functions in the extension of the family. Uniform integrability for a family of measurable functions is the condition that ./]/,>„ I/I goes to zero uniformly as / ranges over the family. We next address the following question: Given g ^ 0 in ?, is there a lifting in L?
Nonstandard analysis and measure theory 1311 PROPOSITION 6.16. Given g ^ 0 in M, there is ? ?^? in L such that for each ? e N, (g л ?) — (? л п) е Lo, whence for some ? e *?;?, J(g) =sup7(g ли) = sup°/(<p ли) = °?(? A /j). 71 И If A is the ?-null set \J„€N{°(g A n) - °(<p л n) ф 0} аи<* ? = {g = +00} ? {> = +oo), f/геи g = °? < +oo ои X — (A U ?), /fg /ms a finite integral, then ? ? ? is S-integrable and ?(? U ?) = 0. (/и this case, ? ? ? is called an S-integrable lifting of g.) PROOF. We may choose sequences [?„: n e N} and [\j/„: n e N) in L so that 0 ^ ?„ ^ g An ^?,,,?,, ^ ?„+\ and /(i/r„ — <p„) < 1 /и for each n e N. Given к ^ w ^ ? in N, i/fm л ? ^ g л и ^ ?^ a n ^ <pm л ?. By ? ?-saturation, we may choose a <p e L so that for every m and и with m ^ ? in N, ??? A n ^ ? A n ^ <pm A n. Thus, VneN (g ли) -(? ли) e Lo. ? We now return to the case of Example 6.2. Example 6.17. Let X be an internal set, ? be an internal algebra on X, and ? an internal finitely additive measure on (?, ?) with v(X) finite in *R; for example, (?, ?, v) may be an internal probability space. Let / be the ?-integral on the class L of internal ?-simple functions. For each A in ?, let v\(A) = st(v(A)). Then ? is an algebra in the ordinary sense, and v\ is a finitely additive, real valued measure on A. The lattice construction produces a standard measure space {?, ?, ?) that extends the finitely additive space (?, A, v\); i.e., Л с ? and ?(?) = v\ (A) for each A e ? If ? e ? and ? > 0 in R, then from the existence of functions ? and ? in L with ? ^ ?? ^ ? and ? (? — ?) < ?, we obtain sets A\ = {? > 0} and A2 = {? ^ 1} in A. Clearly A|CfiCA2 and v(A2 - A,) iC ? (? - ?) < ?. Using ? ? -saturation, we may extend an increasing sequence A'\ and a decreasing sequence A'2 corresponding to ? = \/n and find а С е Л such that ?(??? = 0. These are the internal approximation results that characterize externally measurable sets in the measure- theoretic approach to this integration theory in [16]. Now, g on X is ?-integrable if and only if g has an S-integrable lifting, and for an S-integrable ? e L, ? °??? = ° ? ???.
1312 PA. Loeb 7. Internal functionals on continuous functions The topological example considered in Example 6.3 is based on the set X = *[0, 1]. The lattice L is the set of internal, continuous functions on X, and / is a positive linear functional on L. An example of / is the nonstandard extension of the Riemann integral. Instead of just the interval [0, 1 ], this section deals with an arbitrary standard set ? supplied with a compact Hausdorff topology ? in an enlargement of a structure containing ? and R. We assume that the enlargement is /c-saturated with ? ^ ? ? and к ^ Card(T), so that in particular, the enlargement is ? ?-saturated. We set X =*Y. Since the topology ? is Hausdorff, each point ? in X is in the monad m(y) of a unique standard point у in У; i.e., у = st(x). With each extended-real valued function g on Y, we associate the extended-real valued function g on X, where g(x) = g(st(x)). That is, we spread the value of g at a standard point over the monad of that point. The standard part map has played an important role in nonstandard measure theory; it's inverse appears here in the form of the mapping #—>¦?. For each set А с У, we set A equal to the union of the monads of the points in A. The set A = if-1 [A]. Moreover, Хл =Xa- We now fix an internal positive linear functional / on *C(Y), with /A) limited in *R (recall that C(Y) denotes the space of continuous, real valued functions on Y). We apply our general results and notation with L = *C(Y). In interpreting the following result, the reader should keep in mind that a subset of ? is compact if and only if it is closed. The result says that the inverse image with respect to the standard part map of each compact set in К is measurable. PROPOSITION 7.1. For each compact set К с. ?, К еВ and if aK=M{°l{*f): feC(Y), ??^/???}, then ?(?) — ??. PROOF. By saturation, there is a function ? e *C(Y) — L with ?*?: ^ ? ^ X? and °?(?) —aK. Given any / e C(Y) with ?? ^ f ^ 1, and given ? > 0 in R, we have V^Xjf <(!+?) ¦*/- It follows that ?? — ? e Lq, ?? e L, and ?{?) = ^??) = 0?{?) = ??. ? Since everything works nicely for compact sets in ?, they also work for the ?-algebra generated by the compact sets, i.e., the Borel sets. (In the more general theory in [21] for a non-compact space, one must work harder.)
Nonstandard anahsis and measure theory 1313 THEOREM 7.2. Let ? = {S с ?: Be В], and let ??(?) = ?(?) for each В e М. Then ? is ? ?-algebra in ? containing the Borel ?-algebra, and ?? is a complete, regular measure on (Y, Л4). A function g on ? is M-measurable if and only ifg is B- measurable on *Y, in which case, ifg ^ 0 then fY g ??? — / ^??. For each f e C(Y), PROOF. It is easy to see that ?4 is a ?-algebra, and we have just shown that it contains the closed subsets of Y. Therefore, it contains the Borel sets. To show that ? у is regular, choose В e ? and ? > 0 in R. Fix ? e L with 0 sC ? sC ?$ and °?(?) ^ J(Xb) - ?. Let К = {st(x): ?(?) > 0). This is the standard part of an internal set of near-standard points, so by a result of Luxemburg (see [12,25]), К is compact. Of course, К с В. Since ??(?) = ?(?)^°?(?), it follows that ??{?) is the supremum of the measure of compact sets contained in B. The same is true for ? \ B. This means that the measure ?? is regular. The relationships between the measures and the integrals on ? and *Y follow from the corresponding relationships for simple functions. If / e C(Y), then */ e L and / = °(*/) e L. Moreover, /- */ e L0. Therefore, fY f ??? = f^, J?? = °I(*f). О Example 7.3. If ? = [0, 1] and / is the extension of the standard Riemann integral on Y, then a real valued g on ? is Lebesgue integrable if and only if 'g = ? + h where ? e *C(Y) and h e Lq. In this case, the Lebesgue integral of g equals the standard part of the internal Riemann integral of ?. A bounded g is Riemann integrable if and only if for any ? > 0, there are continuous functions ? and q on [0, 1] with ? ^ g ^ q and fQ q — pdx < ?. It follows that g is Riemann integrable if and only if °(*g) e L (result of the author and A. Cornea.) If ? is a positive linear functional on C(Y), and / = *T, Then for all / e C(Y), fY f ??? = °I(*f) = T(F). Thus ?? represents T. This is the Riesz Representation Theorem for the Dual space of C(Y) when ? is compact. A similar but more difficult proof in [21 ] works for the continuous functions with compact support on a locally compact space. Numerous applications of nonstandard measure theory to problems in standard analysis have used the standard part map to convert an internal measure to a standard one. (See, for example, the articles of Anderson and Rashid [5] and Loeb [19], as well as the examples below on Lebesgue measure and representing measures in potential theory.) The standard measure on ? is the standard part of the internal measure with respect to the weak* topology. The above correspondence gives weak convergence results since if v„ —> v, that is, for all continuous /, f f dv„ —>¦ f f dv, then ? =st(vtl) for ? e *Мзс. Here the standard part of the internal measure ?? is taken in the weak* topology.
1314 P.A. Loeb 8. Lebesgue measure Another way to construct Lebesgue measure on [0, 1], is to use a hyperfinite partition {0, l/H, 2/H, ...,(#- \)/H} with equal mass \/{H + 1) at each point. One then forms the standard measure from this measure, and projects down by the standard part map. This is the construction of Lebesgue measure used by Anderson in his paper [3]. This measure generates a hyperfinite Riemann sum, so it is clear from the fact that the resulting standard measure on [0, 1 ] is the standard part of the internal one in the weak* topology, that the resulting measure is Lebesgue measure. Anderson later generalized the construction to Radon measures in [4]. 9. Representing measures in potential theory For this example, we work with harmonic functions on the unit disk in the complex plane С Let Dr denote the open disk [z e C: |z| < r], and let D = D\. Let C,- be the circle (геС: |г| = r), and let С = С ?. All measures we consider will be Borel measures. Let Р(г, x) be the Poisson kernel (|г|2— И2)/|г — х\г, and let xq denote the origin. The space ?) consisting of all positive harmonic functions on D taking the value 1 at xo is convex and compact with respect to the topology of uniform convergence on compact subsets of D, i.e., the ucc topology. It is well known that every continuous function on С has a harmonic extension on D and that not every harmonic function on D is obtained in this way. On the other hand, by the Riesz-Herglotz theorem there is for each h e ?' a probability measure v/, on С such that h = j P(z,-)vh(dz). The mapping ? м> Р(г, -) from С into W1 (with the ucc topology) is a homeomorphism. We may think of ?/, as a measure either on С or on the collection of harmonic functions {Р(г, -): z e С). The latter point of view is that of Martin boundary theory and Choquet theory. The simplest realization of Choquet theory deals with a triangle. Each point inside and on a triangle is represented by a unique affine weight on the extreme points of the triangle, i.e., on the vertices. For the compact, convex set ??, the extreme points are the functions {Р(г, -): ? e С), and each h eH] is represented by a unique probability measure vh on this set. While the usual construction of vh is simple for the disk, it does not generalize without going to an ideal boundary. The theory we have discussed in the previous sections does yield a generalizable construction of i>/, by extracting a measure from the function h that would otherwise be lost at the boundary of D. We give a brief description of that construction. More details can be found in the original work [ 17,18,20] from which this example is taken. First, we recall that for each point ? e Dr, there is a Borel measure ??, called harmonic measure, on the circle C,- that gives the value at ? of the harmonic extension of any continuous function on Cr - Moreover, normalized Lebesgue measure on Cr is harmonic measure ?[0 with respect to the origin jco- Given h e ?', the measures h - ???(), 0 < r < 1, are probability measures, and ?/, is the weak* limit as the radius r tends to 1. This
Nonstandard analysis and measure theory 1315 construction of vf, does not work for more general domains and potential theories, but a modification is valid in these more general settings. First we modify the construction of vh for the unit disk. Fix ZieW1. For each r < 1, let {AJ} form an interval partition of Cr, and choose y\ e AJ". Let 8Yr denote unit mass at the point y\. The net of measures ?? Ь(.у\)Ц-гХй(.А\) - <V converges in the weak* topology to the measure v/, on С The directed set for this net is constructed so that the direction is given by letting r tend to 1 and refining the partitions {AJ}. To see that v/, is in fact the weak* limit of this net, note that the integral of any continuous function / with respect to one of these measures with support in Cr is a Riemann sum approximation to the integral of / with respect to the measure h - ?? To obtain a construction that works in general, we do not use finite combinations of measures concentrated on the points of D, instead, we want combinations of point masses on the function space [0, +oo]D. Given r and a partition {AJ} of Cr, the function ? н» ??(??{) is a harmonic function on Dr. It is the solution of the Dirichlet problem for the function that is 1 on AJ and 0 on the rest of C,-. When we divide by ?\ (Ар, the new function is equal to 1 at the origin xr,- Let <5;r be unit mass on the function that is equal to ?\(AJ)^.A(AJ) in Dr and is identically 0 on and outside C,; the point mass <5;r is a measure on the function space [0, +oo]D supplied with the product topology. (The restriction of the product topology is the ucc topology on the set of positive harmonic functions on Dr taking the value 1 at xr,.) Again, the measures J^i Ь(у\)ц.гх (AJ) - <5[ have vh as a weak* limit as r approaches 1 and the partitions {AJ} are refined. The limit measure is on the set {P(z,-): zeCjcH1 C[0,+oo]D. This construction of 17, continues to work in quite general potential theoretic settings; it does not use the Martin boundary. The proof that this construction works was obtained in [18] by interpreting a construction of representing measures given in [17]; that construction was the first application after coin tossing and Poisson processes of the general measure theory from [16] described above. In [17], the measurability of the standard part map was used to replace the standardizations of internal measures with standard measures on the standard compact set ?). Here, for the special case of the disk D, is the construction from [17] of vn. We start with a circle Cr С *D with r ~ 1, and an interval partition {A;r} so fine that every standard harmonic function has infinitesimal variation on each set AJ. We choose a point y;r in AJ for each /. Suppressing the superscript r, we have V*eD, h(x)= f ^(?)???(?)-?^(??)???(?,)^~^-. JCr ??()(??) The family of weights */г(>,;)мЛA(А,) is made into an ordinary probability measure ?? using the general measure theory discussed above. The measure ?? is supported by the set of nonstandard harmonic functions ??{?\)/???(??). This is a set of positive, internal harmonic functions on Dr, with each function taking the value 1 at xr,. The mapping S on this set of functions given by the formula S(g)(x) = °(g(x)), V*eZ),
1316 PA. Loeb is essentially the standard part mapping with respect to ?' supplied with the ucc topology. The measurability of S, established for this special case by Loeb in [17] and generalized by Anderson and Rashid in [5], allows one to project the measure ??, onto ?'. The process preserves affine combinations of harmonic functions and yields representing measures, so by a corollary of a result by Cartier, Fell, and Meyer (see [17]), the final measure is the unique representing measure v\, on the extreme points {P(z, -): ? e C] of ?0. 10. Poisson process The Poisson process is a model for events that occur with frequency ? in time. Here is a way of using an intuitive model for this process to construct an appropriate probability space. It is taken from [16]. Let ? be an infinite factorial in *N. Note that for any standard rational number q, q - ? e *N. Divide the interval [0, ?[ into ?2 equal intervals, [0. \/?[, [?/?, 2/?[,..., [(?' — 1)/ ?, ?[. For simplicity, we fix a standard rational number ?, and we let ? = к - ? e *N. We let ? denote all internal ways that ? distinguishable points can be put into the ?- intervals [k/?, (k + \)/?[. That is, ? consists of internal sequences ? = (?,·: 1 ^ / < ?) with each ?, a nonstandard integer between 1 and ?-. Each point in ? has internal probability \/?'?. We now have an internal probability space (?, ?, ?) from which we obtain a standard probability space (?, Llt (?), ??.)- For any point ? = k/?, let ??(?) be the number of points that the outcome ? puts in the interval [0. kjr}\. It is relatively easy to show that for ?/.-almost all ?, ??(?) is finite for all finite ? and increases at most once in any monad. If we set ?,{?) = sup/VT(a>) for t e R and ? e ?, we get a standard Poisson process with parameter ?. Let us confine our calculation to showing that if t\ and ti are standard rational numbers with t \ < ь, and s = ti — 11, then the ??.-probability of the internal event A that there are exactly к points inside [t\, t2[ is given by the Poisson distribution. There are exactly s - ? subintervals inside [t\, fi[. and the ?-probability of any one of the ? points being put into [t\, ь[ is i/?//?" = s/? = ks/?. Therefore, using the binomial distribution to compute ?(?), we have ?(?) = У·' (у -k)\kl к .? J -?* г (ksy (?*) к I У - ?-Vv* к\ (?-ky.Y к Г ks ? 1 - ks ??-k ? J ks ? ? 1- ks -?-к (??? к\ ? J ¦IJLl(A). That is, ??^(?) is given by the Poisson distribution.
Nonstandard analysis and measure theory 1317 Each outcome of the Poisson process gives an increasing function that increases by 1 at each event. One can show that ?/.-almost surly, only a finite number of events occur in a finite time, and only one event occurs in a monad. (See [16].) Another model for the Poisson process tosses a biased coin at each time change j/?. The coin has very small probability of coming up heads; if it does, one considers that an event has occurred at that time. 11. Anderson's Brownian motion Here is a summary of Robert Anderson's construction of Brownian motion in [3]. Recall that Random variables X\,...,X„ are independent if It Vai,...,a„, P(X| <<*, A---AX„<a„) = Y\P(X, <<*,), ? where ? denotes the probability measure. A Brownian motion is a family of random variables [Xt: t e [0, 1]} such that A) X0 = 0, B) if s ? < t\ ^ S2 < t2 ^ ¦ ¦ -^ s„ < t„ are points in [0, 1], then the random variables Xt] — XSl,..., Xt„ — Xs„ are independent. C) for t > s in [0, 1], the probability that X, - Xs ^ a is ?(?/^/? - s), where ? denotes the standard normal distribution given by ?(?) = 1/V2tt · jtx e~' I dt. The distribution -?{?/?) is the normal distribution with mean 0 and variance ?2. Starting with an infinite coin toss {— 1, 1}'' where ? is an infinite factorial, ? denotes all 2'' internal outcomes of a coin toss, ? is the collection of internal subsets, ? is the internal probability measure, and (?, L?(A), ??.) is the complete standard probability space they generate. We divide *[0, 1] into ? intervals of length At = \/?. At each division point, we toss a coin and move right (if the coin is heads) or left (if the coin is tails) by Ax = s/At. Let Xi(a>) = wi for I ^ i ?. ?. Then for a given coin toss ?, our position at nonstandard time t e*[0, 1] is j Ui'l ?{?,?)=—???{?). Here, [-] denotes the greatest element of *N less than or equal to the argument. For any pair (t,w) e [0, 1] ? ?, let ?(?,?) = °?(?,?). Anderson shows in [3] that ? is a standard Brownian motion built on the nonstandard space ?. Here is a central result in his development; it uses the well-known Reflection Principle. We write C([0, 1]) for the standard space of continuous, real valued functions on [0, 1]. THEOREM 11.1. For iit-almost all ? e ?, ?(?,?) is limited for all t e *[0, 1], and 0(-.")еС([О,1])-
1318 PA. Loeb PROOF. Given m, ? e N, let ?„?? be the internal set defined using internal sups and infs as follows: ????? !o>e ?: 3/ <n with sup ?(?,?)- inf ?(?,?)> — >. reU/«.(/ + l)/"l ie[i/n.(i + l)/n| wj Then for ? = ? ? ?, ?.(?,„?)^??( sup ?(?,?)- inf ?{?,?)>\/??\ \e[0.l/n| re[0.l/n] / ? ^ ? - ? I max ?- 2m ? ? ¦ u\ max У^?, > ) + ? - ?( min > < Ыцл^ 2m/ \|<*<х^ ^«-????,¦> ^\+2«-??[>,<-^ (this is the Reflection Principle) 2m = 4„.? ]?>, /чД ?? 4и · 1 - *? ?? 2т = *-(?-*(?? by the transfer of the central limit theorem and the continuity of i/r. Therefore, An For ?? I{2m) > 1, /2? J^i/ilm) MZ.(^m«)^2« p+oa e 4Ldt =4n ¦ e *«·. •fill Cm) Let ?' = ? - Um (?„ ?««- Then ??,(?') = 1 - sup,„ inf„ ?/.№,«) = 1- Fix ? ? ?. If for some t e *[0, 1], ?(?,?) is unlimited, then ? e ???? for all standard m and и, whence ? g ?'. If for some s and r e *[0, 1] with s_/ we have °|?(?,?) - ?(?,?)| = ? > 0, then for w > 2/a, we have ? e i2m„ for all ? e N, whence ?? ?'. Now suppose ? e ?'. Then ?(?,?) is limited Vf e *[0, 1]. The variation of ?(-,?) on monads in infinitesimal; one says that ?(-,?) is "S'-continuous". Moreover, ?(-,?)
Nonstandard analysis and measure theory 1319 on [0, 1] is the standard part with respect to the uniform topology on C([0, 1]) of this internal S-continuous function on *[0, 1], so ?(-,?) is continuous. ? Anderson also shows in [3] that the projection of the coin tossing measure from ? onto the space of continuous functions is Wiener measure. Anderson continued his work with an application to the stochastic (or Ito) integral. The idea, initiated by Ito, is to form for each path an integral with respect to the changes ??(-,?) in the corresponding Brownian motion. Such a pathwise definition of the integral can not be obtained directly. However, there is no problem with the internal integral with respect to the random walk ? (·, ?). Anderson showed that the corresponding stochastic process equals the Ito integral. Extensions by many authors of Anderson's work on Brownian motion and the stochastic integral continues to this day. See for example [1] and related references. 12. The martingale convergence theorem We next describe the application of a general limit theorem of Jiirgen Bliedtner and the author to the martingale convergence theorem. The original work can be found in [8]. The general limit theorem established in [8] is an approach to the many theorems in analysis and probability theory that say that for each Borel measure ? and for some reference measure ?, the Radon-Nikodym derivative ??/?? is equal ?-almost everywhere to a limit along a directed set I of ratios /?,(?,?), /' e I, defined in terms of ? and ?. In [8], such limit theorems are reduced to an easier form. The essential idea is that the desired limit result is established for all ? once it is shown that for any measurable set ? and any ? with v{E) = 0, lim; /?,(?,?) = 0 ?-almost everywhere on E. Indeed, since the set of Borel measures is closed with respect to scaling, it is enough to show that limsup, Rj(v,a) ^ 1 ?-almost everywhere on E. The formulation of this result is simple and yet general enough so that it is easily applied in quite diverse settings. The technique is applied in [8] to several areas: Boundary limit theorems in potential theory, measure differentiation theorems, the martingale convergence theorem in probability theory. The last, is the topic of this section, but we will present a somewhat simpler form of the full result. We start with a standard probability space (?, Z, P) and an increasing sequence of ?- algebras (?„: ? e N) with ? = °<Х]„€щ?>,)- That is, ? is the smallest ?-algebra containing U„eN ?"· We let ^" and $ denote, respectively, the collections of all finite signed measures on (?,?„) and (?, ?). We will use the notation v\Zn for the restriction to Zn of a measure ? defined on a larger ?-algebra. Recall that an L1 -bounded martingale (/?„: ? e N) is a sequence of functions on ? such that for each ? e N, (i) h„ is Z„-measurable, (ii) ?„ e Sn where VA e Zn, ?„(?) := fA hndP, (iii) Vw > и, ???\?„ =?„, (iv) sup,, fA\h„\dP < +oo. The general technique established in [8] yields an easy proof of the following convergence theorem of Andersen and Jessen [2].
1320 PA. Loeb THEOREM 12.1 (Andersen-Jessen). Fix ? e S. For each ? e N, let ?? = ?\??, and let h„ be the Radon-Nikodym derivative of the nonsingular part of ? with respect to ? and ?n. Then the L] -bounded martingale (hn: ? e N) converges ?-almost everywhere to the Radon-Nikodym derivative of the nonsingular part of ? with respect to P. That is, ?? ??? — (?) = hm h„ (?) = hm —— (?) dP ? ? dP for ?-almost every ? e ?. For the proof, we need some notation and a result in the form of a "maximal inequality." Definition 12.2. For each ? e S, h'- denotes a fixed nonnegative version of ??{/?? (?,-measurable). Also, VMe<S, VAef, VikeN, Ak=\cueA: sup/if (?) > ll. PROPOSITION 12.3. For each ? eS,if AeEk,then ?(?*)> P(Akfl). PROOF. This result follows by taking a finite subset {к <к+ 1 < ··· <Hm)cN, setting ?-1 = 0, Bn = {? e A: h^+n > 1} - U"=-1 Bi forO^n^m, and noting that m . in I in \ n=0JB" n=0 \я=0 / The Anderson-Jessen result follows immediately from the general limit result from [8] discussed above and the following proposition. ? PROPOSITION 12.4. Fix ? e ? with P(E) > 0 and ? e S with v(E) = 0. Then limsupftj'(w) ^ 1 for P-a.e. ? e E. /eN PROOF. We will show that P(f\k=\ El) = °- Fix ? > 0. Since ? is generated by the ?,-, there is а к e N and an A e ?* such that (P + v)(AAE) < ?. By Proposition 12.3, P{Akv) ^ v{Akv), and by definition, ?(?) = 0, so, P(Ek) ^ P(E - A) + P(Ak) ? P(E - A) + v(A - ?) < ?. It follows that Р(Г(к=\ ?,*) = 0- D
Nonstandard analvsis and measure theory 1321 Not all martingales are generated by a measure. An example is "double or nothing": You start with one dollar and at each play you either double your money or lose everything. On the other hand, nonstandard measure theory provides a way to use Andersen and Jessen's result to establish Doob's [11] convergence theorem for L1 -bounded martingales. Fix an L'-bounded martingale (/?„: ? e N). We let (*?,?^, Pl) denote the standard, complete probability space formed from the internal space (*?. *?. *P) using our general measure construction. Similarly, for each ? e *N, we let (*?,??, ?„) denote the standard space formed from (*?,*?„,*?„). Fix an index ? e *Noc. The ?-algebra В = a(\JneNB„) generated by the collection {??„: /; e N) is a subset of ??. That is, for each n e N, ?„????„???. Let ? = ??\?. Given n e N, we have v„ = v\B„ since for each A in the generating algebra *?„, v„(A) = °{*?„(?)) = °{*??(?)) = ?(?). Moreover, for each A e *?„, ?„(?) = °(*?„(?)) = ° [ *hnd*P= ( °{*hn)dPL J A J A since lim f [\hn\-(\h„\Am)]dP=0. in-* oc J ? That is, *h„ is an S'-integrable lifting of °(*hlt). Since the extension of v„ from *?„ to B„ is unique, vn <<C Pl, and the Radon-Nikodym derivative dv„/dPL = °(*h„). By the Andersen-Jessen theorem, °(*/г„) —> dv/dPi ??,-a.e. on *?. Using Proposition 12.5 below, one can "push down" this result and show that the original sequence h„ converges P-a.e. on ?. Moreover, the limit is a function / such that °(*/) = dv/dPi on *? \ U, where U is a set of Pl -measure 0 in ??. PROPOSITION 12.5. Let (?, ?, ?) be a finite measure space and {/?,: /' e 1} a family of finite, measurable functions indexed by a countable, ordered set I с R+ with the ordering inherited from E+ and N С I. Let {* Y, LA (?). ?^.) denote the standardization of the space (*Y, *?, *?) in an ? ? -saturated nonstandard extension of a standard structure containing R and Y. For each i e I, let g( = °(*/г,). Then the functions hi, i e I, converge k-almost everywhere on ? to afunction h ifandonly if the functions gj, i el, converge ??-almost everywhere on *Y to afunction g, in which case g = °(*h) ki-a.e. on *Y. PROOF. If the /?, 's converge almost everywhere to h. then by Egoroff 's theorem, the g, 's converge almost everywhere to °(*h). For the converse, fix ? > 0 in R, m e N, and any finite set w = г ? < г'2 < · · · < i„ in I. Set A = {yeY: 3j, l^j^n. with |/г,„(у) - й,-;0>)| > ?}.
1322 PA. Loeb Set B = {xe°Y: 3j, 1 <;<;«, with \glj(x)\= +00 от \g,„(x) -gij(x)\ > ?}. Then *A с ?, and so ?(?) = ??.(*?) ^ kL(B). By Egoroff's theorem, hi(y), i e I, satisfies a Cauchy condition for ?-a.e. у eY. ? 13. On an infinite number of independent random variables For our last example of the application of nonstandard measure theory, we will discuss the recent work of Yeneng Sun in [31-33] dealing with an infinite number of equally weighted independent random variables. We begin by recalling the difficulties in the classical framework. Rather than speak of one individual tossing a coin over and over again, one would like to speak of an infinite number of equally weighted individuals, acting independently, who each toss a coin once. There are measure theoretic problems. One cannot put a uniform probability measure on a countable set, therefore, the set of individuals must be uncountable. Examples are Lebesgue measure on [0, 1], or the standardization, as discussed above, of an internal uniform measure on a hyperfinite set. Once one gets beyond a countable set, there are classical measurability problems. Here is a sketch of those problems. First a particular case. Let ? = R10·'1 be the space of real-valued functions onT= [0, 1]. Let ? be Lebesgue measure on T, and let ? be a product measure on ? constructed from a non-Dirac probability distribution on R. An example of the measure on R for coin tossing would be weights 1 /2 at 1 and -1. Note that the existence of such a continuum product measure is guaranteed by the Kolmogorov existence theorem. For t e ?, let f, be the tth coordinate function, i.e., ft((o) = w{t), where ? e ? is a function on T. Then, / is a real-valued process with a continuum of independent and identically distributed (iid) random variables. For a fixed ?, the function /(·, ?) on ? is denoted by /?; it is called a sample function. Here is Doob's 1937 result [10] for this case. PROPOSITION 13.1. Let h be any real-valued function on [0, 1]. Let Mh = [?: /? (r) = ?(?) = h(t) except for countably many t e ?}. Then Mf, has ?-outer measure one. PROOF. Let В be any measurable set in the continuum product ?-algebra on ?. The set ? is in the ? -algebra generated by a countable number of cylinder sets. Therefore, В is determined on a countable index set С in the sense that for any ? and ? in ?, ifa(t) = fi(t) for all t e C, then a e В if and only if ? e B. Now suppose that В contains ??/,. Take any ? e ?. Define ? so that it agrees with ? on С and with h on ? \ C. Since w'(t) = h(t) except for countably many t e ?, we have ?' e Mh, and thus ?' e B. On the other hand, since ? and ? agree on C, the property of С implies that ? e В also. This means that ? = ?, and hence the outer measure P*(Mh) of Mh is one. ?
Nonstandard analysis and measure theory 1323 Remark 13.2. If one takes h to be non-Lebesgue measurable, then so is every function in Mh- Thus, the outer measure of all non-Lebesgue measurable samples must be one. On the other hand, if one wants the sample means to be some number a almost surely, then one can take h to be the constant function taking the value a. Since P*{Mh) = 1, one can trivially extend ? to a measure ? with ? (Mi,) = 1, and then claim that almost all sample means are equal to a. Sun notes that the arbitrary choice of a simply makes the statement meaningless. The claim is actually based on a more absurd underlying statement, namely, that almost all sample functions take an arbitrary constant value a. The following proposition of Sun shows that no matter what kind of measure spaces are taken as the parameter and sample spaces of a process, independence and joint measurability with respect to the usual measure-theoretic product are never compatible with each other except for the trivial case. This also means that even if one chooses a very large ? -algebra on the parameter space of a nontrivial independent process so that all the sample functions are measurable, the process itself is still not jointly measurable. The proof can be found in Sun's original work, e.g., [33]. PROPOSITION 13.3. Let (?,?, ?) and (?,?,?) be any two probability spaces with a complete product probability space (I x X,I<g> ?,? <g> v), and let f be a function from ? ? ? to a separable metric space. If f is jointly measurable on the product probability space, and for ? <g> ?-almost all (z'|, b) e / x /, /;, and /,2 are independent, then, for ?-almost all i e /, f is a constant random variable, where f is the function on X defined by /(/, ·)· Yeneng Sun's solution to the measurability problems that occur with ordinary products is to work with the internal product of an internal index set and an internal probability space. He then takes the standardization of that internal product space. That is, he applies to the internal product space the general construction of a standard measure space described in the previous sections of this chapter. It follows from Keisler's Fubini theorem, given below in Theorem 13.6, that this space has the product properties one needs. Now one can speak of each element of the index set representing an independent random variable, or in applications, an independent individual. Also interesting is the fact, as we shall see, that notions of independence that are distinct in the finite case become essentially the same. To expand on these ideas, we must first discuss facts about the internal and external product spaces. Fix two internal probability spaces (?,?,?) and (Y, B, v). A set of the form ? ? ? with A e A and В e Bis called a measurable rectangle and A <g> В denotes the *? -algebra generated by the internal algebra of all * finite disjoint unions of measurable rectangles. The internal product measure of ? and ? on A <g> В is denoted by ? <g> v. Let A <g> В be the completion of A <S> В with respect to ? <g> v; the corresponding extension of the measure ? <g> ? is also denoted by ? <g> v. Analogous notation will be used for the standard product. The next result shows that the usual product ??(?) <g> LV(B) is contained in the product ??®?(?®?). The proof is taken from [26]. Proposition 13.4 (R. Anderson [3]). If ? e ??(?) ® LV(B), then ? e ??®?(?®?) and (? ® ?)?.(?) = ??. ® ??.(?).
1324 PA. Loeb PROOF. We use the notation ??? for the null sets of a measure ?. If N e NVL, then for any standard ? > 0, there is an internal measurable set В з N with the internal measure of ? < ?. It follows that ? ? ? e ?/"(?®,.);. Similarly, if N e ?/"?;, then ? ? ? e ?/"(?®„)?. Suppose that A e LM (Л) and ? e L,,(??). Then there exist sets U e A and V e В such that UAA e7VMt and VAfi e7V,7 . Since (?/ ? V)A(Ax fi)c((f/AA)x У)и (X ? (VAfi)) еЛА(м®,,);. we have ? ? ? e LM®,,(.4 <g> B) and, (?®?)?.(?? ?)~?®?(?/ ? V) = ?(?/) · ?( V) ~ ??.(?) ¦ vL(B) = Mi.® vL(A ? ?). It follows that LM(^)(g) L„(??) С ?,?®,,(? ®?) and (?®?)?. coincides with ?^. ®vl on LM(-4) <8) L^S). Finally, let N с ?? e ?,ДД) ® ?,,,(?) with ?^ ® ?>?.(??) = 0. We have seen that ? e Л/[М®„^. It follows that N is in the complete measure space Llt®v (A <g> B), and^®v)L(N)=0. ? Yeneng Sun has shown that the containment of LM (A)<§Lv (B) in the space LM®„ (A <S> B) is strict if and only if both ?^. and vi have non atomic parts. (See, for example, [33].) The original example showing strict containment is due to D. Hoover. We next give a modification of that example from [26]; it is due to D. Norman. Example 13.5. Fix ? e *?^. Let X be the set {1,2,.... H), and let ? be the internal power set of X. Let ? and ? be the respective counting measures on X and ?. Here, A and В are the internal subsets of their respective sets. Now, LM®,,(.4 <g> B) is formed from the internal counting measure on the internal product. The set F = {(i,S)eX ? Y: i e S} is internal, and by symmetry, the internal cardinality \F\ = ^|Х||У|. It follows that F e L^.iA® B),and ^®v)L(F) = \. We will show that F i LM(*4)®L„ (B) .Assume the contrary; then ? ? ®vl(F) = \. Now, there is a sequence of rectangles ?, ? ?, e.with /' e N and each A,- e L^A) and each ?, e Ltl(B), such that F is contained in the union, and the sum of the product measures ???\ ? ?. (A,)· vl(Bi) < 1. We may even assume that for each /, A, e A, and ?, e ?. As in Section 5, we will only need the rectangles indexed from 1 to some limited integer m to cover the internal set F. The complement of the union of these rectangles is also a finite union of internal rectangles, and for one of these, С ? D, we must have ??(?@) > 0 and st(v(D)) > 0. Now, \{yeY: VjceC, ? i y}\ =2W_|C| = |У| · 2_|C|. Since F П (С ? D) = 0, we have D с {>> e Y\ Vx eC,x <? y], so ?(?))<: ?-! = 2~|C1. \Y\
Nonstandard anah'sis and measure theory 1325 Since 5?(?@) > О, С is not a finite set. But then v(D) ~ 0, so st(v(D)) = 0. This is a contradiction. Even though in general we have strict containment of Lfl(A) <g> LV(B) in the space ??®?(?®?), we do have Keisler's Fubini theorem from [13] for ??®?(?? ?). THEOREM 13.6 (Keisler-Fubini). Let f : ? ? ? -> R be (? ® v)L-integrable. Then (i) For [iL-almost all ? e X, f(x, ·) is vi-integrable. (ii) ? ы>- fY f(x,y)dvL(y) is \xL-integrable. Ciu) fx fY f(x,y)dvL dpL(x) = /??? f ?(? ® v)L. PROOF. First we show that if А с ? ? ? has (? <g>i>)z. -measure 0, then for ??,-a.e. ? e X, the section A(x) = {y e Y: (x, у) е A) has vL-measure zero. Pick a decreasing sequence of internal sets E„ e A <g> В containing A with (? <g> ?) ? -measure going to zero. Let ? be the intersection. For each set En, the internal Fubini theorem holds, so (? ? ?)?. (Ел) = st(ji ® ?)(?„) = ° ? ?(?„(·?)??(.?) = J °?(??(?)) ????) = (' vL {?„(?)) ????). For each ? e ?, the sequence VL{En(x)) is decreasing, and the integrals go to 0. Moreover, for each n, 0 ^ vL{E{x)) ^ у^(?„(л:)). Therefore, vL{E(x)) = 0 ??,-a.e. Since А с Е, A(jc)c ?(jc)forall;c e X, and when vL (?(*)) = 0, A(x) e ?,,,(?) and МА(*)) = 0. Now pick an /S-integrable lifting F of /, and set G() = / F(-, y)dv(y). We need to show that for ??,-a.e. x, the function F(x, ·) is an /S-integrable lifting of f(x, ·) and that G is an /S-integrable lifting of g := //(·, y)dvi(y). This will imply the first two parts of the theorem, and the last part follows from the calculation I f(x,y)d(ji®v)L=0 ? ???®? = ° ? ??)??(?)= /g(*)d/iz.W = ft /"/(*, y)dvz.()')Wz.(*). It follows from the first part of the proof that F(x, ·) is a lifting of/(jc, ·) for ??,-a.e. x. For the S-integrability, we let Fm = —m ? F A m for each m e N. Then, as in the discussion in Section 5, 0 ii / l· J I F - F„, \dv\ ???. ? ° ? [( J | F - Fm \??\??\ = ° /*|F-Fm|d/i®v->-0
1326 PA. Loeb as m —>¦ oo. This means that for ? t-a.e. x, ° f \F — F„,\dv —>¦ 0, and for such an ?, F(x, ·) is S-integrable. A consequence of this is that for ??. -a.e. jc, °G(x) = ° ? F(x,y)dv(y) = ? f(x, y)dvL(y) = g(x), whence G is a lifting of g. If G,„ — f F„,(x, y)dv(y), then \G\ ? m, and 0^° j\G-G,„\d??a j(j\F-F),\dv\d?^O as w —>¦ oo. It follows that G is S-integrable, and the theorem is proved. D 14. Exact law of large numbers and independence Recall that for if random variables X and ? are independent, then the covariance cov(X, Y) = E((X - EX)(Y - ??)) = ?(??) - EX ?? = 0 since E{XY) = EXEY. Also recall the law of large numbers deals with a sequence of random variables X, on a probability space. If the X, are independent with the same distribution and have finite mean ?, then the new sequence of random variables (?','=\ ??)/? tends to the constant random variable ? a.e. This means that for almost all points у of the probability space, the value of the sequence at у tends to the constant ?. For our discussion here, (?,?,?) and (?, ?, ?) are internal probability spaces, and (? ? ?, ? <g> ?, ? ® ?) is the relevant internal product space. Their associated standard measure spaces are respectively (?, L-,??), ??.). (*2. Lp(A), Pi.), and (? ??, L-A®p(T® ?), (? <g> P)z.). All processes on the latter space are assumed to be measurable with respect to the ?-algebra Lk®P{T <& A). Definition 14.1. A real-valued process / on the space (? ? ?, L-A®p(T ® ?), (? ® P)z.) is said to satisfy the law of large numbers if for almost all sample realizations ? e ?, the mean of the sample function f@ on the parameter space ? is essentially a constant. Remark 14.2. When / satisfies the law of large numbers, the relevant constant must be equal to the mean of / viewed as a random variable on the joint space of parameters and samples. This means that for Pi.-almost all ? e ?, fTfC0(t)dkL(t) = ffTxn fd(k®P)L- Note that there is no requirement of identical distributions in the following result due to Sun (e.g., see [33]). THEOREM 14.3. Let f be a real-valued square integrable process on the product space (? ? ?,??®?(? <g> ?), (? <g> P)l)- If the random variables ft are almost surely uncorrelated, then f satisfies the law of large numbers, where the almost sure uncorrelatedness condition means that for (? <g> k)L-almost all (t\,ti) ?? ? ?, ftl and ft2 are uncorrelated.
Nonstandard analvsis and measure theorx 1327 PROOF. This is a consequence of Keisler's Fubini theorem, Theorem 13.6, and the condition of uncorrelatedness. That is, writing f, for f(t, ·), we have (/?@-?/,)??? dPL ( (/(??,?)- Ef,,)dkL{t\) J (/(h,")- Eft2)dkL(t2)\dPL f {f(t\,a>)- Eftl)(f(t2,o>)- Eft2)dkL(ti)dkL(t2)\dPL ( (/(f,, ?) - ?/„ )(/(f2, ?) - Eft2) d(k ® k)LjdPL = jj (J (ft, - Eft] )(/„_ - Efh)dPL\ d(k ® ?)?. = 0. Thus, for PL-almost all ией, /?(/?@ - Ef,)dkL = 0, so ?/? = ?/. ? A classical example of Bernstein shows that there are three events that are pairwise independent but not mutually independent. Similarly, pairwise independence is also strictly weaker than mutual independence for a finite collection of random variables. We finish with statement of Sun's striking result (e.g., see [33]) showing that pairwise independence is, in fact, almost identical to mutual independence in an ideal setting. THEOREM 14.4. Let f be a process from (? ? Q,LA®PG ® A),(k ® P)l) to a separable metric space X. The random variables f, are almost surely pairwise independent if and only if they are almost mutually independent, i.e., for any ? ^ 2, ft,, ft2, ¦ ¦ ·> ft,, ore mutually independent for (k")L-almost all (t\, ti, ¦ ¦ ·, t„) e T". References [ 1 ] S. Albeverio, J.E. Fenstad, R. H0egh-Krohn and T. Lindstr0m, Nonstandard Methods in Stochastic Analysis and Mathematical Physics, Academic Press Series on Pure and Applied Mathematics, Vol. 122, Orlando A986). [2] E.S. Andersen and B. Jessen, Some limit theorems on set-fimctions, Danske Vid. Selsk. Mat. Medd. 25 E) A948), 1-8. [3] R.M. Anderson, A non-standard representation for Brownian motion and ltd integration, Israel J. Math. 25 A976), 15^46. [4] R.M. Anderson, Star-finite representations of measure spaces. Trans. Amer. Math. Soc. 271 A982), 667- 687. [5] R.M. Anderson and S. Rashid, A nonstandard characterization of weak convergence, Proc. Amer. Math. Soc. 69A978), 327-332. [6] L. Arkeryd, Loeb solutions of the Boltzmann equation. Arch. Rational Mech. Anal. 86 A984), 85-97. [7] R. Bianconi, J. Prandini and С Possani. A Daniell integral approach to nonstandard Kurzweil-Henstock integral, Czechoslovak Math. J. 49 A24) A999), no. 4, 817-823.
1328 P.A.Uieb [8] J. Bliedtner and P.A. Loeb, A reduction technique for limit theorems in analysis and probability theory. Ark. Mat. 30A) A992), 25-43. [9] N.J. Cutland and Siu-Ah Ng, The Wiener sphere and Wiener measure. Ann. Probab. 21 A) A993), 1-13. [10] J.L. Doob, Stochastic processes depending on a continuous parameter. Trans. Amer. Math. Soc. 42 A937), 107-140. [11] J.L. Doob, Stochastic Processes. Wiley, New York A953). [12] A.E. Hurd and P.A. Loeb, An Introduction to Nonstandard Real Analysis. Academic Press, Orlando A985). [13] H.J. Keisler, An infinitesimal approach to stochastic analysis, Mem. Amer. Math. Soc. 48 B97) A984). [14] H.J. Keisler, Elementary Calculus, An Infinitesimal Approach. 1st edn A976): 2nd edn, PWS-Kent A986). [15] H.J. Keisler, Infinitesimals in probability theory. Nonstandard Analysis and its Applications. N.J. Cutland, ed., Cambridge University Press, Cambridge A988). 106-139. [16] P.A. Loeb, Conversion from nonstandard to standard measure spaces and applications in probability theory. Trans. Amer. Math. Soc. 211 A975), 113-122. [17] P.A. Loeb, Applications of nonstandard analysis to ideal boundaries in potential theory. Israel J. Math. 25 A976), 154-187. [18] P.A. Loeb. A generalization of the Riesz-Herglotz theorem on representing measures. Proc. Amer. Math. Soc. 71A) A978), 65-68. [19] P.A. Loeb, Weak limits of measures and the standard part map. Proc. Amer. Math. Soc. 77 A) A979), 128-135. [20] P.A. Loeb, A construction of representing measures for elliptic and parabolic differential equations. Math. Ann. 260A982), 51-56. [21] P.A. Loeb, A functional approach to nonstandard measure theory. Proceedings of the S. Kakutani Retirement Conference, Contemp. Math. 26A984), 251-261. [22] P.A. Loeb, Measure spaces in nonstandard models underlying standard stochastic processes. Proceedings 1983 International Congress of Mathematicians in Warsaw, PWN, Warsaw A984), 323-335. [23] P.A. Loeb and H. Osswald, Nonstandard integration theory in topological vector lattices. Monatsh. Math. 124A997), 53-82. [24] P.A. Loeb and M.P.H. Wolff, Nonstandard Analvsis for the Working Mathematician, Kluwer. Amsterdam B000). [25] W.A.J. Luxemburg. A general theory of monads. Applications of Model Theory to Algebra, Analysis, and Probability, W.A.J. Luxemburg, ed.. Holt, Rinehart, and Winston, New York A969). [26] H. Osswald. Measure and probability theory and applications. Nonstandard Analysis for the Working Mathematician, P.A. Loeb and M. Wolff, eds, Kluwer, Amsterdam B000). [27] E.A. Perkins, A global intrinsic characterization of Brownian local time, Ann. Probab. 9 A981), 800-817. [28] A. Robinson, Non-Standard Analysis, North-Holland, Amsterdam, 1966. [29] K.D. Stroyan and J.M. Bayod, Foundations of Infinitesimal Stochastic Analysis. North-Holland Studies in Logic. Vol. 119. Amsterdam A986). [30] K. Stroyan and W.A.J. Luxemburg, Introduction to the Theory of Infinitesimals. Academic Press, New York A976). [31] Y.N. Sun, Integration of correspondences on Loeb spaces. Trans. Amer. Math. Soc. 349 A997), 129-153. [32] YN. Sun, A theory of hyperfinite processes: the complete removal of individual uncertainty via exact LLN. J. Math. Econom. 29 A998), 419-503. [33] Y.N. Sun, Conventional operations on nonstandard constructions. Nonstandard Analysis for the Working Mathematician. P.A. Loeb and M. Wolff, eds, Kluwer, Amsterdam B000).
CHAPTER 33 Monotone Set Functions-Based Integrals Pietro Benvenutia, Radko Mesiarb, Doretta Vivonaa aDipartimento di Metodi e Modelli Matematici, per le Scienze Applicate. Universita degli Studi "La Sapienza ", Via A. Scarpa. 16. 00161 Roma. Italy- E-mail: benvenut@dmmm.imironial.it. vivona@dmmm.uinroinol.it Slovak Technical University, Radlinskeho 11. 81368 Bratislava. Slovakia and Systems Research Institute. PAN, ul. Newelska 6. 01-447 Warszawa. Poland E-mail: mesiar@leo.svf.stuba.sk Contents 1. Introduction 1331 2. Choquet and Sugeno integrals 1333 2.1. Preliminaries 1333 2.2. Basic and simple functions 1334 2.3. Choquet integral 1336 2.4. Sugeno integral 1340 2.5. Similarity of Choquet and Sugeno integrals 1342 3. Pseudo-additions and pseudo-multiplications 1343 3.1. Pseudo-addition ® 1343 3.2. Pseudo-difference ? 1345 3.3. Pseudo-multiplication О 1345 3.4. Additional conditions with pseudo-ring structures 1348 4. General fuzzy integral 1352 4.1. Integral of simple functions 1354 4.2. Integral of measurable functions 1358 4.3. Additional properties of integral with pseudo-ring operations 1361 4.4. Extension of the integral to [-F, F] 1363 5. Examples 1364 6. Conclusions 1376 Acknowledgement 1377 References 1377 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1329
This page intentionally left blank
Monotone set functions-based integrals 1331 1. Introduction The classical probability theory works with ?-additive measures and one of its most important tools is the Lebesgue integral. The Lebesgue integral is a monotone non-negative linear functional on the space of bounded measurable functions; its construction is strongly linked with the additivity of the underlying measure. However, the additivity of a measure may be rather restrictive when modeling several real situations, e.g., the weights of criterion groups in multi-criteria decision making. The interaction phenomena put them out of the framework of additive measures. Further, additive measures are closed under (non-negative) linear combination. However, the application of some other aggregations of measures, e.g., min or max value of two measures of every measurable set, may violate the additivity. To avoid the above mentioned problems, several generalizations of classical probability measures have been proposed so far, such as pseudo-additive measures, possibility measures, belief functions, A:-monotone capacities, к -order additive measures, etc. All these generalizations are covered by fuzzy measures, i.e., monotone set functions vanishing at the empty set (with some continuity properties when the universe is infinite). These set functions are sometimes called also premeasures. When dealing with a pseudo-additive measure, constructions similar to the one of Lebesgue integral were used (Riecanova A982), Weber A984, 1986), Marinova A986), Sugeno and Murofushi A987), Ishihachi, Tanakaand Asai A988), Pap A990), Kolesarova A993)). However, for a general monotone set function (fuzzy measure) we cannot get an acceptable integral applying the above mentioned construction methods. Therefore, some modified integral construction has to be applied, all adapted to the most expressive property of a general fuzzy measure: its monotonicity. There have been a number of contributions dealing with integrals based on a general monotone set function (fuzzy measure), which for the sake of brevity will be called fuzzy integrals through this chapter. The fundamental one and the most important of them is known as "Choquet integral" after its introduction in the fuzzy literature by Schmeidler A986). Really this integral has been defined first by Vitali A925) using an outer measure and successively by Choquet A953-54) in the field of capacities. De Giorgi and Letta A977) and independently Greco A977) have given the definition of the integral with respect to a general fuzzy measure and pointed out his fundamental property: the horizontal additivity. The Choquet integral is defined for non-negative functions; it is comonotone additive (i.e., additive for comonotone functions) and homogeneous with respect to the multiplication by non-negative reals. For any event A, the Choquet integral of the characteristic function 1 a with respect to any fuzzy measure ? is just the measure ? (A) of the event. Among several works devoted to the Choquet integral, we recall, e.g., Wang and Klir A992), Den- neberg A994), Pap A995). For the applications of the Choquet integral, we mention On- isawa et al. A986), Tanaka and Sugeno A991), Murofushi and Sugeno A993), Sugeno and Kwon A995), Grabisch A996). A recent overview of state-of-art in this domain is contained in Grabisch, Murofushi and Sugeno B000). An alternative approach to the Choquet integral was developed by Sipos A979) who considered all real functions, not only the non-negative ones. There are two possible
1332 P. Benvenuti el a]. extensions of the original Choquet integral to the class of all measurable real functions, namely the asymmetric and the symmetric integral: see Sipos A979), Denneberg A994), Pap A995). The asymmetric one is comonotone additive and positively homogeneous; the other is completely homogeneous (with respect to the multiplication by a real number, positive or negative) but it is neither additive nor comonotone additive. Another fuzzy integral important for the applications of fuzzy set theory in many domains was introduced by Sugeno A974). The Sugeno integral (in its original form) deals with functions whose range is contained in [0. 1], i.e., with fuzzy sets, and with normed fuzzy measures. In several publications, this integral is simply called "fuzzy integral". It is comonotone maxitive, i.e., pseudo-additive for comonotone functions with respect to the operation ?: ? ? у = sup{.v, у}. It is л-homogeneous, too, i.e., pseudo- homogeneous with respect to the operation л: х л у = inf{.v. у]. Similarly as in the case of the Choquet integral, also the Sugeno integral gives exactly the value ? (A) when applied to the characteristic function 1 д. Some generalized integrals have been introduced with respect to a general fuzzy measure with properties similar to those of Choquet or Sugeno integrals. We recall the Ralescu and Adams A980) generalization of the original Sugeno integral, the Murofushi and Sugeno integral A991) based on r-conorm, the Choquet-like integrals of Mesiar A995). The similarity between the Choquet and the Sugeno integrals was observed by several authors (Suarez and Gil A986), de Campos, Lamata and Moral A991), Murofushi and Sugeno A991), de Campos and Bolanos A993), Mesiar A995), Benvenuti and Vivona A996a, 1996b)). Taking into account this similarity, Benvenuti and Vivona built a general fuzzy integral based on two binary operations ? (pseudo-addition) and © (pseudo- multiplication), similar to those introduced by de Campos and Bolanos in the finite spaces. We recall furthermore the Pan-integral of Yang A985) (see also Wang and Klir A992)), the multi-linear integral of Matsushita and Kambara A996), but we will not consider them in this chapter since they do not have the characteristic properties of Choquet and Sugeno integrals; namely, no type of additivity can be expected for these and the output for the characteristic function \A is not ? (A), indeed it depends on the set A, not merely by its measure but by the values of the measure on the family of the subsets of A. Another interesting class of fuzzy integrals was introduced by Imaoka A997). Based on a ?-additive atom-less measure on the Borel subsets of the unit square and constructed by means of a copula (Schweizer and Sklar A983), Nelsen A999)), they include the Choquet and the Sugeno integrals, are monotone and, when applied to a characteristic function 1д, they give ? (A). A generalization of Imaoka's approach was recently proposed by Klement, Mesiar and Pap B001). However, Imaoka's integrals do not possess in general any kind of additivity, so we will not consider them, too. In this chapter, using the procedure given by Benvenuti and Vivona, we will introduce a class of general fuzzy integrals with respect to a general fuzzy measure. A fuzzy integral is associated to any couple of fitting operations ? and ©. We request the properties which characterize both the Choquet and the Sugeno integrals. These integrals, the integral of Ralescu and Adams and the one of Weber A986) are particular cases in the class introduced in this chapter. As a matter of fact, in the case when ? = + and © — · our approach results just the Choquet integral (on the interval [0, oo]); in the case ? = ? and © = л working on the unit interval [0, 1 ] we get the original Sugeno integral, while in the case of the
Monotone set functions-based integrals 1333 interval [0, +00] the fuzzy integral of Ralescu and Adams is obtained. Similarly, the case in which ? = ? and © is any strict r-norm on [0, 1] leads to the integral proposed by Weber. Note that if the considered fuzzy measure is finitely ©-additive then our integral is a special case of the Sion integration method introduced in A973). However, the possible lack of ?-additivity by a general fuzzy measure puts our integral out of the Sion concept. In general, fuzzy integrals can be viewed as specific aggregation operators: see Bouchon- Meunier A998) and Benvenuti and Vivona B000). Some aggregation operators (with finite or infinite number of inputs) are represented by integrals on a suitable fuzzy measure space; the fuzzy measure is a tool in this representation; its values come from an interval [0, ?] which need not be linked with the interval [0, F] related to the inputs. To keep this generality, we even abandon the request of reconstruction of the original fuzzy measure from the respective integral. The exposition is organized as follows. In Section 2, the properties of the Choquet and the Sugeno integrals are recalled, discussed and compared. Section 3 is devoted to an appropriate choice of the binary operations ? and 0. These operations strongly influence the properties of the resulting integrals. We assume for the pseudo-multiplication О only the essential properties. Indeed, the additional properties of ? result in the additional properties of the corresponding fuzzy integral, e.g., the associativity of ? results in the ©-homogeneity of the integral and the reconstruction of the original fuzzy measure from its related integral is linked with the existence of a left-neutral element for the pseudo- multiplication which we should work with. The construction of the general fuzzy integral is done in Section 4. Finally, in Section 5 we give several examples of operations ? and ©, with the corresponding fuzzy integrals. As an informative overview of recent results in the fuzzy measures and fuzzy integrals theory and application we recommend to the interested readers the volume of Grabisch, Murofushi and Sugeno B000). 2. Choquet and Sugeno integrals 2.1. Preliminaries Let ? be an abstract space, A a ?-algebra of subsets of ?. Let ? : А -> Ш = [0, oo] be a fuzzy measure, i.e., a non-decreasing set function with ?@) = 0. The triple (?, ?, ?) is called a fuzzy measure space. No assumption is made in general about the continuity of ?; we say that ? is continuous from below if: C„ e A C„ /С => м(С„) / м(С). We recall that a fuzzy measure is additive (is a valuation on A) if ?(???)+?(??\?) = ?(?)+?(?) VA,SeA
1334 P. Benvenuti et al. Moreover, we say that ? is maxitive (v-additive) if ?(???)=?(?)??(?) ??,??? To any function f :? -> R = [-00,+00], the family Cf =: {Cf(x): ? e R) U {CUjc): jc e R) is associated, with C/(jc) = {? e ?: /(?) > jc) (= {/ > *} in short) and С'Лх) = {? e ? | /(?) ^ ?] (=[f ^ ?] in short). The family C/ is a chain, i.e., it is completely ordered with respect to the set inclusion. We call Cf the chain associated to f. In order to examine the additivity or the maxitivity of the integrals it is useful to consider the concept of comonotone (common monotone) functions (see Denneberg A994)). Every function/ : ? -> R introduces on ? a semi-order relation: ?\ <f coi «=> f{cu\) < f{o>i). The comparison between the two semi-order relations, introduced by two functions / and g, suggests a binary relation ~ in Rn which is symmetric and reflexive but not transitive. Definition 2.1. / and g are comonotone (/ ~ g) iff the two corresponding semi- orders are non-contradictory, i.e., there exists no pair ?\, ??_ \? ? such that ?>? </ <?? and a>2 <g <W|. PROPOSITION 2.2. Given two functions f and g in Rn, let Cf and Cg be the two chains associated to f and g, respectively, and C/, the chain associated to the function h = f +g; the following conditions are equivalent (see Dennenberg A994)): (i) f~g\ (ii) CfUCg=Ch; (iii) there exist two continuous non-decreasing functions ?/ and ?? from R tо R such that: <Pf(z) + <pg(z) = z, f(w) = <pf{h(w)), ?(?) = (^(/?(?)). 2.2. Basic and simple functions We shall denote with ?(?), or shortly T, the set of all measurable (Д-measurable) functions from ? to R+: that means that the chain Cf associated to / e ?(?) is a subset of ?. This definition of measurability coincides with the classical one, because Л is a ? -algebra. For any a e R+ and any A e ?, the function b(a, A) defined by: fa ifa>eA, 0 if ? g A, is called basic (simple) function. A measurable mapping s: ? -> R+ is called a simple function if its range is finite. We shall denote by ?(?), or shortly S, the set of all simple functions from ? to R+. The simple functions have several representations by means of basic functions. The most
Monotone set functions-based integrals 1335 known representation from the classical measure theory is based on the notion of the finite partition of ?. To any simple function s, with range {a\, ai, ¦ ¦ ¦, a,,}, a finite partition {A|, ?2,..., A,,} is associated, where A,¦ = {? e ? | /(?) — ?,}. The function s admits the representations: ? « i = J]fo(flf,A/)s\/fo(fli,Ai). B.1) 1=1 1=1 The possible null term in B.1) can be omitted, and by assuming: 0 < ? ? < 02 < ¦ ¦ ¦ < a„, B.2) so we obtain univocally the classical standard representation, with minimal number of used basic functions. The family S of simple functions is fundamental in the construction of an integral because it is dense in the class ? of measurable functions. As a matter of fact, any measurable function is the limit of a non-decreasing sequence of simple functions; therefore any functional defined on ?, which is continuous from below, is completely determined by its values on S. The Lebesgue integral of a simple function s is defined taking its classical representation B.1) and putting: I sdm— II y^fo(a,-, A,) J dm = y~]q,w(A,·). The structure of the classical measure space (the ?-additivity of the measure) allows also the use of any additive representation by means of basic functions, in order to express univocally the integral as similar sum. The lack of additivity in the case of fuzzy measures requires another type of representation of simple functions. In order to express the comonotone additivity of the Choquet integral and the comonotone maxitivity of the Sugeno integral, it is convenient to introduce a representation by means of a system of comonotone basic functions. PROPOSITION 2.3. Any simple function s admits a comonotone additive step representation: s = Yib(ci,Ci) B.3) 1 = 1 and a comonotone maxitive step representation n s = \/b(a,,C,) B.4) 1 = 1 with C\ DC2D -DC,, D 0 (strictly).
1336 P. Bemenuti et al. PROOF. We assume: C, = U"=/A/ and from B1^ and B·2) we obtain directly the expression B.4). We obtain easily the expression B.3) from B.1) taking: c\ = a\, ст = a2-a\,...,c„=a,,-a„-\. ? The representations B.3) and B.4) (again omitting the possible null term) are minimal for the number of basic functions. They are characterized by common conditions С ? D C2 D ¦ ¦ О C„ D 0, and {d} С С,. Note that Cs = {?, C,. C2, ¦ ¦ ¦, C„, 0}. 2.3. Choquet integral We recall the definition of the Choquet integral and its fundamental properties. For more details see Denneberg( 1994) and Pap A995). Definition 2.4. The Choquet integral is the functional ?(?) -> M+ defined in this way: S/??=: f p(Cf(x))dx. B.5) The function ? -+ ?(? f(x)) is non-increasing, so the integral is always meaningful in the sense of Cauchy-Riemann. The fundamental properties of the Choquet integral are the following: (CH. 1) BASIC VALUES. For any basic simple function it is: b{c,C)d? = c¦?{C) VceE+. VCeA B.6) with convention 0 · oo = oo · 0 = 0. / PROOF. Equation B.6) is immediate from B.5) because if / = b(c, C), then we have Cf(x) = С for O^x <candCf(x) = tf, for ? ^ с ? In particular, for с = 1, the basic function coincides with the characteristic function 1 с of the set С and the Choquet integral reconstructs the given fuzzy measure: / \€??=?@ VCeA (CH. 2) MONOTONICITY. fUg => ? /??^? 8??.
Monotone set functions-based integrals 1337 PROOF. The property is immediate consequence of the monotonicity of ?. Indeed, if / ^ g, then it is Cf(x) с Cg(x) and so m(C/(*)) ^ ц.(Сх(х)) for all xeR+. ? (Ch. 3) Homogeneity. ? (с-/Lд = с· ? f ?? VceE*. PROOF. For с = 0 the relation is obvious. Suppose с > 0; from C-/U) = C/(x/c), we obtain: ?(?/)??=? ii{Cf{xlc))dx=c- ? ?(€ f(y))dy=c ¦ j /??. (CH. 4) COMONOTONE ADDITIVITY. f~g=* f (/+8)?? = { fdp + f 8??. B.7) PROOF. First we consider the function h = f + g and the two functions ?/ and ?? given by (iii) of Proposition 2.2. We define a pseudo-inverse of ?( putting: ?)(?) = sup{f: <pf(t) ^ x] = inf{f: <p/@ > x]. So we have: (Pf(y)>x<=*yxp*(x) and f(cu) = (pf(h(w))>xt=$h(co)>(pf(x). From these equivalences and the relation /(?) = <p/ (h(co)) we obtain ? /??= [ vi{Cf(x))dx = ? ?{?„{?}(?)))?? Starting from the Lebesgue measure /, we introduce on the Borel sets of E+ the measure mf defined by mf(B) = 1({?: ?*(?) e B}). This is exactly the Lebesgue- Stieltjes measure induced by the function ?/ because: mf([a, b[) = ?([?/(?), ?/?)[) = ?,?) - ?,(?). By means of the substitution у = ?* (?) (see, for instance, Halmos A950, Sec. 39)) we obtain: / ???= ? ?(?,(>0)^
1338 P. Beiwenuti el al. Equally, using the function <pg we have: where mg is the Lebesgue-Stieltjes measure induced by ??. Now, by Proposition 2.2(iii) we get <pf(y) +<pg(y) = y- So we obtain mf+m4=l and finally /¦00 /-X /-ЭС /· / ?(?,?(?))???/+ n(Ch(y))dmg= / ^{Ch(y))dy = it hdii. D (Ch. 5) Horizontal additivity. Given a measurable function f and с е R+, we obtain a horizontal additive decomposition by cutting the function f at the level с and putting 0 iff(?) <Cc, /(?) - с if /(?) > с. / = (/лс) + /+, withft+(co) = V For any horizontal additive decomposition the Choquet integral is additive: j /?? = {(/ ??)??+? f+??. B.8) PROOF. The functions (/ л с) and /c+ are comonotone; B.8) is a special case of como- notone additivity. ? (CH. 6) CONTINUITY FROM BELOW. If the fuzzy measure ? is continuous from below, then the Choquet integral is also continuous from below, i.e., for any non-decreasing sequence {/„}„ем it is: f (sup f„J ?? = sup ? f„ ??. PROOF. If we assume that the fuzzy measure ? is continuous from below, then we obtain the continuity from below of the Choquet integral from the well known theorem of monotone convergence for the Cauchy-Riemann integral. In fact, if /,, / f pointwise, then Cf„{x) / Cf(x) and by continuity of ? we obtain ?(?/?(?)) /д(С/(х)), too. D THEOREM 2.5. If ? is ? ?-additive measure, the Choquet integral coincides with the Lebesgue integral: ????? fan. ?
Monotone set functions-based integrals 1339 PROOF. Let s be a simple function, from the comonotone step representation B.3) we see that С,(дс): С ? if 0 ^ ? < a\, C, if a,¦ _ ? ^ ? < at, 0 if a„ ^ jc. So, by the definition B.5) we obtain, for any simple function: / sd? =а|Д(С|) + (а2 - ai)M(Ci) + (аз -а?)м(Сз)Н + (а„ -а„_|)м(С„). Now, С„ = А„ and, as the fuzzy measure ? is ?-additive, then ?@ — m(C,-+i ) = ?(?,). So we obtain f ,?<*? = ]?4?(?,) ^ i=l This is as well the value of the Lebesgue integral of the same function s. The identity of the Choquet integral and Lebesgue integrals on ?(?) implies the identity on the entire family ?(?) because both integrals are continuous from below. ? Note also that the additivity of ? is enough to ensure the additivity of the corresponding Choquet integral, see Denneberg A994). Moreover, if the fuzzy measure ? is continuous from below, then the Choquet integral of a given function / can be represented by means of the Lebesgue integral of the same function in a measure space (?,?/,??)-), which is however depending on /. THEOREM 2.6. In a fuzzy measure space (?,?,?), with ? continuous from below, the Choquet integral of any function f coincides with the Lebesgue integral in the measure space (?,A/,m f): ????=? fdmf, ? where A/ is the ? -algebra generated by С / and m / is the ? -additive measure induced on A / from the restriction of ? to the chain Cf. PROOF. If ? is continuous from below, its restriction of Cf really admits a unique extension to A/ (see Halmos A950, Sec. 13)). The Choquet integral of the function / with respect to ? is equal to the Choquet integral of the same function with respect to this extension m/ because ? and m f are equal on Cf. By Theorem 2.5 the latter integral coincides on the family of all Af -measurable functions with the Lebesgue integral in the measure space ^,A/,mf). Obviously, / is A/ -measurable. ?
1340 P. Benvenuti el al. 2.4. Sugeno integral We recall the definition of the Sugeno integral and its fundamental properties. For more details see Pap A995). Let (?,?,?) be a fuzzy measure space with ? : A —>¦ [0, 1]; we consider the family T\ (?) of all measurable functions from ? to [0, 1]. Definition 2.7. The Sugeno integral is the functional ?\(?) -> [0, 1] defined in this way: ? ???=: \J (* ? m(C, (*))). B.9) velO.ll The fundamental properties of the Sugeno integral are the following: (Su. 1) Basic values. / b(c,C)d? = cл?(C) Vc e [0. 1]. VC e A. PROOF. If / = b(c, C), then we have C/(x) = С for 0 ^ ? < с and Cf(x) = ttforx^c. Therefore from definition we obtain: / b(c,C)d?= \J (jc ??(?) =слд(С). ? velO.cl The value 1 is the unit element for the operation л in the interval [0, 1]. The Sugeno integral of the characteristic functions gives back the fuzzy measure: / \???=?@ VCeA. (SU. 2) MONOTONICITY. /^?=> f /??? $ gd?. i /??? i PROOF. If/<g,thenitisC/(Jc)CCs(Jc)andsoM(C/-(Jc))^M(C1?(Jc))foralU e [0, 1[. This inequality involves by means of B.9) the inequality between the two integrals. D (SU. 3) Л-HOMOGENEITY. i (??/)??=??<& f ?? Vc e]0, 1].
Monotone set functions-based integrals 1341 PROOF. We have: Ссл/(х) = С/(?) for* < с and Ссл1(х) = 0 for ? ^ с. So we obtain: У дслд(С?.л/(дс))= V дслд(С/(дс)) = сл( V дг л ? (С/Ч*)) V .telO.I] леЮ.<1 VvelO.ll ' D (Su. 4) Comonotone maxitivity. /~g => i (fvg)dv = f /???? 8??. B.10) PROOF. If / ~ g, by Proposition 2.2 the two sets C/(x) and Cg(x) belong to the same chain and therefore it is C/(x) => Ся(х) or Ск(л) 2 С/Ч*) for every * e [0, 1[. In both cases /i(C/VJf(*)) = M(C/(*) U Q(*)) = д(С/(дс)) ? м((Сч(*)). The equality B.10) follows easily from the definition B.9) and from the algebraic properties of operations ? and л. ? (Su. 5) Horizontal maxitivity. Given a measurable function f e ?\(?) and с е ]0, 1[, we obtain a horizontal maxitive decomposition by cutting the function f at the level с and putting: ? ...,, ,V/ . .0 iffW^c, / = (/Ac)v/(v, withf?(co) = /(?) iff(w)>c For any horizontal maxitive decomposition the Sugeno integral is maxitive: |/^ = |(/ac№v|/>. B.11) PROOF. The functions (/ л с) and fv are comonotone; B.11) is a special case of comonotone maxitivity. ? (Su. 6) CONTINUITY FROM BELOW. If the fuzzy measure ? is continuous from below, then the Sugeno integral is continuous from below, too, i.e., for any monotone non- decreasing sequence {/„}«eN: i isup/„j<iM = supi ?„??. PROOF. If /„ / / pointwise, then it is С/¦„(*) / Cf(x). If we assume that the fuzzy measure is continuous from below, then we have ? л д(С/„(*)) У х л д(С/(*)). ?
1342 P. Benvenuti et al. 2.5. Similarity ofChoquet and Sugeno integrals The properties (Ch.) characterize univocally the Choquet integral. The properties (Su.) are exactly the same as (Ch.) except for replacement of operations + and · with the operations ? and л, respectively. Likewise, they characterize univocally the Sugeno integral. Many characterization theorems are known in the literature. We expose here two theorems, which are formally identical under replacement of the operations + and · in R+ with the operations ? and л in [0, 1 ]. We assume that a fuzzy measure space (?, ?, ?) is given, with ? : ? -> Ш+ continuous from below. THEOREM 2.8. The Choquet integral is the only functional Lch'-F(H) -> E+ which verifies the properties (Ch. 1, 5, 6). PROOF. Let Lch'-F(H) -> K+ be a functional which satisfies the properties (Ch. 1, 5, 6). First we consider a simple function s e <S(i2) and its comonotone additive representation B.3); this representation is obtained by means of ? - 1 repeated horizontal additive decompositions. From the properties (Ch. 1, 5), which are supposed to be true for the functional Lch and are true for the Choquet integral, we obtain for both the same value ons: Lch(s) = ?^ ·?@= ? .?^?. ?=? J The functional Lch and the Choquet integral are identical on the class <S(i2) of simple functions; both are monotone in this class, which is dense in ?(?). By the property (Ch. 6) of continuity from below, which is assumed for the functional Lch and holds for the Choquet integral, the functional Lch and the Choquet integral are identical on the whole ?(?). As a matter of fact, any function / e ?(?) is the limit of a non-decreasing sequence of simple functions: s„ / /; from the property (Ch. 6) we obtain: Lch(f)= Hm LCh(s„)= lim f s„d?= ? f ??. ?—>эс ?—юс J J a If the fuzzy measure is not continuous from below, then the Choquet integral results into a functional in general not continuous from below. Nevertheless the remaining properties (Ch. 1, 2, 3, 4, 5) need not characterize univocally the Choquet integral.1 To keep the universality of properties characterizing the Choquet integral we will deal with fuzzy measures continuous from below. 'Take, e.g., ? defined by: ?@) = 0, ?(?) = ос, ?(?) = 1 if ? ? 0. ?. Then the Choquet integral gives: ???? = sup/, if inf/ = 0, $f ?? = ос otherwise. Let L be the functional defined by: L(f) = sup/, if /"' @) / 0, L(f) = oc else. The functional L fulfills, as well, the properties (Ch. 1. 2. 3. 4. 5). but it is different from the Choquet integral if the space ? is not finite.
Monotone set functions-based integrals 1343 THEOREM 2.9. The Sugeno integral is the only functional Lsu'-^\(^) -> [0. ? which verifies the properties (Su. 1, 5, 6). PROOF. The proof is exactly the same as in the previous theorem, except for replacement of the operations +, · with the operations v, л and the use of the comonotone maxitive step representation B.4) instead of the comonotone additive step representation B.3). D The property of monotonicity for both Choquet and Sugeno integrals is involved by (Ch. 1, 4) or (Su. 1, 4), respectively. The stronger properties of comonotone additivity or of comonotone maxitivity are involved by the weaker properties of horizontal additivity or of horizontal maxitivity in the context of the functionals which verify the axioms (Ch. 1, 5) or (Su. 1, 5), respectively. The above theorem shows the way in order to give a common definition of Choquet and Sugeno integrals. Moreover, it suggests a unique procedure for building up a general integral by means of two pseudo-operations ? (pseudo-addition) and ? (pseudo- multiplication). This integral uses a general fuzzy measure and includes the Choquet and the Sugeno integrals as particular cases. 3. Pseudo-additions and pseudo-multiplications In the previous section, we have seen that the Choquet integral is strongly related to the usual arithmetic operations of addition + and multiplication · on the interval [0, oo]. Similarly, the Sugeno integral depends on the operations ? and л on the interval [0, 1]. In order to find a common framework for both Choquet and Sugeno integrals, we have to deal with a general pseudo-addition ? and a general pseudo-multiplication ? which must be fitting to each other. The algebraic properties of functionals taken into account in the integration procedures result to the corresponding properties of the operations ? and ? which we have to work with. In general, ? is supposed to be a continuous generalized triangular conorm (see Sugeno and Murofushi A987), Pap A990, 1995), Wang and Klir A992), Grabisch, Murofushi and Sugeno A992), Mesiar A995), Benvenuti and Vivona A996a, 1996b), Benvenuti and Mesiar B000a, 2000b)). 3.1. Pseudo-addition ? Definition 3.1. Let F e]0, oo]; a binary operation ? : [0, F]2 -> [0, F] is called a pseudo-addition on [0, F] if the following properties are satisfied: (Al) a® b = b®a (commutativity) (A2) a ^ a', fo < b' =>· ? ? b ^ ?' ? b' (monotonicity) (A3) (? ? b) ? с = ? ? (b ? с) (associativity) (A4) ??0 = 0?? = ? (neutralelement) (A5) a„ -> a, bn -> b =>¦ a„ ? b„ -> ? ? b (continuity). The axioms (A2)-(A5) ensure that a pseudo-addition ? on [0, F] forms an /-semigroup of Mostert and Shields A957). The general form of the pseudo-addition is commutative.
1344 P. Benvenuti el al. So, the commutativity follows from the remaining axioms; we have included also it only for the sake of completeness. In the following proposition we recall the general structure of the operation ? in an /-semigroup; for more details the reader can see Mostert and Shields A957), and the volume of Klement, Mesiar and Pap B000). We note preliminarily that, by virtue of the continuity of the operation ?, the set of ©-idempotent elements C® = {a e [0, F]: a® a = a] is closed. It is never empty because 0, F e C®. So, the complement Ce = {a e [0, F]: ? © a > a} is the union of a discrete system (countable, finite or, in case, empty) of disjoint open intervals: Ce=(J]a*-&[. keK PROPOSITION 3.2. Let ? be a pseudo-addition on the interval [0, F]. There exists a system of disjoint intervals ]<**, Д: [, k e К, К finite or countable, and an associatedfamily of continuous strictly increasing mappings gk : [otk,Pk] -* [0, oo] with gk(ak) = 0 such that: a®b= (**~'((**(?)+**(&)) л g*(ft)) if]a,b[e]ak,Pk[2, (ЗЛ) lavi) otherwise. Each function gk is determined uniquely up to a multiplicative positive constant. For the closed set C® the two extreme cases are possible: C® = {0, F) C$ = ]0, F[ and C® = [0, F], C^ = 0. In the first case the pseudo-addition assumes one of the following forms: Example 3.1. a(Bb = g-](g(a)+g(b)), C.2) being g : [0, F] -> [0, oo] an increasing bijection. Example 3.2. a®b = g-]((g(a)+g(b))Al), C.3) being g : [0, F] -» [0,1 ] an increasing bijection. The form C.2) shows an isomorphism with the usual addition on [0, oo]; this is obtained in particular when g is the identity map. The form C.3) shows an isomorphism with the truncated addition on [0, 1]. In the second case the pseudo-addition is the idempotent operation v: Example 3.3. a®b = avb. C.4)
Monotone set functions-based integrals 1345 3.2. Pseudo-difference © For a given b e [0, F] the function ? н> fo ? ? need not to be strictly monotone. The equation b®x = a, with a^ b has always a solution, but the solution need not be unique. So, the pseudo-difference is not always univocally defined. We shall use, together with a given pseudo-addition ?, the following (minimal) pseudo-difference, compare also Weber A983). Dehnition 3.3. The pseudo-difference is the binary operation ©:[0, F] ? [0, F] -> [0, F] defined by: aQb = inf{x e[0, F]: b®x ^ b}. C.5) The pseudo-difference is defined on the whole square [0. F] ? [0, F], not only if a > b. When a < b, then ? ? b = 0, and when a > b, then ? ? b > 0. More, when a ^ b, then fo© (fl9i>) = fl. 3.3. Pseudo-multiplication © We will build an integral for the Д-measurable functions (with values in the interval [0, F]) on a fuzzy measure space (?,?, ?) with ?(?) — ? @ < ? < ??). The integral will be a functional with some basic properties connected, as well, with the given pseudo- addition ©. Any integration procedure uses another binary operation ©, which is called pseudo-multiplication. The expected properties of the integral determine the next minimal properties of © which we have to require. Our approach assumes that the integral of any basic simple function b(a, A) depends on the event A only by means of its measure ?(?): r® / b(a,A)?d? = I(a,?(A)). This formula should be valid for any a e [0, F], A e ? and for any fuzzy measure ? : ? -> [0, ?], with ? = ?(?). Therefore the function / defines the so called pseudo- multiplication ©: [0, F] ? [0, M] -> [0, F] as aQb=:I(a,b). C.6) We expect that the integration of the zero function results to zero. The zero function can be expressed either as fo@, A) for arbitrary A e ? or as b(a, 0) for arbitrary a e [0, F]. Recall that ?@) = 0 and thus it should be 0©?(?) = ?©0 = 0. C.7) The integral is supposed to be monotone, comonotone ?-additive and continuous from below whenever the underlying fuzzy measure ? is continuous from below. All
1346 P. Benvenuti el al. above mentioned expected properties, when applied merely to the basic simple functions, determine the minimal assumptions on ? which we have to require and we are going to state. Let ?? < a2 and A\ С Ai\ it is ?{?\) < ?(??) and b(a\. A\) ^ b(ai, Ai). For the monotonicity of the integral we must have ????(?|)<;?2??(?2). C.8) For any ??, 02, A, the two functions b(a\, A)andfo(a2, A) arecomonotoneandfo(fii|, ?) ? b(a2, A) = b{a\ ? аг, A). For the comonotone ©-additivity of the integral we must have: (?? ? ?2) ? ?(?) = (?, ? ?(?)) ? (?2 ? ?(?)). C.9) Finally, let ?, / a and ?, / A, then fo(a,. A,) / b(a, A), and, if the fuzzy measure is continuous from below, it is also ?(?,) / ?(?). In order to have the continuity from below of the integral, the following must hold: ?,??(?,)/???(?). C.10) The formulae C.7)—C.10) should be valid for any ?,, for any A, and for any fuzzy measure, i.e., for a, e [0, F] and ?{?,) e [0, M], as ? =?(?). Therefore for a correct building up of a general fuzzy integral the monotonicity, the left distributivity, the left continuity of the pseudo-multiplication © are unavoidable. Dehnition 3.4. Let ? be a given pseudo-addition on [0, F] and let ? e]0, oo]. A binary operation ©: [0, F] ? [0, ?] -» [0, F] is called a ?-fitting pseudo-multiplication if the following properties are satisfied: (Ml) aQ0 = 0Qb = 0 (zero element) (M2) a^a\b^b' =>· a © b ^ a © b' (monotonicity) (M3) (a®b)Oc = aOc®bQc (left distributivity) (M4) (sup,, an) © (sup,„ b,„) = sup,, ,„ a„ © b„, (left continuity). Note that we will follow the usual convention to the priority of operations, i.e., a © с ? b © с means (a © c) ? (b © c). The left distributivity (M3) corresponds to the Cauchy equation in the I-semigroup ([0, F], ?) for all functions fh(x) = xQb,be[0, M\. fb(x®y) = fb(x)®fb(y), x,ye[0,F]. C.11) Consequently we have the following characterization of all ?-fitting pseudo-multiplications. THEOREM 3.5. The binary operation ©: [0, F] ? [0, ?] -> [0, F], defined by cQb = fh(c)
Monotone set functions-based integrals 1347 is a ®-fitting pseudo-multiplication if and only if (f, | b e [О, М}) is a non-decreasing system of left-continuous solutions of the Cauchy equation C.11) such that /b@)=0, /o(c) = 0, Mc) = sup f,(c) a<b for all с e [0, F] and all b e ]0, ?]. The class of solutions of the generalized Cauchy equation is closed with respect to the concerned pseudo-addition (see Benvenuti, Vivona and Divari B002), Proposition 2.1). So the class of ?-fitting pseudo-multiplications is, as well, closed with respect to the operation ?. COROLLARY 3.6. Let Q\ and ©2 be two pseudo-multiplications: [0, F] ? [0, ?] -> [0, F]. If ? ? and ©2 are (B-fitting, then also the operation ?* = (??) ? (©2) given by: a ©* b = (a ©| b) ? (a ©2 b) is a ?-fitting pseudo-multiplication. PROOF. From the two systems (/1./, | b e [0, ?]) and (fu, I b e [0, ?]), associated to the two operations ©| and ©2, we obtain the system (/,* = f\j, ? fi.b I b e [0, M]) which is a non-decreasing system of left-continuous solutions of the Cauchy equation. The pseudo- multiplication ©* is associated to this system. ? However, the structure of all solutions of the generalized Cauchy equation was recently characterized in Benvenuti, Vivona and Divari B002). So, the above theorem allows to describe explicitly the ?-fitting pseudo-multiplications in several specific cases. Example 3.4. If the pseudo-addition has the form C.2), then each left-continuous solution of the Cauchy equation C.11) has the form: /(*) = ?"'(? ·?(*)), ??[0,??], C.12) with the convention 0 · 00 = 00 · 0 = 0. The null solution of the Cauchy equation is obtained in particular for ? = 0. The system (//,) defines a left-continuous non-decreasing map ?: [0, ? ] -> [0, oo] with ?@) = 0. So, in this first example the general form of the pseudo- multiplication is aOb = g-](g(a)-i(b)). C.13) Example 3.5. If the pseudo-addition has the form C.3), then each left-continuous solution of the Cauchy equation C.11) has the form: /W = «"'((^«W)a1), ??{0}?[1,??]. C.14) The system (//,) defines a left-continuous non-decreasing map ?: [0, ?] -> {0} U [ 1, oo]. So, there is a constant bo e [0, M] such that k(b) = 0 (and then //, = 0) whenever b < bo
1348 P. Benvenuti el al. and k(b) ^ 1 whenever b > Ь$. Hence, in this second example the general form of the pseudo-multiplication is ??& = *_?((*(?)·?(&))??). C.15) Example 3.6. For the pseudo-addition ? = ? the left distributivity and the right one follow from the monotonicity of 0. Therefore, any binary operation which satisfies (Ml), (M2) and (M4) is fitting with the pseudo-addition v. Right unit. If the integral is assumed to represent an aggregation operator (Benvenuti and Vivona B000)) then the property of idempotence is requested: the integral of any function constant on the whole ? must give the value of the function. From C.3) we obtain /? /-? с ? ?? = / b(c, ?)??? = ?? ?(?). So, ? = ?{?) must be a right unit for the pseudo-multiplication: (M5) aQ ? = a (right unit element). Nevertheless the idempotence is not necessary in other applications of the integral; so in general we do not require the condition (M5) in the definition of ?-fitting pseudo- multiplication. Example 3.7. In the form C.13) the condition (M5) is equivalent to ?(?) = 1. So the form of the pseudo-multiplication verifies (M5) with any non-decreasing and left- continuous function ?: [0, ?] -> [0, 1 ] with ?@) = 0 and ?(?) = 1. In the form C.15) the condition (M5) is still equivalent to ?(?) = 1 but in this case the condition is very strong because the function ? can assume only the values 0 and 1. The pseudo-multiplication is determined up to the value of the parameter bo e [0, ? [. It is ? ? b = 0 for b ^ bo and a © b = a for b > bo- 3.4. Additional conditions with pseudo-ring structures We assume from now on that the interval [0, M] (values of the fuzzy measure) coincides with the interval [0, F] (values of the functions). This is frequent in the applications except when the integrals represent special aggregation operators (Benvenuti and Vivona B000)). If the binary operation © is defined on the square [0, F]2, as well as the operation ?, the two operations give to the interval [0, F] the structure of a pseudo-ring. In this case, some other properties are meaningful for the integral and some additional conditions must be requested to the pseudo-multiplication for their fulfillment.
Monotone set functions-based integrals 1349 Left unit. Let ?: [0, F] ? [0, F] -> [0, F] be a pseudo-multiplication with left unit u: и ? b = b for all b e [0, F]. The basic simple function b(u. A) = LI? is called the pseudo- characteristic function of the event A. Now, from C.3) we have / иАОац. = иО ?(?) = ?(?). If we expect that integration of a pseudo-characteristic function will result in the underlying fuzzy measure ?, then the existence of a left unit element и for the pseudo-multiplication ? is needed. (M6) There exists и e ]0, F] such that uQ b = b (left unit element). PROPOSITION 3.7. The only pseudo-addition ?, which admits a ®-fitting pseudo- multiplication with left unit и which is ®-idempotent, is ? = v. PROOF. Let и be ?-idempotent and as well left unit for a ?-fitting pseudo multiplication. Then we have: a®a = uQa®uQa = {u®u)Qa = uQa = a for any a e [0, F]. Consequently ? = v. ? Any binary operation non-decreasing and left continuous in both variables, with ? ? 0 = 0 ? b = 0, is fitting with the pseudo-addition v. The equation uQb = b may be true for all b e [0, F] and и running in some interval. Therefore, in a v-fitting pseudo-multiplication the uniqueness of the left unit can be missing. PROPOSITION 3.8. If ? ? ? and ? is a (B-fitting pseudo-multiplication with left unit u, then (with notations of Proposition 3.2) there exists к so that: и e ]<**, Д-[, gkWk) = oo, and the pseudo-multiplication is univocally determined on the strip ]ak, ??<] x [0, F] by: aeb=\ 8^((8k(a)gi,(b)/gk(u)) л gh(M) ifb e ]<*„, ?„[, he К, \b ifbeC®. C.16) PROOF. From Proposition 3.7 the left unit и is not ?-idempotent; so, there exists к such that и e ]?*,&[. If fo e Ce, from и О b = b and Lemma 4.6 of Benvenuti, Vivona and Divari B002), we deduce a Q b = f,,(a) = f,,(u) = b for all a e ]ak, pk[.lf b<?C®, it is b e ]<*/,, /3/,[ for some h e K. Theorem 4.7 of the same note ensures that we have for all a e ]ak, ?? ¦ aQb= fh{a) = g^({k„ ¦ gk(a)) ? g„(fi,)) with some ?& e ]0, oo] and with limitation ?/, · #НД0 > gh(Pn) if gk(fik) < oo. Moreover, the condition uQb = b request that ?* · gk (и) = gi, (b), so we obtain C.16) for a e ]ak ,pk[. But, if gk(Pk) is finite, the limitation . ,a , gh(b)-gk(fik) gk(u)
1350 P. Benvenuti et al. cannot be fulfilled for all b e ]<**, Д [. Therefore gk(fik) must be necessarily oo. Finally, C.16) is true for a = ??-, as well, because of the left continuity of the pseudo- multiplication. D COROLLARY 3.9. 1/(&??, any (B-fitting pseudo-multiplication possesses not more that one left unit. PROOF. Suppose и е]ог*,Д-[ is a left unit of a ?-fitting pseudo-multiplication. Evidently, from C.16), no other different left unit exists in the interval ]at, ? к¦.]¦ Moreover, from C.16) we see that: inf a © b = sup{c eCe: с ^ b}, aQft = ? ?) = : inf{c e Ce: с > b]. a>ak Further, from monotonicity and left continuity (in both variables) of the pseudo-multiplication ? we obtain: ak Ob ? sup( inf a Ob') =??) =: sup{c eC®\c<b]. So, for a < otk, b g Ce, we have aQK <*(b) < & and, for a > &., fo ^ Ce. we have а О b > Д ? b = ??) > b. Therefore no other left unit different from и exist, neither in [0,a*]norin[&,F]. D Remark 3.8. The existence of a left unit element for the pseudo-multiplication ? is really a strong property. Not all pseudo-additions ? admit a fitting pseudo-multiplication with a left unit element. This is, e.g., the case of the pseudo-addition C.3). Example 3.9. The only pseudo-multiplication with left unit и fitting with the pseudo- addition C.2) is defined by: aOb = g-x{{g(a)g(b))lg(u)). C.17) The existence of the left unit element и forces the commutativity of the pseudo-multiplication given by C.13). The pseudo-multiplication C.17) doesn't verify the condition (M5). For the pseudo- multiplications fitting with the pseudo-addition C.2), the conditions (M5) and (M6) are incompatible. Example 3.10. A pseudo-multiplication with left unit, fitting with a general pseudo-addition C.1) with at least а к such that gk(fik) = oo, is obtained assuming the expression C.16) in the strip ]<**, Д] ? [0, F] and, e.g., aOb = 0 if0<^a^oik, aOb = F if/3*<a<F. This is the minimal pseudo-multiplication with left unit и e ]«;·, /3*[.
Monotone set functions-based integrals 1351 An other example is the maximal form of the ?-fitting pseudo-multiplication with left unit u: in association of the expression C.16), Corollary 3.9 and ?0? = ?0?) = ?, we assume for b > 0: aQb = a(b) if0<a^ak, aQb=F if ft <a^F. Associativity. The associativity of the pseudo-multiplication is not required in general but we must assume this property if we want the ©-homogeneity of the integral. r® r® / (cOf) ??? = ?? f Q ??, с е [0, F]. As a matter of fact, the ©-homogeneity restricted only on basic simple functions implies the following equality: (M7) (a © b) © с = a © (b © c) (associativity). Right distributivity. As well the right distributivity of the pseudo-multiplication is not required in general. Nevertheless, it must be required if we want the integral to be ?-additive when the measure is ?-additive. In fact, for А П В = 0, if ?(? U ?) = ?(?)??(?) we get: /? p® (b(a,A)®b(a,B))Od?= / b(a, A U ?) ??? = a © (? (?) ®?{?)). If we request the ?-additivity of the integral, then it must be: /? /-? r® (?>(?,?)??>(?,?))?<*?= / b(a,A)?d?® b(a,B)Od?. So the ?-additivity of integral is possible only if the following equality holds: (M8) aO(b®c) = aOb®aQc (right distributivity). Commutativity. Commutativity of © is not required, in general; furthermore this property does not correspond to any meaningful property of the integral. However, in some special cases, the commutativity may be required, or may be forced by other required properties: (M9) ? ? b = b © a (commutativity). Some authors - Pap A990, 1995), Wang and Klir A992), Mesiar A995), Mesiar and Rybarik A995), Benvenuti and Vivona A996a, 1996b) - have considered the sometimes so-called pan-operations. These operations are a general pseudo-addition ? satisfying the properties (A1-A5) and a ?-fitting pseudo-multiplication satisfying all properties (M1-M9), i.e., all properties seen above. Further, commutative, associative, monotone binary operations © with neutral element и е ]0, F[ are proper generalizations of the uninorms recently introduced by Klement,
1352 P. Bemenuli el al. Mesiar and Pap A996), Yager and Rybalov A996). Therefore, left-continuous uninorms (see De Baets A998, 1999)) are appropriate candidates for pseudo-multiplication when ? = v. On the other hand, if и = F and ? is commutative and associative, ? should be a left-continuous triangular norm (see Schweizer and Sklar A983), Klement, Mesiar and Pap B000)). Remark 3.11. To keep the maximum of generality we assume for the pseudo- multiplication, in the next section, only the properties (M1-M4). These are sufficient to define and build univocally an integral for the Д-measurable functions in a general fuzzy measure space with properties similar to those of Choquet and Sugeno integrals. 4. General fuzzy integral The aim of this section is to define a general fuzzy integral on an arbitrary fuzzy measure space (?, ?, ?) by means of any given pair of fitting pseudo-operations ? and 0. This integral shall be endowed with properties which are corresponding to the characteristic properties of Choquet and Sugeno integrals. It coincides with the former or the later in special cases. The instruments which we use in order to construct the general fuzzy integral are the following: A) A fuzzy measure space (?, ?, ?) with ? :A^> [0. ?], ? = ?(?) e ]0, oo]. B) A pseudo-addition ?: [0, F] ? [0, F] -> [0, F], which gives to the interval [0, F] the structure of I-semigroup. C) ? ?-fitting pseudo-multiplication О: [0, F] ? [0, ?] -> [0, F]. We do not request any property of continuity for the fuzzy measure ?, nevertheless we shall recognize in this section that the best background for the general fuzzy integral is a measurable space equipped with a fuzzy measure continuous from below. The general fuzzy integral is a functional defined on the family ? of Д-measurable functions /: ? -> [0, F]: j /Od/i:.F-*[0,F]. We request for the general fuzzy integral the following properties. (Ge. 1) Basic values. / Ь(с,СH<*м = сОд(С). In particular, if the pseudo-multiplication admits a left unit element и, the integral of the pseudo-characteristic functions Uc = b(u, C) reconstructs the given fuzzy measure: / Uc ???=?@ VCeA
Monotone set functions-based integrals 1353 (GE. 2) MONOTONICITY. /? /-? f?d?? ????. (GE. 3) COMONOTONE ?-ADDITIVITY. /? /-? /-? (/?,?)???=/ /????/ #???. (Ge. 4) Horizontal ?-additivity. Given a measurable function / and a number с e [0, F], we obtain a horizontal ®-additive decomposition by cutting the function / at the level с and putting: / = (/лс)ф(/©с). We request that the general fuzzy integral is ?-additive for any horizontal ?-additive decomposition: /? /-? /¦ f?d? = (/?^???® f?d?= (/лсH<*мф/ (/??)???. Horizontal ?-additivity is a weaker form of comonotone ?-additivity because the two functions (/ л с) and (/ ? с) are comonotone. (Ge. 5) Continuity from below. If ? is continuous from below we request that the integral is continuous from below, i.e., for any non-decreasing sequence {/„}„ен: / (sup/„j ©<*M = sup / /,,???. We define the general fuzzy integral as a functional ? -> [0, F], which satisfies the properties (Ge. 1,2, 3, 4, 5). In reality, this functional is univocally determined by the properties (Ge. 1, 4,5), as well as the Choquet and Sugeno integrals, which are determined by properties (Ch. 1, 5, 6) and (Su. 1, 5, 6), respectively. We begin building the functional on the family S of the (measurable) simple functions from ? to [0, F]. Afterwards, we recognize that the functional so defined in S verifies all properties requested. Finally, by means of the continuity from below, we extend the functional on the whole family ? of all Д-measurable functions and we prove that the so obtained functional verifies as well all properties (Ge. 1,2,3,4,5).
1354 P. Benvenuti et al. 4.1. Integral of simple functions Any simple function s admits the standard classical representation: 11 s = \Jb(aj,Ai), D.1) i=\ with 0 < ? ? < 02 < ¦ ¦ ¦ < a„ and А, П Ay = 0 for i ? j ¦ More, it admits many comonotone ?-additive step representations (shortly ?-step representation) m s = 0fo(c,,C,), withCi 2C2 2---2C,,,. D.2) i=\ The standard ?-step representation ? s=®b(c*,C*) D.3) i=l is the minimal one for the number of steps and for the height of the single steps; it is obtained from the classical one taking in D.2) It m=n, C* = \^JAj and c* = a\, c\ = аг ??\, ..., с* = ?,, ? ?„-? · j=' In order to define the integral of a simple function s we can use its standard ?-step representation because it is obtained by means of ? — 1 repeated horizontal ?-additive decompositions. For any functional L® :S -> [0, F] which satisfies (Ge. 1) and (Ge. 4), by repeated applications of these properties, we obtain necessarily: m ^%) = фс*©М(С*). D.4) Now, we associate to any ?-step representation D.2) of a simple function s the expression: m ? = фс,Ом(С,). D.5) 7=1 LEMMA 4.1. The expression D.5) gives the same result for all ®-step representations of any given simple function s. Therefore m m ? = 0 c; ? ?(?) = 0c,* ? м(С,*) = L®(s) D.6) ?'=? ? = ? for any ?-step representation of the simple function s.
Monotone setfiinctioiis-based integrals 1355 PROOF. 1. First we consider the presence in the representation D.2) of a step subdivided horizontally in two steps. Let CJ+\ = Cj, the two steps b{cj,Cj) and b(cJ+\, C]+\) form a horizontal cut of the single step b(Cj ? cJ+\, Cj). From the left distributivity of ? it is: Cj ? ? (С/) ? с,-+| 0 д(С^) = (с,- ? с,-+|) ? ? (С;)· Therefore the two steps can be replaced by the single one fo(cy ? c,-+i, C/) without modifying the result ? of the expression D.5). 2. Secondly, we consider the possible presence of an inessential step. If c\ ?· · -®Cj-\ = a, and ?,- ? Cj = a,-, the step b(Cj,Cj) is inessential in the representation D.2). We prove that the corresponding term cj ? ? (Су) in D.5) is also negligible. Let Sj = c\ ? ?((?|) ? · · · ? Cj-\ ? ?^-?), from the monotonicity of the two operations and from the left distributivity, it is Sj > ^\®---®?]^)??^]-\) = ????^?-\). As the pseudo-addition ? is continuous, there exists w e [0, F] such that Sj¦ = w ? ?, ? ?^}-\). Again from the monotonicity it holds: Sj ^ Sj+\ = Sj ? Cj ? ?(G7) ^ ?? ? ?, ? ?(?>-?) ? с, 0д(Суч) = ?? ? (?,- ? с,-) ? [i(Cj- \) = w ? ?, ? ?(G7-1) = 5;. So it is S; = Sj+\ and the inessential step can be eliminated without modifying the result ? of the expression D.5). 3. Then we consider the possible presence in D.2) of a non-minimal step. Let Cj be not minimal, the step b(cj,Cj) can be replaced by b(c'-,Cj), with c' < cy, because the difference is the inessential step b(cj ? c',Cj). 4. Finally, starting from any ?-step representation, by elimination of all inessential steps and all subdivision of a single step in two or more steps, we attain the standard one, without modifying the result ? of the expression D.5). ? The functional L® is univocally defined by the expressions D.4) or D.6): it can be evaluated for any given simple function s starting from the standard ?-step representation or from any ?-step representation. LEMMA 4.2. The functional Le, defined on S by D.6), is non-decreasing. PROOF. We consider the representations of two simple function by means of a common partition {A,: / = 1,2,...,«} II И i = \/fo(a,,A,), ?' = \/??\,??) i=\ i=\
1356 P. Bemvnuti et al. with ?| ^ Д2 < · · · ^ a„ and a\ ^ a'-, < · · · ^ a'ir The relation s < s' involves a; < a- V/. For evaluating the integrals we consider two ?-step representations: II II II * = ф/>(с,,С,), i' = 0fo(C;,C,) withC,=[jAy. D.7) i = l , = l j=\ It is sufficient to prove that In m фс,Ом(С,)<фс;Ом(С,) D.8) 7=1 7=1 for a, = a,' for / ? к and а к < a'k. If к = ?, we assume in D.7): c; = c;' for / < ? and hence necessarily c„ < c,',. The inequality D.8) is a direct consequence of the monotonicity of the operations ? and ©. \f к < ?, we assume c-, = c\ for / ? к к + 1 and putting d = a'k Q a* we take Ck =ak ??*-?, 4 =ck®d, ck+\ = d®c'k+r c'k+] =ak+\ Qa'k. The two sides of D.7) have the same terms except Ck О ?(Ck)®Ck+\ ? ?(Ck+\) in the left side and c'k ? ? (С к) ? с'к+, ? ?(Ck+1) in the right one. Now for these terms we have: Ck ? М(С*) ? Cjt+i ??(Ck+\) = «0 д(С*) 0d ?m(C*+i) ? q.+ | ? ?(<7*+?) <с*Од(С*)ФйОд(С*)фс;+|Од(С*+|) = q©M(CjtHq.+ | OM(Cjt+i). The inequality D.7) is now a direct consequence of the associativity and monotonicity of the operation ?. D LEMMA 4.3. The functional L®, defined in S on D.6), is comonotone (B-additive. PROOF. Let s, s' e Sm- If s ~ s' the corresponding chains C, and Cs· belong to the chain CS0j' =: {i2, С ?, Сг, ¦ ¦ ¦, С„, 0). The functions s, s', s ? s' admit the following ©-step representations, in which there are present, possibly, some inessential or null steps: II II II i = 0fo(c,,C,), i' = 0fo(c;,Ci). s®s' = Qb(ci®c'„Ct). 1=1 i=l i=l By means of the left distributivity of ? and the associativity of ? we obtain from D.6): L®(f®g) = L®(f)®L®(g). ? LEMMA 4.4. If ? is ®-additive and © is distributive on both sides, then the functional L®, defined in S on D.6), is ®-additive.
Monotone set functions-based integrals 1357 PROOF. For a given simple function we consider the classical representation and any ©- additive representation, with basic functions not necessarily comonotone: in s = ®Hdj,Dj). D.9) i=\ We express the sets by means of the atoms {P\, Рг, ¦ ¦ ¦, Pn] of the partition generated by the families [Dj} and {A,}: Ai=\JPh, DJ=\Jph heH, neK, where tf, = {h: Ph С A;) and K; = {h: Pn с Dj] are subsets of {1,2, ...,N]. {H\, Нг, · · ·, H„) is a partition of {1, 2,..., N) and D.9) is equivalent to a, = ? dj for all /г such that Ph с А,. Oj2/>,, So we have /1 0а,Ом(А,) = 0фа,Ом(/7,) = 0ф( ф ^JOM(ft) ?=? 1 = 1 he Hi i=\ heH,^ Dj^p,, ? ? = ? ? ^??(^) = ? ? ^??(?,) /?=? Dj2P,, /?=? ?,,??^ = ?^??(??). 7=1 For a given simple function s we obtain the same result when we use the classical representation or any ?-additive representation, comonotone or quite arbitrary. But, if we consider the standard ©-step representation or any ?-step one, the result gives the value of the functional L®; therefore we have for any ?-additive representation of s: 0^0M(D;) = L®(s) = ??, ? ?(?,). j=\ i=\ This formula shows clearly the asserted ?-additivity of the functional Le and the possibility to evaluate it starting, as well, from the classical representation. D
1358 P. Benveiiuti el al. 4.2. Integral of measurable functions In order to define the general fuzzy integral, the functional L® should be extended to the whole family ? of all measurable functions. The procedure is similar to those used sometimes for the Lebesgue integral. Given a function / e f, we consider the class S/ of all simple functions s such that s < /. Dehnition 4.5. We define for any / e T, in agreement with the requested property (Ge. 5): / f(ddii=:%\^{L®{s):s^Ss]. D.10) THEOREM 4.6. The integral defined above verifies the requested properties (Ge. 1, 2, 3, 4). PROOF. We first observe that the integral D.10) is an extension to ? of the functional Le; this is a consequence of Lemma 4.2: f s(ddii = l®(s) VseS. D.11) The property (Ge. 1) is true for Le by virtue of D.6) and for the integral, as well, by virtue of D.11). The monotonicity (Ge. 2) of the integral is evident directly by definition. Now we give the proof of the comonotone ?-additivity (Ge. 3), so (Ge. 4) is proved, too. Given two functions / and g in T, we take /? r® fOd? and ?' < / gOd?^, then there exist two simple functions s eSf and s' e Sg such that L®(s) > ? and L®(s') > ?'. We consider the standard ?-step representations ? ? ?=| ?=? and we construct two modified functions ? and J' by replacing the sets C, and C\ with: C,¦ = ????: /(<й)>фсЛ and С-= \? e ?: g(w) ^ фс'Л.
Monotone set functions-based integrals 1359 It is easy to see that 11 И s <: s = ф/>(с,, Ci) ? f, s' <: Г = 0fo(c,', C\) < g. i=l i=l Now, if / ~ g, then s ~ J' and from Lemma 4.3 we obtain: L®(s®s') = L®(s) ? Le(s') > Le(s) ? Le(s') > ? ? ?'. As (? ? ?') e «S/e^, and ?, ?' are arbitrary we obtain: /? /-? /-? (f?g)Od?^ fQd?®^ #???. We prove now the opposite inequality by using the functions <pj and <ря of Proposition 2.2(iii). To any given s e S/®g we associate two comonotone simple functions Sf = iff о s and Sg =ifsos. We note that Sf eS/ and s g e S,,, and Sf®sg =s. Therefore, from Lemma 4.3, the definition D.10) and the monotonicity of the operation ?, we obtain: /? /-? fOd?? / g?d?. For arbitrariness of s e Sf®g we conclude /? /-? /-? (f®g)Od?^ fOd?®j 8???. ? THEOREM 4.7. If the fuzzy measure ? is continuous from below, then the integral defined by D.10) is continuous from below. PROOF. First, we take a sequence {/itheN of measurable functions which converges pointwise to a simple function s e S and we consider this function given by its classical representation and its standard ©-step one: II It fk/s = \fb(a,,Ai) = ($b(c*,C*). i=\ i=l Let с,,..., c,', be real numbers such that: I c'j < c* V/ and ?,_? < ??c- = a\ < a\ for /' = 2, 3,..., n.
1360 P. Bemenuti el al. We construct the sequence sk =Q)b(c';, ? with C* = {? e ? | /t(<u) > a,'}· We get: • Sk < fk because sk (?) = a] =>· ? e cf =>· /t (?) > ?,', • Cf с С/ because ? e С* =>· 5 (?) ^ ? (?) > ?- =>· 5 (?) > ?;, • С* ?' C, because ? е С,- =>· /* (?) ?' s (?) ^ ?, > ? ¦. Therefore: /¦? " " lim / fk?d?^ lim ? с,' ©м(С*) =0с,' O^Cf). к^+ос J k^ + oc ^^ /=1 ' = 1 From arbitrariness of с' we obtain / /;· © ?? / L®(s). Now we assume /* /* /, with f e J7. Let s e <S/\ then fk As / s and so /? /·? /·? fkOd?^ lim / (fk?s)Od?= ?©???. As s is arbitrary in S/ and because of monotonicity (Ge. 2), we obtain: /? /-? fkOd?/J /???(/). ? The following theorem generalize Theorem 2.8 and Theorem 2.9 and shows that really a measurable space, with a fuzzy measure continuous from below, is suitable for construction of a general fuzzy integral. THEOREM 4.8. In a fuzzy measure space (?,?,?), with ? continuous from below, the general fuzzy integral is the only functional Lce-F -> [0, F] which verifies the properties (Ge. 1, 4, 5). PROOF. The proof is exactly the same as in Theorem 2.8 after replacement of the operations + and · by any couple of fitting pseudo-addition ? and pseudo-multiplication ©. D Remark 4.1. Given any pseudo-addition ? : [0, F] ? [0. F] -> [0, F] and any ©-fitting pseudo-multiplication О: [0, F] ? [0, ?] -> [0, F], the construction of the general fuzzy integral is applicable in any fuzzy measure space (?, ?, ?) with ?(?) ^ ?. The general fuzzy integral possesses all the properties (Ge. 1-5); it is idempotent if ?{?) is a right unit for the pseudo-multiplication. Moreover, the construction leading to the definitions D.4) and D.10) is the only possible in agreement with (Ge. 1,4,5). So the general fuzzy integral defined above is characterized by these properties.
Monotone set functions-based integrals 1361 From the construction of the general fuzzy integral its pseudo-additivity with respect to pseudo-multiplication follows. COROLLARY 4.9. Under the notations of Corollary 3.6, J ?((?\)®(??))??=([ /????\®(? f ?? ??\ D.12) PROOF. For any ?-step representation D.2) of a given simple function s we obtain, by virtue of the associativity of the pseudo-addition ?: in . m \ ? in 0c,-((Oi) ? @2))м(С,) = ( 0c,¦ ?? м(С,)) ? ? 0c, ?2 ? (С/) i=\ 4=1 ' 4=1 This equation shows the relation D.12) for the integrals of all simple functions. The validity of the same relation for any measurable function follows immediately from definition D.10). ? 4.3. Additional properties of integral with pseudo-ring operations In this subsection we consider the structure of pseudo-ring on the interval [0, F]. We assume that the two operations ? and © are both defined on the square [0, F] ? [0, F]. We deal with some properties of the integral, depending on conditions requested to the pseudo-multiplication in addition to the fundamental ones (M1-M4). THEOREM 4.10. If the pseudo-multiplication is associative and right distributive, then the integral is Q-homogeneous, i.e.: /® /-? (??/)??? = ?© / f ???. PROOF. By virtue of the left continuity of ? it is sufficient to prove the equality only for the simple functions. We consider any ?-step representation of a simple function; from the right distributivity we have: It It i = 0fe(C;,Ci) => flOi = 0fe(fl©Ci,Ci). i = l f=l From Lemma 4.1 and by means of associativity and right distributivity we obtain: r® " / " \ / (a?f)?d? = 0(fl ? с,) ? д(С) = ? ? I 0с, ? д(С,) 1 J i=\ 4=? ' г® = aQ /???. ?
1362 P. Benvenuli el al. THEOREM 4.11. If the fuzzy measure ? is ®-additive and continuous from below, and if the pseudo-multiplication ? is distributive on both sides, the general fuzzy integral is (&-additive and coincides with the integral of Sugeno-Murofushi (see A987)): /? /·? /???=? fdii = SM(f,ii,®,Q). PROOF. The integral of Sugeno-Murofushi is ?-additive and continuous from below. It is defined in S by means of the ?-additivity, starting from the classical decomposition. The coincidence of the two integrals on S is given by Lemma 4.4 and on ? by continuity from below. D For the general fuzzy integral a representation theorem similar to Theorem 2.6 is true. THEOREM 4.12. Let ?/ be the restriction of ? to the chain C/, let A/ be the ?- algebra generated by C/ and m? the (B-additive continuous from below measure induced on A/ from ??. If the pseudo-multiplication is distributive on both sides, the general fuzzy integral of the function f coincides with the Sugeno-Murofushi integral on the fuzzy measure space (?,?/, /я?): /? ?-л fOd?= fdmJ=SM(f,mJ,®,0). ?? PROOF. The function / is A/-measurable. ? THEOREM 4.13. In the case of pan-multiplication ? the general fuzzy integral coincides with the so called Choquet-like integral ofMesiar: /? /·? fQd?= / hdk® = SM(h, ??, ?, ?). ?? The function h(x) = ?@/(?)) is the same as in the Choquet integral. The ?-additive measure ?? is defined on the Borel class of [0, +oo], it is continuous from below and determined by the condition ?([0, ?]) = ?, Vjc e [0, +oo] (see Mesiar A995)). The integral of a given measurable function /: ? -> [0, F] is a functional on the set ? of all fuzzy measures ? : A -> [0, F]. The set ? is partially ordered by the relation: Ml -<M2 «=> ??(?)^ ?2{?L? e A. THEOREM 4.14. The general fuzzy integral is a monotone functional on M'- J fOd?^?:J fQd?^ fQd?
Monotone set functions-based integrals 1363 PROOF. The inequality is evident for the simple functions (from monotonicity of the operations ? and ©). For the measurable functions it follows from the continuity from below of the integral. D The set ? is closed with respect to the pseudo-addition ? :? ? ? -> ? defined by: M = Ml0M2 «=> м(А) = Д|(А)фд2(А)УАе Л. THEOREM 4.15. If the pseudo-multiplication is as well right distributive, then the integral is ^-additive with respect to the pseudo-addition of measures: /? /·? /-? /Оф|Фд2)=/ fOd?^(B f ???2. D.13) PROOF. It is sufficient to prove the equality for the simple functions. As a matter of fact, from the right distributivity of О and from the associativity of ? we have: фс,0(М|(С;)Фм2(С,))=ф(с,Ом|(С,)фс,Ом2(С,)) f = l /=1 = (фс,ОМ|(С,))©@с,Ом2(С,)), i.e., by definition D.6) the equality D.13) holds. ? 4.4. Extension of the integral to [— F, F] The Choquet integral, defined for functions whose range is in [0, oo], admits two different extensions to the whole real line [—oo, oo]: the so called asymmetric Choquet integral and the symmetric one (see, e.g., Denneberg A994) and Pap A995)). The asymmetric Choquet integral is defined for finite fuzzy measure and measurable function / : ? -» [—oo, oo] by (a) J fdix = ^ f+?? - j f-??'1 where /+ = / ? 0, /~ = (—/) ? 0 and ?'' is the dual fuzzy measure: ?'1 (A) = ?{?) - м(Ас). However the integral is not defined if the integrals of /+ and /~ are both infinite. The asymmetric integral is comonotone additive and hence shift invariant, i.e., for any real constant c. (a) j(c + f)d? = c?(?) + (a)Jfd?.
1364 P. Bemenuli et al. Therefore, if inf / = с > — oo, (a) I fdiL = ??(?) + I (/ - ?)??. However, the homogeneity of the asymmetric integral holds in general only with respect to multiplication by a constant с > 0 and fails if с < 0. Especially (a) / b{c,C)d? = c^{C) if с > 0, ????@ ifc<0. The symmetric Choquet integral was originally developed as the Sipos integral A979) and is defined for any (even non finite) fuzzy measure by (s) J fdn = j f+??- j ??? again avoiding terms oo — oo. The symmetric integral is homogeneous with respect to multiplication by a constant с positive or negative. However the symmetric integral is neither comonotone additive nor horizontally additive. For general fuzzy integral, related to a given pseudo-addition ? on [0, F], the extension to the measurable functions with values in [— F. F] necessarily requires an appropriate extension of the operations ? and ©. However, such extension with suitable properties need not exist. Note that even in above discussed case of the Choquet integral, the common addition defined on the extended real line is defective in the points (oo, -oo) and (—oo, oo). When extending naturally the idempotent pseudo-addition ? = ? from [0, F] to [- F, F], again the only idempotent extension is v. But now the neutral element will be not more 0 but - F. This approach allows to extend, for example, the Sugeno integral to the interval [- F, F]. However, keeping ?@) = 0, the Sugeno integral is always non-negative, for non-positive functions as well. Recently Grabisch proposed an approach to define a symmetric Sugeno integral on [— F, F]; nevertheless, the pseudo-addition used there is not continuous and the monotonicity of the pseudo-multiplication is violated (Grabisch, 2000). This integral is out of our framework. Above mentioned problems exclude a general method of extending the general fuzzy integral defined on [0, F] to [- F, F] keeping a reasonable number of properties and hence we will not discuss this topic. 5. Examples In this section, we introduce several examples of fitting pseudo-additions and pseudo- multiplications and the corresponding fuzzy integrals. The first examples are about the general form of pseudo-multiplication satisfying only the fundamental conditions (M1-M4).
Monotone set functions-based integrals 1365 To simplify the notations, as well as for the consideration of fuzzy measures not necessarily continuous from below, we put in the following: ?/(?)=:?(G/(?)) and ?}(a) = : sup ?/(?). a <x < F In general for each fuzzy measure it is: ?*??) ^ ?/(a), but if ? is continuous from below, then ?^· (?) = ?^(?). EXAMPLE 5.1 {Continuation of Examples 3.1 and 3.4). a®b = g-l(g(a)+g(b)). aOb = g~] (g(a)k(b)). with g: [0, F] -»¦ [0, oo] continuous and increasing, g@) = 0, g(F) = oo, ?: [0, ?] -> [0, ??'] left continuous non-decreasing, ?@) = 0. The integral is a transformation of the standard Choquet integral: j /0«?? = ^?(/(«?/)^(???)). E.1) The function g acts as a change of scale on the interval [0, F] which is monotonically transformed in the interval [0, oo]. The function ? acts as a change of scale on the values of the fuzzy measure; the function ? ? ?: ? -> [0, ?'] is, as well, a fuzzy measure. In particular, if ?(?(?)) = 1, then the integral satisfies the idempotence property: / ???? = ? Vc e [0, F]. If F = ?, any element и е ]0, F[ may be assumed as the (unique) left unit. Then the function ?„ and the ®-fitting pseudo-multiplication ©„ are uniquely determined: ?„ (&)=——, aO„b = g —— . g(u) \ g(u) ) Putting h(a) =: g(a)/g(u), we obtain h(u) = 1 and ? ? b = h~l(h(a) +h(b)), ? ?„ b = /г (h(a)h(b)). Moreover, j /?„ ?? = ?-?(<[(???/)?·(???)\. E.2) The integrals of the pseudo-characteristic functions Uc = b(u. C) reproduce the fuzzy measure: f ?????? = ?@.
1366 P. Benvenuti el al. The integral E.2) is a generalization of the Pap g-integral A993) which is defined starting from a ?-additive measure. Compare also Markova A996) and Mesiar A996). EXAMPLE 5.2 (Continuation of Examples 3.2 and 3.5). ? ? b = ?"' ((g(a) +g(b)) A g(F)), aQb = g ((g(a)k(b)) A g(F)), with g : [0, F] -> [0, g(F)] continuous and increasing, g@) = 0, g(F) < oo, ?: [0, ?] -> {0} U [1, oo] left-continuous non-decreasing, ?@) = 0. The integral is similar to the previous one: j f?d? = g-i(g(F)A /(??/)?(???)? Moreover, the integral satisfies the idempotence property if the range of ? is {0, 1), i.e., k(b) — 0 for b ^ bo, k(b) = 1 for b > bo, with fro < ?(?). In this case, the integral turns out to be independent of the function g which characterizes the operation ? and we have: f f ??? = esssup(/) = supjjc: ?(?/(?)) > bo}- ??? No ?-fitting pseudo-multiplication with a left unit exists in this case (see Definition 3.1). EXAMPLE 5.3 (Continuation of Examples 3.3 and 3.6). a®b = aw b, aOb = h(a,b) with h:[0,F] x [0,?] -> [0, F] left-continuous, non-decreasing in both variables, /2@, b) = h(a,0) = 0. The distributivity (left and right) is assured by the monotonicity: f f?d?= у h(a^(Cf(a)). E.3) яе[0.Л If h(a, ?(?)) = a for all a e [0, F], then the integral is idempotent. This example shows a generalization of the Sugeno and the Ralescu-Adams integrals A980). Both use h(a,b) = a A b; the first on [0,1], the second on [0, oo]. Next, if ? = F — 1, then the integral E.3) generalize also the Weber's integral A986) in which h(a, b) = ? (a, b) is a strict f-norm. Reasonable candidates for v-fitting pseudo-multiplication are left-continuous f-norms (with unit 1) or left-continuous uninorms (with unit и e ]0, 1 [).
Monotone setfimctions-based integrals 1367 Example 5.3.1. Let® = ? on [0, 1] ? [0, 1], assume on [0,1] ? [0, 1] that a Qb = a b (or the same on [0, oo]). The associated integral ; /??? = \J (fl/i(C/(fl))) ae[0.1] is idempotent whenever ?(?) = 1. It is also ©-homogeneous and reproducing, i.e., the integral of the pseudo-characteristic function Ua = b( 1, A) results in the underlying fuzzy measure. Observe that this integral is a special case of Weber's A986) generalization of the Sugeno integral in [0,1], while on [0, oo] it generalizes the Shilkret integral A971). Example 5.3.2. Let ? = ? on [0, F] ? [0, F], assume ? © 0 = 0 and ?? b = a on [0, F]x]0, M]. The associated integral r f ??? = ess sup/ =: sup {?: ?(?/(?)) > 0} is idempotent, ©-homogeneous but it doesn't reproduce the fuzzy measure. Example 5.3.3. Let ? = ? on [0, F]2, for a e ]0, F[ we define ©„ : [0, F]2 -> [0, F] by \ a /\b if ? < a, aOab=\ I b if a > a. Then ©„ is a v-fitting pseudo-multiplication. All elements и е ]a, F] are left unity for the pseudo-multiplication ©„; each corresponding basic function b(u,A) is a pseudo- characteristic function. Moreover, ©„ is associative and distributive; so the resulting integral is ©„ -homogeneous. For a given measurable function /: ? -> [0, F] and a given fuzzy measure ? : ? -> [0, F], the corresponding integral is given by ? /???? = ?*/(?)? I (f ??)?? = ?}(?) V j f ??. E.4) Note that: /V f fV (/??)????, $ (f ??)??= ? (/??)????. So, the first expression in E.4) correspond to the application of the ©-horizontal additivity. Moreover, if $ f ?? > ?*(/) then it is also S /??= V (aA/i(C/(a))) = i(/Aa)d/i 0<a<a
1368 P. Benvenuti el al. and we obtain the second expression in E.4). Observe also that ©„ = (л) ? (©*) (with notations of Corollary 3.6) and I b it a > a. We obtain again the second expression in E.4) by application of Corollary 4.9 because /v /?*?? = ?*/(?). Example 5.3.4. Let ? = ? on [0, F]2 and let ?:[0, F] -> [0, F] be an increasing bijection. Put aQb = (p{a) Kb, a,be[0,F]. The operation © is a v-fitting pseudo-multiplication and the integral is: / / ?? ??. = & ? ? f ??. For the pseudo-multiplication ?? the only left unity is и = F which is as well the right unity; so if ?(?) = F, the resulting integral is idempotent. The pseudo-multiplication is associative and the integral is ©-homogeneous in the case ? = id only. In this case the integral is just the Sugeno integral on [0, F]. Example 5.4. We assume for ? the general form C.1) ? ? ь = f Sk'\(gk(a)+gk(b)) A gk0k)) if ]a. b[ e ]ak. flt[2, ( a v b otherwise. Let for some к e K, gk(fik) = oo. Consider a left-continuous non-decreasing function ?: [0, F] -> [0, oo], with ?@) = 0, and put d\ =: sup{fo: k(b) = 0), d2 =: inf{b: k(b) = oo}. For a fixed h e K,v/c construct a non-decreasing system of left-continuous solutions of the Cauchy equation putting (see Theorem 3.5): for b^d] fh(a)=0, 0 ifa^ajt, for d\ < b ? d2 Ma) = gbl(gl,Wh) A gk(a)Hb)) if a* < a < Д., ??, ifa>^·, for b>d2 /*(«>= (д, iffl>at. We obtain the ?-fitting pseudo-multiplication ©;¦/,, see Figure 5.1. aQkhb = Gh(gf.(a)k(b)) @ · oo = oo · 0 = 0), E.5)
Monotone setfiincrions-based Integrals 1369 where g>) = Fig. 5.1. 0 if a ^ oik:, gk(a) if a* <a ^ &·, G/,(*) = . oo if ? > Д, 0 if.r=0, g-'U) ifO<*<g/,(fl,), . /3,, if .v > #,(#,). It is easy to verify that /?(???) = G,,(/ (lo^fC/f^'U)))^ E.6) Really the expression E.6) gives a functional on ? which verifies (Ge. 1, 4, 5), therefore it is identical to the integral. Note that: sup M(Cf(^'(*)))= ?* (<**) and inf M(C/-(gi7lW))=M/-(A). Then the value of the integral is zero whenever ?^(?^) ^ d\. It is exactly ?/, if ?/($0 > d\ or ?^(?^) > <i2- Otherwise, the integration gives a value in the interval ]<*/,, ?/,]. Example 5.5. Let the pseudo-addition ®:[0, F]2 -> [0. F] possesses only one non- trivial idempotent a e ]0, F[. Then there are two mappings g\ :[0,a]-> [0, oo], g2 : [<*· Л-> [0, oo] generating ? (see Proposition 3.2 and Figure 5.2). If any of these mappings is unbounded, i.e., if either g\(a) = oo or gi(F) = oo, there exist a ?-fitting associative pseudo- multiplication ? with left unit (see Proposition 3.7 and Example 3.10). Suppose that both
1370 P. Benvenuti el al. Fig. 5.2. Pseudo-addition for Examples 5.5. 5.5.1. 5.5.2. 5.5.3. g\ and g2 are unbounded, then there exist ?-fitting pseudo-multiplications © with left unit element и е]0,a[ and, as well, with left unit element и е]а, F[. In both cases the unit element is unique (see Corollary 3.9). Example 5.5.1. Let и e]0,<*[. With no loss of generality we may suppose g\(u) = 1. The general form of the ?-fitting pseudo-multiplications with left unit и is given by aOb-- Г 0 if a = 0 or b = 0, ? g, \g\(a)g\(b)) if0<a, b^a. ft'diWfiW) ifO<a^a,a<b^F, У 8?* Ш(а)к(Ь)) ifa<a^F, where ?: [0, F] -> [0, oo] is a left continuous non-decreasing map such that ?@) = 0 and k(b) = oo for all b > a. See Figure 5.3 in which d\ = sup{b: k(b) = 0} > 0, and di = inf{b: k(b) = oo) ^ a. The pseudo-multiplication ©H can be obtained (see Corollary 3.6) as ©„ = (©?) ? (?2) ? (©з), where (with notation of Example 5.4): aO\b = G\ (g*\{a)g\(b)), aQ2b = G2{g^a)gUb)), aQ3b = G2(g*2{a)k{b)). The pseudo-multiplication ©|, ©2, ©3 are ?-fitting, nevertheless и is left unit for ©, but not for ©?, never for ©2 or for ©3. We introduce three integrals: h=f /02ЛМ = С2(/(?Г°/)Л(?2°М)). h=j /???? = 02(?(82*?/)?(???)\
Monotone set functions-based integrals 1371 Fig. 5.3. Pseudo-multiplication for Example 5.5.1. and we obtain (see Corollary 4.9): I =¦ / / Ou dix = I\®h® h = h ? (/2 ? h). Note that if ?*, @) < a and ess sup / < a, then h = /3=0 and Moreover, if ?*??) > di, then / = /3 = F. Otherwise / = /2 ? /3 = 821По°°(82 о д)(С/(«Г' (*))) dx + j (?o?)(Cf(gTi(x)))d?\
1372 P. Benvenuli el al. f A Fig. 5.4. Pseudo-multiplication for Example 5.5.: EXAMPLE 5.5.2. Let и e]a, F[. With no loss of generality we may suppose gi{u) = 1. The general form of the ?-fitting pseudo-multiplications with left unit и is given by aQb-. 0 ifa = 0or6 = 0. g-[(g\(a)k(b)) ifO<as:a, g\\g2{a)gx{b)) ifa<a^F,0^b^a, g^\gl(a)g2(b)) if a < a, bs^ F, where ? : [0, F] -> [0, oo] is a left continuous non-decreasing map such that ?@) = 0 for all b ? a. See Figure 5.4 in which d\ = sup{b: k(b) = 0} > a.
Monotone set functions-based integrals 1373 The pseudo-multiplication © can be obtained by the pseudo-addition of three ?-fitting pseudo-multiplications: ? = (??) ? (?2) ? (О.ч), where: ? ?? b = Gi (gf (?)?(?>)), a Q2 b = d (gj(fl)gf(fo)), a03b = G2(g^a)g*(b)). We introduce the integrals: f Q2 ?? = ??{?{81 ? f)d(g*x ? ?) ?)?(?\??) ч ч h = f /03<*m = G2(Y(,?2*° and we obtain: /=:/ /?» ^? = (/|?/2)?/3. Note that if ?*?0) ^ d\ and ess sup/ ^ a then / = 0. Moreover, if ?*??) > a, then I = h > a. Otherwise I = I\®„h = gTl(j (? ? M)(C/(gf' U))) d.r Example 5.5.3. The general form of the ?-fitting pseudo-multiplications with nghtunit F is given by f 0 aQb = if a = 0 or b = 0, *?'(*?(?)?|(&)) if0<a^a, gr'(g2(a)A2(fc)) if<*<aiC F and 0 < b < <f\ . g^'(g2(a^3(W if ? <?^ F and 0 < fo iC F, where ? ? : ]0, F] -> [0, 1], ?2: ]0, d*] -> [0, oo], ?-,: ]<**, F] -> [0, 1] are left continuous non-decreasing maps such that: d\ = sup{b \ k\(b) = 0} > ? = sup{fo | ?2(/>) = 0}, and X[(F) = k2(F) = 1. See Figure 5.5. The pseudo-multiplication © can be obtained as ?„ = (??) ? (?2) ? (Оз), where: aQib = Gi (gf (?)?,(?>)), ? ?2 fo = G, (??2*(?)?2(/>)), а Оз b = G2(g2(a)k3(b))
1374 P. Benvenuti et al. Fig. 5.5. Pseudo-multiplication for Example 5.5.3. with notations of Example 5.4. We introduce the integrals: h=j /???? = ??(?(§??/)?(????)\ h=\ /?2?? = ??(?(8*_?/)?(?2??)\ h= I /03^ = G2(i (*??/)?/(?3??)? and we obtain /=:/ /0»<?? = (/??/2)?/3.
Monotone set functions-based integrals 1375 Fig. 5.6. Pseudo-addition and pseudo-multiplication from Example 5.6. Note that if ?^·@) ^ d\ and ?^-(?) ^ <fr then / = 0. Moreover, if ?^(?) > d*, then / = /3 > a. Otherwise / = /, ? h. Example 5.6 (see Benvenuti and Vivona A996a, 1996b)). Let the pseudo-addition ? on [0, F] be given by a®b = g-\g(a) + g(b)) ifa^a,b^a, ? V b otherwise, where a e ]0, F[ and g: [a, F] —>¦ [0, oo] is an increasing bijection. For a given unit element и we may suppose with no loss of generality g(u) = 1. Any symmetric ©-fitting pseudo-multiplication has the following form, where Г is an (arbitrary) left-continuous triangular norm on [0, a]. See Figure 5.6. aQb = g \g(a)g(b)) ifa^a, b^a, T{a,b) а лЬ if a < a, b < a, otherwise. Put ? = л. Then the resulting integral is given by «4?|(«?/)?^)) ???}(?)>?, $ fdn where g* is defined similarly as in Example 5.4. / ???? otherwise, Example 5.7. Based on Example 5.6, we can introduce a one-parameter family of fuzzy integrals including the Choquet and the Sugeno integrals as the limit members.
1376 P. Bemenuti et al. Namely, for a given a e ]0, oof, put (for a, b e [0, oo]) a+b — a if a > a, b> a, a®a b = ? ?? b = av b otherwise, a + (a — a)(b — a) if a ^ a, b ^ a, a Ab The corresponding integral is otherwise. ????? = ?+?/???? \??*((?)>?, where /„ = (/ - a) V 0 and ?? = (? - a) V 0. The limit members are: /?(/) = ^/^?, ?? =$/??. Example 5.8. Let the pseudo-addition ? on [0, F] be given by ' g'\g(a) + g(b)) if ? ? a, b ? a, a®b = ?? b otherwise, where a e ]0, F[ and g : [0, a] -> [0, oo] is an increasing bijection. Any symmetric ?-fitting pseudo-multiplication © on [0, F] with the unit element и = g~' A) can be written in the following form: aQb = 0 ifa=0or/> = 0, g~] (g(a)g(b)) if a iC a, b^a, S{a,b) if a > a, b > a, avb otherwise, where S is an (arbitrary) left-continuous triangular conorm on [a, M]. Put S = v. Then the resulting integral is given by (using the notation from Example 5.4) / ??? = g '( f (g*°/)<*(?* ? ?)) ifesssup/sCa, esssup/ ? ??-(?) otherwise. 6. Conclusions Based on couples of a pseudo-addition ? and a pseudo-multiplication ©, we have constructed a large class of general fuzzy integrals (on any interval of type [0, F] with
Monotone set functions-based integrals 1377 F e ]0, oo]), including the Choquet and the Sugeno integrals as distinguished members. Several examples were given, including a one-parameter family of integrals with the Choquet and the Sugeno integrals as limit members. Several applications, especially in the area of aggregation operators and multi-criteria decision making can be expected. Our approach allows to reduce the choice of an appropriate integral to the choice of an appropriate pseudo-addition ? (for the processing of input values) and subsequently to the choice of a suitable pseudo-multiplication ©. The pseudo-multiplication must satisfy some fundamental conditions which are necessary in order to construct the integral. Other supplementary conditions can be requested in order to obtain an integral with some additional properties useful for particular applications. Acknowledgement The paper was written during the visit of the second author at the University "La Sapienza", Rome. The support of University "La Sapienza", the G.N.F.M., the MURST of Italy, the grants VEGA 1/8331/01 and 1/7146/20 are kindly announced. References Benvenuti, P. and Mesiar, R. B000a), Integrals with respect to a general fuzzy measure. Fuzzy Measures and Integrals, M. Grabisch, T. Murofushi and M. Sugeno, eds, Physica-Verlag, Heidelberg, 205-232. Benvenuti, P. and Mesiar, R. B000b), Pseudo-arithmetical operations as a basis for integration with respect to a general fuzzy measure. Submitted. Benvenuti, P. and Vivona, D. A996a), General theory of the fuzzy integral, Mathware and Soft Computing 3, 199-209. Benvenuti, P. and Vivona, D. A996b), Integral with respect tofitzzx measure, Proc. IPMU '96, Vol. 1, 231-233. Benvenuti, P. and Vivona, D. B000), Comonotone aggregation operators. Rend. Mat. 20, 323-336. Benvenuti, P., Vivona, D. and Divari, M. B002), The Cauchy equation on I-semigroups, Aequationes Mathe- maticae XX, 1-11. Bouchon-Meunier, B. A998), Aggregation and Fusion of Imperfect Information, Physica-Verlag, Heidelberg. Choquet, G. A953-1954), Theory of capacities, Ann. Inst. Fourier 5, 131-295. De Baets, B. A998), Uninorms: the known classes. Fuzzy Logic and Intelligent Technologies for Nuclear Science and Industry, Da Ruan et al., eds, Proc. Third Int. FLINS Workshop, World Scientific, Singapore, 21-28. De Baets, B. A999), Idempotent uninorms, European J. Oper. Res. 118, 631-642. de Campos, L.M., Lamata, M.T. and Moral, S. A991), A unified approach to define fuzzy integrals. Fuzzy Sets and Systems 39, 75-90. de Campos, L.M. and Bolanos, M.J. A993), Characterization and comparison of Sugeno and Choquet integrals. Fuzzy Sets and Systems 52, 61-67. De Giorgi, E. and Lena, G. A977), Line notion generate de convergence faible pour des fonctions croissantes d ensemble, Ann. Sc. Nor. Pisa (IV) 4, 61-99. Denneberg, D. A994), Non-Additive Measure and Integral, Kluwer Academic Publishers, Dordrecht. Grabisch, M. A996), Fuzzy measures and integrals: a survey of applications and recent issues. Fuzzy Sets Methods in Information Engineering: A Guided Tour of Applications, D. Dubois, H. Prade and R. Yager, eds, Wiley, New York. Grabisch, M. B000), Symmetric and asymmetric integrals: the ordinal cases. Proceedings IIZUKA, CD-room. Grabisch, M., Murofushi, T. and Sugeno, M. A992), Fuzzy measures of fuzzy events defined by fuzzy integrals. Fuzzy Sets and Systems 50, 293-313. Grabisch, M., Murofushi, T. and Sugeno, M. B000), Fuzzy Measures and Integrals, Physica-Verlag. Heidelberg.
1378 P. Benvenuti et al. Greco, G.H. A977), Integrate monotono. Rend. Sem. Mat. Univ. Padova 57, 149-166. Halmos, P.R. A950), Measure Theory, Van Nostrand Reinhold Company, New York. Imaoka, H. A997), On a subjective evaluation model by a generalized fuzzy integral. Int. J. Uncertainty, Fuzziness, and Knowledge-Based Systems 5, 517-529. Ishihachi, H., Tanaka, H. and Asai, K. A988), Fuzzy integrals based on pseudo-addition and multiplication, J. Math. Anal. Appl. 130, 354-364. Klement, E.P., Mesiar, R. and Pap, E. A996), On the relationship of associative compensatory operators to triangular norms and conorms, Intemat. J. Uncertainty. Fuzziness Knowledge-Based Systems 4, 25-36. Klement, E.P., Mesiar, R. and Pap, E. B000), Triangular Norms, Kluwer Academic Publishers, Dordrecht. Klement, E.P, Mesiar, R. and Pap, E. B001), A geometric approach to aggregation, Proc. EUSFLAT '01, Leicester. Kolesarova, A. A993), A note on the ?-measure based integrals, Tatra Mountains Math. Publ. 3, 173-182. Marinova I. A986), Integration with respect to a ?-measure. Math. Slovaca 36, 15-22. Markova, A. A996), A note on g-derivative and g-integral, Tatra Mountains Math. Publ. 8, 71-76. Matsushita, ? and Kambara, H. A996), Partition of attributes and fuzzy integral model for evaluation process, Proc. IIZUKA '96, Vol. 1, 303-307. Mesiar, R. A995), Choquet-like integrals, J. Math. Anal. Appl. 194, 477^488. Mesiar, R. A996), Pseudo-linear integrals and derivatives based on a generator g, Tatra Mountains Math. Publ. 8, 67-70. Mesiar, R. and Rybarik, J. A995), Pan-operations structure. Fuzzy Sets and Systems 74, 365-369. Mostert, PS. and Shields, A.L. A957), On the structure of semigroups on a compact manifold with boundary, Ann. of Math. 65, 117-143. Murofushi, T. and Sugeno, M. A991), Fuzz\ t-conorm integrals with respect to fuzzy measures: generalization of Sugeno integral and Choquet integral. Fuzzy Sets and Systems 42. 57-71. Murofushi, T. and Sugeno, M. A993), Some quantities represented by the Choquet integral. Fuzzy Sets and Systems 56, 229-235. Nelsen, R.B. A999), An Introduction to Copulas, Lectures Notes in Statistica, Vol. 139, Springer, Berlin. Onisawa, Т., Sugeno, M., Nishiwaki, Y, Kawai, H., and Harima ? A986), Fuzzy measure analysis of public attitude towards the use of nuclear energy. Fuzzy Sets and Systems 20, 259-289. Pap, E. A990), Integral generated by decomposable measure, Univ. Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 20A), 135-144. Pap, E. A993), g-calculus, Univ. Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 23 A), 145-156. Pap, E. A995), Null-Additive Set Functions, Kluwer Academic Publishers, Dordrecht. Pap, E. B002), Pseudo-additive measures and their applications. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 1403-1468. Ralescu, D. and Adams, G. A980), The fuzzy integral, J. Math. Anal. Appl. 75, 562-570. Riecanova, Z. A982), About ?-additive and ?-maxitive measures. Math. Slovaca 32, 389-395. Schmeidler, D. A986), Integral representation without additivity, Proc. Amer. Math. Soc. 97, 255-261. Schweizer, B. and Sklar, A. A983), Probabilistic Metric Spaces, North-Holland, Amsterdam. Shilkret, N. A971), Maxitive measures and integration. Indag. Math. 33, 109-116. Sion, M. A973), A Theory of Semigroup Valued Measures, Lectures Notes in Math., Vol. 355, Springer, Berlin. SipoS, J. A979), Integral with respect to apremeasure. Math. Slovaca 29, 141-145. Suarez, F. and Gil, P. A986), Two families of fuzzy integrals. Fuzzy Sets and Systems 18, 67-81. Sugeno, M. A974), Theory of fuzzy integrals and applications, Ph.D. doctoral dissertation, Tokyo Inst, of Tech. Sugeno, M. and Kwon, S.H. A995), A new approach to time series modeling withfiizzy measures and the Choquet integral, Proc. Yokohama '95, 799-804. Sugeno, M. and Murofushi, T. A987), Pseudo-additive measures and integrals, J. Math. Anal. Appl. 122, 197- 222. Tanaka, H. and Sugeno, M. A991), A study on subjective valuations of printed color images. Int. J. Approximate Reasoning 5, 213-222. Vitali, G. A925), Sulla definizione di integrate delle funzioni di una variabile, Ann. Mat. Рига ed Appl. IV, 2, 111-121. English translation: On the definition of integral of functions of one variable, Rivista di Matematica per le scienze sociali 20 A997) 159-168.
Monotone set functions-based integrals 1379 Wang, Z. and Klir, G.J. A992), Fuzzy Measure Theory: Plenum Press, New York. Weber, S. A983), A general concept of fuzz\ connectives, negations and implications based on t-norms and t-conorms. Fuzzy Sets and Systems 11. 115-134. Weber, S. A984), L-decomposable measures and integrals for Archimedean t-conorm, J. Math. Anal. Appl. 101, 114-138. Weber, S. A986) Two integrals and some modified version - critical remarks. Fuzzy Sets and Systems 20,97-105. Yager, R.R. and Rybalov, A. A996), Uninorm aggregation operators. Fuzzy Sets and Systems 80, 111-120. Yang Quing Ji A985), The pan-integral on the fuzzy measure space. Fuzzy Math. 3, 107-114 (in Chinese).
CHAPTER 34 Set Functions over Finite Sets: Transformations and Integrals Michel Grabisch LIP6, University of Paris VI, 4 Place Jussieii, 75252 Paris, France E-mail: michel.grabisch@lip6.fr Contents 1. Introduction 1383 2. Set functions over finite sets 1383 3. Transformations of set functions 1385 4. The Choquet and Sipos integrals 1390 5. ?-additive measures 1392 6. The ordinal case: the Sugeno integral 1395 References 1400 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1381
This page intentionally left blank
Set functions over finite sets: transformations and integrals 1383 1. Introduction Measures in a finite setting are positive-valued set functions with some characteristic properties (e.g., additivity for classical measures). Although the finiteness of the universe considerably restricts the interest of the measure concept, it also allows other viewpoints, coming from various fields of mathematics, as combinatorics, game theory, complexity, etc., when viewing measures as particular set functions or pseudo-Boolean functions. We intend to illustrate this viewpoint into this chapter. We will introduce various transformations of set functions, and the notion of integral will appear naturally as an extension of set functions. In a last part, we will consider set functions valued on linearly ordered sets and corresponding transformations and integrals. Throughout the paper, we assume a finite space N with ? elements, denoted simply 1,2, ...,n if there is no fear of ambiguity. In a similar way, s, t,... will denote the cardinality of subsets S,T,... of N. We denote by л, ? the minimum and maximum operators on R. 2. Set functions over finite sets We consider real-valued set functions v.V(N) —>¦ R, and several particular cases. Set functions vanishing on the empty set are called games, while capacities [3] or fuzzy measures [32], which we will always denote by ?, refer to games which are monotonic with respect to inclusion, i.e., ACS =>· ?(?)<:?(?). Consequently, fuzzy measures assume only positive values. In applications, it is often required in addition that ?(?) = 1. For any set function v, the dual set function or conjugate set function of ? is defined by v(S):=v(N)-v(Sc), VScN, A) where Sc is the complement set of S. A particular family of set functions of interest are the unanimity games. For any S с ?, the unanimity game w.r.t. 5" is defined by (?)·=|?, otherwise. B) Another view of set functions which has its own interest is given by pseudo-Boolean functions. We introduce briefly the topic here, more can be found in [12,17]. Any function /: {0, 1)" -> R is said to be a pseudo-Boolean function. By making the usual bijection between {0, 1}" and V(N), pseudo-Boolean functions on {0, 1)" coincide
1384 ?. Grabisch with real-valued set functions on N. It has been shown by Hammer and Rudeanu [20] that any pseudo-Boolean function can be written in a multilinear form: /(?)=??(?)?\??, V^e{0, I}"· C) TcN ieT The coefficients a(T) will be explained below. The monomials П/бГ-*' correspond to unanimity games uj. In terms of game theory, Equation C) gives the decomposition of a game on the basis of unanimity games. Note that C) can be put in an equivalent form, which is /U)= ][>(Г)Д;с,, V*e{0,l}". D) TcN ieT A topic of interest concerns the extension of pseudo-Boolean functions to [0, 1]". An obvious way to do this is to extend expressions C) and D) to the whole hypercube [0, 1]". The first one is called the multilinear extension, while the second is the Lovasz extension. The multilinear extension of /, given by /(*):= ]? ? (Г)[];с„ Voce [0,1]", E) TcN ieT is the only multilinear function which extends /, hence its name (see Owen [27]). It performs the classical linear interpolation of / in [0, 1 ]". The Lovasz extension of / [21,31 ] is given by /(*):= ? а(Г)Ддг/, Voce [0,1]". F) TCN ieT Defining the simplex ?? = [? e [0, 1]" | ???) ^ · · · ^ x* <.,,)} where ? is a permutation on N, the Lovasz extension is the unique affine function which interpolates / between the ? + 1 vertices of ??. By contrast, the multilinear extension interpolates between all vertices. As a preamble to the next section, we introduce informally some linear transforms over set functions. Mobius transform: in combinatorics, the Mobius transform is well-known (see, e.g., Rota [28]). For any set function v, its Mobius transform mv is defined by muE:):= ^(-?^-'???). VS С N. G) TcS The inverse transform is the Zeta transform, expressed by v(S) = ]Гш1,(Г), VSCN. (8) TCS
Set functions over finite sets: transformations and integrals 1385 mv is called the Mobius representation of v. In game theory, this corresponds to the dividends of a game. Comparing (8) and C), we see that ml'{T) is nothing else than a(T). In other words, the Mobius transform is the coordinates of ? in the basis of unanimity games. In Dempster-Shafer theory of evidence, mv is called the basic probability mass assignment [29]. Interaction transform: it has been proposed by Grabisch [7], and is defined for any set function ? by IV(S):= ? (" ?'"'?'',' ? <-1>'~*?<*??>- *SCN- (9) TCN\S {П S+l>- KcS This definition extends in fact the Shapley value ?1' [30] and the interaction index I,j for a pair of elements /, j in N, introduced by Murofushi and Soneda [24]. They are defined by ???)-.= ? -(и~д71)!д!?*?w)-"(*>]. v/eN' A0) ScN\i "" hj:= ? -" 7 'J".?''' [v{SU[i,j}) - v(SU {/}) - v(SU{j}) + v(S)}. ScN\[i.j\ {П '¦ (?) We have /"({'}) = ??(?) and /L'({/',;'}) = /,y. /'' is often referred as the interaction index. The interaction index has a natural interpretation in the framework of cooperative game theory and multicriteria decision making. It has been axiomatized by Grabisch and Roubens[19]. Co-Mobius transform: for any set function v, it is defined by mv(S):= ? (-l)"~'v(T)= ]?(-1)'?(? \ ?), VScN. A2) TDN\S TcS m" corresponds to the commonality function of Shafer [29]. The analogy with the Mobius transform can be noticed. 3. Transformations of set functions We introduce formally the concept of transformation of a set function, as proposed by Denneberg and Grabisch [6], borrowing the formalism of transformation and operators used in combinatorics (see Berge [1]). An operator is a two-place set function ? : V(N) ? V(N) -> R. The multiplication * between operators and set functions is defined as follows, for every A, B,C С ?: (? * ?)(?, ?) := ? ?(?, С)Ф(С, В), CCN
1386 ?. Grabisch (?*?)(?):= ]Г <P(A,C)v(C), CcN (?*?)(?):= ]? u(C)^(C, S). CcN The Kronecker's delta 1, ifA = fi, ?(?,?):=.? 1 0, otherwise, is the unique neutral element from the left and from the right. If it exists, the inverse of ? is denoted ?-1, satisfying ? *?~' = ?, ?-1 *? = ?. It can be shown that the family ?:=\?-.?(?) ??(?)^?\?(?, ?)= 1 VA с ?, ?{?, ?) = 0 if ? <? ?\ of functions of two variables together with the operation * forms a group, and the inverse ?-1 &Q of ? eQ computes recursively through ?-\?,?)= 1, ?"'(?,?) = - ]? Ф~\А,С)Ф(С,В) ifA^B. ACCgB The Zeta operator Z(A, B), defined by 1, if А С S, Z(A,B):=ln 1 0, otherwise, and its inverse the Mobius operator correspond to our previous definitions, i.e., with former notations, m = v + Z~~], v = m-kZ. A3) The next fundamental operator to introduce is the so-called inverse Bernoulli operator ?: 1 ?(?,?): , ifACfi, \B\A\ + \' 0, otherwise. The interaction index is recovered by 1 = Г*т. A4) We turn now to a special class of operators, satisfying ?(?,?) = ?(&,?\?) for А С ?, A5)
Setfiinctions over finite sets: transformations and integrals 1387 i.e., they can be represented by an ordinary set function ?(?) := ?@, A), denoted with the corresponding small Greek letter. In fact, the set of such operations forms an Abelian group, as well as the corresponding set of set functions: with operation * defined by <p*i/r(A):= ]T<p(C)iA(A\C), AcN. CCA The neutral element S of g is 1, ifA=0, 8(A) :=,. . . 1 0, otherwise, and the inverse of ? is denoted <p*~'. Since ? and ? have property A5), we can introduce the corresponding Zeta function and Bernoulli function: ?(?)=? for all A e ?, If moreover ? is a function only of the cardinal of sets, then we call it a cardinality function, and the corresponding ? a cardinality operator. Note that ? and ? have also this property. There is a general formula for the inverse of cardinality operators (which is also cardinal). More specifically, if ?{?) = /(|A|), then ?*~\?) = /*~'(|A|), with /*_l defined recursively by /-40):= 1, m-\ A6) \ к Г k=0 m-\ , ч r-'(m) :=-? (™)/(/n-*)/*-'(*), meN. ?— ? V / With this formula, we get ?*~] the Mobius function, and y* the Bernoulli function, giving rise to the Bernoulli numbers. Coming back to the interaction index, we have / = r*w = r*(u*Z~'), which can be rearranged to obtain / = ? * 1.1 is called the interaction operator, and is not in the group Q. This formalism permits to compute easily, simply by combining and inversing operators, all formulas between the different representations of set functions (Mobius transform,
1388 ?. Grabisch Table 1 Table of conversion between v, m and / v(S) = m(S) = V ?(-\?~'?(?) TcS ? (~1>v"' ^ m TcS m(S) ? ' ,'"'7'» ks'-s+x I ? ^??·) гсл/ ? B'-sHT) TDS 1(S) interaction index, etc.). These formulas are summarized in subsequent tables. We introduce some notations. The Bernoulli numbers В к are defined by *<:=-Edb(/)· *>0· and So = 1- A7) First numbers of the sequence are B\ = -1/2, Bi = 1/6, 63 = О, В4 = -1/30, 65 = 0, etc., and 62^+1 =0 for all к > 0. The numbers ?[ are defined from the Bernoulli numbers by #:=E(*V-;' m = o,i,2.... A8) 7=0 First values of ?? are k\l 0 1 2 3 4 0 1 1 4 { 2 1 6 1 3 1 6 3 0 1 6 1 6 0 4 1 30 1 30 2 75 1 30 1 30 This table has a property similar to the Pascal triangle, i.e., one coefficient ?^] is the sum of the two above ftk, ?1^'. For other properties, see [8]. Another formalization is based on matrices, putting the coefficients ?(S) for all S С N into a vector. Then, transformations leads to peculiar forms of matrices (see [16]). We give some properties of the above introduced transforms (for more details, see [6,10, 12]).
Set functions over finite sets: transformations and integrals 1389 Table 2 Tables of conversion between m and other representations m(S) = V ? (-ly-'viT) TDN\S m TDS I ^2(-\)'-sB,.,l(T) TdS m(S) = KS) = m ? (-1)'ш(Г) rc,v\.v ?(-l)'-V,G·) > m (/ ) ^ f-i+1 IDS PROPERTY 1. For any set function ? and its conjugate v, we have m"(S) = (-l)i+lm"(S), VScN, S^0, w D(S) = (-1)J+I ]Гт"СГ), V5"CN. 5^0. A9) B0) PROPERTY 2. Аи;у set function can be decomposed on the set of unanimity games or their conjugate: B1) B2) B3) v(S)= ]T m"B>r(S), TCN v(S)= ? (-l)'+]m4T)uT(S), TCN.Tj=<fl v(S)= ? m\T)UT(S). TCN PROPERTY 3. For any set function ? and its conjugate v, we have ?@) = ?(??-/?'@), l'v(S) = -(-l)sIv(S), VScN, S^ii. PROPERTY 4. Let ? be a fuzzy measure with ?(?) = 1. Then the interaction index l,j ranges in [—1,+1]. /,y = 1 if and only if ? = u\jj], the unanimity game for the pair i, j. Similarly, l\j = — 1 if and only if ? = й|,.у), the dual measure of the unanimity game for the pair i, j. PROPERTY 5 [2]. A set of 2" coefficients m(S), S С ?, corresponds to the Mobius transform of a fuzzy measure if and only if (i) w@) = O, EscnS)=1, (ii) Y,ieTcSm(T) ^0, for all S<ZN, for alii e S.
1390 ?. Grabisch PROPERTY 6 [10]. A set of 2" coefficients I(S), S С ?, corresponds to the interaction transform of a fuzzy measure if and only if (») ?,-etfJ({'))=!, («0 Escwv fiiSsnT\J(SU {/}) > 0, V/ e N, VT с ? \ {/}. 4. The Choquet and Sipos integrals Definition 1. Let / be a positive real-valued function on N, and ? a set function. Let us denote /(/) by /,-, for every / in N, thus considering / as an element of (R+)". Then the Choquet integral of / with respect to ? is given by II i=l where ·(/) indicates a permutation on N so that /ц, ^ fi) ^ ·¦· ^ /(«>, and A(() := {(;),...,(n)}. Also /,o, :=0. It has been shown by Chateauneuf and Jaffray [2], extending Dempster [4] (see also Walley [34]), that the Choquet integral of positive integrands can be expressed using the Mobius transform: Cv{f)=Yjm\S)-f\f. B5) ScN ieS We recognize here the Lovasz extension of ? (see F)), which shows clearly the link between the set function and the Choquet integral: the latter is an extension of the former, which means that for any S С ?, if /5 is defined as fs = 1 whenever ; e S, and 0 otherwise, then Cv(fs) = v(S). The next step is to define the Choquet integral for real-valued integrands. For any / e E", we introduce /+ := / ? 0, /~ := (-/) ? 0 the positive and negative parts of /, i.e., such that / = /+ - /~. Two definitions exist [5,26]: • the asymmetric integral, which is the usual definition of Choquet integral: Cv(f)-=Cv{f+)~d,(f-), B6) • the symmetric integral, which is also called the Sipos integral [33]: Cv(f)-=Cv(f+)-CAD. B7)
Set functions over finite sets: transformations and integrals 1391 They satisfy for all / e (K+)'\ C,.(-/) = -C,-.(/) and C,(-/) = -C„(/), hence their names. The explicit expression of the Sipos integral is, for any /el": p-\ ^.(/) = ?(/(?)-/(/ + ?))?'({A),...,@}) + /(?)?'({A),···.(?)}) / = l + /(/,+ ?,?({(?+1),...,(«)}) + ? (/(?)-/(?-?))«({? («)}) i=p+2 B8) with /(!)<···< /(,,) < 0 < /(p+i) ^ ··· ^ /(»)· By contrast, expression B4) of the Choquet integral does not change when / e R". This means that the Sipos integral behaves differently on positive and negative numbers (true zero, ratio scale), while the Choquet integral does not differentiate them (arbitrary position of the zero, difference scale) (more on this topic and its relation to decision making in [15]). We turn to the expression of these two integrals using the various transformations introduced above. PROPERTY 7. Let ? be a set function, m, in, I their Mobius, co-Mobius and interaction transforms, and f a real-valued function on N. Then the Choquet integral of f w.r.t. ? is expressed by Cv(f)=Y,m(S)/\f, ScN ieS Cv(f)= ? (-1)?+?^E)\//? ScN. S^0 ieS ^(/)=???*'-'/+<?>??-? B9) C0) SCN^TDS ieS + ? (-!M+| ?#-'/-<?> V/" SCN. S^tA C1) ¦TdS ieS with I+ indicating a restriction so that only terms with positive interaction are taken into account, and similarly for I~. PROPERTY 8. Let ? be a set function, m, m, I their Mobius, co-Mobius and interaction representations, and f a real-valued function on N. Then the Sipos integral of f w.r.t. ? is expressed by ACN Li6A ieA = ]T m{A)f\f+ ]T m(A)\J f, C2) ACN+ ieA ACN~ ieA
1392 ?. Grabisch Uf)= ? (-1)|А|+|т''(А)\//'+ ? Ы)И|+|™"И)Д/;, C3) АПЛ/+^0 ieA ?(??~?? ieA W)= ? (-1)И1+1^'ЧА)Д/,+ ^ (-1),А|+|тг'И)\/^' C4> АСЛ/+ ieA ACN- ieA &(/> = ? ( ? %|/+Ми/?ЛД/, АСЛ/+ ^ScN\A ' ieA + ? ? ? S|B|/+(AUS)W/, ACN" ^BcN\A ' ieA + ? (-D|A|+I( ? ртпАивЛу/, ??\?+?<? ^BcN\A ' ieA + ? (-DIA|+I( ? PlBBlr(AUB)\/\f„ C5) ??\?~?<? ^BCN\A ' ieA with N+ := {i eN \fj^ 0} and N~ := ? \ N+. 5. fc-additive measures In this section, we focus on capacities (or fuzzy measures), although the concept of ?-additivity can be defined for any set function. border additive measures or k-additive measures for short have been introduced by Grabisch in an attempt to decrease the exponential complexity of fuzzy measures in practical applications, since a fuzzy measure defined on a set of// elements requires 2" real coefficients for its definition. A means which has been often used for this is to introduce the property of decomposability: a fuzzy measure ? is decomposable if the measure of any subset can be expressed as a function of the measures of each element in the set. Thus we need only to define the distribution of ? over N, hence // coefficients instead of 2". The most usual example in this category are additive measures. But it appears that this is too drastic a simplification, which is too limitative, especially in multiattribute decision making. One can think of a distribution defined not only for singletons, but also for pairs. An adequate way to define this is to refer to pseudo-Boolean functions, and especially their multilinear form C). An additive measure (defined by a distribution on singletons) has a linear expression f(x) = ???'???- and tne coefficients a,'s (i.e., the Mobius transform) are indeed the distribution itself. Then a fuzzy measure of which the multilinear extension is limited to a degree 2 (or 3, etc.) can be expressed only by a distribution on singletons and pairs (or also on triples, etc.). Equation C) will then provide the value of ? for every subset. Dehnition 2. A fuzzy measure ? is said to be k-additive if its Mobius transform satisfies m(S) = 0 for any S such that s > k, and there exists at least one subset 5Of N of exactly к elements such that m (S) ? 0.
Set functions over finite sets: transformations and integrals 1393 The following property of A;-additive measures is fundamental. PROPERTY 9. Let ? be a k-additive measure on N. Then (i) I (S) = m(S) = Ofor every S С N such that \S\ > k, (ii) / E") = m(S) = m(S)for every ScN such that \S\=k. Thus, A;-additive measures can be represented by a limited set of coefficients, either m(S), \S\ < k, or I(S), \S\ sC k, or equivalently m(S), \S\ ij k, i.e., at most ?*=l ('') coefficients. The case of 2-additive measures is particularly appealing, since it is much more general than additive measures, while remaining of low complexity. We detail properties of this case in the sequel. A first fact is that any 2-additive measure ? can be expressed only by ?((/)),?({/, ;'}),forall /, ;' e ?, hence the notion of "distribution" for singletons and pairs suggested above. PROPERTY 10. Any 2-additive measure can be written as ?(?)= ? ?({/,;·})-(|?|-2)]??({/}), УД С ?, |?|>2. C6) \i.j)eA ieA Moreover, the interaction index /,y reduces to ??=?({?,?)-?({?)-?(?}). C7) The Choquet and Sipos integrals have a particularly interesting expression for 2-additive measure, in terms of the interaction transform. PROPERTY 11. Let f eW and ? a 2-additive measure. Then " ( 1 /„ >0 /,,<0 i=\ ^ ??\ C8) сд(/)= ? (fiAfj)lu+ ? (/^/й i.jeN+.Iij>0 i.jeN-.I,j>0 + ? (???-)?/.7?+ ? (fi*fj)w i.jeN+.Iij<0 i.jeN-.I,,<0 + ?? ? m)+Ef{ ? m ieN+ jeN~.I,j<0 ' ieN~ уеЛГ./,,-<() +????-\?\?'? C9> using previous notations.
1394 ?. Grabisch These expressions are useful for a deep understanding of these integrals, since all coefficients are positive: thus the contribution of each term to the value of the integral becomes clear. The following interpretation can be done: • The Choquet integral is formed of conjunctive terms, disjunctive terms, and linear terms. A conjunctive term appears whenever the interaction index is positive, while a disjunctive one appears when the interaction index is negative. The linear part is weighted by the Shapley value, minus all interactions (positive or negative). Putting the Choquet integral into the form сд (/) = ? aiJ (¦# л fj> + ? ^ (./·¦v /;) + ? и fi ij i.j i and remarking that due to ]?;. ?? = ?(?) = 1, we have ?^?+?^ + ?^1' i-j iJ i we conclude that any Choquet integral w.r.t. a 2-additive measure is a convex combination of conjunctions, disjunctions and linear terms. The reciprocal also holds since any coefficient in the combination can take the value 1. • The Sipos integral, although more complicated, is very similar to the Choquet integral, except that negative and positive numbers are processed differently. In fact, disjunctions turn to conjunctions when the sign changes. Another remarkable fact is that pairs (f\, /2) of values with different signs are not considered. This interpretation is of high interest in decision making. To conclude this section, we present a graphical interpretation of the Choquet integral when ? = 2 [14]. In this case, formula C8) becomes: Д i(/|V/2)/|2 + /|@| + ^/|2) + /j@2 + i/|2), if/|2<0, which is in fact the general expression for Choquet integral when и = 2, since in this case, at most 2-additive measures coincide with general fuzzy measures. Clearly, all possible Choquet integrals are obtained when ф\, фг and 1\г vary on their domain. Using Property 6, this domain is defined by 01 — 5/12 ^ 0, 02 - 5 /| 2 ^ 0, 01+5/12^0, 02+ 5-/12^0. Recalling that ф\ + фг = 1, this domain can be represented with the (?\, /|?) coordinates only, and is the shaded area on Figure 1. In other words, the Choquet integral is the convex closure of the four vertices of Figure 1. We further study these vertices, and the two axes. Let us consider the horizontal axis, where I a = 0. In the case ? = 2, this is equivalent to say that the measure is additive, and thus the Choquet integral is a weighted sum,1 A weighted sum is any expression of the form g{x\ ,??) = w\.x\ + unxi, with w\. wj e M+ and w\ +wi = 1.
Set functions over finite sets: transformations and integrals 1395 !/2(t.;l+ii2i Fig. 1. Interpretation in term of interaction index. with weights ?({1}) and ?({2}), which coincide with ?\ and фг- Thus the horizontal axis represent the set of all possible weighted sums. Let us examine the vertical axis, where ф\—фг = \- Assuming /12 > 0 (the converse case leads to the same result), formula D0) becomes: C^(ai,a2) = з«A)A + /12) + з«B>A - /i2). D1) We recognize here an ordered weighted average,2 with weights 5 A + /12) and ,A — hi)- Moreover, since any OWA operator is such that ?\ = ?? = \ when и = 2, the vertical axis is the locus of all possible OWA operators. The upper vertex (/12= 1) corresponds to the minimum operator, and the lower vertex to the maximum operator, as it can be seen from D1). 6. The ordinal case: the Sugeno integral So far we have dealt with real-valued set functions. In this section, we address the case of ordinal structures. We suppose that set functions are valued on a linearly ordered set E. This corresponds in applications to evaluations on an ordinal scale, where only order makes sense, and any arithmetical operation is forbidden. In this case, the Choquet and Sipos 2An ordered weighted average is any expression of the form g(x\, дгт) = u-ч jt( |) + wi-^B)· wi(n Ш|, шт б R and ш 1 + u>2 = 1.
1396 ?. Grabisch integrals are of no use, and one has to turn to other integrals. The basic material here is the Sugeno integral [32], defined as follows, for any function / valued on [0, 1]: /1 <V/)--=V[/«>AM(A„,)], D2) /=l with same notations as for the Choquet integral B4). This definition uses only minimum and maximum operators, so it can be used for any function and fuzzy measures valued on a linearly ordered set. Using the residuated difference operator -*- defined by fa, if a > b, D3) 0, otherwise, the above expression can be rewritten as [25] /1 <V/):=V[(/(|->^/(/-i))AM(A(/))]. D4) The similarity with the Choquet integral is striking, replacing sum by maximum, and product by minimum. This shows that the Sugeno integral is a translation of the Choquet integral in the ordinal world. Other types of integrals obtained by changing operators are studied at length in [18,25]. In the sequel, we show how constructions similar to the cardinal case can be done in this context, namely set functions transformations and the definition of symmetric and asymmetric integrals. As we need the notion of "negative" number on ordinal scales for the definition of symmetric and asymmetric integrals, we begin by constructing a suitable scale with suitable ordinal operators. Let ? be a linearly ordered scale with 2k + 1 degrees defined by e-k <e-k+\ < ·· <e-\ < e0 <e\ <¦¦¦ < ek-\ < ek centered around e^. We speak of "negative" value for any degree with a negative index, and denote E+ := {eo, e\,..., e^} and E~ := ? \ E+. By commodity, we denote eo by (D and ek by 1. We introduce for every e, in ? its symmetrical element —e, defined —ej := e-i (symmetric ordinal scale). We extend on ? the usual operations v, л defined on E+, so as to have an algebraic structure like a ring. Denoting ©, © these extended operators, called symmetric maximum and symmetric minimum, respectively, it would be desirable to have properties like -(a © b) = (-a) © (—b), a © (-b) = a -"- b, and -(a © b) = (-a) © b = a © (-b), where - is the residuated difference D3) properly extended on E. Here, ©, © play the role of addition and product, respectively.
Set functions over finite sets: transformations and integrals 1397 Table 3 Definition of the symmetric maximum ? a®)b a <© a = <D a ><D b <<D a f\b b a^(-b) b = <D a ? a b><0 b^(-a) b a\ib Table 4 Definition of the symmetric minimum ? a@b a <<D a = <D a ><D b <<D \a\A\b\ ? ~(ал\Ь\) b = <D ? ? ? b><0 -(\а\лЬ) ? а лЬ The following definitions are suitable for our purpose [13]. The symmetric difference is defined for any a, b on E+ by ¦b:= a, if a > b, <D, \ia = b, —b, otherwise. D5) Using this definition, the symmetric maximum is defined in Table 3. Then, the symmetric minimum is defined in Table 4. With these definitions, we can show the following result. PROPERTY 12. The structure (?,©.©) has the following properties: (i) © и commutative; (ii) О и the unique neutral element ofQ, and the unique absorbant element of ©; (iii) a © —a = (D,/or all a e E\ (iv) -(a©fe) = (-a)©(-/>); (v) © is associative on E+ and on E~; (vi) © E commutative; (vii) 1 i5 the unique neutral element of<&, and the unique absorbant element o/©; (viii) © и associative on E; (ix) © 15 distributive w.r.t. © in E+ and E~. The associativity of © and distributivity do not hold on ? in general. The second step is to address set functions (in fact, we restrict to fuzzy measures here) and their transformations. In this ordinal context, a fuzzy measure is defined naturally as a set function ?:?>(?) -> E+ such that ?@) = <D, ?(?) = 1, and А с В implies ? (?) ^ ? (S). Following Berge [1], who presents the Mobius function as a powerful tool for inversion formulas over posets, we define the ordinal Mobius transform of ?, denoted by m v, as the solution of the equation: ?(?)=0)??»(?). BcA D6)
1398 ?. Grabisch In fact, unlike the cardinal case, there is no single solution to this equation. Previous studies by the author [9,11] and related works of Mesiar [23] and Marichal [22] have shown that the smallest positive solution is given by mC(A):=(M(A)' ifM(A)>M(MO, Vi e A, ( 0, otherwise, and the greatest one is simply m^ = ?. Any set function comprised between these two solutions is a solution. We take as definition of the Mobius transform the lower bound, which is a non-negative function, so that we can use from now on usual ? and л operators. If there is no fear of ambiguity, the superscript? can be dropped. Interestingly enough, the Mobius transform can be expressed as "*v(A):= V ?(?)- V ?(?) Вел вел \A\B\ even |?\?| odd which is an exact transcription of the cardinal case (see G)). However, if ? is not monotonic, the above formula is no more solution of D6) in general. The ordinal Mobius transform has many interesting properties. E.g., let us consider any possibility measure ?, i.e., a fuzzy measure such that ?(? U ?) = ?{?) ? ?{?) for all ?, ? с ?¦ Then its Mobius transform m!J is m'J({i}) = n({i}), VieN, and wG(A) = (D, VAcN, \A\>\. Thus we have the same kind of result as with probability measures in the cardinal context. Attempts have been done by Grabisch to define an interaction transform in this ordinal context [11]. Changing in a suitable way operators in formula (9), and taking into account some properties which are counterparts of the cardinal case (see below), the following equation defines the ordinal interaction transform of a fuzzy measure ?: /v(A):= V V M(BUC)- V M(fiUC) CCA CCA |A\C|even \A\C\ odd BcN\A For A = {i}, we get the ordinal Shapley value: D7) 4>v0'):= V [M(AU/)-M(A)j. D8) ACN\i The ordinal Shapley value fulfills the following properties: (Al) V,4=aX@ = M(A0 (sharing of ?(?0); (A2) if i is such that ?(? U i) = ? (A) for every А С N \ i, then ?$(?) = <D (null players); (A3) if /, ;' are such that ?(? U i) = ?(? U ;') for every А с ? \ {i, j), then </#(/) = ?? (j) (symmetric players).
Set functions over finite sets: transformations and integrals 1399 These properties are counterparts of the properties of the (original) Shapley value [30]. However, contrary to what is wrongly claimed in [11], the "maxitivity" does not hold: in general ?^?? ^?^,??^, where ?, ? are two fuzzy measures. Lastly, the ordinal interaction transform can be expressed through the ordinal Mobius transform by /v(A)= V m^(AUB). BcN\A D9) Again, the similarity with the cardinal case has to be noted. The last step is to address the definition of the Sugeno integral over E, the expression over E+ being given by Equation D2), where / is any function from N to E+, and ? is a fuzzy measure in the sense defined above. As in the cardinal case, it appears that the Sugeno integral can be viewed as an extension of fuzzy measures (or pseudo-Boolean functions). Specifically, any fuzzy measure can be expressed on the basis of unanimity games, by ?(?)= у (т$(В)Лив(А)), BcN E0) which is the counterpart of C) and B1) in the ordinal case. The extension of this formula on (E+)" gives rise to the Sugeno integral: ?„(/)= V "МА)л(Д/Л ACN*- \'6A ' E1) Remark the analogy with the Choquet integral (see B5)). Based on formulas D4) and B8), the following expression defines the symmetric Sugeno integral [13]: 5?(/): ©(/(/, ©?({0),...,(«)})) ./=1 © (л»©?({(?,···,(")})) E2) for any / e E", with /(n ^ · · · ^ /<,,> < <D ^ /(,,+ n ^ · · · ^ /(«>· Note that the above expression is well-defined, i.e., there is no problem of associativity of ©, since @f=| operates only on E~, and @"= +| only on E+. The symmetric Sugeno integral indeed satisfies the property of symmetry: 5?(-/) = -5?(/) E3) for any fuzzy measure ? and any f e E". The case of the asymmetric integral is not necessary to consider, if it can be thought, as in the cardinal case, that the expression of the asymmetric integral does not depend on the
1400 ?. Grabisch position of the zero of the scale. Then, the zero can be put at the lower extremity, and we are working on a scale with only positive values, where the usual definition of the Sugeno integral works. Another possibility is to take as definition 4(?) = 5?(?+)^5?(?-), where ?? stands for the asymmetric integral. But this needs a proper definition of Д, which requires more structure on ? than what we have defined so far. References [1] C. Berge, Principles of Combinatorics. Academic Press. San Diego A971). [2] A. Chateauneuf and J.Y. Jaffray, Some characterizations of lower probabilities and other monotone capacities through the use ofMobius iirversion. Math. Social Sci. 17 A989). 263-283. [3] G. Choquet, Theory of capacities, Ann. Inst. Fourier 5 A953). 131-295. [4] A.P. Dempster. Upper and lower probabilities induced by a multivalued mapping, Ann. Math. Statist. 38 A967), 325-339. [5] D. Denneberg, Non-Additive Measure and Integral, Kluwer Academic, Dordrecht A994). [6] D. Denneberg and M. Grabisch, Interaction transform of set functions over a finite set. Inform. Sci. 121 A999), 149-170. [7] M. Grabisch, k-order additive fiizzx measures, 6th Int. Conf. on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU), Granada, Spain (July 1996), 1345-1350. [8] M. Grabisch, Alternative representations of discrete fuzzy measures for decision making. Intemat. J. Uncertain., Fuzziness Knowledge Based Systems 5 A997), 587-607. [9] M. Grabisch, k-additive and k-decomposable measures, Proc. of the Linz Seminar. Linz, Austria (February 1997), 88-94. [10] M. Grabisch, k-order additive discrete fuzzx measures and their representation. Fuzzy Sets and Systems 92 A997), 167-189. [11] M. Grabisch, On the representation of k-decomposable measures. 7th IFSA World Congress. Prague, Czech Republic (June 1997). [12] M. Grabisch, The interaction and Mbbius representations of fuzzy measures on finite spaces, k-additive measures: a survey. Fuzzy Measures and Integrals - Theory and Applications, M. Grabisch, T. Murofushi and M. Sugeno, eds. Physica-Verlag, Heidelberg B000), 70-93. [13] M. Grabisch, Symmetric and asymmetric fuzzv integrals: the ordinal case, 6th Int. Conf. on Soft Computing (Iizuka'2000). Iizuka, Japan (October 2000). [14] M. Grabisch, A graphical interpretation of the Choquet integral, IEEE Trans. Fuzzy Systems, to appear. [15] M. Grabisch and Ch. Labreuche, To be symmetric or asymmetric? A dilemma in decision making. Preferences and Decisions under Incomplete Knowledge, J. Fodor, B. De Baets and P. Pemy, eds, Physica- Verlag, Heidelberg B000), 179-194. [16] M. Grabisch, J.-L. Marichal and M. Roubens, Equivalent representations of a set function with applications to game theory and multicriteria decision making, Proc. of the Int. Conf. on Logic, Game theory and Social choice (LGS'99), Oisterwijk. The Netherlands (May 1999), 184-198. [17] M. Grabisch, J.-L. Marichal and M. Roubens, Equivalent representations of a set function with applications to game theory and multicriteria decision making. Math. Oper. Res. 25 B) B000), 157-178. [18] M. Grabisch, T. Murofushi andM. Sugeno, Fuzzy measures of fuzzy events defined by fuzzy integrals. Fuzzy Sets and Systems 50 A992), 293-313. [19] M. Grabisch and M. Roubens, An axiomatic approach to the concept of interaction among players in cooperative games, Intemat. J. Game Theory 28 A999), 547-565. [20] PL. Hammer and S. Rudeanu, Boolean Methods in Operations Research and Related Areas, Springer- Verlag, Berlin A968).
Set functions over finite sets: transformations and integrals 1401 [21] L. Lovasz. Submodular function and convexity. Mathematical Programming. The State of The Art, A. Bachem. M. Grotschel and B. Korte, eds, Springer. Berlin A983). 235-257. [22] J.-L. Marichal, P. Mathonet and E. Tousset, Mesuresfloues definies sur une echelle ordinale. Working paper A996). [23] R. Mesiar, k-order pan-additive discrete fuzzy measures. 7th IFS A World Congress. Prague, Czech Republic (June 1997), 488^90. [24] T. Murofushi and S. Soneda, Techniques for reading fuzzy measures (III): interaction index, 9th Fuzzy System Symposium, Sapporo, Japan (May 1993), 693-696 (In Japanese). [25] T. Murofushi and M. Sugeno, Fuzzy t-conorm integrals with respect to fuzzy measures: generalization of Sugeno integral and Choquet integral. Fuzzy Sets and Systems 42 A991), 57-71. [26] E. Pap, Null-Additive Set Functions, Kluwer Academic Publishers, Dordrecht A995). [27] G. Owen, Multilinear extensions of games. The Shapley Value. Essays in Honor of Lloyd S. Shapley, A.E. Roth.ed.. Cambridge Univ. Press. Cambridge. UK A988). 139-151. [28] G.C. Rota, On the foundations of combinatorial theory I. Theory of Mobius functions, Z. Wahrschein- lichkeitsth. verw. Geb. 2 A964), 340-368. [29] G. Shafer, A Mathematical Theory of Evidence, Princeton Univ. Press. Princeton, NJ A976). [30] L.S. Shapley. A value for ?-person games. Contributions to the Theory of Games, Vol. II, H.W. Kuhn and A.W. Tucker, eds, Annals of Mathematics Studies, Vol. 28, Princeton Univ. Press, Princeton, NJ A953), 307-317. [31] I. Singer, Extensions of functions o/0-l variables and applications to combinatorial optimization. Numer. Funct. Anal. Optim. 7 A) A984), 23-62. [32] M. Sugeno, Theory of fuzzy integrals and its applications, Ph.D. thesis, Tokyo Institute of Technology A974). [33] J. Sipos, Integral with respect to a pre-measure. Math. Slovaca 29 A979), 141-155. [34] P. Walley, Coherent lower (and upper) probabilities. Technical Report 22, University of Warvick, Coventry A981).
CHAPTER 35 Pseudo-Additive Measures and Their Applications Endre Pap* Institute of Mathematics, University of Novi Sad, Trg D. Obradovica 4, 21000 Novi Sad, Yugoslavia E-mail: pape@eunet.yu, pap@im.ns.ac.yu Contents Introduction 1405 1. Pseudo-additive measures and integrals based on them 1406 1.1. Pseudo operations 1406 1.2. Pseudo-additive measures 1408 1.3. Pseudo-integral 1413 1.4. Pseudo-convolution 1415 1.5. The Riesz type theorem 1422 2. Applications 1424 2.1. Probabilistic metric spaces 1424 2.2. Fuzzy numbers 1426 2.3. Information theory 1428 2.4. System theory 1430 2.5. Optimization and morphism with the probability 1430 2.6. Pseudo-linear operators 1433 3. Idempotent integral as limit of g-integrals 1436 3.1. Pseudo-operations 1436 3.2. Measures and integrals 1438 4. Applications on nonlinear PDE 1440 4.1. Hamilton-Jacobi equation with non-smooth Hamiltonian 1440 4.2. Bellman differential equation for multicriteria optimization problems 1443 4.3. Stochastic optimization 1445 4.4. Option pricing 1448 5. Non-commutative and non-associative pseudo-operations 1449 5.1. Applications on nonlinear PDE 1452 5.2. Corresponding pseudo-measure 1454 6. Conditional distributive real semiring 1458 6.1. Conditional distributivity 1458 6.2. Hybrid probability-possibilistic measure and integral 1458 *The author wants to thank for the partial financial support of the Project in the Fields of Basic Research "Mathematical models of nonlinearity, uncertainty and decision" A866) supported by Ministry of Science, Technology and Development of Serbia. HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1403
1404 ?. Pap 6.3. Hybrid utility function 1463 References 1465
Pseudo-additive measures and their applications 1405 Introduction It will be presented some results on special non-additive measures - pseudo-additive (decomposable) measures and the corresponding integrals, which give a base for the so called pseudo-analysis. There are applications in optimization problems, nonlinear partial differential equations, nonlinear difference equations, optimal control, fuzzy systems, For the range of a set function instead of the field of real numbers in Bacceli et al, A992), Maslov et al. A992), Pap A995), Sugeno et al. A987) it is taken a semiring on the real interval [a,b] с [—oo, +oo], denoting the corresponding operations as ? (pseudo- addition) and ? (pseudo-multiplication). This structure is applied for solving nonlinear equations (ODE, PDE, difference equations, etc.) using now the pseudo-linear principle (Maslov et al,, 1992; Kolokoltsov and Maslov, 1997; Kolokoltsov, 2001; Pap, 1995), which means that if щ and ui are solutions of the considered nonlinear equation, then also ?? ? ? ? ? ?? ? «2 is a solution for any constants a\ and an from [a,b]. Based on semiring structure it is developed in Pap A990, 1995, 2000) the so called pseudo-analysis in an analogous way as classical analysis, introducing ?-measure, pseudo-integral, pseudo- convolution, pseudo-Laplace transform, etc. The advantages of the pseudo-analysis are that there are covered with one theory, and so with unified methods, problems (usually nonlinear) from many different fields (system theory, optimization, control theory, differential equations, difference equations, etc.). Important fact is that this approach gives also solutions in the form which are not achieved by other theories. In some cases it enables for mostly nonlinear equations to obtain exact solutions in the similar form as for the linear equations. Some obtained principles as for example the pseudo-linear superposition principle allows us to transfer the methods of linear equations to the nonlinear equations. Pseudo-analysis uses many mathematical tools from different fields as functional equations, variational calculus, measure theory, functional analysis, optimization theory, semiring theory, etc, and still is in the developing form. There are many investigations from the theoretical point of view as well as in many new applications (related monographs: Baccelli et al. A992), Grabisch et al. A995), Klement et al. B000a, 2000b), Kolokotsov and Maslov A997), Maslov and Samborskij A992), Pap A995), Puhalskii B001), Wang and Klir A992)), The main problem in the application on nonlinear PDE is the identification of operations ? and ©. Goard and Broadbridge A997) have obtained a close connection with Lie symmetry algebras (see Blumen and Kumei A989)). Namely, there are a number of computer added algorithms for finding Lie symmetry algebra (Schwarz, 1988) and therefore this relation can be used for finding © and ©. There are presented also two further generalizations related the pseudo-analysis. First, is related to further generalization of the real semiring structure to the case when the operations ? and © are noncommutative and non-associative, and the (left) right distributivity of © over ? plays a crucial rule. There is given a representation theorem - Theorem 5.2 for such operations and in Theorems 5.6 and 5.3 we give a complete characterization for generalized pseudo-addition and pseudo-multiplication. The approach is based on the functional equation technique, given in Aczel A966). More details can be found in Pap et al. B000).
1406 ?. Pap The second generalization is related to relaxation of the distributivity law, see Klement et al. B000a, 2000b), which enables a generalization of the classical von Neumann- Morgenstern utility theory A944) to hybrid probability-possibilistic utility theory (Dubois et al., 2000). 1. Pseudo-additive measures and integrals based on them 1.1. Pseudo operations Let ? be a ? -algebra of subsets of a given set X. Now we restrict ourselves on special non-additive measures so called pseudo-additive or decomposable measures (see Dubois et al. A982), Grabisch et al. A995), Klement and Weber A991), Klement et al. B000a, 2000b), Murofushi et al. A991), Pap A990, 1993a, 1993b, 1995), Sugeno and Murofushi A987), Wang and Klir A992), Weber A984)). This measures are based on pseudo-addition and serve as an important tool in pseudo-analysis. We shall start with the operations on the real interval [0, 1]. A triangular norm ? (?-norm briefly) is a function ?: [0, 1 ]2 -> [0, 1 ] such that (Tl) T(x,y) = T(y,x); (T2) T(x,T(y,z)) = T(T(x,y),z); (T3) T(x,y)^T(x,z)fory^z\ (T4) T(x, \) = x. Example 1.1. The following are the most important i-norms Тм(х,у) = т\п(х,у), Tp(x,y) = xy, Th(x,y) = тах@,* + у - 1), ?<? у) = { minU^ У) if тах(х,у)= 1, 10 otherwise. The corresponding dual operation is given by A triangular conorm S (?-conorm briefly) is a function S: [0, 1 ]2 -> [0, 1 ] such that E1) S(x,y) = S(y,x); E2) S(x,S(y,z)) = S(S(x,y),z); E3) S(x,y) ? S(x,z) for у <;Z; E4) S(x,0) = x. We see that ?-norms and ?-conorms differs only by boundary conditions. Example 1.2. The following are the most important i-conorms: SM(x,y) = max(x,y), Sp(x,y) = x + y-xy,
Pseudo-additive measures and their applications 1407 5ъ(-*,}0 = тгпA, * + }'), 5d(jc>>;)= Ыах(х,у) if min(.x,H = 0, 1 1 otherwise. 1 otherwise. Many other important f-norms and f-conorms can be found in Klement et al. B000a, 2000b), Golan A992). The basic connectives in fuzzy logic and operations by fuzzy sets are based on triangular norms and triangular conorms (Grabisch et al. A995), Klement et al. B000a, 2000b)). We extend now the considered interval for the previous operations. Let [a,b] be a closed (in some cases semiclosed) subinterval of [—oo, -boo]. We consider here a total order ^ on [a,b] (although it can be taken in the general case a partial order). The operation ? (pseudo-addition) is a function ®:[a,b] ? [a,b] -> [a,b] which is commutative, nondecreasing, associative and has a zero element, denoted by 0. Let [a,b]+ = {?: ? e [a,b], ? ^ 0}. The operation ? (pseudo-multiplication) is a function ?: [?, b] ? [?, b] ->¦ [?, b] which is commutative, positively nondecreasing, i.e., ? ^ у implies ? ? ? ^ у ? ?, ? е [?, />]+, associative and for which there exist a unit element 1 e [a, b], i.e., for each ? e [a, b], 1 ? ? = ?. We suppose, further, 0 ? ? = 0 and that ? is a distributive pseudo-multiplication with respect to ?, i.e., ? ? (y ® z) = (? ? y) ® (? ? z). The structure ([a,b],(B, ?) is called a sem/rmg (see Cuninghame-Green( 1979), Benvenuti et al. B002), Maslov et al. A992), Pap A995)). It can be considered as a general algebraic structure - semiring (?, ?, ?) on an arbitrary set ? endowed with the operations ? and ? which satisfy the previously given conditions (Golan A992), Kuich A986)). In this chapter we will take as basic the following special real semirings (using the equality 0 ? ? = 0; we can consider also closed intervals in the following examples): Case I. Pseudo-addition is idempotent and pseudo-multiplication is not idempotent, e.g·, (i) ? ®y = m\n{x,y], xQy — x +y, on the interval ]—oo, -boo]. We have 0 = +oo and 1=0. (ii) ? ? у = max{;t,)>}, xOy = x+y, on the interval [—oo, +oo[. We have 0 = —oo and 1=0. Case II. Semirings with pseudo-operations defined by monotone and continuous generator g (Aczel, 1966;Mesiaretal., 1995; Pap, 1990, 1993a, 1993b, 1995). In this case we will consider only strict pseudo-addition, i.e., such that the function ? is continuous and strictly increasing in ]a, b[ ? ]?, b[.
1408 ?. Pap By Aczel's representation theorem (see Aczel A966)) for each strict pseudo-addition ? there exists a monotone function g (generator for ?), g : [a, b] -> [—00, 00] (or with values in [0, 00]) such g@) = 0 and ii®u=g-l(g(ii) + g(u)). Using a generator g of strict pseudo-addition ?, we can define pseudo-multiplication ? (Pap, 1990, 1993a): This is the only way to define pseudo-multiplication ©, which is distributive with respect to ? generated by the function g (Mesiar and Rybarik, 1995). Case III. Both pseudo-addition and pseudo-multiplications are idempotent, e.g., (i) Let ? = max and © = min on the interval [-00, +00]. (ii) Let ? = min and © = max on the interval [—00, +00]. An idempotent semigroup (semiring, e.g.. Cases I and III) ? is called an idempotent metric semigroup (semiring) if it is endowed with a metric d: ? ? ? -> Ш such that the operation ? is (respectively, the operations ? and © are) uniformly continuous on any order-bounded set in the topology induced by d and any order-bounded set is bounded in the metric. Let X be a set, and let ? = (?, ©,d) be an idempotent metric semigroup. The set B(X, P) of bounded mappings X -> P, i.e., mappings with order- bounded range, is an idempotent metric semigroup with respect to the pointwise addition (/ ? h)(x) = f(x) ? h(x), the corresponding partial order, and the uniform metric p(/, h) = supx p(f(x), h(x)). If ? = (? ?, ©, d) is a semiring, then ?(?, ?) has the structure of an P-semimodule, i.e., the multiplication by elements of ? is defined on B(X, P) by (aQf)(x) = aQf(x). If X is a topological space, then by C(X, P) we denote the subsemimodule of continuous functions in B(X, P). For X finite, X = {x\,..., *,,}, ? & ?, the semimodules C(X, P) and ?(?, ?) coincide and can be identified with the semimodule P" = {(a\,..., a„): ctj e A]. Any vector a e P" can be uniquely represented as a linear combination a = ?"=| ?, © e-3, where {e/, ;' = 1,..., n\ is the standard basis of P" (the ;'th coordinate of es is equal to 1, and the other coordinates are equal to 0). As in the classical linear algebra, we can prove that the semimodule of continuous homomorphisms F: ?" -> ? (in what follows such homomorphisms are called linear junctionals on P") is isomorphic to P" itself. Similarly, any endomorphism ? : ?" -> ?" (a linear operator on P") is determined by an P-valued ? ? ? matrix, see Cuninghame- Green A979), Kolokoltsov B001), Maslov et al. A992). 1.2. Pseudo-additive measures 1.2.1. S-measures. Let X be a fixed non-empty set. We restrict first on [0, l]-valued set functions. Dehnition 1.3. Let S be a ?-conorm and let ? be a ?-algebra of subsets of X. A mapping m:F-> [0, 1] is called an S-measure (decomposable measure) if it is
Pseudo-additive measures and their applications 1409 continuous from below, w@) = 0 and m is S-decomposable, i.e., for all А, В e ? with А П В — 0 we have m(AU B) = S(m(A),m(B)). Remark 1.4. (i) If S is a left continuous f-conorm, then a set function m: X1 -» [0, 1] satisfying w@) = 0 is an S'-measure if and only if for each sequence ( A„)„6n of pairwise disjoint elements of ? we have / oo m I (J A„ = S m(A„). \n=l / (ii) If the set X is finite or countably infinite and S is a left continuous f-conorm, then each S'-measure m :V{X) -» [0, 1] is uniquely determined by the values m({x}) with xeX. (iii) A set function m : ? -» [0, 1] is SM-decomposable if and only if for all А, В е X1 we have w(A U ?) = 5"M(w(A), m(S)). Example 1.5. (i) Obviously, each measure m : ? -> [0, oo] with Range(w) с [0, 1] is an S^-measure. (ii) Let В be the ?-algebra of all Borel subsets of the real line. The function m:B^> [0, 1] defined by m{A) = ????(?(?), 1), where ?:? -> [0, oo] is the Lebesgue measure, is an S'L-measure, but not a measure. (iii) Fixing a number ieN and defining the set function m : V(N) -> [0, 1 ] by /card(A) m(A) = min where card(A) denotes the cardinality of the set A, it is easily seen that m is always an 5l-measure, and an SO-measure in the case к = 2, but not a measure. Generally, m is an S-measure if and only if the restriction of S on the chain {0, \/k,...,k/k} is divisible Archimedean discrete ?-conorm (see Klement et al. B000a, 2000b)), i.e., it coincide with the restriction of S^ to that chain. (iv) The function m:B^> [0, 1 ] defined by m(A) = 1 — e -?(?) is an Sp-measure. (v) For an arbitrary function /: X -> [0, 1], the set function w/ :V(X) -> [0, l]defined by m/(A) = sup{/(;c): ? e A} is an ^M-measure (Maslov and Kolokoltsov, 1997; Pap, 1995; Shilkret, 1971; Sugeno, 1974). (vi) The set function m : V([0, 1 ]) -> [0, 1 ] defined by ... [sup A if A is finite or countably infinite, m(A) = { , , 1 otherwise,
1410 ?. Pap is an ^M-measure. (vii) Let ?: ? -> [0, 1] be a probability measure. Then for each a e ]-1, oof \ {0} the set function ma : ? -> [0, 1] defined by ma(A)=-U\+a)i'iA)- 1) is an 5,swa-measure for the the Sugeno-Weber f-conorm Sswa(.x, y) = min(.x + у + axy, 1), see Klement et al. B000a). Observe that we have mA = (s^w)-1 ? ?, where s^w is the unique additive generator of Ssw<* which satisfies s^w( 1) = 1 · We remark that mi is subadditive if and only if ? e] —1,0[, and superadditive if and only if ??]0, oof. (viii) If m : ? -> [0, 1] is an S-measure and ?:[0, 1] -> [0, 1] an increasing bijection, then ?-1 ?»?:?^ [0, 1] is an ^-measure, where So; is defined by SOJ(x,y)=co-i(S(w(x),w(y))). The S-measures presented in Example 1.5(i)-(iv), (vii) are special cases of S-measures (decomposable measures) (Dubois and Prade, 1982; Weber, 1984), some of which can be constructed as follows: PROPOSITION 1.6. Let S be a continuous Archimedean t-conorm, s : [0, 1] -> [0, oo] an additive generator of S, ?? ?-algebra of subsets of X and m ; ? -> [0, oo] a measure. Then the function i'~"offl:I'^· [0, l]iian S-measure. We introduce now a generalization of the concept of ?-finiteness of additive measures (Klement et al., 2000a, 2000b). Definition 1.7. Let 5" beai-conorm, ? aa-algebraof subsets of X and m : ? -> [0, 1] an S'-measure. (i) A set A e ? is called S-m-faithful if for each В e ? with ? с A we have S(u, v) < 1 whenever и < m(B) and ? < m(A \ B). (ii) A partition С = {Q: Q e X\ A: e ?} of X, where ? is a finite or countably infinite index set, is called an S-m -partition if, for each к e K, the set Q is S-w-faithful. (iii) The S'-measure w is said to be S-faithful if there exists an S-m-faithful partition С of X. Remark 1.8. LetS beai-conorm, ? aa-algebraof subsets of X and m : ? -> [0, 1] an S-measure. (i) If A e ? is S-w-faithful then each В e ? with ? с A is 5-w-faithful. (ii) If we have S(x, y) = 1 if and only if max(;t, y) = 1 then X is S-w-faithful, and consequently each partition С = {Q: Ck ? ?, к e K} of X, where К is a finite or countably infinite index set, is an S-m -partition. Hence each SM-measure is Sm- faithful and, if S is a strict ?-conorm, then each S-measure is S-faithful too. (iii) Each A e ? with m{A) < 1 is S-w-faithful. (iv) In case of a nilpotent ?-conorm S, the S-faithfulness of an S-measure m is weaker than the requirement that X be w-achievable in the sense of Weber A984).
Pseudo-additive measures and their applications 1411 An interesting relationship between (?-additive) measures and S-measures with respect to some continuous Archimedean f-conorm S is given as follows: PROPOSITION 1.9. Let S be a continuous Archimedean t-conorm, s : [0, 1 ] -> [0, oo] an additive generator of S, ? ? ? -algebra of subsets of X and m : ?1 —» [0, I] an S-measure. Then s о m : ? —» [0, oo] is a measure if and only if X is S-m-faithful. The assumption that X be S'-w-faithful cannot be dropped because of Example 1.5(H), (iii). Observe that, in (ii), a set A e В is S^-m-faithful if and only if ?(?) ^ 1, and in (iii), if and only if card(A) ^ k. We only mention that, if S, s, ? and m are as in Proposition 1.9, X is not S-m-faithful if and only if s о т is not a measure but a continuous from below set function vanishing on empty set and (see case (NSP) in Weber A984)) there exist disjoint elements А, В & ? such that (s о m){A Ufi)<(so m)(A) |(jo m){B). Turning to an example of a non-Archimedean f-conorm, recall that ^M-measures were called (?-) maxitive measures in Shilkret A971), which appear in the context oi fractal dimensions (see Falconer B002), this Handbook). Also the possibility measures, which are special cases of upper probabilities and of plausibility measures and which have a close connection to the theory of fuzzy sets, are of interest. Since, for an arbitrary (even for an uncountable) index set /, we naturally have E???)/6/?, =sup{a,-: i el} for each family (a,)/6/ e [0, 1]', we may even consider the following special case of a maxitive, i.e., ^M-measure. DEHNITION 1.10. An ^M-measure m:V(X) -> [0, 1] is called a completely maxitive measure if for each family (A,),-6/ of subsets of X we have W(UA/) =suP{w(A/): ' e /)· A completely maxitive measure ? :V(X) -> [0, 1] with ?(?) = 1 is called a possibility measure. Example l.l 1. (i) For each function /: X -^ [0, 1], the 5м-measure m/ given in Example 1.5(v)is a completely maxitive measure. Conversely, for each completely maxitive measure m:V(X) -> [0, 1] there exists exactly one function /: X -> [0, 1], which is given by f(x) = m({x}) (and which can be interpreted as the membership function of a fuzzy subset of X in the sense of Zadeh (see Klement et al. B000a)), such that m = m f.
1412 ?. Pap (ii) The ^M-measure m considered in Example 1.5(vi) is not completely maxitive. (iii) One example of an SM-measure is related to the Hausdorff dimension defined on V(W) (see, e.g., Falconer B002), this Handbook). First, for each subset А с R" consider its diameter d(A) = sup{||;c - y\\ \ x, у е A) (where \\x\\ is the usual Euclidean norm of ? e R"), and define for each ae]0,oo[ the function Ha:V(R")->[0,oo]by Ha(A) = \im inf J^d(Akf: (Ak)keN e V(Rnf, А с |J Ak, l/t=l k=\ d(Ak) ? ? for а\\ к eN\ J. Then the Hausdorff dimension dim/y: ^(R") -> [0, oof is given by dimw(A) = inf{ae]0,oo[: Я"(А)=0} (= sup{<* e]0, oof: Я"(А) = oo}), and ^dimw :V(W) -> [0, 1] is an SM-measure (which is not completely maxitive). 1.2.2. ?-measure. We extend now the previous considerations for [a, /?]-valued set function. Let X be a non-empty set. Let ? be a ? -algebra of subsets of X. A set function w : ? -> [?, /?]+ (or semiclosed interval) is a ©-decomposable measure if there hold m@) = 0 (if ? is not idempotent), ffl(AUB)=m(A)$m(8) for ?, ? e ? such that ? ? ? = 0. In the case when ? is idempotent, it is possible that m is not defined on an empty set. A ?-decomposable measure m is ?-?-decomposable if Qa, =0m(A,) / /=l hold for any sequence (A;),-6pj of pairwise disjoint sets from ?. Further on, we shall suppose that m be a ?-?-decomposable measure and < is total order on [a,b]. The construction of pseudo-integral is similar to the construction of the Lebesgue integral. Dehnition 1.12. A set function m: ? -> [a, b]+ (or semiclosed interval) is a ?-?-decomposable measure if there hold: A) m@)=O; B) w((J^| ?? = 0/ = i m(Ai) for any sequence {A,},6n of pairwise disjoint sets from ?, where ?^, *,¦ = Нт,,^^ ?"=? ??- If ® is idempotent operation condition A) and pairwise disjointness of sets can be left out.
Pseudo-additive measures and their applications 1413 1.3. Pseudo-integral Let ([a, b], ?, ?) be a semiring (see the remark at the end of this section), ([a, b], ?) and ([a,b], 0) are complete lattice ordered semigroups. Let interval [a,b] be endowed with a metric d compatible with sup and inf (limsupjt,, = ? and liminf.*,, = ? imply Vim„^ood(x,i,x) = 0) and let the metric d satisfies at least one of the following conditions: (a) d(x®y, x'®y')<:d(x,x')+d(y,y')\ (b) d(x®y, x'®y')^max{d(x,x'), d(y,y')}. Both conditions imply: d(x„, y„) -> 0 =>¦ d(x„ ? ?, у,, ?;)^0. We suppose that metric d is also monotonic, i.e., x±z<y => d(x,y)^max.[d(y,z),d(x,z)}. Example 1.13. Metric with property (b) on the semiring ([-oo, +oo[, max, +) is ddx,y) = \e-x -e~y\. Metric with property (b) on the semiring ([—oo, +oo], max, min) is di(x,y) = | arctanjc — arctan;y|. Both metric are monotonic. The pseudo-characteristic function of a set A is: 1 for ? e A, 0 for ? f A, where 0 is zero element for ? and 1 is unit element for O. For functions defined on X and values in [a, b] we transfer pointwise the operations ? and 0. A mapping s :X -» [a,b] is a simple function if it has the following representation s = ?"=| ?, ? 1A., where ?, e [?, b], A, e ? and if ? is not idempotent then sets A,- are disjoint. An elementary (measurable) function is mapping e.X -> [a, />] that has the following representation oo е = фа, OIa,, A) for a, e [a,b], Aj e ?,? e X and if ? is not idempotent sets A, are disjoint, when the right-hand side of equality A) exist.
1414 ?. Pap Definition 1.14. Let ? be a positive real number and В с [a,b]. A subset {/f},6/ of set ? is a ?-net on ? if for each ? e В there exists If such that d(lf, ?) ^ ?. If we, also, have If < x, then we shall call {If },6/ a /ower ?-иел If /f < /?_, holds, then {/(?} is monotone. We choose a metric d which ensures that each bounded subset of metric space ([a, b], d), is compact. Then this fact would imply the existence of a finite ?-nets in every bounded subset of [a,b], e.g., in Example 1.13 for the semiring ([-oo, +oo[, max, +) for the metric d\ we have that in the bounded set {x: d\ (x, — oo) ^ ?} there exists the following finite ?-net /? _ ? 1?(? - ? ?) forO ^ ? < i 1 \ -oo for г = i 0, where г'о = [?/?] + 1. THEOREM 1.15. Let f : X —> [a, b] be measurable if pseudo-addition is idempotent, and if not, let f be measurable such that for each positive real number ? exists a monotone ?-net in f(X). Then there exist a sequence (<pn)neN of elementary functions such that, for each ? e X, d(<pn(x), /(-*)) -> 0 uniformly as ? -> oo. The proof of Theorem 1.15 that gives the construction of functions <p„ can be found in Pap A990). Definition 1.16. The pseudo-integral of a simple function s (elementary function e) with respect to the ?-?-decomposable measure m is: / sQ<iffl = 0flj0m(Ai), (/ eQdm = QajOm(Ai)\. The pseudo-integral of a bounded measurable function /: X -> [a, />], for which if ? is not idempotent for each ? > 0 there exists a monotone ?-net in /(X), is defined by: / f Qdm = lim / <p„ ? dfi Jx n^xJx where (<^„)/6n is the sequence of elementary functions from theorem above. The pseudo-integral over A, when A is an arbitrary subset of X, is given by: / fQdm= / ./A JX fOdm= / AA ?/) ? dm, where хд is pseudo-characteristic function of set A.
Pseudo-additive measures and their applications 1415 Let (G, +), G С ?.", where + is the coordinatewise addition. Definition 1.17. The semiring B(G. [a.b]) consists in Cases I and III (at least pseudo- addition is idempotent) of the bounded (with respect to the order in [a. b]) functions, and in Case II (pseudo-addition has been represented by its additive generator g) of functions /:G -> [a,b] with property g(|/|) e L\(G) (the space L\(G) consists of Lebesgue integrable functions which satisfy the condition fG \f(x)\ dx < +00). All previous considerations can be transferred to the case [a, b] с К", taking care that the order in M." is a partial order. 1.4. Pseudo-convolution Let G be subset of W and * a commutative binary operation on G such that (G, *) is a cancellative semigroup with unit element e and G+ = {?: ? e G, ? ^ e) is a subsemigroup of G. All considerations can be managed also for a general topological group G. We shall consider functions whose domain will be G. Definition 1.18. The pseudo-convolution of the first type of two functions /: G -> [a, /?] and h:G-> [a, b] with respect to a ?-?-decomposable measure m and * e G+ is given in the following way f® Jg\ f*h(x)= / f(u)Odmh(v), where Gx+ = {(?, ?): и * ? = ?, ? e G+, и е G+}, w/, = w in the case of sup-decomposable measure m{A) = supv6A h{x), in the case of inf-decomposable measure m(A) = infxeA h(x), and dm/, = h ? dm in the case of ?-decomposable measure m, where ? has an additive generator g and g о m is the Lebesgue measure and / e S(G, [a, b]). We consider also f/?e second type of pseudo-convolution when (G, *) is a group and the pseudo-integral is taken over whole set G: f*h(x)= / /(**(-0)©d»M0· where (—f) is unique inverse element for t and jc e G. Remark 1.19. When * is the usual addition on R and G = E, pseudo-convolutions of the first and the second type, for jc e R+, are (f*h)(x)= / f(x-t)Odmh(t), J [0.x] f® = / fix Jg {f*h){x)= / f(x-t)Odmh(t),
1416 ?. Pap respectively (Pap and Stajner, 1999). For both types for Case II we shall use also the notation W-/ (f*h)(x)=l f{x-t)Oh(t)dm{t). Remark 1.20. In Schweizerand Sklar( 1983) the convolution of two functions has been defined as f*h{x)= f f{x-t)dh{t) B) J[0,x[ for all ? e ]0, oof, / * /?@) = 0 and / * h(oo) = 1. Convolution defined in such manner is commutative, associative operation with identity. Using this approach we can introduce a different type of pseudo-convolution of the first type, e.g., for g-case when * = + and G = R: f*h{x) = g-'U g(f(x-t))dg(h(t))\ Convolution B) can be transformed into the form f*h{x)=i dn(f(u),h(v)) JSum\x} where Sumfjc} = {(и, и): и + v < ?; и, ? е R+} and П(х,у) = х у. The main difference between this definition and Definition 1.18 is existence of identity, which is not certain for pseudo-convolution (see g-case). Next definition considers cases when pseudo-addition ? is an idempotent operation. Definition 1.21. Pseudo-delta function is given by 1 for ? = e, ??) = K ' ? forjc^e, where 0 is zero element for ?, 1 is unit element for ? and e is zero element for *. We shall give some examples of pseudo integrals and related pseudo-convolutions for Cases I—III. We restrict here to the case G С R· Example 1.22. In this example we shall give the form of the pseudo-convolution of the first and the second type for some characteristic cases (the relevant semirings are the semirings that are mentioned in the section above) and for * = + and G = R.
Pseudo-additive measures and their applications 1417 Case I. base i. (i) For the semiring ([—oo, oof, max, +) pseudo-integral, with respect to sup-decomposable measure m, m(A) = supv6A h(x), is given by r® / fOdm = sup(f(x)+h(x)), Jm ж and the pseudo-convolution of the first type of the functions / and h will be (f*h)(x)= sup (f(x-t) + h(t)), ?????? and the pseudo-convolution of the second type is (f*h)(x) = sup(f(x-t) + h(t)). Unit element for this pseudo-convolutions is the following pseudo-delta function Лтах. + ,г,_{1(=0) if*=0, w-|0(=-oo) ifx^O. (ii) For the semiring (]—oo, oo], min, +) pseudo-integral is given by / fOdm=mf(f(x) + h(x)), where the function h defines the inf-decomposable measure m. The domain of functions will be [0, oo] (or some subset of [0, oo]) and the domain of the semiring can be any subinterval of ] — oo, oo] which contains 0 and oo. The zero element for the ? is oo and the unit element for the ? is 0. Pseudo-delta function is given by min. + ,^_Jl(=0) if*=0. Г'П' (*)-}0(=oo) ifjt^O. Previous pseudo-delta function is the unit element for the operation of pseudo-convolution of the first type (f*h)(x) = Qmf<p(x-t)+h(t)). and of the second type (f*h)(x)=inf(f(x-t)+h(t)). teR We can consider in analogous way the more general semiring ([a, b], max, ?), such that © is non-idempotent pseudo-multiplication.
1418 ?. Pap Case II. Pseudo-convolution of the first type in the sense of the g-integral (see Pap A993a)), i.e., when the pseudo-operations are represented by a generator g as ? ? у = g~' (g(x) +g(y)) and ? О у = g~] (g(x)g(y)), is given in the following way (f*h)(x)= / f(t)Qh(x-t)dm(t) J[0..x] = 8-*ПХ8(П»)-8(И(х-0)аГу Pseudo-convolution of the second type in the same sense is f® (f*h)(x) = / f(t)Qh(x-t)dm(t) Jm = *"'(/ g(fU))-g(h(x-t))dt\ Case III. For the semiring ([—oo, oo], max, min) pseudo-integral is given by / / О dm = sup(min(/(jc),/?(;c))), JM хеш where the function h defines the sup-decomposable measure m. The domain of functions will be R (or some subset of R) and the domain of the semiring is [-00, 00] (or any subinterval). The zero element for the ? is —00 and the unit element for the © is +00. Pseudo-delta function is given by «jmax.min.4_ J 1 (=+00) if*=0, w~|0(=-oo) if jc ^0. Pseudo-delta function is the unit element for the pseudo-convolution of the first type and the pseudo-convolution of the second type: (f*h)(x)= sup (min(/(;t - t),h(t))) the first type, (/ * h)(x) = sup(min(/(;t - t), h(t))) the second type. We can analogously consider the pseudo-integral for semiring ([—00, 00], min, max) valued functions.
Pseudo-additive measures and their applications 1419 Example 1.23. Let ? *y = x + y +xy, G =R, /:K -> [0,1] and h:R -> [0, l].Let ? be arbitrary f-norm. The pseudo-convolution of the first type for ? = max, ? = ? and ? e G+ = [0, oo] (case (I)) is f -kh(x) = sup[T(f(u),h(v)): и + v + uv = ? for u. ? e [0, oo]}. Another way to write it down is /**и,-.Я,|г(/(тТ7)Н1 or, in the sense of Definition 1.18 f*h(x)=[ f(±^L\odmh(t). J[0.x] \1+'/ The basic properties of the generalized pseudo-convolution for an idempotent pseudo- addition are given in the following theorem (see Pap and Stajner A999)). THEOREM 1.24. Let J7 be a class of functions f such that f:G->[a, b], where (G, *) is a commutative cancelative semigroup with unit element e. Let ? be continuous (up to some distinguished points) pseudo-multiplication of the first or the second type on interval [a,b]. Then the pseudo-convolution of the first type (second type for G a commutative group) for the idempotent pseudo-addition (Cases I andlll) is commutative, associative operation with the unit element <5®°. Remark 1.25. When pseudo-addition ? is max, then, we can take left continuous pseudo-multiplication instead of continuous pseudo-multiplication. Restricting on Case I: P = (min, +), we have the convolution on B(G, P) (f*h)(x)= M(f(y) + h(x-y)). veG This operation turns B(G, P) into an idempotent semiring, which will be denoted by CS(G) and called the convolution semiring. Some subsemirings of CS(G) are of interest in studying multicriteria optimization (Kolokoltsov, 2001). Let L denote the hyperplane in R* determined by the equation L=\(aJ)eRk: JV=o}, and let us define a function и e CS(L) by setting n(a) = maxy(-a^). Obviously, ? *? =n, i.e., и is a multiplicatively idempotent element of CS(L). Let CS„(L) с CS(L) be the subsemiring of functions h such that n*h=h*n=h. CS„(L) contains the function
1420 ?. Pap identically equal to 0 = oo and that the other elements of CSn(L) are just the functions that take the value 0 nowhere and satisfy the inequality h(a) - h{b) ^ n(a - b) for all a,b e L. In particular, for each h e CS„(L) we have \h{a) — h{b)\ ^ max|a^ — ЬЦ = \\a — b\\, j which implies that h is differentiable almost everywhere. Closely related to this convolution semiring is the semiring in the following example. Example 1.26. Pareto order (a = (a\,..., a„) ^ b = (b\,..., b„) if and only if ?, ^ bi for all i = 1,..., n) defines in R" the structure of an idempotent semigroup. For any subset А с Шк, by Min(A) we denote the set of minimal elements of the closure of A in Rk. Let P(Rk) denote the class of subsets А с Шк whose elements are pairwise incomparable, P(Rk) = {Ac Rk: Min(A) = A}. Obviously, P(Rk) is a semiring with respect to the operations ? ? ? ?? = Min(A| U A2) and A1 © A2 = Min(Ai + A2); the neutral element 0 with respect to addition in this semiring is the empty set, and the neutral element with respect to multiplication is the set whose sole element is the zero vector in Rk. The semiring P(R.k) is isomorphic to the semiring of normal sets, i.e., closed subsets N С IR*· such that b e N implies a e N for any a ^ b; the sum and the product of normal sets are defined as their usual union and sum, respectively. Indeed, if N is normal, then Min(N) e P(Rk); conversely, with each A e P(Rk) we can associate the normalization Norm(A) = {a e Rk | Bb e A: a ^ b). The last two semirings CS„ (L) and P(Rk) are closely connected, as shows the following proposition that is a specialization of a more general result given in Samborskii and TarashchanA990). THEOREM 1.27. The semirings CS„(L) and P(Rk) are isomorphic. For functions with values in semirings of type II we have the following THEOREM 1.28. Pseudo-convolution of the first or the second type for g-case (Case II) /5 commutative and associative operation while G is whole set of reals and * is the usual addition on E. We consider now the question when the generalized pseudo-convolution satisfies the cancellation law, see Pap and Stajner A998). First we give a example which shows that generally this law is not true. Example 1.29. Take functions / and g with values in the semiring (]-oo, +00], inf, +) given by h(x) = ex\ f(x) = e~x; k(x) = e~lx. Then (f*h)(x)= inf (f(y) + h(z))=0 \+Z=X
Pseudo-additive measures and their applications 1421 and (k * h)(x) = inf (k(y) + h(z)) = 0, although f фк. The following theorems from Zagordny A994) give us some positive results concerning the (inf, +) pseudo-convolution from Example 1.29. THEOREM 1.30. If f,k,h:W -> IU {+00} are proper lower semicontinuous convex functions such that h is strictly convex and .. 4x) hm = 00 |.r|->-oo |JC I then f -kh =k*h implies f=k, i.e., * does not have a zero divisor for this particular class of functions. THEOREM 1.31. If f,k,h:R" -> IU {+00} are proper lower semicontinuous convex functions. Moreover, suppose h is uniformly convex. Then f *h =k*h implies f = k. Now, let consider pseudo-operations represented by its generator g. It is well-known that Titchmarsh's theorem give us a wide class of functions for which generalized pseudo- convolution with generator g(x) = ? and * = + does not have a zero-divisor. With the same method of proof we can prove more general theorem. Let ?|oc be a class of functions with real values that are Lebesgue integrable over every compact subset of E+. THEOREM 1.32. Let g be a generator for pseudo-operations ? and ?. If compositions g о / and g oh are in C\oc, then generalized pseudo-convolution f*h{x) = g-'(j g(f{t))-g(h(x-t))dt\ does not have a zero-divisor. Definition 1.33. A function /: [0, +oo[^· [a, b] is an idempotent with respect to the generalized pseudo-convolution * if / * / = /. We shall discuss here the idempotents of the pseudo-convolution of first type and * = +, using Markova-Stupftanova A999). First we remark that if there exists a neutral
1422 ?. Pap element <5?·? with respect to the pseudo-convolution, then it is it idempotent. Also, the zero-function f{x) = 0, ? e [0, +oo[, is an idempotent of the pseudo-convolution. We remark that if the pseudo-addition ? has only trivial idempotents, then the pseudo- convolution has no *-idempotents up to zero-function, which we shall call trivial idempotent. Therefore we reduce on the case when ? = sup. THEOREM 1.34. Let ([0, 1], sup, T) be a continuous semiring, where ? is a continuous Archimedean t-norm and let g be an additive generator of T. Then a function f is an idempotent with respect to the corresponding pseudo-convolution * if and only if /@) e {0, 1} and the composite g о / is a subadditive function on [0, +oo[. Moreover, if /@) =0 and f is a non-trivial ^-idempotent, then limv^o+osup/(.x) = 1. THEOREM 1.35. (Necessary condition) Let * be a pseudo-convolution based on the semiring ([a, b], sup, inf). Then if f is an idempotent with respect to the pseudo- convolution *, then for every с е [a, b] we have /(c)+/(c)C/«\ where fu"> = [x e [0, +oo[: f(x) ^ c}. (Sufficient condition) If for every с е [a, b] we have f(c)+f(c) = /"¦', then f is an idempotent with respect to the pseudo-convolution *. 1.5. The Riesz type theorem The Riesz theorem in functional analysis establishes a one-to-one correspondence between continuous linear functionals on the space of continuous real functions on a locally compact space X vanishing at infinity and regular finite Borel measures on X, see Diestel B002), this Handbook. Analogous correspondence exists in idempotent analysis. We restrict here our consideration to the case of the semiring ? = (min, +) using Kolokoltsov B001). Proofs, generalizations and references could be found in Kolokoltsov and Maslov A997). Idempotent measures are absolutely continuous, i.e., any such measure can be represented as the idempotent integral of a density function with respect to some standard idempotent measure. Let us formulate this fact more precisely. Let Co(X, P) denote the space of continuous functions /: X -> ? on a locally compact normal space X vanishing at infinity, i.e., such that for any ? > 0 there exists a compact set К с X such that d@,f(x)) < ? for all ? e X\K. The topology on Co(X, P) is defined by the uniform metric d(f, g) — supx d(f(x), g(x)). The space Co(X, P) is an idempotent semi module. If X is a compact set, then the semimodule Co(X, P) coincides with the semimodule C(X, P) of all continuous functions from X to A. The homomorphisms Co(X, ?) —* ? will be called pseudo linear functionals on Co(X, P). The set of pseudo linear functionals will be denoted by Cq (X, P) and called the dual semimodule of Co(X, P).
Pseudo-additive measures and their applications 1423 THEOREM 1.36. For any m e C^(X. P) there exists a unique lower semicontinuous and bounded below function f : X -> ? such that m(h) = Mf(x)Qh(x) for every heCo(X,P). C) Conversely, any function f: X -*¦ ? bounded below defines an element m e C^(X, P) by C). At last, the functionals m^ and mj2 coincide if and only if the functions f\ and /2 have the same lower semicontinuous closures; that is, CI f\ = CI /1, where (С1/)(д0 = supf^U): ??/,?? C(X, P)}. We can define an idempotent measure ?/ on the subsets of X by ? f(B) = inf{/(.v): ? e ?}. This measure is ?-?-decomposable. Equation C) specifies a continuation of m/ to the set of ?-valued functions bounded below. As in the classical analysis, we say that such functions are integrable with respect to the measure ? f and denote the values taken by m j- on these functions by the idempotent integral r® mf(h)= / h{x)dvf(x) = mif(x)Qh(x). J ? v This integral is just the pseudo-integral, which was defined as the limits of the corresponding idempotent Riemannian (or Lebesgue) sums. Theorem 1.36 is equivalent the statement that all idempotent measures are absolutely continuous with respect to the standard idempotent measure m\. Let Co(K") be the space of continuous functions /: R" -> ? (? is of type (I)(i), (III)) with the property that for each ? > 0 there exists a compact subset К с Ш" such that d@, infv6K„^ /(*)) < ?, with the metric D(f, g) = supv d(f(x), g(x))- Let C0S(R") be the subspace of Co(K") of functions / with compact support supp0 = [x: f(x) ? 0}. One can develop the concept of weak convergence and the corresponding theory of generalized functions. In particular, for X = R", simple delta-shaped sequences can be constructed of smooth convex functions; e.g., ?™,?'+(?) is the weak limit of the sequence (fn)neN defined by f,(x) =n(x - yJ. Therefore each linear functional (or operator) on Co(K") is uniquely determined by its values on smooth convex functions. The dual semimodul (Co(K"))* is the semimodul of continuous pseudo-linear P- valued functionals on Co(K") (with respect to pointwise operations). Analogously the dual semimodul (Cq((E"))* is the semimodul of continuous pseudo-linear P-valued functionals on Cqs(E") (with respect to pointwise operations). We shall need the following representation theorem, which is a consequence of Theorem 1.36, see Kolokoltsov and Maslov A997). THEOREM 1.37. Let f be a function defined on R" and with values in the semiring ? of type (I)(i) or (III) and afunctional mj-: CqS (?") -> ? is given by r® m/(h)= fOdmh=M(f(x)Oh(x)).
1424 ?. Pap Then A) The mapping f н» m/ is a pseudo-isomorphism of the semimodule of lower semicontinuous functions onto the semimodule (C'0S (R"))*· B) The space Cq(R") is isometrically isomorphic with the space of bounded functions, i.e., for every m ft,m f2 e Cq (R") we have suprf(/i (x), f2(x)) = sup{d(mfl (h), mh{h)): h e C0(R"), D(h, OK 1}. .V C) The functionals m ft andmj\ are equal if and only if Cl/i = CI /i, where C\f(x) = $ир{ф(х): ? e C(R"), iA < /}· We remark that Theorem 1.37 is not valid for the semimodule C(R, P) of bounded continuous functions defined on R. 2. Applications 2.1. Probabilistic metric spaces We shall show that the basic notion of the theory of probabilistic metric spaces, the triangle function, is based on the pseudo-convolution of the first type. A distance distribution function is a function F, F:[0,+oo] -> [0, -boo], such that F@) = 0 and F(+oo) = 1 and it is left continuous on ]0, +oo[. The family of all distance distribution functions is denoted by ?+. Now we shall introduce an operation ? on ?+. DEHNITION 2.1. A triangle function ? is a binary operation on ?+ that is commutative, associative and non-decreasing in each place, and has ?? as identity, where . . Г 0 if jc = 0, [1 if 0 < ? < +oo. The following definition introduce the basic space (see Schweizer and Sklar A983)). DEHNITION 2.2. A probabilistic metric space is a triple (?,?,?) where ? is a nonempty set, T:M2 -> ?+ is given by (p,q) м> Fpq, ? is a triangle function, such that the following conditions are satisfied for all p, q. r e M: (i) PpP =??; (?) Fpq ^?0??? p^q; (iii) Fpq = FqP\ (iv) Fpr ^T(Fpq,F4r). Let F, ? e ?+ and u, ?, ? e [0, oo]. Taking for triangle function ? = ??, where tt(F, H){x) = sup{T(F(M), H(v)): и + ? =?],
Pseudo-additive measures and their applications 1425 (pseudo-convolution of the first type with respect to (max, T) and * = +) for a left continuous ?-norm ? we obtain a special important probabilistic metric space, the Menger space. The fact that function zj is triangle function yields from Theorem 1.24 (also, see Schweizer and Sklar A983)). It is important to obtain rich source for different triangle functions which would enable the construction of new probabilistic metric spaces. One of the useful construction goes in the following way. Definition 2.3. С is the class of all binary operators L on [0, +oo[ which satisfy the following conditions (i) L maps [0, +oo[2 onto [0, +oo[; (ii) L is non-decreasing in both coordinate; (iii) L is continuous on [0, +oo[2 (except possibly at the points @. +oo) and (+oo, 0)). We introduce now, using the family С the following binary operation on ?+. Definition 2.4. For a ?-norm ? and L e ?, a function zT L defined on (?+J and with values in [0, +oo[ is given by tt.l(F, G)(x) = sup{r(F(M), G(u)): L(u, v) = x]. In a special case L(x, y) = ? + у, we obtain ??? = *T· The following theorem guarantees under mild conditions the triangularity of zj.l (see Schweizer and Sklar A983)). THEOREM 2.5. If ? is left continuous t-norm and Lfrom С is commutative, associative, has 0 as identity and satisfy the condition if и ? < mi and v\ < ит, then L(u\, v\) < L(ui, vj), D) then zt.l is a triangle function. Condition D) is weaker than strictly increasingness of L in each place. So min and max satisfies it, although they are not strictly increasing in each place. Example 2.6. zT max is a triangle function for any left continuous ?-norm T. The left continuity of Г in the preceding theorem is only sufficient but not necessary condition. It can be proved in a similar way that for ? = min and ? = S. where S is continuous ?-conorm that functions zs(F, H)(x) = inf{S(F(ii). H(v)): и + ? = ?}. zs.l(F, H){x) = inf{S(F(ii), H(v)): L(u. v)=x] are triangular functions, where the binary operation L has all properties from previous theorem and F, ? e ?+.
1426 ?. Pap Example 2.7. rs.mjn is a triangle function for any continuous ?-conorm S. Example 2.8. Let F, ? e ?+ and ? e [0, oo]. If we consider convolution from Remark 1.20, i.e., F*tf@) = 0 and F*tf(oo)=l, F*H(x)= J F(x-t)dH(t) for;te]0,oo[, which has identity ??, we have obtained an triangle function. Remark 2.9. The g-case of pseudo-convolution of functions F and ? is not an triangle functions due to the lack of identity for that particular operation of generalized pseudo- convolution. For a general method for solving nonlinear equations, i.e., fixed point theory, in probabilistic metric spaces see Hadzic and Pap B001). 2.2. Fuzzy numbers Generally, a fuzzy number is a fuzzy subset (represented by its membership function ? a :R -> [0, 1]) of the real line R, see Dubois et al. B000). That is the most basic definition but in practice some less general definitions are being used. If A is a fuzzy set, its membership function ?? (?) will be identified with A{x). Definition 2.10. Let A be a fuzzy subset of the real line. A is a fuzzy number if it is normalized and convex, i.e., if A(x) = 1 for some ? e R and for all x,y,z eR such that ? < у < ?, it is A(y) ^ min(A(;c), A(z)). If A is a fuzzy number, then set Aa = {x e R: A(x) ^ a) for a e ]0, 1] is the ?-cut of fuzzy number A. We will give a few examples of the fuzzy numbers that are usually used in applications. Example 2.11. (i) An LR-fuzzy number A = (a,a,fi)L.R is a function from reals into the interval [0, 1] satisfying A(x) = (a-x\ LI ? ??? — ?^?^?, . 0 otherwise,
Pseudo-additive measures and their applications 1427 where a e R is the center, a > 0 is the left spread, ? > 0 is the right spread of A, L and R are non-increasing and continuous functions from [0,1] to [0, 1] satisfying L@) = R@)= 1 andL(l) = tf(l)=0. (ii) An LR-fuzzy interval A = (/, r, a, /S)z..«. for / ^ r, is a function from reals into the interval [0, 1] satisfying if ? ^ / — a, if ?-? ? ? ? I, ifl^x^r, ifr^x^r+?, ifr + ???, where L and R are non-increasing and continuous functions from [0, 1] to [0, 1] satisfying L(Q) = R@) = 1 and L(\) = R(\) = 0, / and r are called the left and right peak, a > 0 is the left spread, ? > 0 is the right spread of A. Specially, for L(x) = R(x) = 1 — ? trapezoidal fuzzy interval is obtained. The arithmetical operations with fuzzy numbers are based on Zadeh's extension principle (see Юетет et al. B000a)): Let ? be an arbitrary but fixed ?-norm and * a binary operation on R. Then the operation * is extended to fuzzy numbers A and В by A*TB(z)= sup T(A(x),B(y)) .V*Y=:- for ? e R- Some usual operations with fuzzy numbers are following: Addition is obtained for * = +: ?®? B(z) = supA.+v=. T(A(x), B(y)). Subtraction for * = -: ? ?? ? (?) = supv_v=- T(A(x), B{y)). Multiplication for * = ·: A QT B(z) = supv.v=. T(A(x), B(y)). Quotient for * = /: А Сдт B(z) = supv/v=_ T(A(x), B(y)). Multiplication and addition are generalized pseudo-convolutions of the second type based on semiring ([0, 1 ], max, Г) but, due to the fact that subtraction of reals and quotient of reals are not commutative operations, subtraction and quotient of fuzzy numbers are not generalized pseudo-convolutions. As a result of a Theorem 1.24 addition and multiplication of fuzzy numbers for ? being left continuous ?-norm are commutative and associative operations. In practical computation it is natural to require preserving the shape of fuzzy numbers during the arithmetic operations. Preserved shape allows us to control the resulting spread while working with LR-fuzzy numbers or LR-fuzzy intervals (see Dubois and Prade A988), Markova-StupfianovaA997), Mesiar A997)). In the sense of generalized pseudo- convolution, preserving the shape means that generalized pseudo-convolution is closed operation on certain family of functions. The following corollaries that result from some Propositions from Dubois and Prade A988) will give us the resulting spreads for addition based on the strongest Гм and the weakest Tq triangular norm. A(x) = @ L 1 R lo 1-х ? — r
1428 ?. Pap COROLLARY 2.12. Consider ? LR-fuzzy intervals A, = (/,, r,, a,, Pi)l.r, г = 1,...,и, then the sum ф"=| A, based on Гм (pseudo-convolution of the second type on semiring ([0, 1], 5"m, 7m)). where 5"м = max, is given by II ? II II II II \ ??·= ?''.?"·?«'·??- · E) /=1 \?=? ?=? ?=? ?=? I LR COROLLARY 2.13. Consider n LR-fuzzy intervals A, =(//, r/, ?,-, Pi)l.r, /" = 1, — и, ?/геи ?/ге яиш ф"=. A, based ои Го (pseudo-convolution of the second type on semiring ([0, 1], max, To)) is given by II ? II II \ ? At'= ??/(' ?r''' m'ax ?/'max a ) ^^ 1 = 1 \i'=l i=l '~ '~ / LR Formulas E) and F) give the limits for addition of ? fuzzy numbers based on an arbitrary ?-norm ? ((®T)"=iAj). The limit ?-norm based addition preserves the shape of incoming fuzzy numbers. However, for some special shapes, there exist some special ?-norms preserving that shape (see Klement et al. B000a)). Next interesting issue in fuzzy number theory is whether fuzzy numbers are comparable. Set of fuzzy numbers can be ordered partially. The idea is to extend operations min and max in the sense of Zadeh's extension principle: ???(?,?)(?)= sup min(A(jt),B(}')), min(.v.v)=: MAX(A,B)(z)= sup min(A(;c), B(y)). max(.t.\)=c Since min and max are continuous operations MIN(A, B) and ???(?,?) are fuzzy numbers and are pseudo-convolutions of the second type for ? = max and ? = min. Properties of new operations MIN and MAX are commutativity, associativity, existence of idempotent element, absorption, distributivity. The first two properties for MIN and MAX follow from Theorem 1.24 and the rest of those properties is easy to prove. 2.3. Information theory The information theory based on the notion of a probability has been introduced in 1948 by Wiener and Shannon (see for more details Sander B002) in this Handbook). Let (?, ?, p) be a probability space. The mapping J : ? -> [0, oo] is called an information measure if for an event A e ? we have J (A) = — logp(A). Axiomatization of information measure has been introduced in 1967 by Kampe de Feriet and Forte (see Klement et al. B000a)).
Pseudo-additive measures and their applications 1429 Definition 2.14. Let (?, 27) be a probability space. Then the following mapping J : 27 -> [0, +oo] is called an information measure if it satisfy the following conditions: A1) 7(.?) = 0, 7@) = +oo; A2) ? с A implies J(B) ^ 7(A); A3) if A and ? are independent in the sense of Banach-Marczewski, then J (? ? ?) = 7(A) + 7(?); A4) J (A U ?) = FG(A), У (?)) whenever ? ? ? = 0 for some two-place function F; A5) ./«?? A<) = linW+DC ./«?? A/)· Remark 2.15. Two events A and ? are independent is Banach-Marczewski style if they are contained in independent sub-?-algebras ? and ?? of 27, respectively. Two sub-?- algebras ? and N oi ? are independent if and only if for all 0 ^ A e ? and 0 ? В е Л/" is АП ? ^0. The function F that appears in the previous definition is called a composition law. Usually, for F are required: A) continuity, B) symmetry (F(x, y) = F(y,x)), C) associativity (F(F(jc, }>), z) = F(jc, F()·, z))), D) monotonicity (F is non-decreasing in both components), E) neutral element (F(jc, oo) = jc). Schweizer and Sklar A969) proposed that information J should be treated as a random variable on [0, oo] instead as a non-negative real number. The idea is similar to that of Menger's generalization of metric spaces. So, we have distribution function K(A):[0,oo] ->¦ [0, 1] that is a left-continuous non-decreasing mapping with K(A)@) = 0. Now, let 7(A) be a random information on [0, oo] for each AeA, K(A, x) the probability that the information given by the realization of A is less than x. Axioms (Il)-(I5)can be rewritten in the style of Schweizer and Sklar A969). The additivity axiom has been translated into the following form A4') for ?, ? e ? that ? ? ? ? 0 it is K(Al) ?) =?(?(?), ?(?)), where ? is a continuous, associative, commutative, non-decreasing two-place function on the system of distributions on [0, oo], with neutral element 0 and annihilator 1. Let S be a continuous ?-conorm. Then functions ?5(?, G,x) = S(H(x),G(x)), ts(H.G,x)= sup S(H(u),G(v)), where и, ?, ? e [0, +oo] and H, G are distributions on [0. +oo], fulfill the required axiom. The second one is an pseudo-convolution of the first type with respect to (max, S).
1430 ?. Pap 2.4. System theory Pseudo-convolutions are as important tools in the system theory as the classical convolution was (see Baccelli et al. A992), Maslov and Samborskij A992)). Let us consider arbitrary set of events. There is a numbering mechanism which assigns a number to each event in order of these events taking place. It starts from an initial finite value ко e ? (where ? is set of all integers) which can, also, be negative value. There is, also, a special type of event that corresponds to ticks of an absolute clock. These ticks are numbered in increasing order, starting from some value ?? e Z. Mapping from ? into ? such that graph is the set of all pairs (event number, clock value) is dater associated with a certain type of event. Definition 2.16. A signal и is a mapping from (R,+, ·) into ([—oo, oo], max, +). When a signal is a nondecreasing function it is called a dater. Dater d may not be strictly increasing since events of the same type may occur simultaneously. The convention is that ' d(k) = oo when к < ко. +oo event never took place, any finite value otherwise. Domain of d may be extended by setting d(—oo) = —oo and d(oo) = oo. The semiring for the range of d is ([ —oo, oo], max, +). Now, pseudo-addition is the conventional pointwise maximum: (d\ ? di)(k) = max(d\(k),d2(k)). Pseudo-multiplication is: (d\ ? ат)(к) = suP/ez(^i @ +di{k - /)) and that is the pseudo-convolution of the second type (integral is taken over whole group Z) for ? = max, ? = + and * = +. One of elementary systems is shift ?4. This system maps inputs to outputs according to the equation у (к) = и (к - g) for all к, where и is a signal. Definition 2.17. A linear system S is called shift-invariant if it commutes with all shift operators, that is if for all и and for all c, S(rc(u)) = rc(S(u)). Interesting part is that in the shift-invariant case the input-output relation can be expressed as follows: y(k) = supseU(h(k - s) + u(s)). That is, the pseudo-convolution of the second type (for ? = max, ? = + and * = +) and it plays the role of the convolution in conventional system theory. 2.5. Optimization anamorphism with the probability First, we state the problem of finding the maximum of the utility function f\(X\)+f2(X2)+--- + fN(XN), on the domain D = {(x\,X2, ¦ ¦ -,xn)'· x\ +X2 ? b*yv =¦*· *i ^ 0, ? = 1,..., ?].
Pseudo-additive measures and their applications 1431 We can rewrite this problem in such manner that it represent generalized pseudo- convolution of the first type applied on N functions. Semiring that is used for this particular problem as a range of this pseudo-convolution is ([0, oof, max, +)¦ This type of problem often occurs in the mathematical economics and operation research (see Baccelli et al. A992), Bellman and Dreyfus A962)) and it can be solved by applying the pseudo-Laplace transform, the pseudo-exchange formula and inverse of pseudo- Laplace transform (see Pap and Ralevic A998)). DEFINITION 2.18. The pseudo-character of the group (G, +), GcM" is a continuous (with respect to the usual topology of reals) map ? : G -> [a, b], of the group (G, +) into the semiring (fa, b], ?, ?), with property Их+У)=Их)ОИУ), x.yeG. The map ? = 0 is trivial pseudo-character. The forms of pseudo-character in the special cases can be found in Klement et al. B000a), Pap and Ralevic A998). Interesting case for us is ([a,b], ?, ?) = ([0, oof, max, +) and then pseudo-character has the form ?(?, с) = с ¦ ?, for each с е R, where we have taken the dependence of the function ? also with respect to the parameter c. DEFINITION 2.19. The pseudo-Laplace transform ?e(/) of a function / e B(G, [a,b]) is defined by (?e/)(?)(z)=i Hx.-z)Odmf(x). JGn|0,3C|" where ? is the pseudo-character. When at least pseudo-addition is idempotent operation we can consider the second type of pseudo-Laplace transform: (?®f)(№)= f H{x,-z)Odmf(x\ Jg i.e., pseudo-integral has been taken over the whole G. The forms of the pseudo-Laplace transform are known for three special cases and can be found in Pap and Ralevic A998). We shall restrict to the special important case is ([0, oof, max, +) and then pseudo-Laplace transform has the following form (?max/)(z) = sup(-«+ /(*)), .v^O and in и-dimensional case (?max/)(z)= sup (-г,*, z„x„+fW). л,^0./=| /?
1432 ?. Pap The important result that has been proved in Pap and Ralevic A998) (see Kolokoltsov and Maslov A997)) is the pseudo-exchange formula, that transforms operation of generalized pseudo-convolution of the first or second type to pseudo-multiplication: ??(/?*/2) = ??(/?)???(/2), where functions f\, /2 belong to B(G. [a, b]). In order to solve the problem from the beginning of this section we need the next theorem (see Pap and Ralevic A998)). THEOREM 2.20. //?*(/) = F for semiring ([0, oof, max, +), then there exists (?e)~', inverse of pseudo-Laplace transform, and it has the following form: ((C®y\F))(x)=mfo(xz + F(z)). Now, let f(x) = maxD (/1 (x \) + /2 (??) ? l· /n (*n))- This is the pseudo-convolution of the first type of functions /1, · · ·, /yv with respect to (max, +)· Applying the pseudo- Laplace transform (for the case ? = max and ? = +), the pseudo-exchange formula and inverse of pseudo-Laplace transform we obtain N Cmm(f) = Cm^(fdxu + ---+fN(XN))=^Cm'd^f^ and finally the solution /(*)= i(?™*)-' ^?m-(/0J(^) = mini^ + ^?max(/,)(z)j. There is an interesting correspondence principle between probability theory and stochastic processes on the one hand, and optimization theory and decision processes on the other hand (Baccelli et al., 1992; Kolokoltsov and Maslov, 1997). Then the Markov causality principle corresponds to the Bellman optimality principle. For the development of the theory of optimization processes in the spirit of stochastic processes, where the notions of optimization martingales and optimization measurability play the main role, see Akian et al. A998), Kolokoltsov B001), Kolokoltsov and Maslov A997), Puhalskii B001). Formally, in the case of (space and time) homogeneous processes, the connection between Markov and Bellman processes is given by the Cramer transform (—?m,n) о logo?+ (Akian et al., 1998). In general, Bellman processes present a kind of semiclassical approximation to the Markov processes (Kolokoltsov and Maslov, 1997; Kolokoltsov, 2001; Puhalskii, 2001). We are not going into details in this direction, but we shall explain here only the connection between probability and idempotent measures, which is given by the large deviation principle. To this end, let us recall first the following general definition. Let ? be a topological space and ? be the algebra of its Borel sets. One says that a family of probabilities
Pseudo-additive measures and their applications 1433 (??), ? > 0, on (?, ?) obeys the large deviation principle if there exists a rate function I: ? -> [0, oo] such that A) /is lower semi-continuous and i?(i = {a>e ?: ? (?) ? a} is a compact set for every ? < oo, B) - limsup^oelogPftC) ^ infw6c /(?) for every closed set С С ?, C) -liminfj^oelogPHi/) ^ infw6(y /(?) for every open setU с ?. Obviously m(A) = ????6? ? (?) is then a ?-min-additive measure with respect to the operation ? = min function on ?. Therefore, it is naturally to generalize the previous definition in the following way (Akian, 1999; Puhalskii, 2001). For any Borel set A let Pout(A) = limsupelogPf(A). P,n(A) = \immfs\ogP?(A). One says that (??)?>? obeys the weak large deviation principle, if there exists a positive idempotent measure m on (?, ?) such that A) there exists a sequence (i2„)„6N of compact subsets of ? such that ??(?^) -> 0 = +oo as ? -> oo, where С stands for the complimentary set of C, B) m(C) ^ -Pout(C) for every closed С С ?, C) m(t/) ^ -P'n(U) for every open U (??. Applying Theorem 1.36 and its generalizations one can prove (see details in Akian A999), Puhalskii B001)) that the large deviation principle and its weak version are actually equivalent for some (rather general) "good" spaces ?. One can obtain also an important correspondence between the tightness conditions for probability and idempotent measures. 2.6. Pseudo-linear operators Every pseudo-linear operator on the semimodule of functions with values in a semiring with idempotent addition is an integral operator, see Maslov and Samborskii A992), Kolokoltsov B001), Pap and Teofanov A998). Let ? = (min, +) and X a locally compact normal space. An P-linear continuous operator on (or homomorphisms of) semimodules Co(X) is continuous mapping ? : Co(Y) -> Q(X) such that B(a ? h ? с ? g) = a Q B(h) ? с ? B(g) for every а, с е A and h, g e Со(У). THEOREM 2.21. Let B:Co(Y) -> Co(X) be a continuous ?-linear operator. Then there exists a unique function b:XxY—>P (called the integral kernel of B) lower semicontinuous with respect to the second argument and such that (Bh)(x) = mfb(x,y)Oh(y). G) у This result is actually a direct consequence of Theorem 1.36. Properties of the kernel b(x, y) are known, which are necessary and sufficient for G) to define a continuous linear operator. One can also specify the properties of b under which the corresponding operator
1434 ?. Pap acts on the spaces of continuous functions with a compact support, or on the space of functions having a limit at infinity. Moreover, one can describe the properties of the kernel b that ensures that the corresponding operator is compact (or completely continuous), in the usual sense, i.e., it carries each set bounded in the metric to a precompact set. One easily sees that if X is compact, then for any continuous b, G) defines a completely continuous linear operator. The group of invertible linear operators is very slender. For example, any invertible nxn matrix with elements in ? is the composition of a diagonal matrix and a permutation of the standard basis of the free semimodule P". More generally, the following result holds, see Kolokoltsov B001). Theorem 2.22. Let B:C0(Y)^Co(X) and D:C0(X) -»· C0(Y) be mutually inverse ?-linear operators. Then there exists a homeomorphism ? : X —» У and continuous functions ?: X -» ? and ?:? -*¦ ? nowhere assuming the value 0 such that ?(?) ? ? (? (?)) = 1 and the operators В and D are given by the formulas (Bh)(x) = <p(x)Oh(a(x)), (Dg)(y) = V(>0 ©·?(<*"' (>0) for every heCo(Y), geCo(X)- The spectrum of general compact linear operators in idempotent analysis is also very slender. THEOREM 2.23. Let X be a compact set, and let В be a P-linear operator on C(X, P) with a continuous integral kernel b, which is nowhere equal to 0 = oo. Then В has a unique eigenvalue ? ф 0 and a (not necessarily unique) eigenfunction h e C(X, P), which is nowhere equal to 0, such that Bh = ?©/? = ?+h. The application of this result to the optimization theory is based on the following two corollaries. COROLLARY 2.24. Let an operator В satisfy the conditions of Theorem 2.23, so that its eigenvalue ? is unique and the eigenvector h is nowhere equal to 0. Let f ^ h + с with some constant с Then Bmf(x) lim =?. m—*oc m COROLLARY 2.25. Under the conditions of Corollary 2.24, if ? > 0, then the Neumann series /0?(/)??2(/)?··· (8) is finite, that is, is equal to the finite sum ? ? B(f) ? · · · ? Bk (f)for some к.
Pseudo-additive measures and their applications 1435 The ?-series (8) represent the solution of the equation g = Bg® f which describes for the stationary optimization problem corresponding to the Bellman operator B. We consider now operators homogeneous in the sense of the semiring ? = R U {+00}, i.e., operators В on function spaces such that В (a +h)=a+B(h) for any number a and any function h. The theory of such operators is closely related to game theory and there is an analog of the eigenvalue theorem for such operators (Maslov and Samborskii, 1992). This analog is applied to the construction of the turnpikes in stochastic games. For simplicity, we only consider the case of a finite state space X = {1,..., ?} in detail, using Kolokoltsov B001). We define an antagonistic game on X. Let pij (?, ?) denote the probability of transition from state i to state j if the two players choose strategies a and ?, respectively (a and ? belong to some fixed metric spaces), and let bis (?, ?) denote the income of the first player from this transition. The game is called a game with value if ? ? minmaxVV^a, fi)(h> +bjj(a. ?)) = max пипУ^ ;>;,((*, /S)(/?y + bij(a, ?)) с/ ? ? ? for all у e R". The operator В :R" -> R" such that B,(y) is equal to (9) is called the Bellman operator of the game. Applying the dynamic programming method Bellman and Dreyfus A962), we can prove that the value of the A:-step game defined by the initial position i and the terminal income h e R" of the first player exists and is equal to Bkt (h). It is obvious that the operator В has the following two properties: B(ae + h) = ae + B(h), for every a e R, h e R", e = A,..., 1) e R", A0) \\B(h)- B(g)\\ < ||/i-g||, for every h,geR". A1) where \\h\\ = max|/?'|. These two properties are characteristic of the game Bellman operator. Namely, as it was proven by Kolokoltsov in Maslov and Samborskii A992), each map satisfying A0) and A1) can be represented in form (9). Another characterization of homogeneous nonexpansive maps was obtained in Crandall and Tartar A980): if S:R" -^ R" satisfies A0), then В is nonexpansive, i.e., it satisfies A1), if and only if it is order-preserving. In the investigation of homogeneous operators, it is useful to define the quotient space ? of the space R" by the one-dimensional subspace generated by the vector e = A,..., 1). Let ?: R" -> ? be the natural projection. The quotient norm on ? is defined by II Л(/?)|| = inf \\h +ae\\ = - (max/2^ - minhj). " " aeM 2\ j j ' It is clear that ? has a unique isometric (but not linear) section S: ? -> R". The image 5"(?) consists of all h e R" such that max; h' = - min^ h'. By virtue of properties A0) and A1) of ?, the continuous quotient map ?.? -> ? is well defined.
1436 ?. Pap Further, some additional technical assumptions on the transition probabilities: 3<5 > 0: V/,;, a 3/3: ?^(?,?)^?, A2) 3i > 0: Vi,./ 3m:V<*,/3: ????(?,?)>?, pjm(a,fi)>S. A3) Let all |/>iy(a, ?)\ be bounded by some constant С The proof of the following simple fact can be found in Kolokoltsov B001), Maslov and Samborskii A992) and Kolokoltsov and Maslov A997). PROPOSITION 2.26. (i) If A2) holds and ? < \/n, then В maps each ball of radius R ^ C<5 centered at the origin into itself. (ii) If A3) holds, then \\B(H)-B(G)\\ <c A -щн -G\\, for every H,Ge0. As a direct consequence of this proposition, the fixed point theorems, one obtains the following result. THEOREM 2.27. (i) //A2) holds, then there exists a unique ? e R and a vector h e K" such that B(h) = ? + h and for all g e Ш." we have \\B'"g — mk\\ ^ \\h\\ + ||й - g||, and lim,n^oo „ =?. (ii) If A3) holds, then h is unique {up to equivalence), and lim Son(B'"(g)) = Son(h), for every g e E". 3. Idempotent integral as limit of ^-integrals The properties of decomposable measures and the corresponding integrals are closely related to the properties of the pseudo-additions on which they are based. Generally, there is quite different behavior between two classes of decomposable measures: one which is based on generated pseudo-addition (g-case, see Pap A993a, 1995)) and other based on idempotent pseudo-operation (sup and inf, see Baccelli et al. A992), Kolokoltsov and Maslov A997), Maslov and Samborskii A992), Pap A995)). 3.1. Pseudo-operations Starting from a semiring of (II) type ([a, b], ?, ©), where x®y = g~]{g(x)+g(y)) and xQy = g'](g(x)g(y)) with a generator g: [a, b] -> [0, +oo]. For ? e ]0, oof the function gA (the function g on the power ?). Then gk is a generator of the semiring (fa. b], (Bi, ©a), where х®ьУ = №)~1(8к(х)+8к(У)) A4>
Pseudo-additive Measures and their applications 1437 and x ?? У = (/)"' (g4x)gHy)) =xOy< A5) since (gk(x))~i = g~\x]/A). Hence gA is a generator of the semiring ([a,b], ®?, ?). Example 3.1. Let g: [-oo, +oo] -> [0, +oo] be given by g(x) = ex. Then * ? у = ln(ev + ey) and xQy=x+y are operations defined on [-oo, +oo]. We have 0 = -oo and 1 = 0. We have for gA(x) = ln(eAv +ek>) ? ?? у = and ? О у = ? + у. ?. We have by Mesiar and Pap A999). THEOREM 3.2. Let g: [a,b] -> [0, -foo] be a strictly decreasing generator ofthe semiring ([a, b], ?, O) of the (II) type and gA the function g on the power ? e ]0, oof. Then gk is a generator of the semiring {[a, b], ??, ?) and for every ? > 0 and every (x, у) е [a, b] there exists ?? such that \x ?? У - inf(;c, y)\ < ? for all ? ^ ??. For g increasing the same result holds for sup. The same result holds for ? -> — oo changing only the monotonicity of g. COROLLARY 3.3. By the same supposition as in Proposition 3.2 every semiring ([a, b], inf, O), where © is generated by g, and a sequence {?,,} with ?„ -> oo as ? -> oo is the limit of the sequence of semirings (fa, b], ®x„, ??„) ?* ?/? -*¦ oo. w/геге ??„ and Qx„ are given by A4) and A5), respectively. We can partially obtain also a semiring of the type (III) as a limit of a family of semirings of type (II) (Mesiar and Pap, 1999). Example 3.4. We can partially obtain the sup-inf case (on interval [0, +oo]) if we put h„(x)= (exP(-(n/*)") forjr^n, [ x/{ne) for ? > n. Then we take the sequence of semirings {([0, +oo]. ?„, ?,?)}- where x®ny = h~^(hn{x) + hn{y)) and ? Q„ у = h~\h„{x)hn{y)).
1438 ?. Pap Taking ? -> -boo we obtain ? ®„ у -» sup(;c, v) and for all ?, у finite ? ©„ у -> inf(jc, у). If X or у is oo and the other is non-zero then ? ? у = oo. The continuous extension of the limit O,, on open square is just inf. Unfortunately, there cannot exist generators leading to inf on the whole square as always we have for X non-zero oo ? ? = oo. 3.2. Measures and integrals First, remark that if m is a ?-decomposable measure, where ? has a generator g, then g о m is an additive measure. Denote by ? the usual Lebesgue measure on R. The following problem will be investigated: For ®„ generated by g„ (we have ®„ -> sup) let m„ be ?-?„-decomposable measures. Let m„ -> w. Is then w sup-decomposable measure? We have m„ = g~' ? ?„, where ?„ is a ?-additive measure. Note that then ?„ = g„om„. Suppose that ?„ is absolutely continuous with respect to ?. Denote by hn the Radon- Nikodym derivative _ rf(g„ ow„) ?/? Then ™,,(A) = g~ll / /?,,^? where the usual Lebesgue integral is taken into account. First we shall give an example. Example 3.5. Let g: [a, b] -> [0, +oo] be an increasing continuous generator of the semiring ([a, b], ?, ?) of type B). Let m be a ©-decomposable measure on a measurable space (?, ?). Let a family of pseudo-additive measures {m-A} be given such thatgAow^ = g о m for all ? e ]0, +oo[. Then mk=g-]((g°m)]l'A) is a ®i-decomposable measure, ? e ]0, +oo[. Taking ? -> +oo we obtain m\^> M, where ? (A)-. where g '(!) = !. 0 ifw(A) = 0, 1 ifw(A)>0, Example 3.5 shows that the resulting sup-decomposable measure can be reduced to a two- valued set function. However, more general sup-decomposable measures can be obtained, too (Mesiar and Pap, 1999).
Pseudo-additive Measures and their applications 1439 THEOREM 3.6. Let m be a sup-decomposable measure on ([0, +oo], ??([0, +oo]), where m(A) = esssup((p(x): ? e A), ? where ?: [0, +oo] -> [0, +oo] is a continuous densit): Then for any generator g there exist a family {mi} of ^^-decomposable measures on ([0, +oo), B). where ®A is generated by gk, ? e ]0, +oo[, such that lim mi = m. ?—>+oo Remark 3.7. Note that on the intervals sup = ess sup. For the case of arbitrary Borel set, e.g., one point set, we need in the transforming from interval to arbitrary Borel set an essential supremum. Each sup-decomposable measure which is limit of g-measures (?-measures, where ? is generated by g) fulfills the countable chain condition (CCC). Theorem 3.6 shows that some special members of the class of sup-decomposable measures fulfilling CCC are limits of g-measures. Problem: is this true for every sup-decomposable measure fulfilling CCC? Then density ? need not be continuous. Theorem 3.6 gives only sufficient conditions and for measurable ? it was not yet shown, at least by our approach. We can formulate three problems: (a) Is Theorem 3.6 true for every measurable density ?? (b) Is any CCC sup-decomposable measure limit of g-measures? (c) Is any CCC sup-decomposable measure representable as ess supM ? for some measurable density ?? It is obvious that (a) and (c) would imply (b). For any continuous function / : [0. +oo] -> [0, +oo] the integral / sup f Qdm can be obtained as a limit of convenient g-integrals (Mesiar and Pap, 1999). THEOREM 3.8. Let ([0, +oo], sup, ©) be a semiring with О generated by generator g. Let m be same as in Theorem 3.6. Then there exists a family {mA} of ^^-decomposable measures, where (Bi is generated by gA, ??]0.+??[, such that for every continuous function f : [0, +oo] -> [0, +oo] fSUp /sup pis;. fQdm= lim / f Qdmy = ^JsT(fg^ofedx).
1440 ?. Pap 4. Applications on nonlinear PDE We consider here the nonlinear PDE, so called Hamilton-Jacobi-Bellman equation du(x,t) /Эй \ H[—,x,t)=0, A6) dt \dx' (see Maslov and Samborskii A992), Kolokoltsov and Maslov A997), Pap A993b, 1997b, 1997c, 1999), Pap and Ralevic A998)). Hamilton-Jacobi equations are specially important in the control theory. Unfortunately, usually the interesting models are represented by Hamilton-Jacobi equations in which the non-linear Hamiltonian ? is not smooth, for example the absolute value, min or max operations. Hence we can not apply on such cases the classical mathematical analysis. There is so called "viscosity solution" method (see Lions A982)) which gives upper and lower solutions but not a solution in the classical sense, i.e., that its substitution into the equation reduces the equation to the identity. Using the pseudo-analysis with generalized pseudo-convolution it is possible to obtain solutions which can be interpreted in the mentioned classical way. 4.1. Hamilton-Jacobi equation with non-smooth Hamiltonian 4.1.1. Two examples. We start with two examples to illustrate how can be applied the pseudo-linear superposition principle on some non-linear partial differential equations. Example 4.1. An important nonlinear partial differential equation is the Burgers equation for a function и = u(x,t). Burgers A948), Hopf A950) and Cole A951) (see also Ames A965)) investigated as a model of turbulence the following equation dv dv с d2v ,,_. dt dx 2 ox1 where с is a parameter. Putting ? = ди/дх in A7) and integrating with respect to X we obtain the equation Эй 1 /Эи\ с д~и .,„. Э7 + 2Ы -2Э^=°< A8) for ? е Ш and t > 0, with the initial condition u(x,0) = uq(x), where с is the given positive constant, and which models the burning of a gas in a rocket, see Ames A965). We shall apply on this equation by the g-calculus with the generator g(u) = e~"/c. Then, the corresponding pseudo-addition is u®v = -c\n(e-H/c+e-v'c) and the distributive pseudo-multiplication и ? ? = и + v.
Pseudo-additive Measures and their applications 1441 Then for solutions u\ and иг the function (?? ? и\) ? (?2 ? иг) is also a solution of Burgers equation. The solution of the given initial problem is c /-? tx _ s\i u(x,t) = -\nBnct)Q Qu0(s)ds, ???) ? / where the integral is given by f f(x)dx = -c\n(f e-flx)'cdx\ The operator L: и -> ко is self-adjoint with to respect to scalar product f® (u, ?)? = I u(x)Ov(x)dx. Jm Example 4.2. Taking с -> 0 in the Burgers equation we obtain Hamilton-Jacobi equation Эй 1/Эич э7 + 2Ы =°· Then for solutions u\ and «2 the function (?? ? и ?) ? (?2 ? иг), where и ? ? = min(w, ?) and и © ? = и + ?, is also a solution of the preceding Hamilton-Jacobi equation. 4.1.2. General case. We extend now the pseudo-superposition principle to a more general case, see Kolokoltsov and Maslov A997), Pap A997b, 1999). THEOREM 4.3. If u\ and иг are solutions of the Hamilton-Jacobi equation A6), where ? e C(E"+2) anddu/дх is the gradient of u, then (? ? ???)?(?2??2) is also a solution of the Hamilton-Jacobi equation A6), with respect to the operations ? = min and © = +. We consider now the following Cauchy problem for Hamilton-Jacobi (-Bellman) equation Эй /Эи\ ,._, —+//(—j=0, и(дг,0) = ио(дО, A9> where xeE", and the function ? :R" -> R is convex (hence by boundedness of ? it is also continuous). For control theory the important examples of the Hamiltonian ? are non-smooth functions, e.g., max and | · |. The approach with pseudo-analysis avoids the
1442 ?. Pap use of the so called "viscosity solution" method, which does not give the exact solution of A9) (see Lions A982)). We apply now the methods of pseudo-analysis. For that purpose we define the family of operators {R,}, >o, for a function щ(х) bounded from below in the following way u(t,x) = (R,u0)(x)= mf(uo(z)-tC®(H)(^^\), B0) where С is considered on the whole R". The operator Rt is pseudo-linear with respect to ? = min and ? = +. We denote by L (or L(q), q e R") the transformation L{H){q) = supp6R„ (pq — H(p)), which is called also Legendre transform of H. Note that L(H)(q)=-C®(H){q), where C®(H)(q) = mfpew(-pq + H(p)). We note that L is also convex for ? convex, but it can be discontinuous- First we suppose that mo is smooth and strongly convex. We shall use the notations {x, y) and || ? || for the scalar product and Euclidean norm in R", respectively. For a function F: R" -> [-oo, +00] its subgradient at a point и e R" is a point w e R" such that F{u) is finite and {w,v-u)+F(u)^F(v) for all ? e R". Then we have by Kolokoltsov and Maslov A997). LEMMA 4.4. Let щ{х) be smooth and strongly convex and there exists S > 0 such that for all X the eigenvalues of the matrix u'q(x) of all second derivatives are not less than ?. Then A°) For every ? e R", t > 0, there exists a unique ?(t,x) e R" such that (x - ?(?, x))/t is a subgradient of the function ? at the point u'Q{%(t,x)) and (RlUO)(x) = uo(Ht,x))+tL(H)(X~^t,X)\ B1) B°) The function ?(?, ?) for t > 0 satisfies the Lipschitz condition on compact sets, and ???\^0?(?,?)=?. C°) The Cauchy problem A9) has a unique C1 solution given by B1), and ди д~х uch; ди ~dt м@ (',*) =«;>(?('. у problem + я( ,*) = ' ди\ ???)~ "oW. ?)). = ?, B2)
Pseudo-additive measures and their applications 1443 is the adjoint problem of the Cauchy problem A9). The classical resolving operator R* of the Cauchy problem B2) on the smooth convex functions by Lemma 4.4 is given by (R;u0)(x) = M(uo(t)+tL(H) We note that /?* is the adjoint of the resolving operator Rt with respect to bipseudo-linear functional f® / fOhdm. Then we can introduce as in the theory of linear equation (see Pap and Takaci B002)) the notion of generalized weak solution (using Theorem 1.37). Definition 4.5. Let щ be a bounded from below function m0:IR" ->MU {+oo} and w„0 the corresponding functional from Cq(R"). The generalized weak pseudo solution of Cauchy problem A9) is a continuous function from below (Rt uq)(x) which is defined uniquely by m/?,„„(<?) — muo(/?*<p) for all smooth convex functions ?. We can construct the solution for the case when mo is a smooth strictly convex function by Lemma 4.4. Then it follows by Theorem 1.37 and Definition 4.5. THEOREM 4.6. For an arbitrary function щ(х) bounded from below the weak pseudo- solution of the Cauchy problem A9) is given by (R,u0)(x) = (R, CI ио)(дО = inf/ci ii0(z) + tC®{H) where CI/(Jt) = sup{!/.(*): r/,eC(R"), ? ? /}· 4.2. Bellman differential equation for multicriteria optimization problems We present results from Kolokoltsov and Maslov A997), Kolokoltsov B001) obtained for the controlled process in W specified by a controlled differential equation ? = f(x, v) (where ? belongs to a metric control space V) and by a continuous function ? e B(R." ? V,Rk), which determines a vector-valued integral criterion *(*(¦))= ? ?(?(?),?(?))?? Jo ?))¦ (?))
1444 ?. Pap on the trajectories. The problem of finding the Pareto set ?, (?) for a process of duration t issuing from X with terminal set determined by some function ?? e S(R", Шк), i.e., can be described by a>t(x) = Min\J(<P(x(-))Qa>o(x(t))), B3) *(¦) where x(-) ranges over all admissible trajectories issuing from X. Proposition 1.27 enables to encode the functions ?, e B(R", P(Rk)) by the functions (t, x, а) н» и, m:R+ xR" ? L-> R. The optimality principle implies the following equation, which is valid modulo 0(? ) for small ?: u(t, x,a) = Min(/?T(p(v ,V)*u(t — ?,? + ??(?)))(?). Using the representation C) of hTiptXA,) and the fact that ? is the multiplicative unit in CSn(L) we obtain u(t, x,a) = min(r^(;c, v) + u(t — ?, ? + Ax(v), a — ???{?, ?))), к where ? = к~ 2_J'PJ аП(^ 4>L = ? — 4>e- j=\ Substituting Ajc = xf(x, v) into this equation, expanding S in a series modulo 0(?2), and collecting similar terms, we obtain the equation Эй / Эй ди \ „ ^ — + max[<pL (x, v) — - f(x, v) — - Щх, ?) = 0. B4) dt ? \ da ox ) Despite of the presence of a vector criterion has resulted in a larger dimension, this equation coincides in form with the usual Bellman differential equation. Therefore, the generalized solutions can be defined on the basis of the idempotent superposition principle, as in Section 4.1. Then we obtain the main result of this subsection, Kolokoltsov and Maslov A992). THEOREM 4.7. The Pareto set ?,(?) B3) is determined by a generalized solution и, е ?, CS„(L)) of B4) with the initial condition u0(x) = hao(x)eB(R",CS„(L)). The mapping Res '¦ «0 ^ ut is a linear operator on S(R", CSn{L)).
Pseudo-additive measures and their applications 1445 Remark that B(R", S„(L)) is equipped with the CS„(L)-valued bilinear inner product {h,f)=mfx(h*f)(x). We apply the Pontryagin's maximum principle to the problem in question based on the following observation. Let R be the usual resolving operator for generalized solutions of the Cauchy problem for B4), so that R acts on the space B(R" ? L, ]-oo, +00]) of ]-oo, +oo]-valued functions bounded below on R" ? L. There is an embedding m:B(R",CS„(L)) -> B(R" ? L, ]-oo, +00]), which is an idempotent group homomorphism, i.e., preserves the operation ? = min. We have that the diagram Res B(R",CS„(L)) »- B(R",CSn(L)) R B(R" ? L,]-oo, +00]) —*- B(R" ? L,]-oo, +00]) commutes. Namely, for smooth initial data this follows from the fact that a smooth solution of B4) always defines optimal synthesis. This implies commutativity for general initial conditions, since the operators Res and R are uniquely defined by their action on smooth functions and by the property that they are homomorphisms of the corresponding idempotent semigroups, i.e., preserve the operation ? = min. This implies the following result. PROPOSITION 4.8. u(t,x,a) is the minimum of the functional ? ?(?(?), ?(?)) dz + fc^Ud »(?(?)) B5) Jo defined on the trajectories of the system i = f(x,v), a = -<pL(x.v) B6) in R" ? L issuing from (x, a) with free right endpoint and fixed time t. 4.3. Stochastic optimization The method considered in Section 4.1 for constructing generalized solutions to the Hamilton-Jacobi-Bellman equation, can be used in more general situations, for stochastic or infinite dimensional generalizations. Here we formulate some results obtained in Kolokoltsov A998b, 2001), where one can find the proofs as well as the applications of
1446 ?. Pap these results to the construction of WKB-type asymptotics of stochastic pseudodifferential equations. We consider the equation du + H it, x, -^ J dt + (c(t, x) + g(t, x) — J о dW = 0, B7) where ? e R", t ^ 0, W = (W',..., W") is the standard и-dimensional Brownian motion, see Brooks B002), this Handbook (o, as usual, denotes the Stratonovich stochastic differential), u(t,x,[W]) is an unknown function, and the Hamiltonian H(t,x,p) is convex with respect to p. This equation is called the stochastic Hamilton-Jacobi- Bellman equation. First, we explain how this equation appears in the theory of stochastic optimization. Second, we develop the stochastic version of the method of characteristics to construct classical solutions of this equation. Third, on the basis of the methods of idempotent analysis (and on analogy with the deterministic case), we construct a theory of generalized solutions of the Cauchy problem for this equation. We start from the controlled stochastic dynamics which is described by the equation dx =f(t,x,v)dt + g(t,x)odW, where the control parameter ? belongs to some metric space V and the functions / and g are continuous in t and ? and Lipschitz continuous in X. Let the income along the trajectory ?(?), ? e [t,T], defined by the starting point ? = x@) and the control [?] = ?(?), ? e [t, T], be given by the integral lf(x,[v],[W])= f b(z,x(T),u(T))dT+ ? c(T,x(T))odW. We will find an equation for the cost (or Bellman) function u(t, ?, x, [W]) = sup(ll(x, [v], [W]) + u0(x(T))), ? where the supremum is taken over all piecewise smooth (or equivalently, piecewise constant) controls [v] and mo is some given function (terminal income). Our approach is based on the following fact: if we approximate the noise W in some stochastic Stratonovich equation by smooth functions, then the solutions of the corresponding classical (deterministic) equations will tend to the solution of the given stochastic equation. For smooth functions W, we have the dynamics i = f(T,x,v)+g(T,x)W(T) and the integral income j [b(z, ?{?), ?(?)) + c(t,x(t))W(t)] dx.
Pseudo-additive measures and their applications 1447 The Bellman equation for the corresponding deterministic (nonhomogeneous) optimization problem is given by ди / ди\ ( ди\ о — +sup{b(t,x,v) + f(t.x,v)—\ + lc(t,x)+g(t,x)—)W(t)=0. Rewriting this equation in the stochastic Stratonovich form, we obtain B7) with H(t,x, p) = sup(b(t, ?, v) + pf(t,x, v)). V We restrict on the following two particular cases. (i) Let с = 0 and g = g(t) be independent of X. Differentiating B7), we obtain ди (дН ЭН д2и\ д2и дх \ дх др дх1 / дх1 and using the relationship w о dW = w dW + - dwdW 2 between the Ito and the Stratonovich differentials, we obtain the equation for и in the Ito form / du\ 1 ? д~и ди du + H(t,x,ir)dx + -g--^dt+g—dW=0. \ дх/ 2 дх1 дх For the mean optimal cost function и, this implies the standard second-order Bellman equation of the stochastic control theory: ди ( ЭиЛ 1 -, д'и — + ? ?, jc, — + - g2 —^ = 0. dt \ dxj 2s dx2 (ii) For g = 0 B7) reduces to du + H(t,x,-^-}dt+c(t,x)dW=0, B8) since in this case the Ito and the Stratonovich differential forms coincide. Particularly, we formulate a result for the case of B8) with ? and с that do not explicitly depend on t. THEOREM 4.9. For an arbitrary initial function uq(x) bounded below, there exists a unique generalized solution of the Cauchy problem for B8)/or all t ^ 0.
1448 ?. Pap Approximating nonsmooth Hamiltonians by smooth functions and defining the generalized solutions as the limits of the solutions corresponding to the smooth Hamiltonians, Kolokoltsov A998b, 2001) found (on analogy with the deterministic case) that formula for generalized solutions remains valid for nonsmooth Hamiltonians. 4.4. Option pricing We shall use a part of Kolokoltsov A998b, 2001) for describing the subject. The well- known Black-Sholes and Cox et al. A979) formulas are basic results in the modern theory of option pricing in financial mathematics. They were obtained by means of stochastic analysis. Various generalizations of these formulas were proposed using more sophisticated stochastic models for common stocks pricing evolution. The systematic deterministic approach to the option pricing leads to a different type of generalizations of Black-Sholes and Cox-Ross-Rubinstein formulas characterized by more rough assumptions on common stocks evolution. This approach reduces the analysis of the option pricing to the study of certain homogeneous nonexpansive maps, which are "strongly" infinite dimensional: they act on the spaces of functions defined on sets, which are not (even locally) compact. Kolokoltsov A998a) has shown what type of generalizations of the standard Cox-Ross- Rubinstein and Black-Sholes formulas can be obtained using the deterministic (actually game-theoretic) approach to option pricing. Characterizing the class of homogeneous nonexpansive maps which appear in these formulas, considering first a simplest model of financial market with only two securities in discrete time, then its generalization to the case of several common stocks, and then the continuous limit. A simplest model of financial market deals with only two securities: the risk-free bonds (or bank account) and common stocks. The prices of the units of these securities, В = (Bk) and S = (irrespectively, change in discrete moments of time к =0, 1,... by the recurrent equations Bk+\ = pBk, where ? ^ 1 is a fixed number, and Sk+i = &+i &, where & is an (a priori unknown) sequence taking value in a fixed compact set McR. Denote by и and d respectively the exact upper and lower bounds of ? (и and d stand for up and down) with 0 < d < ? < и. There were investigated two cases: (i) ? consists of only two elements, its upper and lower bounds и and d, (ii) ? consists of the whole closed interval [d,u]. Case (i) corresponds to the Cox-Ross-Rubinstein model and case (ii) stands for the situation when only minimal information on the future evolution of common stocks pricing is available. An investor is supposed to control the growth of his capital in the following way. Let Ck-1 be his capital at the moment к - 1. Then the investor chooses his portfolio defining the number у* of common stock units held in the moment к — 1, and we have Ck-1 = YkSk-1 + (Ck-1 - YkSk-1). where the sum in brackets corresponds to the part of the capital laid on the bank account (all operations are friction-free). The control parameter y* can take all real values, i.e., short selling and borrowing are allowed. In the moment к the value ijk becomes known and
Pseudo-additive measures and their applications 1449 thus the capital becomes equal to Ck = YkHkSk-\ + (Ck-1 - YkSk-1)p. The strategy of the investor is any sequence of numbers ?\,..., ?? such that each ?-} can be chosen using the whole previous information: the sequences Co, · · ·, Cj-\ and SO,..., Sj-\. The investor, selling an option by the price ? = Co should organize the evolution of this capital (using the described procedure) in a way that would allow him to pay to the buyer in the prescribed moment ? some premium f(Sn) depending on the price S„. In the case of the standard European call option, which gives to the buyer the right to buy a unit of the common stocks in the prescribed moment of time ? by the fixed price K, the function / has the form f(S„) = nax(Sn-K,0). Therefore the income of the investor will be C„ - f(Sn). The strategy ?\,..., y„ is called a hedge, if for any sequence ?\,..., ?? the investor is able to meet his obligations, i.e., C„ - f(S„) ^ 0. The minimal value of the initial capital Co for which the hedge exists is called the hedging price P/, of an option. The hedging price P/, will be called correct (or fair), if moreover, C„ - f(Sn) = 0 for any hedge and any sequence ?y. The correctness of the price is equivalent to the impossibility of arbitrage, i.e., of a risk-free premium for the investor. It was in fact proven in Cox et al. A979) (using some additional probabilistic assumptions on the sequence ?]) that for case (i) the hedging price P/, exists and is correct. It is known that when the set ? consists of more than two points, the hedging price will not be correct anymore. Kolokoltsov A998a) proved using exclusively deterministic arguments (idempotent analysis) that both for cases (i) and (ii) the hedge exists and is the same for both cases whenever the function / is nondecreasing and convex (possibly not strictly). Some generalizations of the standard Black-Sholes formula (characterized by more rough assumptions on the underlying common stocks evolution) were obtained which reduce the analysis of derivative securities pricing to the study of homogeneous nonexpansive maps, which act in infinite dimensional spaces. 5. Non-commutative and non-associative pseudo-operations We present in this section results obtained in Pap and Vivona B000) related the relaxation of the properties of pseudo-addition and pseudo-multiplication and the application of obtained results on nonlinear PDE. DEnNlTlON 5.1. We call real operations ? and ? generalized pseudo-addition and generalized pseudo-multiplication (from the right), respectively, if they satisfy the following conditions: (i) ? and 0 are functions from C2(E2), (ii) the equation t ? t = ? for given ? is uniquely solvable,
1450 ?. Pap (iii) ? is right distributive over ?: (Dr) (*?H?? = (.???)?()>?;:)· Changing in the previous definition in (iii) that ? is left distributive over ?: (D,) zQ(x®y) = (zQx)®(zQy). we obtain generalized pseudo-addition and generalized pseudo-multiplication (from the left), respectively. Since all considerations are analogous we shall take in the future considerations only on the right case. We shall give a representation theorem for generalized pseudo-addition and generalized pseudo-multiplication. For that purpose first we shall prove several lemmas. First of all for a function of two real variables и =и(х, у) we shall use the following notations Эй Эй Эй (и)\= — =и_х, (иJ= — =иу, (к)|2 = т—г- =".rv дх ду дхду The following representation theorem is obtained in Pap and Vivona B000). THEOREM 5.2. Two real operations ? and ? from C2(E2) with (х<Эу)\ ^Oforallx, y, are generalized pseudo-addition and pseudo-multiplication {from the right), respectively, if and only if they are representable by uQz = h-](f(z)h(u)+a(z)), B9) and Х®у = И~]((И(х)ШИ(у)), C0) where f and a are arbitrary functions from C2(E), the function h is given by h(u) = ("{e-W^'^ds, Jy the operation ?' is given by ? ?' у = ?~' (? ? y), where ? is defined by ? (?) = ? ? ? and ЕВ satisfies the functional equation (u EB v)f(z)+a(z) = (uf(z)+a(z)) EB (vf(z)+a(z)). THEOREM 5.3. If® and ? are generalized pseudo-addition and pseudo-multiplication (from the right), respectively, and additionally that (a) x © 0 = a (constant) for every x;
Pseudo-additive measures and their applications 1451 (b) there exists yo such that (x Qyo)\ ? 0; then we have x®y = h-\h{x) + h{y))x 0у = Г' (f(y)h(x)), C1) where (for ?, ? arbitrary non-zero constants) h(x) = (Y®PJ[{^^dx, C2) J (?®?J and f is an arbitrary function fmm C"(R) and /@) = 0. Remark 5.4. (i) Theorem 5.3 holds also for ?, 0 e C1, then also / e C1. (ii) As a consequence of Theorem 5.3 we obtain that ? is also commutative and associative. Specially, for / = h we obtain the g-calculus, see Pap A993a, 1995). Definition 5.5. For given generalized pseudo-addition ? and generalized pseudo- multiplication О (from the right) we shall call any function h from Theorem 5.2 in B9) and C0) and Theorem 5.3 in C1) and C2) a generator of operations ? and ©. Now we can give a complete characterization of generalized pseudo-addition and pseudo-multiplication THEOREM 5.6. Let ? and © be generalized pseudo-addition and pseudo-multiplication, respectively, with (? ? у) \ ? Ofor all x, y. Then they can be represented in one of the following forms (i) X®y= h~\h(y) + i/(h(X) - h(y))), (ii) ? ? у = h-l(h(y) + Y(h(x) - h(y))), xQy = h-\a{y)-h{x)), with ? odd function; (iii) ? ? у = й-1 (h(y) +a(h(x) - h(y))), xOy = h-i(h(x)f(y)+a(y)), where ?(?) = t ???,? is a constant and ЕБ, a, f, h are functions from Theorem 5.2.
1452 ?. Pap 5.1. Applications on nonlinear PDE First order equations. We shall consider the following nonlinear PDE of the first order, see Goard etal. A997), c\(x,y)u_x +c2(x,y)uy = F(u), C3) where c\, ci and g are given functions. Taking dt о-/ ' F(t) we introduce using (i) from Theorem 5.6 generalized pseudo-addition ? and pseudo- multiplication © (from the left) for arbitrary ? и ? ? = h~\h(v) + \j/e(h(u) - h(v))), и О ? = /г (h(v) +?(?)), respectively, for an arbitrary function a, and ?^(?) = G~](s + G(x)) for arbitrary function G from C1. Then by Goard et al. A997) we have that if и ? and иг are solutions of Equation C3) then also for every ? mi ? u2 = /г (h(u2) + \l/e(h(ui)- h(ui))) is a solution of Equation C3). For arbitrary but fixed real number a and a solution и of Equation C3) we have that also for every function a а<Эи =h~*(h(u)+a(a)) is a solution of Equation C3). Summarizing, we have that if u\ and иг are solutions of Equation C3) and a\ and a2 arbitrary but fixed real numbers, then also for every ? and every a (?? ???)?(?2?) is a solution of Equation C3). Taking specially c\(x, y) = c2(x, y) = 1 and F(u) = u2 we obtain the equation "л + «у = " · Then we have h{u) = -\/u. In this case we have for the operations ? and О 1 1/?-?A/?- 1/и) 1 и 0 ? = 1/? — а(и)
Pseudo-additive measures and their applications 1453 where ?(?) = - \n(e' + 1), by Goard et al. A997), the same statement as for the general case. Multi-dimensional aggregation of solutions. We can consider the extension to «-dimensional case of Equation C3) for F ? 0 " 9 Ycl(x],...,x„)-^ = F(u) C4) ?[ dx' considering the m-dimensional pseudo-superposition principle. We have that if и ?...., u,„ are solutions of Equation C4) then also the generalized geometric mean of solutions G(u ? ,...,«,„) = /-' (/(«?)"' ···/(«,„)"'") and the generalized arithmetic mean of solutions A(« , u„) = (/-' (fl,/(ii,)"+·· ¦+a„J(umY)L,\ where f{x) = exp(/i dt/ F(t)), a; ^0 (i = 1,.. .,m) and ? ? ? \-am = 1, and ? ^0, are solutions of Equation C4). Second order equations. We shall need the following special class of operations ? and ?. THEOREM 5.7. Let ? be an arbitrary but fixed positive real number. If к is a strictly monotone positive function and belongs to C~, then operations defined by x®y=k~] (?k(x) + k(y)), ? ? у = к~' (к(хУк( у)) are generalized pseudo-addition and pseudo-multiplication (from the left), respectively, which belong to the class (i) in Theorem 5.6 (introducing an operation ?' by ? ©' у = У ? ?). Specially for ? = 1 we obtain a semiring which is based on a generator (g-calculus). Example 5.8. We consider again the Burgers equation A8). We introduce generalized pseudo-addition and generalized pseudo-multiplication (from the left) from Theorem 5.7 given by u®v = к~*^к(и)+к^)). uQv = k~\k(u)ek(v)), respectively, for special k(u) = e~"^2A). Then by Goard et al. A997) we have that if u\ and и 2 are solutions of Equation A8) then also for every ? > 0 и ? ? «2 = -2? 1?(?*>-"'/|2?) + е'"-/ак)) is a solution of Equation A8).
1454 ?. Pap For arbitrary but fixed real number a and a solution и of Equation A8) we have that also for every ?, ?© и =?? + u is a solution of Equation A8). Summarizing we have that if и ? and И2 are solutions of Equation A8) and a\ and сь arbitrary but fixed real numbers, then also for every ? > 0, (? ? ? u\) ? (?2 ? u2) is a solution of Equation A8). Note that if и is a solution and a is an arbitrary real number then и © ? is a solution of Equation A8) if and only if ? = 1, which gives us the g-calculus. Example 5.9. Consider the following PDE of the second order ? ? ? ????](?\,..., x„)ux,Xj + Y^bk(x\,..., x„)uXk +c(x\,..., x„)u ?'=? j=\ i=l . ? ? = - ???'^??' ¦ ¦¦'Xn)uXiuyj, C5) " '=1 7=1 where ?,-,-, fa, с are given functions and ? is a constant different from 1. Taking k(x) = ? for ? > 0 and ? ? 1 we consider generalized pseudo-addition and pseudo- multiplication (from the left) from Theorem 5.7 и ? ? = (??'"? + ?'-?)?/(|-?), и © ? = и"?, respectively. If u ? and И2 are positive solutions of Equation C5) then also for every ? > 0 и ? ? И2 = k'] (sk(u ?) + k(u2)) is a positive solution of Equation C5). For arbitrary but fixed positive real number a and a solution и of Equation C5) we have that a © и = aeu is a positive solution of Equation C5). Taking ? ? (? ? ? иг) with a = \????\]/? and ? = \?\ \/\?2\ we obtain Levin's A970) result. We note that if и is a solution and a is an arbitrary positive real number then и ? ? is a solution of Equation C5) if and only if ? = 1, which gives us the g-calculus. 5.2. Corresponding pseudo-measure In this section we shall consider special class of the generalized pseudo-operations ? and О initiated by Theorem 5.7. Dehnition 5.10. Let ? be an arbitrary but fixed positive real number and к a positive strictly monotone continuous function defined on R or [0, oof, then operations defined by x e у = Г' (ek(x) + k(y)), ? ? у = k~' (k(xfk(y)) are called к -pseudo-addition and A:-pseudo-multiplication (from the left), respectively, on ? or [0, oof.
Pseudo-additive measures and their applications 1455 Since generalized pseudo-operations are not necessarily commutative nor associative operations we need to define pseudo-sum of ? elements and pseudo-sum of pseudo- products. Definition 5.11. Let ? and ? be operations from Theorem 5.7 and a, e R (or a, e [0,oo)) i e {1,2,...и). Then 0 щ = (· · · ((?, ? a2) ? a3) ? · · ·) ? a„ = k~' I ^?"-1 ?(?,) J. i=l \?=? / Now, pseudo-sum of pseudo-products is ?(?, 0ft) = r' ij2e"^k(a,Y ¦ *(ft)V i = l \i=l / Let С be a family of semiclosed subintervals of R, then С is semiring of sets, i.e., A) 0eC, B) ?, ? eC=> АП5 eC, C) for all ?, ? e С there exists С ?,..., C„ from С such that Cj П C, = 0 for i ^ ;' and A\« = U*=|C*. Let ]?,/>] с R be an arbitrary but fixed semiclosed interval, let ? be an и-partition of ]a, b], a = xo < x\ ^ · ·· ^ xn =/?,andletC]o.ft] =C(~){]a,b]}. We denote by C? h] the subfamily of the family Су,.ь\ which consists of all subintervals ]*,-, x,+ \ ] obtained by the partition ? and the interval ]a, b]. If a = xq < x\ 4. ¦ ¦ ¦ < jt„ is the «-partition ? of the interval [a,b], the (n + l)-partition p' will be obtained if we keep all the points from previous partition and add one more point and renumerate the points of the new partition in the increasing order. After s-repetition of the preceding procedure we obtain (n + s)-partition pis). First, we shall define by Pap and Stajner-Papuga B001) ?-measure on C'^ fc) with respect to a given partition p. Definition 5.12. A mapping ?? :C!'(l h, -> [0, oo[ is called ?-measure for given «-partition ? if Mp(]^.*,-+.])=*-|(igS?). C6) where i e {0, ?,.,.?- 1), ?;,(]?,?]) = k~\b - a), and if for some у e]a,b] such that ¦*; < У < *i+b we have for ?,,-: C? ,;| -+ [0, oof Мр'(]*.-.У])=*-'(^?) and дИ]^.+|])=Г'(^т)' C7> where p' is the (n + l)-partition obtained from ? partition plus the point y.
1456 ?. Pap Remark 5.13. Equation C7) transform ©-measure ?? to ©-measure ??· on (n + 1)- partition p' that differ from starting «-partition at most one point y, in such manner that following equality is valid ?-?{]??,??+\]) = ?,,'(]??, у]) ® ??'(]}·,??+\]). C8) We remark that with respect to ?? unchanged are the values m p'{]xj,Xj+\\) for j > i; the values mp'(\xj,y]) and тр'(\у,х,+ \]) are given by C7), and for j < i we have mp'(]Xj,Xj+]]) = k-](k(mp(]Xj,Xj+i])/s). Now, for any interval ]c, d] с ]a, b] we can obtain the value ??"{?, d]) by ?? and, at most, (n + 2)-partition. THEOREM 5.14. Let ?? be ?-measure from the previous definition, then following holds (a) ?,,@) = *-'(()), (b) MlXo]*/.*'¦+!]) = ®i=o Vp(]xi,Xi+\])- Now, the ?-integral can be constructed. The construction of ?-integral of continuous function is analogue to the construction of Riemann's integral through the Riemann's integral sums. In order to construct ?-integral we shall need the sequence of partitions of some interval [a, b].The ?-measure ?? defined for и-partition will give us ?-measure ??· defined for (« + l)-partition ? by procedure given in C7). Repeating this procedure s-times we obtain ?-measure ? <,> defined for (« -M)-partition pis). DEnNlTlON 5.15. Let / : [a,b] -> [0, oo[ be a continuous function. The ?-integral of function / is defined by /(?.?) /«+.V-I \ ???.?= lim ?(/(%?H?A?''·?'+?1)) Sx.b\ ??,(,)(].?,..?? + ,|)^*-1@)\ ;=0 / (if the limit exists). The ?-integral over the interval [a,b] does not depend on the partition of interval [a, b]. THEOREM 5.16. The ?-integral ofa continuous function f :[a, b] -+ [0, oo[ with respect to ???)? any partition ? over the interval [a, b] is given by: /¦(?.?) / rh \ / /???=?? (k(f(t)))f dt). C9) J[u.b\ \Ja / Since the integral on the right side of C9) does not depend on the partition of interval [a,b], we shall denoted the right side of C9) by f^f f. For ? = 1 the g-integral is obtained.
Pseudo-additive measures and their applications 1457 Next theorem will give us some basic properties of ?-integral. First, we shall introduce a transformation [·]* in the following manner: [/]*(*) =*_?(*?/?(/(*»). where / is a continuous function that maps interval [a, b] into [0, oof. If this transformation is applied on constant a from [0, oof we obtain: [?]*=*-'(*?/?(?)). THEOREM 5.17. Let f andh be continuous functions that maps interval [a, b] into [0, oof and let a be a constant from [0, oof, then: (b)«oCf)/ = Cf)(^o/). We can introduce a generalization of g-derivative from Pap A993a). Dehnition 5.18. Let / e С' [a, b]. Generalized derivative of / is if it exists. The generalized derivative have the following properties @ C_f Г-е) + k(f(a)) = k(f(x)) and (jj^f /)<'¦'> = *(/(*)). (ii) (/? a)"'f) = /('-f) ? causing the representation of ?-integral obtained in Theorem 5.16, we can introduced a generalization of ^-convolution. Dehnition 5.19. Let /, h : [0, oof-* [0, oof be continuous functions. The pseudo- convolution / * h : [0, oof—> [0, oof of functions / and h is given by f*h(x) = r](j\k(f(x-t))f(k(h(t))ydt\ THEOREM 5.20. Let f, g:[0, oof-* [0, oof be continuous functions. Then the pseudo- convolution of functions f and h has the following presentations: ?-(?.?) f*h(x)= ([/]*(*-0Ой@)¦ J 10.x] It follows that pseudo-convolution is a commutative operation.
1458 ?. Pap 6. Conditional distributive real semiring 6.1. Conditional distributivity A uninorm U (respectively ?-norm T) is a commutative, associative, monotone binary operation on the unit interval [0, 1] and for some 0 < e < 1 we have U(x,e) = ? (respectively T(x, 1) = x) for all ? e [0, 1]. Throughout this section, let U be a left- continuous uninorm or ?-norm and S a continuous ?-conorm such that U is conditionally distributive over S, i.e., they satisfy the property (CD): U(x,S(y,z)) = S(U(x,y),U(x,z)) for all x,y,z e [0, 1] such that S(y,z) < 1. In this way ([0, 1], 5", U) is a conditionally distributive semiring. For ?-norm U = ? there was obtained in Klement et al. B000a) the following characterization THEOREM 6.1. A continuous t-norm ? is conditionally distributive over a continuous t- conorm S if and only if there exist a e [0, 1], a strict t-norm T* and a nilpotent t-conorm S* such that the additive generator s* of S* satisfying s*(l) = 1 is also a multiplicative generator of T* such that T(x,y) = ?·7?(?) forx,ye[0,a], a + (l-a)r*(f^,g) forx,ye[a,l], min otherwise. (t-norm ordinal sum) where T\ is an arbitrary continuous t-norm and [ max otherwise, (t-conorm ordinal sum). If S is a strict ?-conorm generated by an additive generator s, then the only distributive left-continuous uninorms with respect to S are associative compensatory operators (see Klement et al. A996)) generated by multiplicative generator с ¦ s, ce ]0, oof (with the convention 0 · oo = 0). 6.2. Hybridprobability-possibilistic measure and integral Based on the structure ([0,1], S, U) with conditional distributivity it is developed in Klement et al. A996, 2000a, 2000b) the so called E", i/)-integral. We introduce now an integral based on S'-measure, see Section 1.2.1, and a conditionally distributive pair E", U). Denote by ? the set of all .^-measurable functions from X to
Pseudo-additive measures and their applications 1459 [0, 1]. As usual, a measurable step function ?: X ->· [0, 1] is a measurable function which assumes only finitely many values, and the set of all step functions will be denoted S. If Range(<p) = {a\,a2, ¦ ¦ -,an} with ?, ? ?, whenever i ? j, and if A, = ?~] ({a,}), then there is a canonical representation of ? given by /1 ?= S i/(a,,lA,), D0) where iA(X) = {e0 e for ? e A, otherwise. We have U(a, 1д) = ???, where 1 for ? e A, 10 otherwise. Observe that representation D0) is independent of the ?-conorm S under consideration. As classical measure theoretic result we have THEOREM 6.2. For each measurable function f:X -*¦ [0, 1] there exists a sequence (<Pn)n6N of step functions such that ?„ / f uniformly. Dehnition 6.3. Let m : ? -+ [0, 1] be an 5"-faithful 5"-measure. (i) Given an 5"-w-faithful partition С = {Ск- к е К) the E", i/)-integral of a measurable step function ?: ? -> [0. 1] (which is represented as in D0)) is defined by flS.U) ? ? \ / q>dm= S I S i/(flj,m(A,nCi)) )¦ (ii) The (S, i/)-integral of a measurable function / : X -> [0, 1] is defined by •(S.f) /-(S-f) / / iiw = lim / ?„ dr, Jx h^^Jx where (<p„)fieN is a sequence of step functions such that ?„ / f (which exists by Theorem 6.2). (iii) The (S, i/)-integral of a measurable function / : X -> [0. 1] over a set A e ? is defined by •(S.f) /-(S.i;) / /dw= / U(lA,f)dm. J A JX
1460 ?. Pap Integral of a step function given in Definition 6.3(i) is independent of the choice of the 5-m-partition and the E, i/)-integral of a measurable function in Definition 6.3(iii) is independent of the choice of the sequence (<p„)„eN °f steP functions such that ?„ / f. The basic properties of E, i/)-integral are contained in the following theorem. THEOREM 6.4. Let m : ? -> [0, 1] be an S-faithful S-measure and f,geM- (i) Iff ^g then MS,U) MS.U) / fdm ? gdm. Jx Jx (ii) If А, В e ? with ? ? ? = 0 then / fdm =S\ \ fdm, / fdm). Jaub \Ja Jb J (hi) IfU(S(f, g), a) = S(U(f, a), U{g, a)) for all a e [0, 1] then / S(f,g)dm = sll fdm, I gdmj. Remark 6.5. It should be mentioned that for a ?-norm U = ? the condition in Theorem 6.4(iii) is fulfilled for all f,geM only if S equals 5м- Because of conditional distributivity of Г over 5 (iii) in Theorem 6.4 also holds if S(f(x),g(x)) < 1 for all ? e X. For the E, i/)-integral some classical convergence theorems such as the monotone convergence theorem and Fatou's lemma, can be proven, see Klement et al. B000b). THEOREM 6.6 (Beppo Levi). For every non-decreasing sequence of measurable functions (fn)neN, ie-, fn ^ /(,+ i, with Ит^эс f„ = fe M, we have ~(S.U) MS.U) I f dm = lim / /„ dr, Jx "^xJx THEOREM 6.7 (Fatou lemma). Let (/„) be a sequence of measurable functions. Then •(S.G) MS.U) I liminf/,, dm < liminf / /„ dm. Jx "^°° "^x Jx The E, i/)-integral is an extension of 5-measure, see Klement et al. B000b). Theorem 6.8. (i) For each f e Л4 the function m / : ? —> [0, 1 ] defined by (S.U) mj(A) = I f dm, J a is an S-faithful S-measure.
Pseudo-additive measures and their applications 1461 (ii) For all f,geMwe have f(S.U) riS.U) flS.U) / fdmg= gdmf= t/(/, J X J X J X g)dm. Theorem 6.8(i) shows that the (S, i/)-integral is a proper extension of the underlying S'-measure since, in the case where / is the constant function with value 1 on X, we have rrif = m. We remark that for the S'-measure m : V(X) -> [0, 1] specified by m{A) = 1 for all non-empty subsets A of X and for SM-measure given by M/(A) = sup{/(.x): xeA] we always have ?/ = m f. For each S-faithful S'-measure m : ? —>¦ [0, 1] we consider the subfamily Mm of ? defined by M,„ = {f e M: X is S-m/-faithful). Obviously, for each ^M-measure m: ? ->· [0, 1] we have Mm = M. We have that the (S, i/)-integral is {/-homogeneouson the family Mm- THEOREM 6.9. Let m : ? -+ [0, 1] be an S-faithful S-measure. Thenfor every a e [0, 1] HS.U) / r(S.U) / U(a,f)dm = Llla, / fdm\. For special uninorms or ?-norms U and ?-conorms S, we obtain some integrals which are well-known in the literature. Example 6.10. (i) Let m: ? -> [0, 1] be an ^-measure which is also a (?-additive) measure. Then it is a S^-faithful S^-measure and the (Sl, Tp)-integral of a function f &M coincides with the classical Lebesgue integral of /. (ii) For each left-continuous uninorm or ?-norm i/, for each S^-measure m : ? —> [0, 1 ] and for each function / e ? we have /. (SM.G) fdm = sup{U(a,m({xeX: f(x)^a}))\ae[0, 1]}. IX If m is a completely maxitive measure then we have •(SM.f) /, fdm = sup{U(f(x),m({x})): xe X). ? Remark 6.11. (i) The procedure for the construction of the E", i/)-integral used in Definition 6.3 works also if we replace uninorm or ?-norm U by an operation *: [0, !]"—>·
1462 ?. Pap [О, 1] which is left-continuous, monotone, has an annihilator 0 and a left neutral element e e @, 1 ], and which is conditionally distributive over S. Moreover, Theorem 6.4(i), (ii) as well as Theorems 6.6 and 6.7 still hold. (ii) The restriction to S'-measures and measurable functions whose ranges are subsets of [0, 1] causes no loss of generality. Let [a, b] с [-oo, oo] such that ([a, b], ©) is an /-semigroup with neutral element a (see Klement et al. B000a)). Consequently, there is some strictly increasing bijection ?: [a, b] —>¦ [0, 1] such that the continuous r-conorm S defined by S(x,y) = w(w-\x)®w-\y)) is isomorphic to ?. Let * be a binary operation on [0, 1] as in (i), which is conditionally distributive over S, and define the operation ? on [0, 1] by xQy = ?(?~\?)* ?-1 (}>))· Observe that ? is a left-continuous monotone operation on [a, b] which has annihilator a and possesses some left neutral element, and which is conditionally distributive over ?. Then for an ?-measurable function / : X -> [a, b] and an ©-faithful ©-measure m : ? —> [a,b] can be introduced in a complete analogy to our construction the integral jf-Q) fdm (compare Section 1.3, Maslov and Samborskii A992), Pap A990)), and it can be represented by the (S, *)-integral via (S.*) / fQdm=w'][j (wof)d(coom) \. Example 6.12. (i) For [a,b] = [0, oo] we obtain the integral introduced in Sugeno and Murofushi A987), and for ([a, b], ?. ?) = ([0, oo], +, ¦) we again come back to the classical Lebesgue integral. (ii) If the operation ? in the semiring ([a, b], ?, ?) is not idempotent, then the operations ? and ? are generated by some uniquely determined strictly increasing bijection g : [a, b] —> [0, oo] via ^©^^"'(gW + gOO). xOy = g-*(g(x)g(y)). The corresponding (?, ©)-integral was studied in Section 1.4, Pap A990) (called g- integral in Pap A993a, 1993b)), and it has the special form J fOdm = g-](J(g0f)d(gom) where the integral on the right hand side is a Lebesgue integral. Under some additional conditions, also a generalized derivative (g-derivative)can be defined (Pap, 1993a, 1993b) which turns out to be useful when solving some differential equations.
Pseudo-additive measures and their applications 1463 (iii) If 5 is a continuous Archimedean ?-conorm with an additive generator s : [0, 1] —> [0, oo] and if we replace the ?-norm ? by the binary operation * given by a * b = s~ (a ¦ s(b)) and if we assume that X is w-achievable if s(l) < 1 (in the sense of Weber A984)), we can repeat the whole construction of Definition 6.3 for the E, *)-integral, and we obtain the integral introduced in Weber A984). 6.3. Hybrid utility function In order to generalize decision theory to non probabilistic uncertainty, one approach is to generalize mixture sets. In the paper by Dubois et al. B000) there are characterized the families of operations involved in generalized mixtures, due to a previous result on the characterization of the pairs of continuous ?-norm and ?-conorm such that the former is conditionally distributive over the latter, see Theorem 6.1. A basic notion in probability theory is independence. The main issue in probabilistic independence is the existence of special events A\,...,A„ such that P(A\ ? ¦¦¦ ? ??) = ?'/=? ?(??- Such events are called independent events. In order to preserve the computational advantages of independence, any operation * for which it could be established that P(A\ П- ¦ ¦ П A„) = *'.'=| P(Aj), would do. However the Boolean structure of sets of events and the additivity of the probability measure, impose considerable constraint on the choice of operation *. In the paper by Dubois et al. B000) is studied the possible operations * when changing ? for a pseudo-additive (decomposable) measure based on a ?-conorm 5. A first remark is that it is natural to require that * be a continuous triangular norm. If A = X is a sure event, then A and X are independent, and it follows that m(A П X) = m{A) * m{X) = m{A) * 1 = m(A). Commutativity and associativity of * reflect the corresponding properties for conjunctions. It is also very natural that * be non-decreasing in each place and continuous. We try to find which triangular norms can be used for extending the notion of independence for pseudo-additive measures in the sense of a prescribed triangular conorm. Since the term independence has a precise meaning in probability theory, we shall speak of separability in the framework of 5-measures. Two events A and В are said to be ^-separable if т(АП B) = m{A)*m{B) for a triangular norm *. It turns out by Dubois et al. B000) that the only reasonable pseudo-additive measures admitting of an independence-like concept, are based on conditionally distributive pairs E, T) of ?-conorms and ?-conorms in the form (Em, 5l), (T\, Tp))a, see Theorem 6.1, namely. (a) probability measures (and * = product); (b) possibility measures (and * is any i-norm); (c) suitably normalized hybrid set-functions m such that there is a e ]0, 1[ which gives for A and В disjoint (m{A) +m(B) - a if m(A) > a, m{B) > a, max(m(A), m{B)j otherwise,
1464 ?. Pap and for separability: m(Af)B) = а + ША)-а)шВ)-а) if m(A) > a, m(B) > a, 1-я rm(A) mi B) > а-Г|(«1,^) ifin(A)<a,in(B)<fl, min(m(A), m(B)) otherwise, Any probability distribution on a finite set X can be represented as a sequence of binary lotteries. A binary lottery is 4-uple (A,a,x,y) where А с X and a e [0, 1] such that ? (A) = a, and it represents the random event that yields X if A occurs and у otherwise. Let ? be a probability on X such that p, = P(xj), x, e X. Assume X = {x\,xi,xi} then ? can be described by the following. The binary tree is obtained as follows: First partition X into {? |} and {x2, хз} with probabilities ? \ and pi + рз, respectively, then partition ? \ {? ?} into {*2} and {*з} with probabilities pi/ipi + Pi) and pj/ipi + pi), respectively. The two trees are equivalent, provided that the probability of jc, is calculated by performing the product of weights on the path from the root of the tree until the leaf x. More generally, suppose m is a S'-measure on X = {x\,xi,xy\ and rrij = т{{х(}). Suppose we want to decompose the ternary tree into the binary tree so that they are equivalent. Then the reduction of lottery property enforces the following equations S(y\,\>2)=\, ?(?,??) =W2, ?(?,??) = тт,, where ? is the triangular norm that expresses separability for S'-measures. The first condition expresses normalization (with no truncating effect for ?-conorm S allowed). If these equations have unique solutions, then by iterating this construction, any distribution of a S'-measure can be decomposed into a sequence of binary lotteries. This property is basic in probability theory since it explains why probability trees can be used as a primitive notion for developing the notion of probability after Shafer A996). Turning the S'-measure into a sequence of binary trees leads to the necessity of solving the following system of equations <*l = ?(?, ?>|), ?? = ?(?, ??), S(v\,vi)=\ D1) for given ? ? and <*2- Assuming that T\ = min we have solved D1) completely in Dubois et al. B000) and exhibited the analytical forms of (?, v\, vi). We define the set ?$.? of ordered pairs (?, ?) in the following way *s.a = {(<*,?): (?,0)?]?, 1[\ ?+?=\+?] U {(?, ?): min(a, ?) ? a, max(a, ?) = 1}. A hybrid mixture set is a quadruple (<?, M, T, S) where Q is a set, (S, T) is a pair of continuous ?-conorm and ?-norm, respectively, which satisfy the condition (CD) and ? :Q2 ? ?$.? —> Q is a function (hybrid mixture operation) given by M(x,y-a,fi) = S(T(a,x),T(fi,y)).
Pseudo-additive measures and their applications 1465 It is enough to restrict to the case ({Sm, Sl), {T\,Tp))a. Then it is easy to verify that ? satisfies the axioms (M1)-(M5) on 0s.a, where (Ml) M(x, y; 1,0) = л; (M2) M(x, y; ?, ?) = M(y. ?; ?, a); (M3) M(M(x, y; ?, ?), у; ?, ?) = ?(?, у; ? (?, ?), ?(?(?, ?), ?(?, 1))); (?4) ?(?,?;?,?) = ?; (?5) ?(?(?, у; ?, ?), ?(?. у; ?. ?); ?, ?) = ? (?, y; S(T(a, ?), ? (?, ?)). 5"(?(/3, ?), ?(?, ?))) holds for all ?. у e Q and all (?, ?), (?. ?), (?, ?) e 4>s.a ¦ This kind of mixtures exhausts the possible solutions to (M1)-(M5). Let us show the main appeal of (M1)-(M5). Let (S. T) be a pair of continuous i-conorm and ?-norm, respectively, of the form ((SM,SL),(T\,TP))a. Let u\,u2 be two utilities taking values in the unit interval [0, 1] and let ??, ?? be two degrees of plausibility from ?$.? ¦ Then we define the optimistic hybrid utility function by means of the hybrid mixture as U(u |, м2; ? ?, ??) = S(T(u ?, ? ? ), T(u2, ?2)). In the paper by Dubois et al. B000) it is examined in details this utility function. Although the above description of optimistic hybrid utility is rather complex, it can be easily explained, including the name optimistic. Putting together the results of this paper, the utility of a n-ary lottery can be computed by decomposing the S'-measure into a sequence of binary trees and applying the above computation scheme for hybrid utility recursively from the bottom to the top of the binary tree expansion. More details and proofs of theorems stated in this paper can be found in Dubois et al. B000). It remains a pending problem to find a corresponding axiomatization for hybrid utility as was done for classical utility theory in von Neumann and Morgenstern A944) and possibility utility theory in Dubois et al. A998). Cox's well-known theorem, see Cox A946), which justifies the use of probability for treating uncertainty, was discussed in many papers. Some relaxed axioms have been proposed, enabling non-additive functions to be admissible solutions. It is clear that the family of pseudo-additive measures exhibited here is worth studying in Cox relaxed framework, since a natural form of conditioning can be defined on the basis of the triangular norm in the pair (S, T) satisfying (CD), leading to an almost regular independence notion exhibited here. References Aczel, J. A966), Lectures on Functional Equations and their Applications, Academic Press, New York. Akian, M. A999), Densities of idempotent measures and large deviations. Trans. Amer. Math. Soc. 351 A1), 4515^4543. Akian, M. Quadrat, J.-P. and Viot, M. A994), Bellman processes. 11th International Conference on Analysis and Optimization of Systems: Discrete Event Systems. Lecture Notes in Control and Inform. Sci., Vol. 199, Springer, Berlin.
1466 ?. Pap Akian. M., Quadrat. J.-P. and Viot. M. A998). Duality between probability and optimisation, Proc. Intern. Workshop "Idempotency", Bristol. October 1994. J. Gunawardena. ed.. Cambridge University Press. Cambridge, 331-353. Ames, W.F. A965), Ad hoc Exact Techniques for Nonlinear Partial Differential Equations, Nonlinear Partial Differential Equations in Engineering, Academic Press, New York. Azencott, R., Guivarc'h, Y. and Gundy. R.F. (eds) A978). Ecole d'e'te' de Saint Flour 8, Lecture Notes in Math., Springer, Berlin. Baccelli. F, Cohen, G.. Olsder, G.J. and Quadrat. J.-P. A992). Synchronization and Linearity: An Algebra for Discrete Event Systems, Wiley, New York. Bellman, R.E. and Dreyfus, S.E. A962). Applied Dynamic Programming. Princeton University Press. Princeton. NJ. Benvenuti, P., Mesiar. R. and Vivona. D. B002). Monotone set functions-based integrals. Handbook of Measure Theory. E. Pap, ed., Elsevier. Amsterdam. 1329-1379. Blumen. G.W. and Kumei. S. A989). Symmetries and Differential Equations, Springer. New York. Brooks, J.K. B002). Stochastic processes and stochastic integration in Banach spaces. Handbook of Measure Theory, E. Pap, ed.. Elsevier. Amsterdam, 449-502. Burgers, J.M. A948), ? mathematical model illustrating the theory of turbulence. Adv. Appl. Mech. 1. 171. Choquet. G. A953-1954), Theory of capacities. Ann. Inst. Fourier (Grenoble) 5. 131-292. Cole, J.D. A951), On a quasi-linear parabolic equation occurring in aerodynamics, Quat. Appl. Math. 9, 225- 236. Cox, R. A946), Probability, frequency, and reasonable expectation. Amer. J. Phys. 14 A). 1-13. Cox. J.C.. Ross, S.A. and Rubinstein. M. A979). Option pricing: A simplified approach, J. Financ. Econom. 7, 229-263. Crandall, M.G. and Tartar. L. A980), Some relations between nonexpansive and order-preserving mappings, Proc. Amer. Math. Soc. 78. 385-390. Cuninghame-Green. R.A. A979). Minimax Algebra. Lecture Notes in Economics and Math. Systems. Vol. 166. Springer, Berlin. Denneberg. D. A994), Non-Additive Measure and Integral, Kluwer. Dordrecht. Diestel. J. B002). The Riesz Theorem, Handbook of Measure Theory. E. Pap. ed.. Elsevier. Amsterdam. 401^t47. Dubois. D., Godo, L., Prade, H. and Zapico. A. A998). Making decision in a qualitative setting: from decision under uncertainty to case-based decision. Proceedings of the Sixth International Conference Principles of Knowledge Representation and Reasoning (KR '98), A.G. Cohn. L. Schubert and S.C. Shapiro, eds, Morgan Kaufman, San Francisco, 594-605. Dubois, D.. Pap, E. and Prade H. B000), Hybrid probabilistic-possibilistic mixtures and utility functions. Preferences and Decisions under Incomplete Knowledge, J. Fodor. B. de Baets and P. Pemy. eds, Springer, Berlin, 51-73. Dubois, D. and Prade. H. A982). A class of fuzzy measures based on triangular norms. Intemat. J. Gen. System 8,43-61. Dubois, D. and Prade, H. A988). Fuzzy numbers: an overview. Analysis of Fuzzy Information, Vol. 2. J. Bezdek. ed.. CRC Press. Boca Raton. 3-39. Dubois, D. and Prade, H. B002). Qualitative possibility functions and integrals. Handbook of Measure Theory. E. Pap, ed., Elsevier, Amsterdam, 1469-1522. Dubois, D., Kerre, E., Mesiar, R. and Prade, H. B000). Fuzzy interval analysis. Fundamentals of Fuzzy Sets, D. Dubois and H. Prade. eds, Kluwer. Dordrecht, 483-582. Falconer, K.J. B002), Fractal measures. Handbook of Measure Theory. E. Pap, eds, Elsevier, Amsterdam, 1037- 1054. Freidlin, M.I. and Wentzell, A.D. A984), Random Perturbations of Dynamical Systems. Springer, New York. Gaubert, S. and Gunawardena, J. A998). The duality theorem for min-max functions. C. R. Acad. Sci. Paris Ser. I 326. 43^48. Goard, J.M. and Broadbridge, P. A997). Nonlinear superposition principles obtained Lie symmetry methods, J. Math. Anal. Appl. 214, 633-657. Golan, J.S. A992), The Theory of Semirings with Applications in Mathematics and Theoretical Computer Sciences, Pitman Monogr. Surveys Pure Appl. Math., Vol. 54. Longman. New York.
Pseudo-additive measures and their applications 1467 Golan, J.S. A999), Power Algebras over Semirings, With Applications in Mathematics and Computer Science, Kluwer, Dordrecht. Gondran. M. A996). Analyse M1NPLUS. C. R. Acad. Sci. Pans Ser. I 323, 371-375. Grabisch, M.. Nguyen. H.T. and Walker. E.A. A995). Fundamentals of Uncertainty Calculi with Applications to Fuzzy Inference, Kluwer, Dordrecht. Gunawardena, J. (ed.) A998). Proc. Intern. Workshop "Idempotency", Bristol, October 1994. Cambridge Univ. Press, Cambridge. Hadzic, O. and Pap, E. B001), Fixed Point Theory in Probabilistic Metric Spaces. Kluwer, Dordrecht. Hopf, E. A950). The partial differential equation u, + imv = ???? v.Comm. Pure Appl. Math. 9. 201-230. Inselberg, A. A970), Noncommutative superpositions for nonlinear operators. J. Math. Anal. Appl. 29, 294-298. Jones, S.E. and Ames, W.F. A967), Nonlinear superposition. J. Math. Anal. Appl. 17, 484-487. Klement, E.P, Mesiar, R. and Pap, E. A996), On the relationship of associative compensatory operators to triangular norms and conorms. Intemat. J. Uncertainty. Fuzziness Knowledge-Based Systems 4. 25-36. Klement, E.P, Mesiar, R. and Pap, E. B000a), Triangular Norms. Kluwer, Dordrecht. Klement, E.P, Mesiar, R. and Pap. E. B000b). Integration with respect to decomposable measures, based on a conditionally distributive semiring on the unit inten-al. Intemat. J. Uncertain. Fuzziness Knowledge-Based Systems 8. 701-717. Klement, E.P. and Weber, S. A991), Generalized measures. Fuzzy Sets and Systems 40, 375-394. Kolokoltsov, V.N. A998a), Nonexpansive maps and option pricing theory. Kibemetika 34 F), 713-724. Kolokoltsov, V.N. A998b), Stochastic Hamilton-Jacobi-Bellman equation and stochastic WKB method, Proc. Intern. Workshop "Idempotency", Bristol, October 1994, J. Gunawardena, ed., Cambridge Univ. Press, Cambridge, 285-302. Kolokoltsov. V.N. A998c), Semiclassical Approximation for Diffusion and Stochastic Processes, Monograph. Kolokoltsov, V.N. B001), Idempotent structures in optimization, J. Math. Sci. 104 A). 847-880. Kolokoltsov, V.N. and Maslov, VR A992), Bellman's differential equation and Pontryagin's maximum principle for multicriteria optimization problem, Dokl. Alcad. Nauk SSSR 324 A). 29-34. Kolokoltsov, V.N. and Maslov. VR A997). Idempotent Analysis and Its Applications, Kluwer. Dordrecht. Kuich, W A986), Semirings, Automata. Languages, Springer, Berlin. Levin, S.A. A970), Principles of nonlinear superposition. J. Math. Anal. Appl. 30. 197-205. Lions, PL. A982), Generalized Solutions of Hamilton-Jacobi Equations, Pitman, London. Markova-Stupnanova, A. A997), T-sum of L-R fiizzv numbers, Fuzzy Sets and Systems 85. 379-384. Markova-Stupnanova, A. A999), ? note to the idempotent functions with respect to pseudo-convolution. Fuzzy Sets and Systems 102. 417^421. Maslov, VR A987), ? new superposition principle for optimization problems, Uspekhi Mat. Nauk 42 C), 39^48. Maslov, V.P. and Samborskii, S.N. (eds) A992). Idempotent Analysis. Adv. Sov. Math., Vol. 13, Amer. Math. Soc., Providence, RI. Mesiar, R. A997), Triangular-norm-based addition offiizzy intewals. Fuzzy Sets and Systems 91. 231-238. Mesiar, R. and Pap, E. A999), Idempotent integral as limit of g-integrals. Fuzzy Sets and Systems 102, 385-392. Mesiar, R. and Rybarik, J. A995), PAN-operations. Fuzzy Sets and Systems 74, 365-369. Murofushi, T. and Sugeno, M. A991). Furry t-conorm integral with respect to fuzzy measures: Generalization of Sugeno integral and Choquet integral. Fuzzy Sets and Systems 42, 57-71. von Neumann, J. and Morgenstem. O. A944), Theory of Games and Economic Behavior, Princeton Univ. Press, Princeton, NJ. Pap, E. A990), Integral generated by decomposable measure, Univ. u Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 20A), 135-144. Pap, E. A993a), g-calculus, Univ. u Novom Sadu Zb. Rad. Prirod.-Mat. Fak. Ser. Mat. 23 A). 145-150. Pap, E. A993b), Solution of nonlinear differential and difference equations, Proc. of EUFIT'9.3, Vol. 1,498-504. Pap, E. A994), Nonlinear difference equations and neural nets, BUSEFAL60, 7-14. Pap, E. A995), Null-Additive Set Functions. Kluwer. Dordrecht. Pap, E. A997a), Pseudo-analysis as a base for soft computing. Soft Computing 1,2, Springer, Berlin. Pap, E. A997b), Decomposable measures and nonlinear equations. Fuzzy Sets and Systems 92, 205-222. Pap, E. A997c), Solving nonlinear equations bx non-additive measures. Nonlinear Analysis 30, 31^40. Pap, E. A999), Applications of Decomposable Measures. Handbook Mathematics of Fuzzy Sets-Logic, Topology and Measure Theory, U. Hohle and S.R. Rodabaugh, eds, Kluwer, Dordrecht, 675-700.
1468 ?. Pap Pap, E. B000), Pseudo-convolution and its applications. Fuzzy Measures and Integrals, Theory and Applications, M. Grabisch, T. Murofushi and M. Sugeno. eds, Physica-Verlag, 171-204. Pap, E. and Ralevic, N. A998), Pseudo-Laplace transform. Nonlinear Analysis 33, 533-550. Pap, E. and Stajner, I. A998), Generalized pseudo-convolution, Proc. Seventh Intemat. Conf. IPMU '98 Paris, La Sorbonne, 1216-1222. Pap, E. and Stajner, I. A999), Generalized pseudo-convolution in the theory of probabilistic metric spaces. information, fuzzy numbers, optimization, system theory. Fuzzy Sets and Systems 102, 393^415. Pap, E. and Stajner-Papuga, I. B001), Pseudo-integral based on non-associative and non-commutative pseudo- addition and pseudo-multiplication, Intemat. J. Uncertainty, Fuzziness, Knowledge-Based Systems, 159-167. Pap, E. and Takaci, A. B002), Generalized derivatives. Handbook of Measure Theory, E. Pap, ed., Elsevier, Amsterdam, 1237-1260. Pap, E., Takaci, Dj. and Takaci, A. A997), Partial Differential Equations through Examples and Exercises, Kluwer, Dordrecht. Pap, E. and Teofanov, N. A998), Pseudo-delta functions and sequences in the optimization theory, Yugoslav J. Oper. Res. 8, 111-128. Pap, E. and Vivona, D. B000). Non-commutative and non-associative pseudo-analysis and its applications on nonlinear partial differential equations, J. Math. Anal. Appl. 246, 390-408. Paris, J.B. A994), The Uncertain Reasoner's Companion, A Mathematical Perspective, Cambridge University Press, Cambridge. Puhalskii, A. B001), Large Deviations and ldempotent Probability. CRC Press. Samborskii, S.N. and Tarashchan, A.L. A990), Semirings occurring in multicriteria optimization problems and in analysis of computing media, Sov. Math. Dokl. 40, 441^45. Sander, W. B002), Measures of information. Handbook of Measure Theory, E. Pap, ed., Elsevier. Amsterdam, 1523-1565. Schweizer, B. and Sklar, A. A969), Measures aleatoires de I'information. C. R. Acad. Sci. 269 A. 721-723. Schweizer, B. and Sklar. A. A983), Probabilistic Metric Spaces. North-Holland, Amsterdam. Schwarz, F. A988), Symmetries of differential equations: from Sophus Lie to computer algebra, SIAM Rev. 30, 450-481. Shafer, G. A996). The Art of Causal Conjecture. The MIT Press, Cambridge, MA. Shilkret, N. A971). Maxitive measure and integration. Indag. Math. 33, 109-116. Sugeno, M. A974), Theory of fuzzy integrals and its applications, Ph.D. thesis, Tokyo Institute of Technology. Sugeno, M. and Murofushi. T. A987), Pseudo-additive measures and integrals. J. Math. Anal. Appl. 122. 197— 222. Wang, Z. and Klir, G.J. A992), Fuzzy Measure Theory, Plenum Press, New York. Weber, S. A984), L-decomposable measures and integrals for Archimedean t-conorm, J. Math. Anal. Appl. 101. 114-138. Zagordny. D. A994), The cancelation law for inf-convolution of convex functions. Studia Math. 110 C), 271-282.
CHAPTER 36 Qualitative Possibility Functions and Integrals Didier Dubois and Henri Prade IRIT-UPS. 118 Rome de Narbonne. 3/062 Toulouse. France E-mail addresses: dubois@irit.fr. prade@irit.fr Contents 1. Introduction 1471 2. Set-relations and ordinal belief structures 1472 2.1. Basic postulates of confidence relations 1472 2.2. Basic examples of confidence relations 1473 2.3. Representing confidence relations by set-functions 1475 3. Qualitative possibility theory 1477 3.1. Possibility distributions 1478 3.2. Information content of a possibility distribution 1479 3.3. Interpretation of possibility distributions 1481 3.4. Refined possibility relations 1483 4. Conditional possibility and plausible inference I486 4.1. Qualitative conditioning 1487 4.2. Plausible inference with a possibility distribution 1488 4.3. Universal possibilistic entailment 1491 4.4. Probabilistic interpretations of plausible inference 1494 5. Independence in qualitative possibility theory 1496 5.1. The paradoxes of probabilistic independence 1496 5.2. The minimum rule in possibility theory 1498 5.3. Possibilistic independence between variables 1499 5.4. Possibilistic event independence based on conditioning 1502 5.5. Qualitative independence and belief revision 1505 6. Qualitative integrals 1506 6.1. Possibilistic integrals 1507 6.2. Axiomatics for possibilistic integrals 1510 6.3. Sugeno integrals 1513 6.4. Comparing functions without commensurateness 1515 7. Conclusion 1518 References 1518 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1469
1470 D. Dubois and H. Prade Abstract Measure theory relies on numerical set-functions. Here we consider relations induced by set-functions on a set of events. Such relations considered as primitive objects are more natural than set-functions for the purpose of modelling subjective uncertainty. Starting from a set of events and a preordering relation among events, some properties are described, that such relations should obey in order to be natural candidates for representing uncertainty. Well-known families of confidence relations on finite sets are surveyed, such as comparative probabilities, first proposed by De Finetti. An important particular case is studied in more details: comparative possibility measures. Their importance stems from the fact that they can be simply induced from a complete preordering between elements of the referential. Such orderings play an important role in Artificial intelligence, in theory revision and nonmonotonic reasoning. Several issues are discussed like information contents and conditioning. It is shown that possibility orderings are representable by means of idempotent measures or maxitive set-functions, called possibility measures, taking values on finite chains. The compared expressive powers between such set-functions and possibility orderings are discussed. Notions of conditioning and independence and integrals on finite ordinal scales are presented.
Qualitative possibility functions and integrals 1471 1. Introduction Measure theory relies on numerical set-functions. However, fields such as decision theory and artificial intelligence consider numerical set-functions as too sophisticated notions for the representation of uncertainty viewed from a subjective point of view. The purpose of uncertainty modelling includes the representation of belief. Quantifying subjective belief without resorting to objective frequency observations is a very difficult task in practice, and a more natural approach is to model belief by means of relations on a set of events. Such relations, considered as primitive objects, are more natural than numerical set-functions since it sounds more reasonable for human agents to assert that an event is more likely than another, than to precisely describe to what extent an event is likely. This chapter, whose topic is related to but not central to measure theory, has but a limited ambition. It proposes an introduction to the comparative counterpart to set-functions, their representations in terms of numerical set-functions, and their connections, and pointers to a rather recent literature which finds its origin in the works of some 20th century scholars such as De Finetti A937), Koopman A940) and Ramsey A963). In particular, this paper has no pretence of being a piece of genuine pure mathematics, only a gateway to interesting mathematical issues. Starting from a set of events and a preordering relation among events, some properties are described in Section 2, that such relations should obey in order to be natural candidates for representing subjective beliefs. Well-known families of confidence relations on finite sets are surveyed, such as comparative probabilities, first proposed by De Finetti A937) and possibility orderings, which are representable by means of idempotent or maxitive set-functions, called possibility measures, taking values on finite chains. This important particular case is studied in more detail in Section 3 devoted to qualitative possibility theory. Its importance stems from the fact that possibility orderings on events can be fully characterized by a complete preordering between elements of the referential. Such orderings play an important role in Artificial intelligence, in theory revision and nonmonotonic reasoning. Several issues are discussed further in this section, like information content of possibility distributions and refinements of possibilistic orderings using lexicographic notions. Section 4 introduces the ordinal version of conditioning. Its links to non-monotonic inference are explored in depth, by means of representation theorems. Some connections to probabilistic reasoning are also outlined. Section 5 introduces qualitative counterparts to probabilistic independence on finite ordinal scales and shows that in the qualitative setting the language of independence is equivalent to possibility theory. Connections to belief revision theory are pointed out. Section 6 describes qualitative counterparts to integrals, focusing on possibility integrals, and generalizing them to any qualitative monotonic set-functions, thus leading to the so-called Sugeno integrals. Properties of such integrals are described and representation results are surveyed, in the scope of decision theory. Lastly, the problem of constructing a partial ordering relation on a set of functions with the same domain and range, from the knowledge of complete preorderings on these domain and range is discussed. This construction can be viewed as an attempt to defining a purely ordinal counterpart to an integral.
1472 D. Dubois and H. Prade 2. Set-relations and ordinal belief structures Let S be a finite set of elements called "states". States encode descriptions of possible situations, states of affairs, etc. Subsets of states are called events. The finiteness assumption is made for the sake of simplicity, and is natural when the description of situations is achieved by means of a formal language such as propositional logic. Then, S is the set of interpretations of the language. It comes down to understanding propositions as ultimately true or false. Here we consider a syntax-free environment, from a logical point of view. In particular, a proposition is understood as the set of states where it is true. A relation, denoted ^L, on the Boolean algebra 2s of events A, B,C,... can be called a set-relation, in analogy with set-functions. We shall focus on a particular type of set- relations called confidence relations, adapted to the operational purpose of representing beliefs. Then, a relation ^ ? is attached to an agent, and A ^ ? ? means "the agent has at least as much confidence in event A as in event B". This relation describes an epistemic state that refers either to a particular situation (for instance, the agent's belief that the next person he will meet today is a close friend rather than an unknown person), or to a collection of situations. In the latter case, the confidence relation pertains to concepts of normality, typicality, as for instance in the statement "it is more typical that birds fly than not". 2.1. Basic postulates of confidence relations With this interpretation in mind there are some naturally expected properties for set- relations modelling relative likelihood: reflexivity: A^l A; completeness: A ^? ? or ? ^? ?; transitivity: A^L В and B^LC ^ A^LC. Under these properties (the first of which is redundant, mathematically speaking), the relation ^ ? is a complete preordering; also called a weak order. While reflexivity is natural because A ^ ? ? is understood as a weak form of dominance, the two other properties are more debatable. They are natural in the scope of representing confidence orderings by means of numerical set-functions, for which these two properties are bound to hold when comparing the levels of confidence of events. Transitivity is also a way of enforcing a form of rationality in pairwise comparisons of events in terms of relative likelihood by an agent. Violating transitivity would be strange. Let the strict part of ^z. be denoted by >l, where A >i В means that ?^? ? holds but not В ^l A. When both hold, this is denoted by ? ~?. ?. This relation is an equivalence relation that partitions 25 into families of equally likely events. The quotient set 2s/~^ forms a chain. At this point, some comments are in order. First, regarding completeness, one may in the principle admit that two events be incomparable in terms of likelihood, and use partial preorderings instead of weak orders (Halpern, 1997). Second, one may start from a strict confidence relation: > ? irreflexive and transitive so as to express strict dominance in terms of likelihood. Then weak dominance ?^? ? defined as "not (B >l A)" is not transitive, generally.
Qualitative possibility functions and integrals 1473 Next come properties that express interactions between the relation and the Boolean structure of its support. Stating from the intuition that if a proposition implies another, the agent should be at least as confident in the former as in the latter, and understanding implication as set inclusion, the following postulate makes sense: Coherence with deduction: if А с ? then B^A. From this postulate and transitivity, the limit conditions S1 ^z. A ^z. 0 follow. S is understood as the ever sure event (the tautology), and 0 the ever false event (the contradiction). Then the following condition will also be requested non-triviality: S >l 0- Lastly there are properties which involve symmetric set theoretic operations like union and intersection. The weakest such property is as follows: Weak stability: If (A U ?) ? С = 0 then A > L В implies AUC^t ? U C. It expresses a minimal stability of the relation forbidding a strict dominance reversal when augmenting the size of the two sets A and В in the same way. Interestingly, using simple contraposition and some renaming, it is equivalent, to a conjunctive stability property: A >L В implies ? ? С ^L В П С, if (А П В) U С = 5". Overall we shall use the following notion: Dehnition 1. A confidence relation is a set-relation which is a weak order, which is non-trivial, coherent with deduction, and weakly stable. 2.2. Basic examples of confidence relations Historically, the oldest set-relation is the comparative probability relation, first suggested by De Finetti A937), and exploited by Savage A972) in decision theory: Dehnition 2. A set-relation ^p is called a comparative probability iff (i) ^p is a non-trivial weak order; (ii) VA, A>p0 (consistency); (iii) if А П (? U С) = 0 then: В ^Р С <S> AU В ^Р AU С (pre-additivity). A comparative probability is a confidence relation. The preadditivity condition is stronger than weak stability since it supports a cancellation property, clearly compatible with the additivity property of probability measures. It is a strong stability condition since augmenting or diminishing the size of the two sets A and В in the same way strictly preserves the ordering relation. The coherence with deduction is a corollary of Definition 2.
1474 D. Dubois and H. Prade So far we did not consider set complementation and the behavior of set-relations with respect to this connective. Comparative probabilities are stable with respect to complementation, in other words, they are self-dual relations: A^pB <s> ?>?? (self duality). A more recent confidence relation was introduced by Lewis A973) in the setting of a modal logic of counterfactuals, and independently rediscovered by one of the authors (Dubois, 1986): the comparative possibility relation. Dehnition 3. A set-relation ^/7 is a possibility relation iff (i) ^/7 is a non-trivial weak order; (ii) VA, A >n 0; (iii) VA, В ^п С =>· A U В ^п A U С (disjunctive stability). В^п С reads В is at least as possible (that is, at least as plausible for the agent) as C, for reasons that will become clear below. Again, the disjunctive stability is stronger than weak stability, but it cannot be compared with the preadditivity property. This kind of set-relation has been more recently used by Grove A988) in the setting of belief revision theory. Possibility relations are not self dual. The dual set-relation defined A ^-n В о· В ^п A is called a necessity relation (Dubois, 1986). It can be independently defined as follows: Dehnition 4. ^N is a necessity relation iff (i) ^-n is a non-trivial weak order; (ii) VA, S^N A; (iii) B^NC^-AnB^NAnC (conjunctive stability). В >yv С reads В is at least as necessary (that is, certain for the agent) as С The conjunctive stability is also stronger than weak stability, but it cannot be compared with the preadditivity property, nor to disjunctive stability. The dual of a necessity relation is a possibility relation and conversely. The duality between possibility and necessity relations is a counterpart of the duality between possibility and necessity modalities in modal logic Hughes and Cresswell A968). Possibility and necessity relations can actually be axiomatized in the framework of modal logic, as first shown by Farinas del Cerro and Herzig A991), based on conditional of Lewis A973) (the so-called VN conditional logic). Other attempts include Boutilier A994), Farinas del Cerro et al. A994), Klir and Harmanec A994), Hajek and Harmancova A994), Hajek et al. A994). It has been pointed out (Dubois and Prade, 1991) that a necessity relation is very close to an epistemic entrenchment relation after Gardenfors A988), except that in the epistemic entrenchment framework, the stronger condition S >n A, for ? ? S, is requested instead of (ii) Definition 4, i.e., non-tautological statements are always strictly less certain than tautologies. This is natural in the framework of Gardenfors' belief revision theory, where any non-tautological sentence may disappear upon a revision process.
Qualitative possibility functions and integrals 1475 The set-relations that are at the same time any two of possibility, necessity or probability orderings reduce to deterministic (truth-like) set-relations, defined as follows: 3!s°eSVA,B, A> В iff s° e A, s° e B, and A ~ В otherwise. The state s° is such that S ~ {s0} > 0, and (s) ~ 0 otherwise for s ^ i°. So, for any event A, A~{i°}ifi°eA, and A ~ 0 otherwise. The name "deterministic" stems from the idea that the agent knows that the real state of the world is 5° so that any event can be told true or false by this agent with certainty. This is the case of no uncertainty. 2.3. Representing confidence relations by set-functions As opposed to set-relations that offer a tool for a relative representation of belief, set- functions are the natural tool for absolute representations of belief. Let L be a totally ordered scale with bottom 0 and top 1. L can be a finite chain, the set of integers, the unit interval, for instance. A set-function is a mapping ? from the set of events 2s to L. It assigns to each event its level of confidence. Some natural conditions must be fulfilled by a set-function that models the epistemic state of an agent: Definition 5. A confidence function is a set-function ? such that (i) <p@) = 0, <p(S) = 1 (limit conditions); (ii) if А с ? then ?(?) ^ ?(?) (monotonicity). The limit conditions are self explanatory and the monotonicity expresses the coherence of confidence functions with deduction see Grabisch B002) for a survey (Chapter 34 in this Handbook). A set-function ? induces a set-relation >? such that В ^l A iff ?(?) ^ ?(?). Conversely a class С of set-functions ? is said to be a representation of a class C' of set- relations ^ i provided that for any set-relation ^ r in C', there is a set-function ? in С such that ?(?)>?(?) iff B^rA. This is the natural bridge between absolute and relative representations of uncertainty. A lot of works exist on the representation of comparative probability. See Fine A973), and Fishburn A986a) for surveys. Of course for the representation of comparative probability by numerical probabilities, one must choose L = [0, 1]. In the infinite case it is possible to represent comparative probabilities by means of probability measures, provided that some continuity axiom is added to the basic properties of comparative probabilities (like Savage A972) did). However, in the finite setting, there is a famous counterexample of Kraft et al. A959) showing that while any probability measure induces a comparative probability, the converse is false, namely there are comparative probabilities which are not repre- sentable by any probability measure. This negative result can be addressed by finding a stronger substitute to preadditivity that guarantees probabilistic representations. However, proposed axioms turned out to be either too strong or unnatural, from a modeling point of view.
1476 D. Dubois and H. Prade On the contrary, possibility and necessity relations on finite sets can be represented by specific classes of confidence functions, and the choice of the scale L is arbitrary (Dubois and Prade, 1998). It can be a finite chain or the unit interval, or even the set of integers. The only set-functions capable of representing any possibility relation are possibility functions (also called possibility measures, Zadeh A978)) that are maxitive set-functions ? satisfying Л@) = 0,17(S) = 1 and П(А U ?) = тах(Л(А), П(В)) (maxitivity). Any maxitive set-function induces a possibility relation and conversely any set-function representing a possibility relation is maxitive (Dubois, 1986). Similarly, the only set- functions capable of representing any necessity relation are necessity functions (also called necessity measures, Dubois and Prade A980)) that are "minitive" set-functions N verifying N@) = O, N(S) = I and N(A П В) = min(N(A), N(B)) (minitivity). When L is embedded into the unit interval, these set-functions are unique up to a monotonically increasing transformation. The duality property between possibility and necessity can be expressed, in the case of set-functions, using an order-reversing bijection ? of L on itself, namely ?(?) = v(N( A)), where A is the complement of set A. Namely ? is a possibility function if and only if its dual ?'' such that ?'(?) = ?(?( A)) is a necessity function. Representing epistemic states by means of necessity or possibility functions, rather than using comparative notions, is a matter of convenience. These set- functions were first suggested as useful models of uncertainty in terms of possibility by the economist Shackle A961). He called the quantity N( A) the degree of potential surprise of A. The decomposability properties of probability, possibility and necessity functions make them attractive because of their structural simplicity. Probability functions ? are completely determined by their distribution on S. Similarly, possibility and necessity functions are completely determined by so-called possibility distributions on S, which are mappings ? from S to L such that n(s) = 1 for some state s. In order to keep this structural simplicity for more general set-functions, the following decomposability axiom can be adopted: there exists an operation * on the unit interval such that ifAnfi = 0 then<p(AU ?) = ?(?) *?(?) (disjunctive decomposability). Such non-additive representations have been introduced by Dubois and Prade A982) and Weber A984), and studied at length by Pap A995) see Chapters 23 (Butnariu and Klement, 2002) and 35 (Pap, 2002) of the Handbook for surveys. Similar ideas have been proposed in the literature of decision theory under the name "distorted probabilities" (Yaari, 1987). An interesting question is then, to find a relational counterpart of disjunctively decomposable measures. The following axiom, a weak form of preadditivity we call "almost disjunctive stability", was proposed by Dubois A986): if An(fiUC) = 0andfi^C thenAUS^AUC
Qualitative possibility functions and integrals 1477 It is weaker than both disjunctive stability and preadditivity. Results in Dubois A986), refined by Chateauneuf A996), indicate that indeed the only numerical counterparts of almost disjunctive stable confidence relations are disjunctively decomposable measures. Of course set-functions dual to disjunctively decomposable measures can be studied too and satisfy a conjunctive counterpart to the above axiom. Other well-known set-functions can be described at the relational level: probability envelopes and belief functions. Upper and lower probabilities (Walley, 1991) can be used as degrees of plausibility and certainty, respectively. This is the path followed by Shafer A976) and Smets and Kennes A994) for whom degrees of subjective belief can be expressed by means of two set-functions induced by a third one called basic probability assignment, which is formally a random set, i.e., a probability distribution m on 2 such that w@) = 0. A belief function is a numerical set-function Bel defined by Bc\(A) = E{m(B), B^A]. The dual set-function P1(A)= 1-Bel( A) = ?{??(?). ВП А фЩ is called plausibility function see Grabisch B002) (Chapter 34 in this Handbook). These set-functions subsume probability functions (obtained when A is a singleton whenever m(A) > 0). Numerical possibility (respectively necessity) functions are special cases of plausibility (respectively belief) functions (obtained if, whenever m(A) > 0 and m(B) > 0, А с В or В с A). Wong et al. A990) have proposed families of set-relations that can always be described by belief functions. Such relations obey the following axiom, for disjoint events А, В, С: A U S >Bei В =>· A U В U С >Bei BUC. Comparative plausibility relations satisfy the dual axiom: A U В U С >pi В U С =>· A U В >pi В. Fishburn A986b) also adopts a purely ordinal view of more general classes of upper and lower probabilities, based on interval-orderings. However his axioms for Shafer belief functions are too technical to be meaningful. Research on the determination of belief function-based orderings is continuing. A natural line of research is to carry an ordering relation satisfying Definition 2 from a set of events to another one, through a multiple-valued mapping; it comes down to using Dempster's A967) definition of belief functions based on multiple-valued mappings where the basic probabilistic assignment is changed into an ordering relation (Wong et al., 1993). 3. Qualitative possibility theory The simplest confidence relations are comparative possibilities and necessities, because they can be fully characterized by complete preorderings of elementary states. Hence the computational complexity of representations of such confidence relations is linear in the number of states while it is exponential in general for confidence relations. In particular, comparative probabilities cannot be defined in this way, contrary to numerical probabilities. The simplicity of possibility theory makes it an attractive tool for representing knowledge. In this section, we present two kinds of qualitative representation of possibilistic
1478 D. Dubois and H. Prade knowledge: comparative possibility distributions and finitely-scaled ones. We show that they are not exactly equivalent, and that the latter are more expressive than the former. The comparison of possibility distributions in terms of their informational content leads to different results in each case. Lastly the interpretation of possibility distributions is discussed, and the basic blocks of possibilistic reasoning are laid bare. It is claimed that possibility theory is a natural setting for plausible reasoning, that is, reasoning under incomplete information, as if the state of the world were systematically normal. 3.1. Possibility distributions A possibility relation ^/7 and its dual necessity relation >yv are fully characterized by a so-called comparative possibility distribution (cpd) ^? which is a complete preordering on the set of states S. Namely, define ^? as s>Rs' iff H^nis')- Then the possibility and necessity relations for non empty sets are recovered as follows: A^n В iff 3i e A, Vs' e B, s ^? s', A^NB iff 3seB, Vi'eA, ?,??'· However the cpd does not tell anything about the possibility relation linking any event A to the empty set 0. Namely, in order to recover the full possibility relation, one has to add 0 ^/7 A or A >/7 0 for each subset A, according to whether event A is impossible or not. A pair (S, ^тг) where S is a finite set and ^ is a complete preordering will be called a possibility structure. The set of states of affairs is then partitioned via ^? into a totally ordered set of ? + 1 clusters of equally possible states S = Eq U E\ U · · · U E„ by means of the equivalence relation ~? induced by the symmetric part of °^?. This set of equivalence classes is linearly ordered by the strict part of the possibility relation ?0 >n E\ >n ¦¦¦ >n E„. The set ?0 contains the most possible states that will be considered as the normal ones. The set E„ contains the least possible states and, by convention, starting from a given cpd, these states are not fully impossible (E„ >n 0), unless the contrary is explicitly said. The family ?o>/7 E\ >n ¦¦¦>n E„ is called a well-ordered partition (wop) by Spohn( 1988). Another kind of qualitative possibility distribution is the finitely-scaled one defined as a mapping ? from S to a finite chain L. Let Я be a set-function representing a possibility relation ^/7, and N its dual possibility function. Then the finite chain L possesses a number of elements equal to the number of equivalence classes of the symmetric part of ^/7. The possibility distribution (pod) ? is then defined as n(s) = T7({s}), Vs e S. Then we can consistently recover ?(?) = max n(s), N(A) = minu(n(s)), s?A s?A where ? is the order-reversing map on L. The cardinality of L is ? + 2, generally, one per element of the wop, plus 0 (except if En ~n 0, and then it is ? + 1). In order to ensure that
Qualitative possibility functions and integrals 1479 n(S) = 1, it is requested that n(s) = 1 for some state, that is, some state is fully possible, normal. The possibility distribution is then said to be normal. There are several noticeable types of possibility distributions: - deterministic possibility distributions such that n(s) = 1 for some state s, and n(s') = 0 for all 5 ^5'. For all events А, П(А)= 1 > ?( A) = 0or ?( A) = 1 > ? (A) = 0; - linear possibility distributions: they are such that the corresponding cpd is a total ranking of S; the associated WOP contains only singletons. They are deterministic by default, in the sense that for all events ?, ?(?) > ?( A) or ?( A) > ?(?); - non-dogmatic possibility distributions: they are such that n(s) > 0 for all states; no state is totally excluded; - subnormal possibility distributions: they are such that n(s) < 1 for all states; no state is totally possible, every state is somewhat abnormal. It expresses a state of partial inconsistency. The logical contradiction corresponds to a subnormal pod n(s) = 0, Vie 5". Are cpd and finitely-scaled pod equivalent representations? Not exactly. The relative expressive power of the set of complete preorderings on S and the set of mappings ? from 5" to L is different from several point of views. First, an ordering ^л of states defined on S cannot express that some states are impossible, as pointed out above. With an absolute scale this is easy: one just has to let n(s) = 0 for impossible states. In particular, it is easy to represent deterministic information with a pod. On the contrary, it is not possible to express this deterministic situation with a cpd alone. One needs the whole possibility relation on the power set Moreover, it is possible to capture the idea of internal conflict with a pod. A subnormal ? is interpreted as a partially inconsistent epistemic state, and the level ConsO) = max565 7r(i) is the degree of consistency of ?. It is easy to see that for any event A, min(N(A), N( A)) = u(ConsO)) > 0. A partially inconsistent epistemic state is one for which an event and its contrary are somewhat certain. On the contrary, cpds cannot be compared with regard to the notion of internal conflict. Two possibility distributions ? and ? may correspond to the same cpd(^T= ^?), while we may have ConsO) < Сош(д). In the scope of information fusion, it is better to have scaled representations under the form of pods with the same range, that can be aggregated because they are commensurate. In contrast, ordering relations are not commensurate: how to combine a confidence relation ^tt and a preference relation ^ ? on SI As first established in social choice theory (Moulin A988), for instance), there are very few reasonable aggregation schemes, if any. The situation is easier if there is a common scale. Lastly, notions of quantity of information contained in a possibility structure cannot be equivalently defined with absolute and relative representations, as seen in the next section. 3.2. Information content of a possibility distribution Now that a model of an epistemic state has been described, the next question is to qualify epistemic states in terms of information content. Namely, how informative is an epistemic state? This notion is crucial in the face of an incompletely specified epistemic state. Then
1480 D. Dubois and H. Prade one might be interested in finding the least informative possibility ordering compatible with the explicit knowledge. A natural idea is to define an ordering relation between epistemic states, whose meaning is "more informative than". Let ^T and ^;< be two cpds corresponding to wops ?oU?| U · · · U E„ and Fq U F\ U · · · U Fm. A first natural idea is to use the notion of refinement of a relation. Namely: a cpd ^? is said to refine ^T iff Vs, s' if s ^? s' then s ^? s'. In terms of wops, one gets more refined by splitting equivalence classes Ei. Namely, for all / = 0,..., n, E, = F}¦ U ¦ ¦ ¦ U Fi for some j and k, j <^k. The wop Fq U F\ U · · · U Fm refines F() U E\ U · ¦ · U E„. The least refined cpd is unique and is obtained when Eq = S: states cannot be distinguished according to plausibility. This cpd brings no information at all. The most refined cpds are such that V/', ?, = {s} for some s: they are all linear possibility orderings (no ties): they are informative as the agent can always point out the most plausible one among any two states. Comparing two normal L -valued pods leads to a different natural concept namely the specificity ordering: ? is said to be less specific than ? in the wide sense iff ?(s) ^ ?(?), Vs. It means that any state is at least as possible according to ? as according to ?. It can be checked that any event is at least as sure according to ? as to тт. The least specific pod is such that every state is fully possible: n(s) = 1, Vs. It corresponds to a fully ignorant agent. The most specific pods are such that ??(?) = 1, for some unique s, and n(s') = 0, Vs' ? s, they are the deterministic pods: the agent precisely knows what is the state of affairs. Interestingly, linear possibility distributions are no longer the most specific ones. They are less specific than deterministic pods. However, the least specific pod does correspond to the least refined cpd. It is interesting to try and relate the two notions of informational content comparisons, the scale-based and the ordinal one. A notion of comparative specificity can be defined as follows: >? is ordinally less specific than >(J, in the wide sense, if and only if, in terms of their associate wops: V/= l,...,min(w,«), F0U F| U---U F; C?0UE| U---U ?;. Here the idea is that the specificity of an epistemic state is decreased by moving states up towards more plausible elements of a wop. Maximally specific wops are also the most refined ones, and the least specific wop is also the least refined. However ordinal specificity and refinement are different orderings of cpds. Note that if ~^? is less specific than ^? in the above sense, then there exist two pods ? and ?, ordinally equivalent to >„¦ and ^;< respectively, such that ? is less specific than ? in the wide sense. However these pods cannot be arbitrarily chosen: one may easily find two pods ? and ?, ordinally equivalent to >T and ^;1 such that ? is not less specific than ? even if ^n is ordinally less specific than >;;. Even more, it is easy to see that if ? is less specific than ?, then ^? may fail to be less specific than ^/(. The best example is when ? is a deterministic pod while ? is a linear pod with n(s) = ?^) = I. Then ? is strictly less specific that ?. However the wop induced by ? is (j)UX\|s), while the one induced by ? is a maximally refined partition, which is ordinally strictly more specific than (i)US\(s). This is partly due to the lack of expressiveness of wops, regarding impossible states. However the same phenomenon occurs if ? is such that Vs' ? s, n(s') = a > 0 and ?(?') > a.
Qualitative possibility functions and integrals 1481 These remarks show that L -valued possibility distributions are more expressive and more flexible than cpds, in terms of information contents. Besides the notion of ordinal specificity is only apparently ordinal. In actual fact, it presupposes the use of a scale L — {?? = 1 > ? ? > ¦ ¦ ¦ > ?„ > · ¦·} where, by convention, states in ?, have plausibility ?,. In the case of the two-element wop (s)US\ [s\, states other than s are always supposed to have plausibility ? ?. So it seems at first glance that the notion of refinement is a more natural tool for comparing the information content of cpds than ordinal specificity. Unfortunately, as shown in Benferhat et al. A999), the problem of finding the least informative cpd obeying constraints of the form А >п В cannot be solved using the least refined cpd obeying those constraints. Such a cpd generally does not exist. However a unique least specific cpd always exists, if the constraints are not conflicting. The principle of minimal specificity argues that every proposition that is not known to be impossible is supposed to be possibly true, thus leading to maximize the possibility of states, each time this possibility is unknown. This principle of minimal commitment is cautious and conservative. This is why wops of the form Eq U E\ are interpreted in such a way that states in Eq are fully plausible and states in ? \ are as plausible as can be even if less plausible than those in Eq. It contrasts with other kinds of informational principles such as the closed world assumption (CWA) according to which any proposition not known to be true is supposed to be false. The principle of minimal specificity and the CWA are actually at odds with respect to each other. The former is more similar to the principle of maximal entropy in probability theory, although this probabilistic principle rather suggests maximal disorder or conflict between pieces of information, and is not concerned by the preservation of information incompleteness. It leads to introducing default independence assumptions. On the contrary the principle of minimal specificity prohibits the introduction of arbitrary additional default information. 3.3. Interpretation of possibility distributions A possibility distribution is supposed to represent the state of knowledge of an agent regarding the state of affairs, either the current one (as of now) or the usual one (generally). The complete preordering of states tells what is likely from what is less likely. Like a probability distribution, it may represent generic information as to what is usually the case and what is less usually the case, except that we do not presuppose the existence of explicit statistics, nor must we refer to a well defined population of cases. Under this view, s ^ s' means that state s is at least as normal or typical as s'. Alternatively the pod may only refer to the current situation and what is currently the most plausible one, according to the agent's subjective beliefs. Let ?(?) = [s. n(s) = 1} be the core of the pod. In the following we assume normal pods. It coincides with the set ?? of most plausible, or most normal states in the corresponding wop (according to whether we adopt the point of view of specific or generic knowledge). This model involves a presupposition, which is considered to be characteristic of possibilistic reasoning, namely that the agent believes that, by default, and in the absence of any other information, the truth lies in S(n), until getting new evidence that it is not the case. That is, the agent expects the
1482 D. Dubois and H. Prade current state of affairs to lie in S(n). With this postulate in mind, the possibilistic ordering possesses a clear meaning: В ^п С means that the expected situation where В occurs is at least as plausible as the expected situation where С occurs. This is really comparing propositions on the basis of their most normal models. This view drastically differs from the probabilistic representation of belief, which cumulates likelihood of occurrence of all situations realizing an event. Now the dual necessity ordering precisely describes what are the propositions the agent accepts as true by default in the face of the available knowledge modelled by a pod. They are the propositions that are true in all normal, or maximally plausible situations, i.e., A such thatSOr) с A, which is equivalent to A >yv 0- Events A such that A >yv 0 correspond to accepted beliefs of the agent. It is easy to prove that for any event A, if A >yv 0 then 0 ^/v A. If the agent accepts one proposition, (s)he cannot at the same time accept its negation. As a consequence there are three epistemic situations only regarding event A: either A >n 0, or A >yv 0, or both 0 ^yv A and 0 ^/v A. At one extreme end, there are the sure events A such that A ># S. In the case of a non-dogmatic pod, the only sure event is S, the tautology. At the other extreme end, there are the impossible events A such that A^n S (equivalently 0 ^/7 A). Again, in the case of a non-dogmatic pod, the only impossible event is 0, the contradiction. There is a third category of events, namely the ones the agent does not know anything about, called the unknown (or uncertain) events, such that 0 ># A and 0 ^n A. These are the propositions the agent is agnostic about. If the pod is deterministic (hence dogmatic) there are only sure events and impossible events, and no unknown events. If the pod is linear and non-dogmatic, then again there are no unknown events, but events are more or less accepted or rejected. Noticeably, accepted events are closed under deduction. This is due to the following obvious properties A >n 0 and Ac В imply ? >? 0, A > ? 0 and В > yv 0 imply А П ? > # 0. The set of accepted propositions induced by a pod forms a belief set in the sense of Gardenfors A988). More generally, fixing any accepted belief С >n 0. the set {A ^ C), called a cut, also forms a belief set for any C. This approach is in agreement with the idea of "reasoning as if the current situation were normal", since, by considering the belief set induced by a pod, the agent is jumping to conclusions obtained from the assumption that the situation is as normal as can be. In summary, the possibilistic representation of belief leads to sharing the set of propositions, or events into 3 subsets: accepted beliefs A: A >^ 0; rejected beliefs TZ: A such that S >n A (or A >^ 0); agnostic beliefs V: A such that neither A >n 0 nor A >yv 0 hold. This partition is similar to the one in classical logic where a belief set is the set of consequences of a set of formulas, the rejected beliefs are the negation of accepted beliefs and the ignored beliefs are sentences which cannot be inferred, nor their negations. So
Qualitative possibility functions and integrals 1483 the possibilistic setting is like the one of classical logic: ? is deductively closed, and incompleteness is captured. Unlike classical logic, ? is ranked in terms of certainty, and 1Z is ranked in terms of impossibility. Any cut of ? is deductively closed as well. This feature confirms the interpretation of the necessity relations in terms of acceptance: the agent reasons with accepted beliefs by means of classical logic, as if they were true. But the complete preordering in A captures the idea that some accepted beliefs are more entrenched than others, following Gardenfors, and that some beliefs may be more easily given up than others upon the incorporation of new pieces of information. 3.4. Refined possibility relations Two events A and В cannot be discriminated using the dual pair (?, П) as soon as ? (A) = ?(?) and N(A) = N(B), which occurs quite often, namely, as soon as тахл6д n(s) = тах.5бВ n(s) < 1 or тахл^д n(s) = тах^д n(s) < 1. The equality ?(?) = ? (?) may be due to a high value of П(В ? A). However, in the spirit of probability theory, this weight should not affect the comparison between A and B, since it is common to both. The same reasoning_applies to П( В ? А) with respect to the equality N(A) = N(B). Only В ? A and ? ? A should matter in telling A from B. Let us introduce the relation >riL on 2 , which is different from, although closely related to the original necessity and possibility orderings modelled by N and its dual ?: A>nLB iff П(ВПА)>ЯEПД), A^nLB iff not (B>nLA). The relation >nt is a refinement of both possibility and necessity orderings in the following sense: N(A) > N(B) => A >nL В. П(А) > П(В) => A >nL B. Of course, it may happen that both N(A) = N(B) and П(А) = П(В) hold and that A >nL B. Note that ^/7 and ^riL coincide when restricting to mutually exclusive events A and В since then П(А П ?) = 0. More generally ^/7 and ^nt coincide whenever ? (А ? B) < min(/7(A П В), П( А П ?)) which is to say when the property ?(? ? ?) = тт(Л(А), П(В)) does not hold, a possibilistic counterpart to independence (see Section 5). Notice that the relation A ~til В iff neither A >r\L B_ nor ? >_?? A hold on 25 is not transitive since Я(А П ?) = ?(?~ ? ?) and ? (В ПС) = П(ВГ)С) do not imply ?(? ? С) = П( А П С). This is because тах(л-?, ?}) = тах(тг4, щ) and тахОз.тгб) = тахОз, ??) do not imply тах(Л4.Л7) = тах^т.л^) denoting ?, = ? (Si), A = {52.53,55}, В = {s2,S4,S(,}, С = {53,54.57} (even if ? takes on two values, 0 and 1 only). Interestingly, the refined possibility and necessity relation ^/7/, satisfies not only coherence with deduction, but also the preadditivity condition (iii) of comparative probabilities, and the relation ^riL turns out to be self-dual see Dubois, Fargier and Prade A998b). Moreover if ? is a proper subset of A then A >nt В always holds for non- dogmatic possibility distributions.
1484 D. Dubois and H. Prade So, the relation ^til possesses all the properties of a comparative probability, except for the transitivity of indifference. In the following, the possibilistic relation ^riL will be called possibilistic likelihood and A ^riL В reads "A is at least as likely as ?", so as to emphasize the probabilistic-like self-duality. However, because of the lack of transitivity of the indifference relation, we cannot generally represent such relations by means of probability measures,' nor can we assume properties that are usually derived from comparative probabilities. For instance, the following property of comparative probabilities is NOT satisfied in general by possibilistic likelihood: If АПВ = 0апс1СПО = 0, then A ~p С and B~P D <S> AU B^pCUD. Counter-example. 5" = [a,b,c,d,e, f.g.h); A = [a,b,c]; В = {e, f, g}; С = [b, g, h}; D = {c, d, e]. Suppose that ?(?) = n(f) = л(с) = n(g) = 1; ??) = ?(?) = л(е) = л(И) = 0. П(АГ\С) = П({а,с}) = \ and П(С ? ? ) = n{{g, h}) = 1: A ~nL С, n(BD~D) = n({f,g}) = \ and Я(ОП ?) = n({c,d}) = 1: ? ~nL D, П((АиВ)Г)СГI>) = n({a,f})= 1 > Я((С U D) П АПб) = n({d,h}) = 0. Thus: AU В >nL CUD. Moreover, the following very strong, non probabilistic property holds for the possibilistic likelihood: For A, S, С disjoint, AUC >?? ? and BUC >nL A imply С >nL AU B. Hence, although similar to a comparative probability relation, possibilistic likelihood noticeably differs from it. In terms of well-ordered partitions, the possibilistic likelihood can be described as follows: A > n L В iff there is an Ek such that ? ? S ? Ek ? 0 and ВПАП?(=0, while А П В П ?? = 0 and ? ? ? ? ?,¦ = 0 for all /' < ?. In particular if ? is uniform (total ignorance), ? >/7?. ? if and only if ? is a proper subset of A. The possibilistic likelihood coincides with the inclusion relation. At the opposite, if ? is linear, then for any ? ? ?, either A >nt В от В >nt ?. More generally the relation A >ni В can be generated from the restriction of the regular possibilistic relation to disjoint events (this is a partial ordering) completed using the preadditivity property for non disjoint events. 'This representation is not always possible with standard comparative probability on finite sets (Fishbum. 1986a).
Qualitative possibility functions and integrals 1485 The problem of axiomatically characterizing these uncertainty relations remains open. However, they belong to a more general family of what can be called partial additive set-relations ^i on events, that are characterized by the same axioms as comparative probabilities but for the completeness of ^/.. A large class of such relations can be defined by means of incompletely specified numerical weights ? = (p\,..., p„) assigned to elements of S, and a set of constraints on the p'ts defining a domain ? of feasible- tuples p. Then define (Lang, 1996): S>z.A iff Y^p(s)^Y^p(s). Vpe/i. seB seA By suitably restricting ?, the possibilistic likelihood relation can be recovered (Benferhat et al., 1998). Similarly, one can further refine the possibility relations between events A and В by considering a lexicographic ranking of reordered elements in A and B. First let us define the leximax ordering of vectors. Let V and W be two vectors having m L-valued components respectively denoted u, and ш,-. Denote V > leximax W whenever Зк, гю[ц = V[i] for/ < к and ищ > w\k\ where W[\] > W[i] > ¦·· > w\m] and t>[i[ ^ f|2| ^ ¦ ¦ · ^ i>[w| result from a decreasing reordering of components. Now if a is the vector (?? a,„) such that, denoting S = {?|,?2. ¦¦¦,Sm}: aj = ??(?,) if Si e A, a; = 0 otherwise, then define an ordering among events by A >/7Lex ? ?· ? >leximax ^· This relation will be called "possibilistic lexicographic likelihood" ("leximax" likelihood for short). The complement ^riLex of this relation is a complete preordering of events which refines the possibilistic likelihood ^ruin terms of well-ordered partitions, the leximax likelihood can be described as follows: A>/7LexS iff 3Ek: \АПЕк\>\В Г) Ek\ and \??\??\ = \? ? Et\ for all i < k. The leximax likelihood relation can be generated from the well-ordered partition {Eq, ..., Ep] of S as follows @) {*}>/7Lex0; (i) ifi,i'e ?,, then {i}~/7Lex {i'l: (ii) if s e Ei, and А с Ei+\ U --·? ?,, then {s} >/7Lex A; (iii) apply additivity: В >nLe\ С О A U ? >/7Lex A U C. For a uniform possibility distribution the leximax likelihood relation coincides with the comparative probability relation that is induced by a uniform probability. This is
1486 D. Dubois and H. Prade not surprising in view of the fact that the leximax likelihood relation is a comparative probability in the usual sense. Even more, any leximax likelihood relation can be represented by a special kind of probability measure. Indeed, consider the well-ordered partition {?0 E„] of S. Let /„ = |?„| and /, = \E,\(fi+\ + 1), for / < ? decreasing to 0. Let К be the sum of the ffs. Define a probability distribution ? such that p(s) = fi/(K -\Ei\) if seEi. Then S^/7LexC О P(B)^P(C). The possibilistic likelihood and the leximax likelihood relations coincide for linear possibility distributions. Under the linearity assumption possibilistic likelihood relations can be represented by a very special kind of probability measures introduced by Snow A994, 1999) and also studied in Benferhatet al. A999). It is called an atomic-bound system by the first author. Definition 6. The atomic bound system on S is the set of probability distributions ? such that p(s) > 0, Vs, each inducing a linear ordering > on S, whereby s > s' if and only if p(s) > p(s') and such that for each interpretation s, p(s) > ]ГД... ?>?. p(s'). A probability distribution in an atomic-bound system is such that the probability of an interpretation is much bigger than the probabilities of each less probable interpretation. Such a distribution is called a big-stepped probability (bsp) by Benferhat et al. A999) and is such that VA с 5", 3s e A, P((s}) > P(A \ {s}) so that ? (A) > P(B) if and only if тахFд P(s) ^ maxv6« P(s): the magnitude of P(s) can never be attained by summing the probabilities of states that are individually less probable than s. Moreover P(s\) > P(s2) > ¦¦¦ > P(s,,-\) > P(s„) for states s\,...,s„ of nonzero probability. Clearly a lot of probability measures are ruled out by these conditions. Example. 5" = {a, b, c, d, e] and ? = (?/0.6, b/0.3, c/0.06/, d/0.03, e/0.01). The above defined probability is an example of a bsp when \E,¦ | = 1 for all /. Given a linear pod ? which induces an linear ordering of states > rs that coincides with the ordering induced by a bsp p, it is obvious that ?(?) ^ ?(?) ? P(A) ^ P(B). In fact the possibilistic likelihood relation family marginally intersects the set of comparative probabilities (and leximax likelihoods), while leximax likelihoods form a subclass of comparative probabilities. 4. Conditional possibility and plausible inference In this section, a qualitative possibilistic counterpart of conditional probability is presented. While this notion is very simple, it is shown that it leads to a form of plausible reasoning that has been axiomatized in the literature of artificial intelligence by Lehmann and colleagues (Kraus et al., 1990; Lehmann and Magidor, 1992), and in philosophy of science by Adams A975) and Lewis A973), under very different frameworks and motivations.
Qualitative possibility functions and integrals 1487 This view of plausible reasoning is also captured by the so-called preferential inference proposed by Shoham A988). It accounts for the idea that tentative conclusions derived in the absence of complete information can be questioned, and even refuted, by the arrival of new information. This non-monotonic behavior is also the one found in Gardenfors A988) theory of belief revision. At the heart of these various systems and application areas, possibility orderings are at work. 4.1. Qualitative conditioning Let ^/7 be a possibility ordering, and С an event, supposed not to be impossible, which characterizes a context of interest. A possibilistic ordering ^cn restricted to the subsets of С can be induced as follows: A is at least as possible as В in the context С > /7 0 (denoted by A ^ сп В) if and only if А П С ^ /7 ? П С. The relation ^ cn is called a conditional possibility relation. This kind of conditioning can be applied to any relation between events. For finitely-scaled normal possibility measures valued on L, the following quantitative conditional possibility function has been defined by Dubois and Prade A988): jl ifA^0. Л(С) = Л(АПС)>0, \П(АГ\С) if П(А Г) С) <П(С). It is also, by virtue of the minimal specificity principle, the greatest solution to the Bayesian-like equation ?(? ? С) = тт(Л(А|С), П(С)). The possibility distribution associated with Л(- \ С) is given by л(и\С) 1 if л(и) = П(С). и еС, л-(и) if л(и) < П(С), и е С, 10 ifu<?C. Note that if П(С) = 0 then ?{- \ С) is still a solution to the Bayesian-like equation and is equal to the characteristic function of C, which is simply substituted to ?. The only difference between conditional possibility and conditional probability is that the renormalization via division cannot be carried out. It is changed into a simple move of the most plausible elements in A to 1. Ramer A989) noticed that only one element with possibility П(С) needs to receive possibility 1 after conditioning by A. in order to minimize the change in the possibility values. The purely ordinal view of conditional possibility theory agrees with the finitely scaled form of conditioning in the sense that if a possibility measure ? represents a possibility ordering ^n then the conditional possibility measure П(- \ С) represents the conditional possibility ordering ^cn. Qualitative conditioning makes sense in a finite setting only. Indeed this form of conditioning applied to infinite numerical settings creates discontinuities, and the infinite maxi- tivity axiom is then not preserved by conditioning (see De Baets et al. A999)). Especially, if
1488 D. Dubois and H. Pnide ?(?) = supiFfi тг(и) holds, П(В\А) = supueBn(u\A) may fail to hold for non-compact events В (De Cooman, 1997). The dual conditional necessity orderings can be unsurprisingly defined as follows: A^CN В if and only if ? >^ A. The relation A^CN В reads "A is at least as necessary as В in the context С >n $"· It is equivalent to С U A ># С U ?, not to А П С ^yv ? ? С. The latter is indeed equivalent to A^yvS or A^n С and is not a proper translation of conditioning as focusing on reference class С For L-valued necessity functions on L, the conditioning of necessity functions by duality property corresponds to the following definition: N(A\C) = ?(?( A\C)). It is easy to interpret the condition N(A\C) > 0 (or A >^ A) by the sentence "A is an accepted belief in the context C". Indeed a basic property of qualitative conditioning is, IfC>/70, thenN(A|C) >0 iff AnC>/jSnC, that is, A and С are conjointly more plausible than С and the negation of A. It easily follows that the set A(C) = {A: N(A\C) > 0} of conditionally accepted beliefs is deductively closed. But it may be the case that N(A) > 0 and N(A\C) > 0. It means that while A is an accepted belief, this belief is rejected, and even refuted in the context C. This is clearly a non-monotonic behavior, that comes close to a form of plausible commonsense inference described in the literature of non-monotonic reasoning, as seen later. One may be tempted to consider that qualitative conditioning on a finite scale is more expressive than ordinal conditioning. This is only partly true because qualitative conditioning cannot express a graceful attenuation of belief, due to the following property (Dubois et al., 1997): THEOREM 1. IfN(A) > 0 then either N(A\B) ^ N(A) orN(A\B) = 0. When ? becomes known, A is either at least as sure as it was, or is canceled (N(A\B) = N(?|?) = 0) or refuted by В (N(A~\B) > 0). Hence, the absolute value of N(B\A) is not meaningful on a finite scale L. When N(A\B) > N(A), one may be tempted to consider that the belief in A is reinforced by the knowledge of B. However if a further piece of plausible knowledge С comes to be known, that somewhat contradicts A, then, N(A\B П С) = 0. This problem does not occur with quantitative conditional possibility defined by ?(?|?) = ?(? ? ?)/?(?), ?(?\?) = 1 - ?(?\?) for which N (A) > N(A\B) > 0 is a feasible situation. 4.2. Plausible inference with a possibility distribution Shoham A988) proposed to understand the default inference of a proposition A from a proposition С in terms of the truth of A in all the best models of C, where the term "best"
Qualitative possibilin functions and integrals 1489 means the maximal models of С in terms of some prescribed relation on the set of states. In the following, this relation is considered to be a non-dogmatic possibility preordering of states ^?, and the best models of a proposition С ? 0 are understood as the maximal models according to this (complete) preordering. These are the most plausible (normal) states when A is true. The restriction to non-dogmatic orderings enables any non empty subset of states С to be considered as a not fully impossible context for conditioning. Let us define a possibilistic consequence relation (=T between propositions, induced by the knowledge of the cpd ^n as follows: С\=ЛА iff СГ\А>пСГ)А. The notation С \=? A, which reads "A is a plausible consequence of С (in epistemic state ?)", is a logical (semantic) substitute to the condition N(A\C) > 0, which stresses the close connection between the idea of conditioning in uncertainty theories and the notion of deduction in non-classical logics. It is easy to see that it exactly matches the intuition of Shoham A988) since it holds that С (=т A iff A is true in the most plausible worlds (in the sense of ^?) where С is true. Using an absolute representation of the possibility ordering on states: С \=? A iff Vs e C, if n(s) = П(С). then s e A. This is pictured on Figure 1. This inference is less demanding than the usual deduction of A from С that requires that all models of С be models of A. It has much intuitive appeal as a reasonable representation of how people make inferences in real life situations, when they have a clear idea of what is normal and what is not. In order to figure out what can be said about the state of the world when the (incomplete) piece of information С is available, an agent will assume the situation is as normal as can be (in the context delimited by C) and will jump to conclusions that can be made under this normality assumption. In order for the inference С \=? A to qualify as a consequence relation of some kind, it is important to characterize it by its properties. For instance it is well known that the standard consequence relation С (= A in logic (corresponding to an inclusion relation of С into A) satisfies the following (semantic) characteristic properties: reflexivity: С \=C; monotonicity: С \= A implies В П С (= A; transitivity: if С (= В and В (= A then С \= A. The relation \=? violates all these properties. In contrast, noteworthy properties of (=,T are as follows: С (=7г С if С ф 0 (restricted reflexivity); ? preferred worlds in С Fig. I. C plausibly implies A.
1490 D. Dubois and H. Prade 0 (=7г A never holds (nihil ex absurdo); if С ? 0, then С (=7г 0 never holds (consistency preservation); if А с ? and С \=? A then С (=? В (right weakening rule RW); if С (=7г A and С (=? ? then ? П С (=л· A (cautious monotony CM); if С (=7г ? and ? П С (=7г A then С (=.т A (cut, or weak transitivity); if С (=7г A and С (=? ? does not hold, then В DC \=? A (rational monotony RM); if ? П С (=7г A then С (=?г ? U A (half of deduction theorem); if ? \=? ?; ? (=? С then ?\=? ВПС (Right AND); if ? (=? С; ? (=? С then A U ? (=,т С (Left OR). Let us comment on these properties so as to assess their cognitive appeal. Reflexivity is restricted to non contradictory propositions. Indeed, the notion of "most plausible model" is not mathematically defined on an empty set. Intuitively С = 0 corresponds to an agent receiving inconsistent information that prevents him from deriving any sensible conclusion. The impossibility to plausibly infer in a conflicting epistemic state is stressed by the "nihil ex absurdo" property. Consistency preservation states that in a coherent context the agent should not infer contradictions. Right Weakening is easily checked from the definition of (=тг. It means that the agent is allowed to make standard inferences from plausible beliefs. The subsequent properties embody the notion of plausible inference in the presence of incomplete information. Namely, they describe the properties of default deduction under the assumption that the state of the world is as normal as can be. The crucial rules are Cautious Monotony and the Cut. Cautious Monotony claims that if in the context C, the normal course of things is that A and В hold, then assuming that В and С hold should not lead us to situations that are exceptional for the context C: A should still normally hold. This property gives some conditions when the monotonicity С \=? A implies В П С \=? ? holds namely, the condition С \=? В. The Cut is the converse rule: If В is believed in context С and A usually holds in the context defined by С and В then, one should take it for granted that A normally holds in context C. The set of these two properties, Cautious Monotony and Cut, is also called cumulativity by Makinson A994). Rational monotony, also introduced by Makinson, is a strengthening of weak monotonicity. The condition under which С \=? A implies В П С Кт A is weakened: it is enough that in the context C, the negation of В is not believed. Rational Monotony is not a condition of the same nature as the other ones, because it uses the negation of a plausible entailment (which differs from the plausible entailment of a negation - remember there are three epistemic attitudes with respect to propositions). The remaining properties are not specific to plausible inference, and hold for standard inference as well. Only half of the deduction theorem holds. Indeed the other half leads to a reduction of the context that is not in agreement with the nonmonotonic nature of plausible inference. The Left OR property enables disjunctive information to be handled without resorting to separately reasoning on cases. The Right Weakening rule, when combined with the Right AND, just ensures that the set A(C) = {A: С \=? A] is deductively closed. The set of plausible inferences ?(?) = {(С, А), С \=? A), considered as default rules, form what Kraus et al. A990) call a rational extension. In fact it has been proved (under various guises, e.g., Lehmann and Magidor A992), Gardenfors and Makinson A994)) that any consequence relation |~ satisfying the above properties and inducing a complete plausibility preordering on S is representable by the
Qualitative possibility functions and integrals 1491 possibilistic consequence relation (=T. There is a representation theorem for possibilistic entailment proper that can be stated as follows (Benferhat et al. A997)). It must first be noticed that given a qualitative possibility ordering ?, the following equivalence hold: A>nB iff ???\=??}. Let |~ be a consequence relation on 25 ? 25; define an induced partial relation on subsets as ?>?? iff AUBhfi- THEOREM 2. A consequence relation |~ satisfies restricted reflexivity, consistency preservation, right weakening, rational monotony, Right AND and Left OR if and only if A > ^ В is the strict part of a possibilistic ordering of events. This theorem precisely describes the nature of possibilistic inference, and the intuitive character of these properties suggest it is an appealing model of plausible reasoning. 4.3. Universal possibilistic entailment In the previous section, a cpd on states induces a set of pairs of events (C, A) corresponding to inferences of the form "in context C, A is typically true". It has been pointed out that the set of beliefs of the agent, obtained as plausible consequences in a given context, is a deductively closed set of propositions. In this section we address the converse problem. Namely suppose a confidence relation ^ is given, that represents the epistemic state of an agent. Define an accepted belief A as one such that A >i A, and an accepted belief A in the context С as one such that АПС >l АПС Suppose the set of accepted beliefs in any non contradictory context, induced by the confidence relation, forms a deductively closed set. What does it enforce on the nature of the confidence relation? The main assumptions we shall use concerning a confidence relation are transitivity of its strict part > ? and its monotony with respect to inclusion. The latter is the orderly property of Halpern A997). Axiom MI. If А с A' and В' с В then A >L B^-A'>L B'. Since a belief set is deductively closed, the compatibility between the confidence relation and the deductive closure property requires that any consequence of an accepted belief is an accepted belief and that the conjunction of two accepted beliefs is an accepted belief (Dubois and Prade, 1995b), hence the two properties: • Consequence stability (CS): if А с ? and ?>? A then В >l B. • AND rule: if A > L A and В > L ? then Anfi>z.AUfi. _ Since the set of accepted beliefs when С holds, Al(C) = {А: А П С >l А П C} is supposed to be closed under deduction the above axioms must be generalized into conditional versions of CS and the AND rule: • Conditional consequence stability (CCS): if Acfi and СПА>^СПА then СП В >LCn~B,
1492 D. Dubois and H. Prade • Conditional AND rule (CAND): if С П A>L С П A~ and СП В >L С Г\~В then С ? ? ? ? >tCll(AUS). CCS and CAND give back the CS and AND rules when С = 5". In Dubois et al. A998a), confidence relations satisfying transitivity and inclusion monotonicity CAND (CCS is implied by MI) are characterized. These kinds of confidence relations are called acceptance relations. It is easy to prove that as soon as transitivity and inclusion monotonicity for >l are assumed, the other properties come down to the following axiom introduced in Dubois and Prade A995b): Acceptance Axiom (Ac). VA, В, С three disjoint events, AUB >L С and AUC>L В =>· A>i5UC. The CCS property is recovered from Axiom MI and the CAND property is equivalent to Ac in presence of Axiom MI. Thus, we can define acceptance preorder relations ^i on a set of events as those relations whose strict part satisfies transitivity, MI and Ac. The relation > l induced by an acceptance preorder will be called a partial acceptance order. Let hi. be the consequence relation induced by this acceptance preorder (C|~z.A iff С П ? >? С П A). Then we get the following result related to Friedman and Halpern A996) results (see Dubois, Fargier and Prade A998a)): THEOREM 3. If the strict part of any confidence preorder satisfies transitivity (T), inclusion monotonicity (MI) and the acceptance axiom (Ac), then it induces a consequence relation that also satisfies restricted reflexivity, Right AND, MI, RW, CM, CUT, Left OR, and CP. Conversely, if a consequence relation |~ satisfies CAND, OR, RW, CM, CUT, restricted reflexivity, and CP, then the confidence relation it defines via (A > l В iff A U ? \^? ?) is a partial acceptance order on disjoint events. Note that this is almost like the properties of the consequence relations induced by possibility orderings except that rational monotony is replaced by the weaker cautious monotony. These properties form the so-called system ? proposed by Kraus et al. A990) who call the consequence relations, satisfying CAND, OR, RW, CM, CUT, and reflexivity, "preferential". Here reflexivity is replaced by CP, only non contradictory contexts are considered and reflexivity is restricted to such non-contradictory propositions. Another result bridges the gap between acceptance orderings and possibility theory, namely, Dubois and Prade A995a) have shown that an inference relation |~ is preferential if and only if there exists a set of nondogmatic possibility relations ? such that if ? ? 0: Ah-? iff V>/7e ?. ??)? >? ???. Hence the only way of representing epistemic states of an agent under the form of a deductively closed set induced by a confidence relation is, under mild assumptions,
Qualitative possibility functions and integrals 1493 by means of a family of possibility orderings. From the point of view of the induced consequence relation, going from a single possibility ordering to a family thereof naturally leads to the loss of the rational monotony property, due to the presence of a negated condition in RM. Only cautious monotony remains. Denote A (=vT В the consequence relation induced by any family VJ of possibility orderings. This consequence relation is called universal possibilistic entailment, and the set A(VJ) = {(?. ?): ? (=?? ?} is called a preferential extension. It can be viewed as a set of default rules. It is the intersection of rational extensions ?(?) induced by each ? in VJ. The universal possibilistic entailment is preferential. The maximal set VJ( |~) of possibility orderings that yield a preferential extension of the form VJ{T) is usually different from and contains ?. Benferhat et al. A999) investigates the structure of VJ( h) from a lattice-theoretic point of view. First, they define the maximum of two comparative possibility distributions, and show that this maximum operation is internal to VJ{ \~~), and that it computes the least upper bound of the two comparative possibility distributions in the lattice-theoretic sense, for the specificity ordering. Dehnition 7. Let ^ and ^?· be two comparative possibility distributions on a set 5", WOPi^,) = (?o ?„) and WOP(^,T) = (E'() ?',,) being their well-ordered partitions. Then Max(^T. ^T) denotes the unique comparative possibility distribution whose associated well-ordered partition is obtained from the set {?,',' ^т\щп.п'^ defined as follows Eq = ?oU Eq, E'l = (Ek UE'k)-( U E'A iork=\ min(/7. n), \i=\.k-\ I by eliminating the empty ?[' and renumbering the non-empty ones in sequence. The following properties of (VJ( h). Max) have been established: • Max is associative. • Max(^,^)e?J(l·)· • Maxj^Tr, ~^?·\ is less specific than ^ and ^-, • If ^тг* is less specific than both >T and ^v (in the wide sense) then it is less specific than Maxj^Tr, ^?·}. It is generally not true that ^,T and ^T- refine Max{^,T. ^v) (in a non-trivial way). To summarize, the set VJ( h) is an upper semi-lattice under operation Max and VJ( h) possesses a least specific comparative possibility distribution ?* (see Lehmann and Magidor A992), Benferhat et al. A998)). The set ?(?*) is called the rational extension induced by VJ( И (hence by VJ). However VJ( |~) has several most specific comparative possibility distributions. A most specific distribution corresponds to a linear ordering on S, or equivalently. to a maximally refined (or informative) comparative possibility distribution. Benferhat et al. A999) have proved that the preferential extension A(VJ) can be generated from the set VJl( h) of linear cpd in VJ( h). neglecting other possibility orderings, namely (А, В) е A{VJ) iff VneVJL(H,A\=n B.
1494 D. Dubois and H. Prade 4.4. Probabilistic interpretations of plausible inference It is tempting to interpret plausible inference in a probabilistic way. Namely, in the spirit of acceptance orderings, the plausible inference of В from A might be interpreted as "the majority of A are B" that is, P(B | A) > 1/2. Unfortunately, the set of plausible inferences in this sense, say [В: Р(В | A) > 1/2} is not closed under deduction. Some properties of preferential inference are violated under this interpretation, like Cautious monotony and Left Or. This is shown by the following counter-examples: Counter-example for CM. 1% of students are male", i.e., P(m \ s) > 0.5, 1% of all students will pass the exam", i.e., P(g \ s) > 0.5, but: % of male student pass the exam", i.e., P(g \ s П m) < 0.5. Counter-example for OR. P(C | A) = 38/66 > 0.5, and P(C \ B) = 38/66 > 0.5, but P(C \ A U B) = 44/100 < 0.5. In order to maintain the semantic link between probabilistic deduction and plausible inference it is necessary either to increase the threshold of plausible inference from 1/2 to a number close to 1, or to restrict the class of probabilities involved. There are two existing approaches: infinitesimal probabilities, and big-stepped probabilities. Adams A975) and more recently Pearl A988) have proposed to interpret the default statement "Generally, if A then B" as P(B | A) ^ 1 - ? where ? is a positive real number arbitrarily close to 0. Let Ve — [?: ?(?, | A,) ^ 1 - ?, / = 1 ?] be a set of probability distributions which are induced by a set of statements of the form: "Generally, if A, then ?,". Define the infinitesimal closure of these statements as the set ?(??) = {(?, ?): VP e VF, P(B | A) ^ 1 - ?(?)} where ???^??(?) = 0. Let ? — {тг, ?, \=? ?,-, / = 1 ?] be the set of cpds induced by the same set of statements expressed as possibilistic inferences. A result in Lehmann and Magidor A992) comes down to proving that the preferential closure ?(?) coincides with A(Ve). However the interpretation of plausible inference in terms of arbitrarily high probabilities is not so satisfactory at the intuitive level. An alternative probabilistic semantics is in
Qualitative possibility functions and integrals 1495 terms of big-stepped probabilities defined in Section 3.4. Under a bsp, each proposition A possesses a "usual" model (it occurs more probably that the disjunction of other models of A). Namely let ? be a linear pod and ? be a bsp inducing the same total ordering of states as the pod. The ordering of events induced by the associated possibility and probability measures are the same because if A and В are two subsets of S, sa and sb being the unique states with maximal probability, P(A) > P(B) is equivalent to p(sa) > P(sb)- In particular, it implies that А \=л В iff P(B \ A) > 1/2. This remark leads to a standard probabilistic semantics for universal inference THEOREM 4 (Benferhat et al. A999), Snow A999)). Let V be a set of big-stepped probabilities, and VJ be the corresponding set of linear pods, then define: A (=pbS В as WP e V, P(B | A) > 1/2. It holds that A (=Pbs В ifandonly if ? (=?,? ?. This result is at the same time good news and bad news for possibility theory. It is good news because qualitative possibility possesses a deep link to probabilistic reasoning. However the above results also point out the limitations of ordinal representations of uncertainty and of the kind of plausible reasoning, based on acceptance and deductive closure advocated in this section. These limitations are closely related to the so-called "Lottery paradox" (Kyburg, 1988). Assume that the statement "A is accepted in context C" is modelled by the property ? (A | C) > a where ? is a fixed threshold close to 1. This model of acceptance fails as illustrated by the following counterexample; consider a lottery game where ? (say 1 000000) lottery tickets are sold. Let А,, В denote the respective events "player / loses" and "all tickets are sold". There is one winner ticket so that the following probabilities can be estimated: Prob(A, | ?) = 1 — l/п. For ? high enough, the statement A, is accepted. Now suppose universal inference is applied, then, using the right AND rule, plausible inference from all В \=yn A,, V/, leads to conclude that ? (=?,? ??=?.« Ai· mat is, all players lose. However, ?,=? n A — 0. since there is one winning ticket, so that in any case Prob(P|, = |.„ A; \ B) — 0. Ordinal possibilistic reasoning fails in this situation. This counterexample has been used to refute the well-foundedness of plausible inference based on acceptance, as opposed to probabilistic reasoning. However the paradox will not be observed if a big-stepped probability is used, since then, {A, P (A | C) > 0.5 (remains consistent and deductively closed. The reason for the paradox is the presence of a supposedly purely random phenomenon in the lottery game, namely a uniform winning probability on the set of players that bought a ticket. In this situation there is nothing typical. The only high probability events are those with many possible realizations, and they are globally inconsistent. On the contrary, big-stepped probabilities account for random phenomena where precise events are observed much more frequently than other ones. Plausible reasoning based on jumping to likely conclusions makes sense only for such situations, not for the lottery game. Bsp's seem to be rare, since the constraints bearing on them are very strong. However in the scope of knowledge representation, the set of states is built in such a way that states represent meaningful clusters of actual situations that can be described using the chosen language. The assumption behind the existence of a bsp is thus that natural languages are built in such a way that they can be used to describe typical course of things. Based on this
1496 D. Dubois and H. Prude assumption, one may even think of deriving a bsp from statistical data in the following way: given a sample space ? and a probability measure P, find a "linguistically meaningful" partition Son ? (in other words, a language), informative enough, such that the coarsening of ? on S is a big-stepped probability. This point of view suggests a way of bridging the gap between statistics and symbolic induction. 5. Independence in qualitative possibility theory The notion of independence has been given a precise definition in the probabilistic framework. Other notions of independence exist, such as logical independence between events. A and В are said to be logically independent if none of the pairs (?, ?), (?, ?), (A, B), and (A, B) is inconsistent. This notion is very weak. Logical independence is implied by stochastic independence. Probabilistic independence has been questioned as to its appropriateness for representing epistemic notions of relevance. When a probability distribution has a frequentist flavor, the meaning of stochastic independence is clear. A and В are independent events if A occurs as often when В occurs as when not, and conversely. But this explanation becomes debatable when the probability distribution is subjective, that is, represents the cognitive state of an agent. Possibilistic independence is a difficult and composite topic whose full elucidation is a big challenge for future research. First, one must distinguish between independence between binary variables and independence between events. Moreover the situation of independence in quantitative possibility theory is significantly different from the one in qualitative possibility theory. 5.1. The paradoxes of probabilistic independence The standard definition of probabilistic independence is via the Multiplication Law: events A and С are probabilistically independent iff Prob( А П C) = Prob( A) · Prob(C). It satisfies the properties: (symmetry) If С is independent of A then A is independent of C. (negation stability) If С is independent of A then С is independent of A. (truth) A and S are independent. Symmetry justifies saying "A and С are independent" instead of "A is independent of C". However interpreting the Multiplication Law in terms of independence is far from being straightforward. It seems more natural to define independence in terms of invariance with respect to conditioning: С is probabilistically independent of A iff Prob(C|A) = Prob(C). It turns out that, for positive distributions, this conditioning-based notion of independence is equivalent to the multiplication law. It follows from the axioms of probability theory that independence defined in this way is a symmetric relation, and that it is not sensitive to negation. However, the symmetry property cannot be guessed by just looking at the equality Prob(C|A) = Prob(C), and negation stability cannot be guessed at all. Both are obtained as a by-product of the additivity axioms of probability theory. They are not intrinsic to the intuitive concept of independence. Moreover, the above
Qualitative possibility functions and integrals 1497 properties (symmetry, negation, truth) are not enough to completely characterize the notion of probabilistic independence (Fine, 1973). In qualitative possibility theory, the two definitions of independence do not coincide. However, when conditioning is defined in the numerical setting via the product rule, then the product-based symmetric independence notion П(А П ?) = ? (A) ¦ ?(?) is trivially equivalent to П(В\ А) = П(В) for positive values and is thus much closer to probabilistic independence. Note that there are no simple properties governing the interplay of probabilistic independence with conjunction and disjunction. For instance it is not true that if A is independent of С and В is independent of С then А П В is independent of C, nor A U В is independent from C. Neither does it hold that if A is independent of С and A is independent of B, then A is independent of С П ?, nor A is independent from CUB. A formal objection to the multiplication Law has been given by Keynes A921), cited in Gardenfors A978). According to Keynes, the following property should be accepted: If С depends on A, and В does not contradict A then С depends on А П B. The intuition behind this postulate is that since A is an argument that affects C, the conjunction of A and В should also be considered as relevant to C, since А П В contains a relevant part, even if P(C) = P(C I An S). Keynes notes that his postulate is not validated by the Multiplication Law. He proposes a stronger definition of probabilistic independence that does it: С is independent of A iff there is no В such that A implies В and Prob(C | ?) ? Prob(C). Carnap A950) has shown that this definition leads to a trivial notion of independence: It entails that С is independent of A only if A = S. A special case of Keynes postulate is the so-called conjunction criterion for dependence (called conjunction criterion for relevance in Gardenfors A978)): (CCD) If С depends on A, and С depends on В and А П ? ? 0 then С depends on ???. Gardenfors A978) has also suggested a conjunction criterion for independence dual to (CCD): (CCI) If С is independent of A, and С is independent of В then С is independent of ???. Moreover Gardenfors A978) considers that the notion of relevance should satisfy the following minimal requirements: (Rl) If A is relevant for С then С is not independent of A. (R2) If A is relevant for С then A is relevant for C. (R3) S is not relevant for С (R4) If С is contingent (= neither S nor 0) then there is some A that is relevant for C. These postulates equate relevance with dependence (i.e., the negation of independence), insist on negation insensitivity (so that 0 is not relevant for C). As seen later, negation insensitivity is not valid for possibilistic independence, and the postulate that there is no middle way between relevance and independence is then questionable. Gardenfors shows that under (Rl )-(R4), CCI + CCD is equivalent to Keynes postulate. Since the conjunction of CCI and CCD leads to trivializations, one has to choose one of them. Gardenfors criticizes the Multiplication Law of probability theory because it does not guarantee any of these principles. He argues that CCD should be given up because it sanctions some events as relevant to С while intuitively they are not. This is due to postulate (R2).
1498 D. Dubois and H. Prude Namely assume A is relevant for C, then generally A U D is also relevant for С even if D is not. Due to (R2), AUOis also relevant for С so that by CCD, we conclude that D is also relevant for C, even if it is not! This remark leads Gardenfors A978) to accept CCI and he proposes a probabilistic concept of independence that satisfies it: С is independent of A iff Prob(A) = 0, or Prob(C|A ? ?) = Prob(C) for all В such that Prob(A П ?) > 0 and Prob(C|fi) = Prob(C). Note that Gardenfors' independence relation is non-symmetric. An alternative attitude is, rather than rejecting CCD, to accept both CCD and CCI and drop (R2). This is allowed for in the possibilistic framework. We show that in an ordinal setting where uncertainty is described by ordering the states of the world according to their plausibility, we capture properties similar to CCI and CCD in terms of disjunction, with much simpler definitions of independence than the above probabilistic one. 5.2. The minimum rule in possibility theory Zadeh A978) introduced a symmetric definition of independence called "non-interactivity" between possibilistic variables that is not based on conditional possibilities. This notion has also been studied by Nahmias A978) for events, under the name "unrelatedness". Dehnition 8. A and В are unrelated propositions in Nahmias's sense (denoted by ALB) iff ? (? ? ?) = ?????(?(?), ? (?)). This definition is clearly the possibilistic counterpart of the multiplication rule of probability theory. It is interesting to characterize the constraints induced by unrelatedness on the ordering of events ???, ???, ???, ??? respectively (that correspond to interpretations if A and В are encoded as literals in logic). The following property is easily obtained: A and В are unrelated if and only if ?(? ? ?) ^ тт(Л(А ? ?), ?{ ? ? ?)). Clearly, A and ? are "related" means that А П ? is an implausible situation (since in any case it holds that П(А П ?) < тт(Л(А), ?(?))), i.e., A and В are (more or less) mutually exclusive: it implies that A and В are "related" if and only if N( B\ A) > 0 and N(A\B) >0. So the terms "related" and "unrelated" proposed by Nahmias are not perfectly appropriate. The joint statement N( B\A) > 0 and N( A\B) > 0 indicates an epistemic form of mutual exclusion between A and В since when A is true it is normally the case that В is true and conversely. On the contrary when A and В are "unrelated", the two propositions are allowed to be true together. So, the minimum rule is a weakening of the logical notion of consistency between A and ?, and we shall call it "epistemic consistency", although the case when ? (A) = 0 or П(В) = 0 is encompassed. This notion is not very demanding. Moreover this notion is local in the sense that it is sensitive to negation: if A and В are epistemically consistent, it does not say anything about the other conjunctions of events A and В, В and A, B and A. Other properties of "epistemic consistency" are as follows (Dubois et al., 1994): A) AJ_C iff CJ_A (symmetry). B) If ALB and ALC then A LB U C.
Qualitative possibility functions and integrals 1499 C) If AJ_fi U С then A±B or AJ_C. D) 0J_A. E) S±A. F) AJ_A. G) AJ_Aiff Л(А) = 0 or Я( A) = 0. (8) A UCLA. Facts B) and C) are disjunction-oriented. However, none of the two conjunction criteria (CCD) and (CCI) are valid with epistemic consistency. The proof of these facts is easy using the max-decomposability of possibility measures and the mutual distributivity of min and max. Note also that fact D) sounds strange in view of the "epistemic consistency" interpretation of J_. In fact A±B means that "either A and В are non-mutually exclusive or at least one of them is impossible". This remark also sheds light on fact G). Facts F) and (8) are certainly strange properties for an "independence" relation, and motivate the search for a stronger notion. De Cooman A997, Part III) adopts the possibilistic independence between events, so as to solve the paradox of Nahmias definition whereby A is unrelated with itself. Lastly, there seems to be no way of characterizing possibility theory by means of J_, the reason being that we cannot express ? (A) = 1. Therefore, we conjecture that (just as for probabilistic independence) J_ cannot be axiomatized alone. 5.3. Possibilistic independence between variables Enforcing epistemic consistency between all values of binary variables corresponds to a generalization of logical independence. It contrasts with the situation in probability where independence between events is insensitive to negation. The variables are said to be non- interactive (Zadeh, 1975). PROPOSITION. Two binary variables .x and у are non-interactive (П(.х. у) — тт(Л(.*). ? (у)) far all ? e {A, A }, у е {?. ? } iff there is a choice of values ? e {A, A] and у e {?, ? }, such that П(х. у) ^ П(.х. у) ^ ? (.?. у) = ?(?. у). When ? takes on values in {0. 1} only, the non interaction between .x and у means that the subset of {?. ?) ? {?. ?) with possibility 1 is itself a Cartesian product of subsets of {A, A } and {?, ? }. Logical independence is when it is {A. A } ? {?, ? } itself. Actually, non-interactivity is a decomposability property of the joint possibility distribution on {A, A } ? {?, ? } into its projections combined by the minimum rule. More generally, let ? and у be two variables (on U and V respectively) linked together through a fuzzy restriction on the Cartesian product U ? V, encoded by a possibility distribution, ?,.?, called a joint possibility distribution. The possibility distribution ?? representing the induced restriction on the possible values of ?, can be calculated as the projection of ??-? on U defined by Zadeh A975): ??(??) = П([и) ? V) — sup7Tv.v(". v).
1500 D. Dubois and H. Prade Generally, ???- ^ ?????(??, ??). When equality holds, ?,·.? is then said to be min- separable. Note that projection is also in accordance with the principle of minimum specificity, since ?? (и) is calculated from the highest possibility value of pairs (x, y) where ? = u. As long as a possibility distribution represents a state of incomplete information, non-interactivity expresses a lack of knowledge about the links between ? and y. Conversely, now consider two variables ? and у with possibility distributions ?? and ?? respectively. The principle of minimum specificity leads us to define the joint possibility distribution ?* ,. induced by these two pieces of information as the "non-informative product" ?*? = minO.y, лу) as a consequence of the inequalities ??.?(?, ?) ^ л>(и), Vu, and лх_у(и, ?) ? ??(?), Vm, which hold for the unknown joint possibility ??.?, since ?? does not presuppose anything about у while ???- does. Note that if there is an unknown link between ? and y, then ?* ? provides upper bounds on degrees of possibility, i.e., the conclusions deduced from the uninformative product are always correct but might be less informative that those deduced from ??_,¦· Indeed some values (и, v) may be considered as possible according to ?* r, while when ??.? < ????(??.??), they are not so. This upper bounding effect is in agreement with the entailment principle which states that if "x lies in A" then "x lies in S", provided that А с ?, or Vm, A(u) ^ B(u) more generally. Note that while in probability theory, independence plays the same role as non-interactivity in possibility theory (probabilistic variables can be assumed to be stochastically independent by virtue of the principle of maximum entropy), stochastic independence does not lead to bounding properties as non-interactivity does. This is because stochastic independence assumes an actual absence of correlation while non-interactivity expresses a lack of knowledge. The projection and the non-informative product extend to several variables and are part of the calculus of fuzzy relations (Zadeh, 1975). The non-interactivity of variables implies that the possibility and necessity measures become decomposable with respect to Cartesian products (? ? ?) and the associated co- products (A + ? = ? ? ? ), respectively: ? (? ? ?) = гат(Д, (А), ПУ(В)), ? (? + В) = max(yVv(A), Ny(B)), where A and В are subsets of the universes U and V (the ranges of ? and )¦) and Д, and Nx are possibility and necessity measures based on the normalized possibility distribution ??. Possibilistic counterparts of probabilistic graphical representations have been proposed. Bayesian networks encode joint probability distributions, by decomposing them on the basis of an arbitrary ordering of the variables, as a product of conditional probability distributions. This decomposition is done in an efficient way, exploiting conditional independence assumptions so as to identify simple acyclic directed graphs representations (DAGs). The local propagation schemes described by Pearl A988) become very efficient if the directed graph has no cycles. These schemes are made possible due to the properties
Qualitative possibility functions and integrals 1501 of the conditional independence relations. A conditional independence relation is a ternary relation relating disjoint subsets of variables ?. ?. ?. denoted (X±Y \ Z) which spells out "X is independent of ? given Z" and is usually understood in the probabilistic sense. The basic properties of these relations are known as "semi-graphoid axioms" (see Pearl A988)): Symmetry: (X±Y | Z) => (Y±X \ Z). Decomposition: (X U W±Y | Z) =>· (X±Y \ Z) and (W±Y | Z). Weak union: (X U W±Y | Z) => (X±Y \Z\JW). Contraction: (W±Y | Z) and (Х±У \Z\JW)=>(XU W±Y | Z). While symmetry is easy to understand, decomposition implies that if X is independent of ? given ? then any variable in X is independent of ? given Z. Weak union implies that learning something about a variable in set X that is independent of ? should not affect our knowledge about Y. The contraction axiom implies that, if a variable x' is independent of ? and if ? is independent of Y, when we learn about x' then it should be that the two variables ? and x' are independent of Y. Probabilistic independence satisfies an additional axiom: Intersection: (X±Y \ZUW) and (X±W \ ? U Y) =>· (Х±У UW|Z). This axiom together with the ones above equips the independence relation with a so-called graphoid structure. The graphoid structure enables a directed acyclic graph Q = (N,A) whose node set N contains variables and A contains directed arcs describing causal links between variables to be read in terms of independence properties. Namely it provides topological separation properties between disjoint sets of nodes of the graph that precisely reflect conditional independence relations among the corresponding sets of variables (See Verma and Pearl A990)). In the possibilistic setting, conditional possibility distributions can be used as a basis for a decomposition of joint possibility distributions, whose structure can be reflected by a DAG Q = (N, A)- Namely using ordinal conditioning, ?(?\ xn) = min ?(jc, | Par(x,)) /=l и where Par(jc() = [x; (x,x,) e A). The corresponding conditional independence notion is of the form n(x,y \z) = mm(n(x \?),?(? \ ;)). This kind of conditional independence based on minimum leads to a semi-graphoid structure. Changing minimum into product (Dempster conditioning) recovers the full graphoid structure. For details the reader is referred to De Campos et al. A995), De Campos and Huete A999). In the min-based framework, hypergraph representations sound more adapted. Recent activity on possibilistic networks rather deals with the problem of learning from data (Gebhardt and Kruse, 1996, 2000; Sanguesa et al., 1998a, 1998b). Benferhat et al. B002) study how to encode directed possibilistic graphs in possibilistic logic.
1502 D. Dubois and H. Prade Other decomposable representations for ordinal possibility distributions on Cartesian products of universes have been much less studied and prove to be much more restrictive than in the finitely scaled or numerical setting due to the non-commensurateness of orderings (see Ben Amor et al. B000)). 5.4. Possibilistic event independence based on conditioning Other notions of qualitative independence between events have been introduced based on ordinal conditioning (Dubois et al., 1994, 1997). They are not equivalent, but stronger than the minimum rule in possibility theory. A first proposal is to define В as independent of A when the conditional measure of В given A is equal to the unconditional measure of B. Here we have two uncertainty functions ? and N. Hence we can define independence either as ?(? \ A) = П(В) or as N(B | A) = N(B). But since N(B \ A) = N(B) is equivalent to ?( ? | A) = ?( ?) one of them is enough. Let us consider the equality N(B \ A) = N(B) in order to express the positive statement that A is irrelevant to (the level of acceptance of) B. It turns out that this notion of irrelevance is rather heterogeneous because it expresses the disjunction of two distinct forms of independence according to whether ? is a priori believed or not: (i) N(B\ A) = N(B) = 0. (ii) N(B | A) = N(B) > 0. The first condition holds if and only if 1 = тах(Я( ? ? ?), ? (? ? ??)) and ? (? ? б) ^П(АГ)В). The second condition holds if and only if Л (? ? ?) > ? (? ? ~?) > ? ( ? ? ?). The two situations (i) and (ii) correspond to (almost) reversed orderings of conjunctions. Moreover in (i), В is either ignored or a priori rejected both a priori and in the context A, again a heterogeneous situation. Worse, the invariance of the degree of certainty via conditioning is not stronger than the minimum rule: N(C\A) = N(C) does not imply AJ_C. For instance if П(А П С) > Я(АПС) and Л( АПС) > Я(АПС) then/V(C|A) = N(C) = 0 while A±C obviously does not hold. Instead, N(C\A) = N(C) implies AJ_C so that П(С\А) = ? (С) is stronger than AJ_C. Now, in possibility theory, the knowledge about В is expressed by the pair (N(B), N(B)) and includes the three situations where С is accepted, rejected or unknown. It leads to recognize three situations of independence in the absolute form: (Al) N(B_\ A) = N(BJ >0(henceyV(fi | A) = N(~B) = 0). (Al') N( В | A) = N( В ) > 0 (hence N(B |_A) = N(B) = 0). (II) N(B | A) = N(B) = N(B\A) = N(B) = 0. _ Condition (Al) (respectively (АГ)) means that believing В (respectively: B) is not affected by A, while the third condition means that A does not inform about B. In the first and the second situation we shall speak of absolute epistemic independence В (respectively B) vis a vis A, where the term "absolute" refers to the stability of the level of acceptance, and the term "epistemic" indicates that it is assumed for this notion that В is
Qualitative possibility functions and integrals 1503 an accepted belief. Absolute epistemic independence N(B \ A) = N(B) > 0 implies A±B since ? (А П ?) > П(А П В) holds, and it is stronger. The property (II), which is never met in the probabilistic case, means that in the presence of A, B, which was originally ignored, is still ignored. In this case, we shall speak of uninformativeness of A about В (or C, equivalently). Condition (II) holds iff Л (А П ?) = Л(АП ?)= 1 or Л( АП ?) = Л(АП ?) > Л(АП В) = Л(АП ?). It is obvious that when A does not inform about В then A±B holds (since Л(АПв) = Л(АПв)). So uninformativeness is stronger than A±B. The contrary of uninformativeness is that the agent starts accepting В when learning A. We shall speak of epistemic causality. These notions are not symmetrical, and the first two are sensitive to negation. A is then said to be irrelevant to С if any one of the three above conditions (??), (??) and (II) holds. Irrelevance is thus taken as a generic notion of invariance of the level of belief with regard to evidence. The absolute epistemic independence may be seen as too strong. Recall that in the ordinal setting the preservation of absolute values does not mean much. Especially the situation where N(B) > N(B | A) > 0 is provably impossible as recalled earlier. There cannot be any form of dependence expressing a belief attenuation effect. So the reinforcement N(B | A) > N(B) > 0 does not make much sense. Moreover, absolute epistemic independence enforces the inequality ?(??)?)^?(??)?) which implies that in the context where В would be false, it is forbidden to conclude that A can be accepted. Indeed, the absolute epistemic independence property reads: В is absolutely independent of A if and only if N(B | A) > 0 and not (N( A \ В) > 0). This may be viewed as counterintuitive for an agent declaring that his acceptance of В is "independent" from the knowledge of A and not wishing to commit himself when В is false. A milder notion of independence is that if В is accepted unconditionally, then if A is true, В remains accepted; then we do away with any commitment in the case when В would turn out to be false. Hence the following definition of qualitative epistemic independence: Dehnition 9. The acceptance of В is qualitatively independent of A (denoted ? ?> ?) iff N(B | A) >0andyV(B) > 0. It expresses that В is an accepted belief independently of A and only requires ? (? ? ?) > ? (? ? ?) and ? (?) > ? (A). It is non-symmetric and sensitive to negation. It can be verified that this notion implies, but is stronger than, the condition ?(? f] В) = тт(П(А),П(В)). An antonym notion of qualitative relevance, denoted A ~> В can be defined as not (А ф> В) and S ф> В. А «> В is not equivalent to not (A j=> B) so that Gardenfors (R2) does not hold. It is such that A *«> В is equivalent to N(A) > 0 and N(A | ?) = 0. There are two situations, according to whether В refutes A (N( A \ B) > 0) or simply cancels it (?(?\ ?) = 0). These notions have been studied in detail and axiomatized by Dubois et al. A997). They examine the validity of Keynes-Gardenfors criteria of Section 1 in the ordinal setting of possibility theory. Namely the following requirements: (CCD) If A «> С and В «> С then А П В «> С. (CCI) If А ф> С and В ф> С then ? ? ? ?> С
1504 D. Dubois and H. Prade They also consider symmetric counterparts of (CCD) and (CCI): (CCD-r) If A %> В and A^> С then A «> В П С. (CCI-r) If ? ?> В and ? ф> С then A ^> В П С. and the corresponding properties changing conjunction into disjunction ((DCD), (DO), etc.). (DCI) If А ф> С and В ф> С then A U В ?> С (DCI-r) \i Аф> В and А ^> С then А ф> В U С (DCD) ИА^> С and ? *«> С then A U ? ^> С. (DCD-r) If А ^> В and А «> С then А «> ? U С. First the minimum rule J_ satisfies the four above criteria concerning disjunctions. Moreover (CCD-r), (CCI-r), (DCI), (DCD) hold for absolute and qualitative epistemic independence. The qualitative independence has the following stronger properties: А ф> В П С iff ? ?> В and А ф> С. АГ\Вф>С iff А ^> С and ? ^> С. They are (DCI) and (CCI-r) with equivalence. This is natural if qualitative independence is considered in terms of belief revision: if I continue to accept В ПС upon learning A, I should continue to accept С and В as well. Overall, eight axioms are used to completely characterize qualitative independence, namely: (QI1) S ф> S (tautologies do not undermine tautologies). (QI2) If А ф> С then ? ?> ? U С (right weakening). (QI3) If S ?> A then ??> A (if С is believed, then it cannot undermine itself). (QI4) If ? ?> ? then 5" ?> ? (if A is known not to undermine В then ? is a priori believed). (QI5) ??> A never holds. (QI6) If А ф> С and В ф> С then A U В ?> С (left OR rule, i.e., (DCI)). (QI7) If A U ? ф> С then A ^> С or A U В ф> A (similar to rational monotony). (QI8) If А ф> В and ? ?> С, then A ?fe- В П С (stability under conjunction for acceptance, i.e., (CCI-r)). Note that S ?> A means N(A) > 0. Especially, (QI4) says that when believing В does not depend on А, В must be an a priori belief. (QI2), (QI3) and (QI7) are typical of qualitative independence. (QI1), (QI2), (QI4), (QI5), (QI6) and (QI8) are valid for absolute epistemic independence. Now the following characterization theorem holds, showing the axiomatic equivalence of ?> with possibility theory (Dubois et al., 1997): THEOREM 5. Let ?> be a relation on events, and N a mapping from the set of events to [0, 1] such that А ф> В iff N(B | A) > 0 and N(B) > 0. Then ф> is a qualitative independence relation iff N is a necessity measure. The proof uses a remarkable property that relates qualitative independence and possibility orderings, namely for any possibility ordering >n, the following equivalence holds: VA,C,AUC^>C iff A>nC
Qualitative possibility functions and integrals 1505 For such pairs of events of the form (A U C. C), qualitative independence A U С ?> С is equivalent to absolute epistemic independence. Hence it is possible to axiomatize qualitative relevance as well. One uses the fact that A U В ~> В iff П( В) > ? (?) ^ ?{ A). It can also be done by translating the relevance relation back to the qualitative independence one, and the result is closely related to the one of Farinas del Cerro and Herzig A996) in their setting of theory revision. 5.5. Qualitative independence and belief revision Several notions of independence and relevance defined above, among which qualitative dependence, can be fully expressed in the framework of revision of propositional theories also called belief sets (Gardenfors, 1988). A central problem for the theory of belief revision is what is meant by a minimal change of a state of belief. As pointed out in Gardenfors A990), "the criteria of minimality that have been used in the models for belief change have been based on almost exclusively logical considerations. However, there are a number of non-logical factors that should be important when characterizing a process of belief revision." Gardenfors focuses on the notion of independence (he considers it as synonymous to the term 'irrelevance') and proposes the following criterion: If a belief state К is revised by a sentence A, then all sentences in К that are independent of the validity of A should be retained in the revised state of belief. This seems to be a very natural requirement for belief revision operations, as well as a useful tool when it comes to implementing belief change operations. As noted by Gardenfors, "a criterion of this kind cannot be given a technical formulation in a model based on belief sets built up from sentences in a simple propositional language because the notion of relevance is not available in such a language". However the above criterion does make sense in the ordinal setting of possibility theory. We suppose given a belief set ?, that is, a set of propositions closed under deduction, and an AGM revision operation * (Gardenfors, 1988). К * A represents the result of revising К by a proposition A. According to Gardenfors and Makinson's characterization theorem, if the revision operation satisfies the so-called AGM postulates (Alchourron et al., 1985) then К and * can be represented equivalently by an epistemic entrenchment ordering, which in turn is nothing else than a qualitative necessity ordering (Dubois and Prade, 1991). Conversely any qualitative necessity ordering leads to an AGM revision operation. Namely, given a necessity function ?, the set К = {С, N (С) > 0} is a belief set. This is because N is an acceptance function (Dubois and Prade, 1995b) that is, {C. N(C) > 0} is closed under conjunction and logical entailment. Moreover, it can be proved that the revision operation * is described in terms of possibility theory as follows: С belongs to К * A is equivalent to N(C\ A) > 0 (Dubois and Prade, 1992) (also equivalent to N( A U C) > N(A)). If we translate the various definitions of possibilistic independence and relevance of this section in terms of revision we get: Qualitative independence: ? ?> С iff С e К and С е К * A. Relevance: A cancels С iff С е К and С <?K*A and С ? К * A. A refutes С iff С е К and С е К * A.
1506 D. Dubois and H. Prade Epistemic causality: A justifies С iff С ф К and С е К * ?. Uninformativeness: A does not inform about С iff С ? К, С ? К, С ? К * A and С <?K*A. Qualitative independence exactly corresponds to Gardenfors' above requirement for revision-based independence. The operation opposite to revision is contraction. Contracting a belief set К by a proposition A means deleting A from К as well as propositions that possibly enable A to be derived from К - {A}, so as to obtain a belief set К - A that does not contain A. The Harper identity (Gardenfors, 1988) defines contraction in terms of revision as follows: К — A= К Г) (К * A), i.e., first revise К to accept A and keep only those formulas in К * A that are in К. Conversely К *A = Cn((K - A)U{A}), where Cn is the consequence operation. This is Levi's identity whereby revising by A means contracting A first and then adding A. Companion definitions of qualitative independence and relevance relations ?>? and ss>(. can be associated to a contraction operation "-" via the following definition: Аф>,С iff CeKmdCeK-A iff С е К - A, A^>CC iff CeK and C?K-A iff N(C) > 0 and N(A) > N(A U C). It is easy to check that Levi and Harper's identities can be written in terms of independence relations as follows: А ф>с С iff ? ?> С and A «*><- С iff A ^> C. Farinas del Cerro and Herzig A996) have proved the equivalence between axioms characterizing possibilistic relevance ~>( and an AGM contraction operator. Similarly, postulating the equivalence between ??> С and С е К * A, it can be proved that axioms (QI1)-(QI8) are equivalent to the AGM postulates for revision. 6. Qualitative integrals Mathematical notions similar to integrals can be defined for finitely-scaled set-functions. Of course, such "integrals" will not be additive, and the role of the addition and the multiplication will be played by the maximum and the minimum indifferently. Moreover, qualitative integrals can be defined only for functions ranging on the same scale as the set- function. Since we shall stick to the finite setting, we shall get qualitative counterparts of weighted averages for possibility and necessity functions. Considering more general set- functions, so-called Sugeno integrals are obtained. They are the qualitative counterparts of Choquet integrals. Qualitative integrals can be used to evaluate the likelihood of fuzzy events. Events, which we would like to evaluate the uncertainty of, are often ill-defined: e.g., we would like to estimate whether somebody is "rich" without being able to describe in precise quantitative terms what it means. Possibility and fuzzy set theory jointly offer a simple tool for modelling both the partial lack of available information and the ill-definition of relevant events. Fuzzy events are represented by a function from the set of states S to a totally ordered scale L (L-fuzzy sets, Goguen A967)). Set-functions representing partial belief can be generalized by extending the usual concept of "event" to fuzzy events via
Qualitative possibility functions and integrals 1507 qualitative integrals and the algebra of events to sets of functions, so as to accommodate fuzzy events. Algebras of fuzzy events are no longer Boolean. Decision theory is also a framework where these notions of integral make sense, and can be axiomatized. However in the purely ordinal case, defining the counterpart of integrals is tricky due to commensurateness problems between orderings. In Savage's A972) framework, the consequence of a decision depends on the state of the world in which it takes place. For instance, in a restaurant one must choose between dishes without being totally sure that the food is good. If 5 is a set of states and X a set of possible consequences, a decision is a mapping from S to X. The decision-maker has some knowledge of the actual state and some preference on the consequences of his decision. The usual approach to decision-making under uncertainty is based on representing uncertainty on states by a unique probability distribution and the decision-maker by a utility function on consequences. Then decisions are selected on the basis of expected utility. When uncertainty is just due to a lack of information, and only the result of the present decision matters, the idea of expectation is less attractive. For this reason a purely possibilistic approach to decision under uncertainty that addresses this kind of situation has been developed (Dubois, Prade and Sabbadin, 1998, 2001). It exploits possibilistic and Sugeno integrals as preference functionals serving as decision rules. 6.1. Possibilistic integrals Consider a pod ? defined on a finite set S and ranging on a totally ordered scale L. Let / be a function from S to L. Let 1 /(i)^a denote the characteristic function of {s; f(s) ^ a]. Note that the obvious identity holds: f(s) = maxmin(a, 1 flf)>a(s)). C/€L So / can always be considered as a multistep function. Then possibility and necessity integrals can be defined in the usual style from the possibility and necessity functions ? and N induced by ?, as: n(/) = maxmin(ci, Я(/ ^?)), N(f) = maxmin(a, N(f ^a)). C/€L C/€L These expressions reduce to what Zadeh A978) and Dubois and Prade A980) respectively call possibility and necessity of fuzzy events, noticing that function / is the membership function of a fuzzy set: Я(/) = maxmin(/(i), n(s)), N(f) = minmax(/(i), v(n(s))), where ? is the order-reversing function of L, noticing that Я(/ ^ a) = v(N(f < a)) and that maxa€L min(a, N(f ^ a)) = min„gi. max(a, N(f > a)). The possibility and necessity integrals can be viewed as prioritized versions of the maximum and the minimum,
1508 D. Dubois and H. Prade the possibility values тг(и) acting as weights (Dubois and Prade, 1986), while the usual integral reduces to a weighted average. Possibility and necessity integrals have been used as a semantic matching process between the representation of the available information (?) and the description of some request in the form of a fuzzy event F. Possibility theory enriches the symbolic pattern matching paradigm of Artificial Intelligence, allowing for graded semantic matching (Cayrol et al., 1982; Dubois et al., 1988). Given a collection of objects for which we have some (possibly incomplete) knowledge about the value of an attribute x, represented by means of a possibility distribution for each object, the set of objects such that "x is F" is an ill-known set described by a pair of fuzzy sets, namely the fuzzy set of objects which more or less possibly satisfy the requirement "x is F" and the fuzzy set of objects which more or less necessarily (i.e., certainly) satisfy it. Membership grades to such fuzzy sets are calculated using possibility and necessity integrals. For two-valued functions fA, that can be used as characteristic functions of a subset A of S, the expression of the qualitative integrals is rather simple: assume fA(s) = a if s e A and fA(s) = ? if s ? A , with a > ?. Observe that n(fA) = max(/3, тт(Я(А), a)) = min(a, тах(Я(А). ?)), N(fA) = max(/3, min(yV(A), a)) = min(a, max(/V(A), ?)). Note that the possibilistic integrals of the binary function are the medians of {а, Я (A), ?], and {a, N(A), ?} respectively, thus contrasting with usual integrals, which are based on averaging. This view of possibilistic integrals as medians remains valid in the general case, as can be seen from the definitions based on level-cuts of /. Assume S = {s\, jt, ..., s,,,}, and denote F, = {s„{i),.. .,s„(,n)}, where ? is a permutation such that /(??(?)) ^ /(¦VB)) ^ ¦ ¦¦ ^ f(sn(m))- ri(f) is actually the median of 2m — 1 terms f(s\),..., f(s,„), n(F2),-..,n(Fi„), and N(F) is obtained similarly. As another illustration, the value f(s) of function / may represent the utility of some decision made in state s. The possibilistic integral N(fA) is a kind of pessimistic preference functional that is easy to understand: if the agent is sure enough that A occurs (N(A) > a) then the utility of the decision is a. If the agent has too little knowledge (max(jV(A), N( A)) < ?), then cautiousness prevails and the utility is /3,_the worst case. Of course the same happens if the agent is at least somewhat certain that A occurs. If the agent's certainty that A occurs is positive but not extreme, the preference functional N(fA) reflects the certainty level and is equal to N(A). The possibilistic integral Я(/А) is a kind of optimistic preference functional: the value of Я(/А) is high (= a) as soon as the agent believes that obtaining ? is possible enough (Я(А) > a). In particular, suppose ? is the characteristic function of a subset ? of states. In other words, the agent only knows that the state of the world is in E, then the possibilistic integrals reduce to: Я(/) = тах/E), N{f) = mmf{s) s e ? jet" which coincide with well-known maximax and maximin criteria in decision theory. More generally, the inequality Я(/) ^ N(f) holds, often with strict inequality, so that N(f) can
Qualitative possibility functions and integrals 1509 be called a pessimistic possibilistic integral and Л(/) an optimistic possibilistic integral. These functionals are special cases of Sugeno integrals studied below. The optimistic criterion has been first proposed by Yager A979) and the pessimistic criterion, under the form of a maximin disutility, by Whalen A984), and also used in Inuiguichi et al. A989). The main properties of possibilistic integrals are: - monotonicity: If / ^ g (statewise) then Л(/) ^ 17(g) and N(f) ^ N(g); - maxitivity of Л(/): Let / ? g be the function from S to L defined, for any s e S, by / ? g(S) = max(/(i), g(s)). Then Л(/ ? g) = max( Л(/), Л(#)); - minitivity of N(f): Let / л g be the function from S to L defined, for any s e S, by / л g(s) = min(/(i), g(s)). Then N(f Ag) = min(/V(/), N(g)); - homogeneity properties: Consider truncated functions /?? and / л ? for any ? e L: weak minitivity of Л(/): Л(/ ? ?) = тт(Л(/), ?), weak maxitivity of N(f): N(f ??) = max(jV(/), ?). More general forms of possibilistic integrals have been studied. The generalized optimistic possibilistic integral can be defined by: Л*(/) = sup f(s)*n(s) where * is a conjunctive-like operation on L called a semi-norm (De Cooman, 1997). This is a fuzzy conjunction that is not necessarily commutative or associative, but is increasing in the wide sense, and such that 1 * ? = ? * 1 = ?. Then, Л*(/) = П(А) when / is the characteristic function of a set A, and, Л*(/) = Л(/) when * = min. Similarly, the generalized pessimistic possibilistic integral can be defined by: ?,(/) = ??(?E),/E)) where / is an implication operation on L, that is, decreasing in the first place and increasing in the second place (in the wide sense), and such that /A,0) = 0, /@,0) = /A, 1) = / @, 1) = 1. Viewing function / as the membership function of a fuzzy set F and choosing / of the form ? (?, ?) = ? (a * ?(?)), the identity ? (A) = v(N(A)) is extended to fuzzy events, i.e., n*(F) = v(N*(~F)) for the usual fuzzy set complementation Fc'(¦) = v(F(-)), where у denotes an order-reversing function on L. If operation * is increasing in the wide sense in both places, the characteristic properties of possibility and necessity measures with respect to disjunction and conjunction of fuzzy sets still hold (respectively defined by the pointwise maximum and the minimum of membership functions). ??? v 8) = тах(Л*(/), Я»(^)). /V»(/ л g) = min(yV*(/), NAg))- Moreover, if ? * ? ^ min(a, ?), then Л(/) > W(/), V/ also holds (if ? is normalized). Inuiguichi and Kume A994) provide a systematic study of possibility and necessity measures of fuzzy events with various implications. They point out three kinds of fuzzy necessity integrals:
1510 D. Dubois and H. Prade - strong ones such that N(f) = 1 if and only if n(s) > 0 implies f(s) = 1 (the support of ? is contained in the core of /); this is when ?(?,?) = max(l -?, ?), for instance; - medium ones such that N(f) = 1 if an only if ? ^ /; this is when ? (?, ?) = 1 as soon as a < ?, especially residuated implications, see Bouchon-Meunieret al. A999); - and weak ones such that N(F) = 1 if and only if n{s) > 0 implies f(s) > 0. Shilkret A971) integral is defined for L = [0, 1] by /prod fdn = sup а П(/> a). It is an optimistic possibilistic integral where the minimum is replaced by a product and values lie in the unit interval. In that case, it is easy to see that / , f dil = Л*(/) with * = product. This notion is also studied by Agbeko A991) under the name "optimal average". More generally, r-norm- and conorm-based possibility integrals have been studied by De Cooman et al. A992), Yan A993,1994), De Cooman and Kerre A996). They correspond to the following integral-like schemes, called normed and conormed possibility integrals, respectively: f*fdn = SUp„;>(,<* * П(/ > <*)- f±fdN = Ma>0a±N(f>a). The first of these expressions is equal to Л*(/) if * is a triangular norm. The second one is of the form N/ (/) for / = v(a) J_ ? if J_ is a triangular conorm-like operation. On such a basis, De Cooman A997, Part I) outlines a theory of possibilistic integration, where possibility levels are taken in a complete lattice, and the operation * is a semi-norm. He proves possibilistic counterparts of Radon-Nikodym and Fubini theorems. In a series of papers, Mesiar A992, 1994, 1997) studies the representation of possibility and Shilkret integrals via Markov kernels. See also Benvenuti, Mesiar and Vivona B002) (Chapter 33 in this Handbook). 6.2. Axiomatics for possibilistic integrals It makes sense, if information is qualitative, to represent the incomplete knowledge of an agent on the state by a possibility distribution ? on S with values in a plausibility scale L and the agent's preference on the set of consequences X of decisions by means of another possibility distribution ? with values on a preference scale U. The utility of a decision d whose consequence in state s is ? = d(s) e X for all states s, can be evaluated by combining the plausibilities n(s) and the utilities ?(?) using the two quantitative criteria based on possibilistic integrals for evaluating the worth of decisions, provided that a commensurability assumption between plausibility and preference is made (Dubois, Prade and Sabbadin, 2001). Maximizing the pessimistic integral means to find a decision d, all the highly plausible consequences of which are also highly preferred. The definition of "highly plausible" is decision-dependent and reflects the compromise between high plausibility and low utility
Qualitative possibility functions and integrals 1511 expressed by an order-reversing map between L and U that make utility values and possibility values commensurate. The pessimistic integral is small as soon as there exists a possible consequence of the decision which is both highly plausible and bad with respect to preferences. This is clearly a risk-averse and thus a pessimistic attitude. However, contrary to the maximin Wald criterion, the pessimistic possibilistic criterion focuses on the idea of usuality and relies on the worst plausible consequences induced by the decision. Some unlikely states are neglected by a variable thresholding and the threshold is determined by the commensurateness mapping between utilities and possibilities. In contrast, the other criterion corresponds to an optimistic attitude since it is high as soon as there exists a possible consequence which is both highly plausible and highly prized. The pessimistic criterion has been axiomatically justified by Dubois and Prade A995c) in the style of Von Neumann and Morgenstern utility theory; see Dubois et al. A999) for the simplified set of axioms given below. The idea is that if the uncertainty on the state is represented by a finitely scaled pod ?, each decision induces, on the set of consequences X. a possibility distribution such that ?(/(?) = ?(?~](л)). So ranking decisions comes down to ranking possibility distributions on X. Assume the decision-maker supplies an ordering between possibility distributions on X. thus expressing his attitude in front of risk, that is, in front of various possibilities of happy and unhappy consequences in X. Let nj(x) e L be the plausibility of getting ? under decision d. The question is to know what kind of axioms on the ordering between possibility distributions on X make it representable by the ranking of decisions according to the above pessimistic or optimistic criteria. Let (?/?, ?/?') denote the "qualitative lottery" yielding ? with plausibility ? e L and ?' with plausibility ? e L. Of course, ???3?(?, ?) = 1. The following are the pessimistic axioms: A) The set of possibility distributions is equipped with a structure (!>=. ~. >-) of complete preordering, where >.~. > are the weak preference, indifference and strict preference. B) Independence: ?\ ~ ?? =>· (?/??. ?/?) ~ (?/??. ?/?). C) Continuity: if ? > ?' then 3? e L. ?' ~ A/?. ?/?). D) Reduction of lotteries: (?/(?/?. ?/?'), ?/?') ~ (?/?. ?/?') where ? = min(a, ?) and ? = ??3?(?, ????(?, ?)). E) Risk aversion: ? ^ ?' =>· ?' )? ?. The risk aversion axiom states that the less informative is ?. the more risky the situation is perceived by the agent: for a pessimistic decision-maker, the worst epistemic state is total ignorance. Continuity says that the utility of ? goes down if the uncertainty about ? raises. It can be proved that if the knowledge is represented by a subset A of possible states, then 3x e A . ? ~ A. This property, violated by expected utility, suggests that contrary to it, the pessimistic utility is not based on the idea of average and repeated decisions, but makes sense for one-shot decisions. The reduction of lotteries suggests that if lottery (?/?, ? /?') is embedded in the lottery (?/?, ?/?') at place ?, ? obtains in two sequential steps with possibility min(a, ?) and ?' obtains either directly with possibility ? or in two steps with possibility ????(?, ?). The reduction of lotteries property is due to the fact that the whole setting relies on a qualitative counterpart of the structure of probabilistic mixtures that underlies the Von Neumann and Morgenstern approach (Herstein and Milnor, 1953). This is due to the closure of the set of possibility measures under weighted maxima: if ? and ?' are
1512 D. Dubois and H. Prade possibility measures, then тах(/(Л), /'(Я')) is a possibility measure again (Dubois and Prade, 1990) where / and /' are monotonic transformations of the possibility scale, with /@) = /'@) = 0, and max(/(l), /'A)) = 1. This is the only aggregation that preserves possibility measures. Let ? с L2 be the set of ordered pairs (?. ?) such that max(a, ?) = 1. A possibilistic mixture set is a pair (G, M) where G is a set of possibilistic lotteries, ? is a function from G ? G ? ? to G that computes for each pair of possibilistic lotteries ? and ?' and each pair of weights (?, ?) in ? another element of G denoted ?(?, ?'; ?, ?) such that (Dubois etal., 1996): Ml: ?(?,?'\ 1,0) = ?; M2: ?(?,?';?,?) = ?(?',?;?.?); ?3: ?(?(?, ?'; ?, ?), ?': ?, ?) = ? (?.?'; min(a, ?), max(min(/3, ?), ?)). Ml says that lottery (?/?,?/?1) is ?; ?2 says that lotteries (?/?,?/?') and (?/?', ?/?) are the same; M3 is axiom 4 (which can be dropped if M3 is adopted as a computation rule). M1-M3 imply the following: M4: ?(?,?;?,?) = ?\ M5: ?(?(?, ?'\ ?, ?), ?(?, ?'; ?, ?); ?, ?) = ?(?, ?'; max(min(a, ?), min(y. ?)), max(min(/S, ?), min(<5, ?))). The last expression explains the form of the optimistic criterion. The form of the other criterion is obtained by working with degrees of necessity (ordered pairs (?, ?) such that min(a,/3) = 0). An alternative axiomatic justification of possibilistic integrals has been developed in the style of Savage (Dubois, Prade and Sabbadin, 2001). The problem is one of representing a complete preordering ^ over the set of functions from a finite set S to a finite set X by means of a functional whose form is dictated by the properties of this preordering. The functions d in D = Xs represent the set of potential decisions of an agent, X being the set of consequences of decision, d(s) being the consequence of choosing decision d in state s. Let А с S be an event, / and g two functions in Xs, and denote by f Ag the function such that fA8(s)-[g(s) if si A. A set of axiom for the pessimistic possibilistic integral is as follows: SI: (D, » is a complete preorder. Let > denote the strict part of this complete preorder. S2: There exist two functions d and d' where d > d' (non triviality). S3: If fx and fy denote constant functions such fx(s) = ? and fy(s) = y, Vs e S, then for any function d e Xs and any event A: fx > fy implies fx Ad > fy Ad. S4: If d > d' and d > d" then d > d' ? d", where d' is a constant function. S5: lfd' >d and d" > d then d' a d" > d. THEOREM 6 (Dubois et al. A998a)). If the preference relation over D satisfies the five above axioms then there exists a possibility distribution ? on S, a preference function ? on X and an order-reversing function between plausibility and preference scales such that d > d' if and only if ?(???) > ?(???') using the pessimistic possibilistic utility.
Qualitative possibility- functions and integrals 1513 The optimistic criterion can be justified likewise, moving the constant act condition from axiom S4 to S5. In fact axioms SI, S2, S3 are enough to show that the uncertainty on states is captured by a monotonic (Sugeno) set-function. We have also proved that SI, S2, S3, S4, and S5 restricted to constant acts d', are enough to show that the preference relation > is representable by a Sugeno integral (Dubois. Prade and Sabbadin, 1998). S5 ensures the min-decomposition of the uncertainty measure (which is then a necessity function). The pessimistic nature of S5 can be guessed from its equivalent form: d' Ad > d implies d ^ d Ad'. If decision d is improved by adopting d' when A occurs it means that A is sure enough to occur and so, there is no way of improving decision d by changing it in case A does not occur (the agent neglects this possibility of non-occurrence). 6.3. Sugeno integrals In this section, the possibilistic integrals are extended to more general confidence functions, also called fuzzy measures (Sugeno, 1974). Fuzzy measures are monotone increasing (with respect to set inclusion) set-functions. Namely if A is a subset of В it is assumed that ?(?) ^ ?(?), where ?(?) is a degree of confidence of A. In his thesis, Sugeno A974) was inspired by fuzzy logic operations in formalizing the concept of integral of a function F valued on a totally ordered scale L, with respect to a fuzzy measure ?. Replacing addition and multiplication of numbers by max and min, respectively, he arrived at the following definition where / ^ a = [u e U: f(u) ^ a) is the level-cut of the function from S to L (Sugeno, 1974, 1977): f f ??= sup min(a, ?(/ ^ a))· For a careful comparison of Sugeno integral and Lebesgue integral, see Ralescu and Adams A980). Note that if ? varies on a numerical, continuous scale, this definition can also be written as jfdq>= inf max(a. ?(/ ^ a))· Consider the case of a finite set S = (s\ , s,,,}. The Sugeno integral reduces to the following expression: ffd<p = max mm(f(s„a)).<p({s„{i).....s„{llu})) i = l »i = min max(f(s„ii)).<p({s„{i-\) s„im)})) l = \ ??? using an increasing reordering of vector (f(s\). f(sm)) according to a permutation ?. f f ?? is actually the median of 2m — 1 terms \f(s\) f(s„,).<p({sa{2) s„{m)}), ¦ ¦ -,??????)})} (Dubois and Prade, 1980). Another form of Sugeno integral is an extension of Boolean polynomials: f f d<p= max mini min/(.?). ?(?)) = minmax(max/(i). ?(?)\. A V.se.A / ? V (?д /
1514 D. Dubois and H. Prade Viewing / as a utility function, these forms suggest that Sugeno integral achieves an optimistic (respectively pessimistic) compromise between the level of confidence in each event and the degree of utility of the worst (respectively best) state where the event occurs (respectively does not occur). Possibility and necessity integrals are particular cases of Sugeno integrals, where ? is a possibility measure and a necessity measure respectively. Namely / fail = 17(f) and / fdN = N(f); see for instance Dubois and Prade A980), Inuiguichi et al. A989), Grabisch et al. A992). Other special cases of Sugeno integral are the ordered weighted min and max (see, e.g., Dubois et al. A988)). If ?(?) only depends upon the cardinality of A, so that <?({??(,-),..., ??(„?)}) = ?? (and фт+\ = 0) one gets of the form: OWmm(u) = min, = | „, max@, + |, f(s„U))), OWmax(u) = max, = | ,„ min(</>,-, f(s„{i))). whose values coincide with the median of 2m - 1 terms f(s\) f(snl),Ф2, ¦ ¦ -,фтН ? (A) = 1 if | A | ^ к and 0 otherwise, it corresponds to the requirement that a decision must be good enough at least in any к states. This will be modelled by m — к + 1 weights equal to 1, i.e., ф, = 1 if 1 ^ i ^ m - к + 1, ф,¦ = 0 for i > m - к + 1. More generally the set of weights can be derived from a fuzzy quantifier of the form "most": a fuzzy subset Q of integers s.t. ??@) = 0, ??(/') ^ ??(/ + 1), ??(??)= 1; then define 0, = 1 - ??(/ - 1), for /' = 1,..., m + 1. For a detailed study of Sugeno integral and its characterization as an aggregation operation, see Marichal B000), Dubois, Prade and Sabbadin A998). For the mathematics of continuous Sugeno integral, see Wang and Klir A992), Pap A995). Sugeno integral satisfies the following properties, where ??? denotes the function / such that f(s) = ? if s e A and ? otherwise: - Increasing monotonicity (in the wide sense): if / ^ h then f f ?? >//? ??. - Idempotency: if / is a constant function and f(x) = ?, then j f ?? — ?. - Non compensation. For any ACS and ? e S, /\????, /????? e [?(?), ?}. - Weak minitivity and maxitivity // /\??? = mini j f ??. ?), // ? ??? = maxi f f ??, а). - Comonotonic minitivity and maxitivity: if / and h are comonotonic that is, Vs, s1 e S, f(s) > f(s') implies h(s) ^ h(s'), then //????? = ?\??(//??,/????) and // ? \??? = maxi j f ??, $\???\. The last property emphasizes the similarity between Sugeno integrals and Choquet integrals which are additive with respect to comonotonic functions. Then, we have the following three characterizations of Sugeno integral, see Marichal A998, 2000): THEOREM 7. Let M(f) be ? functional from Lx to L. The following assertions are equivalent
Qualitative possibility functions and integrals 1515 (i) ? is increasing, idempotent, and поп compensatory, (ii) ? is increasing and weakly minitive and maxitive, (iii) ? is increasing, idempotent, and comonotonic minitive and maxitive, (iv) there exists a confidence measure ? such that M(f) =//??. Similar axiomatization of Sugeno integrals have been obtained in the Savage setting by Dubois, Prade and Sabbadin A998), restricting S4 and S5 above to when d' is a constant act. De Campos et al. A991) and Ralescu and Sugeno A996) prove representation theorems for Sugeno integrals on the basis of comonotonic minitivity and maxitivity. Numerical variants of Sugeno integrals exist such as Shilkret integral introduced above, changing minimum and maximum into more general operations. Later on, generalized variants of Sugeno integral with t -norms and conorms in the finite setting have been considered by Suarez Garcia and Gil Alvarez A986), Grabisch at al. A992), De Cooman and Kerre A996). 6.4. Comparing functions without commensurateness Let S and X be two finite sets (interpreted as a state space and a consequence space in decision theory). Assume there is a cpd >T on S, and another complete preordering ^ ? on X. The former describes the agent's knowledge and the latter describes the preferences of the agent on consequences. Let the set of functions Xs be equipped with a strict preference relation >, that is, a transitive and irreflexive relation. Let ^ denote the associated non strict relation such that / ^ h if and only if h > f does not hold. The relation > represents the agent's preferences among decisions. The question addressed in this section is to what extent such a relation on Xs can be constructed from >T on S and ^/> on X. This problem is easily solved if ^л on S and ^/> on X are modeled by functions ? and / from S to L and X to L respectively, where L is a totally ordered set, using possibility integrals for instance. Here we do not assume the existence of L. To this aim, a set of axioms, that slightly differs from the ones of Savage, is proposed by Dubois etal. A997, 2000): • S'|: (Xs, >) is a transitive, irreflexive, partially ordered set. • St: (Xs, >) satisfies the sure-thing principle: f Ah ^ gAh if and only if /Ah' > gAh'. • WSTP: if fAg > g and gAf ^ g then f)?g. • S3: VA с 5", A not null, (fx > fy)A if and only if л ^Р у. • S4: Vx,y,x',y' e X s.t. ? >p y,x' >P y',xAy )^xBy <?> х'Ау' !>= x'By'. • S'y 3x, y, ? three constant acts such that fx > fx > fz- Axioms of the form S, are originally from Savage. Axioms of the form S- are variants of Savage axioms. Note that the weak preference relation !>= is not transitive. The sure-thing principle enables two notions to be simply defined, namely conditional preference and null events. Function / is said to weakly preferred to g, conditioned on A if and only if V/?, f Ah ^ gAh. This is denoted by (/ !>= g)A. An event A is said to be null if and only if V/, v?. (/ > g)A holds. WSPT is called the weak sure thing principle (Grant et al., 1997). It is a consequence of the sure thing principle if the weak preference relation ^ is transitive.
1516 D. Dubois and H. Prade Again, we identify the set of constant functions [fx, ? e X} and X. The preference on X can be induced from (Xs, !>=) as follows. Given (Xs, », the preference relation ^p on X is of the form Vjc, у e ?, ? ^p у if and only if fx !>= fy. A likelihood relation on 25, that is, a binary relation >l.w among events can be defined by A >lx\ В iff xAy > xBy for ? >p y. Property S4 enables events A and В to be consistently compared by fixing ? > ? у arbitrarily when selecting binary events ? Ay and xBy. All relations >(..„¦ are then the same and denoted >l- Since we are working in a purely ordinal setting, the relative position of / and g with respect to relation > should more generally not depend upon the particular values f(s) and g(s), but only on the fact that f(s) >p g(s) or not. Changing / into /' and g into g' such that the relative positions of images of each state is not altered should not affect the preference between functions. Two pairs of functions (/, g) and (/', g') will be called statewise order-equivalent whenever Vs e S, f(s) ^/> g(s) if and only if f'(s) ^p g'(s). This is denoted (/, g) = (/', g'). It leads us to postulate the following requirement (Fargier and Perny, 1999): Ordinal Invariance axiom. @1). V/, /', g, g', if (/, g) = (/', g') then (/^o The following results can be proved (Fargier and Perny, 1999; Dubois et al., 2000). THEOREM 8. Under assumptions that 01 holds, and that relation ^ on acts is reflexive, it follows that S2, and S4 hold too. Function / is better than g in state s if f(s) > P g(s). Denote [f >p g]= {s, f(s) > ? g(s)} the set of states where / is better than g. THEOREM 9. Under the assumption of ordinal invariance, if the weak preference on acts is reflexive and the induced weak preference on consequences is complete, the weak preference relation on X is complete and transitive, and the only possible definition of the ordering on functions is via the so-called likely dominance rule: f>g&[f>pg]>L [g >p f] where >p is the projection of>onX (due to S3) and >L is the projection of > on events (due to S4). The latter result enables the partial ordering on functions (acts) to be reconstructed from a likelihood relation on events and a preference relation on consequences. It can be proved that if a possibilistic likelihood relation is chosen for >t, then the induced partial ordering on functions does satisfy S\, St, S3, WSTP, S4, S^ and 01. Actually, starting with a regular possibilistic ordering ^/7 on events, lifting it to acts using the likely dominance rule and reprojecting the relation on acts back to events, yields the possibilistic likelihood relation refining ^/7 (see Section 3.4). Moreover, starting from a strict preference ordering satisfying S'h S3, WSTP, S'5 and 01, the induced
Qualitative possibility functions ami integrals 1517 likelihood relation is preadditive and satisfies two already encountered properties that are characteristic of the acceptance partial orderings (see Section 4.3): A >l A and AC; В imply В >l В for А, В, С disjoint. AUC>ifiandfiUC>tA imply С >LAU В. These properties imply that >i is generated by a family of cpds on S (Dubois et al., 1997, 2000) and lead to non-monotonic inference rules in the sense of Kraus, Lehman and Magido (Friedman and Halpern, 1996). However the class of relations on Xs thus induced is rather poor. This is because the considered problem is almost the same as the one addressed by Arrow A951) when he studied the theory of voting in the ordinal setting and proved a famous impossibility theorem. Assume S is a set of voters and XlS a set of candidates. The set (Xs. ^s) is the preference profile of voter s, defined by / ^s g iff f(s) ~^p g(s). Then the likely dominance rule is a generalized Condorcet rule. For instance suppose we start from a single cpd on S. Then the following relations are obtained: Suppose all states are equally possible. Then the Pareto ordering is recovered f>g iff f(s)^P g(s)ys апаЪ. f(s)>P g(s). This is known as the unanimity rule in voting theory. If there is a linear order on states ¦Л >? ^2 >? ¦ ¦ ¦ >? sn then the ordering of functions is defined by: f>g iff 3fc such that/(л,-) ~/>g(i,-).V' <kmaf(sk) >Pg(sk). In other words, the ordering of the functions is the one induced by the most plausible discriminant state under a lexicographic scheme. It corresponds to the presence of a sequence of more or less important dictators in the corresponding voting procedure. In the general case of a well-ordered partition of S: {??· E\ E„}, the functions in Xs must be ordered in the following way: / > g iff 3k such that f(s) ~P g(s), V.v e ?,, V/ < k. and/E) ^p g(s),Vs e Ek and 3j e Ek, f(s) >p g(s) where ~/> is the equivalence relation induced from the complete preordering ^p. It corresponds to an oligarchy Eq of most plausible states where the unanimity rule applies; otherwise, if / and g are statewise preferentially equivalent on E\, the ordering decision is made by ?2; if there is no unanimity when comparing / and g on E\, then / and g are not comparable. These relations, viewed as decision rules are either too adventurous, or too undecisive (Mas-Colell and Sonnenschein, 1972). It is one more piece of evidence showing the difference between the qualitative settings with and without and underlying totally ordered scale. More general results along this line are proved in Dubois, Fargier and Perny B002).
1518 D. Dubois and H. Prade 7. Conclusion This chapter has proposed an overview of set-functions valued on a totally ordered scale, with emphasis on the simplest of such set-functions, found in possibility theory. The corresponding relations on a power set describing the relative confidence of events have been studied. It has been shown that the qualitative counterpart of many measure-theoretic and probabilistic notions such as conditioning, independence and integrals still make sense in a finite ordinal setting. These notions bear strong connections with nonmonotonic reasoning and decision theory. References Adams. E.W. A975). The Logic of Conditionals. Reidel. Dordrecht. Agbeko, K. A991). Optimal average. Ann. Univ. Sci. Budapest. Sect. Сотр. 16. 5-10. (Full version: Report 73/1990. Mathematical Institute. Hungarian Academy of Sciences. Budapest.) Alchourron, C.E.P.. Gardenfors. P. and Makinson, D. A985). On the logic of theory- change: Partial meet functions for contraction and revision. J. Symbolic Logic 50. 510-530. Ben Amor, N.. Benferhat. S.. Dubois. D.. Geffner. H. and Prade. H. B000). Independence in qualitative uncertainty frameworks, Proc. of the 7th Intemat. Conf. on Principles of Knowledge Representation and Reasoning (KR2000). Brekenridge, CO A1 avril-15 avril 2000). 235-246. Arrow, K. A951). Social Choice and Individual Values. Wiley. New York. Benferhat, S., Dubois. D. and Prade. H. A997). Nonmonotonic reasoning, conditional objects and possibility theory; Artificial Intelligence 92. 259-276. Benferhat, S., Dubois. D. and Prade. H. A999). Possibilistic and standard probabilistic semantics of conditional knowledge, J. Logic Comput. 9, 873-895. Benferhat, S. et al. A998). Л general approach for inconsistency handling and merging information in prioritized knowledge bases. Proc. of the Intemat. Conference on Principles of Knowledge Representation and Reasoning (KR'98). Trento. Italy. June 1998. Benferhat, S.. Dubois, D.. Garcia. L. and Prade. H. B002). On the transformation between possibilistic logic bases and possibilistic causal networks. Intemat. J. Approx. Reason. 29. 135-173. Benvenuti, P.. Mesiar. R. and Vivona. D. B002). Monotone set functions-based integrals. Handbook of Measure Theory, E. Pap. ed.. Elsevier, Amsterdam. 1329-1379. Bouchon-Meunier, В., Dubois. D.. Godo. L. and Prade. H. A999). Fuzzy sets and possibility theory in approximate and plausible reasoning. Fuzzy Sets in Approximate Reasoning and Information Systems. J.C. Bezdek. D. Dubois and H. Prade. eds. The Handbooks of Fuzzy Sets Series. Kluwer. Boston. 15-190. Boutilier. С A994). Modal logics for qualitative possibility theory: Intemat. J. Approx. Reason. 10. 173-201. Butnariu. D. and Klement. E.P. B002). Triangular norm-based measures. Handbook of Measure Theory. E. Pap. ed., Elsevier. Amsterdam. 947-1010. Carnap. R. A950). Logical Foundations of Probability. Chicago. Cayrol. M.. Farreny. H. and Prade. H. A982). Fuzzy pattern matching. Kybemetes 11. 103-116. Chateauneuf, A. A996). Decomposable capacities, distorted probabilities and concave capacities. Math. Soc. Sci. 31. 19-37. De Baets. В.. Tsiporkova. E. and Mesiar. R. A999). Conditioning in possibility with strict order norms. Fuzzy Sets and Systems 106, 221-229. De Campos. L.M.. Gebhardt. J. and Kruse. R. A995). Axiomatic treatment of possibilistic independence. Symbolic and Quantitative Approaches to Reasoning and Uncertainty, С Froidevaux and J. Kohlas. eds. Lecture Notes in Artif. Intel.. Vol. 946. Springer. Berlin. 77-88. De Campos, L.M.. Lamata. M.T. and Moral. S. A991). Л unified approach to define fuzzy integral. Fuzzy Sets and Systems 39. 75-90. De Campos. L.M. and Huete, J.F. A999). Independence concepts in possibility theory. Part I: Fuzzy Sets and Systems 103. 127-152: Part II: Fuzzy Sets and Systems 103, 487-505.
Qualitative possibility functions and integrals 1519 De Cooman. G. A997), Possibility theory - Part I: Measure- and integral-theoretics groundwork: Part 11: Conditional possibility: Part III: Possihilistic independence. Intemat. J. Gen. Systems 25 D). 291-371. De Cooman, G. and Kerre. E. A996), Possibility- and necessity integrals. Fuzzy Sets and Systems 77. 207-229. De Cooman. G.. Kerre. E. and Vanmassenhove. F.R. A992). Possibility theory and integral-theoretic approach. Fuzzy Sets and Systems 46. 287-300. De Finetti. B. A937). La prevision, ses his logiques et ses sources subjectives, Ann. Inst. H. Poincare 7, 1-68. Dempster. A.P A967). Upper and lower pwbabilities induced by a multivalued mapping, Ann. Math. Statist. 38. 325-339. Dubois. D. A986). Belief structures, possibility theory and decomposable confidence measures on finite sets. Comput. Artificial Intelligence (Bratislava) 5 E). 403^416. Dubois. D., Fargier. H. and Prade. H. A996). Refinements of the ma.x-inin approach to decision-making in fuzzy environment. Fuzzy Sets and Systems 81. 103-122. Dubois. D.. Fargier. H. and Prade. H. A997). Decision-making under ordinal preferences and comparative uncertainty, Proc. of the 13th Conf. on Uncertainty in Artificial Intelligence. D. Geiger and P.P. Shenoy. eds. Providence. RI, Aug. 1-3. 1997. Morgan Kaufmann. San Francisco. CA. 157-164. Dubois, D.. Fargier. H. and Prade. H. A998a). Comparative uncertainty, belief functions and accepted beliefs. Proc. of the 14th Conf. on Uncertainty in Artificial Intelligence. Madison. WI. G. Cooper and S. Moral, eds, Morgan Kaufmann, San Francisco. CA. 113-120. Dubois. D.. Fargier, H. and Prade. H. A998b), Possihilistic likelihood relations. Proc. of 7th Intemat. Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU'98. Paris). Editions Medicales et Scientifiques. Paris. 1196-1202. Dubois. D.. Fargier. H.. Pemy. P. and Prade. H. B000). Qualitative Decision Theory: From Savage's Axioms to Non-Monotonic Reasoning. IRIT Research report n° 02-04R. Toulouse. Dubois. D.. Fargier. H. and Pemy. P. B002). On the limitations of ordinal approaches to decision-making. Proc. of the 8th Intemat. Conf. on Principles of Knowledge Representing and Reasoning (KR2002). Toulouse. France. April. 2002, Morgan Kaufmann, San Francisco. CA. 133-144. Dubois. D.. Farinas del Cerro. L., Herzig. A. and Prade. H. A994). An ordinal view of independence with application to plausible reasoning. Proc. of the 10th Conf. on Uncertainty in Artificial Intelligence, Seattle. WA, July 29-31, R. Lopez de Mantaras and D. Poole, eds, 195-203. Dubois, D.. Farinas del Cerro, L.. Herzig. A. and Prade, H. A997). Qualitative relevance and independence: A roadmap. Proc. of the 15h Inter. Joint Conf. on Artificial Intelligence (IJCAI'97. Nagoya. Japan, August. 23-29), 62-67. Extended version: A roadmap of qualitative independence. Fuzzy Sets, Logics and Reasoning about Knowledge. D. Dubois, H. Prade and E.P Klement. eds. Applied Logic Series. Kluwer, Dordrecht, 1999. 325-350. Dubois, D., Fodor, J.C.. Prade. H. and Roubens, M. A996), Aggregation of decomposable measures with application to utility theory. Theory Decis. 41. 59-95. Dubois. D., Godo. L., Prade. H. and Zapico, A. A999). On the possihilistic decision model: from decision under uncertainty to case-based decision. Intemat. J. Uncertain., Fuzziness and Knowledge-Based Systems 7 F), 631-670. Dubois. D. and Prade. H. A980). Fuzzy Sets and Systems, Theory and Applications. Academic Press. New York. Dubois, D. and Prade. H. A982). A class of fuzzy measures based on triangular norms, Intemat. J. Gen. Systems 8,43-61. Dubois D. and Prade H. A986), Weighted minimum and maximum operations. Inform. Sci. 39. 205-210. Dubois, D. and Prade, H. A988), Possibility Theory. Plenum Press. New York. Dubois, D. and Prade, H. A990). Aggregation of possibility measures. Multiperson Decision Making Using Fuzzy Sets and Possibility Theory. J. Kacprzyk and M. Fedrizzi. eds. Kluwer. Dordrecht. 55-63. Dubois, D. and Prade. H. A991). Epistemic entrenchment and possihilistic logic. Artificial Intelligence 50. 223— 239. Dubois. D. and Prade. H. A992). Belief change and possibility theory. Belief Revision, P. Gardenfors, ed., Cambridge University Press. UK. 142-182. Dubois, D. and Prade, H. A995a). Conditional objects, possibility theory and default rules. Conditionals: From Philosophy to Computer Science. G. Crocco. L. Farinas del Cerro and A. Herzig. eds. Oxford University Press. 311-346.
1520 D. Dubois and H. Prade Dubois. D. and Prade. H. A995b). Numerical representations of acceptance, Proc. of the 11th Conf. on Uncertainty in Artificial Intelligence. P. Besnard and S. Hanks, eds. 149-156. Dubois. D. and Prade. H. A995c). Possibility- theory as a basis for qualitative decision theory. Proc. of the Inter. Joint Conf. on Artificial Intelligence (IJCAI'95. Montreal. August). 1924-1930. Dubois, D. and Prade, H. A998), Possibility theory: qualitative and quantitative aspects. Quantified Representation of Uncertainty and Imprecision, Handbook on Defeasible Reasoning and Uncertainty Management Systems - Vol. 1, P. Smets, ed.. Kluwer, Dordrecht, 169-226. Dubois, D., Prade, H. and Sabbadin. R. A998). Qualitative decision theory with Sugeno integrals. Proc. of the 14th Conf. on Uncertainty in Artificial Intelligence. Madison. WI. G. Cooper and S. Moral, eds, Morgan Kaufmann. San Francisco. CA. 121-129. Also in: Fuzzy Measures and Integrals. M. Grabisch, T. Murofushi and M. Sugeno. eds, Physica-Verlag. 2000. 314-322. Dubois. D., Prade, H. and Sabbadin. R. B001). Decision theoretic foundations of possibility theory. Eur. J. Oper. Res. 128, 459^478. Dubois. D., Prade, H. and Testemale, С A988), Weighted fuzzy pattern matching. Fuzzy Sets and Systems 28. 313-331. Fargier. H. and Pemy. P. A999). Qualitative models for decision under uncertainty without the commeiisurability assumption, Proc. of the 15th Conf. on Uncertainty in Artificial Intelligence, Providence, RI, K. Laskey and H. Prade, eds. Morgan Kaufmann. San Francisco, CA, 157-164. Farinas del Cerro, L. and Herzig. A. A991). A modal analysis of possibility theory, Proc. of the Inter. Workshop on Fundamentals of Artificial Intelligence Research (FAIR'91). Smolenice Castle. Czechoslovakia. Sept. 8-12. 1991. Ph. Jorrand and J. Kelemen. eds. Lecture Notes in Comput. Sci.. Vol. 535. Springer. Berlin. 11-18. Farinas del Cerro, L. and Herzig. A. A996). Belief change and dependence. Proc. of the 6th Conf. on Theoretical Aspects of Rationality and Knowledge (TARK'96), De Zeeuwse Stromen. The Netherlands, March 17-20, 1996, Y. Shoham, ed., Morgan Kaufmann, San Francisco. CA. 147-161. Farinas del Cerro, L., Herzig, A. and Lang. J. A994). From ordering based non-monotonic reasoning to conditional logics, Artificial Intelligence 66. 375-393. Fine, T.L. A973), Theories of Probability, Academic Press, New York. Fishbum. P. A986a), The axioms of subjective probabilities. Statist. Sci. 1. 335-358. Fishbum, P. A986b). Interml models for comparative probability on finite sets. J. Math. Psych. 30. 221-242. Friedman. N. and Halpem. J. A996). Plausibility measures and default reasoning. Proc. of the 13th National Conf. on Artificial Intelligence (АААГ96. Portland). 1297-1304. To appear in J. Assoc. Comput. Mach. Gardenfors. P. A978), On the logic of relevance. Synthese. 351-367. Gardenfors. P. A988). Knowledge in Flux. MIT Press. Cambridge. Gardenfors. P. A990), Belief revision and irrelevance. PSA 2. 349-356. Gardenfors, P. and Makinson. D. A994). Non-monotonic inference based on expectations, Artificial Intelligence 65. 197-245. Gebhardt, J. B000). Learning from data: possibilistic graphical models. Abductive Reasoning and Uncertainty Management Systems. Handbook on Defeasible Reasoning and Uncertainty Management Systems - Vol. 4. D. Gabbay and R. Kruse, eds. Kluwer. Dordrecht. 315-389. Gebhardt, J. and Kruse. R. A996). Automated construction of possibilistic networks from data. Appl. Math. Comput. Sci. 6. 529-564. Goguen. J.A. A967). L-fuzzy sets. J. Math. Anal. Appl. 8. 145-174. Grabisch. M. B002). Set functions over finite sets: transformations and integrals. Handbook of Measure Theory. E. Pap, ed.. Elsevier. Amsterdam. 1381-1401. Grabisch. M.. Murofushi, T. and Sugeno. M. A992). Fuzzy measure officzy events defined by fuzzy integrals. Fuzzy Sets and Systems 50. 293-313. Grant, S.. Kajii. A. and Polak. B. A997). Weakening the sure-thing principle: decomposable choice under uncertainty, Workshop on Decision Theory, Chantilly. France. June 12-14. A revised version. "Decomposable Choice under Uncertainty", is to appear in J. Econom. Theory. Grove. A. A988). Two modellings far theory change. J. Philos. Logic 17. 157-170. Halpem. J. A997), Defining relative likelihood in partially-ordered preferential structures. J. Artificial Intelligence Res. 7. 1-24. Hajek, p. and Harmancova. D. A994). A qualitative fi<zzv possibilistic logic. Intemat. J. Approx. Reasoning 12. 1-19.
Qualitative possibility· functions and integrals 1521 Hajek, P.. Harmancova, D.. Esteva. F., Garcia, P. and Godo. L. A994), On modal logics for qualitative possibility in a fuzzy setting. Proc. of the 11th Conf. on Uncertainty in Artificial Intelligence. R. Lopez de Mantaras and D. Poole, eds. Morgan Kaufmann. San Francisco. CA. 278-285. Herstein. I.N. and Milnor. J. A953). An axiomatic approach to measurable utility. Econometrica 21. 291-297. Hisdal. E. A978). Conditional possibilities independence and noninteraction. Fuzzy Sets and Systems 1. 283— 297. Hughes. G.E. and Cresswell. M.J. A968). An Introduction to Modal Logic. Methuen. London. Inuiguichi. M.. Ichihashi, H. and Tanaka. H. A989). Possibilistic linear programming with mensurable multiattribute value functions, ORSA J. Comput. 1. 146-158. Inuiguichi, M. and Kume. Y. A994). Necessity measures defined by level set inclusions: Nine kinds of necessity measures and their properties. Intemat. J. Gen. Systems 22. 245-275. Keynes. J.M. A921). A Treatise on Probability MacMillan. London. Klir. G.J. and Harmanec. D. A994), On modal logic interpretations of possibility theory. Intemat. J. Uncertain.. Fuzziness. and Knowledge-Based Syst. 2. 237-245. Koopman. B.O. A940), The bases of probability, Bull. Amer. Math. Soc. 46. 763-774. Kraft. C.H.. Pratt. J.W. and Seidenberg. A. A959). Intuitive probability on finite sets. Ann. Math. Statist. 30. 408^419. Kraus, S.. Lehmann. D. and Magidor. M. A990). Nonmonotonic reasoning, preferential models and cumulative logics. Artificial Intelligence 44, 167-207. Kyburg. H. A988), Probabilistic inference and nonmonotonic inference. Uncertainty in Artificial Intelligence. Vol. 4. R. Schachter et al.. eds. North-Holland. Amsterdam. 319-326. Lang, J. A996), Conditional desires and utilities. Proc. European Conference on Artificial Intelligence (ЕСАГ96, Budapest). Wiley. New York. 318-322. Lehmann. D. and Magidor. M. A992). What does a conditional knowledge base entaiP.. Artificial Intelligence 55 A), 1-60. Lewis. D. A973), Counter/actuals. Basil Blackwell. London. Makinson. D. A994). General patterns in nonmonotonic reasoning. Nonmonotonic Reasoning and Uncertain Reasoning. Handbook of Logic in Artificial Intelligence and Logic Programming -Vol. 3, Oxford University Press. UK. 35-110. Marichal, J.-L. A998), Aggregation operators for multicriteria decision aid. Ph.D. thesis. Institute of Mathematics. University of Liege. Liege. Belgium. Marichal. J.-L. B000), On Sugeno integral as an aggregation function. Fuzzy Sets and Systems 114. 347-365. Mas-Colell, A. and Sonnenschein. H. A972). General possibility theorems for group decisions. Rev. Econom. Stud. 39, 185-192. Mesiar, R. A992). Characterisation of possibility measures offiizzy events using Markov kernels. Fuzzy Sets and Systems 46. 301-303. Mesiar, R. A994). On the integral representation offiizzy possibility measures. Intemat. J. Gen. Systems 21, 109-121. Mesiar, R. A997). Possibility measures, integration, and fuzzy possibility measures. Fuzzy Sets and Systems 92. 191-196. Moulin. H. A988). Axioms of Cooperative Decision Making. Cambridge University Press. Cambridge. MA. Nahmias. S. A978), Fuzzy variables. Fuzzy Sets and Systems 1 B). 97-110. Pap. E. A995). Null-Additive Set-Functions, Kluwer. Dordrecht. Pap, E. B002). Pseudo-additive measures and their applications. Handbook of Measure Theory. E. Pap, ed., Elsevier. Amsterdam. 1403-1468. Pearl. J. A988), Probabilistic Reasoning Intelligent Systems: Networks of Plausible Inference, Morgan Kaufmann. San Mateo. CA. Ralescu. D. and Adams. G. A980). The fuzzy integral. J. Math. Anal. Appl. 75. 562-570. Ralescu. D. and Sugeno, M. A996). Fuzzv integral representation. Fuzzy Sets and Systems 84. 127-133. Ramer. A. A989). Conditional possibility measures. Cybernet. Systems 20. 233-247. Ramsey, F.P. A963), Truth and probability. Studies in Subjective Probability. H. Kyburg and H.E. Smolker, eds. Wiley. New York. 61-92. Roubens, M. and Vincke. P. A985). Preference Modelling. Lecture Notes in Econom. Math. Systems. Vol. 250. Springer. Berlin.
1522 D. Dubois and H. Prade Savage. L.J. A972). The Foundations of Statistics, Dover. New York. Sanguesa. R..Cabos, J. and Cortes. U. A998). Possibilistic conditional independence: A similarity-based measure and its application to causal network learning. Intemat. J. Approx. Reason. 18, 145-167. Sanguesa. R.. Cortes. U. and Gisolfi. A. A998), A parallel algorithm for building possibilistic causal networks. Intemat. J. Approx. Reason. 18. 251-270. Schmeidler. D. A986), Integral representation without additivity. Proc. Amer. Math. Soc. 97 B). 255-261. Shackle. G.L.S. A961). Decision, Order and Time in Human Affairs. 2nd edn, Cambridge University Press. UK. Shafer. G. A976). A Mathematical Theory of Evidence. Princeton University Press. Princeton. NJ. Shilkret, N. A971). Maxitive measure and integration. Indag. Math. 33, 109-116. Shoham, ? A988), Reasoning about Change - Time and Causation from the Standpoint of Artificial Intelligence, The MIT Press. Cambridge. MA. Smets, P. and Kennes, R. A994). The transferable belief model. Artificial Intelligence 66. 191-234. Snow. P. A994). The emergence of ordered belief from initial ignorance, Proc. 11th National Conf. on Artificial Intelligence (AAAI-94. Seattle. WA), 281-286. Snow. P. A999). Diverse confidence levels in a probabilistic semantics for conditional logics. Artificial Intelligence 113, 269-279. Spohn. W. A988). Ordinal conditional functions: A dynamic theory of epistemic states. Causation in Decision. Belief Change and Statistics. W. Harper and B. Skyrms. eds. 105-134. Suarez Garcia. F. and Gil Alvarez. P. A986). Two families offuzzx integrals. Fuzzy Sets and Systems 18. 67-81. Sugeno. M. A974), Theory of fuzzy integrals and its applications. Ph.D. thesis. Tokyo Institute of Technology. Japan. Sugeno, M. A977). Fuzzy measures and fuzzy integrals -A sunvy. Fuzzy Automata and Decision Processes. M.M. Gupta. G.N. Saridis and B.R. Gaines, eds. North-Holland. Amsterdam. 89-102. Verma. T. and Pearl. J. A990). Causal networks: semantics and expessiveness. Uncertainty in Artificial Intelligence, Vol. 4. R.D. Schachter et al.. eds. North-Holland. Amsterdam. 69-78. Walley. P. A991), Statistical Reasoning with Imprecise Probabilities. Chapman and Hall. Wang. Z. and Klir. G. A992), Fuzzy Measure Theory. Plenum Press. New-York. Weber, S. A984). Decomposable measures and integrals for Archimedean t-conorms. J. Math. Anal. Appl. 101. 114-138. Whalen. T. A984). Decision making under uncertainty with various assumptions about available information. IEEE Trans. Systems Man Cybernet. 14. 888-900. Wong. S.M.K.. Yao. Y.Y. Bollman. P. and Burger. H.C. A990). Axiomatization of qualitative belief structure. IEEE Trans. Systems Man Cybernet. 21. 726-734. Wong, S.M.K.. Yao, Y.Y. and Lingras, P. A993). Comparative beliefs and their measurement. Intemat. J. General Systems 22. 69-89. Yaari. M.E. A987). The dual theory of choice under risk. Econometrica 55. 95-115. Yager. R.R. A979), Possibilistic decision making. IEEE Trans. Systems Man Cybernet. 9. 388-392. Yan, B. A993), Semiconormed possibility integrals for application-oriented modelling. Fuzzy Sets and Systems 57, 239-248. Yan, B, A994). Seminormed possibility integrals for application-oriented modelling. Fuzzy Sets and Systems 61, 189-198. Zadeh. L.A. A975), Calculus of fuzzy restrictions. Fuzzy Sets and Their Applications to Cognitive and Decision Processes, L.A. Zadeh, K.S. Fu. M. Shimura and K. Tanuka. eds. Academic Press. New York. 1-39. Zadeh, L.A. A978). Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets and Systems 1. 3-28.
CHAPTER 37 Measures of Information Wolfgang Sander Institute for Analysis, Technical University of Braunschweig. Pockelsstr. 14, 38106 Braunschweig, Germany E-mail: w.sander@tu-bs.de Contents Introduction 1525 1. Probabilistic and non-probabilistic information measures 1526 2. Branching inset information measures 1531 3. Recursivity and generalized additivity 1534 4. Recursive measures of multiplicative type 1536 5. Regular weighted (/,m)-additive measures of degree (?. ?) 1540 6. Subadditive information measures for random vectors 1543 7. Information measures in a theory of evidence 1546 8. Measures of fuzziness 1550 9. Weighted entropies 1555 10. Summary '561 References 1562 HANDBOOK OF MEASURE THEORY Edited by Endre Pap © 2002 Elsevier Science B.V. All rights reserved 1523
This page intentionally left blank
Measures of information 1525 Introduction There is no doubt that the concept of information plays nowadays an important role since it has so many aspects and applications in different areas, see for example the books of Aczel and Daroczy A975), Behara A990), Goodman and Nguyen A985), Martin and England A981), Grabisch et al. A995), Guiasu A977), Taneja( 1989), Klirand Yuan A995). Another commonly used word for information is uncertainty. Notice that we can interpret decrease of information as increase of information. This was the reason of saying that a measure of the amount of uncertainty is the same as a measure of the amount of information. And still a third expression occurs in the connection with information and uncertainty: the name "entropy", which was originally introduced in physics by R. Clausius in 1864, but which was proposed by John von Neumann "because no one knows what entropy really is, so in a debate people using this word will always have an advantage". The most common approach of modeling uncertainty is based upon some form of probability theory. In this framework a measure of the amount of uncertainty (entropy) should use all kinds of information that is provided, which here means: dependence upon the outcomes of an experiment and the corresponding probabilities. But in the early 1980s new types of uncertainty-based information were investigated, due to different deficiencies of information like vagueness, ambiguity, non-specificity, divergence, etc., mainly in fuzzy measure theory and fuzzy set theory. So there are many different mathematical frameworks dealing with uncertainty or information. We here restrict to - the non-probabilistic concept, - the probabilistic concept, - the concept of a fuzzy measure, - the concept of a fuzzy set. Moreover, in this paper we want to focus on attempts of combining the above different concepts so that we arrive at so-called "total measures of uncertainty" or "total information measures" because we feel that there is now more need for generalized measures of information and uncertainty within systems due to randomness, evidence and fuzziness (see also Aczel and Daroczy A978), DeLuca and Termini A972), Forte A977), Klir and Mariano A987), Ramer A987), Pal and Bezdek A994)). The idea is to define an information measure by a set of intuitively, natural properties. These properties can rather often be represented by a system of functional equations and functional inequalities the solutions of which finally lead to characterization theorems for total uncertainties. This axiomatic procedure is probably a most effective and a most useful one for practical applications (see also Aczel and Daroczy A975) and Ebanks et al. A998)). We don't claim completeness in any respect, in contrary, we restrict ourselves to discrete information measures, and also here, only a choice of the (in our opinion) relevant facts and the bibliography is presented.
1526 W. Sander 1. Probabilistic and non-probabilistic information measures In Sections 2-6 we want to present results in a "mixed" theory of information - inset theory for short (the name was chosen because the idea was born at a meeting at the Ecole Normale Superieure de L'Enseignement Technique - ENSET - near Paris, but inset can also be understood as "in set"), where the measures of information may depend on both the probabilities and the events. Thus the probabilistic and non-probabilistic concepts are here combined. The first pioneering result in this direction is due to Aczel and Daroczy A978) and was the basis for many researchers for a long time. It turns out that nearly all problems in this field are solved, and we will present the complete solutions. To do this we start with some preliminaries like notations and important definitions. But we also repeat here briefly results on well-known probabilistic and non-probabilistic information measures. The events are considered as elements of a ring В of subsets (containing, with any two sets, their union and difference, and thus also their intersection and the empty set 0) of some given set (universe) ?. Note that В is an algebra if for each ? e В there exist ?\? e B. Fixing к to be an arbitrary, but fixed positive integer and putting J = ]0, \[k, R0 = [0,oo[. R+ = ]0.oo[, A.1) we denote the set of all finite (и-агу) disjoint classes of nonempty events (which form a partition of ?, in case of an algebra B) and the set of A:-tuples of discrete, complete probability distributions - without zero probabilities - of length ? (и = 2, 3,...) by T>„ = {?=(?,,...,?„): ?,ей\@), ?,П ?,= 0, ? ? j, ij=\,...,n) A.2) (огЯ„= ? = (?,, ...,E„)eV„: \J ?,¦. = ? 1 A.3) and ? " 1 ??=\? = {?\,...,?„): ??, = 1 p\,...,p„eJ , A.4) I i=l J respectively. If in A.2), A.3) and in A.4) empty sets and zero probabilities are allowed then we denote these sets by V„, Я„, Г„, A.5) respectively.
Measures of information 1527 Here 1 represents the /<:-vector A. 1 1), and all operations on vectors are to be done componentwise. We remark that ? e ?„ is а (к х /<:)-matrix where the P,=(P\,,---,Pk,)TeJ, 1</?? A-6) (? denotes the transpose) are column vectors and the Pj = {Pj\,...,pjn), l^J^k. A.7) are row vectors. We agree on writing ? e ?„ as P = (P\,P2,...,Pn) = (P\,Pi Pk). A-8) although /> = (/>,,..., Pk)T. Dehnition 1.1. A sequence (/„) of functions /„:2)„xf„^l or I„ :?„ ? ??-> R. ? = 2,3 A.9) is called a (^-dimensional) inset information measure (or IIM, for short) on the open domain. If in A.9) T>„ is replaced by V„ then (/„) is called an IIM on the closed domain. If к = 1 then it is convenient to consider (/„) as a measure of uncertainty, for instance, in the outcome of an experiment (or message or market situation, etc.) where the possible events E\,..., E„ have probabilities p\ p„, respectively. Let us first give some examples of important information measures in the probabilistic case where we have no dependence upon the events: Hartley A928) essentially introduced the following entropy HZ(p\,...,Pn) = \og\{i:Pi>0. \^i^n}\ A.10) which depends only upon the number of nonzeros in ? e ?,,. The other principal entropy is probably the most famous entropy (it is the entropy for the theory of communications), due to Shannon A948): /7 H„(P) = -Yipi\ogpi. A.11) Further important entropies are the entropies of degree ?, ? eM, я-(Р)=(B^-1Ш=1Р?-1). «? U (U2) \Н„(Р), а=\
1528 W. Sander (see Daroczy A970)), and the entropies of degree (?, ?), ?. ? e R (see Behara A990) and Sharma and Taneja A975)) /,e./J=|B'--2'-/')-|E?=l(pr-pf). ?? ?, A13) Note that lim H%(P) = H„(P), a^ I Н^Р) = Щ and Н^-Р) = НР so that #,, is a two-parametric family generalizing the Shannon entropy and also the entropy of degree a. Examples of so-called deviations (k = 2) and information improvements (k = 3) are the inaccuracy of Kerridge A961) /1 Kn(P\, Pi) = ~??? log p2i A.14) 1 = 1 and the information improvement of Theil A967) // Ln(P\,Pi,P3) = yiP4l°S—· 0·15) ?? p* respectively. Here K„ is given by observing a true probability distribution P\ as Pi, whereas the interpretation of L„ is that a true probability distribution P\ will be predicted by Pi, and then this prediction will be revised by ??,. Note also that K„(P,P) = H„(P). Continuing this last procedure it seems natural to consider information measures depending upon к probability distributions since situations like repeated revision of data collections occur rather often in weather forecast, economics and experiments. If, for example, zero probabilities are allowed in A.15) then we have to require that p\i =0 whenever pit = 0 or рц = О, and also the awkward conventions 01og0 = 0, 01og-=0. A.16) To avoid these restrictions in "closed domains", information measures on the open domain are investigated (and there are well-known techniques to extend results on an open domain to one on an "half-closed" or "closed" domain, if necessary; see Aczel and Ng A983)).
Measures of information 1529 Let us remark that the proofs of characterization theorems of information measures become much more difficult on an open domain than on the closed domain. Now we briefly comment non-probabilistic information measures. For a survey see, for example, the books of Guiasu A977) and Grabisch et al. A995). The two main approaches are due to Ingardenand Urbanik( 1962) and to Kampede Feriet and Forte A963, 1967). Without going into details the first two authors have shown that information and probability are not separated things: starting with information content I{E\) then in a canonical way p(E\) occurs, and reversely. The connection is given by /(?,) =-logP(?,). ?(?|) = «?"?". A.17) In Kampe de Feriet and Forte A963, 1967) the principle of localization is used defining the information measure / on a measure space (?.?) supplied by the events (instead of defining the information of the whole experiment). Here I :B—> [0. oo]. and if E\, ?S are disjoint elements of В then / satisfies I(E\ U E2)= F(I(E\ ),/(F2)) O·18) where F: [0, oo]2 -> [0, oo] is a so-called composition law, that is F is a ?-norm on the interval [0, oo] (F is associative, commutative, monotonic increasing in each place, and oo is neutral element of F). We get immediately (see, for example. Schweizer and Sklar A983)): F is a composition law iff for any continuous and strictly increasing function ?: [0.1 ] —» [0, oo] the function Т(х,у) = ф-*[Р(ф(х),ф(у)]. л, ? e [0.1]. A.19) is a ?-norm (on [0, 1]). Thus we can generate composition laws from f-norms using Е(х,у) = ф[Т(ф-](х),ф-](у))]. л-.уе[0,оо]. A.20) Let us consider two examples (where с > 0 is a fixed constant). A) If T(x,y) = max(.v +y - 1,0), ф(х) = -flogd -л). 0 '(-?) = 1 - e~cx then A.20) implies that F(jc,y) = max^--log(e-<v+e-<v).oY *.ve[0.oo]. A.21) is the Wiener-Shannon law with parameter с B) If T(x, y) = xy, ф(х) = - 1/log.v, 0 (.v) = e"l/v then we arrive at the hyperbolic law F(x,}>)=—^, *.ve[0.oo]. A-22) x+y To connect the local point of view (information of an event) with the global point of view (information of the whole experiment) we may assume that (?.?. ?) is a probability
1530 W. Sander space and we consider a partition (? \ ,...,?„) of ?. Then the amount of information that is provided when one among these events occurs is one of the numbers /(?,), /(F2), ..., /(?„) A-23) while P(?i), P(E2), .... P(E„) A-24) are the corresponding probabilities. Putting p, = P(F,) the expected information uE-p)=!'{pfEi) pfk) pS.))=P'/(?') (L25) is the amount of information one expects to receive from the experiment to find out which of the events E\,..., E„ will occur. Finally, let us introduce some notations which will be needed in the following sections. Let U denote the setoff unit vectors и,- = @, 0, l,0,...,0)eR', 1 ^ /' ^ k. If X is a set then V(X) = [А: А с X) is the power set of X. If Pe r„,then D„ = \(p\,...,p„): pte J, ^,еЛ. A-26) We define for PeThQe Г,„, F e ?>,, F e Vm (F e П,, F e ?>„,) P* Q = (p\q\, ¦ ¦ ·. P\qm, ¦ ¦ ¦, p,q\ ¦ ¦ ¦ ¦, p,qm) e П,„ A.27) and ? ? F = (F, ? F,,..., ?\ ? F,„,..., ?, ? F,,..., ?, ? F,„) e V,m A.28) (F ? F e Л,,„, respectively). Moreover F ? F ^ 0 means that F, ? Ej? 0, 1 ^ /' ^ /, 1 ? j ^ w. We also agree on writing pa = (Pi,...,Pk)iai ai):=pT---Ptk (L29) for all p, a e R*. Note that p"eR whereas p" e R* for и е N. For instance, p2 = (P\,.--,Pk)-(P\,--.,Pk)=(pl--;Pl)- (L3°) A map g: В \ {0} -> R is additive if S(F, UF2) = g(F|)+g(F2), (F|,F2)eP2· d-31)
Measures of information 1531 Moreover mappings aJ,m:J -> R are called additive, logarithmic and multiplicative if a(x+y) = a(x) + a(y). (i.v)eO:. A.32) l(x-y) = !(x)+l(y), x.yeJ. A.33) m(x ¦ y) = m(x) ¦ m(y), x.yeJ, A-34) respectively. The general form of these additive, logarithmic and multiplicative functions are given by к а(дс) = а(дс|,...,/*) = J]fl/(jr/), x e J. A-35) i=l * Kx) = l(x\,...,lk) = Yili(xi). xeJ. A.36) i=l к m(x) = m(x],...,lk) = Y[mi(xi). xeJ. A-37) i=l where each a,, /;, w, is additive, logarithmic and multiplicative, respectively on {{p,q): p,q, ? +q e ]0, l[},on ]0, oo[2, or on ]0. 1[2. So the corresponding regular (say measurable) functions are given by a(x) = cQx, l(x) = cQlogx. m{x) = xl. ceJ. A.38) Here с ? jc is the usual inner product in Rk and log.? is defined componentwise (where log=log2): cQx = (c\,...,ck)(x\ xk) = c\x\ ? hckxk· A-39) logjr = (logor logjrjt). A-40) Finally we denote by к к pOCOq = YiYiPiCijqj A.41) 1 = 1 1 = 1 the multiplication of vectors p, q with a real-valued (к х к)-тгХпх С = (с,/). 2. Branching inset information measures Definition 2.1. An inset information measure (/„)
1532 W. Sander A) has the branching property if there is a function G : Vi_ ? Di -> R such that Pi ·· Pn I \P\+P2 Pi ·· Pn I \P\ for all (?, P) e P„ ? ?„ (и = 3,4,...), B) is /-symmetric for some / e N, / > 3 if ? ? ··· E/\ _ л (??{\) ··· ??(/) ~?(?) -?1" , (?, Р)еР/хГ/, B.2) Pi ··· ?// V^iD '" ???/)/ for all permutations ? on {1, 2 /}. In the following we agree on the notation In(E,P) = I„(E;P) = I„(E] '" E" V (?,/>) e P„ ? ?„. B.3) \?? ·· P«/ The branching property is one of the most fundamental axioms for any information measure having the following interpretation. In an experiment A with possible outcomes E\,..., E„ and corresponding probabilities p\...., p„ the left-hand side of B.1) is the uncertainty of A. The right-hand side contains as first summand the uncertainty of the experimentB with outcomes ? ? UEi, ?3 E„ and corresponding probabilities ? ? +P2, ??,,..., p„. So we are uncertain whether E\ or Ei in the outcome will occur. To overcome the difference of the uncertainties of A and В a third experiment С will be performed to decide whether E\ or Ei will occur. This is taken into account by the second summand ? on the right-hand side of B.1): ? depends only upon E\, E->_ and p\, P2 which is weak assumption. The second property which will be needed for our first main characterization theorem is the symmetry of an IIM: Each //, / ^ 2, is invariant under a simultaneous change in the order of the events and the corresponding probabilities. From a mathematical point of view only a weaker symmetry condition is needed. It is interesting that it is sufficient to use only these two properties, the branching property and the 3- and 4-symmetry for a first important characterization theorem of an IIM (/„). Before we present this result we look more closely at the functional equation B.1) with ? =4 and see that B.1) doesn't require the value of ? on all of ?? ? A>, if Sis an algebra with atoms. It requires values of ?(?\, ?2; p\, pi) only for those (??, ?2) to which ?3, ?4 exist with (?|. ?2, ?3, ?4) e P; or there exist E\ \. ?12 satisfying ?i=?iiU?|2, (?ц,?|2)еГЬ and (?,,, ?,2, ?3) e P.v
Measures of information 1533 Denoting this domain of ? by V ? Di it can be seen that if В either is not an algebra or is a non-atomic algebra then V may be replaced by pairs (??, ??) to which there exists an ?3 such that (?|, ?2. ?3) ? ?>з· Moreover, if В is not an algebra then D = ZV Due to technical difficulties of this type of domains the occurring functions in the following Theorem 2.2 are not defined on V\ ? J, but on appropriate subdomains which will not be given explicitly. They will be indicated by some "primes" on D|. and the interested reader is referred to the original paper of Ebanks A990), where also some refinements of the following result are presented. THEOREM 2.2. A) Let В be a ring of sets. An IIM (/„) on V„ ? ?„ is 3- and Asymmetric and has the branching property, if and only if there exist maps b:T>\ ? J —> ? and с: V'[ -> ? such that /„(?, P) = ][>(?,. Pi) +c((j ?,) B.4) for all (?, ?) e V„ ? ?„, ? ??,? 2. B) If in A) В is an algebra and I„ is defined on ?? ? ?„ then I„ has the form B.4) with с = 0, that is ? I„(E,P) = Yjb(El.Pl) B.5) i = l has the so-called sum-form property with generating function b. Remember that nearly all known information measures have this sum-form property. Thus it is surprising that this result holds without any regularity condition. The proof is rather long, and is based on many earlier work. We mention here the pioneering papers of Jessen et al. A968) and of Ng A974). For a survey in the purely probabilistic case we refer to the book of Ebanks et al. A998). Let us give a very brief outline of the proof to indicate the occurring difficulties and the dominant role of functional equations in this area. Assuming that (/„) satisfies B.1). the 4-symmetry of (/„) leads to the so-called fundamental equation of IIM with the branching property ?,U?2 ?? /E\ E2 Р + Я r ) \p q ?,U?, ?? (?? ?? B6) P + r q ) \p r ) for all (p,q,r) e Dj and all {?\.??,?^) for which there exists an E\ with (E\,Ei. ?3, ?4) e T>4.
1534 W. Sander Moreover, the 3-symmetry leads to '?|U?i ЕЛ Л(Е\ Е2 b ¦ . . - ¦ Р+Я r ) \p q = /?,U?, E2\ (Et ЕЛ 2 So the key is to find the solution ? of B.6) (see Ebanks A989)) because then, substituting the form of ? into B.7) the form of h can be derived, and finally the form of (/„) can be derived inductively via B.1). (We remark that keeping (?|, ??, ? у,) е Рд temporarily fixed in B.6) we arrive, defining i/r,: D2 -> К in a canonical way, B.6) leads to the functional equation ?\ (? + q,r) + ?2(?, q) = ЫР, О + ЫР + г, q) B.8) for all {p, q, r) e D^, which contains simultaneously 4 unknown functions. Having determined these 4 functions and freeing (?|. Ei, ?-0 again, the form of ? can be found.) 3. Recursivity and generalized additivity In Section 2 it turned out that all 3- and 4-symmetric IIM (/„) on T>„ ? ?„ with the branching property have the form I„(E,P) = Ylb(Ei,pi)+cl\jEiY О 2. C.1) ?=? \/=l / for some functions b:V\ ? J -> R and c:V'[ -> R (and с = 0 if (/„) is defined on T>„ x П„). The problem is to determine b explicitly since up to this point b is an arbitrary map. Now there are two main streams to get the explicit form of b: A) One possibility is to require a special form of ? in B.1) This will lead to a natural condition, the so-called Recursivity of multiplicative type. C.2) B) Another possibility is to impose additional conditions, and we consider here Generalized additivity conditions. C.3) Let us explain the ideas behind C.2) and C.3). Remembering the interpretation of the branching property (after Definition 2.1). The difference lJE> ·· E")-lJEiUE2 E* ·· E") C.4) \P\ ·· Pn I \P\+P2 Pi ·· Pn )
Measures of information 1535 in B.1) will now be recaptured by an experiment С having two outcomes E\. ?2 with conditional probabilities q\ = ¦ . 92 = ; C-5) P\+P2 P\+P2 (so that <7i + qi — 1). On the other hand the experiment С must only be performed in case of E\ U Ei occurs, the probability of which is p\ + /r>. Thus we assume that ? is the uncertainty of the experiment С multiplied with a function of p\ + pi'- ?' E2) = M(p]+p2).I2(E> E2). C.6) P\ Pi) У F' \q\ qi) Moreover there is a convincing argument to assume that ? in C.6) is not an arbitrary, but a multiplicative function (for details see the paper of Aczel and Ng A983)). This leads to Dehnition 3.1. A) An IIM (/„) is called recursive of multiplicative type ? if (/„) satisfies B.1) with the function ? given in C.6) with a multiplicative map M. B) If in addition M(x) = xa, ? e J then /„ is called recursive of degree a, a e Rk. For example, the Shannon entropy H„ is recursive of degree 1, - the entropy of degree ? {? 1) is recursive of degree a, - the inaccuracy of Kerridge is recursive of degree A.0), and - the information improvement of Theil is recursive of degree A.0,0). In the next section we will present all 3-symmetric IIM (/„) which are recursive of multiplicative type ? (without and with regularity conditions). Let us remember the idea of additivity. We now assume that we have two independent experiments A and В which are given by (E, P) eT>i ? ? and (F. Q) e T>„, ? ?,„, respectively. Here independence means that we can perform the combined experiment (EnF,P*Q)eV/m ? r,m. Now, additivity says that the information obtained from experiment С is the sum of the amount of information received from A and B: //„,(?¦ П F, ? * Q) = I,(E, P) + I,„(F, Q). C.7) ? e ?/, Q e Гт, ? П F ? 0. More generally we assume that the information obtained from experiment С is a weighted sum of the amounts of information obtained from experiments A and В for some weight functions w\ and u':: //,„(?¦ П F, ? * Q) = w\(P) ¦ I,AF. Q) + w2{Q) ¦ //(?. P), C.8) ? e ?/, Q e ?,?, ? П F фЪ. Since the operations П and * are associative it can be shown (only assuming that the range {/„ (P„ ? ?„): ? = 2, 3 } has infinite cardinality) that w\
1536 W. Sander and wi have the sum-form property with multiplicative generating functions: / m w](P) = YjM\(Pi), w2(Q) = J2M^Pj)· P*rhQerm. C.9) i=l j=\ Here M\,Mi:J -> Ш are multiplicative functions. Note that C.8) and C.9) go over into C.7) if M\ and Mi are projections, that is M(pi) = M(pu, · · ·, pi,) = Psi for some s, M(qj) = M(q\j,..., qmj) = q,j for some t. Dehnition 3.2. A) For a fixed pair (l,m), I ^ 2, m ^ 2, we say that an IIM (/„) is weighted (/, w)-additive of type (?\, ??) if it satisfies / m //„(?¦ П F, ? * 0 = ]? ??? (?) · /„(F. ?) + J] ??2(?>) · ??(?. ?) C.10) ?=? 7=1 for all ? e ??, Qe ?,„, F e D,, F e D,„, F П F ^ 0, where ? ?, ??2 : J -> ? are multiplicative. B) If in C.10) M\(p) = pa, M2(q) = qfi for some a, /3 e R* (p, q e У), C.11) then (/„) is called weighted (/, w)-additive of degree (a, /3). In this survey we will present all regular (here measurable) solutions of C.10) in the case C.11) if (/„) satisfies the sum-form property B.5). We remark that we require C.10) only for a fixed pair (/, w) and not for all / > 2, m > 2. Moreover, it turns out that the case / = 2, m = 2 is the most difficult and surprising one (in both cases) since unexpected solutions occur. 4. Recursive measures of multiplicative type THEOREM 4.1. Let В be a ring of sets ?/?. An IIM (/„) is 3-symmetric and recursive of multiplicative type ? (? ? 0) if and only if it is given by /,,(F, ?) = ? f(E,)M(Pi) - /I \J E,¦ ) +?(?, ?) D.1) i=l \?=| /
Measures of information 1537 for (?, P) e V„ ? ?,„ ? = 2, 3,..., where E'=|L(?„/>). ifM=\, ?(?,?)= ?"=]1(?,)?(??), if ? is additive, D.2) 0, otherwise. Here l:J->Ris logarithmic, L:V\ ? J -> 1 is additive in the first and multiplicative in the second component {on appropriate domains), and f.g:V\ -> R are arbitrary functions. If В is an algebra with atoms then for ? = 2 the domain ofI2 is given by V ? ?2 (instead ofT>2 ? ??) with V=V2\\(E\,E2): E\ is an atom and E\ U E2 = ?]. D.3) Again, let us indicate the proof. Substituting the special form C.6) of ? into B.7) - and using that p+q+r=\-we arrive at the so-called fundamental equation of recursive inset information of multiplicative type: ф(Е]иЕ2,Еур) + М(\-р)ф[Е].Е2: Я \-p = ф{Е\ UE?,E2;p) + M(\ -р)ф(е\.Еу.-^-Л D.4) for (?|, E2; p) eT> ? J, where ф(ЕиЕг,р):=Ь[,Е] ЕЛ. (Ei.E2\p)eT>xJ. D.5) V1"^ ? ) A key step in solving D.4) is to solve the functional equation K{E\ U E2, E3) + K(E\. E2)= K(Et U ?3. E2) + K{EU ?3) D.6) for all (?|, ?2, Ey) e V3, if К : V -> R. It is interesting that only by the additional form C.6) of ? the function b in B.5) (now defined on T>\ ? J instead of V\ ? J) can be given explicitly (note that if (/„) is 3-symmetric then (/„) is 4-symmetric so that (/„) indeed has the form C.5)). Let us now give some examples of Theorem 4.1. A) If ? ? 1 and ? is not additive then the case / = constant = с gives the probabilistic information measures /„(/>) = с(]ГМ(р,)-1). D.7)
1538 W.Sander If we assume recursivity of type ? (?) = pa (with some a priori given a e Шк) then we arrive without any regularity conditions at the regular solutions 1п(Р) = с1?щрЛ-1\=с(?ра11р%---р%-\ D-8) Some well-known special cases of D.8) are: - (for k = 1) the entropy of degree ?, ? el: 1п(Р) = сГ?рГ-\ D·9) - (for к = 2) the inaccuracy of degree a, a e E: 1»(П = сГ?р1р*2Г-Ч- D1°) - (for к = 3) the divergence of degree a, a e E: /»(/,) = с(Ер|^2Г1р'Г-Ч· DЛ1> B) If ? is additive (and nonzero) then ? is a projection. In this case it is not trivial to get the regular solutions from the representation in Theorem 4.1. The reason is the following. Let us assume к = 1 in D.2). Then M(p) = ? and the Shannon entropy H„(P) is very regular having the sum form ??=\ b(pi) with the regular function b(x) = —x log*. Consider now the very "irregular" function c(x) = —x log* + d(x) where d: ? -> ? is any nontrivial derivation (that is, d satisfies d(x + y) = d(x) + d(y) and d(xy) = xd(y) + yd(x) so that we arrive at ?"=] d(pi) = d(\) = 0. But a regular (say measurable) derivation is continuous (being additive) and thus linear, which means d(x) = ? d(\) = 0 for all ? el). Introduce the function c(x) d{x) b{x) — xl{x) where l(x)= = —logjrH · ? ? Then / is logarithmic and ??=\ KPi)M(pi) can be very regular for some irregular function /. If (/„) is measurable, that is, only the function ? defined in D.5) is measurable in ? for fixed (?|, ?2). then some additional machinery is needed to prove that /„(?, ?) = ? f(Ei)M(Pi) - /I (J ?< ) + ?(? °^gPi)M(pi) D.12)
Measures of information 1539 for some constant vector A e Rk (see, for instance, Jarai and Sander A997) for a more general result and the book of Ebanks et al. A998) for historical remarks and further references). Thus we get in the probabilistic case (/ = constant) that D.12) reduces to the last sum in D.12) from which we get for example - with appropriate constants - Hn(P), Kn{P\, Pi) or Ln(P\, P2, P?) (cf. A.11), A.14) and A.15)). C) The real surprise of Theorem 4.1 comes in the case ? = 1: We get an unexpected term /7 ]Tl(?,,/>) D.13) ? = ? which disappears if either zero probabilities are allowed or there is dependence on events (due to the fact that L is logarithmic in the probabilities and additive in the events). If again (/„) is measurable then we get ? / " \ " /„(?, P) = ? /(?,)- /I (J ?, +]Ta(?,)Ologp, D.14) i = l \/ = l / i'=l for some additive set function a:T>\ -» Rk (see Ebanks A984)). Note that if a = (??,...,?*) for additive set functions aj\V\ -> R, \ ? j ? k, then (with our notations A.6) and A.7)) we have /7 ? k П к ]Tl(?,, p,) = ]??(?,) ©log?, = ]? ]??,(?,I?§?„ =: ]?/,/,(?. />,·). ?' = ? ?'=? j = \ 7=1 y' = l D.15) Finally we remark that many authors have contributed to these results, and it took a long time until the above Theorem 4.1 was proved by Ebanks et al. A988). A complete list of (nearly) all contributors can be found, for example, in Sander A987) and in the book of Ebanks etal. A998). It is interesting that we can get the recursive inset entropies also by the well-known "how to keep the expert honest" method (see the book of Aczel and Daroczy A975), Aczel A980) and Kannappan and Sahoo A989)): An expert predicts q\,..., qn as probabilities of the events E\...., E„ which in reality are p\,..., p„ (here it is assumed that E, e В where В is not an algebra). The expert agrees to be paid only after the occurrence of the event and that his payoff will be /(?,) if the z'th event happens. So his expected gain is ?''_| Pif(Pi)· 1° order to keep the expert honest the payoff function / will be chosen so that the expert's earnings will be maximum if he predicts the "true" probabilities, that is we have 77 ? ]? Pif(Ej, q,¦) < ]T pi/(?,, pi) /=l i=l
1540 W. Sander or more generally 77 77 / = l ? = ? for a fixed я ^ 3, ? > 1 and for all ? eT>„, ? e ?„. Without any assumption on / it was shown by Aczel A980) and Kannappan and Sahoo A989) that the last two inequalities lead to the inset entropies I(E Р)=\сН"(П + Я=\я№)· a = l> 'Л ' \cHX(P) + r;=iP7f(E0 + b. a>\, where b, с are constants. 5. Regular weighted (/, m)-additive measures of degree (?, ?) Dehnition 5.1. An IIM (/„) on ?„ ? ?„ has the measurable sum-form property if there exists a map b: T>\ ? J e ? satisfying B.5) for ? = 2. 3.... such that ? -> b{\, p) is measurable on J for each E\ &T>\. We remark that rather often the sum-form property will be considered as an "artificial" property but we have pointed out that this property is a consequence of the very natural (and weak) conditions: Branching property and weak symmetry. Let us now fix /, m e N, / ^ 2, m ^ 2 and ?, ? e Шк. In this section we present all weighted (/, w)-additive IIM (/„) of degree (?, ?) which have the measurable sum-form property. Substituting B.5) and C.11) into C.10) we arrive at the fundamental equation of weighted (/, w)-additive inset information of degree (?, ?): I m ???(?, П Fj^iq^-q^biEi. ?,)) - ?? (b(Fj¦, qj))) = 0 E.1) / = l y = l for all ? en,,F e ?,„, ? ? F ? 0 and for all ? e ?, Q e ?,„. The problem is to solve this functional equation for fixed / ^ 2, m ^ 2: If b is known then we get /„ from B.5). The main result here is a result of Gross A993) and Gross and Sander A991) which is an extension of the first results in this area by Ebanks A986, 1994) who investigated the case (/, m) = B, 3) and ?, ? e U. THEOREM 5.2. Let В be a ring of sets, let I. m e N, / ^ 2, m ^ 2 and let ?, ? e Rk be arbitrary, but fixed.
Measures of information 1541 An IIM (/„) on ?„ ? ?„ has the measurable sum-form property and is weighted (I, m)- additive of degree (?, ?) if and only if A) there is a purely probabilistic information measure (J„) with J„ : ?„ —* ?, ? — 2,3,..., which has also the measurable sum-form property (with a purely probabilistic generating function) and which is weighted (I. m)-additive of degree (?, ?), and B) if in the case ?, ? e U there is an additive set function a:V\ -+Rk such that IAE,P)=UP) + \T!Ua{E')OXOgP'· ?-?"?· <5·2> 10. otherwise, if В is a non-atomic algebra and I„ (?, P) = J„(P) if В is not an algebra. E.3) Ebanks A994) remarked that if В is an algebra with atoms the same methods as in the proof of Theorem 5.2 can be applied but this case was not investigated because of its complexity and the lack of applications. The case E.3), where В is not an algebra, is uninteresting from the inset point of view: it reduces to the purely probabilistic case, that is /„(?. P) = J„(P) is given by ? J„(P) = Y^f(Pi) for some function/: J -» ?. E.4) ? = ? where ? e ??, and where / satisfies / 111 ? YXfiPW) - ??/(Л") - P?f(qj)) = 0· E-5) i = l 7=1 If ? is a non-atomic algebra we have again the purely probabilistic case with the exception of ?, ? e U where surprisingly the same term occurs like in Section 4 (see the last sum in D.14)). Since the solutions of E.5) are known in nearly all cases (for further details we refer to the survey article of Ebanks et al. A997)) we will present the most interesting solutions. Again, in the case / = m = 2 the behavior of the solutions is somewhat surprising since new unexpected solutions arise, namely polynomials up to degree 3 if ?. ? in U and polynomials up to degree 5 if a = ? (in the case к = 1). If ?, ? e U we get from E.2) /„(?, P) = J2Dd ? Pi ~9dO PI +ЫО Pi) + 0m -l-m+ n)b i = l II /I + ^2piOCO\ogPi+^2a(Ei)Q\ogpi. E.6) i = l 1 = 1
1542 W. Scolder Here a : V\ -> R* is an additive set function, b eR, d = (d\,... ,dk) eRk and С = (c,y) is a (& ? A:)-matrix. Moreover, a = 0 if/w ^ 6. This shows that the first sum in E.6) occurs only in the exceptional case / = m = 2. Note that /„ can be rewritten as (putting d\ = -jdj, 1 < /' ^ k) к к к I„(?, P) = 2J2d',H*{P,) - 3 ]????^??) + - ? ?»(?,) ? = ? ?' = | ? = ? к к к + ?????(?.) + ? ? cuK„(Pi,Pj) ? + {lm-l-m + \)b + Y^a(E,H logp/. E.7) Let us assume for the moment that the set function a = 0. Then all measures J„ satisfying E.4), E.5) and the usual normalization condition /2G, 4) = 1 for all / ^ 3, w ^ 3 are given by ¦/„(/>) = E"=iP;OCOlogp,, 1OC0U-I, ?,???, TH=\^(bO\ogPi), bO\ = -2a~], a = peRk\U, .B*-°-2*-?)-?(?!!^??-?!=???)' ???,???? or ?^?. Here fo is a A ? A:)-vector and С = (c,y) is a (? ? A:)-matrix. In this last formula for Jn the first sum represents a natural к -dimensional analogue of the Shannon entropy, whereas the last two sums are a natural ^-dimensional analogue of the entropies of degree (?, ?) (cf. A.13)). Finally we remark that it is an interesting open problem to determine the general solution of E.5) if ?, ? e U (in the case a <? U or ? <? U the general solution of E.5) is known!!). As an indication of the occurring difficulties we consider in the one-dimensional case the map /: @, 1) -> R defined by D2(x) /00 = —^, *e@.1). E.8) ? where D is a nonzero real derivation. Then a calculation shows that / satisfies E.5) with a = ? = 1 for all fixed pairs (/, m) of integers, / ^ 2, m ^ 2. Thus / is not measurable (even nonnegative), but it is not of the form E.6) (with a = 0). On the other hand it was pointed out in the book of Ebanks, Sahoo and Sander that all remaining problems in the characterization of functions / satisfying E.5) (and more generalized functional equations) if the following one functional equation /(W)+ /(/>(!-?))+/((l-/>)?) + /((l-/>)U-?))=0, P,q e J, E.9)
Measures of information 1543 can be solved. Here f(p) = d(p)+aOp + b E.10) where a e Шк, b e R and d: J -> R is a non-measurable solution of E.9). Summarizing the results in Sections 4 and 5 we can say that from the two results in Theorems 4.1 and 5.2 we can derive nearly all relevant and well-known characterizations for (inset) information measures. 6. Subadditive information measures for random vectors Let us briefly remember the concept of subadditivity of information measures using an appropriate notation. Let ?, ? be two real-valued discrete random variables (d.r.v., for short) on a probability space (?, ?, P) with values in two finite sets of real numbers i* ·= ?(?) = {?],...,?„] and ??:=?(?) = {}?,...,?„,}, F.1) respectively. Let (??,?>(??), ??) and (??,?(??), Qy) be the probability spaces induced by X and Y, so that pi = P(X=xi) = Px(xi), 1^ /<;„, q]=Q{Y = yj)=QY(yj), \<:j^m. F.2) Definition 6.1. The amount of uncertainty induced by the random vector (X, Y) with range Аху.Ву = \(х\,у\),...,(х„,ут)\, \AX ? BY\=nm F.3) and the joint probability distribution Pxy = {Рц = P(X =x,,Y = }'j): l^i^n, K/^m) F.4) is defined by , ?? ^R ? ч_/ {(х\,У\) ·· (хп,У„,)\ /6сч 1„??(?? ? ??, ???) = 1„т I „ )· УЬ--1) \ Р\\ ¦¦¦ Рпт ) We will denote Inm also as the information measure of (X, Y). Definition 6.2. If Inm is the information measure of (x, y) Then /„,„ is (n, w)-subaddi- tive if Inm(Ax ? ??, ???) < I„(AX, Px) + Iln(BY, Qy) F.6)
1544 W.Sander or , ((х\,У\) V Pw \P\ where m ? ?? = ??4' 1? = ??'?' l^i^n,l^j^m. F.8) 7=1 ?=? Now the property of subadditivity is self-explanatory. Aczel et al. A974) gave a simultaneous characterization of the Shannon entropy (H„) and the Hartley entropy (#,*) (see A.10)). More generally, it could be proven that every probabilistic one-dimensional information measure J„ : ?„ -> Ш which is A) /-symmetric for all / e N, B) (/, 2)-subadditive for all / e N, / > 2, and C) (/, w)-additive for all /, m e N, / ^ 2, w > 2 has the form /„(/>) = A ·#„(/>)+0(л), РеГ„, и = 2,3,..., F.9) where ? ^ 0 is a constant and ? is an arbitrary additive number theoretical function: ф(пт)=ф(п)+ф(т), п^2,т^2. F.10) This result is interesting because it was difficult to derive an equality from the inequality (in the definition of subadditivity). In the paper of Forte and Gupta A985) an inset version of the above result is shown, but the main steps of the proof can already be found in another paper of Forte A977). For simplicity (and to avoid long and heavy notations) we present their result in the case of a 2-random vector instead of а к -random vector. The following properties are assumed: (Al) Subadditivity. /„„, is subadditive for all ? ^ 2, m ^ 2. (A2) Additivity. If X and ? are independent then the uncertainty about (?? ? ??, ???) should be maximal (cf. C.7)): I„m(Ax ? ??, Px * QY) = I„(AX, ??) + ?,,???, Qy). F.11) Note that now ??? = ?? * Qy. (A3) Weak expansibility. If pnj = 0 for all 1 ^ ;' < m or pim = 0 for all 1 ^ /' < ? then /„„, is weak expansible if , {(х\,У\) ··· (*„,?», )\ , {(х\,У\) ·¦· (Хп-\,Ут)\ \ Pi ? ¦·· Pnm ) V Pi ? ¦·· P(n-\)m ) (Xi,}'i) PU Pn) iyl (х„,Ут) Pnm ¦ ¦ ¦ Ут ¦ ¦ ¦ Чт F.7)
Measures of information 1545 or T {(Х\,У\) ··· (Xn,y„,)\ _ , f(x\,y\) ··· (хп,Ут-\) V ?? ? ··· Pnm ) V Pi ? ·¦· Аи/л-п (A4) Partial symmetry. Let (X, У) and (X', У") be two independent random vectors with the same ranges ?? = ??· and ?? = ??· where \??\ = ?, \??\ = т. Then ?????' = ??? ¦ ???' and it seems naturally to assume that for all fixed (r,s) and (t,u) , ? (¦*/-, }\, xt, y„) · · · (??. y„, .v,-, у ?) · V",M- prsP'tH ··· a„p;s ? ,/'" С*/" > У-? > *f > У» ) ··" (jfi. ,V||.^/·. V.v) ••? F 12") /?-//?- PtuPrs ¦¦¦ PrsP,,, Here the dots in F.12) indicate the unchanged part of the domain of /„:,„:. Thus only the probabilities PrsP,,, and plup'rs are exchanged. (A5) Boundedness from below. Define p, s = 1 - ? and p!u = ? for some fixed 1 ? r, t ^ ? and 1 ^s, и ^m and p,y = 0 otherwise. Then the functions ?^??,?(?? ? BY, PXY) on [0,1] F·13) are assumed to be bounded from below. Now we are ready to present the main result of Forte and Gupta A985) (in a slightly different but equivalent form). THEOREM 6.3. The measures of uncertainty (?,,??) satisfy the properties (Al)-(A5)/or all n,m e N if and only if him (A x x ??, ???) = -а ]Г ]Г pulog ?? ? = ? 7=1 + ]? F/(jti. · · ·, Xi)Pi + ? F>(-V| · · · ·' yJ)qJ i=l 7=1 + G„(.vi, ....*„) + Я,„(У|,..., y,„) F.14) for all ??? such that pij > 0, 1 ^ /' ^ //, 1 ^ j ^ m, where a is a nonnegative real constant, Ft: №.'' -> E, / = 1, 2,..., Pi, qj satisfy F.8), G„ : ?? -> К аиЛ Я,„ : By -> R. The idea of the proof is the same like in the paper of Aczel et al. A974) but is now much more technical. Let us mention that the investigations lead to a generalized version of Equation D.4) where in the purely probabilistic case the function ? will be replaced by 4 different functions (see, for example, Jarai and Sander A997)).
1546 W. Sander Let us finally present the following the following special case of Theorem 6.3. If we require a slightly extended version of expansibility and if the boundedness condition will be replaced by a continuity condition at 0 then F.14) reduces to him {? ? ? ??,???) ? m ? m = -aYJYJPij\ogplj+YjF(xl)pl+YjF(yj)qj F.15) i = l 7=1 /=l 7 = 1 for some function F: ? -> ? where the first sum is of course the Shannon entropy and the last two sums represent the expected information (cf. A.25)). 7. Information measures in a theory of evidence A more general concept than the concept of measure - the concept of a so-called fuzzy measure - takes into account that in many applications no "additivity" is present: Fuzzy measures are monotonic increasing functions m : V(X) —> [0, 1 ]. Here X is an arbitrary set. These fuzzy measures occurred already in other fields of mathematics, take for example capacities or characteristic functions in cooperative game theory. Thus a fuzzy measure assigns to each subset of a universal set a value signifying the degree of evidence or belief that a particular element belongs to the subset. The monotonicity says that, if we know with some degree of certainty that the element in question belongs to a set, then the degree of certainty that it belongs to a larger set (containing the former set) cannot be smaller. In the following we drop the monotonicity and consider a set X (which is assumed to be finite here) and a basic assignment m:V(X)^[0, 1] G.1) such that m@) = O, ]T m(A)=\ G.2) ???,,??) where f,„(X) = {AcX:m(A)>0} G.3) is the set of all focal elements. Any set function m satisfying G.1) is called a belief structure, which presents an approach to representing uncertainty and imprecision within the available evidence. Here m(A) is interpreted as the degree of evidence or belief that a specific element ? of X belongs to the set A. Together with two quantities, called belief of A (collecting all
Measures of information 1547 focal elements of subsets of A) Bel(A)=Y^m(B), AcX, G.4) BcA and plausibility of A (collecting all focal elements of sets which are not disjoint with A) Pl(A)=l-Bel(A)= ]T m(B). А С X, G.5) a basic assignment m forms the core of the Dempster-Shafer theory (Dempster A967) and Shafer A976)); A denotes the complement of Л in X). Note that m need not to be additive or monotonic. The basic assignment defines a probability measure on V(X) but (in general) not on X since the focal sets neither need to be disjoint nor form a partition of X. Whenever all focal elements are singletons then we can define a probability measure P:V(X)-> [0, l]via P(A) = ]T/w ({*}), ACX G.6) .veA (since a probability measure is uniquely determined by the values on the finite set X). Thus Bel(A) (and ?I (A)) are probability measures iff m({x}) = Bel({x}) (m({x}) = Pl({x})) and m{A) = 0 for all subsets ? of X with \A\ > 1. In the Dempster-Shafer theory also measures of uncertainties are investigated: - Uncertainty about which element A e T,„(X) contains a specific element ? e X and uncertainty about the location of ? within A. - Measures which contain both types of uncertainties will be called measures of total uncertainties. In several papers total information measures were proposed (see the book of Klir and Yuan A995) and Ramer A993)), for example, T(X,m) = - ? m(A)logi ]T m(B)———\ D(in) - ]P m(A)log(m(A) G.7) AeF,„{X) ? (in) or T(X,m) = - ? w(A)log|A|- ]T w(A)logw(A). G.8) AeF,„{X) ???,,,??) Nun) Hon)
1548 W. Sander In G.7) and G.8) the last term is the Shannon entropy applied to focal elements and is a measure of non-specifity (since H{m) is not specific about ? e A for all sets A with \A\>\). In G.7), D(m) is a measure of conflict among given focal elements satisfying ?\???\ ,„„, т(В)*——^аР1(А) G.9) BeF„,(X) and thus ]T m(A)Bel(A) ^ D(m) <: ]T m(A)Pl(A). G.10) D(m) expresses the sum of individual conflicts of focal elements m(B) (each of which is multiplied with the degree of subsethood \A П B\/\B\ of the set A in the set ?) with respect to a particular focal element m{A). Moreover, p(A) defined in G.9) is a probability measure so that T(X, m) in G.7) has the form E.2) of a (one-dimensional) additive inset entropy T(X,m) = - ]T m (A) log p(A)- ]T m(A)\ogm(A). G.11) AeF„,(X) A?F,„(X) In G.8) N(m) is a weighted mean of a Hartley-type information and was characterized by Ramer A987) and Klir and Mariano A987). But the author is aware of only one characterization result concerning total uncertainties in the theory of evidence. We present the main result of Sander A997). The main idea is to reformulate the property of subadditivity appropriately and then to combine it with ideas presented in the last section. If X is a finite set of cardinality ? we introduce X„ = {x\,...,x„}, Xk = {x\,...,Xk} and X„-k={xk+i,...,x„], G.12) where к < ?, ? > 2. Dehnition 7.1. In a belief structure on X„ the amount of total uncertainty is measured by a real-valued function T„ depending upon the ordered tuple ??„ =(A|,A2,...,A2") G·13) of the 2" subsets A,- of X„, 1 < /' ^ 2", and a basic assignment ???? = (m |, mi, ¦ ¦ ¦, mp) G.14) where w, = w(A,).
Measures of information 1549 We now identify T(X„) := P(X„) \ {0} with ~P(X„) = (F(Xk) ? {0}) U ({0} ? P(X„_*)) U (P(Xk) ? T(X„-k)). G.15) Notethat|7>(X„)|=2* - 1 +2"-k - 1 +Bk — 1)B" A — 1) = 2" — 1. Definition 7.2. A sequence (Г„) is subadditive - or (T„) satisfies (A'l)-if for all ? e N, и ^ 2 and for all 1 < к ^ ? - 1 the condition ~P(Xk) ? T>(X„-k) С ?,,?„ (??) implies ii(^„mx») < 7*(А^,/я^) + Тп-к(АХп_к,тХп^) G.16) where »??„ is a given basic assignment and mXk and т^_, are the usual marginal basic assignments, that is, the projections of mXn on Xk and X„-k, respectively (cf. F.8)) In the same manner like (Al) is replaced by (A'l), the axioms (A2)-(A5) can be modified by (A'2)-(A'5) (where in the symmetry axiom an additional assumption is included) and we arrive at the following result of Sander A997). THEOREM 7.3. In a belief structure on X„ the amount of total uncertainty satisfies the properties of (A'l) subadditivity, (A'2) additivity, (A'3) expansibility, (A'4) symmetry and (A'5) one-side boundedness if and only if there exist real constants a, b e E, a ^ 0, and a ?-additive function g:V(X„) -> К such that T„(AXn,mXn)= -a ]P m(A)\ogm(A) A?F„,(X„) + ? m(A)g(A) + b-log\Fm(X„)\. G.17) ???,„{?„) Here the P-additivity (P stands for powerset) of g means that g(AxB) = g(A) + g(B), AeP(Xk), BeP(X„-t). G-18) for all 1 ? к -? ?- 1, ? ^ 2 (identifying A with ? ? {0} and ? with {0} ? ?, cf. G.15)). Now we present some remarks and examples concerning Theorem 7.3. (a) In G.17) the last term is nothing else but the "Hartley entropy" (compare log\fm(X„)\ with H* (see A.10)). It represents the non-continuous part of the total uncertainty. In different applications we can have different interpretations for this term. For example, when determining the center of a certain pain the term b ¦ log|:F,„(X„)| can be interpreted as the "intensity of the pain" which is an additional effect besides the localization of the center of the pain. (b) We don't need a continuity assumption for deriving G.17), the above rather weak boundedness assumption is sufficient to prove the result.
1550 W. Sander If for example, T„ has values in the nonnegative reals then the boundedness assumption is satisfied. Thus in this case the result is true assuming only (A'1MA'4). (c) If in D) g{A) = log \A\ and if Tn is continuous at 0 (this means Mm ? ( A] Al A* '" Al-"\ _ ? (A\ Al- A* '" Al"\ ,,l™+ "\l-u и 0 ... 0 J' "{ 1 0 0 .·· 0 ) (? e ?)) then b = 0 and G.17) goes over into G.8). Moreover, if in addition a = 0 then we get a characterization theorem for J2A€yrm{X)m(A)log\A\ with a specified continuity assumption, which was not known before (see the paper by Ramer A987)). (d) If we assume that m{A) > 0 only on singletons then our sequence (?„) reduces to a sequence (T„) dependent only upon X„ and a positive probability distribution Pn on X„. Thus we arrive at the inset case, and in particular we obtain the result of Aczel et al. A974) (with g = 0). For further examples and results we refer to the papers of Dubois and Prade A987), Ramer and Lander A987), Ramer A993), and the book of Klir and Yuan A995), where, for example, possibility and necessity measures are discussed (when all focal elements in the sums in G.10) are nested). 8. Measures of fuzziness Let X be a finite set. A fuzzy set Л in X is a generalized subset of X which is characterized by a membership function fA which associates to each ? e X a number of the "valuation set" [0, 1], that is, /a(x) represents the grade of membership of jc in A. When ? is a set whose membership function has the valuation set {0, 1} (with /a(jc) = 1 or /л(х) = 0 according to whether ? does or does not belong to A) then fA reduces to the well-known characteristic function of a set Л с X. Since a fuzzy set is completely characterized by its membership function /д we define: Any map / from X into [0, 1 ] is called a fuzzy set. (8.1) The fuzzy sets were first introduced by Zadeh A968) and since then fuzzy set theory became an attractive new mathematical theory, see, for example, the books by Dubois and Prade A980), Klir and Yuan A995) and Grabisch et al. A995). In the following we first discuss "natural axioms" for so-called measures of fuzziness, and then we report on attempts to combine probabilistic and fuzzy uncertainties. If/,ge[0, \]x mdhe[0, if then 1 -/, fvg, f/\g e [0, 1]X and fxg e [0, \]XxY are given by A-/)(*)= 1-/(*), (/v *)(·*) = тах{/(д0,*(д0}.
Measures of information 1551 (fAg)(x) = mm{f(x).g(x)}. (fxg)(x,y)=f(x)g(y). moreover ?(?)=??(?) (8-2) xeX is the power of /, which is a natural extension of the notion of cardinality for crisp sets. Now we assign a nonnegative number to each fuzzy set / e [0, \]x which characterizes the degree of fuzziness of /. DEHNITION 8.1. If X = {л-1, ...,x„] is a finite set then every function ?: [0. l]x -> Eo := [0, oo [ satisfying M(f) = M,,(f(x\l...,Axn)). /e[0. l]x. (8.3) for some mapping M„ : [0, 1]" -> R(> is called a measure of fuzziness or a fuzzy entropy. This definition says that a fuzzy entropy can be regarded as an entropy in the sense that it measures the uncertainty about the presence or absence of a certain property over the set X. In case of a finite set X a measure of fuzziness M(f) is determined by the function values of the fuzzy set /. The first list of desiderata for measures of fuzziness was presented by DeLuca and Termini A972): (PI) Sharpness: ??(/) = 0 iff /(X) с @, 1). / e [0, 1]X. (P2) Maximality: M{f) is maximum iff /(X) = \, / e [0. \]x. (P3) Resolution: ?(/) ^ ?(/*) where /* is a "sharpened" version of /. (P4) Symmetry: M(f) = M(\ - /), / e [0, \]x. (P5) Valuation: M(f ? g) + M(f лg) = M(f) + M(g), f,ge [0, l]x. Here /* is a "sharpened" version of / means that \f* - \\~^ \f - {\- The properties (Pl)-PD) seem quite natural, (P5) states that the sum of the degrees of fuzziness doesn't change if / and g are exchanged with the two operations of the lattice ([0, 1]?, ?, л), / ? g and / л g. As examples, Deluca and Termini A972) introduced ? ж/) = -?/(*.-) log/и-) (8·4> i = l and showed that H\f):=H(f) + H(\-f) (8.5) satisfies (P1)-(P5). There is an interesting connection between #'(/) and the Shannon- type functional H(m) defined in G.7) the idea of which goes back to Forte's personal communication in the paper by Ebanks A983):
1552 W. Sander Let us interpret f(x) as the probability that ? e X possesses a certain property P, and think of the process of deciding whether x\ ,x„ do or do not possess ? as an experiment. If we assume that f(x) and f(y) are independent for ? ? у, х, у е X, then / induces a basic assignment on ^(Х) by m{A)=Y\f{x) Y\ {I-f(x)), ACX, (8.6) x€A .\eX\A and a calculation shows that H(m) = H'(f), that is (with the convention 0 log 0 = 0) — 2_. rn(A)\ogm(A) AeFm(X) = - ]T f(x)logf(x) - ]TA - /U))log(l - f{x)) .xeX .\eX = E5(/w) (8·7) xeX where S(t) = H2(t, 1 -i)=-ilogi-(l -i)log(l -0, re [0,1], (8.8) is the so-called Shannon function. Concerning characterizations for fuzzy entropies, it turns out that (P5) plays a key role because of the following result (Sander A989)): If X = {jc, ,..., x,,} and ?: [0, l]x -> E() is a fuzzy entropy then ? satisfies (P5) iff there exist generating mappings <pj: [0, 1] -> Eo A ^ /r ^ n) such that ? M(/) =]>>(/(*,)). (8-9) ? = ? If M„ in (8.3) is a symmetric function in its variables then all functions ??, ? i ^ и, coincide (see Ebanks A983)) and we get with ?, =? ?' = ? In a recent paper, Ebanks A998) replaced (P5) by (P6) Localization: There exists a continuous map ?: [0, 1 ]2 -> R such that ??(/) = ??(/') + p[f(x,). f\xi)] (8·1 D whenever / and /' differ at only one point x, e X.
Measures of information 1553 This states that the difference in the degrees of fuzziness of / and of /' depends only on their values at the single point where they differ. The condition in (8.11) is a branching-type property for "fuzzy uncertainties" (like in (P5) also "fuzziness" is "exchanged"). It is interesting that (P5') is again equivalent to the existence of a generating (continuous) function ?:[0, 1] -> ? such that M(f) has the sum-form property (8.10) (see Ebanks A998)). Starting from the representation (8.9) or (8.10) we have numerous proposals for measures of fuzziness since only the functions 0, and ? must be specified and characterized. For a survey see the book of Dubois and Prade A980) and the survey article of Pal and Bezdek A994), which contains several lists of interesting measures of fuzziness. We here present a recent result of B. Ebanks which is based on the following observation. The functional H(f), given in (8.4), satisfies H(fxg) = P(g)H(f) + P(f)H(g) (8.12) (and also (PI) and (P5), but not (P2)-(P4)), which is an weighted additivity condition where the weights correspond to the "size" of / and g. Thus he assumes more generally that the amount of fuzziness of / ? g is a function of M(f), M(g) and of the "sizes" P(f), P(g): (P7) Direct Product Composition Law: There exists a function G:M+xR+^R+ such that M(f xg) = G(P(f), P(g), M{f), M(g)). (8.13) Moreover, a natural symmetry condition is supposed: (P8) M(fxg) = M(gxf). (8.14) The property (P7) - together with (P8) - seems to be a more natural assumption than (8.12) and the two generalizations of (8.6), considered in Ebanks A983) and Sander A989). The following result is the announced (slightly modified) result of Ebanks A998) which generalizes a result of Sander A989). THEOREM 8.2. (I) A measure of fuzziness M{f) satisfies (PI) and (P6)-(P8) ifand only if ? has either the form ? M{f)=-aYjf(x,)\ogf(x,) (8.15) i = l for some a > 0, or else there is some a e @, 1) U A. oo) such that ? M{f) = cYj(f(xir-f(xl)) (8.16) 1 = 1 with some constant с (с < 0 if a > 1, с > 0 //0 < a < 1).
1554 W. Sander B) The only measure offuzziness satisfying (P1)-(P4) and (P6)-(P8) is given by ? M(f) = bYjf{xl)(\-f{xl)) (8.17) i=\ for some constant b > 0. For some further results we refer to two papers by Ebanks A983, 1998). Let us now consider an experiment with outcomes E\,..., E„ (where ?, are elements of a ring В of sets of a universal set ?) and corresponding probabilities p\,..., p„. If in addition a fuzzy set /: X = {E\ ,...,?„} -> [0, 1 ] is given then we have two kinds of uncertainty: A) The uncertainty of random nature concerns the particular event E, which will occur. The average uncertainty will be measured by Shannon's entropy H„. B) The uncertainty of "fuzzy nature" is related to the interpretation of ?, as 1 or 0. This amount of uncertainty can be measured by S(f(Xi)) (see (8.8); note that S:[0, 1] -> [0, 1] satisfies S@) = S(\) = 0, S is concave on [0, 1 ] having a unique maximum at \ with S( \) = 1). Thus the statistical average information in taking a decision A or 0) on the possible outcomes of the probabilistic experiment is II ?(?,?) = ???5(???)), PeT„. (8.18) ?'=? Therefore a total entropy Hlol can be defined as Hm(f,P)=H„(P) + L(f, P) ? ? = -^p,logp,+^AEo/)(?,). (8.19) 1=1 ?=? This seems to be an interesting total entropy for several reasons: A) If there is now fuzziness, that is /(X) С {0, 1} then Hm(f,P) = H„(P). (8.20) B) If there is no uncertainty, that is pj — 1 and p, = 0 if ?'? j then Hlol(f,P) = (Sof)(Ej). (8.21)
Measures of information 1555 C) IfP = (I ... I)then 1 Hl0l(f, ?) = Щ{Р) + - Y(So /)(?,). (8.22) П 1 = 1 D) If we don't know the probability distribution ? and if ? = 1 and E\ = ? then we get H101(f,P)=(Sof)(Q). (8.23) E) There is a connection with inset entropies which indicates a characterization of #I01. The difference between the two uncertainties in (8.19) and (8.23) is the amount Pi ··· Pn /(?,) ··· /(?„) = -I> log ? +? A (^ о /)(?,) - (So /)( [J ?,- ) <8·24) of information gained from the knowledge of the probability distribution, and it has exactly the form of a recursive inset entropy. Because of Theorem 4.1 it seems naturally to introduce a one-parametric family of total entropies (Hl) where H^{ = ??? if a = 1 and + ? № о /)(?,)- (So/) [J ?,) (8.25) i=l \?=? if ? ? 1. It is an open problem to chatacterize H"n, a e R. For further total entropies combining probabilistic and fuzzy uncertainties and applications of these entropies we refer to Pal and Bezdek A994). 9. Weighted entropies Let us consider the following example which comes from an unpublished manuscript of B. Forte. A real estate agent receives (commission and bonus) $ te, if he sells a house at the date ?,. A measure of the information content of the statement
1556 W. Sander "?, = the house was sold on the date T, and the probability for that to occur is pf should take into account the gain w, and the probability p,. A good candidate is I(E()=-cwilogpi, KD«. (9.1) Thus the measure of expected information (see A-25)) leads to the so-called weighted entropy ? I„{w\,...,w„;p\,..., p„) = - ]P WiPi log p,. (9.2) / = l Here w; e [0, oo[ denotes the subjective importance of different events of a ring В of sets, that is w, = w{E;) for some function w:B -> [0,oo[. This weighted entropy was introduced and characterized by Belis and Guiasu A968). The idea was that the occurrence of an event ?, is connected with two types of uncertainty. - the quantitative type related to the corresponding probability p, of an event ?,, and - the qualitative type relative to the utility w,- = »(?,) which the experimenter has in mind. Special cases of (9.1) are for example (if ? e ?„): ? In(w,...,w;p\,...,p,,)= -w]P PiTog pi, (9.3) i = l /,,(^,...,^) = -X>logp,, (9.4) \P\ PnJ f~^ /-i-r?L----rA-) = E"?· (95) V log ?, logp,,/ f^ In the following we denote и-tuples (w\,..., w„) of RJj or R'| by V and define: A weighted entropy (/„) is any sequence of functions where /„:Rgxr„-»-Ro. (9.6) We now present some axioms which are natural extensions of axioms characterizing the entropies of degree a, so that in particular we get the result of Belis and Guiasu A968): (Bl) Continuity. /"(»], w2; p, 1 — p) is continuous on [0, 1] for all им, хиг е Ко· (B2) Symmetry. We have for all permutations on {1, ..., n] /"(»!,..., w„; p\, ...,pn)= I" (w„ \,..., ???„; ??\, ...,???).
Measures of information 1557 (B3) Hw'=(pa]w[ +p>2)/(pi +p2)then " \P\ рг ръ ?,, _ ,a ( w' W2 Ws wA,„a,a( W] W2 \ /q 7ч _/"U+W P2 P3 ?») ??-\7$?- PW+P*) for all WelJ.Pe T„ with p\ + p2 > 0. (B4) Uniformity. If ? = (^, ..., 1) then /,?(? " "i"^L,)W'+"' + W" (9-8) for some function L" : N \ {0} -> R+ and for all WeKJ, that is /" is proportional to the mean value of the weights. (B5) Normality anddecisivity. LaB) = 1 and I%(w\, w2; 1,0) = 0. In (B2) the definition of w is a natural one since, e.g., for a = 1 this means that W(E,UE2)=P|W(?|) + P2W(?2) (9.9) P\ + Pi is satisfied by utilities in decision theory, or in game theory. The following result for a = 1 is due to Belis and Guiasu A968) whereas the case ??1+,?^1, can be found in Kriimpelmann A991) (I was not aware of another reference): THEOREM 9.1. LetaeR+.A weighted entropy (/,?) with /,?: R? ? T„ satisfies (Bl)- (B5) iff П Iaa(W,P) = -2x-aYdwip^\0gPi. (9.10) i = l If a = 1 then we obtain (9.2) (without using (B5)). In a similar way the weighted entropies of degree (?, ?) сап be characterized. We refer to the paper by Dial and Taneja A981). In the characterization of weighted entropies using weighted additivity conditions we get similar to the considerations in Section 3 the functional equation Ih„(V *W,P*Q,) = /,(V, P)I„,(W, Q)+g,„(W, Q)I,(V, P), (9.11) ? e ?/, Q e T,„, V e M!+, ff e R'J, where V * W = (v\ w\,..., v,wm) e R1"' and g,„ : R™ ? T,„ -»¦ R0, /,: R'+ ? ?, -»· R0.
1558 W. Sander If /„, // and gm have the sum-form property with generating functions I, f,g:R+ ? [0, 1] -> R, respectively, then (9.11) leads to the functional equation / m ^^{HviWj^iqj)- g(wj,qj)I(Vi,pi)- f(vl,pi)I(wj,qj)) = 0. (9.12) i=l 7 = 1 In Sander A985) it was shown that all measurable solutions /, /, g of Equation (9.12) (assumed for / = m = 1 or for some fixed pair (l,m), I > 3, m > 3) if ? e [0, 1]', ? e [0, 1]'", V e R'+, W e R'^ are given by I! ^pfuflogpfuf. (9.13) ? = ? II fc^^ufsinOog^uf). (9.15) i = l where k, a, b, c, d are constants (? ? 0). The same result was obtained in Dial and Taneja A981) if / = gin (9.12). There is another interesting result (proved in the theory of gambling) because it leads again to recursive inset entropies (see Meginnis A976)). If W is a utility function defined on a ring В of sets (that is, W : В -> К) satisfying a certain compound gamble rule and if /„ has the form II /,I(W,P) = ^/(W(?,),A) (916) i = l for some regular function / then /„ is given by ,(W p)_\ ?- = , PiW(E,) +a LIU, ?, log Pi, « = 1. where ? is a constant. Here the first term for a — 1 is the expected gain whereas J.R. Meginnis interprets the second sum as the "joy in gambling". Weighted entropies have many applications, for instance, Gil et al. A989) introduced and characterized the following information measures ? - ?!'-1 Pi log(w,7?(W)), a = 1, IH(W,P)=\ , -'" , (9.18) (where E(W) = ?? = ? wiPi) an<J presented applications to statistics.
Measures of information 1559 Different weighted entropies were proposed by Zadeh A968) II M{f,P) = -Yjf{El)Pl\ogPl (9.19) 1 = 1 where /: {E\,..., E„} -> [0, 1 ] or by expressions of the form ? /,,(W,/) = ]>>,#(/(?,)) (9.20) i=l for ffeMJ and some function g : [0. 1] -> E. These last two measures are not yet characterized but they were applied for big classes of functions (see, e.g., Pal and Dutta Majunder A986)). To be convinced that these measures are "good" measures some efforts should be made to characterize them. We close this section by considering information measures which are defined on ? " 1 ?„ = \(??,...,?„): ?#<1. Pi>0\. (9.21) I 1 = 1 J that is, probability distributions are allowed to be incomplete. The question arises (we formulate here only the one-dimensional case) of characterizing real-valued information measures of the the form /,,WP) = W ? u-j *(/,(/>,¦))) (9·22) where ?: ? -> ? is a strictly monotonic function, and where (/„) is polynomially additive: /„„(V * W, ? * Q) = G{Ii(V, P), I,„(W, Q)) (9.23) (V e E'+, W e R™, PeA,,Qe ?,„\ (9.23) is valid for all /, m e N). Here G:RxR^R is a polynomial in two variables. The property (9.21) of an information measure (/„)is called ^-average, and is well- known (see, for example, the book by Aczel and Daroczy A975)). The condition on G implies (assuming a fullness condition of the range of (/„)) that (9.23) has only the form I,m(V *W,P*Q)= I,(V, P) + I„,(W, Q) + ?/,( V, P)Im(W, Q) (9.24) for some ? e ? (see Behara and Nath A973)). In this generality the problem is unsolved, in particular if no regularity assumption is made. To indicate that this problem may lead to interesting information measures we consider a special case, where the weights are given by PS ил = _Г ., <5eE, (9.25) Lj = \Pj so that /„ : ?„ -> ?.
1560 W. Sander The following result is based on a result of Veres A982) (see KrumpelmannA991)). Theorem 9.2. Let S e R. An information measure (/„) with I„ : A, -> R, ? e N, й ш ?-average with weights w, = ??/?/ = ? P^ ш^ /'? polynomially additive satisfying the normalization condition I\ (\) = 1 /y /„(/>) = (??=,?-'«(-??8?))/??=,???. ^??((??=,?-2"?(-|08?'-,)/??=??-). :(((??=,^2"'.(-?8?))/??=|^) 54lOg(l+A)/y у eM, ? = -1, у = 0, ? = 0, У^О, ? = 0, у = 0, ?^?, ?>-1, 1), у^О, ?^?, ?> -1, (9.26) w/геге у e R, ? e [— 1, ??[ a/id h is an arbitrary additive function satisfying h( 1) = 1. We denote /„ in the following by U„'k' saying that it is an entropy of order 1 - у of type(l-log(l+A),a). Concerning continuous solutions of Theorem 9.2 we get that 1/%'*' is continuous if h is continuous. Thus (because of h(\)= 1) we get h{x) = x. With the bijections ]-l,oo[- we get the following result у -> 1 — ?, (9.27) THEOREM 9.3. Let S e R. A nonconstant information measure (/„) with I„ ¦ ?,, -> R, ? e N, where I\ is continuous is an ?-average with weights u>, = pf /?/ = ? Pj an^ 's polynomially additive satisfying the normalization condition I\ (^) = 1 iff I„(P) = ?(-?7=????<*?)/?7=???. ^????-?/^-'?/??-???). B(|-^(-?"=?^?°8?.·)/?:=?/'?_?)) 21 —/? ? ? =1,0=1, ??? 1,0=1, ? г'-/1-! 1 (((??=???+?-,)/??=???),,_/',/(,",,,-?). «* 1.0*1. (9.28) w/геге ?, /3 e R. We denote I„ by ?" , the entropy of order a of type {?, <5). We remark that H„' is 3-parametric generalization of H„ since lira„^i Д, ' = H„ (? constant), lim/j^ ? ?"?-& = H"u& (a constant).
Measures of information 1561 Thus this result contains many results as special cases: Renyi entropies and the results characterized by Sharma and Mittal A975), Van der Pyl A976) and Arimoto A971) (put 5= 1 and 1/a = 2-/3). 10. Summary In this paper we were mainly interested to give a survey on "total entropies", which are useful and may have applications. A useful measure here means that it comes from a real problem, or it has natural and intuitive properties which lead to a characterization (see also AczelA984)). Of course, there are other "justifications" like the maximum entropy principle. For details we refer to the books by Guiasu A977) and Kapur A989). Thus we have gathered some interesting results which perhaps can be a basis for further research, and mainly in Sections 7-9 there is room for additional work which seems worth to be done. Only sometimes we have given applications, but there are a lot of hints and references for those who are interested. The emphasis was on the mathematical background without which there are no applications. We close with two remarks. A) We have not reported on so-called continuous analogous for entropies. Let us only mention that for instance the "continuous" analogue of the Shannon entropy #„, Hp = - j p(x)\ogp(x)dx A0.1) Ja for some continuous probability density function ? is not the limit of the Shannon entropy for discrete distributions ? -5>(дг,-Iоер(дг,-). (Ю.2) i = l but the approximating sums of the integral A0.1) are inset entropies ¦]Tp(&)logp(S/)(.x/ -*,--i) 1 = 1 ? ^F(xi)-F(xi-i)l F(Xi)-F(Xi-i) = -> log (x,--Xj-\) ^ Xi-Xi-i X,~Xi-\ 1 = 1 ? ? = - ? ? log ?, + ? ^ l0SKEi), (Ю.З) i=l i=l where ?,¦ e ]jc,- - jc,_i ], F is the probability distribution function (Ff = p, F(a) = 0, F(b)= 1) with probabilities p, = F(Xj) - F(x,-_i) A ^ /' ^n), sets ?,¦ = ]*,¦_?,*,-] with union [a, b] and Lebesgue measure /(?,¦) = Xj — x\-\.
1562 W. Sander See Aczel A978) for details and further applications, and the books by Guiasu A977) and Kullbacket al. A980) for "entropy as variation of information". For characterization theorems in the "continuous" case we refer to Forte and Sastri A975), Forte and Hughes A989) and Ebanks A998). B) Probably this field of research will never come to an end. Always new and interesting things are coming up. Let us mention here an idea of combining probabilistic concepts and concepts from cooperative game theory (which in some sense is "an application" of fuzzy measures). If m is a fuzzy measure on a finite set N = {1,..., n) (or: the characteristic function of a cooperative game on N) then the Shapley value of m is given by ф(т)= (</>|(m), ...,<Mm)), Ф-Ю, l]P(yV)-> R", 4>i-.[0, lflN)^R, A0.4) where *i \ ТГ (\?\-\?\-1)\\?\\, , 4>i(m)= у -— (m(TUi)-m(T)). A0.5) t—i \N\\ TcN\[i) '' ' But a calculation shows that the Shapley value is a probability measure on N and we can ask for an interpretation and characterization of ? Н(Ф(т)) = -]Г </>,-(/и) log <fc(/w). A0·6) i = l For more details and similar problems we refer to Marichal B002) and Kojadinovic et al. B002). References Aczel, J. A978). A mixed theory of information. VI: An example at last: A proper discrete analogue of the continuous Shannon measure of information (and its characterization). Univ. Beograd. Publ. Elektrotehn. Fak. Ser. Mat. Fiz. 602-633. 65-72. Aczel, J. A980). A mixed theory of information. V How to keep the (inset) expert honest. J. Math. Anal. Appl. 75. 447^453. Aczel. J. A984). Some recent results on information measures, a new generalization and some 'real life' interpretations of 'old' and new measures. Functional Equations: History. Applications and Theory. J. Aczel. ed., Reidel. Dordrecht. Aczel, J. and Daroczy, Z. A975). On Measures of Information and Their Characterizations. Academic Press, New York. Aczel, J. and Daroczy, Z. A978). A mixed theory of information. I. Symmetric, recursive and measurable entropies of randomized systems of events, RAIRO Informat. Theor. 12. 149-155. Aczel, J., Forte. B. and Ng, C.T. A974). Why the Shannon and Hartley entropies are 'natural'. Adv. Appl. Probab. 6. 131-146. Aczel, J. and Ng. C.T. A983). Determination of all semisymmetric recursive information measures of multiplicative type on ? positive discrete probability distributions. Linear Algebra Appl. 52/53. 1-30. Arimoto, S. A971). Information theoretical considerations on estimation problems. Inform. Control 19. 181-194. Behara, M. A990), Additive and Nonadditive Measures of Entropy. Wiley. New York.
Measures of information 1563 Behara. M. and Nath. P. A973), Additive and non-additive entropies of finite measurable partitions. Probability and Information Theory. II. Lecture Notes in Math.. Vol. 296. Springer. Berlin. 102-138. Beli§, M. and Guia§u. S. A968). A quantitative-qualitative measure of information in cybernetic systems. IEEE Trans. Inf. Theory IT-14. 593-594. Daroczy. Z. A970), Generalized information functions. Inform. Control 16. 36-51. DeLuca. A. and Termini, S. A972), A definition of nonprobabilistic entropy in the setting of fiizzy sets theory. Inform. Control 20, 301-312. Dempster, A.P. A967), Upper and lower probabilities induced bx multivalued mappings, Ann. Math. Statist. 38. 325-339. Dial. G. and Taneja, I.J. A981), On weighted entropy of type (?. ?) and its generalizations. Appl. Math. 26. 418^425. Dubois. D. and Prade, H. A980), Fuzzy Sets and Systems, Academic Press. New York. Dubois. D. and Prade, H. A987), Properties of measures of information in evidence and possibility theories. Fuzzy Sets and Systems 24 B). 161-182. Ebanks, B. A983), On measures offuzziness and their representations, J. Math. Anal. Appl. 94, 24-.37. Ebanks, B. A984). Measures of inset information on open domain. 111. Weakly regular, symmetric, ?-recursive entropies, C. R. Math. Rep. Acad. Sci. Canada 6. 159-164. Ebanks. B. A986). Measures of inset information on open domains, 11. Additive inset entropies with measurable sum property. Probab. Theory Related Fields 73, 517-528. Ebanks, B. A989). Generalized characteristic equation of branching information measures. Aequationes Math. 37, 162-178. Ebanks, B. A990), Branching inset entropies on open domains, Aequationes Math. 39. 100-11.3. Ebanks. B. A994). Additive k-dimensional inset entropies with measurable sum property. J. Math. Anal. Appl. 187, 952-960. Ebanks. B. A998), Localizable composable measures offuzziness, Publicationes Math. Debrecen 52, 343-355. Ebanks, В., Kannappan. PI. and Ng. C.T. A987). Generalized fundamental equation of information of multiplicative type. Aequationes Math. 32. 19-31. Ebanks, В.. Kannappan, PL and Ng. C.T. A988), Recursive inset entropies of multiplicative type on open domains, Aequationes Math. 36. 268-293. Ebanks, В., Kannappan. PL, Sahoo. P. and Sander. W. A997). Characterizations of sum form information measures on open domains. Aequationes Math. 54. 1-30. Ebanks, В.. Sahoo, P. and Sander. W. A998). Characterizations of Information Measures, World Scientific. Singapore. Forte. B. A977). Subadditive entropies for a random variable. Boll. Un. Mat. Ital. В E) 14. 118-133. Forte, B. and Gupta, R. A985). Additive and subadditive entropies for discrete n-dimensional random vectors. Aequationes Math. 28. 269-287. Forte, B. and Hughes, W. A989). A representation theorem for entropies with the branching property. Aequationes Math. 37, 219-232. Forte, B. and Sastri. C.C.C.A. A975). Is something missing in the Boltzmann entropy?. J. Math. Phys. 16, 1453- 1456. Gil. M.A.. Perez. R. and Gil. P. A989). A family of measures of uncertainty involving utilities: definition. properties, applications and statistical inferences. Metrika 36. 129-147. Goodman. I.R. and Nguyen, H.T. A985), Uncertainty Models for Knowledge-Based Systems. Elsevier. Amsterdam. Grabisch. M.. Nguyen, H.T. and Walker. E.A. A995). Fundamentals of Uncertainty Calculi with Applications to Fuzzy Inference. Kluwer. Dordrecht. Gross. H. A993). Inset-lnformationsmasse. Ph.D. thesis, TU Braunschweig. Germany. Gross. H. and Sander, W. A991). Determination of all additive inset information measures with measurable sum property, Tech Rep., Technical University Braunschweig 5, 1-28. Guia§u, S. A977), Information Theory with Applications. McGraw-Hill, New York. Hartley, R.V.L. A928), Transmission of information. Bell System Tech. J. 7, 535-563. Ingarden. R.S. and Urbanik. K. A962). Information without probability, Colloq. Math. 9. 1.31-150. Jarai. A. and Sander, W. A997), A regularity theorem in information theory, Publ. Math. Debrecen 47, 1-13.
1564 W. Sander Jessen, В., Karpf, J. and Thorup, A. A968). Some functional equations in groups and rings. Math. Scand. 22, 257-265. Kampede Feriet, J. A963), Theorie de Г Information. Principe du Maximum de VEntropie et ses Applications a la Statistique et a la Mecanique. Publications du Laboratoire de Calcul de la Faculte des Sciences de l'Universite de Lille, Lille. Катаре de Feriet, J. and Forte, B. A967), Information et probabilite 1, 11. Ill, C. R. Acad. Sci. Paris Ser. A 265, 110-114, 142-146,350-353. Kannappan, PI. and Sahoo, P. A989), On a generalization of the Shannon functional inequality, J. Math. Anal. Appl. 140, 341-350. Kapur, J.N. A989), Maximum-Entropy Models in Science and Engineering, Wiley Eastern. New Delhi. Kerridge, D.F. A961), Inaccuracy and inference, J. R. Stat. Soc. Ser. В 23, 184-194. Klir, G.J. and Mariano, M. A987), On the uniqueness of possibilistic measure of uncertainty and information. Fuzzy Sets and Systems 24, 197-219. Klir, G.J. and Yuan. B. A995), Fuzzy Sets and Fuzzy Logic. Theory and Applications, Prentice Hall, Upper Saddle River, NJ. Kojadinovic, I.. Marichal. J.-L. and Roubens. M. B002). An axiomatic approach to the definition of the entropy of a discrete Choquet capacity, Manuscript. Kriimpelmann, M. A991), lnformationsmasse und Distanzmasse. Master thesis, Technical University, Braunschweig, Germany. Kullback, J., Keegel, J.С and Kullback, J.H. A980). Topics in Statistical Information Theory. Lecture Notes in Statistics, Vol. 42. Springer. Berlin. Marichal, J.-L. B002), Entropy of discrete Choquet capacities, European J. Oper. Res. 137, 612-624. Martin, N.F.G. and England, J.W. A981), Mathematical Theory of Entropy, Addison-Wesley, Reading, MA. Meginnis. J.R. A976), A new class of symmetric utility rules for gambles, subjective marginal probability functions, and a generalized Bayes rule. Bus. Econ. Stat. Sec. Proc. Amer, Stat. Assoc.. 471^476. Ng, C.T. A974), Representations for measures of information with the branching property. Inform. Control 25, 45-56. Ng, C.T. A979), Measures of information with the branching property over a graph and their representations. Inform. Control 41, 214-231. Pal, N.R. and Bezdek, J.C. A994). Measuring fuzzy uncertainty. IEEE Trans. Fuzzy Systems 2, 107-118. Pal, S.K. and Dutta Majumder, D. A986). Fuzzy Mathematical Approach to Pattern Recognition. Wiley. New York. Pap, E. B002), Pseudo-additive measures and their applications. Handbook of Measure Theory. E. Pap, ed.. Elsevier, Amsterdam, 1403-1468. Ramer, A. A987), Uniqueness of Information Measure in the Theory of Evidence. Fuzzy Sets and Systems 24. 183-196. Ramer, A. A993). Total uncertainty of possibility assignments. Proc. 2nd IEEE Int. Conf. on Fuzzy Systems. San Francisco, 1220-1224. Ramer, A. and Lander, L. A987). Classification of possibilistic uncertainty and information functions. Fuzzy Sets and Systems 24 B), 221-230. Sander, W. A985), On a sum form functional equation, Aequationes Math. 28, 321-326. Sander, W. A987), The fundamental equation of information and its generalizations, Aequationes Math. 33, 150-182. Sander, W. A989), On measures offuzziness. Fuzzy Sets and Systems 29. 49-55. Sander, W. A997), A characterization theorem in the Dempster-Shafer theory. Abstracts of Linz-Seminar: Enriched Lattice Structures for Many-Valued and Fuzzy logics, Linz, 47-50. Schweizer, B. and Sklar, A. A983). Probabilistic Metric Spaces. North-Holland, Amsterdam. Shafer, G. A976), Л Mathematical Theory of Evidence, Princeton University Press, Princeton. Shannon. C.E. A948), A mathematical theory of communication. Bell System Tech. J. 27. 379^423, 623-656. Sharma, B.D. and Mittal, D.P. A975). New non-additive measures for discrete probability distributions, J. Math. Sci. 10, 28-40. Sharma, B.D. and Taneja, I.J. A975), Entropy of type (?, ?) and other generalized measures in information theory, Metrika 22, 205-215.
Measures of information 1565 Taneja, I.J. A989), On generalized information measures and their applications. Adv. Electron. Electron Phys. 76,327^413. Theil, H. A967), Economics and Information Theory, North-Holland. Amsterdam/Rand McNally. Chicago. Van der Pyl, T. A976). Axiomatique de /'information d'ordre a et de type ?. С. R. Acad. Sci. Paris Ser. A 282, 1031-1033. Veres, S. A982), On afunctional equation in connection with information theory, Aequationes Math. 24, 179- 186. Zadeh, L.A. A968), Probability measures of fuzzy events, J. Math. Anal. Appl. 23, 421—427.
Author Index Roman numbers refer to pages on which the refer to reference pages. Aaboe, A. 25 Aarnes, J.F. 842, 866 Abraham, P. 174 Abramov, L.M. 1227, 1229, 1231 Aczel, J. 972, 1007, 1405, 1407, 1408, 1465. 1525, 1526, 1528, 1535, 1539, 1540, 1544, 1545, 1550, 1559, 1561, 1562,7562 Adams, E.W. 1486, 1494, 1518 Adams, G. 1332, 1366, 1378, 1513, 1521 Adams, J.F. 90, 120 Adams, R.A. 1243, 1244, 1259 Adamski, W. 506, 526, 783, 783 Adian, S.I. 96, 120 Adler, R.J. 1044, 1051 Adler, R.L. 1225,1231 Agbeko, K. 1510, 1518 Ajupov, S.A. 741 Akcouglu, M.A. 153,774 Akian, M. 1432, 1433, 1465,1466 Albeverio, S. 1304, 1319, 1327 Alchourron, C.E.P 1505, 1518 Aleksjuk, V.N. 174 Alexiewicz, A. 610 Alfsen, E.M. 227, 231, 245, 438, 444, 512, 526 Aliprantis, CD. 714, 716, 741, 796, 797, 805, 822 Almgren, F.J., Jr. 1028, 1032, 1034, 1035 Alo, R. 660, 666 Ambrosio, L. 1034, 1035 Amemiya, I. 814, 822 Ames, W.F. 1440, 1466, 1467 an der Heiden, U. 507, 512-514, 518, 526 Anantharaman, R. 439, 444 Andersen, E.S. 1319,1327 Andersen, K. 25 Anderson, R.M. 1304, 1305, 1310, 1313, 1314, 1316, 1317, 1319, 1323, 1327 Ando,T. 127.174 Andrews, K.T. 543, 559, 560, 580 Anger, B. 512, 520, 526, 526 ir (or his/her work) is mentioned. Italic numbers Aniszczyk, B. 677, 700, 773, 783 Antosik, P. 128, 135, 146, 149, 150, 157, 162, 170, 174, 175 Anzai. H. 739, 741, 1221,1231 Appell, J. 807, 823 Arbeiter, M. 1050, 1051 Arias de Reyna, J. 440, 444, 741, 741 Arimoto, S. 1561.1562 Arkeryd. L. 1327 Armstrong, Т.Е. 741, 742 Arnold, K. 1102, 1127 Amoux, P. 1227, 1228. 1232 Arrow, K. 1517, 1518 Artstein, Z. 628, 639, 643, 646, 647, 650, 651, 653, 654. 656, 662-666, 666 Arveson, W. 776, 783 Asai, K. 1331, 1378 Ascherl, A. 724, 742 Ash, R.B. 517, 519, 520, 526 Asplund, E. 510, 526 Assani, I. 1214, 1232 Attouch, H. 655, 664. 665, 667 Aubin, J.R 619, 666. 667 Aumann, G. 508, 511 -513, 524, 525, 526 Aumann, R.J. 619, 636-638, 662, 667, 949, 1007 Aversa. V. 238, 245, 699, 700 Azencott, R. 1466 Babiker, A.G. 1152-1154, 1170-1172, 1175, 1177, 1178 Baccelli, F. 1405, 1430-1432, 1436, 1466 Bachelier. L. 342 Bagchi, S. 660, 667 Balder, E. 638, 657, 662, 663, 667 Ban, A.I. 905, 906 Banach, S. 85, 90,92, 117, 120, 224,245, 436,444, 783, 1093, 1096, 1112, 1121, 1127, 1263, 1290 Bandt. С 436, 444, 1112, 1121, /127 Baraki, G. 1121. 1127 1567
1568 Author Index Barbati, A. 629, 633, 634, 637, 667 Barbieri, G. 914, 935, 940, 944, 949, 1006, 1007, 1007 Barcenas, D. 634, 667 Baron, M.E. 25 Bartle, R.G. 81, 398, 439, 444, 564, 580, 595, 610, 1083, 1090 Basile, A. 261, 262, 292, 741, 742, 944 Bator, E.M. 558, 580, 581 Batt, J. 514, 520,526 Bauer, H. 68, 81, 956, 961, 963, 1007 Bayod, J.M. 1304, 1328 Beck. A. 576, 581 Becker, H. 118, 120, 1110, 1115-1117, 1121, 1123,7/27 Bedford, T. 1044,705/ Beer, G. 629, 633, 634, 637, 655, 657, 667 Behara, M. 1525, 1528, 1559, 1562, 1563 Beli§, M. 1556, 1557, 1563 Bellman, R.E. 1431, 1435, 1466 Bellow, A. 1133, 1170, 1171, 1178 Belluce, L.P. 956, 958, 1008 Beltrametti, E.G. 866, 866 Ben Amor, N. 1502, 1518 Benes, V.E. 638. 667, 776, 783 Benferhat, S. 1481, 1485. 1486, 1491, 1493, 1495, 1501, 1502, 1510, 1518 Bennett, M.K. 171, 176, 834, 867 Benvenuti, P. 288, 292, 1332, 1333, 1343, 1347- 1349, 1351, 1375, 1377, 1407, 1466. 1518 Berberian, S.K. 175, 520, 527, 1112, 1127 Berg, С 520, 527 Berge, С. 1385, 1397, 1400 Bergelson, V. 1202, 1205, 1211, 1232 Bergh, J. 813, 823 Bemau. J. 918, 944 Bemik, V.I. 1044, 1051 Berti, P. 263, 292 Besicovitch, A.S. 228, 233, 242, 245, 680, 700, 1017, 1020, 1022, 1024, 1035, 1044, 1051 Bessaga, С 438, 440, 444, 553, 581 Bezdek, J.C. 1525, 1553, 1555, 1564 Bharucha-Reid, A.T. 1177, 1183 Bhaskara Rao, K.P.S. 81, 175, 261, 262, 292, 730, 732, 742, 769, 779, 780, 783, 786, 940, 944, 1178 Bhaskara Rao, M. 81,175, 730, 732, 742,940, 944, 1178 Bianconi, R. 1306, 1327 Bichteler, K. 508, 509, 511-513, 522-525, 527, 1140, 1154, 1178, 1179 Bierlein, D. 724, 742 Bigard, A. 879, 906 Billingsley, P. 313, 337, 342, 647, 667, 767, 769, 783 Bilyeu, R.G. 581 Bing, R.H. 1290 Birkhoff, G.D. 19, 25, 175, 269, 292, 534, 581, 797, 823, 830, 836, 866, 871, 906, 956. 1008, 1193, 1232 Bishop, E. 437, 444 Bismut, J.M. 639, 667 Bjomsson, O.J. 771, 783 Blackwell, D. 772, 779, 783, 783, 784, 992, 1008 Bliedtner, J. 229, 245, 245,1179, 1319, 1320,1328 Blum. JR. 1205, 1232 Blumberg. A. 679, 700 Blume, F. 1223, 1232 Blumen, G.W. 1405, 1466 Blumenthal, R.M. 342 Bobillo Guerrero, P. 513, 526, 527 Boccuto, A. 287, 292 Bochner, S. 260, 266, 293, 399 Boehme, Т.К. 1254, 1259, 1260 Bogdan, V.M. 513, 527, 742 Bogdanowicz, W.M. 506, 513, 527 Bogoliouboff, N.N. 1117.1127, 1199, 1234 Bolanos, M.J. 1332, 1377 Bollman, P. 1477, 1522 Bombieri, E. 1035, 1035 Bongiomo, B. 82, 234, 245, 371, 383, 399, 581, 592-599, 610 Bonsall, ?.???,444 Boole, G. 834, 866 Borkowski, L. 1008. 1009 Bouchon-Meunier, B. 1333, 1377, 1510, 1518 Bourbaki, N. 25, 82, 256, 293, 399, 527, 749, 762, 940, 944, 1057-1059, 1062. 1072, 1086, 1090 Bourgain, J. 438, 441. 444, 557, 568, 581, 1198, 1210, 1213, 1232 Bourgin, R.D. 438, 444 Boutilier, С 1474, 1518 Bowen, R. 1044, 1051, 1225, 1232 Boyer, C.E. 26 Boyvalenkov, P. 228, 245 Brehmer, S. 512, 527 Breiman, L. 297, 308, 313, 342, 1222, 1232 Broadbridge, P. 1405, 1452, 1453, 1466 Brook, С 742 Brooks, J.K. 127, 128, 139, 149, 153, 175, 283, 284,293, 297, 313, 330, 342, 342,399,441,444, 451,452, 454,455,458,469,477^479, 482,485, 492, 495, 500, 501, 581, 1446, 1466 Brothers, J.E. 1013, 1035 Brown, G. 1050, 1051 Brown, R. 342
Author Index 1569 Bruckner, A.M. 194, 201, 206, 225, 231, 232, 235, 242-244, 245, 247, 596, 610, 677, 687-689, 700 Bruckner, J.B. 194, 235, 243, 244, 245, 247 Brudno, A.A. 1226, 1232 Buczolich, Z. 245, 590, 591, 600, 604, 606, 610 Bukhvalov, A.V. 813, 823 Bullen, PS. 592, 598, 610 Bunce, L.J. 842, 866 Bungart, L. 510, 526 Burger, H.C. 1477, 1522 Burgers, J.M. 1440, 1466 Burke, M.R. 87, 120, 757, 758, 762, 1133, 1137, 1144, 1153, 1156, 1157, 1159, 1179 Burkill, J.C. 598, 599, 610 Burkinshaw, O. 714, 716, 741, 796, 797, 805, 822 Bumes, J.A. 628, 666 Burzyk, J. 1259, 1259 Busemann, H. 205, 231, 241, 245 Butnariu, D. 876, 879, 906, 913, 918, 925, 927, 930,940, 944,949,951,952,954-957,964, 965, 973, 983, 988, 991. 997, 1008, 1476, 1518 Byrne, C.L. 638, 667 Cabos, J. 1501,7522 Cafiero, F. 127, 130, 153, 175 Calderon, A.P. 1099,1127 Campbell, J.T. 1213, 1232 Candeloro, D. 260, 263, 275, 276, 278, 279, 283, 284, 289, 290, 293. 477, 478, 501, 524, 527 Cao, S.S. 608, 610 Caratheodory, С 82, 240, 246, 693, 700, 747, 762, 889, 907 Carlson, Т. 724, 742,1179 Carnap, R. 1497, 1518 Carothers, D.C. 1179 Carrillo, M.D. 259, 293 Carrington, D.C. 598, 610 Cartan, H. 1113,7/27 Cassels, J.W.S. 654, 667 Cassinelli, G. 866, 866 Castaing, С 284, 293, 619, 622, 630-633, 638, 639, 643, 657, 662. 664, 666, 667 Cawley, R. 1050. 1052 Cayrol, M. 1508, 1518 Ceccherini-Silberstein, T. 85, 96, 120 Cembranos, P. 440, 444 Cerf. R. 666, 667 Chacon, R.V. 153, 175, 297, 313, 330, 342, 342, 1195, 1206, 1232 Chakrabarti, S. 677, 701, 1181 Chakraborty, N.D. 581, 583 Chang, C.C. 175, 830, 853, 860, 866, 878, 879, 905, 907, 913, 914, 944, 956, 1008, 1267, 1290 Chateauneuf, A. 1389, 1390, 1400, 1477, 1518 Chatterji, S.D. 268, 273, 274,293, 581, 1175, //79 Chelidze, V.G. 594, 610 Chemoff, H. 992, 1008 Chew, T.S. 594, 598, 600, 602, 610. 611, 613 Chictescu, I. 524, 527, 581 Chistensen, J.R.P. 520, 527 Chlebik, M. 1045,7052 Choban, M.M. 753, 762 Choe, G.H. 1206, 1232 Choirat, С 643, 664-666, 667 Choksi, J.R. 748, 749, 752, 753, 755, 758, 760, 762, 1175,7/79 Chong, K.M. 813, 823 Choquet, G. 437, 438, 444, 527, 652, 655, 668, 1331, 1377, 1383, 1400, 1466 Choukain-Dini, A. 660, 668 Chovanec, F. 171, 177, 834, 867, 889, 907 Christensen, E. 842, 866 Christensen, J.PR. 1114, 1127, 1147, 1168, 1179 Chuaqui, R.B. 117. 120, 1096, 1100, 1102, 1127 Cichon, J. 1106,7/27 Cicogna, G. 513, 527 Ciesielski, K. 117,120,682, 700, 1124, 1127,1127 Cignoli, R. 878, 879, 905, 907, 944 Cilin, V.I. 741 Clauzure, P. 662, 667 Clemens, J.D. 1110, 1127 Clifford, A.H. 951, 1008 Climescu, A.C. 950,1008 Cohen, G. 1405, 1430-1432, 1436, 1466 Cohen, P.J. 1290 Cohn, D.L. 82, 516,527, 769,778, 784, 1169,1179 Cole, J.D. 1050, 1051, 1052, 1440, 1466 Collet, P. 1050, 1052 Collins, H.S. 552, 581 Colombeau, J.F. 1254,1259 Colubi, A. 657, 668 Comfort, W.W. 242,246, 754, 762, 1279, 1290 Constantinescu, С 128, 144. 175, 509, 510, 527, 742 Contreras, M.D. 441, 444 Conze, J.-P. 1211,/2i2 Cook, A.T./75 Cooke, R. 842, 866 Comfeld. I.P 1189, 1191, 1206, 1209, 1210, 1221, 1232 Corson, H.H. 534, 581 Cortes, U. 1501, 1522 Coste, A. 643, 646, 647, 660, 668 Collar, M. 1214, 1232 Couvreux, J. 655, 665, 668 Cox, J.C. 1448, 1449, 1466 Cox, R. 1465, 1466
1570 Author Index Crandall, M.G. 1435, 1466 Cressie, N. 653, 662. 668 Cresswell, M.J. 1474, 1521 Csomyei, M. 700, 700 Cuninghame-Green, R.A. 1407, 1408, 1466 Cutland, N.J. 1328 Cutler, CD. 1043, 1048. 1052 Dal Maso, G. 656, 664, 665, 668 Dalgas, K.P. 1152,7/79 Dall'Aglio, G. 776, 784 d'Andrea, A.B. 170. 175 Dang-Ngoc-Nghiem 1102, 1127 Daniell. P.J. 507, 510,527 Darmawijaya. S. 598, 611 Daroczy, Z. 1525, 1526, 1528, 1539, 1559, 1562, 1563 Darst, R.B. / 75, 772, 784 Dashiell, F.K. 175 Dauben, J.W. 119,720 Daures, J.P. 660, 668 Davies, R.O. 591,611, 1047, 1052 Davis, W.J. 272, 293 De, A. K. 691.70/ de Amo, E. 522, 524. 527 De Baets, B. 954, 1008. 1352. 1377, 1487, 1518 de Branges, L. 438, 444 de Campos, L.M. 1332. 1377. 1501, 1515. 1518 De Cooman, G. 1488, 1499, 1509, 1510, 1515, 1519 DeFinetti, B. 1471, 1473,75/9 De Giorgi, E. 656, 664, 668, 1025, 1028, 1034, 1035. 1035. 1331, 1377 de Glas. M. 956, 1008 de Guzman. M. 226, 228. 231, 240-242, 245. 246, 677,691,70/ de Korvin, A. 583, 660, 666 de la Harpe, P. 85, 96,120 de la Vallee Poussin, Ch. 247 de Leeuw, K. 437, 444 DeLia, A. 513. 527 de Lucia, P. 170, 173. 174,175,176. 742, 836.867, 941,944 de Maria, L.J. 176 dePossel, R. 212, 230, 231,247 Debreu, G. 619, 622, 626, 643. 647, 668 Dedekind, R. 85. 120 Defant, A. 437, 444 Dekiert, M. 773, 784 Dekker, T.J. 103. 104. 120 Dekking, F.M. 1223, 1232 delJunco, A. 1229, 1231, 1233 Dellacherie, С 301, 342, 386, 393, 399, 451, 453, 477, 491, 494, 495, 501, 630, 668 DeLuca, A. 1525, 1551, 1563 Dempster, A.P. 1390, 1400, 1477, 1519, 1547, 1563 Denjoy, A. 589, 599. 611, 683, 686-688, 700 Denneberg, D. 252, 293, 512, 518. 527, 1331, 1332, 1334, 1336, 1339, 1363, 1377, 1385, 1388, 1390, 1400, 1466 Deuber, W.A. 96, 120 Devlin. K.J. 1290 Di Nola, A. 889, 898, 907, 944, 956, 958, 1008 Di Piazza, L. 234, 245, 581, 592-599, 601, 603, 610,611 Dial, G. 1557, 1558, 1563 Diaz, S. 440,441,444, 574, 575. 581 Diaz Carrillo, M. 513, 514, 522, 524-526, 527 Dichtl, M. 832, 866 Dickman, M.A. 1290 Diestel, J.J. 82, 128, 150, 151, 153, 154, 176. 266, 272,293, 362, 379,399,437,439^*41, 444,445, 520, 527, 5.33, 536, 539, 540, 542, 572, 581, 627, 636,646,668,709,712, 741, 742, 937, 944, 1081,/090, 1422, 1466 Dieudonne, J. 128, 163, 176, 1081, 1083, 1090, 1179 Dilworth, S.J. 570, 572, 582 Dimitrov, D.B. 540, 582 Dinaburg, E.I. 1225, 1232 Dinculeanu, N. 56, 82, 146, 175, 176, 347. 356. 360-364, 368, 371, 372, 379, 383,399, 432.440. 445, 451, 452, 454, 455, 458, 469,478, 479, 482, 485.492,495, 500,501,524, 527, 560, 581, 582. 646, 668, 1058, 1060, 1080, 1085, 1090. 1177, 1178, 1179 Dim G. 805, 823 Ditkin, V.A. 1258, 1260 Ditzen, A. 1120,//27 Divari, M. 1347, 1349, 1377 Djvarsheishvili. A.G. 594, 610 Dobbertin, R. 1050, 1052 Dobrakov, I. 176, 399, 442, 445, 534, 582, 742. 1085, 1090 Dobric, V. 576. 582 Dodson, M.M. 1044. 1051, 1052 Dominguez-Menchero, J.S. 657, 668 Dong. W.L. 660, 668 Donoghue, W.F. 1179 Doob, J.L. 779. 784, 1321, 1322, 1328 Dorofeev, S.V. 843, 866 Doss, H. 6.39, 668 Doss, S. 638, 668 D'Ottaviano, I.M.L. 878, 879, 905. 907, 944 Dougherty, R. 108, 120, 1114, 1127 Douglas, J. 1027, 1035
Author Index 1571 Dowling, P.N. 440. 445 Drake, F.R. 1267. 1284, 1285, 1290 Drewnowski, L. 127, 137, 153, 176, 534, 541-543, 572, 573. 575.582, 643. 644, 668, 705, 706, 723, 738,740,741, 742,944 Dreyfus, S.E. 1431. 1435, 1466 Drisch. T. 844, 866 Dubins. L. 110,720 Dubois, D. 954, 1008, 1406, 1410, 1426, 1427, 1463-1465, 1466, 1474, 1476, 1477, 1481, 1483, 1486, 1488, 1491-1493, 1495, 1498, 1501-1505, 1507, 1508, 1510-1517, 1518- 1520, 1550. 1553, 1563 Dudley, R.M. 82 Dumitrescu, D. 905, 907, 954, 1008 Duncan, R. 1115,1179 Dunford, N. 82. 127, 129, 146, 149, 152, 153, 176, 266, 269, 279, 293,399.439,444. 513, 514, 520, 528, 539, 561, 564, 580.582, 669, 740, 742, 944, 1059, 1083,7090, 1144,7/79 Dupacova, J. 665, 669 Dutta Majumder. D. 1559, 1564 Dvoretzky, A. 571, 582, 949, 993, 1006, 1008 Dvurecenskij. A. 171, 174, 176, 830. 831, 835, 836, 840, 842-845, 848, 850, 851, 857, 858, 860, 862, 865, 866, 866, 867, 879, 898, 907, 919, 944 Dye, HA. 1102,7/27 Dynkin, E.B. 297, 329, 332. 342, 639, 669 Easton, W.B. 1290 Ebanks, B. 1525, 1533, 1534, 1539-1541, 1551- 1554, 1562, 1563 Edgar, G. 533, 536-538, 540, 543, 559, 560, 582 Edgar, G.A. 206, 246. 437, 445, 781, 784, 1039, 1042, 1049, 1050, 1052, 1133, 1170, 1177, 1178, 1179 Edwards, C.H., Jr. 10, 16. 18,26 Edwards, R.E. 1083, 1085, 1090 Egghe. L. 582 Eggleston, H.G. Ill, 120 Egorov. D.F. 72, 82 Ehrenpreiss, L. 1252,1260 Eifrig, B. 1179 Eigen, S.J. 755, 762 Eilers, M. 844. 867 Einstein, A. 342 El Amri. K. 637. 669 Elek. G. 95, 120 Elliott, G.A. 878, 907 Ellis. H.W. 1140, 1179 Eisner, J. 522, 528 Elstrodt, J. 26, 68, 82 Elton, J. 438. 445 Emch, G.G. 878. 907 Emmanuele, G. 440, 445, 575, 582 Ene, V. 234, 246 Engelking, R. 82. 760. 762 England, J.W. 1525, 1564 Erben, W. 534, 582, 1149, 1179 Erdos, P. 749, 762. 1104, 1127 Erochin, V.D. 772, 784 Ershov, M.P. 781, 784 Es-Sahib, A. 639. 669 Esteva, F. 1474, 1521 Etemadi, N. 656. 669 Evans, L.C. 1013, 1025, 1035 Evans, L.E. 225, 228, 240, 246 Evstigneev, I.V. 639, 643, 669 Ezzaki. F. 657, 659, 667, 669 Faden. A. 779, 784 Faires, B. 176,581 Falconer. K.J. 246, 1013, 1022, 1035, 1039, 1040, 1042-1044, 1046, 1048-1051, 1052, 1411. 1412, 1466 Fan, A.H. 1048, 1052 Farag, H.M. 1025, 1035 Farah, I. 1179 Farell, R.H. 1118, 1127 Fargier, H. 1483, 1488, 1492, 1502-1504, 1512, 1515-1517, 1519,1520 Farinas del Cerro, L. 1474, 1498, 1502, 1505, 1506, 1519, 1520 Farkova, J. 176, 742 Farmaki, V. 582 Farreny, H. 1508, 1518 Fatou, P. 1143,7/79 Federer, H. 182, 205, 220, 225, 226, 234, 246, 1013, 1015, 1017-1019, 1024-1026, 1028, 1031-1033, 1035, 1039,/052 Fefferman, C.H. 524, 528 Fejzic. H. 604, 611 Feldman, J. 1122, 1128 Feledziak, K. 801, 805, 813, 815, 823 Feller, W. 82, 205, 231, 241, 245, 297, 342, 343 Fenstad, J.E. 1304, 1319, 1327 Ferenczi, S. 1198, 1221. 1232 Fernandez, A. 440, 444, 574, 575, 581 Filipczak, M. 690, 700 Filipczak, T. 690, 691, 700 Fillmore, PA. 1179 Filter, W. 510, 527 Fine, T.L. 1475. 1497, 1520 Fink, J.P 582 Fishbum, P. 1475, 1477, 1484, 1520 Fisk, D.L. 501 Flachsmeyer, J. 520. 528
1572 Author Index Fleischer, I. 594, 611, 944 Fleissner, R.J. 596. 610 Fleissner. W.G. 1281, 1282, 1290 Fleming, W.H. 1028, 1031-1033, 1035 Florencio, M. 440, 444, 573-575, 581, 582 Floret, K. 437. 444, 507, 521, 523, 528 Fodor, G. 1290 Fodor, J.C. 1519 Foias, С 379. 399, \\??,1179 Folmer, ?. 501 Fomin, S.V. 1117, 1128, 1189, 1191, 1206, 1209, 1210. 1221, 1232 Foran, J. 596. 610. 690. 700 Foreman, M. 108, 119,120, 121 Forte, B. 1525, 1529. 1544. 1545, 1550, 1562. 1562-1564 Foulis, D.J. 171,176, 178, 833, 834, 867 Fourie. J.H. 437, 444 Frank, M.J. 949, 951, 952, 1008 Frankiewicz. R. 677, 700, 1179 Frankowska. H. 619, 667 Frechet, M. 638, 669, 705, 742 Freedman, D. 297, 301, 343 Freidlin, M.I. 1466 Freiling, С 599, 604. 611 Fremlin, D.H. 29, 42, 44, 48-50, 59, 63, 65, 66, 68, 73-75, 81, 82, 257, 293, 516, 528, 533, 535, 548, 550, 553, 557, 559, 568, 581- 583, 608, 611, 747-751. 753, 755-757, 760, 761. 762, 763, 796, 797, 801. 805, 823, 889, 899, 906, 907, 919, 944, 1102-1104, 1108, 1124, 1128, 1133-1135, 1138-1140, 1150- 1157, 1159-1161, 1163. 1165, 1170. 1177, 1179, 1288, 1290,7290 Freniche, F.J. 128, 144,176, 440, 445, 575, 583 Freudenthal, H. 797, 822, 823 Fric, R. 907 Friedman, N.A. 1093, 1099, 1128, 1189, 1221, 1232, 1492, 1517,7520 Friedrichs, K. 1239,7260 Frisch, U. 1049,1052 Fu, S. 598,6/7 Fuchs, L. 836, 867, 879, 898, 907 Fuglede, B. 1180 Fiiredi, Z. 228, 246 Furstenberg, H. 755, 763, 1187, 1199, 1201, 1202, 1208. 1209. 121 \, 1232, 1233 Futas, E. 512, 528 Galileo Galilei 85,121 Gamelin, T.W. 438, 445 Gao, S. 1110, 1121, 1127, 1128 Gapaillard, J. 1140, 1180 Garcia, F. 176 Garcia, L. 1501, 1510,1518 Garcia, P. 1474, 1521 Gardenfors, P. 1474, 1482, 1487, 1490. 1497, 1498, 1505, 1506, 1518,1520 Gardner. R.J. 101, 110, 115. 116, 121. 747. 748, 763, 111, 784, 1180 Garg. K. 232. 246 Gariepy, R.F. 225, 228. 240,246, 1013, 1025,1035 Garling, D.J.H. 439. 445 Garsia, A.M. 1195, 1233 Gaubert, S. 1466 Gebhardt. J. 954, 1009, 1501, 1518, 1520 Geffner, H. 1502, 1518 Geitz, R. 542, 543, 550, 556, 583 Gelfand, I.M. 539, 583, 1247. 1260 George, С 82 Georgescu, G. 830, 860, 861, 865, 867, 889, 907, 944 Georgiou, P. 1154, 1180 Gerla, B. 904, 907 Getoor, R.K. 342 Ghoussoub, N. 583 Gibson, F.G. 692, 700 Gil, M.A. 657, 668, 1558, 1563 Gil, P. 1332, 1378, 1558, 1563 Gil Alvarez, P. 1515, 1522 Gil de Lamadrid, J. 439, 445 Gillman, L. 1162, 1180 Gine, E. 647, 669 Girardi, M. 570, 572, 582 Gisolfi, A. 1501,7522 Gitik, K. 1288,7297 Giuntini, R. 834, 867 Giusti, E. 1013, 1025, 1034, 1035, 1035 Glasner, E. 1208, 1231, 1233 Gleason, A.M. 830, 841, 867 Glicksberg, I. 514, 520, 528 Gnedenko, B.V. 767, 779, 784 Goard, J.M. 1405, 1452, 1453, 1466 Godel, K. 1265,7297 Godet-Thobie, С 643, 647, 666, 669 Godin, V. 754, 763 Godo, L. 1465, 1466, 1474, 1510, 1511, 1518, 7579. 7527 Goffman, С 683, 689, 692, 693, 707 Goguen, J.A. 1506,7520 Golan, J.S. 1401,1466,1467 Goldman, A. //SO Gondran, M. 1467 Goodearl, K.R. 867. 878, 879, 890, 904, 907 Goodman, I.R. 1525, 1563 Goodman, T.N.T. 1225. 1233 Goodwyn. L.W. 1225, 1233
Author Index 1573 Gordon, H. 242, 246 Gordon, R.A. 236, 246, 583, 591. 592, 594. 595, 597, 598, 608, 611 Gordon, Y. 439, 445 Gottwald, S. 949, 950,1008 Gould, G.G. 506, 513, 518, 528 Gowurin, M. 399 Grabisch, M. 954, 1008, 1331, 1333, 1343, 1364, 1377, 1383, 1385, 1388, 1390, 1391, 1394, 1396-1399, 1400, 1405-1407, 1467, 1475, 1477, 1514, 1515,1520, 1525, 1529, 1550,1563 Graf, S. 1044,7052, 1133, 1136, 1137, 1139, 1144, 1147, 1149, 1151, 1173, 1177, 1178, 1178,1180 Graham, C.C. 443, 445 Grant, S. 1515,7520 Grattan-Guinness, I. 26 Gravereaux, B. 451, 501 Graves, W.H. 583 Graziano, M.G. 830, 857. 860, 867 Greco, G.H. 252, 293. 512, 518, 528, 1331. 1378 Greechie, R. 832, 867 Greenleaf, F.P. 1118.1128 Grekas, S. 522, 528, 750, 751, 753-756, 758-761, 762,763, 1133, 1153, 1154, 1180 Greschonig, G. 1120, 1128 Gretsky, N.E. 776, 784 Greuling, H. 834, 867 Gribanov, Yu.I. 798, 823 Grigorchuk, R.I. 85, 96, 120 Grimeisen, G. 534, 582 Gross, H. 1540,1563 Grothendieck, A. 135, 176, 437, 439, 445, 550, 583, 1081-1085, 1087, 1088, 1090 Grove, A. 1474,7520 Gruenhage, G. 1150, 1180 Gryllakis, С 748, 750, 751, 755, 758, 759, 761, 763, 1133, 1153,1180 Guariglia, ?. ПО, 176 Gudder, S. 866, 867, 907 Guiasu, S. 1525, 1529, 1556, 1557, 1561, 1562, 1563 Guivarc'h, Y. 1466 Gulisashvili, A. 583 Gunawardena, J. 1466,1467 Gundy, R.F. 1466 Gunson, J. 842, 867 Gunzler, H. 506, 509, 513, 514, 516, 518, 525, 526, 527, 528 Gupta, R. 1544, 1545, 1563 Gurevich, B.L. 510, 529 Guselnikov, N.S. 176 Haar, A. 1110, 1128 Habil, E.D. 176 Hackenbroch. W. 774, 784 Hadjiev, D. 741 Hadzic, O. 1426, 1467 Hagler, J. 436, 445 Hagood, J.N. 279, 293 Hagood, J.W. 524, 528 Hahn, F.J. 1199,1233 Hahn, H. 176 Hahn, M.G. 647, 669 Hajek, P. 949, 950, 1008, 1474, 1520, 1521 Hajian, A. 1093, 1099, 1100, 1128 Hall. M., Jr. 97, 121 Halmos, P.R. 50, 59, 82. 177, 251, 255, 291, 293, 399, 520, 528, 748, 752, 763, 767, 769, 784, 889, 907, 956, 992, 995-997, 1004, 1005, 1008, 1060. 1061, 1071. 1090, 1095, 1100. 1101, 1113, 1128. 1140, 1180, 1189, 1208, 1214, 1233, 1337, 1339, 1378 Halpem, J. 1472. 1491, 1492, 1517. 1520 Halsey, T.C. 1049, 1052 Hammer, PL. 1384,7400 Hanf, W.P 7297 Hansel, G. 775. 784,1180 Hansell, W. 7290 Hansen, J.C. 654, 666 Hanson, D.L. 1205,/2i2 Hardt, R. 1013, 1028, 1035 Harima, ? 1331, 1378 Harmancova, D. 1474, 7520, 7527 Harmanec, D. 1474, 7527 Harrison, J.M. 331, 335, 343 Hart, S. 647, 650, 651, 656, 666, 669 Hartley, R.V.L. 1527, 756i Hashimoto, H. 699, 707 Hashimoto, K. 583 Hasselblatt, B. 1044, 1053, 1189, 1199, 1233 Haupt, O. 683, 707 Hausdorff, F. 26, 121, 1039, 7052 Hawkins, T. 22, 25, 26 Haydon, R. 777, 546, 548, 558. 583, 776, 784 Hayes. C.A. 181, 187, 201, 203, 206, 225, 226, 231, 238, 241, 242, 244, 245. 246 Heath, T. 8, 26 Hebert, D.J. //SO Heilio, M. 575. 583 Heinich, M.H. 568, 583, 639, 669 Heisenberg, W. 829, 867 Hejcman, H. 177 Hejcmann, J. 711, 742, 940, 944 Heller, G. 1170-1172, 1175, 1178, 1180 Helmer, D. 583 Hensgen, W. 440, 445 Hensley, D. 1044, 1052
1574 Author Index Henstock, R. 188, 201, 236, 246, 589, 591, 592, 598, 600, 608-610, 611, 612 Herer, W. 638, 669, 705, 723, 742 Herstein, I.N. 1511,752/ Herzig, A. 1474, 1498, 1502, 1505, 1506, 1519, 1520 Hess, С 285. 293, 623. 628. 629. 633. 634. 637. 639, 643, 647, 650, 654-657. 660. 662-665. 667-670 Hewitt, E. 98, 121. 752, 763. 1059. 1060, 1090. 1106, 1113, 1126, 1128 Hiai. F. 619, 629, 635, 637-641. 643. 645. 646. 651, 654, 656. 657. 660. 663, 670 Hildenbrand, W. 619, 628, 647, 662, 670 Himmelberg. C.J. 629. 632, 670 Hirsch, M. 110,720 Hisdal, E. 1521 Hobson, E.W. 679. 701 H0egh-Krohn, R. 1304, 1319. 1327 Hoffmann. D. 509. 512. 522, 525, 528 Hoffmann, K. 1143, 1149, 1180 Hoffmann-J0rgensen, J. 575, 576, 583. 742, 775, 776, 778, 784, 1114, 1130, 1175, 1180 Hofmann, K.H. 754, 762 Hohle, U. 949, 954, 956, 1008 Holmes, R.B. 550, 583 Hopf, E. 1093, 1096,1128, 1195,1233, 1440,1467 Hormander, L. 1244, 1247, 1260 Horst, E. 844, 867 Host, B. 1209, 1231, 1233 Howard, E.J. 246 Howroyd, J.D. 1046, 1047, 1052, 1053 Hrbacek, K. 1293 Hu, X. 1050,1053 Hudetz, T. 907 Huete, J.F. 1501, 1518 Huff, R.E. 272, 293, 541, 543, 583 Hughes, G.E. 1474,752/ Hughes, W. 1562,1563 Hukuhara, M. 628, 670 Hulanicki, A. 1108. 1124, 1128 Hunt, BR. 1051, 1053 Hurd, A.E. 1297. 1313, 1328 Hutchinson, J.E. 1043, 1044,1053 Ichihashi, H. 1509. 1514.1521 Imai, Y. 830. 856, 867 Imaoka, H. 1332. 1378 Ingarden, R.S. 1529, 1563 Inoue, H. 653, 672 Inselberg, A. 1467 Inuiguichi. M. 1509. 1514.1521 Ioffe, A.D. 631.670 lonescu Tulcea, A. 82, 211, 293, 363, 399, 538, 545, 583, 611, 701, 755, 756, 763, 1133, 1139, 1141, 1143, 1146-1148, 1151, 1153- 1158, 1172, 1174, 1175, 1177, 1178,1180,1181 lonescu Tulcea, С 82, 111, 293, 363, 399, 538, 545, 583, 611, 701, 755, 756, 763, 1133, 1139. 1141. 1143. 1146-1148, 1151. 1153- 1158, 1172, 1174. 1175. 1177. 1178, 1181 Iorgulescu, A. 830, 860. 861. 865. 867 Iseki. K. 177, 246. 830. 856. 867 Ishihachi, H. 1331.1378 Ito. K. 297. 313. 331, 343 Ito. Y. 1093. 1099, 1128 Jacobs, K. 506, 528 Jaffray. J.Y 1389. 1390. 1400 Jahnke, H.N. 26 Jajte, R. 848. 867 Jaker. A.Sk. 581, 583 Jakubik. J. 879. 898. 907, 914. 944 James. R.C. 550. 583 James, R.D. 599. 612 Janicka, L. 565, 572, 573, 583 Jarai, A. 1545, 1563 Jarchow, H. 439, 444 Jarnik, J. 246, 593, 598, 601-605, 612, 613 Jarvenpaa, M. 1047, 1053 Jasmski, J. 770, 784 Jech.TJ. 1266-1268, 1277, 1284, 1285, 1291 Jedrzejewski, J.M. 686, 701 Jeffery, R. 693, 701 Jensen, M.H. 1049,7052 Jensen, R.B. 1291 Jessen, B. 241, 246, 693, 701, 773, 779, 784, 786, 1319, 1327. 1533, 1564 Jewett, R.I. 1200, 1233 Jewett, R.S. 127, 139,775 Jirina, M. 1175, 77S7 Johnson, R.A. 291. 292.293, 748, 763, 1133, 1156. 1181 Johnson, W.B. 436. 445 Jones. F.B. 7297 Jones. N.D. 1225,/2ii Jones. S.E. 1467 Josefson. B. 436. 445 Jovanovic, A. 1275, 7297 Joyce. H. 1047, 1053 Junnila. H.J.K. 7290 Jureckova. M. 890. 899. 907. 943, 944 Jurkat, W.B. 603. 607, 672 Just, W. 87. 97. 727. 757. 762, 1133, 7779 Kacian. A. 907 Kadanoff. LP. 1049, 7052
Author Index 1575 Kadec', M.I. 439, 445 Kadets, V.M. 570, 583 Kahane, J.-P. 597, 612, 1044, 1047, 1050, 1053 Kajii, A. 1515,7520 Kakutani, S. 436, 445, 749, 752, 763, 1099, 1100, 1126,1128, 1163, 1181, 1194, 1235 Kalikov, S.A. 1209, 1233 Kalitvin, A.S. 813, S24 Kallianpur, G. 772, 774, 784 Kalmbach, G. 177, 866, 867 Kaloshin, V.Y. 1051, 1053 Kalton, N.J. /77, 440, 444, 572, 573, 5S3 Kambara, H. 1332, 1378 Kampe de Feriet, J. 1529, 1564 Kanamori, A. 1268,7292 Kannan, V. 1107, 1128 Kannappan, PI. 1539-1541, 1563, 1564 Kantorovich, L.V. 776, 784, 797, 822, 823 Kaplansky, I. 932, 944 Kapur, J.N. 1561,7564 Karatzas, I. 343 Kargapolov, M.I. 1109, 1128 Karlin, S. 343 Karolis, P. 760, 763 Karpf, J. 1533,7564 Karush, J. 110,720 Katok, A. 1044, 1053, 1189, 1199, 1233 Katz, V.J. 26 Katznelson, Y. 1201, 1202, 1211, 1233 Kaufman, R. 1046, 1053 Kawada, Y. 1093, 1096, 1098, 1100, 1102, 1128 Kawai, H. 1331,7373 Keane, M. 842, 866, 1222, 7233 Kechris, A.S. 118, 720, 1094, 1110, 1113, 1115- 1118, 1121, 1123,1127,1128 Keegel, J.C. 1562,7564 Keimel, K. 879, 906 Keisler, H.J. 1267, 7290, 7297, 1302, 1325, 1328 Kellerer, H.G. 775-777, 784 Kelley, J.L. 82, 252, 293, 522, 524, 528 Kendall, M.G. 619, 666, 670 Kendall, W.S. 666, 672 Kennes, R. 1477, 7522 Kenyon, H. 181,231,245,246 Kerre, E. 7466, 1510, 1515, 7579 Kerridge, D.F. 1528, 7564 Ketonen, J. 7297 Keynes, J.M. 1497,7527 Khan, M.A. 638, 670 Khararazishvili, A.B. 1104-1106, 1124, 1126, 1127,7727,7725 Khintchine, A.I. 237, 246, 598, 672, 679, 707, 1200, 7233 Khurana, S.S. 438, 445, 742 Kifer, Y.I. 1120,7725 Kindler, J. 516, 528 King, J.L. 1050, 7053, 1208, 7233 Kingman, J.F.C. 1195, 1196, 7233 Kirchheim, B. 1015, 1022, 1024, 1034, 7035 Kirk, R.B. 748, 763 Klawonn, F. 954, 7009 Klei, HA. 638, 643, 647. 670 Klein, E. 619, 670 Klement, E.P. 876, 879, 906, 908. 913, 918, 925, 927, 930, 940, 944, 949-952, 954-958, 961, 965, 970, 973, 991, 1008. 1009. 1332, 1344, 1352, 1378, 1405-1407, 1409-1411, 1427, 1428, 1431, 1458, 1460, 1462, 7467, 1476, 1518 Klimkin, V.M. 128, 777 Klir, G.J. 1331, 1332, 1343, 1351, 7379, 1405, 1406, 1468, 1474, 1514, 7527, 7522, 1525, 1547, 1548, 1550, 7564 Klis, С 777 Kluvanek, I. 399, 513, 528. 583. 1085, 7090 Knight, F.B. 330, 343 Knorr, W.R. 5, 26 Knowles, G. 583 Knowles, J.D. 748, 763. 1152, 7/7S Kochen, S. 842, 867 Kodaira, K. 752, 763, 1126, 7/2S Kohlberg, E. 647, 650, 669 Kohnheim, A.G. 1225,7237 Kojadinovic, I. 1562, 7564 Kolesarova, A. 1331, 1378 Kolmogorov, A.N. 26, 767, 779, 784. 829, 867, 905, 908, 1188, 1210, 1214, 1219, 7233, 7234 Kolokoltsov, V.N. 1405, 1408, 1409, 1419, 1422, 1423, 1432-1436, 1440-1445, 1448, 1449, 7467 Kolzow, D. 207, 244, 246. 257, 259, 293. 1133, 1140, 1146, 1172, 1181 Komlos, J. 583 Konig, D. 97, 98, 727 Konig, H. 439, 446, 511, 516-518, 520, 522, 528 Koopman, B.O. 1471,7527 Корка, F. 171, 777, 834, 867. 908 Komfeld, I. 1205,7232 Kotz, S. 776, 784 Koumoullis, G. 748, 750, 763. 772, 784 Kowalski, O. 1021,7035 Kraft, C.H. 1475,7527 Krasnoselskil, M.A. 797, 813, 819, 823 Kraus, S. 1486, 1490, 1492, 7527 Krawczyk, A. 1108, 1125, 7/2S Krein, S.G. 813, 823 Krengel, U. 1204, 1205, 7234 Krieger, W. 1102, 7729, 1200, 1220, 7234 Kriimpelmann, M. 1557, 1560, 7564
1576 Author Index Krupa, G. 654, 657, 660, 671 Kruse, R. 639, 671, 954, 1009, 1501, 1518, 1520 Kryloff, N.M. 1117, 1127, 1199, 1234 Kubota, J. 594, 612 Kubota, Y. 613 Kudo, H. 619, 67/ Kuich, W. 1407, 1467 Kuipers, L. 114,72/ Kullback, J.H. 1562, 1564 Kume, ? 1509,1521 Kumei, S. 1405, 1466 Kunen, K. 1266, 1291 Kunita, H. 477, 501 Kunze, M.A. 516,529 Kunze, R.A. 1112, 1121,1129 Kupka, J. 150, 169, 177, 561, 583, 756, 763, 1133, 1145, 1156, 1161, 1177,//S/ Kuratowski, K. 631, 671, 768, 769, 784, 1291 Kurepa, D. 1291 Kuriyama, K. 908 Kurzweil, J. 246, 589, 592, 593, 598, 601-605, 608, 612, 613 Kushnirenko, A.G. 1223,1234 Kussmaul, A.U. 399, 451, 478, 484-486, 499, 500, 501 Kutateladze, S.S. 797, 823 Kuz'minov, V. 763 Kwon, S.H. 1331, 1378 Kyburg, H. 1495, 1521 Labreuche, Ch. 1391, 1400 Labuda, I. 741, 742 Lacava, F. 944 Lacey,M. 1212, 1234 Laczkovich, M. 85, 89, 91, 96, 98, 101, 103, 105-107, 110-117, 120, 121, 1095, 1099, 1103, 1129, 1263, 1291 Ladyzhenskaya, O. 1044, 1053 Lahiri, B.K. 677, 701,1181 Lamata.M.T. 1332, 1377, 1515, 1518 Lander, L. 1550, 1564 Landers, D. 153,177, 742 Lang, J. 1474, 1485,1520, 1521 Larsen, L. 682, 700 Larson, L. 682, 700 Lavie, M. 647, 660, 671 Leader, S. 203, 246, 594, 613 Lebesgue, H. 26, 82, 589, 613, 679, 701, 1142, 1143, 1181 Lee, P.Y 591-594, 598, 600, 601, 611, 613 Lee, T.Y. 600, 613 Leese, S.J. 634, 671 Lehmann, D. 1486, 1490, 1492-1494, 1521 Lehn, L. 724, 742 Leibman, A. 1202,1232 Leibniz, G.W. 16,26 Leinenkugel. С 511, 528 Leinert, M. 511,516, 528 Lembcke, J. 723. 724, 742, 775, 784 Lepellere, M.A. 914, 944 Lesigne, E. 1211, 1232 Lena, G. 1331, 1377 Lettieri, A. 889, 907, 944 Levi, B. 1260 Levin. S.A. 1454, 1467 Levin, V.I. 776, 785, 1154, 1178, 1181 Levy, A. 1291 Levy, P. 298, 343 Lewis, D. 1474, 1486, 1521 Lewis, P.W. 441. 444, 446, 581, 1085,1090 Li, S. 660, 671 Liao, K.C. 594. 598, 613 Liapounoff, A.A. 992, 1006, 1009 Lifshits, Je.A. 797, 823 Lima, A. 439, 446 Lindenbaum, A. 86, 87, 121 Lindenstrauss, J. 813, 823, 992, 1009 Lindstr0m,T. 1304, 1319. 1327 Ling, CM. 951,/009 Lingras, P. 1477. 1522 Lions, PL. 1440, 1442,1467 Lipecki, Z. 572, 573, 575, 582, 705, 724, 726, 743, 1178, 1181 Liu.G.Q. 590, 591, 613 Livshits, A.N. 1228, 1235 Lloyd, S.P 1152, 1181 Lodkin, A.A. 842, 868 Loeb, PA. 228, 229, 245, 245, 246, 1179, 1297, 1303, 1305, 1306, 1308-1317, 1319, 1320,1328 Loeve, M. 297. 343 Lofstrom.J. 813, 823 Lomonosov, V. 438,440, 444, 446 Loomis, L.H. 513, 525, 526, 528 Loomis, L.L. 1121, 1129 Lopes, A.O. 1050,1053 Lopez-Diaz, M. 657. 668 Lorent, A. 1023, 1035 Lorentz, G.G. 813, 823 Los, J. 1108, 1123,1129 Losert, V.L. 758.759, 763,1133, 1152, 1153, 1156, 1177, 1181, 1182 Lovasz, L. 97,121, 1384, 1401 Lu, J. 600, 613 Lu, S.P 591, 592, 598. 613 Lubotzky, A. 85. 121 Lugovaja. G.D. 845, 846, 868 Lukasiewicz, J. 879, 908, 950, 1009
Author Index 1577 Lukes, J. 677, 686, 701 Lusin, N.N. 592, 613, 679, 687, 700, 701 Luther, N.J. 290, 293 Luu, D.Q. 583, 660, 671 Luxemburg, W.A.J. 169, 170, 177, 796-798, 805, 813, 814, 817, 822, 823, 824, 1288, 1291, 1313, 1328 Lyashenko, N.N. 654, 661, 671 Macdonald, A.L. 813, 824 Macheras, N.D. 277,294, 534, 547, 583,586, 755- 757, 763, 764, 1145, 1150-1152, 1159-1162, 1164-1168, 1171,1182, 1183 Mackey, G.W. 830, 868, 1113, 1129 Maeda, S. 842, 844, 868 Magidor, M. 1291, 1292, 1486, 1490, 1492-1494, 1521 Maharam, D. 177, 256, 277, 293, Ы1, 701, 705, 723, 743,749,752,755, 764, 774,781, 785,905, 908, 1139, 1145, 1163, 1173, 1114,1182 Maher, R.J. 1150, 1151, 1154, 1157, 1181, 1182 Maitland Wright, J.D. 1182 Maitra, A. 783, 784, 785 Makinson, D. 1490, 1505, 1518, 1520, 1521 Malgrange, B. 1252, 1260 Malicky, P. 905, 908 Mallory, D. 772, 785 Maly, J. 677, 686, 701 Mandelbrot, B.B. 1039, 1044, 1049, 1050, 1053 Mandl, P. 343 Mangani, P. 914, 945 Manning, A. 1044,1053 Marcinkiewicz, J. 241, 246, 599, 613, 693, 701, 813,524 Marczewski, E. 770, 774, 785, 1108, 1123, 1129, Wll, 1182 Marczewski (Szpilrajn), E. 785 Marechal, O. 1178, 1182 Mariano, M. 1525, 1548, 1564 Marichal, J.-L. 1383, 1388, 1398, 1400, 1401, 1514,752/, 1562,7564 Marinova, I. 1331, 1378 Markechova, D. 905, 908 Markova-Stupnanova, A. 1366, 1378, 1421, 1427, 1467 Marra, V. 905, 908 Marstrand, J.M. 1020-1022, 1024, 1035, 1036, 1046, 1053 Martellotii, A. 260, 263, 275, 276, 278-284, 286, 293 Martin, N.F.G. 677, 701, 1525, 1564 Mas-Colell, A. 1517,752/ Maslov, V.P. 1405, 1407-1409, 1422, 1423, 1430, 1432, 1433, 1435, 1436, 1440-1444, 1462,1467 Massopust, PR. 1044. 1053 Matheron, G. 652, 666, 671 Mathonet, P. 1398, 1401 Matousek, J. 106, 121 Matsuda, M. 567, 584 Matsushita, Y. 1332, 1378 Manila, P. 225, 228, 229, 240, 246, 1013, 1017, 1020-1022, 1024, 1036, 1039, 1040, 1042, 1044-1047, 1053 Mattioli, J. 666, 672 Matvejchuk, S.M. 842, 868 Mauldin, R.D. 109, 121, 1044, 1050, 1052, 1053, 1104, 1127, 1153, 1182 Maurey, B. 439. 446 Mawhin, J. 603, 613, 614 Maynard, H.B. 270, 279, 294, 524, 528 Mazur, S. 436, 444 Mazurkiewicz, S. 87, 121 Mc Shane, E.J. 628, 671 McAloon, K. 1292 McAndrew, M.H. 1225, 1231 McCluskey, H. 1044, 1053 McCutcheon, R. 1202, 1232 McGehee, O.C. 443, 445 Mckean, H.P., Jr. 297, 313, 331, 343 McLeod, R.M. 236, 246, 591,614 McMillan, B. 1222, 1234 McMinn, T.J. 1182 McMullen, C.T. 1044, 1053 McShane, E.J. 188, 238, 246, 247, 512, 528, 591, 596,614, 1057, 1073, 1076, 1090 Meaya, K. 661, 671 Mecke, J. 666, 672 Medvedev, F.A. 26 Meginnis, J.R. 1558, 1564 Menas, Т.К. 1292 Mendoza, J. 583, 608, 611 Menger, K. 949, 950,1009 Mercourakis, S. 754, 755, 763, 764 Mertens, J.P. 662, 670 Merzbach, U.C. 26 Merzljakov, Ju.I. 1109, 1128 Mesiar, R. 288, 292, 898, 899, 905, 906, 908, 952, 954,957,973,975,983,1008,1009, 1332, 1343, 1344, 1351, 1352, 1362, 1366, 1377, 1378, 1398, 1401, 1405-1411, 1427, 1428, 1431, 1437-1439, 1458, 1460, 1462, 1466, 1467, 1487, 1510, 1518, 1521 Metivier, M. 399, 451, 478, 482, 490, 498, 501 Meyer, P.-A. 301, 342, 386, 393, 399, 437, 444, 451, 453, 477, 491, 494, 495, 501, 630, 668 Meyer-Nieberg, P. 797, 824 Mibu, Y. Wll, 1129
1578 Author Index Michalak, A. 572, 573, 584 Miction, G. 1050, 1051 Mijajlovic, Z. 1291 Mikusinski, J. 128, 156, 177, 1254, 1258, 1259, 1260 Mikusinski, P. 513, 527 Miller, A.W. 770, 785 Milman, M.M. 813, 824 Milnor, J. 1511, 1521 Misiurewicz, M. 1095, 1099, 1129 Mitchell, W. 1291 Mitial, D.P. 1561, 1564 Mityagin, B. 1205, 1232 Moedomo, S. 274,294, 584 Mokobodzki, G. 1133, 1139, 1152,1183 Molchanov, IS. 666, 671 Molto, A. 177 Monakov-Rogozkin, A. 1174, 1175, 1183 Monge, G. 776, 785 Montagna, F. 898, 899, 908 Montgomery, D. 752, 764 Montgomery-Smith, S.J. 177 Moore, C.C. 1122, 1128 Moore, E.F. 1020, 1022, 1036 Moore, G.H. 119,/2/ Moral, S. 1332, 1377, 1515, 1518 Morales, P. 170, 175,176 Moran, P.A.P. 619, 666, 670 Moran, W. 842, 866, 1171, 1183 Morayne, M. 245, 247 Morgan, F. 1013, 1028, 1036 Morgenstern, O. 950, 1009, 1406, 1465, 1467 Morris, A.S. 755, 764 Morse, A.P. 181,201,225,226,228,231,241,242, 245,246,247, 1020, 1036 Morters, P. 1045, 1053 Mosco, U. 656, 664, 671 Mostert, PS. 1343, 1344, 1378 Mostowski, A. 1291, 1292 Moulin, H. 1479, 1521 Moussa, P. 1050, 1052 Muldowney, P. 610, 614 Mullins, CW. 822, 824 Mundici, D. 830, 853, 854, 856, 868, 878, 879, 889, 898-900, 904-906, 907, 908, 914, 919, 944, 945 Muni, G. 513,529 Mufioz Rivas, P. 513, 514, 525, 526, 527 Munroe, M.E. 82, 247, 570, 584 Murofushi, T. 290, 294, 1331-1333, 1343, 1362, 1377, 1378, 1385, 1396, 1400, 1401, 1405, 1406, 1462, 1467, 1468, 1514, 1515, 1520 Musial, K. 277, 280-282, 293, 294, 533, 534, 537, 541, 547, 549, 550, 553, 554,556, 561-563, 565, 566, 569, 576-580, 581-586, 636, 671, 755- 757, 764, 771,774, 779, 785, 1151, 1152, 1160, 1161, 1164-1168, 1172, 1182,1183 Mycielski, J. 86,91,92, 107-109,/2/, 122, 1111, 1121, 1129, \2Ю,1292 Nachbin, L. 708, 743, 1113, 1129 Nadkarni, M.G. 774, 785, 1115, 1117, 1122, 1129 Nahmias, S. 1498, 1521 Naimark,M.A. 1059, 1090 Nakanishi, S. 608, 609, 614 Nakano, H. 797, 824 Nath, P. 1559, 1563 Natkaniec, T. 692, 700 Navara, M. 832,868, 879, 898, 906, 908, 950, 957, 958,973,975,983, 1009 Neal, D. 477, 501 Negrepontis, S. 1279, 1290 Nelsen, R.B. 951, 1009, 1332, 1378 Neubrunn, T. 879, 890, 899, 905, 909 Neugebauer, C.J. 683, 689, 693, 701 Neveu, J. 517, 529, 625, 639, 660, 671, 767, 769, 785 Newton, D. 1223,1234 Ng, C.T. 1528, 1533, 1535, 1539, 1544, 1545, 1550, 1562-1564 Ng, W.L. 601, 613 Nguyen, H.T. 652, 671, 814, 824, 1405-1407, 1467, 1525, 1529, 1550,1563 Nguyen, NT. 652, 671 Niederreiter, H. 114, 121, 122 Nikodym, O.M. 127, 135,177, 184, 247, 251, 294, 743 Nikol'skii, S.M. 1244, 1260 Nirenberg, L. 1253, 1260 Nishiura, T. 683, 689, 693, 701 Nishiwakj, Y. 1331, 1378 Nissenzweig, A. 436, 446 Nonnenmacher, D.J.F. 603, 604, 607, 612, 614 Norberg, T. 652, 671 Novikov, A. 247 Nowak, M. 801, 805, 813-815, 823, 824 Nowik, A. 1106,1129 Nygaard, O. 439, 446 Nyikos, P.J. 1281,/292 Oberle, R.A. 742 Ochoa.J.P 441,446 Odell, E. 585 Ogura, Y. 660, 671 Oharu, S. 583 Oja, E. 439, 446 Okada, S. 513, 529
Author Index 1579 Olsder, G.J. 1405, 1430-1432, 1436, 1466 Olsen, L. 1049-1051, 1053 Ol'shanskii, A.Yu. 96, 122 O'Malley, R.J. 685, 690, 693, 701 O'Neil, R. 813,824 O'Neil.T.C. 1050, 1051, 1052. 1053 Onisawa.T. 1331, 1378 Orey, S. 500, 501 Orlicz, W. 146, 177 Omstein, D.S. 1102,1129. 1195, 1206, 1221, 1222, 1227, 1228, 1232,1234 Osswald, H. 1323, 1324, 1328 Ostaszewski, K.M. 608, 614, 682, 684, 700. 701 Ostroy, J.M. 776, 784 Owen, G. 1384, 1401 Oxtoby, J.C. 117, 122, 678, 683, 684, 687, 701, 749, 755, 762, 1113, 1114, 1126, 1128, 1129, 1136,1183 Pachl, J.K. 771-775, 781-783, 785, 1175-1177, 1183 Pacquement, A. 598, 610 Pal, N.R. 1525, 1553, 1555,1564 Pal, S.K. 1559, 1564 Pallares, A.J. 585 Pallu de la Barriere, R. 643, 647, 668, 671 Panchapagesan, T.V. 1057-1059, 1070, 1081, 1083-1086, 1088-1090, 1090 Panti, G. 878, 908 Pap, E. 128, 135, 145, 146, 150, 156, 157, 160, 162-164, 169, 173, 174, 175-178, .350, 399, 512, 529, 836, 867, 908, 941, 944, 952, 1009, ПАЛ, 1252, 1254, 1259, 1260, 1331, 1332, 1336, 1340, 1343, 1344, 1351, 1.352, 1363, 1366, 1378, 1390, 1401, 1405-1411, 1414, 1416, 1418-1420, 1426-1428, 1431- 1433, 1436-1441, 1443, 1449-1451, 1455, 1457, 1458, 1460, 1462-1465, 1466-1468. 1476, 1514,1521,1564 Papageorgiou, N.S. 639, 643, 647, 666, 671, 672 Papangelou, F. 506, 529 Paplauskas, A.B. 26 Papoulis, A. 241,247 Paris, J.B. 1468 Parisi, G. 1049, 1052 Parry, W. 1192, 1197, 1200, 1204, 1205, 1210, 1219,1221,1234 Parthasarathy, K.R. 1111, 1113, 1120,7/29 Parthasarathy, T. 632, 670 Paszkiewicz, A. 842, 868 Paterson, A.L.T. 85, 95-97, 122, 1118, 1129 Patzschke, M. 1050,705/ Pauc, C.Y. 181, 187, 201, 203, 206, 225, 226, 231, 238, 241, 242, 244, 245, 246, 683, 701 Paul, P.J. 440, 444, 573-575, 581, 582 Pawlikowski, J. 119, 122 Pearl, J. 1494, 1500, 1501, 1521, 1522 Peck, C.V. 755, 764 Pelc, A. 117, 120, 1106-1108, 1124, 1127, 1127, 1129 Pelczynski, A. 438^*40, 444, 446, 55.3, 581, 1085, 1090 Pellaumail, J. 399, 451, 478, 498, 501, 1175, 1183 Penconek, M. 118, 122, 1105, 1129 Peres, Y. 1044, 1054 Perez, R. 1558, 1563 Perkins, E.A. 1050, 1054, 1328 Pemy, P. 1515-1517, 1519, 1520 Perron, O. 589, 614 Pesin, I.N. 26 Pesin, YB. 1044, 1049, 1050, 1054 Petersen, K.E. 1187-1189, 1193, 1194, 1196- 1198, 1204-1207, 1210, 1212-1214, 1217, 1220-1222, 1232,1234 Petrovicova, J. 905, 908 Pettis, B.J. 146, 178, 269, 293, 533, 535, 540, 559, 570, 573, 585 Petunin, Ju.I. 813, 823 Peyriere, J. 1044, 1050, 1051, 1053, 1054 Pfanzagl, J. 779, 785 Pfeffer, W.F. 2.34, 236, 245, 247, 505-508, 510, 515, 516, 519, 521,529, 590, 603, 604, 606, 608, 610, 613, 614, 748, 763 Phakadze, S.S. 1124, 1129 Phelps, R.R. 272, 293, 438, 446 Phillips, R.S. 135, 149, 178, 534, 570, 585 Piasecki, K. 879, 908 Pier, J.-P. 26 Pietsch, A. 28.3, 294, 439, 446 Pincus, D. 126.3, 1288, 1292 Pirogov, S.A. 1120, 1128 Piron, С 842, 868 Pisier, G. 439, 446 Plante, J.E. 1118, 1129 Plappert, P. 100, 122 Plebanek, G. 556, 558-560, 585, 775, 776, 785 Plummer, M.D. 97, 121 Poincare, H. 1200, 1234 Pol, R. 534, 585 Polak, B. 1515, 1520 Pollard, D. 518, 529 Poncet, J. 1121, 1129 Popa, D. 585 Poprugenko, G. 992, 993, 1009 Portenier, С 512, 520, 526, 526 Portinko, N.I. 331, 343 Possani, С 1306,1327
1580 Author Index Postnikov, A.G. 1199.1234 Povolotskii, A.I. 813, 824 Prade, H. 954, 1008, 1406, 1410, 1426, 1427, 1463-1465, 1466, 1474, 1476, 1481, 1483, 1486, 1488, 1491-1493, 1495, 1498, 1501- 1505, 1507, 1508, 1510-1517, 1518-1520, 1550, 1553, 1563 Prandini.J. 1306, 1327 Prasad, V.S. 755, 762, 764 Pratelli, M. 451,502 Pratt, J.W. 1475, 1521 Preiss, D. 106, 122, 234, 238. 245. 247. 596, 597, 599, 610. 614, 1020-1022, 1024, 1025, 1035. 1036,1045-1047, 1053,1054 Preston, C.J. 178 Price, J.F. 754, 764 Prikry,K.L. 724, 741, 742, 1133, 1156,1181, 1277. 1291, 1292 Procaccia, I. 1049,1052 Protter, P. 502 Prudnikov, A.P. 1258, 1260 Przymusinski, T. 1292 Ptak, P. 178, 866, 868 Pucci, S. 289, 290, 293 Puhalskii, A. 1405, 1432, 1433, 1468 Pujie, W. 594, 614 Pulmannova, S. 178, 851, 858, 866, 867, 868, 889, 907, 908 Puri.M.L. 647, 654, 661,672 Pustylnik, E.I. 813, 819, 823 Qu, Y.S. 517, 529 Quadrat, J.-P. 1405, 1430-1432, 1436, 1465, 1466 Rachev, ST. 776, 785 Rachunek, J. 860, 868 Rado, R. 97, 122 Rado, ? 1027,1036 Radon, J. 133, 178, 436, 446 Radon, M. 251,294 Rahe, A.M. 1231, 1233 Raikov, D.A. 1113, 1129 Rainwater, J. 438, 446 Raj'u, S.R. 1107, 1128 Ralescu, D.A. 647, 654, 661, 672, 1332, 1366, 1378, 1513, 1515,752/ Ralevic, N. 1431, 1432, 1440, 1468 Ramachandran, D. 767, 770, 774, 776-783, 784- 786, 1095, 1099, 1129 Ramakrishnan, S. 783, 785 Ramer, A. 1487, 1521, 1525, 1547, 1548, 1550, 1564 Ramsey. F.P. 1471. 1521 Randall, C.H. 178. 833, 867 Randolph, J.F. 1020, 1036 Randrianantoanina, N. 441, 446 Rao, B.V. 769, 783 Rao, K.M. 502 Rao, M.M. 178, 399. 517. 523, 529. 812, 813, 824 Rashid, S. 1305, 1313, 1316, 1327 Raynaud de Fitte, P. 639, 647, 657, 672 Regazzini, E. 263, 292 Reidi, R. 1044, 1050. 1053 Reinhardt, W.N. 1268, 1292 Remus, D. 754, 762 Remy, M. 773, 783, 786 Ren, Z.D. 812, 813, S24 Renyi, A. 82, 1205. 1234 Ressel.R 520,527. 748. 764 Retherford. S.R. 441, 447 Rice, N.M. 813, 823 Richter, H. 619, 637. 672 Rickart, C.E. 127, 129, 135, 178, 709, 743 Riddle, L.H. 544, 546. 547, 554, 557, 560, 566, 569, 585 Riecan, B. 512, 529, 856, 868, 879, 889, 890, 898, 899, 905, 907-909. 914, 945 Riecanova, Z. 1331, 1378 Rieffel, M.A. 270. 294. 542, 585 Riesz, F 436, 446. 692, 701. 797, 822, 825 Rigo, P. 263, 292 Rinkewitz, W. 1154, 1175, 1183 Rinne, D. 599, 604, 611 Robbins. H.E. 619. 672 Roberts, Ch.E. 583 Roberts, J.W. 177. 1121, 1129 Roberts, R. 660, 666 Robinson, A. 1297, 1328 Robinson, R.M. 103, 122 Rockafellar. R.T. 631, 656, 664, 672. 1004, 1009 Rodine, R.H. 781.7S6 Rodriguez-Piazza, L. 440, 444 Rodriguez-Salinas. ?. ??83 Rogers, C.A. 206, 247. 1013, 1036, 1039, 1047, 1048, /052, 1054 Rogers, L.C.G. 343 Rogge. L. 153, 177 Rokhlin (Rohlin). V.A. 780. 786. 1208, 1220, 1221, 1227, 1229, 1231, 1234 Ro.se. A. 945, 949. 1009 Rosenblatt, J.M. 97, 102, /22, 1102, 1129, 1211, 1234 Rosenfeld, M. 201, 245 Rosenkrantz, W. 331, 343 Rosenthal. H.P. 128, 150, 178, 436, 446, 545, 556, 557. 566, 567, 585 Ross, D.A. 652, 672. 1183
Author Index 1581 Ross, K.A. 98, 121, 752, 763, 1106. 1113, 1126. 1128 Ross, S.A. 1448, 1449, 1466 Rosser, J.B. 945, 949, /009 Rota, G.C. 1384, 1401 Roubens, M. 1383, 1385, 1388, 1400, 1519, 1521, 1562, 1564 Rowbottom, F. 1292 Royden, H.L. «2,505,507, 510,516,517,520,529 Rubinstein, M. 1448, 1449, 1466 Rudeanu, S. 1384,7400 Rudin, V.N. 513, 529 Rudin, W. 50, 82, 347, 399, 802, 804, 825, 1059, 1090, 1198, 1199, 1234 Rudolph, D.J, 1189, 1209, 1212, 1218, 1219, 1222, 1229-1231, 1233, 1234 Ruelle, D. 1044, 1054 Ruess, W. 552, 581,583 Ruess,W.M.441,445 Ruschendorf, L. 776-778, 785, 786 Ruspini, E. 905, 909, 954, 1009 Rusu, D. 677, 701 Rutickii, Ya.B. 813, 823 Ryan, R. 437,446, 1183 Rybakov, VI. 278, 279, 294, 533, 540, 560, 566, 568, 573, 585 Rybalov, A. 1352, 1379 Rybarik, J. 905, 908, 909, 954, 1009, 1351, 1378, 1407, 1408, 1467 Rybarikova, E. 907 Ryll-Nardzewski, С 565, 566, 585, 631, 671, 770, 771, 774, 779, 784-786, 1104, 1129 Rynne, B.P. 1044, 1052 Ryzhikov, V.V. 1209, 1234 Saab, E. 440, 441, 444, 446, 544, 546, 547, 554, 557, 560, 566, 569, 583, 585 Saab, P. 440, 441,444, 446, 585 Sabbadin, R. 1507, 1510, 1512-1515, 1520 Sacks, G.E. 1292 Saeki. S. 157. 175. 178 Sahoo, P. 1525, 1533, 1539-1541. 1552, 1553. 1563. 1564 Saint Raymond. X. 1045,1054 Saint-Pierre, J. 639, 672, 1175, 1183 Saint-Raymond. J. 633. 672 Sajid, S. 639. 672 Sakai, K. 1100. 1129 Saks. S.26. 127. 135. 178. 184. 206. 232. 234. 240, 243. 247, 255. 294. 436. 446. 598. 614. 692. 701 Salama. I. 1198, 1234 Saleski. A. 1223, 1234 Salinetti. G. 647. 652. 655. 672. lib. 784 Salvati. S. 176 Samborskii, S.N. 1405, 1407, 1408, 1420, 1430, 1433, 1435, 1436. 1440. 1462, 1467. 1468 Sambucini, A.R. 280-282, 284, 286, 293 Sander, W. 1428, 1468. 1525, 1533, 1539-1541, 1545, 1548, 1549, 1552, 1553, 1558, 1563. 1564 Sanguesa, R. 1501,7522 Santalo, L. 666, 672 Sapounakis, A. 1152, 1183 Sarkhel, D.N. 691,70/ Sarymsakov, T.A. 741 Sastri, C.C.C.A. 1562. 1563 Savage. L.J. 1473. 1475, 1507. 1522 Sazenkov. A.N. 178 Sazonov, V 772, 786 Scepin, E.V 760, 764 Schachermayer. W. 128. 178, 441,445, 585 Schaefer, H.H. 640, 672, 797, 805, 825 Schafke, F.W. 509. 511-513, 522, 524, 525, 528, 529 Scheinberg. S. 684, 685, 701 Schief, A. /183 Schmechel, N. 954, 1009 Schmeidler, D. 512, 529, 643, 647, 662, 668, 672, 1331, 1378. 1522 Schmidt, K. 1120, 1128, 1129 Schmidt. K.D. 774, 786, 945, 954, 959, 983, 1009 Schmidt, W. 114, 122 Schmitt, M. 666, 672 Schreiber. B.M. 1177. 1183 Schurger, K. 654, 672 Schurle. A.W. 592,614 Schuss, Z. 591,6// Schwabik. S. 603, 604, 612 Schwartz, J.T. 82, 127, 129, 146, 149, 152, 153, 176,399, 439,444, 513. 514,520, 528, 561, 564, 580, 582, 669, 740, 742, 944, 1059, 1083, 1090, 1144,1179 Schwartz, L. 266, 279, 293, 439, 446, 520, 529, 1139. /183. 1260 Schwarz. F. 1405. 1468 Schweizer. B. 776, 786. 949. 951. 1009. 1332. 1352. 1378. 1416. 1424, 1425. 1429. 1468. 1529. 1564 Scott. D.S. 1268. 1291. 1292 Segal. I.E. 257. 294. 516. 529. 1112, 1121. 1129, 1140. 1183 Seidenberg. A. 1475. 1521 Seifert, C.J. 439, 445 Semadeni, Z. 797. 805. 825 Semenov, E.M. 813. 823 Sentilles, D. 543, 585. 586 Sen. R. 643, 664-666. 667 Serra, J. 672
1582 Author Index Serres, M. 26 Sersouri, A. 585 Shackle, G.L.S. 1476, 1522 Shafer, G. 1385, 1401, 1464, 1468. 1477, 1522, 1547, 1564 Shannon, C.E. 1222, 1234. 1527, 1564 Shapiro, L.B. 760, 761, 764 Shapley, L.S. 949, 1007, 1385, 1399, 1401 Sharma, B.D. 1528, 1561, 1564 Shelah, S. 583, 757, 764, 1133, 1139, 1144, 1157, 1173, 1179. 1183. 1288, 1291, 1292 Shepp, L.A. 331, 335, 343 Sherman, G.A. 87, 89, 91, 122 Sherstnev, A.N. 843, 845, 866. 868 Shieh, N.-R. 1050, 1054 Shields, A.L. 1343, 1344, 1378 Shields, P.C. 1222, 1235 Shilkret, N. 1367, 1378, 1409, 1411, 1468, 1510, 1522 Shilov, G.E. 510, 529, 1247, 1260 Shiryayev, A.N. 82, 1191, 1235 Shoenfield, J.R. 1292 Shoham, Y. 1487-1489, 1522 Shorn, R.M. 770, 772, 775, 779, 780, 786 Shraiman, B.J. 1049, 1052 Shreve, S. 343 Shulman, V. 776, 784 Shwartz, L. 748, 764 Sierpinski, W. 85-87, 90, 100, 103, 119, 121, 122, 586, 679, 701, 770, 786, 1292 Sikorski, R. 586, 797, 825, 831, 868, 890, 909 Silver, J.H. 1122, 1129, 1274, 1284, 1292 Simha, R.R. 752, 762 Simon, B. 846, 868 Simon, J. 1035, 1036 Simon, K. 1044, 1054 Simon, L. 1013, 1028, 1035, 1036 Simonnet, R. 517,529 Simonovits, M. 96, 120 Simpelaere, D. 1050, 1054 Sinai, Y.G. 1189, 1191, 1206, 1209, 1210, 1219, 1221, 1232, 1234,1235 Singer, I. 379, 399, 440, 446, 1384, 1401 Sion, M. 723, 743, 1085, 1090, 1146, 1184, 1333, 1378 Sipos, J. 512, 519, 529, 1331, 1332, 1364, 1378, 1390, 1401 Siu-Ah Ng 1328 Sklar, A. 949, 951, 1009, 1332, 1352, 1378, 1416, 1424, 1425, 1429, 1468, 1529, 1564 Skvortsov, V.A. 245, 592-595, 597-599, 608, 610, 614 Smets, P. 1477, 1522 Smith, B. 441,446 Smorodinsky, M. 1222, 1233, 1235 Snobar, M.G. 439, 445 Snow, DO. 1140, 1179 Snow, P. 1486, 1495, 1522 Sobolev, A.V. 797, 823 Sobolev, S.L. 1239, 1260 Sobolevskii, P.E. 813, 819, 823 Sokal, A.D. 779, 786 Solecki, S. 245, 247, 1106, 1114, 1129, 1130 Solodov, A.P. 608, 614 Solomon, B. 1032, 1034, 1036 Solomyak, B. 1044, 1054 Solovay, R.M. 49, 82, 1263, 1268, 1286, 1288, 1292 Soneda, S. 1385, 1401 Sonnenschein, H. 1517, 1521 Sonntag, Y. 655, 657, 672 Sos, V.T. 96, 120 Sparre-Anderson, E. 773, 786 Specker, E.R. 842, 867 Spohn.W. 1478, 1522 Srinivasan, T.P 522, 528 Stajner-Papuga, I. 1416, 1419, 1420, 1455, 1468 Stefansson, G.F. 541, 556, 558, 586 Stegall, C.P 272, 294, 441,447 Stegall, Ch. 536, 586 Steinhaus, H. 1114, 1130 Steinlage, R.C. 436, 447, 1121, 1130 Stepan, J. 776, 783 Stepanoff, W.W. 687, 701 Stone, C.J. 330, 343 Stone, M.H. 508, 529, 1136, 1139, 1173, 1183 Stout, E.L. 438,447 Stoyan, D. 666, 672 Strassen, V. 775,776, 786 Straus, E.G. 86, 122 Strauss, W. 277, 294, 534, 586, 755-757, 763, 764,1140,1145,1150-1154, 1159-1162, 1164- 1168, 1170-1172, 1175, 1178, 1182-1184 Strieker, С 502 Stromberg, K. 1059, 1060, 1090 Stroock, D.W. 297, 343, 507, 508, 516, 517, 529, 774, 786 Stroyan, K.D. 1304, 1328 Struik, D.J. 16,26 Suarez, F. 1332, 1378 Suarez Garcia, F. 1515, 1522 Sucheston, L. 153, 174 Sudakov, V.N. 776, 786 Sugeno, M. 288, 290, 294, 949, 1009, 1331-1333, 1343, 1362, 1377, 1378, 1383, 1396, /400, 1401, 1405, 1406, 1409, 1462, 1467, 1468, 1513-1515, 1520-1522
Author Index 1583 Sullivan, D. 1041, 1054 Sullivan, J.M. 228, 247 Sun, T.C. 1177, 1183 Sun, Y.N. 638, 670, 1322-1324, 1326, 1327, 1328 Swanson, L. 1231, 1233 Swart, J. 437, 444. 520, 527 Swartz, С 128, 135, 148-150, 157, 162, 170, 175. 177, 178,441.447 Swierczkowski, S. 1287, 1292 Szemeredi, E. 1201, 1235 Szpilrajn, E. 1094, 1107, 1123, 1124, 1126, 1130 Takaci, A. 1244, 1252, 1259, 1260, 1443, 1468 Takaci, Dj. 1244, 1252, 1259, 1260. 1468 Talagrand, M. 438, 441, 444. 447. 533, 535, 537, 543, 546, 548-550, 553, 554, 557, 559, 561, 568, 569,575, 576,578,580,581.583.586. 751, 757, 758,764, 1133, 1153, 1156, 1158, 1159, 1169, 1170, 1177, 1178, 1179. 1184 Tall, F.D. 682, 684, 686, 691, 701, 1281, 1283, 1292, 1293 Tanaka, H. 1331, 1378, 1509, 1514, 1521 Tanaka, S. 177 Taneja, I.J. 1525, 1528, 1557, 1558, 1563-1565 Tarashchan, A.L. 1420, 1468 Tarski, A. 85, 87, 90, 97, 98, 100, 110, 117, 120, 122, 879, 908, 909, 1093, 1096, 1099, 1103, 1127, 1130, 1274, 1291, 1292 Tartar, L. 1435, 1466 Taylor, A.E. 508, 510, 520, 529 Taylor, H. 343 Taylor, M. 776, 786 Taylor, R.L. 653, 672 Taylor, S.J. 233, 247, Ы1, 694, 695, 701, 1041, 1044, 1048, 1050, 1053, 1054 Telgarsky, R. 1104, 1129 Temam, R. 1044, 1054 Teofanov, N. 1433, 1468 Terepeta, M. 694, 696, 698, 699, 701 Termini, S. 1525, 1551, 1563 Terpe, F. 520, 528 Testemale, С 1508, 1514, 1520 Theil,H. 1528, 1565 Thiam, D.S. 513, 529, 643, 647, 672, 673 Thiele, H. 954, 1009 Thomas, E. 570, 572, 573, 576, 586, 1057, 1085- 1089, 1090 Thompson, A. 619,670 Thomson, B.S. 194, 231, 233-235, 237, 239, 243, 244,245,247, 598, 599,610,611,614,615, 687, 701, 1260 Thorup, A. 1533, 1564 Thouvenot, J.-P. 1231, 1235 Tiser, J. 1022, 1025, 1036, 1045, 1054 Tits, J. 96, 122 Tkacenko, M.G. 760, 764 Tomczak-Jaegermann, N. 439, 447 Tong, A.E. 439, 447 Tonge, A. 439, 444 Topsoe, F. 517, 518, 529, 773, 786, 1114, 1130 Tortrat, A. 586, 1171, 1184 Tousset.E. 1398, 1401 Touzani, A. 284, 293 Traynor, ? 153, 176, 178, 705, 738, 743, 944, 945, 1136, 1184 Tricot, C, Jr. 122, 233, 247, 1041, 1045, 1054 Triebel, H. 1244, 1260 Troallic, J.P. 775, 784 Trotter, H.F. 309, 343 Troyer, R.J. 677, 701 Truss, J.K. 101, 122 Tsiporkova, E. 1487, 1518 Tsukada, M. 67i Tweddle, I. 148, 178 Tzafriri, L. 813, 823 Uemura, T. 653, 673 Uhl, J.J., Jr. 82, 128, 150, 151, 153, 154, /76,266, 272, 274,293, 294, 362, 379,399, 439,445, 533, 536, 539,544, 546, 547, 554,557,560, 566,572, 581,582,584-586, 627, 636, 646,668,709, 712, 741, 742,937, 944, 1081, 1090 Ulam, S.M. 261, 294, 1212, 1235, 1268, 1284, 1285, 1293 Ulger, A. 441,447 Umegaki, H. 619, 629, 635, 637-641, 643, 660, 670 Urbanik, K. 1529, 1563 Urbanski, M. 1044, 1050, 1053 Urbina, W. 634, 667 Uspenskii, V.V. 760, 764 Ustunel, S. 477, 502 Valadier, M. 284, 293, 619, 622, 630-633, 639, 641, 643, 664, 666, 667, 673, 781, 786, 1175, 1184 Valdivia, M. 520, 529 Valko, S.98, 121 Van Cutsem, B. 619, 639, 643, 673 van Dantzig, J. 1110, 1130 VanderPyl.T. 1561, /565 van der Waerden, B.L. 26, 1110, 1130, 1201, 1235 van Dulst, D. 582 van Neerven, J.M.A.M. 585 Van Vleek, FS. 632, 670 Vanmassenhove, F.R. 1510, 1519 Varadajan, VS. 520, 529
1584 Author Index Varadarajan, VS. 559, 586, 747, 764, 866, 868, 890,909, 1118, 1130 Varadhan, S.R.S. 297, 343 Varga, R.S. 1191, 1235 Vasak, L. 559, 586 Vath, M. 789, 795, 798, 805, 813, 814, 821, 822, 823, 825 Veech, W.A. 755, 764 Vera, G. 585 Veres, S. 1560, 1565 Verma.T. 1501, 1522 Vershik, A.M. 1228, 1235 Vervaat, W. 652, 672 Vesterstr0m, J. 1177, 1184 Vetterlein, T. 866, 867 Vickers, J.A.G. 1044, 1052 Vincke, P. 1521 Vind, K. 643, 647, 673 Viot, M. 1432, 1465, 1466 Vitale, R.A. 653, 661, 666, 673 Vitali, G. 89, 122, 133, 178, 1103, 1130, 1331, 1378 Vivona, D. 288, 292, 1332, 1333, 1343, 1347- 1349, 1351, 1375, 1377, 1407, 1449, 1450, 1466, 1468, 1518 Vladimirov, D.A. 706, 743, 1102, 1130 Vladimirov, VS. 1247, 1250, 1260 Volcic, A. 255, 291, 292, 294, 510, 523, 524, 527, 529,530, 1184 Volkmer, H. 743, 932, 945 Volpert, A.J. 1025, 1036 von Neumann, J. 91-93, 104, 105, 107, 122, 277, 294, 830, 866, 868, 871, 906, 950, 1009, 1097, 1102, 1110, 1130, 1136, 1137, 1139, 1141, 1153, 1173, 1183, 1189, 1193, 1212, 1214, 1233-1235, 1406, 1465, 1467 von Weizsacker, H. 538, 586, 1136, 1137, 1139, 1144, 1147, 1157, 1172, 1174, 1178, 1180, 1184 Vopenka, P. 1293 Vrabelova, M. 909 Vulikh, B.Z. 918, 945 Vybomy, R. 592, 593, 610, 613 Wagner, D.H. 631,638, 673 Wagner-Bojakowska, E. 694, 696-699, 701, 702 Wagon, S. 85-87, 89-91,93,95-97, 101-104, 107, 108, 111, 115, 117, 119, 122, 1095, 1096, 1100, 1118, 1123, 1130 Wajch, E. 748, 751,760, 763 Wald, A. 949, 993, 1006, 1008 Waldschaks, G. 774, 786 Walker, E.A. 1405-1407, 1467, 1525, 1529, 1550, 1563 Walley, P. 1390, 1401, 1477, 1522 Walsh, J.B. 331, 3.35, 343 Walters, P. 1204, 1235 Wang, P.J. 594, 615 Wang, S.P. 1120, 1130 Wang, Z.P. 178, 660, 668, 673, 1331, 1332, 1343, 1351, 1379, 1405, 1406, 1468, 1514, 1522 Ward, A.J. 247 Warner, S. 707, 739, 743, 929, 945 Waterman, D. 683, 692, 701 Weakley, C.B. 743 Weber, H. 128, 144, 156, 157, 160, /75,705,711, 718, 72,3, 724, 7.38, 741, 742, 743, 913-915,932, 935, 938, 944, 945, 949, 1006, 1007, 1007, 1010 Weber, K. 509, 510, 527 Weber, S. 956, 1008, 1331, 1332, 1345, 1366, 1367, 1379, 1406, 1410, 1411, 1463, 1467, 1468, 1476, 1522 Weglorz, B. 1106, /127 Wehrung, F. 100, 108, 116, 119, 121-123 Weil, A. 26, 1113, 1114, 1120, 1130 Weil, W. 662, 673 Weir, A.J. 505, 510, 511, 514, 516, 520, 523, 530 Weiss, B. 1211, 1227, 1228, 1232, 1233 Weiss, H. 1050, 1054 Weiss, T. 770, 784 Wentzell, A.D. 1466 Wenzel, J. 439,446 Werner, E. 585 Wets, R.J.-B. 647, 652, 655, 656, 663-665, 666, 669, 672 Weyl.H. 1198, 1199, 1235 Whalen, T. 1509, 1522 Wheeden, R. 700, 702 Wheeler, R.F. 586 White, B. 1034, 1036 White, H.S. 1225, 1226, 1235 Whiteside, D.T. 26 Widom, H. 82 Wiener, N. 298, 343, 1194, 1196, 1213, 1235 Wierdl, M. 1211, 1212, 1234 Wijsman, R.A. 632, 673 Wilczynski, W. 682, 694, 697, 699, 700-702 Wilhelm, M. 512, 530 Williams, D. 343 Williams, S.C. 1044, 1052, 1053 Williamson, J.H. 82 Willis, G.A. 95, 123 Wills, J.M. 114, 122 Wils, J. 1177, 1184 Wintner, A. 1213, 1235 Wisjman, R.A. 632, 673 Wittaya Naak-in 592, 613 Wloka, J. 1244, 1260
Author Index 1585 Wnuk, W. 805, 825 Wojdowski, W. 691, 702 Wqjtaszczyk, P. 439,447 Wolfenstein, S. 879, 906 Wolff, M.P.H. 1297, 1328 Wolfowitz, J. 949, 993, 1006, 1008 Wong.S.M.K. 1477, 1522 Wright, J.D.M. 842, 866, 899, 909 Wu,T.S. 1120, 1130 Wyler, O. 956, 1010 Xue, X.H. 660, 673 Yaari, M.E. 1476, 1522 Yager, R.R. 1352, 1379, 1509, 1522 Yan, B. 1510, 1522 Yand, J.S. 1120, 1130 YangQuingJi 1332, 1379 Yannelis, N. 662, 673 Yao, YY. 1477, 1522 Yeadon, F.J. 842, 868 Yor.M. 451,502 Yosida, K. 1194, 1235 Yuan, B. 1525, 1547, 1550, 1564 Zaanen, A.C. 26. 169. 170. 177. 523, 530. 796- 798, 805, 813, 814, 819, 822, 823-825 Zabrelko, P.P. 805, 807, 813, 814, 819, 822, 823- 825 Zadeh, L.A. 945, 949, 953, 956, 1010, 1476, 1498- 1500, 1507, 1522, 1550, 1559, 1565 Zagordny, D. 1421, 1468 Zahorski, Z. 689, 702 Zajicek, L. 677, 679, 686, 701. 702 Zakrzewski, P. 117, 118, 123. 1100-1105, 1107, 1108,1125-1127, 1128-1130 Zalinescu, С 655, 657, 672 Zame, W.R. 776, 784 Zapico, A. 1465. 1466, 1511, 1519 Zbierski, P. 1179 Zermelo, E. 119, 123 Ziat, H. 657, 673 Ziemer, W.P. 225, 228, 240, 247, 677, 701, 1013, 1025,1036 Zinn, J. 647,669 Zippin, L. 752, 764 Zygmund, A. 241, 246, 599, 613, 693, 700, 701, 702, 1143, 1184
Subject Index ?-orthogonality, 171 ®-step representation, 1354 ®-sum, 171,837 ? functor, 874 <5-ring, 30 ?-bracket, 509 ?-independent partitions, 1221 ? -almost everywhere (?-a.e.), 41 ? -chained, 731 ?-conegligible set, 38 ?-integrable function, 56 ?-integrable set, 39 ?-locally, 514 ?-measurable set, 39 ?-measurable step functions, 41, 351 ?-negligible sets, 38 ?-null set, 262 ?-topology, 708 ?-uniformity, 934 ?-continuous, 936 ?-additive, 748 ?-additive measure, 1134 ?-additive product, 749 .^-measurable function, 35, 348 ?-Borel measure, 1060 ?-Borelset, 1058, 1081, 1082 ? -algebra, 30 ? -bounded, 1060, 1061 -set, 1058 ? -complete MV-algebra, 873 ?-homomorphism, 877 ?-ideal, 798, 1096 (a-)order continuity, 716 ?-order continuous, 922, 929 - operator, 819 - submeasure, 716 ? -representation integral, 518 ?-Riesz homomorphism, 799 ?-ring, 30 ?-subadditive function, 713, 921 ?-submeasure, 713, 921 ?-density topologies, 693-699 ? ?-saturation, 1303 a.e. topology, 690 Aarnes theorem, 844 absolute value of a linear functional, 1058 absolutely - p-summable sequence, 426 - continuous, 251, 285, 288, 997 - continuous ?[,-measure, 991 -continuous functional, 523 absorbing, 306 absorbing subset, 285 abstract -Li'(l) spaces, 510 - Daniell-Loomis spaces, 526 - Fubini theorem, 521 - Lebesgue integral, 516 - Riemann integral - improper, 513 - proper, 513 - Riemann^-integrable functions, 514 - Riemann-type approach, 520 AC, 1265 acceptance preorder relations, 1492 action, 87 action of a group, 1094 - m-free, 1095 - by measurable transformations, 1093 - free, 1094 - transitive, 1094 addition of observables, 886 additive - measure, 1333 - set-valued function, 643 adic transformation, 1228 admissible density, 1138 admissible extension of a Daniell system, 510 admissible linear liftings, 1144 admissibly generated lifting, 1139 AF C*-algebra, 871 1587
1588 Subject Index affine function, 413 Aleksandrov theorem, 79 algebra of sets, 30, 1526, 1533, 1537, 1541 -non-atomic, 1533 almost perfect, 808 almost strong -density, 1149 -lifting, 1149 - lifting property, 1150 almost-invariant set, 1095 o?-almost perfect, 808 o?-perfect, 808 ambiguous set, 86 amenable group, 92 ample product system, 521 analytic space, 769 Andersen and Jessen theorem, 1319 aperiodic Markov shift, 1206 aperiodic transformation, 1227 approximate - finitely additive Radon-Nikodym property, 275 - functional Radon-Nikodym, 524 - limit, 686 - Radon-Nikodym-Bochner theorem, 277 - tangent m -plane, 1019 - tangent vectors, 1018 approximately continuous, 686-690 - functions, 677 approximation property, 430 Archimedean - Riesz space, 792 - t-norm, 951 area minimizing current, 1034 arithmetic progression, 1201 arithmetical mean of observables, 886 AS LP, 1150 associate space, 815 associativity, 1351 asymmetric integral, 1390 atom, 43, 628, 646, 729, 797, 920 atomic - 7"L-measure, 998 - lattice, 729, 920 atomistic lattice, 729 atomless lattice, 729, 920 attractor, 1043, 1044 Aumann integral, 636 Aumann-Pettis integrable, 636 Aumann-Pettis integral, 636 auxiliary function, 1049 average density, 1045 axiom - K, 535 - L,535 -Martin's, 1287 ?-exceptional sets, 509 Baire - a -algebra, 275 -lifting, 1139 - measure, 522, 748, 1060, 1061, 1071, 1134 -set, 1058, 1081-1084 - strong liftings, 757 Banach - algebra, 407 -density, 1201 - function space, 813 - lattice, 406 - measure, 90 Banach-Schroder-Bemstein theorem, 90 Banach-Tarski paradox, 90 band, 798 Bartle-Dunford-Schwartz - integral, 277, 280 -representation theorem, 1083 bases associated with a lifting, 206 basic -assignment, 1546, 1547 - function, 1334 -probability mass assignment, 1385 - values of Choquet integral, 1336 - values of general integral, 1352 - values of Sugeno integral, 1340 basically disconnected space, 875 basis -decomposable, 187 - pointwise character, 187 BCK-algebra, 856 belief, 1546, 1547 - function, 1477 - revision, 1505 Bellman operator, 1435 Bernoulli - numbers, 1388 -shift, 1190, 1198, 1206, 1220 Besicovitch-Morse property, 228 Bessaga-Pelczynski theorem, 155 big-stepped probability, 1486 bimorphism, 898 bipartite graph, 97 bipolar, 282 Birkhoff ergodic theorem, 654 Biting Lemma, 153 Blackwell space, 779 block, 832 Blumenthal 0-1 law, 305
Subject Index 1589 Bochner - integrable, 275 - integrable functions, 268, 273, 509, 524 -integral, 266, 351,354 - Lebesgue space, 806 - type derivative, 284 Boolean algebra, 873 - of projections, 573 Boolean partition of unity, 903 Boolean valued satisfaction, 1266 Borel - ? -algebra, 31 -?-field, 268 - function, 35 -lifting, 1139 -measure, 519, 522, 1014, 1060, 1110, 1134 - Radon measure induced by a positive linear functional, 1059, 1061, 1062 - regular, 1040 -set, 31, 1058 - space, 1095 bound, lower, 791 bound, upper, 791 boundary of current, 1029 bounded -complex Radon measure, 1071 - linear functional in /CG")*, 1071 - linear lifting, 1149 - Radon operator, 1087 -set, 711 - variation, 269, 285 - order - from above, 791 - from below, 791 --operator, 818 --set, 791 Bourbaki integral, 506 Bourgain property, 544 box dimension, 110 Brooks-Jewett theorem, 139 Brownian - local time, 309 - motion, 1044, 1050, 1297, 1317 - natural stretched, 332 - normalized, 298 - - stretched, 332 -path, 1041 Burgers equation, 1440 Busemann-Feller bases, 205 C*-algebra, 407 e.g. ?-algebra, 767 c.l.d., 1139 - product measure, 72 Cafiero uniform exhaustivity theorem, 130 cancellation law, 99, 101 canonical extension, 368 - of additive measures, 370 Cantor -diagonal argument, 1264 - function (Devil's Staircase), 50 -set, 50, 1039, 1043 - theorem, 1284 capacities, 1383 capacity - approach, 517 - functional, 652 Caratheodory, 10.39 -construction, 1014 -extension, 257, 517 -extension theorem, 1303 cardinal -compact, 1267 -hige, 1279 -Mahlo, 1285 - medium compact, 1266 - real valued compact, 1276 - real valued strongly compact, 1276 - rel valued large, 1276 -RV large, 1275 - strongly compact, 1266, 1275 - supercompact, 1267, 1275 - weakly compact, 1266 cardinal - invariants, 1275 - norm,1267 cardinality -function, 1.387 -jump, 1279 - operator, 1.387 cascade, 1199, 1224, 1226 Castaing representation, 631 categorical equivalence, 874 category measure, 11.35 Cauchy sequence, 624 Central Limit Theorem, 886 centre, 916 Chacon transformation, 1206 Chacon-Omstein theorem, 1195, 1197 chained, 728 - Гм-clan, 1006 - uniform space, 940 change of variables, 68 character of a measure space, 1126 characteristic function, 876 charge, 768 - extension property, 778
1590 Subject Index Choquet - integral, 252, 289, 1336, 1390, 1393-1395 - theory, 1314 classical representation, 1335, 1354 closed -convex hull, 620, 621 - order, 796 - subset, 263 closure - of a Daniell system, 509 - theorem, 1033 CMEA, 1283 co-Mobius transform, 1385, 1391 commonality function, 1385 commutativity, 1351 comonotone - ®-additivity of general integral, 1353 - additivity of Choquet integral, 1337 -function, 1334 - maxitivity of Sugeno integral, 1341 compact, 1301 - charge, 770 - class, 769 - measure, 770 -Polish G-space, 1117 - range property, 554 compactness, 278 -theorem, 1033, 1266 comparative - possibility distribution, 1478, 1493 -probability, 1473 compatible element, 832 complement of a fuzzy set, 954 complete - algebra, 255 -ergodicity, 1206 - Dedekind, 793 --?-, 793 - super, 793 - measure, 515 - MV-algebra, 873 completeness, 261 completion - of FN-topologies, 727 -of MV-algebra, 933 - regular, 748 - regular measure, 1134 complex Radon measure induced by a positive linear functional, 1063 complex-valued function - integrable, 59 - measurable, 59 composite observable, 885 compressible equivalence relation, 1122 conditional - entropy, 1216 - entropy of a partition of unity, 902 - expectation, 67, 273, 523, 893 - information, 1216 - possibility, 1486 - possibility function, 1487 cone, 790, 836 - positive, 790 confidence - function, 1475 -relation, 1472, 1473 congruence property, 1041 conjecture of J.R. Choksi, 747 conjugate set function, 1383 conservative ergodic, 1197 consistent -density, 1159 -lifting, 1158 content, 33 continued fractions, 1044 continuity - of Choquet integral, 1338 - of general integral, 1353 - of Sugeno integral, 1341 continuous - measures, 263 - operator, order, 819 --?-,819 - projective system, 1163 continuum displacement, 1286 controlled convergence theorem, 593 convergence -a.e., 888 - a.e., in an MV-algebra, 888 - in distribution, 652, 888 - in distribution, in an MV-algebra, 888 - in measure, 71, 888 - in measure, in an MV-algebra, 888 - in the mean, 57, 352 - in the mean of order p, 354 - order, 796 - theorems, 390 convex measures, 995 convexification property, 653 convolution - of functions, 77 - of measures, 81 - semiring, 1419 coordinate - random variable, 885 - random vector, 885 corresponding ?-group with unit of an MV-algebra, 875
Subject Index 1591 Corson space, 534 countable - chain condition, 1101 - convexity, 58, 352 - equivalence relation, 1122 countably - 7"-additive function on a ?-tribe, 959 - equidecomposable sets, 117, 1096 - paradoxical set, 117, 1096, 1103, 1104, 1115, 1117,1122 - supported, 634, 637 counting measure, 36, 258, 291 Cousin's lemma, 235, 589 covariance function, 661 cover, 1040 covering relation, 185 crisp set, 953 critical point, 1266 CRP, 554 crude - inner envelope, 517 - outer envelope, 517 cumulative hierarchy, 1265 current, 1029 cutting-and-stacking construction, 1206, 1227 cylinder, 885 d-bounded set, 172 D-poset - complete, 172 - quasi-?-complete, 172 Daniell - integral, 506 - lattices, 506 - lower integral, 507 - mean, 508 - system, 506 - upper integral, 507 Daniell-Loomis integrals, 524 Daniell-Stone theorem, 514 Darboux - lower sum, 60 - property, 993 - sum, 20 - upper sum, 60 De La Vallee Poussin theorem, 243 decisivity, 1553 decomposability, 1392 decomposable, 638 decomposition, 292 -(D), 1139 -(ND), 1139 - of a measure space, 1138 -Jordan, 818 Dedekind - complete, 793 --?-, 793 - - super, 793 - vector lattice, 513 definable, 1265 defining sequence, 276, 286 degree of membership, 953 ?j-condition, 812 Dempster-Shafer theory, 1547 densities, 1017 density - function, 881 - point, 677-686 - property for ?, 212 - respecting coordinates, 1159 - topology, 677, 682-697, 1147 deniable, 270, 272 denting point, 270 dependent choice, 1264 derivation, 1538, 1542 derivation basis, 187 derivative, 1302 deterministic possibility distributions, 1479 deviation, 1528 Diagonal Theorem, 162 - Antosik's, 161 - Mikusinski-Antosik-Pap's, 161 diameter, 1040 - based packing measure, 1042 difference poset (D-poset), 171, 834 differentiable form, 1028 differentially equivalent, 202 differentiation - basis, 597-599 - theory of Daniell integrals, 523 diffusion, 305 - canonical form, 321 - regular, 306 dimension decomposition, 1048 Diophantine approximation, 1044 direct -density, 1145 -lifting, 1145 - sum property, 795 directed - downwards, 791 -set, 791 - upwards, 791 discrepancy, 112 discrete - measure, 36, 768 - spectrum, 1214, 1221
1592 Subject Index disintegrable, 782 disintegration, 782 disjoint, 799 - complement, 799 - measure-preserving systems, 1230 dispersion - of an observable, 882 - point, 678-686 distance function, 620 distribution, 1248 - of a measurable multifunction, 652 - of a multifunction, 648 distributivity under subtraction, 891 divergence of degree a, 1538 dividends, 1385 divisible - MV-algebra, 878 - probability MV-algebra, 880 division, 609 Dixmier's condition, 95 Doleans - function, 480 - measure, 397 domain, 629 dominance relation, 978 dominated -convergence theorem, 812 - ergodic theorem, 1196 Doob-Meyer decomposition, 397 Dorofeev-Sherstnev theorem, 844 Downward Transfer, 1300 downwards, directed, 791 drastic product, 951 dual -basis, 190 - nuclear space, 283, 284 -order, 818 -set function, 1383 - space, 270 duality, 282 - space, 777 Dunford integral, 538 Dunford-Schwartz integral, 514 dynamical system, 1044, 1187 Dynkin system, 32 ? -ergodic measure, 1123 ? -invariant set, 1122 effect algebra, 834 Effros -?-field, 629, 633, 648, 649 - measurable, 629, 631 Egorov theorem, 71, 790 elementary - embeddings, 1266 - integral system, 508 Elliott's partial addition, 872 embedding, 626 entire topology, 804 entropy, 1188, 1214, 1224 -convergence rates, 1223 - function, 902 - fuzzy, 1551 -Hartley, 1527, 1548, 1549 - of a partition of unity, 902 -of degree a, 1527, 1538 -of degree (?. ?), 1528, 1542 -of order a of type (?, у), 1560 -Shannon, 1527, 1535, 1551, 1561 -total, 1554 - weighted, 1554 epigraphical - convergence, 663 - multifunction, 664 equally weighted independent random variables, 1322 equiconsistency, 1268, 1274 equicontinuous norm, 811 equidecomposable sets, 86 equidistribution theorem, 1198 equimeasurable - function, 807 - set of functions, 798 equivalent projections, 871 ergodic, 1196 - Hubert transform, 1213 -hypothesis, 1187, 1188 - Szemeredi theorem, 1201 - theorem, 1193 - theory, 1187 ergodicity, 1189, 1196 essential - boundary, 1026 - integration, 512, 513 - measures, 256 - Radon integrals, 513 - supremum, 43 - upper functional, 513 - variation, 595 essentially i'-integrable functions, 513 evanescent set, 386 exact dimension, 1048 excess, 621 exhaustion, 524 - theorem, 798 exhaustive, 709, 922, 935 expectation of an observable, 882
Subject Index 1593 expected information, 1530, 1556 exponentially bounded group, 88 exposed points, 263 extension - of ? -additive measure, 369 - of an invariant measure, 1107 - of positive measures, 368 - property, 778 external, 1299 F(S), 848 factor map, 1230 factorization, 783 faithful - measure, 900 - state, 880 Fatou property, 805 Fatou's lemma, 55, 58, 373, 507 Feller's theorem, 333 filtration, 386 finite - measure property, 35, 795 - subset property, 795 - variation, 268 finitely - 7"-additive function, 958 - additive measure, 253, 260-263, 285, 514, 837, 900 firefly in a box, 834 first category, 875 Fischer, Riesz-property, 811 fixed point, 1100, 1118 flat norm, 1030 Fleissner theorem, 1282 FN-topology, 706 focal element, 1546 Fourier transform, 77 - exchange formula, 78 - inverse, 78 fractal, 1039 frame function, 842 Frank t-norms, 952 Fubini theorem, 74 fuge, 1287 full - integral, 512 - tribe, 876 function -АССд, 598 -AC*. ACG*. 592 - ACS,ACGS, 592 - absolutely continuous, 64 -additive, 1531, 1537 - almost bounded, 1272 - Borel measurable, 48 -continuum displacement, 1286 - convex. 67 - essentially bounded, 69 - incompresible, 1272 - Lebesgue integrable, 59 - Lebesgue measurable, 48 -logarithmic, 1531, 1537 - lower density, 1149 -multiplicative, 1531, 1536, 1537 -nowhere bounded, 1272 - of an observable, 885 - upper density, 1149 function-lifting, 1140 functional, 313 -determined by a complex measure, 1075, 1077 fundamental equation - 11M with the branching property, 1533, 1534 -of recursive 11M, 1537 -of weighted additive 11M, 1540, 1541 fundamental solution, 1251 - of the Cauchy-Riemann operator, 1252 - of the heat equation, 1254 - of the Laplace operator, 1254 - of the one-dimensional wave equation, 1253 - of the two-dimensional wave equation, 1253 fuzzy - integration, 252 - measure, 288, 290, 1333, 1383, 1546 - measure space, 288, 1333 -set, 517, 953, 1550 - subset, 953 G-? -bounded set, 1100 G-absolutely negligible set, 1124 G-atom, 1097 G-bounded set, 1096 G-equidecomposable sets, 88 G -invariant a -algebra, 1093 C/-invariant (linear) lifting, 1157 G-negligible set, 10% G-partition, 1108 G-space, 1093, 1094 game, 1383 - with value, 1435 gauge, 589 - essential, 595, 606 - function /i, 1041
1594 Subject Index Gauss - measure, 1191 - Green theorem, 1027 Gaussian - multifunctions, 660 - system, 1191 GCH, 1265 Gel' fand - derivatives, 278 - integrable, 277 - integral, 277, 538 Gel'fand-type derivative, 277 general - duality theorem, 777 - fuzzy integral, 1352 generalized - Bartle-Dunford-Schwartz representation theorem, 1083 - derivative - distributional, 1249 - in the Sobolev sense, 1241 - dynamical system, 902 - product, 898 generated - (r-)tribe, 956 - ?-measure, 977 - tribe, 876 generator, 903 generic - extensions, 1266 - ultrafilter, 1266 geometric measure theory, 1044 Gleason theorem, 841 graph, 629 graph-measurable, 630 graphoid structure, 1501 Greechie logic, 832 Greenleaf's problem, 107 Gribanov, Luxemburg theorem, 795, 817 Grothendieck space, 431 group - of Borel automorphisms, 1122 - of homeomorphisms, 1110, 1121 - of isometries, 1110, 1121 - of permutations, 1094 group-valued measure, 882 Holder - exponent, 1047 -inequality, 71,356, 815 H6rmander formula, 622 Haar measure, 1093, 1106, 1110, 1112, 1113 - existence of, 1111 Haar null set, 1114 Hahn decomposition, 40, 252, 254, 262, 523 halo properties, 225 harmonic measure, 1314 Hashimoto topology, 699 Hausdorff -dimension, 1041, 1042, 1046, 1412 - distance, 621, 655 - measure, 255, 1015, 1039, 1042, 1046, 1050 - outer measure, 1040 - type, 1050 Heaviside's function, 1240 helical transform, 1213 Henstock's lemma, 591, 608 Hewitt-Yosida theorem, 153 Hubert space, 269, 871, 1192 HOD, 1265 HODR, 1265 homogeneity - of Choquet integral, 1337 - of Sugeno integral, 1340 homogeneous G-space, 1120 homomorphism, Riesz, 799 - normal, 799 -?-, 799 Hopf-Kawada theorem, 1097, 1099, 1100 horizontal - ®-additivity of general integral, 1353 - additive decomposition, 1338 - additivity of Choquet integral, 1338 - integral, 517 - maxitivity of Sugeno integral, 1341 Hudetz entropy, 904 hull-kemel topology, 873 hyper-stonian space, 751, 1135 hyperfmite set, 1302 / continuous -at oc, 518 -at 0,518 /-integrable function, 510 /-measurable function, 510 /-summable function, 507, 514 ideal, 798, 800 - null, 820 -?-, 798 - of an MV-algebra, 873 - space, 806 idempotent - integral, 1423 - semimodule, 1408 - on pseudo-linear functionals, 1422 image measure, 47 imaginary part of ?, 1058
Subject Index 1595 improper integrable functions, 526 inaccuracy, 1528, 1535, 1538 indecomposable integral current, 1032 indefinite integral, 251, 359 independence, 883 - space, 779 independent - complement, 780 - multifunctions, 648 - observables, 883 - sequence of observables, 884 indicator, 876 - function, 623 induced transformations, 1229 inductive limit locally convex topology, 1057 inequality, Holder, 815 infimum, 791 infinitary - distributivity law, 873 - languages, 1266 infinite product measure, 885 infinitesimal, 1300 - numbers, 1298 information, 1187 - function, 1215 - improvement, 1528, 1538 - measure, 1429 --additive, 1535, 1544, 1548, 1549 - - branching, 1532, 1534, 1540 - expansible, 1544, 1549 --inset, 1527 - non-probabilistic, 1529 - on closed domain, 1528 - on open domain, 1528 - - polynomially additive, 1559 --probabilistic, 1528 --recursive of degree a, 1535, 1541, 1555, 1558 --recursive of multiplicative type, 1535, 1536 --subadditive, 1543, 1549 - - sum form, 1533, 1536, 1540, 1541 --symmetric, 1532-1534, 1536, 1545, 1549, 1551,1556 --weighted additive, 1535, 1536 injective tensor norm, 411 inner - product, 1531 - product space, 848 - regular, 519 instantaneous, 305 integrable, 287 - function, 251 - multifunction, 635, 657 - observable, 882 integrably bounded multifunction, 635, 654, 655 integral, 286, 375, 819, 1302, 1310 - C, 596 - M|,604 - ?-regular Henstock-Kurzweil, 601 - BV, 606 - F, 606 - GP, 603 - PU, 605 - i>, 607 - p, 602 - g,606 - approximate, 597 - approximate symmetric, 599 - bilinear form, 412 -Bochner, 351, 354 - current, 1031 - dyadic, 597 - flat chain, 1032 - Henstock-Kurzweil, 590, 600, 608, 609 - indefinite, 66, 359 - induced by measure, 515 - McShane, 596 - measure, 251, 252 - metric, 525 - Nonnenmacher, 604 - normal, 819 - norms, 524 - of step function, 349 - Pfeffer, 604 - regular Henstock-Kurzweil, 602 - repeated, 73 -representation, 514 - stochastic, 390, 461 - symmetric, 598 - with respect to a finitely additive measure, 93 integralgeometric measure, 1016 integration - lattice, 524 - with respect to a measure with finite variation, 357 interaction -index, 1385, 1386, 1388, 1389, 1393, 1394 -operator, 1387 -transform, 1385, 1390, 1391, 1393 internal, 1299 -description, 1299 intersection of fuzzy sets, 953 interval - effect algebra, 835 -exchange, 1192, 1206, 1221, 1228 -MV-algebra, 891 invariant - mean, 95
15% Subject Index - measure, 900 - measure extension theorem, 92 - set, 1094 inverse - Bernoulli operator, 1386 -density, 1145 - lifting, 1145 irrational rotation, 1198 isomorphism -invariant, 1189, 1218 - metric, 1188 - spectral, 1189 isoperimetric - inequality, 1027 - theorem, 1032 iterated - function system (IFS), 1043 - (local) integral norms, 522 - upper and lower integrals, 521 Jacobian, 69 Jensen's inequality, 67 joining, 1230 joint observable, 883 Jordan -decomposition, 41, 265, 818, 1058 - upper norm,525 - m-integrable functions space, 525 Julia sets, 1044 ?-additive measures, 1392 К -automorphism, 1209, 1220 К -tight, 724 (K) property, 534 К weak precompactness, 545 Keisler's Fubini theorem, 1323, 1325 kernel, 953 Kolmogorov -complexity, 1225 - construction, 884 - probability space, 885 Kolmogorov-Sinai entropy, 903 Kolmogorov-Sinai theorem, 903, 1219 Komlos theorem, 657 Kothe space, 813 Krieger generator theorem, 1200, 1220 «-group, 853, 874 LP -spaces, 523 Laplace transform, 79 large deviation principle, 1433 - weak, 1433 large set, 1103 lattice, 791 - norm,806 --pseudo-, 801 - - quasi-, 806 lattice-order of an MV-algebra, 873 law of large numbers, 575 Lebesgue -completion, 1059, 1063, 1066, 1073-1075, 1077 - completions, 1076 - density, 1136 - density theorem, 63, 678, 692 - dominated convergence theorem, 23, 58, 507 -integral, 23,59, 1313 - measurable set, 48 - measure, 23, 251, 258, 268, 291, 769, 1093, 1094, 1102, 1105-1107, 1123, 1124, 1314 - measure on ?'", 48 - outer measure, 48 - PIP, 559 -set, 182 - spectrum, 1206 - theorem, 355, 376 Lebesgue-Bochner space, 806 Lebesgue-Radon completion, 1089 Lebesgue-Stieltjes integral, 379 left unit, 1349 Legendre transform, 1050 lifting, 756, 1135, 1304 - compact map, 1170 - respecting coordinates, 757 - theory, 524 - topology, 1147 limited, 1299, 1300 linear - differentiation, 259 - extensions of set functions, 508 -lifting, 1142 - lifting respecting coordinates, 1159 - possibility distributions, 1479 Lipschitz - mapping, 1040 - neighbourhood retract, 1032 local - convergence in measure, 514 - dimension, 1047 - integral metric, 524, 525 - null set, 795 localizability, 523 localizable, 794 - measure. 1134 -uppergauge, 524 localization. 1529, 1552 localized Lebesgue-Radon completion, 1089
Subject Index locally - commutative action, 103 -determined measure, 1134 - finite graph, 97 - integrable function, 522 - solid topology, 800 - - entire, 804 Loeb - measure, 1304 - measure space, 1304 Loomis system, 514 Loomis-Sikorski theorem, 877 Lorentz space, 807, 810 Lorentz-Luxemburg theorem, 816 lottery paradox, 1495 Lovasz extension, 1384, 1390 lower -bound, 791 - density, 682, 692, 1136 - variation, 984 lower-upper aproximant, 518 Lukasiewicz - t-conorm, 951 - t-norm, 951 Lusin theorem, 43, 81 Luxemburg - Lorentz theorem, 816 -norm, 807 Luxemburg-Gribanov theorem, 795, 817 Lyapunov convexity theorem, 637 .M(M). 873 ш-almost everywhere convergence, 896 m-almost everywhere property, 372 m-measurable function, 372 m-negligible set, 372 m-set, 1023 OT-complete homomorphism. 1266 Mobius - operator, 1386 -representation, 1385 -transform, 1384-1387, 1389-1392 Maharam measure, 257 Marcinkiewicz space, 807 Marczewski's problem, 108 marginal measure, 773 marginal problem, 775 Markov -kernel, 961 - property - - strong, 300 -shift, 1190, 1198, 1206, 1220 - time, 300, 305 Martin boundary theory, 1314 martingale, 273, 396 -convergence theorem, 894, 1319 - square integrable, 397 mass of current, 1029 maximal - ergodic theorem, 1194 - function, 1194 - ideal, 873 -inequality, 1195, 1320 maximality, 1551 maxitive measure, 1334, 1411 -completely, 1411 Mazur property, 534 meager set, 875 mean ergodic theorem, 1192 mean functional, 665 mean variation, 480 measurable, 251 -cylinders, 75 - function, 789 - Implicit Function Theorem, 633 - multifunction, 629 - network, 1150 -partition, 1190, 1214 - rectangle, 1323 - space, 34 -sets, 36,515, 1095 -space, 34, 1095 measure, 36, 251, 916 - G-ergodic, 1095 - absolutely continuous, 65 - additivity, 1267 -algebra, 255, 1269 - associated to a function, 382 - atomless, 43 - Cantor, 80 - continuous, 839 - countable-cocountable, 44 -diffuse, 1048 -diffused, 1095 - Doleans, 397 - dominated, 534, 839 - finitely additive, 900 - image, 47 - indefinite-integral, 66 - induced by integrals, 514 - inner regular for the compact sets, 80 -invariant, 1093, 1095 - invariant on a Boolean algebra, 1102 - localizable (Maharam), 43 - locally determined, 43 - locally finite, 80 - nonatomic, 1110
1598 Subject Index - normal, 1274 - normalized, 789 -of fuzziness, 1550, 1551, 1553 - on BCK-algebra, 858 - on projections, 901 -onX, 1095 - optional, 394 - outer, 38 - outer regular, 81 - predictable, 394 - product, 883 - purely atomic, 43 -quasi-invariant, 1095 - Radon, 80 -regular, 79, 1267 - self-affine, 1050 -self-similar, 1049, 1050 - semi-finite, 43, 1107, 1114, 1127 - semiregular, 1107 - singular, 65, 839 - space, 36, 515 --complete, 38 - completion, 39 - Stieltjes, 46 - stochastic, 394 - strictly localizable (decomposable), 43 - topological, 80 - totally finite, 36 - truly continuous, 65 - uniform real-valued, 1275 - uniformly ? -additive, 355 - universal on a set, 1102 - usual, 76 -vector-valued, 1050 measure-preserving -flow, 1187, 1196 -system, 1187 - transformation, 897, 903, 1093, 1187 measure-theoretic -boundary, 1026 - exterior normal, 1026 membership function, 953 metric approximation property, 430 metric outer measure, 1040 Mikusinski's - differential operator, 1256 -integral operator, 1256 - operator field, 1255 mild mixing, 1208 minimal homeomorphism, 1199 minimal self-joinings, 1231 minimal surface, 1034 minimal unbounded function, 1272 Minkowski - addition, 620, 623, 625 - functional, 282 mixed norm space, 817 mixing, 1189 mixture problem, 781 model, 1265 modular - function, 1112 -space, 813 moments of an observable, 882 monocompact - class, 772 - measure, 773 monoid, 99 monotone -class, 30, 31 -convergence theorem, 58, 373, 507 -differentiation, 259 - function on a ?-tribe, 959 - sequence, 876 monotonically irreducible ?-measure, 977 monotonicity - of Choquet integral, 1336 - of general integral, 1352 - of Sugeno integral, 1340 monotonous lifting, 1140, 1149 monotony, 1274 morphism of unital i-groups, 874 Mosco convergence, 656 multifractal, 1049 -measure, 1049, 1051 -spectrum, 1049-1051 multifunction, 285, 622, 629 multilinear extension, 1384 multimeasure, 286, 643 multiple recurrence, 1201 multiplicative MV-algebra, 898 multiset, 905 Murray-von Neumann order, 871 mutual independence, 1327 MV-algebra, 853,872,913 - ? -complete, 873 - boolean, 873 - complete, 873 - divisible, 878 - faithful state, 880 - ideal, 873 -interval, 891 - multiplicative, 898 -observable, 881 - product, 891 - semisimple, 873 - spectral topology, 873
Subject Index 1599 - state, 879 - weakly ? -distributive, 875 - with product, 891 *N,c, 1299 narrow convergence, 647 natural - scale, 306 - time, 341 necessity - measures, 1476 - relation, 1474 net bases, 206 Nikodym -convergence theorem, 132 - Radon theorem, 821 NMSC, 1281-1283 non-additive integration, 512 non-continuity integration, 513 - process, 525 non-dogmatic possibility distributions, 1479 non-lattice integration, 511 non-singular transformation, 1095 nonatomic, 646 - 7"L-measure, 998 - measure, 768 - part, 628 - vector r^-measure, 998 nondogmatic possibility relations, 1492 nonmeasurable set, 1094, 1102, 1103. 1105-11 1123, 1124 nonperfect measure, 771 nonstandard - extension, 1299 -natural numbers, 1299 -real numbers, 1299 norm,706,1267 - equicontinuous, 811 - lattice, 806 - Luxemburg, 807 - pseudo-, 801 --lattice, 801 - quasi-, 806 - - lattice, 806 normal -current, 1030 -ideal, 1272 -integral, 819 - Riesz homomorphism, 799 - singular operator, 819 normalization, 1557, 1560 normalized measure, 789 norming - space, 365, 381 - subspace, 348 nuclear - map, 283 - spaces, 282, 283 null -function, 510, 1307 - ideal, 820 - set, local, 795 numerical index of contraction, 203 o-topology, 800 observable, 881 - addition, 886 - arithmetical mean, 886 - composite, 885 - dispersion, 882 - expectation, 882 - independence, 883 - independent in probability MV-algebra, 883 - independent sequence, 884 - integrable, 882 -joint, 883 - mean, 882 - moment, 882 - product, 886 - square integrable, 882 - variance, 882 - zero, 896 OD, 1265 OML, 831 OMP, 831 open set condition, 1043 operator - absolutely p-summing, 419 - order continuous, 819 --?-,819 - positive, 818 - regular, 818 -singular, 819 - unconditionally converging, 423 optimistic hybrid utility function, 1465 optional - ?-algebra, 386 - measure, 394 - process, 386 - projection, 393 - stochastic measure, 393 orbit, 1094 -equivalence relation, 1122 - of a state, 1188 order - bounded - from above, 791
1600 Subject Index - from below, 791 - - operator, 818 - set, 791 - closed, 7% - continuous, 922, 929, 935 -continuous operator, 819 --?-,819 - convergence, 796 -dual, 818 - separable, 792 - topology, 7% order-complete, 287 order-continuous systems, 506 order-determining system, 835 ordered -semigroup, 836 - weighted average, 1395 ordinal, 1264 -interaction transform, 1398 - invariance, 1516 - Mobius transform, 1397 -Shapley value, 1398 ordinary density point, 691 Orlicz space, 807 Orlicz-Pettis theorem, 146 orthoalgebra, 833 orthogonal lifting, 1140 orthogonally closed subspace, 848 orthomodular - lattice, 831 - poset, 831 outer measure, 38, 255 packing, 1042 -dimension, 1042, 1046 -measure, 1041, 1042 Painleve-Kuratowski - convergence, 654, 655 - lower limit, 655 - upper limit, 655 pairwise independence, 1327 paradox, 85 - of infinity, 85 paradoxical set, 87 part, regular, 811 partially ordered vector space, 790 partition, 589 - a-fine, 589 - McShane, 5% - partial, 589 -of unity, 901 perfect - almost, 808 --a-, 808 -a-, 808 - matching, 97 - measure, 771 - probability space, 771 - semi-, 808 perimeter, 1025, 1026 Pettis - derivative, 274, 278 - integrable, 274 - integral, 274, 538 - integral property, 558 Pietsch domination theorem, 420 PIP, 558 Plateau problem, 1027 plausibility, 1547 - function, 1477 plausible inference, 1486, 1488 PMEA, 1281, 1282 points of density, 678 Poisson process, 1316 Polish - G-space, 1115 -group, 1110 -space, 663, 1110 polynomial recurrence, 1202 positive - cone, 790 -contraction, 1195 - linear functional, 1058, 1059, 1062, 1064, 1066, 1071, 1074, 1084 - operator, 818 possibilistic - event independence, 1502 -independence, 1499 -integrals, 1507, 1510 -likelihood, 1484, 1485 - likelihood relation, 1516 possibility -distribution, 1478, 1481, 1488 -measure, 1411, 1476 -relation, 1474 pre-density, 1140 predictable -a-algebra, 386 - measure, 394 - projection, 393 preferential extension, 1493 preideal space, 806 pretopological measure space, 1134 primitive product measure, 72 principle -Vopenka, 1266, 1268, 1284
Subject Index 1601 probabilistic -independence, 14% - metric space, 1424 probability, 33 - ?-measure, 959 - distribution, 883 - measure, 517 - MV-algebra, 880 - MV-algebra with product, 892 - space, 1187 problem -of coefficients, 599 - of primitives, 589, 5% process, 387 - p-summable, 388 - adapted, 386 -cadlag, 452 -evanescent, 387 - indistinguishable, 387 - locally p-summable, 391 - modification, 387 - with finite variation, 392 product -density, 1158 -joint refinement, 901 - lifting, 758, 1158 - linear lifting, 1158 -measure, 75,521,748, 883 - of observables, 886 - system, 521 -transformation, 1221, 1226 projection - equivalent, 871 - property, 778 - theorem, 632 projective - limit of a self-consistent family of densities (liftings), 1164 -tensor norm, 411 prolongable, 1088, 1089 - Radon operator, 1087-1089 - Radon vector measure, 1089 proper ideal in MV-algebra, 873 proper Riemann ?-integrable functions, 514 properly measurable function, 549 property - Baire, 1287 - direct sum, 795 - Fatou, 805 - finite measure, 795 - finite subset, 795 - Riesz-Fischer, 811 - V, 423 primings, 186 pseudo- - addition, 1343 - Boolean functions, 1383-1384 - convolution - of the first type, 1415 - of the second type, 1415 -difference, 1345 -multiplication, 1346 - MV-algebra, 860 -norm, 706, 801 --Riesz. 801 pure 7"; -measure, 976 purely - atomic part. 628 - finitely additive measure, 736 - ???-?-order continuous, 926, 938 - unrectifiable set, 1017 qualitative conditioning, 1487 quantum logic, 831 quasi- - compact measure, 771 - complete space, 284 - dyadic, 750 - norm, 806 - Radon measure space, 1135 - singular, 291 quasimartingale. 480 ^-regular. 1062 -complex measure, 1060 - measure, 1060 ^-regularity, 1085 r topology, 690 r-dimensional volume - of half open interval, 47 radical, 854 radius of a subset, 653 Radon - integral, 520 -measure, 508, 519, 520, 1014, 1314 -operator, 1086-1089 - probability space, 1289 -regular, 1060-1062, 1064, 1066, 1068-1071, 1073-1076 - regularity, 1073 - vector measure, 1087-1089 Radon-Nikodym -derivative, 66, 251, 257, 258, 268 - property, 252, 269, 272, 362 -theorem, 361,522, 647, 821 random - ergodic theorem, 1212
1602 Subject Index - sets, 623 - vector, 883 - walk - natural stretched, 336 - stretched, 336 range, 263, 282 rank one transformation, 1198 rate function, 1433 rational extension, 1490, 1493 real -form, 801 - part of a linear functional, 1058 real-valued-measurable cardinal, 1103 rearrangement invariant, 807 rectifiable, 1044, 1045 - current, 1031 - measure, 1020 -set, 1017 recurrence, 1188, 1200 reduced boundary, 1026 refined possibility relations, 1483 refinements of partitions, 1215 reflection principle, 1317 reflexive, 272 region - starshaped, 1244 regular -conditional probability, 778 - distribution, 641 - linear functional, 520 - measure, 519 - operator, 818 -part, 811 - space, 811 - state (charge), 852 relative cancellation property, 857 relatively compact, 274 relatively dense set, 1200 Rellich's lemma, 1244 representation by Radon integrals, 520 representing measure, 1088, 1316 -of «, 1084 residuated difference, 1396 residuation in Elliott's addition, 872 resolution, 1551 return-times theorem, 1213 rich multimeasure, 645 Riemann -integral, 19,60 -sums, 589 - functionally small, 592 - locally small, 592 Riemann-integrable functions space, 525 Riemann-Liouville operators, 1256 Riemann-Stieltjes integral, 508 Riemann-type integral, 287 Riesz - homomorphism, 799 --a-, 799 - normal, 799 - operations, 791 - pseudo-norm, 801 -space, 791 - Archimedean, 792 - of measurable functions, 792 - space (vector lattice), 69, 282, 287, 406 -representation theorem, 519, 1306, 1313 - theorem, 523, 790 Riesz-Dedekind complete, 69 Riesz-Fischer property, 811 Riesz-Herglotz theorem, 1314 right - distributivity, 1351 -unit, 1348 ring of sets, 30, 1526, 1533, 1540 Rokhlin tower, 1229 Rosenthal's lemma, 150 Rudin-Keisler order, 1279 Rybakov control, 279, 282, 284 ?-bounded measure, 279 ?-dimensional Hausdorff (outer) measure, 1040 ?-dimensional packing (outer) measure, 1042 5-integrability, 1304 5-integrable, 1305, 1310 -lifting, 1311 5-measure. 1408 saturated measure, 515 saturation, 1271 scalar - integrability, 538 - measurability. 535 scalarly - concentrated measure, 556 - measurable, 634 - uniformly absolutely continuous, 281 scaling property, 1041 SDP-property, 1083, 1085 segment property - of a region, 1244 selection, 631,645, 648 selector, 1103 self-affine functions, 1044 self-affine set, 1044 self-conformal set, 1044 self-consistent family -of densities, 1163
Subject Index 1603 -of liftings, 1163 -of linear liftings, 1164 self-dual bases, 207 self-joining, 1231 self-similar set, 1043, 1044 selfsupporting measure, 545 semi-finite measure, 256, 1134 semi-perfect, 808 semiconvex vector 7"L-measure, 993 semigenerated G"-)tribe, 957 semimartingale, 398 semimodule -dual, 1422 seminorm, 706 semiring, 30, 1407 semisimple MV-algebra, 873 semivariation, 706 -of a function, 380 - of a vector measure, 364 separable, 270 -a-algebra, 768 - dual, 273, 278 - dual spaces, 274 - order, 792 separably valued function, 267 separately additive density, 1160 separating subalgebra, 874 sequence - asymptotically- ACG*, 595 - control convergent, 594 - entropy, 1223 - equi-convergent, 594, 603 - of set functions - uniformly absolutely continuous, 133 - uniformly ACG*. 594 - uniformly ACG$, 594 set - a, -definable, 1273 -constructible, 1265 set function -?-additive, 33 -?-finite, 34 - ?-triangular, 157 - continuous from above, 33 - absolutely continuous, 133 -additive, 33, 1530, 1541 - continuous from below, 33 -exhaustive, 129 -exhaustive (s-bounded), 34 - extended real valued, 32 - (finite) real-valued, 32 -finitely additive, 33 - measure, 33 - monotone, 32 - nonnegative, 32 - normalized, 32 - positive, 32 -purely finitely additive, 152 - subadditive, 32 - submodular (concave), 32 - superadditive, 32 - supermodular (convex), 32 - totally monotone, 33 - triangular, 157 set-valued - central limit theorem, 662 - conditional expectation, 639 - expectation, 623 - Fatou Lemma, 663 - integral, 623 - martingale, 658 - measures. 643 - strong law of large numbers, 652 sets of finite perimeter, 1025 Shannon function, 1552 Shannon-McMillan-Breiman theorem, 1222 Shapley value, 1385, 1394 Shapley-Folkmann inequality, 653 sharpness, 1551 Sierpinski sets, 770 signed measures, 265 simple - function, 1334 - multifunction, 623-625 singular, 291,723 - 7"i,-mea.sure, 992 - element, 836 - operator, 819 - normal, 819 Sipos integral, 1390-1395 skew product, 1227 slice convergence, 656 small set, 1103 smooth equivalence relation, 1123 Sobolev - lemma, 1244 -space, 1242, 1243 solid, 798, 800 - locally topology, 800 - - entire, 804 - set. 706 space - associate, 815 - Banach function, 813 - ideal, 806 -Kothe, 813 - Lebesgue-Bochner, 806
1604 Subject Index -Lorentz, 807, 810 - Marcinkiewicz, 807 - mixed norm, 817 - modular, 813 - normal Moore, 1281 - Orlicz, 807 - partially ordered vector, 790 - preideal, 806 - product measure, 1269 - regular, 811 -Riesz, 791 - Archimedean, 792 - of measurable functions, 792 - symmetric, 807 spectral -invariant, 1189 - topology, 873 speed measure, 307 splicing, 774 splitting subspace, 848 square integrable - martingale, 397 - observable, 882 stabilizer, 1109 stable family of functions, 548 standard, 1299 - Borel G -space, 1114 - Borel group, 1113 - Borel space, 768, 1110 - measure space, 780 - part, 1300 - part map, 1312 - part mapping, 1316 state, 832, 854, 863 - in an MV-algebra, 879 -in Hilbert space, 871 - on a tribe, 876 -space, 1188 state-morphism, 855 stationary random process, 1189 step - function, 34, 348 -representation, 1335 Stieltjes integral, 383 stochastic -integral, 390, 392, 461 - locally p-summable, 392 - interval, 386 -matrix, 1191 - measure, 394 - process, 387 Stone -extension, 275 - representation theorem, 1080 - space, 275 Stone-Weierstrass theorem, 417 Stonian, 505 - Daniell system, 515 - lattice cone, 518 stopping time, 386 -predictable, 386 Stratonovich stochastic differential, 1446 strict t-norm, 951 strictly - localizable, 259 - localizable measure space, 1138 - positive state, 880 strong - approximate continuity, 693 - Daniell lattices, 506 - density, 692, 1147 - dual space, 282 - duality space, 777 - extension of measure, 1126 - law of large numbers, 889 -lifting, 1147 - lifting property, 756 - Lusin condition, 593 - mixing, 1203 - mixing of higher order, 1209 - unit, 835, 874 - Vitali assumptions, 198 strongly - Blackwell space, 779 - comparable measures, 257 - equivalent measures, 1102 - integrable, 286 - lifting compact map, 1170 - lifting compact topological space, 1170 - measurable, 267, 626, 654 - measurable multifunction, 623, 625, 629 subadditive, 706 -ergodic theorem, 1195 - function, 921 - law, 902 - measures, 512 - sequence, 1195 submeasure, 706 subnormal possibility distributions, 1479 Sugeno integral, 282, 288, 1340, 1396-1400, 1513 summability - of processes with integrable semivariation, 395 -of the stochastic integral, 391 super Dedekind complete, 793 superadditive set function, 357 superatomic Boolean algebra, 730 superstructure, 1297
Subject Index 1605 support, 953 - function, 620 - of a continuous function, 1239 -of a function, 792 -of a set, 792, 793 - of a topology, 805 supramenable group, 89 supremum, 791 Suslin topological space, 633 symmetric - difference, 1397 -integral, 1390 -maximum, 1396-1397 -minimum, 1396-1397 - space, 807 - Sugeno integral, 1399 Szemeredi's theorem, 1201 ?-clan, 956 ?-disjoint, 954 ?-measure, 959 ?-partition, 954 Г-tribe, 956 ?-valuation, 958 t-norm, 950 tangent measure, 1021, 1045 Tarski's circle-squaring problem, 110 tensor quadratic variation, 471 theorem - Central Limit, 886 - dominated convergence, 812 - Egorov, 790 - exhaustion, 798 - Hahn-Banach, 1288 - Kolmogorov-Sinai, 903 - Loomis-Sikorski, 877 - Lorentz-Luxemburg, 816 - Luxemburg-Gribanov, 795, 817 - martingale convergence, 894 - Nikodym convergence, 132 - Radon-Nikodym, 821 - Riesz, 790 - strong law of large numbers, 889 - Vitali's convergence I, 812 - Vitali's convergence II, 813 - weak law of large numbers, 889 thick subspace, 768 Titchmarsh's theorem, 1255 Tonelli theorem, 74 topological - entropy, 1224 -ergodicity, 1210 - measure space, 1134 - probability space, 748 - strong mixing, 1210 - weak mixing, 1210 topologically amenable group, 1117, 1118 topology - compatible with density, 1147 - locally solid, 800 - - entire, 804 - o-, 800 - order, 7% total -uncertainty, 1547, 1548 - variation, 706, 985 totally - measurable, 286 - ordered measure space, 1152 trace, 1245 - of a function, 1245 - ?-algebra, 767 - operator, 841 - probability, 767 tracial state, 901 transfer principle, 1298 translation invariant lifting, 1155 transportation problem, 777 transversal, 1103 triangular -conorm,950 -conorm corresponding to a t-conorm, 950 - functional, 160 - norm,950 tnbe. 876 Trotter's theorem, 309 truncation properties, 518 Turing machine, 1225 type - I convergence, 1259 - ? convergence, 1259 - Maharam, 1288 - semigroup, 98 W-bounded set, 172 n-integrable, 1086 и-measurable, 1088 ucc topology, 1314 ultrarilter -?-weakly normal, 1279 - measure, 709 - nonregular, 1277 - nonregular countably incomplete, 1277 - uniform, 1267 - topology, 709 -weakly normal, 1267 unanimity game, 1383, 1384, 1389
1606 Subject Index uncertainty, 1547, 1554 unconditional convergence, 643, 645 unconditionally convergent -map, 1085 -operator, 1085 underlying lattice of an MV-algebra, 873 uniform absolute continuity - of the indefinite integral, 355 uniform -convergence, 376 - integrability, 1310 -measure, 1021, 1045 - MV-algebra, 927 uniformity, 1557 uniformly ? -additive set of positive measures, 367 uniformly exhaustive, 941 - measures, 732 unimodular group, 1112 union of fuzzy sets, 953 unique ergodicity, 1199 unit, weak, 807 unitary operator, 1189 universal - independence space, 779 - possibilistic entailment, 1491 - strong lifting property, 1150 - Turing machine, 1226 universally - measurable, 630 - measurable function, 557 - measurable set, 1157 unlimited, 1298-1300 unrelatedness, 1498 upper -bound, 791 - density, 1136 - envelopes of upward-filtering families, 520 - functional, 512 -gauge, 525 - integral, 513 - norm,524 - S-norm,511 --S-norm, 524 - variation, 984 upwards, directed, 791 usual conditions for filtration, 386 USLP, 1150 ?-integrable functions, 513 valuation, 900, 1551 van der Waerden's theorem, 1201 variance of an observable, 882 variation, 39, 193, 267 -function, 45, 46, 381 - of a function, 44, 380 - of a multimeasure, 645 - of vector measure, 357 variational - characterization of the integral, 208 - measure, 593, 597, 599 - principle, 1224 vector - T\ -measure, 993 - lattice, 505 -measure, 1083 - locally bounded, 366, 370 - quadratic variation, 471 -state, 841 Vitali, 1286 -convergence theorem I, 812 -convergence theorem II, 813 - covering theorem, 183, 679 -set, 1103 - theorem, 63, 376 Vitali-Hahn-Saks theorem, 133 von Neumann's paradox, 104 W*RNP,561 weak -convergence, 1305, 1313 - extension of measure, 1126 - halo properties, 231 - law of large numbers, 889 - mixing, 1200, 1203 - mixing of higher order, 1208 - normality trace, 1279 - precompactness, 545 - Radon-Nikodym set, 554 - RNP, 554 - unit, 807 -uppergauge, 525 - Vitali property, 230 weak* - Radon-Nikodym property, 561 - scalar integrability, 538 - scalar measurability, 535 -topology, 1305, 1313 weak** - Radon-Nikodym set, 554 - RNP, 554 weakly -? -distributive i-group, 875 -a-distributive MV-algebra, 875 -a-distributive probability MV-algebra, 880 - p-summable, 426 - Bernoulli partition, 1221
Subject Index 1607 -compact, 1089 - compact Radon operator, 1087, 1088 - compact Radon vector measure, 1089 -comparable measures, 258 - complete spaces, 274 - measurable function, 267 - wandering set, 118, 1099 weight, 842 well-ordered partition, 1478 Wiener measure, 1319 Wiener-Shannon law, 1529 Wiener-Wintner theorem, 1213 Wijsman - convergence, 656 - topology, 632 WMEA, 1283 Young function, 807 Young-Fenchel transform, 663, 665 zero observable, 896 Zeta - operator, 1386 -transform, 1384