Author: Dold A.   Eckmann B.  

Tags: mathematics  

ISBN: 3-540-07163-6

Year: 1975

Text
                    Lecture Notes in
Mathematics
Edited by A. Dold and B. Eckmann
Series: Department of Mathematics, University of Maryland,
College Park
Adviser: L Greenberg
458
Peter Walters
Ergodic Theory -
Introductory Lectures
Springer-Verlag
Berlin • Heidelberg • New York 1975


Dr. Peter Walters Mathematics Institute University of Warwick Coventry/England Library of Congress Cataloging in Publication Data Walters, Peter, 1943- Ergodlc theory. (Lecture notes in mathematics ; 458) Bibliography: p. Includes index, 1. Ergodic theory. I. Title. II. Series: Lectures note's in mathematics (Berlin) ; 458. QA3.L28 no. 458 ,QA313] 510'.8s c515'.42] 75-9853 ISBN O-387-O7163-6 AMS Subject Classifications (1970): 28A65 ISBN 3-540-07163-6 Springer-Verlag Berlin - Heidelberg • New York ISBN 0-387-07163-6 Springer-Verlag New York ■ Heidelberg ■ Berlin This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to the publisher, the amount of the fee to be determined by agreement with the publisher. © by Springer-Verlag Berlin • Heidelberg 1975. Printed in Germany. Offsetdruck: Julius Beltz, Hemsbach/Bergstr.
Preface These are notes of a one-semester introductory course on Ergodic Theory that I gave at the University of Maryland in College Park during the fall of 1970. I assumed the audience had no previous knowledge of Ergodic Theory. My aim was to present some of the basic facts in measure theoretic Ergodic Theory and Topological Dynamics and show how they are related so that the audience would have the foundations to read the research papers if they wished to pursue the subject further. At the beginning of Chapter 1 I give a list of examples of measure-preserving transformations and at the end of each section of Chapter 1 I investigate whether these examples have the properties discussed in that section. These examples were chosen because of their varied properties and importance in the subject. Similarly in Chapter 5, on Topological Dynamics, a list of examples is given and the properties discussed in that chapter are considered for these examples. I tried to deal with entropy as simply as possible. In the discussion of entropy I have inserted without proof some of the more difficult theorems when I thought they were relevant to the discussion. In particular I have discussed the recent deep results of D. S. Ornstein on Bernoulli automorphisms and Kolmogorov automorphisms. In the final chapter I have presented the new treatment of topological entropy due to R. E. Bowen. One of the beauties of this treatment is that topological entropy can be defined for a uniformly continuous map of any metric space and that its value remains unchanged under certain types of covering maps. This enables one to give an elegant calculation of the topological and (Haar) measure
IV theoretic entropies of affine transformations of finite-dimensional tori. Since these notes have not been fully edited many references are missing and it is likely that credit is often not given where it is due. The theorems and definitions are numbered independently, but a corollary is given the same number as the theorem to which it is a corollary. Thanks are due to Victor Charles Stasio and Suellen Eslinger who took notes of the course and also to Allan Jaworski for editing and compiling the bibliography. Special thanks are due to Betty Vander- slice for her superb typing. —Peter Walters
Contents Chapter 0 : Preliminaries 1 §1. Introduction 1 §2. Measure Theory 3 §3. Hilbert Spaces 8 §4. 'Haar Measure 9 §5. Character Theory 10 §6. Endomorphisms of Tori 12 Chapter 1: Measure-Preserving Transformations 16 §1. Examples 16 §2. Problems in Measure Theoretic Ergodic Theory 19 §3. Recurrence 20 §4. Ergodicity 21 §5. The Ergodic Theorem 29 §6. Mixing 37 Chapter 2: Isomorphism and Spectral Invariants 51 §1. Isomorphism of Measure-Preserving Transformations 51 §2. Conjugacy of Measure-Preserving Transformations 53 §3. Spectral Isomorphism 54 §4. Spectral Invariants 57 §5. Examples 59 Chapter 3 : Measure-Preserving Tran s format ions 6 3 with Pure Point Spectrum §1. Eigenfunctions 63 §2. Pure Point Spectrum 64 §3. Group Rotations 67 Chapter 4: Entropy 70 §1. Partitions and Subalgebras 70 §2. Entropy 72 §3. Conditional Entropy 76 §4. Properties of h(T,A) 80 §5. Properties of h(T) 83 §6. Examples 92 §7. How good an invariant is entropy? 9 6 §8. Bernoulli and Kolmogorov Automorphisms 9 8 §9. Pinsker Algebra 107 §10. Sequence Entropy 108 §11. Comments 109 §12. Non-invertible Transformations 110
V! Chapter 5: Topological Dynamics 112 §0. Introduction 112 §1. Minimality 113 §2. Topological Transitivity 117 §3. Topological Conjugacy and Discrete Spectrum 122 §4. Invariant Measures for Homeomorphisms 128 Chapter 6: Topological Entropy 140 §1. Definition by Open Covers 140 §2. Bowen's Definition 146 §3. Connections with Measure Theoretic Entropy 155 §4. Topological Entropy of Linear Maps and Toral Affines 160 §5. Expansive Homeomorphisms 168 §6. Examples 182 Bibliography 18 5 Index 197
Chapter 0 ; Preliminaries §1. Introduction Generally speaking, ergodic theory is the study of transformations and flows from the point of view of recurrence properties, mixing properties, and other global, dynamical properties connected with asymptotic behavior. Abstractly,'one has a space X and a transformation T of X (or a family {T : t € R} of transformations of X) with some structure on X which is preserved by T (or by {T }). The nature of most of the work so far can be categorized into one of the four following types: (1) measure theoretic : Here one deals with a measure space X and a measure preserving transformation T: X •* X. (2) topological: Here X is a topological space and T: X -* X is a continuous map. (3) mixture of (1) and (2): In this situation X is a topological space equipped with a measure m on its Borel sets while T: X -* X is continuous and preserves m. (4) smooth: One considers a smooth manifold X and a smooth map T: X -> X. We shall deal with some topics from (1), (2), and (3). To see how this study arose consider, for example, a system of k particles in 3-space moving under known forces. Suppose that the phase of the system at a given time is completely determined by the
2 positions and the momenta of each of the k particles. Thus, at a given time the system is determined by a point in 6k-dimensional space. As time continues the phase of the system alters according to the differential equations governing the motion, e.g., Hamilton's equations dqi 9H dpi 9H dt 3p. dt 3q. If we are given an initial condition and such equations can be uniquely solved then the corresponding solution gives us the entire history of the system, which is determined by a curve in phase space. If x is a point in phase space representing the system at a time tQ, let T (x) denote the point of phase space representing the system at time "t+tn- From this we see that T. is a transformation of phase space and, moreover, Tn = id., T., = T.°T . Thus r 0 t+s t s {T.: t € R} is a one-parameter group of transformations of phase space. In dynamics one is interested in the asymptotic properties of the family {T }. It seems reasonable to study the system at discrete times tn, 2tQ, 3tQ,..., i.e., study the family {T^ } , , since we expect the properties of {T } to be reflected in those of {T^ }. r r0 For this reason, as well as the fact that it is simpler, one studies individual transformations and their iterates. One is particularly interested in the flow on an energy surface, which is sometimes smooth (hence considerations of type (4) arise), and sometimes is not (one then investigates along the lines of (2) ). Measure theory enters the picture via a theorem of Liouville which tells us that if the forces are of a certain type one can choose coordinates in phase space so that the usual 6k-dimensional measure in these coordinates is preserved by each transformation T.. Around 1900 Gibbs suggested using the measure-theoretic approach
3 in mechanics because of the difficulty of solving the equations of motion and also because this deterministic approach does not answer several important questions in mechanics. In discovering statistical mechanics Gibbs suggested looking at what happens to subsets of phase space. For example, if A and B are subsets of phase space what is the probability that the system is in B at the time t given that the system is in A at the time t-.? Given that the system begins in A at time tQ what is the average time the system spends in B? Such questions motivate the type of study undertaken in ergodic theory. We now list some general references for the material we shall discuss : For topics of types (1) and (3) mentioned above see Halmos [2], Billingsley [1], Hopf [1], Jacobs [1] [2], Parry [3], Rohlin [3][4][5], Friedman [1], Shields [2]. In addition to the Shields notes, further details on the results of Ornstein described in Chapter 4 may be found in a forthcoming book by Friedman and Ornstein [2]. For material of type (2) see Gottschalk and Hedlund [1], Nemytskii and Stepanov [1], and Ellis [1]. And material of type (i+) may be found in Avez and Arnold [1], Smale [1], Abraham [1], Abraham and Robbin [1], Nitecki [1]. Khinchin [1] provides a good sketch of the foundations of ergodic theory. For extensive bibliographies see Jacobs [1][2], Gottschalk [1], and Smale [1]. A recent survey is Mackey [1]. §2. Measure Theory General reference - Halmos [1]. We recall some fundamental notions from measure theory. Let X be a set. A a-algebra of subsets of X is a collection B of subsets of X satisfying:
4 (1) X € B, (2) B € B = X\B € B, (3) B € B, n > 1 = (J B € B. n , n n=l We then call (X,B) a measurable space. A measure space is a triple (X,B,m) where X is a set, B is a a-algebra of subsets of X, and m is a function m: B •*■ R satisfying m( U B ) = E m(B ) n=l n n=l n if {B } is a pairwise disjoint sequence of elements of B. We say that (X,B,m) is a probability space, or a normalized measure space, if m(X) = 1. We shall usually deal with such spaces. A collection A of subsets of a set X is an algebra if: (1) X € A, (2) A € A = X\A € A, n (3) A ,...,An € A = U A. € A. x n i=l x When one is trying to equip a measurable space (X,B) with a measure one usually knows what the measure should be on an algebra A £. 8, and so, one would like to know when this function defined on A can be extended to a measure on B. The following result deals with this situation. Hahn-Kolmogorov Extension Theorem: Given a set X, an algebra A of subsets of X, let m: A ■* R be a function satisfying m(X) = 1, m((jAn) = Lm(An) n n whenever A € A V n, I J A € A, and the (A } are disjoint. Then n there is a unique probability measure m defined on the a-algebra generated by A such that m(A) = m(A) whenever A € A.
5 A monotone class of subsets of X is a collection C of subsets of X such that if E, c E„ c ... belong to C then U E € C and n if F. D F, D ... belong to C then (~) F € C. n Theorem: If A is an algebra of subsets of X then the a-algebra generated by A equals the monotone class generated by A. If (X,B,m) is a finite measure space, one can easily deduce from this theorem that if A is an algebra generating the a-algebra B then for any B € B and e > 0 there exists A € A with m(AAB) < e. (In fact, C = {B€B|Ve>0 ] B € A with m(BAB ) < e} is a monotone class and contains A.) Notation: If A is an algebra we shall write a(A) for the a-algebra generated by A. Direct Products : Let (X.,8.,m.), i € Z be probability spaces. Their direct product (X,B,m) = TT (Xi,Bi,mi) i=-~ is defined as follows: (a) X = U X± (b) Let n, < n„ < ... < n be integers, and A € 8 i = 1,...,r. i i We define a measurable rectangle to be a set of the form { (x. ) € X: x € A V i: 1 S i S r} . 1 n. n. J il Let A be the collection of all finite unions of measurable rectangles. A is an algebra: (1) and (3) are obvious; to show (2)
6 observe that r X\{(x.) |x €A , lïiîr} = U {(x.) | xn € Xn \ A } € À a ni ni i=i -i ni ni ni and that A is closed under finite intersections. Let B be the a-algebra generated by A. (c) Each element of A can be written as a disjoint finite union of measurable rectangles so that we define r m({(x.) | xn_ €A , lsisr}) = J[ mn (A ) J i i i=l i i and then extend m to A in the obvious manner. The conditions of the Hahn-Kolmogorov Theorem can be shown to be satisfied, and thus we can extend m to B. Hence, we obtain a probability space. Measurable Functions : A Borel subset of R is a member of the a-algebra generated by the open sets. Let (X,B,|j.) be a measure space; f: X -* R is measurable if for all c € R, f~ (c,-) € B, or equivalently f~ (D) € B V Borel sets D c R. A function f: X -* C, f = f. + if„ is measurable if f,: X -* R and f„ : X -* R are measurable. If X is a topological space and B is the a-algebra of Borel subsets of X (the a-algebra generated by the open subsets of X) then each continuous function f: X -* C is measurable. Integration : n A simple function is a function of the form ]T a.X. , where i=l 1 i a. € R, the {A.} are disjoint members of B, and X denote s l the characteristic function of A.. Simple functions are measurable. We define the integral for simple functions by: (• n n ( E a.X. )dm = E a.mCAO. J i=l 1 Ai i=l 1 1
7 Suppose f: X -* R is measurable and f > 0; then there exists an increasing sequence of simple functions f j> f. For example we could take fn(x) i-1 ■ .c i-1 «- et \ i • t ^ri if S f(x) < — i=l,.. . ,n2 2n 2n In if f(x) > n. We define / fdm = lim / f dm and note that this definition is înde- * n pendent of the chosen sequence {f }. Suppose f: X •*■ R is measurable; then f = f - f where f+(x) = max {f(x),0} > 0 f"(x) = max {-f(x),0} 5r 0. We say that f is integrable if / f dm, / f~dm < », and we define [ fdm = I f+dm - [ f"dm. Now if f: X -* C is measurable, f = f, + if„, f is integrable if f, and f„ are integrable and we define I fdm = I fxdm + i I dm + i | f„dm. Observe that f is integrable if and only if |f| is integrable. Let L (m.) denote the class of all integrable functions (X,B,|j.) -* C. Lebesgue Dominated Convergence Theorem : Suppose {f } is a sequence of measurable functions, f -*f a.e. and there exists an integrable function g such that |f (x)| s g(x) a.e. for all n; then f and each f are integrable and f fn^ - } fd,
§3. Hilbert Spaces General reference - Halmos [3]. H is a Hilbert space if it is a Banach inner product space, i.e., (1) H is a vector space over the complex numbers C (2) There is an inner product on H, i.e., a map (•,•): H x H -* C such that : (a) (•,■) is bilinear (b) (f,f) ï 0 for all f in H (c) (f,f) = 0 if and only if f = 0 (d) (f,g) = (g,f). (3) ||f|| = (f,f) is a norm on H inducing a complete metric topology on H. Let (X,B,m) be a measure space. Consider all measurable func- tions f: X •*■ C such that / |f| dm < »; we define an equivalence relation on this set by saying that f ~ g if and only if f = g a.e. The set of equivalence classes relative to this relation forms a Hil- 2 2 bert space which we denote by L (X,B,m) = L (m), where the inner product of two functions is given by (f,g) = j fgdm. Recall: The Schwarz Inequality In any Hilbert space H |(f,g)| S ||f||-|lgll for all f,g in H. A unitary operator U on a Hilbert space H is an isomorphism of H, i.e., U is a linear bijective map preserving the inner product ((Ux,Uy) = (x,y), x,y in H). It follows that U is continuous .
9 §4. Haar Measure General reference - Pontrjagin [1]. Theorem: Let G be a compact topological group; then there exists a finite measure m defined on the Borel subsets of G such that m(xE) = m(E) for all x in G, and for all Borel sets E. We call such a measure a Haar measure. E.g., let K = {z € C: |z| = 1} and m denote normalized circular Lebesgue measure. Then m(aU) = m(U) for all sets U measurable on K. Theorem; If m and p. are both finite Haar measures on the compact topological group G then m = cm. where c > 0. Thus there exists only one normalized Haar measure on G. Remarks : (1) If U c G is a non-empty open set then it has positive Haar measure. This is because G = U gD=g,U0g,U0.,.0gU g€G X n by compactness. 2 (2) In the Hilbert space L (K,B,m) where m is the Haar measure on the unit circle K, the functions f(z) = z11, n € Z form an orthonormal basis.
10 §5. Character Theory General reference - Pontrjagin [1]. Many of our examples will be rotations, endomorphisms or affine transformations of compact groups. (We mean endomorphism in the sense of topological groups, i.e., an abstract group endomorphism v;hich is continuous.) In some proofs we will use the character theory of compact abelian groups, which we summarize in this section. For those not familiar with character theory, proofs in the later sections involving characters will usually be preceded by the proof in a special case where the group used is the unit circle and then classical Fourier analysis will be used. Let G be a locally compact abelian group with a countable topological base. Let G denote the collection of all continuous homo- morphisms of G into the unit circle K. The members of G are the characters of G, G is an abelian group under the operation of pointwise multiplication of functions. With the compact open topology G becomes a locally compact abelian group with a countable topological base. We have the following results: (1) G is compact «• G is discrete. (2) (G) is naturally isomorphic (as a topological group) to G, the isomorphism being given by the map a ■* a where a(y) = y(a) for all y € G. (3) If G is compact then G is connected «» G is torsion free. (4) G, xG„ = G, xG„ where "x" denotes direct product. So in some sense we can study compact abelian groups by studying discrete countable groups. Examples: (See §6 for proofs.) (i) Let G = K = {z € C: |z| =1}. Each homomorphism of K to itself is of the form z -* z11 (n € Z) so that G ^ Z.
11 (ii) Let G = Kn the n-torus. By (1+) Kn ^ Zn and in fact each member of Kn has the form f\ z-]_ ) z2 ' • * • ' zn' ~ z]_ z2 *"* n where (p ,...,p ) € Zn. (5) If H is a closed subgroup of G and H i G there exists a Y i G, y * 1 such that y(h) =1 V h € H. (We shall write this Y(H) = 1.) (6) Let G be compact. The members of G are mutually orthogonal 2 members of L (m), where m is Haar measure. Proof. It suffices to show Y(x)dm(x) = 0 if Y ¥ !• If a € G then since m is Haar measure Y(x)dm(x) Y ( a.x ) dm ( x ) = Y(a) Y(x)dm(x) Choosing a so that y(,&) i 1 we have Y(x)dm(>0 = 0. // (7) If G is compact, the members of G form an orthonormal basis o for L (m) where m is normalized Haar measure. This is part of the Peter-Weyl theorem and can be easily deduced from the Stone-Weierstrass theorem, which implies that finite linear combinations of characters are dense in C(G) = space of complex- valued continuous functions of G. (8) If A: G -* G is an endomorphism we can define an endomorphism A: G -* G by Ay = Y°A, y i G. It is easy to see that A is one- to-one if and only if A is onto and A is onto if and only if A is one-to-one. Therefore A is an automorphism if and only if A is an automorphism. Recall that for compact groups G, G is metric iff G has a
12 countable topological base. §6. Endomorphisms of Tori We shall view the n-torus in two ways:- multiplicatively as Kn = KxKx...xK , and additively as Rn/Zn where Rn is n-space and n times Z is the subgroup of R consisting of points with integer coordinates. A topological group isomorphism is given by K -* R /Z , (e 2rrix, 2Trix_ , „ n 1,...,e n) I* (Xl,...,xn) + Zn. Theorem: (1) Every closed subgroup of K is either K or is a finite cyclic group consisting of all p-th roots of unity for some integer p > 0. (2) The only automorphisms of K are the identity and the map z K- z (3) The only homomorphisms of K are the maps 4> : z h- z , n € Z. (4) The only homomorphisms of K to K are maps of the form (z, ,...,z ) >->■ z, •...•z where m, , . . . ,m_ € Z. In 1 n in Proof: (1) Let H be a closed subgroup of K; if H is infinite it has a limit point so, V e > 0 3 a,b € H 3 d(a,b) < e, a t b. Then d(b~ a,l) < e and therefore the elements of H are e-dense in K V e. Thus, H = K. If H is finite and has p elements then a^ = 1 V a € H. So each element of H is a p-th root of unity, and since there are p elements in H, H must consist of all the p-th roots of unity. (2) Let 9: K -* K be an automorphism. 9(1) = 1. Since -1 is the only element of K of order 2 we have 9(-l) = -1. Since i,-i
13 are the only elements of order 4 either 9(i) = i and 9(-i) = -i or 9(i) = -i and 9(-i) = i. Consider the first case. Since 9 maps intervals to intervals, the interval Cl,i] from 1 to i is either mapped to itself or to [i,l] (all intervals go anticlockwise). But since [l,i] does not contain -1 it cannot be mapped to Ci,l] so 9[l,i] = [l,i]. The only element of order 8 in Cl,i] is s and so this must be fixed by 9. Therefore gCl^"1 ] :—>■ . k . k Cl,eTrl/1+]. By induction one shows that 9(e2Trl/2 ) = e2Tri/2 for v each k > 0. It follows that 9 fixes all the 2 -th roots of unity V k > 0 and hence is the identity. In the second case one shows k . k that 9(e2Trl/2 ) = e"2Tri/2 V k > 0 and hence 9(z) = z-1, z € K. (3) Let 9: K -* K be an endomorphism. If 9 is non-trivial, its image, 9(K), is a closed connected subgroup of K and so 9(K) = K by (1). The kernel Ker 9 is a closed subgroup of K so either Ker 9 = K or Ker 9 = H , the group of all p-th roots of unity, for some p. The first case corresponds to trivial 9. If Ker 9 = H let a : K/H •* K be the isomorphism given by a (zH ) = zP, and let 9,: K/H -* K be the isomorphism induced by 9 (9,(zH ) = 9(z) ). 1 p r J 1 p Then 9]_aZ is an automorphism of K and by (2) either ®-\aZ (z) = z V z € K or 9,a-1(z) = z-1 V z € K. Hence either 9(z) = 9,(zH ) = 1 p 1 p 8,a"1(zP) = zP V z € K or e(z) = z~P V z € K. 1 P (4) Let Y..-: K -► Kn be defined by y-Cz) = (1,1, . . . ,l,z ,1, . . . ,1). i-th place If 6: K -* K is a homomorphism then 9oy-: K -* K is an endomorphism m. and so eoY^(z) = z 1 for some m. € Z by (3). Hence e(z1,...,zn) = e(Y1(z1)-Y2(z2)-...-Yn(zn)) = 9y1(z1)-9y2(z2)'-• •'9Yn(zn) m m, m_ = z/-z,2'....znn . // 1 l n
14 Theorem: (1) Every endomorphism A: K -* K is of the form: a a a a ac > ,11 in m nn, A(z1,...,zn) = (z1 -...-zn ,...,z1 -...-zn ) where a.- € Z. In additive notation, *.»)-^(T n (2) A maps Kn onto Kn iff detCa..] i 0. (3) A is an automorphism of Kn iff det[a..] = ±1. Proof: (1) Let tt . : K -* K be the projection to the i-th coordinate. Then tt.oA: Kn -* K is a homomorphism, so by (i+) of the previous theorem rr.oACz,,...^) - ^■^■...■^ where a.. € Z. (2) Assume detCa..] =0. 3 nu ,. . . ,m integers not all zero 3 m,A, + ... +m A =0 where A. is the i-th row of A. Then each 11 n n l n mi mn point (<u,,...,<u ) of K in the image of A satisfies co, ...m =1. Thus A(Kn) t Kn. Conversely if A(Kn) t Kn then the points of A(K ) are annihilated by a nontrivial character of K , say m m (z1,...,zn) -> zx -...-znn (this is by (5) of §5). Then mnA, + . . . +m A =0 and so det[a. - ] = 0 . il n n i] (3) If A is an automorphism represented by a matrix [A] then A is also an automorphism represented by a matrix [B], and since AA = I = A A we have that [B] = [A]~ . Since [B] is an integer matrix, det[A] = ±1. Conversely, if det[A] = ±1, [A]~ has integer entries and if B is the endomorphism of Kn it defines we have AB = BA = I. //
15 Notation : If A is an endomorphism of the n-torus, [A] will always denote the associated matrix and A will denote the linear transformation of Rn determined by [A]. So if tt : Rn -* Rn/Zn is the natural projection (tt(x) = x + Z ) we have ttA = Att . Let A: K -* K be an endomorphism. We now consider how the map A: K -* Kn (introduced in §5) acts as a map of Zn when Kn is identified with Z by the isomorphism: 2. .- n when y(z1)z2'-••>zn) = z±1'z2~'' " 'zn One readily checks that the endomorphism À: Z -* Z is given by CA]t ,mls n where [A] denotes the transpose of the matrix [A].
Chapter 1 : Measure-Preserving Tran s fo rmat ions §1. Examples Suppose (X,,8.,m,), (X„,B„,m„) are probability spaces. Definition 1.1: a) T: X1 ■* X„ is measurable if T-1(B2) c %1 (i.e., B2 € B2 = T-1B2 € B1). b) T: X, ->■ X2 is measure-preserving if T is measurable and m1(T"1(B2)) = m2(B2) V B2 € Bj. c) We say that T: X, -* X„ is an invertible measure-pre serving transformation if T is measure-preserving, bijective, and T~ is also measure-preserving. Remarks : (1) We should write T: (X. ,B, ,m., ) -* (X2,B2,m„) since the measure- preserving property depends on the B's and m's. (2) If T: X, -* X2 and S: X, -» X, are measure-preserving so is SoT: Xx - X3. (3) Measure-preserving transformations are the structure preserving maps (morphisms) between measure spaces. (4) We shall be mainly interested in the case (X.,B,,nu) = (X2,B2,m2) since we wish to study T (see §1, Ch. 0). In practice it would be difficult to check, using Defs. 1.1, whether a given transformation is measure-preserving or not since one usually does not have explicit knowledge of all the members of B. However we often do have explicit knowledge of an algebra A generating B (for example, when X is the unit interval A may be all finite unions of intervals, and when X is a direct product space
17 A may be the collection of all finite unions of measurable rectangles). The following result is therefore desirable in checking whether transformations are measure-preserving or not. Theorem 1.1: Suppose (X-, ,8..,m.. ), (X2,B2,m„) are probability spaces and T: X, ■* X„ is a map. Let A„ be an algebra which generates B„. If A2 € A2 » T_1(A2) € B1 and m1(T-1(A2)) = m2(A2) then T is measure-preserving. Proof: Let C2 = {B€B2: T_1(B) €B , m1(T-1(B)) =m2(B)}; we want to show that C„ = B„. However A„ £. C„ and C„ is easily seen to be a monotone class, so the result follows since the a-algebra generated by C„ is the monotone class generated by A2. // Examples of Measure-Preserving Transformations : (1) I = identity on (X,B,m) is obviously measure-preserving. (2) Let K = {z€C: |z| =1}, 8= Borel sets, and m = Haar measure. Define T:K-*K by T(z)=az where a is a fixed element of K. T is measure-preserving since m is Haar measure. (3) The transformation T(x) = ax defined on any compact group G (where a is a fixed element of G) preserves Haar measure. (4) Any continuous endomorphism of a compact group onto itself preserves Haar measure. Proof: Let A: G -* G be a continuous endomorphism, and m = Haar measure on G. Let |x(E) = m(A~ (E)). p. is a Borel probability measure and M-(Ax-E) = m(A_1(Ax-E)) = m(x-A_1E) = m.(E). Since A maps G onto G, \j. - m by the uniqueness property of Haar measure. // For example, T(z) =z preserves Haar measure on the unit circle.
18 (5) Any affine transformation of a compact group G preserves Haar measure. An affine transformation is a map of the form T(x) = a-A(x) where a € G is fixed and A: G -* G is a surjective endomorphism. T is measure-preserving because it is the composition of measure- preserving transformations. When A = I we have example (3) and when a is the identity element of G we have example (4). (6) Let Y = {0,...,k-l}, and give measure p. to i such that k-1 Y. P- - I- We let X = Tf Y together with the direct product i=0 1 measure. Define T: X -* X by: T({xi>) = {yi>, where yi = xi+1- T preserves the measure of each measurable rectangle and thus it preserves the measure of sets which are finite disjoint unions of measurable rectangles. By Theorem 1.1 T is measure-preserving. We call T the two-sided (p..,...,p. ..)-shift. (7) Let Y be as above, X = IT Y with the direct product measure. 0 Let T: X -> X be defined by (xQ,x1, . . .) h- (x1,x2, . . . ) . By an analogous argument to the one in example (6) we see that T is measure-preserving. We call T the one-sided (pQ,...,p, .)-shift■ 2 (8) Let I be the unit square equipped with Lebesgue measure and I 2 the unit interval with Lebesgue measure. Then p: I -* I defined by p(x,y) = x is measure-preserving. Given any set X,, any probability space (X„,B„,m2) and any onto map T: X. -* X„ we can choose a a-algebra 8. and a measure m1 on X-, to make T measure-preserving. In fact let 8, = T 8„ and define m, by nuCT B„) = nuCB,). Conversely, if (X, ,B., ,m., ) is any probability space, X2 any set
19 onto and T: X, -* X„ any map, then we can choose a a-algebra B„ and a sure m„ on X„ so that T is measure-preserving. Put B2 = {B: B c x2 and T-1B € B1> and m2(B) = m1(T~1B) for B € B2- §2. Problems in Measure Theoretic Ergodic Theory (a) External Problems : How do we apply measure theoretic ergodic theory to other branches of mathematics and physics? In these applications one has a space X with some structure on it, and a map T of X which preserves this structure. To apply the theory of measure preserving transformations one needs an invariant measure for T which acts "nicely" with respect to the structure on X. For example, if X is a topological space we would like the measure to be a Borel measure which is positive on non-empty open sets. Examples : (1) Hamiltonian Mechanics : Here one has a one-parameter group of diffeomorphisms of a manifold and there is a smooth measure on the manifold preserved by each diffeomorphism. (2) Number Theory: To study continued fractions one studies T: [0,1) -> [0,1) given by: f 0 if x = 0 T(x) = < [{1/x} if x ?! 0} where the {y} denotes the fractional part of y. T preserves the Gauss measure on [0,1) which is given by: m(A) = lo -— [ — dx, A = [0,1). g 2 JA 1+x
20 (b) Internal Problems : The main internal problem in measure theoretic ergodic theory is: Given two measure preserving transformations when are they isomorphic? (i.e., when can we consider them to be the same?) We look for invariants. A property P is an invariant if when T, has the property P and T„ is isomorphic to T, then T„ has the property P. Invariants give good negative answers, i.e., if T, has property P and T„ does not, then T, and T„ are not isomorphic. The invariants we shall study are of two types:—spectral invariants and entropy. Before discussing the notion of isomorphism we shall introduce some general concepts such as recurrence and mixing. §3. Recurrence Theorem 1.2: (Poincaré Recurrence Theorem) Let T be a measure-preserving transformation of a probability space (X,B,m). Let E € B, m(E) > 0. Then almost all points of E return infinitely often to E under positive iteration by T, (in fact, we have the stronger result that: 3 F c E, m(F) = m(E) 3 if x € F 3 integers 0 < n1 < n2 ... 3 T -"-(x) € F V i). oo Proof: For N =: 0 let EN = U T~n(E). We have T-1(EN) = EN+1, n=N EN C EN-1 C EN-2 C ■•• C V E C V and m(EN+l) ^(T_1(EN)) = m(EN) since T is measure-preserving. Therefore, for each N, m(EN) = m(EQ) and, m( D EM) = m(En). N=0 N u O EM = 0 U T~nE, which is the set of all points entering E N=0 N N=0 n=N infinitely often under positive iteration by T. Moreover F = E n ( O EM) consists of all points of E which enter E N=0 N
21 infinitely often under positive iterates of T. Since O EM c En N=0 N U and both sets have the same measure, m(F) = m(E fl 0 E ) = m(E n E ) = m(E). N=0 N u It remains to show a point of F returns to F infinitely often. n ■ Let x € F, then 3 0 < n, < n ... 3 T -"-(x) € E V l. Consider T '(x). T '(x) € E and enters E infinitely often under positive iterates, namely, n„-n,, n.-n,, ... since T -1 l (T x(x))=T 1(x) €E; n, n-; thus T x(x) € F. Similarly one shows T (x) € F V l. // Remark: We do not need that T be measure-preserving in the hypothesis; we need only assume that T is incompressible, i.e., if B € B, T-1B c B then m(B) = m(T-1B). §4. Ergodicity Let (X,B,m) be a probability space and T: X -* X be a measure- preserving transformation. If T~ B = B for B € B, then T~ (X\B) = X\B and we could study T in two separate parts, namely T|B and tIv\r- If ° < m(B) < 1 this has simplified the study of T. We need a concept of irreducibility for measure- preserving transformations, such that if T has this irreducibility property then the study of T cannot be split into two parts as above. Ergodicity is such a concept. Also, we would like some way of splitting a measure-preserving transformation into ergodic parts in a canonical way. This can be done in reasonably well behaved measure spaces. (See Rohlin [3].) Definition 1.2: T: (X,B,m) -* (X,B,m) is ergodic if for B € B, T-1B = B =»m(B) = 0 or m(B) = 1.
22 Theorem 1.3: The following are equivalent for measure-preserving T: X -* X: (1) T is ergodic. (2) m(T-1BAB) = 0, B € B => m(B) = 0 or 1. (3) V A,B € B, m(A),m(B) > 0 3 n > 0 3 m(T~nA fl B) > 0. Proof: (1) => (2). Suppose m(T-1BAB) = 0. Let oo oo . B_ = O U t~1b « B- Then n=0 i=n B_ = (BO T-1B U T~2B U . . . ) fl (T-1B U T~2B U . . . ) fl (T~2B U T~3B U . . . ) fl . . . ; and therefore T~ B_ = B_ and m(B_) = m(B). Hence, by (1), m(B_) =0 or 1, and therefore m(B) = 0 or 1. (2) =• (3). Let m(A) > 0, m(B) > 0 and suppose (3) is false, i.e., V n > 0 m(T-nAriB) = 0. Then m(( U T"nA) fl B) =0. n = l oo Let A' = U T~nA. Then T-1A' c a' and m(A') = m(T A'); so that n=l m(T-1A'AA') = 0, and by (2) we have m(A') = 0 or 1. But T-1AcA' and T is measure-preserving, so that m(A ) = 1. But this contradicts the above fact that m(A' (IB) = 0. (3) => (1). Suppose (1) is false, i.e., 3 B € B, T-1B = B and 0 < m(B) < 1. Then m(T~nB fl (X\B)) =0 V n > 0 which contradicts (3). // A characterization of ergodicity in terms of functions is given by the following results.
23 Theorem 1.4: Let T: (X,8,m) -* (X,B,m) be measure-preserving; then the following are equivalent: (1) T is ergodic. (2) Whenever f is measurable and (foT)(x) = f(x) V x € X then f is constant a.e. (3) Whenever f is measurable and (foT)(x) = f(x) a.e. then f is constant a.e. (4) Whenever f € L2(m) and (f°T)(x) = f(x) V x € X then f is constant a.e. (5) Whenever f € L2(m) and (f°T)(x) = f(x) a.e. then f is constant a.e. Proof: (1) => (3). Suppose f is measurable and foT = f a.e.; while T is ergodic. We can assume that f is real-valued for if f is complex-valued we can consider the real and imaginary parts separately. Define X(k,n) = {x: k/2n £ f(x) < (k+l)/2n} k € Z, n > 0. We have T-1X(k,n)AX(k,n) c {x: (foT)(x) i f(x)} and hence m(T~ X(k,n)AX(k,n)) = 0 so that by (2) of Theorem 1.3 m(X(k,n)) =0 or 1. Fix n, then \J X(k,n) = X which is a disjoint union; so for k€Z each n 3 unique k 3 m(X(k ,n)) = 1. Let Y = (~) X(k ,n). n n ' , n n=l Then m(Y) = 1 and f is constant on Y so that f is constant a.e. Trivially we have (3) => (2) => (4), (5) =» (4), and (3) =» (5). So it remains to show: (4) => (1). Suppose T-1E = E, E € B. Then X-, € L (m) and
24 (XE°T)(x) = X£(x) V x € X so, by (4) X£ is constant a.e. Hence ■f m (E) = X^dm =0 or 1. // Note: A similar characterization in terms of L (m) functions is true, since in the last part of the proof X„ is in L (m) as well as 2 12 L (m). Also we could use real L (m) or L (m) spaces. Remark: The following remark comes later (Theorem 5.5) but we preview it now in order to analyze our examples. Let X be a compact metric space and m a Borel probability measure on X which gives positive measure to every non-empty open set. If T: X -* X is continuous and ergodic with respect to m then m({x | {Tnx | n 2: 0 is dense}}) = 1. Proof: Let {U } -, be a base for the topology of X . n'n=l eo ea ince {Tnx |nî0} is dense in X ~ x € (~) U T~kU . S n=l k=0 n ea ea T~ ( U T~ U ) c (_) T~ U and T is measure-preserving and ergodic k=0 n k=0 ea ea we have m( U T~ U ) = 0 or 1. Since U TU is a non-empty k=0 n k=0 n ea open set we have m( \J T~ U ) = 1. The result follows. // k=0 n Note that this result is applicable when m is Haar measure on a compact metric group and T is an affine transformation. Examples : We shall now see when the examples of § 1 are ergodic. (1) I on (X,B,m) is ergodic iff all members of B have measure 0 or 1. (2) Consider T: K -* K, T(z) = az. T is ergodic iff a is not a root of unity. Proof: Suppose a is a root of unity, then ap = 1 for some
25 p i 0. Let f(z) = zp; then clearly f i constant a.e. and f°T = f. 2 Conversely, suppose a is not a root of unity and foT =f, f €L (m). ee Let f(z)= ]T bz be its Fourier series. Then by above, n=-~ E b anzn = E b z11 and therefore b (an-l) = 0. If n t 0 then n n n b =0, and so f is constant a.e. // (3) Let T(x) = ax on a compact metric group G, then T is er- godic iff {a } _ is dense in G. In particular, T ergodic => n cZi G is abelian. Proof: Suppose firstly that Tx = ax is ergodic. By the above remark it follows that {a 'X-C]}7, is dense for some xn and so taI1}n n-r n-r is dense since if y € G 3 {n.} with a -"-x. -* yxn i.e. , a ± -* y. Conversely, suppose {a } .„ is dense in G. This implies G is abelian. Let f (. L2(m) and foT = f. By (7) of §5 of Chapter 0 f can be represented as 2_ b.y^) where y- i G. Then T. b.Y-(a)r-Cx) = ^- b.y(x) so that if b. i 0 then y(a) = 1 i i and so y. = 1. Therefore only the constant term of the Fourier series of f can be non-zero, i.e., f is constant a.e. // (4) For an endomorphism A of a compact metric group G necessary and sufficient conditions for ergodicity are known. When G is abelian, Halmos [4] and Rohlin independently proved that A is ergodic «» whenever y°A = y> n > 0, then y = 1• Before proving the general result we will illustrate the proof by showing that 2 A(z) = z on K is ergodic. Suppose foA = f, f € L (m). We have that if f(z) = Y. a z" n=-» then 1- a z = *— a z11 and therefore a = a„ = a,, = . . . . So n n n ^n 4n ea if n i 0 we must have a = 0 because 2_ la I < ». Therefore n ' n ' — ea f is constant a.e.
26 Proof of the general result : Suppose that whenever yAn = y we have y = 1> and let f°A = f with f € L (m). Let f(x) have the Fourier series *- a y where y € G and 2_ I a I < ». Then n n n ' n ' £- a y where y € G and 2_ I a I n n n ' n ' anYn(Ax) = 2_anYn(x)> so that if Yn > Yn°-A, Yn°A , ... are all distinct their corresponding coefficients are equal and therefore zero. So if a i 0, y_(A^) = y for some p > 0. Then y = 1 by assumption and so f is constant a.e. Conversely let A be ergodic and yA = y> n > 0. If n is the least such integer, f = y +yA + .•. +yA is invariant under A and not a.e. constant (being the sum of orthogonal functions), contradicting the ergodicity. // Consider now the case when G is the n-torus K . The ergodicity condition then becomes: A: K -* K is ergodic «• the matrix [A] has no roots of unity as eigenvalues. Proof: Recall that, under the identification Kn ^ Zn, if *!\ m / n € Z then [A] m = Â(m). If A is not ergodic then k k [A] m = m for some m ?! 0 and k > 0. Then [A] has 1 as an t - - - - t eigenvalue so [A] has a k-th root of unity as an eigenvalue. Conversely if [A] has a k-th root of unity as an eigenvalue then [A] has 1 as an eigenvalue so that [A] - I induces a singular linear transformation R . Hence [A] - I induces a many- to-one map of Z into Z and so there exists 0_ i m € Z with k [A] m = m. // So, for example, all the endomorphisms z-*z, |k|>l, of K are ergodic. Chu [1] has considered the case when G is nonabelian. He has shown that a continuous endomorphism of a compact group G onto itself is ergodic if and only if the induced map on the representation
27 ring R(G) has no finite orbit except the constant functions. The representation ring, R(G), is the ring generated by the coefficients of all irreducible unitary representations of G over the complex field. (5) For affine transformations of compact metric groups necessary and sufficient conditions for ergodicity are known. The simplest case is when G is a compact, connected, metric, abelian group. If Tx = a>A(x) is an affine transformation of the compact, connected, metric, abelian group G then the following are equivalent: (a) T is ergodic. (b) (i) Whenever Y°A = y f°r k > 0 then Y°A = Y, and, (ii) the smallest closed subgroup containing a and BG (where Bx = x-1-A(x) ) is G (i.e., [a,BG] = G). (c) -3 xQ € G with {Tn(xQ): n 2 0} dense in G. (d) m({x: {Tnx: n 2 0 is dense}}) = 1. (Note that conditions (i) and (ii) reduce to the conditions given in (3) and (4-) in the special cases. The equivalence of (a) and (b) was investigated by Hahn, Hoare and Parry.) Proof: First note that B is an endomorphism of G and commutes with A. (b) => (a). Suppose (i) and (ii) of (b) hold. If foT = f, f € L2(m) let f = L biY-, Yi € G. Then £ biYi(a)Y;L(Ax) = £ biYi(x). (*) 2 So if Yi, Y-°A, Y-°A , ... are all distinct then b, = 0 or else YL |b.| < » is violated. Hence, if b. i 0 then Y.°An = Y- for some n > 0, and by (i) YoA = Y. But then (*) implies Y.(a) = 1 and so Y^(x) =1 V x € [a,BG] and by (ii) Y. =1. So f is constant a.e. (a) => (d). This follows by the remark above.
28 (d) => (c) is trivial. (c) => (b). It remains to show that if 3 x. € G with {T xQ: n 2; 0} dense in G then conditions (i) and (ii) of (b) hold. k ~ k Suppose y°A = Y» k > 1, y € G. Let y-, = Y°B. Then YtCT x) = k-1 k -1 k Y-jCa-Aa- . . . -A a)Y (A x) = Y(a a a)Y.(x) = Y (x). Hence Y assumes only the finite number of values Y..(xQ), ri(Txn)> •••> Y-.CT xn) on the dense set {T x. : n £ 0} and hence assumes only these values on G. Since G is connected y-, must be constant, i.e., Yn = 1» Hence YA = Y and condition (i) holds. If Ca,BG] ?! G 3 y * 1. Y « G, with Y(a) = 1, and Y(Bx) = 1. Then Y(Tx) = y(*) and so y assumes only the value Y(xn) on the dense set {T xn: n 2 0} and therefore y is a constant. Hence Y s 1, a contradiction, and we have shown that condition (c) implies (ii). // When G is Kn the equivalence of (a) and (b) becomes: T =a-A is ergodic iff (i) the matrix [A] has no proper roots of unity (i.e., other than 1) as eigenvalues, and (ii) [a,BKn] = Kn. This is easily proved by a method similar to the one used in (M-) for the endomorphism case. Conditions for ergodicity of affine transformations of compact nonabelian groups may also be found in Chu [1]. (6) The 2-sided (pQ,... ,p, _1)-shift is ergodic. Proof: Let A = the algebra generated by finite unions of measurable rectangles. Suppose T E = E, E € B. Let e > 0 be given, and choose A € A 5 m(EAA) < e; thus |m(E)-m(A)| = |m(EHA) + m(E\A) -m(AflE) - m(A\E) | < m(E\A) + m(A\E) < e.
29 Choose n so large that B = T nA depends upon different coordinates 2 from A; so, m(B(lA) = m(B)m(A) = m(A) . m(EAB) = m(T~nEAT~nA) = m(EAA) < e and since EA(AflB) c EAA U EAB we have m(EA(AnB)) < 2e , hence |m(E) - m(Afl B) | < 2e and |m(E)-m(E)2| £ |m(E) - m(A nB)| + |m(AnB) -m(E)2| < 2E + |m(A)2 -m(E)2| £ 2e + m(A)|m(A) -m(E)| +m(E)|m(A) -m(E)| < te 2 since m(A),m(E) s 1. Since e is arbitrary m(E) = m(E) which implies that m(E) =0 or 1. // (7) By a similar argument, we see that the 1-sided (p.,... ,p, _,)- shift is ergodic. §5. The Ergodic Theorem The first major result in ergodic theory was proved in 1931 by G. D. Birkhoff [1]. Theorem 1.5: (Birkhoff Ergodic Theorem) Suppose T: (X,B,m) -* (X,B,m) is measure-preserving (where we allow (X,B,m) to be a-finite) and f €Lx(m). Then - £ f(T1(x)) n i=0 converges a.e. to a function f* € L (m). Also, f*oT = f* a.e., and if m(X) < -, f f*dm = f fdm.
30 Note: If T is ergodic then f = a constant a.e. and if m(X) < » (X) J ,,,, , f dm a.e. m(I Motivation: (i) Suppose T: (X,8,m) ■* (X,B,m) is measure-preserving and E € B. For x € X, we could ask with what frequency do the elements of the 2 set {x, T(x), T (x), ...} lie in the set E? Clearly T1(x) € E iff XET1(x) = 1, so the number of elements , n-1 , of {x, T(x), . .., T (x)} in E is Z X T (x); and so the rela- k=0 L tive number of elements of {x, T(x), ..., Tn~ (x)} in E equals - Y X TK(x). If m(x) = 1 and T is ergodic then - T_ X„T1(x) -> ni=0 ' n i=0 L m(E) a.e. by the note; and thus the orbit of almost every point of X enters the set E with asymptotic relative frequency m(E). (ii) We define the time mean of f to be n-1 lim - E f(T1(x)) and the phase or space mean of f to be f(x)dm. -i-f (i(X) J- X The ergodic theorem implies these means are equal if T is ergodic. (The converse is also true.) So, it is important to verify ergodicity for transformations arising in physics. This application to time means and space means is more realistic in the case of a 1-parameter flow {T.} of measure-preserving transformations. The ergodic theo- 1 fT 1 rem then asserts lim =- f(T x)dt exists a.e. for f € L (m) and T-*~ l h r equals m/v\ fdm i-n the ergodic case if the map (t,x) -* T.x -is
31 measurable. An Application to Number Theory Borel's Theorem on Normal Numbers : Almost all numbers in [0,1) are normal to base 2, i.e., 1_ , I the number of l's in the first n digits \ _^ 1_ n \ of the binary expansion of x € [0,1) J 2 a.e. Proof: Let T: [0,1) -* [0,1) be defined by T(x) = 2x mod 1. We know that T preserves Lebesgue measure and is ergodic, by example 4- at the end of §4. al a2 . • Suppose x = — + —=- + ... has a unique binary expansion. Then 2 2 /al a2 a3 \ a2 a3 T(x) = T — +-i-+-4+...=-£• +-|+... . Let f (x) = \ 2 2 2 / 2 2 Xr, ,n(x). Then f1 iff a. ., = 1 ' a. ,., a. ._ \ l+l ftécx)) = f(îi+A + îiîi+ ...) iff ai+1 Hence, the number of l's in the first n digits of the dyadic ex- n-1 pansion of x is £ f(Tx(x)). Dividing both sides of this equality i=0 by n and applying the ergodic theorem we see that -, n-1 . /■ -, - Z f(T x) ► Xr, n.dm = =■ n .t"n a.e. J [Jj,l) 2 (using the fact that the binary-rational points form a set of Lebesgue measure zero). // The ergodic theorem can be applied to give other number theoretic results. Some are obtained in Billingsley [1] and Avez-Arnold [1], We now consider some preliminaries to the proof of the ergodic theorem.
32 Definition 1.3: Let T: (X,8,m) -* (X,8,m) be measure-preserving. Define an operator U_ on complex-valued functions on X by: (UTf)(x) = f(T(x)). We have UTLp(m) c L^(m) and, since T is measure-preserving llU-fll = ||f|| . Let L£(m) denote the real-valued Lp(m) functions, 1 p p K then UTL^(m) c L^(m). To prove Birkhoff's theorem we need: Theorem 1.6: (Maximal Ergodic Theorem) Let U: LR(m) -* L_(m) be a positive linear operator (i.e., f£ 0 » Uf ; 0) which has norm £ 1. Let N > 0 be an integer. Define fn = 0, f = f + Uf + U2f + ... + Un-1f, and F„ = max f ï 0. 0 n ' N Q£n£N n Then fdm 2: J{x:FM(x)>0} 0. Proof: (due to A. Garsia) Clearly F.. € LR(m). We have for 0 < n £ N F.,ïf so, UFN 2: Uf by positivity, and hence UFM + f > f .. . Therefore N n+1 UFM(x) + f(x) > max f„(x) N IsnsN n = max f (x) when F.,(x) > 0 OsnsN n N FN(x). Thus f > FN - UFN on A = {x: F.,(x) > 0}, so
33 I f dm > [ F dm - [ UFMdm - \x FNdm " |A UFNdm - I FNdm " L UFN' since F., = 0 on X\A. N dm since F., î 0 => UFN 2 0. > 0 since IIUII < 1. // Remark : The conditions of Theorem 1.6 hold if U = U_ for measure- preserving T. Corollary 1.6 : Let T: X -* X be measure-preserving. If g € LR(m) and n n_1 ES = {x €X: sup ±. E g(Tm(x)) > a} then n>l n m=0 g dm > am(B (1A) JB HA a a if T-1A = A and m(A) < -. Proof: We first prove this result under the assumptions m(X) <» and A = X. Let f = g -a, then B = U {x: F.,(x) > 0} so that a N=0 N fdm > 0 by Theorem 1.6 and therefore g dm > am(B ). In Jb jb a a a the general case, using T|. in the place of T we see that gdm î am(A PI B„) . // a 'AflB a Proof of Birkhoff's Theorem: It suffices to prove the theorem ]_ , n-1 for f € L_(m). Let f*(x) = lim - £ f(T1(x)) and f,(x) = R n n i=0 n-1 lim è Y. f(T1(x)). We have f*oT = f*, f.°T = fA because if n i=0
34 a„(x) = i- L fCT^-x) then (^) a„.. (x) - a(Tx) = ^^- . For real n n .~_ V n ; n+1 n n numbers j} < a, let E„ B = {x€X: f.(x)<p, a<f*(x)}. a ,p ft Then T^E^p = E^ and E^ n MX: sup ^ fCT^x)) > a} -- E^. We now prove that m(E B)<" so that we can apply Corollary 1.6. a j p Suppose a > 0. Let C c E „ with m(C) < -. Then h = f - aX„ a ,p L is integrable and by the maximal ergodic theorem (f -aX )dm î 0. (HN defined analogously to the F.. in the U {x:H'(x)>0} N = 0 maximal ergodic theorem.) But C c 1J {x: HM(x)>0} so that N=0 N |f|dm ^ am(C). Therefore m(C) s — |f|dm for every subset JX a Jx of E Q with finite measure and hence m(E .)<». If a<0 then a, p a,p (3 < 0 so we can apply the above with -f and -p replacing f and a to get m(E .) < ». Œ j p n-1 Let B = {x € X: sup i £ fCT^O > a}. Then by Corollary 1.6: a n>l i=0 f dm = fdm > am(E .(IB ) = am(E D) , i.e. , Je q Je 0ob a'P a a'P a, p a, p a [ fdm > am(E .) . (*) Je a'P If we replace f,a,p by -f,-p,-a respectively we get that (-f)* = -f:i, (-f)ft = -f* and I fdm s pm(E B). (**) J -p aiP o,p
35 So, if a > p then m(E a) = 0, and since {x: fft(x)<f*(x)} ç (J Eajp a, (3 rational we have m{x: fft(x)<f*(x)} = 0 i.e., f*(x) = f (x) a.e. Therefore n-1 -i 11 X . — Yl f(T1(x)) converges a.e. i=0 ft 1 To show f € L (m) we use the part of Fatou's Lemma that says for non-negative iiytegrable functions g lim g dm < » implies lim g is integrable. Let gn(x) i Z f(T1(x)) n i = 0 Then | gndm =| ^1 f(T1(x)) dm < f |f I dm so that lim g dm < », and by Fatou's Lemma lim g = If I is J n n ' ft ' integrable. Hence fft is integrable. It remains to show that f dm = f*dm if m(X) < •». Let D^ = {x €X: -<f*(x)<^i} where k € Z, nil. For each small k n n ' e > 0 we have D, fl B , = D? and by Corollary 1.6 (H-e) I fdm > (--e)m(DÎ1) so that _ n k fdm > - m(D^) n k (Aftft) Then [ f*dm £ *±Im(Dn) < I„()f) + f fdm (by (***)) J ^n '„n Summing over k we get that
36 L f*dm £ 2L12LL + | f dm V n > 1: thus f*dm < fdm since m(X) < -. Applying this to -f instead JX JX of f gives (-f) dm £ -fdm i.e. , - fAdm - - fdm. Since Jx Jx Jx * Jx f = f* a.e. we get that f f*dm =: * Jx '* Hence, j f*dm = [ fdm. // Corollaries 1.5: (i) Let (X,B,m) be a probability space and T: X -* X measure- preserving, then T is ergodic iff V A,B € B i n_1 - Y. m(T XA n B) - m(A)m(B). n i=0 Proof: (=>) Suppose T is ergodic. Putting f = X. in Theo- 1.5 gives i £ X.(Tx(x)) -> m(A) a.e. Multiplying by Xn: n i=0 A B , n-1 i- JT XA(T1(x))XB -> m(A)XB a.e. By the dominated convergence theorem if we integrate we get n-1 - 5~ m(T XA n B) -> m(A)m(B). n i=0 (=) Let T E = E, E € B. Let A = B = E. Then - £ m(E) n i = 0 2 2 m(E) so m(E) = m(E) , hence = 0 or 1. // (ii) L^ Ergodic Theorem: (Von Neumann [1], [2]) Let 1 £ p < ». Let T be measure-preserving on the probability space (X,B,m). If f € Lp(m), 3 f* € Lp(m) 5 f*oT = f* a.e. n n-1 and || i Z fCT^-x) - f*(x) || - 0.
37 Proof: If g is bounded and measurable then g € Lp V p and 1 n_1 i by the ergodic theorem we have that — £_ g(T x) -* g (x) a.e. n i=0 Clearly g* € L°°(m) and hence g* € Lp(m). Also, I — Z! g(T1x) - g*(x) | -* 0 a.e. and by the bounded convergence n i=0 n theorem, || - T g(T x) - g*(x)|| -+ 0 i.e., V e > 0 3 N(e,g) 3 n i=0 P if n > N(e,g) and k > 0 i n_1 ,• n n+k-1 i Z g(Txx) - -4- I gcrSon < n i=o n K i=o p , n-l Let f € Lp(m), and M (f)(x) = - T f(T1x). We must show that n n ito {M (f)} is a Cauchy sequence in Lp(mJ. Note that ||M (f)|| < ||f|| . n n p p Choose g € L°°(m) 3 ||f - gjj < eA; then P llMnf-Mn+kf"p £ "Mnf -"n^llp + »MnS " Mn+kS"p + "Mn+kS " Mn+kf «p £ e/4- + e/2 + e/4- = e if n > N(e/2,g) and k > 0. We have f*oT = f* a.e. because (^)(M_inf)(x) - (M f)(Tx) - -(X- n n+1 n n §6. Mixing We have seen that T is ergodic iff V A,B € B, N-l i £. m(T ^OB) -> m(A)m(B). N i=0 Definitions 1. 4- : (i) T is weak-mixing if V A,B € B N-l èr H |m(T XA PI B) - m(A)m(B) | -> 0. N i=0
38 (ii) T is strong-mixing if V A,B € B m(T"NA n B) -> m(A)m(B). Note: (i) T strong-mixing =» T weak-mixing. (ii) T weak-mixing => T ergodic. This is so because if {a } is a sequence of real numbers then -. n-1 -, n-1 a -> 0 =» i Y |a.| -> 0 => - Y a. -+ 0. n i=0 x n i=0 x (Put a = m(T~nA flB) - m(A)m(B).) n (iii) An example of an ergodic T which is not weak-mixing is given by T(z) = az on K, where a is not a root of unity. (See the end of this section for the proof.) (iv) There are examples of weak-mixing T which are not strong- mixing. Kakutani has an example constructed by combinatorial methods, and Maruyama constructed an example using Gaussian processes. Chacon and Katok-Stepin also have examples. Indeed, if (X,B,m) is a probability space, let t(X) denote the collection of all invertible measure-preserving transformations of (X,B,m). If we topologize t(X) with the "weak" topology (see Halmos [2]), the class of weak- mixing transformations is of second category while the class of strong mixing transformations is of first category. The following result shows it suffices to check the convergence properties on an algebra generating B. Theorem 1.7 : If T: X -* X is measure-preserving and A is an algebra generating B then
39 (i) T is ergodic iff 'V A,B € A n-1 - Z m(T-1AnB) ■* m(A)m(B), n i=0 (ii) T is weak-mixing iff V A,B € À , n-1 - £ |m(T-1AnB) - m(A)m(B)| -> 0, and n i=0 (iii) T is strong-mixing iff V A,B € A m(T~nAnB) -> m(A)m(B), Proof: Let e > 0 and E,F € B. Choose En>Fn € A with m(EAEQ) < e, m(FAFQ) < e. Then and therefore Hence m((T nE nF)i(T_nEQ fl FQ)) < 2e |m(T"nE HF) - m(T nEQ fl FQ) | < 2e. n n_1 -v =■ Z m(T E HF - m(E)m(F) n k=0 |i £ Cm(T KE flF) - m(T"KEn f1Fn)] '0 " '0' - l£ jï. m(T-kE flF > - m(E0)m(F0) n n_1 + \~ T. m(E0)m(FQ) - m(EQ)m(F)| n-1 + |i I m(En)m(F) -m(E)m(F)| n k= 0 ° n-1 - 2e + | ^ L m(T-kE0 fl FQ ) - m(E0)m(FQ) j + £ + e for each n. (i) follows by the known behavior of the right hand side
to of this inequality. To prove (ii) first show |m(T"kEnF) - m(E)m(F)| £ te + |m(T""kEQ fl FQ) -m(E0)m(FQ)| (*) 1 n_1 by leaving out " — H " from the above inequalities, and then take n k=0 Cesaro averages of each side of (*). (iii) follows immediately from the inequality (*). // Theorem 1.8: If {a } is a bounded sequence of real numbers then the following are equivalent: n-1 (1) i I |a£| - 0. n i=0 1 (2) 3 J c Z , J of density zero, i.e., cardinality (J H{0,1,■■■ ,n-l}) n ) - », such that lim a = 0 provided n i J. n n (3) J Z Kl2 - 0. n i = 0 1 Proof: If M c Z let aM(n) denote the cardinality of {0,1,...,n-l} n M. (1) = (2). Let Jk = {n € Z+: | an | > £•} (k > 0). Then i=0 — j- a. (n). Therefore there exist integers 0 = ZQ < Z-. < Z„ J, c J c . . . . Each Jv has density zero since such that for n t t, , 1 , -, 1 — aT (n) < rpr . n Jk+1 k+1 Set J = U [Jk+1 H C^v>^k+i^' We now show that J has density
m zero. Since J c j c ..., if I < n < ^k+1 we have j n [o,n)=[jn[o,y] u CJ n [£k,n)] c [jkn [o,-£k)] u [Jk+1 n [o,n)], and therefore i- aT(n) S kaT (.1.) + aT (n)] £ ka (n) + aT (n)] < i + -i_ . n J n Jk k Jk+1 n Jk Jk+1 k k+1 Hence — aT(n) ■* 0 as n •*■ ~, i.e., J has density zero. If n J n > I. and n t J then n / J, ., and therefo k k+1 re lan' < k+T* Hence lim la I = 0. t-» n ' J/n-*» (2) => (1). Suppose |an| £ K V n. Let e > 0. There exists N such that n î N , n i J imply |a | < e, and M such that cij(n) n î M implies < e. Then n 2 max(N ,M ) implies £ n £ £ i n-1 - I |a.| n iTo l! I |aj + L |a.| jn{0,l,.-. ,n-l} x if?jn{0,l,...,n-l} 1 < - aT(n) + e < (K+1)e. n J (1) «» (3). By the above it suffices to note that lim |a | = 0 Jjin-*» iff lim |a | = 0. // Jj>n— n Corollary 1.8 : T is weak-mixing iff V A,B € B 3 J(A,B) of density zero in Z+ 5 lim m(T"nAnB) = m(A)m(B) iff V A,B € B n/J(A,B) n-1 i- Y. |m(T_:LA HB) - m(A)m(B)|2 -> 0. i=0 Remark : To say T is strong-mixing means that any set B € B as it
42 moves under T becomes, asymptotically, independent of a fixed set A € B. T is weak mixing means B becomes independent of A if we neglect a few instants of time. T is ergodic means B becomes independent of A on the average. The next result expresses the mixing concepts in functional form. Recall that UT is defined on functions by Umf = f°T. Theorem 1.9 : Suppose (X,B,m) is a probability space and T: X -* X is measure-preserving. Then (a) T is ergodic iff for all f,g € L (m) n n_1 ± Z (U^f.g) -> (f,l)(l,g) n i=0 T iff for all f € L2(m) i n"1 i Z (l£f,f) -> (f,l)(l,f). n i = 0 i 2 (b) T is weak-mixing iff for all f,g € L (m) i Z |(U™f,g> - (f,l)(l,g) n i=0 iff for all f € L2(m) - Z |(i£f,f) - (f,i)d,f) n i=0 iff for all f € L2(m) i- Z l(UJf.f) " (f,D(l,f)|2 n i=0 i (c) (1) T is strong-mixing iff (2) for all f,g € L2(m), (Ui£f,g) - (f,l)(l,g) iff (3) for all f € L2(m), (U^f.f) - (f,l)(l,f),
43 Proof: (a), (b), and (c) are proved using similar methods. We shall prove (c) to illustrate the ideas. Slight modification of this proof will prove (a) and (b). (2) => (1). This follows by putting f =XA> g=*B> for A,B € B. (1) => (3). We easily get that for any A,B € B, (UTX.,XR) -> (XA,1)(1,XR). Fixing B, we get that (u"h,XR) -> (h,l)(l,XR) for any simple function h. Then, fixing h, we get that (U_h,h) T A' B' T..,/»B/ - vi.,J./vl,XB) mh (h,l)(l,h). So (3) is true for all simple functions. o Suppose f € L (m) , and let e > 0. Choose a simple function h 3 ||f-h|L < e, and choose N(e) so that n > N(e) implies |(u£h,h) - (h,l)(l,h)| < e. Then if n>N(e) .|(u!Jf,f) - (f,l)(l,f)| £ | (u£f,f) - (u£h,f)| + |(u£h,f) - (u£h,h)| + |(u£h,h) - (h,l)(l,h)| + |(h,l)(l,h) - (f,l)(l,h)| + |(f,l)(l,h) - (f,l)(l,f)| £ |(UT(f-h),f)| + |(UTh,f-h)| + e + | (l,h)| | (h-f,l) I + |(f,l)| |(l,h-f) | < ||f-h||2||f||2 + ||f-h||2||h||2 + e + ||h||2||f-h||2 + ||f||2||h-f||2 by the Schwartz inequality £ e||f||2 + e(||f|l2 +e) + e + (||f||2 +e)e + e||f||2. Therefore (UTf,f) -> (f,l)(l,f) as n -> -. (3) => (2). Let f € L2(m) and let Hf denote the smallest 2 (closed) subspace of L (m) containing f and the constant functions and satisfying UTHf c Hf. Ff = {g € L2(m): (U^f,g) -> (f,l)(l,g)} 2 is a subspace of L (m), is closed, contains f and the constant
44 functi ons and is U™ invariant so i"t contains H^ • If g x H^ "then (U^f.g) = 0 for n î 0 and (l,g) = 0 and therefore H^ c Ff. Hence Ff = L2(m). // Definition 1.5: If T: X -* X is measure-preserving, define TxT: XxX -* XxX by (TxT)(x,y) = (T(x),T(y)). This is a measure-preserving transformation on (XxX,8x8,mxm) by Theorem 1.1 since it is measure-preserving on measurable rectangles and hence on finite disjoint unions of such rectangles. Theorem 1.10 : If T is a measure-preserving transformation on a probability space X then the following are equivalent: (1) T is weak-mixing. (2) TxT is ergodic. (3) TxT is weak-mixing. Proof: (1) => (3). Let A,B € B, C,D € B. 3 J±^2 of density zero such that lim m(T"nAriB) = m(A)m(B) n-*» lim m(T-ncriD) = m(C)m(D). n(J2 n-*» Then lim (mxm){(TxT) n(AxC) fl ( BxD) } = lim m(T_nA n B)m(T nC n D) n^JjUJ2 n<JjUJ2 = m(A)m(B)m(C)m(D) = (mxm)(AxC)(mxm)(CxD). Thus the proper relationship holds for rectangles and hence for finite disjoint unions of these rectangles. These we know form an algebra F
45 which generates the a-algebra B. By Corollary 1.8 we have , n-1 - Z |m(T-1AnB) - m(A)m(B)| -* 0 V A,B € F and the result follows n i=0 by Theorem 1.7. (3) ■=> (2) is clear. (2) => (1). Let A,B € B. We have that , n-1 _. , n-1 - Z m(T 1AnB) = - Y. (mxm)((TxT)-:L(AxX) H (BxX)) n i=0 n i=0 -* (mxm)(AxX)(mxm)(BxX) by (2) = m(A)m(B). Also -, n-1 . „ . n-1 £ Z (m(T-1A flB)r = - E (mxm)((TxT)-:L(AxA) n (BxB)) n i=0 n iTo (mxm)(AxA)(mxm)(BxB) by (2) = m(A)2m(B)2. Thus -, n-1 . „ =- Z {m(T_:LA flB) -m(A)m(B)r n i=0 k Z {m(T-1AnB)2 -2m(T XA nB)m(A)m(B) +m(A)2m(B)2} n i=0 2m(A)2m(B)2 - 2m(A)2m(B)2 Therefore T is weak-mixing by Corollary 1.8. // Definition 1.6: Let T: (X,B,m) -* (X,B,m) be a measure-preserving transformation on a probability space. We say that X is an eigenvalue of T, (X € C) if 3 f i 0 € L2(m) 3 UTf = Xf in L2(m); i.e., f(Tx) = Xf(x) a.e. We call f an eigenfunction corresponding to X.
46 Remarks : (i) If X is an eigenvalue of T then |\| =1 since ||f||2 = ||uTf||2 = (uTf,uTf) = (\f,\f) = |x|2||f||2. (ii) X = 1 is always an eigenvalue corresponding to any constant eigenfunction. Definition 1.7 : We say that T: X -* X has continuous spectrum if 1 is the only eigenvalue of T and the only eigenfunctions are the constants. Observe that T has continuous spectrum iff X = 1 is the only eigenvalue and T is ergodic. We shall need the following result from spectral theory to prove the next theorem. The proof can be found in Halmos [3]. Spectral Theorem for Unitary Operators : Suppose U is a unitary operator on a complex Hilbert space H. Then V f € H, 3 a unique finite Borel measure \if on K 5 (Unf,f) = [ Xnd|^(X) V n € Z. JK r If T is an invertible measure-preserving transformation then Um is unitary, and if T has continuous spectrum then M.f has no atoms for all f € L (m) with (f,l) = 0. Theorem 1.11: If T is an invertible measure-preserving transformation of a probability space then T is weak-mixing iff T has continuous spectrum. Proof: (=>). Suppose fT = Xf a.e., f € L2(m). If X ?! 1 then integration gives (f,l) = 0 and by the weak-mixing property
1*7 , n-1 i I |(l£f,f)| - 0 i.e. , n i = 0 i n-1 n i=0 Since |X1| = 1, this gives (f,f) = 0 and therefore f = 0 a.e. If X = 1 then f = constant a.e. by the ergodicity of T. (=). Suppose T has continuous spectrum. We show that if f € L2(m) then , n-1 . „ £ I! |(U^f,f) - (f,l)(l.f)| - 0. n i=0 i If f is constant a.e. this is true. Hence, all we need to show is that (f,l) = 0 implies k% i«4f,f)i2 - o. By the spectral theorem it suffices to show that if p._ is a continuous (non-atomic) measure on K then n-1 £ f if ^dMX)!2 - 0. n i=0 J r We have n-1 r on n-1 \ L if xVfU)|2 = i. £ ^J X^^CX) • J X ^(X)) = ^ Zo(f xid"fU) ■ I T"id^f(T)) = J7 .Z (XT)1d(|j.fxM.f.)(X,T) (by Fubini's Theorem) 1-0 KxK \\ ( H- .? C^)1) d(nfxnf)(X,T). KxK
48 If (\,t) is not in the diagonal of KxK then -> 0 as n -* ». Since |j.f has no atoms ( |j.fX|j.f) (diagonal) = 0 and therefore the integrand ■* 0 a.e. The integrand has modulus £ 1, so that we can apply the bounded convergence theorem to obtain the result. // We now investigate the mixing properties of the examples mentioned in §1. Examples : (1) I = identity on (X,B,m). I is ergodic iff all the elements of B have measure 0 or 1 iff I is strong-mixing. (2) T(z) = az on K. T is never weak-mixing since if f(z) = z then f(Tz) = f(az) = af(z) and f t constant. (This has used the trivial part of Theorem 1.11.) (3) No rotation on a compact group is weak-mixing. We have already mentioned that if T is ergodic then the group G is abelian; and then if Tx = ax and y is any character of G we have y(Tx) = Y(a)y(x), which shows that T does not have continuous spectrum. (M-) Endomorphisms of compact metric groups are strong-mixing iff weak-mixing iff ergodic. Proof : We shall give the proof when G is abelian. It suffices to show that if the endomorphism A: G -* G is ergodic then A n is strong-mixing. If y,6 * G then (U-y,6) = 0 eventually unless n Y = 6 = 1. So always (U y,6) -* (y,l)(l,6). Fix 6 € G. The collection H5 = {f € L2(m): (U^f,6) -> (f ,1) (1,6 ) } 2 is a closed subspace of L (m). (To check H is closed, suppose t n-1 n i = 0 n 1 - (Xt)1 1 -(Xt)
49 f, € H and f, -* f € L2(m). For 6=1 it is clear that H, = k k 6 L2(m). So suppose (1,6) = 0. Then |(U^f,6)| < |(u£f,6) - (U^fk>6)| + |(U^fk>6)| S llf-fkll2l|6|l2 + |(UAfk,6)| (by the Schwarz inequality) |f-fkll2+ KUAfk'6)| < e if n 2 N(e) where k is chosen so that ||f-fkH2 < e/2 and N(e) is chosen so that n > N(e) implies |(U?f.,6)| < e/2.) Since H ^ 2 2 contains G, it is equal to L (m). Fix f € L (m) and consider 2 n Lf = {g € L (m) : (UAf,g) -* (f,l)(l,g)}. Lf is a closed subspace of 2 * • 2 L (m), contains G by the above, and so equals L (m)• Hence A is strong-mixing. // (5) An affine transformation T = a-A on a compact metric abelian group is strong-mixing iff it is weak-mixing iff A is ergodic. Proof: We shall give the proof in the case when G is connected. Let Bx = x~ A(x) and recall that T is ergodic iff (i) y°A = y k > 0 implies y°A = y, a^d (ii) [a,BG] = G. If A is ergodic then BG = G since the endomorphism B of G is one-to-one. Choose b € G so that B(b) = a. Put 4>(x) = bx; then 0T = A0. ^ preserves Haar measure m on G and hence induces 2 a unitary operator U on L (m). We then have that U„ and U. are isomorphic as Hilbert space operators by the induced conjugacy vu* -- vv Now A is strong-mixing by (M-), and hence U. satisfies the conditions of part (c) of Theorem 1.9. But then UT satisfies these conditions, and hence T is strong-mixing.
50 Conversely if T is strong-mixing and A is not ergodic then by (i) y°a = Y for some y i 1. But then |(U?Y.Y)| = |(Y(a)Y(Aa)...Y(An"1a)Y,Y)| = HyIIj = X which does not converge to |(y,D(1,y)| = 0> contradicting the mixing of T. So if T is strong-mixing then A is ergodic. // (6) The 2-sided (p.,...,p, _.)-shift is strong-mixing. This is proven by doing the easy verification on measurable rectangles, then on their disjoint finite unions, and then applying Theorem 1.7. (7) Similarly, the 1-sided (pn,...,p, _.)-shift is strong-mixing.
Chapter 2 : Isomorphism and Spectral Invariants § 1. Isomorphism of Measure-Preserving Transformations What should we mean by saying that two measure-preserving transformations are the "same"? We must bear in mind that sets of measure 0 do not matter from the point of view of measure theory. Examples : 2 (1) Let T be the transformation Tz = z on the unit circle K with Borel sets and Haar measure, and let S be given by Sx = 2x mod 1 on [0,1) with Borel sets and Lebesgue measure. Consider the map <j> : [0,1) ■* K defined by x-*enx. 4> is a bijection and preserves measure (check on finite unions of intervals and use Theorem 1.1). Also <t>S = T*. So, we want to regard T and S as the "same". (2) Again, let S be the transformation Sx = 2x mod 1 on [0,1) with Borel sets and Lebesgue measure, and let T„: X -* X be the 1-sided (Jj,Jj)-shift. Define \|r : X -* [0,1) by a a„ a ^(a1,a2,a3,...) = _ + -^ + _ + . . . . \|r is not one-to-one only at points (a. ,a.,...) whose coordinates are constant eventually. \|r, though, is onto and \|rT„ = S\|r. Also \|r preserves measure; we can check this out on dyadic intervals and then on their finite disjoint unions and apply Theorem 1.1. Suppose D is the set of points of the space X of the 1-sided (Js,Js)-shift which have constant coordinates eventually. Then T„ D = D and so T~ (X\D) = X\D. Let D„ consist of the dyadic rationals in [0,1). Then S-1D2 = D2, so that S-1([0,1)\D2) = [0,1)\D2.
52 We see that the diagram T X\D - > X\D \|r I I \|c 1:1 onto [0,1)\D2 —^—> [0,D\D2 commutes. We would like to consider these transformations as isomorphic since, after removing sets of measure zero, we can throw one to the other by an invertible measure-preserving transformation. Definition 2.1: Suppose (X,,B..,m,) and (X2,B2,m„) are probability spaces together with measure-preserving transformations T, : X, -* X,, T„: X„ -* X„. We say that T, is isomorphic to T„ if 3 M, € B , m1(M1) =1, M2 € B2, m2(M2) = 1 3 (i) T1M1 ^ M]_, T2M2 i M2, and (ii) 3 an invertible measure-preserving transformation 4>: M]_ -> M2 3 *T1(x) = T2«(x) V x € M]_. We write T, = T2. (In (ii) the set Mi (i=l,2) is assumed to be equipped with the a-algebra Mi fl Bi = {M. fl B | B € B^ and the restriction of the measure m. to this a-algebra.) Remarks : (a) ^ is an equivalence relation. (b) T± ^ T2 =• Tn ^ Tn, V n > 0. (c) If T, and T„ are invertible we can take M1,M2 so that k k 1M1 = Ml' T2M2 = M2' we 3ust take H TiMi' H T2M2 as the new T sets .
53 § 2. Conjugacy of Measure-Preserving Transformations Although the notion of isomorphism, introduced above, is useful in practice the following is mathematically more natural. Given (X,B,m) we define an equivalence relation on B by saying that A ^ B iff m(AAB) = 0. Let B denote the set of equivalence classes. g is a Boolean a-algebra under the operations induced from the usual operations on B. m induces a measure m on 8. We call (B,m) a measure algebra. Note that for B € 8 , X„ 2 is a uniquely defined member of L (m). Suppose T: X -* X is measure-preserving. If A ^ B then T-1A ^ T~ B; so we have a map T~ : B -* B which is defined by T~ (?) = î^_B- T. preserves unions, intersections, and complements, and m(T-1B) = m(g). Definition 2.2: A map * : (B„,m„) -* (8. ,51.) of measure algebras is called an isomorphism of measure algebras if * is a surjective bijection and preserves complements and countable unions and n^CSCBj)) = m2(g2) V §2 € g2. Definition 2.3: We say that T±: X± ■* X±, T2 : X2 -♦ X2 are conjugate if 3 measure algebra isomorphism * : (B„,m„) ->■ (B,,^) such that -1 -1 *T2 = T1 *. Remarks : (1) Conjugacy is an equivalence relation. (2) T1 ^ T2 =» T1 and T2 are conjugate. Just let * = <t> , which is uniquely defined although <t> is not defined on the whole of X,.
54 In Lebesgue spaces (i.e., probability spaces isomorphic to a sub- interval of [0,1] with Lebesgue measure possibly together with countably many atoms) conjugacy implies isomorphism. In particular, a compact separable metric space with a completed Borel measure is a Lebesgue space. §3. Spectral Isomorphism Suppose T: X ■* X is a measure-preserving transformation on a 2 2 probability space (X,B,m). We have defined U_: L (m) -* L (m) by f h- foT, and noted that V f,g € L2(m) we have (U f,U„g) = (f,g). Also, if T is one-to-one, U_ is unitary. A spectral property of T is a property of U_. Definition 2.4: Measure-preserving transformations T, on (X-,8.,m1), and T„ on (X„,B„,m„) are spectrally isomorphic if 3 a linear operator 2 2 W: L (m2) -* L (m ) such that (i) W is invertible (ii) (Wf,Wg) = (f,g) V f,g € L2(m2) (iii) UT W = WUT . (The conditions (i), (ii) just say that W is an isomorphism of Hil- bert spaces.) Remarks : (1) Spectral isomorphism is an equivalence relation. (2) If *: (8.,m.) -* (B.jm,) is a measure algebra isomorphism then * induces an invertible linear maj VX„ = X.,Rv, with the properties: 2 2 * induces an invertible linear map V: L (m„) -* L (m.), by (a) (Vf,Vg) = (f,g) V f,g € L2(m2).
55 (b) V, V- map bounded functions to bounded functions. (c) V is multiplicative on bounded functions. Proof: V is defined as follows. Let B„ ç B2> then V(x„ ) = x /n > which is unambiguous in L (m„). We then extend V 2 to simple functions and then to L (m„) functions. The properties of $ guarantee this can be done. (a), (b), and (c) are proved by checking first for characteristic functions, then for simple functions, 2 and then extending to the whole of L (m„). // (3) If T, and T„ are conjugate then they are spectrally isomorphic. Proof: Suppose $ : (B2>m2^ "* C B -■ » Ji-, ) is an isomorphism of measure algebras such that $T~ = T, *. Let V be defined as in remark (2). It remains to check that VUT = UT V. 2 ''I First, on characteristic functions UT V(X ) = UT (X.R ) = X _ = X _ = V(X ) = VU„ (X ). 1 ~2 1 ~2 Ti *?2 *~2 ~2 ~2 ~2 2 2 Therefore UT V and VU_ agree on characteristic functions and 1 i2 hence on linear combinations of characteristic functions. By their continuity we have UT V = VU_ . // il i2 The following tells us when spectral isomorphism implies conjugacy. Theorem 2.1: 2 2 An invertible isometry V: L (m„) -* L (m ) is induced by an isomorphism of measure algebras (in the same sense of remark (2)) if both V and V take bounded functions to bounded functions and V(fg) = 2 V(f)V(g) whenever f and g are bounded and in L (m„).
56 Proof: Let B„ € B„ . We have Xn = Xn so that V(Xn ) = V(XR )V(Xn ) = V(XR ), B2 B2 B2 B2 and we see that V(XD ) takes 1 and 0 as its only values. Thus B2 3 B.. € B.. such that V(Xn ) = Xn a.e. We define $: (B„,m„) -* 1 1 D« D, ~Z ~Z C B- ,m, ) by *(B_) = B,. This is unambiguous since if i,(B JAJ = 0 then ||x„ - X. || = 0 so that ||V(xn ) - V(x. ) || = 0. Clearly, V B2 A2 2 A2 is induced by * in the sense of remark 2. We now show that $ is an isomorphism of measure algebras. First, $ is invertible by doing the above for V . Also, m2(B2) = m2(B2) = J X^ dm2 = (X^.x^) = (VX§2,VXR2) = (Xi(§2),Xi(|2)> = mi(*B2). It remains to show that $ preserves complements and countable unions. First note that since V is norm-preserving and maps characteristic functions to characteristic functions, V(l) = 1. 2 Since Xn + Xv . D =1 in L (m„) applying V to both sides ~2 wo^S? gives X$B + X$(x B j = 1 so X^Sj^ = *(X2\B2). Therefore $ preserves complements. Suppose B,C € B2. Then XBUC XB + XC XBflC XB + XC XBXC" Taking V of both sides we get: X*(BUC) = X$(B) + X*(C) X*(B)X$(C) = X$(B)U*(C)' Thus $(B UÇ) = *(§) U*(C) and hence, (by induction) preserves all
57 finite unions. Let B1,B2,...,Bn,... € B2, then X -* X a.e. Û B£ Û gi i=l i=l 1 2 and also in L (m„) by the bounded convergence theorem. Since V is an isometry it is continuous, so, v/x \ -> v/x \ = X .in L2(m1). i=l On the other hand, iW \ 9w <9^) v/x \ = x = x 2 by the above and so converges to X in L (m.). Therefore Û *§• i=l 1 *( U §i) = U *B£. // i=l i=l Corollary 2.1: If T, : X, ->■ X, , T„: X„ ->■ X„ are measure-preserving and if U V = VU for V: L2(m ) -♦ L (m ) satisfying the conditions of il i2 l x Theorem 2.1, then T, and T„ are conjugate. §4. Spectral Invariants Definition 2.5: A property P of measure-preserving transformations is a isomorphism conjugacy invariant if the following holds: spectral
58 Given T1 has P and T2 is isomorphic conjugate to T,, _spectrally isomorphic then T„ has property P. Note: A spectral invariant is a conjugacy invariant, and a conjugacy invariant is an isomorphism invariant. Theorem 2.2 : The following are spectral invariants of measure-preserving transformations : (i) Ergodicity (ii) Weak-mixing (iii) Strong-mixing. Proof; (i) T is ergodic iff {f € L2(m): UTf = f} is a one- dimensional subspace. (ii) T is weak-mixing iff 1 is the only eigenvalue and T is ergodic. (iii) Suppose WUT = U_ W and T. is strong-mixing. We have i2 il x to show that (U h,k) -+ (h,l)(l,k) V h,k € L2(m ). i2 L This is true if h is constant or if k is constant, so assume (h,l) = 0 = (k,l). Since 1-^ is ergodic then T„ is ergodic by (i) and since W sends the invariant functions for T„ onto those for 2 T-,, W maps the subspace of constants in L (m„) onto the subspace of constants in L (n^). So (Wh,l) = 0 = (l,Wk). Since W preserves the inner product, (UT h,k) = (WU h,Wk) = (UT Wh,Wk) ■*■ 0 i2 L2 il since T, is strong-mixing. Therefore T„ is strong-mixing. //
59 §5. Examples Recall that T, is isomorphic to T„ =» T, is conjugate to T„ => T, is spectrally isomorphic to T„ and the converse of the first implication holds in all "decent" measure spaces. (1) Consider T1>T2: K -* K given by T-^z) = a^, T2(z) = a2z where a, is a root of unity and a„ is not a root of unity. T. is not ergodic while T„ is ergodic. Hence they cannot be spectrally isomorphic. (2) Let T(z) = az where a is not a root of unity. We know that 2 2 T is ergodic but not weak-mixing. Consider A: T -* T defined by A(z,w) = (zw,z). Since none of the eigenvalues of the matrix I 1 are roots of unity, A is weak-mixing. Hence T and A are not spectrally isomorphic. (3) Let at least two of the numbers {p-, >P2 > • • • >P } be non-zero, and n ^T p. = 1. Let the same be true for the numbers {q,,...,q }. We i=l 1 i m claim that the 2-sided (p,,...,p )-shift and the 2-sided (□.,...,q )- 1 n ^1 ^m shift are spectrally isomorphic but not necessarily conjugate. A consideration of entropy shows that they need not be conjugate. (See Chapter 4.) Consider the special case of the (Js,Js)-shift T, with X = TT {-1,1}. A basis for L ({-1,1}) consists of the constant function 1 and the identity map sending (-1) h- (-1), 1 h- 1. 2 -t-r- 2 Moreover, L (TTX.) is the tensor product of the spaces L (X.) so 2 that we have an orthonormal basis for L (X) consisting of all
60 functions X -* C of the form: g0({xn}) - 1 and, for n, < n„ < ... < n g ( {x }) = x -x •. . . -x 6n1>...,nr n1 n± n2 np Note that Vn1,...,nr^xn>) = ^n,,. . . ,nr°T> < <xn» = xn1+l'xn2+l'-"'xnr+l = gni+l ,n2+l,. . . ,nr+l( {xn}) ' that is, UTgn1,...,nr = gn1+l,...,nr+l* 2 So we have an orthonormal basis for L (X) of the form; f0 5 i, {u;fl}n€Z , {u;f2>n€Z , ... . Diagramatically, the basis has the form f0 5 X .., u~ f1, u~ f1, f1, uTf1, uTf19 • • • , Um o' T 2' 2' ^t,-^2' Urn^2' Definition 2.6: An invertible measure-preserving transformation T: X ■* X h countable Lebesgue spectrum if there exists an orthonormal basis L2(X) of the form: fQ = 1, {U^fj} j 2 1. n € Z, i.e., a basis as in (*) above.
61 Remarks : (1) Any two transformations with countable Lebesgue spectrum are spectrally isomorphic. Proof : If T.-X-X, S: Y -* Y have bases: n n€Z 2 f0 s 1, {UTfj>J€Z+ for L (X) n n€Z 9 g0 s 1' <USgj>J€Z+ for L (Y) we define W: L2(Y) -> L2(X) by gQ -> fQ, Ugg. -> UTf. and extend by linearity. Thus WU<, = UTW and S and T are spectrally isomorphic. (2) Using a similar method to the one used above for the (Jj ,Jj)-shift one can show that if at least two of {p,,p„,...,Pn> are non-zero then the 2-sided {p, ,p„ ,...,p }-shift has countable Lebesgue spectrum. Theorem 2.3: If T has countable Lebesgue spectrum it is strong-mixing. Proof: Let {f ,U„f : n€Z, m>0} be the basis. Then, clearly, as p -* » (UT°UTfm'UTfq) - (UTfm5l)(1'UTfq} V k'n € Z' m'q " °' Fix k and q and consider Hk,q = <f € L2(m): (U^f,U^fq) -> (f,1)(l.U^f )}. 2 tf, is a closed subspace of L (m) (c.f. proof in example (4) §6 k ,q Ch. 1) and contains the basis {fQ,UTf : n €Z, m>0} and hence is equal to L2(m). Fix f € L2(m) and let Lf = {g € L2(m): P 2 (Urpf,g) -» (f,l)(l,g)}. Lf is a closed subspace of L (m) , contains the basis by the above, and therefore is equal to the entirety of 2 L (m). Hence (U^f,g) -> (f,l)(l,g) V f,g € L2(m). //
62 Suppose A: G -* G is an ergodic automorphism of a compact abelian metric group. Then the automorphism A: G -* G has no finite orbits except for the orbit of the identity. (This is what the er- 2 godicity of A says.) Since the characters form a basis for L (m) we can conclude that A has countable Lebesgue spectrum if we can show there are infinitely many distinct orbits of A. This is proved in Halmos [2]. In Chapter 4 we shall consider a whole class of transformations with countable Lebesgue spectrum.
Chapter 3 : Measure-Preserving Transformations with Pure Point Spectrum In this chapter we study a class of measure-preserving transformations for which the conjugacy problem is solved and for which spectral isomorphism implies conjugacy. §1. Eigenfunctions Suppose T is an ergodic measure-preserving transformation of a probability space (X,B,m). Suppose \ is an eigenvalue correspond- 2 ing to the eigenfunction f, i.e., f t 0, f € L (m), U_f = \f € L2(m) ( (foT)(x) = \f(x) a.e.). Then (1) |\| = 1 and |f| is a constant a.e. We have f(T(x))f(T(x)) = \\f(x)f(x) a.e. Integrating both sides we get that ||f||2 = |\|2||f||2. Therefore |\| = 1. Also |f(T(x))| = |\||f(x)j a.e. = |f(x)| a.e. Thus |f| is a T-invariant function and, since T is ergodic, |f| = a constant a.e. (2) Eigenfunctions corresponding to different eigenvalues are orthogonal. Suppose \ i |j., U_f = Xf, UTg = |j.g. Then (f,g) = (UTf,UTg) = (Xf,^g) = \il(f,g) and X|j. i 1 implies (f,g) = 0. (3) If f°T = Xf, goT = Xg then f = c-g where c is some constant. By (1) g i 0, so (f/g)°T = f/g which must be constant since T is ergodic. So (2) and (3) show that eigenspaces are 1-dimensional and
64 mutually orthogonal. (4) The eigenvalues of T form a subgroup of K. If foT = Xf, g°T = M-g then (fg)°T = Xjlfg. 2 By (2) if L (m) is separable then the group of eigenvalues is countable. §2. Pure Point Spectrum Definition 3.1: An ergodic measure-preserving transformation T on a probability space (X,B,m) has pure point spectrum (discrete spectrum) if 2 there exists an orthomormal basis for L (m) which consists of eigen- functions of T. The following theorem shows that the eigenvalues determine completely whether two such transformations are conjugate or not. Theorem 3.1: (Discrete Spectrum Theorem - Halmos and Von Neumann [1], 1942) The following are equivalent for ergodic measure-preserving transformations T, and T„ each having pure point spectrum: (1) T. and T„ are spectrally isomorphic. (2) T, and T„ have the same eigenvalues. (3) T, and T„ are conjugate. Proof: (1) =» (2) is trivial. (3) => (1) is always true (see §3 of Chapter 2). 2 (2) => (1). For each eigenvalue X, choose f, € L (nu.) , 2 g, € L (m„) such that and \fX - XfX> UT/X = XS> IM "" l*xl = x-
65 2 2 We define W: L (m„) -* L (m ) by W(g,) = f. and extending by linearity. We readily see that W is a bijective isometry; moreover WUT = UT W by checking this on the g. . i2 il X (2) => (3). To prove this we need the following result: Theorem 3.2 : Let H be a discrete abelian group and K a divisible subgroup of H (i.e., V k € K and Vn>0 3 a € K 3 an=k). Then there exists a homomorphism ^ : H -* K such that <t>\K = identity (i.e., K is an algebraic retract of H). Proof: Let R consist of all retracts onto K from supergroups of K in H, i.e., R consists of all pairs (M,*) where H>M>K and <t> : M -* K is a homomorphism such that <j> | „ = identity. R is non-empty as (K,idK) € R. We order R by extension, i.e., (M1,«1) < (M2,«2) if M1 s M2 and *2Im = *i* This is a partial ordering and every linearly ordered subset has an upper bound. So, by Zorn's Lemma there exists a maximal element, say (L,p), of R. We claim that L = H. Suppose not, then consider g € H\L and let M be the group generated by g and L. Case 1: If no power of g lies in L then every element of M can be uniquely written in the form g a where a € L, i € Z. We define \|r: M -* K by \|r(g a) = p(a). We can easily check that \|r is a homomorphism and that \|r | „ = id,,. This then contradicts the maximality of (L,p). Case 2: Let n be the least positive integer such that g € L. Each element of M can be uniquely written as g a, where a € L, 0 s i S n-1. Since K is divisible, let g. € K be such that p(g ) = gn« Then \|r(g a) = g.p(a) defines a homomorphism of M into K such that \|r | „ = id, . Again, we have contradicted the maximality of (L,p).
66 Thus it follows that L = H. // We now prove that (2) => (3). Let A denote the group of eigenvalues of T, = the group of eigenvalues of T„. Fix X € A. Let 2 f. € L (nu) be chosen so that |f. | = 1, UT f. = \f. and observe 2 2 that {f\: X € A} is a basis for L (m,). Also, choose g, € L (m„) so that |g.| = 1, U_ g. = Xg and observe that {gx: ^ € A} is a 2 basis for L (m„). UT.fX^ = X^ V \„x € A l and also UT (f,-f ) = f,(T)-f (T) = (X|^)(f.-f ). By (3) of §1 there exists a constant r(X,|j.) € K such that f,(x)f (x) = r(X,u)f, (x) a.e. X |j. X|j. Let H denote the collection of all functions X •*■ K. Clearly, H is an abelian group under pointwise multiplication. Moreover, K is a subgroup of H if we identify constant functions with their values. By the previous Theorem 3.2 there exists a homomorphism ^: H -* K such that, *IK = idK. Let f* = $(fx)fx; then |f*| = 1, UTf* = \f* and {f?: X € A} is a basis for L (m.). Also, f*f* = *(f,)«(f )f.f = 0(f.f )f,f 0(r(\,M.))*(fx )fXM_r(\,M.) r(\,M.)*(fx )r(\,M.)fx Thu s for all intents and purposes we can assume that ^\^u ~ ^\u an<^ gXg|^ = gX|T
67 2 2 Define W: L (m„) -* L (m.) by W(g)) = f\ and extend by linearity. W is bijective, linear and preserves the inner product. Also, WU_ = UT W. If we can show that W is multiplicative then i2 il W is necessarily induced by an isomorphism of measure algebras (by Corollary 2.1) and hence T and T„ are conjugate. But, W(g,g ) = W(g. ) = f, = f,f = W(g,)W(g ). 2 Let h,k be bounded functions in L (m„). If we fix g and let a ' 2 finite linear combination of g s converge to h in L (m„) we ob- tain that W(hg ) = W(h)W(g ). Then if we let a finite linear combi- nation of g 's converge to k in L (m„) we get that W(hk) = W(h)W(k). It follows from this that W maps bounded functions to bounded functions since this is also true for bounded h and any 2 2 k € L (m„), and then W(h) is bounded since W(h)f € L (nu) for all f in L2(m1). // Corollary 3.3: If T is an invertible ergodic transformation with pure point spectrum then T and T are conjugate. Proof: They have the same eigenvalues. // §3. Group Rotations Example •■ Let T: K -* K be defined by T(z) = az where a is not a root of unity. We know that T is ergodic. Let f : K -* C be defined n by f (z) = z where n € Z. n f (Tz) = f (az) = anzn = anf (z). n n n Thus f is an eigenfunction with eigenvalue a . Since the i^-n) 2 form a basis for L (K) we see that T is ergodic and has pure
68 point spectrum. These ideas carry over to rotations on compact abelian groups. Theorem 3.3: Let T, (T(g) = ag) be an ergodic rotation of a compact abelian group G. Then T has pure point spectrum. The eigenfunctions of T all consist of constant multiples of characters, and the eigenvalues are {y(a): y € G}. Proof: Let y € G. Then Y(Tg) = r(ag) = r(a)Y(g). Therefore each character is an eigenfunction and so T has pure point spectrum. If there is another eigenvalue besides the members of {y(a): y € G} then the corresponding eigenfunction would be orthogonal to all members of G, by CO of §1, and so is zero. Hence {y(a): y € G} consists of all the eigenvalues of T and the only eigenfunctions are constant multiples of characters, using (3) of §1. // Theorem 3.4: (Representation Theorem) An ergodic measure-preserving transformation T with pure point spectrum is conjugate to an ergodic rotation on some compact abelian group. Proof: Let A = the group of all eigenvalues for T and give A 2 the discrete topology. If L (m) is separable then A is countable. In the other case we shall need to use the character theory of groups without a countable basis. Let G = Â, the character group of A. G is compact and abelian. The map a : A -* K given by a(X) = X is a homomorphism of the discrete group A and hence, by (2) of §5 of Chapter 0, 3 a € G so that a(X)=X(a) X € A (where we write X when we wish to consider "X" as a homomorphism of G to K).
69 Define S : G •*■ G by S(g) = ag. We claim that S is ergodic. Suppose fS = f, f € L2(G), f = Hb.X., X. € A. Then the above gives us that Eb.X.(a)X.(g) <\- Eb.X.(g) so, b.X.(a) = b.. But X.(a) = X. and therefore b.X. = b.. If b. t 0 then necessarily X. = 1. Thus X.(g) = 1 for all g € G, and we get that f = a constant a.e. We know then that S is ergodic, and by the previous theorem has pure point spectrum. Again by the previous theorem the eigenvalues of S = {Y(a): y € G} = {a(X): X € A} = {X : X € A} = A. So, S and T have the same eigenvalues and both have pure point spectrum. Hence the Discrete Spectrum Theorem tells us that they are conjugate. // Theorem 3.5: (Existence Theorem) Every subgroup A of K is the group of eigenvalues of an ergodic measure-preserving transformation with pure point spectrum. Proof: The desired transformation is the rotation S constructed in the proof of Theorem 3.4-. // The conjugacy problem for ergodic measure-preserving transformations with pure point spectrum is completely solved. We have some very simple invariants, namely the eigenvalues, which determine when two such transformations are conjugate. Also there are some simple examples, namely group rotations, such that each ergodic measure- preserving transformation with pure point spectrum is conjugate to one of these examples. So each conjugacy class of ergodic measure- preserving transformations with pure point spectrum is characterized by a subgroup of K, and each subgroup of K corresponds to a conjugacy class.
Chapter M- : Entropy We are searching for conjugacy and/or isomorphism invariants. In 1958 Kolmogorov [1] introduced the concept of entropy into ergodic theory, and this has been the most successful invariant so far. For example, in 194-3 it was known that the two-sided (1/2,1/2)-shift and the two-sided (1/3,1/3)-shift both have countable Lebesgue spectrum and hence are spectrally isomorphic; but it was not known whether they were conjugate. This was resolved in 19 5 8 when Kolmogorov showed that they had entropies log 2 and log 3 respectively and hence are not conjugate. Von Neumann had had the same idea considerably earlier, but he was unable to prove that entropy was a conjugacy invariant. The notion of entropy now used is slightly different from that used by Kolmogorov - the improvement was made by Sinai [1] in 1959. §1. Partitions and Subalgebras Throughout, (X,B,m) will denote a probability space. Definition 4- .1 : A partition of (X,B,m) is a disjoint collection of elements of B whose union is X. We shall be interested in finite partitions. They will be denoted by Greek letters, e.g., Ç = {A,,...,A, }. If Ç is a finite partition of (X,B,m) then the collection of all elements of B which are unions of elements of Ç is a finite sub-a-algebra of B. We denote it by A(ç). Conversely, if C is a finite sub-a-algebra of B, say C = {C.: i=l,...,n}, then the nonempty sets of the form B, fl . . . fl B where B. = C. or X \C. form r J 1 n l i. l a finite partition of (X,B,m). We denote it by Ç(C). Thus we have
71 a one-to-one correspondence between finite partitions and finite sub-a-algebras of B. Definition 4.2: Suppose Ç and t| are two finite partitions. Ç ; tj means that each element of Ç is a union of elements of r|. Note: Ç ïti « A(ç) £. A(-q) A £ C « £(A) s Ç(C). Definition M-. 3 : Let Ç = {A1,...,An}, ti = {C1,...,Ck}. Then Ç v ti = {Ai n C. : 1 < i < n, lsjsk}. If A and C are finite sub-a-algebras of B then A v C denotes the smallest sub-a-algebra of B containing A and C. Note: Ç(A v C) = Ç(A) v Ç(C) A(Ç v ti) = A(Ç) v A(ti). Suppose T: X -* X is a measure-preserving transformation. If Ç = {A19...,Am}, then by T~nç we mean {T~nA1,...,T-nAm} and by T~n(A) we mean {T~nA: A € A} (n > 0). Note: If n > 0 ÇCT nA) = T nç(A) A(T nO - T nA(ç) T"n(A v C) = T~nA v T"nC T"n(Ç v-q) = T nÇ v T-nT! Ç £ ti ■=> T nÇ £ T_nT! A £ C - T nA c- T nC.
72 Definition 4.4: If V and E are (not necessarily finite) sub-a-algebras of B, then we write OIE if V D € P 3 E € E such that m(DAE) = 0 and V E € E 3 D € P such that m(DAE) = 0. If 0 and E are finite, 0 2 E, and if Ç(0) = {Dl5...,D ,D +1,...,D } where m(D^) > 0 for 1 2 i £ p and m(Di) = 0 for p+1 £ i £ q, then Ç(E) = {E^ . . . ,E ,E +1, . . . ,Eg} where mtE^AD^) = 0 for 1 £ i £ p and m(E.) = 0 for p+1 £ i £ s. V c C means V D € V 3 C € C such that m(DAC) = 0. §2. Entropy All logarithms are to base 2 and 0•log 0=0. Let A S. B be finite. Let Ç(A) = {A., , . . . ,A, }. Then k H(A) = H(Ç(À)) = - E m(A.) log m(A.), i = l is called the entropy of A (or of Ç(A) ). (This means that if A,,...,A, denote the outcomes of an experiment then H(A) measures the uncertainty removed (or information gained) by performing the experiment. H(A) is a measure of the uncertainty about which A. a general point of X will belong to.) Remarks : (1) If A = {X,*} then H(A) = 0. Here A represents the outcomes of a "certain" experiment so there is no uncertainty about the outcome. (2) If Ç(A) = {A1,...,A]<} where m(Ai) = 1/k V i then k , H(A) = - E è log £ = log k. i = l K K
73 Thus, we gain a lot of information if k is large. (Since all the members of Ç(A) have equal measure there is much uncertainty about which A. a point will belong to.) (3) H(A) s 0. (4) If A I C then H(A) = H(C). Suppose T: X -* X is measure-preserving. If A is a finite sub-a-algebra of B we define h(T,A) = lim j^HCA v T_1A v ... v T~(n_1)A) n-*» n-1 = lim ±H( V T 1A), n->- n i=o which we call the entropy of _T_ with respect to _A_. (Later (in Corollary 4.4) we will show that the above limit always exists.) (This means that if we think of an application of T as a passage of n-1 _i one day of time, then \/ 1 k represents the combined experiment i=0 of performing the original experiment represented by A on n consecutive days. h(T,A) is then the average information per day that one gets from performing the original experiment daily forever.) Remarks : (5) h(T,A) £ 0. n-! _i n-! _i (6) The elements of Ç( V T A) = V Ç(T A) are all the sets of i=0 i=0 n-1 _. the form (~\ T 1A where Ç(A) = {A,,...,A,}. i=0 mi Ik We define h(T) = sup h(T,A) where the supremum is taken over all finite sub-a-algebras A contained in B and call this the entropy of _J_. ( h(T) is the maximum average information per day obtainable by performing a finite experiment.)
74 Remarks : (7) h(T) 2: 0. h(T) could be +-. (8) h(idv) =0. If h(T) = 0 then h(T,A) = 0 for every finite A, A n-l _± which implies that V T A does not change much as n -* ». i=0 Theorem 4.1: Entropy is a conjugacy invariant and hence an isomorphism invariant. Proof: Let T, : X, -* X. , T„: X„ -* X„ be measure-preserving and let $ : (8„,m„) -* (8..,m,) be an isomorphism of measure algebras such that ST"1 = T"1*. Let A be finite, k^ c B , and £(A2) = {A15.,.,A }. Choose B^ € B such that B^ = *(A.) and so that r\ - {B.,...,B } forms a partition of (X,, B.. ,m.. ) . Let A. = A(t|). n_1 -i Now O T, B (where the q. € {l,...,r}) has the same measure i=0 X qi n-l _. as Pi T„XA since i=0 qi ♦(HA ) = *(OT:iA ) = Ht"1*(A ) = "fi T^B = ^ T^B . i=0^1^qi i = 0 qi i = 0 qi i = 0 ~x ~qi i = 0^L^qi n-l _. n-l _. Thus, H( V T XA ) = H( V T„1A ) which implies that h(T ,A ) = i=0 x x i=0 l L x x h(T„,A„) which in turn implies h(T,) ï h(T„). By symmetry we then get that h(Tx) = h(T2). // Theorem 4.2: The function 4> : [0,-) -* R defined by: 0 if x = 0 4>(x) = I x.log x if x ?! 0 is convex, i.e., 0(ax+py) £ a*(x) + p*(y) if x,y € [0,-), a,p > 0, a+p = 1.
75 By induction k k *( Z a-;xJ - E a.-*Cx. ) i=l 1 1 i=l 1 k if x. € CO,-), a. > 0, Z a. = 1. 1 1 i=l 1 Proof: <t>' (x) = log e + log x 0"(x) = i£S_Ë. > o on (0,-). x Suppose y > x; by the mean value theorem 4>(y) - 4>(ax+{3y) = <t> ' (z)a(y-x) where ax+py < z < y and 4>(ax+py) - 4>(x) = 4>'(w)p(y-x) where x < w < ax+py. Since 4," > 0 4>'(z) 2; *'(w), thus pU(y) - 4>(ax+py)) = 0'(z)ap(y-x) > 4> ' (w)ap(y-x) = a(*(x+py)-0(x)). Therefore 4>(ax+{3y) 2 a^(x) + p*(y) if x,y > 0, and hence also if x,y so by continuity of <t>. II Corollary 4.2: If Ç = {A1,...,A]<} then H(Ç) £ log k. Proof: Put a. = 1/k and x. = m(A.) 1 £ i S k. Then HCO s log k. // Combined with remark (2) this corollary shows that among all the partitions of X into k sets, the largest entropy is obtained when all the sets have equal measure. This fits in with our intuitive interpretation of entropy.
76 §3. Conditional Entropy Let A,C £ B be finite. 5(A) = {A1,...,Ak}, ç(C) = {C15...,Cp}. We define the entropy of A given C to be p k m(A. PI C. ) m(A. fl C. ) H(A/C) = - £_ m(C.) Y ï 1- log ï 3_ j=l 3 i=l m(Cj) m(Cj) m(A. fl C. ) = - 2. m(A. fl C . ) log J— 2: 0 i,j x 3 mCCj) omitting the j-terms when m(C.) = 0. So to get H(A/C) one considers C. as a measure space with normalized measure m(-)/m(C.) and calculates the entropy of the partition of C. induced by £(A) (this gives k m^ fl C.) m(Ai fl C.) - L ^— log 3— ) and then averages the answer taking i=l m(Cj) m(Cj) into account the size of C. ( H(A/C) measures the average information obtained from performing the experiment associated with A given the outcome of the experiment associated with C.) Let W denote the a-field {*,X}. Then HCA/W) = H(A). (Since N represents the outcome of the trivial experiment one gains nothing from knowledge of it.) Remarks : (1) HCA/C) > 0. (2) If A 2 V then H(A/C) = H(P/C). (3) If C 2 V then H(A/C) = H(A/0).
77 Theorem 4.3 : If A,C,P are finite subalgebras of B then: (i) H(AvC/P) = H(A/P) + H(CMvp) (ii) H(A vc) = H(A) + HCC/A) (iii) A ^ C = H(A/0) s H(C/0) (iv) A £• C = H(A) s H(C) (v) C £. 0 = HCA/C) > H(A/0) (vi) H(A) > H(A/0) (vii) HCAvC/O) < H(A/0) + H(C/0) (viii) H(A vC) s H(A) + H(C). (ix) If T is measure-preserving then: H(T"1A/T"1C) = H(A/C) and (x) H(T_1A) = H(A). (The reader should think of the intuitive meaning of each statement. This enables one to remember these results easily.) Proof: Let Ç(A) = {A.}, Ç(C) = {C.}, 5(0) = {D, } and assume, x 3 k without loss of generality, that all sets have strictly positive measure (since if Ç(A) = {A,,...,A,} with rc(A.) > 0 1 < i 2 r and m(A.) =0 r £ i £ k we can replace £(A) by «Al Vl'N. UAr+l U ••• U V }" m(A. n C. n D.) (i) H(Av C/V) = - L_ m(A. fl C . n D, ) log ± J — . i,5,k x 3 K m(Dk) mCA.ncnD.) mCA.ncnD.) m(A. n D, ) But 3 £_ r i 3 !S_ i iS_ unless m(A. n D. ) = 0 m(Dk) m(AinDk) m(Dk) and then the left hand side is zero and we need not consider it; and therefore
78 m(A. fi D, ) H(AvC/0) = - E m(A. n C. n D ) log x K i.j.k X 3 k m(Dk> m(A. fl C fl D ) - I. m(A. fl C. fl D, ) log i 3 £- i,j,k 3 mCA^ Dk) m(A. fl D, ) = - L m(A. fl D, ) log i — + HCC/A v 0) i,k 1 K m(Dk) = H(A/P) + HCC/Av 0). (ii) Put V - H = U,X}. (iii) By (i) H(C/0) = H(AvC/0) = H(A/0) + H(C/Avp) > H(A/0). (iv) Put V = H in (iii). m(D, fl C. ) m(A. fl D,) (v) Fix i,j and let a = - ^—, x, = = —. Then K m(C.) K m(Dk) by Theorem 4.2 / m(Dknc.) m(Ai n Dk) \ y m(Dk fl C. ) /itAJD^ \ \ k m(C.) m(Dk) J'y. m(C.) \ m(Dk) J but since C £ V the left hand side /m(A.nc.)\ m(A.nc) i(A.OC) = 4, Ï 1- = i 3_ log i 2_ . \ m(C.) J m(C.) m(C.) Multiply both sides by m(C.) and sum over i and j to give _ m(A. PC.) _ m(A. (1 D, ) m(A. fl D, ) 2_ m(A. HO log 1- < 2_ m(D, n O — log 1 i,j J m(C.) i,j,k ^ J m(Dk) m(Dk) _ m(A. n D, ) m(A. fl D, ) = 2. m(D. ) i — log i k_ i,k m(D,) m(D, )
79 or -HCA/C) S -H(A/Î>). Therefore H(A/Î>) S HCA/C). (vi) Put C = H in (v). (vii) Use (i) and (v). (viii) Set V - H in (vii). (ix), (x) Clear from definitions. // Theorem 4.4: If {a } , satisfies a ï 0, a. sa +a V n,m, then n n>l n n+m n m lim a /n exists and equals inf a /n. Ti~*-°° n Proof: Fix m > 0. For each j>0 j=km + n where 0 5 n < m. Then a. a .. a a, a ka a a j _ n+km < _n_ km ,. _n_ m _ _n_ , _m j n + km km km km km km m a • a a. a As j -* » then k-*~ so lim — £ — and therefore lim -1 £ inf —. j m j m a a. a. a. But inf — - lim —1 so that lim -J- exists and equals inf -1. // m j 3 j Corollary 4.4: 1 n"1 -i If A £ B then lim - H( V T A) exists. n-*- n i=0 (See also the remark after Theorem 4.6.) n-1 _. Proof: Let a = H( V T 1A) > 0. n i=0
80 n+m-1 an+m = H< V T"lA) 1 = 0 n-1 _. n+m-1 _. by (viii) < H( V T 1À) + H( V T 1À) „ „ . n „•_„ of Theorem 4.3. i=0 l-n m-1 _. = a + H( V T A) by (x) in Theorem 4.3. n i = 0 = a + a . n m We then apply Theorem 4.4. // §4. Properties of h(T,A) n-1 _. Recall that h(T,A) = lim - H( V T 1A) . n-*~ n i=0 Theorem 4.5: Suppose A,C are subalgebras of B and T is measure- preserving. Then (1) h(T,A) < H(A). (2) h(T,AvC) £ h(T,A) + h(T,C). (3) A £ C = h(T,A) < h(T,C). (4) h(T,A) < h(T,C) + H(A/C). (5) If T is invertible and mil then m h(T,A) = h(T , V T1A). i = -m Proof: (1) - H( V T 1A) s ± £ H(T 1A) by (viii) of Theorem 4.3. n i = 0 n-1 i £ H(A) by (x) of Theorem 4.3. = H(A).
81 (2) H( V T_i(A^O) = H( V T_iA v V T_:LC) i=0 i=0 i=0 n-1 . n-1 £ H( V T_1A) + H( V T" C) by (viii) of Theorem 4.3. i=0 i=0 (3) If A £. C then n-1 . n-1 V T_1A £ V T_1C V n £ 1 i=0 i=0 so one uses (iv) of Theorem 4.3. n-1 _. n-1 . n-1 (4) H( V T 1A) <H(( V T_1A) v ( y T_1C) ) by (iv) of Theorem 4.3 i=0 i=0 i=0 n-1 , n-1 . n-1 _, = H( V T_1C) + H(( V T xA)/( V 1 C)) by (ii) of Theorem 4.3. i=0 i=0 i=0 But, by (vii) of Theorem 4.3 n-1 . n-1 . n-1 . n-1 . H((V T1A)/(V T_1C)) S Y. H(T_1A/(V T3C)) i=0 i=0 i=0 j=0 n-1 £ Z. H(T 1A/T 1C) by (v) of Theorem 4.3 i = 0 = nH(A/C) by (ix) of Theorem 4.3. n-1 . n-1 _. Thus, H( V T XA) < H( V T C) + nH(A/C). i=0 i=0 m . , k-1 . m (5) h(T , V T1A) = lim ^ H( V T"3 ( V T1A) ) -m k-*~ j=0 i = -m , m+k-1 = lim i H( V T_:LA). k-»~ -m
82 m+k-1 . m . m+k-1 H( V T_1À) = H( V T_1À v V T_1À) -m -m m m+k-1 • m . m+k-1 = H( V T_1A) + H(( V T_1À)/( V T 1A)) m -m m by (ii) of Theorem 4.3 k-1 . m . m+k-1 H( V T_1A) + H(( V T 1A)/( V T 1A)) i=0 -m m by (x) of Theorem 4.3. We want to show that , m . m+k-1 ^H((VT:lA)/( V T 1A)) -> 0 as k->-. -m m , m . m+k-1 _• -, m . But, i- H(( V T 1A)/( V T 1A)) < i- H( V T" A) by (vi) of Theo- -m m -m rem 4.3 and so the result follows. // Theorem 4.6 : If A £ B is finite, and T is measure-preserving then h(T,A) = lim H(A/( V T 1A)), n-*» i=l Proof: The limit exists since the right hand side is non- increasing in n by virtue of (v) of Theorem 4.3. We show by induction that for n î 2 n-1 . n-1 j H( V T 1A) = H(A) + Z H(A/( V T 1A)). i=0 j=l i=l This is true for n = 2 by (ii) of Theorem 4.3. Assume that this equality is true for n. We shall prove that it holds for n+1.
83 H( V T_iA) = H( V T_1A) + H(A/( V T_1A)) i=0 i=l i=l by (ii) of Theorem 4.3 n_1 -i n -i = H( V T 1A) + H(A/( V T 1A)) i=0 i=l by (x) of Theorem 4.3 = H(A) + Z H(A/( V T_iA)). (*) j=l i=l Dividing the above by n and taking the limit the result follows, using the fact that the Cesaro limit of a convergent sequence is the ordinary limit. // Remark : 1 n_1 -i ± H( V T A) decreases to h(T,A). n i = 0 n-1 . n Proof: By (*) H( V T 1A) > nH(A/( V T 1A)) using (v) of i=0 i=l Theorem 4.3. Hence n-1 . n . n-1 n[H( V T-1A) + H(A/( V T_1A))] s (n+l)H( V T_1A) i=0 i=l i=0 n _. n-1 nH( V T 1A) < (n+l)H( V T 1A). i=0 i=0 §5. Properties of h(T) Theorem 4.7: (1) For m > 0, h(Tm) = mh(T). (2) If T is invertible then h(Tm) = |m|h(T) V m € Z. Proof : We first show that m-1 h(Tm, V T 1A) = mh(T,A). i=0
84 This follows since , k-1 . m-1 . km-1 lim ± H( V T ml( V T_:là)) = lim r^- H( V T 1A) k-~ j = 0 i=0 k-~ Km i=0 = mh(T,A). m-1 Thus, mh(T) = m • sup h(T,A) = sup h(Tm , V T_1A) A finite A i=0 < sup hCI^.C) = h(Tm). C m-1 _. Also, h(Tm,A) 2 h(Tm, V T 1A) = mh(T,A) by (3) of Theorem 4.5 and i = 0 so, h(T ) £ mh(T). The result follows from these two inequalities. (2) It suffices to show that h(T~ ) = h(T); and all we need to show is that h(T_1,A) = h(T,A) for all finite A. But n-1 . c _n 1 n-l H( V T1A) = H(T"tn l; V TA) by (x) of Theorem i+. 3 i=0 i=0 = H( V T 3A). // j = 0 Theorem 4.8: Let (X,B,m) be a probability space and Bn be an algebra such that the a-algebra generated by 8Q (denoted by CT(8Q) ) satisfies aCB.) I B. Let C be a finite subalgebra of B. Then for every e > 0, there exists a finite algebra 0, V £ BQ such that H(0/C) + H(C/0) < e. Proof: Let Ç(C) = {C,,...,C } and assume without loss of generality that each C. has positive measure. We can do this since if C,,...,C have positive measure and C .,...,C have zero measure where 1 £ s < r then Ç(C') = {Cl9 . . .,Cg_1 ,Cg U ... U Cr> is such that H(C'/0) = H(C/0) and H(0/C') = H(0/C). Since *(x) = x-log x is continuous and 4> ( 0 ) = 0, <*>(!) = 0,
85 3 0 < 6Q < 1 such that -*(x) < e/2r if 0 £ x £ 6Q or l-60 £ x £ 1. We first show that if we choose a partition Ç(0) = {D-,,...,D } m(C.) 3 m(C.AD.) £ min 6n — =6 V i, then H(C/P) < e/2. For, if 1 X lsi<r ° 2 Ç(0) satisfies the above inequality, then m(C. ) < m(D-) + 6 < m(D. ) + m(C.) which implies < m(D.) and hence, 2 ! m(Di) - m(Dinci) £ m(DiAC.) < 6 < 6Qm(Di). m(D. fi C. ) Thus i — > l-6n. m(Di) U m(C. PI D- ) Therefore, if j i i then ^ — £ 6n and hence, m(Di) U v ,m(C. flD.)\ i j \ m(D.) / S Y. m(.H.) T. e/2r = e/2. i j So m(C.AD.) < — min m(C) implies H(C/P) < e/2. If 1 2 l£i£r 1 60 m(C.iD.) £ -j— min m(C.) then this still holds and also m(C.iD-) < 1 l£isr 3 m(C.)/4 which implies m(D.) > r- m(C.) and hence 6 n - 6 n m(C.AD.) < -7- ' T min m(D. ) < -~- min m(D. ). 11 4 d lsisr 1 ^ l£i£r 1 Therefore H(î>/C) < e/2 also. So it suffices to show we can choose D. € 8. with m(C.AD.) £ -£- min m(C.) = a. 4 l£i£r
86 Choose X > 0 3 X(r-l)[1+ r(r-l)] < a; for each i, choose Bi € BQ 3 m(CiiBi) < X. If i i j then B± fl B. c ( B^C.^ U ( B. AC • ) so that m(B. (IB.) < 2X. Let N = IJ (B. FIB. ). We have r-1 m(N) < r(r-l)\. Set D. = B.\N for 1 s i< r and D = X \ (J D.. i=l {D,,...,D } is a partition of X and each D. € B.. If i < r then D.AC, c (BiiCi) U N and so, m(D.AC) < X[l+r(r-l)] < a. r-1 However D AC c (J (D.AC.) and therefore r i=l 1 1 m(DrACr) < (r-l)X[l +r(r-l)] < a. So the theorem is proved. // (If {A } is a sequence of subalgebras of B then V A de- n ^ = n n notes the sub-algebra of B generated by the A .) Corollary 4.8. If {An} is an increasing sequence of finite algebras and C c V An then H(C/A ) -> 0 as n -> -. n n n Proof: Let Bo = U Am 5 Bo is an algebra and C ° ct(Bq) by m=l hypothesis. By Theorem 4.8 V e > 0 3 finite V £ BQ 5 H(C/0 )< e. But V -A for some m. since V is finite, so, if m î m„ H(C/A ) < H(C/A ) < H(C/0 ) < e. m mn e Thus, H(C/A ) tends to zero as m -* ». // ' m For entropy to be useful we require methods of calculating its value. The following is one of the main tools for calculating entropy.
87 Theorem 4.9: (Kolmogorov-Sinai Theorem) Let T: (X,8,m) -* (X,B,m) be an invertible measure-preserving transformation and let A be a finite subalgebra of B 3 oo V TnA ? b. Then h(T) = h(T,A). n = -» Proof: Let C 5 B be finite. We want to show that h(T,C) < h(T,A). m m • h(T,C) < h(T, V T1A) + H(C/ V TA) i=-m i = -m by (i+) of Theorem 4.5, m = h(T,A) + H(C/ V T1A) i = -m by (5) of Theorem 4.5. m Let Am = V TXA. It suffices to show that H(C/A ) goes to zero i = -m m as m -> ». This follows by Corollary 4.8. // A similar result holds when T is not necessarily invertible:- Theorem 4.10 : If T: X -* X is a measure-preserving transformation (but not necessarily invertible) and if A is a finite algebra contained in B with V T1 2 B then h(T) = h(T,A). i = 0 m-1 _. Proof: This is similar to the previous theorem; use V T A i=0 m . in the place of V TA, and the formula i = -m m-1 _. h(T,A) = h(T, V T 1A) i = 0 (the proof of which is similar to (5) of Theorem 4.5). // The following is sometimes useful in showing transformations
88 have zero entropy. We shall use it later to show a rotation of the unit circle has zero entropy. Corollary 4.10 : If T is invertible and V T_1A % 8 for some finite A then i=0 h(T) = 0. Proof: By Theorem 4.10 h(T) = h(T,A) n = lim H(A/_V T_1A) by Theorem 4.6. n-*» i=l » . , n . But V T-1A % T_1B = B. Let A = V T_:LA ; then k± c A c ... i=l i=l 00 and V A % B. By Corollary 4.8 H(A/A ) - o and h(T) = 0. // , n J J n n=l Remarks : Entropy can be defined for any countable partition of (X,B,m) as follows: If Ç = {A,,A,,...} then H(Ç) = - I! m(Ai)log m(Ai) (which may be infinite). A countable partition Ç of X is called a generator for an invertible measure-preserving transformation T if ea V TnA(Ç) 2 B. n = -- As in Theorem 4.9, one can prove that if Ç is a generator and- H(Ç) < - then h(T) = h(T,Ç). The basic theorem on existence of generators Ç with H(ç) < » was given by Rohlin in 1963. To state it we need the following definitions: We say that (X,B,m) is countably generated if there exists a countable collection {B } of elements of B such that the
89 cr-algebra generated by the {B } % 8. (X,B,m) is complete if every subset of a set of measure zero is measurable. Suppose T: X -* X is a measure-preserving transformation, and X is countably generated and complete. We say that T is aperiodic if m( \J {x € X: Tn(x) = x}) = 0. n€Z n*0 (The countably generated and completeness conditions ensure that this set is measurable.) Note that T ergodic implies T is aperiodic (unless the space is finite). Theorem 4.11: (Rohlin [2], 1963) Suppose (X,B,m) is a countably generated complete non-atomic probability space and T: X -* X is an invertible measure-preserving transformation. Then T has a generator Ç with H(Ç) < » iff h(T) < » and T is aperiodic. Thus, if T is ergodic and h(T) < » then T has a generator Ç with H(Ç) < -. Recently Krieger [1] proved: Theorem 4.12: If (X,B,m) is countably generated and T is an invertible ergodic measure-preserving transformation such that h(T) < » then T has a finite generator Ç = {Al5...,An}. In fact ç may be taken so that eh( < n < e + 1. Hence finite generators exist in the most interesting cases although they may be difficult to find. We now prove some more results that are useful for computation of entropy.
90 Theorem 4.13: If BQ is an algebra and ct(Bq) g B then h(T) = sup h(T,A) where the supremum is taken over all finite subalgebras A of Bn. Proof: Let e > 0. Let C £ B be finite. By Theorem 4.8 there exists a finite V c Bn such that e 0 H(C/0 ) < e. e Thus, h(T,C) < h(T,P ) + H(C/0 ) by (4) of Theorem 4.5 < h(T,P ) + e. e Therefore, h(T,C) < sup h(T,0) + e, V-BQ V finite and thus h(T) < sup h(T,0). PSBQ V finite The opposite inequality is obvious. // Theorem 4.14 : Let A be finite subalgebras of B such that A, £ A £ ... and V A ° B. Then h(T) = lim h(T,A ). n n n n=l n-*« Proof : We note that h(T,A ) is an increasing sequence by (3) ea of Theorem 4.5. B = U A is an algebra and a(B ) ? B. By Theo u n=l n u rem 4.13 h(T) = sup h(T,C). If C ^ Bn is finite then C £ A C£BQ ° n C finit for some n.. Thus C£BQ u n0 h(T,C) s h(T,A ) n0
91 which implies h(T) < lim h(T,À ) n n-v» and hence h(T) = lim h(T,A ). // n-*« Theorem 4.15: h(T1><T2) = h(Tx) + h(T2). Proof: Let T : CX,^,!^) -* (X-^B^n^) and T2: (X2,B2,m2) -> (X2,B2,m2). If Ai - Bi' A2 £ B2 are finite then AixA9 """S fi-ni-te' Ç(A1xA2) = {A1xA2: A1€Ç(A1), A2€Ç(A2)}. Let Tn = U A,xA„ u A CB A co x *■ A1_e1,A2-e2 finite = the algebra of finite unions of measurable rectangles. Thus, CTC'n^ " Bi*B2 ^y definition of 8,xB„, and by Theorem 4.13, h(T,xT„) = sup h(TnxT„,C). X A (iCT -1- *■ 0 C finite But if C is finite, C ç TQ then C £ A-xA. for some finite A, ç; B,, A t B2. Hence h(T,xT ) = sup h(T,xT,A,xA„). 1 l A!-Bi L I L I A2iB2 A. ,A_ finite
92 n-1 H(y (T1xT2)~1(À1xA2)) i=0 n-1 _. n-1 = H(( V T,^) x ( y T^A.)) i=0 x i=0 L A = - E (m1><m2)(CkxD- ) -log (m^nijXC^D.) n-1 _. where the C, are the members of Ç(V t7XA, ) , * i=0 1 1 n_1 -i and the D. are the members of Ç ( V T„ A„) 3 i=0 l l - - L m1(Ck)m2(D.)-log(m1(Ck)m2(D.)) = - E m1(Ck)m2(D.)-[log m^C^) + log m2(D.)] = - E m1(Ck) -log n^CC^) - E m2(D-) -log m2(D-) n-1 . n-1 = H( V T^AJ + H( V T^A.). i=0 x x i=0 L A Thus h(T1xT2,A1xA2) = h(T1,A1) + h(T2,A2) so, h(T]_xT2) = h(Tx) + h(T2). // §6. Examples We shall now calculate the entropy of our examples. (1) If I: (X,B,m) -*■ (X,B,m) is the identity, then h(I) = 0. This is because h(I,A) = lim - H(A) = 0. Also, if Tp = I for some p^O then h(T) = 0. This follows since 0 = h(Tp) = p-h(T) by Theorem 4.7. Hence any measure-preserving transformation of a finite space has zero entropy. (2) Let T: K -> K be T(z) = az. Case 1: Suppose {a : n € Z} is not dense, i.e. , a is a root of
93 unity. Thus a^ = 1 for some p t 0 ; and T^(z) = a^z = z so h(T) = 0 by example (1). Case 2: Suppose {an: n € Z} is dense in K. Then {a11: n<0} is dense in K. Let Ç = {A.,A„} where A. = upper half circle [1,-D A„ = lower half circle [-1,1) For n > 0 T~~ Ç consists of semi-circles beginning at a and -a~n. Since {a : n > 0} is dense any semi-circle belongs to V T_nAU). Hence any arc belongs to V T_nA(Ç). Thus, n=0 n=0 ea B = V T~nAU) and so, h(T) = 0 by Corollary 4.10. // n=0 (3) Any rotation of a compact metric abelian group has entropy zero. Proof : (a) Suppose X = Kn, the n-torus, and T(z,,...,z ) = (a,z,,...,a z ). Then T = TnxT0x...xT where T. : K -* K is defined linn il n î by Ti(z) = aiz. By example (2) h(Ti) = 0 for all i so by Theorem 4.15 n h(T) = Y. h(T.) = 0. i=l 1 (b) General Case. Let T: G -* G be T(x) = ax. Let G= {Yn sY, > • • • }• Let H = Ker Yn H ... n Ker y • H is a closed subgroup of G and n 1 n n (G/H ) is the group generated by {y.,...,y }. Thus (G/H ) = finite group * Z n, n i so G/H = F x K n n n where F is a finite group and K is a finite-dimensional torus. T induces a map T : G/H -*■ G/H by T (gH) = agH . T is a c n n n-^n°n °n n
94 rotation on G/H , so that it can be written T = T , x T „ where n' n n,l n, 2 T , is a rotation of F and T , is a rotation of K n. Thus, n,1 n n, 2 h(T ) = h(T ,) + h(T „) = 0 n n ,1 n,i by example (1) and case (a) of this proof. Note that H \ {e} so that V A(G/H ) % B where A(G/H ) n n n n denotes the a-algebra consisting of those elements of B that are unions of cosets of H . Therefore, if B = \J A(G/H ) then by Theorem 4.13 u n=l n h(T) = sup h(T,C). C-BQ C finite However, if C £ BQ is finite then C £ A(G/H ) for some n and so, h(T,C) < h(Tn) = 0. Thus, h(T) = 0. // Corollary: Any ergodic transformation with pure point spectrum has zero entropy. This follows from Theorem 3.4. (Actually we have shown the re- 2 suit only when L (m) is separable since the above calculation was for a metric group G.) (4) Endomorphisms of compact groups: If A is an endomorphism of the n-torus K onto K we shall show in Chapter 6 that h(A) = L. log|X| where the summation is over all eigenvalues of the matrix [A] with absolute value greater than one. One can write down a complicated formula for the entropy of an endomorphism of a general compact metric abelian group. See Juzvinskii CI].
95 (5) Affine transformations: We shall show in Theorem 6.10 that when T = a-A acts on Kn then h(T) = h(A). (6) The two-sided (p.,... ,p, _,)-shift has entropy ¥,. P,- • log p.. i=0 1 1 Let X = IT {0,1,... ,k-l} and T be the shift. Let Ai = — oo i{x,}: xQ = i>, 0 < i < k-1. Then Ç = {AQ ,. . . >A]<-_1} is a partition of X. Let À = À(Ç). By definition of B, V TXA = B. i=-» By the Kolmogorov-Sinai Theorem (4.9), h(T) = lim i H(À v T-1À v . . . v T~(n-1)A) . n-*» Atypical element of Ç ( A v T-1A v . . . v T-^-15 A) is a. n t-1a. n ... n T~(n-1)A. 10 11 n-1 {{xn}: x0=iQ, ^1--i1,---^n.1--±n_1} which has measure p. -p. ...-p. . Thus, 10 11 n-1 H(A vT-1A v ... vT"'""1^) = - 11 (p. •••••Pn- )-log(p. -...-p. ) 10 n-1 0 n-1 k-1 = - I! (P.: ■■•••Pi )[l0g p. + ... + log p. ] in=...=i ,=0 x0 xn-l x0 n-1 0 n-1 k-1 = - n Y. Pi • log p • • i=0 1
96 k-1 Therefore, h(T) = h(T,A) = - Z p.•log p.. // i=0 1 1 Corollary: The 2-sided (1/2,l/2)-shift has entropy log 2; the 2-sided (1/3,1/3,l/3)-shift has entropy log 3. Thus these transformations cannot be conjugate. (7) The 1-sided (pQ,...,p, _.)-shift has entropy k-1 - I Pi-log p-• i=0 1 1 The proof is very similar to the one in example (6) but Theorem 4.10 is used instead of Theorem 4.9. An example of a transformation with infinite entropy is the following: (8) Let I = (0,1] with Borel sets and Lebesgue measure. Let oo X = TT ! with product measure and let T be the shift on X. Then h(T) = -. To see this let A • = < {x. }: i^i < xn 5 - n>0 1 < i 5 n > . n,i\l3Jn On J Then m(A .) = 1/n and Ç = {A .,...,A } is a partition of X. n,3. n n,x n,n Hence h(T,£ ) = log n by the same argument as used in example (6) -1 -k (using the independence of Ç ,T Ç ,...,T Ç ). Therefore, h(T) 2 log n for each n, and so h(T) = -. // §7. How good an invariant is entropy? Definition 4.5: An invariant P for an equivalence relation is a complete invariant if whenever T and S both have the property P then T and S are equivalent.
97 Entropy is, in general, far from complet-». (a) An example of two ergodic measure-preserving transformations with equal entropy which are not conjugate. Let T: K -* K be defined by T(z) = az, a € K, where {a11} is dense in K, and let S: K -* K be defined by S(z) = bz, b € K, where {bn} is dense in K. T and S are ergodic and h(T)= 0=h(S) by example (2) of the previous section. If we choose a,b so that {a11}"^ i {b }_^ then T and S are not conjugate (in fact, they are not even spectrally equivalent) by Theorem 3.1. // (b) An example (due to Anzai) of two ergodic and spectrally equivalent measure-preserving transformations with equal entropy which are not conjugate. Let T: K2 -> K2 and S : K2 -> K2 be defined by T(z,w) = (az,zpw) S(z,w) = (az,zqw) n °° • where {a }__ is dense in K and p,q are non-zero integers. Observe that T and S are affine transformations, T and S are ergodic, and h(T) = h(S) = 0 by example (5) of §6. By considering 2 the characters of K one can easily show that L (m) has a basis of the form {gn: n 2: 0} U {U^f : j € Z, q i 0} where gn(T) = angn. 2 3 Similarly L (m) has a basis {g } U {Ugh : j € Z, q > 0}. One then 2 2 defines a unitary operator W: L (m) -* L (m) by W(g ) = g and W(UTf ) = Ugh and extending. Clearly WUT = UgW showing T and S are spectrally isomorphic. However if p i +q T and S are not conjugate. As mentioned before conjugacy and isomorphism coincide for measure-preserving transformations of K equipped with completed Haar measure m. We shall show T and S are not isomorphic. Suppose 4>T = S* and 4>(z,w) = (f ( z ,w) ,g( z,w) ) . f and g are only defined almost everywhere
98 but this will not affect our argument as we shall consider them as members of L (m). We have f(T) = af and g(T) = fqg. Since f is an eigenfunction with eigenvalue a, by remark (3) of §1 of Chapter 3, f(z,w) = c*z for some c € K. The second equation then becomes g(T(z,w)) =cqzqg(z,w). If one now expresses g as a Fourier series, then it is straightforward to show that g(z,w) = kz w where k € K, pm = q for some m € Z and a = c^. So 4>(z,w) = (cz,kz w ) is an affine transformation and for <j> to be an invertible measure-preserving transformation one needs m = ±1, i.e., p = ±q. // However we can consider the problem of completeness of entropy for certain collections of measure-preserving transformations, and this we do in the next section. §8. Bernoulli and Kolmogorov Automorphisms (As general references for this section see Shields [2] and Friedman and Ornstein [2].) Definition 4.6: Let (Y,F,|j.) be a probability space. Let (X,B,m) = "(T (Y.F.m.) and let T: X -* X be the shift T({yn>) = {xn> where xn = yn+1 n € Z. T is an invertible measure-preserving transformation and is called the Bernoulli automorphism with state space (Y,F,m.). Examples of Bernoulli automorphisms : (1) the 2-sided (pQ,...,pk_1)-shift. Here Y = {0 ,1,... ,k-l} . (2) the example (8) of §6. Here y = (0,1]. 2 (3) If T is a Bernoulli automorphism so is T .
99 (i+) If T, and T„ are Bernoulli automorphisms so is T,xT Remark : If T is a Bernoulli automorphism then h(T) < » iff 3 a countable partition -q on (Y,F,|j.) 3 H(t|) < » and A(t|) = F. In this case h(T) = H(F). We shall call a probability space a Lebesgue space if the identity map of it is isomorphic (as a measure-preserving transformation) to the identity map on a probability space consisting of a subinterval of [0,1] with Lebesgue measurable sets and Lebesgue measure together with some atoms. Recently entropy has been shown to be a complete invariant for the class of Bernoulli automorphisms: Theorem 4.18: (Ornstein [1] and [2]) Two Bernoulli automorphisms (whose state spaces are Lebesgue spaces) with the same entropy are conjugate; i.e., entropy is a complete invariant for the conjugacy of Bernoulli automorphisms. (Isomorphism follows from conjugacy by the assumption on the state spaces.) Certain special cases had been worked out earlier by Meshalkin [1] and by Blum and Hanson [1]. This result reduces the conjugacy problem for Bernoulli automorphisms to their state spaces, since the entropy depends only on the state space. It is possible, for example, for a Bernoulli automorphism with a state space of two points to be conjugate to a Bernoulli automorphism with a countably infinite state space. Note : Given any x > 0 one can find n > 0 and {p-,p„ ,...,p }, n n p. £ 0 JT p. = 1 such that - £ p.-log p. = x. Hence V x > 0 1 i=0 1 i=0 1 1
100 3 a Bernoulli automorphism with entropy x. Corollary 4.16: (i) Every Bernoulli automorphism has an n-th root. (ii) Every Bernoulli automorphism can be written as a direct product of two Bernoulli automorphisms. Proof: (i) Let T be a Bernoulli automorphism and n > 0. Let S be a Bernoulli automorphism with h(S) = -h(T). Then Sn is a Bernoulli automorphism with entropy h(T), and therefore S and T are conjugate. (ii) Let T be a Bernoulli automorphism. Let S be Bernoulli with h(S) = Jj-h(T). Then h(SxS) = h(T) and since SxS is Bernoulli, SxS is conjugate to T. // Ornstein has proved many deep results about Bernoulli automorphisms , some of which we now summarize : Theorem 4.17: (Ornstein [3] etc.) (i) Every root of a Bernoulli automorphism is a Bernoulli automorphism. (S is an n-th root if Sn = T.) (ii) Let T be a Bernoulli automorphism. If F is a sub-a-algebra of B with TF = F then T restricted to the measure space (X,F,m|r) is a Bernoulli automorphism (i.e., a factor of a Bernoulli automorphism is a Bernoulli automorphism), (iii) If F ? 8 is an increasing sequence of a-algebras with TF = F and T restricted to (X,F ,m|E ) is a Bernoulli automor- n n n r n phism V n > 0, then T is a Bernoulli automorphism (i.e., an inverse limit of Bernoulli automorphisms is a Bernoulli automorphism). The following class of transformations were introduced by Kolmogorov in 1958 by analogy with regular stochastic processes.
101 Definition 4.7: An invertible measure-preserving transformation T of a probability space (X,B,m) is a Kolmogorov automorphism (K-automorphism) if 3 a sub-a-algebra K of B such that: (i) K c TK. (ii) V TnK 2 B. n=0 oo (iii) O T"nK 2 W = {X,*}. n=0 (If A,C are a-algebras A c C will mean V A € A 3 C € C with m(AiC) =0. If B.jB, € B then B1 £ B2 means m(B1\ Bj) = 0 and B. ? B„ means m(B,AB2) = 0.) Theorem 4.18 : Every Bernoulli automorphism is a Kolmogorov automorphism. Proof: Let the state space for T be (Y,F,m.). If F € F, let F < {xn> €X: x € F > € B. Let G = {F: F€ F}, which is called the 0 time-0 a-algebra. Let K = V T1G. We now verify that K satis- fies the conditions for a Kolmogorov automorphism. 0 . 1 . (i) K = V T1G c V 1 G = TK. i=-« i=-« oo oo jl . oo , (ii) V TnK = V V T1G = V T1G = B by definition of B. n=0 n=0 i=-» -» 00 00 (iii) We have to show f~) T~nK S H = {X,4>}. Fix A € P) T~nK = 0 0 oo -n oo D V T1G. Let B € V T*G, j € Z. Since A € V T1G, A and n=0 -" k=j i<j B are independent, and therefore m(A(lB) = m(A)m(B). The collection of all sets B for which m(An B) = m(A)m(B) is a monotone class,
102 and, by the above, contains U V T G. Therefore V B € B, j=-- k=j m(Af)B) = m(A)m(B). Put B = A, then m(A) = m(A)2 which implies m(A) =0 or 1. Hence H T"nK ? W. // n=0 It was an open problem from 1958 to 1969 as to whether the converse of Theorem 4.18 was true, i.e., whether a Kolmogorov automorphism is conjugate to a Bernoulli automorphism. This was shown to be false by Ornstein. Theorem 4.19: (Ornstein [6]) There is an example of a Kolmogorov automorphism T with the following properties: (i) T is not conjugate to a Bernoulli automorphism, (ii) T does not have a square root, (iii) T is not conjugate to T~ . Corollary 4.19: Entropy is not a complete invariant for the class of Kolmogorov automorphisms. Proof: Let T be the example of Ornstein. 3y Theorem 4.9 h(T) > 0. Choose a Bernoulli automorphism S with h(S) = h(T). S and T are not isomorphic by (i). Remarks : (1) Property (iii) of T contrasts with the behavior of ergodic transformations with pure point spectrum. (See Corollary 3.3.) (2) Ornstein's example is defined by induction and so is fairly complicated to describe. It is therefore important to check whether the more "natural" examples of Kolmogorov automorphisms are Bernoulli
103 automorphisms or not. We consider some of these at the end of this section. (3) Sinai has proved that if T is an ergodic invertible measure- preserving transformation of (X,8,m), which we assume is isomorphic to the unit interval with Lebesgue measure, and h(T) > 0 and if S is a Bernoulli automorphism with h(S) £ h(T) then there exists a measure-preserving transformation 0 such that 4>T = S*, i.e., S is a factor of T. The next theorem shows that Kolmogorov automorphisms and Bernoulli automorphisms are spectrally the same: Theorem 4.20: (Rohlin) A Kolmogorov automorphism T of a probability space (X,B,m), 2 where L (m) is separable, has countable Lebesgue spectrum. Proof: Assume B i {X,4>} = N or else the result is trivial. We have (i) K ° TK (ii) V TnK ? B (iii) D T~nK ? W. We split the proof into three parts : a) We first show that K has no atoms, i.e., if C € K and m(C) >0 then 3 D € K with D c C and m(D) < m(C). Suppose C is an atom of K with m(C) > 0. Then TC is an atom of TK and since K c TK either TC £ C or m(CnTC) = 0. If TC c C then TC % C since both sets have the same measure so that C ç° OT~nK and therefore m(C) = 1. Hence B % {X,*}, a contradiction. On the other hand, suppose m(TC (1C) = 0. Then either for some k > 0 T C c C (and we use the above proof to get a contradiction) or m(TkC n C) = 0 V k>0 and then C U TC U T2C U . . . has infinite measure, a contradiction. b) Let H = {f € L2(m): f is K-measurable}. Then U^H c H. Let _ m . H = V (6 UTH. From U~nH = © ujv (8 U^ ^ (n,m > 0) we conclude
104 that L (m) = (+) ILV © C where C is the subspace of constants. It suffices to show V is infinite-dimensional since if {f.,f„ ,f.,...} is a basis for V, then {fQ = 1, U-f : n €Z, m>0} is a basis for L2(m). o c) We now show V is infinite-dimensional. Since TK t K (we are o assuming B t H) we know V t {0}. Let g € V, g t 0 and then G = {x: g(x) i 0} satisfies m(G) > 0. Since G € K and using a) we know XQH = {xQf: f i H} is infinite-dimensional. XQH = v' (ê XQUTH where V c v so either V is infinite-dimensional (and hence V is) or X„UTH is infinite-dimensional. In this case there is a linearly independent sequence of functions {x„U_f } where the f J bin n are bounded functions in H. Then {gUTf } are linearly independent in H. It suffices to show these functions are in V. But if f € H then (guTfn,uTf) = (g,uT(ffn)) = o so gUTfn € V. // Corollary 4.20: A K-automorphism is strong mixing. Proof: By Theorem 2.3. // Kolmogorov automorphisms are connected to entropy by the following result (half of which was proved by Pinsker). Theorem 4.21: (Rohlin S Sinai [1]) Let (X,B,m) be isomorphic to [0,1] with Lebesgue measure. Let T: X -* X be invertible and measure-preserving. Then T is a K-automorphism iff h(T,A) > 0 V finite À % N; i.e., T has completely positive entropy.
105 Remark: This shows that K-automorphisms are "the opposites" of transformations with zero entropy (since h(T,A) = 0 V A in the zero entropy case). Examples : (1) Group Automorphisms. Rohlin proved that any ergodic automorphism of a compact abelian metric group is a K-automorphism and later Yusinskii proved the theorem in the non-abelian case. Katznelson [1] has shown that ergodic automorphisms of finite-dimensional tori are conjugate to Bernoulli automorphisms. Chu [l] and Lind [1] have independently extended Katznelson's results to the (countably) infinite- dimensional torus. Katznelson and Weiss [2] have also solved the case where the dual group is the discrete rationals but whether an ergodic automorphism of a general compact abelian metric group is Bernoulli is not yet known. (2) Markov Chains. Consider a two-sided Markov chain with transition matrix [p..]. The shift T on the space of sequences of integers becomes a measure-preserving transformation for the Markov measure defined by tPi-;^ and an initial vector [p-i » • • • >Pk] satisfying [pl5...,Pk][p^.] = Cpl5..-5pk]. (See Billingsley [1].) It is known that T is ergodic iff the chain is irreducible (i.e. , V pairs of states i,j 3 n > 0 with p.. > 0) and T is strong mixing iff the chain is irreducible and aperiodic (i.e., V states i g.c.d. {n: p.. > 0} = 1). Friedman and Ornstein [1] have shown that if T is strong mixing then it is conjugate to a Bernoulli automorphism. Therefore, from the point of view of ergodic theory mixing Markov chains are the same as Bernoulli automorphisms, i.e., we can represent the space as a direct product measure space so that T becomes the shift on the new space.
106 (3) One can generalize the notion of a finite-dimensional torus to obtain another kind of homogeneous space:--namely a nilmanifold. Let N be a connected, simple connected, nilpotent Lie group and D a discrete subgroup of N so that the quotient space N/D is compact. N/D is called a nilmanifold. When N = Rn and D = Zn we get an n-torus. The Haar measure on N determines a normalized Borel measure on N/D. If A: N ■* N is a (continuous) automorphism with AD = D then this induces a map A: N/D -* N/D, which we call an automorphism of N/D. A always preserves the measure m. Parry has investigated the ergodic theory of such maps and has shown that if A is ergodic then A is a K-automorphism. A subclass of the automorphisms of N/D, the Anosov ones, are known to be conjugate to Bernoulli automorphisms, but it has not yet been proved that the others are. The simplest examples are as follows: Let N satisfies the above conditions with the operation of matrix multi- 3 plication and the natural topology from R . Let f/1 mpX D = < 0 1 n : m,n,p € Z l\0 0 1/ Then N/D is a nilmanifold. The automorphism 2 1 x+y 0 1 1 2x+y z+x +xy+^4- of N induces an ergodic automorphism of N/D, and it is unknown if this is Bernoulli.
107 §9. Pinsker Algebra Let T be a measure-preserving transformation of a probability space (X,B,m) which is isomorphic to [0,1] with Lebesgue measure. Let P(T) = V {A: A c B, À finite, h(T,A) = 0}. This is called the Pinsker a-algebra. One can show that T_1P(T) = P(T). One can also prove that if À is finite then A c P(T) iff h(T,A) = 0. Thus, P(T) is the maximum a-algebra such that T restricted to (X ,P(T),m|p,_.,) has zero entropy. Note that P(T) = B iff h(T) = 0 and P(T) = N iff T is a K-automorphism (by Theorem 4.21). Theorem 4.22: (Rohlin) If T is an invertible measure-preserving transformation with h(T) > 0 then U™ has countable Lebesgue spectrum in L2(B) 0 L2(P(T)). This reduces the study of the spectrum of invertible measure- preserving transformations to those with zero entropy. Corollary 4.22: Transformations with pure point spectrum have zero entropy. 2 Proof : L (m) cannot have a subspace on which U™ has countable Lebesgue spectrum. The types of spectrum that can occur for zero entropy transformations are unknown. There are examples of zero entropy transformations with countable Lebesgue spectrum (from Gaussian processes and horocycle flows). In the space of invertible measure-preserving transformations of (X,B,m) with the weak topology, the set of transformations of zero
108 entropy forms a dense G,. Pinsker [1] conjectured that any ergodic measure-preserving transformation could be written as a direct product of one with zero entropy and a K-automorphism. However, (ii) of Theorem 4.19 allows us to obtain a counterexample for if T: X ■* X is the example of Ornstein with no square root then the transformation S of the space {0}xX|J{1>><x given by S(0,x) = (l,x). S(l,x) = (0,Tx) provides a counterexample to the Pinsker conjecture. This example is not 2 mixing (since S is not ergodic), but Ornstein has constructed a mixing transformation which violates the Pinsker conjecture. §10. Sequence Entropy (See: Kushnirenko [1].) Let (X,B,m) be a probability space isomorphic to [0,1] with Lebesgue measure. Let T: X -* X be an invertible measure-preserving transformation. Let r = {t.,t„,...} be a sequence of integers. Let A be a finite algebra A <= 8. Define hr(T,A) = lim sup ± H(T XA v . . . v T nA) n-*~ and define hp(T) = sup hr(T,A). A finite It is easily shown that h„(T) is a conjugacy invariant for each r. Entropy and spectral properties are connected by the following: Theorem 4.23 : (Kushnirenko [1]) T has discrete spectrum iff h„(T) =0 V r. One can also show that sup_h_(T) = » or log k, for some k > 0, k € Z; moreover, those T with suprh„(T) = log k are easy to describe.
109 Problem: If T has quasi-discrete spectrum (see Hahn and Parry [1]), what sequences give h„(T) > 0? h_(T) has been calculated except in the cases when T has zero entropy and r has "large gaps". h_(T) will only give new information when h(T) = 0. (See: Newton [1].) §11. Comments Entropy was introduced as a conjugacy invariant for measure- preserving transformations. It was soon realized that entropy was more than just an assignment of a number to each transformation. Kolmogorov automorphisms and transformations with zero entropy have received the most treatment. They are "opposites" from the point of view of entropy. Kolmogorov automorphisms are important for applications as it seems that the most interesting smooth systems are Kolmogorov and even Bernoulli. By Theorem 4.22 we know that the spectral theory of invertible measure-preserving transformations reduces to that for the zero entropy case. The following is still an open problem: If h(T) = 0 what kind of spectrum can UT have? For transformations with zero entropy the isomorphism problem is only solved for ergodic transformations with discrete spectrum, totally ergodic transformations with quasi-discrete spectrum and some other special cases. Sequence entropy may play a role in the isomorphism problem for zero entropy transformations. We note again that in the weak topology on the set of all invertible measure-preserving transformations on a given space (X,B,m), the set of transformations with zero entropy is a dense G. (countable intersection of open sets) and the set of weak mixing transformations is also a dense G.. Since the set of strong mixing
110 transformations is a set of first category it follows that "most" transformations are weak mixing, have zero entropy, but are not strong mixing. The main problem to consider for Kolmogorov automorphisms seems to be to find more examples of Kolmogorov automorphisms that are not conjugate Bernoulli automorphisms. One should first check this fact for all the usual ways of constructing new transformations from old ones (e.g., is a weak mixing group extension of a Bernoulli automorphism a Bernoulli automorphism?). Then one might hope to find a new invariant that may be complete for Kolmogorov automorphisms. §12. Non-invertible Transformations Suppose T: (X,B,m) -* (X,B,m) is measure-preserving but not necessarily invertible; assume that (X,B,m) is isomorphic to [0,1] with Lebesgue measure. Note that B :> T_1B :> T~2B ^ ... Let B = Pi T_nB; so, T_1B = B , and T|,v „ , is invertible. n=0 " " (X,B_,m; One can show that U™ has countable Lebesgue spectrum in 2 2 L (B) 0 L (B_) where countable Lebesgue spectrum in this situation means there is a basis of the form {Unf : n>J and m >0}. m This reduces the study of spectral properties of measure- preserving transformations to those which are invertible (in fact, invertible ones with zero entropy by Theorem 4.22). One can also show that P(T) c B (i.e., if h(T) = 0 then T is invertible modulo sets of measure zero; more precisely T is a measure algebra isomorphism).
Ill The analogous concept to a K-automorphism is an exact endomor- phism. Definition 4.8: T: X -* X is an exact endomorphism if O T"nB 2 H ; i.e., B_ % N. n=0 So exact endomorphisms are as far from being invertible as possible. Examples of exact endomorphisms are the one-sided Bernoulli shifts. Exact endomorphisms are strong mixing (by the above remarks about spectrum and a proof like that of Theorem 2.3). It was conjectured that every ergodic measure-preserving transformation is a product of an exact endomorphism and an invertible measure-preserving transformation. This is not so (Parry and Walters). Also, one-sided Bernoulli shifts with the same entropy are not necessarily conjugate since an m-to-1 map cannot be conjugate to an n-to-1 map if m i n. So entropy is far from complete for exact endomorphisms. Parry and Walters (1971) constructed two exact endomorphisms S,T with S"nB = T_nB V n > 0, S2 = T2 (=. h(S) = h(T)) but with S and T not conjugate. The method used involved the Jacobian of an endomorphism, a concept which was introduced in Parry [3]. (It is not known if there are two K-automorphisms S,T with 2 2 S = T but with S and T not conjugate.) Also, exact endomorphisms need not be conjugate to one-sided Bernoulli shifts; in fact a one-sided Markov chain which is exact need not be conjugate to a onesided Bernoulli shift.
Chapter 5 : Topological Dynamics §0. Introduction In measure theoretic ergodic theory one studies the asymptotic properties of measure-preserving transformations. In topological dynamics one studies the asymptotic properties of continuous maps. Theorem 5.0: Let X be a compact Hausdorff space. The following are equivalent: (1) X is metrizable. (2) X has a countable base. (3) C(X) (the space of all complex-valued continuous functions on X) has a countable dense subset. Proof: See Kelley [1]. We shall consider compact metric spaces X and homeomorphisms T: X -* X. C(X) is a Banach algebra with ||f|| = sup |f(x)| . x€X The map UT: C(X) -> C(X), defined by (UTf)(x) = f(Tx) is clearly a multiplicative linear isometry of C(X) onto C(X), i.e., IL, is a Banach algebra automorphism. Remarks : Compactness is a "finiteness" condition and corresponds to the assumption of a finite measure in the measure theoretic work. The assumption of metrizability is not needed for many of the results but it often shortens proofs and most applications are for metric spaces.
113 We assume that T is a homeomorphism, rather than a continuous map, for simplicity. Typical examples that we shall study are: Examples : (i) I on any X. (ii) Tx = ax where X is a compact metric group. (On such a group there exists a left invariant metric d, i.e., d(bx,by) = d(x,y) V b,x,y € X ). (iii) an automorphism of a compact metric group. (iv) an affine transformation Tx = a-A(x) where A is an automorphism of a compact group G and a € G. (v) Let Y = {0,1,...,k-l} with the discrete topology. Let 00 X = (f Y with the product topology. A metric on X is given by: — ea " Ix ~y I d({x},{y}) = £ n , " . The shift T ( T{x} = {y } with n n nr:. 2 ' ' y = x ., ) is a homeomorphism of X. Note that here we have a Jn n+i c special case of (iii) since X is a compact group under the operation {xn} + {yn} = {(xn+yn)mod(k)}, and T is an automorphism of X. d is an invariant metric on X. §1. Minimality X will denote a compact metric space and. T : X ■*■ X a homeomorphism. We would like to find a concept of "irreducible piece" to play the role ergodicity played for measure-preserving transformations. Definition 5.1: T is minimal if V x € X {Tnx: n € Z} is dense in X. 0_(x) = {Tnx: n € Z} is called the T-orbit of x.
114 Theorem 5.1: T is minimal iff TE = E and E closed =» E = * or X. Proof: Suppose T is minimal and suppose E is closed, E i <t> and TE = E. Choose x € E. Then 0T(x) c e by the T-invariance of E, and X = 0_(x) so E = X i.e., E = X. Conversely, V x € X, 0_(x) is a closed T-invariant non-empty set, and hence is all of X. // Definition 5.2: A closed subset E of X which is T-invariant is called a minimal set with respect to T: X -* X if T|_ is minimal. Theorem 5.2 : Any homeomorphism T: X •* X has a minimal set. Proof: Let E denote the collection of all closed non-empty T-invariant subsets of X. Clearly E i $ since X belongs to E. E is a partially ordered set under inclusion. Every linearly ordered subset of E has a least element (the intersection of the elements of the chain. The least element is non-empty by Cantor's intersection property.) Thus, by Zorn's Lemma, E has a minimum element. This element is a minimal set for T. // Remark : Ergodicity has the properties: (i) An ergodic transformation is indecomposable. (ii) Every measure-preserving transformation on a decent space can be decomposed into ergodic pieces in a nice way. By its definition, a minimal transformation is indecomposable. We know that each homeomorphism T: X •*■ X has a minimal set. Hswever, in general, one cannot partition X into T-invariant closed sets E such that X=Ue, TE =E V a and TI „ is minimal (although a a a ' h ° a a
115 we can in some important cases). If T has such a decomposition it is sometimes called semi-simple. An example of a transformation not admitting such a decomposition is an ergodic automorphism of a compact metric group. This will be clear from the next section. Definition 5.3: A point x € X is a periodic point of T if T x = x for some n i 0. The least such positive n with this property is called the period of x under T. Theorem 5.3: Let T: X •*■ X be a minimal homeomorphism. Then: (1) T has no nonconstant invariant continuous functions. (2) If X is not finite T has no periodic points. Proof: (1) fT(x) = f(x) implies fTn(x) = f(x) V n € Z, and so f is constant on the dense subset 0_(x) of X. Thus f is constant on X. (2) If Tnx = x, n i 0 then {x,Tx,...,Tn_1x} is a closed T-invariant set and by the minimality condition it is the whole space X. // Remarks : (i) If T has no nonconstant T-invariant functions then T need not be minimal. As an example of this, let A be an ergodic automorphism of a compact metric group G i {e}. Then A is not minimal since A(e) = e. But A satisfies property (1), since if fA(x) = f(x), f continuous, then by ergodicity, f = constant a.e. and, since Haar measure is positive on open sets f is constant everywhere . (ii) The fact that a minimal homeomorphism of a non-finite space has no periodic points rules out many important examples, such as ergodic automorphisms of compact metric groups.
116 We now check whether the examples mentioned in §0 are minimal or not. Examples : (i) I is minimal iff X = a single point. (ii) Let G be a compact metric group and T(x) = ax. T is minimal iff {a : n € Z} is dense in X. Proof: ( = ) 0T(e) = {an : n€Z}. (=) Let x € X. We must show that 0 (x) = X. Let y € X. 3 n- 3 a -* yx i.e. , n. a -x -* y ii n. T 1(x) - y. Therefore 0„(x) is dense in X. // (iii) An automorphism of a compact metric group G is minimal iff G = {e}. The proof is trivial. (iv) For affine transformations of compact metric groups necessary and sufficient conditions for minimality are known. For example, if G is also abelian and connected then T = a-A is minimal iff O BnG = {e} and [a,BG] = G n = 0 where B is the endomorphism of G defined by B(x) = x~ -A(x) and [a,BG] denotes the smallest closed subgroup of G containing a and BG. This was proved by Hoare and Parry [1], (v) The shift on k symbols is minimal iff k = 0. This is seen from (iii) above.
117 §2. Topological Transitivity Definition 5.4: T: X -* X is topologically transitive if 3 xQ € X 3 0T(xn) is dense in X. Note: T minimal => T topologically transitive. Theorem 5.4: The following are equivalent: (1) T is topologically transitive. (2) TE = E, E closed, E i X => E is nowhere dense (or, equivalent- ly, if TU = U, U open, U i <t> then U is dense). (3) If U,V are nonempty open sets then 3 n € Z 3 Tn(U) fl V t 4,. (4) {x € X: 0T(x) ?! X} is a set of first category. Proof: (1) =» (2). Suppose 0T(xQ) = X and let E i «, E closed, TE = E, E i X. Suppose U ç E is open, U i 4>. Then 3 p 5 Tp(xQ) € U £ E so that 0T(xQ) ç E and X = E, a contradiction. Therefore E has no interior. (2) =» (3). Suppose U,V i « are open sets. Then \J TnU is an open T-invariant set; so, it is necessarily dense by condition (2). Thus U TnU fl V i <t>. n = -~ (3) => (4). Let Ul5U2,...,U ,... be a countable base for X. Then 0^7x7 i X » 3 n 5 0T(x) fl Un = <t> ~ 3 n 5 Tm(x) € X\U V m € Z n
118 3 n with x € (~) Tm(X\U ) x € (J p| Tm(X\U ). n=l m=-» It suffices to show (~) T (X\U ) is a nowhere dense set V n. Its m=-» n 00 complement is {J T (U ) which is clearly dense by condition (3). m=— n Hence the result follows. (4) => (1). This is clear since a compact metric space is of second category. // The following theorem gives many examples of topologically transitive homeomorphisms. Theorem 5.5: Let X be a compact metric space and T: X ■* X a homeomorphism, m a Borel probability measure on X giving positive measure to every non-empty open set. If T is ergodic with respect to m, then m{x € X: 0T(x) = X} = 1. In particular, T is topologically transitive . Proof: Let U,,U„,... be a countable base for the topology. By the previous proof {x: 0T(x) i X} = U Pi TK(X\U ). 1 n = l k=— n The closed set (~) T (X\U ) is T-invariant, so by ergodicity has k=-- n measure 0 or 1. But U is contained in complement of this set and m(U ) > 0, since U is open. Therefore n n c ea m( O Tk(X\U )) = 0 k=- and so m{x: 0T(x) i X} = 0. Hence m{x: 0T(x) = X} = 1. //
119 Corollary 5.5: Let G be a compact metric group and T: G -* G an affine transformation. T is ergodic with respect to Haar measure «• T is topo- logically transitive. Proof: (=>) This is obvious as Haar measure is positive on non-empty open sets. (=) This proof is like the last part of the proof in example (5) of §4 Chapter 1, which deals with the case when G is connected and abelian. // Theorem 5.6: If T is topologically transitive then T has no nonconstant continuous invariant functions. Proof: Suppose f € C(X), fT(x) = f(x). If 0T(xQ) = X then f is constant on 0_(xQ), a dense set, and hence is constant on X. // Remarks : (1) If all the T-invariant continuous functions are constant then T need not be topologically transitive. The following is an example to illustrate this: Let X = K2x{0} U K2x{l} / / (e,0) ~ (e,l) i.e., two copies of the two-torus joined at the identity. Let 2 2 A: K -* K be an ergodic automorphism and define T: X -* X by T(x,0) = (Ax,0), T(x,l) = (Ax,l). 2 Then T is not topologically transitive since T preserves K x{0} 2 and K x{l}. However, each continuous T-invariant function is con- 2 2 stant since it must be constant on both K x{0} and K *{1}, because A is ergodic, and these two constants must be the same because they must agree at the point (e,0) s (e,l). //
120 (2) T can be topologically transitive and have a dense set of periodic points. To illustrate this we prove that any automorphism A 2 of K has a dense set of periodic points. 2 Fix n > 0. Consider the finite subgroup of K consisting of points of the form (w,,w„) where w, = wn = 1. These are all the 2 elements of K of group order n. Since A is an automorphism it preserves this finite subgroup and hence, each member of this subgroup is a periodic point for A. If we now vary n we obtain a dense set. This proof can obviously be extended to an automorphism of K , n > 1. // (3) Topologically transitive homeomorphisms enjoy some of the properties of minimal homeomorphisms and also allow other interesting things to occur; e.g., a dense set of periodic points. (2) and (3) of Theorem 5.4 show that topological transitivity is (in some sense) a topological analogue of ergodicity. Also, topologically transitive homeomorphisms are "indecomposable"; i.e., we cannot write X = U E , TE = E and E closed w a a a a a when T is topologically transitive. So it seems that topologically transitive homeomorphisms are better building blocks than minimal homeomorphisms. If T has a decomposition into minimal pieces then each piece is also topologically transitive. So, the best thing to do is to try to get a decomposition of T into topologically transitive pieces, and then see if these pieces are also minimal. A distal homeomorphism (i.e., x^y => 3 6>0 5 d(Tn(x) ,Tn(y)) > 6 V n € Z) can be decomposed into minimal pieces (Ellis [1]). An Axiom A homeomorphism can be decomposed into topologically transitive pieces (Smale [1]). But, not all homeomorphisms can be decomposed into topologically transitive pieces; e.g., see the example in remark (1) above.
121 The following gives a sufficient but not necessary condition for a topologically transitive homeomorphism to be minimal. Theorem 5.7 : If X is a compact metrizable space, T: X ■* X a topologically transitive homeomorphism, and if there exists a metric on X making T an isometry, then T is minimal. Proof: Suppose such a metric is d, i.e., d(Tx,Ty) = d(x,y). Let 0T(xQ) = X and consider x € X. We want to show that 0_(x) =X. Let y € X and let e > 0. There exist n,m € Z such that d(x,Tm(xQ)) < e, d(y,Tn(xQ)) < e so, d(y,Tn_m(x)) < d(y,Tn(xQ)) + d(Tn(xQ),Tn_m(x)) = d(y,Tn(xQ)) + d(Tm(xQ),x) < 2e. Therefore 0T(x) = X. // We now check our examples for topological transitivity. Examples : (i) I is topologically transitive iff X = one point. (ii) T(x) = ax is topologically transitive iff T is minimal iff T is ergodic iff {a11: n € Z} is dense in X. ni - ni a -xQ ■* yxQ i.e., a So, {a : n € Z} is dense in X. (Another proof would be to apply Theorem 5.7 or Corollary 5.5.) // (iii) An automorphism A of a compact metric group is topologically
122 transitive iff A is ergodic. (See Corollary 5.5.) (iv) An affine transformation T of a compact metric group X is topologically transitive iff T is ergodic. (See Corollary 5.5.) (v) The shift on k symbols is topologically transitive. Consider 00 X = "U" {0,1,...,k-l}, T = shift. We know T is an automorphism of — ea the compact metric group X. The Haar measure on X is the measure given by the weights 1/k,...,1/k. (To check this, fix x € X and show, by checking on rectangles and using Theorem 1.1, that this measure is invariant under translation by x.) T is ergodic with respect to Haar measure, and therefore T is topologically transitive by Corollary 5.5. §3. Topological Conjugacy and Discrete Spectrum When should we consider two homeomorphisms of compact spaces to be the "same" from a dynamical point of view? The following seems the most suitable: Definition 5.5: Let T: X -* X, S: Y -* Y be homeomorphisms of compact spaces. T is topologically conjugate to S if there exists a homeomorphism 0 : X -* Y such that «T = S$. Notes : (1) This is an equivalence relation. (2) If T and S are topologically conjugate then T is minimal iff S is minimal and T is topologically transitive iff S is topologically transitive. Definition 5.6: Let X be a compact metric space, T: X -* X a homeomorphism, f a complex-valued continuous function on X. We say that f is an
123 eigenfunction for T if 3 X € C 3 f(Tx) = Xf(x) V x € X, and f i 0. We then call X the corresponding eigenvalue for f. Remarks : Suppose T is topologically transitive. (1) f(Tx) = Xf(x), f € C(X) - |\| = l and |f(x)| = constant. Proof: f(Tx) = Xf(x) => |f(Tx)| = |x||f(x)|. Therefore, sup |f(Tx)| = |X|sup |f(x)| X€X x€X and since T is onto = sup |f(x)|. x€X Therefore |X| = 1. Hence, |f(Tx)| = |f(x) | and by Theorem 5.6 |f(x)| = constant. // (2) If fT = Xf, gT = Xg, f,g € C(X) then f = constant-g. Proof: By (1), |g(x)| i 0 V x € X since g é 0. Therefore (f/g)(Tx) = (f/g)(x) =» f/g = constant by Theorem 5.6. // (3) Eigenfunctions corresponding to distinct eigenvalues are linearly independent in C(X). Proof: Let f (Tx) = X f (x) where {X } are all distinct for n n n n n = l,...,k. Suppose V x € X, a1f1(x) + a2f2(x) + ... + 'akfk(x) = 0 where the a. € C for i = l,...,k. By applying the above equation to T x instead of x, we get alXlfl(x) + a2X2f2(x) + ••• + akXkfk(x) = ° V x € X- Hence
124 / 1 :v ,k-l ,k-l Xl X2 a-f-(x) a2f2(x) £~7VkV*7 \°/ a, f, (x) All the X. s are distinct so the matrix is nonsingular. Therefore V V x € X, akfk(x)/ \o i.e., a -f.(x) a. = 0, l 0 V x € X, i = 1,...,k, i.e i i = l,...,k since f- ^ 0. Hence, the f. s are linearly independent in C(X). // (4) The eigenvalues form a subgroup of the circle group K. Under our assumptions T has only countably many eigenvalues. To check there are only countably many eigenvalues it suffices to show that if h: X -* K is an eigenfunction corresponding to an eigenvalue t i 1 then ||h - 1|| > 1/4. For then two eigenfunctions, with values in K, corresponding to different eigenvalues will be greater than distance 1/4 apart and, since C(X) has a countable dense set, there can only be countably many eigenvalues. So let h(Tx) = xh(x) , t i 1. Choose xQ € X and p so that xph(x ) is in the left-hand half of the unit circle. Then |h -1|| = sup ||h(x) -1|| x€X ï ||h(Tpxn) -11 TPh(xQ) -1|| > 1/4. // Definition 5.7: Let T: X -* X be a homeomorphism of the compact metric space X.
125 We say that T has topological discrete spectrum if the smallest closed linear subspace of C(X) containing the eigenfunctions of T is C(X), i.e., the eigenfunctions span C(X). Note: When T is topologically transitive and has topological discrete spectrum, 3 f : X -* X, n = 1,2,... linearly independent, spanning C(X), such that f T(x) = X f (x). The following is a representation n n n ° theorem for topologically transitive homeomorphisms with topological discrete spectrum. Theorem 5.8: (Halmos and von Neumann Cl]) The following are equivalent for a homeomorphism T of a compact metric space X: (1) T is topologically transitive and is an isometry for some metric on X. (2) T is topologically conjugate to a minimal rotation on a compact abelian metric group. (3) T is minimal and has topological discrete spectrum. (4) T is topologically transitive and has topological discrete spectrum. Proof: (1) => (2). Let d be the isometry metric for T. Suppose 0T(xQ) = X. Define a multiplication * in Om(xJ by Tnx0*TmxQ = Tn+mxQ. We have d(Tnxn*Tmxn,TPxn*Tqxn) = d(Tn+mxn,TP+qxn) "O"-1- ^O'-1- "Q"1- "Q < d(Tn+mx0,TP+mxQ) + d(TP+mx0,TP+qx0) = d(Tnx0,TPxQ) + d(TmxQ,Tqx0), Hence, the map * : 0T(xQ) xOT(xQ) -* 0_,(xQ) is uniformly continuous and therefore can be extended uniquely to a continuous map a: XxX -» X.
126 Also, d(T nxQ,T mxQ) = d(Tmx0,TnxQ) and so, „ , , inverse „ . . 0T(xQ) > 0T(xQ) is uniformly continuous and can be uniquely extended to a continuous map of X. Thus we get that X is a topological group and is also abelian since it has a dense abelian subgroup {T xn: n € Z}. Since T(TnxQ) = Tn+ xQ = Tx fcTnxQ we have Tx = TxQftx and so T is the rotation by Tx0- (2) ■» (3). if T is a minimal rotation on a compact abelian group G then each character of G is an eigenfunction. Let A be the collection of all finite linear combinations of characters. Then A is a subalgebra of C(X) , contains the constants, is closed under complex conjugation, and separates points. Applying the Stone- Weierstrass Theorem we see that A = C(X). (3) =» (4) is trivial. (4) => (1). We can choose eigenfunctions f : X -* K, nil, with f (T) = X f and where the f are linearly independent and n n n n span C(X). Define a metric on X by: - |f (x) - f (y)| d(x,y) = I —IL n n=l 2n - |\f (x) - \f (y)| Then d(Tx,Ty) = I -^-2 —2-H = d(x,y). n = l 2n It remains to check that d gives the topology on X. If d(x ,x) -*0 then for all nil, as m -* », pr lfn(V - fn(x)l £ d(xm'x) "* °" Thus, V nil, f„(x„) -* f (x) as m-*» and since {f„} separates n m n n ' c
127 points, x -* x as m -* ». Conversely, suppose x -* x. Let e > 0, and choose N such that £ 4- < L- n=N+l 2n 2 By the continuity of the functions f..,..., f„ 3 M 3 m > M => |fi(xm) - fi(x)| < e/2 i = 1,...,N. If m > M then dUm'X) = i?i ? |fiUm) " fi(X)' i=l 21 2 2 i.e. , d(xm,x) -♦ 0. // Remark : If Tx = ax is a minimal rotation of a compact metric abelian group G it is straightforward to show that the set of eigenvalues of T is {y(a): y € G} and every eigenfunction is a constant multiple of a character. In fact, this follows from Theorem 3.3 since 2 each continuous eigenfunction is an L -eigenfunction. We have the following isomorphism theorem. Theorem 5.9: (Topological Discrete Spectrum Theorem) Two minimal homeomorphisms of compact metric spaces both having topological discrete spectrum are topologically conjugate iff they have the same eigenvalues. Proof: (1) The proof is along the lines of the proof of Theorem 3.1, but instead of using Theorem 2.1 we use the Banach-Stone Theorem. This says that if X,Y are compact spaces, $: C(Y) ■* C(X) is a bijective linear isometry, and $(f-g) = *(f)*(g), then there exists a homeomorphism ^ : X -* Y such that <Mf)(x) = f(^(x)).
128 (2) This theorem can also be proved using Theorem 5.8 and character theory. By Theorem 5.8 we can suppose T is a minimal rotation of a compact abelian group G, Tx = ax, and S is a minimal rotation of a compact abelian group H, Sy = by. We are assuming {Y(a): y € G} = {6(b): 6 € H}. Define a map 9 : H -* G by 9(6)(a) = 6(b). This is well-defined and a bijection. Moreover, 9 is easily checked to be a group automorphism and hence induces an automorphism C: G -* H. It is easy to show that CT = SC. // Remark: Thus the theory of topologically transitive homeomorphisms with topological discrete spectrum is entirely analogous to that of ergodic measure-preserving transformations with pure point spectrum. §4. Invariant Measures for Homeomorphisms In this section we consider some connections between the topological and measure theoretic systems. We first prove some results about Borel measures including the fact that a Borel measure on a metric space is determined by how it integrates continuous functions. By a Borel measure on X is meant a measure defined on the Borel subsets of X, (i.e., the smallest a-algebra containing the closed sets). Theorem 5.10: A Borel probability measure m on a metric space X is regular (i.e., if B denotes the Borel sets then V B € B and V e > 0 3 an open set U and a closed set C with C Ç B ç u and m(U \C ) < e). e e Proof: (The proof does not require X to be metric but that each closed set be a G..) Let R be the collection of all sets such that the regularity condition holds, i.e., R={A€8:Ve>0 3 open U , closed C with C Ç A ç U and m(U\C)<e}. We show
129 that R is a a-algebra. Let A € R", we show that X\A € R. Let e > 0. 3 open U , closed C with C Ç A £ U 3 m(U \C ) < e. Thus, X\U ç X\A £ X\C and (X\C ) \ (X\U ) = U \C , so e e eeee m((X\Ce) \ (X\Ue)) = m(Ue\Ce) < e. Therefore X\A € R. We now show R is closed under countable unions. Let 00 A,, A ,... € R and let A = U A.. Let e > 0 be given. There 1 i=l 1 exist open U , closed C such that C £ A Ç u and e,n e ,n e,n n e,n ea m(U \ C ) < e/3n. Let U = U U (which is open), e,nxe,n e , e, n r> n=l ~ k C = U C , and choose k such that m(C \ I I C ) < e/2. n=l n=l Let C e = I J C (which is closed). We have C Ç A ç U . Also, nVi e>n m(UAC ) < m(U \C ) + m(C \C ) e e e e e e nÇ1m(Ue,n\Ce,n) +m(£eXCe} E — + - = e. n=l 3n 2 Therefore R is a a-algebra. To complete the proof we show that R contains all the closed subsets of X. Let C be a closed set and e > 0. Define Un = {x € X: d(C,x) < 1/n}. This is an open set, U12 U2 2 ... 2 U 2 .. ea and O U. = C. Choose k such that m(U, \C) < e and let C = C i=1 i k e and Ue = Uk. This shows C € R. //
130 Corollary 5.10: For a Borel probability measure m on a metric space X we have that for a Berel set B m(B) = sup m(C) and m(B) = inf m(U). C closed U open C Ç B U 2 B Theorem 5.11: Let m,|j. be two Borel probability measures on the metric space X. If f dm = f d|i V f € C(X) then m = p.. Jx Jx Proof: By the above corollary it suffices to show that m(C) = M-(C) for all closed sets C £ X. Suppose C is closed and let e > 0. By the regularity of m there exists an open set U, U 2 C such that m(U\C) < e. Define f: X -> R by 0 if x ^ U d(x,X\U) f(x) = I. d(x,X\U) + d(x,C) if x € U. f is well-defined since the denominator is not zero. Also f is continuous, f = 0 on X\U, f = 1 on C, and 0 < f(x) < 1 V x € X. Then, M.CC) < f f dp. = I f dm 5 m(U) < m(C) + e. JX JX Therefore m.(C) < m(C) + e V e > 0, so |j.(C) S m(C). By symmetry we get that m(C) < |j.(C). // Theorem 5.12: (Riesz Representation Theorem) Let X be a compact metric space and J: C(X) -► C a continuous linear map such that J is a positive operator (i.e., if f > 0 then J(f) 5: 0) and J(l) = 1. Then there exists a Borel probability
131 = f f du measure n on X such that J(f) for all f in C(X). Proof: See Halmos [1], p. 247. // The next theorem expresses the fact that the unit ball in the dual space of C(X) is weakly compact. Theorem 5.13 : If {y. } is a sequence of Borel probability measures on a compact metric space X, then there is a subsequence (Mv,} which converges in the weak topology, i.e., 3 a Borel probability measure |j. on X such that [ f d^ - [ f d^ Jx ni JX for all f in C(X). Proof: We write |j.(f) = f du when f € C(X) and (i is a JX Borel measure. Choose f.,f2,... dense in C(X). Consider the sequence of complex numbers {a ( f, ) } . This is bounded by l|f-,||, and so has a convergent subsequence, say {|j. (f,)}. Consider {p. (f„)}; this is bounded and so has a convergent subsequence (2) (2 ) {p. (f„)}. Notice that {u: (f-,)} also converges. We proceed in this manner, and for each i î 1, construct a subsequence {|j. } of {|^n> such that i^^} £ {^1-1)} Ç ... Ç {|^(1)} £ {|^n>, and so that {M. (f)} converges for f = f, ,f „ ,. . . ,f. . Consider the diagonal {u:11 }. The sequence {|J. (f-)} converges for all i; thus (m- (f)} converges for all f € C(X) (by an easy approximation
132 argument). Let J(f) = lim |j.nn)(f). Clearly J: C(X) -* C is linear n-v» and bounded, as |J(f)| £ ||f||. Also J(l) = 1, and if f > 0 then J(f) a 0. By the Riesz Theorem, there exists a Borel probability measure n on X such that J(f) = f du for all f € C(X), i.e., JX f f d^n) - [ f d,x. // Jy " Jy X Corollary 5.13: The space of Borel probability measures on a compact metric space X is itself a compact metric space under the weak topology. Proof: Let f.,f„,... be dense in C(X). Define - |f fi dm " f fi d^ D(m,n) = Z • i = 1 2i||fil| D is a metric en the space of Borel probability measures which gives rise to the weak topology. The compactness follows from the previous theorem. // Theorem 5.14: (Krylov and Bogolioubov [1]) If T is a homeomorphism of a compact metric space X then there exists a Borel probability measure on X which is preserved by T. Proof: Fix x € X. For f € C(X) and n £ 0, define n-1 J„Cf) = =• T. f(T1(x)). n n i = o J : C(X) -* C satisfies the conditions of the Riesz Representation n Theorem (note that |j (f)| £ l|f||)3 so there exists a Borel probability measure u. on X such that J n
133 J (f) = I f du for all f € C(X), n J n By Theorem 5.13 there exists a subsequence {|j. } and a Borel probability measure n on X such that J (f) = [ f dp. + fd|i for all f € C(X). nj Jx nj Jx , n. Since |j (foT) - J (f)| = — | fT D(x) - f(x)| n3 n. n. < ^-2||f|| -> 0 as j -> - we have foT dp. = f d|x, JX JX i.e., J f d|^T-1 = f d|i V f € C(X). jX JX So by the uniqueness Theorem 5.11 for Borel measures we have that M.(T-1B) = |j.(B) for all Borel sets B. // Theorem 5.15: Let T be a homeomorphism of a compact metric space X, and let M„ denote the collection of all T-invariant Borel probability measures on X (by Theorem 5.14, M_ i 0). Then (1) MT is closed in the weak topology, (2) MT is a convex set, and (3) if m € Mm then m is an extreme point of Mm iff m is ergodic with respect to T. Proof: (1) Suppose {|j. } c M_ converges to |j. in the weak topology. Then
134 J fT d^n "* J fT d II f f d,n - J f d, so that |j. is T-invariant. (2) is obvious. (3) Suppose m € M™, m not ergodic. There exists a Borel set E such that T~ E = E a.e. and 0 < m(E) < 1. Define measures m, and m„ by m ,„-, - m(B n E) an , m ,R, _ m(B n (X\E)) m. (B; = and m„(B; = 1 m(E) 2 m(X\E) for B a Borel set. Note that m. and m„ are in M_, nu i m„, and m(B) = m(E)m1(B) + (1 - m(E))m2(B), so that m is not an extreme point of M_. Conversely, suppose m € M_ is ergodic, and m = am. + (l-a)m„ where m.,m„ € MT, 0 < a < 1. We must show m. = m„. m. « m (m. is absolutely continuous with respect to m) so that the Radon- r dm.(x) Nikodym derivative dm-,/dm exists, (i.e., m-.CE) = dm(x) ). 1 JE dm -1 f aml So, m, (E) = m.CT E) = —- dm(x) 1 ! J _x dm T XE f dmi -i .n _L(T xy) dm(T Xy) ]■£ dm f dml -1 —-CT y) dm(y). J-p dm
135 dm, _, dm. Therefore -j—— (T y) = g=—Cy) a.e.(m) (by uniqueness of the Radon- Nikodyin derivative). But, m is ergodic, so that dm,/dm = constant = k a.e.(m). Therefore 1 = m,(X) = J k dm = k-m(X) = k. Jx Since k = 1, m. = m and therefore m„ = m = m, . // Definition 5.8: T is uniquely ergodic if there is only one T-invariant Borel probability measure on X, i.e., M„ = one point. Remark: T is uniquely ergodic with respect to m implies that T is ergodic with respect to m. Unique ergodicity is connected to minimality by: Theorem 5.16 : Suppose T is uniquely ergodic and m is its unique invariant measure. T is minimal iff m(U) > 0 for all nonempty open sets U. Proof: Suppose T is minimal. If U is open, U i ^ then X = U Tn(U) ; so if m(U) = 0 then m(X) =0, a contradiction. n=-» Conversely, suppose m(U) > 0 for all open nonempty U. Suppose also that T is not minimal, i.e., there exists a closed set K such that TK = K, K i X. T|K has an invariant Borel probability measure p.K on K by Theorem 5.14-. Define |j. on X by |j.(B) = HK(K(1 B) for all Borel sets B. Then \i € MT and p. i m because m(X\K) > 0 as X\K is nonempty and open while |j.(X\K) = 0. This contradicts the unique ergodicity of T. // The following results formulate unique ergodicity in terms of
136 ergodic averages. Theorem 5.17: The following are equivalent: n-1 . (1) V f € C(X), =• Y. f(T x) converges uniformly to a constant. n i = 0 i n_1 i (2) V f € C(X), =• Y. f(T x^ converges pointwise on X to a n i = 0 constant. (3) 3 m € MT 3 V f € C(X) and V x € X, n-1 dm. n . r n iTo J (4) T is uniquely ergodic. Proof: (1) =» (2) holds trivially. (2) = (3). Define k: C(X) ■* C by 1 n"1 k(f) = lim - Y. fT (x). Observe that k is a linear operator and is continuous since | J"!1 fT^x) | < |f|. n i = 0 Also k(l) = 1 and f i 0 => k(f) ï 0. Thus by the Riesz Representation Theorem there exists a Borel probability measure m such that k(f) = I f dm. But k(fT) = k(f) and so, I fT dm = If dm, i.e., f dmT~ = f dm which implies that mT~ = m by 5.11, so that m € MT- (3) =» (4). Suppose that v € MT- We have i F fTx(x) - f* V x, n i=0 where f = f dm. Integrating with respect to v, and using the
137 bounded convergence theorem we get that [ f dv = [ f* dv = f* = J f dm V f € C(X). Hence v = m by 5.11. Therefore T is uniquely ergodic. 1 n_1 i (M-) =• (1). If - ^ fT (x) converges uniformly to a constant n i=0 then this constant must be f dm, where m is the unique T-invariant measure. Suppose (1) is false. Then 3 g € C(X), 3 e > 0 3 V N 3 n > N and 3 x € X 3 n n-1 dm I > e . n" £0 gTi(V - J 8 n-1 1 * Define J : C(X) -> C by J (f) = - F fT1(x ). J satisfies the n J n n . _ « n n conditions of the Riesz Representation Theorem. Hence, J (f) = f da for some Borel probability measure |j. . Moreover, there exists a subsequence {a } such that i Jn_(f) = J f d,n_ -. \ f d, for all f € C(X) and for some Borel probability measure \i (by Theorem 5.13). Then | J (fT) - J (f)| = -jL |fT 1(xn ) - f(xn )| ni ni ni ni ni l so that fT d(i = f dp.. Hence, p. € M_. But, | g dp. - g dm| > e so that p. i m contradicting the uniqueness of m. // We now see which of our examples are uniquely ergodic.
138 Examples : (i) I is uniquely ergodic iff X = one point, since M_ = all Borel probability measures. (ii) T(x) = ax on a compact group is uniquely ergodic iff T is minimal. Proof: (=>) follows from Theorem 5.16 and the fact that Haar measure is positive on open sets. («=) T is minimal «• {a11} is dense in G. Therefore G is abelian. If 1 i y € G then \ ¥ rC^x) = i "i rCaScx) - ^ (r(a)"-i) . 0 as n _ n i=o n i=0 n y(a) - 1 (note that y(a) i 1). So (2) of Theorem 5.17 holds when f is a character and the condition (2) will hold for each f € C(X) by approximation, since finite linear combinations of characters are dense in C(X). // (iii) An automorphism of a compact group G is uniquely ergodic iff G = {e}, since Haar measure is preserved and so is the point measure concentrated at e. (iv) An affine transformation of a compact connected abelian metric group is uniquely ergodic iff it is minimal. Proof: (=>) follows by Theorem 5.16. (=) follows, as in example (ii), by checking that (2) of Theorem 5.17 holds. This was done by Hahn and Parry [1]. // (v) The Bernoulli shift on k symbols is uniquely ergodic iff k = 1. This is by example (iii). An excellent survey of unique ergodicity and related topics can be found in J. C. Oxtoby [1].
139 Recent results of Jewett [1] and Krieger [1] imply that any ergodic invertible measure-preserving transformation of a Lebesgue space is isomorphic in the sense of Chapter 2 to a uniquely ergodic system. This indicates a certain lack of measure-theoretic import for the concept of unique ergodicity. Hahn and Katznelson [1] have found uniquely ergodic transformations in shift spaces with arbitrarily large measure-theoretic entropies.
Chapter 6 : Topological Entropy Adler, Konheim, and McAndrew [1] introduced topological entropy as an invariant of topological conjugacy and also as an analogue of measure theoretic entropy. §1. Definition by Open Covers All logarithms are to base 2. Let X be a compact topological space. We shall be interested in open covers of X which we denote by a,p, . . . . Definition 6.1: If a,p are open covers of X their join a v p is given by: avp={AnB:A€a,B€p}. Definition 6.2: An open cover p is a refinement of an open cover a, written a < p, if every member of (3 is a subset of a member of a. In particular, a < a v p, {3<av(3. Definition 6.3: If a is an open cover of X and T: X -* X is continuous then T~ a = {T~ (A): A € a} is an open cover of X. Note: T-1(a v p) = T-1(a) v T-1(p) and a < p => T-1a < T-1p . Definition 6.4: If a is an open cover of X let N(a) = the number of sets in a finite subcover of a with smallest cardinality. We define the entropy of a by: H(a) = log N(a).
141 Remarks : (1) H(a) Ï 0. (2) H(a) = 0 iff N(a) = 1 iff X € a. H(a) is small means that there are a few sets in a which cover X. H(a) is large means that some part of X is covered by a large number of sets in a and not by a small number. (3) a < p = H(a) £ H(p). Proof: Let {B,,...>BN,g)} be a subcover of p with minimal cardinality. V i 3 A. € a 3 A. 2 B.. So, {A, ,. . . ,Ay,, „, } covers X and is a subcover of a. Thus N(a) £ N({3). // (4) H(a v p) s H(a) + H(p). Proof: Let {A, , . . . ,A.,, .. } be a subcover of a of minimal car- 1' ' N(a) dinality, and {B, , . . . >BNf8^ } be a subcover of (3 of minimal cardinality. Then {Aj 0 B.: 1 £ i £ N(a), 1 < j < N(p)} is a subcover of a v {3 so, N(a v |3 ) £ N(a)N((3). // (5) If T: X -<• X is a continuous map then H(T~ a) £ H(a). If T is also surjective then H(T a) = H(a). Proof: If {A-, , . . . ,AN(. -, } is a subcover of a of minimal cardinality then {T~ A,,...,T~ A.,, . } is a subcover of T~ a, so N(T-1a) £ N(a). If T is onto, and {T-^, . . . ,T-1AN(T-ia) } is a subcover of T~ a of minimal cardinality then {A, ,. . . ,A.,,T-i ■.} also covers X, so N(a) £ N(T-1a). // Theorem 6.1: If a is an open cover of X and T: X -<• X is continuous, then lim - H(a v T-1a v ... v T-(n-1)a) exists, n— n
142 Proof: Recall that if we set an = H(a v T-1a v ... v T (n 1}a) then by Theorem M-.M- it suffices to show that: a ï 0, and a. sa +a V nun. n n+m n m By (1), an ï 0, and a +m = H(o v T_1a v ... v T-(n+m-:L)a) n+m £ H(a v T-1a v ... v T~(n-1)a) + H(T"na v ... v T-(n+m-:L)a) by U) = a + H(T_na v ... v T"(n+m"1) n a) a„ + H(T"n(a v ... v T (m"1)a)) n an + H(a v ... v T (m-1)a) by (5) = a + a . // n m Definition 6.5: If a is an open cover of X and T: X ■* X is a continuous map then the entropy of _J_ relative to _a_ is given by:- h(T,a) = lim i. H(a v T-1a v ... v T~(n-1)a). n-»-» Remarks : (6) h(T,a) i 0 by (1). (7) a < p = h(T,a) < h(T,p). n-1 . n-1 _. Proof: a < p =» V T-:La < V T 1p , so by (3) we have that i=0 i=0 H( V T a) £ H( V T p). Hence h(T,a) < h(T,p). // i=0 i=0
143 Note that if {3 is a finite subcover of a then a < (3 so then h(T,a) £ h(T,B). (8) h(T,a) < H(a). Proof: By (4) we have H(a v T-1a v ... v T~(n-1)a) n-1 < £ H(T a) i = 0 < n-H(a) by (5). // Definition 6.6: If T: X ■* X is continuous, the topological entropy of T is given by: h(T) = sup h(T,a) a where a ranges over all open covers of X. Remarks : (9) h(T) 5: 0. (10) In the definition of h(T) one can take the supremum over finite covers of X. This follows from (7). (11) h(I) = 0 where I is the identity map of X. The next result shows that topological entropy is an invariant of topological conjugacy. Theorem 6.2: If X,,X„ are compact spaces and T • : X. -<• X. are continuous for i = 1,2, and are topologically conjugate, then they have the same entropy. Proof: Suppose ^: X, ■* X„ is a homeomorphism such that *T1 = T2*" Let a ^e an °Pen cover °f X9' Then>
mk h(T2,a) = lim i- H(a v T21a v ... v T2(n 1}a) n = lim i- H(«-1(a v T^a v ... v T~(n-1)a)) by (5) n = lim - HU-1a v T^1«"1a v ... v T^(n-1)0"1a) n = h(T1,«"1a). By taking suprema the result follows. // Adler, Konheim, and McAndrew proved several results about h(T). In the next section we give a definition of topological entropy for any uniformly continuous map of a metric space (not necessarily compact). This definition can also be given for uniform spaces. The definition will reduce to the previous definition in the compact case. We shall prove the properties of h(T) with this new definition. However, one result we would like to note is the following: Theorem 6.3: If T: X ■* X is a homeomorphism of a compact space X, then h(T) = h(T-1). Proof: h(T,a) = lim - H(a v T-1a v ... v T~(n-1)a) n = lim i- H(Tn-1(a v T-1a v ... v T-(n-1)a)) n n = lim i H(a v Ta v ... v Tn-1a) n n = h(T-1,a). // Adler, Konheim, and McAndrew could not answer the following questions which they stated as conjectures in their paper.
145 Notation : From now on the measure theoretic entropy of a transformation T preserving a measure m will be written h (T). r ° m Conjecture 1: Let X be compact and m a regular Borel measure on X. If T: X ■* X is a homeomorphism preserving m then hm(T) £ h(T). m [This has been proved by Goodwyn [1]. T need only be continuous. We shall give a proof when X is a finite-dimensional torus. (See Theorem 6.9.] Conjecture 2 : Let X be a compact metric space and T: X ■* X a homeomorphism. (By Theorems 5.10 and 5.14 we know that MT, the set of Borel measures on X invariant under T , is nonempty.) Then h(T) = sup h (T). m€MT [Partial contributions were made by Goodwyn [1] and Dinaburg [1], but Goodman [1] finally proved it. We can drop the condition of X being metric and allow T to be only continuous, provided we define M„ to be all T-invariant regular Borel probability measures on X.] Conjecture 3 : If X is a compact metric group and T is an automorphism of X then h(T) = h (T) where m is Haar measure, m [This was shown by Berg [1] and generalized by Bowen [i+]: T can be an affine transformation and need not necessarily be invertible. We shall prove this. (See Theorem 6.10.)] Conjecture 4 : Let {T.} be a one-parameter group of homeomorphisms of a compact space X. Then h(T ) = | -t | hCTx).
146 [This was proved by Bowen when X is metric] Conjecture 5 : Let X,Y be compact spaces. Let {T : x € X} be a family of homeomorphisms of Y so that T(x,y) = (x,Tx(y)) is a continuous map of XxY. Then h(T) = sup h(T ). x€X x [This was proved in the case where X and Y are metric by Bowen.] §2. Bowen's Definition If (X,d) is a metric space and x € X then B (x) will denote the open ball centered at x and of radius e. UC(X,d) will denote the collection of all uniformly continuous maps <t> : X -<• X. Let T € UC(X,d); n € Z, n > 0; and e > 0. If K ç X, a set F s X is said to (n,e)-span K with respect to T if V x € K 3 y € F such that max d(T1x,T1y) < e. 0<i£n-l For K compact, let r (e,K) be the smallest cardinality of any (n,e)-spanning set for K with respect to T. We show later (Theorem 6.4) that r (e,K) < -. Set r_(e,K) = lim sup — log r (e,K). n A set E c x is (n,e)-separated with respect to T if, whenever x,y € E, x ^'y then max d(T1x,T1y) > e. 0si£n-l
147 For K compact, let s (e,K) denote the largest cardinality of any (n,e)-separated subset of K with respect to T. We later show (Theorem 6.4) that s (e,K) is finite. Set sT(e,K) = lim sup — log s (e,K). We define h(T,K) = lim rT(e,K) = lim sT(e,K). These limits exist and are equal by Theorem 6.4. We then define hd(T) = sup h(T,K). K compact Remarks : (1) This definition can also be given in the context of uniform spaces. (2) h,(T) measures the amount of expansion in T (for the metric d). For r (e,K) and s (e.K) to increase as n increases we need n n some expansion for T. (3) The ideas for this definition come from the work of Kolmogorov on the size of a metric space. If (X,p) is a metric space then a subset F is said to e-span X if V x € X 3 y € F with p(x,y) £ e, and a subset E is said to be s-separated if whenever y,z € E, y i z, then p(y,z) > e. The e-entropy of (X,p) is then the logarithm of the minimum number of elements of an e-spanning set and the e-capacity is the logarithm of the maximum number of elements in an e-separated set. So in the above definitions we are considering the metric spaces (K,d ) where d is the restriction to the com- n n pact set K of the metric p (x,y) = max d(T1x,T1y). n 0£i<n-l Then h(T,K) = lim lim sup — [e-entropy of (K,p )]. (It follows from e^O n^- n n
148 the proof of the next theorem that to define h(T,K) it suffices to consider spanning sets for K which are subsets of K.) sT(e,K) is the average e-capacity of the spaces (K,d ) and h(T,K) is the limit of sT(e,K) as e -* 0. Theorem 6.4: Suppose K is compact. Then (i) r (e,K) £ s (e,K) £ r (e/2,K) < - and (ii) if e.^ < e then r*T(e ,K) > rT(e„,K) and IT(e1,K) ï sT(e2,K). Proof: (i). We first show r (e,K) < ». There exists a 6 > 0 such that d(x,y) < 6 implies max d(T1x,T1y) < e. 0£i£n-l Then r (e,K) is less than the number of 6-balls needed to cover K and hence is finite. We shall now prove s (e,K) £ r (e/2,K). Suppose E ç K is an n n (n,e)-separated set and that F (n ,e/2)-spans K. Define 0 : E -* F by choosing for each x € E some point 4>(x) € F with max d(T1«(x),T1(x)) £ e/2. If *(x) = $(y) then 0£i£n-l max d(T1x,T1y) £ e/2 + e/2 = e 0£i£n-l so that x = y. Hence ^ is one-to-one and the cardinality of E is less than or equal to the cardinality of F. Therefore s (e,K) £ rn(e/2,K). Finally we show r (e,K)£s (e,K). Let E be an (n,e)-separated subset of K of maximum cardinality. We claim that E (n,e)-spans K, since if not 3 x € K 5
149 max d(T1x,T1y) > e V y € E. 0<i<n-l Then E U {x} is an (n, e)-separated subset of K, contradicting the choice of E. (ii) is obvious. // Hence the definition of h(T,K) makes sense. Remarks : (1) hd(T) depends on d. (2) If K ç y. u .. . U Km are all compact then h(T,K) £ max h(T,K.). l£i£m Proof: Certainly, s (e,K) £ s (e,^) + ... + sn<e>Km>' Fix e > 0. V n choose K. (e) 5 n s (e,^ (e)) = max s (e,K. ), Then s (e,K) <m-s (e,K^ (e)) and so, n log s (e,K) £ log m + log s (e,K. (e)). Choose n. -* » such that — log s (e,K) ->■ lim sup — log s (e,K) n. n. n n and so that K. (e) does not depend on j (i.e., K. (e) = K(e) n3 n. V j). Thus, sT(e,K) £ sT(e,K(e)). Choose e ->■ 0 so that K(e ) is constant ( = K. , say). Thus, h(T,K) £ h(T,K. ) £ max h(T,K.). // 0 j 3
150 (3) V 6 > 0, in order to compute h, (T) it suffices to take the supremum of h(T,K) over compact sets of diameter less than 6. This is true by (2). U) If X is compact, hd(T) = h(T,X). Proof: By (2), if K c x, K compact, then h(T,K) £ h(T,X). // Definition 6.7 : Two metrics d and d' on X are uniformly equivalent if id. : (X,d) -> (X,d') and id. : (X,d') -> (X,d) are both uniformly continuous. In this case, T € UC(X,d) iff T € UC(X,d'). Theorem 6.5: If d and d are uniformly equivalent and T € UC(X,d) then hd(T) = hdi(T). Proof: Let e. > 0. Choose e„ > 0 5 d (x,y) < e„ => d(x,y) < e. and choose e„ > 0 5 d(x,y) < e„ => d (x,y) < e„. Let K be compact. Then r (e1,K,d) £ rn(e2,K,d ) and rn(e2,K,d') £ rn(e3>K,d). Hence, r^Ce^K.d) £ rT(e2,K,d') £ ?T(e3,K,d).
151 If e, -* 0, then e„ -* 0, and e_ -* 0 so we have hd(T,K) = hd.(T,K). // Remark: If X is compact and if d and d' are equivalent metrics then they are uniformly equivalent. Also, each continuous map T: X -* X is uniformly continuous. Theorem 6.6: (Lebesgue Covering Lemma) If (X,d) is a compact metric space and a is a finite open cover of X then there exists a 6 > 0 such that each subset of X of diameter s 6 lies in some member of a. Proof: Let a = {A, ,...,A }. Assume the theorem is false. Then for all n there exists B ç X such that diam(B ) < 1/n and B n n n is not contained in any A.. Choose x € B and select a subse- ^i n n quence {x } which converges, say x •*■ x. Suppose x € A. € a. Let a = d(x,X\A.) > 0. Choose n. such that n. > 2/a and d(x ,x) < a/2. Then if y € B n. J n • l l d(y,x) < d(y,x ) + d(x ,x) < — + - < a. 3 ni ni' ni 2 So y € A.. Hence B £ A., a contradiction. // ^ i ^ Theorem 6.7 : When X is compact, Bowen's definition of entropy coincides with the open cover definition. Proof: For the duration of this proof let h*(T,a) and h*(T) denote the numbers that occur in the open cover definition. Let a - {A,,...,A } be an open cover of X. We shall show that h*(T,a) S h(T). Let 6 be a Lebesgue number for a. Let F be a
152 (n,6/2)-spanning set for X of minimum cardinality. For z € F choose A. (z),...,A. (z) in a so that B,,„(T z) £ A. (z). Let x0 xn-l 6/2 xk C(z) = A. (z) fi T-1A. (z) H ... H T~(n-1)A. (z), 0 1 n-1 which is a member of a v T~ a v ... v T ~ a. We have X = U C(z) since if x € X 3 z € F 3 z€F max d(T1x,T1z) £ 6/2 Osi<n-l and hence x € T~k(Bg/2(Tkz)) ç l~kA± (z), 0 s k < n-1; so x €C(z). k Hence N(a v T-1a v ... v T~(n-1)a) < |F| = r (6/2,X), and h*(T,a) S rT(6/2,X) £ h(T,X) = h(T) since X is compact. Therefore h*(T) < h(T). To prove the converse let 6 > 0 be given. Choose an open cover a - {A,,...,A } of X such that diam(A.) < 6 for all i. Let E be an (n,6)-separated subset of X with maximal cardinality. Two members of E cannot belong to the same element of a v T-1a v ... v T~ a since if 11-1 --i x,y € Pi T JA. x,y € E j = 0 xj then max dCT^x^-^y) < 6 and so x = y. 0£j<n-l So, N(a v T-1a v ... v T~(n-1)a) > |E| = s (6,X). Therefore h*(T) > h*(T,a) ï s"T(6,X). Letting 6 -<• 0 we have h*(T) > h(T,X) = h(T) . //
153 Notes : (1) If we had set up the definitions using uniformities we would get the above for compact Hausdorff spaces. (2) Since a maximal separated set is spanning we get by the first part of the proof of Theorem 6.7 that s (6/2,X) ï N(a v T a v ... v t~ a) where 6 is a Lebesgue number for a. Theorem 6.8: (1) If T € UC(X,d) and m > 0 then h(Tm) = m-h(T). (2) Let Ti € UC(Xi,di) i = 1,2. Define a metric on X1xX2 by d((x1,x2) , (y1,y2) ) = max {d^x^y^ ,d2 (x2 ,y2) } . Then hd(VT2) ïh^CV +hd2(T2). If X, and X„ are compact then equality holds. Proof: (1). Since r (e,K,Tm) £ r (e,K,T) we have n mn ilog rn(e,K,Tm) £ ^ log r^Ce ,K,T) and therefore h,(Tm) < m-h,(T). d d Since T is uniformly continuous, V e>0 3 6>0 3 d(x,y) < 6 =» max dCT^x^^y) < e. 0£j£m-l So an (n,8)-spanning set for K with respect to Tm is also an (nm,e)-spanning set for K with respect to T. Hence, r (6,K,Tm) ï r (e,K,T) so, m-r_(e,K) £ r _(6,K). Thus, m-hd(T,K) < hd(Tm,K). (2). Let K. ç x. be compact, i = 1,2. If F. is an (n,e)- spanning set for K. with respect to T. then F,xF„ is an
154 (n,e)-spanning set for K,xK with respect to T,xT Hence, rn(e,K1xK2,T1xT2) < rn(e,K1,T1)•rn(e,K2,T2) which implies rT xT (e,K1xK2) £ ?T (e,^) + ?T (e,K2). Therefore WWV - ^^l^^ +hd2(T2'K2}- Let tt.: X,xx -* X. i = 1,2 be the projection map. If K ç X1*x2 ^-s compact then K, = tt.(K) and K„ = tt„(K) are compact and K ç ICxK Hence hd(T1xT2,K) £ hd(T1xT2,K1xK2). Therefore h (T xT ) = sup h,(T xT ,K) K£X1xX2 x L compact sup hd(T1xT2,K1xK2) Kl-Xl K2çX2 cpt. sup h, (T,,K,) + sup h, (T„,K„) Kl-Xl 1 K2çX2 a2 l l cpt. cpt. - h^CT^ + h^C^). Now suppose X, and X„ are compact. Let a. be an open cover of X. and have Lebesgue number 6. (i =1,2). If S. is a maximal (n,6. /2)-separated set for X. with respect to T. then S-,xS2 is an (n,6)-separated set for X-i*X2 with respect to T-i*T9 where
155 6 = min(61/2,62/2). Therefore sn(6,X1xX2) 5: sn(61/2,X1)-sn(62/2,X2) > N(a1 vT11a1 v . . . vT^11 1)a1)-N(a2 v T"1^ v . .. vT2(n 1)a2) by note 2 above. Hence h(T,xT„) > lim sup i log s (6,X,xX„) 1 I nn 1 I lim - log N(a1 v T^a, v . . . v T (n "^O + lim i- log N(a2 v T"1^ v . . . v T2(n 1}a2) h(T1,a1) + h(T2,a2). Since a1>a2 were arbitrary we get hCT.xT) > h(T.) + h(T2). // Remarks : (1) If T is a homeomorphism and T € UC(X,d), T-1 € UC(X,d) then h (T) i hd(T-1) in general. We shall show later that if T: R -* R is defined by T(x) = 2x then h(T) = log 2 while h(T~ ) = 0 using the usual metric on R. (Note that T has expansion but T~ does not.) However, on compact spaces h(T) = h(T~ ) (Theorem 6.3). This is because on a compact space T has "as much expansion" as does T. (2) Equality probably holds in (2) for non-compact X, and X„ but I do not know a proof. §3. Connections with Measure Theoretic Entropy In this section we shall prove conjecture 3 (assuming conjecture 1 is true) and we shall prove conjecture 1 when X is a
156 finite-dimensional torus. Theorem 6.9: (Goodwyn) Let X be a compact space and T: X -*■ X continuous. If m is a T-invariant regular Borel probability measure on X, then hm(T) £ h(T). We shall prove this theorem when X is a finite-dimensional torus since the proof is easier in this case. k k k Proof: Let X = K, T: K -<• K be any continuous map, and let m be,any T-invariant Borel probability measure on K . We wish to show that h (T) £ h(T). m Consider Kk as Rk/Zk with metric d(x+Zk,y+Zk) = inf ||x-y+v|| x,y € Rk v€Zk where || • || denotes the usual Euclidean norm. Fix an integer q > 0. Consider a decomposition of the unit v k-cube in R into all sets of the form r PX PX+1 P2 P2+l < (x, , . . . ,x, ) : — £ x, < , — £ x„ < , \ 1 k 2q 2q 2q 2q pk pk+1 — £ x, < where 9q k oQ 0 < pi < 2q for i = 1,.. . ,k | . This induces a partition of the torus K which we denote by k 1 Ç = {A, , . , . ,A , }. Any ball in K of radius —t-=- intersects at q l 2K1 2q k k most 2 members of Ç . Let y = {C,,...,C } be a cover of K by open balls of radius —-pr. Let 2 6 be a Lebesgue number for y For all x € Kk, fix some C(x) € Y with Bg(x) Ç c(x). Let F be
157 k . an (n,6)-spanning set for K with respect to T of minimal cardi- n-1 . nality. Let x € Pi T A. , A. € ç . Choose y € F with max dCT-'x.T-'y) £ 6, 0£j£n-l and hence, TJx € C(TJy). Thus TJx € A. fl C(TJy). Hence if 15 Un = «V""'^-!^ npQ T"JAi3 * *> then |Un| £ 2kn|F| = 2knrn(6,Kk). So, using Corollary 4.2, Hm(Ç vT-1£ v... vT"(n-1)Ç ) £ log |Un| £ n-log 2k + log rn(6,Kk) Thus, hm(T,Ç ) £ log 2k + lim i- log r (6,Kk) q n n = log 2k + ?T(6,Kk) £ log 2k + h(T) = k + h(T). (Note that we are taking logarithms to base 2.) But, A(Ç ) / B as q ■* -. So, by Theorem 4.14 h (T) = lim hm(T,£ ) s k + h(T). m m q But this holds for any continuous T, so, in particular, for Tn, n > 0. If n > 0 h (T) = - h (Tn) £ - [k + h(Tn)] = - + h(T) m n m n n so, by letting n ■* » we get the desired result. //
158 Theorem 6.10: (Bowen) Let X be a compact metric group and T: X -* X, T = a-A an affine transformation. If m denotes Haar measure on X then h (T) = h(T) = h (A) = h(A). m m Proof: By the previous theorem h (T) £ h(T), and so it remains r m to prove that h(T) £ h (T). Suppose d is a left invariant metric on X. Let B (y) = {x: d(x,y) < e} and n-1 _, , D(x,e,T) = O T KB (T x). n k=0 e By induction we shall show that T-kBe(Tkx) = x-(A"kBe(e)). It is true for k = 0 by the invariance of the metric d. Assuming it holds for k we prove it for k+1. T-(k+l)B (Tk+lx) = T-1(T-kBe(Tk(Tx))) = T-1(Tx-A-kBe(e)) - x.(A-(k+1)Be(e)). n-1 Hence, D (x,e,T) = x • C] A B (e) = x-D (e,e,A). n k=0 e n Also, m(D (x,e,T)) = m(D (e,e,A)). Let e > 0. Let £ = {A, ,...,A } be a partition of X into Borel n-1 , n-1 _, sets of diameter < e. If x ( f| T~ A. then O T A. S. k=0 1k k=0 \ n-1 _, v v x-D (e,e,A), since if y i C] T KA. then TK(x),TK(y) € A. V k, n k=0 xk xk and hence y € T~kB (Tkx) V k, i.e., y € D (x,e,T) = x-D (e,e,A).
159 n-1 . Thus, m( O T~ A. ) < m(D (e,e,A)) and taking logs we see that k=0 k n n F (0 T"kA. ) log m( 0 T kA. ) i0'"-'in-l = 1 "k "k n YL m(PlTkAi ) log m(Dn(e,e,A)) io'--^in-r1 log m(Dn(e,e,A)). Thus, hm(T) - hm(T'^) = lim n H(^ V '"" V T (n-1)£) n ï lim sup [- - log m(D (e,e,A))] Hence, since e was arbitrary we obtain that h (T) > lim lim sup [- - log m(D (e,e,A))] . m e-*0 n n n (The limit clearly exists.) Consider now an (n,e)-separated set with respect to T, E ç X, having maximal cardinality. Then U D (x,e/2,T) = (J x-D (e,e/2,A) x€E n x€E n is a disjoint union because of the choice of E. Therefore s (e,X)-m(D (e,e/2,A)) < 1 and so s (e,X) £ m(Dn(e,e/2,A)) Therefore sT(e,X) S lim sup [- - log m(D (e ,e/2,A))] n and letting e •*■ 0 we see that
160 h(T) = hd(T,X) £ lim lim sup C- ± log m(D (e,e/2,A))] e-*0 n m Thus, hm(T) = h(T) = lim lim SUP t_ n1 loS m(D (e,e,A))]. e-<-0 n We can replace T by A here since the right hand side is independent of a. // Note: The formula h(T) = lim lim sup [- - log m(D (e,e,A))] e-<-0 n illustrates that T measures "the amount of expansion" in T. §4-. Topological Entropy of Linear Maps and Toral Affines Our aim in this section is to compute the topological entropy (and hence by Theorem 6.10 the measure theoretic entropy) of affine transformations of finite-dimensional tori. Recall (see §6 of Chapter 0) that we can view the n-torus K either multiplicatively as KxKx...xK (n factors) or additively as Rn/Zn. Each endomorphism A of Kn onto Kn is given, in the additive notation, by A(x + Zn) = [A]-x + Zn x € Rn, where [A] is an n*n nonsingular matrix with integer entries. [A] determines a linear transformation A of Rn and ttA = Att where tt : Rn -* Kn is the natural projection given by tt(x) = x + Zn. Let || • || denote the usual Euclidean norm on Rn. We define a metric d on Rn/Zn by
161 d(x+Zn,y+Zn) = inf ||x-y+v|| x,y € Rn. v€Zn d is left and right invariant and, for every x € R tt maps the ball of radius 1/4- about x in Rn isometrically onto the ball of radius 1/4- about tt(x) in R /Z . The next theorem deals with such a situation and asserts that h,(A) = h~(Â) in this case, where d denotes the metric on Rn in- a d duced by the Euclidean norm || • ||. (Since ||Ax - Ay|| £ ||A|| • ||x - y|| we know A € UC(Rn,d).) This will reduce the problem of calculating the entropy of A to that of calculating the entropy of A. Theorem 6.11: Let (X,d),(X,d) be metric spaces and tt : X -* X a continuous surjection such that there exists 6 > 0 with "Ibr(2)! V*> -B6(n(x)) o an isometric surjection for all x € X. If T € UC(X,d) and T € UC(X,3) satisfy ttT = Ttt then h,(T) = h~(T). d d Proof : If K is compact in X of diameter < 6 then tt(K) is compact in X of diameter < 6. Every compact subset of X of diameter < 6 is of this form. Let e > 0 be such that e < 6 and if d(x,y) < e then d(Tx,Ty) < 6. Suppose E ç. K is an (n,e)-separating set with respect to T. We first prove that tt(E) is an (n ,s )-separating subset of tt(K) with respect to T. To prove this, let x i y belong to E. Then tt(x) i rr(y). Let iQ be chosen so that d(T x,T y) < e if i < iQ - ~V1~ ~i0+1~ and d(T x,T y) > e. (This can be done since E is an (n,e)- separating set with respect to T.) By our choice of e,
162 „ „i +1^ .i.+l^ ~i0+1~ ~i0+1~ d(T x,T y) < 6 and so T y € B (T x) which is mapped V1 ~ isometrically onto BS(T tt(x)). So, iQ+l „ iQ+l „ „ ~in+1~ ~in+1~ d(T U tt(x),T U rr(y)) = d(T U x,T U y) > e. Thus tt(E) is (n,e)-separated with respect to T. Therefore, s (e,K,ï) < sn(e,rr(K) ,T). To prove the converse inequality, suppose E is an (n,e)- separated subset of tt(K) £ X with respect to T, where K is compact and of diameter < 6. Let È = tt~ (E) n K. Then È is an (n,e)- separated set with respect to T since if d(T x,T y) £ e where x,y € È then d( T1^ (x) ^tt (y) ) < e. Hence, sn(e ,tt(K),T) £ sn(e,K,T). Therefore s (e,K,T) = s (e,n(K),T) and hence h„(T,K) = h,(T,rr(K)). d d By remark (3) of §2 h„(T) = h^(T). // Corollary 6.11: If A: Kn -<• Kn is an endomorphism then hH(A) = h~(A) where A a d is the linear map of Rn covering A, d is the metric on R determined from the Euclidean norm and d is any metric on K . We shall now proceed towards calculating the entropy of a linear map of Rn.
163 Theorem 6.12: Suppose A: Rp -* Rp is a linear map, and p a metric determined by a norm on Rp. Then: (i) h (A) > log |det A| if det A i 0, and (ii) if all the eigenvalues of A have the same absolute value t then h (A) = max {0,p-log t}. Proof: All norms on Rp are uniformly equivalent, so, by Theorem 6.5 we can assume that p is the metric given by the Euclidean norm. Obviously, A € UC(R ,p) as p(Ax,Ay) = ||Ax -Ay|| £ ||A||||x-y|| = ||A||p(x,y). (i). Let m denote Lebesgue measure on R . Then m(A(E)) = |det A|-m(E) for all Borel sets E Ç Rp. Let K £ Rp be compact and m(K) > 0. If F (n,e)-spans K then K ç IJ D (x,e,A) = U [x +D (0,e,A)] x€F x€F n-1 . . where D (x,e,A) = (~) A-1B (A x) (as in the proof of Theorem 6.10). n i=0 e Thus, m(K) < m(D (0,e,A))-r (e,K) = m(D (0,e,A))-r (e,K) . „. m(K) r (e,K) > m(Dn(0,e,A)) Therefore r.(e,K) 5r lim ^- [log m(K) - log m(D (0,e,A))] An n n = lim [- ^ log m(D (0,e,A))]. n = n n ., _n m(B (0)) But, m(D(0,e,A)) < m(A ln l;B (0)) = —-T n e i . . ,in-l det A
164 so that rA(e,K) ï lim - [log | det A|n -1" - log m(B (0))] n = log |det A | . Therefore h (A) ï h(A,K) > log |det A|. (ii). In this case, jdet Aj = |product of eigenvalues! = t". So, by (i) h (A) ï p log t and then, clearly, h (A) ï max {0 , p log t}. We now have to show the opposite inequality. Assume first that ||A|| > 1. Let K be a compact subset of R^ of diameter < 1/2. Choose b € RP such that 0 € b + K = K, . The diameter of K, < 1/2 b b p so that K, c I where b 1 ij e {(x1,...,xp) € RP: |Xi| £ 1 V i}. For 6 such that 0 < 6 < 1 let F(6) = {(n16,...,n 6): n^ € Z, |ni6| < 2}. Observe that |F(6)| £ (5/6)p, and 3 c > 0 5 V y € 1^, 3 x € F(6) 5 p(x,y) < c6. F(6) is an (n, ||A||nc6) -spanning set for K, with respect to A, since if y € K, 3 x € F(6) 5 pCaSc.A^) £ ||Ai||p(x,y) < HAl^cS £ [|A||nc6, for 0 < i £ n-1. But then, F(6) - b is an (n,||A|| c6)-spanning set with respect to A e for K. Let e > 0, and set 6 = < 1, for n sufficiently HA||n-c large. Thus
165 rn(e,K) £ I e \ < I 5llAHnc V l||A||nc/ ~ \ e ) for sufficiently large n. Also, r.(e,K) = lim i log r (e,K) a _ n n n £ lim 2- [log 5 + n log ||A|| + log c - log e] n P log ||A|| So, h (A) £ p log ||A|| and hence h (A) < max {0 , p log ||A||} if ||A|| > 1. If ||A|| < 1 then h (A) = 0 since a (1 ,e )-spanning set is an (n,e)- spanning set. Thus, in all cases h (A) s max {0 , p log ||A[[}. However, h (A) = - h (An) for n > 0, P n p < - max {0 , p log ||An|| } = max {0 , p log [|An|[ n>. But, ||An|| -+ the spectral radius of A, which here is precisely t. Therefore h (A) £ max {0 , p log t}. // Remark : If A: rP -<• rP is linear and the metric p is determined by a norm then h (A) = lim lim sup [- — log m(D (0,e,A))] where m is p e-<-0 n-*»
166 n-1 _. Lebesgue measure on Rp and D (0,e,A) - (~) A-1B (0). n i=0 e Proof: We can suppose p is determined by the Euclidean norm. In the proof of (i) of Theorem 6.12 we showed r.(e,K) ï lim sup [- - log m(D (0,e,A))] and hence a n n n-*™ h (A) ï lim lim sup [- - log m(D (0,e,A))]. Let K be the p-cube P e->0 n— n n q in Rp with center 0 and side length 2q. If E is an (n,e)- separated subset of K then {J D (x,e/2,A) is a disjoint union q x€E n and U D (x,e/2,A) = N x + D (0,e/2,A) ç K x. . Hence xTe n xTe n q+2e sn(e/2,K )-m(D (0,e/2,A)) £ (q+2e)P and hence s.(e/2,K ) £ lim sup [- - m(D (0,e/2,A))]. A <1 n-*~ n n Therefore h (T) = sup h (K ) < lim lim sup [- - m(D (0,e/2,A))]. // P pq n nn Theorem 6.13: Suppose A: Rp -* Rp is linear and p is a metric coming from a norm. Then E ic h (A) * *- log |\.| where X,,...,\ are the eigenvalues of A. Proof: By the Jordan Decomposition Theorem (Jordan Canonical Form), we can write Rp as a direct sum of subspaces RP = E, * ... ® E, 1 k where A(E.) ç Ei for i = l,...,k and A. = A|£ has all its i
167 eigenvalues with the same norm t.. Thus and A = A. (6 ... m A, 1 k h (A) s f h (A.) P ^i P ! by use of Theorem 6.8 and (since 6.8 is stated in terms of a specific metric) the fact that all norms on R^ are equivalent. By Theorem 6.12 k h (A) £ Y. max f° > dim E- - loS T-^ = L. (dim E. • log t.) T.>1 1 1 X = E log |\.|. |Xi|>l We can suppose p is determined by the Euclidean norm. By the above remark we have h (A) = lim lim sup [ log m(D (0 ,e ,A) )]. Write e-<-0 n-*» Rp as a direct sum of two subspaces Rp = F, ® F„ so that AF. Ç f. (i = l,2) and A. = A|_ has eigenvalues with absolute value greater 1 tl than one and A„ = A|_ has eigenvalues with absolute value less than or equal to one. Since D (0,e,A) Ç B (0) PI A~ B (0) we have ne e m(D (0,e,A)) £ c Idet A~ I for some c independent of n. n e ' 1 ' e r Therefore lim sup [- - m(D (0,e,A)] ï log |det A11 = 2- log \\^\ . Therefore n-H- " " Uil>l h (A) » I. log |\.j. //
168 Theorem 6.14: Suppose T: K^ ■* K^ is an affine transformation, Tx = a-A(x) where a € K^ and A is a surjective endomorphism of K^. If m is Haar measure, then h(T) = hm(T) = hm(A) = h(A) = L log |\i|, the X.'s being the eigenvalues of the matrix [A] which represents A. Proof: We know by Theorem 6.10 that h(T) = h (T) = h (A) = h(A) m m and by Corollary 6.11 that h(A) = h(A), where A denotes the covering linear map of A. h(A) is calculated in Theorem 6.13. // Note: We have given a full proof of this result when the space is a finite-dimensional torus (since we proved Theorem 6.9 only in this case). The above proof is due to Bowen. This formula for the entropy of an automorphism was first stated by Sinai [1]. §5. Expansive Homeomorphisms As an analogue of the measure theoretic concept of a generator, one could make the following definition: Let (X,d) be a compact metric space, and T: X -* X a homeo- morphism. Definition 6.8: A finite open cover a of X is a generator (weak generator) for T if for every bisequence {A } of members of a,
169 O T Â is at most one point ' n n=-~ (||TA is at most one point 1. n=— n / These concepts are due to Keynes and Robertson [1]. Theorem 6.15: T has a generator iff T has a weak generator. Proof: (=>) is trivial. («-). Let p be a weak generator for T, p = {B1,...,Bs}, and let 6 be a Lebesgue number for p. Let a be a finite open cover by sets A. having diam(A.) 5 6. So if A. is a bisequence n in a then V n 3 j 3 A. £ B. . Hence, """n •'n n T~nÂi ç ô T_nB- which is either empty or a single point. So a is a generator. // The following shows that a generator determines the topology on X. Theorem 6.16: Let a be a generator for T. Then V e>0 3 N>0 3 each N set in V T"na has diameter < s. Conversely, V N>0 3 e>0 -N such that d(x,y) < e implies N x,y € OT"nA -N n for some A ..,...,A„ € a. -N N
170 Proof: Suppose the first part of the theorem does not hold. 3 e>0 3 V j>0 3 x.,y., d(x.,y.) > e and 3 A. • € a, 3 3 J J J j1 A „-i. -j < i < j with x.,y. € (] T A. .. We can suppose that x. -* x, 3 3 i=-"i '1 y. •* y since X is compact, and hence x i y. Consider the sets A. n. Infinitely many of them coincide since a is finite. Thus x.,y. € A., say, for infinitely many j and hence x,y € A.. Similarly, for each n, infinitely many A. coincide and we obtain A € a with x,y € T~n . Thus, n ,J n x,y € H T"nÂn — ea contradicting the fact that a is a generator. To prove the converse let N > 0 be given. Let 6 > 0 be a Lebesgue number for a. Choose e > 0 such that d(x,y) < e implies d(T1x,T1y) < 6 for -N S i £ N. Hence if d(x,y) < e and |i| £ N then T1x,T1y € A. for some A. € a. Hence N x,y € O T_1A.. // -N The .analogue of the Kolmogorov-Sinai Theorem is: Theorem 6.17: If a is a generator for T then h(T) = h(T,a). Proof: Let {3 be any open cover. Let 6 be a Lebesgue number for p. Choose N > 0 so that each member of V T~ a has diameter N "N < 6. Then p < V T na, and so, -N
171 N h(T,p) < h(T , V Ta) -N , n-1 . N = lim 4 H( V T_1( V T~na)) k— i=0 -N , N+k-1 = lim 4 H( V T"na) k-~ K -N , 2N+k-l = lim ~ H( V T~na) k-- K 0 ,„., . . 2N+k-l - n. 2N+k-l 1 „/ \/ _-n v ■ iiz —*~' 2N+kTïH( Y = h(T,a). Therefore, h(T,{3) £ h(T,a) for all open covers p. Hence h(T) = h(T,a). // Remark: The same result holds for weak generators. Generators are connected with the notion of expansive homeomor- phism, which was studied long ago. Definition 6.10: A homeomorphism T: X -* X is expansive if 3 5 > 0 5 if x ?! y then 3 n € Z 5 d(Tnx,Tny) 5 6. We call 5 an expansive constant for T. Remark : Another way to define an expansive homeomorphism is as follows. Consider X*X with TxT acting on it. Define a metric D on XxX by D((u,v),(x,y)) = max {d(u,x),d(v,y)}. Then T is expansive «» 3 6 > 0 such that if (x,y) is not an element of the diagonal,
172 some power of TxT takes (x,y) out of the 6-neighborhood of the diagonal. The following theorem is due to Reddy, and Keynes and Robertson. Theorem 6.18: T is expansive iff T has a generator iff T has a weak generator. Proof: By Theorem 6.15 it suffices to show T is expansive iff T has a generator. (-) Let 6 be an expansive constant for T and a a finite 00 cover by open balls of radius 6/2. Suppose x,y f f) T A where A € a. Then, d(Tnx,Tny) £ 6 V n, so, by assumption x = y. Therefore a is a generator. («■) Conversely, suppose a is a generator. Let 6 be a Le- besgue number for a. If d(Tnx,Tny) < 6 V n then V n 3 A € a 3 Tnx,Tny € A and so, ea x,y € H T"nAn which is at most one point. Hence x = y and T is expansive. // Corollary 6.18 : (1) Expansiveness is independent of the metric (however, the expansive constant does change). (2) T is expansive iff T is expansive, k i 0- (3) Expansiveness is a topological conjugacy invariant. Proof: (1) This is trivial, since having a generator has nothing to do with the metric. (2). If a is a generator for T then a v Ta v ... v T~^k~ a
173 is a generator for T . If a is a generator for T then a is also a generator for T. (3) is trivial. // The next result shows how to find measure theoretic generators for expansive homeomorphisms. Theorem 6.19 : Let T be expansive with constant 6- If Ç = {C1,...,Cs} is a partition of X into Borel sets of diameter < 6, then Ç is a measure theoretic generator for any T-invariant Borel probability measure. Proof: Let C. be a bisequence of members of Ç. If n x>y € H T~nc- then Tnx,Tny ç Ci for all n, and hence -» n n ea d(Tnx,Tny) < 6 V n. By expansiveness x = y. Thus P)T~nC. = * -» n or = one point. Hence ee V TnA(ç) = B. // — ea Thus, for expansive homeomorphisms there are many measure theoretic generators. Examples : (1) Isometries are never expansive except on finite spaces. (2) Let A be an automorphism of the n-torus, and [A] the corresponding matrix. Then A is expansive iff [A] has no eigenvalues of modulus 1. Sketch of proof: One first shows that A is expansive iff the linear map A of R that covers A is expansive. Then show that
174 A is expansive iff the complexification of À is expansive. Then one shows that the complexification of A is expansive iff the transformation given by the Jordan normal form is expansive. Lastly, one shows that the normal form is expansive iff there are no eigenvalues of modulus 1. (Note: By Theorem 6.19, any partition of Kn into sufficiently small n-rectangles is a measure theoretic generator for an expansive automorphism of Kn.) (3) The two-sided shift on k symbols is expansive. Proof (1): Let the state space be {0,1,...,k-l}. Let Ai = |{xn>: xQ = i|, i = 0,1,...,k-l. Then AQ U A± U ... U Ak_± = X and each A. is open. a - {A-.,. . . ,A, -, } is a generator for the shift since if x € (~\ T~nAi where the A. € a then -» n n X = (•••,l_2'1_T51Q51T512''' We then use Theorem 6.18. // Proof (2): Let d be the metric given by: |x - y d({x >,{y }) = Z n -1 n' n Jn nr- 2'n' Suppose {x } i {y }. Then for some nn, x i y and n n u n. n. n„ n d(T °{x>,T °{yn}) - Z -err l*r y„ n"~ "n" n = _- 2|n| n+n0 n+n0 lxn " yn n0 n0 Thus 1 is an expansive constant. // Remarks : (a) If T: X -* X is expansive and Y is a closed subset of X with
175 TY = Y then T|„ is expansive. (b) If T,: X, -* X,, T„: X„ -* X„ are expansive then so is TjXT,: X,xX„ -* X,xX„. Any finite direct product of expansive homeo- morphisms is expansive, but infinite products are not. (c) If T: X -* X is expansive and S : Y -* Y is a homeomorphism with T X >-X * [ [ * Y >Y commutative for <t> : X -* Y a continuous surjection then S need not be expansive. S is expansive if ^ is a k-to-one covering map. So expansiveness is not preserved under the operation of taking factors. Example : (Parry and Walters) 2 - - Consider the 2-torus K ; identify (z,w) with (z,w). The map <t> is a 2-to-one covering map except at four points: (1,1), (-1,1), (-1,-1), and (1,-1). 2 2 . . .2 Let A: K -* K be an automorphism. The quotient space is S and 2 A induces a homeomorphism on S which can be shown to be non- expansive . Another example can be constructed as follows: Let Tz = az be a minimal rotation of K. We shall represent T as a factor of a subset of the two-sided shift on two symbols. Consider the cover of K by the closed intervals (arcs) between -1 and 1 on K. Call one of them AQ and the other A,. n n f ^*\ A If z € K\{a ,-a : n € Z} we can uniquely associate n a member of ~[T {0,1} to z by z iely associate / \ 1 Tnz € A . Let A denote the subset of Tf {0,1} n -00
176 arising in this way. The map *: A -> K \{an,-an: n € Z} defined above satisfies tf>S(x) = Ttf>(x), x € A where S denotes the shift, and if we can show <t> is uniformly continuous then <t> extends uniquely to a continuous map tt : A -* K with ttS = Ttt . Suppose s > 0 is given. Choose N > 0 so that {l,a~ ,a~ ,. . . ,a~ } is e/2-dense in K. Suppose {b } and {c } are two members of A such that b = c for |n| s N. We then have to show d(tf> ( {b }) ,<t> ( {c }) ) < e. Let x = tf>({b }) and y = +C{c }). The assumption b = c , |n| < N means that anx and any belong to the same element of the cover for |n| i N. If y = -x then this clearly cannot happen. So suppose the counter-clockwise distance from y to x is smaller than the clockwise distance. For some n with |n| £ N anx is in the open interval of length e starting at 1 and going counter-clockwise. Hence a y must also be in the upper half of the circle and by the assumption about the relative positions of x and y, ay must be between 1 and anx. Hence d(a x,a y) < e and so d(x,y) < s. We shall now show that every expansive homeomorphism is a factor of a subset of a two-sided shift. Theorem 6.20: Let T: X -* X be an expansive homeomorphism. Then 3 an integer k > 0, a closed subset 2 of oo xk = TT{o,i,-.-,k-i} _ oo such that ct2 = 2, where a is the shift on X, , and a continuous surjection tt : 2 -* X such that rra(y) = Trr(y) y € 2.
177 Proof: The proof will resemble that of the preceding example. Let 6 be an expansive constant for T. Choose a cover a = {A.,...,A, _.} by closed sets such that diam(A.) < 6 V i and so that the A. intersect only in their boundaries. Let D denote the union of the boundaries of the A.. Then D„ = U TnD is a first category set and so X\D_ is dense in X. For each x € X\D_ we can 00 • n assign, uniquely, a member of X, by x -* {a }_„ iff T x € A . n Let A denote the collection of all sequences arising in this way. If * : A -* X\D_ is the map defined above then *a(y) = T*(y) V y €A and if we can show $ is uniformly continuous it will then follow that tf> can be uniquely extended to a continuous map tt : A ■* X such that rra(y) = Trr(y) V y « Î. Let s > 0 be given. Choose N > 0 so that each member of N V Tna has diameter less than s, which can be done by Theorem 6.16, -N since we can enlarge each A. to an open set to obtain a generator (remembering diam(A.) < 6). If {a >,{b } € A and a = b for l n n n n N |n| £ N then *({a }), ({b }) are in the same member of V Tna n -N and so d(tf>{a },*{b }) < e. Hence <t> is uniformly continuous. // The periodic points of T are associated with a generator as follows : Theorem 6.21: Let T: X -* X be expansive and let a be a generator (or a weak generator). Then T x = x iff x = O Tk,1(A0 n ta1 n ... n t Ak_1) i = -» where A. € a j = 0,...,k-l.
178 k k-1 Proof: Suppose T x = x. Since a vTo v ... vl a is a cover k-1 t of X, x € An n TA. n ... n T A, . for some collection of A. s ' 0 1 k-1 -i : in a. Thus x € Tk,1(A0 H TA1 H ... H Tk_1Ak_1) V i, and this implies x € P) Tk'X(AQ n TA1 fi ... H Tk_1Ak_1) which is at most one point. Therefore, x = n Tk-:L(A0 n ta n ... n Tk-1Ak_1). The converse is trivial. // The following gives an estimate on the number of fixed points of Tn. Corollary 6.21: If T is expansive and a is a generator for T with M members, then Nn(T) = |{x: Tnx=x}| < Mn (n > 0). Topological entropy is connected to periodic points by Theorem 6.22 : If T is expansive, then h(T) > ca(T) = ïïm - log N (T). n— n n n-1 _. Proof: Let a be a generator for T. Each element of V T~ a 0 contains at most one point fixed by T , since if Tx=x, Ty=y
179 n-1 and x,y € P) T-1A. then i=0 :i x,y € O T"nk(A. n t_1a. n ... n t"^"1^. ) k=-- 30 :1 ^n-l which is at most one point. Therefore x = y. Thus, N (T) s N(a vT_1a v . . . v T-^-1^) n which implies that i log N (T) ; - H(a vT_1a v ... vT~(n_1)a) n n n so, lim - log N (T) < h(T,a) = h(T), Therefore co(T) 2 h(T). // Consider our examples: (1) Let A be an expansive automorphism of the torus & . N (A) = |{x: Anx = x}| = |Kernel (An-I)| |det ([A]n - I)| (by the proposition below) I 71" u? - i>| 1 1 whe re the X. are the eigenvalues of the matrix [A]. So, cù(A) = lim - £ (log |xn - l|), n— n X. l If |\.| > 1 then i log |Xn - 1| = £ Clog |X.|n + log |1 - X7n|] log |Xi| + 0 = log |X£|
180 If |\.I < 1 then £ log |Xn - 1| - 0. So, cù(A) = L log |\.| = h(A) by Theorem 6.14. |X.|>1 Therefore, in this case we have equality. (2) If T is the two-sided shift on k symbols then NR(T) = kn and co(T) = log k = h(T), so that here too we have equality. In these examples co(T) and h(T) coincide. However, this is not true for all expansive homeomorphisms as there are examples of minimal expansive homeomorphisms ( co(T) = 0 since minimality implies there are no periodic points when the space is infinite) with positive topological entropy (due to Furstenberg). Problem: If T is expansive and has a dense set of periodic points then is it true that co(T) = h(T)? Bowen has shown this to be true under the stronger assumption * * ft that T is an axiom A homeomorphism. In example (1) we used the following: Proposition : If B: K ■* Kn is an endomorphism of Kn onto Kn with corresponding matrix [B], (so det [B] i 0) then B is a |det [B]|-to- one map. (We used the case B = An - I where [A] has no roots of unity as eigenvalues.)
181 Proof: [B] is an invertible matrix since det [B] ï 0. Thus, we can write [B] = E,E„...E where the E. are elementary matrices. Since [B] has integer entries, the E. have rational entries. Each E. is one of the following forms: (1) I with two rows interchanged, (2) I with one row multiplied by c € Q, or (3) I with the j-th row replaced by j-th row + c(k-th row), c € Q. For each E., choose e. € Z such that e.E. has integer entries. In case (1), e.E. induces an |e.| -to-one map of K , i.e.., an |e.|n|det E.|-to-one map. In case (2), eiE4 induces an |e.| |c|-to- one map of Kn, i.e., an |e.|n|det E.|-to-one map. In case (3), e.E. induces an |e.| |det E.|-to-one map. Let C be the endomor- phism of Kn determined by e.E.. Let b' be the endomorphism determined by e....e [B]. Then B' = C, o .. . °C . If B' is a b'-to-one map and B is a b-to-one map then since B = C, o ... o r But, also since [B ] = e, ...e [B], so r IT i=l r TT i=l r TT i=l b' |e.|n|det E.| |e. |n • TT |det E i=l |e.|n|det [B]|. = TT |e.|n-b i=l x b = Idet [B]|. // Consider <b(T) = lim ±. log N (T). n-*» n n co(T) is connected with the ^-function which was introduced (for
182 diffeomorphisms) by Artin and Mazur [1]: If T: X -* X is a homeomorphism such that N (T) < - for all n > 0 then we set ÇT(z) - exp(nÇin-znNn(T)), z € C. Note that <b(T) = log radius of convergence of £T (Artin and Mazur showed that "most" diffeomorphisms have a ^-function with positive radius of convergence, and Smale suggested that the ^-function might be a rational function of z for "most" diffeomorphisms. Manning [1] has shown this to be the case for axiom A diffeomorphisms but results of Simon [1] have answered Smale's conjecture 3 negatively (for K ).) It is known that there are no expansive homeomorphisms of S , 2 but not known if there are any on S . It seems reasonable to ask whether a compact metric space admitting an expansive homeomorphism is finite-dimensional. §6. Examples We consider the topological entropy of some examples. (1) Isometries have zero entropy. This is clear from Bowen's definition. Hence rotations on compact metric groups and all topologi- cally transitive homeomorphisms with topological discrete spectrum have zero entropy (Theorem 5.8). (2) The two-sided shift on k symbols has entropy log k. This is proved by considering the obvious generator. (3) Any homeomorphism of K has zero topological entropy.
183 Proof: Let T: K -* K be a homeomorphism. T maps intervals to intervals as the intervals are the connected subsets. Suppose the circle has length 1. Choose e > 0 such that d(x,y) < e ■=> d(T"1x,T"1y) 2 1/"+. Consider spanning sets for K with respect to T. Clearly, r,(e,K) S Cl/e] + 1, where [•] denotes the least integer function. We estimate r (e,K). Suppose we have an (n-l,e)-spanning set F of minimal cardinality r _,(e,K). Consider the points of Tn F and the intervals they determine. Add points to this set so that the new intervals have length < e. We have added at most I — + 1 points. Let F' = F U T"(n-1)(these new points). We claim that F* is an (n,e)-spanning set for K. Let x € K. Then 3 y € F 3 max d(T1x,T1y) £ e. 0£i£n-2 If d(Tn~ x,Tn~ y) £ e then our claim is proved. If there is no y € F with both these properties, choose a y € F 3 max d(T1x,T1y) £ e. 0<i£n-2 Consider the interval between T ~ x and T y which is mapped by T~ to the e-interval [Tn~ x,Tn~ y]. Choose a point Tn~ z, z € F1 inside the chosen interval [Tn~ x,Tn~ y] and with d(Tn x,Tn~ z)se. Then Tn-2z lies in the e-interval [Tn_2x,Tn-2y] and so, d(Tn_2z,Tn_2x) s e. The e-interval [Tn_2x,Tn_2y] is mapped by T_1 to an interval of length £ 1/M-, and hence to the e-interval
184 [Tn_3x,Tn_3y]. So, since Tn-3z is in this interval, d(Tn-3x,Tn~ z) £ e. Similarly, by induction d(T1x,T1z) £ e V i, 0 £ i £ n-1. Thus, F is an (n,e)-spanning set for K. So, r (e,K) S rn_1(e,K) + [1/e] + 1 2 n([l/e] + 1). Therefore, r_(e,K) = lim -log r (e,K) = 0, so, h(T) = 0. // Corollary: Any homeomorphism of [0,1] has zero topological entropy. Proof: T: [0,1] -> [0,1] has either T(0) = 0 and T(l) = 1 2 or T(0) = 1 and T(l) = 0. In both cases T induces a homeomorphism of K. // (4) If T: Mm -* Mm is a differentiable map of an m-dimensional Riemannian manifold M111 with Riemannian metric || • || , then h (T) < max {0 , m log sup ||dT || } p x€M x where dT : M ->■ M_, , is the derivative of T at x and p is the x x T(x) metric on M determined by the Riemannian metric. This has been proved by several people. --2--
Bibliography R. Abraham: [1] Foundations of Mechanics, Benjamin, 1967. R. Abraham and J. Robbin: [1] Mappings and Flows, Benjamin, 1967. R. L. Adler: [1] Skew products of Bernoulli shifts and rotations, Israel Journal of Mathematics, vol. 12, pp. 215-222, 1972. R. L. Adler, A. G. Konheim and M. H. McAndrew: [1] Topological entropy, Transactions of the American Mathematical Society, vol. lit, pp. 309-319, 1965. R. L. Adler and P. C. Shields: [1] Skew products of Bernoulli shifts with rotations, Israel Journal of Mathematics, vol. 12, pp. 215-222, 1972. R. L. Adler and B. Weiss: [1] Similarity of Automorphisms of the Torus, Memoirs of the American Mathematical Society, no. 98, 19 70. H. Anzai: [1] Ergodic skew-product transformations on the torus, Osaka Mathematics Journal, vol. 3, pp. 83-99, 1951. M. Artin and B. Mazur: [1] On periodic points, Annals of Mathematics, vol. 81, pp. 82- 99, 1965. L. Auslander, L. Green, and F. Hahn: [1] Flows on Homogeneous Spaces, Annals of Mathematics Studies, no. 53, 196 3.
186 A. Avez and V. I. Arnold: [1] Ergodic Problems in Classical Mechanics, Benjamin, 1968. K. Berg: [1] Convolution of invariant measures, maximal entropy, Mathematical Systems Theory, vol. 3, pp. 146-150, 1969. P. Billingsley: [1] Ergodic Theory and Information, Wiley, 1965. G. D. Birkhoff: [1] Proof of the ergodic theorem, Proceedings of the National Academy of Sciences USA, vol. 17, pp. 656-660, 1931. J. R. Blum and D. L. Hanson: [1] On the isomorphism problem for Bernoulli schemes, Bulletin of the American Mathematical Society, vol. 69, pp. 221-223, 1963. Bowen: [1] Topological entropy and axiom A, Proceedings of the Summer Institute on Global Analysis, Berkeley, California, pp. 23- 42, 1968. [2] Markov partitions for axiom A diffeomorphisms, American Journal of Mathematics, vol. 92, pp. 725-747, 1970. [3] Markov partitions and minimal sets for axiom A diffeomorphisms, American Journal of Mathematics, vol. 92, pp. 9 0 7- 918, 1970. [4] Entropy for group endomorphisms and homogeneous spaces, Transactions of the American Mathematical Society, vol. 15 3, pp. 401-414, 1971. [5] Periodic points, measures and axiom A, Transactions of the American Mathematical Society, vol. 154, pp. 377-397, 1971. [6] Periodic orbits for hyperbolic flows, American Journal of Mathematics, vol. 94, pp. 1-30, 1972. [7] Entropy-expansive maps, Transactions of the American Mathematical Society, vol. 164, pp. 323-331, 1972. [8] The equidistribution of closed geodesies, American Journal of Mathematics, vol. 94, pp. 413-423, 1972. [9] One-dimensional hyperbolic sets for flows, Journal of Differential Equations, vol. 12, pp. 173-179, 1972.
187 [10] Symbolic dynamics for hyperbolic flows, American Journal of Mathematics, vol. 45, pp. 429-460, 1973. [11] Topological entropy for noncompact sets, Transactions of the American Mathematical Society, vol. 184, pp. 125-136, 1973. [12] Entropy versus homology for certain diffeomorphisms, Topology, vol. 13, pp. 61-67, 1974. R. Bowen and P. Walters: [1] Expansive one-parameter flows, Journal of Differential Equations, vol. 12, pp. 180-193, 1972. R. V. Chacon: [1] Change of velocity in flows, Journal of Mathematics and Mechanics, vol. 16, pp. 417-431, 1966. Hsin- Chu: [1] Some results on affine transformations of compact groups, to appear. J. P. Conze: [1] Entropie des flots et des transformations affine sur les espaces homogènes, Compte Rendu, vol. 270, pp. 547-548, 19 70. E. I. Dinaberg: [1] The relation between topological entropy and metric entropy, Doklady Akademii Nauk SSSR, vol. 19 0, 19 70 (Russian), Soviet Mathematics, vol. 11, pp. 13-16, 1970 (English). [2] A connection between various entropy characterizations of dynamical systems, Izvestija Akademii Nauk SSSR, Serija Matematiceskaja, vol. 35, pp. 324-366, 1971 (Russian). R. Ellis: [1] Lectures on Topological Dynamics, Benjamin, 1969. N. A. Friedman: [1] Introduction Ergodic Theory, Van Nostrand, 1970. [2] Bernoulli shifts induce Bernoulli shifts, Advances in Mathematics, vol. 10, pp. 29-48, 1973.
188 N. A. Friedman and D. S. Ornstein: [1] An isomorphism of weak Bernoulli Transformations, Advances in Mathematics, vol. 5, pp. 365-394, 1970. [2] Entropy and the Isomorphism Problem, to be published by Springer. [3] Ergodic transformations induce mixing transformations, Advances in Mathematics, vol. 10, pp. 147-163, 1973. H. Furstenberg: [1] Strict ergodicity and transformations of the torus, American Journal of Mathematics, vol. 83, pp. 573-601, 1961. [2] Disjointness in ergodic theory, Mathematical Systems Theory, vol. 1, pp. 1-50, 1967. T. N. T. Goodman: [1] Relating topological entropy and measure entropy, Bulletin of the London Mathematical Society, vol. 3, pp. 176-180. 19 71. [2] Maximal measures for expansive homeomorphisms, Journal of the London Mathematical Society, vol. 5, pp. 439-444, 1972. L. W. Goodwyn: [1] Topological entropy bounds measure-theoretic entropy, Proceedings of the American Mathematical Society, vol. 23, pp. 679-688, 1969. [2] A characterization of symbolic cascades in terms of expan- siveness and topological entropy, Mathematical Systems Theory, vol. 4, pp. 157-157, 1970. [3] Some counterexamples in topological entropy, Topology, vol. 11, pp. 377-385, 1972. [4] Comparing topological entropy with measure theoretic entropy, American Journal of Mathematics, vol. 74, pp. 366- 388, 1972. W. H. Gottschalk: [1] Bibliography for Dynamical Topology, fourth edition, Wesleyan University, 1969. W. H. Gottschalk and G. A. Hedlund: [1] Topological Dynamics, American Mathematical Society Colloquium Publications, 1955.
189 C. Grillenberger: [1] Constructions of strictly ergodic systems I, II, Zeit- schrift fur Wahrscheinlichkeitstheorie, vol. 25, pp. 323- 334, 1973. F. Hahn: [1] On affine transformations of compact abelian groups, American Journal of Mathematics, vol. 85, pp. 42 8-446, 19 63. F. Hahn and Y. Katznelson: [1] On the entropy of uniquely ergodic transformations, Transactions of the American Mathematical Society, vol. 126, pp. 335-360, 1967. F. Hahn and W. Parry: [1] Minimal dynamical systems with quasi-discrete spectrum, Journal of the London Mathematical Society, vol. 40, pp. 309-323, 1965. [2] Some characteristic properties of dynamical systems with quasi-discrete spectrum, Mathematical Systems Theory, vol. 2, pp. 179-190, 1968. P. R. Halmos: [1] Measure Theory, Van Nostrand, 1950. [2] Lectures on Ergodic Theory, Chelsea, 1953. [3] Introduction to Hilbert Space and the Theory of Spectral Multiplicity, Chelsea, 1957. [4] On automorphisms of compact groups, Bulletin o'f the American Mathematical Society, vol. 49, pp. 619-624, 1943. P. R. Halmos and J. Von Neumann: [1] Operator methods in classical mechanics, II, Annals of Mathematics, vol. 43, pp. 235-247, 1942. H. Hoare and W. Parry: [1] Affine transformations with quasi-discrete spectrum, I, Journal of the London Mathematical Society, vol. 41, pp. 88-96, 1966.
190 [2] Affine transformations with quasi-discrete spectrum, II, Journal of the London Mathematical Society, vol. 41, pp. 529-530, 1966. [3] Semi-groups of affine transformations, Quarterly Journal of Mathematics, Oxford, vol. 17, pp. 106-111, 1966. E. Hopf: Cl] Ergodentheorie, Chelsea, 1937. K. Jacobs: [1] Neue Méthode und Ergebnisse der Ergodentheorie, Springer, 1960. [2] Lecture Notes on Ergodic Theory, Aarhus University, 1962-3. R. Jewett: Cl] The prevalence of uniquely ergodic systems, Journal of Mathematics and Mechanics, pp. 717-729, 19 70. S. A. Juzvinskii: [1] Calculation of the entropy of a group endomorphism, Sibirskii Matematiceskii Zurnal, vol. 8, pp. 230-239, 1967 (Russian); Siberian Mathematical Journal, vol. 8, pp. 172- 178, 1967 (English). [2] Metric properties of endomorphisms of compact groups, Izvestija Akademii Nauk SSSR, Serija Matemasiceskaja, vol. 29, pp. 1295-1328, 1965 (Russian); American Mathematical Society Translations, Series 2, vol. 66, pp. 63-98, 1968 (English). S. Kakutani: [1] Induced measure preserving transformations, Proceedings of the Imperial Academy of Tokyo, vol. 19, pp. 635-641, 1943. [2] Examples of ergodic measure preserving transformations which are weakly mixing but not strongly mixing, Recent Advances in Topological Dynamics, Springer Lecture Notes, no. 318, pp. 143-149, 1973. Y. Katznelson: [1] Ergodic automorphisms of Tn are Bernoulli shifts, Israel Journal of Mathematics, vol. 10, pp. 186-195, 1971.
191 Y. Katznelson and B. Weiss: [lj Commuting measure-preserving transformations, Israel Journal of Mathematics, vol. 12, pp. 161-173, 1972. t2] Ergodic automorphisms of the solenoid are Bernoulli, unpublished. J. L. Kelley: Cl] General Topology, Van Nostrand, 1955. H. B. Keynes and J. B. Robertson: Cl] Generators for topological entropy and expansiveness, Mathematical Systems Theory, vol. 3, pp. 51-59, 1969. A. I. Khinchin: [1] Mathematical Foundations of Statistical Mechanics, Dover, 1949. A. N. Kolmogorov: [1] A new metric invariant of transient dynamical systems and automorphisms of Lebesgue spaces, Doklady Akademii Nauk SSSR, vol. 119, pp. 861-864, 1958. [2] On the entropy per time unit as a metric invariant of automorphisms, Doklady Akademii Nauk SSSR, vol. 124-, pp. 754-755, 1959. W. Krieger: [1] On entropy and generators of measure-preserving transformations, Transactions of the American Mathematical Society, vol. 149, pp. 453-464, 1970. [2] On unique ergodicity, Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, 1970. N. Kryloff and N. Bogoliouboff: [1] La théorie générale de la mesure dans son application à l'étude des systèmes dynamique de la mécanique non linéaire, Annals of Mathematics, vol. 38, pp. 65-113, 1937. A. G. Kushnirenko: [1] Metric invariants of entropy type, Uspehi Matematiceskih Nauk, vol. 22, no. 5, pp. 57-65, 1967 (Russian), Russian Mathematical Surveys, vol. 22, no. 5, pp. 53-61, 19 67 (English).
192 D. A. Lind: [1] Ergodic automorphisms of the infinite torus are Bernoulli, to appear. G. W. Mackey: [1] Ergodic Theory and its significance for statistical mechanics and probability theory, Advances in Mathematics, vol. 12, pp. 178-268, 1974. A. Manning: [1] Axiom A diffeomorphisms have rational zeta functions, Bulletin of the London Mathematical Society, vol. 3, pp. 215-220, 1971. L. Markus: [1] Lectures in Differentiable Dynamics, American Mathematical Society Regional Conference Series, no. 3, 1971. R. McCabe and P. C. Shields: [1] A class of Markov shifts which are Bernoulli shifts, Advances in Mathematics, vol. 6, pp. 323-328, 1971. L. D. Meshalkin: [1] A case of isomorphism of Bernoulli schemes, Doklady Akademii Nauk SSSR, vol. 128, pp. 41-44, 1959. V. Nemytskii and V. Stepanov: [1] Qualitative Theory of Differential Equations, Princeton, 1960. J. von Neumann: [1] Proof of the quasi-ergodic hypothesis, Proceedings of the National Academy of Science USA, vol. 18, pp. 263-266, 1932. [2] Zur Operatorenmethode in der klassichen Mechanik, Annals of Mathematics, vol. 33, pp. 587-642, 1932. D. Newton: [1] On sequence entropy I, II, Mathematical Systems Theory, vol. 4, pp. 119-128, 1970.
193 Z. Nitecki: [1] Differentiate Dynamics, M.I.T. Press, 1971. D. S. Ornstein: [1] Bernoulli shifts with the same entropy are isomorphic, Advances in Mathematics, vol. 4, pp. 337-352, 1970. [2] Two Bernoulli shifts with infinite entropy are isomorphic, Advances in Mathematics, vol. 5, pp. 339-348, 19 70. [3] Factors of Bernoulli, shifts are Bernoulli shifts, Advances in Mathematics, vol. 5, pp. 349-364, 1970. [4] Imbedding Bernoulli shifts in flows, contributions to ergodic theory and probability, Lecture Notes in Mathematics, Springer Berlin, pp. 178-218, 1970. [5] Some new results in the Kolmogorov-Sinai theory of entropy and ergodic theory, Bulletin of the American Mathematical Society, vol. 77, pp. 878-890, 1971. [6] An example of a Kolmogorov automorphism that is not a Bernoulli shift, Advances in Mathematics, vol. 10, pp. 49- 62, 1973. [7] The isomorphism theorem for Bernoulli flows, Advances in Mathematics, vol. 10, pp. 124-142, 1973. [8] A K-automorphism with no square root and Pinsker's conjecture, Advances in Mathematics, vol. 10, pp. 89-102, 1973. [9] A mixing transformation for which Pinsker's conjecture fails, Advances in Mathematics, vol. 10, pp. 103-123, 1973. D. S. Ornstein and P. C. Shields: [1] An uncountable family of K-automorphisms, Advances in Mathematics, vol. 10, pp. 63-88, 1973. [2] Mixing Markov shifts of kernel type are Bernoulli, Advances in Mathematics, vol. 10, pp. 143-146, 1973. D. S. Ornstein and B. Weiss: [1] Geodesic flows are Bernoullian, Israel Journal of Mathematics, vol. 14, pp. 184-197, 1973. J. C. Oxtoby: [1] Ergodic sets, Bulletin of the American Mathematical Society, vol. 58, pp. 116-136, 1952.
194 W. Parry. [1] Symbolic dynamics and transformations of the unit interval, Transactions of the American Mathematical Society, vol. 122, pp. 368-378, 1966. [2] Compact abelian group extensions of discrete dynamical systems, Z. Wahrscheinlichkeitstheorie, vol. 13, pp. 9 5- 113, 1969. [3] Entropy and Generators in Ergodic Theory, Benjamin, 1969. [4] Ergodic properties of affine transformations and flows on nilmanifolds, American Journal of Mathematics, vol. 91, pp. 757-771, 1969. [5] Spectral analysis of G-extensions of dynamical systems, Topology, vol. 9, pp. 217-224, 1970. [6] Dynamical systems on nilmanifolds, Bulletin of the London Mathematical Society, vol. 2, pp. 3 7-40, 19 70. M. S. Pinsker: [1] Dynamical systems with completely positive and zero entropy, Doklady Akad. Nauk SSSR, vol. 133, pp. 1025-1026, 1960 (Russian), Soviet Mathematics Doklady, vol. 1, pp. 937-938, 1960 (English). L. S. Pontrjagin: [1] Topological Groups, Gordon and Breach, 19 66. W. Reddy: [1] The existence of expansive homeomorphisms on manifolds, Duke Mathematical Journal, vol. 32, pp. 627-632, 1965. V. A. Rohlin: [1] A general measure-preserving transformation is not mixing, Doklady Akademii Nauk, vol. 60, pp. 349-351, 1948. [2] Generators in ergodic theory, Vestnik Leningradskogo Universiteta, vol. 18, no. 1, pp. 26-32, 1963. [3] Selected topics in the metric theory of dynamical systems, American Mathematical Society Translations, Series 2, vol. 49, pp. 171-240, 1966. [4] Metric properties of endomorphisms of compact commutative groups, Izvestija Akademii Nauk, Serija Matematiceskaja, vol. 28, pp. 867-874, 1964 (Russian); American Mathematical Society Translations, Series 2, vol. 64, pp. 244-252, 1967 (English).
195 [5] Lectures on the entropy theory of transformations with invariant measure, Uspehi Matematiceskih Nauk, vol. 22, no. 5, pp. 3-56, 1967 (Russian); Russian Mathematical Surveys, vol. 22, no. 5, pp. 1-52, 1967 (English). [6] Metric properties of endomorphisms of compact commutative groups, Izvestija Akademii Nauk, Serija Matematiceskaja, vol. 13, pp. 329-340, 1949 (Russian). A. Rohlin and Ja. G. Sinai: [1] Construction and properties of invariant measurable partitions, Doklady Akademii Nauk SSSR, vol. 141, pp. 10 38-1041, 1961 (Russian); Soviet Mathematics, vol. 2, pp. 1611-1614, 1961 (English). Ruelle: [1] Statistical mechanics on a compact set with Z action satisfying expansiveness and specification, Transactions ■of the American Mathematical Society, vol. 185, pp. 2 3 7- 252, 1973. C. Shields: [1] Bernoulli shifts are determined by their factor algebras, Proceedings of the American Mathematical Society, vol. 41, pp. 331-332, 1973. [2] The Theory of Bernoulli Shifts, University of Chigago Lecture Notes, 19 74. P. Simon : [1] Instability in Diffr(T3) and the nongenericity of rational zeta functions, Transactions of the American Mathematical Society, vol. 174, pp. 217-242, 1972. G. Sinai: [1] On the concept of entropy of a dynamical system, Doklady Akademii Nauk SSSR, vol. 124, pp. 768-771, 1959. [2] On flows with finite entropy, Doklady Akademii Nauk SSSR, vol. 125, pp. 1200-1202, 1959. [3] On a weak isomorphism of transformations with an invariant, Doklady Akademii Nauk SSSR, vol. 147, pp. 797-800, 1962 (Russian), Soviet Mathematics Doklady, vol. 3, pp. 1725- 1729, 1962 (English). [4] On a weak isomorphism of transformations with an invariant measure, Matematiceskii Sbornik, vol. 63, pp. 23-42, 19 6 4 (Russian).
196 [5] Classical dynamical systems with countably-multiple Lebesgue spectrum, II, Izvestija Akademii Nauk SSSR, Serija Matematiceskaja, vol. 30, pp. 15-68, 1966 (Russian); American Mathematical Society Translations, Series 2, vol. 68, pp. 34-88, 1968 (English). [6] Construction of Markov partitioning, Funkcional'nyi Analiz i ego Prilozenija, vol. 2, pp. 70-80, 1968. [7] Markov partitions and u-diffeomorphisms, Functional Analysis and Applications, vol. 2, pp. 61-82, 1968 (English); Funkcional'nyi Analiz i ego Prilozenija, vol. 2, pp. 64- 89, 1968 (Russian). S. Smale: [1] Differentiable dynamical systems, Bulletin of the American Mathematical Society, vol. 73, pp. 747-817, 1967. M. Smordinsky: [1] On Ornstein's isomorphism theorem for Bernoulli shifts, Advances in Mathematics, vol. 10, pp. 1-9, 1973. [2] Ergodic Theory, Entropy, Springer Lecture Notes, no. 214, 1971. B. Weiss: [1] The isomorphism problem in ergodic theory, Bulletin of the American Mathematical Society, vol. 78, pp. 668-684, 1972.
Index affine transformation 18 aperiodic 89 Bernoulli shift 18, 98 character theory 10 complete invariant 96 completely positive entropy 104- conditional entropy 7 6 conjugate transformations 53 continuous spectrum 4-6 direct product of probability spaces 5 direct product of transformations 44 discrete spectrum, measure theoretic 64 topological 12 5 distal 12 0 eigenfunction, measure theoretic 45, 63 topological 12 3 eigenvalue, measure theoretic 45 topological 123 endomorphisms of tori 12 entropy 72 entropy of affine transformations of tori 168 of Bernoulli shifts 95 of rotations 93 ergodicity 21 ergodic theorem, Birkoff 29 maximal 32 LP 36 exact endomorphism 111 expansive homeomorphism 171 generator, measure-theoretic 88 topological 168 Haar measure 9 Hahn-Kolmogorov Extension Theorem 4 Hilbert spaces 8 induced operator on Lp 3 2 invariant 5 7 invariant measures for homeomorphisms 128 isomorphism 5 2 Kolmogorov automorphism 101 Kolmogorov - Sinai Theorem 87 Krylov and Bogolioubov Theorem 132
198 Lebesgue Covering Lemma Lebesgue spectrum Markov chain measure algebra measure preserving transformation measure theory minimal homeomorphism minimal set mixing nilmanifold non-invertible transformation normal number orbit partitions periodic point Pinsker algebra Poincafe Recurrence Theorem pure point spectrum recurrence refinement of open covers Riesz Representation Theorem rotations on groups semi-simple homeomorphism separated set sequence entropy a-algebra spanning set spectral isomorphism spectral theorem sub-a-algebras topological conjugacy topological entropy connection with measure theoretic entropy topological entropy of affine transformations of tori of Bernoulli shifts of homeomorphisms of the circle topological transitivity uniformly equivalent metrics uniquely ergodic 151 60 105 53 16 3 113 lit 37 106 110 31 113 70 115 107 20 64 20 140 130 67 115 14-6 108 3 146 54 46 70 122 143, 147 155 168 182 182 117 150 135
Vol. 277: Séminaire Banach. Edité par C. Houzel. VII, 229 pages. 1972. DM22,- Vol. 278: H. Jacquet, Automorphic Forms on GL(2) Part II. XIII, 142 pages. 1972. DM 18,- Vol. 279: R. Bott, S. Gitler and I. M. James. Lectures on Algebraic and Differential Topology. V, 174 pages. 1972. DM 20,- Vol. 280: Conference on the Theory of Ordinary and Partial Differential Equations. Edited by W. N. Ever* and B. D. Sleeman. XV, 367 pages. 1972. DM 29,- Vol. 281 : Coherence in Categories. Edited by S. Mac Lane. VII, 235 pages. 1972. DM 22,- Vol. 282: W. Klmgenberg und P. Flaschel, Riemannsche Hilbertman- mgfaltigkeiten. Periodische Geodatische. VII, 211 Seiten. 1972. DM 22,- Vol. 283; L. Illusie, Complexe Cotangent et Déformations II. VII, 304 pages. 1972. DM 27,- Vol. 284: P. A. Meyer, Martingales and Stochastic Integrals I. VI, 89 pages. 1972. DM 18,- Vol. 285: P. de la Harpe, Classical Banach-Lie Algebras and Banach- Lie Groups of Operators in Hilbert Space. Ml, 160 pages. 1972. DM 18,- Vol. 286: S. Murakami, On Automorphisms of Siegel Domains. V, 95 pages. 1972. DM 18,- Vol. 287: Hyperfunctions and Pseudo-Differential Equations. Edited by H. Komatsu. VII, 529 pages. 1973. DM 40,- Vol. 288: Groupes de Monodromie en Géométrie Algébrique. (SGA 7 I). Dirigé par A. Grothendieck. IX, 523 pages. 1972. DM 55,- Vol. 289: B. Fuglede, Finely Harmonic Functions. Ill, 188. 1972. DM 20,- Vol. 290: D. B. Zagier, Equivanant Pontr|agin Classes and Applications to Orbit Spaces. IX, 130 pages. 1972. DM 18,- Vol.291: P.Orlik,SeifertManifolds.VIII,i55pages.1972.DM 18,- Vol. 292: W. D. Wallis, A. P. Street and J. S. Wallis, Combinatorics: Room Squares, Sum-Free Sets, Hadamard Matrices. V, 508 pages 1972. DM 55,- Vol. 293: R. A. DeVore, The Approximation of Continuous Functions by Positive Linear Operators. VIII, 289 paqes. 1972. DM 27,- Vol. 294: Stability of Stochastic Dynamical Systems. Edited by R. F. Curtain. IX, 332 pages. 1972. DM 29,- Vol. 295: C. Dellachene, Ensembles Analytiques Capacités. Mesures de Hausdorff. XII, 123 pages. 1972. DM 18,- Vol. 296: Probability and Information Theory II. Edited by M. Behara, K. Krickeberg and J. Wolfowitz. V, 223 pages. 1 973. DM 22,- Vol. 297: J. Garnett, Analytic Capacity and Measure. IV, 138 pages. 1972. DM 18,- Vol. 298: Proceedings of the Second Conference on CompactTrans- formation Groups. Part 1. XIII, 453 pages. 197 2. DM 35,- Vol. 299: Proceedings of the Second Conference on Compact Transformation Groups. Part 2. XIV, 327 pages. 1972. DM 29,- Vol. 300: P. Eymard, Moyennes Invariantes et Représentations Unitaires. 11,113 pages. 1972.DM 18,- Vol. 301: F. Pittnauer, Vorlesungen ûber asymptotische Reihen. VI, 186Se.ten.1972.DM18,- Vol. 302: M. Demazure, Lectures on p-Divisible Groups. V, 98 pages. 1972. DM 18,- Vol. 303: Graph Theory and Applications. Edited by Y. Alavi, D. R. Lick and A. T. White. IX, 329 pages. 1972. DM 26,- Vol. 304: A. K. Bousfield and D. M. Kan, Homotopy Limits, Completions and Localizations. V, 348 pages. 1972. DM 29,- Vol. 305: Théorie des Topos et Cohomologie Etale des Schémas. Tome 3. (SGA 4). Dirigé par M. Artin, A Grothendieck et J. L Verdier. VI, 640 pages. 1973. DM 55,- Vol. 306: H. Luckhardt, Extensional Godel Functional Interpretation. VI, 161 pages. 1973. DM 20,- Vol. 307: J. L. Bretagnolle, S. D. Charterji et P.-A. Meyer, Ecole d'été de Probabilités: Processus Stochastiques. VI, 198 pages. 1973. DM22,- Vol. 308: D. Knutson, A-Rings and the Representation Theory of the Symmetric Group. IV, 203 pages. 1973. DM 22,- Vol. 309: D. H. Sattinger, Topics in stability and Bifurcation Theory. VI, 190 pages. 1973. DM 20,- Vol. 310: B. Iversen, Generic Local structure of the Morphisms in Commutative Algebra. IV, 108 pages. 1 973. DM 18,- Vol. 311: Conference on Commutative Algebra. Edited by J. W. Brewer and E A. Rutter. VII, 251 pages. 1973. DM 24,- Vol. 312: Symposium on Ordinary Differential Equations. Edited by W. A. Harris, Jr. and Y. Sibuya. VIII, 204 pages. 1973. DM 22,- Vol. 313: K. Jôrgens and J. Weidmann, Spectral Properties of Hamil- tonian Operators. Ill, 140 pages. 1973. DM 18,- Vol. 314: M. Deuring, Lectures on the Theory of Algebraic Functions of One Variable. VI, 151 pages.1973.DM 18,- Vol. 315: K. Bichteler, Integration Theory (with Special Attention to Vector Measures). VI, 357 pages. 1973. DM 29,- Vol. 316: Symposium on Non-Well-Posed Problems and Logarithmic Convexity. Edited by R. J. Knops. V, 176 pages. 1973. DM 20,- Vol. 31 7: Séminaire Bourbaki - vol. 1971 /72. Exposés 400-41 7. IV, 361 pages. 1973. DM 29,- Vol. 318: Recent Advances in Topological Dynamics. Edited by A. Beck VIII, 285 pages. 1973. DM 27,- Vol. 319: Conference on Group Theory. Edited by R. W. Gatterdam and K. W. Weston. V. 188 pages. 1973. DM 20,- Vol. 320: Modular Functions of On» Variable I. Edited by W Kuyk. V, 195 pages. 1973. DM 20,- Vol. 321 : Séminaire de Probabilités VII. Edité par P. A. Meyer. VI, 322 pages. 1973. DM 29,- Vol. 322: Nonlinear Problems in the Physical Sciences and Biology. Edited by I. Stakgold, D. D. Joseph and D. H. Sattinger. VIII, 357 pages. 1973. DM 29,- Vol. 323: J. L. Lions. Perturbations Singulières dans les Problèmes aux Limites et en Contrôle Optimal. XII, 645 pages. 1973. DM 46,- Vol. 324: K. Kreith. Oscillation Theory. VI, 109 pages. 1973. DM 18,- Vol. 325: C-C. Chou, La Transformation de Fourier Complexe et L'Equation de Convolution. IX, 137 pages. 1973. DM 18,- Vol. 326. A. Robert, Elliptic Curves. VIII, 264 pages. 1973. DM 24,- Vol. 327: F_ Matlis. One-Dimensional Cohen-Macaulay Rings. XII, 157 pages. 1973. DM20,- Vol. 328. J. R. Buchi and D. Siefkes, The Monadic Second Order Theory of All Countable Ordinals. VI, 217 pages. 1973. DM 22,- Vol. 329: W Trebels, Multipliers for (C, «(-Bounded Fourier Expansions in Banach Spaces and Approximation Theory. VII, 103 pages. 1973. DM 18,- Vol. 330 Proceedings of the Second Japan-USSR Symposium on Probability Theory Edited by G. Maruyama and Yu. V. Prokhorov. VI, 550 pages. 1973.DM 40,- Vol. 331. Summer School on Topological Vector Spaces. Edited by L. Waelbroeck. VI, 226 pages. 1973. DM 22,- Vol. 332: Séminaire Pierre Lelong (Analyse) Année 1971-1972. V, 131 pages. 1973. DM 18.- Vol. 333: Numensche, insbesondereapproximationstheoretischeBe- handlung von Funktionalgleichungen. Herausgegeben von R. An- sorge und W. Tornig. VI, 296 Seiten. 1973. DM 27,- Vol. 334: F. Schweiger, The Metrical Theory of Jacobi-Perron Algorithm. V, HI pages. 1973.DM 18,- Vol. 335: H. Huck, R. Roitzsch, U. Simon, W. Vortisch, R. Walden, B. Wegner und W. Wendland, Beweismethoden der Differentialgeome- tne im GroSen. IX, 159 Seiten. 1973. DM 20,- Vol. 336: L'Analyse Harmonique dans le Domaine Complexe. Edité par E J. Akutowicz. VIII, 169 pages. 1973. DM 20,- Vol. 337: Cambridge Summer School in Mathematical Logic. Edited by A. R. D. Mathias and H. Rogers. IX, 660 pages. 1973. DM 46,- Vol: 338 J. Lindenstrauss and L Tzafnn, Classical Banach Spaces IX, 243 pages. 1973 DM 24,- Vol. 339: G. Kempf, F. Knudsen, D. Mumfordand B. Samt-Donat, Toroidal Embeddings I. VIII, 209 pages. 1973. DM 22,- Vol. 340: Groupes de Monodromie en Géométrie Algébrique. (SGA 7 II). Par P. Deligne et N. Katz. X, 438 pages. 1973. DM 44,- Vol. 341; Algebraic K-Theory I,'Higher «-Theories. Edited by H. Bass. XV, 335 pages. 1973. DM 29,-
Vol. 342: Algebraic K-Theory II, Classical" Algebraic K-Theory, and Connections with Arithmetic. Edited by H. Bass. XV 527 pages. 1973. DM40,- Vol. 343: Algebraic K-Theory III, Hermitian K-Theory and Geometric Applications. Edited by H. Bass. XV, 572 pages. 1973. DM 40,- Vol. 344: A. S. Troelstra(Editor), Metamathematical investigation of Intmtionistic Arithmetic and Analysis. XVII, 485 pages. 1973. DM 38,- Vol. 345: Proceedings of a Conference on Operator Theory. Edited by P. A. Fillmore. VI, 228 pages. I973. DM 22,- Vol. 346: Fucik et al., Spectral Analysis of Nonlinear Operators. II, 287 pages. 1973. DM 26,- Vol. 347: J. M. Boardman and R. M. Vogt, Homotopy Invariant Algebraic Structures on Topological Spaces. X, 257 pages. I973. DM24,- Vol. 348: A. M. Mathai and R. K. Saxena, Generalized Hyper- geometric Functions with Applications in Statistics and Physical Sciences. VII, 314 pages. I973. DM 26,- Vol. 349 : Modular Functions of One Variable II. Edited by W. Kuyk and P. Deligne. V, 598 pages. 1973. DM 38,- Vol. 350: Modular Functions of One Variable III. Edited by W. Kuyk and J.P. Serre. V, 350 pages. 1973. DM 26,- Vol. 351 : H. Tachikawa, Quasi-Frobenius Rings and Generalizations. XI, 172 pages. 1973. DM 20,- Vol. 352: J. D. Fay, Theta Functions on Riemann Surfaces. V, 137 pages. 1973. DM 18,- Voi. 353: Proceedings of the Conference, on Orders, Group Rings and Related Topics. Organized by J. S. Hsia, M. L. Madan and T. G. Ralley. X, 224 pages. 1973. DM 22,- Vol. 354: K. J. Devlin, Aspects of Constructibihty. XII, 240 pages. 1973. DM 24,- Vol. 355: M. Sion, A Theory of Semigroup Valued Measures. V, 140 pages. 1973. DM18,- Vol. 356: W. L J. van der Kallen, Infinitesimally Central-Extensions of Chevalley Groups. VII, 147 pages. 1973. DM18,- Vol. 357: W. Borho, P. Gabriel und R. Rentschler, Primideale in Einhullenden auflbsbarer Lie-Algebren. V, 182 Seiten. 1973. DM 20,- Vol. 358: F. L Williams, Tensor Products of Principal Series Representations. VI, 132 pages. 1973. DM 18,- Vol. 359 U. Stammbach, Homology in Group Theory. VIII, 183 pages. 1973. DM20,- Vol. 360: W. J. Padgett and R. L. Taylor, Laws of Large Numbers for Normed Linear Spaces and Certain Frechet Spaces. VI, 111 pages. 1973. DM I8,~ Vol. 361 : J. W. Schutz, Foundations of Special Relativity: Kinematic Axioms for Minkowski Space Time. XX, 314 pages. 1973. DM 26,- Vol. 362. Proceedings of the Conference on Numerical Solution of Ordinary Differential Equations. Edited by D. Bettis. VIII, 490 pages. 1974. DM 34,- Vol. 363. Conference on the Numerical Solution of Differential Equations. Edited by G. A. Watson. IX, 221 pages. 1974. DM 20,- Vol. 364: Proceedings on Infinite Dimensional Holomorphy. Edited by T. L. Hayden and T. J. Suffridge. VII, 212 pages. I974. DM 20,- Vol. 365: R. P. Gilbert, Constructive Methods for Elliptic Equations. VII, 397 pages. 1974. DM 26,- Vol. 366: R. Steinberg, Conjugacy Classes in Algebraic Groups (Notes by V. V. Deodhar). VI, 159 pages. 1974. DM 18,- Vol. 367: K. Langmann und W. Lutkebohmert, Cousinverteilun- gen und Fortsetzungssatze. VI, 151 Seiten. 1974. DM 16,- Vol. 368: R. J. Milgram, Unstable Homotopy from the Stable Point of View. V, 109 pages. 1974. DM 16,- Vol. 369: Victoria Symposium on Nonstandard Analysis. Edited by A. Hurd and P. Loeb. XVIII, 339 pages. 1974. DM 26,- Vol. 370: B. Mazur and W. Messing, Universal Extensions and One Dimensional Crystalline Cohomology. VII, 134 pages. 1974. DM 16,- Vol. 371 : V. Poenaru, Analyse Différentielle. V, 228 pages. 1974, DM 20- Vol. 372: Proceedings of the Second International Conference on the Theory of Groups 1973. Edited by M. F. Newman. VII, 740 pages. 1974. DM 48,- Vol. 373- A. E. R. Woodcock and T. Poston, A Geometrical Study of the Elementary Catastrophes. V, 257 pages. 1974. DM 22,- Vol. 374: S Yamamuro, Differential Calculus in Topological Linear Spaces. IV, 179 pages. 1974. DM 18,- Vol. 375- Topology Conference 1973. Edited by R. F. Dickman Jr. and P. Fletcher. X, 283 pages. 1974. DM 24,- Vol. 376- D, B. Osteyee and I. J. Good, Information, Weight of Evidence, the Singularity between Probability Measures and Signal Detection. XI, 156 pages. 1974. DM 16.- Vol. 377: A. M. Fink, Almost Periodic Differential Equations. VIII, 336 pages. 1974. DM 26,- Vol. 378: TOPO 72 - General Topology and its Applications. Proceedings 1972. Edited by R. Alo, R. W. Heath and J. Nagata. XIV, 651 pages. 1974. DM 50,- Vol. 379: A. Badnkian et S. Chevet, Mesures Cylindriques, Espaces de Wiener et Fonctions Aléatoires Gaussiennes. X, 383 pages. 1974. DM32,- Vol. 380: M. Petrich, Rings and Semigroups. VIII, 182 pages. 1974. DM 18,- Vol. 381 : Séminaire de Probabilités VIII. Edité par P. A. Meyer. IX, 354 pages. 1974. DM 32,- Vol. 382' J. H. van Lint, Combinatorial Theory Seminar Eindhoven University of Technology. VI, I31 pages. 1974. DM 18,- Vol. 383 Séminaire Bourbaki - vol. 1972/73. Exposés 418-435 IV, 334 pages. 1974. DM 30,- Vol. 384: Functional Analysis and Applications, Proceedings 1972. Edited by L. Nachbin. V, 270 pages. 1974. DM 22,- Vol. 385: J. Douglas Jr. and T. Dupont, Collocation Methods for Parabolic Equations in a Single Space Variable (Based on C- Piecewise-Polynomial Spaces). V, 147 pages. 1974. DM 16,- Vol. 386: J. Tits, Buildings of Spherical Type and Finite BN- Pairs. IX, 299 pages. 1974. DM 24,- Vol. 387. C P. Bruter, Elements de la Théorie des Matroïdes. V, 138 pages. 1974. DM 18,- Vol. 388: R. L. Lipsman, Group Representations. X, 166 pages. 1974. DM20,- Vol. 389: M.-A. Knus et M. Ojanguren, Théorie de la Descente et Algèbres d' Azumaya. IV, 163 pages. 1974. DM 20,- Vol. 390: P. A. Meyer, P. Pnouret et F. Spitzer, Ecole d'Eté de Probabilités de Saint-Flour III - 1973. Edité par A. Badnkian et P-L. Hennequin. VIII, 189 pages. 1974. DM 20,- Vol. 391. J. Gray, Formal Category Theory Adjointness for 2- Categones. XII, 282 pages. 1974. DM 24,- Vol. 392: Géométrie Différentielle, Colloque, Santiago de Compostela, Espagne 1972. Edite par E. Vidal. VI, 225 pages. 1974. DM 20,- Vol.393: G. Wassermann, Stability of Unfoldings. IX, 164 pages. 1974. DM 20,- Vol. 394: W.M.Patterson 3rd. Iterative Methods for the Solution of a Linear Operator Equation in Hilbert Space - A Survey. Ill, 183 pages. 1974. DM 20,- Vol. 395: Numensche Behandlung ntchthnearer Integrodifferen- tial- und Differentialgleichungen. Tagung 1973. Herausgegeben von R. Ansorge und W. Tôrnig. VII, 313 Seiten. 1974. DM 28,- Vol. 396 K. H. Hofmann, M Mislove and A. Stralka, The Pontry- agin Duality of Compact O-Dimensional Semilattices and its Applications. XVI, 122 pages. 1974. DM 18,- Vol. 397: T. Yamada, The Schur Subgroup of the Brauer Group. V, 159 pages. 1974. DM 18,- Vol. 398: Theories de l'Information, Actes des Rencontres de Marseille-Luminy, 1973. Edite par J. KampedeFérietetC. Picard. XII, 201 pages. 1974. DM 23.-
Vol. 399: Functional Analysis and its Applications, Proceedings 1973. Edited by H. G. Garnir, K. R. Unni and J. H. Williamson. XVII, 569 pages. 1974. DM 44,- Vol. 400: A Crash Course on Kleiman Groups - San Francisco 1974. Edited by L Bers and I. Kra. VII, 130 pages. 1974. DM 18,- Vol. 401: F. Attyah, Elliptic Operators and Compact Groups. V, 93 pages. 1974. DM 18,- Vol. 402: M. Waldschmidt, Nombres Transcendants. VIM, 277 pages. 1974. DM 25,- Vol. 403: Combinatorial Mathematics - Proceedings 1972. Edited by D. A. Holton. VIII, 148 pages. 1974. DM 18,~ Vol. 404: Théorie du Potentiel et Analyse Harmonique. Edité par J. Faraut. V, 245 pages. 1974. DM 25,- Vol. 405: K. Devlin and H. Johnsbrâten, The Souslin Problem. VIII, 132 pages. 1974. DM 18,- Vol. 406: Graphs and Combinatorics - Proceedings 1973. Edited by R. A. Bari and F. Harary. VIII, 355 pages. 1974. DM 30,- Vol. 407: P. Berthelot, Cohomologie Cristalline des Schémas de Caractéristique p > o. VIII, 598 pages. 1974. DM 44,- Vol. 408: J. Wermer, Potential Theory. VIII, 146 pages. 1974. DM 18,- Vol. 409: Fonctions de Plusieurs Variables Complexes, Séminaire François Norguet 1970-1973. XIII, 612 pages. 1974. DM 47,- Vol. 410: Séminaire Pierre Lelong (Analyse) Année 1972-1973. VI, 181 pages. 1974. DM 18,- Vol. 411: Hypergraph Seminar. Ohio State University, 1972. Edited by C. Berge and D. Ray-Chaudhun. IX, 287 pages. 1974. DM 28,- Vol. 412: Classification of Algebraic Varieties and Compact Complex Manifolds. Proceedings 1974. Edited by H. Popp. V, 333 pages. 1974. DM 30,- Vol. 413: M. Bruneau, Variation Totale d'une Fonction. XIV, 332 pages. 1974. DM 30,- Vol. 414: T. Kambayashi, M. Miyanishi and M. Takeuchi, Uni- potent Algebraic Groups. VI, 165 pages. 1974. DM 20,- Vol. 415: Ordinary and Partial Differential Equations, Proceedings of the Conference held at Dundee, 1974. XVII, 447 pages. 1974. DM 37,- Vol. 416: M. E. Taylor, Pseudo Differential Operators. IV, 155 pages. 1974. DM 18,- Vol. 417: H. H. Keller, Differential Calculus in Locally Convex Spaces. XVI, 131 pages. 1974. DM 18.- Vol. 418: Localization in Group Theory and Homotopy Theory and Related Topics Battelle Seattle 1974 Seminar. Edited by P. J. Hilton. VI, 171 pages. 1974. DM 20,- Vol. 419: Topics in Analysis - Proceedings 1970. Edited by O. E. Lehto, I. S. Louhivaara, and R. H. Nevanlinna. XIII, 391 pages. 1974. DM35,- Vol. 420: Category Seminar. Proceedings, Sydney Category Theory Seminar 1972/73. Edited by G. M. Kelly. VI, 375 pages 1974. DM32,- Vol. 421: V. Poénaru, Groupes Discrets. VI, 216 pages. 1974. DM 23,- Vol. 422: J.-M. Lemaire, Algebres Connexes et Homologte des Espaces de Lacets. XIV, 133 pages. 1974. DM 23,- Vol. 423: S. S. Abhyankar and A. M. Sathaye, Geometric Theory of Algebraic Space Curves. XIV, 302 pages. 1974. DM 28,- Vol. 424: L. Weiss and J. Wolfowitz, Maximum Probability Estimators and Related Topics. V, 106 pages. 1974. DM 18,- Vol. 425: P. R. Chernoff and J. E. Marsden, Properties of Infinite Dimensional Hamiltoman Systems. IV, 160 pages. 1974. DM 20,- Vol. 426: M. L. Silverstein, Symmetric Markov Processes. IX, 287 pages. 1974. DM 28,- Vol. 427: H. Omori, Infinite Dimensional Lie Transformation Groups. XII, 149 pages. 1974. DM 18,- Vol. 428: Algebraic and Geometrical Methods in Topology, Proceedings 1973. Edited by L. F. McAuley. XI, 280 pages. 1974. DM 28,- Vol. 429: L Cohn, Analytic Theory of the Harish-Chandra C- Function. IV, 154 pages. 1974. DM 18,- Vot. 430: Constructive and Computational Methods for Differential and Integral Equations. Symposium 1974. Edited by D. L Cotton and R. P. Gilbert VII, 476 pages. 1974. DM 37,- Vol. 431 ; Séminaire Bourbaki - vol. 1973/74. Exposés 436-452. IV, 347 pages. 1975. DM 30- Vol. 432: R. P. Pflug, Holomorphiegebiete, pseudokonvexe Ge- biete und das Levi-Problem. VI, 210 Seiten. 1975. DM 23,- Vol. 433: W. G. Faris, Self-Adjoint Operators. VII, 115 pages. 1975. DM 18,- Vol. 434: P. Brenner, V. Thomée, and L a Wahlbin, Besov Spaces and Applications to Difference Methods for Initial Value Problems. II, 154 pages. 1975. DM 18,- Vol. 435: C. F. Dunkl and D. E Ramirez, Representations of Commutative Semitopological Semigroups. VI, 181 pages. 1975. DM 20,- Vol. 436 :LAuslander and R.Tolimieri,Abetian Harmonic Analysis, Theta Functions and Function Algebras on a Nilmanifold. V, 99 pages. 1975. DM18,- Vol. 437: D. W. Masser, Elliptic Functions and Transcendence. XIV, 143 pages. 1975. DM 18,- Vol. 438: Geometric Topology, Proceedings 1974. Edited by L G Glaser and T. B. Rushing. X, 459 pages. 1975. DM 37,- Vol. 439: K. Ueno, Classification Theory of Algebraic Varieties and Compact Complex Spaces. XIX, 278 pages. 197S DM 28,- Vol. 440: R. K. Getoor, Markov Processes: Ray Processes and Right Processes. V, 118 pages. 1975. DM 18,- Vol. 441 : N. Jacobson, Pi-Algebras. An Introduction. V, 115 pages. 1975. DM 18,- Vol. 442: C. H. Wilcox, Scattering Theory for the d'Alembert Equation in Exterior Domains. Ill, 184 pages. 1975. DM 20,- Vol. 443: M. Lazard, Commutative Formal Groups. II, 236 pages. 1975. DM 23,- Vol. 444; F. van Oystaeyen, Prime Spectra in Non-Commutative Algebra. V, 128 pages. 1975. DM 18,- Vot. 446: Partial Differential Equations and Related Topics, Proceedings 1974. Edited by J. A. Goldstein. IV, 389 pages. 1975. DM 32,- Vol. 447: S. Toledo, Tableau Systems for First Order Number Theory and Certain Higher Order Theories. Ill, 33g pages. 1975. DM 30- Vol. 448: Spectral Theory and Differential Equations. Proceedings 1974. Edited by W. N. Everitt. XII, 321 pages. 1975. DM 30,- Vol. 449: Hyperfunctions and Theoretical Physics. Proceedings 1973. Edited by F. Pham. IV, 218 pages. 1975. DM 23,- Vol. 450: Algebra and Logic. Proceedings 1974. Edited by J. N. Crossley. VIII, 307 pages. 1975. DM 28,- Vol. 452: Combinatorial Mathematics III. Proceedings 1974. Edited by Anne Penfold Street and W. D. Wallis. IX, 233 pages. 1975. DM23,- Vol. 453 : Logic Colloquium. Symposium on Logic Held at Boston, 1972-73. Edited by R. Parikh. IV, 251 pages. 1975. DM 25,- Vol. 457: Fractional Calculus and Its Applications. Proceedings of the International Conference Held at the University of New Haven, June 1974. Edited by B. Ross. VI, 381 pages. 1975. DM 32,- Vol. 458: P. Walters, Ergodic Theory - Introductory Lectures. VI, 198 pages. 1975. DM23,-
This series aims to report new developments in mathematical research and teaching - quickly, informally and at a high level. The type of material considered for publication includes: 1. Preliminary drafts of original papers and monographs 2. Lectures on a new field, or presenting a new angle on a classical field 3. Seminar work-outs 4. Reports of meetings, provided they are a) of exceptional interest and b) devoted to a single topic. Texts which are out of print but still in demand may also be considered if they fall within these categories. The timeliness of a manuscript is more important than its form, which may be unfinished or tentative. Thus, in some instances, proofs may be merely outlined and results presented which have been or will later be published elsewhere. If possible, a subject index should be included. Publication of Lecture Notes is intended as a service to the international mathematical community, in that a commercial publisher, Springer-Ver- lag, can offer a wider distribution to documents which would otherwise have a restricted readership. Once published and copyrighted, they can be documented in the scientific literature. Manuscripts Manuscripts should comprise not less than 100 pages. They are reproduced by a photographic process and therefore must be typed with extreme care. Symbols not on the typewriter should be inserted by hand in indelible black ink. Corrections to the typescript should be made by pasting the amended text over the old one, or by obliterating errors with white correcting fluid. Authors receive 75 free copies and are free to use the material in other publications. The typescript is reduced slightly in size during reproduction; best results will not be obtained unless the text on any one page is kept within the overall limit of 18x26.5 cm (7x10'/2 inches). The publishers will be pleased to supply on request special stationery with the typing area outlined. Manuscripts in English, German or French should be sent to Prof. A. Dold, Mathematisches Institut der Universitât Heidelberg, 69 Heidelberg/Germany, TiergartenstraBe, Prof. B. Eckmann, Eidgenôssische Technische Hochschule, CH-8006 Zurich/Switzerland, or directly to Springer-Verlag Heidelberg. Springer-Verlag, D-1000 Berlin 33, Heidelberger Platz 3 Springer-Verlag, D-6900 Heidelberg 1, Neuenheimer LandstraBe 28-30 Springer-Verlag, 175 Fifth Avenue, New York, NY 10010/USA ISBN 3-540-07163-6 ISBN 0-387-07163-6