Text
                    LONDON MATHEMATICAL SOCIETY
MONOGRAPHS NEW SERIES
Series Editors
P. M. Cohn H. G. Dales

LONDON MATHEMATICAL SOCIETY MONOGRAPHS NEW SERIES Previous volumes of the LMS Monographs were published by Academic Press, to whom all enquiries should be addressed. Volumes in the New Series will be published by Oxford University Press throughout the world. NEW SERIES 1. Diophantine inequalities R. C. Baker 2. The Schur multiplier Gregory Karpilovsky 3. Existentially closed groups Graham Higman and Elizabeth Scott 4. The asymptotic solution of linear differential systems M. S. P. Eastham 5. The restricted Burnside problem Michael Vaughan-Lee 6. Pluripotential theory Maciej Klimek 7. Free Lie algebras Christophe Reutenauer
Free Lie Algebras Christophe Reutenauer Univer site du Quebec d Montreal CLARENDON PRESS OXFORD 1993
Oxford University Press, Walton Street, Oxford 0X2 6DP Oxford New York Toronto Delhi Bombay Calcutta Madras Karachi Kuala Lumpur Singapore Hong Kong Tokyo Nairobi Dar es Salaam Cape Town Melbourne Auckland Madrid and associated companies in Berlin Ibadan Oxford is a trade mark of Oxford University Press Published in the United States by Oxford University Press Inc., New York © Christophe Reutenauer, 1993 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press. Within the UK, exceptions are allowed in respect of any fair dealing for the purpose of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act, 1988, or in the case of reprographic reproduction in accordance with the terms of licences issued by the Copyright Licensing Agency. Enquiries concerning reproduction outside those terms and in other countries should be sent to the Rights Department, Oxford University Press, at the address above. A catalogue record for this book is available from the British Library Library of Congress Cataloging in Publication Data Reutenauer, Christophe. Free lie algebras/Christophe Reutenauer. (London Mathematical Society monographs, new series, no. 7) Includes bibliographical references and index. 1. Lie algebras. I. Title. IL Series. QA252.3.R48 1993 512'.55-dc20 92-27318 ISBN 0-19-853679-8 Typeset by Integral Typesetting, Great Yarmouth, Norfolk Printed in Great Britain by St Edmundsbury Press, Bury St Edmunds, Suffolk
Ce livre est dedie d Arthur, Victor, Emile, Eva

Preface Lie polynomials first appeared at the turn of the century in the work of Campbell, Baker, and Hausdorff on exponential mapping in a Lie group, which led to a result known as the Campbell-Baker-Hausdorff formula. About thirty years later, Witt showed that the Lie algebra of Lie polynomials is actually the free Lie algebra, and that its enveloping algebra is the (associative) algebra of noncommutative polynomials; he answered a ques- tion of Magnus—who had himself arrived at the solution—on the lower central series of the free group. Some years earlier, in 1933, P. Hall had begun commutator calculus in the free group, which led M. Hall to construct his basis of the free Lie algebra; the link between the latter and the free group is given by the work of Witt and the Magnus transformation. In 1942 and 1944, Thrall and Brandt studied the free Lie algebra from the point of view of representation theory of the linear group, and Brandt computed the character—a formula closely related to the Witt formula. At the end of the forties, Dynkin, Specht, and Wever simultaneously discovered the characterization of Lie polynomials through the ‘left to right bracketing mapping’. Some years later, Friedrichs gave his characterization for Lie polynomials; his criterion fascinated many mathematicians, who actually proved it. After that, the subject was studied by many people, often independently. Recently, it has had a new impulse, from the point of view of representation theory of the symmetric group. As far as we know, no book exists that exclusiyely treats free Lie algebras. Bahturin, in his recent book on varieties of Lie algebras, devotes two chapters to free Lie algebras; Bourbaki, Jacobson, and others, give an introduction to the subject. It seems to us that the theory has become so extensive with existing results so widely scattered, to justify the publication of a book on the subject. The book is partly written in the spirit of Lothaire’s Combinatorics on words, with emphasis on the algebraic point of view; it can be considered as a series of variations on Lyndon words; the presentation of the latter is rather indirect, so the interested reader could begin by reading the corresponding section by Lothaire. In Chapter 0, we give without proof the Poincare Birkhoff Witt theorem,
viii Preface which enables us to prove that the Lie algebra of Lie polynomials is the free Lie algebra; this necessitates a basis construction (in cases where the ring of scalars is not a field), which is done through Lazard elimination. The impatient reader may proceed directly to Chapter 1, where things really begin. We give several characterizations of Lie polynomials, introduce the shuffle product and present Hopf algebra-like properties of free associative algebras. Chapter 2 is devoted mainly to two results: subalgebras of free Lie algebras are free; automorphisms of free Lie algebras are products of elementary automorphisms. The related problem of characterizing free sets of Lie polynomials is also treated. In Chapter 3 we characterize exponentials of Lie series, and give several results on the Hausdorff series, after having connected it to the canonical projections of the free associative algebra. The study of Hall bases begins in Chapter 4: we construct the Hall basis of the free Lie algebra, and the corresponding Poincare-Birk hoff-Witt basis of the free associative algebra. We show also that this basis construction is identical to the one arising from Lazard elimination. Chapter 5 gives some applications of Hall sets: the Lyndon basis, which is a particular case of a Hall basis; the calculation of the dual basis in the shuffle algebra; the construction of a Hall basis compatible with the derived series of the free Lie algebra; and the order on the free monoid associated with a Hall set and the associated codes. In Chapter 6 we give some properties of the shuffle algebra: it is freely generated by Lyndon words, and has a remarkable presentation. Related to shuffles is the concept of subword. This leads to subword functions, a generalization of binomial coefficients, and the Magnus transformation of the free group. Commutator calculus is presented, and connected with the Hall basis and the algebra of subword functions. Chapter 7 studies circular words: after giving the formulas enumerating them, we relate them to Hall sets. Two algorithms on Lyndon words are presented, and we give a natural bijection between words on an ordered alphabet and multisets of primitive necklaces. The Lie representation of the symmetric group (or the linear group) is considered in Chapter 8. Its character and the multiplicities of the irreducible representations are given. Almost all of them occur. Remarkable Lie elements, the Lie idempotents, are studied in the symmetric group algebra. Representations on the components of the canonical decomposition of the free associative algebra are also studied. Chapter 9 shows the close connection between the free Lie algebra and the Solomon descent algebra of the symmetric group. The primitive idempotents of the latter represent the canonical projections, and the dimension of the corresponding quasi-ideals has an interpretation in terms of necklaces. The action on Lie monomials of elements of the descent algebra characterizes this
Preface ix algebra, which has as a natural homomorphic image the ring of symmetric functions, and is itself dual to the ring of quasisymmetric functions. Each chapter ends with an appendix: each subsection can be thought of as an exercise, with hints or references, and gives some information on related subjects; sometimes it is simply a review of related work. Montreal C.R. July 1992
Acknowledgements I first discovered the subject of this book in Gerard Lallement’s book on semigroups and in Dominique Perrin’s chapter on factorization of free monoids in Lothaire’s book; thanks to Adriano Garsia’s Combinatorics of the free Lie algebra and the symmetric group, this subject was given a new impulse, in the direction of algebraic combinatorics. While writing this book, I had innumerable discussions with Marco Schutzenberger, who is for me the initiator of the subject, and gave me useful advice as well as some unpublished results. During this same time, Guy Melancon wrote his Ph.D., and was of considerable help, by discussions, reading and correcting the successive versions of the manuscript. Paul Cohn kindly accepted this book in the London Mathematical Society Monographs series and carefully read the whole manuscript; he also communicated some unpublished results of his Ph.D. thesis. I also thank, for many discussions and correspondence, Sheila Sundaram, Andre Joyal, Pierre Leroux, Xavier Viennot, Hartmut Laue, Pierre Bouchard, Francois Bergeron, and Nantel Bergeron. Special thanks to Daniel Krob, who carefully read the manuscript, and found many mistakes. I also want to thank the whole Department of Mathematics and Computer Science of the Universite du Quebec a Montreal, for excellent working conditions, and especially Dominique Chabot, Sonya Comtois, Lucie Lortie, Marlaine Grenier, and Diane Amatuzio for their typing. Finally, I was supported by a grant of NSERC (Canada) during the three years I wrote this book.
Contents Index of notation xv 0. Introduction 1 0.1 The Poincare-Birkhoff-Witt theorem 1 0.2 Free Lie algebras 4 0.3 Elimination 7 0.4 Appendix 12 0.5 Notes 13 1. Lie polynomials 14 1.1 Words, polynomials, and series 14 1.2 Lie polynomials 18 1.3 Characterizations of Lie polynomials 19 1.4 Shuffles 23 1.5 Duality concatenation/shuffle 26 1.6 Appendix 33 1.7 Notes 39 2. Algebraic properties 40 2.1 The weak algorithm 40 2.2 Subalgebras 44 2.3 Automorphisms 45 2.4 Free sets of Lie polynomials 49 2.5 Appendix 50 2.6 Notes 51 3. Logarithms and exponentials 52 3.1 Lie series and logarithm 52 3.2 The canonical projections 57 3.3 Coefficients of the Hausdorff series 61 3.4 Derivation and exponentiation 76 3.5 Appendix 80 3.6 Notes 82
xii Contents 4. Hall bases 84 4.1 Hall trees and words 84 4.2 Hall and Poincare-Birkhoff-Witt bases 89 4.3 Hall sets and Lazard sets 98 4.4 Appendix 101 4.5 Notes 103 5. Applications of Hall sets 105 5.1 Lyndon words and basis 105 5.2 The dual basis 108 5.3 The derived series 112 5.4 Order properties of Hall sets 114 5.5 Synchronous codes 119 5.6 Appendix 124 5.7 Notes 126 6. Shuffle algebra and subwords 127 6.1 The free generating set of Lyndon words 127 6.2 Presentation of the shuffle algebra 129 6.3 Subword functions 131 6.4 The lower central series of the free group 136 6.5 Appendix 147 6.6 Notes 152 7. Circular words 154 7.1 The number of primitive necklaces 154 7.2 Hall words and primitive necklaces 158 7.3 Generation of Lyndon words 161 7.4 Factorization into Lyndon words 163 7.5 Words and multisets of primitive necklaces 166 7.6 Appendix 170 7.7 Notes 174 8. The action of the symmetric group 176 8.1 Action of the symmetric group and of the linear group 176 8.2 The character of the free Lie algebra 180 8.3 Irreducible components 185 8.4 Lie idempotents 194 8.5 Representations on the canonical decomposition 201 8.6 Appendix 206 8.7 Notes 215
Contents xiii 9. The Solomon descent algebra 217 9.1 The descent algebra 217 9.2 Idempotents 224 9.3 Homomorphisms 233 9.4 Quasisymmetric functions and enumeration of permutations 242 9.5 Appendix 248 9.6 Notes 254 References 256 Index 267

Notation К ^К(Л), ^(Л) М(Л) t', t" Id ad(c) (st”) Un A Л* 1 A+ = Л*\1 IM 14 к<л> к«л» (S, P) (S, w), (P, w) (s, 1),(P, 1) 5 x <5 = (id ® a) c <5 Ad r LU 6P a~lS,a~lP cone <5' sh E * ,<y commutative ring with unit free Lie algebra over К on the set A 4, 18 free magma on the set A 4 immediate left (right) subtree of the tree t 5 degree of the tree t 5 mapping у [c, y] 7 the tree (.. .((s, t), t),... ,t) 10 the submodule of an enveloping algebra generated by nth powers of Lie elements 13, 57 alphabet, set of noncommuting variables 15 free monoid on Л 15 empty word, neutral element in Л* 15 15 length of the word w in Л*1.1; weight of the word w in T* 224 number of occurences of the letter a in w 14 K-algebra of noncommutative polynomials on Л, free associative К-algebra on A 15 K-algebra of noncommutative formal series on Л 17 pairing К<<Л>> x К<Л> -> К 17 coefficient of the word w in S, P 15, 17 constant-term of S, P 17 coproduct a t—> a ® 1 4- 1 ® a 19, 52 principal anti-automorphism of К<Л> 19, 52 19, 52 K-algebra homomorphism К<Л> -> EndK(K<^>), ai—>ad(a) 19, 53 right normed bracketing, bracketing from right to left 20, 52 shuffle product 24 p-fold coproduct 25 26, 46 concatenation product K(A) ® K(A) K(A) 27 dual coproduct 27 shuffle product К<Л> ® К<Л>-> К<Л> 26 maps a polynomial to its constant term 27 convolution product in End(K(/l>) 28 complete tensor product К(Л)®К(Л) (shuffle at the left, con- catenation at the right) 28 dual convolution product
xvi Notation concp 6'p shp I Pu~l S„ 7Ti D(w) Uff D£S e f(t) н h’h" i\ Xi(s), pi(s) T(s) S„ C) F(A) M I F„(A) lhn nh 0(4") Pn, Рл st(w) K<4>„ En ch In л(Т) D(T) maj(T) Xм Хл "-Л Ui /(>) g°f p-fold concatenation product K<4>®p -» K<4> 31 p-fold dual coproduct 31 p-fold shuffle product K<4>®p -> K<4> 31 left normed bracketing, bracketing from left to right 37 41 symmetrized product 57 symmetric group of order n canonical projection K<4> -» U„ 58 canonical projection K<4> -> £f(A) 58 descent set of w 62 right action of permutation о on word w 63 sum of permutations whose descent set is contained in S 65 anti-automorphism <rt—><r~1 in KSn 65 foliage of tree t 84 Hall set 84 rewriting rule on standard sequences 86 standard factorization of Hall word h 85 Hall polynomial 90 mappings defined on standard sequences 91 derivation tree of a standard sequence 5 91 dual basis of K<4> 108 derived ideal of the free Lie algebra 112 binomial coefficient on words 131 free group on A Magnus transformation F(A) -» Z<<4>>, at—> 1 4- a 132 infiltration product 134 lower central series of F(A) 136 Hall words of length <n 136 Hall exponent 136 series without term of degree < n 138 power sum symmetric function 156, 178 standard permutation of word w 167 space of homogeneous polynomials of degree n multilinear part of K<4)„ 176 characteristic of a representation 178 characteristic of the nth Lie representation 180 shape of a standard tableau T 185 descent set of standard tableau T 185 major index of standard tableau T 185 irreducible character of Sn 186 value of irreducible character of S„ 188 space of homogeneous Lie polynomials of degree n 194 sum of permutations whose descent set is S 195 Dynkin-Specht-Wever Lie idempotent 195 Lie idempotent of Klyachko 196 subspace of K<4> 201 length of partition Л 201 plethysm of symmetric functions
Notation xvii hn qn Qc complete homogeneous symmetric function 202 projection of К<Л> on its graded component of degree n 217 convolution product of the qn corresponding to the composition C 218 Г Г„ C(a) S(C) Ac К Я д T C(vr) /(w) M„ K, z(C), z(P) •i-M *P| к > p. с С convolution subalgebra generated by the qn 217 graded component of Г 217 descent composition of permutation о 222 subset associated to a composition 223 is Dss(c> 223 Solomon descent algebra of S„ 223 is 0(Zn) 224 projection onto U, 224 is {г,, t2, t$,...} 224 composition associated with w in T* 224 length of w in T* 224 element of Q<r> 224 element of Q<r> 224 algebra homomorphism -* Г, t, qt 225 partition associated to composition C, polynomial P 227 subspace of Q<T> 227 weight of polynomial P in Q<7"> 227 partition Л is finer than p 227 automorphism of Q<7’> 228 is C ° 7 228

о Introduction The aim of this preliminary chapter is to show that the Lie algebra of Lie polynomials, which is introduced in Chapter 1 and which is the subject of this book, is indeed the free Lie algebra. Apart from some elementary universal constructions, two results are required: the Poincare-Birkhoff-Witt theorem (actually only its consequence, that a Lie algebra is embedded in its enveloping algebra), and the fact that the free Lie algebra is a free module. The first result will not be proved here, as it is included in many textbooks. The second one is proved in Section 0.3. 0.1 THE POINCARE-BIRKHOFF WITT THEOREM Let К be a commutative ring with unit. A Lie algebra over К is a К-module together with a K-bilinear mapping x -> !£, (x, y) [x, y], called a Lie product or Lie bracket, satisfying the two following relations, for any x, y, z in Lf: [x, x] = 0, (0.1.1) [[x, y], z] + [[y, z], x] + [[z, x], y] = 0. (0.1.2) The latter identity is called the Jacobi identity. Note that (0.1.1) implies antisymmetry, i.e. [x,y] + [y,x] = 0, (0.1.3) because 0 = [x + y, x + y] = [x, x] + [x, y] + [y, x] + [y, y] = [x, y] + [y, x], by (0.1.1), bilinearity, and (0.1.1) again. In view of (0.1.3), we may rewrite (0.1.2) as [[x, y], z] = [x, [ y, z]] + [[x, z], у]. (0.1.4) Subalgebras of Lie algebras and homomorphisms between Lie algebras are defined as usual. Given an associative algebra over K, it acquires a
2 0 Introduction Fig. 0.1 natural structure of Lie algebra when [x, y] is defined by [x, y] = xy — yx. Indeed, [x, y] is bilinear, (0.1.1) is immediate and (0.1.2) is easily verified. Let У be a Lie algebra, and consider a Lie algebra homomorphism from /.T into an associative algebra .ч/, with its natural Lie algebra structure. Among all these algebras j/, there is one which has a universal property, stated in the next result. Proposition 0.1 Let <£ be a Lie algebra over K. There exists an associative algebra si0 over К and a Lie algebra homomorphism q>0: -+ si 0 having the following property: for any associative algebra si and any Lie algebra homomorphism <p: -> si, there is a unique algebra homomorphism f: si0 -► si making the diagram of Fig. 0.1 commutative. The algebra siQ is unique up to isomorphism. This algebra .я/0 is called the enveloping algebra of J?. Proof We first prove uniqueness up to isomorphism. Let six be another couple like <p0, ,У0. Then, using Fig. 0.2, we deduce the existence of algebra homomorphisms ft si0 -> six and gt six -+ .я/0 such that f ° (p0 = <Pi and g°<Pi = <Po- Then id ° <p0 = <p0 = g ° (px = g°f acp0, and id, g°f are both Fig. 0.2
0.1 Poincare-Birkhoff-Witt theorem 3 Fig. 0.3 algebra homomorphisms. By looking at Fig. 0.3, and by uniqueness, we find that g ° f = id. Similarly, f ° g = id. This shows that ,c/0 and are isomorphic. We now prove the existence of за/0. Let T = T(ST} be the tensor algebra of ST over K, that is T(^) = © ^®". n>0 Then T has a natural structure of associative algebra with unit. Let I be the ideal of T generated by the elements x®y — y®x — [x, j](x, у e ST); finally, let з/0 = T/I and <p0: ST -+ з/0 be the composition <p0 = p ° i, where i is the canonical injection ST -+ and P the canonical surjective algebra homomorphism T -+ T/I. Note that since Ker p = I and by the definition of I, we have p([x, yj) = p(x ®y — y®x). Hence, for any x, у in ST, (p0 ([x, Я) = p([x, у]) = p(x ® у - у ® x) = p(x)p( j) - pO)p(x) = [p(x), = [p° i(x), p° i(y)] = [<p0(*)> <Po(j)l Hence, <p0 is a Lie algebra homomorphism. Next, let з/ and <p be as defined in Proposition 0.1; then, since T is the tensor algebra of ST, there is a unique extension of cp to an algebra homomorphism (p:T->sf. Now, for, x, у e ST, we have <p(x ® у — у ® x — [x, XD = <p(x)<p(У) ~ <p(y)<p(x) ~ 4>{\x, у]) = 0, because (p is a Lie algebra homomorphism. Thus, I <= Ker ф, which implies that ф defines an algebra homomorphism f\ sT0 = T/I -> st, defined by ф = f ° p. Now, for any x in ST, we have f ° <p0(x) = f °p° i(x) = ф ° i(x) = <p(x), which shows that the diagram in Fig. 0.1 is commutative. The homomorphism f is unique, because (Po{ST) generates з/0. □ The next result is the Poincare-Birkhoff-Witt theorem. We do not give a proof, because many books contain one (e.g. Cartier 1954/55; Jacobson 1962; Humphreys 1972; Dixmier 1974; Hochschild 1981; Lothaire 1983). Theorem 0.2 Let ST be a Lie algebra over K, and suppose ST is a free К-module with a totally ordered basis (Xi)ieI. Let be its enveloping algebra
4 0 Introduction Fig. 0.4 and (pQ: be the natural Lie algebra homomorphism. Then is a free К-module with basis the set of decreasing products <р0(хь) • • • <Po(xin)(n 0, ii,..., in g I, f > • • • > i„). Corollary 0.3 Let <£ be a Lie algebra over K, and suppose it is a free К-module. Let be its enveloping algebra and <p0: <£ -> .c/0 be the canonical Lie homomorphism. Then <p0 is injective. This result allows us to consider a Lie algebra as a Lie subalgebra of its enveloping algebra, especially in the case when К is a field. 0.2 FREE LIE ALGEBRAS Let JTq be a Lie algebra over K, A a set and i: A -+ a mapping. The Lie algebra .У20 is called free on A if for any Lie algebra and any mapping ft A -+ <£, there is a unique Lie algebra homomorphism f\ У20 -+ <£ such that the diagram in Fig. 0.4 is commutative. Theorem 0.4 For each set A, there exists a free Lie algebra ^f(A) on A, which is unique up to isomorphism. Moreover, <S?(A) is naturally a graded K-module, i is injective, the component of degree 1 of Jzf(A) is the free submodule generated by A = i(A), and ^(A) is generated, as a Lie algebra, by A. We denote the free Lie algebra by LK(A) or JT(A), and we also say that =^(A) is freely generated by A. Recall that a magma is a set with a binary operation. The free magma M(A) over A may be identified with the set of binary, complete, planar, rooted trees with leaves labelled in A. Equivalently, trees may be identified with well-formed expressions over A, which are recursively defined by the following: each element of A is a well-formed expression; if t', t" are well-formed expressions, then t = (t', t") is a well-
0.2 Free Lie algebras Fig. 0.5 formed expression, which is identified with the tree obtained by taking a new root, with immediate left subtree t' and immediate right subtree t". The binary operation of M(A) is the mapping M(A) x М(Л) -+ М(Л), (f, t") i—► t. We define the degree |t| of a tree t to be the number of its leaves, i.e. |t| = 1 if te A and |(t', t")l = |t'| + |t"|. Proof of Theorem 0.4 (i) We prove first uniqueness of the free Lie algebra. Let i:A -+ and j: A -+ where У20, are free on A. By the diagrams in Fig. 0.5 we deduce the existence of Lie algebra homomorphism j-. and i: -+ such that j°i = j and i°j= i. By Fig. 0.6 we deduce that id: =% -+ is the unique Lie algebra homomorphism such that id ° i = i. Since we have i = i°j = i°j° i, we therefore must have i°j = id. Similarly, j i = id, which shows that i, j are isomorphisms. Hence and are isomorphic. (ii) Let D(A) be the free (noncommutative, nonassociative) K-algebra over A. One may view D(A) as the К-module freely generated by M(A), the free magma over A. Multiplication in M(A) is linearly extended to D(A). An ideal in D(A) is a submodule of D(A) which is closed under multiplication on the left or right by any element of D(A). Let I be the ideal of D(A) generated by the elements (xy)z + (yz)x + (zx)y, (0.2.1) Fig. 0.6
6 0 Introduction and xx, (0.2.2) with x, у, z g D(A). Let &(A) be the quotient module D(A)II. It is immediate that ^(A) has a multiplication inherited from D(Aj, and that with this multiplication, &(A) is a Lie algebra over K. Moreover, with the canonical mapping A -+ &(A), £f(A) is clearly the free Lie algebra on A. Now, with the degree defined on M(A), D(A) is a graded К-module, with О(Л)1 = ©aeAKa. Since the relations (0.2.1) and (0.2.2) are homogeneous of degree >2 and since multiplication increases the degree, £f(A) is also a graded module with ^(A\ = ©aeAKa. □ Let л/0 be an associative algebra over К and j: A -> a mapping. Then X) is called free on A if for any associative algebra and any mapping /: A -► «я/, there exists a unique homomorphism of algebras f such that the diagram in Fig. 0.7 is commutative. Uniqueness up to isomorphism is proved as in the case of free Lie algebras, or enveloping algebras. Existence is shown below, and a direct construction is done in Chapter 1. Let JTfA) be the free Lie algebra on A, i: A -> JT(A) the corresponding mapping, and л/0 the enveloping algebra of £f(A) with <p0: f?(A) ->• л/0 the corresponding Lie algebra homomorphism. Then we have a mapping j = cp0 A:A^ j/0. Theorem 0.5 The enveloping algebra of the free Lie algebra £^(A) is a free associative algebra on A. The Lie algebra homomorphism q>0: ^(A) -> is injective, and cp0(JT(A)) is the Lie subalgebra of .g/0 generated by j(A). Proof (i) Let л/ and f be as shown in Fig. 0.7. Then we have Fig. 0.8, in which we show the existence of g and f, homomorphisms of Lie algebras and associative algebras, respectively. By the universal property of the free Lie algebra <T(A), we deduce existence and uniqueness of g. Then by the universal property of the enveloping algebra, we deduce the existence of
0.3 Elimination 7 Fig. 0.8 /.To prove uniqueness of f, suppose we have the commutative diagram in Fig. 0.7. Then, as above, we find a unique Lie algebra homomorphism g such that g°i = f. Since j = <p0°i, we have f = f°j = f°(Po°i, and by uniqueness of g, we deduce g = / ° <p0. Now, by the universal property of the enveloping algebra, we deduce uniqueness of f. (ii) In view of Corollary 0.3, it is enough to show that ^T(A) is a free К-module to deduce that <p0 is injective. This will be done independently in the next section (see Corollary 0.10). Now, the proof of Theorem 0.4 shows that ^T(A) is generated, as a Lie algebra, by i(A). Hence, ср0(^(А)) is generated by <p0° i(4) = j(A). □ 0.3 ELIMINATION We begin by stating a theorem which allows us to ‘eliminate’ one variable. Recall that a derivation of a Lie algebra is a linear endomorphism D such that D([x, yj) = [Dx, y] + [x, Dy], If c is an element of a Lie algebra JT, we denote by ad(c) the linear mapping <£ -+ defined by ad(c)(y) = [с, у]. By (0.1.4) and (0.1.3), ad(c) is a derivation of Recall that A is canonically embedded in the free Lie algebra (4). Theorem 0.6 Let c g A. Then the К-module ^f(A) is the direct sum of Kc and of a Lie subalgebra which is freely generated, as a Lie algebra, by the elements (-ad(c))n(b), n>0, be A\c. (0.3.1)
8 0 Introduction Note that an element (0.3.1) is of the form [.. .[[b, c], c],..., c], with n cs. The reader may verify that the subalgebra of the theorem is the Lie ideal generated by Л\с. We begin with a lemma. Lemma 0.7 Let be a Lie algebra. (i) The set Der(Jzf) of derivations of is a Lie subalgebra of the algebra of linear endomorphisms of . (ii) If V is another Lie algebra and h: -+ Der(Jzf) is a Lie algebra homomorphism, then there is a unique Lie algebra structure on = <£ © <£', extending that of and and such that Vx g Vx' g [x', x] = /i(x')(x). (0.3.2) (iii) If is free on T, then each mapping T -+ extends uniquely to a derivation of УТ. The Lie algebra is called the semidirect product of and ST' with respect to the homomorphism h: ST' -+ Der(^f). Proof (i) Let D15 D2 be two derivations. We show that DX°D2 — If °D2 is again a derivation. We have D, - D/[x,)']) = D,([D,x, v] + [x, = [D,D,x, y] + [D;X, D,y] + [Цх, D,y] + [x, ЦВД. This implies that [Di, ВД([х, y]) = D, о D2([x, y]) - D2 о D,([x, y]) = у] + lx [£>i, аду)], hence [D15 D2] is a derivation. (ii) It is clear that the Lie algebra structure on is completely defined by (0.3.2), because the Lie bracket must be distributive and satisfy [u, v] = — [и, и]. Conversely, define the bracket by (0.3.2). We verify the Jacobi identity. We must show that [[x + x', у + у'], z + Z] + [[у + y',z + z'],x + x'] + [[2 + z',x + x'],y + y'] = 0, where x, y, z g ST, x', y', z' g ST'. By multilinearity and antisymmetry, we only have to consider four cases: (1) [[*',/],/] + • • • (2) [[x', y’], z] + • • • (3) [[x', y], z] + • • • (4) [[x, y], z] + • • •
0.3 Elimination 9 Cases (1) and (4) are immediate consequences of the Jacobi identity in and У. In case (2), we have, by antisymmetry: [[%', y'], z] + [[y', z], x'] + [[z, x'], у'] = 6([x', y'])(z) - /i(x')((6(y')(z)) which is 0, because /i([x', y’]) = 6(x')6(y') — h(y')h(x'). In case (3), we have [[< У], z] + [[y, z], x'] + [[z, x'l y] = [6(x')(y), z} - h(x')([y, z]) - [6(x')(z),y], which is 0, because /i(x') is a derivation and by antisymmetry. (iii) We use the previous part with the following: jSf = as a К-module, with trivial Lie bracket, i.e. [x, y] = 0 for any x, у in jSf' = Jf(T) as Lie algebra; 6(x')(x) = [x', x] for any x in <£' and any x in where the Lie bracket is taken in the free Lie algebra ^(T); then h(x) is a derivation of <£ (actually any endomorphism of is a derivation of because the Lie bracket is trivial), and x'i-> /i(x') is a Lie algebra homo- morphism (see Section 0.4.1). So^ = x gets a Lie algebra structure with [(x, x'), (y, y')J = ([x', >0 + [x, У'Ъ [x', У])» where brackets are taken in У(Т). Define a Lie homomorphism f : ^(T) -* by f(t) = (d(t), t), where d is the given mapping T -+ exists, because У is free on T. We may write /(x) = (D(x), u(x)) for any x in <£(T\ By the definition of the product in u(x) is a Lie homomorphism ^f(T) -+ <£". Moreover, u(t) = t for t in T. Hence, и is the identity and f (x) = (D(x), x). Then №,y]), [x,y]) = /([x,y]) = [/(X),/(y)] = [(Dx,x), (Dy,y)] = ([Dx,y] + [x, Dy], [x, y]), which shows that D is a derivation of Jz?(T) extending d. □ Proof of Theorem 0.6 Let = ^f(c), the free Lie algebra on c (which is of dimension 1), and У = ^f(T), the free Lie algebra on T= N x B, with В = A\c. By Lemma 0.7(iii), there is a unique derivation I) o! !/’ such that D(n, b) = — (n + 1,6), for (n, 6) in N x B. The linear mapping h: <£' -+ Der(J$f), c i—> D is a Lie homomorphism. Hence, by Lemma 0.6(ii), we may define the Lie algebra = <£ © <£', whose product extends that of <£ and <£', and such that [c, (n, 6)] = h(c)((n, b)) = D((n, b)) = — (n + 1, 6). Now, since 3\A) is free on A, there is a unique Lie algebra homomorphism 1Д: ^f(/l) -+ such that <Д(с) = c, and ф(Ь) = (0, b) for be B. Similarly, there are unique Lie homomorphisms <p': -+ £^(A) and cp: -+ Jz?(/1), such that <p'(c) = c and <p((n, 6)) = (— ad(c))"(6). Let E be the subset of У defined by E = {x g \(p ° D(x) = [c, <p(x)]}. It is a linear subspace, containing T: indeed, (p°D(n,b) = <p( —(n + 1,6)) = -(-ad(c))n +1(b)= —[( — adc)n(b), c] = [c, <p(n, 6)]; moreover, since
10 0 Introduction у|—► [с, у] is a derivation, Е is a Lie subalgebra of У'. х, у е Е implies д> ° D([x, у]) = <p([Dx, у] + [х, Dy]) = [<р ° Dx, <ру] + [<рх, <р ° Dy] = [fc, <рх], <ру] + [<рх, [с, <ру]] = [с, [<рх, <ру]] = [с, <р([х, у])]. Непсе, Е is equal to <£, and we have <p ° Dx = [c, <px] for any element of Define a linear mapping a: -+ ^(Л) by a(c) = c and a(x) = <p(x) if x is in . Then a is a Lie homomorphism: indeed, we have a([c, x]) = a(/i(c)(x)) = a(Dx) = <p ° Dx = [c, <px] = [a(c), a(x)]. The homomorphisms i/r°a and a°i/r are the identity of and ^(Л), respectively. Indeed, ф ° a(c) = <Д(с) = c, and ф ° a((n, b)) = ф ° <p((n, b)) = <Д(( — ud(c))"(b)) = — (аЛ(фс))п(фЬ) (because ф is a Lie homomorphism) = (— ad(c))"(0, b) = (n, b) (by a straightforward induction, using the definition of the Lie bracket in J^); since c and the (n, b) generate the Lie algebra JS^, we deduce that ф ° a is the identity of . A similar argument shows that a ° ф is the identity of &(A). Thus, a is a Lie isomorphism -+ &(A). Observe that a maps T onto the set of elements (0.3.1), and onto Kc. Since = <£ © JS?' as a К-module and since JZ is free on T, the theorem is proved. □ Consider again the free magma M(A) (see Section 2). There is a canonical mapping ф: M(A) -> ^(Л), which is the identity on A and which sends each tree (t', t") to [фЬ, <Дг"]. If s, t are two trees, we denote by (stp) the tree (.. .((s, t), t),..., t). Consider trees t0,..., tn and subsets T0,...,T„+l of М(Л) with to e = > G e Л = {(ttg) | p > 0, t g T0\t0}, tneTn = {(tt^JIp^O^Gr^At^J, Tn + l = {(ttpn)\p>Q,tcT„\t„}. (0.3.3) Corollary 0.8 With the above notations, one has a direct sum decomposition (as K-module) ^(Л) = Кф(10) ©© Кф(1„) ©&n + 1, where ^n + l is a Lie subalgebra freely generated by ф(Т„ + 1). Proof For n — — 1, this is clear. Let us assume that this is true for n — 1 (n > 0), and we prove it for n. Since <£n is freely generated by ф(Тп), we have
0.3 Elimination 11 Now, we have t„ g T„ and Tn+l = {(tt^)\p > 0, t g Tn\t„}. Thus, by Theorem 0.6, ^(Tn) is the direct sum Kt„©^', where jjf' is freely generated by the elements [.. .[[t, tj, t J,..., t„] in ^(T„), t g T„\t„. Return- ing to ^(Л), we find that <£n = Ki//(tn) © JS£+ n where &n + l is freely generated by ф(Тп + 1). □ We say that a subset E of M(A) is closed if for each tree (t', t") in E, one has t', t" g E. In other words, E contains all the subtrees of its elements. A Lazard set is a totally ordered subset L of Л/(Л) such that for any finite, nonempty and closed subset E of М(Л), one has L n E = {t0 > tj > • • • > t„}, (0.3.4) for some n > 0, such that (0.3.3) holds, and that moreover Tn+1r\E = 0. (0.3.5) Corollary 0.9 IfL is a Lazard set, then ф(Ь) is a basis of the К-module (A). Proof The set ф(Ь) is linearly independent: indeed, it is enough to show that it is the case for each finite subset К of E; then we can find a finite closed subset E 0 containing K, and it is enough to show that ф(Е n E) is linearly independent; this is a consequence of (0.3.4), (0.3.5), and Corollary 0.8. Now, let P 0 be in ^f(4); the latter is linearly generated by ф(М(А)), hence we may find a finite nonempty subset В of A such that P is a linear combination of <Д(М(В)). Denote by d the total degree in the variables in B. Let d(P) = d, and define E= {t g M(B)\d(t) < d}. Then E is a nonempty, closed and finite subset of М(Л), so we have (0.3.4), (0.3.3), and (0.3.5). Note that for each t = t0,..., t„, d(t) < d, and, hence, also d(<A(O) < d because ф does not increase degrees. We have, by (0.3.5), t g Tn + i => d(t) > d. Hence, since ф is homogeneous, each nonzero element of ф(Т„+1) is homogeneous of total В-degree >d. Since the Lie bracket is homogeneous, the same holds for each homogeneous component of each element of the subalgebra generated by ф(Тп+1). Hence, by Corollary 0.8 and the degree assumption on P, the latter is a linear combination of <А(г0),...,1Д(гя). □ Corollary 0.10 ^(A) is a free K-module. Proof In chapter 4, we shall define Hall sets, show that they exist (Proposition 4.1) and that each Hall set is a Lazard set (Theorem 4.18(i)):
12 0 Introduction these two results will be proved independently of this chapter. Hence, Lazard sebexist, which implies by Corollary 0.9 that У(А) is a free K-module. □ 0.4 APPENDIX 0.4.1 Variants of the Jacobi identity The Jacobi identity (0.1.2) may be rewritten, using (0.1.3): [X, = [[x, y], z] + [y, [x,z]]. This means that the linear endomorphism ad(x): у i—> [x, y] is a derivation of the Lie algebra У2, i.e. ud(x)([y, z]) = (ad(x)(y), z~] + [y, ad(x)(z)]. The Jacobi identity may also be written [*, [У,2]] - [у, [X, z]] = [[x,y],z]. This is equivalent to (ad(x), ad(y)](z) = ad([x, y])(z), and means that x i—> ad(x) is a Lie algebra homomorphism -► End(jSf). Another equivalent form of the Jacobi identity is [x, [y, z]] = [[x, y], z] - [[x, z], у]. This may be used to show by induction that if a set X generates a Lie algebra, then the latter is linearly generated by the left to right bracketed elements (or left normed elements) [.. .[[x15 x2], x3],..., x„], Xi g X. 0.4.2 Witt formula Let A have q elements and a„ denote the dimension of the homogeneous component of degree n of ^(A) over a field K. Then Theorems 0.2 and 0.5 imply that п = E p.x", n > 1 л > 0 where /?„ is the dimension of the component of degree n of the free associative algebra on A. Now, it is not difficult to see that /3„ = qn (see Section 1.1). Then, by taking the logarithm of the previous formula and by Mobius
0.5 Notes 13 inversion, one obtains the formula of Witt (1937): = - E /'(W7*'- П d | л 0.4.3 Canonical decomposition Let <£ be a Lie algebra over K, and its enveloping algebra. We assume that J2? is a free К-module, hence <£ is embedded in л/. Suppose that К contains Q and define Un as the linear span of the elements Г ч 1 V (x1,...,x„) = - X *,(1) • • • *<7(л)> n! <reS„ for each choice of xn ..., xn in <£ (S„ is the symmetric group). Then Uo = K, Ux = & and = © U„. (0.4.1) л>0 Indeed, denote by Vn the linear span in л/ of the elements xv .. ,xp, with p < n and x, g Jzf. Then x1...xn = (x1,...,x„)mod Vn.x. (0.4.2) This is a consequence of the formula xy = [x, y] + yx, which implies that all products xff(1)... xa(n} are congruent to Xj ... xn mod К-p Summing up, we get (0.4.2). Now, take the хг in an ordered basis В of JZ. Then eqn (0.4.2) gives triangular relations between the elements x1...xn (хг g B, Xj > • • • > x„) and the elements (xn ..., x„). Hence, Theorem 0.2 implies that the latter form a basis of <s/, which implies (0.4.1). Note that Proposition 3.6 shows that Un coincides with the submodule of generated by the elements x", x g . 0.5 NOTES In Sections 0.1 and 0.2, we have followed Bourbaki (1972). Note that Corollary 0.3 is true under weaker hypotheses (Cohn 1963). For the Lazard elimination process (Section 0.3), we have followed Viennot (1978). Theorem 0.6 and its corollaries are due to Lazard (1960).
1 Lie polynomials After introducing words, noncommutative polynomials and series, we define Lie polynomials. One of the main results presents the various characteriza- tions of Lie polynomials. This naturally leads us to define the shuffle product, and to study the duality between concatenation and shuffle product; in other words: the Hopf-algebra-like properties of the free associative algebra. Related questions are treated in the appendix, including the support of the free Lie algebra (the set of words which may appear in Lie polynomials), the free Lie p-algebra and the Jacobson identities, the kernel of the left to right bracketing mapping and a brief excursion into automata theory. 1.1 WORDS, POLYNOMIALS, AND SERIES Let A be a set, which we call an alphabet, whose elements are letters', for the main applications, the alphabet will be finite, but it is convenient to admit infinite alphabets. A word on the alphabet A is a finite sequence of elements of A, including the empty sequence, called the empty word. With the concatenation product, the set of all words over A gives rise to a monoid called the free monoid on A and denoted by A*; the neutral element is the empty word, denoted by 1. The set of nonempty words is denoted by A + . Each letter is itself a word, and each word w is the product of its letters, from left to right: w = ava2 (ate A). Here, n is the length of w, denoted by |w|. For each letter a, we denote by |w|a the number of occurrences of a in the word w: it is the length of w with respect to the letter a. A factor of a word w is a word и such that w = xuy for some words x, y, if, moreover, x = 1, и is called a left factor, or a prefix of w; a right factor (or suffix) is defined similarly. A factor и of w is called proper if и / w, and nontrivial if и / 1. The terminology ‘free monoid’ is justified by the following universal property.
1.1 Words, polynomials, and series 15 Proposition 1.1 For any mapping f from A into a monoid M, there is a unique extension of f to a monoid homomorphism f : A* -+ M such that the diagram in Fig. 1.1 is commutative (where i is the natural injection A -+ 4*). Proof If w = ara2 . a„(at e A) then clearly we must have f(w) = f(ai)f(a2) - - • Л“Л So f is unique. Moreover, it is easily verified that if f is defined in this way, then 7 is a monoid homomorphism and satisfies f = f°i. □ Let К be a commutative ring with unit. A noncommutative polynomial on A over К is a linear combination over К of words on A. We simply say polynomial when no confusion arises. If P is a polynomial, we write it as P= £ (P, w)w. we A* Thus, (P, w) is the coefficient in P of the word w; all but a finite number of the (P, w)'s are zero. The set of all polynomials is denoted by K<4). It has a K-algebra structure, with componentwise addition, and product defined by (PQ,y»)= z (p.uxe.0. w = uv In other words, it is the K-algebra of the monoid A*. As we shall consider other products on K<A>, we call the product just defined the concatenation product and K<4> with this product the concatena- tion algebra. Note that A* is contained in K<4>, as a submonoid: K<4> is a free К-module with basis A*. The algebra K<4> is the free associative K-algebra generated by A, as is shown by the following universal property. Proposition 1.2 For each mapping f from A into an associative K-algebra , there is a unique extension off to a K-algebra homomorphism f: K(Aj> -> .<з/ such that the diagram in Fig. 1.2 is commutative (where i is the natural injection AK(Af).
16 1 Lie polynomials Proof If f is such an extension, then f\A* is a monoid homomorphism A* -> jrf (multiplicative structure of зз/), so f\A* is unique by Proposition 1.1. Since A* linearly generates over K, f must be unique. Now, denote by g the extension of f to a monoid homomorphism A* -> stf (g exists by Proposition 1.1). Define f to be the linear extension of g to зз/: it exists because A* is a basis of the К-module Х<Л>. Given two polynomials P = ^ueA, (P, u)u, Q = (Q, v)v, we have PQ = ^u,v (P, u)(Q, v)uv, hence /те) = Е(Л«)(С, »)»(«>) U, V = E (p’ U)(Q’ v)g(u)g(v) U, V = E (p’ MW) E (G’ v^(v) U V which shows that f is an algebra homomorphism. Evidently, fi = f □ The degree of a nonzero polynomial P is deg(P) = sup{|w|, w g A*, (P, w) / 0} and deg(O) = — oo. Similarly, for a letter a in A, the partial degree of P with respect to a is dega(P) = sup{|w|a, w g A*, (P, w) / 0} and dega(0) = - oo. A polynomial P is homogeneous of degree n if P is a linear combination of words of length n, and finely homogeneous if P is a linear combination of words having all the same partial degrees with respect to all letters; homogeneous components and finely homogeneous components are defined as usual.
1.1 Words, polynomials, and series 17 A formal series (or series) on A over К is an infinite formal linear combination S = £ (S, w)w. we A* The constant term of S is (S, 1). The set of all series is denoted by K((A>>. It acquires a K-algebra structure, with product (ST,w)= Y (S,u)(T,v). w — uv As before, we use the word ‘concatenation’ when confusion with other products may arise. Note that К<Л> is a subalgebra of К<<Л>>. There is a natural duality between К<Л> and К<<Л». Indeed, define the pairing К«Л» x К<Л> -> К, (S,P)^(S,P)= E (S,w)(P,w). we A* This sum is finite because P is a polynomial. It is easily seen that with this pairing, may be identified with the dual space of К<Л>. When restricted to K(A), this pairing gives a scalar product on К<Л> with A* as orthonormal basis. We need some topological remarks, but we shall not go into too much detail. Put on К the discrete topology, and consider on K<<4>> the smallest topology such that each mapping Si->(S,w), K«A»->K, is continuous. Equivalently, a fundamental system of neighbourhoods of a series S is the family of subsets, indexed by finite subsets L of A*, VL(S) = {TeK((A)) | VwgL,(P, w) = (S, w)}. Then becomes a complete topological ring, and К<Л> is a dense subring of This topology is sometimes called the A-adic topology. When A is finite, it is defined by the ultrametric distance d(s, t) = for some fixed 6,0 < в < 1, where for any nonzero series S, co(S) is the length of the shortest word w such that (S, w) / 0, and w(0) = + co. This topology has the following nice property, with respect to infinite sums: if (5,)Ie/ is a family of series such that for each neighbourhood of 0, all but a finite number of these series are in this neighbourhood, then the family
18 1.2 Lie polynomials (S^ is summable, and its sum 5 is defined for any word w by (S, w) = £ (S„ w). ie I Observe that only finitely many terms in the right-hand sum are nonzero, by hypothesis. This condition may also be expressed by saying that the family (5,) is locally finite. In particular, if a series S has constant term 0 (equivalently co(S) > 1), then each family (a„S")„>o is summable, and one may define a„S". In particular, (1 — S)-1 = £„>0 Sn, and if К contains Q, one defines СЛ exp(S) = es = £ —, log(l + S) = £ ------------Sn, n>o n! „>i n and one has the usual formulas log(es) = S, exp(log(l + S)) = 1 + S. Similar consideration apply to infinite products, and to other rings of formal series, like the complete tensor product 1.2 LIE POLYNOMIALS Given two (noncommutative) polynomials P, Q in К<Л>, their Lie bracket (or Lie product) is as usual defined by [Л Q1 = PQ-QP- A Lie polynomial is an element of the smallest submodule of К (Ay containing A and closed under the Lie bracket. By Theorem 0.5 it is the free Lie algebra on A, so we denote it by ^K(A) or 5£(A). Moreover, K(Ay is the enveloping algebra of ^K(A). The next result is elementary, but useful. If L is another commutative ring with unit and <p: К -> L a Z-linear mapping, then we still denote by <p the mapping К<Л> -+ £<Л> defined by <p(P) = £we?1* <p((P, w))w. Note that if (p is a ring homomorphism, then so is its extension to K(Ay. Lemma 1.3 (i) ^(A) is finely homogeneous, that is, if P is a Lie polynomial, then each finely homogeneous component of P is a Lie polynomial. (ii) With L and <p as above, <p(P) is a Lie polynomial in L(Ayfor each Lie polynomial P in К (Ay. Proof (i) is true when P is a letter. Moreover, if it is true for P, Q, and if cte K, then it is also true for P + Q, ctP and [P, Q] = PQ — QP. So it is true for any Lie polynomial.
1.3 Characterizations of Lie polynomials 19 (ii) If (p is a ring homomorphism, then so is its extension to K<A>, and clearly the lemma holds in this case. Let и: Ж -> K,v: £ -> L be the canonical ring homomorphisms sending 1 to 1. Then each Lie polynomial P e &k(A) may be written as a finite sum P = Y.<‘I“(PI), (1.2.1) where a,- g K, P{e This is clear when P is a letter, and is easily extended by induction to all of &K(A). Now, if QgZ<A>, then for a in K, (p(au(Q)) = cp(ct)v(Q); indeed Q = ^n7w7(n7eZ, w7g Л*), hence <p(au(0) = <p(^ n7avv7) = <p(n7a)vv7 = £ n7<p(a)w7 = <p(a)u(0, <p being Z-linear. Thus, by (1.2.1) and linearity of <p, <p(P) = X <P(aiu(-f/)) = E ф(а/М^)- Since Pt is in J^Z(A) and as v is a ring-homomorphism, we conclude that и(Д) is in У\(А)', hence so is <p(P). □ 1.3 CHARACTERIZATIONS OF LIE POLYNOMIALS We shall characterize Lie polynomials among the set of all polynomials. For this, we need to define several linear mappings. Define a homomorphism of K-algebras (sending 1 to 1) <5: К<Л>-> К<Л>®кК<А>, <5(a) = a ® 1 + 1 ® a, for any letter a. Such a homomorphism exists by Proposition 1.2. Define a linear mapping a from К<Л> into itself, in the following way: for any word w = aY ... a„(a{g A), let a(w) = ( — l)"a„.. .av In other words, a is the anti-automorphism of K<A) which sends each letter «to —a. In particular, a(PQ) = a(Q)a(P) for all polynomials P, Q. Now, define J = (id® a) ° d, where id is the identity of К<Л>. For any polynomial P, recall that ad(P) denotes the linear mapping K<A) -+ К<Л) defined by ad(P)(Q) = [P, Q] = PQ- QP. (13.1) Moreover, define a homomorphism of K-algebras Ad from К<Л> into End(K</l» by Ad(a) = ad(a) for any letter a; Ad exists by Proposition 1.2. Note that Ad # ad in general: e.g. ad(ab) is the endomorphism Q i—> [ah, Q] = abQ — Qab, while Ad(ab) is the endomorphism Q i—> [a, [b, Q]] = abQ — aQb — bQa + Qba. Let D: К (A) -> K(A> be the linear mapping which sends each word w of length n into nw. It is easily verified that D is a derivation of KfAy, that is, D(PQ) = D(P)Q + PD(Q).
20 1 Lie polynomials Actually, D is the unique derivation of К (A) such that D(a) = a for any letter a. Finally, define the ‘Lie bracketing from right to left’ or the ‘right normed bracketing’ to be the unique linear endomorphism r of К<Л> such that for any word w = a{ ... an of positive length, one has r(w) = [a15..., [u„_15 «„]. .Moreover, r(l) = 0. For example, one has for a, b, c in A: r(abc) = [a, [b, c]] = [a, be — cb] = abc — acb — bca + cba. In the next result, the ring К is assumed to be a Q-algebra, that is, a ring containing Q as a subfield. We shall also assume that A has at least two letters; the theorem is trivial in the one-letter case (except for condition (ii), which is not equivalent to the others). Theorem 1.4 For P in K(A), the following conditions are equivalent: (i) P is a Lie polynomial', (ii) ad(P) = Ad(P) and (P, 1) = 0; (iii) b(P) = P ® 1 + 1 ® P; (iv) b(P) = P ® 1 - 1 ® P; (v) (P, 1) = 0 and r(P) = D(P). The equivalence of (i) and (iii) is often expressed in the following way: let A' be a copy of A, with bijection a i—► a'; let each letter in A commute with each letter in A'. Then, denoting a polynomial P by P(a, b,...), (iii) is rewritten as P(a + a', b + b',...) = P(a, b,...) + P(d, b',...). Thus, a poly- nomial is a Lie polynomial if it is additive, in the above sense. Theorem 1.4 is still true if К is a Z-algebra without torsion (see Corollary 4.17). However, it is not true if К is a field with nonzero characteristic; when the characteristic is a prime number p, it characterizes the free p-Lie algebra (see Section 2.5.2). Lemma 1.5 Define two linear mappings A, cone: K(A) ® KfA) -> KfA) by z(P ® Q) = D(P)Q and cone (P ® Q) = PQ. Then for any polynomial P, one has 2°<5(P) = r(P) and cone ° <5(P) = (P, 1). Proof We have b(l) = 1 ® 1 and D(l) = 0 so that z°<5(l) = 0 and cone ° <5(1) = 1. It remains to show that for n > 1 and an,..., аг in A, one has Л°д(а„... аг) = r(an.. .af), (1.3.2) and cone ° b(a„... aj = 0. (1.3.3) We do it by induction on n. The case n = 1 is easy: b(«!) = (id (x)a)°b(«i) = (id ® a)(aj ®1 + l(g)zi1) = zz1<g>l — 1 ® aj,
1.3 Characterizations of Lie polynomials 21 so that A ° 5(aJ = D(ar) — £>(1)^ = аг = г(аг) and cone ° 5(aJ = at — at = 0. Suppose that eqns (1.3.2) and (1.3.3) are true for n > 1; we prove them for n + 1. Let . ,«i) = Qt i Then S(an ...«i) =£-Д® a(Q,), hence we have by induction E D(Pi)<*(Qi) = Ao d(a„ ...ar) = r(a„... aj, (1.3.4) and E Pi°L(Qi) = cone ° d(a„... af) = 0. (1.3.5) Now, we have <5(«n+i .. .«J = (id® a)o<5(an+1 .. .aj = (id ® ct)(d(a„+1)d(a„... a J) = (id ® a)((an+ j ® 1 + 1 ® an+ j)(£ Pf ® Q,)) = (id ® a)(£ «„ +1^- ® Qi + £ Л ® «л+ iQ.) = E a» + ipi ® a(Gf) “ E pi ® a(&K +1, because a is an anti-endomorphism of the algebra К<Л> and a(an+1) = — an+l. Thus, we have ... af) = £ D(an + 1^.)a(Qi) - £ WXQ.K+i = E «n + i^(Q.) + E - E WMQ.X + i = 0 + a„+lr(an ...aj- r(a„ ...a1)a„ + 1 = [a„+1,r(a„.. .ui)] = r(a„+1... aj, where we have used in the second equality the fact that D is a derivation such that D(an+ J = an+ n and (1.3.4) and (1.3.5) in the third one. Moreover, we have cone ° <5(u„ 4-i ...Ui) = £ «л +1 ^f^(Qi) E + i 0» by (1.3.5). □
22 1 Lie polynomials Define a linear mapping p: K(A) ® K(A) -> End(K</l» by pU\®P2)(Q) = PiQP2- Lemma 1.6 (i) For any polynomial P, one has ad(P) = p(P (x) 1 — 1 (x) P) and Ad(P) = ц°<5(Р). (ii) p is injective if A has at least two letters. Proof (i) The first equality is immediate. The second one holds if P is a letter, and so it is enough to show that p ° d is multiplicative, because Ad is, so both are algebra homomorphisms which coincide on the generators, and thus are equal. Now, p ° 6 = p ° (id ® a) ° d, and d is multiplicative. A routine verification shows that p ° (id ® a) is multiplicative: p о (id ® a)((Pt ® Р2МГ ® Q2))(P) = p ° (id ® «XPiQi ® P2Q2)(P) = р(Ле1®а(е2)а(^))(Ю = P1Q1Pa(Q2)a(P2) = M(P1®a(P2))(Q1Ra(Q2)) = p(J\ ® ^PJ^plQ^ ® a(Q2))(K) = (^o(id®a)(P1®P2)) °(^°(id ® a))(Qi ® Q2)(K). □ (ii) Note that A* x A* is a basis of the free К-module K(A) (x) К<Л>. Take an element x # 0 in this space; it may be written x = <i<n * ut ® v(, where * indicates a nonzero coefficient, and (uf, v{) are distinct couples of words. Let ur be of minimal length among all the uh take N greater than the lengths of all these words, and let a, b be two distinct letters in A. We show that u1aNbv1 is different from each word for i > 2: this will imply that p(x)(aNb) / 0, hence p is injective. Suppose u1aNbvl = ща'^Ьь^. Then, by minimality of u15 ux is a left factor of uf: uf = urs => aNbvt = saNbvt. Because N is big, s is a left factor of aN, hence s = aj, which implies aNbvr = aj+NbVi. Since a / b, we must have j = 0; therefore = uY and Vj = vt. □ Lemma 1.7 a(P) = — P for any Lie polynomial P. Proof This is clear for each letter, by definition of a. Furthermore, if this equality is true for polynomials P, Q, and if a e K, then it is also true for P + Q, aP and for [P, QJ: indeed, a([P, Q]) = a(PQ - QP) = a(Q)a(P) - a(P)a(0 = ( —Q)( —P) — ( —P)( —Q) = — [P, Q], because a is an anti- endomorphism of the K-algebra К<Л>. □
1.4 Shuffles 23 Proof of Theorem 1.4 (i) => (ii) The set of polynomials P satisfying ad(P) = Ad(P) is a submodule of К (A) containing A and closed under Lie bracket: indeed if ad(P{) = Ad(Pi), i = 1,2, then for any polynomial Q ad([Pv P2])(Q) = ad(P.P2 - P2P^Q) = P1P2Q - p2p& - QP,p2 + Qp2p^ and P2])(Q) = Ad(P,P2 - P2P^Q) = (Ad(J\) > Ad(P2) - Ad(P2)^Adm)(Q) = (ad(P1) о ad(P2) - ad(P2) о ^(PJXQ) = LA, [P2, Q]] - [P2, ЕЛ, QUl = Pffl2Q - PrQP2 - P2QPr + QP2Pr - P2P,Q + P2QPr + PrQP2 - QPrP2 = p,p2Q + QP2P, - p2p,Q ~ QPffli- Hence, ad([Pi, P2]) = Л^([Р15 P2]) and the set of polynomials satisfying (ii) contains all Lie polynomials. (ii) => (iv) We have by hypothesis and Lemma 1.6(i) that fflP ® 1 — 1 0 p) = ц о <5(P). Then by Lemma 1.6(ii), we obtain P ® 1 — 1 ® P = d(P). (iv) => (v) Apply the mappings z and cone of Lemma 1.5 to the equality J(P) = P ® 1 - 1 ® P. We obtain r(P) = D(P) and (P, 1) = P - P = 0. (v) => (i) Write P as £„>0 P„, where P„ is the homogeneous component of degree n of P; then r(P) = D(P) = £ nP„ (by definition of D). Thus nP„ = r(P)„, the homogeneous component of degree n of r(P). Since r(P) is evidently a Lie polynomial and since &(A) is homogeneous by Lemma 1.3(i). nPn is a Lie polynomial. Finally (since we may divide by n in K), Pn is a Lie polynomial, for each n > 1. To conclude, observe that Po = (P, 1) = 0. (i) => (iii) We have shown that (i) => (iv). Hence, if P is a Lie polynomial, we have <5(P) = (id ® a)°5(P) = (id ® a)(P® 1 — 1 ® P) = P ® 1 + 1®P, because a is an involution, and by Lemma 1.7. (iii) => (v) We have 3(P) = (id ® a) ° д(Р) = P ® 1 + 1 ® oc(P). Applying the mappings z and cone of Lemma 1.5 to this equality we get r(P) = D(P) and (P, 1) = P + a(P). Taking constant terms, and observing that (a(P), 1) = (P, 1), we have (P, 1) = 2(P, 1), hence (P, 1) = 0. □ 1.4 SHUFFLES Let w = aY ... a„ be a word of length n and I be a subset of {1,..., n}. We denote by w\l the word ah ... aik, if I = {ix < i2 < • • • < ik}; in particular, w\I is the empty word if / = 0. Such a word w\l is called a subword of w.
1 Lie polynomials 24 Note that when = U Ij, j=i then w is determined by the knowledge of the p words w|/7 and the p subsets Ij. Given p words ur,..., up of respective lengths n15..., np, their shuffle product, denoted by u{ lli - • -lliup, is the polynomial W1 LU - • -LUUp = X Ip) where the sum is extended to all p-tuples (/15..., Ip) of pairwise disjoint subsets of {1,..., и} (n = nx + • • • + np) such that and |/j| = n} for any j = 1,..., p, and where the word w = w(Ilf... ,Ip) is defined by w\Ij = Uj for j = 1,..., p. Note that иг ш• • -шup is the sum of words of length n, so it is a homogeneous polynomial of degree n. In particular, if one of the Uj is empty, it may be omitted without changing the shuffle product. Moreover, the product does not depend on the order of the words Uj, as the reader may verify. A word appearing in the shuffle product uY ш • • ш up is called a shuffle of «i,..., up. Thus, a shuffle of иг,..., up is a word obtained by ‘shuffling’ together the words ut,..., up without changing each word Uj. The shuffle product is then the sum of all shuffles, with multiplicities. For example, with a, b, c 6 A, we have: ab ш ас = abac + 2aabc + 2aacb + acab. The shuffle product ш is extended to K<A> and K<<4>>, by the formula 51lu---lu5p = Y (5i> Mi) • • (Sp, wp)wx ш • • • uj up. Ml, ... ,Up This infinite sum makes sense, because it is locally finite: indeed, for any word w, there are only finitely many p-tuples (uY,... ,up) such that w is a shuffle of u15..., up. If each Sj is a polynomial, then so is their shuffle product. Note that ur ш (w2 ш w3) = (ux ш w2) ш w3 = иг ш u2 ш w3. Hence, the shuffle product is associative. As it is also clearly distributive with respect to addition, we obtain on K<A> and K<(4>> a commutative algebra structure called the shuffle algebra. The neutral element is the empty word. The shuffle product is intimately related to the homomorphism 6 of the previous section. We consider the complete tensor product fP=K(A>
1.4 Shuffles 25 with its concatenation structure. An element of #p is an infinite linear combination E aui....,UpWi ® • • -®«p- ui, ..., upeA* The product in Jp is defined by (iq ® ® ® ® vp) = («1^1) ® ® (upvp) and extended by infinite linearity (this always makes sense). Define an homomorphism of concatenation algebras dp. K((A>> -> by bp(a} = a ® 1 ® ® 1 + 1 ® a ® ® 1 + ••• + 1 ® 1 ® ® a for any letter a. Indeed, dp so defined extends to a unique algebra homomor- phism К<Л> -> /р (by the universal property of K<4>, Proposition 1.2); this extension being homogeneous and degree-preserving, it may be extended by infinite linearity to К<<Л>>. Note that d = <52. The following result shows that dp has another equivalent definition, in terms of the shuffle product. Proposition 1.8 For each series S, one has dp(S) = (S, иг ш-• -Lu up)u1 ® up. (1-4.1) Ul ,..., upe A * Recall that the scalar product of a series 5 and a polynomial P is (S,P)= X (МЛ4 WE A* Proof We have only to show that eqn (1.4.1) holds for any word w in place of 5. Let w = aY ... а„(а{ e A). Then, since dp is an homomorphism, <5P(ai a„) = ^p(a1).. .dp(a„) = П (ai ® ® ® ® ai ® ’ ’ ’ i = 1 ® 1 + • • + 1 ® 1 ® -® at) E (О®- -® (w|/p) {1....»| = /|U • <j/p(disjoint) = £ (w, UY LU- • -LU Up)Ui ® Up, ui....upeA* the last equality by definition of the shuffle product. □ The shuffle product is sometimes defined recursively in the following asymmetric way: 1 ш w = w ш 1 = w for any word w, and if и = au',
1 Lie polynomials 26 v = bv'(a, b e A, u', v'eA*), then (au') lli (bv') = a(u' lli (bv')) + b((au') ш v'). (1-4.2) This definition is equivalent to the previous one: indeed, a shuffle of au' and bv' is either an a followed by a shuffle of u' and bv', or a b followed by a shuffle of au' and v'. Equation (1.4.2) expresses the fact that a certain mapping is a derivation of the shuffle algebra К<<Л>>. Indeed, fix a letter a and define a continuous linear endomorphism S^a~1S of К<<Л>> by its effect on each word: a~Yw = и if w = au and ue A*, a~Yw = 0 if w does not begin with a. Then a xS is a derivation of the shuffle algebra, i.e. ан(5ш T) = (а~15)шТ + Хш(а'Т). (1.4.3) Indeed, eqn (1.4.3) follows easily from eqn (1.4.2). 1.5 DUALITY CONCATENATION/SHUFFLE Recall that there is a canonical scalar product к<л> x к<л> к, (p,e)= e (f. »xe. ") weA* In other words, it is the unique scalar product on K(A) for which A* is an orthonormal basis. Similarly, one defines on К<Л>®Р the scalar product for which {«i ® ® up, Up ..., up 6 A*} is an orthonormal basis. Let us view the shuffle product as a linear mapping sh: K(A) ® K<(A> -> K(A>, и ® v -> и ш v. This mapping is homogeneous and degree-preserving for the canonical degree on К<Л> ® К<Л>, i.e. deg(u ® v) = deg(u) + deg(v) = |u| + |v| for any words u, v. Consider the adjoint (for the above scalar products) of the mapping sh, that is the unique mapping sh*: K(A) -> К<Л> ® К<Л> such that for any polynomials P, Q, R (sh(P ® Q), R) = (P® Q, sh*(R)). Note that by Proposition 1.8, we have b(R) = b2(R) = £ (R, и ш v)u ® v. u,veA*
1.5 Duality concatenation/shuffle Thus, for any words u, v 27 (sh(u ® v), R) = (w lli v, R) = (w ® v, d(R)). Hence, by linearity, for any polynomials P, Q, R (sh(P®Q), R) = (P®Q, <5(R))- In other words, d is the adjoint of sh. Note that by definition, d is a homomorphism for the concatenation structure (see Section 1.3). We obtain a similar result if we interchange concatenation and shuffle. Indeed, denote by д' the adjoint of the concatenation product, viewed as a linear mapping cone: ® K(A) -> K(A), u®vi—>uv. Thus д' = cone*. We have for any polynomials P, Q, R (PQ, R) = (conc(P ® Q), R) = (P ® Q, conc*(R)) = (P ® Q, d'(R)). This implies that for any word w d'(w) = E (u ® v’ d'(w))u ® v u, ve A* = £ (uv, w)u ® V = £ u®v. w= uv This formula—which completely describes д'—means, informally, that d'(w) is equal to all decompositions in two words with respect to the concatenation product. Similarly, <5(w) is equal to all decompositions in two words with respect to the shuffle structure, with multiplicities. Define a linear mapping e: K(A> - K, e(P) = (P, 1). We have partially proved the following result. Proposition 1.9 The adjoint d of the shuffle product is a homomorphism for the concatenation product. The adjoint д' of the concatenation product is a homomorphism for the shuffle product. The mappings d and д' are described by d(w) = £ (w, и ш v)u ® v, (1.5.1) u, ve A* d'(w) = £ (w, uv)u®v. (1.5.2) u, ve A* Proposition 1.9 implies that there are two bialgebra structures on К<Л>: one with the concatenation product and d as coproduct, the other with the shuffle product and д' as coproduct.
28 1 Lie polynomials Proof It is enough to show that д': K(A) -> К<Л> ® K(A) is a shuffle homomorphism. Let us write that d is a concatenation homomorphism, i.e. 5(xy) = <5(x) <5(y). By eqn (1.5.1), this means that £ (xy, U LU v)u ® V = I £ (x, Ui LU fJUi ® Vi jl £ (y, U2 Ш V2)u2 ® V2 I U,V \ut,Vt J\U2,V2 / = J (x,ulmvl)(y,u2mv2)(u1u2®v1v2). U2.V2 Hence, for any words x, y, u, v, by taking the scalar product of both sides with и ® v we obtain (xy, wujv)= Y (x,Ui ^nvl)(y,u2mv2)(u1u2,u)(v1v2,v). Ui , V] , U2, V2 Note that this formula has a complete symmetry concatenation/shuffle. We just have to reverse the computation, interchanging shuffle and concatena- tion. Thus, by (1.5.2) and linearity, д'(и ш v) = £ (w ш v, xy)x ® у х.У = £ (Wj Ш Up x)(l42 Ш V2, y)(u, UiU^V, v1v2)x ® у X, у, ui, U2, , V2 = £ (u, UjUjXv, ViV2)l £(Uj LU vl,x)x\®l £(m2lu v2, y)yj Wl,U2,Vi,U2 \ X / \ У / = £ (u,u1u2)(v,v1v2)(u1wv1)®(u2ujv2) Ul,U2,Vi,V2 = 1 £ (w, ® U2 j Ш I Y, (Г’ yly2)yl ® V2 ) \W]*W2 / / = д'(и) ш <5'(0> where of course K(A) ® К (A) has the shuffle structure defined by («! ® U2) LU (Vj ® V2) = (Ui Ш Vj) ® (u2 Ш v2). □ We define now on End(K</l», the К-module of linear endomorphisms of К (A), a K-algebra structure, different from its usual algebra structure defined by composition of endomorphisms. For f, g in End(K</l», define f *g in End(K</l» by the formula f *g = conc-^ {f ® g)° d. (1.5.3) In other words, for any polynomial P, one has by Proposition 1.8: X <P,u4iv)f(u)g(v). (1.5.4) u, ve A* We call the product * the convolution. Recall that the endomorphism а: К < A) -> К (A) sends each word w = ax ... an (a{ g A) on its signed rever-
1.5 Duality concatenation/shuffle 29 sal a(w) = (— l)"a„... Now define an associative algebra j/, as the complete tensor product = К<Л>® К<Л>, with the shuffle product on the left of ®, and the concatenation on the right of ®. More formally, an element of si is an infinite linear combination au,rM ® v and the product in si is defined by X ct„fVu® V V X Px,yx®y I = X iuJxfumx)®(vy). u,v / \x,y J u,v,x,y Note that each endomorphism f of К<Л> is completely described by its canonical image £иеЛ* и ® /(w) in si. Note also that the identiy endo- morphism is mapped onto £u и ® u. Proposition 1Л0 With the convolution, End(K<4>) acquires an associative algebra structure, whose unit element is e. The inverse in this algebra of the identity of KfA") is a, which is an anti-automorphism of K(A)> for the concatenation product, and an automorphism for the shuffle product. The canonical embedding End(K<^>) -> si, f i—► £u и ® /(u), is an algebra homo- morphism for this product. In other words E w® ((/*#)(w)) = /(M) r ® #00 ) C1-5-5) w \ u J \ V J where the product is taken in si (shuffle on the left of ®, concatenation on the right). This proposition implies in particular that К<Л> has a Hopf algebra structure, with concatenation as product, 5 as coproduct, and a as antipode. Proof We start with eqn (1.5.5). The right-hand side is £ (w ш v) ® (f(u)g(v)) = £ (w, и ш v)w ® (f(u)g(v))j U,V U, V \ w / = E w ® (E u ш OM») )• w \u,v / Hence, eqn (1.5.5) is equivalent to (/ *0)(w) = £ (w, и ш v)f(u)g(v), U, V which is the definition of f *g (see eqn (1.5.4)). Since the product in .я/ is associative and since /h->£u®/(u), End(K<^>) -> si
30 1 Lie polynomials is injective, the convolution is also associative. Moreover, the image of e is 1(8)1, which is the neutral element of stf. Hence, e is the neutral element for the convolution. The fact that a is an automorphism of the shuffle algebra (and an anti-automorphism of the concatenation algebra) is clear by inspection. Now, by Lemma 1.5, we have for any polynomial P, conc°<5(P) = (P, 1), where <5 = (id ® a) ° b. In other words, by eqn (1.5.3), we have id * a = e. So a is the right inverse of id for the convolution. This may be written as <x(u) j = 1 ® 1. v / \ и / Now, a is an automorphism for the shuffle, hence a ® id is an automorphism of . Applying a ® id to the last equation, we obtain £ a(v) ® v Y £ a(u) ® a(u) j = 1 ® 1. Now, £u a(w) ® a(w) = Lu и ® и and a(v) ® v = v ® a(v) because a is an involution. Hence (£ v ® a(v))(£ и ® u) = 1 ® 1, which means that a is also a left inverse of id. □ If h is an endomorphism of the concatenation algebra K<A>, we say that h is a Lie endomorphism if h(A) У2(Т), or equivalently, if ^f(A). This is particularly the case if h is homogeneous, i.e. h preserves homogeneous polynomials and degrees; equivalently, h(a) is a homogeneous polynomial of degree 1 for each letter a; a particular case of this is when h is a substitution of letters, i.e. h(A) A. For later use, we prove the following lemma. Lemma 1.11 If f, g 6 End(K</l» commute with a Lie endomorphism h, then so does f *g. Proof Since h(a) is a Lie polynomial for any letter a, we have by Theorem 1.4(iii) 3 -> h(a) = h(a) ® 1 + 1 ® h(a). Furthermore, (h ® h) о 3(a) = (h® h)(a ® 1 + 1 ® a) = h(a) ® 1 + 1 ® h(a), since /1(1) = 1. Thus 3 h = (h ® h) 6, both being concatenation
1.5 Duality concatenation/shuffle 31 homomorphisms. Hence, by (1.5.3), (/ * g) ° h = cone ° (/ ® g) ° 3 ° h = cone ° (/ ® g) ° (h ® h) ° 3 = cone ° ((/ ° h) ® (g о h)) ° 3 = cone ° ((/i ° /) ® (h ° g)) ° 3 (by hypothesis) = cone ° (h ® h) ° (/ ® g) ° 3 = h° cone °(f ®g)°3 = h°(f*g), because cone ° (h ® h) = h ° cone, h being a concatenation homomorphism. □ There is a simple extension ofeqn (1.5.3). As in Section 1.4 we shall denote by 3p the concatenation homomorphism K<A> - such that for any letter a, 3p(a) = a ® 1 ® ® 1 + 1 ® a ® ® 1 + • • + 1 ® 1 ® ® a. It is easy to establish the following slight extension of Theorem 1.4(iii): if P is a Lie polynomial, then 3p(P) = P®1®--®1 + 1®P®---®1 + -- -+1®1®---®P. (1.5.6) Moreover, for any endomorphisms /15..., fp, their convolution is /i*- • •*/₽ = Сопере/! ®- • ®fp)°3p, (1.5.7) where concp(w1 ® • • ® up) = ur... up. This may be seen for example by expressing the product j\ * • • • * fp in the algebra stf and using Proposition 1.10. Because of the symmetry concatenation/shuffle in the algebra stf, there is another Hopf algebra structure K(A), with shuffle as product, 3' as coproduct and the same antipode; the convolution *' in this case is defined by f *' g = sh°(f ®g)°3', and it corresponds to the product in л/, under the embedding End(K<A>) i—► .$/, ve A* Furthermore, define 3P and shp by <5p(w) = X shp(ul ® • • -® Up) = ur LU- • -Ш Up. W = U 1 . . .Up
32 1 Lie polynomials The mapping b'p is a shuffle homomorphism, and we have /1 *'. .. *' fp = Shp о (Л ® • • ® 4) о 4, (1.5.8) for any endomorphisms ., fp of К<Л>. It may be worthwhile to note that if f, f* are adjoint endomorphisms, i.e. (/(w), v) = (u, /*(v)) for any words u, v, then one has in d Yu® = Yf*(v)®v (1.5.9) Conversely, this equality implies that f and f* are adjoint. The foregoing results have as applications some combinatorial identities on words. By Lemma 1.5, we have the identity A ° <5(P) = r(P) for any polynomial P, where A, J are the linear mappings defined by: D(w) = |w|w for any word w, z(P ® Q) = D(P)Q, <5 = (id ® a) ° 5, and r(«x ...«„) = [als..., [a„_i, a„] ...] for any letters аг,...,а„ (Lie bracketing from right to left). We may write A = cone ° (D ® id), hence the above identity may be rewritten as: r = cone ° (D ® id) ° (id ® a) ° <5 = cone ° (D ® a) ° d = D*tx. Since id is the inverse of a for convolution, we obtain: r * id = D. We write the latter equality in the algebra si with the help of eqn (1.5.5), and use the fact that if p denotes the adjoint endomorphism of r, then by eqn (1.5.9): £ w® r(u) = Yp(v)®v- U V Thus £ |w|w ® w = £ w ® D(w) = I £ p(v) ® v II £ и ® и I w w \ V J \ U J = s (p(v) ш M) ® vu- U, v Hence, we deduce £ p(v) ш и = |w|w w = vu for any word w. It is interesting to note that p has an effective recursive definition p(l) = 0, p(a) = a ifaeA, (1.5.10) p(aub) = ap(ub) — bp(au) if и e A* and a, b e A. Indeed, since p and r are adjoint we have for any words v, w (r(v), w) = (v, p(w)). (1.5.11)
1.6 Appendix 33 It is enough to check that p defined by (1.5.10) satisfies (1.5.11). Note first that p(w) and r(w) are homogeneous polynomials of degree |w|, equal to 0 if w = 1, equal to a if w is the letter a. Hence (1.5.11) is true for |w| = 0, 1. Note also that (1.5.11) is true if |v| / |w| since in that case both sides vanish. So we need only consider the case |v| = |w|. We shall use the following identity (xy, zt) = (x, z)(y, t) if x,y, z, te A*, |x| = |z|, |y| = |t|. (1.5.12) Suppose now |w| > 2, hence w = aub with a, b in A. Then by (1.5.10), the right-hand side of (1.5.11) is (v, p(w)) = (v,ap(ub)) — (v,bp(au)). Since |v| = |w|, we have v = ex, c g A, x g A*. Then by (1.5.12), (v, p(w)) = (c, a)(x, p(ubj) — (c, b)(x, p(au)). By induction, this is equal to (c, n)(r(x), ub) — (r(x), au)(c, b). This in turn is equal, by (1.5.12), to (cr(x), aub) - r(x)c, aub) = (cr(x) — r(x)c, aub) = (r(v), w), which completes the proof of (1.5.11). We can thus state the following result. Theorem 1.12 Define the linear endomorphism p of К<Л) by (1.5.10), and the endomorphism r by r(aY ...«„) = [a15..., [a„_ 15 «„] ...] for any letters аг,... ,a„. Then r and p are adjoint endomorphisms of К<Л>, and for any word w, one has £ p(v) lli и = |w|w, (1.5.13) vv = vu or equivalently, in the convolution algebra K^A), r*id = D, where D(at ...a„) = naY ... an. 1.6 APPENDIX 1.6.1 Support of the free Lie algebra We call support of the free Lie algebra the subset 5 of A* consisting of those words which appear (with a nonzero coefficient) in some Lie poly- nomial over Z. It was shown by Duchamp and Thibon (1989) that a word w is in Л*\5 if and only if w is either of the form an (ae A, n> 2), or a palindrome (i.e. a word equal to its reversal) of even length. One part of this result goes as follows: if w = an (a g A, n > 2), then by projection &(A) -> ^(a) = T.a, one sees that w is not in 5; if w is a palindrome of even length, then a(w) = w, and a(P) = - P for each Lie polynomial by Lemma 1.7, so that w has zero coefficient in P, and w is not in 5. In order to show that each word not of the previous form is in 5, the authors construct an ad hoc family of Lie polynomials. A more general problem is the following: define for each word w, the ideal
34 1 Lie polynomials (n) of Z consisting of the coefficients of w in all Lie polynomials over Z. The previous result characterizes those words with n = 0. Which words have n = 1? (This problem was posed by M.-P. Schiitzenberger.) 1.6.2 Formulas in characteristic p For any letters a, b, and integer n, one has the identity ad(a)n(b) = [a, [a,..., [a, b] ...]] = £ ( " )( - 1)W i + j = n \ I / If К is of prime characteristic p, one deduces, for any n > 0, ad(a)pn(b) = [u<b], (1.6.1) and od(a)p-1(b) = £ a‘baj. (1.6.2) i+j=p-1 This is because, in characteristic p, one has (p") = 0 for i = 1,..., p" — 1, (p;1) = (-1)* for / = l,...,p- 1 and(—l)p-1 = 1. Formula (1.6.1) implies that for any n, m, one has [a"", b₽m] = ad(a)pn(bpm) = ad(a)pn~1 ° ad(a)(bpm) = ad(a)p”~\[a, bpm]) = —ad(a)pn~1([bpm,a']) = - ad(d)pn ~1 о ad(by”\a). (1.6.3) Define a polynomial Ap(«, b) by the formula (a + b)p = ap + bp + Ap(a, b). (1.6.4) Then Ap(a, b) is a Lie polynomial (in characteristic p). Indeed, by definition, Ap(a, b) is the sum of all words of length p on the letters a, b, except the two powers ap, bp. Hence Ap(a,b) = £ Ap k(a, b), к = 1 where Ap k is the sum of all words of length p and partial degree к in b. So it suffices to show that each Apk is a Lie polynomial. Observe that £ kAPik(u, b) = (a + ь?ь(а + bY- к=1 i + j = p— 1
1.6 Appendix 35 Since 1,..., p — 1 are invertible in characteristic p, and since the free Lie algebra is finely homogeneous by Lemma 1.3(i) it is enough to show that the right member is a Lie polynomial. But it is equal to ad(a + b)p~ \b), by (1.6.2). The previous formulas are called the Jacobson formulas (Jacobson 1937; see also Zassenhaus 1939). 1.6.3 Free Lie p-algebra Let К be of prime chracterisitic p and denote by J?P(A) the К-subspace of К<Л> generated by the p"th powers of Lie polynomials (n > 0). By (1.6.3), the Lie bracket of two elements of ^fp(A) lies in ^(A). This shows that Ур(А) is a Lie subalgebra of К<Л> containing ^(A), called the free Lie p-algebra. See Section 2.5.2 for a further study. 1.6.4 Identities on words The fact that a is the antipode for the two Hopf algebra structures on К<Л>, i.e. that £ и ® a(w) = £ a(v) ® v is the inverse of £ w ® w in the algebra 3/, may be expressed in each of the four following equivalent identities on words (w is a nonempty word): 0 = £ (— l)|u|u lu v w = uv = £ (— l)|l,|w ш v w~uv = £ (— l)|u|(w, и ш v)Uv U, V = £ (— l)|,?l(w, и Ш v)uv, U, V where u denotes the reversal of и (see Schmidt 1990). 1.6.5 Derivations Theorem 1.4(iii) means that the primitive elements for the coproduct 6 are the Lie polynomials. If P is a Lie polynomial, then the adjoint of the left multiplication by P (in the concatenation algebra) is a derivation in the shuffle algebra. This is a consequence of (1.4.3) and the fact that the Lie bracket of two derivations is a derivation. Conversely, if the adjoint of the left multiplication by P is a derivation in the shuffle algebra, then P is a Lie polynomial. For the proof, use Theorem 1.4(iii). Similar results hold of course for the adjoint of the right multiplication.
36 1 Lie polynomials Symmetrically, the primitive elements for the coproduct д' are the linear combination of letters. If P is such a polynomial, then the adjoint of the shuffle product by P is a derivation for the concatenation structure. If P = a is a letter, it is the usual derivation d/да which maps each word w on £w=uei? uv- & has characteristic zero, the intersection of the kernels of all these derivations, for a in A, is equal to the (concatenation) subalgebra M of К<Л> generated by the Lie polynomials without linear part: indeed, it is easy to see that the derivative of each element of M vanishes; conversely, each element of К (A) has a unique expansion p = E p. П a aeA where the sum is over all a in N(j4), where Pa 6 M, and where the product is taken in decreasing order, for some total order on A (because К (A) is the enveloping algebra of &(A); see Theorem 0.5 and Theorem 0.2); if dIdaP = 0, for any letter a, then it is easy to see that all Pa = 0, except Po. For related problems, see Lenormand (1969/70). 1.6.6 An identity of Baker (1905) Let /: К<Л> -> У(А) be the ‘Lie bracketing from left to right’, i.e. the linear function such that /(1) = 0, 1(a) = a for any letter a, and l(Pa) = [l(P), d] for any polynomial P. Then, for any polynomials P, Q /(W)) = m/(0]. (1-6.5) It is enough to check this identity when P, Q are words; then, a few lines of computations and an induction on the length of Q give the result. From (1.6.5), one deduces that l(U(P), ПОЛ) = U2(P), 1(Q)1 + U(P), Z2«2)J - (1.6.6) By Jacobi’s identity, each Lie polynomial is of the form l(P) (see Section 0.4.1). Thus (1.6.6) implies that 1\.У(А) is a derivation of Lie algebra, that is /([Л QD = W>), QI + [Л №)], (1.6.7) for any Lie polynomials P and Q. Now, (1.6.7) implies easily, by induction on n, that for each homogeneous Lie polynomial P of degree n, one has l(P) = nP. This is Theorem 1.4(v).
1.6 Appendix 37 1.6.7 Kernel of the left to right bracketing The kernel of the linear mapping /: К<Л> -> У2(Л) is the right ideal of К<Л> (concatenation algebra) generated by the polynomials P/(P), РеК<Л>. (1.6.8) This was proved by Cohn (1951). We outline a proof which works only in characteristic 0. Observe that Ker / is a right ideal: this is because l{Pd) = [/(P), a] for any letter a. Now, Baker’s identity (1.6.5) shows that each polynomial of the form (1.6.8) is in Ker /. For the converse, we use the identity nP = у (P, и lu v)ul(v) (1.6.9) u, veA* for any homogeneous polynomial of degree n (proof below). Identity (1.6.9) may be rewritten (n - 1)P = Z(P) + У (P, и ш v)ul(v) u,v # 1 = /(P) + У (P, U LU u)ul(u) + У (P, U LU v)ul(v). U # 1 U # V U, V # 1 The second summation is a linear combination of ul(y) + vl(u), because the shuffle product is commutative. The latter polynomial is equal to (u + v)l(u + v) — ul(u) — vl(y). Thus, if P is in Ker /, it is a linear combination of polynomials of the form (1.6.8). To prove identity (1.6.9), we use the identity r*id = D of Theorem 1.12. By symmetry, we have id* / = D, which by (1.5.5) may be rewritten in the algebra за/ у и ® V ® /(r)^ = y |w|w®w. Recall that за/ is the complete tensor product of the shuffle algebra by the concatenation algebra. Thus we obtain for any word w |w|w = У (w, U LU v)ul(v). U, V This implies (1.6.9), by linearity. For related work, see Labute (1978) and Patsourakos (1987).
38 1 Lie polynomials 1.6.8 Sweedler dual coalgebra The linear mapping д' of Section 1.5 is extended to formal series by the formula d'(S) = £ (S, uv)u ® v. U, V Hence, the image d'(S) is an element of the complete tensor product К<<Л» ® К«Л», of the form i'(S) = £ S, ® Tt. (1.6.10) i The dual coalgebra (see Sweedler 1969, Chapter VI) of the concatenation algebra К<Л> is by definition the set of formal series (identified as in Section 1.1 with linear functions on such that the sum (2.6.10) is finite; that is, the series 5 such that d'(S) actually lies in the tensor product К<<Л» ® К«Л>>. Denote by R this set of series. Since д' is a homomor- phism for the shuffle product (Proposition 1.9), R is closed under shuffle product. Hence R becomes a bialgebra, with shuffle product and д' as coproduct. It is easily verified that a(R) cz R, hence R is actually a Hopf algebra. It is interesting to note that these series occur in a quite different context, derived from automata theory. It is known, and easily shown, that condition (1.6.10), with a finite sum, is equivalent to the fact that the kernel of S, considered as a linear form on K(A), contains an ideal (concatenation structure) of К<Л> of finite codimension (we assume for simplicity that К is a field; this ideal may be equivalently left, right or two-sided). This condition is also known in automata theory, where such a series 5 is called recognizable. These series have an equivalent definition: indeed, by the Kleene- Schiitzenberger theorem, a series is recognizable if and only if it is rational, i.e. belongs to the smallest subalgebra of (concatenation structure), which contains the polynomials, and which contains the inverse of all its invertible elements. One connection with automata is the following result of Schiitzenberger (1961): a language L (i.e. a subset of A*) is recognizable by a finite automaton if and only if its characteristic series is rational (or recognizable). References on this subject are Eilenberg (1974), Salomaa and Soittola (1978), Lallement (1979), and Berstel and Reutenauer (1988). Similarly, one may raise the question which are the series 5бК((Л)) such that Ж)6К«/1»®К«Л». (1.6.11) Among these series are the Lie series (e.g. the Hausdorff series) and the
1.7 Notes 39 exponentials of Lie series (see Section 3.1). Moreover, the set of series satisfying (1.6.11) is a concatenation subalgebra of K<<4>>, because 3 is a homomorphism. If К is an algebraically field of characteristic 0, then the converse holds: the set of series satisfying (1.6.11) is the concatenation subalgebra generated by the Lie series and their exponentials. This is a consequence of Sweedier (1969, Section 13.1 together with Lemma 8.0.l.c and Section 7.2). Note that in the case of a one-letter alphabet, this result expresses the well-known fact that a series £ antn satisfies a linear differential equation with constant coefficients if and only if it belongs to the subalgebra °f К [[t]] generated by t and the series e°" (a e К). 1.7 NOTES Theorem 1.4 characterizing Lie polynomials has different sources: charac- terization (ii) is from Finkelstein (1955), (iii) is due to Friedrichs (1953) and proved by Cohn (1954), Magnus (1953, 1954), Lyndon (1955b), and Finkelstein (1955), (iv) is from von Waldenfels (1966b), and (v) is due to Dynkin (1947), Specht (1948), and Wever (1949). For the proof, we have followed von Waldenfels (1966b), who establishes Lemma 1.5. Other proofs may be found in Ree (1958) (his proof rests essentially on eqn (1.5.13)), and Garsia (1990); see also Section 8.6.5. The introduction of the shuffle product for the study of Lie polynomials is due to Ree (1958). A signed shuffle product appears in earlier papers of MacLane (1950) and of Eilenberg and MacLane (1953). Today, the shuffle product is well understood in the light of Hopf algebras. An analogue of this product is defined more generally on the dual of each enveloping algebra (Dixmier 1974): it is simply the adjoint of the coproduct 3, defined for any Lie element as here by <5(P) = P ® 1 + 1 ® P. Proposition 1.9 is a particular case of a result in bialgebras: if an associative algebra has a coproduct which is an homomorphism, then its product is a coalgebra homomorphism (see Abe 1980, Theorem 2.1.1). The fact that 3’ is a shuffle homomorphism was also noted by Chen (1968, Theorem 1.8). The convolution product * in End(K<4>) is defined more generally in any bialgebra. For more on coalgebras, bialgebras and Hopf algebras, see Milnor and Moore (1965), Sweedler (1969), Bourbaki (1972), Abe (1980), and Hochschild (1981). Theorem 1.12 is due to Ree (1958). This result and other identities of the same kind were rediscovered and applied in control theory by Crouch and Lamnabhi-Lagarrigue (1989); one reason for this is that the shuffle product corresponds to the product of iterated integrals (Chen 1957; Fliess 1981; see also Section 6.5.4).
2 Algebraic properties In Section 2.1, we introduce the weak algorithm for noncommutative polynomials, and a similar tool for Lie polynomials. In Section 2.2, we prove the theorem of Shirshov-Witt that each Lie subalgebra of a free Lie algebra is free. In the next section, we show that the automorphism group of JT(A) is generated by elementary automorphisms, and give a jacobian-like condition characterizing automorphisms. In the final section, we characterize free sets of Lie polynomials and prove the defect theorem. In the appendix, we discuss uniqueness of rank and restricted Lie algebras in characteristic p. In this chapter, К is a field. 2.1 THE WEAK ALGORITHM Cohn’s weak algorithm is an extension to noncommutative polynomials of the Euclidean algorithm. We say that a finite family P1;..., Pn of polynomials in K(A > is (right) dependent if either some Pj = 0 or if there exist polynomials Qi,..., Qn such that deg(£7 PjQj) < maxJ(deg(PJQJ)). Observe that if non- zero polynomials P1;..., Pn are right К<Л>-1теаг dependent, then they are dependent. Further, a polynomial P is (right) dependent on Pt,..., Pn if either P = 0 or if there exist polynomials Q1;..., Qn such that deg(P - X PjQj\ < deg(P), (2.1.1) and for j = 1,..., n <teg(PjQj) < deg(P). The next result is due to Cohn (1961). Theorem 2.1 Let Pt,..., Pn be a dependent family of polynomials with deg(Pj) < • • • < deg(P„). Then some Pt is dependent on Ръ..., P^. For the proof of Theorem 2.1 we need a linear operator on К<Л) (symmetric to the operator introduced at the end of Section 1.4). Let и be any word.
41 2.1 The weak algorithm Then define the polynomial Pw-1 by Pn 1 = S (P, wu)w. we A* It is clear that P i—► PtT1 is a linear endomorphism of К<Л>; if P = w is a word, then wu~1 = x if w = xu, and wu~1 = 0 if w does not have the suffix u. The following relations hold: deg(Pw "1) < deg(P) - |u|, (2.1.2) P(ur)1 = (Pr^u1. (2.1.3) Moreover, for any letter a (PQ)a-1 = P(Qa"1) + (Q, l^a"1, (2.1.4) where (Q, 1) is as usual the constant term of Q. All these relations are easily verified when P, Q are words, and then extended by linearity. Lemma 2.2 If P, Q are polynomials and w is a word, then there exists a polynomial P' such that (PQ)w~1 = P(Qw^ + P', and P = P' = 0 or deg(P') < deg(P). Proof We may assume that P 0. If w = 1, then the lemma is evident, because P i—► Pw~1 is then the identity. Otherwise, let w = au, for some letter . Then we have by induction (PQ)u-1 = P(Qw"1) + P', deg(P') < deg(P). By (2.1.3), we have (PQ)w-1 =((PQ)w-1)a~1 =(P(Qu-1))a-1 +P'a~1. Hence, by (2.1.4) and (2.1.3) (PQ)w1 = P((Qw-1)a-1) + (Qm1. l)Pa-1 + P'a-1 = P(Qw~1) + P", where P" = (Qw-1, l)Pa“1 + P'a-1. By (2.1.2), deg(P") < deg(P), which proves the lemma. □ Proof of Theorem 2.1 We may suppose that no P, = 0. Hence deg(£, ^Q,) < maxi(deg(^Q,)) for some polynomials Q1;..., Q„. Let r = max(deg(^Q,)) and I = {i|deg(^Qi) = r}. Then R = Xiei PiQihas degree < r. Let к = sup(/);
42 2 Algebraic properties then i e I => deg(f,) < deg(Pk). Let w be word such that |w| = deg(Qk) and 0 / (Qk, w) = a-1 e K: this is possible, because Qk T- 0 (otherwise deg(P) < r = deg(PkQk) = — oo, which is not possible). By Lemma 2.2 we have iel iel for some polynomials P't with degiP'J < deg(^). Since 1 = a*", we have A + « E = (2.1.5) ie/\k iel Now, by (2.1.2) deg(Pw-1) < deg(P) - |w| < r - |w| = deg(PkQk) - deg(Qk) = deg(Pk). Furthermore, deg(P-) < deg(Pf) < deg(Pk). Hence, the degree of the right- hand side of (2.1.5) is <deg(Pk). Moreover, by (2.1.2) we have for every i e I: deg(^(Q,w_1)) = deg( Pf) + deg(Qfw-1) < deg(^) + deg(Q,) - deg(Qk) = r - deg(Qk) = deg(Pk). This shows that Pk is dependent on Ph i e I \k; a fortiori, Pk is dependent on Рр-.-’Л-Р □ We say that a Lie polynomial P is Lie-dependent on Lie polynomials Рг, . . . , Pn if P = 0 or if there exists a Lie polynomial f(x1,...,x„) in ..., xn) such that deg(P — /(P1;..., P„)) < deg(P), and that each word in the X( appearing in f is of some degree d, in with £ d{ deg(/<) < deg(P). Theorem 2.3 Let Pt,..., Pn be a dependent family of Lie polynomials, with deg(Pj) < • • • < deg(P„). Then some P( is Lie-dependent on Рг,..., Р(_г. We need the following result. It is valid mutatis mutandis in any enveloping algebra. Lemma 2.4 Let LT be a Lie subalgebra of LT(A). Then each Lie polynomial which is in the right ideal of К {A} generated by LT is already in LT. Proof Let LA be a totally ordered basis of LT {A), containing a basis of LT, and such that P, Q&LA. PeLT. Q$LT => P >Q. (2.1.6) We claim that the set of polynomials of the form Л ...P„, n > 1,Ле< Л Л > • • • > P„, (2.1.7)
2.1 The weak algorithm 43 linearly generates the right ideal I of К<Л> generated by <£. Indeed, by Theorem 0.5 К<Л> is the enveloping algebra of ^(Л), so that by Theorem 0.2,1 is linearly generated by the polynomials P = ... Pn, n > Ute.A Pt (2.1.8) So it is enough to show that each polynomial (2.1.8) is a linear combination of polynomials (2.1.7); we do this by induction on (n, t(P)), ordered lexico- graphically, where t(P) is the number of (i, j), 1 < i < j < n, such that P{ < Pj. If n = 1, then P is of the form (2.1.7), and there is nothing to prove. If n > 2, and t(P) = 0, then P again is of the form (2.1.7). Hence, we may assume t(P) > 1: then there exists i such that P( < Pi+ P If i = 1, since Pt e .У. we deduce by (2.1.6) that P2 e ЛР. Then [РрРгЗб^, because is a Lie subalgebra, hence [P1; P2] = ayQXQj e <£ n ccj e K). We have P = 1Л Л1Л • • • + P2PtP3 ... P„ = + - р,- j Polynomial QjP3 ... P„ is of the form (2.1.8), with a smaller n; moreover, P2PiP3 ... P„ is also of the form (2.1.8), with the same n and smaller t(P). Hence, by induction, all these polynomials are linear combinations of polynomials of the form (2.1.7), and so is P. Suppose now that i > 2. Then, we have [/>. Pi+ J = flkRk (Rk e <£), and P = Л • • • Pi- 10Л Pi+ il+Pi+ iPi)Pi+2 - Pn = X PkPl • • • Pi-lPkPi+ 2 • • • Pn + Pl • • • Pi- 1^ + 2 • • • Pn- к Then, a similar argument shows by induction that P is a linear combination of polynomials of the form (2.1.7). This proves the claim. Now, let К be a Lie polynomial which is in I. Then we may write *=Z«ce=ZM’ (2.1.9) Qt.H P where aQ, fiP are in K, and where the second summation is over polynomials P of the form (2.1.7). Now, the decreasing products 2,6^, Qi > • • • > Qn, are linearly independent (Theorem 0.2). Hence, (2.1.9) implies that К is a linear combination of polynomial Q e & n In particular, R is in %?. □ Denote by P the highest homogeneous component of a polynomial P, with P = 0 if P = 0. Proof of Theorem 2.3 We may suppose that no Pf = 0. By Theorem 2.1, some P, is dependent on P^..., Р^. In other words, we have
44 2 Algebraic properties <teg(pi PjQj) < de8 for some polynomials Q7, and deg(PyQ?.) < deg(PjQj) for j = 1,..., i — 1. We deduce that P.-UA, where the sum is over those j such that 1 < j < i and deg(PjQj) = deg (Pf). Hence P( is in the right ideal of К (A) generated by P1;..., P(_ P Observe that these polynomials are Lie polynomials, by Lemma 1.3(i). This implies by Lemma 2.4, applied to the Lie subalgebra generated by P1;..., 1; that Pi is in this subalgebra. We have Pt = д(Ръ ..., Pt- J for some Lie polynomial g in .y'f.Xj...., J. Let Pj = deg(Pj), p = deg(Z-). Let g = £(<0 g(d), where the sum is over all (i - l)-tuples (d) = (d^ .. and where g(d} is the homogeneous compo- nent of g of degree dj in Xj, j = 1,..., i — 1. Let f = g(d} where the sum is over all (d) with djPj = p. By homogeneity, we have Pi = f (Pi,..., P(-J. Moreover, £P(xr,..., x,_ J is homogeneous (Lemma L3(i)), so that each gid} is a Lie polynomial, hence f is too. Now, by construction, f(Pr,..., Pi-f) is equal to f(Pt,..., Р(-г) plus a polynomial of degree less than p. This shows that P( is Lie-dependent on P1; P □ 2.2 SUBALGEBRAS In this section and the following, К is a field. The next result is due to Shirshov (1953) and Witt (1953, 1956). Theorem 2.5 Each Lie subalgebra of a free Lie algebra is free. Proof Let J*9 be a Lie subalgebra of the free Lie algebra £P(A). Denote by En the subspace E„ = {Pe^|deg(P)<n}. Let <E> denote the Lie subalgebra generated by £c £P(A). Moreover, let E'„ be the subspace of En defined by E'n = En и <E„-i>- We have of course {0} = Eo = E\ s £1 s E'2 S E2 S S £„_ , £ E; s £„ s • • •. Let X„ be a subset of E„ which defines a basis of £„ mod E'„. Define
2.3 Automorphisms 45 = Un> 1 X*- We show that J2? is free on X, it is enough to show that PP is isomorphic with ^f(B), where В is an alphabet with a bijection h i-> xb, В -> X. For this, it is enough to show that: (i) X generates У: (ii) for each nonzero Lie polynomial f(b)heB e P(B), one has f(xh)heB ± 0. (i) Let P e LP with deg(P) = n. Then P is in En, hence for some scalars ax, one has Q = P - E xxxeE'n. xeXn Thus Q is in the subalgebra generated by En.x, hence by induction in the subalgebra generated by X. This shows that P is in <X>, hence X generates <P. (ii) Arguing by contradiction, suppose that /(P1;..., Pq) = 0, for some nonzero Lie polynomial f(bx,.. .,bq)e ^f(B) and some P1;..., Pq e X with deg(Pj) < • • • < deg(Pg). A fortiori, there exists a nonzero polynomial f in KfBy such that /(P1;..., Pq) = 0. Take such a polynomial with least degree, and write it as f = E bi9i- i = 1 By minimality, some , Pq) is nonzero. Since 0 = /(Л,,..,/>,) = ^ЛК. (2-2.1) i we deduce that P1;..., Pq are dependent. By Theorem 2.3 some polynomial Pi is Lie-dependent on P1;... ,P(_ P This may be written: P{ plus a linear combination of those P,, k < i, of the same degree as P{ = a Lie expression of the others (which are of degree less than P{) plus an element of EB_X, with n = deg(/<). This implies that the polynomials in Xn are not linearly independent mod E’n, which is a contradiction. □ 2.3 AUTOMORPHISMS If this section the alphabets are finite. Given two alphabets A, B, a (concatenation) algebra homomorphism f: K(A> -> K(By is completely specified by the image f(d) of the letters a in A; this is because K<A) is the free associative algebra on A. Similarly, a Lie algebra homomorphism (p: &(A) -> У'(В) is defined by its effect on the letters, because ^P(A) is the free Lie algebra on A. Note that a Lie algebra homomorphism J^(A) -> У'(В) uniquely extends to an algebra homomorphism f: К (A) —► and that an algebra homomorphism f: K<A) —► К (By is such an extension if and only if /(А) с У’(В).
46 2 Algebraic properties Recall that in Section 1.4 we have defined a linear mapping К (A) -> К<Л>, P i—> a~ гР, for each letter a in A, by a~1P= (P,aw)w. (2.3.1) we A* Given an algebra homomorphism f: K(A) -> K<B>, we define its jacobian matrix to be the following В x A matrix over K<B>: J(n = (b-1f(a))beB,aeAeK<ByBx\ The terminology stems from the analogy with the commutative case. As in the latter case, the chain rule holds. We denote as usual by J9 the matrix obtained by applying homomorphism g to each entry of the matrix J. A homomorphism g is called proper if g(a) has no constant term, for any letter a. Proposition 2.6 Let А, В, C be three alphabets and f: K<A> -> K.(By, g: К<B) -> K<C> be algebra homomorphisms with g proper. Then J(g°f) = J(g)J(f)9- (2.3.2) In proof, we need the following identities: P = (P, 1)+ E a(a*IP), (2.3.3) ae A for any polynomial P in K<A>, and a~1(PQ) = {a-xP)Q + (P, l)(a-1Q), (2.3.4) for any polynomials P, Q in K<A> and any letter a in A. Recall that (P, 1) is the constant term of P. Identities (2.3.3) and (2.3.4) are easily verified when P, Q are words, and then extended by linearity ((2.3.4) is the left-right dual of (2.1.4)). Note that a lP = 0 when P is constant. Proof Let a e A, c e A. Then c \g°f){a) = с ^(/(a))) = c 1 b (/(a), I) + E bib Via)) \ L beB by (2.3.4) = c 1 (/(a), i) + E gib)gib 7W _ beB _ = E + (g(b)’ i)c-1^_1/(«))] beB by (2.3.4) = E c гд(ь)д(ь Via)), beB
2.3 Automorphisms because g is proper. Equality of the extreme members means that the (c, a)-entry of the matrix 3{g ° /) is equal to the (c, a)-entry in the product of 3(g) by J(f)9. This proves (2.3.2). □ Let V denote the vector space £оеЛ Ka. A Lie algebra automorphism (p: &(A) -> (A) is called elementary if either qj \ Lis a linear automorphism of V, or if for some letter a, (p(a) = a + P, where P is in <f(A\a), and (p(b) = b for any letter b^a. Note that, in the second case, (p1 is defined by (p~1(a) = a — P, <p~ 1(b) = b for b + a; hence, if q> is an elementary automorphism, so is <p~ L We call jacobian matrix of a Lie algebra endomorphism of JP(A) the jacobian matrix of its unique extension to an algebra endomorphism of K<A>. Theorem 2.7 Let <p: У2(Л) -> =^(Л) be a Lie algebra endomorphism. The following conditions are eguivalent: (i) (p is an automorphism', (ii) (p is surjective', (iii) the jacobian matrix of (p is right invertible in К<Л)ЛхЛ; (iv) (p is a product of elementary automorphisms. The equivalence of (i) and (iv) is due to Cohn (1964). Proof We tacitly use the following fact: if a square matrix over К<Л> is right or left invertible, then no column or row in this matrix is 0; this may be seen by taking the image of this matrix in К [Л j. (i) => (ii) is evident. (ii) => (iii) because, since (p is surjective, it has a right inverse «Д. Indeed, define ф for any letter a by ф(а) = P for some Lie polynomial P such that (p(P) = a', since ^(Л) is the free Lie algebra, ф extends uniquely to a Lie endomorphism of £P(A). Let f, g be the algebra endomorphisms of Х.'<Л> extending tp, ф respectively. Then (p ° ф = id, hence f ° g = id. By Proposition 2.6, we obtain J(f)J(g)f = IA, the A x A identity matrix. Hence J(/) is right invertible. (iii) => (iv) by induction on d((p) = ^aeA deg(<p(a)). Since no (p(a) is zero (otherwise the jacobian matrix J(<p) of (p is not right invertible) and since (p(a) is a Lie element, we have deg(<p(a)) > 1. If d(q>) < | A I, then deg <p(a) = 1 for each letter a and we may write g>(b) = Цаел^а.ь0 for some scalars ccah. Then a-1<p(h) = ctah, which shows that J(cp) = (txa b')a beA. By hypothesis, J(<p) is right invertible in К<Л>ЛхЛ; taking constant terms of the polynomials involved, we see that J((p) is invertible in KA*A. Hence, (p defines an automorphism of the vector space £аеЛ Ka, and is elementary.
48 2 Algebraic properties Suppose now that d((p) > |Л|. Let (Pa,b)a.beA be the right inverse of J(<p). Then we have, for any letters a, c, X (a~1 <рФ))Рь,с = ba c. be A We multiply by a on the left, we sum over all letters a, and we note that (p(b) has no constant term (it is a Lie polynomial), so that (2.3.3) gives E <p(b)Pb,c = c. be A Since d(yp) > |Л|, there is some letter b0 such that deg(<p(b0)) > 2; moreover, there is some c such that PbOtC 0, the matrix (Pb c) being left invertible. Hence, for this c, deg( E <P(b)Pb'C) = deg(c) = 1 < deg(<p(h0)Pi(o>c) < sup (deg(<p(h)Pft>c)). \ЬеЛ / b This shows that the Lie polynomials <p(h) are dependent. Note that none of them is 0. Hence, by Theorem 2.3, there exists a in A and a Lie polynomial P(b)fteB in £f\B) (with В = Л\а) such that deg(<p(a) - P(<p(b)beB)) < deg(<p(a)). (2.3.5) Define an elementary automorphism ф by Ф(а) = a - P(b)beB, фф) = b if b * a. Define an endomorphism a of £P(A) by a = <p ° ф. Then we have a(a) = (р(ф(а)) = (p(a — Рф)ЬеВ) = (p(a) — P((p(b)beB) because (p is a Lie algebra endomorphism. Moreover, if b a, then a(h) = <p(<A(h)) = (рф). Hence, by (2.3.5), d(a) < d((p). Moreover, by (2.3.2), J (a) = /(<р)/(<Д)ф. The matrices J(<p) and are right invertible in К<Л>лхл. Moreover, M -> AP is a ring endomorphism of К<Л>лхЛ, so that 7(<Д)Ф is right invertible. Hence J(a) is right invertible. By induction, we conclude that a is a product of elementary automorphisms. Hence, so is (p. (iv) => (i) is evident. □ Corollary 2.8 If n polynomials generate the free Lie algebra £P(ar,..., a„), then they generate it freely. Proof Let P1;..., P„ be these n polynomials. Then (p(at) = Pt defines a Lie endomorphism of У2( Л). It is clearly surjective, so that (p is an automorphism by Theorem 2.7. Hence, P1;..., Pn generate &(A) freely. □
2.4 Free sets of Lie polynomials 49 2.4 FREE SETS OF LIE POLYNOMIALS We say that a set E of Lie polynomials is free if it generates freely a Lie subalgebra of £F(A). Recall that the derived ideal [У2, У2] of a Lie algebra У is the linear span of the elements [P, Q], P,Qe Theorem 2.9 Let Ebe a subset of ^(A) and £? the Lie subalgebra generated by E. The following conditions are eguivalent: (i) E is free-, (ii) E is linearly independent modfy2, Jzf]; (iii) E is right K(Ay-linearly independent. Proof If E is not free, then clearly E is right K<4>-linearly dependent. Suppose that this is the case: then some finite subset of E is К (A >-linearly dependent, and we may suppose that E is finite. Then E is dependent, and by Theorem 2.3 some polynomial P in E is of the form P = Q + P', where Q, P' are in ttf Q is in the Lie subalgebra generated by E\P, and deg(P') < deg(P). Note that Q may be written Q = Qr + Q2, where Qr is a linear combination of elements of E\P, and Q2 e [J*9, If P' = 0, then P — Qi e and we deduce that E is linearly dependent modf^, JS?]. If P' У 0, then replace P in E by P': we obtain a set E' which still generates and which is still K(A )-linearly dependent. By induction on £RgE deg(R), we deduce that E' is linearly dependent modfJZ2, У]. Hence + 12кеЕ\р PrR e .У], where a, fiR are scalars, not all equal to 0. This implies that txP — txQi + 0rR 6 [J*9, JS?]; this is a nontrivial linear combination of E (if a 0, this is clear, and if a = 0, it is a nontrivial linear combination of E\P), hence E is linearly dependent mod [J*9, Suppose that E is linearly dependent mod [.У9, ^]. Then clearly E is not free. □ The next result is the defect theorem. Theorem 2.10 If n Lie polynomials generate поп-freely a Lie subalgebra of &(A), then this subalgebra may be generated by fewer than n elements. Proof As in the previous proof, we obtain that one of these polynomials, P say, is of the form P = Q + P', where Q is in the Lie subalgebra generated by the others, and deg(P') < deg(P). If P' = 0, we are done. Otherwise, we replace P by P' and conclude by induction. '
50 2 Algebraic properties Corollary 2.11 Let E be a set of homogeneous Lie polynomials; then there exists a subset of E which is free and generates the same Lie subalgebra. Proof If E is not free then, as in the proof of Theorem 2.9, some polynomial P in E is of the form P = Q + P', where Q is in the Lie subalgebra generated by E\P and deg(P') < deg(P). By homogeneity, we may assume that P' = 0, hence we can remove P from E. □ 2.5 APPENDIX 2.5.1 Rank of a free Lie algebra If A generates a free Lie algebra &(A), then it also generates its enveloping algebra К<Л>. Thus A generates the algebra of commutative polynomials К[Л], because К[Л] is the quotient of К<Л> by the relations PQ = QP. We deduce that the cardinality of A is equal to the transcendence degree of К[Л] over K. Thus, this cardinality depends only on £?(A); we call it the rank of зг/(Л). A particular case is when is a Lie subalgebra of &(A). Then it is a free Lie algebra (Theorem 2.5) and, hence, has unique rank. By Theorem 2.9(iii) this rank is equal to the rank of the right ideal ^K(A) as a free K<?l)-module (each right ideal in K(A) is a free K<T4)-module, by a theorem of Cohn (1985)). The latter rank is unique, because each invertible matrix over К<Л> is square (Cohn 1985). This again shows the uniqueness of rank in free Lie algebras. 2.5.2 Restricted Lie algebra of characteristic p Let К be a field of characteristic p 0. A restricted Lie algebra of characteristic p is a Lie algebra over K, together with a mapping -> x H-* xlP] such that (i) (ax)lp] = apxfpl, for any a in K; (ii) (x + y)[pl = x[pl + y[pl + Ap(x, y), where Ap is the Lie polynomial of eqn (1.6.4); (iii) [x[p], y] = ad(x)p(y). If за/ is an associative algebra over K, then it has a natural structure of restricted Lie algebra of characteristic p, with x[pl = xp ((ii) and (iii) are satisfied by eqns (1.6.4) and (1.6.1)). Given a restricted Lie algebra let за/ be its enveloping algebra, and the quotient of за/ obtained by identifying xp and x[pl, for each x in Then sfp is called the restricted enveloping algebra of If (x,)lg/ is a totally ordered basis of ЛР, then the decreasing products <p(xfl)... <p(xln), n > 0, > • • • > i„,
2.6 Notes 51 where each x, appears at most p - 1 times and where <p is the canonical mapping У -> л/р, form a basis of Vp (see Jacobson 1962, Theorem V. 11; Bourbaki 1971, Exercise 6 of Section 2; Abe 1980). In particular, <£ is naturally embedded in its restricted enveloping algebra. With this last result, one shows that the free Lie p-algebra ^,(Л) (see Section 1.6.3) has the desired universal property in the category of restricted Lie algebras, and that K(A) is its restricted enveloping algebra. In particular, if (Pi)tei is a totally ordered basis of &P(A), then the set of elements Pft in, 0 < ..., jn < p is a basis of К<Л>. Theorem 1.4 has the following version in &P(A): a polynomial P is in Ур(А) if and only if д(Р) = P 0 1 + 1 0 P (for the proof, use the Poincare-Birkhoff- Witt theorem and the fact that d(Ppn) = Ppn 0 1 + 1 0PP" for each Lie polynomial). Equivalently, if A has at least two elements, ad(P) = Ad(P) and (Л 1) = 0. 2.6 NOTES Other proofs of Theorem 2.5 are given by Cohn (1964) and Bahturin (1987); our proof is close to the proof of Cohn. It was shown by Kukin (19776) that the intersection of two finitely generated subalgebras of the free Lie algebra is again finitely generated. A similar result holds in the free Lie p-algebra (Witt 1956; see also Kukin 1972a; Bahturin 1987, Theorem 11.8). A kind of Schreier formula holds for subalgebras of the free Lie p-algebra: if is a subalgebra of Ур(А), of codimension d, then the rank of <£ is equal to 1 + ра(|Л| — 1) (Kukin 1972a; Bahturin 1987, Theorem II.9). In the case of free Lie algebras, a subalgebra of finite codimension is in general not necessarily finitely generated. Proposition 2.6 has an equivalent form with Fox derivatives (Fox 1953). The equivalence of (i), (ii), and (iii) in Theorem 2.7 is from Reutenauer (1992) and Shpilrain (1990) (see also Shpilrain 1992 for a generalization). This result has some similarity with the jacobian conjecture in the (commutative) polynomial algebra (Bass et al. 1982). Note that a matrix over К<Л) is right invertible if and only it is left invertible; this follows because К<Л> is embeddable in a (skew) field (Cohn 1985). Equivalence of (i) and (ii) in Theorem 2.9 is due to Kukin (1977a, 1978). The analogy between free groups and free Lie algebras, as shown in the results on subalgebras and auto- morphisms, is further discussed by Baumslag (1972). For other results of algebraic nature involving the free Lie algebra, see Kukin (19726), Baumslag and Baumslag (1971) (who show that for any n, any ascending chain of Lie subalgebras of a free Lie algebra, that are generated by n elements, is finite), Yunus (1984), Unlu and Ekici (1986), Zerck (1989), Bryant (1991), and Drensky (1992).
3 Logarithms and exponentials The various characterizations of Lie polynomials obtained in Chapter 1 are extended to Lie series, in a straightforward way. These results become really interesting when one considers exponentials of Lie series. This leads naturally to the Campbell-Baker-Hausdorff formula. Section 3.2 is devoted to the canonical projections of the free associative algebra, especially the first one, whose image is the free Lie algebra and which is obtained by a logarithm. Section 3.3 is devoted to the computation of the coefficients of the Hausdorff series. A generalization of this series leads again to the canonical projection. The computation is done using descent numbers of permutations and shuffle algebra. Several properties of the coefficients are established: symmetries, generating functions, and recursions. The last section presents the original methods of Campbell, Baker, and Hausdorff, which explains the introduction of Bernoulli numbers. A closed formula is also given for the Hausdorff series. 3.1 LIE SERIES AND LOGARITHM Let К be a commutative Q-algebra. We define Lie series. First, suppose that the alphabet A is finite, and let s= z S, и>0 in К((Л>> be written as sum of its homogeneous components. Then S is a Lie series if each Sn is a Lie polynomial. If A is infinite and В c A, denote by SB the projection of 5 in That is, SB is the image of 5 under the algebra homomorphism defined by a i—> 0 if a e A\B, b i-> b if b e B. Then 5 e K<<A>> is a Lie series if, for each finite В c A, SB is a Lie series in K«B>>. The mappings b, a, b, D, and r of Section 1.3 are all homogeneous and degree-preserving. Hence, they extend (by infinite linearity) to K«A>>. Furthermore, ad and Ad extend naturally to mappings К<<Л>> ->
3.1 Lie series and logarithm 53 Епс1к(К<<Л>» by the formulas ad(S)(T) = [5, Г], 4d(S)(D= £ (S,u)(T,v)Ad(u)(i,). u, ve A* The last sum is well defined, because it is locally finite; indeed, a given word w appears only in finitely many polynomials Ad(u)(v). This being done, we easily obtain (from Theorem 1.4) the following result. Again, we assume that A has at least two elements. Theorem 3.1 Let Sbea formal series. The following conditions are eguivalent: (i) 5 is a Lie series; (ii) Ad(S) = ad(S); (iii) b(S) = 5 ® 1 + 1 ® S; (iv) 5 is orthogonal to each shuffle и ш v with u, ve A + , and (S, 1) = 0; (v) b(S) = S ® 1 - 1 ® S; (vi) (S, 1) = 0 and r(S) = D(S). Proof (a) We suppose first that A is finite. Then for each series S= E S, n>0 written as the sum of its homogeneous parts, Sn is a polynomial because there are only finitely many words of a given length. Note that the mappings d, a, d, D, and r are homogeneous and degree- preserving; more precisely, with evident notations, one has b(S)„ = b(S„), (S ® 1 + 1 ® S)„ = S„ ® 1 + 1 ® S„, and so on. Hence, the equivalence of (i), (iii), (v), and (vi) follows directly from Theorem 1.4 and the previous definition of Lie series. In order to prove the equivalence of these condi- tions with (ii), observe that, for any word w, ad(S)(w) = ad(Sn)(w) and Ad(S)(w) = £n Ad(S„)(w); since ad(S„)(w) and Ad(S„)(w) are both homo- geneous polynomials of degree n + |w|, (ii) is equivalent to saying Vn, ad(S„) = Ad(S„), which by Theorem 1.4 is equivalent to (i). Now, by Proposition 1.8, we have 3(S) = X (S, и lli v)u ® v u,veA* = £ (S, иш1)и®1+ Y, (5,1шг)1®1!-(5,1ш1)1®1 ueA* veA* + Y, (S,umv)u®v u,i>eA* = 5® 1 + 1 ® 5 - (S, 1)1 ® 1 + X <S,umv)u®v. u.ve A +
54 3 Logarithms and exponentials Hence, (iv) implies (iii). Finally, if (iii) holds, the above equation gives us -(S, 1)(1 ® 1) + X (S, иши)и®и = 0. u.veA* Hence (iv) follows. (b) Suppose now that the alphabet A is infinite. Let В be a finite sub-alphabet of A. Note that all the mappings of the theorem commute with the canonical projection К<<Л>> -> K<<B>>, S SB. Hence, the general case follows directly from the first part of the proof. □ Theorem 3. 1 implies in particular that if a polynomial is orthogonal to each Lie polynomial, then it is a linear combination of the empty word and of the polynomials и ш v, и, v e A+. Given a formal series 5 with constant term equal to 1, we may form in К<<Л>> its logarithm; indeed, write 5 = 1 + T where Г has zero constant term. Then the following infinite sum makes sense and defines the logarithm of 5: log(S) = log(l + n = L r. (3.1.1) п>1 n Similarly, one defines for each formal series U with zero constant term its exponential by the formula U" eu = exp(U)= X -г- (ЗЛ.2) n>o n! As usual, one has the formulas exp(log(S)) = S, log(exp(t/)) = U. (3.1.3) In the next result, we assume that A has at least two letters. Theorem 3. 2 Let S be a series with contant term 1. The following conditions are eguivalent: (i) log(S) is a Lie series; (ii) d(S) = S®S; (iii) the linear mapping К(АУ -> К which sends each word w on (S, w) is a homomorphism from the shuffle algebra К(АУ into K; (iv) for any series T, Ad(S)(T) = STS ~ L Proof By Theorem 3.1, log(S) is a Lie series if and only if 5(log(S)) = log(S) ® 1 + 1 ® log(S). (3.1.4)
3.1 Lie series and logarithm 55 Since d is a continuous homomorphism, we have d(S) = 5(exp(log(S)) = exp(5(log(5)). Moreover, since log(5) (x) 1 and 1 (x) log(5) commute, we have the usual identity of the exponential function: exp(log(S) ® 1 4- 1 ® log(S)) = exp(log(5) (x) 1) exp(l ® log(S)). Since T -> T ® 1 and T -> 1 (x) T are con- tinuous homomorphisms, this is equal to (exp(log(S)) ® 1)(1 ® exp(log(5)) = (S ® 1)(1 ® S) = S ® S. Finally, taking exponentials of both sides in (3.1.4), we find that log(S) is a Lie series if and only if d(S) = S ® S. Thus (i) and (ii) are equivalent. Now, by Proposition 1.8, we have 5(5) = £ (S,uujv)u®v. u, ve A* Since S®S = £ (5, u)(S, v)u ® v, u.veA* we deduce that (ii) is equivalent to Vu, v e Л*, (S, и ш v) = (S, u)(S, v). But this is equivalent to (iii). Let 5 = eu and denote by g (respectively d) the continuous linear operator on К<<Л>> defined on any word w by g(w) = Uw (respectively d(w) = wU). Then g and d commute with each other, and hence 5w5-1 = euwe~u = e9 e~d(w) = e9~d(w) = ead(V\w). Moreover, Ad: -> End(K<<4)>) is a continuous homomorphism, so that Ad(S) = Ad(eu) = eAd(U), hence Ad(S)(w) = eAd{U}(w). Thus, condition (iv) is equivalent to Ad(U) = ad(U), and we apply Theorem 3.1 (ii). □ The set of formal series with constant term 1 is a group under multiplica- tion: indeed, the constant term of a product is the product of the constant terms, so this set is closed under product; moreover, if S = 1 + T with (Г, 1) = 0 then S-1 = Xn>o (— 1)"T" also has constant term 1. The group described previously has a remarkable subgroup. Corollary 3.3 The set of series S with constant term 1 such that log 5 is a Lie series is a group under multiplication. Proof Indeed, by Theorem 3.2(ii), log 5 is a Lie series if and only if 5(S) = S® S. Now, if we also have 5(T) = T® T, then d(ST) = (ST)® (ST). Moreover, d(S ~J) = 5(5)~1 = (5 ® 5)“1 = 5 “1 ® S"1. Thus, the corollary follows. □
56 3 Logarithms and exponentials The following result is the famous Campbell-Baker-Hausdorff formula. Here, a, b are two letters. Corollary 3.4 The series log(e° eb) is a Lie series. This series is called the Hausdorff series. Proof Since log(e°) = a is a Lie series, it suffices to apply the previous result. □ Let A = {ar,... ,am}. Let a = (a1?..., am) be a path in Rm, where each function ctj(t) is real, defined on the segment [a,/?], is continuous and of bounded variation. Define for each word w in A* and t e [a, b] the iterated integral (relative to a) J' dw recursively by, J' dw = 1 if w is the empty word and if w = then f' dw is defined by the Stieltjes integral J' dw = ft (ft du) daXs). Corollary 3.5 The series log(£we4. (ft dw)w) is a Lie series. Proof By Theorem 3.2(iii), it is enough to show that for t e [a, b], the linear mapping (pt: Й<Л> -> R, wh>ftdw is a shuffle homomorphism, that is, <p,(w lu w') = <p,(w)<p,(w') for any words w, w'. If one of these words is empty, this is clear. Otherwise w = uu,, w' = vaj. Then by definition of the iterated integral, we have for any letter ak and word x: <Pt(xak)= (ps(x) dak(s), J a hence, by linearity, <Pt(Pak)= f <ps(P) dotk(s), (3.1.5) J a for any polynomial P. Now, we have from eqn (1.4.2) and by symmetry w lu w' = (u lu va^Oj + (иа, lu г)а^. Thus, by linearity and eqn (3.1.5), (pt(w lu w') = (ps(u lu va^ da,(s) + <ps(uaf lu v) da/s). J a J a This is equal, by induction on |w| + |w'|, to <ps(ru7)<ps(u) da,(s) + (ps(uai)(ps(v) da/s). J a J a
3.2 The canonical projections 57 Since by eqn (3.1.5), ф,(ш^) = J* (pr(u) da,(r), we have (viewing <ps(Mai) as a function of s) dt/p^ua,)) = <ps(u) da^s); similarly d(<ps(ray)) = <ps(v) da;(s). Hence, the previous sum of integrals is equal to (ps(vaj) d(<ps(ucZj)) + (p^uaj d((ps(vaj)) = dfp^uajq^vaj)). J a J a J a Thus, we obtain (pt(w ш w') = (pfua^qjfvaj) = (pt(w)(pt(w'), which completes the proof. □ 3.2 THE CANONICAL PROJECTIONS We assume that К is a Q-algebra. Define subspaces Un of К<Л> by letting Un be the submodule of К (Ay generated by the nth powers of Lie polynomials. This submodule has another equivalent definition. If PY,..., Pn are polynomials, define their symmetrized product by (Pt,...,P.) = ( X (3.2.1) nl aes„ where Sn is the symmetric group of order n. Proposition 3.6 Un is the submodule of К (Ay generated by the elements (Pn ..., Pn), where Pt are Lie polynomials. Proof The identity Pn = (P,..., P) (n times) proves one inclusion. The other inclusion follows from the identity X E P,Y. /£{1...n) \ie/ / which is a consequence of the inclusion-exclusion principle. □ We shall see that there is a direct sum decomposition К<л>= © un, n>0 and that the corresponding projections may be computed using the convolu- tion product ♦ of End(K<?l» defined in Section 1.5. Note that_L70 = K, If = JT(A). Observe that the complete tensor product л/ = K(Ay ® K(Ay (see Section 1.5) is a graded algebra, complete with respect to the graduation. Hence, we may define log ( x u ® w I = x —,—I E м ®u \иеЛ* / k> 1 ft \иеЛ +
58 3 Logarithms and exponentials where A+ = The right-hand side may be written as (_ i )* -1 X---------- X (a^ -•шак)®(а1...ак) = к > 1 К u i,..., Uk e A + X X ------------,-----X (W’U1 Ш' ' 'Ш w*)wi • • •)• we A * \k > 1 к / Observe that the sum at the right of ® is finite, and is equal to a homogeneous polynomial which has degree |w|. Hence, the formula log! X и ® и j = X w ® ni(w) (3.2.2) \иеЛ* / w defines a linear endomorphism of K<A>. Actually, (_ I)*-1 7ti(w)= X ------7--- E (w,Ui ш-•-ш uju! . ..uk. (3.2.3) k> 1 К u].........uke A + This formula shows that ^(w) is a linear combination of words obtained by permuting the letters of w. It is equivalent to define щ using the formula nj = log(id) in the algebra End(K<4>) with the convolution product. Now, define for each n > 0, an endomorphism by я, = 1 „Г, (3.2.4) n! (nth power for the convolution product), or equivalently in j/ (see Pro- position 1.10), 1 / X w ® 7t„(w) = - £ a® 7ti(a) , (3.2.5) w n\\„ J which may also be written as nn=\ conc„ ° (л^п) ° dn, (3.2.6) n! by eqn (1.5.7). With these definitions, we have the following result. Theorem 3.7 The module K<A> has the direct sum decomposition K(Aj = ф Un. n>0 The corresponding projections К(АУ -> U„ are the endomorphisms nn.
3.2 The canonical projections 59 The projections are called the canonical projections. The proof of Theorem 3.7 may be simplified, if one admits the first assertion, which is one version of the Poincare-Birkhoff-Witt theorem and valid in any enveloping algebra (see Section 0.4.3). We first prove two lemmas. Lemma 3.8 The image of щ is contained in £T(A). If q>: L -> К is a Z-linear mapping between two rings, then its canonical extension £<<Л>> -> К<<Л>> maps Lie series onto Lie series. This is a slight extension of Lemma 1.3(ii) which we need in the following proof. Proof Let L be the shuffle algebra К<<Л>). Then we have a canonical isomorphism л/ ~ £<<Л>). In this isomorphism, X«® « becomes the series Хиел* auw e where au = и e L. The /.-linear mapping £<Л> -> L which maps each word и onto au is a shuffle homomorphism, because au = и and L is the shuffle algebra К<<Л>>. Thus, by Theorem 3.2(iii), its logarithm is a Lie series: log(Xu auu) = fvv e J%(4). Let w be a fixed word and (p: L -> К be the mapping S i—> (S, w). Then, by the remark before the proof, X <P(fib)v is a Lie series. But by (3.2.2) this series is precisely nfw), which is therefore a Lie polynomial. □ Let I: К<Л> -> К (A) be the linear mapping defined by I(w) = w if we A+, /(1) = 0. Lemma 3.9 Let <p: К <B> -> К(Л) be a concatenation homomorphism such that (p(b) is a Lie polynomial for any letter b in B. Then <p commutes with I *k and nt, for any к > 0. Proof We have = log(id) = log(l + I) = X ( — l)k~1I*k/k, so it is enough to show that <p ° I *k = I*k°tp. Now, dk ° <p = <p®‘ ° dk because both sides are concatenation homomor- phisms and for any letter b: bk ° (p(b) = (p(b) ® 1 ® • • • ® 1 + 1 ® (p(b) ©•••@1 +------1-1 ® 1 ® • • • ® (p(b) (by eqn (1.5.6), because (p(b) is a Lie polynomial) = (p®k ° bk(b). Moreover, I ° <p = (p ° I (because (p preserves constant terms), hence I®k ° (p®k = ip®k l®k. and conck ° (p®k = <p- conck, cp being a concatenation homomorphism. Finally by eqn (1.5.7), I*kocp = conck ° I®k ° 5k ° (p = cp ° I*k, by putting together the previous equalities. □ Proof of Theorem 3.7 We show that (i) 7in restricted to Un is the identity; (ii) any polynomial P is equal to the sum X« > о
60 3 Logarithms and exponentials (iii) the image of лп is contained in l/„; and (iv) if к / n, then 7tk(Un) = 0. This will imply the theorem. (i) We have лх = log(id) = log(l + I), hence Я, = Z '/**, (3.2.7) к > 1 к in End(K</4» with its convolution product structure. This is, by eqn (1.5.7) equivalently written as (-1)*-1 7li = X ----------СОПСко I®k ° dk. k> i к Now, if P is a Lie element then, by eqn (1.5.6), dk(P) = Р®1®---®1 + 1®Р(8)---(х)1ч-------------h 1 ® 1 ® ® P (fc terms). (3.2.8) This shows that I ° dk(P) = 0, unless k= 1 (because /(1) = 0). Hence, Щ(Р) = P. Now, we have by (3.2.8) that 5„(P") = 5„(P)" = nlP ® P ® • - • (x) P + £, where X is a sum of terms ® Q2 ® ® Qn with at least one Qt equal to 1. Thus 7i^"°<5„(P") = n!P ® ® P, because 7^(1) = 0 and лх(Р) = P. This implies by (3.2.6) that л„(Р") = (1/n!) conc„ ° л^п ° <5„(P") = Pn- Since Un is generated by the polynomials P", (i) follows. (ii) We have in End(K<?l>) with the convolution structure id = exp(log(id)) = ехр(лг) = X 1 л*п = X л„, n>o«! «>о by (3.2.2), Proposition 1.10, and (3.2.4), which proves (ii). (iii) By Lemma 3.8, we have Imt^) с ^(A) - Ц. Now, we have by (3.2.5) X w ® n„(w) = X (ui ш---ши„)® (л JmJ... л1(ип)) Since the shuffle product is commutative, we can group the terms cor- responding to the same set of words iq,..., u„; hence, using the notation of (3.2.1): rc„(w) = X ---------------------r^(^l(Wi),...,7l4(u„)) ui <••<«„ N(M!,...,W„)
3.3 Coefficients of the Hausdorff series 61 where < is some total order on A*, and where N(u15..., u„) is the cardinality of the subgroup {ст e 5„ | (uff(1),..., ua<„}) = (u15..., u„)} of S„. Hence, by Proposition 3.6 7t„(w) is in Un and 1т(л„) c Un. (iv) Suppose now that В = {a}, and that in K(B) we have nfa") = 0 if n / 1. Let P be some Lie polynomial and let be defined by <p(a) = P. Then, by Lemma 3.9, itfPn) = °<p(a") = nfa”) = 0. So, it remains to treat the case of a one-letter alphabet. Observe that in j/, (a (g) a)n = ашп ® a” = nla” (g) an; hence, £ an (g) nfa") = logl £ an ® an j = log(exp(a ® a)) = a (g> a. n > 0 \n > 0 / So we deduce that nfa”) = 0 if n 1. Now, let к / n. Then by (3.2.8), bk(Pn) = bk(P)n is a sum of terms of the form P'1 (g> • • • (g> Pik, where at least one ij does not equal 1. By what we have just seen, nfP1) = 0 unless i = 1; hence, we obtain я,(Р") = 1сопс,оя®‘ог1(Р') = 0. nl This proves (iv). □ 3.3 COEFFICIENTS OF THE HAUSDORFF SERIES Recall that the Hausdorff series is the series log(ea eft), where a and b are two letters. We shall be slightly more general and consider an alphabet A = {ar,..., a„} of n distinct letters. By Corollary 3.3, the series H(a15..., an) = log(eai... ea") is a Lie series. We may write H = Z (p) where (p) is a multi-index of length n and where H{p) is the finely homogeneous component of H of multi-degree (p) = (pn ..., p„). Recall that is the canonical projection of Q<4> on to &(A) (see Section 3.2). Lemma 3.10 The homogeneous component H<p} is equal to (a? a?"\ 7li----------. \Pi! ...p„!/
62 3 Logarithms and exponentials Proof By eqn (3.2.3), the linear mapping ях is degree-preserving and even finely homogenous. We extend it to by continuity. Since is the identity on ^(A), we have 7^(5) = 5 for each Lie series S; moreover, if к 1, 7^(5*) = 0 because пл(Рк) = 0 for each Lie polynomial and Q<4> is dense in О«Л». This implies that УН-Н = пЛ У -H‘) \Р) 1 I Д-* / । / р \Л>0 *'• / = nt(exp(H)) / а1' = л1(еа' ...еа") = 7t1 X — \(р) Pi- • а"п\ V (a*1 • • аРЛ ---- = / л1|---------- Pn'J fa XpJ-.-Pn'J Since я! is finely homogeneous, we obtain the lemma. □ We shall give a formula for the coefficients of the words in H. For this let A be ordered by ar < • • • < an, and consider a word w of length p in A*. A descent (resp. a rise) of w is an index i in {1,..., p — 1} such that the ith letter in w is followed by a smaller (resp. a greater) letter. The descent set D(w) (resp. rise set R(w)) of w is the set of these indices. Let d(w) = |D(w)| and r(w) = |/?(w)|, respectively the number of descents and rises of w. Let w be written in the form w = off. aqf... (3.3.1) *1 *2 x ' where two consecutive indices are always distinct, and where all exponents are > 1. Then D(w) = {qx + • • + qj, 1 < j < m - 1, > iJ+1), R(w) = {<h + • ’ • + Qj, 1 <j < m - 1, ij < ij+l}. Moreover, d(w) + r(w) = m — 1. For example, if w = a1a1a3a2a2a1, then m - 4, D(w) = {3, 5}, R(w) = {2}, d(w) = 2, r(w) = 1. Similarly, we define the number of descents and of rises of a permutation in S„, viewed as a word of length л on naturally ordered. Note that if we S„, then d(w) + r(w) = n — 1. The eulerian polynomial En(x) is the polynomial £,(x)= X x*'1. <reS„ For example, Ex = 1, E2 = 1 + x, E3 = 1 + 4x + x2, £4 = 1 + 1 lx + 1 lx2 + x3, Es = 1 + 26x + 66x2 + 26x3 + x4.
3.3 Coefficients of the Hausdorjf series 63 The homogeneous eulerian polynomial En(x, y) is = /-*£, - = X x1My™ \y / aeSn (3.3.2) It will be convenient to put Eo = 0. Define a linear function r. Q[x] -> Q by s(xk) = (-0* к + 1 ’ In other words, we have for any P in Q[x]: r ° J -1 s(P) = P(x) dx. (3.3.3) Theorem 3.11 Let w be written as in (3.3.1), and Pi,.--,p„ its respective lengths in the letters ax,... ,a„. Then the coefficient of w in log(efll ... e°"), or equivalently the coefficient of w in the projection nfa^' ... a^/p^.... p„l), is equal to s(xd(l x x)r [J Eq.(x, x + \)/qf.), (3.3.4) \ 1 < j < m J where d (respectively r) is equal to the number of descents (respectively rises) of w. We need some preliminary results. Given a word и = b{... bp of length p, and a permutation о of Sp, define ucf = ba(V)... ba(p}. This is a right action of Sp on the words of length p: indeed, if we denote by u(i) the ith letter of u, then we have (uo)(i) = u(oi), so that (u(oaf)(i) = u(oa(i)) = (uo)(oti) = ((uo)a)(i), hence u(oa) = (uo)a. If W is a set of permutations, we define its eulerian polynomial by E„(x)= X x*”. aeW and its homogeneous eulerian polynomial by E^(x. y) = £ хд',У”- aeW Lemma 3.12 Let w be as in (3.3.1), of respective length Pi,...,p„ in the
64 3 Logarithms and exponentials letters аг,..., a„, and let и = a{'... a?". Let IV = {cr e Sp | uo = w}, where p is the length of и and w. Then E^fx) = Pl' Pn' xdtw> П M*)' - l<j<m and E^(x, y) = Pf Pd xd1w>yriw> П q1'... qm! i <j<m Proof Let /15..., Im be the consecutive intervals of {1,..., p}, of respective lengths q^... ,qm. Let G be the subgroup of S„ which leaves these intervals invariant; hence о e G if and only if ст(/у) = f, for j = 1,..., m. Since и i—> uo is a right action, and since G evidently fixes w, we have о e IV, aeG => ста e IV. Hence, W is a union of left cosets ctG. In each coset C, there is a unique element ctc whose restriction to each interval f is increasing. We claim that there are pf.... p„l/qf.... qm\ such cosets in W. Indeed, W is also a single right coset modulo the subgroup fixing u; the latter has pf.... p„! elements, hence so does W and so W is union of pt!... p„!/|G| cosets of G. The claim then follows because G has qf... .qm\ elements. Observe that for r = qr + • • • + qp i.e. the greatest element of Ij, and for any ст in W, one has r e D(ct) ore D(w). Indeed r e D(ct) о ст(г) > ст(г + 1) => и(ст(г)) > и(ст(г +1)) (because и is an increasing word) о w(r) > w(r + 1) (because w = ист) о w(r) > w(r + 1) (by the choice of г) о r e D(w); similarly, ст(г) < ст(г + 1) implies w(r) < w(r + 1), hence the reverse implications also hold. In particular, since ctc and w have no descent in Ij\{r}, we have D(ctc) = D(w). Next we claim that if a e G, then Р(стса) is the disjoint union D(w) о D(a). From the previous observation, we deduce that Л(стса) n + + q-, 1 < j < m - 1} = D(w). Since ctc|/7 is increasing, and since a permutes f, we have for i in /7\{qi +------h qj}: ct(i) > a(i + 1) о CTca(i) > + 0- This proves the claim. From the latter claims, we deduce that E„.M = Eac(x)Ea(x) = xdloc>Ea(x) = хл">Еа(х), hence E„(C(x) = xdMEc(x). Evidently, E0(x) = E„(x) ... E,„(x), so that EM = У E„c(x) = хл” У Ec(x) = xJ<’> р‘---р"\ E „(X)... E,_(x). For the second identity of the lemma, note that ст e Sp implies г(ст) + d(o) =
3.3 Coefficients of the Hausdorjf series 65 p — 1; thus by (3.3.2) ( y\ n I n ! / Ew(x,y) = yp'1Ew( I = yp~l — /".( ) П /'’'E (x,y) \y/ qf- qm' \y/ i<j<m because r(w) = m — 1 — d(w) and p = qr + + qm. □ The right action of the symmetric group Sp on the words of length p extends linearly to a right action of the group algebra Q[5p] on the linear span of the words of length p. It is convenient to extend it to all of О<Л> by the formula: wo = 0 if о e Sp and w is a word of length / p. If 5 is a subset of {1,..., p — 1}, we denote by D-s the sum, in Q[5p], of the permutations whose descent set is contained in 5. In the next lemma, we identify each permutation in Sp with the corresponding word on {1,... ,p}. Recall that the convolution product * has been defined in Section 1.5. Lemma 3.13 Let p1,...,pk (k > 1) be positive integers of sum p and S = {pi,Pi + p2,---,Pi + ' + Pk-i} the corresponding subset {1,...,p — 1}. Factorize the word 1 2 ... p as ur... uk with ImJ = ph for each i. (i) One has DsS = 0(ui ш‘ ‘ ’ш ukf where в is the linear involution of Q[5p] sending each permutation onto its inverse. (ii) Let qn denote the linear endomorphism of such that qn(£i Pi) = Pn, for each polynomial P = £ Pt written as the sum of its homogeneous compo- nents. Then, for each polynomial of degree p, one has pdsS = (%>* *qPk)(P)- The right action of Q[5p] defined previously leaves the canonical scalar product on invariant (this scalar product has A* as orthonormal basis; see Section 1.1). This implies that for any words u, v of length p and any permutation о in Sp, one has (u, vo) = (uo~ l, v). Thus, if x is in Q[Sp], we obtain (u, vx) = (ив(х), v), (3.3.5) which means that the adjoint of the linear endomorphism of Q<4>: v i-> vx is и i—> u6(x). Proof (i) A permutation о appears in the sum Dt S if and only if its descent
3 Logarithms and exponentials 66 set is contained in S, that is a(i) > o(i + 1) => i e S, for any i in {1,..., p — 1}. Moreover, a permutation a appears in lu • • lu uk if and only if for any i in {1,..., p — 1 }\5, the digit i + 1 appears at the right of i in the word a(l)... a(p), that is i ф S => a - 1(i) < a - 71 + 1), or equivalently a-1(i) > a-1(i + 1) => ie S. This implies the lemma, since the sums D-s and lu- • -lu uk+ j are multiplicity-free. (ii) The adjoint of the linear mapping P^PD-S is by (i) and (3.3.5) the linear mapping P i—> P(u1 ш -lu uk). The adjoint of the mapping qPi* - *qPk is the mapping qpi *'• • -*'qPk, where *' is the convolution product defined in Section 1.5; indeed, the adjoint of cone and d are respectively 3' and sh (Proposition 1.9), so that the adjoint of f*g = conc(/ ®g)cd is sh ° (/♦ ® g*)° д’, where /♦, g* denote the adjoint of f, g; hence, the above assertion is implied by the fact that qn is self-adjoint. Thus, it is enough to show that for any word w, one has w(u j lu lu uk) = (qpi qPk)(w). (3.3.6) Suppose that |w| = p. Let w = vx ... vk be the factorization such that |rf| = p,. Then, by definition of the right action of Sp, one has w(ul LU • • - LU Uk) = LU • • - LU Vk. On the other hand, by eqn (1.5.8), we have (qPi *' ’ -*'qPk)(w) = shk°(qpi®- • ® qPk) ° d'k(w) = shk°(qpi®- ®qPk)( X Wi®--®wk] = ^P,(W1) ® • • ® qPk(wk)^ = shk(Vi ® - ®vk) = Vi LU- -LU Vk. If |w| / p, then the same calculation shows that both sides of (3.3.6) vanish. □ Lemma 3.14 Let I: K(A) —► be the linear mapping which sends each
3.3 Coefficients of the Hausdorff series 67 nonempty word onto itself and the empty word on 0. Then for any к > 1 and any word w of length p, one has I*k(w)= J wD^s, |S| = k-l where the summation is over subsets S of {1,..., p — 1}. Moreover S<={1...р-I} |dI + 1 Proof Because of (3.2.7), we have only to verify the first equality. We have I = Хл > i Яш with the notations of Lemma 3.13(ii). Thus ^k= £ pi,•.•, Pk > i Since qPi * • • • ♦ qPk = cone ° (qPi ® • • • ® qpk) ° d sends w onto 0 if рг + • • • + pk / p, we deduce that I*k(w) = X (<?₽> *• ’ -*^)(w). pi + • • • +pk =p Since (p15..., pk) и-► {pi, pi + p2,..., Pi + • • • + pk_ J is a bijection from the set of sequences of length к of positive integers of sum p onto the set of subsets of cardinality к — 1 of {1,..., p}, we obtain the first equality from Lemma 3.13(ii). □ Proof of Theorem 3.11 The equivalence in the statement follows from Lemma 3.10. Let x = a[' ... a*”lpt\ ... pn\. We have only to show that (л i(x), w) is equal to the last formula of the theorem. By Lemma 3.14, we have (-l)|s| (n1(x),w)= X ------------------~(xD^s,w), S<=(1...p- 1} |o| + 1 where p is the common length of и = nf1 ... a$n and w. By definition of DsS, this is equal to ( — nisi ( — nisi s |S| + 1 ~sp s |S| + 1 D(a)^S ua = w with h = \/pYl.. .pn'.. This is equal to ua = w D(a) S S |S| +1 Since each subset D(o) is contained in (p~1 [d{a>) subsets of {1,..., p — 1} of
68 3 Logarithms and exponentials cardinality d(o) + i, and since i may be equal to 0,..., p — 1 — d(o), we obtain (7t!(x), w) = h X U(T= w = hsl X \U<7 = W p-l-d«7) (_ + i Л, _ 1 _ i=o d(o) + i + 1 \ i i = 0 \ i / / = hsl X xd(<T)(l \U<7=W = hsl X + x)r(ff) \U<7 = W , = hs(Ew(x, x + 1)), where IV = {a e Sp | uo = w}. Hence the theorem follows from Lemma 3.12. □ Let m, qu..., qm, d, r be non-negative integers. We denote by A(d, r;qY,..., qm) the expression (3.3.4). By Theorem 3.11, it is the coefficient of w, written as in (3.3.1), in the series log(efll ... efl"), if d (respectively r) is the number of descents (respectively of rises) of w; note that in this case d + r = m — 1. Corollary 3.15 The coefficient k(d, r; qY,..., qm) does not depend on the order of the sequence ql,...,qm. Moreover, if d,r are interchanged, then it is multiplied by (— I)41 + ’ ’ ’ Proof The first assertion is an immediate consequence of the definition of z. For the second, observe first that En(x, y) = En(y, x); indeed, the mapping which sends each digit i onto n + 1 — i is a bijection S„ -> Sn (where each permutation is viewed as word on {!,...,«}) which interchanges descents and rises. Moreover, by (3.3.2), we have En( — x, —y) = (—i)n~1En(x,y), because d(o) + r(o) = n — 1 for any о in S„. Now, using (3.3.4) and interpret- ing the linear form s as an integral (see (3.3.3)), one performs the change of variables x = — 1 — y, and the result follows by the previous observations. An alternative proof (in the case where d + r = m — 1) is to apply to log(efll ... efl") the anti-automorphism a of Q<<4>> which sends each Lie series S onto — S (Lemma 1.7). □ The previous corollary shows that if w is a word of even length with an equal number of descents and rises, then its coefficient in log(eai ... ea") is 0. This is the case in particular for the coefficient of a word of even length beginning and ending with the same letter in the Hausdorff series log(ea eb).
3.3 Coefficients of the Hausdorff series 69 Observe that by definition (3.3.4) of A, and the fact that Er = 1, we have A(d, qm) = Jfd, r; 1,..., 1, <h, • • •, qm) (any number of ones). We write X(d, r) for X(d, r; 1,..., 1). Corollary 3.16 The coefficient X(d, r) is equal to (— i)ddlrl/(d + r + 1)!. In other words, if each permutation in S„ is considered as a word on A = one has in 0<Л> n1(12...n) = £ ---------- x <*• aeSn П \ d{O) ) Proof By definition of A and the fact that Efx, y) = 1, we have f 0 A(d, r) = s(xd(l + x)r) = xd(l + x)rdx. J -1 Note that A(d, 0) = x xd dx = (— V)d/d + 1. Hence the result is true for r = 0. Now, let r > 1. Then f 0 X(d, r) = xd(l + x)rdx J -1 ’1 T f° r = ------x‘, + 1(l+x)r — I -------xd + X(1 + x)r"1 dx, |_d+l J-i J-i^+1 by integration by parts. Hence, by induction on r, we have X(d,r)=-----r—X(d+ l,r- 1) d + 1 r (-\)d+1(d+ l)’(r — 1)! “ ~d+\ (d + r + 1)! (- l)dd!r! “ (7+7+1)! ’ The last assertion of the lemma follows from the fact that 7^(12... n) is a linear combination of permutations in Sn, by (3.2.3). Corollary 3.17 Let d, r, m be nonnegative integers with d + r = m — 1. The generating function in Q[[f 15..., tm]] of the numbers 2.(d, r; qY,..., qm) is
70 3 Logarithms and exponentials In order to understand why the right-hand side is a formal power series in t15..., tm, we make the following observations: if a formal power series is an alternating function of t15..., tm, then its quotient by A = — rj) *s still a formal power series; now, the right-hand side is symmetric in t15..., tm, hence its product by A is alternating; furthermore, the quotient (Г; — tj)/(efi — e'J) is a formal power series, hence this product too. We begin with a classical lemma. Recall that we have put £0 = 0. In particular, л vanishes if some qt = 0. Lemma 3.18 The exponential generating function of the eulerian polynomials is tn i i) „>i nl ец — x Proof Let us write En(x) = a(n, k)xk, where for n > 1, a(n, k) is therefore the number of permutations in Sn with к descents. Each permutation in S„ +15 when viewed as a word, is obtained by shuffling a permutation in S„ with the digit n + 1. A closer look then shows that a(n + 1, к + 1) = (к + 2)a(n, к + 1) + (n — k)a(n, k). We have, moreover, the initial conditions a(n, 0) = 1 if n > 1 and a(Q, k) = 0 for к > 0. On the other hand, we write 1 _ pMx- 1) fn S = ----------= У b(n, k)xk —. e,(x — x „Л и! Then it is seen (by putting t = 0, then x = 0) that the same initial conditions hold for the b(n, k). Moreover, a simple computation shows that S + 1 + (x — x2) — + (tx — I) — = 0. dx dt By taking the coefficient of xk+1tn/nl, we obtain b(n, к + 1) + (к + l)b(n, к + 1) — kb(n, к) + nb(n, к) — b(n 4- 1, к + 1) = 0. Hence the b(n, k) satisfy the same recursion as the a(n, k). We deduce that a(n, k) = b(n, k), hence £ En(x)tn/nl = S. □ From the lemma, we easily deduce the exponential generating function of the homogeneous eulerian polynomials: fl- eHx-y) t(y-x) _ j У En(x, y)- =----------------=---------------. (3.3.7) n > i nl у e‘(x y) — x у — x et(y x)
71 3.3 Coefficients of the Hausdorjf series Indeed, we have by (3.3.2) and Lemma 3.18 tn i x\ tn X En(x,y)~ = X i>1 n'. „>i \yj n'. = 1 V F (A (^)" Уп>1 \y) n'. | 1 _ Qyt[(xly)~ 1] ” у еУЫхМ-и _ x/y | _ gt(x-y) ^Цу-х) _ | У qUx-у) _ x у — x et(y~x> We deduce from (3.3.7) the formula tn e' — 1 X En(x, x + 1) =----------------------= F(x, t). n! 1- x(e - 1) (3.3.8) Proof of Corollary 3.17 We have by Theorem 3.11 X z(d, r; , qm)tr ... C qi....qm = X /xd(l+x)r П EqfX,X + 1)/<7j4rT •••*£” 41..qm \ 1 < j<m / = s(xd(l+x)r X П Eq.(x,x + lyt^/qf) \ 41,...,4m 1 < j<rn j where we have extended the linear function s: Q[x] -> Q to s: QM((ri> • • • Л™)) Q, coefficient-wise. This is by (3.3.8) s(xd(l + x)r П X Eqfx, x + Ow) / Q(J - | \ = si xd(l + x)r П ----------------- . \ iZUl -x(e''- 1)/ Now, we have, for some elements Nj in the field of fractions of Q((t b ..., tm)): л т-г e,J - 1 ™ Nj xd(l + x)r П ------------------= У----------------, 1 - x(etJ - 1) J=1 1 - - 1) because the left-hand side is a rational fraction in x with degree of numerator equal to d + r = m — 1, which is smaller than the degree of the denominator, equal to m, and because the latter is a polynomial in x with simple roots.
72 3 Logarithms and exponentials To compute Nh multiply both sides by 1 — x(en — 1) and put x = \/(eh — 1), hence 1 + x = e,7(e,i — 1): (e'1 - 1)V because d + r = m — 1. Now, observe that for T in Q[x]((t!,..., tm)), s(T) = — h(T)\x= _15 where h is the Q((rls..., tm))-linear operator of О[х]((П,..., tm)) which maps xk onto xk + 1/(k + 1). Hence, h is an integration operator, and so M s _____N<_ 1 — x(e" \ m — N- Thus, the generating series of the z’s is In order to justify completely the previous computations, the reader may find it useful to perform them in L ®QWf)) Q[x]((t)), with Q((0) = Q((tb ..., tm)) and L the field of fractions of the latter. □ Recall that the Bernoulli numbers Bn are defined by the exponential generating function У и _ X „Го т cx - 1 ’ (3.3.9) It is easily verified that Bo = 1, Вг = —1/2 and that x/(ex — 1) + x/2 is an even function, which shows that Bn = 0 for odd n > 3. The first few values are B2 = 1/6, B4 = -1/30, B6 = 1/42. Corollary 3.19 For p,q > \,the number A(0, 1; p, q), i.e. the coefficient of apbq in log(ea eb), is equal to (~1)P f <f\R plql Л1Ш P+^k' Proof By Corollary 3.17 and (3.3.9), we have
3.3 Coefficients of the Hausdorjf series 73 ey _ 1 X Ж1; i, j)xlyJ = x ex ---------- j> i e — ey ex — 1 ey — C ey — e' ey — e: ex(y — x)(ey — 1) ey — ex e^x-l = “}' + (£ ХУW ~ \i>0 t' / = -y + (X X 1) \i>0 0<j<i \J/ I- / = -y+I z I ^‘+< i>0 j<ik> 1 V/ l- K- By change of variable p = i — j, q = к +j (hence к < q, j = q — к and i = p + q — k), this sum becomes cP -У+ X X , „„m».-»- P>O,<J>1 p'-q'. i <k<q(q — ky.k' Thus, we have for p,q> 1: Л(о,1;А,) = Ц^ f □ p'q' k = i\kj Let us call size of an и-tuple (qY,..., <?m) of positive integers the number (t/i — !) + ••• + (qm — 1). If this size is zero, then the qt are all equal to 1, and the coefficient A(d, is given by Corollary 3.16. The next results give recursion relations between these coefficients, allowing us to compute them by induction on the size. Corollary 3.20 The following relations hold, for non-negative integers d, r, Яи-чЯт with qm> 1: (i) i(d + l,r;qlt... ,qm)= X ~ #>Ж П <h, • • •, qm-1, qm + 1 - if 0 <i <,qm (-1)' (ii) X(d, r + 1; qY,..., qm) = X ——Bjfd, r; q^,..., qm-p qm + 1 - i).
74 3 Logarithms and exponentials Proof The first lines of the proof of Corollary 3.17 give us X Ж r; qY,..., qjtf .. .t4” = st xd(\ + x)' ...q™ X etj — 1 1 — x(eh — 1), П < i< (3.3.10) Now, we have etm — 1 1 — x(e,m — 1) so that etm — 1 1 etm — 1 1 — x(e'm — 1) etm — 1 1 — x(etm — 1) Since s is Q((tm ))-linear, we deduce et] — 1 1 1 — x(e° — 1)/ etm — 1 = si xd(l + x)r etj — 1 \ 1 - x(e° - 1)/ etj — 1 :m 1 - x(e° - 1). Note that the first series of the right-hand side does not involve the variable tm. Now, take on both sides the coefficient of t]1... t*'”, using that by (3.3.9) l/(e‘m — 1) = Xr>o (Br/r!)t^71. On the left-hand side this coefficient is, by (3.3.10), X ~A(d,r;q1,...,qm_1,qm + 1 - i). 0 < i < qm I- On the right-hand side, it is A(d + 1, r; qr,..., qm) by (3.3.10). This proves the first equality. To prove the second, use Corollary 3.15. □ Corollary 3.21 (same hypothesis as in Corollary 3.20). The following relations hold: (0 (Qm + 0Ж C qY,..., qm- 15 qm + 1) = -dt(d, r + \;qY,...,qm)~ rA(d + 1, r; qt,..., qm) X t(d + l,r + 1; <?i, •- ---, <7m); 1 < к < m — 1 qfc + «fc' = 4k
3.3 Coefficients of the Hausdorjf series 75 (ii) + W Г 4i, • • •, qm-1, qm + 1) = /.(d + 1, r; Qi,..., qm) + Ж r + 1; qu ..., qm) + E i(d + 1, r + q'm,q'^). q'm +q'm=qm Proof A straightforward computation shows that F(x, t), given by (3.3.8), satisfies dF/dt = (x + (x + 1))F + x(x + 1)F2 + 1. Hence, we deduce by (3.3.10) and because s is Q[[tm]]-linear: ~ ( E Ж C qY,..., qJW ... t4”] \<Ii./ = / (V-^1 + -v)r П ж g-)Y) \ \ l<j<m J J = sfxd+1(l + x)r fl F(x, tj) j + sfxd(l + x)r+1 fl F(x, tj)\ + s[xd + 1(l + x)r+1 П F(x, tj)F(x, tm)2 ) + s(xd(l + x)r П F(X, tj)Y \ 1 < j < m - 1 / By (3.3.8), the third term is equal to s(xdtI(l + x)'+1 I ( П (?'EJx,x+I)/,/) X t^^E^X + 1)E^(X’X + Hence, by taking the coefficient of t^' ... t’"1 in the above equality, we obtain the second relation of the corollary. Since s(/(x)) = fLi /(x) dx, we have s( ~ (*d+ ‘(1 + *)'+ ‘F(x, F(X, („))) = 0. \(7X / Now, we have ^=Fix,,r, dx which implies, similarly to what has been done above, (d + 1)Ж r + 1; qY,..., qm) + (r + 1)Ж + 1, r; qt,..., qm) + X E A(d+ l,r+l;qi,...,qfq'j,...,qm) = O. 1 < j < m q’j + q’j = qj
16 3 Logarithms and exponentials The first equality of the corollary follow from this and the second one. □ 3.4 DERIVATION AND EXPONENTIATION We begin by a general formula on the derivative of an exponential in a noncommutative algebra. We concentrate only on the algebraic aspect and omit the convergence conditions; in our applications all infinite sums are, as usual, locally finite. So, let M be an associative algebra over some commutative ring К containing Q. Let cp be an algebra homomorphism M -» M. A((p, cp)- derivation D is a function M -> Al, linear over K, and such that Vx, у e Al, D(xy) = D(x)(p(y) + <p(x)D(y). (3.4.1) We note as before by ad(a) the operator: b h-► [a, b]. Theorem 3.22 Let f(t) = Xn > о antn be a formal series with coefficients in K. With the previous definitions, one has for x in M D(f(x))= X ^ad((px)k~\Dx)f(k\<px). к > 1 к- Observe that when Al is commutative and D a derivation (i.e. cp = id), then one obtains the classical formula D(/(x)) = D(x)/'(x). Proof The formula is К-linear in f, so it is enough to prove it when f(t) = tn. Let us write (px = a, Dx = b. Then the right-hand side is X — ad(a)k~ 1(b)n(n — 1) ... (n — к + l)an~k, k>i k! which is equal to " (n\ X , \ad{a)k-\bffin-k. (3.4.2) к = 1 \kj We show that (3.4.2) is equal to D(x"), by induction on n. The case n = 0 is clear; let us assume equality for n. Then, by (3.4.1) and induction n (и\ D(x"+1) = D(xx") = ban + aD(xn) = ban + X ]a ad(a)k~1(b)an~k. k = l \kj
3.4 Derivation and exponentiation 77 Because au = [a, u] + ua = ad(a)(u) + ua, this is equal to ban + \'\ad(a)k(b)an k + J \]ad(a)k \b)a‘ k = i\kj k = i\kj I n \ = ban + У I )ad(a)k~1(b)an+1~k k=2 \k ~ 1/ " I n\ + X ( , }ad(a)k~1(b)a‘ к = 1 \kj = ba" + £ k = 2 n \ n к — 1/ \ к ad(a)k 1(b)a' n)ad(a)n(b) + \]ban nJ \1/ и + 1 /n + = E ( , \ad(a)k 1(b)al к = 1 \ к J □ Corollary 3.23 (same hypothesis) D(ex) = ead(<px) _ |\ ----------- (Dx) e*x. ad(tpx) J Proof In the theorem, let f(x) = ex. Then we obtain D(ex) = У — ad((px)k~ Y(Dx) е9Х k> i kl which proves the corollary. □ Consider again the Hausdorff series H defined by e° eb = eH in Q<<n, b>>. Let H = I Hn, n>0 where Hn is the part of H which is homogeneous of degree n with respect to a. In particular, Ho = b. Recall that the Bernoulli numbers Bn are defined by (3.3.9). Corollary 3.24 One has HY = f= a + b] + У ~ad(b)2n(a). 1 \eadW- 1J n>i(2n)l
78 3 Logarithms and exponentials Proof Let (p: Q<<n, b>> -» Q<<n, b>>, the homomorphism defined by <p(a) = 0, (p(b) = b. Let D be the linear endomorphism such that jw if |w|a = 1, D(w) = < [0 otherwise, for any word w. Thus D keeps only the words which have one a. Then Hr = D(H). Observe that for any words и and v, one has D(uv) = (p(u)D(v) + D(u)(p(v). Hence, D is a (<p, <p)-derivation of Q<<n, b>>. By Corollary 3.23 we thus obtain /ead(<pH) _ l\ a eb = D(ea eb) = D(eH) = -------------- (DH) е”я. \ ad(<pH) J Since (pH = Ho = b and DH = we deduce /pad(b) _ i\ o= ----------- (Я,), \ ad(b) / which implies H' = (e^,t, _ , )W- The second equality is a consequence of the definition (3.3.9) of the Bernoulli numbers. □ If D is a derivation of the algebra M (as above), then the linear mapping exp(D): M -> M given by exp(D)(x) = X “ и>0 И! is an algebra homomorphism M -> M. This is a consequence of the well-known Leibnitz formula в”(х>’)= E ("Wu In the case of Q<<n, b>>, we denote by S(d/db) the derivation of Q<<n, b>> which maps a on to 0, and b on to S (where S e Q<<n, b>>). Corollary 3.25 The Hausdorff series is equal to ( d \ H = exp Hr — (h). \ dbJ
3.4 Derivation and exponentiation 79 In particular, 1 / а у Hn = Hi — (b). nl V dbj Observe that this result shows again that H is a Lie series. Indeed, we have Hn + 1 = l/(n + 1)(HX a/ah)(H„), so it is enough to verify that H1 d/db maps Lie series into Lie series. But 0 = 1^ d/db is a derivation of Q«a, b>>, so its restriction to 3\a, b) is a derivation of Lie algebra, i.e. D(IP, QI) = LDP, Q] + [P, DQ], and the generators a and b are mapped onto 0 and which is a Lie series by Corollary 3.24. Thus Lie series are mapped into Lie series. Proof Let D = H^d/db) and p = exp(D). Then D is a derivation and p a homomorphism. We have by Corollary 3.23 (with (p = id) and Corollary 3.24 D(eb) = ~ad(b) _ < \ / ad(b) _ -------)(Db)eb =-------------- ad(b) ) \ ad(b) (Hx)eb ead(b) _ j u</(b) ad(b) Qad(b) _ | (a) eb = a eb. Thus, by induction, because Da = 0, we obtain Dn(eb) = an eb. Hence, p(eb) = X 1 Dn(eb) = X an eb = ea eb. n>0 til n>Otll Since p is a continuous homomorphism, we have p(eh) = eM<b), which implies eM<b) = e" eb. Hence, p(b) = H, as required. All the previous results have a dual version. Indeed, we have with the notations of Corollary 3.23: /1 D(ex) = e4 _ Q~ad(<px)' ad(<px) , (Dx). Let us write h= £ h; л>0
80 3 Logarithms and exponentials where H'n is the part of H which is homogeneous of degree n in the letter b. Then = (—= ^ + Ka’ M + У ad(a)2n(b), 1 _ e-ad(a)J^ 2L „^o (2м)! 7 and ( d\ H = exp Hj — (a). \ da J Finally, H’ = 1 n! ( , d\n - (a). \ da J 3.5 APPENDIX 3.5.1 Friedrichs’ criterion generalized Theorem 3.1 implies that if P is a polynomial with constant term 0, then P is a Lie polynomial if and only if P is orthogonal to each proper shuffle и ш v (и, v e A +). The following more general result (essentially from Cohn (1954)) holds: let и be a positive integer and denote by the linear span of products PY ... Pq, where each Pi is a Lie polynomial and q < n. Then a polynomial P is in =5? if and only if P is orthogonal to each shuffle products ux ш - • ш мл+1, with u,e A + . This may be proved by using the Poincare- Birkhoff-Witt theorem (Theorem 0.2) and the generalized coproduct 3n + x of Section 1.4. It is also a consequence of Theorem 5.3(iii). 3.5.2 Chen series Let a = (an..., am) be a path in IR"1, as in the hypothesis of Corollary 3.5. We call the Chen series of a the formal series (fa dw)w; by Corollary 3.5, the logarithm of this series is a Lie series. If /?: [b, c] -> is another such path, whose initial point is equal to the endpoint of a, then the Chen series of the path obtained by concatenating a and f is equal to the concatenation product of the Chen series of a and f. If m = 2 and a is the horizontal path from (0,0) to (0, 1) and ft the vertical path from (0, 1) to (1, 1), then their Chen series are respectively eai and e°2; hence, the path obtained by concatenating them has Chen series eai e°2; hence the latter is the exponential of a Lie series by Corollary 3.5. This shows that this result extends the formula of Campbell-Baker-Hausdorff (for these results see Chen (1957)).
3.5 Appendix 81 3.5.3 Free group Let F(A) denote the free group over A. Then the mapping а не" (a e A) extends uniquely to an injective homomorphism from F(A) into the group of formal power series in Q<<4>> with constant term 1. Each element of the image is of the form es, for some Lie series S (Corollary 3.3); see Magnus et al. (1976, Section 5.10) for results on this image. Actually, each mapping of the form a 1 + a + a2S(a) extends to an injective homomorphism from F(A) into (see Bourbaki 1972, Theorem II.5.1); for example, the Magnus transformation (see Section 6.3). 3.5.4 Dynkin’s formula The following formula for the Hausdorff series H is obtained by an application of Theorem 1.4(v) (which states that for each homogeneous Lie polynomial P of degree n, r(P) = nP, where r is the right-to-left bracketing mapping), and an easy computation: H = \og(eaeb) = X (^-1)* = X —X —J fc>i к k>i к \P+q>o Р-q'/ = X v----------------------ap'bqi ... aPkb4k к pM- • • • pM- where the summation is over all к > 1, , pk, qr,... ,qk>0 with p,r + q,; > 0 for i = 1,..., k. Hence, one obtains H = ^^-_____________________1______________________1________ к Pi + • ' • + Pk + qY + • • • + qk Pi'- • pk'-qi'- - - - qk'- x r(ap'bq' ... aPkbqk) (Dynkin 1947, 1949, 1950). 3.5.5 Zassenhaus formula There exists a unique sequence (Рл)„>2 of homogeneous Lie polynomials in a, b such that Pn is of degree n and that e°+b = e° eb ePi ePi... Let us write S = P + 0(4") if S — P has no terms of degree < n. The main lemma to prove the previous formula is to show that if S = 1 + 0(4") is the exponential of a Lie series, than S = epT, where P is a homogeneous Lie polynomial of degree n and T = 1 + 0(4" + 1) is the exponential of a Lie series; use Corollaries 3.3 and 3.4, and the fact that log(eaeb) = a + b + 0(42).
82 3 Logarithms and exponentials A similar product formula may be proved by applying the homomorphism (p ® id: .e/ -» to the formula of Corollary 5.6 where (p is any shuffle homomorphism О<Л> -» Q, e.g. cp defined by ea+b = (p(w)w (cf. Theorem 3.2(iii)). See also Fer (1958). 3.6 NOTES Condition (iii) in Theorem 3.2 is due to Ree (1958), and condition (iv) to Cartier (1956), who also proved Corollary 3.3. Corollary 3.4 is the formula due to Campbell (1897, 1898), Baker (1905), and Hausdorff (1906). Corollary 3.5 is from Chen (1957). The formula in the proof of Proposition 3.6 is taken from Higman (1956). The second part of Theorem 3.7 is from Solomon (1968b) and Reutenauer (1986b), while the first part is an equivalent version of the Poincare-Birkhoff-Witt theorem; see Section 0.4.3 or Dixmier (1974, 2.4.6). A formula similar to eqn (3.2.3) may be found in Hain (1986). Lemma 3.8 is from Ree (1958). Joyal (1986) shows that the analytic functor ‘tensor algebra’ is the exponential of the analytic functor ‘free Lie algebra’. Lemma 3.10 is taken from Bourbaki (1972, II.6.4 Remarque 1). Theorem 3.11 is due to Goldberg (1956). For its proof, we have followed Helmstetter (1989), who proved Lemma 3.12. Lemma 3.13(i) is from Garsia and Remmel (1985), and Lemma 3.14 is from Garsia (1990). Corollaries 3.15, 3.17, and 3.19 are due to Goldberg (1956). Corollary 3.16 is from Solomon (1968b). The recursion formulas of Corollaries 3.20 and 3.21 are due to Helmstetter (1989). Another proof of Theorem 3.11 was given by von Waldenfels (1966a): he considers more general series of the form 0(/i(«i). • •/(«„) — 1), where the ft are one-variable power series with constant term 1, and g is any series in one variable. The fact that the computation of the Hausdorff series may be an application of a computation in the algebra of Sn was noted by Dynkin (1949). For more on eulerian polynomials, see Foata and Schiitzenberger (1970). In Section 3.4, we have followed more or less the original proofs of Baker (1905) and Hausdorff (1906). Theorem 3.22, in particular, is due to Baker (1905, p. 340), and Corollary 3.23 is essentially already in Campbell (1897); this result explains why the Bernoulli numbers step in. Corollary 3.24 is in all of the three papers (Campbell 1898; Baker 1905; Hausdorff 1906). Finally, the formula of Corollary 3.25 is from Baker and Hausdorff. For more on the Campbell Baker- Hausdorff formula, including a history of it, see Czyz (1992). For results on exponentials of derivations of the free associative algebra, see Lenormand (1969/70). We have not discussed convergence of the Hausdorff series. For this, the reader may see Dynkin (1950), Cartier (1956), Bourbaki (1972, Chapter II, Sections 7 and 8), Lazard (1963), Michel (1973/74, 1974, 1976), and Postnikov (1986, Lecture 6).
3.6 Notes 83 Several numerical computations have been made for the coefficients of the Hausdorff series: Poetsch and von Waldenfels (1964), Michel (1973/74, 1974, 1976), Maltey (1988), and Koseleff (1991). The Campbell Baker Hausdorff formula has been applied to the Burnside problem for groups: Magnus (1950) and Michel (1976). A continuous Campbell Baker Hausdorff formula may be found in Magnus (1954), Bialinicki-Birula et al. (1969), and Michel (1974).
4 Hall bases Hall sets of trees are introduced in Section 4.1. A rewriting system on standard sequences of Hall trees is defined and shown to be confluent, invertible, and always terminating. As a consequence, each word has a unique decreasing factorization into foliages of Hall trees. By interpreting the nodes of a Hall tree as the Lie bracket, one obtains a basis of the free Lie algebra, and the corresponding Poincare- Birkhoff-Witt basis, in Section 4.2. In the last section it is shown that this basis construction gives exactly the same bases as those obtained by elimination in Chapter 0. In the appendix it is shown amongst other things that Hall bases are in some sense optimal. 4.1 HALL TREES AND WORDS Recall that the free magma M(A) has been defined in Section 0.2. Each tree t of order at least 2 will be written t = (t', t"), where t' (respectively t") is its immediate left (respectively right) subtree. There is a canonical mapping from M(A) onto the free monoid A*, defined by f(a) = a if a is in A, and f(t) = f(t')f(t"), if t = (f, t") is of degree >2. Note that |t| = |/(t)|. We call f(t) the foliage of t. Let H be a subset of M(A). Then H is called a Hall set if the following conditions hold: • H has a total order <; • A is contained in H; • For any tree h = (h', h") in H\A, one has h" e H and h < h". (4.1.1) • For any tree h = (h', h") in M(A)\A, one has h e H if and only if h’,h"eH and h'ch”, (4.1.2) and either h'eA, or h' = (x, y) and у > h"; (4.1.3)
4.1 Hall trees and words 85 Observe that each subtree of a Hall tree is again a Hall tree. If a Hall set is fixed, we call Hall tree an element of this set. Proposition 4.1 Hall sets exist. Proof Let < be a total order of M(A) such that for any tree t of degree >2, t = (t', t"), one has t < t". Such a total order surely exists. Define recursively a set H by A £ H and, if h = (h',h”), then heH if and only conditions (4.1.2) and (4.1.3) are satisfied. Then it is clear that H is a Hall set. □ Actually, there are continuously many Hall sets, for each finite A with |Л| > 2; this is left as an exercise. The reader may also verify that each Hall set is obtained as in the proposition. Example 4.2 The alphabet is A = {a,b}. In Fig. 4.1 we give the trees of degree < 5 of a Hall set in M(A), ordered by < from left to right and line after line. Fix a Hall set H in M(A). We define a standard sequence of Hall trees to be a sequence s = (/ip ...,/i„), (4.1.4) with n > 1, hi e H and for any i, either hi is a letter, or hi = (/i-, h”) with (4.1.5) a b Fig. 4.1
86 4 Hall bases In other words, a sequence of Hall trees is standard if for each tree h in it, which is not a letter, its immediate right subtree h" is greater or equal to every tree located at the right of h in the sequence. Observe that a sequence of letters is always standard; moreover, each decreasing sequence of Hall trees (i.e. hr > • • • > hn) is standard; indeed, if /i, = (h'h h"\ then by (4.1.1) we have h" > h(, hence h” > hi + l,..., hn. We call rise of a sequence (4.1.4), an index i such that ht < hi + 1; by abuse, we also say that the rise is the couple (hhhi + l). We call inversion of (4.1.4) a couple (i,j) such that i < j and ht < hj. Note the opposite inequality, contrary to the usual inversions; this is because our ‘ideal’ Hall sequences are decreasing. Thus, a Hall sequence is decreasing if and only if it has no rise, or equivalently, no inversion. A legal rise of (4.1.4) is a rise i such that ^i+ 1 + > К- (4.1.6) We now define a rewriting system on the set of standard sequences. Let s in (4.1.4) be standard, and i a legal rise of s. Then we write s s', where s' = (/ip ..., hi_1, (hi, hi+1), hi + 2,..., hn). (4.1.7) In other words, s' is obtained by composing, in M(A), the two trees which form the legal rise in s. Observe that s' is standard: indeed, we have either ht;e A, or hi= (h'i, h'-) and then by (4.1.5), which shows that (hi,hi+l) is a Hall tree, by (4.1.2) and (4.1.3); moreover, (4.1.6) holds, and for j ~ 1,..., i — 1, either hj is a letter, or h'- > hi+l (by (4.1.5), because s is standard) > (/i,, hi+l), by (4.1.1). Hence s' is standard. We denote by the reflexive and transitive closure of the binary rela- tion Theorem 4.3 (i) For any standard sequences s, s15 s2 with s st and s1* s2, there exists a standard sequence t such that s{ t and s2 t. (ii) For each standard sequence s, there exists a sequence of letters t such that t s. (iii) For each standard sequence s, there exists a decreasing standard sequence t such that s t. Property (i) is expressed by saying that is confluent. Property (ii) means that is invertible. Proof (i) Write s t if t is obtained from s by applying the binary relation p times. Then We have s s{, s s2, and we may suppose p, q 0 and 4- s2.
4.1 Hall trees and words 87 Fig. 4.2 Suppose first that p = q = 1. Let s be as in (4.1.4), Sj as in (4.1.7) and s2 = (Лр ..., hj_ p (hj, hj+1),..., hn) where j is a legal rise of the sequence s. We may suppose i < j. We claim that i + 1 <j: suppose indeed that i + 1 = j; then hi+ j = hj, hence hi+l < hj+ p because j is a rise; on the other hand, i is a legal rise of 5, so that by (4.1.6), hi+i > hi+2 = hj+a contradiction, and the claim is proved. We show that (hj, hJ+1) is a legal rise of s, and (ht, hi+l) is a legal rise of s2. Indeed, the first assertion is clear, because what is at the right of hj+1 in Sj is the same as what is at the right of hj+ j in s, and because (hj, hj+ J is a legal rise of s. The second assertion follows from the fact that (hh hi+ J is a legal rise in s, and that hi+1 > hj+l (by (4.1.6), because j + 1 > i + 1) > (hj, hj+1), by (4.1.1) because (hj, hj+ J is a Hall tree. Define now t = (hp..., h,_ p (ht, hi+1), hi+2, ...,hj_!, (hj, hj+ J, hj+2,..., hn). Then we have st -> t and s2 -> t. Suppose now that p = 1 and q > 2. Then we have for some s2: s -> s'2, s'2——► s2. By the previous part of the proof, we have Sj -> t', s'2 -> t' for some t'. By induction on q, we find t such that t’ t, s2*+ t. Thus Sj t, which concludes this case (see Fig. 4.2). We now treat the general case s sx, s s2. We may find s', with s -> s\, s\ ——► Sp By the previous case, we find t' such that s\ t', s2 t'. By induction on p, we find t such that s^t, t' t. Hence, s2 t, which concludes the proof (see Fig. 4.3). (ii) Let s be as in (4.1.4). We argue by induction on £"= j |/i, | — n. If this number is 0, i.e. s is a sequence of letters, we are done. Otherwise take i such that hi is not a letter, and for any j = 1,..., i — 1, either hj is a letter, or
88 4 Hall bases h] > h". Such an i exists: take the left-most ht which is not a letter. Then let t' = (h15..., /i,_ 15 h'h h", hi+l,..., hn). The sequence t' is standard: indeed, either /1- is a letter, or (Л'-)" > h" (by (4.1.3), because (Л-, h") is a Hall tree) > hi + 1,... ,hn (by (4.1.5), because s is standard); moreover, either h" is a letter, or (h")" > h" (by (4.1.1)) > hi +15..., h„; finally, for j = I,... ,i — 1, either hj is a letter or h'j > h" (by the choice of i) > h\, by (4.1.2). This shows that t' is standard. Furthermore, (h'b hf) is a legal rise of t', because If < h" by (4.1.2) and h" > hi+1,... ,hn, by (4.1.5), s being standard. Hence, t’ -> s. By induction, we find a sequence of letters t such that t t'. Hence, t s. (ii i) If s is not decreasing, then s has at least one rise. Let i be the right-most rise. Then ht < hi+1 >hi+l>---> h„, with s as in (4.1.4). This shows that this rise is legal, hence we find s' with s -> s'. By induction on the length of the sequence, we find a decreasing sequence t such that s' t. Hence, s t. □ Corollary 4.4 Each word win A* has a unique factorization w = f(hf)... f(h„), H,hi > >hn. Before proving this, we call foliage of the sequence s in (4.1.4) the word f(s) = f(hf>... f(hn). Observe that by (4.1.7) s —» t implies f(s) = f(t). (4.1.8)
4.2 Hall and Poincare-Birkhoff-Witt bases 89 Proof Let w = ar ... ap, for some letters a,, and consider the standard sequence s = (<ц,..., ap). By Theorem 4.3(iii), there exists a decreasing sequence t = (7ц,..., hn) of Hall trees such that s t. By (4.1.8), we deduce that w admits the desired factorization. Suppose now that w has two factorizations as above: w = /(h1).../(h„) = /(Ц)... f(tp). Consider the standard sequences s = (7ц,..., hn), t — (Ц,..., tp). By Theorem 4.3(ii), we find sequences s', t' of letters such that s' s,t' t. By (4.1.8), we have f(s') = f(s) = w = f(t) = f (t'), hence s' = t'. By Theorem 4.3(i), we conclude that s u, t и for some sequence u. Since s, t have no rises, this is only possible if s = t. □ We call Hall word the foliage of a Hall tree. Corollary 4.5 Each Hall word is the foliage of a unique Hall tree. Thus, we may identify Hall trees and words. We call Hall set in A* the image under f of a Hall set in M(A), with the corresponding total order. Example 4.6 The Hall words corresponding to the trees of Example 4.2 are a2ba, a2ba2, a2bab, a2b, a, a2b2, a2b2, abab2, ab, ab2, a2b2, ab2, b. Corollary 4.7 Each word has a unique decreasing factorization into Hall words. If 71 is a Hall word, let t be the unique Hall tree such that h = f(t). If h is not a letter, then t — (t', t"\. let h' = f(t'), 7i" = f(t"\, then h = h'h”, and we call this factorization of h its standard factorization. For later reference, we note several inequalities on Hall words, which are immediate consequences of the definitions and of (4.1.1), (4.1.2), and (4.1.3): if h is a Hall word with standard factorization h = h'h", then h'<h”, (4.1.9) and h<h". (4.1.10) Now, let к be another Hall word with h < k. Then hk is a Hall word with standard factorization hk if and only if either h e A, or h" > k. (4.1.11) 4.2 HALL AND POINCARE BIRKHOFF WITT BASES Let К be a commutative ring with unit and consider a fixed Hall set. We define for each Hall word h a Lie polynomial Ph in &K(A): if a is a letter,
90 4 Hall bases then Pa = a; if h is a Hall word of length >2 with standard factorization h = h'h", define Pb = [P'b, P*]. It is clear by induction that each Ph is an homogeneous Lie polynomial of degree equal to the length of h; furthermore, Ph has same partial degree with respect to each letter as h. Example 4.8 The Lie polynomials of the Hall set of Examples 4.2 and 4.6 are Pfl2fcfl = [[a, [a, b]], a] = 3a2ba — 3aba2 + ba2 — a2b, Paiba2 = EEEfl, Ea, a], a] — 6a2ba2 — 4aba2 + ba4 — 4a2ba + a4b, Pa2bab = [[a, [a, 6]], [a, 6]] = a2bab — 3aba2b + 2ba2b — a2b2a + 4ababa — 3ba2ba — ab2a2 + baba2, ?а2ъ = Ea, b]] = fl2b ” 2aba + ba2, Pa = a, Ра3ь2 = Ea- Efl, EEa, b], b]]] = a2b2 — 2a2bab + Aababa — ab2a2 — a2b2a — 2baba2 + b2a2, Pa2b* = Efl, EEEfl, ^3, b], b]] = a2b2 — 3abab2 + 3ab2ab — 2ab3a + 3bab2a — 3b2aba + b2a2, Pabab2 = EEfl, b], EEfl, b], 6]] = abab2 — 3ab2ab + 2ab2a — ba2b2 + Ababab — 3bab2a — b2a2b + b2abab, Pab = Efl, b] = ab - ba, Pab3 = EEEfl, b], b], b] = ab2 — 3bab2 + 3b2ab — b2a, Pab. = EEEEfl, b], b], b], b] = ab4 - 4bab3 + 6b2ab2 - 4b3«b + b4a, Pa2b2 = Efl, EEfl, b], b]J — a2b2 — 2abab + 2baba — b2a2, Pab2 = EEfl, b], b] = ab2 — 2bab + b2a, Pb = b. The previous example shows a fact which is clear from the definition: in order to compute Ph, one has simply to interpret in the tree t corresponding to h each node as a Lie bracketing. We call the polynomials Pb the Hall polynomials. These polynomials form a basis of the free Lie algebra, as the next result indicates. Recall that the free associative algebra K.(A') is a free К-module having the canonical basis A*. Theorem 4.9 (i) The Hall polynomials form a basis of the free Lie algebra (viewed as a K-module).
4.2 Hall and Poincare-Birkhoff-Witt bases 91 (ii) The decreasing products of Hall polynomials Phi...Phn, l^e H, hx> >hn (4.2.1) form a basis of the free associative algebra (viewed as a K-module). The second part of the theorem is the Poincare-Birkhoff-Witt theorem applied to the basis (Ph)heH of 2T(A). We shall not use the latter theorem here and actually prove (ii) first and then deduce (i). The proof is constructive and allows us (i) to express each Lie polynomial in the basis of Hall polynomials, without computing these, and (ii) to express each polynomial in the basis of decreasing product of Hall polynomials, again without computing these products. Consider again a standard sequence of Hall trees (or words) s = (h1,...,hn), (4.2.2) with a legal rise i. We define 2,(s) = (Zip • • •, h^ 15 /i./il + 1, /il + 2, • • •, hn), (4.2.3) which is a standard sequence (see (4.1.7)); we also define Pi(s) = (/i15..., hi_ 15 /il + 1, hi, hi+2,h„), (4.2.4) obtained by interchanging /j, and /il + 1 in s. Observe that p,(s) is a standard sequence: indeed, either hi+1 is a letter, or /i"+1 > hi+1 (by (4.1.1)) > h;, because i is a rise. Observe also that p,(s) has one inversion less than s. Let s be a standard sequence. Define a derivation tree T(s) of s to be a labelled rooted tree with the following properties: if s is decreasing, then T(s) is reduced to its root, labelled s; if not, T(s) is the tree with root labelled s, with left and right immediate subtrees T(s') and T(s"), where s' = ^(s), s" = p;(s) for some legal rise i of s (e.g. the right-most rise, cf. proof of Theorem 4.3(iii)) and where T(s'), T(s") are derivation trees of s', s" respectively. Observe that T(s) always exists, and is finite. Example 4.10 With the Hall set of Example 4.2, a derivation tree associated with s = (a, b, b, b) is shown in Fig. 4.4. Observe that in a derivation tree, the leaves are labelled by decreasing sequences. For s as in (4.2.2), define P(s) = Phl ... Phn. Lemma 4.11 For each standard sequence s, P(s) is the sum of all P(t), for t a leaf in a fixed derivation tree of s. Example 4.12 From Example 4.10 and Lemma 4.11 we deduce abbb = Pabbb + 3PbPabb + 3P2bPab + P2bPa.
92 4 Hall bases Fig. 4.4 Proof The lemma is a consequence of the definitions (4.2.3) and (4.2.4) of 2i(s) and pt(s), of that of T(s) and P(s), and of the identity in K<A>: PhPhl+t = LP^Phl+3 + Phl+iPhi = Phih. + 1 + Pht+1Ph.- □ Proof of Theorem 4.9 If w = аг ... a„ (a, e A), then s = (at,..., an) is a standard sequence, hence w = P(s) is, by Lemma 4.11, a sum of polynomials of the form (4.2.1). We have now to show that the polynomials (4.2.1) are linearly indepen- dent. For this, we may assume that the alphabet is finite. Note that, in this case, the К-module of homogeneous polynomials of degree d admits the basis Ad. Since Ph is a homogeneous polynomial of degree |Л|, Corollary 4.7 gives a canonical bijection between the polynomials (4.2.1) which are of degree d and the words of length d. Hence, these polynomials form a basis: indeed, if M is a free К-module of rank n, then each family of n elements which generates M is a basis of M. A particular case of (ii) is that the Hall polynomials are linearly indepen- dent. Hence, it remains to show that £P(A) is linearly generated by the Hall polynomials. We may suppose that the alphabet A is finite. Since each letter is a Hall polynomial, it suffices to prove the following claim (here and in the sequel, h = h'h" denotes the standard factorization of a Hall word /1). For any two Hall polynomials Ph, Pk, their Lie bracket [Ph, Pfc] is a linear combination over Z of Hall polynomials P( with |/| = \h\ + \k\ and /" < sup(/i, k). This will be shown by induction on the couple (|h| + |k|, sup(/i, k)), where these couples are lexicographically ordered: (d, k) < (d15 kJ if either d < dr or d = di and к < kP As there are only a finite number of Hall words of bounded length, the induction is correct. We may suppose that h < k, because [Ph, Pk] = — [Pk, Ph], and [Ph, Ph] = 0. If h is a letter, or if h = h'h" with h" > k, then by (4.1.11), hk is a Hall word and hk is its standard factorization. Thus [Ph, Pk] = Phk and we have (hk)" = к < sup(/i, k), because h < k. So we
4.2 Hall and Poincare-Birkhoff-Witt bases 93 may assume that h = h'h" and h" < k. By (4.1.10), we have also h < h”, hence h<h" <k. (4.2.5) Using the Jacobi identity [[Л Q], К] = [[P, /?], Q] + [P, [Q, /?]], we obtain ЕЛ, Pkl = [IA, P„J, Pkl = [ЕЛ-, Pkl, Ph I + LPh , [Рл , PJ]• Since |h'| + |fc| and \h" | + |fc| are both strictly less than |/i| + |fc|, the induction hypothesis implies that [ft. ft] = Z «,Pu„ [ft--. ft] = Z № ‘ j for some integers a,-, and Hall words ut, Vj such that: = l^'l + I&I, lul = \h"I + I&I, u" sup(h', k) and v'- < sup(/i", k). Note that by (4.1.9) h' < h", hence by (4.2.5), the two previous inequalities imply: tff<k, v';<k. (4.2.6) Thus we obtain [ft, ft] = Z «<[ft,, ft ] + Z Mft-, ft,]- (4-2.7) ‘ j We have |u,| + |h"| = lh'| + |fc| + |/f'| = |h| + |/c|, and sup(u,, h") < к = sup(/i, k) by (4.1.10), (4.2.6) and (4.2.5). Hence [PUi, Ph.J is by induction a Z-linear combination of Hall polynomials P( with /" < sup(u,, h") < sup(/i, k), and |/| = |uj + |/i"| = |h| + \k\. Similarly, we have |h'| + |i?;| = |/f| + |h"| + |fc| = |/i| + |fc| and sup(/i', Vj) < к = sup(/j, k) because of (4.1.9), (4.2.5), (4.1.10) and (4.2.6). By induction we deduce that [Ph,, PVj] is a Z-linear combination of Hall polynomials Pt with |/| = |h'| + |гу| = |h| + |fc| and I" < sup(/i', Vj) < sup(/i, k). With (4.2.7), the previous discussion proves the claim. □ It is worthwhile to write down explicitly the algorithm underlying the proof of Theorem 4.9(i). For this, let us denote by Pt the Lie polynomial obtained by interpreting in the tree t each node as a Lie bracketing; formally, Pa = a if a e A, and Pt = [Pt, Pr] if t = (tr, t"). The algorithm takes as input a linear combination of trees J att and gives as output a linear combination of Hall trees £ [ihh such that £ atPt = £ PhPh. Consider the linear combination J att. If all the trees involved are Hall
94 4 Hall bases Fig. 4.5 (4.2.10) trees, then we are done. Hence, take some tree t which is not a Hall tree, and consider a subtree s = (s', s") of t which is not a Hall tree and such that s’, s" are Hall trees (each leaf is in A, so is a Hall tree, hence s exists). If s' > s", then replace s by (s", s') in t and at by — at. (4.2.8) If s' = s" then remove t from the linear combination. (4.2.9) So, we may assume that s' < s". If s' is in A, or s' = (x, y) with у > s", then s is a Hall tree, which was excluded. Thus у < s", and replace t by the sum of the two trees and t2 obtained by replacing s = ((x, y), s") respectively by ((x, s"), y) and (x, (y, s")). Then go back to the beginning. The fact that this algorithm stops and gives the desired output is implicit in the proof of Theorem 4.9(i). We illustrate the algorithm by an example. Example 4.13 We take the Hall set described in Example 4.2. The input linear combination is the tree of Fig. 4.5. After step (4.2.8) applied to the subtree whose root is circled, we obtain the output given in Fig. 4.6 (the coefficient is indicated at the root). After step (4.2.8) again, we obtain the output given in Fig. 4.7. Now, step (4.2.10) gives the linear combination of Fig. 4.8. Finally, step (4.2.8) again gives the final linear combination of Hall trees (these are not indicated in Example 4.2, but are Hall trees by definition of the latter); see Fig. 4.9. As another example, take as input the tree in Fig. 4.10. After step (4.2.10), we obtain the linear combination in Fig. 4.11. Step (4.2.9) removes the first tree, and the output is the remaining one, which is a Hall tree.
4.2 Hall and Poincare-Birkhoff Witt bases 95 Fig. 4.6 Fig. 4.8 A closer look at the proof of Theorem 4.9 shows that the Lie algebra of Lie polynomials is the free Lie algebra, independently of the Poincare- Birkhoff-Witt theorem and of the proofs in Chapter 0. Indeed, it shows that each relation between the elements Pt (t in the free magma on A) may be deduced from the defining relations of Lie algebras: distributivity of the bracket, antisymmetry, and Jacobi identity.
96 4 Hall bases Fig. 4.9 Corollary 4.14 Let A be finite with q elements. The number of Hall words of length n, and the dimension of the space of homogeneous Lie polynomials of degree n are equal to 1E MQn'd U d\n where p is the Mobius function.
4.2 Hall and Poincare-Birkhoff-Witt bases 97 Proof Call a„ the number of Hall words of length n. By Corollary 4.7, we have the following identity of generating series Indeed, Corollary 4.7 implies that one has in 1 X«<= A a weA* h 1 h where the product is taken over all Hall words in decreasing order; eqn (4.2.11) follows by applying the homomorphism 2<<Л>> -> Z[[t]J which sends each letter onto t. Take the logarithmic derivative of (4.2.11) and multiply by t: qt „ kak tk C^qt ~ l~t*' Hence, X ^" = X X kaktki= X f"(x^aA n> 1 k> 1 i> 1 n> 1 \d|n / Thus q” = Xd|n dad, which implies by Mobius inversion nan = X p(d)qnld. d | n The fact that an is equal to the dimension of the space of homogeneous Lie polynomials of degree и is a consequence of Theorem 4.9(i). □ In the course of the proof of Theorem 4.9, we have also obtained the following result. Corollary 4.15 Each word, when written in the basis (4.2.1), has coefficients in ГЧ1. Corollary 4.16 If L is a subring of K, then ^K(A) n L(A) = J?L(A). Proof Let Ре^(Л)п£<Л>. Then by Theorem 4.9(ii) P is a linear combination over L of polynomials of the form (4.2.1). Since P is a Lie polynomial over K, Theorem 4.9(i) implies that only those polynomials (4.2.1) with n = 1 may appear in this linear combination. Hence, P is in &l(A). The reverse inclusion is clear. □
98 4 Hall bases Corollary 4.17 If К is a ring without torsion over %., then Theorem 1.4 is valid for K. Proof Indeed, К is a subring of the Q-algebra К Q and we apply Theorem 1.4 and Corollary 4.16. □ 4.3 HALL SETS AND LAZARD SETS The following result shows that the bases of £f(A) constructed in Sections 0.3 and 4.2 are the same. The proofs are very technical. Theorem 4.18 (i) Each Hall set is a Lazard set. (ii) Each Lazard set is a Hall set. For trees s, t in M(A) and n > 0, recall the notation (st") for the tree (.. .(s, t),..., t), with n ts. We begin by a lemma. Lemma 4.19 Let H be a Hall set in М(Л) and suppose that A has a greatest element c. Let X = {(acn) | a e Л\с, n > 0}. Then H' — H n M(X) is a Hall set in M(X), where M(X) is identified with a submagma of М(Л). Moreover, H = {c} u H', and c is the greatest element of H. Proof Observe that by (4.1.1) each tree in H is smaller than or equal to its right-most leaf. Since с = тах(Л), we deduce that c is the greatest element of H. Observe also that the submagma of Af(A) generated by X is a free magma, isomorphic with M(X). Let heH \X, with h = (hf, h"), h',h"eM(X). Then by (4.1.2), h', h" e H n M(X) = H' and h' < h"; moreover, if h' is not in X, then ti = (x, y) with у > h” by (4.1.3). Conversely, let h', h" e H' with h' < h" and either /f e X or h' = (x, y) with x, yeM(X) and у > h". Then by (4.1.2) and (4.1.3), h = (Л', h") is in H n M(X) = H', except perhaps in the case where h' eX\A. But in this case, we have h' = (x, y) with у = c, hence certainly у > h", because c is the greatest element of H. This shows that H' is a Hall set in M(X), because (4.1.1) is evidently satisfied. Let t eH\c. We show that t e M(X). If |t| = 1, then t e Л\с x. If |t| > 2, t = (t', t") with t', t" e H and t' < t" by (4.1.2); hence t' / c. Suppose t" Ф c. then t', t" are in H\c, hence in M(X) by induction, hence t e M(X). Suppose t" = c; we show by induction on |t'| that teX: if [f| = 1, then t' e A\c, hence (t', t") = (f, с) e X; if |t'| > 2, then (r')" > t" = c by (4.1.3), which implies (t')" = c and by induction t'eX; this implies t = (f, с) e X, by
(4.3.3) 4.3 Hall sets and Lazard sets 99 definition of X. Thus, finally, H\c £ M(X) n H = H'. The reverse inclusion is clear. □ Let t0,..., tn be elements of M(A). Define subsets To,..., Tn+1 of M(A) recursively by To = A. and Tk+ i = {ttD \p>0,te Tk\tk} for к = 0,..., n. (4.3.1) Observe that Tk\tk £ Tk+l. It is immediate to deduce that for i > 1, Tk\{tk,...,tk + i^}^Tk + i. (4.3.2) Observe also that if x e Tk+1, then either x = (rtf) as in (4.3.1), with p > 1, or x e Tk. Hence x e Tk+1\T0 => xeTi+1 for some i < k, x = (ttf) for some p > 1 and t e TfL- Suppose now, that for к = 0,..., n, one has tk e Tk. Then he Tk+1,..., Tn+ j. (4.3.4) This is actually a consequence of Corollary 0.8. A different proof is the following: observe that 7^ + 1 £ M(Tk), the submagma generated by Tk. Hence, Tk+1 £ for к > 0. Now, each tree in T, either has t0 as a proper subtree, or is in Л\Г0, so a fortiori in M(A\t0); thus, the same holds for each tree in M(Ti), which implies t0 ф hence t0 ф Tk+1, for any к > 0. This proves (4.3.4) for к = 0. For к > 1, one proceeds inductively, by noticing first that M(7]) is canonically isomorphic to the free magma generated by Tr, and similarly for each M(Tk). Suppose now that £ is a Lazard set and E a closed subset of M(A) such that eqns (0.3.3), (0.3.4), and (0.3.5) hold. Then we have Tk n E £ L for к = 0,..., n. (4.3.5) Indeed, if t e Tk n E and if t ф L, then t Ф t0,..., tn by (0.3.4), so that teTn+1 by (4.3.2). But this contradicts (0.3.5). Proof of Theorem 4.18 (i) Let E be a finite, nonempty and closed subset of M(A). We show by induction on |E| that for each Hall set H, H satisfies conditions (0.3.3}-(0.3.5) defining a Lazard set (with H in place of L). Denote by A' the finite alphabet of letters which actually appear in E. Let с = шах(Л') and X = {(acn) | a e A'\c, n > 0}. Let H be a Hall set in M(A). Observe that H n M(A') is a Hall set in M(A'). Then by Lemma 4.19 applied to M(A') and its Hall set H n M(A'), H' = H n M(X) is a Hall set in M(X), where M(X) is identified to a submagma of M(A).
100 4 Hall bases The set £' = E n M(X) is a finite closed subset of M(X), of smaller cardinality than E, because ceE\M(X). If E' = 0, then each letter b appearing in E is not in M(X), hence equal to c: this shows that E involves only the letter c, and E n H = {c}; since c e A and X n E = 0, conditions (O.3.3)-(O.3.5) defining a Lazard set are satisfied in this case. If E' / 0, then by induction on |£|, we conclude that for some n > 1, H' n £' = {tj > • • • > t„}, with tj e X = T\, tt e £• = J | p > 0, te \t, _ j} for i = 2,..., n, and Г'+ j n £' = 0. Let t0 = c e A, To = A and for i = 1, . . . , и 4- 1,7] = {tt/L J | p > 0, t e 7]_ Дг,- Д. Then Т\ £ 7] for i = 1,..., n, and an easy induction on i shows that if t e 7Д 7Д then t involves some letter in A\A'. Hence, we have Tn+1 n E = T'n+1 n £ (because £ £ M(A')) = 0. Since c is the greatest element in A', it is also the greatest element in £ n 77, by (4.1.1). If t e £ n H and t / c, then t e M(A'), and by Lemma 4.19 applied to M(A'), we deduce t e H'; in particular, t e M(X), hence t e H' n £' and t = t, for some i = 1,..., n. Thus EnH — {t0 > > tn}, which completes the proof of (i). (ii) Let £ be a Lazard set in M(A). We show that it is a Hall set. Suppose that h e L\A. Then h = tk +15 for some closed subset £ of M(A) and with the notations of (0.3.3), (0.3.4), and (0.3.5). Since he Tk+1by (0.3.3) we deduce by (4.3.3) that for some i < k, h = (ttP) with p > 1. Hence by (0.3.4) h" = ti > tk + 1 = h and (4.1.1) holds. Clearly L contains A, because for any letter a, L n {a} is nonempty by (0.3.4). Now let h = (h', h") be in L. We show that (4.1.2) and (4.1.3) hold. Choose a closed finite subset £ of M(A) containing h and take the notations of (0.3.3), (0.3.4), and (0.3.5). By hypothesis, we have h = tk+ x e Tk+ x for some k, and by (4.3.3), we have h = (rtf), p > 1, t e T^. Thus h' = (ttP~J), h" = t,. Since h’ e Ti+l n £, we deduce from (4.3.5) that h' e L, hence that h’ = tj for some j. By (4.3.4), we must have i 4- 1 < j 4- 1, hence by (0.3.4) h" = t; > tj = h'. Suppose that h' ф A. Then either p > 2, and (h')" = tt > t, = h", or p = 1, h' = t e Th hence by (4.3.3), h' e Th for some / < i, with h' = (st?_ Д q > 1. Hence, (Л')" = t(_ x > t, = h" by (0.3.4). Hence, (4.1.2) and (4.1.3) hold (with L in place of H). Conversely, suppose that (4.1.2) and (4.1.3) hold. We may find a finite closed subset £ of М(Л) containing h = (h', h"). Taking the notation of (0.3.3) and (0.3.4), we have to show that he L. We have h', h" e L with h' < h". Hence, h' = t,, h" = tj with n > i > j. Suppose that h' e A. Then h' e To, and t0 > • • • > tj > tt = h', hence, h' e T0\{t0,..., tj}. By (4.3.2), we deduce h’eTj+l. This implies h = (h', tj) e Tj+ p by definition of this set. Thus h e L by (4.3.5). Suppose now that h’ ф A. Since h' = t{eTi by (0.3.3), we have by (4.3.3), h' = (ttg-^e Tk for some к < i, p > 1, t e Tk_ i\tk_ P By assumption, we have tk_! = (h')" > h" = tj, hence, к — 1 < j. Thus h' e Tk with к < j 4- 1. If к = j 4- 1, then h' e Tj+1. Otherwise, к < j, hence h' = t,- is not in
4.4 Appendix 101 {tk, tk+1,..., tf, hence, by (4.3.2), h' e Tj+ j also. Thus h = (h', tj) e Tj+l, by definition of this set, and we conclude by (4.3.5) that he L. □ 4.4 APPENDIX 4.4.1 Optimality of the Hall bases The construction of the bases of the free Lie algebra presented in this chapter is optimal, in some sense. Indeed, the following result, due to Viennot (1978, Theorem 1.1.2), holds: let H be a totally ordered subset of M(A), containing A, and such that (4.1.2) and (4.1.3) hold; define the Lie polynomials Ph (he H) by: Pa = a if aeA and Ph = [Ph., Ph..J if h = (h’,h”). Suppose that these polynomials form a basis of the free Lie algebra. Then H satisfies (4.1.1). We sketch the proof, which is divided into three parts. (i) Suppose that w in A* has a factorization w = /(/ij ... f(hn), ht e H, hY > • • • > hn. Then for each word u, uw has a factorization uw = /(tj... f(tp) with |tp| > |/i„| and tr > • • • > tp, ц e H. Indeed, it is enough to show this when и = a e A. Then, either a > /i15 and f(a)f(hi)... f(hn) is the desired factorization, or a < hi which implies by (4.1.2) and (4.1.3) that ki = (a, hj e H. Then either kr > k2, and the desired factorization is f(ki)f(h2) • • • f(h„), or ki < h2, and since ht > h2, (4.1.2) and (4.1.3) imply that k2 = (k^ h2) e H. Continuing in this way gives the result. (ii) Each word in A* has a unique factorization as in (i). Indeed, (i) shows that each word has the desired factorization (take w = 1). To prove uniqueness, we may assume that A is finite, |Л| = q. Then the numbers of trees of degree n in H is, by assumption, equal to the number in Corollary 4.14. The proof of this result then shows that the number of tuples (hv ..., hp), h( e H, hi >•••> hp, of total degree n, is equal to qn. As this is the cardinality of An, the result follows. (iii) Suppose, by contradiction, that for some tree h in H, one has h > h". Denote by r(t) the immediate right subtree of t. Then, for some p > 1, we have rp(h) = a e A. If a < h, then by (4.1.2) and (4.1.3), (a, h) e H. If on the contrary a > h, then for some m in {l,...,p}, we have rm(h) > h and rm~1(h) < h (because rp(h) > h and h” = r(h) < h); then, by (4.1.2) and (4.1.3), we have (rml(/i). h)e H (because rm~1(h)" = rm(h) > h). In both cases, we find a nontrivial right factor v of f(h) such that vf(h) e f(H). Hence, by (i), the word f(h)f(h) has a factorization /(/ij... f(hn), hteH, hx> >hn, with \h„\ > |p| + \h\ > |h|. This contradicts the uniqueness in (ii). 4.4.2 Elimination in free partially commutative Lie algebras Let в be a subset of A x A, not intersecting the diagonal. The free partially commutative monoid M(A, в) is defined as the quotient of A* by the
102 4 Hall bases congruence = generated by the relations ab = ba for (a, b) e 0. Similarly, the free partially commutative Lie algebra L£(A, 0) is the quotient of &(A) by the Lie ideal 1 generated by the Lie polynomials [a, b], for (a, b) e 0. The monoid M(A, 0) and the Lie algebra ^(A, 0) are clearly free, each in the appropriate category. The following result, due to Duchamp and Krob (1992a), extends the Lazard elimination method (Theorem 0.6). For m in M(A, 0), denote by TA(m) the set of letters b in A such that me M(A, 0)b. Note that if (a, b) e 0, then we have for any Lie polynomial: ad(a) о ad(b)(P) = ad(b) > ad(a)(P) mod I. Indeed, by Jacobi’s identity, ad is a Lie homomorphism, so that ad(a) ° ad(b) — ad(b) о ad(a) = ad([a, b]). From this we deduce that и = v implies ad(u)(P) = ad(v)(P) mod 1. Thus ad(x) is a well-defined endomorphism of £L(A. 0), for any x in M(A, 0). With these notations, the elimination result is the following. Let C £ A such that 0 n В x В = 0, where В = 4\C. Then the К-module £f\A, 0) is the direct sum of 0 n C x C) and of the Lie ideal J generated by B. Moreover, J is freely generated, as a Lie algebra, by the elements ad(m)(b), where m is any element of the submonoid of M(A, 0) generated by C, and b in В is such that {b} = TA(mb). For this and related results, see Schmidt (1990), Duchamp and Krob (1992b, c, 1991a,b, 1992J), and Duchamp (1989). 4.4.3 Another rewriting rule There is another rewriting rule for sequence of Hall words, which plays the same role as the relation -> of Section 4.1. It is due to Schiitzenberger (1958, 1986), works only for Hall sets H with the further property h < h', (4.4.1) for any h = (h’, h") in H, but has the advantage of being local. Instead of standard sequences, one considers sequences (4.1.4) with the property that for i = 1,..., n — 1, b, < bi+1 implies (h,, bi+1) e H. Instead of legal rises, one considers rises (b,, hi+1) such that either i + 1 = n, or hi+ j > hi + 2. Then Theorem 4.3 holds for this new rewriting rule. Note that property (4.4.1) also appears in Theorem 5.16(vi). 4.4.4 Free Lie superalgebra Let A be an alphabet with a mapping у: A x A K, where К is a field of characteristic 0 such that /(a, b)/(b, a) = 1 for a, b in A. For finely homo- geneous polynomials P, Q, one defines X(P,Q) = П x(a,b)d^Pde^Q. a. be A
4.5 Notes 103 Then one defines a bilinear operation on K<A> by IP,QJx = pq-AP,<2)QP. Denote by ^Z(A) the smallest subspace of containing A and which is closed under this operation. The previous construction is due to Ree (1960), who gives a generalization of the Friedrichs, Dynkin-Specht-Wever and Poincare BirkhofT Witt theorems and the Witt formula. The construction of Hall bases extends to УХ(А) as follows (Melancon 1991). Let H be a Hall set in M(A). Let x(t15t2) = /(Ж),/(t2)X for any tree t15 t2 in M(A). Let H_ = {he H | h) = — 1}, and define Hx = H<j {(h, h) | heH_}. Extend the order of H to Hx by h > (h, h) > к if h e H. к e H and h> k. Then each word has a unique factorization /(^.../(hj, hieHx,h1>-->hn, where each h in H_ appears at most once. Defining Ph, for h in Hx, as in Section 4.2, the set of polynomials Phl... Phn, with the same condition as above, forms a basis of К<Л>. Moreover, the Ph, heHx, form a basis of £fx(A). See also Mikkalev (1986). 4.5 NOTES The story of Hall bases of the free Lie algebra is not a simple one. They appear in a paper of M. Hall (1950), with condition (4.1.1) replaced by the stronger condition that the order be compatible with the degree. However, similar constructions of ‘basic commutators’ in the free group had already been done by P. Hall (1933) and Magnus (1937). M. Hall’s construction was generalized by Meier-Wunderli (1951), Witt (1956), and Schiitzenberger (1958), by weakening his degree condition. The present condition (4.1.1) was shown to be sufficient to give bases of the free Lie algebra by Viennot (1978); this condition was also known to Shirshov (1962) and Michel (1974). Actually, Viennot shows that this condition is in some sense optimal (see Section 4.4.1). Condition (4.1.1) is so general that it includes the Lyndon basis (which is a basis of the free Lie algebra constructed by Viennot (1978) and Lothaire (1983) by following the lines of the commutator calculus of Chen et al. (1958)) and the Shirshov basis (1958); this was not the case with the original bases of M. Hall. We warn the reader that there are four symmetries which may change the presentation of Hall bases: one can reverse the words and also reverse the order. For instance, our presentation is compatible with that of Viennot
104 4 Hall bases (1978) after these two reversals. We have chosen this presentation to make it compatible with the Lyndon basis of Lothaire (1983). Theorem 4.3 is due to Melancon (1992), who extended a method of Schiitzenberger (1958), itself related to the collecting process of P. Hall (1933); see also M. Hall (1959). It allows one to quickly prove Corollaries 4.4, 4.5, and 4.7, which were known to all the previous authors. The proof of Theorem 4.9(ii) follows Melancon and Reutenauer (1989), and that of part (i) of this theorem, together with the underlying algorithm, follows Schiitzenberger (1986). The assertion on the Lie polynomials in Corollary 4.14 is due to Witt (1937) and Corollary 4.15 is due to Schiitzenberger (1958). The fact that the bases obtained by Lazard are the same as the generalized Hall bases (Theorem 4.18) is due to Viennot (1978). Other bases of the free Lie algebra were constructed by Kukin (1978) and Blessenohl and Laue (1990b), the latter by purely group theoretic methods. See also Corollary 8.20. A formula similar to that of Corollary 4.14 appears in Witt (1956) in the case of the free Lie p-algebra.
5 Applications of Hall sets We begin by introducing Lyndon words and the Lyndon basis of the free Lie algebra; this is a particular Hall set, with special properties, such as triangularity of the change of basis. The dual basis of the Poincare-Birkhoff- Witt basis associated to a Hall basis may be computed recursively, by using the shuffle product; this is done in Section 5.2. In the next section a special Hall basis is constructed, which is compatible with the derived series of the free Lie algebra. The special properties of Lyndon words, with respect to the alphabetical order, are also true for Hall words once the appropriate order on words is defined: to obtain it, one factorizes each word, and then orders sequences of Hall words alphabetically (Section 5.4). In the final section, we show how Hall sets lead to the construction of variable-length codes, with various synchronization properties. 5.1 LYNDON WORDS AND BASIS Let A be a totally ordered alphabet. We order the free monoid A* with alphabetical order, that is, и < v if and only if either v = их for some nonempty word x, or и = xau', v = xbv for some words x, u', v' and some letters a, b with a < b. Observe that this order on words is simply the order in which they would appear in some dictionary. A Lyndon word on A* is a nonempty word which is smaller than all its nontrivial proper right factors; in other words, w is a Lyndon word if vv / 1 and if for each factorization w = uv with u, v Ф 1, one has w <v. Theorem 5.1 The set of Lyndon words, ordered alphabetically, is a Hall set. The corresponding Hall basis has the following triangularity property: for each word w = f ... ln written as a decreasing product of Lyndon words, the polynomial Pw = Ph ... Pln is equal to w plus a Z-linear combination of greater words. We need some properties of the alphabetical order, gathered in the next lemma.
106 5 Applications of Hall sets Lemma 5.2 (i) If и < v and и is not a prefix of v, then их < vy for all words x, y. (ii) If и < v < uw, then v = uv' for some word v' such that v' < w. Proof (i) This is an immediate consequence of the definition of the order. For (ii), imagine the words u, v, uw written in a dictionary, and the assertion immediately follows. □ Proof of Theorem 5.1 (a) Define the standard factorization of each word w of length >2 to be the factorization w = uv, where v is the smallest nontrivial proper right factor of w for the alphabetical order. Then define recursively for each nonempty word w a tree t(w) in M(A) by t(d) = a if a is a letter, and t(w) = (t(u), t(v)) if w = uv is its standard factorization. Evidently, the foliage of r(w) is w, so w i—> t(w) is injective and the set {t(w)|we4*} inherits the alphabetical order. We show that the set {t(w) | w Lyndon} is a Hall set. In view of eqns (4.1.9)-(4.1.11), it is enough to prove the following two assertions. If w is a Lyndon word with standard factorization uv, then u, v are Lyndon words with и < v, w < v, and either и is a letter, (5.1.1) or the standard factorization of и is xy with у > v. If u, v are Lyndon words with и < v, then uv is a Lyndon word. (5.1.2) Let us prove (5.1.1): we have и < uv = w < v, the first inequality by definition of the order, and the second because w is Lyndon. So и < v, w < v. Moreover, v is smaller than all its nontrivial proper right factors, because these are nontrivial proper right factors of w and v is by definition the smallest among them. Hence г is a Lyndon word. Let у be any nontrivial proper right factor of u. Then yv > v. because v is the smallest nontrivial proper right factor of w. If we had у < v, then Lemma 5.2(ii) and the inequalities у < v < yv would imply v = yv' for some word v' with v' < v, a contradiction (because v' would be a proper nontrivial right factor of w = uv = uyv'). Hence, у > v. This shows at the same time that и is a Lyndon word (because и < v < y), and that if и = xy is its standard factorization, then у > v. Hence, (5.1.1) is proved. In order to prove (5.1.2), let s be a proper nontrivial right factor of uv. Then we have three cases. (i) 5 is longer than v, hence 5 = u'v for some nontrivial proper right factor u' of u. Since и is Lyndon, we have и < и', and since и is not a prefix of u', we have by Lemma 5.2(i), uv < u'v = 5. (ii) s = v: if и is a prefix of v, then v = uv'; since v is Lyndon, we have
5.1 Lyndon words and basis 107 v < v', hence uv < uv' = v; on the other hand, if и is not a prefix of v, then from и < v and Lemma 5.2(i), we deduce uv < v. (iii) 5 is shorter than v: then 5 is a nontrivial proper right factor of v, hence v < s (because v is Lyndon); since by (ii) uv < v, we deduce uv < s. This proves that uv < s, hence also (5.1.2), and we conclude that the set of Lyndon words is a Hall set. (b) Note that if / is a Lyndon word written as a nontrivial product / = uv, then by definition / < v; since v < vu, we have / < vu. We show first that for each Lyndon word /, the corresponding Lie polynomial Pt is equal to / plus a Z-linear combination of greater words of the same length as /. This is clear for I = a e A, because Pa = a. If |/| > 2, let I = uv be its standard factorization; then u, v are Lyndon words and by induction Pu = w + E ax^ pv = v + E РУу- x>u у > V Then, by definition of P(, we have P^lPu^l = PUPV - PVPu = uv + e РуиУ + E axxy + E ахРухУ -vu - E a*vx y> V x>u x>u x>u y>v - E РуУи - E ахРУух. y> V x> и У> V Then, the assertion follows from the inequalities I = uv < vu and U < X, V < у => uv < uy, UV < XV < xy, and, similarly, и < x, v < у => vu < vx, vu < yu < yx. In these implications, we have used the following trivial fact: r < s, r’ < s', |r| = |r'|, |s| = |s'| => rs < r's', with strict inequality if one of the first two inequalities is strict. The latter fact (extended to arbitrarily long products) will also imply the theorem; indeed, take w as in the statement of the theorem. Then P(l = /.+ E *x<- x,>l, |X,| = |I.| where * denotes coefficients in Z, whose exact values are not important. Thus Pw = fl Р,. = ^---1п + ^У1-- Уп : — 1
108 5 Applications of Hall sets where the summation runs over words y15 ..., yn with y, > |y,-| = |/,-| and у j > Ij for at least one j; in this case, yx ... yn> f ... ln = w. This proves the theorem. □ 5.2 THE DUAL BASIS We now return to arbitrary Hall sets. By Theorem 4.9(ii) the decreasing products of Hall polynomials Phi ... Phn (h1> > hn) form a Z-basis of Z</1>. By Corollary 4.7, each word w in A* has a unique decreasing factorization into Hall words w = h{ ... hn (hr > >hn}, thus, we define Pw = Phl-.Phn and we obtain a well-defined basis (Pw)weA* of the Z-module Z</1>, indexed by words. In this section, we shall consider the dual basis. Recall that by Section 1.1, the dual space of is canonically isomorphic to Thus the dual basis of the basis (Pw)weA* is the family (Sw)weA* of formal series such that for any word и U= E (S„,u)P„. (5.2.1) we A* Note that, since Pw is a finely homogeneous polynomial of same partial degrees as w, so is Sw. Moreover, by Lemma 4.11, the following result holds: let и = аг ... ap (a{ e A) and w = hr ... hn (ht e H, hr > > hn); then (Sw, u) is equal to the number of leaves labelled (hl,..., hn) in a derivation tree of(a15..., ap). See Examples 4.10 and 4.12. We shall give formulas for the series Sw. It will be more convenient to work in although the series have coefficients in Z; this could be avoided by defining a structure of algebra with divided powers in the shuffle algebra Z<</1>>. Recall that the shuffle product ш has been defined in Section 1.4. (5.2.2) Theorem 5.3 (i) = 1. (ii) For any Hall word h = av (a e A), one has Sh = aSv. (iii) For any word w = h\'... hk written as a decreasing product of Hall words (/q > • • • > hk, f,..., ik e N), one has
5.2 The dual basis 109 We need a lemma. Denote by => the relation on standard sequences, which is defined by 5 => t if either t = 2f(s) or t = pt(s) for some i; see eqns (4.2.3) and (4.2.4). Denote by => the transitive closure of =>. Derivation trees are defined in Section 4.2. Lemma 5.4 (i) Let s be the standard sequence (7i15..., hn) with n>2 and suppose that hr > h( for i = 2,..., n. Then for any standard sequence t with s^> t, t is of length at least 2. (ii) Let s be the standard sequence (h^...,^) with h2> - >hn. If ht... hn is a Hall word, then there is exactly one chain s =>•••=> (ht ... hn). If s^> t and t (7ц ... hn), then t is of length at least 2. Proof (i) By assumption, for any rise h{ < hi+1 in 5, one has i > 2. This implies that Af(s) and p,(s) are of length >2. Moreover, they satisfy the same condition as s, that is, their first element is greater than or equal to the others. This is clear for p,-(s), and for z/s), note that by eqns (4.1.7) and (4.1.10), hi+l > hihi+l. Hence, (i) follows by induction on the length of the chain s=> t. (ii) If и = 1, there is nothing to prove. Suppose n > 2. If hx > h2, then 5 is decreasing and so there is no nontrivial chain starting from s. Moreover, 7ц ... hn is not a Hall word, by Corollary 4.7. So we may assume that 7ц < h2. This is surely the only rise, so that 2,(s) = (h2, hi, h3,..., h„) and p,(s) = (h^, h3,..., 7i„). Note that h2 is the greatest element of x;(s), so by (i), s => t implies that t is of length at least 2. Moreover, pfs) is shorter than s and satisfies the same hypothesis. So we may conclude by induction. □ Proof of Theorem 5.3 (i) This is clear because = 1 and the other Pw are homogeneous of degree > 1. (ii) Let h = av be a Hall word with first letter a. Since Sh has no constant term, the equality Sh = aSv is equivalent to saying that for any word и and any letter b, (Sh, bu) = batb(Sv, u). We have by (5.2.1) w = E <s*- u)p„- w Hence, bu = £(sw,u)bpw. <5-2-^ w Choose a word w and write it as a decreasing product of Hall words, w = hi ... hn. Then the sequence s = (b,ht,... ,hn) is standard, and by Lemma 4.11 bPw = bPln...Ph, = PW = Y.«,pW-
НО 5 Applications of Hall sets where the summation is over all decreasing sequences t of Hall words and where a( is the number of chains s t in a fixed derivation tree of s. By Lemma 5.4(h), each t is of length >2, except when bw = bhr ... hn is a Hall word, in which case there is exactly one chain from s to (bw). When we put this into (5.2.3), we obtain bu = Y, (Sw, u)Pbw + sum °f decreasing products of length >2 bwHaiiword of Hall polynomials. Hence, the coefficient of Ph = Pav in this sum is equal to 0 if a b, and to (Sv, u) if a = b. In other words, (Sav, bu) = 3a b(Sv, u), as required. (iii) Note that, by definition of the dual basis, we have (Sw, Pu) = dw u. In particular, if w is a Hall word and и is not, then (Sw, Pu) = 0. Write w = Wj ... w{ as a decreasing product of Hall words, hence i = ц 4- • • • + ik. We evaluate (SW1 lu- • -ш SWi, Pu), which is equal to (SWI ® • • • ® SWi, bi(Pu)) by Proposition 1.8. By (1.5.6) 3i(P) = P®1®---®1 + 1®P®---®1+--+1®1®---®P, (5.2.4) for each Lie polynomial P. Write и = Uj ... un as a decreasing product of Hall words. Then Pu = PUl... PUn. Now, 3; is an homomorphism and each PUj a Lie polynomial. Hence, Wu) = (5-2.5) By inspection of (5.2.4) and (5.2.5), we find that f(Pu) is a sum of terms of the form Qr ® • • ® Qt, hence that (SWI ш • ш SWi, 3((Ри)) is a sum of terms of the form (SWI, Qt)... (SWi, Q,): if i > n, then in each term, at least one Qj is equal to 1, hence, since (SWi, 1) = 0, we have (SWi ш- • -ш SWi, Pu) = 0; if on the contrary, i < n, then in each term, at least one Qj is a decreasing product Pu = P„ ... PUir with r > 2, hence, since (SWt, Pu.) = 0, we also have (SW1 ш • • • ш Sw , Pu) = 0. In the remaining case i = n, we obtain, because (SWi, 1) = 0, (SWI ш • • ш SWn, Pu) = £ (SWI, PUaJ ... (SWn, PUaJ aeS„ W1 , Ufr( 1 j ... , Urrfn } * <reS„ If w / u, then (w15..., vv„) (ub..., un), and since both sequences are decreasing, the right-hand side vanishes. If w = u, then by Corollary 4.7 (w15..., w„) = (iq,..., u„); since (wp ..., w„) = (7q, ..., 7ц,..., hk,..., hk), each hj repeated ц times, the right-hand side is equal to the number of
5.2 The dual basis 111 permutations fixing the previous sequence, that is, ij ... ik!. Finally, — ,№7 . ,ik! ш---ш^,Ри) = ^, which proves (iii), by definition of the dual basis. □ Corollary 5.5 The shuffle algebra is a free commutative algebra over the set Sh (h Hall word). Proof The families (Sw) and (Pw) are dual bases. Hence, besides eqn (5.2.1), we also have the dual relation, for any polynomial Q c= Z (p»,e)s„. we A* This shows that the polynomials Sw form a basis of the space <0><Л>. Because of Theorem 5.3(iii), this implies that the polynomials Sh (h Hall word) form a free generating set of the shuffle algebra 0»<Л>. □ In the next result, we again use the algebra .2/ introduced in Section 1.5. Corollary 5.6 The following identity holds in the algebra sd E W® w = П exp(Sh® Ph), weA* heH where the product has to be taken in decreasing order. This result could also be stated in the algebra End(Q<4>) with the convolution product * (see Section 1.5): it gives a formula for the identity as a product of exponentials of elementary endomorphisms. Proof The right-hand side is h \i>oi! / hl > > hk ll- • • lk- ij..ik > 1 Because of the definition of PH, and Theorem 5.3(iii) this is E sw®p... we A*
5 Applications of Hall sets 112 which is equal to Eu ® E u)pw I = e u ®u by (5.2.1). □ 5.3 THE DERIVED SERIES Define a sequence of subspaces of £f(A), called the derived series, in the following way: = ^(Л), £Fn +1 = <£"]. The latter means that ^fn+1 is the subspace of £\A) generated by the polynomials [P, Q] for P, Q in <£n. Each subspace P£n is a Lie ideal of ^(Л), and in particular ^fn+1 c P£n-, this is an easy consequence of the Jacobi identity, left as an exercise. We define a particular Hall set H which will be shown to be compatible with the derived series. Define Ho = A and order it totally; now define recursively Hn+1 as the set of trees of the form h = (.. .((hr, h2), h3),..., hk), (5.3.1) where к > 2 and where ht,..., hke Hn with /ii < h2 > h3 > • • • > hk. (5.3.2) Now order Hn+l totally. Finally, let H = (Jn>0Hn and extend the order of the Hn to H by the condition h e Hn, к e Hp, n < p => h > k. Symbolically, this may be written as H0>H1> Equation (5.3.1) is illustrated in Fig. 5.1. (5.3.3) >Hn>Hn+1> Theorem 5.7 The set H is a Hall set. For each n > 0, the set of Hall polynomials {Pw | w e ljp>„ Hp} is a basis of У?п. Proof (a) In order to prove the first assertion, we have only to verify conditions (4.1.1)—(4.1.3). Suppose h e H is of the form fh’, h"). Then h is in Hn+l for some n > 0, and is of the form (5.3.1). Then h" = hk is in Hn, so h < h" by (5.3.3), and (4.1.1) holds.
5.3 The desired series 113 If к > 3, then h' = (.. .(hv h2),... ,hk_l) is in Hn+l. Hence, h' < h" by (5.3.3). Moreover, h' = (x, y) with у = hk_r, and by (5.3.2), у > h". So (4.1.2) and (4.1.3) hold in this case. On the other hand, if к = 2, then h' g Hn and h' = hx < h2 = h" by (5.3.2), and (4.1.2) holds. Moreover, if h’ is not a letter, then n > 1 and h' = (x, y) must also be of the form (5.3.1), so that у is in Hence, у > h" by (5.3.3), and (4.1.3) holds. Suppose now that (4.1.2) and (4.1.3) hold for h = (h', h"). We have to show that h is of the form (5.3.1) and that (5.3.2) holds. If h', h" are in the same Hn, then h is of the form (5.3.1) and (5.3.2) holds with к = 2, because h' < h" by (4.1.2). Otherwise, because of (4.1.2) and (5.3.3), we have h' g Hn+l, h" g Hp with n + 1 > p. Then h' is of the form (5.3.1) and condition (5.3.2) holds. Moreover, writing h' = (x, y), we have у > h" by (4.1.3), and у = hk by (5.3.1). Since hk is in H„ and h" in Hp, we deduce from (5.3.3) that n < p. This, together with p < n + 1, shows that p = n. Then we set h" = hk + l, so h is of the form (5.3.1), and condition (5.3.2) holds (with к + 1 instead of k). (b) Define the level l(t) of a tree t inductively by /(t) = 0 if t g A and, otherwise, t = (t', t"), and let fl + /(t') if I(t') = /(*"); tsup(/(t'), I(t")) if/(f) * l(t").
114 5 Applications of Hall sets The following facts are easy to verify: if s is a subtree of t, then l(s) < l(t); (5.3.4) if ti is obtained from t by replacing its subtree s by Sj and if) /(5) < /(sj, then l(t) < 1(h). J We claim that Hn is the set of trees in H which are of level n. Indeed, note first that, by definition of the level, a tree which is not reduced to a single letter is of level > 1. Hence, Ho is the set of Hall trees of level 0. Arguing by induction, let h be in Hn+ p hence of the form (5.3.1); then h b..., hk are in Hn, hence of level n by induction; an immediate induction on к (starting from к = 2) and the definition of the level shows that h is of level n + 1. This proves the claim. Note that the second assertion of the theorem is true for n = 0, by Theorem 4.9(i). The general case will follow by induction if we prove the following. Let h, к be Hall trees of level >n; then [Ph, Pk] is a linear combination of Hall polynomials Pt, with t of level >n + 1. Note that under the previous hypothesis the tree (h, k) is of level > n + 1. Thus, in view of the algorithm stated after the proof of Theorem 4.9 it is enough to show that each step of this algorithm, when applied to a tree t, produces only trees of level > l(t). This is clear for step (4.2.8) and (4.2.9). For step (4.2.10), we have s = (s', s"), where s' = (x, y) and s" are Hall trees and x < у < s", s' < s'. By the claim and (5.3.3), we have /(x) > l(y) > l(s") and /(s') > /(s"). If we had /(s') = /($"), then we would deduce /(s') = l(s") < l(y) < /(x) < /(s') (the latter inequality by (5.3.4)), hence, equality everywhere; since s' = (x, y), this would imply, by the definition of the level, /(s') = /(x) + 1 > /(x), a contradiction. Thus we have /(s') > l(s") which implies l(s) = /(s'), by definition of the level. Now, by (5.3.4) and (5.3.5), /(((x, s"), y)) > /((x, y)) = /(s') = /(s), and /((x, (y, s"))) > /((x, y)) = /(s') = /(s). Hence, the trees tj and t2 of step (4.2.10), obtained by replacing the subtree s of t by ((x, s"), y) and (x, (y, s'')), respectively, are by (5.3.5) of level >/(t). This proves the theorem. □ 5.4 ORDER PROPERTIES OF HALL SETS Let H be a Hall set in A*. According to Corollary 4.7, each word w in A* has a unique factorization w = hr .. .hn, hte H, hr >• - >hn. We use this bijection between A* and the set of decreasing sequence of Hall words to define a total order on A*, which extends the order on Hall words, and which we therefore still denote by <. This order is obtained by carrying to A* the alphabetical order on the sequences of Hall words. This means that if z is another word in A*, factorized as z = kr ... km, kte H, kx >• - >km, then
5.4 Order properties of Hall sets 115 (5.4.1) w < z if and only if either n < m and = fc,- for i = 1,..., n, or for some j, hx = ki,..., hi_l = k^ t and < kt. In the next result, H is a fixed Hall set, with the corresponding order < on Л*, defined in (5.4.1). Theorem 5.8 Let h be a word in A*. Then the following conditions are equivalent: (i) h is a Hall word; (ii) for each nontrivial factorization h = uv, one has h < v; (iii) for each nontrivial factorization h = uv, one has h < vu. We need several lemmas. The first one has a geometrical interpretation (see Fig. 5.2).
116 5 Applications of Hall sets Lemma 5.9 Let h be a Hall word and h = uv a nontrivial factorization. Then there exist Hall words kr,..., km, ht,... ,h„ such that u = kx...km, v = hl...hn, and kt,..., km < ht, ht > • • > hn> h". Furthermore: (i) if v is longer than h", then hr > (h')" and n > 2; (ii) if v is shorter than h", then hn > (h")". Proof Let h = uv, as in the lemma. Then h = f(t), for some Hall tree t. We prove by induction on \h\ that u =f(si). ..f(sm), v = f(tj ...f(tn) for some Hall trees s19..., sm, tl,...,tn with s1?..., sm < tx, >•••>!„> t". We have t = (f, t") where t', t" are Hall trees by (4.1.2), hence we have ЛОЖ) = uv. (4.2) It may happen that и = f(tf and v = f(f), in which case we take m = n = 1 and Sj = t', tj = t". Then (4.1.2) gives us and we are done. Otherwise, we have two cases, according to the relative lengths of v and h" = f(t"). In case 1, v is longer than h". Then by (5.4.2), v = xf(t"), f(t') = их, for some nonempty word x. By induction, applied to the Hall word t', we obtain u = /(sj ... f(sm), x = f(ti)...f(tn-i), for some Hall trees sl,...,sm, tl,...,tn_l such that s19..., sm < tx, G > • • > t„_ i > (t'K and n > 2. We have (t')" > t" by (5.1.3), hence tn-i > t", and we may take tn = t". Moreover, (i) holds. In case 2, v is shorter than h". Then by (5.4.2) for some nonempty word x, f(t") = xv, и = f(t')x. Then, by induction applied to the Hall word t", we obtain x = f(s2)... f(sm), v = f(tf)... f(t„), for some Hall trees s2,..., sm, t1,...,tn with s2,..., sm < t19 tx > • • • > tn > (t")". By (4.1.1), we have (t")" > t", and by (4.1.2), t" > t'. Hence, t' < tt, and we may take = t'. Moreover, (ii) holds. □ Recall that standard sequences of Hall words (or trees—we identify once more Hall words and trees—see Corollary 4.5) were defined in Section 4.1, where the binary relations -> and its transitive closure are also defined. We denote by w(s) the word w(s) = /ц ... hn, for a sequence of words s = (hi,..., hn), and by max(s) the greatest word in the sequence. Lemma 5.10 Let s = (hi,..., h„) be a sequence of Hall words. Then there
5.4 Order properties of Hall sets 117 exists a standard sequence t = (kt,... ,km) of Hall words such that w(s) = w(f) and max(s) = max(t). Moreover, max(s) = hY if and only if max(t) = kr. Proof (induction on <5(s) = j |/i{| — n). If s is standard (in particular if <5(s) = 0), there is nothing to prove. Otherwise, there are some i, j such that h" < hj. Replace in s the word by the sequence (Jih h"), and call the new sequence s'. Evidently, w(s') = w(s). Note that /i; < h" by (4.1.1), hence hi < hj which shows that /i; max(s); since by (4.2), /1- < h", we deduce that max(s) = max(s'). Moreover, if max(s) = ht, then i 1, hence is the first element of s'; conversely, if max(s) hu then either i ± 1, and hr is the first element of s', and not its maximum because max(s') = max(s); or i = 1, and h\ is the first element of s' with h\ < h’[ (by (4.1.2)) <hj < max(s) = max(s'), and h\ is not the maximum of s'. Now, <5(s') = <5(s) — 1, which allows us to conclude by induction. □ Lemma 5.11 Let s = (ht,..., hn), t = (kv ..., km) be standard sequences such that s —+ t. Then (i) max(s) > max(t); (ii) if max(s) = hr, then hr = кг = max(t); (iii) if max(s) Ф hY and t is decreasing, then max(t) < max(s). Proof (i) This is a consequence of (4.1.7) and of the inequality (/ii,/ii +1) < hi +1, a consequence of (4.1.1). (ii) In this case, (ht, h2) is not a rise, so that hr still appears as the first element of t', for any t' with s -> t'. Hence, (ii) follows from (i), by induction. (iii) Let s = s0 -> st ->----> sn = t. Let h = max(s), and denote by g(sf the first element of sf. We show by induction on i that g(sf < h. This is true for i = 0, by hypothesis. Suppose it is true for sf; then either g(si + J = g(sf < h; or g(si + 1) is by (4.1.7) the concatenation kl of the two first elements of s{, hence, by (4.1.1), g(si+l) < I < тах(х,) < max(s0) (by (i)) = h. The previous argument shows that g(t) < max(s). Since t is decreasing, max(t) = g(t) and we are done. □ Lemma 5.12 Let h be a Hall word of length >2. Then h" is its smallest nontrivial proper right factor. Proof Let v be a nontrivial proper right factor of h. If v is longer than h", then by Lemma 5.9 we have v = hx ... hn for some Hall words ht,..., hn with hr > • • • > hn > h" and n > 2. Hence, we deduce from (5.4.1) that v > h". If v is shorter than h", then by Lemma 5.9 we have v = h{ ... hn for some Hall words hr,... ,hn with ht > • • •> hn> (h")". Since hY > (h")" > h" by (4.1.1), we deduce from (5.4.1) that v > h". □
118 5 Applications of Hall sets Proof of Theorem 5.8 (i) => (ii) This is a consequence of (4.1.1) and Lemma 5.12. (ii) => (i) Suppose w is not a Hall word. Then by Corollary 4.7 we have w = hi ... hn for some Hall words й19..., hn with hr > • • • > hn and n > 2. Hence, by (5.4.1), the right factor hn of w is smaller than w, because hn < (i) => (iii) By Lemma 5.9, we have и = kr ... km, v = hY ... hn for some Hall words kt, hj which are all <hr and such that hY > h". Hence, vu = hi ...h„ki ...km. By Lemma 5.10, there exists a standard sequence of Hall words t = (/p ... ,lp) such that vu = f ... lp and hY = f = max(/t). Hence, Theorem 4.3(iii) and Lemma 5.11 (ii) imply that t t', for some decreasing sequence of Hall words, such that the first element in t' is hp We have hi > h" > h, by (4.1.1), so that by (5.4.1), vu > h. (iii) => (i) Let w be a word which is not a Hall word. Then by Corollary 4.7 we have w = ht ... hn for some Hall words ht with hr > • > hn, and n > 2. If hi = hn, then these Hall words are all equal and w = hnhi ... hn_i. Otherwise, we have hn < hr. Let w' = hnhi ... hn_i. Then, by Lemma 5.10, applied to s = (hn, hr,..., hn_ J, we may find a standard sequence t = (ki,..., fcOT) such that w' — kr .. . km and hi = max(s) = max(t) > kv By Theorem 4.3(iii) and Lemma 5.1 l(iii), we find a decreasing sequence t' of Hall words such that t t' and max(t') < max(t) = hr. This implies by (5.4.1) that w' < hi < w. Hence, we have found in all cases a nontrivial factorization w = uv such that w > uv. □ The next result shows that the standard factorization of a Hall word, and the factorization of a word into a decreasing product of Hall words, may be found with the order < on A* introduced above. Theorem 5.13 (i) Let h be a Hall word of length at least 2 and h'h" its standard factorization. Then h" is the longest proper right factor of h which is a Hall word, and also the smallest nontrivial proper right factor of h. (ii) Let w be a word and w = hi ... hn be its decreasing factorization into Hall words. Then h„ is the longest right factor of w which is a Hall word, and also the smallest nontrivial right factor of w. Proof (i) Let г be a proper right factor of h, longer than h". Then by Lemma 5.9 the decreasing factorization of v into Hall words is hr ... hn, with n > 2. Hence, v is not a Hall word. Now, the remaining assertion follows from Lemma 5.12. (ii) Let у be a right factor of w, longer than hn. Then for some i = 1,..., n — 1, у = vhi+i ... hn where г is a nontrivial right factor of h,. If
5.5 Synchronous codes 119 v = hh then у is not a Hall word, and by (5.4.1), у = h{... hn, because й,- > h„. If v is a proper right factor of hh then by Lemma 5.9, у = k} ... kmhi+1... h„ with kt > > km > h" > h{ > hi+l > • • > hn, by (4.1.1). Hence, again, у is not a Hall word and у > hn by (5.4.1). Now, let у be a nontrivial right factor of w, shorter than hn. Then у is a nontrivial proper right factor of hn, so that hn < у by Theorem 5.8(ii). □ Corollary 5.14 Let h = aw be a Hall word, where a is a letter, w is a nonempty word, and let w = hr ... hn be the decreasing factorization ofw into Hall words. Then h" = h„. 5.5 SYNCHRONOUS CODES A submonoid M of A* has a unique minimal generating set X. Such a set is called a code if M is a free monoid, hence freely generated by X. In other words, a set X is a code if whenever ..., xn, yp ..., yp are elements of X with . x„ = yt ... yp, then n = p and xt = yf for i = 1,..., n. When X is a code, the monoid homomorphism from the free monoid X* into A*, which extends the identity on X, is an isomorphism from X* onto M: it ‘encodes’ X* into A*. We shall identify both monoids, and denote M also by X*. Let X be a subset of A*, distinct from *1}. If X satisfies v, uv e X => и = 1, (5.5.1) for any words u, v in A*, then X is easily seen to be a code: such a code is called a suffix code; this is because (5.5.1) states that no word in X is a right factor of another word in X. It is easily verified that a submonoid M of A* is generated by a suffix code if and only if M is left unitary, i.e. r, uv e M => we M (5.5.2) for any words u, v in A*. A suffix code is called complete if each word w in A* is the right factor of some word in X*; in other words, we have for some words и in A* and y1?..., yp in X\ uw = yt ... yp. This implies that we may write w = sxj ... xn, s e S, n > 0, v, e X, (5.5.3) where S is the set of proper right factors of the words in X. It is easily seen that the representation (5.5.3) is unique, since no word in X is a right factor of another word in X. A word tn in A* is called synchronizing for X if inveX* (5.5.4) for any word г in A*. In particular tn is then in X*. If a word w contains
120 5 Applications of Hall sets as a factor a synchronizing word m, then the computation of the representa- tion (5.5.3) is divided into two parts; indeed, we have w = umv, hencemv e X* by (5.5.4), hence the knowledge of the representation (5.5.3) for и and mv implies that of w. Example 5.15 Let A = {a, b}, M the submonoid {1} и {aw | w e A*}. Then M is generated by the suffix code X = {abn | n > 0}. The set S of proper right factors of X is {bn | n > 0}. Each word in A* has a unique representation bn(abn')... (abnk). Each nonempty word in M is synchronizing. We consider now a Hall set H in A*, and the total order < associated with it, as in Section 5.4. Recall that each word w in A* has a unique representation as a decreasing product of Hall words. Theorem 5.16 Let К be an upwards closed subset of H, i.e. \/k e K.,Vhe H, к < h => he K. Let M = {w e A* | Vk e K, w < k}. Then the following properties hold: (i) M is a submonoid of A*; (ii) M is free, generated by the suffix code X = {he H\K | h e A or h" e K}; (iii) the subset H\K of X* is a Hall set in the free monoid X*; (iv) X is complete if and only if the set S = {kY ... kn | к, e K, n > 0, kr >• >kn} is the set of proper right factors of the words in X. This is the case in particular if К is finite and |Л| > 2. (v) if К is of cardinality p, then for each nonempty word m in M, mp is synchronizing; (vi) suppose that H satisfies the following property: for any h in H\A, with standard factorization h'h", one has h' > h. Then for each nonempty word m in M, there exists p such that mp is synchronizing. If К Ф H, X is complete. Note that, in view of (5.4.1), M is by definition the set of words of the form hr ... hn, n > 0, A, e H\K, /ц > • • • > hn. (5.5.5) Assertion (i) of the theorem is that M contains all the words /ц ... hn with ^Н\К. Note that Lemma 4.19 is a particular case of the theorem. Proof (i) By definition (5.4.1) of the order < in A*, the empty word is in M. Let w, z be two elements in M: then, by (5.4.1), w = /ц ... hn, z = kr ... km with hi,...,hn, k!,...,kmeH, /ц > • • • > hn, ki>--->km; moreover kp kj < к for any к in K, by definition of M. Then wz = hi... h„k! ... km, and by Lemma 5.10 we have wz = f ... lp,
5.5 Synchronous codes 121 for some standard sequence (/19... ,lp) with max(/t) < к for any к in K. Hence, by Theorem 4.3(iii) and Lemma 5.1 l(i), we have wz = . mq with g H, > • > mq and < к for any к in K. Thus by (5.4.1), we deduce wz < к for any к in K, i.e. wz e M. (ii) By Lemma 5.12, for each element h in X and each nontrivial proper right factor v of h, one has v > h"; since h" e К and since К is by hypothesis upwards closed in H, we cannot have vgH\K, and a fortiori not v e X. Hence, X is a suffix code. It generates M: indeed, M is generated by H\K, and for h in H\K, either h g X, or h" g H\K, henceh'eH\K (because h' < h" by (4.1.9) and К is upwards closed) and by induction h', h" e X*, implying h = h'h"eX*. (iii) Let h g H\(K и X). Then h" ф К, which by (4.1.9) implies that h' ф К, because К is upwards closed. So h', h" g H\K and h' < h". If h' ф X, then Кф A and (h')" > ft" by (4.1.11). Conversely, let h,kG H\K with h < к and either h g X or h" > k. We show that hkGH\K. In the first case, either heA and then hkGH because of (4.1.11), or h g X\A, hence h" g K, implying h" > к (because к ф К, and К is upwards closed), thus hkGH by (4.1.11). In the second case, (4.1.11) gives directly hkGH. Moreover, hk < к by (4.1.10), so that ЬкфК, and finally hkGH\K. To conclude that H\K is a Hall set in X*. note that X is contained in H\K, and H\K is contained in M = X*, by definition of M. (iv) Let S be as in the theorem. Then each word w in A* has a unique decomposition w = sxt ... xn, s g S, n > 0, xtG X. Indeed, this is a con- sequence of Corollary 4.7, of the definition (5.5.5) of M, and of the fact, proved above, that M is freely generated by X. If X is complete, then by uniqueness of (5.5.3), we must have necessarily that S is the set of proper right factors of words in X. Conversely, if each word in S is the right factor of some word in X, then each word in A* is the right factor of some word in X*, because of the representation sxj ... xn above. Hence, X is a complete suffix code. If К is finite, then X is complete because of (v) and the fact that M / {1}: indeed |Л| > 2, hence H is infinite and 0 X H \K M. (v) Let К = {kp > • • • > ki}. We show by induction on i that ... /с/1 (Js > 1) is in M. This is enough in view of (5.5.5) and Corollary 4.7. We may write m = hi ... hn with hr > • > hn, ht g H\K, n > 1. Then, in view of Lemma 5.10, there exists a standard sequence of Hall words (^,...,10 such that w = mk{'.. .k{' = .. .lp with kt = max(/7) f. Hence, Theorem 4.3(iii) and Lemma 5.11 (iii) imply that w = pY ... pq for Hall words p19..., pq with Pi >• - >pq and <kt. Thus, we conclude by induction on i that ml~1w is in M, as desired. (vi) If H has the stated property, then so has the Hall set described in (iii). Let m in M, m * 1. We show that for p = |m|, mp is synchronizing, by induction on |m|.
122 5 Applications of Hall sets If ж is a letter, then for any word w = hr ... hn, written as a decreasing product of Hall words, the sequence s = (m, hlf ..., hn) is standard. Then either m > and by (5.5.5) mw e M, or m < hr, and s -> (mhl, h2, • •., h„): since m/ij < m by hypothesis, we have mhi eH\K and we deduce from Theorem 4.3 (iii) and Lemma 5.1 l(i) that mw e M. Now, let m be of length >2, and choose c = the greatest letter in m. If c e H\K, then each letter of m is in H\K, hence m = m'b, m' e M, b e Л\К. By the case |m| = 1, we have bw e M, hence mw = m'bw e M for any word w. Suppose now that с e K. Let K' = {h e H | h > с}, M’ = {w e A* | Vk e K', w < k}; then К' К, M' M, and by (i) and (ii), M' is a submonoid of A*, generated by a suffix code X'; moreover, H\K' is a Hall set in the free monoid X'* by (iii), K\K' is an upwards closed subset of H\K' and M = {w e X'* | Vk e K\K', w < k}. Hence, M is obtained from X' in exactly the same way as M from A. Write m = hi... hn with h^H, hx > > hn. Then by (5.5.5) we have hi gH\K, hence ht gH\K'. Thus m e M' = X'*, by (5.5.5). The X'-length of m is strictly less than its Л-length, otherwise each letter of m is in X' c M', in particular c, a contradiction. Hence, by induction and the previous remarks, we conclude that mp~1X'* M. Observe that, since c is the greatest letter in m, m does not begin with c: otherwise its decreasing factorization into Hall words begins by c, and m > c, hence m ф M‘, a contradiction. Hence m = bm, b < c, thus b e M'. By the case |m| = 1, we have ЬА* M', hence mA* M' = X'*. This shows that mpA* M. If К Ф H, then Л/#{1} and the synchronizing property implies clearly that X is complete. □ Let X be a set of words of equal length n. Then evidently X is a suffix code. We say that X is comma-free if whenever a word x of X is a factor of a message (i.e. a word in X*), then the latter may be cut into two submessages at x; formally, it means that Vx e X, Vu, v e A*, uxv e X* => u,v e X*. (5.5.6) Note that if X contains two words x, y, with x = uv, у = vu for some nonempty words u, v in A*, then X is not comma-free. Indeed, we have that the word uvuv is in X* and contains the inner factor y, but u, v are not in X*. This implies that the cardinality of X does not exceed the number of primitive conjugation classes of length n. Hence |.Y| < 1 EnW", П djn where q = |Л| (see Theorem 7.1). The next result shows that this bound may be achieved, when n is odd. Theorem 5.17 Let n be odd and q = |A|. Then there exists a comma-free code, consisting of words of length n, and of cardinality (l/м) p(d)qnld.
5.5 Synchronous codes 123 The result is not true when n is even; see Berstel and Perrin (1985, p. 346). To prove the theorem, we construct a special Hall set H in A, and show that the set of words of length n in H is a comma-free code. This is enough, because the number in the theorem is equal to the number of Hall words of length n (Corollary 4.14). Recall that M(A) denotes the free magma. We say that t g M(A) (respec- tively A*) is even (respectively odd) if |t| is. Lemma 5.18 There exists a Hall set H such that for any h, к in H: (i) h even, к odd implies h < k; (ii) h, к odd and \h\ > \k\ implies h < k. Proof Let N = {t g M(A) | t even, or t odd and t" odd). Define a binary relation z on N by s z t if either s is even and t odd, or if s, t have the same parity and |s| > |t|. Then the reflexive and transitive closure of z is a partial order on N. Extend it to a total order < on N. Note that for h, к in N, one has (i) and (ii), and that h < Л" for each h in N. Define recursively a subset H of N by A c H, and for any t = (f, t") in N\A, t is in H if and only if t', t" g H,t' < t" and either t' g A, or (t')" > t". Note that h, к e N and h < к implies (h, k) g N. Then H is a Hall set with the desired properties. П Proof of Theorem 5.17 Let H be the Hall set of Lemma 5.18. Let К = {h g H | h odd). Then К is upwards closed in H, thus satisfies the hypothesis of Theorem 5.16 and M = {w g A* | V/c g K, w < k} is a sub- monoid of A*. Note that M is by (5.5.5) generated by the even Hall words. By Corollary 4.7, each word w in A* has a unique representation w = kt . .. knm, with g K, m g M, кг > • > k„. Moreover, if w is a right factor of some word к in K, then by Lemma 5.9 we have w = kY ... k„ with ktGH and kr > > kn> k"; by (4.1.10) k" > k, so that each k, is in K, because К is upwards closed. We claim that each left factor of a word in H is in M u KM. Suppose the claim is proved. Then suppose that the set of Hall words of length n is not a comma-free code. This means by (5.5.6) that there are three such Hall words h, к, I say, and a factorization hk = ulv for nonempty words u, v. Hence, I = WiW2 with h = uWi and к = w2v. Since / is odd, one of and w2 is even, and so there exists an even word w which is both right and left factor of some word in K. Thus, by a previous remark, and the claim, we have w = ki ... kn g M и KM, with kt g K, kt > • • • > kn. Since w is even, we must have w g M, which contradicts the uniqueness of Corollary 4.7, in view of (5.5.5).
124 5 Applications of Hall sets To prove the claim, we prove first that for Hall words h, к one has h, к odd, h < к => hk e M, (5.5.7) and h even, к odd => hke KM. (5.5.8) Let us prove (5.5.7): if h e A or h" > k, then by (4.1.11), hk is a Hall word, evidently even, so that hk e M. If h" < k, then, since h" is odd (because h" > h, and К is upwards closed), we have by induction h"k g M; but h' is even, so that h' g M, and hk = h'(h"k) g M, the latter being a submonoid. We prove (5.5.8): we have h < к by Lemma 5.18(i). If h e A or h" > k, then hk g H by (4.1.11), and hke К c KM. Otherwise h" < k: either h" is odd, so that h' is too and, by (5.5.7), hk = h'(h"k) g KM; or h" is even, so that h' is too, and by induction on |Л|, h"k = klm with кг g K, meM. Then, by induction again, h'^ g KM, so that hk = h'h"к = h'^m g KMm 5= KM. We now prove the claim: let w be a left factor of a Hall word h. If w is a left factor of h', we are done by induction. Otherwise w = h'w', where w' is a left factor of h". If w' = h", we are done because then w g H £ M и KM; so we may suppose w' h". By induction, we have w' = m or km (m g M, kGK). Thus w — h'm or h'km. If h' is even, w belongs to M и KMm, by (5.5.8), hence to M и KM. If h' is odd, then, since h' < h" by (4.1.9), h" is odd too; moreover, к is shorter than h" (because w' h”), so that by Lemma 5.18(ii), h" < k, hence h' < k; thus w g KM и M by (5.5.7). This proves the claim. □ 5.6 APPENDIX 5.6.1 Long products of Lyndon words There exists a function N(q, k) such that for any totally ordered alphabet A with q elements, any word w in A* of length at least N(q, k) has a factorization w = uli ... lkv, where each /t is a Lyndon word and f > • • > lk (Reutenauer 1986a). For the proof, one introduces the code В = a(A\a)*, where a is the smallest letter in A. Then В may be considered as a totally ordered alphabet. The free monoid B* is embedded in A*, and one shows that: (i) the alphabetical order in B* is the restriction to B* of that of A *; (ii) each Lyndon word in B* is a Lyndon word in A*. Then the existence of N(q, k) is proved by lexicographical induction on (k, q). The previous result has as consequence a theorem of Shirshov, which has itself applications in rings with polynomial identities (Shirshov 1957; see also Lothaire 1983, Chapter 7). It has been extended to the Viennot factorizations by Varrichio (1990). It is not known if a similar result holds for Hall words.
5.6 Appendix 125 5.6.2 Multilinear Lie polynomials Let A = {a15 and call a polynomial P multilinear if it is a linear combination of words aa(l)... aa(n), <reS„. Denote by Fn the space of multilinear Lie polynomials. Then Fn is of dimension (n - 1)! and admits as basis the set [• • • a<r(3)J’ • • • » a<r(n)l’ a 6 S„, cr( 1) = 1. Indeed, an inductive use of Jacobi’s identity shows that these polynomials generate F„. Moreover, they are linearly independent, because the above polynomial is the only one involving the word ae(1)... aa(n). Alternatively, one can use the fact that the number of multilinear Lyndon words is (n — 1)!. Let A as above be naturally ordered, and consider the set of Lyndon words as a Hall set, and let Pw denote the corresponding polynomial (as in Theorem 5.1). Then one has the following identity (Melancon and Reutenauer 1989): 1) • flcr(n) * <reSn 5.6.3 Another approach to Hall sets Example 5.15 is directly related to the Lazard elimination process (see Section 0.3). It allows a quite different approach to Hall sets. Let Я be a Hall set in A*. If A has a greatest element z, then by Theorem 5.16(iii) the set H n B* is a Hall set in B*, with В = (A\z)z*. Similarly, if (azn) denotes the tree (.. .(a, z),..., z), then the homomorphism of magma h: M(B) -> M(A) sending azn e В on to (azn) is injective, and the set Л-1(Я) is a Hall set in M(B). This allows us to prove all the results on Hall trees and words. Let us sketch for instance the proof of Corollary 4.4. Let w be a word in A* and let z be the greatest letter in w. We may replace A by a finite alphabet with greatest element z. Let В be as above. Then we may write w = znxr ... xk with Xi e B. Then the word ... xk in B* is strictly shorter (as a word on the alphabet B) than w, hence we conclude by induction that w has a decreasing factorization into foliages of Hall trees. To prove uniqueness, one notes that n is necessarily unique (it is the number of z’s at the beginning of w, because each x in В begins by a letter distinct from z); then one uses induction by passing to the word x'i ... xk. To prove Theorem 4.9 one proves first that, with the previous notations, one has the isomorphism of K-modules: K[z](x)K<B> -+ K(A), zn0xl ...xk^ znPXl... PXk(Xi e B), where, for x = azr, Px is the Lie polynomial [.. .[[a, z], z],..., z], with r zs. Compare this with Theorem 0.6.
126 5 Applications of Hall sets The factorization A* = z*((A\z)z*)* is a special case of bisection. Viennot (1978) gives similar isomorphisms in the case of general bisections (see also Lothaire 1983, Proposition 5.3.11), and more generally, for left regular factorizations of the free monoid. In particular, У (a, b) has a canoni- cal decomposition as the (module) direct sum of the Lie subalgebras ^(re Q> + u oo), where is the space generated by the homogeneous Lie polynomials P such that dega(P)/degfc(P) = r (see also Viennot 1974). 5.7 NOTES Lyndon words appear in the work of Lyndon (1954, 1955a), and they are used by Chen et al. (1958) to construct basic commutators of the free group; Viennot (1978) and Lothaire (1983) also construct the Lyndon basis of the free Lie algebra; an equivalent basis had been constructed by Shirshov (1958). It was Viennot who showed that the Lyndon basis is a particular Hall basis, once Hall sets have been properly generalized (see Section 4.5). Theorem 5.3 is due to Schiitzenberger (1958), with improvements from Melancon and Reutenauer (1989) and Melancon (1991). Note that condition (iii) holds in any enveloping algebra, as the proof shows. Theorem 5.7 is from Reutenauer (1990). The assertion on the length in Theorem 5.13(i) is due to Viennot (1978), and Theorem 5.13(ii) was proved by Duval (1983) in the case of Lyndon words. All other results of Section 5.4 are due to Melancon (1992), who also proved that the order obtained in (5.4.1) in the case of Lyndon words is the alphabetical order. Theorem 5.16 follows an idea of Schiitzenberger (1958) and part (vi) is especially due to him (1986). Theorem 5.17 is due to Eastmann (1965); the proof by Scholtz (1969) of this result consists in constructing a special Hall set, so we have included it in this book (see also the book on codes by Berstel and Perrin (1985, Theorem 5.3.8)). The problem of factorizing matrices of the form 1 — a — b — • • • leads also to the construction of special Hall sets, by Good (1971), who calls them standard lists.
6 Shuffle algebra and subwords The shuffle algebra is a free commutative algebra over the set of Lyndon words; this result is presented in Section 6.1, together with a precise identity on the shuffle product of Lyndon words, which implies that actually there is a canonical structure of algebra of divided powers. In Section 6.2 a remarkable presentation of the shuffle algebra is given; the generators are the nonempty words, and the relations the nontrivial shuffle products. In Section 6.3 we introduce subword functions on the free group, the Magnus transformation of the free group, the algebra structure on the module of subword functions, and the fact that this algebra is generated by the particular subword functions corresponding to Lyndon words. The main tools are the concept of representative, or recognizable functions on the free group, and the infiltration product of Chen, Fox and Lyndon. Section 6.4 presents the commutator calculus of P. Hall, and its generalizations. There are many results involving the lower central series of the free group, the Magnus transformation, and the algebra of subword functions. 6.1 THE FREE GENERATING SET OF LYNDON WORDS Recall that the shuffle product ш was defined in Section 1.4, and that with this product is a free commutative algebra (Corollary 5.5). We show here that the set of Lyndon words is a free generating set of the shuffle algebra. Let A be totally ordered and put on A* the alphabetical order. Recall that a Lyndon word is a word w on A* such that Vu, v e A +, w = uv => vv < v (see Section 5.1). This means that w is smaller than all its nontrivial proper right factors. Recall that each word w in A* has a unique decreasing factorization into Lyndon words; this is a consequence of Theorem 5.1 (the set of Lyndon words is a Hall set), and of Corollary 4.7 (each word has a unique decreasing factorization into Hall words). We assume that К is a Q-algebra. Theorem 6.1 (i) The shuffle algebra К (A) is freely generated by the Lyndon words. (ii) For each word w, written as a product of Lyndon words vv = l\ ... Г?
6 Shuffle algebra and subwords 128 (/j > • • • > lk; ir,... ,ik> 1), one has --------— ш • • ш l^ik = w + £ auu, i i • i)c и < w for some natural integers au. Proof Note that (i) is an immediate consequence of (ii): by triangularity, the polynomials Qw = —^-~l^'in---inirk q!... ikl form a Z-linear basis of Z<4). First note that Qw has coefficients in N, and that w appears in Qw with coefficient a > 1. This is a consequence of the following general fact (which may be verified directly from the definition of the shuffle product): if u15..., uk are words and i15... ,ik are natural integers, then the coefficients of the polynomial are natural integers all divisible by if.... ikl, and u'i ... uk appears in this polynomial. Consider the set of Lyndon words as a Hall set (Theorem 5.1). This implies, by Section 5.2, the existence of two dual bases, (Pw)weA- and (Sw)weA*, of К<Л>. By Theorem 5.1, the basis (Pw) has the following triangularity property: Pw = w + greater words. By duality, we deduce Sw = w + smaller words. Now, with w as in the statement of the theorem, we have by Theorem 5.3(iii) s„= tl!... ikl Moreover, the same theorem shows that Sw has nonnegative coefficients. Thus Sw = w + sum of smaller words. (6.1.1) In particular, S) = / + other words, hence we obtain SW = QW + QW (6.1.2) where Q'w has non-negative rational coefficients. Comparing (6.1.1) and (6.1.2), we deduce that all the words appearing in Qw are <vv; moreover, w appears in Qw with coefficient a > 1, hence we must have a = 1. □ The following result will serve us later.
6.2 Presentation of the shuffle algebra 129 Corollary 6.2 A word w is a Lyndon word if and only if for each nontrivial factorization w = xy, there exists a shuffle of x and у which is greater than w. Proof If w is a Lyndon word and w = xy (x, у / 1), then by definition w < у < yx, and we conclude because yx is a shuffle of x and y. Conversely, suppose that w is not a Lyndon word. Then w is a decreasing product of Lyndon words: w = l\'... If (f > > lk; . . . , ik > 1). Since w is not Lyndon, we have ц + • • • + ik > 2. Let w = xy, with x = f: this is a nontrivial factorization of w. Then each shuffle и of x and у appears in the polynomial If" ш • ш lff'k. By Theorem 6. l(iii), this implies that и < w. 6.2 PRESENTATION OF THE SHUFFLE ALGEBRA Here, К is still a Q-algebra. Define, for each nonempty word w in A*, a variable xw. Denote by X the set of these variables, and consider the algebra of commutative polynomials K[X]. We have a linear mapping ф: K(A) - K[X], wh+xJw/ 1), 1 t— 0. Let J be the ideal of K[X] generated by the polynomials ф(и ш r), u, v e A + . We shall see that the shuffle algebra is isomorphic with K[X]//. Actually, the isomorphism may be precisely described, by the use of the logarithm. As in Section 1.5, consider the complete tensor product = к<л>® к<л>, with the shuffle product at the left of ®, and the concatenation at the right. Then, as in Section 3.2, consider the element log(£u6?1* и ® u); we saw there that i°g( E u(x)u) =Ew®n1(w), \ueA* J »’ where лу is a degree-preserving linear endomorphism of Q<>4>, whose image is in ®(.4) (Lemma 3.8). The adjoint endomorphism Ttf of is completely defined by the equality E ^(w) (x) w = log! X u®ul, (6.2.1) w \иеЛ* / see (1.5.9).
130 6 Shuffle algebra and subwords Theorem 6.3 Considering К (A) with its shuffle structure, let f: -> be the K-algebra homomorphism defined by f(xw) = ft*(w), for any nonempty word w. Then Ker(/) = J, f is surjective, and K(Aj ~ K[X]/.^. Proof Let u, v be nonempty words. Then by Theorem 3.1(iv), we have (ttfiw), и ш v) = 0 for any word w, because nfw) is a Lie element. Hence, by duality (л*(и ш v), w) = 0, which shows that я*(и ш v) = 0. Observe that = f ° ’A; hence, we deduce f ° i/z(u ш v) = 0, and У Ker f. For the opposite inclusion, define L = and let v: -> L be the canonical projection. We show that the series £ H(w)w = £ v(xw)w, (6.2.2) weA* weA* is a Lie series in £<Л). Indeed, from Lemma 1.5, we have the identity 2 ° b(P) = r(P), for any polynomial P. We need the following facts: r(P) is a Lie polynomial, <5 = (id (x) a) ° <5, a(l) = 1, b(P) = (P, u ш v)u (x) v, Л(и (x) v) = |u|ur, and all these mappings are if near, homogeneous, and degree-preserving; see Section 1.3 and Proposition 1.8. The above identity means that for any word w £ (w, и ш г)|м| ua(v) = r(w). U, V In the sum, separate the term corresponding to v = l,u = w. Thus we obtain |w|w = r(w)— £ (w, и ш r)|u|ua(v), i since the terms with u= 1 vanish because |1| =0. The series (6.2.2) is therefore equal to £ vo|/z(w)(|w|-1r(w) - |w|-1 £ (w, и ш r)|u|ua(v)) W^l U, V # 1 = £ V°l//(w)\w\~ir(w) — T, w# 1 where T is T = £ v°^(w)|w|-1(w, u ш r)|u|ua(r) U, V, w # 1 = £ |u|ua(r)|iw|~ 1 v° I Yj (w,uinv)w u,v # 1 \w # 1 because (w, u ш r) / 0 => |w| = |ur|. Now, the second summation is equal to и ш v, so that by definition of./ and v, the series Tis equal to 0. This shows that (6.2.2) is a Lie series. Hence, by Theorem 3.2(iii), its exponential is
6.3 Subword functions 131 defined by a shuffle homomorphism KfA) -> L, that is, E v(xjw = log( X w #1 \ue A* / Applying the homomorphism fi ® id: и ® v —► fi(u)v, -> LfA), we obtain from (6.2.1) !°g( E PMuj = £М(Ф = \иеЛ* / w к because rcf(w) = f(xw). So we deduce that v = fi° f, which implies that Ker f c Ker v = It remains to show that f is surjective. We deduce from (6.2.1) and from the definition of the product in j/ (_ l)*-i f(xw) = nf(w) = £ --------WjUJ---UJWt, W = и 1 . . . Uk К where the sum is over к > 1 and u, / 1. Hence, (- l)fc ~1 f(xw) = w + E , к > 2 к W-Ui . . .Uk By induction on the length, each u, is in Im(/), hence so is their shuffle product. Thus we deduce that w is in Im(/). □ 6.3 SUBWORD FUNCTIONS A word и is a subword of a word w if и = ar ... an (n > 0, at g A) and if w = гоа1Г1«2 • • vn-ia„v„, (6.3.1) for some words in A*. The binomial coefficient (") is defined as the number of factorizations (6.3.1). Observe that if a is a letter, then a" ap the ordinary binomial coefficient. A function A* -»• 7L of the form w >—> is called a subword function. Denoting by A* the characteristic series of A*, i.e. A- = v. it is easy to verify that the subword functions are defined
132 6 Shuffle algebra and subwords by the shuffle product U LU A* = ). I Iw. weA* \ W/ Similarly, a simple verification shows that, if w = a^ ... an (n > 0, a, e A), then (1 + «i)(l + a2)... (1 + a„) = £ Пи. ueA* \ W/ We call Magnus transformation the homomorphism M from A* into the multiplicative monoid defined by M(a) = 1 + a, for any letter a in A. Then we have M(w) = £иеЛ» („)u for any word w in A*. Actually, let F(A) denote the free group generated by A; it contains A* as a submonoid. Since the series 1 + a are invertible in the Magnus transformation may be extended to a group homomorphism, still denoted by M: M: F(A) -> Z«4», a^\+a. For an element g of the group F(A) and a word и in A*, we denote (®) the coefficient of и in M(g). Thus W)= Z (6.3.2) ueA* \W/ We still call subword function a function F(A) —► Z of the form g i—► (®). We shall see that these functions have close connections with Lyndon words and the free Lie algebra. We call space of subword functions the subspace over Q spanned by all the subword functions on F(A). If a, are two functions F(A) -> <Q>, their (pointwise) product is the function ffl:F(A)^Q>, (ffl)(g) = a(g)f(g). This is the way the Q-algebra of functions is defined on F(A). We suppose that A is totally ordered and that A* gets the alphabetical order; Lyndon words are defined in Section 5.1 (see also Section 6.1). Theorem 6.4 The space of subword functions on the free group is a subalgebra of the Q-algebra of functions on the free group. It is generated by the particular subword functions gi—+1 j, и Lyndon word. \u/
6.3 Subword functions 133 Fig. 6.1 We shall see in the next section that the subword functions corresponding to Lyndon words are actually a free generating set of this subalgebra (Corollary 6.19). In order to illustrate the theorem, consider the following example. Example 6.5 Let a, b be two distinct letters and let w be a word in A*. Then one has w \Z w abj \a This relation is proved by seeing how a and ab are relatively located, as subwords of w; there are four cases, shown in Fig. 6.1. The above relation shows that the product of the two subword functions Q) and (*’), on the free monoid, is in the space of subword functions. It also shows that (X) may be expressed by a polynomial in the subword functions corresponding to the Lyndon words a, ab, aab (we suppose a < b). Let us call a function a: F(A) -> Q representative (or recognizable) if there exists a finite dimensional vector space E over Q, a right action of F(X) on E, a vector e in E and a linear function f on E such that for any g in F(/4) «(0) = f(eg). (6.3.3) Lemma 6.6 (i) Representative functions form a subalgebra of the Q-algebra of functions on F(A). (i) If a representative function vanishes on A*, then it is the zero function. (iii) Each subword function is representative.
134 6 Shuffle algebra and subwords Proof (i) If a', a" are as in (6.3.3), then (a' + «")(<?) = f(eg), where £ = £'©£" (with action under F(A): (x1 + x”)g = x'g + x"g), e = e' + e", and f(x' + x") = f(x') + f"(x"). Moreover, we have («'«" )(g) = f(eg), with £ = £' (x) £" (with action (x' (x) x")g = (x'g) (x) (x"g)), e = ё (x) e", and f(x ®x") = f(x')f"(x"). (ii) Each element g in F(A) may be written g = u0V! ги^ ... uk^1vk luk, with к > 0, u,-, Vj g A*. We prove that a(g) = 0 by induction on k. If к = 0, it is true by assumption. Let к > 1; let n be the dimension of £, where £,/, e are as above. By the Cayley-Hamilton theorem applied to the endomorphism x —► xv[ 1 of £, we have for some rational numbers rb..., r„ xv^n = r1xvi”+1 + r2xvi” + 2 + • • • + r„x, for any vector x in £. With x = cuqv""1, this is eu0Vi 1 = r^euQ + r2eu0v1 + • + r„euov"~1. Multiplying on the right by иг . .. vk fik and taking the image under f, we obtain »(g) = n«(w0«i • • • Vk'uk) + r2ct(u0vlul ... 4) + • • + '•na(uoi’i~1Hi • • • Л). By induction on k, the right-hand side is 0. Thus x(g) = 0. (iii) Let a(g) = (2) for some word и in A*. Let £ be the finite-dimensional subvector space of generated by the set P of words which are left factors of u. Let v: 0<<Л>> -> E be defined by v(S) = £reP (S, v)v. Note that v(ST) = v(v(S)T): indeed v(ST) = £reF (ST, v)v = Y»ep Yv = xy (S, x)(T, y)v. Since xyeP=>xeP, this is equal to £rep Ei =xy(v(S), x)(T, y)v = YveP(v(S)T,v)v = v(v(S)T). Define a right action of F(A) on £ by the formula: Xg = v(XM(g)) for any X in £, g in F(A). This is indeed an action, because (Xg)h = v(v(XM(g))M(h)) = v(XM(g)M(h)) = v(XM(gh)) = X(gh). Let e = 1 e £, and f:E-> О, X н-+ (X, u). Then by (6.3.2), ct(g) = (M(g), u). Since и is in P, this is equal to (v(M(g)), u) = (v(eM(g)), u) = /(eg). Hence, a is a representative function. □ Recall the notation w\I, for a word w of length n and a subset 1 of [и] = {1,..., и} (see Section 1.4). Given p words u15..., up of respective lengths ..., np, their infiltration product, denoted by ur | | up, is the polynomial “i !• • 4 “p = E НЛ» • • •
6.3 Sub word functions 135 where the sum is extended over all n < nr ч----+ np and all p-tuples of subsets of [n] such that [n] = (J i < j< p Л’ l;J = nJ for 7 = 1, • • •, n, and where w = w(f,..., Ip) is defined by w\Ij = ujt for j = 1,..., p. The infiltra- tion differs from the shuffle in that there may be overlappings between the Uj when they appear as subwords of w (we do not require the Ij to be pairwise disjoint). We call infiltration of ul,...,up a word appearing in their infiltration product. Each shuffle of u15..., up is an infiltration, with the same multiplicity, and each infiltration of u15..., up is either a shuffle, or of length < luj + • • • + |up|. Examples: ab | ac = abac + 2aabc + 2aacb + acab + abc + acb = ab ш ac + abc + acb, ab la = aba + 2aab + ab = abtna + ab. Lemma 6.7 Let g, x, у be words in A*. Then one has PYH = E СФ> w)(g\ (6.3.4) \Xj\y/ weH* \W/ Proof The formula is clear by inspection (compare with Example 6.5). Proof of Theorem 6.4 Consider the function a: F(A) -► <□, defined by , a / 9 \ 9 «(<?) = WV. E (xly,w)(g\ we A* \W/ By Lemma 6.6(i) and (iii), it is a representative function. It vanishes on A* by Lemma 6.7. So, by Lemma 6.6(ii), it is the zero function. In other words, (6.3.4) holds for any element g of the free group F(/l). This shows that the space of subword functions is closed under pointwise product, and proves the first assertion of the theorem. We show now, by induction, that each function (») is in the subalgebra M generated by the functions (?), I Lyndon word. There is nothing to prove if w is a Lyndon word, or if w = 1 (because (?) = 1). Suppose that w is not a Lyndon word. Then, by Corollary 6.2, there exists a nontrivial factorization w = xy such that each shuffle и of x and у is < w. Let к = (x ш у, w) > 0. Then, by the first part of the proof, PY^ = + \Xj\y J \W / ueA* \^z u # H' Since each infiltration of x and y, is either a shuffle of x and y, or has a length <|x| + |y| = |w|, we deduce that the summation is over words и with either |и| = |w| and и < w (alphabetical order), or with |u| < |w|. By induction, the corresponding functions (®) are in M', similarly, (?) and (?) are in M. Hence, so is (®), because к # 0.
136 6 Shufflle algebra and subwords 6.4 THE LOWER CENTRAL SERIES OF THE FREE GROUP If g, h are elements of any group, we denote as usual their commutator by (g, h), that is (g, h) = g~lh~lgh. For future reference, we recall the easily verified identity (f, gh) = (f, h)(f, g)((f, g), h). (6.4.1) Define the subgroups F„(.l) of F(A) recursively by FfA) = F(A), and + i(A) = subgroup generated by the elements (g, h), for some i, j with g e Ft(A), h e FfA) and i + j > n + 1. This sequence of subgroups is called the lower central series of the free group. We now define special elements of F„(4). For this, take a Hall set H in A* (see Section 4.1), and denote by the set of Hall words of length <n. For h in H, define recursively the element (h) of the free group F(A) by (Л) = h if h is a letter in A and, if h is of length >2, let h = h'h" be its standard factorization. Then (й) = ((hz), (h")). Observe that if n = \h\, then (A)eF„: this is shown by a straightforward induction. Recall that the Magnus transformation M\F(A) -> Z«4», and the subword functions F(A) -> Z have been defined in the previous section. Theorem 6.8 For each g in F(A), there exists a sequence (nh(g))helI of integers such that for any N > 1 9= П W-'”modFN+1(/l), (6.4.2) where the product is taken in decreasing order. For any N, the exponents nh(g) in this equality are unique. The functions nh: F(4) -> Z so defined form a free generating set of the algebra of subword functions. In order to prove the theorem, we introduce a binary relation depending on the fixed integer N, on the set of sequences s = h?) (6.4.3) of Hall words of length <N and their inverses, which are standard; that is,
6.4 The lower central series of the free group 137 for each i, either ht is a letter, or /if > hi+15..., h„, where h{ = hfi" is the standard factorization of /1, (see Section 4.1). Observe that if each /1, is a letter, or if the sequence s is decreasing, that is, hx > • • • > hn, then it is standard. Observe also that a subsequence of a standard sequence is standard. A rise of (6.4.3) is an index i such that hi < hi+l. An inversion of (6.4.3) is a couple (i,j) such that i < j and /i,- > /i . A legal rise is a rise i such that ^i + i ^i + 2, • • •» Л„. (6.4.4) Note that these notions coincide with those introduced in Section 4.1 by simply replacing (6.4.3) by (/i15..., hn). Let the sequence s in (6.4.3) have the legal rise i. Then we replace the elements hf1, hf+\ in s by another sequence, depending on the value of the exponents, thereby obtaining a new sequence s'. We write s -> s' hence defining the binary relation To simplify, write Л,- = h, hi+1 = k. We have several cases: the simplest one is when \hk\ > N; then h±r, k±Y is replaced by k±r, h±Y. (6.4.5) For the remaining cases, we suppose |/i£| < N. Then h, к is replaced by k, h, hk; (6.4.6) /i-1, к is replaced by k, (hk)~l, h~l. (6.4.7) For the remaining cases, let e (respectively o) be the greatest even (respec- tively odd) integer such that \hke\ (respectively \hk° |) is < N. Then h, k'1 is replaced by k~ \ h, hk2,..., hke, (hk°)~ \ ..., (hk)"1; (6.4.8) h~ \ k~1 is replaced by k"1, hk,..., hk°, (hke)~ \ ..., (hk2)"1, h"1. (6.4.9) Lemma 6.9 (i) The sequence s' is standard. (ii) There is no infinite chain s0 -> st sn -► •• • Proof We have h < k, because h, к is a rise of s. Moreover, if h is not in A, then h" > k, because 5 is standard. Also, by (4.1.10), к” > к hence k" > h. By (4.1.11), hk is a Hall word, and inductively, hkr is a Hall word for each r > 1, (hkr)" = k, and hkr < к by (4.1.10). We use these facts without reference in the sequel of this proof. The sequence s is the concatenation of the three sequences u,(h±1,k±1) and v; the sequence s' is the concatenation of u, x, and v, where x is one of the sequences replacing (h±1,/c±1) and given by eqns (6.4.5>-(6.4.9). Note
138 6 Shuffle algebra and subwords that if w is a word such that vv *1 appears in x, then w = h, k, or hkr~, к > w and if w ф A, w" > k. (6.4.10) (i) In order to show that s' is standard, we have to verify that if I, m are two words in s' with l ф A and m at the right of I in s', then I" > m. This is clear if I and m are in the sequences и or v, because s is standard. So we have three cases: (a) I is in u, m is in x—then I" > к because s is standard, hence I" > m by (6.4.10); (b) I, m are both in x—then Г > к > m by (6.4.10); (с) I is in x and m is in v—then к > m by (6.4.4) because h, к is a legal rise, hence I" > m by (6.4.10). (ii) We may assume that the alphabet A is finite. Then H<N is finite. Let E = {(h, k) | h,k &H<N,h < k}. Then E is finite. Order E by (hr, kr) > (h2, k2) if either kY < k2 or kr = k2 and hr >degh2, where >degh2 means either > |й2| or l/ij = |й2| and h1 > h2. Now order N£ lexicographically: then there is no infinite strictly decreasing chain in N£. We show the existence of a mapping v from the set of standard sequences into N£ such that s -> s' implies r(s) > v(s'). This will prove (ii). Define v(s\h k) to be the number of subsequences (h, k) in s; in other words, it is the number of inversions (h, k) in s (note that h < к because (h, k)e E). In view of eqns (6.4.5)-(6.4.9), we have v(s')(h k) = v(s)(hk) — 1. Suppose that I < m: we show that for each inversion (/, m) in s', not already in s, we have (/, m) > (h, k). This will imply r(s') < v(s). We have three cases. (a) I is in u, m is in x: then, since the inversion was not in s, we have by (6.4.10) m = hkr, r > 1; thus m < к and (/, m) > (h, k). (b) I, m are both in x: then, since in each replacing sequence (6.4.5)-(6.4.9), к appears only at the beginning, we have m k, hence m < к by (6.4.10). Thus (/, m) > (h, k). (с) I is in x and m is in v: since h, к is a legal rise of s, we have by (6.4.4) к > m. If m < k, then (/, m) > (h, k). Ifm = k, then I < k, hence by (6.4.10) I = hkr, r > 0; moreover, r > 1, otherwise the inversion (/, m) is already in s; hence, |/| > \h\, which implies I >deg/i and finally (/, m) > (h, k). □ Recall that in Section 4.2 we defined a homogeneous Lie polynomial Ph of degree |/i| for each Hall word h, and that the family (Рй)йен forms a basis of the free Lie algebra. If S is a series in and P a polynomial in Z<X>, we write S = P + 0(Л"+1), to express the fact that 5 — P is a series having no term of degree <n.
6.4 The lower central series of the free group 139 Lemma 6.10 (i) For a, b in A, one has (1 + «)-1(l + b)-1(l +«)(1 + b)= 1 + £ (- l)i+W[«, b]. (6.4.11) i,J>0 (ii) For each Hall word h of length n, one has M((h)) = 1 + P„ + 0(4"+1). (iii) If g 6 FN(A), then M(g) = 1 + 0(4N). (iv) For g e FN(A), let M(g) = 1 + P(g) + 0(4N+1). Then g^ P(g) is a homomorphism from FN(A) into the additive group of homogeneous Lie polynomials of degree N over Z. Proof (i) The left-hand side of (6.4.11) is equal to £ (- l)i+W )(1 + b + a + ab) i,j>0 / = E (- l)i+Jaibj + (-l)i+JalbJ+l i,j>0 i,J>0 + E (-1), + W«+ E (-l)(+7a'^ab. i.J>0 i,J>0 The sum of the first two summations is equal to Ei>o ( — The third summation may be rewritten E«>o (— 1 + Eu>o ( — )),+j+1a,bJ'ba. Hence, the whole sum is X ( —1)W + E (-l)‘«i+1+ E (-l)'+;+Wba i > 0 i > 0 i,j>0 + £ (—l),+7aibJab = 1 + E (~l)i+Jaibj(ab - ba), i,j>0 i.J>0 which is as required. (ii) Let h = h'h" a Hall word of length n > 2, written in standard factorization. Then, by induction, we have M((h')) = 1 + Ph + 0(4" + J) and M((b")) = 1 + Ph" + 0(4" +1), where ri = \h'\, n" = \h"| and n = n' + n". Then, M((h)) = M(((h’), (h"))) = M((h'yl(h")-l(h')(h")) = M((h’)) -1 M((h")) -1 M((h' ))M((h")) = (1 + Ph. + 0(4"' + 1))-1(l + Ph. + 0(/1"'+1))-1 x (1 +Ph. + 0(4" +1))(l + Ph„ +0(4" +1)).
6 Shuffle algebra and subwords 140 By (6.4.11), this is equal to 1+ z (-1)‘+чра, + 0(лп'+1)йрй., + 0(л""+1)У i,J>0 x [P„, + О(Л" +1), Ph„ + О(Л""+1)]. Observe that the term corresponding to i, j involves only words of length > in' + jn" + n' + n”. Hence, for i or j > 1, this term is O(An + J). This implies that М((Л)) = 1 + [P„, + О(Л" +1), ph.. + О(Л"" + 1)] + О(Лп+1) = 1 +[РЙ.Р^] + О(ЛП+1) = 1 + Ph + О(Лп+1), by definition of Ph (Section 4.2). (iii) This is clear for N = 1. For the general case, take g e Fh heFj with i +j > N. Then, induction and eqn (6.4.11) show that M((g,h)) = 1 + О(ЛЛ). Moreover, if M(gr), M(g2) are both of the form 1 + О(ЛЛ). then so is M(grg2 x). This proves (iii) by definition of FN. (iv) This is the consequence of a straightforward computation. □ Proof of Theorem 6.8 (i) Observe that if a standard sequence is not decreasing, then it has a legal rise, e.g. the right-most rise. This implies by Lemma 6.9 that each standard sequence may be rewritten, using the binary relation into a decreasing sequence. In particular, this is the case for any sequence of letters, with exponents +1. Now, define for the sequence s in (6.4.3), (s) = (h^...(hnr. We show below that if s -> s', then (s) = (s')mod FJV+1(X). (6.4.12) This will imply the existence of the expansion (6.4.2). Eqn (6.4.12) is obvious by definition of FN+l when (6.4.5) is applied, because uv = vu(u, v), (6.4.13) hence (h)±l(k)±l = (k)±l(h)±l modFN+l. When (6.4.6) is applied, (6.4.12) holds too, by (6.4.13). We also have v(u, v)~ lu~1 = vv~ lu~ lvuu~1 = lv, which implies that (6.4.12) holds when (6.4.7) is applied. Now, by (6.4.1), 1 = (u, vv~l) = (u, v~ x)(u, r)((u, v), v~l),
6.4 The lower central series of the free group hence 141 (u,v !) = ((u, v), v '(u,v) \ (6.4.14) Writing (uvn) for (.. .((u, v), v),..., v), we obtain from this identity, applied to (uvn) and v instead of и and v, ((uvn), v~l) = ((iw"+1), 1)~1(iw"+1)-1. Hence, by (6.4.14) again, (u, r"1) = ((uv), v~l)~l(u, r)-1 = (uv2)(uv2, v~ l)(uv)~1 = (uv2)(uv3, v~i)~l(uv3)~ fuv)'1 = (uv2)(uv4')(uv4',v~1)(uv3)~1(uv)'1 = = (uv2)(uv4)... (uv2n )(uv2n, v ' ^(ur2" ~1) ~1 ... (uv3)'1 (uv) ~1. (6.4.15) Thus, by (6.4.13) uv1 = v~ lu(uv2)(uv4)... (uv2n)(uv2n, v~1)(uv2n~1)~1 ... (uv3)~ l(uv)~1. The latter identity shows that (6.4.12) still holds when (6.4.8) is applied. For (6.4.9), one argues similarly, using the identity u~lv~l = v~l(u, v'r)~ lu~l = v~ l(uv)(uv3)... (uv2n~ 1)(uv2n, v~ J)~ l(uv2n)~ 1 . . . x (uv4)~ l(uv2)~ lu~ \ by (6.4.15). (ii) We prove at the same time uniqueness of the exponents nh(g), and the fact that nh belongs to the space of subword functions. This will be done by induction: assume the result is true for N — 1 (N > 1). We know that an expansion (6.4.2) exists. By definition, FN+ i is contained in FN, so that (6.4.2) implies g = П mod fn- By induction on N, we know that the exponents nh(g) are unique and that the functions g h-+ nh(g) belong to the space of subword functions, for \h\ < N — 1. Apply the Magnus transformation M to (6.4.2), using the identity (Lemma 6.10(ii)): M((h)) = 1+Ph+Th,
142 6 Shuffle algebra and subwords where Th = О(Л|Л|+1). By Lemma 6.10(iii), the image under M of both members of (6.4.2) coincide up to words of length N; let us express this with the symbol =N: М(й) =„ п *))" = п (1 + Л + г»)"' heH.\ heH<H = 111 ("‘\а + nr heH.\ i>0 \ I / (Hl \ (Hi. \ 1 ‘ (Л, + Л,)" (П. + ll / \ Ik / where the second sum is over all hx > • • • > hk, Hall words of length <N, and integers i15..., ik > 1. Denote by M(g)N the homogeneous part of degree N of M(g). We obtain M(g)H= E I ЕМ-M*- (64-16) |*|=N k>0 Vi / \ h / where the second sum is subject to the further condition that l/ij,..., \hk\ < N, and where * is a polynomial of degree N depending solely on ...,hk and ip...,/*. We know by Theorem 4.9(i) that the polynomials Ph are linearly indepen- dent. Fix h0 of length N; then there exists an homogeneous polynomial Q of degree N such that (Pho, Q) = 1 and (Ph, Q) = 0 for any other h of length n. Take the scalar product of Q with the last identity. We obtain that nho is equal to (M(g), Q) plus a linear combination of products nhl ... nhj, with \ht\ < N. Observe that in the previous computation, only M(g) and the exponents nh depend on g: this proves uniqueness of the functions nh. Moreover, by (6.3.2), induction, and Theorem 6.4, we deduce that nh belongs to the space of subword functions. (iii) Equations (6.4.16) and (6.3.2) show that the subword functions belong to the algebra generated by the functions nh. Hence, the latter functions generate the algebra of subword functions. They generate it freely, because for any finite subset H' с: H, one can find, by (6.4.2), an element g in F(X) such that nh(g) takes, for h in H’, arbitrary values in Z. □ Remark 6.11 In practical computations, it is useful to add to eqns (6.4.5)- (6.4.9) the rule h, h"1 or Л"1, h is deleted in s. (6.4.17) Indeed, Lemma 6.9 still holds, as is easily verified (for (ii) one has to add to the vector v(s) the length of s as a new component, at the extreme right). Moreover, it is clear that rule (6.4.17) does not change (s), with the notations of the proof of Theorem 6.8.
6.4 The lower central series of the free group 143 Example 6.12 Take the Hall set of Example 4.6, g = bab1. and N = 4. We only need to know the inequalities b > ab2 > a2b2 > ab3 > ab > a > a2b. Then, by using at each step the right-most rise, we have Hence, we have (b, a, b"1) -► (b, b~l, a, ab2, (ab3)~ \ (ah)~l) by (6.4.8) -> (a, ab2, (ab3)"1, (ab)"1) by (6.4.17) -> (ab2, a, a2b2, (ab3)-1, (ab)-1) by (6.4.6) -> (ab2, a2b2, (ab3)- \ a, (ab)-1) by (6.4.5) -> (ab2, a2b2, (ab3) - \ (ab)"1, a, (a2b)~1 ) by (6.4.8) bab 1 = (ab2)(a2b2)(ab3) fab) га(а2Ь) ^odFj. Corollary 6.13 (We use the notation of Theorem 6.8.) There exist nonnegative integers kh u (b e H, ueA*, 1 < |u| < |b|) such that for any g in F(A) и \U/ (6.4.18) Proof In view of Theorem 6.8 and Lemma 6.6 it is enough to prove (6.4.18) when g is a word in A*. Observe that when dealing with standard sequences in A*, the only rule of the rewriting system -» which is used is , . . ... fk,h,(hk) if\hk\<N, h, к is replaced by< И }[k,h if \hk\>N. We shall use a modified version of this rewriting system, which works on labelled standard sequences, i.e. sequences of the form S = ((hl,El),...,(hn,En)), (6.4.19) where s = (hx,..., hn) is a standard sequence and each £, is a subset of M. If hh hi+l is a legal rise of s, then we define 5' = (..., (/!,•_!,£,•_!), (hi+l,Ei+1),(hi, Ei),(hihi+l, £,• u £1 + 1), (hi + 2,Ei + 2),...), (6.4.20) where the term with A,b1+1 has to be omitted if this word has length >N. Then S' is still a labelled standard sequence, and we write 5 -» S'. It is easy to verify that -> is confluent, because legal rises do not overlap (cf. proof of Theorem 4.3(i)). Then one shows that the reflexive and transitive closure of -> is also confluent (cf. proof of Theorem 4.3(i)) and that there is no infinite chain So Sj S3... (cf. Lemma 6.9(ii)). Thus, for any S, there is
144 6 Shuffle algebra and subwords a unique final S', i.e. a sequence such that S S' and that S' -> S" for no sequence S". We write S' = f(S). Now, let и = ar ... an e A+ (a; e A, n > 1) and 5 = ((fli, (a2, {i2}),..., (a„, {i„})), where i15..., i„ are distinct numbers. With f(S) given by (6.4.19), let kh,u = |{i | 1 < i < n, ht = h and £f = {i\,..., in}}|. Observe that khu is well defined, i.e. does not depend on the sequence i15..., in of distinct numbers. Observe also that by definition of f(S) and of we have |£,-| < |/i,|, so that kh u # 0 implies |u| = n < |/i|. We show that (6.4.18) holds. For a sequence 5 as in (6.4.19) and £ с N, define S|£ to be the sequence obtained by keeping only those i with £; c £. We claim that if 5 T, then S|£ T\E. Indeed, we may suppose 5 -► S’, that hf, hi+l is a legal rise of 5 and that S' is given by (6.4.20). Then either £,• and Ei+1 are both contained in £, so that £,• u £i+ j is too, hence S\E -> S’\E; or one of £, or £i+1 is not contained in £, so that neither is £;u£i+1, and S|£ = S’\E. From the claim, we deduce that for any word w = a^ ... an, Ec{l,2,...,n] and S = ((«i, {1})’ • • • ’ («„’ {"}))’ we have /(S|£)=/(S)|£, because the underlying standard sequence of f(S) is decreasing, hence so is that of/(S)|£. Recall the notation w\E, defined in Section 1.4. Then we have nh(w) = number of (h, £) in f(S), with E {1,..., n} = E E number of (h, E) in f(S)\E ueA* w|£ = u = E E number of (h, E) in f(S\E) ueA* w|£ =u = £ Z Z □ ueA* w|£=u ueA* \H/ Corollary 6.14 (i) For geF(A) and heH, the number nh(gn) is a linear combination over Z of ("), 1 < i < \h\. (ii) For gx, g2 6 F(A), and heH, the number п^д^^1) is a polynomial over Q in the numbers n^g^, nh2(g2), /ц, h2 e H. Proof (i) We have, by Corollary 6.13, nh(g”)= E kh,u(g\ 1 < |u| < |h| \U /
6.4 The lower central series of the free group 145 Let M(g) = 1 + T with (T, 1) = 0. Then M(g") = Ei>o (")T‘ and (0"') = (M(9").“)= 1 ("V',U)= E f"V',u), \U / i>O\l/ 0<i<|u|M/ hence the result follows. (ii) This has a similar proof. Corollary 6.15 An element g in F(A) is in Fn(A) if and only if M(g) = 1 +0(Л"). In this case, nh(g) — 0 for any Hall word h with \h\ < n — 1, and M(g) = 1 + P + 0(Л"+ 1)/or some homogeneous Lie polynomial P of degree n. Proof The direct part follows from Lemma 6.10(iii). Conversely, suppose that M(g) = 1 + 0(Л"). By Theorem 6.8, we have g = П|я| < л-i (h)nh{e) mod F„(X). Let |h| < n — 1: by Corollary 6.13, we have nh(g) = Ei <|U| <:«-1 fch,u(u) = °, by hypothesis. Hence g = 1 mod. Fn(A). The last assertion follows from Lemma 6.10(iv). □ Corollary 6.16 The group Fn + 1(A) is contained in F„(A), and F„(A)/Fn + 1(A) is an abelian group, freely generated by the classes of the elements (h), for h a Hall word of length n. This group is canonically isomorphic with the ^.-module of homogeneous Lie polynomials of degree n. The dual group (Fn/Fn+l)* is freely generated by the functions nh, for h a Hall word of length n. Proof That Fn+ 1(Л) c Fn(A) is an immediate consequence of our definition of Fn. If g,he Fn(A), then (g, h) g Fn+ fA), hence gh = hg mod Fn+ fA), which shows that Fn(A)/Fn+ fA) is abelian. Each element g in F„(X) has by Theorem 6.8 and Corollary 6.15 a unique expansion в = П (M"h<9)modFn+1, |Л| =" which shows that the classes of the (h) (respectively the nh) such that \h\ = n freely generate Fn/Fn + l (respectively its dual group). The assertion about Lie polynomials follows from Lemma 6.10(ii), (iv): the mapping g P(g) defines an isomorphism between the two /-modules, because the basis ((h)) of Fn/Fn+ j is mapped on to the basis (Ph) of the space of homogeneous Lie polynomials of degree n. □ Take now as a Hall set the set L of Lyndon words. Corollary 6.17 The dual group (Fn(A)/Fn+l(A))* is freely generated by the subword functions (9, I a Lyndon word of length n. Proof By the previous proof, Fn/Fn+l is isomorphic with <fn, the /-module
146 6 Shuffle algebra and subwords of homogeneous Lie polynomials of degree n, via the mapping g P(g) of Lemma 6.10(iv) with N = n. Now, the dual group of Tn is freely generated by the linear functions P h-+ a(, where a, is the coefficient of P when expressed in the basis (P(), / a Lyndon word of length n. Since Pt = l± greater words (Theorem 5.1), we deduce by triangularity that JSf* is also generated by the functions P (P, /). To conclude, we observe that (M(g), I) = (?), by definition of the subword functions. □ Corollary 6.18 For any finite set L' of Lyndon words and any sequence (ai)ieL' °f integers, there exists an element g in F(A) such that 9 I = Otl, VI eL’. Proof Let n be the maximum length of the elements in L. By induction on n, there exists an element gx in F(A) such that: (?) = az for any / in L of length <n. By Corollary 6.17, there exists an element g2 of F„(4) such that (?) = — (?) f°r any I in L' of length n. By Corollary 6.15, we have M(g2) = 1 + 0(4"). Let g = gvg2- Then for / in L = (W), /) = (M(9l)M(g2), I) = E (M(^i)’ w)(W2), v) I — uv = (М(д,'),Г)+ £ (M(91),u)(M(92),i.). I — uv М>л If \l\ < n - 1, the second sum is empty and (?) = (М(^/), /) = (»/) = az. If \l\ = n, then the second sum reduces to (M(g2), I), hence (?) = (?) + (9Z2) = az. □ Corollary 6.19 The subword functions on F(A) (respectively on A*) cor- responding to Lyndon words are algebraically independent in the algebra of functions over F(A) (respectively 4*). Proof Let P(xt) be a polynomial in the commuting variables xh I g L. Define Ф(д) = P(ff)) for any g in F(A). We have to show that Vw g 4*, <D(w) = 0 => P = 0. Now, by Lemma 6.6, Ф is a representative function on F(4), vanishing on 4*, so Ф(д) = 0 for any g in F(4). Let L' be the finite set of Lyndon words such that %,, I g L', appears in P, and let (az)Zgr be any sequence of integers. Then, take g as in Corollary 6.18; hence, 0 = Ф(д) = P(az). This implies that P vanishes for any integer choice of the variables. Hence, P = 0. □
6.5 Appendix 147 Corollary 6.20 Suppose that A is finite. A series S in 2«Л» with constant term 1 is in the A-adic closure of M(F(A)) if and only if for any words x, у in A*, one has (S, x)(S, y)= £ (x j y, w)(S, w). we A* Proof The conditions are necessary by Lemma 6.7. Conversely, let S satisfy these conditions. Let n > 0. Then we may find g e F(A) such that (?) = (S, I), for any Lyndon word / of length < n (Corollary 6.18). An argument similar to that used in the proof of Theorem 6.4 shows that, in fact, we then have (5, w) = (®) for any word w of length <n. Hence, S and M(g) coincide up to words of length n. This shows that 5 is the limit of a sequence of elements in М(Р(Л)). □ 6.5 APPENDIX 6.5.1 Lie polynomial basis of the shuffle algebra Let L be any basis of the free Lie algebra over Q. Then L is a free generating set of the shuffle algebra 0<Л> (Perrin and Viennot 1981). We may assume that the alphabet is finite. Then the space of polynomials of degree < n is of finite dimension; since the scalar product (, ) is positive definite, Theorem 3.1(iv) implies that each polynomial P of degree n, with (P, 1) = 0, may be written P = Q + £ * u ш r, where Q is a Lie polynomial and u, v are nonempty words with |u| + |v| < n. By induction on n, we conclude that P may be expressed as a linear combination of shuffle monomials in the elements of L. A counting argument, using the Witt formula (Corollary 4.14), shows that these monomials are linearly independent. Let Vn denote the subspace of 0<Л> generated by the shuffle products of n Lie polynomials. The previous result shows that there is a direct sum а<л> = ф v„. л>0 6.5.2 A shuffle subalgebra Let L be a subset of Л* which contains each left factor of each word in Ц then the shuffle subalgebra of О<Л> generated by L is a free commutative algebra, over some subset of L. The main step to this result is the proof of the following lemma: if V c A* and ueA* are such that V u {u} contains
148 6 Shuffle algebra and subwords all its left factors, and if и is algebraically dependent on V in the shuffle algebra, then и is in the shuffle subalgebra generated by V. Hint: apply the derivation Qi-'+Qa-1 of the shuffle algebra (cf. (1.4.3)) to an algebraic dependence relation of и on V, for some letter a, and use induction. A particular case of the previous result is when L is the set of factors of a given word w = a{ ... an (af g A). Define for each letter a a square matrix <pa of order n + 1 by ((ра){ i+l = 1 if a{ = a, and (cpa)i j = 0 otherwise. The dimension of the Lie algebra generated by the matrices <pa is equal to the transcendence degree of the shuffle algebra generated by the factors of w (Reutenauer 1985b). 6.5.3 Homomorphism Let Ml be an algebra over K, and p: KfAy -> End(M) a homomorphism from the concatenation algebra into the algebra of linear endomorphisms of M. Suppose that ffla) is a derivation of M for any letter a. Let (p: M -> К be an algebra homomorphism. Define a mapping p: M -> К ((A}} by p(m) = J (p(p(w)(m))w. weA* Then p is a homomorphism of K-algebras, from M into the shuffle algebra Kf(A')') (Fliess 1981, Proposition III.l). A particular case of this result is when A has only one letter a. Then the shuffle algebra К<Л> is isomorphic with the algebra К [[a]], via the isomorphism £n>0 an«" £n>o (ап/и!)а" is supposed to contain Q). Then, if A is a linear differential operator of K[x15..., x„] and a15..., a„ elements of K, the mapping K[xi,...,x„] -> zH->exp(aA)z|Xi = ai, is a homomorphism of K-algebras (see Grobner 1967, pp. 16 17). 6.5.4 Causal analytic functionals Let A = {«j,..., am}, and (ui)l^i^m a family of piecewise continuous functions [0, T] -> (R. For each word w and t e [0, T], define the iterated integral J'o dw recursively by J'o dw = 1 if w is the empty word and, if w = uah then J'o dw = J'o (Jo du)u;(s) ds. Note that this definition agrees with the definition in Section 3.1 when a,(t) = J'o u,(s)ds. Let 5 g such that |(S, w)| < C|w|!r|w| for some constants C and r. Define y(t)= X I dvv’ (6.5.1) weA* Jo
6.5 Appendix 149 which is a convergent series. Then the functional (ub ..., u„) у is called a causal analytic functional, with generating series S. The product of two such functionals corresponds to the shuffle product of their generating series (Fliess 1981). The proof is similar to that of Corollary 3.5, using integration by parts. Among these functionals, there is the special class of those which correspond to a differential system of the form m I 4(0 = E ихолм, • = i ? (6.5.2) y(t) = h(q), J where q(t) belongs to an analytic variety Q over R, and where the vector fields Л15..., Am, and the function h: Q -> R are analytic in a neighbour- hood of q(0). The corresponding generating series 5 is of finite Lie rank, i.e. the vector space {/’ 5 | P g 5T(A)} is finite dimensional, where P S = Е»<=л* (S, wP)w, i.e. 5 h-+ PS is the adjoint of the right multiplica- tion by P (Fliess 1983). The finiteness of the Lie rank is equivalent to the following condition: 5 belongs to a finitely generated shuffle subalgebra of R«4», closed under the operations Tt—>T°P (Pg (R<X>) and closed in the Л-adic topology (Reutenauer 1985«). 6.5.5 Differential algebra A special case of system (6.5.2) is the case where Q is a finite-dimensional vector space, and At,h are linear. Such a system is called bilinear in control theory. It corresponds to the case where the generating series S is recogniz- able (see Section 1.6.8). Let A = {u15..., am} and R{u} = R{uH..., um} the (R-algebra of differen- tial polynomials, i.e. the algebra of (commutative) polynomials in the variables ul,...,um and their formal derivatives u\, u'{,.... Consider the algebra M = R«/l»® R{u}, with the shuffle structure on R<</1>>. It is isomorphic with (R{u})<4>, with its shuffle structure, which is isomorphic with an algebra of formal power series in (infinitely many if m > 2) commuta- tive variables, by Theorem 6. l(i). In particular, M is without zero divisors, and we may form its field of fraction K. The algebra M (hence the field K) becomes a differential ring if one defines as derivation the unique derivation D extending that of R{u}, and which is defined on R<<4>) by m D(S) = E (Sarl)®uh (6.5.3) i= 1 where Sa-1 = Е»<=л* (S, wa)w. Observe that eqn (6.5.3) is motivated by the
150 6 Shuffle algebra and subwords functional intepretation (6.5.1): there, the derivative of у is given by m / (*t \ y(t) = X ( E (s>wai) dwjUi(t). i = 1 \weA* Jo / The differential field К has no constants others than R, i.e. if a g К and D(a) = 0, then a e R. This may be shown by proving first a lemma in the shuffle algebra, interesting in itself: if Sa~1 ш T = 5 ш Ta~1 for any letters a, then 5, T differ by a multiplicative constant. The result on constants in К allows to apply the Picard-Vessiot theory of linear differential equations (see Kaplansky 1957). It is shown in Fliess and Reutenauer (1983) that recognizable series satisfy a linear differential equation with coefficients in R{u}, which has all its solutions in K. Furthermore, the Galois group of the splitting field of this equation, and its Lie algebra, are characterized via the syntactic algebra of the series. * For the differential algebra approach to control theory, see Fliess (1989). 6.5.6 Subword order Write и < v if и is a sub word of v: this is the subword order. A theorem of Higman asserts that if the alphabet is finite, then each set of pairwise incomparable words is finite (see Lothaire 1983, Theorem 6.1.2). The Mobius function of this order has been computed by Bjorner (1989); it uses a variant of the binomial coefficient („) of Section 6.3. Define („)„ to be the number of factorizations (6.3.1) such that: Vi = 0,..., n, \/aeA, A*aaA* and г(^я,Л*. Then the Mobius function fflu, w) is equal to (— l)|u| + |w|(u)„. In other words, for u / w, one has E« < <> < » F(v-> w) = ® = Ци<»<» F(u-> r)- shown in Bjorner and Reutenauer (1992) that the series (w \ j и ® w It / n is rational. 6.5.7 Recognizable subsets of Г(Л) A subset L of F(A) is recognizable, or representative, if for some finite group G and some homomorphism <p: F(A) -> G, one has L = (p~v(p(L). In this case, we say that L is recognized by G. The family of subsets of F(A) which are recognized by finite p-groups (respectively nilpotent groups) is equal to the boolean algebra generated by the particular subsets g e F(^) I = i mod p и/ (u g Л*, i > 0)
151 6.5 Appendix (respectively the particular subsets g g F(/l) g \ , J = i mod n и/ (u g Л*, n > 1, i > 0). This may be shown by using the Magnus transformation, considered with coefficients in Z/pZ (respectively Z/nZ), and mod O(Ad+ 1) for suitable d, to show one inclusion. For the other, use Theorem 6.8, Corollary 6.13, and the following result, proved similarly to Lemma 6.6: the function is an N-linear combination of subword function (“'). This result allows one to reduce the knowledge of n = („) mod pk to that of n \ , . mod p, .P'J because n \ л . | = и,- mod p, P'J if n = £ n.p1 is the p-adic expansion of n (the latter congruence is obtained by expanding (1 + x)" in characteristic p). 6.5.8 Quotients of the lower central series and free Lie algebra Let (F„)„>i be the lower central series of the free group F over A and consider the set gr(F) = £„> i Fn/Fn+ P Then gr(F) becomes a natural structure of graded Lie algebra over Z. Indeed, FnFn.{ is an abelian group, hence a Z-module. Now, let x e F„/F„+1, у e Fp/Fp+15 respectively represented by f e Fn, g e Fp. Then (/, g) e Fn + p and formula (6.4.1) shows that the class mod Fn + p+ j of (/, g) depends only on the class of g mod Fp+l, i.e. on y. A symmetric argument shows that (/, g) depends only on x. Hence we have a well-defined mapping (Fn/Fn +1) x (FP/FP +1) Fn + P/Fn + P+^ (x, у) н-» [x, y]. This mapping is Z-linear, in view of (6.4.1). Extend this mapping to gr(F) by linearity. Since (/,/) = 1 and (g,f) = (f,gY\ we have [x. x] = 0 for any x in gr(F). Now, the Jacobi identity is a consequence of the following identity, where fe denotes g~\fg:
152 6 Shuffle algebra and subwords From this, one may deduce that the mapping g i—»P(g) of Lemma 6.10(iv) induces an isomorphism from gr(F) onto ^(Л), and give another proof of Corollary 6.15. In particular, gr(F) is the free Lie algebra over Z; see Serre (1965). This method is actually the original proof of Witt (1937); see also Lazard (1954) and Bourbaki (1972). 6.5.9 Image of the Magnus transformation on A* The following result (Ochsenschlager 1981; see also Lothaire 1983, Theorem 6.3.22), characterizes the image of the restriction of the Magnus transforma- tion to A*: a polynomial P in М<Л> is in M(A*) if and only if for any words x, у one has (P, x)(P, y) = X (* 1 b H’)(p’ w)- weA* In contrast to Corollary 6.20, no closure is needed here. 6.6 NOTES Theorem 6.1 is due to Radford (1979), who proved combinatorially the triangular identity in the statement. The first assertion was also proved by Perrin and Viennot (1981). The proof given here follows Melancon and Reutenauer (1989). It is not true in general that any Hall set freely generates the shuffle algebra. Corollary 6.2 is due to Chen et al. (1958). Theorem 6.3 and its proof are due to Ree (1958); note the unusual role played by the logarithm in this proof (as Ree observes). We have borrowed the terminology ‘subword function’ and ‘binomial coefficient’ from Eilenberg (1976); the extension of this terminology to the free group is done via the Magnus transformation. Another approach is to take the free differential calculus of Fox (1953). Theorem 6.4 is due to Chen et al. (1958); they introduced the infiltration product and proved Lemma 6.7. Lemma 6.6 follows Melancon and Reutenauer (1993). See Chapter VI of Lothaire (1983) for more on subwords. The commutator calculus of Section 6.4 has its origin in a paper of P. Hall (1933); he essentially proved eqn (6.4.2) of Theorem 6.8, by the use of his ‘collecting process’; this process generates particular Hall sets: those where the order is compatible with the length (cf. the discussion in Section 4.5). See also P. Hall (1957) and M. Hall (1950, 1958, Chapter 11), where uniqueness of the exponents in Theorem 6.8 is also proved, for these particular Hall sets. Here, we work with the general Hall sets, as generalized by Viennot (see Section 4.5); the corresponding group commutator calculus was developed
6.6 Notes 153 by Melancon (1991), and Melancon and Reutenauer (1993); Gorchakov (1969) already gives some of the results (see also Ward 1969). The algorithm presented here is a generalization of the collecting process of P. Hall, with ideas of M. Hall (1959), Schiitzenberger (1958), and Melancon and Reutenauer (1989) where standard sequences of Lyndon words are introduced. Lemma 6.10 is due to Magnus (1937). Corollary 6.13 is due to Therien (1983), in the case of the particular Hall sets (see above) and when g is a word in A*; its generalization to general Hall sets, and to the free group follows Melancon and Reutenauer (1993). Corollary 6.14(i), which is an immediate consequence of Corollary 6.13, is related to an identity of P. Hall (1933; see also Magnus et al. 1976, Theorem 5.13B). Corollary 6.15 is due to Magnus (1937) and Witt (1937); they also proved the existence of the canonical isomorphism between Fn/Fn+l and the Z-module of homogeneous Lie polynomials of degree n (Corollary 6.16). Corollaries 6.17-6.20 are all due to Chen et al. (1958). For applications to the Burnside problem and the theory of p-groups, see Magnus et al. (1976, Chapter 5) and M. Hall (1959, Chapters 12, 18).
7 Circular words There are many links between the free Lie algebra and circular words; the most immediate is the equality of the homogeneous dimension of the former, given by the Witt formula, and of the number of primitive necklaces. The aim of this chapter is to study circular words as an end in itself. We begin by computing the number of primitive necklaces, and of necklaces. Then, we describe the bijection between primitive necklaces and Hall words. In the next two sections two efficient algorithms are described; the first generates Lyndon words up to a given length, and the other computes the factorization into Lyndon words of a given word. The decreasing factorization of a word into Hall words provides a bijection between words and multisets of primitive necklaces; this bijection depends of course on the chosen Hall set. In the final section we give another bijection, which leaves invariant the associated permutation, and which has applica- tions in the study of the various symmetric functions related to the free Lie algebra. 7.1 THE NUMBER OF PRIMITIVE NECKLACES We say that two words u, v in A* are conjugate if for some words x, y, one has и — xy and v = yx. The relation ‘u and v are conjugate’ is an equivalence relation, called conjugation. An equivalence class is called a conjugacy class, a circular word, or a necklace. Geometrically speaking, a necklace is a regular и-gon in an oriented plane whose vertices are labelled in Л; two such n-gons are considered as identical if they may be superposed by applying a rotation, a translation and a homothety (see Fig. 7.1). A necklace is called primitive if no nontrivial rotation leaves it invariant; a word is called primitive if its conjugacy class is a primitive necklace. More generally, a necklace has always a smallest period d, dividing its length n, and in this case the corresponding conjugacy class has d elements, and is of the form C = {u”/d,..., unJd}, where {tq,..., ud} is a primitive conjugacy class. We say in this case that each word w in C has period d and exponent n/d (see Fig. 7.2).
7.1 The number of primitive necklaces 155 ababb babba abbab bbaha babab Fig. 7.1 The conjugacy class of ababb. Theorem 7.1 Let A = {a1?..., aq} be an alphabet with q elements. The number of primitive necklaces of length n is 1 E Kd)qn/d. П d\n The number of primitive necklaces having щ occurrences of the letter a^ (i = 1,..., q) is (7.1.1) (7.1.2) 1 У (d)[ nld nd|^, \njd,..., nq/d) ’ with n = щ + • • • + nq. Here, p and ( ) stand, respectively, for the Mobius function and the multinomial coefficient.
156 7 Circular words Theorem 7.1 may be proved directly, but we prefer use the formalism of generating functions, which will also be useful in the sequel. Let xb ..., xq be commutative variables. The evaluation of a necklace и is the monomial x"1 ... xq4 in Q[x1?..., xj if и has occurrences of the letter ah for i = 1,..., q. The evaluation of a word is defined similarly. If L is a set of necklaces (respectively of words), then the generating function of L is the sum of the evaluations of the elements of L. For example, the generating function of the set of primitive necklaces of length 2 is Z i<J because each such primitive necklace has a unique representative of the form 0,0^, i < j. Define the nth power sum pn to be the symmetric polynomial p„(xi,...,xe) = x'; + --- + xj. Theorem 7.2 The generating function of the primitive necklaces of length n is 1 X pWpT- (7.1.3) П d | n Proof Let Ce(n) denote the set of words of length и and period e. We have a partition of the set An of words of length и A" = U e| n Let P(e) denote the set of primitive words of length e. The mapping и н» unle, P(e) - Ce(n), is a bijection. Observe that each word in P(e) has e conjugates, each with the same evaluation, and that er(u"/e) = er(u)|Xi_x?,e, where ev denotes the evaluation. Moreover, the generating function of An is clearly (Xj + • • • + xq)n = p"(xi, • • •, xq). Thus, denoting by /e(x1?..., xq) the gener- ating function of the primitive necklaces of length e, we have rt(x„ ..., x,) = £ ele(x"C, • , <')• <71-4> e | n Note that pd(xl,..., xq) = pfx*, , xq). So we have - E • • • ,xq) = - £ n(d)p"l,d(xdl, ...,xd) n d | n П d | л = 1 X M(d) X ................xj"), П d | л e| n/d by (7.1.4) with n replaced by n/d and x, by xf.
7.1 The number of primitive necklaces 157 Since d\n and e\n/d is equivalent to e\n and d\n/e, this is equal to - E e/e(x"/e’---’x«/e) E /*00- 71 e | n d | n/e The second sum vanishes, unless n = e in which case it is 1, so that the whole expression is equal to l„(xb ..., xq). □ Proof of Theorem 7.1 In formula (7.1.3), put xb ...,x = 1; this gives formula (7.1.1). Moreover, the number of primitive necklaces with n, occurrences of the letter at (i = 1,..., q) is the coeffcient of x"1 ... xnqq in (7.1.3); this coefficient is (7.1.2), because p^Xj,...,^) Г E f W’.-.x^’. □ ri + • • • + rq = n/d vp • • » rqJ Corollary 7.3 The generating function of the necklaces of length n is - E <p(d)Pdd, П d\n where (p is the Euler function. The number of necklaces of length n on an alphabet with q letters is 1 E 4>(d)qn/d. П d\n The number of necklaces having П; occurrences of the letter ahi = 1,..., q, is 1 E П d | л, / n/d Xnjd,..., nq/d. Proof For each conjugacy class C of words of length n, there is a unique conjugacy class C of primitive words of length dividing n, n/e say, such that C = | и e C'}. This implies the following identity between the generating functions kn (respectively /„) of necklaces (respectively of primitive necklaces): k„(x1,...,x,)= E U(xi’• • • ’-x«)- е|л By Theorem 7.2, we deduce
158 7 Circular words kn(xl,...,xq)=Y-iT E е|л П/e f\n/e = E - E e | n П ef\n = 1 E pnAxi’- • ->*q) E ft d\n e\d \ej = - E ft d [п because p/xl,..., xq) = pe/(Xi,..., xq) and <p(d) = E e\d ep(d/ef as is well known. In order to obtain the other two formulas, we follow the proof of Theorem 7.1. □ 7.2 HALL WORDS AND PRIMITIVE NECKLACES Conjugacy of words is an equivalence relation, which preserves primitivity and periodicity (see Section 7.1). Observe that if a set H of words is a set of representatives of the primitive conjugacy classes, then the set {hn | heH,n > 1} is a set of representatives of all the conjugacy classes of positive length. The following result is a particular case of a theorem of Schiitzenberger (1965). Theorem 7.4 Let H be a subset of A*, with a total order <. Suppose that each word in A* has a unique factorization Iii .. .hn,n>0,hiE H,hx > • • > hn. (7.2.1) Then H is a set of representatives of the primitive conjugacy classes. Recall that Hall sets are defined in Section 4.1. Then the previous theorem and Corollary 4.7 immediately imply the next result. Corollary 7.5 Let H be a Hall set in A*. Then each word in H is primitive, and each nonempty word is conjugate to a unique word of the form hn, he H, n > 1. For the proof of Theorem 7.4, observe that a series of the form [5, Г] = ST — TS is a linear combination (possibly infinite) of polynomials uv — vu, u, v e A*.
7.2 Hall words and primitive necklaces 159 Proof of Theorem 7.4 Let X = {xh | h g H} be an alphabet in bijection with H. Then, an easy extension of the Baker-Campbell-Hausdorff formula (Corollary 3.4) shows that the series log I f ] eXh I \heH J is a Lie series. It is easy to see that its component of degree 1 is Ehew xh. Thus, the series log (fl eXh)- E xh (7.2.2) \heH / heH is a Lie series without component of degree 1, and is therefore an infinite linear combination of polynomials of the form uv — vu, u, ve X*. Since each word in A* has a unique factorization (7.2.1), we have in О«Л» (l - Л)-1 = П (1 - hr1, heH where we write A for Еаел a. Taking logarithms, we obtain £-'/!" = logfnd-*)’1)- (7.2.3) n > 1 n \heH / Consider the (continuous) homomorphism f: -* xh i—> log( 1 — h)-1. Under this homomorphism, (7.2.2) becomes !og( П 0 “ ~ E log(1 -hy1, \heH / heH which is therefore an infinite linear combination of polynomials of the form uv — vu, u, v e A*. By (7.2.3), we deduce that so is the series E -An- e E -hm- (7.2.4) n> 1 n heH m> 1 m Let C be any conjugacy class of length n. Taking scalar product of E«ec u with (7.2.4), we find that ^|C| = E^ec m- If C is primitive, |C| = n and the left-hand side equals 1; by primitivity, the right-hand side is Ehec b hence there is exactly one h in H n C. If C is not primitive, |C| < n, hence H n C is empty. □ Let H be a fixed Hall set in A*. There is a simple algorithm which gives for each word vv / 1 the unique hn (heH,n> 1) conjugate to w. This algorithm works on sequence of Hall words, takes as input w considered as
160 7 Circular words the sequence of its letters, and produces as output the sequence (h,..., h) of h repeated n times, such that he H, n > 1 and w is conjugate to hn. As in Section 4.1, let h = h'h" denote the standard factorization of a word he H\A. We say that a sequence s = (ftp ..., h„) of Hall words is cyclically standard if for any i=l,...,n, either h, is in A, or h, = h\h" with h'- > hp ..., hn. Observe that a sequence of letters is always cyclically standard. Moreover, if the h, in s are not all equal, then there is an index i such that h,-<hi+i =sup{hp...,h„}, where the indices are taken modulo n. For such an i, replace s by s with _ J(hp...,hi_ph1hi+phi+2, ...,h„) if i < n; ((h„hph2,. ,.,h„_1) if i = n. Observe that either h, is a letter, or hf = h\h", in which case h'- > hi+l because s is cyclically standard: in both cases, h = hthi+ , is a Hall word with h' = hf, h" = hi+p by (4.1.11). Moreover, s is cyclically standard, because on one hand h" = hi+l > hp ..., hn (by assumption on i) and h" > h by (4.1.10), and on the other, s being cyclically standard, we have h" > hk, for any j, k, and in particular hj > hi+x = h" > h. For s as above, let w(s) = h{ ...hn. Then clearly, w(s) and w(s) are conjugate words. The algorithm consists to apply the transformation s -» s repeatedly, starting from a sequence of letters, until a sequence of the form (h,..., h) is obtained. Example 7.6 Consider the Hall set of Example 4.6; we want to find the Hall word conjugate to the word abaahaba. Knowing that aabab, aab, a, ab, b are Hall words, written in increasing order, the algorithm gives: (a, b, a, a, b, a, b, a) -* (ab, a, a, b, a, b, a) -» (ab, a, ab, a, b, a) -» (ab, a, ab, ab, a) -» (aab, a, ab, ab) -* (aab, aab, ab) -» (aab, aabab) -» (aababaab). The latter word is the desired Hall word. Let the alphabet A be totally ordered, and order A* alphabetically (see Section 5.1). Recall that a Lyndon word is a word which is smaller than all its nontrivial proper right factors. Corollary 7.7 A word w is a Lyndon word if and only if it is primitive and if it is the smallest word in its conjugacy class. Proof The set L of Lyndon words is a particular Hall set (Theorem 5.1). Hence, by Corollary 7.5, L is a set of representatives of the primitive
7.3 Generation of Lyndon words 161 conjugacy classes. Now, if w is a Lyndon word, let vu be a conjugate of w, with w = uv, u, v / 1. Then, by definition of a Lyndon word, we have w < v. Since v is shorter than w, w is not a prefix of v, so that by Lemma 5.2, we deduce w < vu. Hence, each Lyndon word is the smallest element of its conjugacy class, which concludes the proof. □ 7.3 GENERATION OF LYNDON WORDS Let A be a totally ordered finite alphabet, let A* be ordered with the corresponding alphabetical order, and L be the corresponding set of Lyndon words. We fix an integer N > 1, and denote by LN the set of Lyndon words of length < N. Define a function In- Ln\{z} -* Ln, (where z is the greatest letter in A) by the condition: lN(u) is the smallest word, for the alphabetical order, in the finite set {x e LN \ x > u}. Observe that lN is simply the ‘next element function’, in the finite totally ordered set LN (whose greatest element is z). Hence, the knowledge of lN will provide a simple algorithm for the generation of all Lyndon words of length <N: starting by a (the smallest letter), compute и = lN(a), then lN(uf and so on, until z is obtained. The next result, due to Duval (1988), shows that the function lN may be computed very efficiently. For this, we need two definitions. Given a word u, call periodic expansion at length N of и the word DN(u) = ukp, where к and p are uniquely defined by к e N, p is a nonempty prefix of u, |t/p| = N. In other words, DN(u) is the unique prefix of length N of the ‘infinite word’ uuu .. .u ... Note that such a prefix, being of length N > 1, may always be written as ukp, with p a nonempty prefix of u. Now, define a function 5: A*\z* -* A*, where z* denotes the set of powers of z, by 5(w) = min{x g A* | x > w, |x| < |w|}. There is an easy algorithm to compute 5(w). Let w = vaz', where a is a letter distinct from z. Let b be the letter following a in the finite totally ordered set A. Then S(w) = vb. (7.3.1) Theorem 7.8 The function lN is equal to the composition S DN. We first prove several lemmas.
162 7 Circular words Lemma 7.9 If u, v are Lyndon words such that и < v and n, p are integers > 1, then unvp is a Lyndon word. Proof The case n = p = 1 is eqn. (5.1.2). The general case follows by induction, once it is observed that и < unvp < v (the latter inequality because unvp is a Lyndon word). □ The next lemma shows that there is no Lyndon word between a Lyndon word and its periodic expansion at any order. It is a first step towards the proof of Theorem 7.8. Lemma 7.10 Let u, w be Lyndon words with |u| < N, and и < w < DN(u). Then w = u. Proof Suppose that w satisfies и < w < DN(u). We show that w has a nontrivial suffix s and и has a prefix p such that s < p. Hence, s < p < и < w and w is not a Lyndon word. First, suppose that w is a prefix of DN(u). Then, since DN(u) is a prefix of some power u“ of u, we deduce that w is also; hence, w = ukp for some prefix p of и and some integer к > 0; we have w 1 and we may choose p to be nontrivial (replacing if necessary p = 1 by p = и and к by к — 1). Hence, p = s is a nontrivial suffix of w = ukp. Suppose now that w is not a prefix of DN(u). Since w DN(u), we have by definition of the alphabetical order, w = xaw', DN(u) = xbt for some words x, w’, t and letters a < b. We have, by definition of DN, x = uku', with к 0 and p = u'b prefix of u. Let s = u'aw'; then w = uku'aw' = uks, and s is a nontrivial suffix of w. Moreover, s = u'aw' < u'b = p. □ Lemma 7.11 Let и = ps be a Lyndon word, s 1. Let c be a letter such that s < c. Then pc is a Lyndon word. Proof Let t be a nonempty proper suffix of pc. We show that pc < t, which will prove that pc is a Lyndon word. We have t = t'c, where t' is a proper suffix of p. Then t’s is a nonempty proper suffix of ps, and therefore ps < t's, because ps = и is a Lyndon word. Moreover s < c, hence t's < t'c, and p is a proper prefix of ps, hence p < ps. Thus p < ps < t's < t'c = t, and p < t. Suppose that p is a prefix of t; since t is a proper suffix of pc, we have | p| + 1 = |pc| > |t| => | p| > t|, and we would have p = t, which is not true because of the previous inequality p < t. Hence, we can use Lemma 5.2(i) to deduce that pc < t. □ Corollary 7.12 Let и be a Lyndon word, which is not the greatest letter of the alphabet, p a nonempty prefix of u, and к an integer. Then S(ukp) is a Lyndon word.
7.4 Factorization into Lyndon words 163 Proof Let z be the greatest letter in A. Then p does not begin with z: indeed, otherwise и = zu' and и > z > last letter of u; since и is a Lyndon word, this implies that и is equal to its last letter, hence и = z, against the assumption. In particular, p is not a power of z. Hence, we have p = p}azl. where a g A\z, and by (7.3.1), S(p) = pffi, where b is the letter after a in A. We have и = p^ for some word Sj beginning with a. Hence, < b, and by Lemma 7.11, we have that pffi is a Lyndon word. Now, by (7.3.1) again, we have S(ukp) = S (икр{аг1) = tZpjb. Since pta is a prefix of u, we have и < ргЬ, and we conclude by using Lemma 7.9. □ Proof of Theorem 7.8 Let и be a Lyndon word of length <N, и / z, and w = lN(u). Then by definition w is the smallest Lyndon word in the set (x g A* \ x > u, |x| < N}. By Lemma 7.10, we have w > DN(u). Hence, w is the smallest Lyndon word in the set {x e A* | x > DN(u), |x| < N}. But S(DN(u)) is the smallest word in this set. Since DN(u) = for some integer к and some nonempty prefix p of u, we know by Corollary 7.12 that 5(DN(u)) is a Lyndon word. Hence, S(Djv(u)) = w = lN(u). □ Example 7.13 N = 9, A = {a < b}. Then и = aabbb is a Lyndon word. One has /9(u) = S D9(u) = Sfaabbbaabb) = aabbbab. 7.4 FACTORIZATION INTO LYNDON WORDS As in the previous section, L denotes the set of Lyndon words relative to the alphabetical order on A*, where A is a totally ordered finite alphabet. By Theorem 5.1, L is a particular Hall set, hence, by Corollary 4.7, each word w in A* has a unique decreasing factorization. w = f...ln, (7.4.1) Existence and uniqueness of the factorization (7.4.1) may be proved directly—indeed, each word has a factorization into Lyndon words (e.g. w is the product of its letters). Now, take such a factorization, with a minimal number of factors. Since for к, I Lyndon words, к < I implies kl g L (Lemma 7.9), this minimal factorization must be decreasing. This proves the existence of factorization (7.4.1). Uniqueness is a consequence of the following result. Lemma 7.14 For a factorization of the form (4.1) of the word w, the following properties hold: (i) ln is the smallest nontrivial suffix of w; (ii) ln is the longest suffix of w which is a Lyndon word; (iii) li is the longest prefix of w which is a Lyndon word.
164 7 Circular words Proof Let s be a nontrivial suffix of w. Then s = //, + j ... where /• is a nonempty suffix of /, and 1 < i < n. (i) Ц is a Lyndon word, hence we have /, < < Г{11+ { . ln = s, and /„ is the smallest nontrivial suffix of w, because l„< (ii) Suppose that s is longer than Then i < n, which implies /'• < s. Arguing as in (i), we deduce ln < s, which shows that s has a nontrivial suffix smaller than itself, and is therefore not a Lyndon word. (iii) Let p be a prefix of w, strictly longer than /P Then p = . lj-il'j, where l'j is a nonempty prefix of lj and 2 < j < n. Using (7.4.1), we deduce l'j < Ij < ... lj-il'j = p, which shows that p is not a Lyndon word. □ In order to find the Lyndon factorization of a word, one may apply the algorithm described in the proof of Corollary 4.4. However, there is a much more efficient algorithm due to Duval (1978, 1983), which we describe now. For this purpose, call sesquipower of a word и any word of the form ukp, for some integer к > 0 and some prefix p of u, which we may assume to be proper (i.e. p / u), without loss of generality. Such a sesquipower is called nontrivial if к > 1. Denote by 5 the set of nontrivial sesquipowers of Lyndon words. An element и of 5 has always a representation и = lkp, I e L, p proper prefix of I, к > 1. (7.4.2) For a given element и of S, the representation (7.4.2) is unique: indeed, let p = ... hq be the decreasing factorization of p into Lyndon words; then hi < p < I, hence the decreasing factorization of и into Lyndon words is lkhi ... hq, and we conclude by uniqueness of this factorization. In the sequel of this section, it should be understood that when we deal with elements of S, we deal actually with their unique representation (7.4.2). For a word и in 5, having the representation (7.4.2), we may write I = pas for some letter a and some word s. We denote a = g(w). Now, define a binary relation on the set 5 x A*, denoted by and defined for any и in S, b in A, and v in A* by (u, bv) -» (ub, i?) if s(u) < b. This is well defined, because e(u) < b implies ub g 5: indeed, let и = lkp as in (7.4.2) and I = pas, thus a = e(u). Then either a = b, hence ub = (pas)kpa is clearly in S; or a < b and as < b, hence by lemma 7.11, pb is a Lyndon word with I < pb, which implies by lemma 7.9 that ub = lkpb is a Lyndon word, hence in 5. Denote by the transitive closure of This is clearly a partial order on 5 x A*, and we say that xeS x A* is maximal if for no у in 5 x A*, one has x -* y. It is clear that for each x in 5 x A*, there is a unique maximal у in 5 x A* such that x y. The factorization algorithm is described in the next theorem. Note that
1А Factorization into Lyndon words 165 the decreasing factorization into Lyndon words of a word w may be written w = lkl...lk?, Ц > • > lp, kl?..., kp> (1A3) Theorem 7.15 Let w be a word, factorized as in (7 A3), c its first letter, with w = cw', (u, v) the unique maximal element in S x A* (where и is as in (7.4.2)), such that (c, w')^ (u, v). Then l{ = I and = k. In other words, the rewriting system -» allows us to compute the power of the first Lyndon word in the factorization of w; then, one continues with pv instead of w, and so on. Example 7.16 w = abbabbababb, a < b. For each и in S, written as in (7.4.2), we put the letter e(u) in bold face. Then we have: (a, bbabbababb) -» (ab, babbababb) -» (abb, abbababb) -* ((abb)a,bbababb) -» ((abb)ab, bababb) -» ((abb)2, ababb) -» ((abb)2a, babb) -» ((abb)2ab, abb). The latter is maximal, because b > a. So w = (abb)2s, and we continue with s: (a, babb) -* (ab, abb) -* ((ab)a, bb) -» ((ab)2, b) -» (ababb, 1). Hence, s is a Lyndon word, and the factorization of w is (abb)2(ababb). Proof Since (c, w')^ (u, v) we have w = cw' = uv = lkpv. Let I = pas, ae A. Let pv = hi ... hq be the decreasing factorization into Lyndon words of pv. Then either hi is a prefix of p, hence h{ < p < I, or p is a proper prefix of hi: pbh', where b is the first letter of v; then, since (u, v) is maximal, we must have a > b, hence hi = pbh' < pas = /. In both cases, hi < I, which shows that the decreasing factorization into Lyndon words of w is /% ... hq and that lk = lkl, hi...hq = lk22... If. □ The proof shows that this algorithm is linear in time: more precisely, to factorize w, one needs at most 2|w| comparisons between letters in A. 7.4.1 Applications (a) We know by Corollary 7.5 that each primitive word is conjugate to a unique Lyndon word. To find it, it is enough to factorize ww into Lyndon words, and to extract from this factorization a Lyndon word of length |w|. Indeed, such a word will clearly be conjugate to w. Moreover, it exists: indeed, let I be the unique Lyndon word conjugate to w. Then w = xy, ух = I. Hence, ww = xly. If x or у is empty, we are done because ww = //. So we may suppose that x, у / 1. The last factor in the Lyndon factorization of x is a suffix of x, hence of /, so greater than /. The first factor in the Lyndon factorization of у is a prefix of y, hence of I, so smaller than /. Thus, the Lyndon
166 7 Circular words factorization of xly is obtained by concatenating that of x, I, and that of y: hence, I appears in the Lyndon factorization of ww = xly. (b) Each Lyndon word I of length >2 has a standard factorization I = IT' (see Section 4.1), which according to the proof of Theorem 5.1 is given by: I" is the smallest nontrivial proper suffix of /. In order to find Г, I", let I = aw, ae A, and let w = f ... ln be the Lyndon factorization of w. Then I" = ln. This is an immediate consequence of Lemma 7.14(i). Hence, one can quickly compute the standard factorization of any Lyndon word w, and by iterating this process, its associated tree t(w) (cf. the proof of Theorem 5.1). 7.5 WORDS AND MULTISETS OF PRIMITIVE NECKLACES Recall that, for a set E, a multiset of elements of E is a mapping M: E -» N. For e in E, M(e) is the multiplicity of e in M. It is finite if its cardinality, i.e. £ee£M(e), is <oo. Given a subset H of A* which satisfies the hypothesis of Theorem 7.4, there is an evident bijection, given by (7.2.1), between the words of A* and multisets of elements of H. This is true for example for any Hall set H, or for the set H = L of Lyndon words (cf. Corollary 4.7 and Theorem 5.1). Moreover, Corollary 7.5 shows that there is a bijection between H and the set of primitive necklaces. Hence, we have the following result, where the evaluation (respectively length) of a multiset is the product (respectively sum) of the evaluations (respectively lengths) of its components, with multiplicities. Theorem 7.17 Given a Hall set on A* (especially the set of Lyndon words), there is a canonical evaluation-preserving bijection between the three following sets. (i) The set of words of length n. (ii) The set of multisets of length n of Hall words. (iii) The set of multisets of length n of primitive necklaces. We give now another bijection between words and multisets of primitive necklaces, which has better invariance properties than the previous ones. This bijection will be useful in the study of the various symmetric functions related to the free Lie algebra. Let A be a totally ordered alphabet. Let w = at ... an in A* (a; e A). Let [n] = {1,..., n] and define a function dw: [n] i-> A x [n] by d(i) = (ah i). Evidently, bw is injective. Order A x [n] with lexicographic order. Then the condition dw(i) < &WU) defines a total order on [и]. Note that this condition is equivalent to («,<«;) or (a( = a}and i <j). (7.5.1) This total order on [n] is called the standard numbering of w: it consists of
7.5 Words and multisets of primitive necklaces 167 numbering the positions of the letters in w, from left to right, starting with the smallest letter in w, continuing with the second smallest, and so on (see Example 7.18). The standard permutation of w is the unique permutation st(w) = a of [n] such that a(i) < a(j) is equivalent to bw(i) < bw(j). One obtains a directly, viewed as a word on [n], if one numbers the positions of the letters in w, as above. We leave the verification of the following fact to the reader: a(i) = |{j | 1 < j < n, as < a{}| + |{j | 1 < j < i, at = a7}|. Example 7.18 1 2 3 4 5 w = b b a a b a = 3 4 1 2 5 6 7 8 9 10 11 12 d d d b d b c 9 10 11 6 12 7 8 With w and a as before, consider a cycle c = (f,, ik) of a. Then denote by vf(w) the necklace which is the conjugation class of the word ahai2... aik. Then define a multiset of necklaces M(w) to be the multiset {vf(w) | c cycle of st(vv)}. More formally, M(w) is defined for any necklace v by M(w)(v) = number of cycles c of st(w) such that vf(w) = v. Observe that the evaluation of w is equal to the evaluation of M(w). Example 7.19 With w and a as in Example 7.18, the cycles of a are shown in Fig. 7.3. The multiset M(w) is obtained by replacing each digit i by a„ i(i) (instead of a{—this does not change the necklaces); see Fig. 7.4. Fig. 7.4
168 7 Circular words Theorem 7.20 The mapping w i—> M(w) is an evaluation-preserving bijection from the set of words onto the set of finite multisets of primitive necklaces. The inverse bijection W is described in the proof. Proof (a) We construct below an evaluation-preserving mapping W from the set of finite multisets of primitive necklaces into A* such that M ° W = id. Then W will be injective, hence surjective by Theorem 7.17, because the set of words with a given evaluation is finite. Hence, M is a bijection from A* onto the set of multisets of primitive necklaces. (b) Let P be the set of periodic sequences in ЛЧ Each element in P is of the form л(и), where n: A+ -* P is the mapping n(b1 ... bk) = (b1,...,bk, bx,..., bk,...), (bi-e A). Observe that л defines a bijection between primitive words and periodic sequences. Let z: Л141 -* Л141 be the shift mapping, defined by z(a0, ar, a2,...) = (ar,a2, a3,...). Its restriction to P is a permutation of P. One has z(n(b1 ... bk)) = Ti(b2 ... bkbf), (7.5.2) and z~1(ri(bl ... bk)) = ittb^ ... Ьк_г). (7.5.3) This shows that the orbit under z-1 of л(и), и primitive, consists in the sequences л(г), v conjugate to u, and has precisely |u| elements, the number of conjugates of u. Put on the lexicographic order, and do the same for Л141 x N. Let f\ Л 4 -♦ A be the mapping sending each sequence on its first element. Extend z and f to P x N by z(s, /) = (z(s), /) and f(s, I) = f(s). (c) Denote by (u) the conjugation class of a word и and by u the reversal of и (defined by u = ak... аг if и = ax ... ak, a,-e A). Let N be a finite multiset of primitive necklaces of length n. We associate with N the subset £ of P x N defined by E = IJ {л(й)} x [(V((u))], (7.5.4) ueA + where [0] = 0. Note that E is of cardinality n, because there are |u| elements in the conjugation class of a primitive word u. Note also that E inherits the total order < of Л4 x N; we write E = [ег < e2 < • • • < en}. If и and v are conjugate words, then Ar(('<)) = N((v)) and й and v are conjugate. Hence, by (7.5.2) and (7.5.3), the restriction of z to E is a permutation E -* E. Let a be the permutation of [n] defined by a = 1 ° z~ 1 ° b, where b(i) = e{. Observe that <5, b~1 are increasing functions. Let w = at ... an, with a{ = f °z~ We show that M(w) = N, which will finish the proof: it suffices to pose w = W(N). (d) We claim that a is the standard permutation of w. Indeed, o(i) < a(j) is equivalent to z~1(ei) < z~1(ej) (because <5, <5-1 are increasing),
7.5 Words and multisets of primitive necklaces 169 which by the properties of lexicographic order, is in turn equivalent to {/(^-1(e,)) </(z-1(e7))} or {/(z-1(e,)) =/(z-1(e;)) and e{ < ef, that is, (a, < af) or (a, = aj and i < j), which is (7.5.1). So the claim is proved. By (7.5.3) and (7.5.4), a cycle of z1 is of the form (л(Ьк... bj, /), (Ti(blbk... b2), I),..., (n(bk_1 ... bf>k), I), for some primitive word и = b{ ... bk and some I, 1 < I < M((u)). Note that conjugate words determine the same cycle. Since a = d1 °z1 ° <5, the cycles of a are of the form с = (ц,..., ik) with *i = ...bt),Z)), i2 = b'1((Ti(bl...b2),l)),...,ik = d~1((Tt(bk_l...bk),l)). Note that a, = f °z-1(e,.) = f °z~x °<5(0, hence by (7.5.3) a,. = fo l((it(bk ... bf>, I)) = f^b. ...), 0) = b., ai2 = f°z~1((-n(b1 ... b2), /)) = /((л(/>2 ...),/)) = b2, aik = foZ-\(n(bk^ ... bk), I)) = f((n(bk ...), /)) = bk. This shows that the corresponding necklace vf(w) = (ait ... aik) is equal to (u), and we conclude that M(w) = M. □ Example 7.21 The inverse mapping W of M is described on the multiset of primitive necklaces shown in Fig. 7.4. Label each vertex of each necklace by the periodic sequence obtained by reading, in the opposite direction, the necklace (see Fig. 7.5). Now, order these sequences lexicographically (when multiplicities occur, one has to order the multiple necklaces first; see Fig. 7.6). One obtains a permutation a in cycle form; write it as a word, and replace each digit i by the label of the corresponding vertex: 12 3 4 <7 = 3 4 1 2 w = b b a a 5 6 7 8 5 9 10 11 b d d d 9 10 11 12 6 12 7 8 b d b c One obtains the word of Example 7.18, as expected. Fig. 7.5
170 7 Circular words Fig. 7.6 7.6 APPENDIX 7.6.1 Lyndon elements in free partially commutative monoids We use the same notation as Section 4.4.2. Let A be totally ordered, consider the alphabetical order < on A*, and order M = M(A, 6) in the following way: denote by ip: A* -* M(A, 0) the canonical morphism and let st(m) = max(<p~ ^m)) for any m in M; then m < p if st(m) < st(p). An element m of M is called a Lyndon element if for any nontrivial factorization m = pq, one has m < q. The properties of Lyndon elements in M are quite similar to those of Lyndon words in A*. For instance, m is Lyndon if and only if m is primitive (i.e. m cannot be written m ~ pq, where p and q commute and p, q / 1), and if it is the smallest element in its conjugacy class (conjugation is the equivalence relation ~ in M generated by the relations pq ~ qp, p, q 6 M). A technical lemma, which is not obviously equivalent to the latter, is that m is Lyndon if and only if m < qp for any nontrivial factorization m = pq. Define IA(m) to be the set of a in Я such that m e aM. A pyramid is an element m such that |M(m)| = 1. A pyramid m is admissible if IA(m) consists in the smallest letter appearing in m. Each Lyndon element is an admissible pyramid. The set of admissible pyramids m such that IA(m) = {a}, for a fixed a in A, is a free monoid Ma, and an admissible pyramid m e Ma is a Lyndon element in M if and only if it is a Lyndon word in the free monoid Ma. Each element in M has a unique factorization into a decreasing product of Lyndon elements. If m is Lyndon, not in A, then it has a unique nontrivial factorization m = pq, where q is chosen minimum for the total order < in M. This factorization is called the standard factorization of m. By iteration of the standard factorization, one associates with each Lyndon element m a binary tree; this tree, when the nodes are interpreted as the Lie bracket in the free partially commutative Lie algebra cf(A, 0), defines an element of У(А, 0). The set of all these elements forms a basis of cf(A, 0). All these results are due to Lalonde (1992). His theory is expressed in the geometrical language of heaps of pieces (see Viennot 1986). Conjugation in
7.6 Appendix 171 M(A, 3) was studied first by Duboc (1986). A consequence of this is that, when A is finite, the dimension of the space of homogeneous elements of degree n in ^F(A, 3) is equal to the number of primitive conjugation classes of degree n in M(A, 3). This number bn may be computed as follows. Denote by a„ the number of elements of degree n in M(A, 3). Then the generating function of the a„ is given by the following formula, due to Cartier and Foata (1969): X anXn = fl - X ’ л > 0 \ л > 1 / where cn is the number of subsets В of A of cardinality n and such that ab = ba mod 3 for any a, b in В (see also Lallement 1979, Section XI.3). By a result quoted above, there is a bijection between the set of elements in M of degree n, and the set of decreasing products of Lyndon elements, of total degree n, so that Then, to compute the bn, one takes the logarithmic derivative and applies Mobius inversion, as in the proof of Corollary 4.14. The same computation, using the method of Witt (1937), gives the dimension of the space of homogeneous elements of £f(A, 3); see Duchamp and Krob (1992c). 7.6.2 Irreducible polynomials over a finite field Let F be the field with q elements. Then the number a„ of irreducible monic polynomials in F[x] of degree n is equal to (7.1.1), a formula which was known to Gauss. Indeed, F[x] is a unique factorization domain and there are qn monic polynomials of degree n, so that So one proceeds as in the proof of Corollary 4.14. This shows that the number a„ is equal to the dimension of the space of homogeneous Lie polynomials of degree n: this was noted by Witt (1937). A direct bijection between primitive necklaces of length n over F and the set of irreducible polynomials of degree n in F[x] may be described as follows: let К be the field with qn elements; it is a vector space of dimension n over F. There exists in К an element 3 such that the set [3, 34,..., 3qn is a linear basis of К over F: such a basis is called a normal basis, and always exists (see Lidl and Niederreiter 1983, Theorem 2.35). With each word w = a0 ... a„_! of length n on the alphabet F, associate the element of К
172 7 Circular words given by /? = ao0 + + • • • + an_ *. It is easily shown that to con- jugate words w, w' correspond conjugate elements ft in the field extension K/F, and that w i—► is a bijection. Hence, to a primitive conjugation class corresponds a conjugation class of cardinality n in K; to the latter cor- responds a unique irreducible polynomial of degree n in F[x], This gives the desired bijection. Another bijection using, instead of a normal basis, a generator of the cyclic group K O, is given in Golomb (1967). 7.6.3 Determinant of a sum of matrices Given a square matrix x over a commutative ring, define the function A,(x) by det(l -00=1 + X (-l)VA.(x), n> 1 where t is a commuting indeterminate. Note that A„(xy) = A„(yx) for any matrices x, y. Let x15..., xk be square matrices of the same size; we consider also {xn ..., xk} as an alphabet, to simplify notations. For each primitive necklace v = (xfl... xlr), the matrix function A„(v) = A„(xfl ... xir) is well defined. If M is a multiset of primitive necklaces, let A(M) be the matrix function ||v AM(v)(v), where M(v) is the multiplicity of v in the multiset M. Let sgn(M) be the sign of M, that is the product of the signs of the necklaces in M, where the sign of (xfl... xlk) is ( — I)*-1 Then the following formula holds: Л,(х1 +••• + xJ) = £sgn(M)A(M), (7.6.1) M where the sum is extended to all multisets of primitive necklaces M of length n over the alphabet {x15..., xk} (see Amitsur 1980; Reutenauer and Schiitzenberger 1987). For example, A3(x + y) is the sum of eight terms, given in Fig. 7.7. For the proof, one uses the identity in Z<<Xp ..., xk>) 1 - Xi-------xk = П U - й)> heH obtained in the proof of Theorem 7.4, where H is a Hall set. Then one applies the homomorphism sending the letter x, onto the matrix txo takes the determinant, and uses the fact that Hall words are in one-to-one correspon- dence with primitive necklaces (Corollary 7.5). The Cayley-Hamilton theorem may be deduced from (7.6.1). Indeed, let x, у be и by и matrices. Then A„+1(x + y) = 0; take in this equation the terms of degree n in x, 1 in y, using (7.6.1). This gives X (-1)'Л„.Хх)Л,(х'у) = 0. 1=0
7.6 Appendix 173 Fig. 7.7 Since At is the trace, we have, for any matrix у tr( ( E (- l)iA„_,(x)xi)y) = 0, \\i = 0 / / which implies the desired identity, by nondegeneracy of the trace. By taking in the equation A„+ ..., xn+ J = 0 (х,- n by n matrices) the multilinear part, one obtains the multilinear version of the Cayley-Hamilton theorem (see Procesi 1976, Theorem 4.3(b)). 7.6.4 Factorizations of the free monoid A family (Xt)ie/ of subsets of A* is called a factorization of the free monoid A* if / is totally ordered and if each word w in A* has a unique factorization w = Xi ... x„, x,. 6 Xjt, j\ > >]„. (7.6.2) In this sense, the family ({h})heH is a factorization of A*, for any Hall set H (Corollary 4.7). A theorem of Schiitzenberger (1965)—see also Lallement 1979, Theorem XI.5.7; Lothaire 1983, Theorem V.4.1—asserts that for a family of subsets (Xi)ieJ, indexed by a totally ordered set I, any two of the
174 7 Circular words following conditions imply the third: (i) each word has at least one factorization (7.6.2); (ii) each word has at most one factorization (7.6.2); (iii) each submonoid X* is freely generated by Xi; conjugation within Xf coincides with conjugation in A* and each conjugation class in A* meets exactly one submonoid Xf. Theorem 7.4 is a particular case of this theorem. The link between factorizations of the free monoid and the free Lie algebra has been intensively studied by Viennot (1978). 7.6.5 Zeta functions of cyclic languages The zeta function of a language L (i.e. a subset of Я*) is C(£) = exp ( X \n> i n where an is the number of words of length и in £ (we assume that A is finite). For £ = (ab)*, i.e. the set of powers of ab, £(L) = (1/1 — t2)1/2 and for £ = {a}, C(£) = e'. A cyclic language is a subset £ of A* which is conjugation-closed and power-closed; that is, uv & L о vu 6 £, and w 6 £ <=> wn e L, for any words u, v, w and any integer n > 1. The zeta function of a cyclic language has integer coefficients; in fact, one has the formula where a„ is the number of primitive conjugacy classes contained in £. The main result of Berstel and Reutenauer (1990) is that if L is a cyclic language which is recognizable by a finite automaton, then lj(L) is a rational power series. For example, £(£) = 1/1 — t2, for £ = (ab)* и (ba)*. 7.7 NOTES Formulas (7.1.1) and (7.1.2) must have been known for a long time. The first formula of Corollary 7.3 appears in the book by Lucas (1891, p. 501), who attributes it to Colonel Moreau. For the proof of Theorem 7.2, we have followed Garsia (1990), with the help of Pierre Leroux. The algorithm of Section 7.2 is from Melancon (1991), who extended an algorithm of Meier-Wunderli (1951) and Schiitzenberger (1958). Corollary 7.7 is actually
7.7 Notes 175 the original definition of Lyndon (1954, 1955a). The results of Sections 7.3 and 7.4 are due to Duval (1978, 1983, 1988); see also Berstel and Pocchiola (1992). The bijection of Theorem 7.20 is from Gessel in an unpublished manuscript dated 1981, rediscovered by several people and published by Gessel and Reutenauer (1992). Related bijections are in Metropolis and Rota (1984), and Dress and Siebeneicher (1988). The standard permutation of a word is introduced in Schensted (1961).
8 The action of the symmetric group In Section 8.1 we present the duality between the representations of the symmetric group and the linear group. From this, we quickly deduce the character of the free Lie algebra, already knowing the generating functions of primitive necklaces. This character is induced by a faithful one-dimen- sional representation of the subgroup generated by a circular permutation (Section 8.2). In Section 8.3 we give the combinatorial interpretation of the multiplicities of irreducible representations in the Lie representation. It is also shown that almost all irreducible representations actually appear. Section 8.4 introduces remarkable Lie polynomials, the Lie idempotents. In the last section, we consider the representations arising from the canonical decomposition of the free associative algebra. Throughout Chapter 8, we assume that К is a field of characteristic 0. 8.1 ACTION OF THE SYMMETRIC GROUP AND OF THE LINEAR GROUP Denote by K(Ayn the subspace of K{A) spanned by the words of length n. There is a right action of the symmetric group S„ on defined by (wa). = i=l,...,n, (8.1.1) for any word w of length n, where w, denotes the г th letter of w. Equivalently, if w = ... an, we have («i ... a„)o = a„(1)... a„(n). (8.1.2) This is indeed a right action, i.e. (wa)a = w(aa), for any a, a in Sn (see Section 3.3). We call it sometimes the place permutation action of the symmetric group, because the letters of a word are permuted according to their position in the word. As an example, we denote by 2 3 1 the 3-cycle (123) and have (abc)231 = bca and (aah)231 = aba (and not baa). This right action of Sn on the words of length n extends by linearity to a right action of the group algebra KSn on K{A~)n. The group End( V) of endomorphisms of the К-vector space V = ®aeA Ka
8.1 Action of the symmetric group and of the linear group 177 acts naturally on the left on K<A>; each endomorphism of V extends uniquely to a K-algebra endomorphism of KfA); the algebra endomor- phisms obtained in this way are exactly the homogeneous algebra endomor- phisms of K<A>. The previous extension from V to K<A> preserves composition, as the universal property of implies, so that it is a left action of End(E) on K<4>. This action preserves each submodule X<A>„. These two actions on commute with each other. Indeed, an endomorphism f of Иis given by a matrix (ka b)a beA such that each column has only finitely many nonzero elements and that /(b) = X k.,ba. aeA for any letter b. Then the word w = br ... bn of length n is mapped by the extension of f onto /(w) =f(bl)...f(bn) E ^01,bi • • • kantb„ai • • • ai....aneA Hence, for a in Sn, f(w)a = X kai.bi • • • • • • a<rn- ai,..., ane A On the other hand, we have /(wct) = f(bal ...ban) = i b i * • • b Д1 • • • * ai...a„eA Since ст is a bijection of {1,. .., n} with itself, we deduce (by the change of variables a, = cai) f(wa) = У kc , b , ... kc b cal ... c„n •s \ Cal, Off I "on о 1 url Cl...cneA = E kci.bi - kCn,bnc(T1 ...can=f(w)a, Cl.....c„eA because К is commutative. A particular case of endomorphism of V is induced by a permutation of A. Hence, we obtain an action of the symmetric group SA on the left on K<A>„, given by a«i • • • an = a(Ui ... a„) = «(aj ... a(a„). (8.1.3) More generally, each element of the group algebra KSA acts on the left on K(A). When A contains {1, 2,..., n}, we have therefore two actions of KSn on K<A>„, one on the right, one on the left. We call sometimes the left
178 8 The action of the symmetric group action (8.1.3) the variable permutation action of Sn because the letters of a word are permuted according to their actual value. Still assume that A contains {1,..., n}. Denote by En the linear span of all words w„ = <r( 1)... a(n), for a e Sn. Then £„, viewed as К-vector space, may be identified with the group algebra KS„: under this identification, the left (respectively right) action of a on a permutation a, identified with an element of £„, corresponds to the left (respectively right) multiplication of a with a in S„. More formally = waa, waa = waa, (8.1.4) because (w^a), = (vvff)al = oai = (wffa),-, since (wff)j = oj by definition, and (awff)i = «((wjj) = aoi = (w^f. We identify £„ and KSn, and in particular a permutation with the corresponding word; e.g. (134) (25) in S6 is identified with 354126. This allows us to speak of Lie elements in KSn; e.g. 1 — (12) — (132) + (13) is a Lie element KS3, because it is 123 — 213 — 312 + 321 = [[1, 2], 3] (we write 1 for the identity permutation). It is important to note that the left action of Sn on £„ is equivalent to the left action of Sn on its group algebra, hence the corresponding representation of S„ is the regular representation. We shall need in the sequel a result of representation theory, which relates representations of the symmetric and the linear groups. Let £ be a subspace of К {A) which is invariant under the previous left action, i.e. under each algebra endomorphism of K(/l) sending each letter onto a linear combina- tion of letters. Let n>0 and suppose that A contains {l,...,n}. We call £ n En the multilinear part (of degree n) of £. The spaces £„ and £ are both invariant under the left action of Sn: hence we obtain a repre- sentation of Sn on £„ = £„ n £. Denote by the character of this action. The Frobenius image of or the characteristic ch(xn) of is the symmetric function ch(Xn) = -*. E Хп(ст)Рл(<7), (8.1.5) n! aes„ where л(ст) = z = Г‘2"2... is the cycle type of the permutation о (i.e. a has n, cycles of length i) and рл = p" lfi2 • • • is the corresponding product of the power sum symmetric functions Pi = x) (see Macdonald 1979). Let a = (ай)оел be some multi-index, with the afl almost all zero, and denote by Ea the space of finely homogeneous polynomials of partial degree afl in each letter a. Let na = dim(£a n F) and consider the series Z и, П u-eZ[[4]], (8.1.6) |a|=n aeA
179 8.1 Action of the symmetric group and of the linear group in the commuting variables a 6 A, with |a| = Iv a This series is symmetric in the variables a e A, because F is invariant under any homomorphism sending each letter onto another one. We call it the generating function of F n Hence we have associated with F and the given integer n two symmetric functions of degree n (we look at symmetric functions regardless of the set of underlying variables; this is possible if there are at least as many variables as the degree of the symmetric function, here n; see Macdonald (1979)). Theorem 8.1 The symmetric functions (8.1.5) and (8.1.6) are equal. This is the Schur-Weyl duality between the representations of the symmetric group and the linear group (Weyl 1946, Theorem 7.6.F; Macdonald 1979, A7 in Chapter 1). The following example is typical of the use of Theorem 8.1. Example 8.2 Let F be the free Lie algebra and n = 3. Then the multilinear part £3 n F of F admits as basis the two Lie polynomials [[1, 2], 3] and [[1, 3], 2] (see Section 5.6.2). Now, we have (12)[[1, 2], 3] = [[2, 1], 3] = -[[1,2], 3], (12)[[1, 3], 2] = [[2, 3], 1] = [[2, 1], 3] + [2, [3, 1]] (Jacobi identity) = — [[1, 2], 3] + [[1, 3], 2]. Hence the character y3 of the representation of S3 on £3 n F satisfies Z3((l,2))=-1 + 1=0. Moreover, (123)[[1, 2], 3] = [[2, 3], 1] = —[[1, 2], 3] + [[1, 3], 2], (123)[[1, 3], 2] = [[2, 1], 3] = -[[1, 2], 3]. Thus, Хз(123))= -1. Evidently y3(id) = 2. Since there are 3 transpositions in S3, and 2 circular permutations, we obtain by (8.1.5) сй(Хз) = i(l -2-P? + 3 0-PjPj + 2(— l)-p3) = 34p?-p3). (8.L7)
180 8 The action of the symmetric group On the other hand, F admits as basis a set of finely homogeneous polynomials which is in bijection with Lyndon words with the same multi- degree (Theorem 5.1). A word of length 3 in A*, with A totally ordered, is Lyndon if and only if it is of the form abc, a<b,a<c,b/c, or aab, a < b, or abb, a < b. Thus, the generating function (8.1.6) is the sum in Z[[AJ] of the correspond- ing monomials, that is £ labc + £ (a2b + ab2). (8.1.8) a < Ь <c a <b To verify that (8.1.7) and (8.1.8) are equal, we compute (8.1.7): (/ \ з \ (Eu) -Ед3) \ a / a / = + 3 E a2b + 3 E ab2 + 6 E abc-^a2] \a a<b a<b a<b<c / which is indeed equal to (8.1.8). 8.2 THE CHARACTER OF THE FREE LIE ALGEBRA Let A be an alphabet containing {1,..., n}. Recall that the multilinear part of degree n of the free Lie algebra &K(A) is the space &K(A) n En of Lie polynomials that are linear combinations of words <t(1) ... o(n), a e Sn; moreover, there is a left action of Sn on this space. We call Lie representation of degree n or n-th Lie representation this representation of Sn. We denote by Xn the corresponding character of Sn, and ch(xn) the characteristic of xn (see Section 8.1). Theorem 8.3 The characteristic of the representation of Sn on the multilinear part of degree n of the free Lie algebra is n dfn (8.2.1) Equivalently, for each permutation a in Sn, /„(a) = 0 unless a has only cycles
8.2 The character of the free Lie algebra 181 of length d, for some d dividing n, in which case M ’ d^n/diWd). (8.2.2) n We shall give three proofs of this theorem. The first one rests on the duality between representations of the symmetric and the linear group (Theorem 8.1), while the other proofs are self-contained. First proof of Theorem 8.3 (a) By Theorem 8.1, we have only to verify that the generating function (8.1.6) of the space of homogeneous Lie polynomials of degree n is equal to (8.2.1). But this space possesses as a basis the set of Hall polynomials corresponding to Hall words of length n (Theorem 4.9(i)). Observe that these polynomials are finely homogeneous, with same partial degrees as the corresponding Hall words. Moreover, the Hall words of length n are in evaluation-preserving bijection with primitive necklaces of length n (Corollary 7.5) and the generating function of the latter is given by (8.2.1); see Theorem 7.2. (b) Denote by the number of permutations which commute with a given permutation of cycle type z. If 2 = dn/d, then it is straightforward to show that = dnld(n/d)\. Moreover, there are n\/zx permutations of cycle type z in Sn, for each partition z of n. Hence, by (8.1.5) and the fact that /„ is constant on conjugacy classes ch(xn)= X |A| =n where Xn is the common value of %n of the conjugacy class consisting of permutations of cycle type z. Thus E XnZL'Px = 1 E P(d)pnd/d, | A| = n И d | n by the first part of the proof. This proves (8.2.2) because the рл are linearly independent. Conversely, (8.2.2) implies (8.2.1). □ For the second proof of Theorem 8.3, we need two lemmas. The first one is a result of linear representation theory. Lemma 8.4 Let G be a finite group, e an idempotent in the group algebra KG, and x the character of the action of G on the left ideal KGe. (i) Then E x(g~1)g= E xex"1. geG xeG In particular, dim(KGe) is the product by |G | of the coefficient of 1 in e.
182 8 The action of the symmetric group (ii) If G = Sn and e = E<res„ then the characteristic of % is given by ch(x) = E a»Pw rreS„ In particular, dim(KGe) = Proof (i) Let g 6 G. Then %(g) is the trace of the linear endomorphism u: x i—* gx, KGe -► KGe. Denote by v the linear endomorphism v: x i—* gxe, KG -► KG. We have v\KGe = u, because e is idempotent. Moreover, if И is a subspace of KG supplementary to KGe, e.g. KG(1 — e), then u(L) £ KGe. This shows that tr(v) = tr(u). We compute tr(v). For this, let us compute first the trace of the endomorphism wh: x i—> gxh, where h is in G. We have, because G is a basis of KG: tr(wh) = number of x in G such that gxh = x; the latter equality is equivalent to g~1 = xhx}. Hence, if we denote by (P, Q) the scalar product in KG with G as orthonormal basis, we have tr(wh) = E (xhx-^g-1). xeG Since r = E (g /l)vvh, he G we deduce by linearity X(g) = tr(v) = £ £ (e, h)(xhx'\g~v) = E (*?*" \ g~l), heG xeG xeG which implies E x(g)g~l = E geG geG as was to be shown. (ii) In Sn, g and g~1 are conjugate, so that we have by (i) x(g)= E (xex~^g)= E (^x^gx). xeG xeG
183 8.2 The character of the free Lie algebra Thus, by the definition of the characteristic of %, ch(x)= 1 £ X(g)p^g) = * E x 1^)/’л(х >gX]= E (e' h)Pw)’ П. geG g.xeG he G as was to be shown. For the second lemma, let c, = (1 2 ... i) 6 Sn, for i = 1,..., n. In particular, Cj = 1. Recall that we identify each permutation with the corresponding word on {1,2,..., n] A, and KSn with the space En c К (A). Lemma 8.5 For p = 0,..., n — 1, the following element of ZfA>: с: -(-I)” E n>ip> ><2> ii > 1 is a linear combination of shuffle products иш v, u, v e A + . The following example contains essentially the proof of this lemma. Example 8.6 n = 5, p = 2. c5 — С4С3 — e4C2 ~ — ^‘зС2 — C3C1 — C2CI = 34512 - 34215 - 32415 - 23415 - 32145 - 23145 - 21345 = 34512 - (34 ш 21)5 = 34512 - 345 ш 21 + (345 ш 2)1 = 34512 - 345 ш 21 + 3451 ш 2 - 34512 = -345 ш 21 + 3451 ш 2. We have used the dual identity of (1.4.2), defining the shuffle product. Proof We write P = Q to express the fact that P — Q is a linear combina- tion of и 111 v, и, и 6 A + . Then we have for any words u, v and letters a, b (u ш vb)a = —(ua ш v)b, (8.2.3) by (1.4.2) and symmetry. Let n > ii > i2 > • • > ip> 1. Then a straightforward induction, which is left to the reader, shows that the permutation chci2.. .cip is of the form upp ... u2 2 Uj 1 vn, where the word Uj has length ij — ij+ j — 1, with ip+l = 0 and where ... Ujt' = (p + l)(p + 2)... (n - 1). Hence, these permutations are all distinct. Since they are ("~ *) in number and since they appear all in the polynomial [(p + l)(p + 2)... (n - 1) ш (p ... 2 l)]n, which itself has
184 8 The action of the symmetric group (" p1) terms, we deduce that P= E. CiP---Ci2Ch = [(P + l)(p + 2)... (n - 1) Ш (p... 2 l)]n. Now, an iterative application of (8.2.3) shows that F= -(\У(Р+ l)(p + 2)... (n — l)n 1 2 ... p = (-\yc£. □ Second proof of Theorem 8.3 (a) Let Gn denote the intersection of En and of the space generated by the elements и ш v, и, v e A +, and Fn the space of Lie polynomials in £„. By Theorem 3.1, Gn and Fn are the orthogonal space each of another, for the scalar product which admits A* as an orthonormal basis. This scalar product is invariant under the left action of Sn on En, hence the action of Sn on Fn is equivalent to the action of Sn on En/Gn. We compute the character of the latter. For this, we may take К = C. (b) Let £ be a primitive nth root of unity and define A simple computation shows that e is an idempotent. The left action of Sn on the left ideal CSne has a character /, whose characteristic is, by Lemma 8.4(ii) ch(x) = - E ГкРл(С*) = 1 E E ГкРл(С*)- П к = 0 ft d | n gcd(k, n) = n/d Observe that the cycle type of c* depends only on gcd(k, ri) = n/d, and is equal to dn/d. Moreover, gcd(k, ri) = n/d is equivalent to: £~* is a primitive dth root of unity. Thus ch(x) = - E Pdd E (1) = - E PdldP(d^ (8-2-4) П d | n ы primitive Udfn d th root of 1 as is well known. (c) Denote by = the equality mod Gn in En (identified with CS„). We have by Lemma 8.5 = (-D'C' L ft p = 0 n> ip> > i2> i i > 1 = -c‘e2)(i -r'e,). n
8.3 Irreducible components 185 Call и this latter element. Then и is invertible in CS„, because for p = 1,..., n - 1, 0 / e - 1 = e - C”p = (C - cp)(Cp_1 + • • • + c^1), hence C - cp is invertible, as is 1 — £ lcp. Since Gn is invariant under the left action of Sn, we obtain CS„e = CS„u = CS„ mod Gn. Observe that CS„e and En/Gn both have dimension (n — 1)! (Lemma 8.4(ii), orthogonality of Gn and Fn, and Section 5.6.2). Hence, the restriction to CS„e of the canonical mapping En -> En/Gn is a linear isomorphism. This shows that the left action of S„ on En/Gn is equivalent to that on CS„e, and concludes the proof. □ Corollary 8.7 Let a be an n-cycle in Sn and p: (ст) -> C a faithful representa- tion of the subgroup generated by o. Then the representation induced by p to Sn is equivalent to the Lie representation of degree n. Proof Let co = р(ст). Then co is a primitive nth root of unity, and the representation p is equivalent to the representation of (ст) on the (left) ideal Kf of K(ct), with / = „ £*=0 co~kak, because of = o)f. Now, by definition of the induction, the representation of Sn obtained by inducing p is equivalent to the representation of Sn on the left ideal KSnf. With the notations of part 2 of the second proof of Theorem 8.3, we have that e and f are conjugate idempotents. Hence, the characters of the corresponding representations are the same, and their common characteristic is given by (8.2.4). This concludes the proof, by Theorem 8.3. □ 8.3 IRREDUCIBLE COMPONENTS Recall that for a standard tableau T of shape z(T) = 2, where z = (z15..., zk) is a partition of n, a descent in T is an index i in {1,..., n — 1} such that i + 1 is located in a lower row than i in T (in the English way of depicting tableaux, i.e. rows increase in length from bottom to top). The descent set of T is the set of descents of T, denoted by D( T), and the major index of T is the number maj(T) = £ i. ieD(T) For example, for the tableau T below, we have z(T) = (3,2, 1, 1),
186 8 The action of the symmetric group D(T) = {2, 4, 6} and maj(T) = 12. 124 36 5 7 Recall also that the irreducible representations of the symmetric group Sn are in one-to-one correspondence with the partitions of n, and that the characteristic of the character corresponding to the partition A is the Schur function (see Macdonald 1979). The Lie representation has a special link with the representation of Sn on a quotient ring of K[x15..., x„], which we study first. The action of Sn on К [xb . .., x„] is given by = P(xffl,..., xffn), for any polynomial P in К [xb ..., x„] and any permutation a in Sn. Denote by A(x15 ..., x„) the fixed subring of this action, i.e. the ring of symmetric polynomials, and by I the ideal of R[xp..., x„] generated by the symmetric polynomials without constant term. Let R = KtX1,...,xnyi. Since I is invariant under the action of Sn, R inherits the action of Sn. Moreover, R inherits the graduation of K[x15..., x„]: R = ф R,. i>0 Theorem 8.8 The multiplicity of the irreducible character yf of S„ in its representation on Rt is equal to the number of standard tableaux of shape p and major index equal to i. Proof (a) It is well known that K[x15..., x„] is a free A(xp ..., xn)-module (see Bourbaki 1981b, Chapter IV, Section 6, Theorem 1). We show that there is a K-linear isomorphism R A(x)-+К [x]. (8.3.1) Indeed, let (Pj)jeJ be a basis of R[x] over A. We show first that (Pj is a basis over К of К [x] mod I. This is because each polynomial P may be written P = PjQj for some symmetric polynomials Q,; hence, with a, = constant term of Q}, we have P = a7P7 mod I. Now, suppose that ctjPj = 0 mod /; then ctjPj = QkRk for some symmetric polynomials Qk with- out constant term, and some polynomials Rk, the latter may be written
8.3 Irreducible components 187 XjPjQkj for some symmetric polynomials Qkj, hence X;a;P; = XjPjYjcQkQkr which implies = 0 because the Qk are without constant term and that (PJ is a A(x)-basis. Since R = K[x]//, we may define (8.3.1) by (P,-mod I) (x) Q i—» PjQ, and it is indeed a К-linear isomorphism. This shows that for any choice of a basis (Pj) of K[x]/7, the latter mapping is an isomorphism. Since I is invariant under the action of Sn, and since this action is homogeneous, we may find a homogeneous subspace of К [x] which is invariant under this action and which is complementary to I. Take a homogeneous basis of this subspace: then (8.3.1) preserves the grading and the action of Sn. (b) The isomorphism (8.3.1) preserves the grading and the action of Sn. For a graded S„-module M = ®Mn, where each Mn is of finite dimension, and for о in Sn, let us call generating series of the character of о on M the series n>0 where /„ is the character of Sn on Mn. Then, the tensor product corresponds to the product of generating series. We apply this observation to the isomorphism (8.3.1). The generating series of ff on A(x) is П?=1 О — /)because A(x) is freely generated by the n elementary symmetric functions e1,...,en, of degree l,...,n, as is well known. (c) We compute the generating series of the character of о on К [xn ..., x„]. It is equal to £d>0 qd x (number of monomials xp = xf1 ... xPn left fixed by о and which are of degree d). The action of о on xp is xp'(l)... xp”n hence this monomial is fixed by о if and only if for any i and j in the same cycle of ст,. one has p-t = pf, hence, fixed monomials of о are in one-to-one correspondence with mappings f: {cycles of ст} -> N, and the degree of the monomial is the sum X /(c) x length (c). ccycle of a If ст is of cycle type z = ... лк, we deduce that the generating series of ст is From (8.3.1), we thus have that the generating series of the character of ст on R is n Ik П (' -«')/ П (1 -<Л). (8.3.2) 1 = 1 /1=1
188 8 The action of the symmetric group In particular, for о = id, we obtain Y dim R{ = (1 + <?)(1 + q + q2)... (1 + q + • • • + <-1), i>0 which shows that К, = 0 for i > (2). (d) Let z be a partition of n. Then the symmetric function pA has the following expansion in terms of Schur functions: Pa = Z SpXA, д where Xa is the value of the irreducible character at a permutation of cycle type z, and where the sum is over all partitions y. of n; see Macdonald (1979, Chapter 1, (7.8)). Taking the value of these symmetric functions at 1, q, q2,..., and using the identities Jk 1 рл(1, q, q2,...) = П Рл,(1, 4, • • •) = П ;----Г’ i = i i = il-<T and s/l, q, q2,...) = Y qmai(T)l (1 - q‘), т I 1 = 1 (where the sum is over all standard tableaux of shape /z; see Macdonald (1979)), we find that (8.3.2) is equal to Z Z 4mi'T'xi- (8.3.3) д А(Т) = д Let nlfl denote the multiplicity of the character хц of Sn in its representation on K,. Then (8.3.2) is equal to i > 0 fi By linear independence of the irreducible characters, the nifl are completely determined by this equality. Hence, comparing (8.3.3) and the previous expression, we get nifl = number of standard tableaux of shape у and major index i. □ Theorem 8.9 Let i be in the range 0 < i < n — 1, c an n-cycle in Sn, C the subgroup generated by c and co a primitive n-th root of unity. The representation of Sn on ®p = imOdn Rp is equivalent to the representation induced from the 1 -dimensional representation of C given by c —> co1. In particular, it depends up to equivalence only on the subgroup generated by i mod n. Using Corollary 8.7 and Theorem 8.8, we obtain the beautiful combina- torial interpretation of the multiplicities of the Lie representation.
8.3 Irreducible components 189 Corollary 8.10 Let i, n be relatively prime integers. The multiplicity of the character %* of Sn in the Lie representation of degree n is equal to the number of standard tableaux of shape л and of major index congruent to i mod n. We need to consider again, for each partition A = lai2a2... n"n of n, the polynomial in q nAq) = П i = 1 1 - (i-qT' (8.3.4) This is indeed a polynomial because qn~^ + 1) nAq) = itM (1 -Oi -<T1)..-(i 1 - q*k (8.3.5) with A = (A*,..., AJ, A' = (zk_ 15..., AJ. Lemma 8.11 Let в be a primitive p-th root of 1, where p divides n, and л a partition of n. Then tiA&) = 0, unless /. = pnlp, in which case лАО) = Pnlp{u/p)\. Proof Let z = (A15..., Ak). We have (1-«)... (l-<f) кАч) =----->--------- (1 -<z)...(!-q“) The multiplicity of 6 as a root of the numerator is n/p; at the denominator, it is equal to the number of A{ divisible by p; since £ A; = n, we conclude that if лА@) t4 0, then each A; = p, thus A = pn/p. In this case, a simple computation, using the identity П (1 - «') = ₽. i = 1 shows that ял(0) = pnp(n/pj\. □ Proof of Theorem 8.9 Let /‘-denote the character of Sn induced by the one-dimensional character a{ of C given by c i—> co1. Let a be a permutation of cycle type A. By the proof of Theorem 8.8, the generating function of the character of a on the spaces K, is given by (8.3.2), i.e. it is TiAq)- Thus, all we have to show is that л - 1 ^Aq)= E mod qn - 1. i = 0 (8.3.6) It is enough to show that for each nth root в of unity, both sides of (8.3.6) take the same vaue for q = в.
190 8 The action of the symmetric group By Lemma 8.11, = 0 unless в is a primitive pth root of unity, z = pn/p and in this case ял(0) = p"/p(n/p)!. We claim that if x is the characteristic function of the conjugacy class of cycle type z in Sn, then Ео<г<л-1 ^‘<z‘, Z> vanishes, unless the following conditions hold: 0 is of order p dividing n, к = p"/p; in this case this sum is equal to 1. Suppose this is proved. We have <Z\ Z> = E z'(x)z(x) = ^z'ftf), nl xesn where nl/zA is the cardinality of the conjugacy class of cycle type z. Thus, we find that 1 Eo < i < n -1 0*X*(v) vanishes, unless the previous conditions hold, in which case the sum is equal to 1. Since z = pnlp implies zx = p"/p(n/p)!, (8.3.6) is proved. We prove now the claim. By Frobenius reciprocity ”E^<x\x> ='f^.xlQc i = 0 i = 0 = E 011 E <^’)zW- i = 0 n xec This sum vanishes, unless z is the cycle type of some element in C, i.e. z = pnlp for some p dividing n. Suppose that this holds. Since for x in C, x of cycle type z <=> x of order p, and since c1 and co have the same order, we obtain E0,<z\z> = -E E i= 0 П i = 0 <i>J of order p =1 E ”e П a>J of order p i = 0 If £ is an nth root of unity different from 1, then Eo<i<n-i e‘ = 0, because £ is a primitive <7 th root for some d dividing n and Eo<i<d-i e‘ = sum °f all <7 th roots = 0. Hence, the previous sum vanishes, unless 0 is of order p, in which case it is equal to 1. This proves the claim. The last assertion of the theorem is a consequence of the following remark: if i and j generate the same subgroup mod n, then coj = co\, form some primitive nth root of unity со1Ф □ The next result shows that, except in a few cases, every irreducible representation of Sn occurs in the nth Lie representation. Theorem 8.12 Let X be a partition of n. Then the irreducible representation of Sn corresponding to A appears in the nth Lie representation if and only if A is
8.3 Irreducible components 191 not of the following form: (i) z = (n), n > 2; (ii) z = 1", n > 3; (iii) z = 22; (iv) z = 23. By Corollary 8.10, this result is equivalent to the following corollary. Corollary 8.13 Let i, n be relatively prime integers and a a partition of n. Then there exists a standard tableau of shape a and major index congruent to i mod n if and only if A is not of the form listed in Theorem 8.12. The proof of Theorem 8.12 is rather technical. Proof of Theorem 8.12 (a) We begin by the four cases. Let T be a standard tableau of shape z. In case (i), maj(T) = 0. In case (ii), maj(T) = 1 + 2 + • • • + n — 1 is congruent to n/2 if n is even, and to 0 if n is odd. In case (iii), T is of one of the following tableaux: 1 2 13 34 24 whose major indices are 2 and 4 (the descents have been underlined). In case (iv), T is one of the following tableaux: 1 2 1 2 13 13 11 34 3 5 2 4 2 5 2 5 56 46 56 46 36 whose major indices are 6, 10, 8, 9, 12. In all these cases, maj(T) is not relatively prime to n, so that, by Corollary 8.10, the multiplicity of yf in the nth Lie representation is equal to 0. hence the corresponding irreducible representation of Sn does not appear. (b) From now on, we exclude cases (i)-(iv). We begin by the cases n = 1, 2, 4, 6, that is, we consider the partitions z = 1, I2, 31, 212, 51, 42, 412, 32. 321, 313, 2212, 214. For this, is is enough to exhibit a standard tableau of shape z and major index congruent to 1 mod n (Corollary 8.10). These tableaux are: 1 1 134 12 13456 1245 2 2 3 2 36 4
192 8 The action of the symmetric group 1 245 1 2 5 1 2 5 146 14 13 3 346 34 2 2 6 2 6 6 3 3 4 5 5 5 6 The reader may verify that in each tableau the sum of the descents is = 1 mod n. (c) It remains to show that in all other cases, the irreducible representation VA of Sn corresponding to z appears in the nth Lie representation. By Corollary 8.7 and Frobenius reciprocity, it is enough to show that the restriction VA | C of VA to the subgroup C generated by an n-cycle c contains a faithful one-dimensional representation of C. Recall that for n / 4, the only nontrivial normal subgroup of Sn is the alternating group. From this, we deduce that if an irreducible representation of Sn is not faithful, then it is one-dimensional or n = 4. Thus, by 1 and 2, we may therefore suppose that the representation VA is faithful on Sn. (d) Suppose that n = pr, p a prime. Then FJC is a sum of one-dimensional representations of C; since the sum is faithful on C, one of these one- dimensional representations is faithful on C (otherwise cpr~1 is mapped onto the identity in the representation F., and the latter is not faithful). Thus, we conclude in this case. (e) By (b), we may assume that n Ф 6. Hence, by (d), we may assume that n = dq, d / 1, q = pr > 3, p a prime not dividing d. Since c is an n-cycle, cd has d cycles, each of length q-, call them ol,...,ad. The group C acts transitively on the set £ = ..., ad] by conjugation, and cd acts as the identity of since q and d are relatively prime, cq generates C modulo the subgroup generated by cd, and we conclude that cq cyclically permutes, by conjugation, the elements of We thus may assume that a. = c~q‘(T]Cq‘, i = 0,..., d - 1. (8.3.7) Let G be the commutative subgroup of Sn generated by Стр..., od. The restriction VfG of V, to G splits into a direct sum of one-dimensional representations of G. Since Ул is faithful on G and since cd is of order q = pr, one of them is faithful on the subgroup of G generated by cd. Let x be the character of this representation, and v a basis of this representation; hence av = x(a)v f°r апУ in G. In particular, we have :v = /(ffjt', x(ai) = is a gth root of unity, and x(cd) = x(&i ... trj = ^ ... = £ is a primitive qth root of unity. (f) For a in Sn, such that a~1Go £ G, denote by xf the one-dimensional character of G given by /’(a) = z(<r Suppose first that £p ..., are
8.3 Irreducible components 193 not all equal. Then we can find a permutation i15.. ., id of 1,..., d such that the cyclic permutations of the sequence , £id) are pairwise distinct. Let tr in Sn be such that a~ 1oko = aik, for к = 1,..., d. Then хЧъ) = /(<т~Ч<т) = /(ct.-J = £ik. Thus, replacing / by /’, we may assume that the cyclic permutations of (£i, • •, ^d) are pairwise distinct. Since these d sequences coincide with the d sequences (xcqk(ai))o<i<d-1 = (£;+jk)o<;><<! -1> by (8.3.7), with indices taken mod d, we conclude that the d characters /'’’‘(О < к < d — 1) of G are pairwise distinct. Hence, by a well-known theorem of Artin (see Lang 1965, Theorem VIII.7), these characters are linearly independent. Let и = v + e~ lcqv + • • + e.~(d~ l)cq(d~ l’ve f<, where e is a primitive dth root of unity. Then for a in G, we have au= £ c~kocqkv 0 < к <d- 1 = £ s~kcqk(c~qkocqkv) = £ E~kcqkx(c~qkacqk)v = Y^kXc4k(^)c4kv. Taking any linear form (p on V2, we obtain <p(ou) = ^с.~к(р(сцксУ/У‘,к(о). Thus, if и = 0, we obtain <:k(p(cqkv) = 0, by linear independence of the characters; hence cqkv = 0, which is not true. We conclude that и / 0. Now, since cq is of order d, we have cqu = eu, and since cdv = c,v, we have cdu = cu. Since q and d are relatively prime, we deduce that си = сои for some (gd)-th root of unity; but co must be primitive, because co4 = s (respectively cod = c) is a primitive <7 th (respectively gth) root of unity. Hence, и is the basis of a faithful one-dimensional representation of C contained in 1<, which proves the theorem in this case. (g) Suppose now that = • • • = cd = 0. Then в is a primitive gth root of unity, because so is ()d = We may find numbers гг,... ,rd relatively prime to p, which are not all equal mod q, and whose sum is d. Indeed, either p / 2 and we take = — 2, r2 = 4, r3 = • = rd = 1 (then гг ф r2 mod q because q / 3), or p = 2 and we take rx = — 1, r2 = 3, r3 = • • • = rd = 1 (then d > 3 and q > 4, hence r2 ф r3 mod q). Then of* is a g-cycle, and we may find a in Sn such that a~ 1oio = o'1, 1 < i < d; we thus have /’(o’,) = /(o’- = /(of*) = 0r‘, and these d num- bers are not all equal. Furthermore, xa(cd) = 0ri ... 0rd = 0d is a primitive gth root of unity. Since /’ is a faithful one-dimensional character of the subgroup generated by cd, we are back to (f) by replacing / by /’. □
194 8 The action of the symmetric group 8.4 LIE IDEMPOTENTS К is still assumed to be a field of characteristic 0. An element e of KSn is called a Lie idempotent if e is idempotent and if the left ideal KSne is equal to the multilinear part of the free Lie algebra over К on the alphabet {1,..., n}, where as usual, each permutation is viewed as a word. Theorem 8.14 Let e be a Lie idempotent in KSn, and Сл the conjugacy class of cycle type /, in Sn. Then v _Jn-1/z(p) if л = pnlp aec, (0 otherwise, where e„ is the coefficient of a in e. Proof The theorem is an immediate consequence of Lemma 8.4(ii), of eqn (8.2.1) in Theorem 8.3 and of the linear independence of the symmetric functions рл. □ Denote by 2\A)n the space of homogeneous Lie polynomials of degree n. Lemma 8.15 Let A be an alphabet with at least n elements, and e an element in KSn. Then e is a Lie idempotent if and only if the linear mapping n:K(A>n - K(Ajn,P^>Pe, is a projection onto £P(A)n. In this case е = я(12...м), assuming that {1,..., n} <= A. Recall that Pe denotes the result of the right action of e on P (see Section 8.1). Proof We may suppose that A contains {1,..., n}. (a) Let e be a Lie idempotent in KSn. Let w = ax ... an be any word of length n in A, and define an algebra endomorphism (p of K(Aj> by (p(i) = a{, (p(a) = 0 for any other letter. Then we have we = <p(12 ... n)e = <p(12 ... n e) (by commutation of the left and right action on see Section 8.1) = (p(e), by (8.1.4). Thus we = <p(e) 6 LT(A)n, because (p(ff(A)) <f(A). Any Lie polynomial of degree n is a linear combination of P = [cq ... an], where [...] means some Lie bracket arrangement. We have Pe = [<p(l)... <p(n)]e = <p([l ... n])e = <p([l ... n]e). Now, [1 ... n] is a multilinear Lie polynomial, hence in KSne, by definition of a Lie idempotent. Thus [1 ... n]e = [1 ... n], and Pe = <p([l ... nJ) = P. This shows that P i-> Pe is a projection from onto ^(A)n.
8.4 Lie idempotents 195 (b) Let л: P i-» Pe be a projection from К<Л>„ onto У(А)п. Denoting by Fn the space of multilinear Lie polynomials, we have Fn = tt(Fn) = Fne £ KSne. Moreover, KSne c KSn n n(KSn) c KSn n ^(A)n = Fn; hence, Fn = KSne. In particular, e is a Lie element, and we deduce that e = л(е) = ее, which shows by (8.1.4) that e is a Lie idempotent. Moreover, n(12 ... n) = (12 ... n)e, by (8.1.4) again. □ Recall that D(o) (respectively d(a)) denotes the set (respectively the number) of descents of the permutation a; see Section 3.3. If S c {1,..., n — 1} we denote by Ds the following element of KSn: Ds= £ ff. rreS„ D(a) = S (8.4.1) Theorem 8.16 The two elements of KSn below are Lie idempotents .[[1,2]. 3], -(I — (21))(1 -(321))...(I -(„...21)) n n = |ЛХ(-0‘В,1........(8.4.2) n k = 0 (n- A-1 (-1)|S| (n- A-1 <reS„ n \ d(a) J S<=(1.7\n-1) n \ |S| J (— 1)ISI = X + <8A3> Here we use the notation Ds for DT, T^S (see Section 3.3). Note that in (8.4.2), we identify KSn with the space spanned by multilinear words of length n on the alphabet {1, 2,..., n}. Proof (a) The linear mapping itn, defined for any word ar... an of length n, by 7tn(al ...an) = 1[.. .[[flj, a2f a3],..., a„], is by Theorem 1.4(v) (or rather of its symmetric version), a projection from onto ^(A)n. By definition of the right action, we have 7tn(al ... an) = (ax ... an)6n, with вп defined in (8.4.2). Hence, by Lemma 8.15 л„(12 ... n) = if.. .[[1,2], 3].n] is a Lie idempotent. Observe that, by definition of the right action of Sn on the words of length
196 8 The action of the symmetric group n, the right action of the cycle (n ... 2 1) on a word is given by a j ... an(n ... 2 1) — апаг ... an _ x, (8.4.4) see (8.1.2). Thus, for any homogeneous polynomial P of degree n — 1 and any letter a, we deduce by linearity (Pa)(l - (n...2\)) = Pa-aP = [P, a]. This shows that, with the canonical embedding 8п_г -> S„, one has я„(1 2 ... n) = IX-jO 2 ... n - 1), n] = 7T„_1(12...n- 1)(1 — (n...2 1)), and proves the second equality of (8.4.2) by induction, because kx(1) = 1. Observe that if w is a permutation in Sn_^ with descent set {1,..., k] {1,..., n — 2}, then the permutation wn in Sn has the same descent set, and the permutation nw has descent set {1,..., к + 1}. Moreover, there are (" ~k ') permutations in Sn with descent set {1,..., k], because such a permutation is of the form f ... ik 1 Л ... j„_ x with h > • • • > 4, Ji < • ’ ’ < Л- i -*• Since [.. .[[1, 2], 3],..., n] is of the form £ ±<t, with 2"~ 1 terms, all this shows that this element is equal to Z q( — 1 )kD{ x k} and it proves (8.4.2). (b) Let Kj be the canonical projection -► LT(A), as defined in Section 3.2. Let e = %x(l 2 ... ri). Then, by Lemma 3.14 and Corollary 3.16, e is equal to any one of the expressions in (8.4.3). It remains to show that for any P in К (/!)„, one has ях(Р) = Pe (then e is a Lie idempotent by Lemma 8.15). But such a P is a linear combination of polynomials of the form Q = <p(l 2 ... ri), where cp is an algebra endo- morphism of К (A) which sends letters to linear combinations of letters. By Lemma 3.9, cp commutes with я15 so that я1(2) = я1'>ф(12...п) = (p - xfl 2 ... n) = cp(e) = <p(l 2 ... n • e) = <p(l 2 ... n)e (by the commutation of actions—Section 8.1) = Qe. □ We define the major index maj(a) of a permutation a to be the sum of its descents: maj(ff) = '• ie D(<t) Theorem 8.17 Lew co be a primitive n-th root of unity in K. Then кп = - П creS, is a Lie idempotent.
8.4 Lie idempotents 197 For example, |(1 2 3 + j2 1 3 2 + j 2 1 3 + j22 3 1 + j 3 1 2 + 3 2 1) is a Lie idempotent, for j a primitive cubic root of 1. Lemma 8.18 Let S {1,..., n — 1} and P an homogeneous Lie polynomial of degree n. Then PD S = 0 if S / 0, and in general PDS = (- 1)IS|P. Proof Let w = Uj ... an be a word of length n. Let S = {n15 пг + n2,..., «!+••• + nk_ J, where the n-t are positive. Let 1 2 ... n = «j ... uk, where each word u: is of length nh with nk = n — пг — • • • — nk _ P Then, by Lemma 3.13, we have = 0(ui ш- • -ш uk), where в is the linear endomorphism of KSn sending each permutation on its inverse. Observe that by definition of the right action of Sn, we have w(u1 Ш • • • Ш Uk) = Wj Ш • • • Ш wk, where w = Wj ... wk, with |w;| = n-r Thus, if S / 0, hence к > 2, we have by eqn (3.3.5): (D^s, w) = (P0(Ui ш • • -Ш uk), w) = (P, w(w1 Ш- • -LU икУ) = (P, Wj ш - • - ш wk) = 0, by Theorem 3.1 (iv). This shows that PD S = 0- Now, by inclusion-exclusion, we have Ds= Z (-l)|s|-'^r. This implies the lemma, in view of D0 =1. □ Denote by c the cycle (1 2 ... n) in Sn. Lemma 8.19 Let 1 "-1 . . C„ = - £ W ‘c‘. n i = 0 Then Сл^Л ^лСл Cl’ and £п, кп are idempotents.
198 8 The action of the symmetric group Proof Define d(a) by J(ct) — d(a) + £, where c = 0 or 1 according to a(n) < <t(1) or a(n) > In other words, d(a) is the number of descents in a, viewed cyclically. We claim that for a in Sn: maj(tfc) = maj(tf) — d(a) mod n, maj(ctf) = maj(tf) — 1 mod n, d(ac) = d(a). Suppose the claim is true. Then Kn = 4 E Г wmaj('T) ’ ‘ac‘ = Д Г o)mai(ac‘ ’+ i(d(a) ~1 >ac! n2aeSni = o n2 a,i = — E ^mai(ac,>^i(d(ac‘)~1)ffCi n2 o,i = A E wmaj(a,a J (a/,3,,“1);. П meSn i = 0 Now, for an nth root of unity p, one has E"=o Pl = 0, except when p = 1, in which case it is n. Moreover, d(ct) = 1 if and only if i = c1 for some j, and d(i) e {1,..., n}. Since maj(cJ) = n — j, for j e {1,..., n}, we obtain 1 n _. . k'nSn ~ E П j=l Furthermore, <Л = ‘/if E И i = 0 a e S„ = \ E wrnaJ(c''T)^ n i.a = 1 £ Wmaj,3,,7 = Kn. tt meSn Now, кп is idempotent, because кпкп = кп^пкп = с,пкп = кп, and similarly for *эи' It remains to prove the claim. Let о = аг ... an. Then ac = a2 ... anal, which shows that D(ac) = [i — 1 |z e D(a), i > 2} и E,
8.4 Lie idempotents 199 where E = 0 or {n — 1}, depending on whether an < or an > ax. Thus maj(or) = J (z - 1) + (n - 1)|£| = J (z - 1) + (n - 1)|£| ieD(a) ieD(a) i>2 = maj(tf) — d(o) — |£| mod n = maj(o-) - d(o). Also, co = (ai + 1)... (an + 1), where digits are taken mod n. If an = n, then maj(ca) = maj(a) + n — 1 = maj(a) — 1 mod n. Otherwise, о = ... in j..., co = ... (z + l)l(j + 1) • • • and the descent nj in о is replaced by the descent (z + 1)1 in co; thus maj(ca) = maj(a) — 1. The last equality of the claim is immediate. □ For S {1,..., n — 1}, denote by maj(S) the sum of its elements. Proof of Theorem 8.17 Let e be any Lie idempotent. We show that екп = e. This will show that the space KSne is contained in the space KSnKn. Now, кп is idempotent by Lemma 8.19, hence the dimension of KSnKn is (n — 1)!, by Lemma 8.4(ii). Similarly, the dimension of KSne is also (n — 1)!, hence the two spaces are equal and кп is a Lie idempotent. We have by definition of кп, by Lemma 8.18 and by the fact that e is a Lie element: eKn = e(1 £ comaj(S)Ds^ = - ( £ wmaj,s,(- l)|s|^)e \n S c (1.Л- 1) / n \ S / = -( E wil + "'+Zk(-l)*je П \0 < ii < • • < ik < n / = - (1 — co)( 1 — co2)... (1 — co"- l)e = e, n because (1 - co)(l - co2)... (1 - co"-1) = П (x - coz)|x=1 Z = 1 = (x"-1 + • • • + X + 1 )|x= j, co being primitive. □ Corollary 8.20 Let Hn be a set of representatives of the primitive conjugacy classes of words of length n. Then the set {wk„ | w e Hn} is a basis of ^T(A)n. For the definition of a conjugacy class, see Section 7.1. As the proof will
200 8 The action of the symmetric group show, if a word w in Hn is replaced by a conjugate word w', then wKn is replaced by w'k„ = Оюкп for some nth root of unity 0. This basis is therefore canonical, up to constant factors which are roots of unity. Proof Let be as in Lemma 8.19. Then c\n = a)^n. Hence, by this lemma, скп = a)Kn. Two words w, w' of length n are conjugate if and only if w' = wc‘ for some i; in this case w'k„ = со1юкп. If w is not primitive, then w = c‘w for some z, 1 < i < n — 1; then wk„ = o)‘wk„ => wk„ = 0. These observations prove the corollary, because ^T(A)n = K^A^Kn is spanned by the set {wk„| w word of length n}, and by equality of the number of primitive conjugacy classes of length n and dimension of ^T(A)n (Corollary 4.14 and Theorem 7.1). □ Third proof of Theorem 8.3 By Lemma 8.19, the mapping К8„Цп ~~* KS„K„, X 1 * XKn is a linear isomorphism. It is evidently an isomorphism of left S„-modules. Hence, the two representations of Sn on these left ideals of KSn are equivalent. The second one is the Lie representation, because кп is a Lie idempotent. Hence, one concludes as in the second part of the second proof of Theorem 8.3. □ Let q be a variable, and denote by rn(q) the unique polynomial of degree < <p(n) such that tM = <T mod Ф„(з), where Ф„(д) is the nth cyclotomic polynomial, and <p(ri) = degФn(^) is the Euler function. Define = - E rn(q)°- U ae Sn Theorem 8.21 Let f = <p(ri) — 1, and Kn{q) = <50 + 6tq + • • • + dfqf. Then (i) Kn(q) is a Lie idempotent in K(q)Sn; (ii) d0 + + • • • + cf()f is a Lie idempotent for any choice of scalars ct,... ,cf in K. Proof (i) Let e be a Lie idempotent in KSn. Then KSne = KSnKn.
8.5 Representations on the canonical decomposition 201 This implies that екп = e and кпе = кп. Since, by Theorem 8.17, k„(cu) = кп, we obtain ек„(4) = e, Kn(q)e = Kn(q) mod Ф„(^). Since both sides of each congruence are of degree < <p(n), we actually obtain ек„(4) = e, Kn(q)e = Kn(q). (8.4.5) These equations imply that Kn(q) is idempotent and that K(q)SnKn(q) = K(q)Sne, hence Kn(q) is a Lie idempotent. (ii) Eqn (8.4.5) shows that ed0 = e, ed-t = 0 if i > 1, and 6-te = if i > 0. Hence, e = e(<50 + + • • • + cf8f) and <50 + cl8l + • • • + cf8f = (<50 + cl8l + • • • + cfdf)e. This proves (ii). □ 8.5 REPRESENTATIONS ON THE CANONICAL DECOMPOSITION Let z = Xj ... лк be a partition of n. We define Uk to be the subspace of linearly generated by the polynomials k'. aeSk where each P; is an homogeneous Lie polynomial of degree z{. Lemma 8.22 К<Л> = In particular, with the notation of Section 3.2, let Uk be the subspace of К<Л> generated by the kth powers of Lie polynomials. Then Uk= ® Uk, ia>=k where 1(a) is the length of z (see Proposition 3.6). Proof We may assume that the alphabet is finite. Let (Ph)heH be a Hall basis of &(A). To each word w, which has the decreasing factorization w = hx ... hk (see Corollary 4.7), associate the polynomial Qw = (Phi,...,Phk). By multilinearity of the operator (P1?..., Pk), and by Theorem 3.7 and Proposition 3.6, the polynomials Qw generate the vector space К<Л>. Since they are in degree-preserving bijection with the words, they form a basis of К<Л>.
202 8 The action of the symmetric group The same argument shows that the Qw, for which (deg Phi,..., deg Phk) is the unordered partition z, form a basis of If. Thus the lemma follows. □ Evidently, each If is invariant under each homogeneous algebra endo- morphism of К<Л>; hence under the left action of Section 8.1, we obtain a representation of Sn on the multilinear part of If, when |2| = n (see Section 8Л). To describe this representation, we need the plethysm of symmetric functions (see Macdonald 1979). Let x2,... be infinitely many commuting variables, and g(xx, x2,...) a symmetric function in the xr Each symmetric function has a unique expansion as a series in the power sums p2,.... If g = G(p1? p2,...) and f = F(Pi, p2, •) are two symmetric functions, then the plethysm of f in g is defined to be the symmetric function where f = F(Pi, p2i,...). When f has coefficients in N, there is an equivalent definition. Let (8.5.1) 16 where each m, is a monomial in the Xj. Then g°f = д(тг,т2,...). (8.5.2) Since g is symmetric, the choice of the representation (8.5.1) is immaterial. Observe that g i—► g ° f is for fixed f an homomorphism of the algebra of symmetric functions into itself; furthermore, the mapping f pn ° f is linear, for a fixed integer n. We denote by hn the complete homogeneous symmetric function of degree: hn is the sum of all monomials of degree n in the x-r i.e. hn = Z (8-5.3) i 1 < < i„ Theorem 8.23 Let /. = lni2"2... knk be a partition of n. Then the characteris- tic of the representation of Sn on the multilinear part of If is к П V'- I = 1 where ln denotes the characteristic of the n-th Lie representation, given by (8.2.1). Proof By the proof of Lemma 8.22, the space If admits as a basis the set of polynomials (Plk,..., Plni, P2l,..., P2ni,..., Pkl,..., Pknk), where the polynomials PtJ are chosen in some Hall basis of LF(A), with the condition <teg(Pif) = i, and Pn < • • • < Plni, P2l < • • •< P2n2,..., Pkl < • • < Pknk,
8.5 Representations on the canonical decomposition 203 where the order is that of the Hall basis. The previous polynomial is finely homogeneous, and Hall polynomials are in evaluation-preserving bijection with Hall words. The generating function of the Hall words of degree n is equal to Hence, by definition (8.5.2) of the plethysm and (8.5.3) of hn, the theorem follows from Theorem 8.1. □ Denote by ZA the centralizer of some permutation of Sn of cycle type 2. The next theorem extends Corollary 8.7. We suppose that К contains all roots of unity. Theorem 8.24 Let 2 be a partition of n. The character of Sn on If is induced by a one-dimensional character of Z;. We call characteristic of an element e = ^aeSn eaff of the group algebra KSn, the symmetric function ch(e) = Y^sn Ww- Lemma 8.25 Let e, f be idempotents in KSn, KSP respectively. Denote, for о in Sn, a in Sp, by о x a the permutation co in Sn+p such that: co(i) = a(i) if 1 < i < n, and co(i) = %(i — n) + n if n + 1 < i < n + p. Then e x f = x a) (8.5.4) <reS„ aeSp is an idempotent in KSn + p, and ch(e x f) = ch(e)ch(f). (8.5.5) Proof A straightforward computation shows that e x f is idempotent. Observe that the cycle type of a x a is the union 2(a) u 2(a), hence Pz(<7 X a) = Рл(<7)Рл(а)- This implies (8.5.5). □ Lemma 8.26 Let e, f be idempotents in KSm, KSP respectively. For о in Sm, ulf...,um in Sp, denote by (a; u1,..., um) the permutation of Smp defined (a; и,,..., um) ((i - l)p + j) = (a(i) - l)p + Ui(j),for 1 < i < m, 1 <j < p. Then e°f = Z ?afUi ...fUm((j;u1,...,um) (8.5.6) aeSm ui...umeSp is an idempotent in KSmp and ch(e ° f) = ch(e)n ch(f). (8.5.7) In order to understand the previous definition of (a; u17..., um), consider the partition {l,...,mp} = IJ {(i-l)p + —l)p + pj. (8.5.8) 1 < i < m
204 8 The action of the symmetric group Then щ permutes the ith block of this partition, while о permutes the blocks themselves. Proof Let a, a e Sm, Up ..., um, vx,... ,vme Sp. Then (a; u19, um)(a; , vm) = (aa; ..., ua(m)vm). (8.5.9) Indeed, for 1 < i < m, 1 < j < p (a; Up ..., um)(a; ..., vm)((i - l)p +j) = (a; Up ..., um)((a(z) - l)p + vf(J)) = (aa(z) - l)p + u^v-Xj) = (aa; ua(1)t?p ..., ux(m)vm)((i - l)p + j). Since e, f are idempotents, we have Z eeea = ev, £ fufv = fw. (8.5.10) ff(X = (p uv — w Now, we have = Z e„fUi fUmeJVi um)(cc; vx,..., vm) where the sum is over all a, a in Sm, ub ..., um, vr,..., vm in Sp. Hence, by (8.5.9), the coefficient of (<p; wb ..., wm) in (e f)2 is Z • fvm Z ^a^x Z fu^iyfvi • • • (JH = (p (JO. ~<P U2(i)Vi = Wi Vi, = wt = Z e^x Z frJV1 • • • frmfVm m = Z e^x П Z fr.fv, ax = <p i = 1 r,v, — wt Z ^a^x J”[ fwi e<pfwi • fwm’ ox — <p i = 1 by (8.5.10). This proves that e°f is idempotent. Let us view the set {l,...,mp} as {1,..., m} x {1,..., p}, and the partition (8.5.8) as x {l,...,p} = (J {z} x {l,...,p}. 1 < Z < m We denote by Bt the z th block of this partition. With this identification, the permutation (a; up ..., um) acts as (a; Up ..., um)(i,j) = (a(z), u;(j))- (8.5.11)
8.5 Representations on the canonical decomposition 205 We prove formula (8.5.7) without the assumption that e is idempotent; since this formula is linear in e, we may assume that e is a single permutation: e = e Sm. Write a = Cj ... ck as a product of disjoint cycles; each cycle is of the form c: = (rb..., r() and we denote u|c; = uri... uri, where и = (ub..., um) e S”. We claim that к ch((o;ul,...,um)) = П Рк(°сИ(и\с{), (8.5.12) i = 1 where z; is the length of the cycle c;. Suppose this is true; for и as above, let fu = fUi ... fUm and (a; u) = (a; ub..., um). Then ch(oof)= fu fl P^ach(u\c:) = Z fu П Pk,°ch(Vi) ueSp i ~ 1 vi, VkeSp ueS? i = 1 Vi,u|c, = v> к Z П p^° ch(vf Z fu- vi,..., VkeSp i ~ 1 Vi,u|c, = t’, Now, f is idempotent, so that fl = f for any I > 1, hence £W) ...W, = K,/W1 ... fwt = fw- This implies that the last sum in the previous expression is equal to fVi ...fVk. Thus к к ch(a°f) = ПАРл,°^.)=П Z ft’,Pi,°ch(Vi) vi..Vke5pi = l vteSp к / \ к = П Pit ° ch\ Z fv,vi) = П Рл, ° ch(f) = рЛ(а) ° ch(f) i - 1 \vteSp / i - 1 = c/i(a)oC/i(/). It remains to prove the claim. It is equivalent to show that 2((a; u)) is the disjoint union of the partitions z,z(u|c;), i = 1,..., k, where the factor z; means that each part of z(u|c;) is multiplied by z;. It is enough to show that each cycle of length h of u|c; defines a cycle of length z;/i of (a; u); so let С; = (r15..., r,), with I = /.h and c = (sb ..., sh) a cycle of u|c;. Then by (8.5.11) the successive images of (rpsj under (a; u) are: (rb s,)(r2, (r3, ur2uri(s i)),..., (rp un_, ... un(Sj)), (rp s2), (r2, uri(s2)),..., uri(s2)), (rP sh), (r2, u„(sh)), ...,(r„uri_,... urj(sh)), (Гр sj. Hence, we obtain a cycle of length fh of (a; u). These cycles are all distinct, and their total length is £‘=1 z;p = mp, so that we have described all the cycles of (a; u). □ Proof of Theorem 8.24 (a) Suppose first that n = mp and tht z = pm. We
206 8 The action of the symmetric group apply Lemma 8.26 to 1 1 p~1 . . e = — Z a and f = ~ Z 0) >c‘’ ml aeSm Pi-0 where co is a primitive pth root of unity and c the cycle (1, 2,..., p). Then e ° f is in the group algebra KZ, where Z is the set of permutations in Smp of the form (a; ch,..., ctm), with the notation of Lemma 8.26. In other words, Z is the centralizer, in Smp, of the permutation (1, 2,..., p) (p + 1,..., 2p)... ((m — l)p + 1,..., mp), of cycle type A. We have e°f = —?— £ w"'1 ‘m(a; c11,..., clm). m\pm o<i....im<P aeSm The multiplication rule (8.5.9) shows that the ideal KZe° f is of dimension 1. Furthermore, the characteristic of the representation of Sn on KSne° f, which is induced by the representation of Z on KZe ° f, is by Lemma 8.4(ii) equal to ch(e° f). By Lemma 8.26 ch(e ° f) = ch(e) ° ch(f) = hm ° Ip, because hm = Y^sm Piw as is wel1 known, and ch(f) = lp, by (8.2.1), Corollary 8.7, and Lemma 8.4(ii). (b) Let Z = Z; be the centralizer of some permutation о of cycle type к Since conjugate permutations give conjugate centralizers, and equivalent induced representations, we may assume without loss of generality that the fixed points of a are {1,..., mr}, the 2-cycles of a act on + 1,..., тг + 2m 2}, and so on. Hence, we take a in the subgroup Smi x S2mi x S3m3 x • • • of Sn, о = (cr 1? a2, a3,...), where op e Spmp is the product of the p-cycles of a and is of cycle type pmp. It is well known that Z = Zj x Z2 x • • •, where Zp is the centralizer of op in 8pmp. In part (a), we have constructed an idempotent ep of KZp which is one-dimensional (i.e. dim(KZpep) = 1) such that the characteristic of the representation of Spmp on KSpmpep is hmp°lp. Using Lemma 8.25, we construct a one-dimensional idempotent e of KZ, such that ch(e) = ПР ch(ep) = fL This implies the theorem, by Lemma 8.4(ii), Theorem 8.23, and the fact that representations of Sn are equivalent if and only if they have the same characteristic. □ 8.6 APPENDIX 8.6.1 Multiplicities Denote by aA the multiplicity of the irreducible character /' of Sn in the nth Lie character. Then aA is by Corollary 8.10 equal to the number of standard tableaux of shape z and major index congruent to 1 mod n. For
8.6 Appendix 207 practical computations, one can use the formula *a = - Z (8.6.1) П d | n where %* is the value of the irreducible character /z at an element of cycle type p. This formula is obtained by expressing the symmetric function ln in terms of the Schur functions: ln = E aASA> |A| =n and by using the identities Рд = I X^A A and (8.2.1). Thrall (1942) and Brandt (1944) already give tables for these multiplicities, up to n = 10. Davis (1958) gives explicit formulas for ал when z has at at most two parts. In order to use formula (8.6.1), Foulkes (1959) gives a rule to compute the numbers derived from the Jacobi-Trudi identity: ^A ^®l(^Ai-i + j)l < i, j< n’ which expresses in terms of the complete symmetric functions hk. For d a divisor of n, replace in the above determinant each hk by l/(k/d)! if d divides k, and by 0 otherwise; then x'd„d is (n/d)l multiplied by this new determinant. When n is prime, then ал = ^(/t„ — Xa<">))- Since /(zn) is always equal to 1, 0, or — 1, we have that ал is the nearest integer to (n'1/^) (Brandt 1944), i.e. the order of the irreducible representation corresponding to z, divided by n. Table 8.1 gives the multiplicities of the irreducible representation of Sn in the nth Lie representation, for n = 1-7; they were computed by Thrall (1942) and Brandt (1944). We also indicate the order of the irreducible representa- tion corresponding to the partition z. 8.6.2 Lie invariants Let the alphabet A have q elements, and let V — \aeA Ka. The group GL( V) acts on К<Л> (see Section 8.1). A Lie invariant is a nonzero homogeneous Lie polynomial which is left invariant, up to a scalar, under this action. For example, with A = {a, b}, the Lie polynomials [a, b] and [[a, [a, />]], [6, [a, />]]]
208 8 The action of the symmetric group Table 8.1 Multiplicities of the irreducible represen- tation of Sn in the nth Lie representation, for n = 1-7 (after Thrall (1942) and Brandt (1944)). z Л Lie A A Lie 1 1 1 32 5 1 2 1 0 321 16 3 I2 1 1 313 10 1 3 1 0 23 5 0 21 2 1 2212 9 2 I3 1 0 214 5 1 4 1 0 I6 1 0 31 3 1 7 1 0 22 2 0 61 6 1 212 3 1 52 14 2 I4 1 0 512 15 2 5 1 0 43 14 2 41 4 1 421 35 5 32 5 1 413 20 3 312 6 1 321 21 3 221 5 1 322 21 3 213 4 1 3212 35 5 I5 1 0 314 15 2 6 1 0 231 14 2 51 5 1 2213 14 2 42 9 1 215 6 1 412 10 2 I7 1 0 are Lie invariants (Magnus 1940). With A = {a, b, c}, the Lie polynomial [[[[a, bf [a, c]], [[a, b], [b, c]]], c] + [[[[/>, c], [b, a]], [[/>, c], [c, a]]], a] + [c,b]], [[c, a][a,b]]], b] is a Lie invariant, indicated by Wever (1949). He observed also that the degree d of a Lie invariant must be a multiple of q, and that there are no Lie invariants in the following cases: q = 2, d = 4; q = 3, d = 6; d = q > 3. He gave a formula for the number of invariants (counted up to a scalar factor) of a given degree, for q = 2. Burrow (1958) constructs a Lie invariant for any q > 4 and any degree mq. m > 2. The representation theory of the linear group shows that the number of Lie invariants of degree d = mq is equal to the multiplicity, in the Lie representation of degree d, of the irreducible representation of the symmetric group Smq corresponding to the partition m4: by Corollary 8.10 this dimension is therefore equal to the number of standard tableaux of shape mq and major
8.6 Appendix 209 index congruent to 1 mod d. In particular, Theorem 8.12 shows that there is always a Lie invariant, except in the cases indicated by Wever. 8.6.3 Conjugation on circular permutations Conjugation on the set of circular permutations defines a representation of Sn. In this action, the stabilizer of a single circular permutation c is the subgroup C generated by c. Hence, this representation is equivalent by the one induced from the trivial representation on C. By Theorems 8.9 and 8.8, we deduce that the multiplicity of the irreducible character in this representation is equal to the number of standard tableaux of shape z and major index congruent to 0 mod n. 8.6.4 Lie orthogonal idempotent By Theorem 3.1 (iv), Q<A> is the direct sum Q<A> = ^(A) ф S, where S is the subspace generated by 1 and the shuffle products и ш v, и, v nonempty words. Let ti: Q<4> -» ^(A) be the projection corresponding to this direct sum. Then n commutes with the homogeneous algebra endomorphisms of K<A>, because &(A) and S are closed under these endomorphisms. For each n > 1, we thus have by Lemma 8.15 that %(12 ... n) is a Lie idempotent (this observation is due to Garsia). Some results on this idempotent are given by Duchamp (1991); he shows in particular that the set of polynomials of the form и ш nv, uv e и 1 (uv denotes here the concatenation of the words и and r), is a basis of the multilinear part of degree n of S. More general idempotents may be obtained by using the direct sum of Section 6.5.1. 8.6.5 On the idempotence of 0„ The fact that the element 0„ in (8.4.2) is idempotent (which is essentially equivalent to Theorem 1.4(v)) is proved directly by Garsia (1990) as follows: 0„ is a Lie element, hence by Lemma 8.18 and (8.1.4), we have 0nDs = ( — l)|s|0„, for any S c {1,..., n — 1}. Since e. = "f <-')%......M, Jk = O we deduce e.».= - <-!>*<-1)4 = e„. nk = 0 A direct proof of the idempotence of 0n/n is given as follows by Wever (1949). Let yn be the cycle (n ... 21). Then he shows, by the use of the Jacobi identity,
8 The action of the symmetric group 210 that 0п + еп.1Уп0п = 0, (8.6.2) where is as usual embedded in Sn. Moreover by (8.4.2) on-1on = o2„-1^-yn) = (n — 1)0„_1(1 — yn) by induction = (n-l)0„. (8.6.3) Thus, в2п=0п-1в-Уп)вп by (8.4.2) = вп-1®п - 0п-1Уп®п = (n - 1)0„ + 0n by (8.6.2) and (8.6.3) = n0n. Another direct proof is given by Specht (1948), who actually proves a particular case of Lemma 8.18 (see his eqn (24), p. 375). 8.6.6 An idempotent for the canonical decomposition Let n = mp and let A be the partition pm of n. Let Sm[Sp] denote the wreath product of Sm by Sp: it is the subgroup of Smp normalizing its subgroup Sp = Sp x • • • x Sp, naturally embedded in Smp. Equivalently, Sm[Sp] is the set of permutations which leaves invariant the partition (8.5.8); hence, Sm [Sp] is the set of permutations (a; u19..., um), a e Sm, ui9..., um e Sp, with the notation of Lemma 8.26. Let g = -— У wrnaj('T)cr, Pmm! aeSm[Sp] where w is a primitive pth root of unity and maj(a) the major index of a. Then g is an idempotent for If (i.e. If = and g is idempotent). Indeed, let e = — X <*,/ = - S comaj(u)u. ml (zeSm P ueSp Then maj(cr; ub ..., u„) = maj(ux) + • • • + maj(u„) mod p, because the per- mutation (ст; ux,..., u„), viewed as a word, is the concatenation of words of length p, each one of the form к + u,(l), к + u;(2),..., к + u:(p), for some i and k. This implies that g = e° f is idempotent and that
8.6 Appendix 211 ch(g) = ch(e) ° ch(f) (Lemma 8.26). Hence, the representation of Sn on the multilinear part of Ux is equivalent to that on the left ideal KSng, by Lemma 8.4(ii), Theorem 8.3, Theorem 8.23, and the fact that ch(e) = hm, ch(f) = lp, because f is the Lie idempotent кр of Theorem 8.17. Actually, K(A\g is equal to Ux: indeed, let cp: denote the algebra endomorphism of K<A> sending the letter j onto j + pi. Denote by f: the element Ф ,(./). Then it is easily verified that which implies that g is in U,. hence K(A\g = U, . by equality of dimensions. 8.6.7 Generalized Jacobi identities The left to right bracketing of words defines a linear mapping /: KSn -> KSn, defined by /(al ... an) = [.. .[al, a 2],..., an], for any permutation a in Sn (considered as a word). The kernel of I is clearly a left ideal of KSn. Define elements Oj(j = 2,..., n) of KSn by 0j= 12 ... n + J/(12 ... (J — 1))G+ l)...n. Then Ker I admits as a basis the (n — l)(n — 1)! elements aOj, 2 <j < n, a e Sn, aj = 1, where the product means the product in KSn (Blessenohl and Laue 1988). Indeed, recall Baker’s identity l(Pl(Q)) = [l(P)> K2)J> f°r апУ polynomials P, Q (see Section 1.6.6). It implies that 12... J + .//(12 .. .j — 1) is in the kernel of /, hence Oj is also, because Ker I is a right ideal in the concatenation algebra. Hence, the (n — l)(n — 1)! indicated elements all lie in Ker /. Observe that aOj is of the form a + an element in KT, with T = {a e Sn, al = 1}. Thus, each element of KSn is congruent to an element of К Г mod Ker /. But the (n — 1)! elements 1(a), aeT, are linearly independent mod Ker / (see Section 5.6.2), which concludes the proof. Examples of elements in Ker I are 12 + 21, 123 + 231 + 312, 1234 + 2143 + 3412 + 4321. The two first ones are the defining identities of Lie algebras (antisymmetry and Jacobi identity), while the third was observed by Wever (1949 (9a)); he also observed that the sum of all permutations in Sn(n > 2), and the sum of all even permutations (n > 3), are both elements of Ker / (Wever 1949, Chapter 1). 8.6.8 Subspaces Let Г be a complete binary, rooted, planary, unlabelled tree. Each labelling in A of the leaves of T defines a tree t in the free magma M(A) on A (see
212 8 The action of the symmetric group Fig. 8.1 The left comb. Section 0.2), hence a Lie polynomial in ^(A), by intepreting each internal node of t as the Lie bracketing. The linear span of all these Lie polynomials is a subspace of ^T(A), stable under the action of the linear group (see Section 8.1). Denote by VT the multilinear part of this subspace: the symmetric group Sn (n = number of leaves of Г) acts on VT, as observed by Barcelo and Sundaram (1992). They show that VT is the whole multilinear part of degree n of T£{A) if and only if T is the ‘left comb’ (see Fig. 8.1) or a tree obtained from it by reflections around the nodes: the if part follows from Jacobi’s identity, or from Theorem 1.4(v); the only if part follows from Theorem 5.7 because if T is not of the indicated form, then VT J271 = [J?(A), 2T(A)], and ^f(A)\^f1 contains polynomials of any degree n > 1, by the theorem. Among other results, they also show that if S, T are trees and S[T] is the tree obtained by replacing each leaf of S by T, then the representation VS[Ti is the wreath product of the representations Ts and VT; in other words, it corresponds, via the characteristic map, to the plethysm of the characteristic of VT into that of Vs. Moreover, they call generator of VT the Lie polynomial obtained by labelling T with 1, 2,..., n from left to right, and denote it by wr; they show that if ws, wr are both idempotents up to a scalar factor (e.g. if S is the left comb), then so is ws[r]. Sundaram (1992) continues the study of the subspaces VT. She shows that if Г = (Tj, T2), where TJ is the left comb with n; leaves, then the representation VT is equivalent to the external tensor product of the representations VTl and VT1 if and only if none of no n2 divides the other. Let S be any tree and define the tree S' be as in Fig. 8.2. Sundaram studies properties of restriction and induction of the representation Vs„. In particular, if S = ТЦТ], where T, U are left combs with respectively m, n leaves, and m > 2 then Т/ |Smn+n ____ fiz I C \ fSmn + и- 1 FSn iSmn+n- 1 — V i лтл+л-2>> I For an extension of this study, see Calderbank et al. (1992).
8.6 Appendix 213 8.6.9 A problem of ^-enumeration In the proof of Theorem 8.17 we established that KSne^KSnKn for any Lie idempotent e. For the reverse inclusion, we have used a dimension argument. As shown by Bergeron et al. (1988), this inclusion leads to a problem of enumeration of permutations by major index. Indeed, by Theorem 1.4(iii), it is enough to show that д(кп) = кп® 1 + 1 ® кп. By eqn (1.4.1), we have д(Кп) = E (К'л’ u ш r)M ® v- U. V Hence, we have to show that for any nonempty words u, v with n = |u| + |r|, one has £ оГа](») = 0 (864) But it is known that, for an indeterminate q, maj(vv) weu lu v maj(u) + maj(v) Y я = Q where [] denotes the ^-multinomial coefficient: = п!ч/р1ч(п — p)!q, with nlq = 1(1 +<?)... (1 + q + • • • + qn~J); see Garsia and Gessel (1979). Thus, (8.6.4) easily follows. 8.6.10 Conjugation The symmetric function ln is invariant under the automorphism co of the ring of symmetric functions if and only if n is not the double of an odd number (Foulkes 1959). Recall that ш(рк) = ( — V)k~lpk, and that co sends a Schur function onto the Schur function of conjugate shape (see Macdonald 1979). The first equality, together with eqn (8.2.1), easily implies the above assertion.
214 8 The action of the symmetric group 8.6.11 Representation on the canonical projections The symmetric group Sn acts on the multilinear part of degree n of the space Uk, generated by the /с th powers of Lie polynomials (see Section 3.2). Since Uk= © U„ ia>=k Theorem 8.24 implies that the order of this representation is the Stirling number of the first kind s(n, k) (Reutenauer 1986b). Denote by fn k the characteristic of this representation. Then one has the following generating series for the fn k: E fn.ktk = expf e E n, к \i> 1 d\n ГН / (Hanlon 1990). This may be established by using Theorem 8.23, which implies that E fnjf = f E M') ° f E - vW1/) , \ / \d | n П / and by using the identity EV = exp( £ yf'pj, \i > 1 I / and the definition of the plethysm in terms of the power sum symmetric functions. The representation of the symmetric group on the multilinear part of the space F = ©klf, where the sum is extended to all partitions whose parts are 1 or 2, contains each irreducible representation exactly once; this is because the generating function of F is by Theorem 8.23 equal to n — n - . a 1 - a a<b 1 - ab which is by an identity of Littlewood equal to the sum of all Schur functions (see Macdonald 1979, p. 45). The same property holds for the shuffle subalgebra generated by the words of length 1 and 2. Consider the space Gk = ©; If, where the sum is over all partitions whose parts are divisible by k; denote by ynk the corresponding character of Sn. Then a result of Scharf (1991) states that xn/[(a) is equal to the number of solutions in Sn of the equation a* = a; this proves a conjecture of Kerber stating that this central function is actually a character.
8.7 Notes 215 8.6.12 Plethysm and derived series Let (^f") denote the derived series of the free Lie algebra (see Section 5.3). Denote by dnk the characteristic of the action of the symmetric group Sk on the quotient ^nl^n+i, and by dn the sum ^kdnk. Then, a consequence of Theorem 5.7 is that d{ is the following sum of Schur functions: = sl,n- 1 • n > 2 Furthermore, dn = d1n —d1o- -od1 (n times), and '= Y ',= Y *" n > 1 n > 1 For these results, see Reutenauer (1990). 8.7 NOTES The first study of the free Lie algebra in connection with representation theory (of the linear group) seems to be a paper of Thrall (1942). Formula (8.2.1) giving the character of the Lie action appears in Brandt (1944), although it is already implicit in the Witt formula (1937). For the second proof of Theorem 8.3 we have used an argument similar to that of Lehrer and Solomon (1986) in the proof of their Theorem (3.9). Another proof of Theorem 8.3 is given by Joyal (1986), as an application of his theory of analytic functors and their indicator series. Corollary 8.7 is from Klyachko (1974); see also Blessenohl and Laue (1989). Theorem 8.8 and 8.9 and Corollary 8.10 are due to Kraskiewicz and Weyman (1987); for the proofs, we have followed Garsia (1990), who himself uses ideas of Stanley and of Stembridge (1989). Theorem 8.12 is due to Klyachko (1974). Theorem 8.14 is due to Garsia (1990); in the case of the idempotent (8.4.2), it was also obtained by Blessenohl and Laue (1990a, 1991). The idempotent (8.4.2) was introduced by Dynkin (1947, 1949), Specht (1948), and Wever (1949); the first equality in (8.4.2) is due to the latter two authors, while the second equality is due to Blessenohl and Laue (1991), and to N. Bergeron (see Garsia 1990, Lemma 1.1): the element (8.4.3) is from Solomon (1968b); see also Reutenauer (1986b) and Helmstetter (1989). The element кп is the idempotent of Klyachko (1974), to whom are due Theorem 8.17, Lemma 8.19, and Corollary 8.20. Lemma 8.18 is from Garsia (1990). Theorem 8.21 is from Bergeron et al. (1988), who also proved Theorems 8.23 and 8.24 (the latter was conjectured by Stanley).
216 8 The action of the symmetric group It is interesting to note that the representation of Sn which is conjugate to the nth Lie representation appears at various places: the top homology of the lattice of (set) partitions of {1,..., n} (Hanlon 1981; Stanley 1982); the Stanley-Reisner ring of this lattice (Garsia and Stanton 1984); the Orlik- Solomon algebra of this lattice (Lehrer and Solomon 1986). Barcelo (1990) gives a direct bijective construction between these representations; see also Barcelo and Bergeron (1990); for similar work on the group Bn, see Bergeron (1991).
9 The Solomon descent algebra The convolution subalgebra generated by the projections of the free associa- tive algebra arising from its graduation is also an algebra under composition, and is canonically isomorphic with a subalgebra of the symmetric group algebra, called the descent algebra; the latter was introduced by Solomon for each finite Coxeter group. The canonical projections of the free associa- tive algebra, arising from its structure of enveloping algebra of the free Lie algebra, correspond to the primitive idempotents of the descent algebra; this is established in Section 9.2, where some insight into the structure of this algebra is also given. In the next section, we study various homomorphisms of this algebra: one is a mapping into the ring of symmetric functions, and its kernel is the radical of the descent algebra; another maps each descent algebra into another one, and is a derivation with respect to the convolution product. Elements of the descent algebra are also characterized, in the symmetric group algebra, by their action on Lie monomials. In the final section, we introduce quasisymmetric functions, which are closely related to the descent algebra, and we give an application to the enumeration of permutations. 9.1 THE DESCENT ALGEBRA Let A be an alphabet. Recall that Q<?4> is a graded Q-algebra, and denote by <□</!>„ the space of homogeneous polynomials of degree n. We thus have <□</!> = © Q<4>„. (9.1.1) n > 0 Corresponding to this direct sum, we have for each n a projection д„:О<Л> -» <□</!>, which is the identity on Q</4>„ and which sends Q<?4>m, т Ф n, to 0. Recall that in Section 1.5 we have defined an associative product on Endo(Q</l», the convolution, denoted by ♦. Let Г denote the convolution subalgebra of End(Q</4>) generated by the qn; that is, Г is the linear span of the elements
218 9 The Solomon descent algebra ♦ • • • ♦ Qiky h,. • •, > 0. Note that the identity is not in Г. We may write Г = © Г„, n > 0 where Г„ is the linear span of the qix ♦ • • • ♦ qik with + • • • + ik = n: indeed, such an endomorphism acts on homogeneous polynomials of degree n, and sends the others to 0 (see the definition of the convolution). A pseudocomposition of n > 0 is a /с-tuple H = (/ij,..., hk) of natural integers whose weight \H\ = hx + • • • + hk is equal to n; the length of the pseudo-composition is 1(H) = k. A composition of n is a pseudocomposition without zeros. To each pseudocomposition H is naturally associated the composition C(H) obtained by removing the zeros in H. Given a pseudo-composition H as above, we denote by qH the endo- morphism of Q<A> defined by Qh = Qhl*' ’ '*Qhk- (9.1.2) Observe that q0 is the endomorphism e of Section 1.5, which sends each polynomial to its constant term. Hence, q0 is the neutral element for the convolution, by Proposition 1.10. This implies that for each pseudo- composition H, one has Qh = Qc(H)- (9.1.3) For example, g(3 j 0 2 0) = q(3 1>2). Given a matrix M = (m:j) (with 1 < i <, k, 1 < j < p) of natural integers, its row sum is the pseudocomposition (mu + m12 + • • • + mlp, m21 + m22 + • • • + m2p,..., mkl + mk2 + • • • + mkp). Similarly, its column sum is the pseudocomposition (mM + m2 !-)-••• + mtl,m12 + m22 + • • • + mk2,... ,mlp + m2p + • • • + mkp). The pseudocomposition associated with M is H(M) = (mir,ml2,... ,mlp,m21,m22,.. .,m2p,... ,mkl,mk2,... ,mkp). In other words, H(M) is the pseudocomposition obtained by reading the entries of M, row by row. Example 9.1 1 1 M = 0 3\ 2 0/
9.1 The descent algebra 219 The row sum of M is (4, 3), its column sum is (2, 2, 3), and its associated pseudocomposition is (1,0, 3, 1, 2,0). Theorem 9.2 Let H, L be two pseudocompositions. Then the composition of the endomorphisms qH and qL is given by Ян° Ql = Y Яшм), where the sum is extended to all matrices M whose row sum and column sum are respectively H and L. Example 9.3 Let H = (2, 2), К = (3, 1). Then the possible matrices are o) and (f O) Thus, 2)° <?<з, i) = <?<i.1,2,0) + 9<2,o,i,i) = <7(i. 1,2) + ^(2,1.1) by the theorem and (9.1.3). The proof of Theorem 9.2 requires several lemmas. Recall that the k-fold coproduct bk has been defined in Section 1.4. Lemma 9.4 One has the formula dk°qi= E (4mi ®-• ® mi + ’ • • + mu = I Proof The lemma simply expresses the fact that is degree preserving, which is a consequence of eqn (1.4.1). □ Recall that conct: -> K<4> denotes the к-fold concatenation pro- duct Pj ® ® Pk i-> Pj ... Pk (see Section 1.5). If (il,...,ik) denotes a sequence of positive integers, all < N, we denote cone,...K<4>®" -а К<л>, P, ® • ® P„ P,,... P,K. Lemma 93 <Vconcp = [conclit + 1.(P-I)t + 1 ® • • -®conct.2t.рк]Ь^р. Proof We have, because is a concatenation homomorphism, .cone,!?, ® •• ® P,) = <>,(?,... P,) = «.(PJ... Й.(РГ) = ( E (Pl, Un ш-••-®и1Л L.. \U1 Ulk / X( E (pp’upl ш’ • ’ш MPJWP1 ® • • -® “pk ) \Upl....Upk /
9 The Solomon descent algebra 220 by Proposition 1.8. Hence, «VconCpCPj ® • • - ® Pp) = £ (Pl, «11 ш • • • Ш Ulk) . . . (Pp, Upl Ш • • • Ш upk) Ulj X (uu ...Wp1)®-- ®(wlt...Wpt). On the other hand, we have (<5fc)®P(P! ® • • • ® Pp) = <5t(Pi) ® • • • ® <5t(Pp) = ( E (Л, ш-• • ш рц)рп ® • • -® vlk I ® • • • El’ll.I’lk / L (Pr. ">,1 ш - • - ш vpk)vpl ® • • -® vpk I. \Vpl, Vpk / Thus, [conclit + 1...(p-in+1 ® • • -® conct>2....pt] ° (<5t)®p(Pi ® • • • ® Pp) = E (Л, fl 1 Ш • • • Ш vlk)... (Pp, upl ш • • • ш vpk)(vl! ... rpl) ® • • • ®(vlk...Vpk), which proves the lemma. □ Lemma 9.6 We have for natural integers f,..., iN, j\,..., jN, (qit ® ® qij (qjt ®---®qjJ = \qit ®---®qiv if 0’i, ...JN) = (h, Jn), (0 otherwise. Proof This is evident. □ Lemma 9.7 3fp -3P = 3kp. Proof This follows from Proposition 1.8 and the associativity of the shuffle product. □ For a in SN, we also denote by a the mapping Q<A>® f Pam ® • • • ® Pa(N}. Lemma 9.8 conc(<T1 aN) = concN a. □ Proof This is clear.
9.1 The descent algebra 221 Lemma 9.9 For endomorphisms j\,..., fN of Q</1) and a in SN, one has оc (Л ® • • • ® fN) = (Li ® • • • ® Ln) ° °- Proof Indeed f °(f, ® • • -®АХЛ ® -® PM) = a(fAPi)® ® fM) and (Ai ®- ®/<,»)«»№ ® ® P„) = (Z,i®- ® /.„MP., ®-"® P,x) Lemma 9.10 For a in SN, one has OcdN = <>N- Proof By Proposition 1.8, we have <т<5л(Р) = crl £ (P, иг ш • • • ш uN)ul ® • • -® uN \ui u \ e Л* = £ (P, Uj ш • • • ш uN)ual ® • • • ® uaN = E (Л Utfi Ш • • • Ш ® • • • ® uaN (because the shuffle product is commutative) = E (P, 1’1 Ш- • • ш VN)Vi ® • • -® vN = dN(P). □ vi..... r,x e A* Proof of Theorem 9.2 Let H = (hr,..., hk), L = (/p..., lp). Then by (9.1.2) and (1.5.7) qH-qK = conct (qhi ® • • -® qhk)' dk concp (qh ® • • -® qlp) dp = comv(ghl ® • • -® qhk) (conclit + 1.(р-1)Л + 1 ® • • -® conct 2t pk) ^®^(qh®---®qip)6p (by Lemma 9.5)
222 9 The Solomon descent algebra = conct о E (сопс1Д + 1 4- n i p = hi (p- 1)4 + 1 ® ’ ’ ’ ® COnCt,2t, ...,pk) nk i + + nk p — hk 4Qnil®Qn2l®---®Qnkt®Qni2®---®Qnkp) ° E 4m,, ®4m2, ® • • -® 4mk, ®4m,2® ' • ’® Qmkp _mi i + - - • + mki = li _ TH 1 p “I- ’ 4“ MJc p Ip ° »?- ° 6P (because multiplication preserves the degree, and by Lemma 9.4) = conck ° E (conc1Jt + 1..............(p-ixt + 1 ® • • -® concti2t.pt) mj i 4- • • • 4- mj p = hi mk i + 4- mu p = hk mi i 4- • • • 4-mki = h m i p 4“ 4" mkp Ip °(4m,,® 4m2,® • • - ® 4mkpWkP (by Lemmas 9.6 and 9.7) = COnC1<t + 1..(p-1)(t+l...k, 2k....pk°l E 4m,, ® Qm2t ® • • -®Qmkp )°^kp (by associativity of the concatenation product) = concpk°<7 J £ qmii ®qm2x ®- • ® qmkp b<5tp \mu / (by Lemma 9.8, with о = l[/c + 1] ... [(p — l)/c + 1] ... /c[2/c] ... [p/c]) = СОПСрк J E 4m,, ®4m,2®’ ’ ’® 4m,p®421 ®’ ’ ’®4ткр )0<7°<5tp (by Lemma 9.9) = concpt э(е qmtt ® 4m,2®- • -® 4mk₽y tkp (by Lemma 9.10). □ Corollary 9.11 The convolution subalgebra Г of End(Q></l» generated by the qn (respectively the subspace Г„ generated by the qH with \H\ = n) is closed under composition. Note that Г„ has a unit element, namely qn, and that Г = ®„>0 Г„, as algebra (under composition). Given a permutation a in S„, its descent
223 9.1 The descent algebra composition is the composition of n C(ct) = such that when viewed as a word, о = иг ... uk where each word is increasing and of length c,, and к is minimal. For example, C(21534) = (1, 2, 2). In other words, the descent set of о is S(C(a)), where С h-► S(C) is the canonical bijection between compositions of n and subsets of {1, 2,..., n — 1} given by (Ci, . . . , Ck) —► {c15 Cj + c2,..., C‘i + c2 + • • + ck_ j}. Recall that for a subset S of {1, 2,..., n — 1}, we denote by (respectively Ds) the sum in <Q>S„ of all permutations whose descent set is contained in S (respectively is equal to S). The linear span in <Q>S„ of the 2n l elements Ds is a subspace, freely generated by them; it is equal to the linear span of the elements Ds. because of the triangular relations = £ DT. T^S Denote this subspace of QS„ by If C is the composition corresponding to S, we write Ac = DsS. Recall that we have defined a right action of <0>S„ on the space of homogeneous polynomials of degree n (see Section 8.1). Corollary 9.12 The subspace of <0>S„ is a subalgebra of QS„. If |Л| > n, then the linear mapping £,^r„ (9.1.4) for any composition C of n, is an anti-isomorphism of algebras. If {1, 2,..., n} A, then the reverse mapping is given bvf*-+ f(12 ... n), for any f in Гп. Proof By Lemma 3.13(H), we have for any homogeneous polynomial of degree n, P\c = qc(P), (9.1.5) where we use the right action of <Q>S„ on <□</!>„. The linear mapping QS„ -+ Endo(Q</l>n), x^(P^ Px), in an anti-homomorphism of algebras, which is injective if HI > n. Since T„ is, by Corollary 9.11, a subalgebra of End(Q</l», we deduce that is a subalgebra of <Q>S„ and that (9.1.4) is an anti-isomorphism if |Л| > n.
224 9 The Solomon descent algebra Suppose that A contains {1,..., n}. Then, by (8.1.4), we have (12... n) Дс = Дс, once more viewing each permutation as a word. This proves the last assertion. □ Denote by the subspace &(£„) of where в is the anti-auto- morphism of <0>S„ sending each permutation on its inverse. By Lemma 3.13(i), is linearly generated by the elements Uj ш-• ш for all possible concatenation decompositions 12 ... n = щ .. .uk. Corollary 9.13 The subspace of Q>S„ is a subalgebra, isomorphic with T„. Corollary 9.14 If A is infinite, then the convolution algebra Г is freely generated by the qn, n> 1. Proof By Corollary 9.12 the elements qc are linearly independent, because so are the Дс. □ 9.2 IDEMPOTENTS By Lemma 8.22, we have a direct sum decomposition Q<4> = © If, (9.2.1) where the sum is over all partitions z, and where, for z = (z15..., Ak), If is linearly spanned by the polynomials (P], . . . , Pt) = ~ У Pa(l) • • Pa(k)’ Kl aeSk for each choice of homogeneous Lie polynomials of respective degree z15..., zt. We denote by л/ <□</!> -*•<□</!> the projection onto If, cor- responding to this direct sum. Let T = {f1512,..., tn,...} be a new alphabet, and x, x15 x2,. .., x„,... central variables. To each word w in T* is naturally associated a composition C(w); we therefore denote by /(w) and |w| the length and the weight of C(w), respectively. As an example, w = t2t2t3, C(w) = (2. 1, 1. 3), and l(w) = 4, |w| = 7. We define elements Mn and Kk of Q<T> by the following generating series: У xnM„ = log( 1 + txx + t2x2 -I----1- t„xn -I-). (9.2.2) n > 1 У xkKk = expCxjMj -I- x2M2 + + xnMn + f. (9.2.3)
9.2 Idempotents 225 where for z = Г'2"2..., we denote xA = x'px"2 • • • • Observe that K{n} = Mn. Example 9.15 Mx = M2 = t2 — jtf M3 = t3 — |гхг2 — 2t2t! 4- 3tf and K(3) = M3, K(21) = 2MrM2 + \М2МХ = + 2^2^ 1 — 2G5 ^(1’) = 6-^1 = 1,3 6‘ 1 • We define an algebra homomorphism from Q< T> with concatenation into Г with convolution: С-О<Г>^Г, Ш = Thus, we have С(гп ... tik) = q(l * -*qik, or equivalently C(w) = qC(w), for any word w in T*. Note that C is surjective, and is an isomorphism if A is infinite, by Corollary 9.14. Theorem 9.16 The projections лА belong to Г, and are given by = (9.2.4) The projections are mutually orthogonal idempotents of the composition algebra Г, and their sum for |z| = n is the identity ofTn. We need the following result, where 2T(A)l denotes the space of homogeneous Lie polynomials of degree I. Lemma 9.17 Let л = (zh ..., zt) be a partition and let r15... ,rk be endo- morphisms of К (A) such that 1т(г,) 2T(A)Xi. Then the image of r — '£taesk ral *• • -*rak is contained in If. Proof We have by (1.5.7) and by Proposition 1.8 for any polynomial P r(P)= £ conck (rffl ® • • -® rak) bk(P) <reSk = £ COnCj'(/‘(T1 ® • • -® r^)! £ (P,Ul Ш- • -Ш ujuj ® • • -® uk aeSk \ui....ui< = £ £ (P, Uk ш---шик)га1(и1)...гак(ик). aeSk u 1.Ur Since the shuffle product is commutative, the products rh(ujt)... rik(ujk) corresponding to the same multiset [uh,...,ujk} all appear with the same coefficient in this sum. Hence, this sum is a linear combination of (r^u J,..., rk(uk)), and therefore in L/z, by hypothesis. □ Proof of Theorem 9.16 With the notation of Section 3.2, we have Uk = and therefore nk = £/(А)=л ял. In particular, щ = £„> j я(п).
226 9 The Solomon descent algebra Furthermore, л(п) = q„ ° because £ &(A)„. Thus, by (3.2.3), for any word w (-I/-1 „ Ww) = E —7— E <w’ w*)wi • • • uk k> 1 К U1..........uke4 + |u 1 . • -Ufcl =n у (-1Г1 *>1 к У £ (w, Uj ш- • -шик) . , Uk^A* 7, • • ., ik > 1 11 + • • - + ik = n x сопс4 ° (qh ® • • • ® qik)(ul ® • • • ® uk) = E — E сопск°(дг1 ® • --® qik)°bk(w) к>1 К h............,\>l i i + + ik = n (—I?-1 = E 1—i— E (^h*-'-*^kXw)’ к > 1 К it.......ik > 1 by Proposition 1.8 and (1.5.7). On the other hand, by (9.2.2) hence, (__ I)*-1 C(M„)= £ —--------<?,,*• • -*qik. k, ii...... ik > 1 К ii + + ik = n Thus, 7t(n} = C(M„). In particular, Im(n(n}) £ <?(А)Й. By (8.2.3), we have that for z = (z15..., zt), is, except for a constant factor, equal to the sum of the product MX) ... MAk and its permutations. Applying C, we find that C(KJ is of the form indicated in Lemma 9.17, and we conclude that its image is in Ux. Now, by putting x = 1 in (9.2.2) and xt — 1 in (9.2.3), we find that Ел = 1 + ^ + t2 + ‘ ‘ ‘ • Applying C we deduce that C(K;) is the identity of K<A>. Thus, by (9.2.1), we deduce that C(KA) = The remaining assertions are immediate. □ Since the are idempotents which decompose the identity of the algebra under composition Г„(|2| = n), we have the direct sum Г„ = © (9.2.5) Ц| =|д| -n We shall compute the dimensions of these subspaces of Г„. For this, we decompose the space Q<T> into a direct sum, similarly to (9.2.1), but using weight instead of degree.
9.2 Idempotents 227 To each composition C (respectively to each word w in Г*), is naturally associated a partition, denoted by 2(C) (respectively 2(w)). If 2(C) = 2, we say that C is compatible with 2. Since a nonzero finely homogeneous polynomial P 6 Q<7> is a linear combination of words in T*, all associated to the same partition, we may denote by 2(P) this common partition. On the other hand, such a polynomial has a weight |P|, equal to the weight |z(P)| of the partition 2(P). For two partitions 2, p, define a subspace M of Q<T>: it is generated by the polynomials (9.2.6) where each Pj is a nonzero finely homogeneous Lie polynomial in ^f(P), with 2(Pi ... Plw) = 2 and /((IPJ,..., |P/(M)|)) = p. As an example, the subspace Из211 43 contains the polynomials ([tb t3], [г15г2]) and (t3, [ti,[ti, t2]]). We say that a partition 2 = (z15..., 2t) is finer than a partition p = (p^ ..., ph) if for some partition {1,..., к} = Ex u • • • u Eh, one has p{ = YsjeEi^j. We denote this by 2 > p. For example, 3211 >43. From the definition of the subspace д, it follows that В<_д/0=>2> д. (9.2.7) Let H be any Hall set in T* (see Chapter 4). Recall that each word w in T* has a unique decreasing factorization w = ... hk, H, hj > • • • > hk. (9.2.8) Besides the partition 2(w) associated with w, we associate with w another partition p(w), its type, which is the partition associated with the compositon (|/ii|,..., |hfc|), and which depends of course on H. As an example, H is the set of Lyndon words on T, naturally ordered. Then w = t3t2t4t3tit2 has the decreasing factorization w = (ti)(t2t4ti)(t1t2), and 2(w) = (4, 3, 3, 2, 2, 1), p(w) = (9, 3, 3). Let Qw denote the polynomial Qw = (Phi,..., Phk), where Ph is the Hall polynomial corresponding to the Hall word h; see Section 4.2. Lemma 9.18 The polynomials Qw, with 2(w) = 2 and p(w) = p,form a basis of . In particular, one has the direct sum Q<r>= © W^.
228 9 The Solomon descent algebra Proof It was shown in the proof of Lemma 8.22 that the Qw form a basis of Q<T>. Since each Hall polynomial Ph is finely homogeneous and has the same partial degrees as the Hall word h, we have z(Ph) = 2(h) and |Ph| = |h|; hence, X(Phi ... Phk) =/fhi .. .hk) = a(w) = л, and z((|Phl|,..., |PhJ)) = zOil,..., \hk I)) = g(w) = g. Thus, Qw e Now, each finely homogeneous polynomial is a linear combination of Hall polynomials having same partial degrees as it. Thus, by multilinearity and symmetry of the operator (,...,), we conclude that д is generated by the Qw with z(w) = z and g(w) = g. The last assertion follows immediately. □ In eqn (9.2.2), we have defined elements M„, n > 1, of Q<T>. They are given by Define a (concatenation) algebra endomorphism у of K(Ty by 7(t„) = M„,n> 1. (9.2.9) Then, since Mn = tn + a Q-linear combination of words involving only the Гг with i < n, we conclude that у is an automorphism of Q<T>. Define an algebra homomorphism if from Q<T> onto the convolution algebra Г by if = C ° 7- Observe that, since у is weight preserving, f maps Q<T>„ onto T„. Theorem 9.19 For any two partitions f, g of n, we have = СЖ.Д Before proving the theorem, we derive several consequences. Corollary 9.20 Let H be a Hall set in T*, let z, g be two partitions of n, and suppose that |4| > n. Then the dimension of лдГплА is equal to the number of words w in T* such that z(w) = z and g(w) = g. In particular, dim(nAr„nA) = 1. Proof We have seen in Section 9.1 that the space admits as a basis the elements Ac, C composition of n. Since the linear mapping Ac ь-► qc is by Corollary 9.12 an isomorphism from onto Tn, we conclude that the qc are linearly independent for |C| = n. Hence, the linear mapping C, when restricted to the space Q<T>„ of homogeneous polynomials of weight n in Q<T>, is an isomorphism onto
9.2 Idempotents 229 Г„. Since у is a weight-preserving automorphism of and С = C 7, we conclude that is an isomorphism Q<T>„ -»• Г„. Thus, the first assertion is a consequence of Theorem 9.19 and Lemma 9.18. Suppose now that for some word w in T*, we have z(w) = p(w) = A. Then with the notation of (9.2.8), ht must be a word of length 1 of each i; hence, if z = (Л15..., zfc), z15..., zfc > 1, we have by (9.2.8) that w is equal to the unique product of the rAt which is decreasing in the order of H. This shows that dim(B< A) = 1, which implies the last assertion by Theorem 9.19. □ Given a multiset of primitive necklaces M on the alphabet Г, we define z(M) and p(M), similarly to what we have done previously: each letter in M defines a part of A(Mf each necklace in M, which is the conjugation class of L,... tik, defines the part ц + • • • + ik of p(M). In view of Theorem 7.17, Corollary 9.20 implies the following result. Corollary 9.21 Let |Л| > n.) Let A, p be two partitions of n. Then the dimension of лдГп is equal to the number of multisets M of primitive necklaces on T such that A(M) = z and p(M) = p. Denote by p(n) the number of partitions of n and recall that the number of compositions of n is 2"-1. By (9.2.5), (9.2.7), and Theorem 9.19 we immediately have the following result. Corollary 9.22 (ii) (i) If A < p then лдГплА = 0. Tn © 71дГп71^. x > д |Л| = |д|=п Corollary 9.23 The radical of T„ is equal to ©л>д ядГ„ял. Z/MI n> then T„ is of dimension 2"-1 and its radical of dimension 2"-1 — p(ri). Proof Let R = ®л>д ЛдСЛл and S = ©лГл, where the sum is over all partitions of n. Since the are orthogonal idempotents, Corollary 9.22 implies that R is an ideal in T„ such that some power of R vanishes. Hence, R is contained in the radical of T„. Now S is a semisimple algebra, being isomorphic to some power of the field Q. By Corollary 9.22, the canonical projection Г„ -► S (according to the direct sum T„ = R © S, by (9.2.5)), is a homomorphism of algebra. Since its kernel is R, we deduce that Tn/R is semisimple, hence R contains the radical. Finally, R is the radical of T„. If |Л|^и, then T„ is of dimension 2"-1: indeed, so is having a basis indexed by subsets of {l,...,n — 1}; moreover, they are isomorphic by Corollary 9.12. Now 5 is of dimension p(n), hence the corollary follows. □ For the proof of Theorem 9.19, we need several lemmas.
230 9 The Solomon descent algebra Lemma 9.24 Let QY,... ,Qp be Lie polynomials and к > 1. Then •Wi. • •, e,)) = L <Q,.) ® • • ® (Q, where the summation is over all к-tuples (/b..., Ik) such that {1,..., p] = f u • • u Ik (disjoint union), with (в-) = ~ E ne.w. \1 |! aeS, iel Note that the last product is in the natural order of I. Proof We may consider Qi,...,Qp as noncommuting variables. The equality in the lemma is symmetric in QY,..., Qp, so it is enough to check taht the coefficients of Er ® ® Ek on both sides are equal, for each choice of a factorization . Qp = Ex . .. Ek. Now, at the left this coefficient is, by Proposition 1.8, equal to (p!)-1 multiplied by the number of permutations о e Sp which appear in the shuffle product Er ш • • -ш Ek; that is, by definition of the shuffle product £____P'_______ p!|Ei|!.7.|£fc|!’ On the right, the tensor Er ® ® Ek appears only once, namely in the term with Ij = set of indices of the Q{ appearing in Ej (and then |/y| = |£;|), with coefficient 1 1 |/il’''' 141?' This proves the lemma. □ Recall that to each composition C is canonically associated a partition x(C). For two compositions С, С', /.(C) = z(C') means that C' is a rearrangement of C. Lemma 9.25 Let Qi,... ,QP be homogeneous Lie polynomials in <Q><4> of degree dlf..., dp. Then for f,.. . ,ik> 1, the polynomial (n(il) * • • * 7t(ik)) ((0i, • • •, Qp)) is nonzero only if к = p and A((ir,..., ik)) = z((d15..., dk)). In this case (л(й> * *7I(ik))((Qi, • • , 0л)) = X Qsi • • Qsk- H....= l-si 5 k I VJ. d,j =ij
231 9.2 Idempotents Proof By (1.5.7) and Lemma 9.24, we have (7i(il) * • • • * ..., Qp)) = conck о ® ® n(ik)) ° <5fc((2b • • •, Qp)) = concfc ° (n(il) ® ® n(ik))^X (G/.) ® ® (Qik)^ = concfc^X 7i(il)((QIt)) ® • • • ® n(ik)((Q/k))^ where the summation is over all Ц,..., Ik as indicated in Lemma 9.24. Observe that n(0 is the projection onto U(i), according to the direct sum (9.2.1); hence, ti^R^ ..., Rt)) = 0 if I > 2 and if the Rj are Lie polynomials. Moreover, л(1)(К) = 0 if R is homogeneous of degee / i. Thus, if some term in the previous summation is nonzero, we must have V/ = 1,..., k, |/J = 1, and for Ij={l], deg(Q() = ij. This implies k = p and A((i\,..., ik)) = Ж-4))- The previous discussion also shows that the last formula holds, because we then have n(ij)((Qfj)) = Q,. □ We say an algebra homomorphism </r. <□<£> -> <Q><4> is weight preserving if for each i, (p(ti) is 0 or a homogeneous polynomial of degree i. Lemma 9.26 Let Л = Г‘2"2.. .be a partition, P be a finely homogeneous polynomial in Q<T>, of degree n( in each th Qt,... ,Qk be homogeneous Lie polynomials such that 2((deg Qr,..., deg Qk)) = A. Then C'(P)«ei,.. •, 2.» = E +<P(P), <9.2.10) <P where the sum is over weight-preserving algebra homomorphisms Q<T> -»• Q</1> such that cp(T) =^(Л), and which depend only on Qt,..., Qk. Proof Suppose first that P = t(1... tik. Then £'(P) = C0 У(Р) = C(Mh... Mik) = n(il) * • • • * n(ik), by (9.2.9) and (9.2.4), because KM = M„. By Lemma 9.25, we thus have c(PX(ei.- -.e»))=Ee.,- e». where the summation is over all permutations .s^ ... sk of 1 ... к such that V/, deg(Q;) = ij. Using inclusion-exclusion, this may be rewritten sf П ...>,.) \i>l / where the sum is over all £\, E2,... such that Et is a subset of {gy| deg Qj = i},
232 9 The Solomon descent algebra and where (p is the homomorphism Q><P> -»• Q><4> such that <p(L) = X<2e£. Q- Since such a <p is weight preserving, (9.2.10) follows in this case. The general case follows by linearity. □ Proof of Theorem 9.19 (a) Let Pe then P is a linear combination of L, • • 4 with 2((1\,..., ifc)) = z. Let Q e Uv; then Q is a linear combination of (Qb..., Qn), where the Qt are homogeneous Lie polynomials in ^(4), and 2((deg Qi,..., deg Qn)) = v. Since • • • tik) = я01)* • we deduce from Lemma 9.25 that C'(P)(Q) = 0, unless X = v. Now let Q any polynomial in Q<4>. By (9.2.1), Q = £v Qv, with Qv e Uv. By what we have just seen cw( e e,) = o, \v^z / hence, c(p)° ял(2) = c'oej = ewe). Thus, C'(P)° = C'(P), which implies that C'(P) e Г„лл. (b) Let P e д again. Let 2 = Г‘2"2... and p = pr ... pt. Then P is finely homogeneous of degree n( in each ti5 and also a linear combination of (Pb...,?,) for some finely homogeneous Lie polynomials in Q<T> with |Р;| = МЛ observe that if <p is any weight-preserving algebra homomorphism, then <p((Pi,..., P,)) = (<z>(Pi),..., </>CP/)) and deg(</>(^)) = Щ1 = if more- over </>(Л) c y7(7’), then each </>(Pf) is a homogeneous Lie polynomial of degree pt. Thus, </>(Р)е Ц,. Let Q be any polynomial in K<4>. By (a), we have C'(P)(Q) = C(P)° ttfiQ), hence we may suppose that Q is a linear combination of (Qb ..., Qk) with z((deg Qi,..., deg Qfc)) = z. This implies by Lemma 9.26 that C'(P)(Q) is a linear combination of <p(P), hence is in Lf. We deduce that ° C'(P)(Q) = C'C^XQ) for any Q in K<4>, hence С(Р) = я/'(Р)еядГ„. (c) By (a) and (b), С'(^,д) - ядГ„яА. Conversely, let f e лдГ„лА. Since £' is surjective, and by Lemma 9.18, we have f = ^v>aC(Pva), with Pva e Wvx. But we have seen that C(Pvat) e явГ„я„, and the sum (9.2.5) is direct. Thus, f = С(Р^) e C'( И;д), and we deduce лдГ„ = £'( Иу. □ The previous results may be interpreted in the descent algebra because of Corollary 9.12. Theorem 9.27 Let r: Q<T> = ©n>o he the linear isomorphism such that T(tit... tik) = Лс, C = (ii,. .., ik). Define E- = т(Кх), where K, is given by (9.2.3). (i) The elements Ex are, for |z| = n, orthogonal idempotents of sum 1 in the algebra
9.3 Homomorphisms 233 (ii) The linear mapping P i—► PEX, Q(Ajn -* 2<4>„ is the canonical projec- tion corresponding to the direct sum K<A\= © Ц. |A|=r> (iii) For |Л| = \p\ = n, the space Ex is equal to т°у(И^д), where is defined in (9.2.6) and у in (9.2.9). Its dimension is equal to the number of multisets M of primitive necklaces such that /.(M) = Л and p(M) = p. In particular, dim(£A Ef) = 1. (iv) The radical of^n is of dimension 2"-1 — p(n). Similar results hold of course for the algebra (cf. Corollary 9.13). 9.3 HOMOMORPHISMS Recall that A denotes the ring of symmetric functions over <□, and A„ the subspace of homogeneous symmetric functions of degree n. Recall also that A„ has another product, called inner product and which we denote by л, defined by (РлАл) л (pM/zM) = b^pjz^, (9.3.1) where zz = 1"'2"2... njnj!..., for z = 1"'2"2.... Theorem 9.28 Let |4| > 2. There exist, for |Л| = n, one-dimensional represen- tations (р/. Гп ->Q such that: (i) The linear mapping q>„: Г„ -+ A„ defined by ipn = Jw=lt is an homomorphism from Г„ onto A„ with inner tensor product, Ker (p is the radical ofTn and (p„(nw) = pjzv (ii) For any polynomial P in Ux, |2| = n and any element f in Гп, one has f(P) s Vl(f)P mod X 0.3.2) n< i. If p / 0, then (pfif) is completely determined by this equality. Proof By Corollary 9.20, we have dim(nAr„nA) = 1. Since nz is idempotent, we have = 0>ял. Define by = </>л(/)ял. Then is well defined, because / 0 (indeed If / 0, because |Л| > 2), and linear. More- over, by Corollary 9.22(ii), we have Гп © ttp Г, 71^. Ш = 1д1 =П
234 9 The Solomon descent algebra Hence, each f in T„ has an expansion f = with ядГплл. In particular, Then, if g is another element of Г„, we have f9 X Ад £ 9av = E a>v Л > д > v because the n/( are orthogonal idempotents. Thus Qfe^i-V У, fi.n9nv1 А>д >v and in particular (fg)u = f^g^. This is rewritten 4h(f9)^ = (pJJ^xcp^g^x = <Pi(f)4h(9)7h. This shows that <p; is an homomorphism, because лА / 0. (i) The multiplication rule (9.3.1) shows that <p is an homomorphism. By Corollaries 9.22 and 9.23, its kernel is the radical of T„. Since we deduce </>д(лл) = 6^ and </>„(ял) = pjz^. (ii) Since = 0 if 2 < p, we have, for P in If (thus лл(Р) = P), f(P) = MP) = X nJn^P) = £ д л. > д = 9>л(/)^(Л+ X = 4>Af)P mod X Цх- A > д д < А The last assertion follows from (9.2.1). □ With the notation of the theorem, define <p = © </>„: Г = © Г„ —> A = © A„. л > О n> 0 n > 0 Corollary 9.29 (|4| > 2). The linear mapping (p is an algebra homomorphism from Г with convolution onto A with the usual product. One has <P&M) = Pn/п, q>(q„) = hn. We first prove the following lemma. Lemma 9.30 Let PY,..., Pk be homogeneous Lie polynomials such that 2((deg P15..., deg Pk)) = L Then p1...pk = (p1,...,pt)mo<i £ i/,. д < Л Proof (induction on к = /(2)). If /(A) = 1, there is nothing to prove. Otherwise, the identity P^. = P^. + [^,PJ
9.3 Homomorphisms 235 shows that the products Pa(1)... Pa(k), ct e Sk, differ from Pj ... Pk only by a linear combination of products of homogeneous Lie polynomials Qi ... Qi such that 2((deg ..., deg Qt)) < 2. By induction, these products are in X/(< Цг Thus we have, for any a 6 Sk: Pi Pk = Л(1> • • • ^a(fc) mod X Ч- д < Л Summing over all a in Sk, we get the lemma. □ For a composition C = (i15 ..., ik), write тсс = n(h)* • • • * л(1к). Proof of Corollary 9.29 Let C be a composition such that 2(C) = 2. Let Q e Ц(. Then Lemma 9.25 implies that nc(2) = 0, unless p = 2. Hence, by (9.3.2), we have </>M(nc) = 0 for p / 2. Suppose now that Q e If and put 2 = Г’2"2.... Then Q is a linear combination of (Q15..., Qk), with the notation of Lemma 9.25. By the latter, we obtain that nc((2i, • •, Qk)) is equal to a sum of s = ... permutations of the product Qx ... Qk. Thus, by Lemma 9.30, we have , Qk)) = s(Qn • • •, 2fc)mod£M<A Ц, and we conclude that nc(Q) = SQ m°d £д<л Цх- This implies, by (9.3.2), that (p^(nc) = s. Thus, by Theorem 9.28, we have 4>„(nc) = spx/zx = njnj ... • • • 1"'2"2... = (Р1/1)П,(Р2/2Г. • • In particular, </>(я(в)) = p„/n and </>(n(i)) * • • • * n(ik)) = </>(n(il >)*•••* (p(it(M). Since the nc span Г, we conclude that ip is multiplicative. Now, by (9.2.2), we have x'hf = log(l + tjx + • • • + tnxn + •••). i> 1 Applying the homomorphism (p°f. A, and noticing that C(L) = Чь C(Mi) = 7i(i) by (9.2.4), we obtain X x‘Pi/i = X xi(P(^i)) = log(\ + (p(qf)x + • • • + (p(q„)xn + •••)• i > 1 i > 1 We deduce that X 4>(qn)xn = exp(£x'p,/i) = X hnx”, as is well known. Hence, cp(q„) = h„. □ Corollary 9.31 (|Л| > n). The radical of T„ is generated by the elements tic — 7tC' (respectively qc — qc ) with |C| = |C'| = n, 2(C) = 2(C')-
236 9 The Solomon descent algebra Proof We have nc — itc- (respectively qc - qcfe Ker </>, under the stated conditions, by Corollary 9.29. By Theorem 9.28(i), Ker is the radical of Г„. Moreover, it is of dimension 2"-1 — p(n), by Corollary 9.23. Since the elements nc (respectively qc) are linearly independent, there are 2" 1 — p(n) linearly independent elements among the elements лс — itc> (respectively Qc ~ Qc) satisfying the condition in the corollary. □ All the results we have stated so far for Г„ have an interpretation in the Solomon descent algebra S„, via the anti-isomorphism of Corollary 9.12. We leave this translation to the reader, and now give a result which is better formulated in terms of S„. For this, we call Lie monomial of type /. and degree n a product Pr ... Pk of homogeneous Lie polynomials in ^f(A) such that z((deg P15..., deg Pfc)) = z, and |Л| = deg(Pj ... Pk) = n. Theorem 9.32 (|И| > n). An element x in QSn is in if and only if for each Lie monomial Pr ... Pk of degree n, (PY ... Pk) x is a linear combination of ^a(l) • • • a G $k- The direct part may be proved by using Lemma 9.25. However, we give another method, interesting in itself. Lemma 9.33 Let C = (ji, , ji) be a composition of n and Px ... Pk be a Lie monomial of degree n. Then Р„)ЛС = qc(P, . .. P.) = X Ps, ... PS1, where the summation is over all partitions {1,..., k} = u • • • u S1 such that deg PSi = j(, with Ps = P|ieS Pi (product in the natural order of S). An example of such a formula is the following (where subscripts indicate degrees): (?1^262)^23 = P2P1Q2 + Q2P1 ^2- Proof The first equality is (9.1.5). Now, for С = (jb..., J,), we have <7с(Л . .. Pk) = (qh * • • • * qjl)(Pl ... Pk) (by (9.1.2) = cone, - (qh ® • • • ® qjt) ° ^(Pi Pk). by (1.5.7). Now, if P is a Lie polynomial, then by (1.5.6) 3,(Pi) = Pi ® 1 ® ® 1 + 1 ® Pi ® • • • ® 1 + • • • + 1 ® 1 ® ® Pf This shows that <5,(P,... P,) = . <5,(PJ = X Ps, ®' • -® Ps„
9.3 Homomorphisms 237 where the sum is over all decompositions {1,..., k} = SY и • • • и Sh with Ps = FLes ?t- Applying cone, ° (<?j, ® ® q71), we get the lemma. □ Proof of Theorem 9.32 Since the Ac, for |C| = n, generate linearly E„, the only if part of the theorem follows from Lemma 9.33. Conversely, let x be as in the theorem. We may suppose that A contains {1,..., n}, and we identify a permutation S„ with the corresponding word in A*. Let C be a composition of n, C = (c15..., ck), and factorize the word 12 ... n into Uj ... uk with |u,-| = c(. By (3.2.1), is a homogeneous Lie polynomial of degree cf. By hypothesis, we have (^C1)(M1) • • • ^(C(<)(Wfc))x E aeSk for some coefficient fxC in <□. Since the q, commute with substitution of letters, the same holds for the qc by Lemma 1.11; the latter span Г, hence each n(i) commutes with substitution of letters. Moreover, so does the mapping P и-> Px. Hence, if v1,...,vk are words of respective lengths c15..., cfc, we obtain Wl) . . . 7lM(vk))x = X . .7l(c,lk>)(Vxlk)). aeSk Now, observe that by (1.5.7) and Proposition 1.8 Ko*' • •* я(Ск))(12 ... n) = X (12 ... n, Vi ш- • -ш ^Х^)... 7iM(vk). kJ =g Applying x on the right, we obtain [(я(С))*-• •*я(Ск))(12 ... n)]x = X E (12---n, Ui Ш-•mvJ/JiCn(cJt,a(1))... n(f,lk))(ra(fc)) |i’,| -c, aeSi< = E fa.C E (12 ••• П, Vx(l) Ш • • • Ш VxW)n(c,a/Vx(l)) . . .7CM(vx(k)) «eSk |r3(1|| =c3(,) = E ••*Я^к,))(12...И). aeSi< Let Ic denote the element of corresponding to л(С1) * • • • * n{Cki through the linear isomorphism of Corollary 9.12. Then Ic = •* W12...4 and we deduce from (8.1.4) that Icx = Y^xfx.c^ca (product in KS„), where Cot = (ca(n,..., ca(fc)). Since the Ic span and since has a unit element, we deduce that x is in S„. □ Suppose that the alphabet A is infinite. Then, by Corollary 9.14, the «□-algebra Г (under convolution) is freely generated by the elements qn, n > 1. We may therefore define for each s > 1 a derivation of the convolution
= I 238 9 The Solomon descent algebra algebra Г by Qn-S' where qk = 0 if к < 0. Theorem 9.34 For each s > I, the derivation f of the convolution algebra Г is also a homomorphism of the algebra Г under composition, sending r„ + s into Г„. Moreover, for each partition /. ifs is a part of 2, 0 otherwise. Proof (a) We have to show that £s(qH °qK) = £S(</H)0 for any pseudo- compositions H, K. Since cs is a derivation, we have for H = (hY,..., hk), L = (ll,...,lp): UQh) = * • • • * = X 1 <i<k s <h, where Ht is obtained from H by replacing h, by h( — s. Similarly, SM = X i <j<p S <lj where Lj is obtained from L by replacing lj by Ij — s. Thus, by Theorem 9.2, = E E«««,». <9-3-3’ hi,l} >s where the second sum is extended to all matrices Afy whose row sum is and column sum is Lj. On the other hand, we have by Theorem 9.2 ^s(Qho(1l) = , where the sum is extended to all matrices M = (mf) whose row sum is H and column sum is L. Thus, £sG?h0#l) = X X (9.3.4) M m,j > s where the matrix M(j is obtained from M by replacing the entry mtj by mtJ — s. Note that has row sum Ц and column sum Lj, so that the sum (9.3.4) is contained in the sum (9.3.3). Conversely, if a matrix Мц how row sum Hi and column sum Lj, then by adding s to its (i,j)-entry, we obtain a matrix of row sum H and column sum L. Hence, the reverse inclusion also holds, and (9.3.3) is equal to (9.3.4).
9.3 Homomorphisms 239 (b) Since A is infinite, the algebra homomorphism £Q<T> -* Г (con- volution), with C(t„) = q„, is an isomorphism (Corollary 9.14). Define a derivation ds of Q><T> by ds(tn) = tn_s, with t0 = 1, tk = 0 if к < 0. Then, by Theorem 9.16, all we have to show is that ds(Ki) ifsisapart ofA, otherwise. (9.3.5) Extend ds to the variables x, x15 x2,..., used in (9.2.2) and (9.2.3), by ds(x) = ds(xi) = ds(x2) = = (). Then ds( E ) = E ds(ti)x‘ = X 11-*х1 = xs X Ьх1. \i > O / i>0 i>0 i>0 Hence, X tix' and its image under ds commute. Observe that if у and ds(y) commute then ds(y”) = nyn 4(y). Thus, by (9.2.2) X xnds(Mn) = dA X x"M„ = dA log! 1 + X *<х‘ n > 1 \л >1 / \ \ i > 1 \k > 1 К with S = X;> i h*1- Now, the latter sum is X —y— ds(Sk) = X -7- kS'-'dtf) = (1 + SyXCS) k>i к k> i к = (i + s)~4(i + s) (\ -1 / \ X ttx‘I xsl X hx* ] = xs. i>0 / \i>0 / This shows that X xU(JW,) = x-, n > 1 which implies W) = Now, let U = X«>i х(М(. Then ds(U) = X
240 9 The Solomon descent algebra so that U and its image under ds commute. Thus, Js(exp((/)) = J E 7/''*)= E \k > О к'. / к > 0 KI = X /~кик~Ч5(и) = exp(U)xs. fc>o kl With (9.2.3), we deduce E x^ds(Kx) = ds\Y = ds(exp((7)) = (£ x2K^xs = X x^sK^. >. \ Л / \Л/д Thus (9.3.5) holds. □ Further properties of the mapping cs are given in the following result. Theorem 9.35 The mapping cs is surjective. For s = 1, 2, it admits a right inverse, which sends T„ into Tn+s, and which is a homomorphism for the algebra Г under composition. In particular, each algebra Tn is embedded in Г„+1. We begin by a lemma. Denote by c/ca the derivation of Q<4> (concatena- tion algebra), with A = {a, b,...}, such that d/да sends a to 1 and the other letters to 0. Lemma 9.36 Let E denote the subspace of generated by the poly- nomials (a,P2,...,Pn), (9.3.6) where n > 1 and where P2,..., Pn are Lie polynomials in ^(A). Then the restriction of d/да to E is a linear isomorphism onto Proof Observe that the kernel of c/ca is a subalgebra of Q<Acontaining b, c,..., and closed under the operation P [а, Р]. Hence, each finely homogeneous Lie polynomial, which is not a scalar multiple of a, is in this kernel. Observe also that (1, P2,..., P„) = (P2,..., P„), and that in the definition of E, we may suppose that each P{ in (9.3.6) is finely homogeneous, and is equal to a if it is a scalar multiple of a. Under these conditions, let j > 0 denote the number of Pt in (9.3.6) equal to a. Then, by symmetry and multilinearity of the operator (,....,), and by the previous remarks, we
9.3 Homomorphisms obtain 241 3 — (a, P2,..., P„) = (j + l)(P2,...,f>,). da Thus, d/da is surjective, because Q<A> is generated by the polynomials (P2,...,£„), by (9.2.1). A counting argument, as in Lemma 8.22, shows that d/da | E is injective. □ Proof of Theorem 9.35 (a) The convolution algebra Г is isomorphic with <□<£>, via the isomorphism <□<£> Г. Let ds be the derivation of Q<£> sending t„ onto t„_s, with t0 = 1, and tk = 0 if к < 0. We thus have Let E be the subspace of Q<£> generated by the polynomials (ts, P2,..., P„), where the Pt are Lie polynomials in ^f(T). By Lemma 9.36 the restriction of d/dts to £ is a linear isomorphism £ <□<£>. With у the algebra automorphism of <□<£> defined by (9.2.9), let d's = 7-1 °ds°y. Then d's is a derivation of Q<T>, and we have d'^t^ = y-1 ° ds(Mn). In the proof of Theorem 9.34, we have seen that ds(Mn) = dsn. Thus d's(t„) = dns, and we conclude that d's = d/dts. Hence, d's\E is a linear isomorphism £ ->• <□<£>. This implies that ds\y(E) is a linear isomorphism y(£) -► Q<T>. We conclude that £s\£°y(E) is a linear isomorphism £ ° y(£) Г, and in particular £s is surjective. (b) Observe that a finely homogeneous Lie polynomial P in Q<T> of weight 1 (respectively 2) must be a scalar multiple of (respectively t2). Thus, for s = 1, 2, the subspace £ is equal to © seA, sejt by definition (9.2.6) of W2 Hence, we obtain by Theorem 9.19 C"T(£) = ('(£)= © явГЛ. se Л. se д Since the лл are orthogonal idempotents, the subspace C (£) is therefore a subalgebra of Г under composition, with neutral element We have seen that es|C'(£): C'(£)-r in a linear isomorphism. Since cs is an homomorphism for composition by Theorem 9.34, £S\C(E) is an isomorphism of algebras, and £s has a right inverse. □
242 9 The Solomon descent algebra 9.4 QUASISYMMETRIC FUNCTIONS AND ENUMERATION OF PERMUTATIONS Let X be a totally ordered infinite set, which will serve as an alphabet, and also as a set of commuting variables. A formal power series F in Z[[X]] is called a quasisymmetric function if for any x15..., x„, yY,..., yn in X, with Xi < • • • < x„, yr < • • < y„, and any positive integers kl,..., kn, the coeffi- cients of x*‘... x*" and y?' ... y*H in F are equal. We denote by Qsym the ring of quasisymmetric functions, and by Qsymn the Z-module of homo- geneous quasisymmetric functions of degree n. If C = (i15..., ik) is a composition of n, we define Mc by Мс= X X? •••4. (9.4.1) Xi < ’ • • < xk Clearly, the Mc form a basis of Qsymn, for |C| = n. Define Fc by Fc= Z MD, D>C where D > C denotes that the composition D is finer than C, i.e. S(D) 3 S(C), where S(C) = {i\, + i2,..., + • • • + ik-i} is the subset of {1,..., n — 1} associated to C. By inclusion-exclusion, we have Mc= £ (- 1)“di’w’Fd. (9.4.2) D>C where /(C) is the number of parts of C. This shows that the Fc are a basis of Qsym„, for |C| = n. Let С = (ц,..., ik) be a composition of n, and 5 = S(C). The definition of Fc shows that Fc = £ x, ... x„ (9.4.3) xieX where the summation is subject to the condition: for each i = 1,..., n — 1, x, < x1 +15 and xf < xi+ i if i 6 S. For example, Л1.2) = M1.2) + A/d.i.D = S xy2 + £ xyz= X XiX2X3, X<y X<y<Z X1<X2<*3 and the set associated to the composition (1, 2) is {!}. Let У be a second infinite totally ordered alphabet, disjoint from X. We may consider the set Z = {xy|x 6 X, у 6 Y} (9.4.4) as an alphabet, totally ordered by xy < x'y' if either x < x' or x = x' and у < у'. (9.4.5) Thus, if FeZ[[X]] is a quasisymmetric function, then it makes sense to
9.4 Quasisymmetric functions and enumeration of permutations 243 consider F(xy) e Z[[Z]]. There is a canonical algebra homomorphism u У]]. We may without ambiguity identify F(xy) with its image under this homomorphism. Indeed, the latter maps homogeneous components of degree n on homogeneous components of degree 2n; further- more, if F(x) is a homogeneous quasisymmetric function of degree n, then for fixed x0 in X, we have F(xy) = x^F(y) + terms of lower degree in x0. Recall that C(cr) denotes the descent composition of о (see Section 9.1). Theorem 9.37 Let о be a permutation in S„. Then FC(a)(xy) = X FCM(x)FC(P)(y). a = fla Recall that we have defined a basis (Ds) of the Solomon descent algebra En, indexed by subsets of {1,..., n — 1}; see Section 9.1. The theorem shows again that is a subalgebra of <Q>S„: indeed, since the series Fc(x)FD(y) are linearly independent, the number of pairs (a, ft) e S„ x S„ with C(a) = C', C(f) = C" and fa = о depends only on C = C(a), C', and C". We thus have the following result. Corollary 9.38 Let S, S', S" be subsets of (1,..., n — 1} and С, С, C" the corresponding compositions. Then the coefficient of Ds in the product Ds. Ds, (expanded in the Ds basis) is equal to the coefficient of Fc-(x)Fc--(y) in Fc(xy) (expanded in the Fc(x)FC"(y) basis). For the proof of Theorem 9.37 we shall need a couple of lemmas. Recall that the standard permutation a = st(w) of a word w = Xj ... x„ is defined by <r(0 < </) (*> < Xj) or (Xi = Xj and i <j). (9.4.6) Denote by ev(w) the evaluation of w, i.e. the monomial Xj ... x„ in Z[[X]]. Lemma 9.39 Let a be a permutation in Sn. Then FC(a-')= E W(w). St(W) = <7 Proof We have by (9.4.3), with C = C(cr-1) and S = S(C): Fc = where the sum is over all increasing words .. t„ of length n in X* such that for each i in S, < ti+ P By putting xf = ta(i), we deduce Fc = X w(xi • • • x„)’
244 9 The Solomon descent algebra where the sum is over all words such that < < xff-1(ll) and V/ceS, x„-i(k) < x„-i(k+1). (9.4.7) We show that (9.4.7) is equivalent to the condition that a = st(xr ... x„), which will prove the lemma. Since C is the descent composition of a"1, S is the descent set of a-1, and we have keSoo'^k) > o~fk + 1). (9.4.8) This means that к is at the right of к + 1 in the word cr(l)... cr(n). Suppose that (9.4.7) holds. We show that o(i) < a(j) implies the right-hand side of the equivalence (9.4.6): this will imply (9.4.6), because each side defines a total order on {1,..., n}, and thus о = st(w). So let cr(i) < o(j). By the first condition in (9.4.7), we have — Xa - 1 (<r(i)+ 1) < ' ' ‘ - i(<r( j) - 1} — %j- By the second condition in (9.4.7), we deduce that, if x, = x7, then S n {o-(i), o-(i) + 1,..., <t(j) - 1} = 0. Thus, by (9.4.8), a(i) is at the left of o(j) in the word cr(l)... cr(n), showing that i < j. Hence, the right-hand side of (9.4.6) holds. Conversely, suppose that a = st(xY ... x„), i.e. that (9.4.6) holds. We show that (9.4.7) holds. We have i < i + 1, hence aafi)<aa *(i + 1). By (9.4.6), this is equivalent to (x.x-ni) < xff-1(i + 1)) or (xff-1(i) = xff-1(i + 1) and a-1(i) < o~1(i + 1)). Because of (9.4.8), we deduce that (9.4.7) holds. □ Lemma 9.40 Let и = xt ... x„ e X*, v = y1...y„eY* and w(u,v) = (Х1У1)... (x„y„) e Z*. Then st(w) = st(ust(v)~l)st(v), where ust(v)~r denotes the right action of the permutation st(r)”1 on the word u. Proof We have st(w)(i) < st(w)(j) (х.У.- < Xjyj) or (x.y,- = x}y} and i <j) (by (9.4.6) (Xj < x^ or (x; = Xj and y, < yj) or (xt- = x7 and yt = y7 and i <j) by (9.4.5) (X,- < Xj) or (Xj = Xj and (y, < y7 or (y, = y7 and i < 'Ш (xf < Xj) or (x(. = Xj and st(v)(i) < st(y)(J)) by (9.4.6)
9.4 Quasisymmetric functions and enumeration of permutations 245 Denote ust(v)-1 =tr...tn, hence xt = tst(v)(i). Then the previous condition is equivalent to (^st(v)ti) or Qst(v)(i) and st(u)(i) < st(u)(j)) О st(ti ... tn)(st(y)(i)) < st(ti ... t„)(st(v)(j)) by (9.4.6) оst(ust(v)~ 1)st(y)(i) < st(ust(y)~ 1)st(v)(j). This shows that st(yv) = st(ust(v)~ r)st(v). □ Proof of Theorem 9.37 By Lemma 9.39, Fc(a)(xy) = X ev(wF weZ* st(w) = a ~ 1 where Z is defined in (9.4.4). For u, v, w as Lemma 9.40, write w = w(w, v), and let ev(w(u, v)) = ev(u)ev(v). Then FC(a)(xy) = X ev(yv(u, v)) ueX* vet* st(w(u, i»)) = a ~ 1 = X ev(u)ev(y) st(ust(v) " 1 )st(v) = a ~ 1 = X X ev(ust(v)~ ^eviv) a = fix st(ust(v) ~ 1) = a 1 st(v) = fl- > = X X ev(v>) X evfuf'1) a = fix st(v) =fl~ 1 stfufl ~ 1) = x ~ 1 = x ( x ^(y)Y x ev(u')) a = f}x = / \st(u') = x ~/ — X FC(p)(y)FC(ai)(x), a = fix by Lemma 9.39 □ Let П be a set of permutations, i.e. П c U">o We call quasisymmetric generating function of П the quasisymmetric function I Fc,.,- леП We compute below this generating function for special sets П, and it will turn out that it is actually symmetric and equal to the characteristic of a representation of the symmetric group. Recall that we have defined subspaces If of K<4> in Section 8.5. The evaluation of a multiset of primitive necklaces has been defined in Section 7.5. For a composition C = (i15..., ik), denote by hc the product П1 < j<k hi} of the corresponding complete symmetric functions.
246 9 The Solomon descent algebra Theorem 9.41 (i) Let л be a partition of n. Then the quasisymmetric generating function Pk of the set of permutations of cycle type 1 is equal to the generating function of the set of multisets of primitive necklaces of cycle type z, and also to the characteristic of the representation of S„ on the multilinear part of If. (ii) Let C be a composition of n. Then the quasisymmetric generating function Sc of the set of permutations whose inverse has descent composition C is equal to Sc = E (-1)*'-^,, (9.4.9) D<C and is the characteristic of some representation of S„. Proof (i) The bijection of Theorem 7.20 maps bijectively words w whose standard permutation has cycle type A onto multisets of primitive necklaces of cycle type z. Since this bijection preserves the evaluation and since the inverse of a permutation has the same cycle type as it, we obtain the first assertion as a consequence of Lemma 9.39. The polynomials Qw introduced in the proof of Lemma 8.22 have the same partial degrees as w. Hence, by Theorem 8.1, the characteristic of the representation of Sn on the multilinear part of If is equal to the sum of the evaluations of the words w whose decomposition into Hall words w = iq ... hk satisfies: z((/q|, • • •, |Л*0) = z. Since Hall words are in evaluation- preserving bijection with primitive necklaces (Corollary 7.5), this sum is equal to P,. (ii) Let С = {q,..., ik} and S = {q, f + i2,... + • • • + ik_ J. We claim that C(st(w)) < C is equivalent to Vi£S, x, < xi+1, where w = xt ... x„. Indeed, C(st(w)) < C is equivalent to D(st(w)) £ s, i.e. to Vi ф S, st(w)(i) < st(w)(i +1), and this condition is, by (9.4.6), equivalent to Vi ф S, Xi < xi+1 or Xi = xi+ j and i < i + 1. Hence, the claim follows. It implies that E ev(w) = hil...hik = hc. C(st(w)) < C This implies that hc= E E E ev^ = E E Fc(a-q= E sd- D <C C(a) = D st(w) = a D<C C(a) = D D<C by Lemma 9.39 Hence, we obtain (9.4.9) by inclusion-exclusion.
9.4 Quasisymmetric functions and enumeration of permutations 247 The same argument shows that Sc is the sum of the evaluations of the words w = *! ... xn satisfying the condition: Vi ф S, x( < xi+1 and Vi e S, x( > xi+1. (9.4.10) This shows that Sc is the skew Schur function corresponding to the skew-hook whose lengths of rows are determined by C (from top to bottom, in the French way of depicting tableaux); in particular, Sc is a sum of Schur functions, and hence is the characteristic of a representation of S„; see Macdonald (1979). □ Recall that the space Л of symmetric functions has a scalar product <, > for which the bases (hf) and (шл) of complete and monomial symmetric functions are orthogonal to each another. Observe that each symmetric function is quasisymmetric. The next result gives its expansion in the Fc basis. Theorem 9.42 Let g be a symmetric function. Then c Proof We have 4 = E = E <<Л M E MC л л л(С) = л by (9.4.1) X <9.Лл>Мс = Х<»,Лс>М< л(С) = Л С because z(C) = z implies /ic = /iA. Hence by (9.4.2) 0 = E <0’ hc> X (- W)FD C D>C = Y.Ffe. L (-l)'<D,-"ohc) = LF„<</,Sd> D \ C < D ID by (9.4.9). □ Corollary 9.43 If the quasisymmetric generating function g of a set П of permutations is symmetric, then for any composition C, the number of permutations in П whose descent composition is C is equal to the scalar product ((J, Sc>. Proof Let the number in question be ac. Then g = xcFc. Since the Fc are linearly independent, we obtain from Theorem 9.42 that ac = <3, Sc>. U
248 9 The Solomon descent algebra Corollary 9.44 Let C be a composition and A a partition. The number of permutations of cycle type A whose descent composition is C is equal to the scalar product Sc). Corollary 9.45 Let C, D be two compositions. The number of permutations which have descent composition C and whose inverse has descent composition D is equal to the scalar product <SC, SD). Both corollaries are an immediate consequence of Theorem 9.41 and of Corollary 9.43. 9.5 APPENDIX 9.5.1 Graded bialgebras The projections q„ and the convolution subalgebra Г generated by them may be defined in each graded bialgebra. Theorem 9.2 and Corollary 9.11 are valid if the component of degree 0 of the bialgebra is К and if it is cocommutative (the latter property is used in Lemma 9.10). When the bialgebra is commutative instead, Theorem 9.2 and Corollary 9.11 are still valid if one replaces ‘row’ by ‘column’ in the definition of the pseudocomposition associated to a matrix. The proofs of these generaliza- tions follow closely the proof given here. The previous remarks where suggested by an idea of A. Joyal. 9.5.2 Mackey formula Let C = (i15..., ik) be a composition of n, f о • • - о Ik the partition of {1,..., n} into consecutive intervals such that the length of Is is is, and Sc the subgroup of S„ consisting of the permutations which keep this partition invariant. Such a subgroup is called a Young subgroup. Denote by Xc the set of permutations о such that cr| 7S is increasing, for any s. In other words, Xc is the set of permutations whose descent set is contained in S(C). Then Xc is a set of representatives of the right cosets aSc. Let D = (jji) be another composition of n, f и • - о J, the cor- responding partition, SD the corresponding Young subgroup and XD the corresponding set of representatives of the right cosets. Then XCD = Xc1 n XD is a set of representatives of the double cosets ScoSD (see Bourbaki 1981a, Chapter 4, Exercise 1.3). For each о in XCD, a~lSco n SD is a Young subgroup SE, and the multiplication table of the basis elements Ac of the Solomon descent algebra Sn is given by (9.5.1)
9.5 Appendix 249 where the summation is over all these Young subgroups SE; this is established in the case of a finite Coxeter group by Solomon (1976); recall that the symmetric group is such a group. The link with the multiplication table given here (Theorem 9.2 together with Corollary 9.12) is given by the following result (James and Kerber 1981, Section 1.3): the mapping (|JS n <l(J()|)i <s<k. is a bijection from XCD onto the set of к by I matrices over N of row sum C and column sum D (actually, the matrix above depends only on the double coset of cr). The precise link with (9.5.1) is that, if E denotes the composition obtained by reading the matrix above column by column, and omitting zeros, then SE = (TlSca n SD. \ Recall that the algebra Sn and Tn are anti-isomorphic, via the linear mapping Ac -> qc (Corollary 9.12). Using this bijection and the homomorphism <pn of Theorem 9.28, (9.5.1) implies by Corollary 9.29 that the inner product of the symmetric functions h, and is given by Ал л *„ = £*,,, (9.5.2) where the sum is over all matrices over N whose row sum is z and column sum is p, with v the partition determined by the nonzero entries of this matrix. Recall that h; is the characteristic of the representation of Sn(|z| = n) induced from the trivial representation of the Young subgroup Sx (Macdonald 1979, Section 1.7), and that the inner tensor product л of the symmetric functions corresponds to the inner tensor product of representations. Trans- lated in this way, (9.5.2) may be established directly (James and Kerber 1981, Section 2.9). On the other hand, in the language of double cosets, (9.5.2) is a special case of the Mackey formula (Curtis and Reiner 1962, 44.3). 9.5.3 Idempotents of the canonical projections Denote by En k(l < к < n) the following element of Sn: E.,t = L E,. |л|=и, iw = k Then the En k are for к = 1,..., n orthogonal idempotents of S„, whose sum is the identity (see Theorem 9.27). They correspond to the canonical projections of the free associative algebra К (A), corresponding to the direct sum K<4>= © Uk k>0
250 9 The Solomon descent algebra (see Section 3.2), in the following sense: if P is a homogeneous polynomial of degree n, then its image under the canonical projection К (A) -> Uk is PEn k (right action). This follows from Theorem 9.27, because C4= ® и,. lW = k If we replace x,- by txl in the definition (9.2.3) of Kx, we obtain where s(m, k) denotes the Stirling number of the first kind (see Comtet 1970, p. 47). Thus, we obtain, for any k and n, 1(Л) = к т>к m. Щ = n ii + • • • + ik = л Applying the mapping r of Theorem 9.27, we obtain an expression for En к in the group algebra of Sn: E,.b= E £ Ac. (9.5.3) m>k ml l(C) = k |C| = n The idempotents En k were introduced and computed by Reutenauer (1986b), in connection with the canonical projections, the Baker-Campbell- Hausdorff formula and certain representations of the symmetric group. The idempotents En t appear implicitly in Solomon (1968b), and were also considered by Helmstetter (1989). Formula (9.5.3) is given by Garsia (1990), who also gives the following generating series: let x be a variable; denote (x) f" = x(x + 1)... (x + n - 1), and let d(o) be the number of descents of ere S„. Then Ё xkEn,k=~ E (x-d(a))]no. k= 1 П! aeSn
9.5 Appendix 251 This may be deduced from (9.5.3), by using the well-known formula (*)?"= X s(n,k)xk, (9.5.4) к = 1 (see Comtet 1970, p. 48). For details, see Garsia (1990). A remarkable fact is that these idempotents appear, up to the auto- morphism of the algebra <Q>S„ which multiplies each permutation by its sign, in the quite different context of Hochschild homology. We give, without proofs some of the results (the reader has to apply everywhere the previous automorphism in order to recover the original results). Gerstenhaber and Schack (1987) define an element s„ of <Q>S„ by n -1 sn = X i= 1 They show that the minimal polynomial of s„ in QS„ has the n distinct roots 2‘ — 2, 1 < i < n. From this they deduce that the n elements e.(k) = ( П 2‘ - 2') ‘ П (s. - 2'' + 2), 1 < к < n, \i^k / i ^k are orthogonal idempotents whose sum is 1. They show that e„(2) + • • • 4- en(ri) is the Barr idempotent (Barr 1968). Loday (1989) shows that the idempotents e„(k) satisfy the following equation: (-1)‘- = ke„m + кге„(2) + • + k’eM, where and /^(-l)*’1 X Ds. |S| =*- 1 He shows that the Я* satisfy Garsia and Reutenauer (1989) show that en(k) = En k. See also Patras (1990, 1991) for the study of these idempotents. 9.5.4 Multiplication rule Let s be an element of the free magma M(T), and denote also by s its canonical image in the free Lie algebra Denote by A(s) the partition Aj ... xk, if tA1,..., tAk are the leaves of s, with multiplicities. As an example,
252 9 The Solomon descent algebra z([[? p hl, G33) = 3211. Define the weight |s| of s to be |A(s)|. Now, let s15..., st be several elements in M(T) and let S = (s15..., s^. Observe that if g(S) = A((|sj|,.. .|s,|)) = g and A(S) = A(Sj ... sf) = Л, then S 6 M. We give a multiplication rule for the elements £'($) in Г. Let R = (r15..., rk) another such element. Define R' = (r'15..., r'k), where each r'- is in M(T), with T = {/1, tp ..., t2, t2, -, tn, t'n,...}, in such a way that the letters in R' are distinct, that T' is the set of letters appearing in R' and that R is obtained from R' by forgetting the primes. Then where the summation is extended to all weight-preserving magma homo- morphisms <p: M(T) -> M(T) such that for i = 1,..., I, there is exactly one t in T' with <p(t) = s,. As an example, S = (Sp S2, $3, s4) = ([tp t2], t3, [t2, [t2, 11 ]], [t2, L^2’ f 133), Я = (G’ E^ ^3, t5). Then R' = (t3, [t'3, t53, t'5). There are four possible homomorphisms <p, given by (pt3 = Sj or s2, (pt'3 = s2 or s15 <pts = s3 or s4, cpt'5 = s4 or s3. Thus £'(R)£'(S) is the image under £' of 2([tp ^гЗ, Е*з> E^2’ G333, E^2’ E^2’ G33) + 2(t3, EEg, ^3, E^2’ E^2, G333, E^2’ E^2’ ^133)- The proof of this multiplication rule may be done by using Lemma 9.25 and by adapting the proof of Lemma 9.26. Observe that the elements S are not linearly independent, but one can use a Hall set in order to obtain a basis (cf. Lemma 9.18); in this case, one has to apply the algorithm of Section 4.2 after the multiplication rule. 9.5.5 Another calculation of the multiplities By Foulkes (1980), Zelevinsky (1981), or Gessel (1984), the scalar product <sA, Sc~) is equal to the number of standard tableaux T of shape A, whose
9.5 Appendix 253 descent composition is C (the descent composition of T is the composition associated to the descent set of T; see Section 8.3). Since the Schur functions form an orthonormal basis, this remark implies that Corollary 8.10 is equivalent to — X Sc, maj(C) = i mod n (9-5-5) where maj(C) = i (ci + ''" + ci), for C = (c15..., ck), and where i is a fixed integer relatively prime to n. A direct proof of this formula is due to Gessel and goes as follows. Let Gn(q) = X Scqm^c\ |C|=n Hence, by (9.4.10), we have Gn = X ev(x^ ... x„)qmaj<X1 Xn\ where the sum is over all words ... xn on X and where majC^ ... x„) is the sum of the elements of the descent set of Xj ... x„. By a result due to MacMahon, the number of words in {1,..., r} with evaluation Г'2П2... гПг and major index к is the coefficient of qk in the q-multinomial coefficient ni + n2 + ''' + nr n15 n2,..., nr (9.5.6) see Stanley (1986, Proposition 1.3.17). Thus, where we put X = {x15 x2,..Let to be some primitive Jth root of unity, for some d dividing n. Then, a result of Gloria Olive (1965) implies that the q-multinomial coefficient (9.5.6), evaluated at q = to is zero, unless each n, is divisible by d, and in this case it is equal to the ordinary multinomial coefficient (n/d \ njd, n2/d,..., nr/dj Thus, G» = £ f n/d mi + m2 + ' ' ' =n/d ^2’ • / = (xd + xd+---)"/d = pndld. (9.5.7)
254 9 The Solomon descent algebra Now, if f(q) = fkqk is a polynomial in q, then Лп= z A = ‘(/(i) + m'i) + m_2)+ " + ci"‘iW’”*1)), к = i mod n И where C is a primitive nth root of unity. Thus, by (9.5.7), I n- 1 /7 _ У rkingcd(k,n) '-6i[i]-Д'» Pn/gcd(k,n) П k = O gcd(r,d)= 1 If a) is a primitive Jth root of unity, then the Ramanujan sum Z 0 < r < d gcd(r, d)= 1 is equal to n(d). Hence, G„[.i = 1 E PdW) = П d I n which proves (9.5.5). A by-product of this proof, or of Theorem 8.9, is that for a symmetric function f of degree n, the polynomial g(q) = Жq2, )(i -4)0 -q2) 0 -qn) satisfies the following property: the sum of the coefficients in g of the powers qk with к = i mod n depends only on gcd(i. n). For this and a similar proof of (9.5.5), see Desarmenien (1989). 9.6 NOTES Theorem 9.2 is reminiscent of eqn (4.5) of Garsia and Remmel (1985): they give a formula for the product 0(Ac)0(AD), in terms of matrices with given row and column sums, showing directly that is a subalgebra of QSn. The fact that £„ is a subalgebra of <Q>S„ (Corollary 9.12) was first proved by Solomon (1976), in the more general case of finite Coxeter groups; his result was generalized by Moszkowski (1989). In the case of the symmetric group, besides the proof of Garsia and Remmel, there is another proof due to Gessel (1984, Corollary 12; see also the remark following Theorem 9.37 in this chapter). All the results of Section 9.2, together with Theorem 9.28, Corollary 9.29, Lemma 9.30, Theorem 9.32, and Lemma 9.33 are from Garsia and Reutenauer
9.6 Notes 255 (1989). Corollary 9.31 is already in Solomon (1976). Theorems 9.34 and 9.35 are from Bergeron et al. (1992b). Theorem 9.37 and Corollary 9.38 are due to Gessel (1984), who also proved a result equivalent to Lemma 9.40. The idea behind Lemma 9.39 goes back to MacMahon and was extensively developed by Stanley (1972). Theorem 9.4 l(i) appears essentially in Bergeron et al. (1988). The symmetric functions Sc where considered by MacMahon (1960), and the corresponding representations by Foulkes (1980) and Solomon (1968a). Theorem 9.42 and Corollary 9.45 are due to Gessel (1984), and Corollaries 9.43 and 9.44 are from Gessel and Reutenauer (1992). Many results concerning the Solomon algebra have been extended to other finite Coxeter groups. The case of the hyperoctahedral group Bn has been intensively studied; see N. Bergeron (1991,1992) and Bergeron and Bergeron (1992a, 1992b). A global construction of the idempotents of the descent algebra is made for all finite Coxeter groups by Bergeron et al. (1992a).
References Abe, E. (1980). Hopf algebras. Cambridge University Press. Amitsur, S. A. (1980). On the characteristic polynomial of a sum of matrices. Linear and Multilinear Algebra, 8, 177 -82. Bahturin, Yu. A. (1987). Identical relations in Lie algebras. VNU Science Press, Utrecht. Baker, H. F. (1905). Alternants and continuous groups. Proceedings of the London Mathematical Society, (2), 3, 24-47. Barcelo, H. (1990). On the action of the symmetric group on the free Lie algebra and the partition lattice. Journal of Combinatorial Theory, A55, 93-129. Barcelo, H. and Bergeron, N. (1990). The Orlik-Solomon algebra on the partition lattice and the free Lie algebra. Journal of Combinatorial Theory, ASS, 80-92. Barcelo, H. and Sundaram, S. (1992). On some submodules of the action of the symmetric group on the free Lie algebra. In press. Barr, M. (1968). Harrison homology, Hochschild homology ano triples. Journal of Algebra, 8, 314-23. Bass, H., Connell, E. H., and Wright, D. (1982). The Jacobian conjecture: reduction of degree, and formal expansion of the inverse. Bulletin of the American Mathe- matical Society, 7, 287-330. Baumslag, B. (1972). Free Lie algebras and free groups. Journal of the London Mathematical Society, II, Series 4, 523-32. Baumslag, B. and Baumslag, G. (1971). On ascending chain conditions. Proceedings of the London Mathematical Society, III, Series 22, 681-704. Bergeron, F. and Bergeron, N. (1992a). A decomposition of the descent algebra of the hyperoctahedral group I. Journal of Algebra, 148, 86-97. Bergeron, F. and Bergeron, N. (1992b). Orthogonal idempotents in the descent algebra of Bn and applications. Journal of Pure and Applied Algebra, 79, 109-29. Bergeron, F., Bergeron, N., and Garsia, A. M. (1988). Idempotents for the free Lie algebra and ^-enumeration. In Invariant theory and its applications (ed. D. Stanton), pp. 166-90. IMA Volumes in Mathematics and its Applications, Vol. 19. Springer, Berlin. Bergeron, F., Bergeron, N., Howlett, R. B., and Taylor, D. E. (1992a). A decomposi- tion of the descent algebra of a finite Coxeter group. Journal of Algebraic Combinatorics, 1, 23-44. Bergeron, F., Garsia, A. M. and Reutenauer, C. (19926). Homomorphisms between Solomon’s descent algebras. Journal of Algebra, 150, 503-19. Bergeron, N. (1991). An hyperoctahedral analogue of the free Lie algebra. Journal of Combinatorial Theory, 58, 256-78. Bergeron, N. (1992). A decomposition of the descent algebra of the hyperoctahedral group II. Journal of Algebra, 148, 98-122.
References 257 Berstel, J. and Perrin, D. (1985). Theory of codes. Academic Press, Orlando, FL. Berstel, J. and Pocchiola, M. (1992). A verage cost of Duval's algorithm for generating Lyndon words. Laboratoire d’Informatique de 1’Ecole Normale Superieure. Pre- print no. 92-8. Paris. Berstel, J. and Reutenauer, C. (1988). Rational series and their languages. EATCS Monographs on Theoretical Computer Science. Springer, Berlin. Berstel, J. and Reutenauer, C. (1990). Zeta functions of formal languages. Transactions of the American Mathematical Society, 321, 533-46. Bialinicki-Birula, I„ Mielnick, B., and Piebanski, J. (1969). Explicit solution of the continuous Baker-Campbell-Hausdorff problem. Annals of Phvsics, 51, 187-200. Bjorner, A. (1989). The Mobius function of the subword order. In Invariant theory and its applications (ed. D. Stanton), pp. 118-24. IMA Volumes in Mathematics and its Applications, Vol. 19. Springer, Berlin. Bjorner, A. and Reutenauer, C. (1992). Rationality of the Mobius function of subword order. Theoretical Computer Science, 98, 53-63. Blessenohl, D. and Laue, H. (1988). Generalized Jacobi identities. Note di Matematica, 8, 111-21. Blessenohl, D. and Laue, H. (1989). On Witt’s dimension formula for free Lie algebras and a theorem of Klyachko. Bulletin of the Australian Mathematical Society, 40, 49-57. Blessenohl, D. and Laue, H. (1990a). Symmetric groups and free Lie algebras. Proceedings of the 2nd International Group Theory Conference. Bressanone/Brixen. 1989. Supplement© ai Rendiconti del Circolo Matematico di Palermo. Serie II, 23, 201-8. Blessenohl, D. and Laue, H. (1990b). A basis construction for free Lie algebras. Expositiones Mathematicae. In press. Blessenohl, D. and Laue, H. (1991). On the coefficients in the associative expansion of a Lie word. European Journal of Combinatorics, 12, 205-10. Bourbaki, N. (1971). Groupes et algebres de Lie, Chapter I. Diffusion CCLS, Paris. Bourbaki, N. (1972). Groupes et algebres de Lie, Chapters II, HL Diffusion CCLS, Paris. Bourbaki, N. (1981a). Groupes et algebres de Lie, Chapters IV-VI. Masson, Paris. Bourbaki, N. (1981b). Algebre, Chapters IV-VII. Masson, Paris. Brandt, A. (1944). The free Lie ring and Lie representations of the full linear group. Transactions of the American Mathematical Society, 56, 528-36. Bryant, R. M. (1991). On the fixed points of a finite group acting on a free Lie algebra. Journal of the London Mathematical Society (2)43, 215-24. Burrow, M. D. (1958). Invariants of free Lie rings. Communications in Pure and Applied Mathematics, 11, 419-31. Calderbank, A. R., Hanlon, P., and Sundaram, S. (1992). Representations of the symmetric group in deformations of the free Lie algebra. In press. Campbell, J. E. (1897). On a law of combination of operators bearing on the theory of continuous transformation groups. Proceedings of the London Mathematical Society, (1), 28, 381-90. Campbell, J. E. (1898). On a law of combination of operators (second paper). Proceedings of the London Mathematical Society, (1), 29, 14-32. Cartier, P. (1954/55). Le theoreme de Poincare-Birkhoff-Witt. Seminaire Sophus Lie. Ecole Normale Superieure, Paris.
258 References Cartier, P. (1956). Demonstration algebrique de la formule de Hausdorff. Bulletin de la Societe Mathematique de France, 84, 241-9. Cartier, P. and Foata, D. (1969). Problemes combinatoires de commutation et de rearrangements. Lecture Notes in Mathematics, Vol. 85. Springer, Berlin. Chen, К. T. (1957). Integration of paths, geometric invariants and a generalized Baker-Hausdorff formula. Annals of Mathematics, 65, 163-78. Chen, К. T. (1968). Algebraic paths. Journal of Algebra, 10, 8-36. Chen, К. T., Fox, R. H., and Lyndon, R. C. (1958). Free differential calculus, IV: the quotient groups of the lower central series. Annals of Mathematics, 68, 81-95. Cohn, P. M. (1951). Integral modules, Lie rings and free groups. Ph.D. Thesis, University of Cambridge. Cohn, P. M. (1954). Sur le critere de Friedrichs pour les commutateurs dans une algebre associative libre. Comptes Rendus de I 'Academe des Sciences de Paris, 239, 743-5. Cohn, P. M. (1961). On a generalization of the Euclidean algorithm. Proceedings of the Cambridge Philosophical Society, 57, 18-30. Cohn, P. M. (1963). A remark on the Birkhoff-Witt theorem. Journal of the London Mathematical Society, 38, 197-203. Cohn, P. M. (1964). Subalgebras of free associative algebras. Proceedings of the London Mathematical Society, (3), 14, 618-32. Cohn, P. M. (1985). Free Rings and their Relations, (2nd edn.). Academic Press, London. Comtet, L. (1970). Analyse combinatoire, Vol. 2. Presses Universitaires de France, Paris. Curtis, C. W. and Reiner, I. (1962). Representation theory of finite groups and associative algebras. John Wiley, New York. Crouch, P. E. and Lammabhi-Lagarrigue, F. (1989). Algebraic and multiple integral identities. Acta Applicandae Mathematicae, 15, 235-74. Czyz, J. (1992). Paradoxes of measures and dimensions originating in Felix Hausdorff''s ideas. World Scientific, Singapore. In press. Davis, R. L. (1958). A special formula for the Lie character, Canadian Journal of Mathematics, 10, 33-8. Desarmenien, J. (1989). Etude modulo n des statistiques mahoniennes, 22eme Seminaire Lotharingien de Combinatoire. Publication de 1’Institut de Recherche Mathematique Avancee, Strasbourg, 414/S-22, pp. 27-35. Dixmier, J. (1974). Algebres enveloppantes. Hermann, Paris. Drensky, V. (1992). Fixed algebras of residually nilpotent Lie algebras. In press. Dress, A. W. M. and Siebeneicher, C. (1988). Eiri Lemma fiber Perlenketten. 20eme Seminaire Lotharingien de Combinatoire. Publication de 1’Institut de Recherche en Mathematique Avancee, 372/S-20, Strasbourg, pp. 47-55. Duboc, C. (1986). On some equations in free partially commutative monoids. Theoretical Computer Science, 46, 159-74. Duchamp, G. (1989). On the free partially commutative Lie algebra. Preprint nr. 89-74. Laboratoire d’lnformatique Theorique et de Programmation, Paris. Duchamp, G. (1991). Orthogonal projection onto the free Lie algebra. Theoretical Computer Science, 79, 227-39. Duchamp, G. and Krob, D. (1991a). Factorisations dans le monoide partiellement commutatif libre. Comptes Rendus de Г Academic des Sciences de Paris, 312, Serie I, 189-92. Duchamp, G. and Krob, D. (1991/?). Lazard’s factorizations of free partially
References 259 commutative monoids. Proceedings of the 18th International Congress on Automata, Languages and Programming, (ed. J. Leach-Albert, B. Monien, M. Rodriguez). Lecture Notes in Computer Science 510, 242-53. Springer, Berlin. Duchamp, G. and Krob, D. (1992a). Free partially commutative structures. Journal of Algebra. Duchamp, G. and Krob, D. (1992b). The lower central series of the free partially commutative group. Semigroup Forum. In press. Duchamp, G. and Krob, D. (1992c). The free partially commutative Lie algebra: bases and ranks. Advances in Mathematics. Duchamp, G. and Krob, D. (1992d). On the partially commutative shuffle product. Theoretical Computer Science 96, 405-410. Duchamp, G. and Thibon, J.-Y. (1989). Le support de 1’algebre de Lie libre. Discrete Mathematics, 76, 123-9. Duval, J.-P. (1978). Algorithme de factorisation d’un mot en mots de Lyndon. Actes du ler colloque AFCET-SMT de Mathematiques Appliquees, Vol. 2, pp. 15-26. Duval, J.-P. (1983). Factorizing words over an ordered alphabet. Journal of Algo- rithms, 4, 363-81. Duval, J.-P. (1988). Generation d’une section des classes de conjugaison et arbre des mots de Lyndon de longueur bornee. Theoretical Computer Science, 60, 255-83. Dynkin, E. B. (1947). Calculation of the coefficients in the Campbell-Hausdorff formula. Doklady Akademii Nauk SSSR (N.S.), 57, 323-6. (In Russian.) Dynkin, E. B. (1949). On the representation by means of commutators of the series logfe^e*) for noncommutative л and y. Matematiceskii Sbornik N.S. 25 (67), 155-62. (In Russian.) Dynkin, E. B. (1950). Normed Lie algebras and analytic groups. Uspekhi Matemat iche- skihk Nauk (N.S.) 5, 1 (35), 135-86. (In Russian.) English translation in; Transla- tion series 1, Vol. 9, Lie groups. American Mathematical Society (1962), 470-534. Eastman, W. L. (1965). On the construction of comma-free codes. IEEE Transactions on Information Theory, 11, 263-7. Eilenberg, S. (1974). Automata, languages and machines, Vol. A. Academic Press, New York. Eilenberg, S. (1976). Automata, languages and machines, Vol. B. Academic Press, New York. Eilenberg, S. and MacLane, S. (1953). On the groups Н(П, и), I. Annals of Mathematics, 58, 55-106. Fer, F. (1958). Resolution de I’equation matricielle dC/dt = pU par produit infini d’exponentielles matricielles. Academic Royale de Belgique, Bulletin de la Classe des Sciences, (5) 44, 818-29. Finkelstein, D. (1955). On relations between commutators. Communications in Pure and Applied Mathematics, 8, 245-50. Fliess, M. (1981). Fonctionnelles causales non lineaires et indeterminees non com- mutatives. Bulletin de la Societe Mathematique de France, 109, 3-40. Fliess, M. (1983). Realisation locale des systemes non lineaires, algebres de Lie filtrees transitives et series generatrices non commutatives. Inventiones Mathematicae, 71, 521-37. Fliess, M. (1989). Automatique et corps differentiels. Forum Mathematicum, 1. 227-38. Fliess, M. and Reutenauer, C. (1983). Picard-Vessiot theory of bilinear systems. Proceedings of the IEEE Conference on Decision and Control. Institute of Electrical and Electronics Engineers, New York, pp. 1153 7.
260 References Foata, D. and Schiitzenberger, M.-P. (1970). Theorie geometrique des polyndmes euleriens. Lecture Notes in Mathematics, Vol. 138. Springer, Berlin. Foulkes, H. O. (1959). The analysis of the characters of the Lie representations of the general linear group. Proceedings of the American Mathematical Society, 10, 497-501. Foulkes, H. O. (1980). Eulerian numbers, Newcomb’s problem and representations of the symmetric group. Discrete Mathematics, 30, 3-49. Fox, R. H. (1953). Free differential calculus. Annals of Mathematics, 57, 547-60. Friedrichs, К. O. (1953). Mathematical aspects of the quantum theory of fields V. Fields modified by linear homogeneous forces. Communications in Pure and Applied Mathematics, 6, 1-72. Garsia, A. M. (1990). Combinatorics of the free Lie algebra and the symmetric group. In Analysis, et cetera ..., Jurgen Moser Festschrift. Academic Press, New York, pp. 309-82. Garsia, A. M. and Gessel, I. (1970). Permutation statistics and partitions. Advances in Mathematics, 31, 288-305. Garsia, A. M. and Remmel, J. (1985). Shuffles of permutations and the Kronecker product. Graphs and Combinatorics, 1, 217-63. Garsia, A. M. and Reutenauer, C. (1989). A decomposition of Solomon’s descent algebra. Advances in Mathematics, 77, 189-262. Garsia, A. M. and Stanton, D. (1984). Group actions on Stanley-Reisner rings and the construction of invariants. Advances in Mathematics, 51, 107-201. Gerstenhaber, M. and Schack, S. D. (1987). A Hodge-type decomposition for commutative algebra cohomology. Journal of Pure and Applied Algebra, 48, 229-47. Gessel, I. (1984). Multipartite P-partitions and inner product of skew Schur functions. Contemporary Mathematics, 34, 289-301. Gessel, I. and Reutenauer, C. (1992). Counting permutations with given cycle structure and descent set. Journal of Combinatorial Theory, Series A. In press. Goldberg, K. (1956). The formal power series for log(exey). Duke Mathematical Journal, 23, 13-21. Golomb, S. W. (1967). Irreducible polynomials, synchronization codes, primitive necklaces and the cyclotomic algebra. University of North Carolina. Monograph Series in Probability and Statistics 4, pp. 358-70. Good, I. J. (1971). The factorization of a sum of matrices and the multivariate cumulants of a set of quadratic expressions. Journal of Combinatorial Theory, 11, 27-37. Gorchakov, Y. M. (1969). Commutator subgroups. Sibirskii Matematicheskii Zhurnal (translation), 10, 5, 1023-33. Grobner, W. (1967). Die Lie Reihen und Ihre Anwendungen, (2nd edn). VEB Deutscher Verlag der Wissenschaften, Berlin. Hain, R. M. (1986). On the indecomposable elements of the bar construction. Proceedings of the American Mathematical Society, 98, 312-6. Hall, M., Jr. (1950). A basis for free Lie rings and higher commutators in free groups. Proceedings of the American Mathematical Society, 1, 575-81. Hall, M., J. (1959). The theory of groups. Macmillan, New York. Hall, P. (1933). A contribution to the theory of groups of prime-power order. Proceedings of the London Mathematical Society, (2), 36, 29-95. Hall, P. (1957). Nilpotent groups. Queen Mary College Mathematics Notes. The Edmonton Notes on Nilpotent Groups.
References 261 Hanlon, P. (1981). The fixed point partition lattice. Pacific Journal of Mathematics, 96, 319—41. Hanlon, P. (1990). The action of S„ on the components of the Hodge decomposition of Hochschild homology. Michigan Journal of Mathematics, 37, 105-24. Hausdorff, F. (1906). Die symbolische Exponentialformel in der Gruppentheorie. Leipziger Berichte, 58, 19-48. Helmstetter, J. (1989). Serie de Hausdorff d’une algebre de Lie et projections canoniques dans I’algebre enveloppante. Journal of Algebra, 120, 170-99. Higman, G. (1956). On a conjecture of Nagata. Proceedings of the Cambridge Philosophical Society, 52, 1-4. Hochschild, G. P. (1981). Basic theory of algebraic groups and Lie algebras. Springer, Berlin. Humphreys, J. E. (1972). Introduction to Lie algebras and representation theory. Springer, Berlin. Jacobson, N. (1937). Abstract derivations and Lie algebras. Transactions of the American Mathematical Society, 42, 206-24. Jacobson, N. (1962). Lie algebras. Interscience, New York. James, G. and Kerber A. (1981). The representation theory of the symmetric group. Encyclopedia of Mathematics, Vol. 16. Addison-Wesley, Reading, MA. Joyal, A. (1986). Foncteurs analytiques et especes de structures. In Combinatoire enumerative, Proceedings. Montreal, (1985) (ed. G. Labelle and P. Leroux). Lecture Notes in Mathematics, Vol. 1234, pp. 126-59. Kaplansky, I. (1957). An introduction to differential algebra. Hermann, Paris. Klyachko, A. A. (1974). Lie elements in the tensor algebra. Sibirskii Matematicheskii Zhurnal (translation) 15, 1296-304. Koseleff, P. V. (1991). Jeux de mots dans les algebres de Lie fibres: quelques bases et formules. Theoretical Computer Science, 79, 241-56. Kraskiewicz, W. and Weyman, J. (1987). Algebra of coinvariants and the action of Coxeter elements, manuscript. Kukin, G. P. (1972a). The subalgebras of free Lie p-algebras. Algebra i Logika, 11, 535-50. (In Russian.) Translation: Algebra and Logic, 11, 294-303. Kukin, G. P. (1972b). Subalgebras of a free Lie sum of Lie algebras with an amalgamated subalgebra. Algebra i Logika, 11, 59-86 (In Russian.) Translation: Algebra and Logic, 11, 33-50. Kukin, G. P. (1977a). On the equality problem for Lie algebras, Sibirskii Matematiche- skii Zhurnal, 18, 1192-5. Kukin, G. P. (1977b). Intersection of subalgebras in free Lie algebras. Algebra i Logika, 16, 577-87. (In Russian.) Translation in: Algebra and Logic 16, 387-95. Kukin, G. P. (1978). Bases of free Lie algebras. Matematicheskie Zametki, 24, 375-82. Labute, J. P. (1978). Free Lie algebras as modules over their enveloping algebra. Proceedings of the American Mathematical Society, 68, 135-9. Lallement, G. (1979). Semigroups and combinatorial applicatons. John Wiley, New York. Lalonde, P. (1992). Lyndon heaps: an analogue of Lyndon words in partially commutative monoids. In press. Lang, S. (1965). Algebra. Addison-Wesley, Reading, MA. Lazard, M. (1954). Sur les groupes nilpotents et les anneaux de Lie. Annales Scientifiques de PEcole Normale Superieure, (3), 71, 101-90. Lazard, M. (1960). Groupes, anneaux de Lie et probleme de Burnside. Istituto Matematico dell’ Universita di Roma.
262 References Lazard, M. (1963). Quelques calculs concernant la formule de Hausdorff. Bulletin de la Societe Mathematique de France, 41, 535-51. Lehrer, G. I. and Solomon, L. (1986). On the action of the symmetric group on the cohomology of the complement of its reflecting hyperplanes. Journal of Algebra, 104, 410-24. Lenormand, C. (1969/70). Exponentielle de la derivation et integration des series en variables non commutatives. In Problemes, Mathematiques de la Theorie des Automates. Seminaire Schiitzenberger-Lentin-Nivat, Paris. Lidl, R. and Niederreiter, H. (1983). Finite fields. Encyclopedia of Mathematics, Vol. 20. Addison-Wesley, Reading, MA. Loday, J. L. (1989). Operations sur 1’homologie cyclique des algebres commutatives. Inventiones Mathematicae, 96, 205 -30. Lothaire, M. (1983). Combinatorics on words. Encyclopedia of Mathematics, Vol. 17. Addison-Wesley, Reading, MA. Lucas, E. (1891). Theorie des N ombres, Vol. 1. Gauthier-Villars, Paris. Lyndon, R. C. (1954). On Burnsides problem. Transactions of the American Mathe- matical Society, 77, 202-15. Lyndon, R. C. (1955a). On Burnsides problem IL Transactions of the American Mathematical Society, 78, 329-32. Lyndon, R. C. (1955/?). A theorem of Friedrichs. Michigan Mathematical Journal, 3, 27 9. Macdonald, I. G. (1979). Symmetric functions and Hall polynomials. Oxford University Press. MacLane, S. (1950). Cohomology of abelian groups. International Congress of Mathematicians, Proceedings, 2, pp. 8-14. MacMahon, P. A. (1960). Combinatory analysis. Chelsea, New York. (Origin- ally published in two volumes by Cambridge University Press in 1915 and 1916.) Magnus, W. (1937). Uber Beziehungen zwischen hoheren Kommutatoren. Journal fur die Reine und Angewandte Mathematik, 177, 105-15. Magnus, W. (1940). Uber Gruppen und zugeordnete Liesche Ringe. Journal fur die Reine und Angewandte Mathematik, 182, 142-9. Magnus, W. (1950). A connection between the Baker-Hausdorff formula and a problem of Burnside. Annals of Mathematics, 52, 111-26. Errata: Annals of Mathematics, 57, 606 (1953). Magnus, W. (1953). Algebraic aspects in the theory of systems of linear differential equations. Mathematical Research Group. Washington Square College of Arts and Science. New York University. Research Report No. BR-3. Magnus, W. (1954). On the exponential solution of differential equations for a linear operator. Communications in Pure and Applied Mathematics, 7, 649-73. Magnus, W., Karrass, A., and Solitar, D. (1976). Combinatorial group theory, (2nd edn). Dover, New York. Maltey, F. (1988). Calcul des formules de Baker-Campbell-Hausdorff dans une base de Lyndon. Etude No. F. 137. Centre scientifique I.B.M., Paris. Meier-Wunderli, H. (1951). Note on a basis of P. Hall for the higher commutators in free groups. Commentarii Mathematici Helvetia, 16, 1-5. Melancon, G. (1991). Reecritures dans le groupe libre, I’algebre libre et I’algebre de Lie libre. These Mathematique de 1’Universite du Quebec a Montreal. Melancon, G. (1992). Combinatorics of Hall trees and Hall words. Journal of Combinatorial Theory, A59, 285-308.
References 263 b- Melancon, G. and Reutenauer, C. (1989). Lyndon words, free algebras and shuffles. Canadian Journal of Mathematics, 41, 577-91. Melancon, G. and Reutenauer, C. (1993). Computing Hall exponents in the free group. International Journal of Algebra and Computation. To appear. Metropolis, N. and Rota, G.-C. (1984). The cyclotomic identity. Contemporary Mathematics, 34, 19-24. Michel, J. (1973/74). Base des algebres de Lie et series de Hausdorff. Seminaire Dubreil, 27eme annee, No. 6. Paris. Institut Henri Poincare. Michel, J. (1974). Bases des algebres de Lie, etude des coefficients de la formule de Campbell-Hausdorff. These de 3eme cycle. Universite Paris XI. Michel, J. (1976). Calculs dans les algebres de Lie fibre: la serie de Hausdorff et le probleme de Burnside. Asterisque, 38-39, 139-98. Mikhalev, A. A. (1986). Free color Lie superalgebras. Doklady Akademii Nauk SSSR, 286, no. 3, 551-4. Translated in: Soviet Mathematics Doklady 33, no. 1, 136-9 (1986). Milnor, J. and Moore, J. (1965). On the structure of Hopf algebras. Annals of Mathematics, 81, 211-64. Moszkowski, P. (1989). Generalisation d’une formule de Solomon relative a 1’anneau d’un groupe de Coxeter. Comptes Rendus de Г Academic des Sciences de Paris, 309, Serie I, 539-41. ' Ochsenschlager, P. (1981). Binomialkoeffizienten in freien Monoiden. Dissertation. Technische Hochschule Darmstadt, Fachbereich Informatik. Olive, G. (1965). Generalized powers. American Mathematical Monthly, 73, 619-27. Patras, F. (1990). Filtration du groupe des polytopes et /.-structure du groupe symetrique. Comptes Rendus de Г Academic des Sciences de Paris, 310, Serie I, 501-4. Patras, F. (1991). Construction geometrique des idempotents euleriens, Filtration des groupes de polytopes et des groupes d’homologie de Hochschild. Bulletin de la Societe Mathematique de France, 119, 101-26. Patsourakos, A. (1987). Sur la representation adjointe d’une algebre de Lie libre. Communications in Algebra, 15, 2199-207. Perrin, D. and Viennot, X. G. (1981). A note on shuffle algebras, manuscript. Poetsch, X. G. and von Waldenfels, W. (1964). Maschinelle Berechnung der Koeffizienten der formalen Potenzreihe log exe>. Bericht der Kernforschungsanlage Julich HUL-206-PP. Postnikov, M. (1986). Lie groups and Lie algebras. Mir publishers, Moscow. Procesi, C. (1976). The invariant theory of n x n matrices. Advances in Mathematics, 19, 306-81. Radford, D. E. (1979). A natural ring basis for the shuffle algebra and an application to group schemes. Journal of Algebra, 58, 432-54. Ree, R. (1958). Lie elements and an algebra associated with shuffles. Annals of Mathematics, 68, 210-20. Ree, R. (1960). Generalized Lie elements. Canadian Journal of Mathematics, 12, 493-502. Reutenauer, C. (1985a). The local realization of generating series of finite Lie rank. In Algebraic and Geometric Methods in Nonlinear Control Theory, Colloque du CNRS, (ed. M. Fliess and M. Hazewinkel). D. Reidel, Dordrecht. Reutenauer, C. (1985b). The shuffle algebra on the factors of a word is free. Journal of Combinatorial Theory, A38, 48-57.
264 References Reutenauer, C. (1986a). Mots de Lyndon et un theoreme de Shirshov. Annales des Sciences Mathematiques du Quebec, 10, 237-45. Reutenauer, C. (1986/?). Theorem of Poincare-Birkhoff-Witt, logarithm and represen- tations of the symmetric group whose order are the Stirling numbers. In Combina- toire Enumerative, Proceedings, Montreal, (1985), (ed G. Labelle and P. Leroux). Lecture Notes in Mathematics, pp. 267-84. Springer, Berlin. Reutenauer, C. (1990). Dimensions and characters of the derived series of the free Lie algebra. In M. Lothaire, Mots, Melanges offerts a M.-P. Schiitzenberger, pp. 171-84. Hermes, Paris. Reutenauer, C. (1992). Applications of a noncommutative jacobian matrix. Journal of Pure and Applied Algebra, 77, 169-81. Reutenauer, C. and Schiitzenberger, M.-P. (1987). A formula for the deteminant of a sum of matrices. Letters in Mathematical Physics, 13, 299-302. Salomaa, A. and Soittola, M. (1978). Automata-theoretic aspects offormal power series. Springer, Berlin. Scharf, T. (1991). Die Wurzelanzahlfunktion in symmetrischen Gruppen. Journal of Algebra, 139, 446-56. Schensted, C. (1961). Longest increasing and decreasing subsequences. Canadian Journal of Mathematics, 13, 179-91. Schmidt, W. (1990). Hopf algebras and identities in free partially commutative monoids. Theoretical Computer Science 73, 335-40. Scholz, R. A. (1969). Maximal and variable word-length comma-free codes. IEEE Transactions on Information Theory, 15, 300-6. Schiitzenberger, M.-P. (1958). Sur une propriete combinatoire des algebres de Lie libres pouvant etre utilisee dans un probleme de mathematiques appliquees. Seminaire P. Dubreil. Faculte des Sciences, Paris. Schiitzenberger, M.-P. (1961). On the definition of a family of automata. Information and Control, 4, 245-70. Schiitzenberger, M.-P. (1965). On a factorization of free monoids. Proceedings of the American Mathematical Society, 16, 21-4. Schiitzenberger, M.-P. (1986). Fourbi, manuscript. Serre, J.-P. (1965). Lie algebras and Lie groups. Mathematical Lecture Note Series. Benjamin Cummings, Reading, MA. Shpilrain, V. E. (1990). On automorphisms of relatively free groups and Lie algebras. Communication at the International Congress of Mathematicians, Kyoto. Shpilrain, V. E. (1992). On generators of L/R2 Lie algebras. Proceeding of the American Mathematical Society. In press. Shirshov, A. I. (1953). Subalgebras of free Lie algebras. Matematicheskii Sbornik, 33, 441-52. (In Russian.) Shirshov, A. I. (1957). On rings with identity relations. Matematicheskii Sbornik, 43, 277-83. (In Russian.) Shirshov, A. I. (1958). Free Lie rings. Matematiceskii Sbornik, 45, 113-22. (In Russian.) Shirshov, A. I. (1962). Bases of free Lie algebras. Algebra i Logika, 1, 14-9. Solomon, L. (1968a). A decomposition of the group algebra of a finite Coxeter group. Journal of Algebra, 9, 220-39. Solomon, L. (1968/?). On the Poincare-Birkhoff-Witt theorem. Journal of Combina- torial Theory, 4, 363-75. Solomon, L. (1976). A. Mackey formula in the group ring of a Coxeter group. Journal of Algebra, 41, 255-68.
References 265 Specht, W. (1948). Die linearen Beziehungen zwischen hoheren Kommutatoren. Mathematische Zeitschrift, 51, 367-76. Stanley, R. P. (1972). Ordered structures and partitions. Memoir of the American Mathematical Society, No. 119. Stanley, R. P. (1982). Some aspects of groups acting on finite posets. Journal of Combinatorial Theory, A32, 132—61. Stanley, R. P. (1986). Enumerative Combinatorics, Vol. I. Wadsworth & Brooks/Cole Mathematic Series, Monterey. Stembridge, J. (1989). On the eigenvalues of representations of reflection groups and wreath products. Pacific Journal of Mathematics, 140, 359-96. Sundaram, S. (1992). Decompositions of some ^„-modules arising in the free Lie algebra. Journal of Algebra. In press. Sweedler, M. (1969). Hopf algebras. Benjamin. Therien, D. (1983). Subword counting and nilpotent groups. In Combinatorics on Words, Progress and Perspectives, pp. 297-305. Academic Press, New York. Thrall, R. M. (1942). On symmetrized Kronecker powers and the structure of the free Lie ring. American Journal of Mathematics, 64, 371-88. Unlii, Y. (1985). Graded subalgebras associated with a subalgebra of a graded free Lie algebra. Journal of Pure and Applied Algebra, 37, 103-10. Unlii, Y. and Ekici, N. (1986). Quasiprimeness of lower central terms of an ideal in a free Lie algebra. Journal of Pure and Applied Algebra, 43, 299-302. Varrichio, S. (1990). Factorization of free monoids and unavoidable regularities. Theoretical Computer Science, 73, 81-9. Viennot, X. G. (1974). Algebres de Lie libres et monoides libres. Thesis, University Paris 7. Viennot, X. G. (1978). Algebres de Lie libres et monoides libres. Lecture Notes in Mathematics, Vol. 691. Springer, Berlin. Viennot, X. G. (1986). Heaps of pieces I: basic definitions and combinatorial lemmas. In Combinatoire enumerative, Proceedings, Montreal, (1985), (ed. G. Labelle and P. Leroux). Lecture Notes in Mathematics, Vol. 1234, pp. 321-50. Springer, Berlin. von Waldenfels, W. (1966a). Die Koeffizienten gewisser Potenzreihen in nicht vertauschbaren Unbestimmten. Archiv der Mathematik, 17, 302-7. von Waldenfels, W. (1966/?). Zur Charakterisierung Liescher Elemente in freien Algebren. Archiv der Mathematik, 17, 44-8. Ward, M. A. (1969). Basic commutators. Philosophical Transactions of the Royal Society of London, A264, 343-412. Wever, F. (1949). Uber Invarianten in Lieschen Ringen. Mathematische Annalen, 120, 563-80. Weyl, H. (1946). The classical groups, their invariants and representations. Princeton University Press. Witt, E. (1937). Treue Darstellung Liescher Ringe. Journal fur die Reine und Angewandte Mathematik, 177, 152-60. Witt, E. (1953). Uber freie Liesche Ringe und ihre Unterringe. Mathematische Zeitschrift, 58, 113-4. Witt, E. (1956). Die Unterringe der freien Lieschen Ringe. Mathematische Zeitschrift, 64, 195-216. Yunus, I. A. (1984). On Fox’s problem for Lie algebras, Uspekhi Matematicheskihk Nauk, 39, nb. 3 (237), 251-2. (In Russian.) Translation in: Russian Mathematical Surveys, 1984, 39 nb. 3, 221-2.
266 References Zassenhaus, H. (1939). Uber Liesche Ringe mit Primzahlcharakteristik. Abhandlungen des Mathematischen Seminars Hamburg, 13, 1-100. Zelevinsky, A. V. (1981). A generalization of the Littlewood-Richardson rule and the Robinson-Schensted-Knuth correspondence. Journal of Algebra, 69, 82-94. Zerck, R. (1989). On the fixed points of the Lie ring associated with a free presentation of a finite group. Bulletin of the London Mathematical Society, 21, 381-6.
Index A-adic topology 17 admissible pyramid 170 algebra of divided powers 108 algebra of functions on the free group 132 alphabet 14 alphabetical order 105 antipode 29 antisymmetry 1 Baker’s identity 36 Bernoulli numbers 72 bialgebra 27 bilinear system 149 binomial coefficient 131 bisection 126 decreasing factorization into Hall words 89 decreasing sequence 86, 137 defect theorem 49 degree 5, 16 dependent 40 derivation 7, 19, 36, 76 derivation tree 91 derived ideal 49 derived series 112 descent 62, 185 descent composition 222 descent set 62, 185 differential polynomial 149 dual coalgebra 38 Dynkins formula 81 Campbell-Baker-Hausdorff formula 56 canonical projections 59 cardinality of a multiset 166 causal analytic functional 148 characteristic 178 Chen series 80 closed 11 circular word 154 code 119 column sum 218 comma-free 122 compatible 227 complete 119 complete symmetric function 202 complete tensor product 28 composition 218 concatenation algebra 15 concatenation product 14, 15 confluent 86 conjugate words 154 conjugation 154, 170 conjugacy class 154, 170 constant term 17 convolution product 28 cycle type 178 cyclic language 160 cyclically standard 174 elementary automorphims 47 empty word 14 enveloping algebra 2 eulerian polynomial 62, 63 evaluation 156, 166 exponent of a word 154 exponential 54 factor 14 factorization of the free monoid 173 finely homogeneous component 16 finely homogeneous polynomial 16 finer 227 finite Lie rank 149 foliage 84 formal series 17 free associative algebra 6, 15 free Lie algebra 4 free Lie p-algebra 35 free Lie superalgebra 102 free magma 4 free monoid 14 free partially commutative Lie algebra 102 free partially commutative monoid 101 free set of Lie polynomials 49 Frobenius image 178
268 Index generating function 156,179 graded bialgebra 248 Grobner formula 148 Lyndon element 170 Lyndon word 105 Hall polynomial 90 Hall set 84, 89 Hall tree 85 Hall word 89 Hausdorff series 56 highest homogeneous component 43 homogeneous component 16 homogeneous endomorphism 30, 177 homogeneous eulerian polynomial 63 homogeneous polynomial 16 Hopf algebra 29 immediate subtree 84 infiltration 135 infiltration product 134 inner product of symmetric functions 233 inversion 86, 137 invertible 86 iterated integral 56, 148 Jacobi identity 1 jacobian conjecture 51 jacobian matrix 46, 47 Jacobson formulas 33 labelled standard sequence 143 language 38 Lazard set 11 left action 176 left comb 212 left factor 14 left unitary 119 legal rise 86, 137 length 14, 166, 218 letter 14 level 113 Lie algebra 1 Lie bracket 1, 18 Lie dependent 42 Lie endomorphism 30 Lie idempotent 194 Lie invariant 207 Lie monomial 236 Lie polynomial 18 Lie product 1,18 Lie representation 180 Lie series 52 locally finite 18 logarithm 54 lower central series 136 magma 4 Magnus transformation 132 major index 185 message 122 Mobius function 150 multilinear part 178, 180 multiliner polynomial 125 multiplicity in a multiset 166 multiset 166 necklace 154 noncommutative polynomial 15 nontrivial factor 14 nontrivial sesquipower 164 normal basis 171 order associated to a Hall set 114 palindrome 33 partial degree 16 period of a necklace, a word 154 periodic expansion of a word 161 plethysm 202 polynomial 15 power sum 156, 178 prefix 14 primitive element 35, 170 primitive necklace, word 154 product of functions on the free group 132 proper factor 14 proper homomorphism 46 pseudocomposition associated to a matrix 218 pseudocomposition 218 pyramid 170 quasi-symmetric generating function 245 quasi-symmetric function 242 rank of a free Lie algebra 50 rational series 38 recognizable function 133 recognizable series 38 recognizable subset 150 representative function 133 representative subset 150 restricted enveloping algebra 50 restricted Lie algebra of characteristic p 50 reversal 28 right action 177
Index 269 right factor 14 rise 62, 86, 137 rise set 62 row sum 218 Schur function 186 semi-direct product 8 series 17 sesquipower 164 shift mapping 168 shuffle 24 shuffle algebra 24 shuffle product 24 size 73 space of subword functions 132 standard factorization 89, 170 standard list 126 standard numbering 166 standard permutation 167 standard sequence 85, 136 standard tableau 185 subword 23, 131 subword function 131, 132 subword order 150 substitution of letters 30 suffix 14 suffix code 119 support of the free Lie algebra 33 symmetrized product 57 synchronizing 119 tree 4 type 227 weight 218, 224, 227 weight-preserving 231 word 14 Young subgroup 249 Zassenhaus formula 81 zeta function 174