/
Text
Pi and the AGM
A Study in Analytic Number Theory
and Computational Complexity
JONATHAN M. BORWEIN
PETER B. BORWEIN
Department of Mathematics
Dalhousie University
Halifax, Nova Scotia
A Wiley-Interscience Publication
JOHN WILEY & SONS
New York • Chichester • Brisbane • Toronto • Singapore
Copyright © 1987 by John Wiley & Sons, Inc.
All rights reserved. Published simultaneously in Canada.
Reproduction or translation of any part of this work
beyond that permitted by Section 107 or 108 of the
1976 United States Copyright Act without the permission
of the copyright owner is unlawful. Requests for
permission or further information should be addressed to
the Permissions Department, John Wiley & Sons, Inc.
Library of Congress Cataloging-in-Publication Data:
Borwein, Jonathan M.
Pi and the AGM.
(Canadian Mathematical Society series of monographs
and advanced texts = Monographies et etudes de la
Societe mathematique du Canada)
"A Wiley-Interscience publication."
Includes index.
1. Numbers, Theory of. 2. Computational complexity,
3. Functions, Elliptic. 4. Pi. I. Borwein, Peter B.
II. Title. III. Series: Canadian Mathematical Society
series of monographs and advanced texts.
QA241.B774 1986 512'.7 86-15811
ISBN 0-471-83138-7
Printed in the United States of America
10 9876543
To our mathematician father, David Borwein
Five thousand digits of pi.
1415926535 8979323846
8214808651 32S2306647
4428810975 6659334461
7245870066 0631558817
3305727036 5759591953
9833673362 4406566430
0005681271 4526356082
4201995611 2129021960
5024459455 3469083026
5982534904 2873546873
3809525720 1065485863
5574857242 4541506959
8583616035 6370766010
9331367702 8989152104
6782354781 6360093417
3211653449 8720275596
8164706001 6145249192
4547762416 8625189835
8279679766 8145410095
0674427862 2039194943
9465764078 9512694683
4962524517 4939965143
6868386894 2774155991
4390451244 1365497627
0168427394 5226746767
1507606947
9009714909
5428584447
0374200731
8191197939
5679452080
0306803844
1005508106
2305587631
7229109816
8932261834
2332609729
1809377344
2131449576
6655730925
3348850346
7002378776
6343285878
0990796547
9389713111
8530614228
9769265672
6171196377
6222247715
9451096596
6759852613
9526586782
0578539062
9520614196
9514655022
7734549202
6587969981
7635942187
9091528017
8658516398
8963213293
9712084433
4030707469
8572624334
4711055785
1136576867
5913440171
5698305235
3761255176
7904297828
8137585043
1463853067
9213373751
8915049330
2643383279
0938446095
2847564823
4881520920
0921861173
8602139494
7783771342
8640344181
4252230825
1159562863
2788659361
5082953311
4710181942
7521620569
2164121992
0236480665
1732172147
6948556209
3883786360
0471237137
9835259570
1429809190
8559252459
8079771569
8895252138
0940252288
6554978189
1051141354
1983874478
6342875444
5231603881
6054146659
6357473638
3125147120
3506712748
3150197016
3089857064
5732654893
2112019130
4189303968
3763466820
3324944166
2749470420
8089330657
5675135751
5647303203
0633217518
3609657120
14959501 56
9844489333
5028841971
5058223172
3786783165
9628292540
8193261179
6395224737
7577896091
5981362977
3344685035
8823537875
5338182796
6861727855
9555961989
6602405803
4586315030
4991198818
7235014144
9219222184
9506800642
8696095636
9825822620
6592509372
5395943104
1435997700
5225499346
7971089314
3129784821
7357395231
0847848968
064374S123
9301420937
2520149744
4052571459
5329281918
5832228718
5151168517
2046752590
8239119325
2033038019
6426243410
6531098965
8039626579
5622303389
5740679545
7829666454
1986915140
2979866223
9180763832
6049631862
0963408780
6939937510
5359408128
2712019091
9171536436
3105118548
1907021798
7363717872
4771309960
2619311881
9375195778
8230301952
8890750983
4676783744
8150193511
2861829745
3479775356
1973568548
2725502542
2512520511
4371917287
5224894077
2169646151
9972524680
1296160894
6672782398
5669136867
6829989487
1342716610
3321445713
7181921799
6213785595
2850732518
1028970641
2618612586
3520935396
1437657618
7091548141
9746366730
7621101100
7732269780
2691862056
7877185560
9456131407
7163775254
7791745011
2870808599
7172159160
7166416274
9472654736
7693259939
5820974944
4811174502
4564856692
7892590360
0744623799
6094370277
1468440901
5187072113
7101000313
1857780532
0353018529
8175463746
9448255379
2533824300
5570674983
6369807426
1613611573
5688767179
7392984896
4677646575
2671947826
5709858387
8459872736
4169486855
6456596116
2287489405
2265880485
2135969536
8687519435
9839101591
6636937787
6660021324
4011097120
7321579198
5725121083
3515565088
6549859461
5836041428
4492932151
2807318915
4769312570
8455296541
1127000407
2021149557
2996148903
0480109412
7716692547
8888007869
4252308177
7805419341
592307S164
8410270193
346034S610
0113305305
6274956735
0539217176
2249534301
4999999837
7838752886
171226S066
6899577362
4939319255
7747268471
3558764024
8505494588
542527S625
5255213347
0494601653
0841284886
7396241389
8482601476
410597S859
4469584865
5848406353
3548862305
6010150330
7564014270
2314429524
0643021845
9561814675
0830390697
3408819071
6280439039
4148488291
5791513698
4909989859
6371802709
1388303203
6084244485
4411010446
5863566201
2665408530
8547332699
6158140025
0463994713
1472213179
4873898665
2560290228
0367515906
4473774418
0628620899 8628034825
8521105559 6446229489
4543266482 1339360726
4882046652 1384146951'
1885752724 8912279381
2931767523 6467481846
4654958537 1050792279
2978049951 0597317328
5875332083 8142061717
1300192787 6611195909
3421170679
5493038196
0249141273
9415116094
8301194912
7669405132
6892589235
1609631859
7669147303
2164201989
2599413891 2497217752 8347913151
0604009277 0167113900 9848824012
0404753464 6208046684 2590694912
7496473263 9141992726 0426992279
5869269956 9092721079 7509302955
5181841757 4672890977 7727938000
5741849468 4385233239 0739414333
4668049886 2723279178 6085784383
2694560424 1965285022 2106611863
0865832645 9958133904 7802759009
9909026401 3639443745 5305068203
5977297549 8930161753 9284681382
3836736222 6260991246 0805124388
4220722258 2848864815 8456028506
7745649803 5593634568 1743241125
8617928680 9208747609 1782493858
4775551323 7964145152 3746234364
8493718711 0145765403 5902799344
3191048481 0058706146 8067491927
1426912397 4894090716 6494231961
9207734672
0486331734
7595156771
6447060957
3209144421
9823873455
8199430992
8249037589
9637669838
8232527162
8558100729
6143444318
3908145466
0126228594
2962107340
4764777262
4949450114
4721040317
7350235072
4263129860
2182562599
6496514539
5770042033
5270695722
0067510334
2833163550
4488957571
8524374417
9522868478
0105265227
3606598764
5867697514
4645880797
1302164715
4375189573
2414254854
6540628433
2118608204
8354056704
8099888687
6615014215
0579626856
7869936007
0917567116
6711031412
7647918535
2828905923
0291327656
3123552658
2111660396
8611791045
5661406800
2708266830
5097925923
5961458901
5403321571
6639379003
1900042296
0386743513
4132604721
From the eleventh iteration of
x„., := (xj'2 + x;"2)/a
yo«:=(y„xJ'2 + xS"2)/(y» + 1)
xo := a"2
y, := a"4
Preface
When I was a student, abelian functions were, as an effect of the Jacobian
tradition, considered the uncontested summit of mathematics and each of us
was ambitious to make progress in this field. And now? The younger
generation hardly knows abelian functions.
How did this happen? In mathematics, as in other sciences, the same
processes can be observed again and again. First, new questions arise, for
internal or external reasons, and draw researchers away from the old
questions. And the old questions, just because they have been worked on so
much, need ever more comprehensive study for their mastery. This is
unpleasant, and so one is glad to turn to problems that have been less
developed and therefore require less foreknowledge—even if it is only a
matter of axiomatics, or set theory, or some such thing.
And so there is nothing for it but to collect together the old subjects in
good references... so that later developments may continue them, if fate
should so decree.
Felix Klein (1849-1925) [79 p. 294]
A central thread of this book is the arithmetic-geometric mean iteration
of Gauss, Lagrange, and Legendre. A second thread is the calculation of v.
The two threads are intimately interwoven and provide a remarkable
example of the application to twentieth-century computational concerns of
the type of nineteenth-century analysis whose neglect Klein so deplores. The
calculation of digits of v has had a fascination that has far exceeded
utilitarian concerns—a fascination that has driven some to dedicate their
lives to calculations we may now electronically effect in seconds. The
methods that make the computation of hundreds of millions of digits of v or
any elementary function within our grasp are rooted in the AGM and this is
where our interest in the material began. Making sense of this material took
us in three directions and motivated our writing this book.
The first direction leads to nineteenth-century analysis and in particular
the transformation theory of elliptic integrals. This necessitates at least a
vii
Preface
ix
tions helped produce a better book. The cheerful technical assistance of P.
Flemming, of our departmental office, and of the staff at Wiley is also
gratefully acknowledged, as is the assistance of the Canadian Mathematical
Society and the support of the Natural Sciences and Engineering Research
Council of Canada.
Jonathan M. Borwein
Peter B. Borwein
Halifax, Nova Scotia
November 1986
1
Complete Elliptic
Integrals and the
Arith metic-Geometric
Mean Iteration
\
2
Theta Functions,
TheAGM
and
Algorithms for Pi
\
3
Jacobi's Triple-
Product,
Theta Functions and
Number Theoretic
Applications
1
4
Modular Equations
and the
Transformation
Theory of
Elliptic Integrals
1
5
Modular Equations
and Algebraic
Approximations
to Pi
^--,
<.
6
The Complexity
of Calculating
Algebraic Functions
♦
7
The Complexity
of Pi and the
Elementary Functions
1
8
General Means
and
Mean Iterations
9
Further
Applications of
Theta Functions
->
^
^-^
->
10
Other Approaches
to the
Elementary Functions
I
1
11
The Story of Pi
Computation
and
Transcendence
- Necessary Reading Order
- Possible Reading Order ,
V1U
Preface
brief discussion of a number of topics including elliptic integrals and
functions, theta functions, and modular functions. These attractive and once
central concerns of analysis have been dropped from the standard
curriculum—and much that is beautiful has become relatively inaccessible
except to the expert or the archivist. In presenting this material we have not
striven for generality. This is available in the specialty literature. At times
we have settled for giving only a taste of the material and a few pointers on
where it can be pursued.
We have found this excuse to consult the nineteenth-century masters a
pleasurable and rewarding bonus—as Hermann points out in his
introduction to Klein [79], "We are so used to thinking in terms of the 'progress' of
science that it is hard for us to remember that certain matters were better
understood one hundred years ago."
The second direction takes us into the domain of analytic complexity.
How intrinsically difficult is it to calculate algebraic functions, elementary
functions and constants, and the familiar functions of mathematical physics?
Here part of the attraction is the surprising answers—the familiar methods
are often far from optimal.
Finally, an honest treatment invites exploration of applications and
ancillary material, particularly the rich and beautiful interconnections
between the function theory and the number theory. Included, for example, are
the Rogers-Ramanujan identities; algebraic series for v; results on sums of
two and four squares; the transcendence of v and e; and a discussion of
Madelung's constant, lattice sums, and elliptic invariants.
Our primary concern throughout has been the interplay of analysis and
mathematical application. We hope we have elucidated a variety of useful
and attractive analytic techniques. This book should be accessible to any
graduate student. Only rarely does it assume more than the content of
undergraduate courses in real and complex analysis. It is, however, at times
terse, at times computational, and some of the exercises are difficult. A fair
amount of the material, particularly on the approximation and computation
of v and the elementary functions, is new and only partially available in
research papers.
The accompanying flow-chart gives possible routes through the material.
Chapters 8 and 11 are largely self-contained. Chapter 8 is a treatment of
general mean iterations, while Chapter 11 sketches some of the history of pi,
its calculation and its transcendence. There are numerous exercises
(frequently with hints). The exercises often develop substantial additional
examples and bodies of theory, and even the casual reader is encouraged to
look at them.
The contributions of family, friends, students, and colleagues have been
many and varied and have greatly facilitated the production of this book. To
all these people we offer our thanks. A particular debt of gratitude is owed
to Professors R. Askey, B. Berndt, R. Brent, K. Dilcher, W. Gosper, Y.
Kanada, D. Shanks, and J. Zucker. Their thoughtful comments and sugges-
Biographical Information
Jonathan M. Borwein was born in St. Andrews Scotland in 1951. In 1971 he
obtained an Honours B.Sc. in mathematics from the University of Western
Ontario, where his father is still head of the mathematics department. He
was awarded an Ontario Rhodes Scholarship that year which he held at
Jesus College Oxford, where he was awarded a mathematics D.Phil, in 1974
under the supervision of Michael Dempster. Since then he has been on the
faculty of Dalhousie University and has been Professor of Mathematics since
1984. He has also been on faculty at Carnegie-Mellon University (1980-
1982) and has spent visiting research periods at Cambridge, Limoges, and
the University of Montreal. His other research interests are in classical
analysis, functional analysis, and optimization theory.
Peter B. Borwein was born in St. Andrews Scotland in 1953. He obtained an
Honours B.Sc. in mathematics from the University of Western Ontario in
1974, and a Ph.D, under the supervision of David Boyd, from the University
of British Columbia in 1979. He spent 1979-1980 as a NATO research
fellow in Oxford. Since then he has been a faculty member at Dalhousie
University and is now Associate Professor of Mathematics. He has spent a
sabbatical year at the University of Toronto. His other research interests are
in approximation theory, classical analysis, and geometry.
xi
Contents
Chapter 1 Complete Elliptic Integrals and the Arithmetic-Geometric
Mean Iteration 1
1.1 The Arithmetic-Geometric Mean Iteration 1
1.2 Gauss's Derivation of the Fundamental Limit Formula 5
1.3 Basic Properties of Complete Elliptic Integrals 7
1.4 Quadratic Transformations and Iterations and a Third
Proof of the Fundamental Limit Formula 11
1.5 Jaeobi's Differential Equation and a Fourth Proof of the
Fundamental Limit Theorem 18
1.6 Legendre's Relation 23
1.7 Elliptic Functions 28
Chapter 2 Theta Functions and the Arithmetic-Geometric Mean
Iteration 33
2.1 A Theta Series Solution to the AGM 33
2.2 Poisson Summation 36
2.3 Poisson Summation and the AGM 40
2.4 The Derived Iteration and Some Convergence
Results 44
2.5 Two Algorithms for it 46
2.6 General Theta Functions 52
2.7 The Landen Transformation 57
Chapter 3 Jaeobi's Triple-Product and Some Number Theoretic
Applications 62
3.1 Jaeobi's Triple-Product Identity 62
3.2 Some Further Theta Function Identities 67
3.3 A Combinatorial Approach to the Triple-Product Identity 76
3.4 Bressoud's "Easy Proof of the Rogers-Ramanujan
Identities 78
xiii
XIV
Contents
3.5 Some Number Theoretic Applications 81
3.6 The Mellin Transform and the Zeta Function 87
3.7 Evaluation of Sums of Reciprocals of Fibonacci
Sequences 91
Chapter 4 Higher Order Transformations 102
4.1 A First Approach to Higher Order Transformations 102
4.2 An Elementary Transcendental Approach to Higher
Order Transformations 109
4.3 Elliptic Modular Functions 112
4.4 The Modular Equations for A and j 119
4.5 The Modular Equation in u-v Form 126
4.6 The Multiplier 136
4.7 Cubic Modular Identities 142
Chapter 5 Modular Equations and Algebraic Approximations to it 152
5.1 Singular Values of the Second Kind 152
5.2 Calculation of a 155
5.3 Further Formulae for a 164
5.4 Recursive Approximation to it 169
5.5 Generalized Elliptic Integrals and Rational and Algebraic
Series for 1/tt and \IK 111
5.6 Other Approximations 191
Chapter 6 The Complexity of Algebraic Functions 200
6.1 Complexity Concerns 200
6.2 The Fast Fourier Transform (FFT) 204
6.3 Fast Multiplication 209
6.4 Newton's Method and the Complexity of Algebraic
Functions 212
Chapter 7 Algorithms for the Elementary Functions 219
7.1 77 and Log 219
7.2 Theta Function Algorithms for Log 224
7.3 The Complexity of Elementary and Elliptic
Functions 226
Chapter 8 General Means and Iterations 230
8.1 Abstract Means 230
8.2 Equivalence of Means 239
8.3 Compound Means 243
8.4 Convergence Rates and Some Examples 249
Contents
xv
8.5 Carlson's Integrals and More Examples 256
8.6 Series Expansions of Certain Means 263
8.7 Multidimensional Means and Iterations 266
8.8 Algebraic Iterations and Functional Relations 273
Chapter 9 Some Additional Applications 281
9.1 Sums of Two Squares 281
9.2 (Chemical) Lattice Sums 288
9.3 Odd-Dimensional Sums and Benson's Formula 301
9.4 The Quintuple-Product Identity 306
9.5 Quintic and Septic Multipliers and Iterations 309
Chapter 10 Other Approaches to the Elementary Functions 316
10.1 Classical Approximations 316
10.2 Reduced Complexity Methods 326
Chapter 11 Pi 337
11.1 On the History of the Calculation of ir 337
11.2 On the Transcendence of v 347
11.3 Irrationality Measures 362
Bibliography 387
Symbol List 396
Index
405
Chapter One
Complete Elliptic Integrals and
the Arithmetic-Geometric
Mean Iteration
Abstract. The focus of this chapter is the arithmetic-geometric mean (AGM)
iteration of Gauss, Lagrange, and Legendre and its relationship to elliptic
integrals. The iteration converges quadratically to a nonelementary
transcendental function simply expressible in terms of complete elliptic integrals. This
result, which is fundamental to this monograph, is established in a variety of
ways. Some of the basic properties of elliptic integrals and functions are
discussed.
1.1 THE ARITHMETIC-GEOMETRIC MEAN ITERATION
One of the jewels of classical analysis is the arithmetic-geometric mean
(AGM) iteration of Gauss. It is the following two-term recursion:
C1 1 1\ „ ■ an + bn
(1-1-1) «»+!•= n
(1.1.2) *W=VoA.
It is customary and useful to introduce an auxiliary variable,
(1-1-3) W=i(a„-6„).
If we assume that 0<60^a0, then from the arithmetic-geometric mean
inequality we have
1
2
The Arithmetic-Geometric Mean Iteration
and
(1.1.4) 0^«„+1-6„+1--(v_; + v^)2.
Whence we observe that an and bn converge to a common limit determined
uniquely by a0 and b0. This common limit will be denoted (on letting a'= a0
and b '• — b0) by
(1.1.5) M(a, b) := lim an = Urn bn .
We will refer to this limiting process as the AGM and usually reserve the
symbols an, bn, and c„ for variables bound by the AGM relations. We also
reserve the symbol AG(a, b) for the common limit.
We say that a„ —> a ,with pth-order convergence if
(1.1.6)
(a„ - a)p
= 0(1)
where, as usual, an = 0(j8„) means that, for some constant c and for all n,
an :£ cj8„ . If the an are functions defined for all x in a set jRT, and if the
implicit constant concealed by the O symbol in (1.1.6) is independent of x,
then we say that the convergence is uniformly pth order. Roughly speaking,
quadratic (second-order) convergence doubles the number-of-digits
agreement between successive iterates and the limit, cubic (third-order)
convergence triples the agreement, and so on.
It is an easy observation from (1.1.4) that the AGM converges uniformly
quadratically for a0, b0 restricted to compact subsets of (0, °°). Very precise
estimates for the rate of convergence will be established later.
We also observe that M(a, b) is homogeneous, that is, for A > 0
(1.1.7) AM(a, b) = M(Xa, Xb)
and thus there is little loss of generality, though often some loss of
symmetry, to setting a = 1.
The function M satisfies
(1.1.8) M{a,b) = M(~^-,VaT))
or
(1.1.9) M(l,6) = i^M(l,|^).
1.1 The Arithmetic-Geometric Mean Iteration
3
The analysis of the limit of the AGM rests on finding a two-variable function
M invariant under the transformation (1.1.8) or, equivalently, on finding a
function / satisfying the functional relation (1.1.9), namely,
(1.1.10) /W-HMlS)-
If we set k0 '■= x G (0,1) and
(1.1.11) kn+1: =
*/K
l + kn
then since
"+1 (i + VU2(i + U
we have that kn-*l quadratically. In fact, the function g defined by
(1.1.12) g(x):=af[^r^ k0:=x
n=0 z
is the unique solution of (1.1.10) analytic in a neighbourhood of 1 that
satisfies g(l) = a. (See Exercise 4.) This form of the AGM, as a single
variable iteration, is usually called the Legendre form.
It is convenient and standard to define the complement x' of x by
x . Differentiation will be denoted by /.
Comments and Exercises
The early history of transformations of elliptic integrals, of which, as we
shall see in the next section, the AGM is an example, is laid out in an
entertaining article by G. N. Watson [33] entitled "The Marquis and the
Land-Agent: A Tale of the Eighteenth Century." The marquis is Fagnano
and the land-agent is Landen. Landen's transformation, published in 1775,
will be discussed later. The names of Euler and Lagrange should also be
associated with the early transformation theory. Lagrange uncovered the
AGM iteration sometime before 1785. Gauss rediscovered it independently
in the 1790s. He apparently first considered the iteration in 1791 at the age
of 14 (Almquist and Berndt [Pr]). It is, however, Gauss and Legendre who
develop the theory fully. As Watson [33] points out, "in the hands of
Legendre, the transformation became a most powerful method for
computing elliptic integrals." Gauss is unique in having deduced the invariant
function from the functional equation rather than proceeding in the opposite
(and easier) direction.
The Arithmetic-Geometric Mean Iteration
Deduce that, for the AGM,
an = an+1 + cn+1 bn = an+1-cn+1 c2n = a2n-b2n.
Hence the AGM is well defined for negative n. Show that
a_„ = 2"a„* 6_B=2"c; c_n = 2nb*n,
where a*, b* and c* are generated from the AGM commencing with
a% := a0, b% '■= c0, and cj ".= b0. Show that
c»-i 1
c = =S — c
"■ 4a„ 2 "-1-
Observe that this formula avoids the subtractive cancellation problems
inherent in calculating the very small number c„ from cn+1 = \{an - bn).
Show that
«n + V«n-i(2«n-«n-i)
a„^ = r
Consider the harmonic-geometric mean iteration
._ 2a„)3„
«» + A,
Show, for a0, /30E(0, °°), that the above iteration converges quadrati-
cally to H(a0, )80), where
H(a0, &)'=
M(l/a0,l/P0) ■
Consider the arithmetic-harmonic mean iteration
.= <*„ + /?,,
«»+i • 2
2«„)8„
A,+i: =
«» + A,
Show, for a0, )80 G (0, °°), that the above iteration converges quadrati-
cally and that
lim an = lim /3„ = V^A ■
1.2 Gauss's Derivation of the Fundamental Limit Formula
5
Show that the AGM is a well-defined quadratically convergent iteration
for starting values a0 '■= 1, b0- = z, where re(z) > 0. Likewise the
function g of (1.1.12) is a single-valued analytic function on re(z)>0. In
both cases the convergence is uniformly quadratic on compact subsets.
[The root must always be chosen to lie in re(z) > 0.] ■ Show that
g(z) = M(l, z) for re(z)>0.
1.2 GAUSS'S DERIVATION OF THE FUNDAMENTAL
LIMIT FORMULA
By May 30th, 1799, Gauss had observed, purely computationally, that
2 f1 dt
(1-2.1) ^77-^ -d " J0 ^^
M(l, V2)
agreed to at least eleven decimal places. He commented in his diary that this
result "will surely open up a whole new field of analysis"—a claim
vindicated by the subsequent directions of nineteenth-century mathematics. The
inverse of the above (indefinite) integral is the lemniscate sine, a function
Gauss studied in some detail. He had recognized it as a doubly periodic
function (see Section 1.7) by the year 1800 and hence had anticipated one of
the most important developments of Abel and Jacobi: the inversion of
algebraic integrals.
We now outline Gauss's derivation of the limit of the AGM (Gauss
[1866]). This is not the easiest development but it may be the most
motivated.
Theorem 1.1
2 r12 dd
Jo
M(l, x) it Jo yi - (1 - x2) sin2 8 '
First proof. Observe that [M(l + x, 1 -x)]~\ defined by (1.1.5), is
analytic and even in some neighbourhood of zero. (See Exercise 4, Section
1.) Thus we may suppose that
(1.2.2) 777— : r = 1 + d,x2 + d2x4 + d,x6 + ■■•.
v ' M(l + x, 1-x) x 2 3
Upon application of the AGM transformation we have
1 1
(1.2.3)
M(l + 2Vx/(l + x), 1 - 2V*/(1 + x)) M(1> yi-4x/(l + xf)
- 1 + x
~ M(l + x, 1 - x) "
6 The Arithmetic-Geometric Mean Iteration
Comparing (1.2.2) and (1.2.3) gives
(1 +x)(l + dlX2 + d2x4 + ---) = 1 +d^—j +d2(TT^) +■■-.
(1.2.4)
We leave it as an exercise to the reader to follow Gauss's footsteps by
solving the above equation for dt. In fact,
t2
42'-1
(1-2.5) -J'-L «0^1)1
If we observe, as in (1.2.3), that
(1,2'6) M(l,Vl^x~2) = M(l + x,l~x)
we see by (1.2.2) that we are finished if we show that
/1 o ^ 2 r'2 de V
(1.2.7) - . =Z I .,,. 1X,
irJ° Vl-*2sin20 #_0 L*!(» — 1)!
2 I"*'* <*» _y f(2f-l)I12 -a
-1)!
2 _■ 2 /i\-l/2
4«-»
This final equation requires expanding (1 - x sin 8)~ and integrating
term by term.
It is perhaps possible to be guided to the limit of the AGM by the above
method. If, however, one has correctly guessed the limit, then proving it
correct is much more straightforward and only involves establishing the
invariance of the limit under the transformation.
Second proof. Let
(1.2.8) na,b):=lfQ
r/2
de
Va2 cos2 e + b2 sin2 0
Then, as the substitution f- = b tan 6 shows,
dt
(1.2.9) T{a, b) = -\
IT J -
V J— V(«2 + t2)(b2 + t2) '
Now the substitution u- = \(t — abli) (and some care) yields
(1.2.10) T(a,b)=T(~~-,V^b).
It follows that T(an, bn) is independent of n and hence, since (1.2.9)
1.3 Basic Properties of Complete Elliptic Integrals 7
evaluates as an arctan when a = b, we have on interchanging limit and
integration
(1.2.11) T(a0, b0) = T(M(a0, b0), M(a0, b0)) = ^ . □
Comments and Exercises
An excellent account of the development and the importance of elliptic
function theory in the nineteenth century is to be found in Felix Klein's
classical work, "Development of Mathematics in the 19th Century" (Klein
[79]). Gauss's works are, of course, available in collected form (Gauss
[1866]). Before establishing the limit formula, Gauss produces partial
expansions of M(l, x) and M(l + x, 1 - x) plus a number of AGM
calculations carried to as many as 20 decimals. It is apparent that both the
observation of the limit and the route to a proof were indicated by
prodigious numerical experimentation.
The second proof may be found in Carlson [71], Newman [82, 85], Todd
[79], or Wimp [84]. Carlson [71] offers some interesting generalizations.
These are discussed in Section 8.5.
1. Fill in the details in the above proofs. In particular prove (1.2.5),
(1.2.7), and (1.2.10).
2. Show that [M(l + x, 1 - x)]'1 and [M(l, x)]"1 both solve the second-
order differential equation
(*3 - *) 71 + (3*2 - 1) T~ + xy = 0 .
dx dx
Hint: For the second solution consider equation (1.2.6).
1.3 BASIC PROPERTIES OF COMPLETE ELLIPTIC INTEGRALS
The two basic integrals we will encounter are the complete elliptic integral of
the first kind,
dd
rn/2 j.
2 • 2 n
sin 8
- f1 dt
~J° V(l - t2)(l - k2
V(l - t2)(l - k2t2)
and the complete elliptic integral of the second kind,
8 The Arithmetic-Geometric Mean Iteration
ir/2 /
de
ctt/2
(1.3.2) EW:=)o Vl - k2 sin2 8
-I
VT^tV
o VT1
<fc.
r
The complementary integrals E' and £' are the integrals in the
complementary variable k' = VI - k2,
(1.3.3) K'(k):=K0/l-k2) = K(k')
(1.3.4) E'(k):=E0/l-k2) = E(k').
As is traditional, we willuse the notation f'(k)'-=f(k') to indicate any
function in the complementary variable. The variable k is often called the
modulus, and k' is the complementary modulus.
The second integral arises in the rectification of ellipses. The arc-
length A of an ellipse with semiaxes a and b is given by
A = 4jo W cos2 8 + b2 sin2 8 d8 = 4aEM-) .
The first integral has the following physical interpretation. If p is the
period of a pendulum with amplitude a and length L, then
K-WfiMD)
8
where g is the gravitational constant. Note as a—>0, K(sm(a/2))—> v/2,
and we are left with a simple harmonic approximation.
The Gaussian hypergeometric series, discussed by Gauss in 1812 in what
is one of the first rigorous discussions of convergent series, is defined by
,..<,« ., ^ , a-b a(a + \)b(b + V) 2
(1.3.5) F(a,b;c;z):=l+ — z+ \.2.^+1)^2
a(a + l)(a + 2)b(b + !)(& + 2) 3
l-2-3-c(c+l)(c + 2) Z •
For the complete elliptic integrals we have the series expansions, for
1*1 <1,
1.3 Basic Properties of Complete Elliptic Integrals 9
where (2/ - 1)!! := 1 • 3 • 5 • • • (2/ — 1). The derivations are left as Exercise 1.
We have, as in Exercise 2 of Section 1.2, that K and K' are both solutions
of
(1.3.8) (k3-k)^+(3k2-l) ^~ + ky = 0.
This equation has a regular singular point at zero (the roots of the indicial
equation are both 0) which, as the reader familiar with the elementary
theory of second-order differential equations knows, says that
(1.3.9) K'(k) = alogk K(k) + f(k)
where /is analytic in a neighbourhood of zero. (See, for example, Birkhoff
and Rota [69].)
In fact,
(1.3.10) K'(k)=llog(f)K(k)
This logarithmic asymptote at zero will be of considerable interest later in
the discussion of the complexity of log. Also,
(1.3.11) ^) = 1+1(^)-^
^M-iVsV^K---
Once again the above verifications are left as exercises.
The first and second integrals are linked by the equations
/.o,^ dE E~K
<L3-12> ik = -nr
/--,.,^ dK E-k'2K
(L3-13) * = "W~-
(See Exercises 2 and 3.)
10
The Arithmetic-Geometric Mean Iteration
Comments and Exercises
The functions K and E are nonelementary transcendental functions. This is
a result of Liouville's. In general an elliptic integral is an integral of the form
J R(x, y)dx,
where R is a rational function of x and y and where y2 is a quartic
polynomial in x. Except in special cases, such as repeated factors, this is
always a nonelementary function of u. Incomplete elliptic integrals of the
third kind are of the form
f" dx
•>° (1 - t,*2)V(1 - x2)(l - k2x2) '
The integral is complete when u = 1. (The complete third integral can be
expressed in terms of K and E.) Analogously we may define incomplete
elliptic integrals of the first and second kind. The basic result due to
Legendre is that any elliptic integral may be algebraically reduced to a linear
combination of elliptic integrals of the first, second, and third kind. (See
also Exercise 5 of Section 1.4 and Exercise 6 of Section 1.6.)
The formulae in the section may be found, in tabular form, in Ab-
ramowitz and Stegun [64], Gradshteyn and Ryzhik [80], and the Bateman
project (Erdelyi et al. [53]), so may transformation formulae for the
hyper geometric functions. Of course the indispensible companion volume is
Whittaker and Watson [27].
A number of the seminal nineteenth-century papers, including Gauss's on
the hypergeometric series and Jacobi's on theta functions, are available in
translation in Birkhoff [73].
1. Establish the series expansions (1.3.6) and (1.3.7) for K and E by
expanding the radical by the binomial theorem and integrating term by
term.
2. Establish, by differentiating the integral (1.3.2), that
dE E-K
dk~ k '
3. Verify, from the series expansions (1.3.6) and (1.3.7), that
dK _E- k'2K
dk kk'2 "
4. From the integral representations show that
1.4 Third Proof of the Fundamental Limit Formula
11
a) £(0) = f £(0) = f £(1) = 1.
b) K'(k) = log(|) + 0(A:2|log k\) k I 0 .
Hint:
r12 de = r
J° Vl - (1 - it2) sin2 e ->c
A:' sin 8 d8
Vl - (1 - A:2) sin2 8 Jo Vit2 + (it')2 cos2 8
+
r/2 /r=~F
Jo V l + if
sin 0
sin 8
dd .
The second integral evaluates to log [(1 + k')/k\. (See Borwein and
Borwein [84a].)
c)
*'(*)- log (I)
;4A:2(8 + |logA:|) A: G (0,1].
5. Verify the expansions (1.3.10) and (1.3.11). (See also Exercise 1,
Section 2.3.)
6. Establish the relation, for re(c) > re(6) > 0,
F(a, b; c; z) = C:jQ t"-\l - t)c-b~\l - tz)~
dt
where C is a constant independent of z. [In fact C = T(c) /T(b)T(c - b)
where, as usual, T is the gamma function. (See Section 1.6.)] Note that
this provides an analytic continuation of F to C — [1, °°).
7. Show that F(a, b;c;z) satisfies
z(l - z) ^4 + [c - (a + 6 + l)z] ~^-aby = 0.
This is the hypergeometric differential equation.
1.4 QUADRATIC TRANSFORMATIONS AND ITERATIONS AND A
THIRD PROOF OF THE FUNDAMENTAL LIMIT FORMULA
The complete elliptic integrals satisfy the following functional relations
which we collect together as
12 The Arithmetic-Geometric Mean Iteration
Theorem 1.2
For A: £(0,1),
(«) m = ^-kK{^) (upward)
(b) TO = T^7*(yTf) (downward)
(c) E(k) = ^E(^) + ^K(k) (upward)
(d) E(k) = (l + k')E(~j^j-k'K(k) (downward).
Proof. Parts (a) and (6) are equivalent and follow from the previous
discussion. We give a direct proof of (b) based on Ivory [1796]. This is also a
third proof of Theorem 1.1. Let 1:=(1-k')/(1 +k'). Then we have
V7 = kl(l + k') and 1 + /2 = 2(1 + k'2) /(1 + k'f. Thus
d6
m^-u'^?
V(l + /2) + 2/cos20
on replacing sin2 0 by (1 - cos 20) 12. Then
(~r")*w=\ Jo*(1+/e~2l'flr1/2a+/e2l'fl)~1/2 d0
2m,»=o \ m /\ n /Jo
Here we have used the binomial theorem twice. Since only the terms with
m = n are nonzero, we have
(^■fjO-f'd.!*')-'"
on using (1.3.6). This completes (b). If g(k): = 2Vk/(l + k), then g'1 =
(l-k')/(l + k') and [1 +g'(k)]/2= 1/(1 +k). Now (a) follows by
substituting g(k) for /: in (6). To derive part (c), we differentiate (a) to get, for
K = dK/dk,
(1.4.1) (1 + /:)¾ + K(k) = K(g(k))g(k) .
This is coupled with the differential equation (1.3.13) in the forms
1.4 Third Proof of the Fundamental Limit Formula 13
(1.4.2) E(k) = kk'2K(k) + k'2K(k)
and
(1.4.3) E(g(k)) = g(k)[g'(k)]2k(g(k)) + [g'(k)fK(g(k)) .
We now use (1.4.1) to eliminate K(g(k)) from (1.4.3) and then employ
(1.4.2) and (a) to solve for E(g(k)) in terms of K(k) and E(k). The
algebraical details are left to the reader. Part (d) may be derived
analogously from (b) or by substituting g~1(k) for k in (c). □
The transformations are termed upward and downward because, for
k G (0,1), iterating (a) leads to a sequence of k values increasing to 1 and
iterating (b) forms a sequence of k values decreasing to 0.
It is convenient to introduce homogeneous forms of E and K and to
recast Theorem 1.2 in AGM terms.
Let
(1.4.4) Ha,*):-!"* ., ,« , , .ilffl
Jo Va2 cos2 8 + b2 sin2 8 a Vfl/
(1.4.5) J(a,b):=JQ ■ Va2 cos2 8 + b2 sin2 8 d0 = «£'(-) •
Theorem 1.3
If a„+1 := (a„ + 6„)/2, 6„+1 : = V«A. and 0 < 6„ < a„, then
(«) '(«n+i,fen+i) = /(«n>&n)
(6) 2/(a„+1, 6„+1) - /(a„, 6„) = anbnI(an, bn).
Proof. Part (a) has been observed in the second proof of the
fundamental limit theorem (Theorem 1.1).
To see part (b), notice that if kn ■= cnlan, then k'n= bjan,
2/(«B+i, bn+1) = 2an+1E(yjl - ^) = 2an+1E(kn+1)
and
J(aH,bn) = aHE(^l- ^)= <*„£(*„).
Recall that c2 = a2 — b\. The relationship between kn+1 and A:„ is given by
. =Qn-bH = l-k'n
Kn+1 a„ + b„ 1 + k''
14 The Arithmetic-Geometric Mean Iteration
Thus, establishing (b) is equivalent to establishing
2an+1E(kn+1)-anE(K) = bnK(kn)
which on setting an+1 = (an + bn)l2 and dividing by an may be seen to be
equivalent to part (d) of Theorem 1.2. □
These transformations may be iterated to produce quadratically
convergent algorithms for K and E. (See also Exercise 1.)
x I! (1 + k'„) (upward iteration)
2 „=i
*oe(o,i].
CO
x ll (1 + kn) (downward iteration)
2 „=i
where
*"+i:=TT*^ and ^o e [0,1).
The first product is the unique solution of Exercise la) analytic in a
neighbourhood of 1 that takes the value irl2 at 1, while the second product
is the unique solution of functional relation (b) of Theorem 1.2 that takes
the value v/2 at 0 and is analytic in a neighbourhood of 0. This observation
verifies that the above products are analytic in neighbourhoods of 0 and 1,
respectively, since they are uniformly convergent products of analytic
functions. Also, specifying the value of an analytic solution of either functional
relation at any point in (0,1) in fact specifies the function at an infinite set of
points with limit point within the domain of analyticity and hence uniquely
defines the function up to a constant multiple of the value at the limit point.
(See Exercise 4.)
The algorithms for E and K in AGM form are particularly attractive.
Algorithm 1.2
For a0:= 1, b0 ■= k'E.(0,1], and c0:= k,
Algorithm 1.1
(a) *'(*„)= fn^nr—
z „=0 l ^ Kn
where
(6) ^)-111^ =
1.4 Third Proof of the Fundamental Limit Formula
15
(b) ^"fi-irV.W
Proof. Part (a) is Theorem 1.1. For part (6) we use Theorem 1.3,
and since 4a2+1 - 2a2 - 2a„6„ = -c2,
2"+1[/(a„+1, 6„+1) - a2+1/(a0, &„)] - 2"[/(a„, 6„) - «^(«0, 60)]
= 2-^/(^,6,,).
Thus on summing
(1.4.6) J(a0, b0) = (a2 - 2 2^)/(^, 60)
N n=0 '
which specializes to (6). Here we must observe that
An:=2n[a2J(an,bn)-J(an,bn)]
^ («2 - 62) sin2 8
y/a2n cos2 0 + 62 sin2 0
dd
sin2 0
Va2, cos2 0 + 62 sin2 0
Thus Os A„ <2"c2/(an, 6„), and A„ tends to zero as n—>°° since c2 tends to
zero quadratically. D
Comments and Exercises
The algorithms of the section provide remarkably efficient methods for the
calculation of E and K and related functions. The analysis of these
algorithms will be reserved for later chapters. We have restricted our attention
in this section primarily to a real variable k. This is simplifying but entirely
unnecessary. All of the algorithms and functional relations extend naturally
to the complex domain. In fact, all the algorithms and functional equations
of this section hold at least for kE.{re(z)>0} — [1,00). The interested
reader may readily establish the exact domains of validity for the various
relations. The analysis of the AGM iteration for complex starting values is
reasonably complicated. (See Cox [85].) The problem is to decide which
16
The Arithmetic-Geometric Mean Iteration
root is appropriate in the computation of 6„+1 = ^Janbn. The right choice is
made to ensure \an+1 - bn+1\ < \an+1 + bn+1\ [with im (6„+1/a„+1) >0 in the
case of equality]. The surprise is that no matter how the roots are chosen,
the AGM iteration converges (provided a0^—bQ) though unless the right
choice is made all but finitely often, the limit will be zero.
Exercise 5 on the Landen transform provides an algorithm for calculating
incomplete elliptic integrals. We will revisit Landen's transform in Chapter
2. A wealth of formulae on the calculation of elliptic integrals may be found
in King [24].
1. Show that
a) K'(k)
L_ r.( *rt\
K
l + k"\l + k/
b) kW^^TTf)
c) E'(k) = (l + k)E'(j^)-kK'(k)
d) ^)-(^)^)4^)
and observe that K'lK satisfies a multiplication theorem, namely,
Show that for every integer n there is a unique algebraic x G (0,1) so
that
Show that if kn+1 :=(1- k'n)/(1 + k'n) and )n : = yfk^H, then
and hence
b) K(x) =
i+^w:
(1 + VF)2*VVL1 + ^T7
K
)■
Also
c) *(/&=? n
1.4 Third Proof of the Fundamental Limit Formula
77 TT 4
17
f3 = f ri(i + ;„)2
2-0(1 + ^1^)2 2^
Show that /„ converges quartically to zero.
(77ie quartic AGM) Let a„, 6„, and c„ satisfy the AGM relations. Set
an := a2n and Pn'-=b2„2- ShoW that
a) «n+l =
and
<*n + Pn
b) ft + l = (^^)
1/4
Show also that
c) s 2-¾=s 4-«:- (^ .
n=0 n = 0 L \ Z, / J
Show that the convergence is governed by
4 _ (<*„ ~ ft,)4
d) a»+i-jSll+i*
16
Derive the following quartic algorithms. For a0:=l and j80:=V^fe7(
(0,1],
e) K(k) =
21im«:
f)
K(k)
-14-(^)1
Consider a functional relation
F(z) = s(z)F(g(z)) F(0) = a*0
where s and g are analytic in some complex neighbourhood U of zero.
Suppose that, for some p>\, limn_>eo g(n)(z)—>0 with />th order
convergence uniformly on U. [g(n) denotes g composed with itself n times.]
Show that the above relation has a unique nonzero analytic solution on
U if and only if s(0) = 1.
(The Landen transform) Consider the incomplete elliptic integral of
the first kind
P(4>:
■'*>:-j«
de
0 Vl - k2 sin2 6
kE[0,l), ^0.
18 The Arithmetic-Geometric Mean Iteration
Show, as in the second proof of Theorem 1.1, that if we set kn+1 •'=
(1 - k'n)l{\ + k'n) and tan (<£„+1 - <£„) = /^ tan <f>n, then
a) F(<t>n+1,kn+1) = (l + k'n)F(<l>n,kn) kn+1<kn, 4>„+1^„
and
b) ^.*.)=(noT^)um(^).
Show that if kn+1 : = 2yfTj(l + kn) and sin(2<£„+1 - <f>n) : = /:„ sin <£„,
then
c) F(<£„+1,/;„+1) = |(l + *„W„,/;„) kn+1>kn, <£„+1^„
and
d) J**,, k0) = ( fi -^) log tan (f + I lim *„) .
e) Establish the quadratic convergence.
Similar methods for calculating incomplete second and third integrals
may be found in King [24]. (See also Section 2.7.)
6. Prove Euler's addition theorem. Let g(x) : = (1 - x2)(l — k2x2). Then
Jo VgW+Jo V^W J° Vg(x)
where c : = [by/g(a) + ay/g(b)] /V1 - k2a2b2. This result, which dates
from 1753, is, according to Birkhoff [73], the "first notable theorem
about elliptic integrals."
1.5 JACOBVS DIFFERENTIAL EQUATION AND A FOURTH PROOF
OF THE FUNDAMENTAL LIMIT THEOREM
The second-order linear differential equation (1.3.8) satisfied by j&Tand K' is
(*3 - k)^2 +(3k2 -1) ^ + ky = 0 .
Equivalently
G(k):=kV2k'K(k) and G*(k):= kll2k'K'(k)
satisfy
1.5 Fourth Proof of the Fundamental Limit Theorem 19
d*y 1 /i + £2\2
Also, the functional relations [(Theorem 1.2(a) and Exercise la) of Section
1.4] become
and ,
rTTTT /2Vk\
(1.5.3) G*(k) := V2(l + Qyjj^ G*[TTk)-
If we set g(fc) : = 2Vk/(l + k), these become
(1.5.4) <Kk) = £^<K&))
and
(1.5.5) G*(k) = {j^G*(g(k))
2_
8(k)
Theorem 1.4
Suppose that /, g, and a are all in C2(0,1) and that g maps [0,1] into [0,1],
If, for x G (0,1),
. d2f(x) .
(«) -^2 = «(*)/(*)
and
(fe) /(*) = V5a /(#(*)) c a constant
then
(c) a(x) = (g(x)Ta(g(x))-
1
*(*) 2 \g(x)
\g(x)J J
Proo/ Use (b) to change variables in (a). Then, on suppressing x,
2£
(1.5.6) «/ = /= (2pg +P8) ^f+pf(8) + 82P ^fi
where p '- = y/clg (remember that a is the derivative with respect to x). Also
from (a) and (b),
(1.5.7) f=af=apf{g).
Substituting (1.5.7) into (1.5.6) to eliminate / yields
20 The Arithmetic-Geometric Mean Iteration
dg2 g2p dg \ g2p IJK6)
which on comparison with (a) gives
ap-p ,
•2„ =<*(g)
8 P
since Ipg + pg = 0. Finally we compute p, p and p in terms of g to get
(c). D
The bracketed quantity on the right of (c) is often called the Schwarz
derivative of g.
Corollary 1.1 (Jacobi's Differential Equation)
Suppose G(x)'- = xll2x'K(x). Up and g are algebraic functions that map
[0,1] into [0,1] and
G(x)=p(x)G(g(x))
then g satisfies an algebraic differential equation
1 T a(r\ ^ f S(y\^2
r(x) = [g(x)fr(g(x))
where
M-1( MX
g(x) 2 \g(x)I
... 1 (l + x2\2
r{x) -=-^1^72)
Proof. We may assume that G(x) does not satisfy any equation of the
form
G(x) = p(x) G(x)
with )8 algebraic since by (1.3.13), (1.3.10), and (1.3.11) GIG has a
logarithmic singularity at 1. Thus since G satisfies equation (1.5.1), as in
(1.5.6) we must have
2pg+pg = 0
or
[c
P <S
The result now follows from Theorem 1.4 applied to (1.5.1). □
1.5 Fourth Proof of the Fundamental Limit Theorem 21
This corollary has a converse which provides an algorithmic check on
whether g is an algebraic transform of K. (See Exercise 1.) Theorem 1.4 has
a partial converse.
Theorem 1.5
ippose that j
Suppose that
Suppose that/, g, and a are all in C2(0,1) and that g maps [0,1] into [0,1].
2 . ... 1
(a) a(x) = (g(x)ya(g(x))-
8(x)
M)2]
g(x)J J
2 Lg(*) 2 \g(*)
and
(b) ^ = «(*)/(*).
Then
(<0 \/?/U(*))
is also a solution of (b).
Proof. This is essentially the computation of (1.5.8). Replacing / by
yfcig /(g) leaves (6) invariant provided (a) holds. □
We now offer a proof of Theorem 1.1 based on Theorem 1.5.
Fourth proof of the fundamental limit theorem. For
r \. l (1±jl\2 a t \- 2V*
aW-=~i?il^7i and ^-=1^
condition (a) of Theorem 1.5 holds. This is a tedious though (eventually)
entirely rational calculation. Thus if G is a solution of
d2G(x) ,
v2 =a(s)G(s)
a*
then so is
>SS <***»■
f2gW
However, the above second-order linear differential equation has a regular
singular point at zero and has fundamental solutions of the form
22
The Arithmetic-Geometric Mean Iteration
Vxgt(x)
and
Vxg^x) log x + Vxg2(x)
where gt and g2 are analytic and nonzero. Now observe that
is analytic and nonzero in a neighbourhood of zero, and hence G(x) must be
a constant multiple of the first fundamental solution. Moreover,
_L /ZL
Vx V 26(x
2g(x)
G(g(x))
cannot have a logarithmic singularity at zero, and hence (2g(x)y1/2G(g(x))
must also be a constant multiple of the first fundamental solution. In
particular for some c,
G(x) = cy[^~G(g(x)).
If we multiply by yg(x)7x and take the limit as x—>0, we deduce that c = 1.
We have thus proven that G satisfies equation (1.5.4). Equation (1.5.4)
transforms into
and we have shown that K, defined as an analytic solution of the differential
equation (1.3.8), satisfies the above functional relation, and hence we have
found an analytic invariant for the Legendre form of the AGM. [See
(1.1.10).] □
Comments and Exercises
A form of Jacobi's highly nonlinear differential equation may be found in
Cayley [1895], which is an excellent account of nineteenth-century elliptic
function theory with particular emphasis on the transformation theory.
The general question of when a functional relation
f(x) = p(x)f(g(x))
has a closed-form solution in terms of familiar functions is difficult. We shall
consider it again later. Theorems 1.4 and 1.5 do, however, suggest how one
1.6 Legeudre's Relation
23
might proceed to check whether a solution of the above functional relation
satisfies a second-order differential equation with rational coefficients. (See
Exercises 2 and 3.)
1. Suppose that g(x) is an algebraic function, g maps [0,1] into [0,1], and
g(0) = 0. Suppose that g satisfies the algebraic differential equation of
Corollary 1.1. Show, as the fourth proof of Theorem 1.1, that there
exists a constant c so that
G(x) = 4^-G{g{x))
2. Suppose that
F« = ^F^*»
and that
d2F(x) , , ^ .
where F{x)IVx is analytic in a neighbourhood of zero, r is a rational
function, and g(x) '■ = 2Vxl{\ + x). Without identifying F explicitly,
show that r has double poles at 0 and 1. (Considerations of this nature
can turn the fourth proof into less of a verification.)
Suppose that a, p, and g mapping [0,1] into [0,1] are algebraic, that
4-£$- = a(x)f(x) x G (0,1)
and that
dx>
f(x) = p(x)f(g(x)) * £(0,1)
Suppose also that / does not satisfy a linear differential equation with
algebraic coefficients of order less than 3. Show that
g(x) = ~^2 and P(x) = <cx + df ■
1.6 LEGENDRE'S RELATION
The four quantities K, E, K', and E' are related by a remarkable relation.
24 The Arithmetic-Geometric Mean Iteration
Theorem 1.6 (Legendre's Relation)
For 0<fc<l,
E(k)K'(k) + E'(k)K(k) - K(k)K'(k) = | .
Proof. Let G : = A^'ViC and G* := kU2k'K', as in Section 1.5. Then,
by (1.5.1),
G = G^
G G*
and hence there is a constant c so that
(1.6.1) GG*-G*G = c.
In other words, the Wronskian of G and G* is constant. On writing G and
G* in terms of K and £' this becomes
(1.6.2) k{\ - k2)(KK' - KK') = c .
We now use equation (1.3.13),
dK = E-(l-k2)K
dk k(l - k2)
to eliminate the derivatives in (1.6.2) and deduce that
(1.6.3) EK' + E'K-KK' = c.
It remains to evaluate the constant c. This may be done directly from the
series expansion at zero (see Section 1.3, Exercise 5), or by using Exercise 4
of Section 1.3. □
We define the gamma and beta functions by
/* CO
(1.6.4) rW:=j0 e~'f~ldt re(*)>0
(1.6.5) p(x, y) := jQ f~\\ - 0'"1 dt re(«), re(y) > 0 .
The function T satisfies the functional relation
(1.6.6) r(*)r(i - *) = "
sin TTX
1.6 Legendre's Relation 25
The relationship between T and /3 is
These standard results will be discussed further in Section 3.6. Our present
objective is to evaluate K(l/V2) and £(1/a/2).
Theorem 1.7
«-■ T'(i)
<*)
and
V2/ 4V^?
. , 4r2(|) + r2(i)
<£)-
a/2/ 8V¥
Proo/.
^(V2/ Jo a/7T"Z
dt
V(i-*2)(i-i*2)
V(l-^2)(2-r2)
The change of variables x2 '■= t2/(2— t ) gives
*
a/2/ Jo a/J~
t
The above arclemniscate (giving the arclength of a lemniscate) can be
evaluated in terms of /3. We set u ■ = t4 and see that
I 1 \ V2 f 1/4-1/! sl/2-1 ..
V2 /i i\ a/2 r(j)r(i)
a/2 /i i\
= T Ki« 2)
4 r(3)
Finally, by (1.6.6),
r(|)^ ana r(|) = ^
The evaluation of £(1/a/2) is left as Exercise 1. □
26
The Arithmetic-Geometric Mean Iteration
Since £(1 A/2) = £'(1 A/2) and £(1 A/2) = £'(1 A/2), the evaluation of
K(l A/2) and £(1 A/2) gives an alternate route to evaluating the constant in
Legendre's identity that avoids developing the asymptotic expansion of K'
at zero.
Comments and Exercises
We observe that at 1A/2 Legendre's identity reduces to
*(£)M£M£)
2L
2
or
2£(lA/2~) p£(lA/2) £(1 A/2)
IT \- IT IT
Now KIit and £/77 can be calculated quadratically by Algorithm 1.2 using
only the operations of addition, multiplication, division, and square-root
extraction and commencing with a rational starting value. This provides an
excellent approach to calculating v—an approach we will pursue in some
detail in Chapters 2 and 5.
Legendre derives the constant in (1.6.3) by evaluating K((V3± l)A/8).
(See Whittaker and Watson [27] and Exercise 6.) Note that (V3-l)A/8
and (V3 + l)A/8 are complementary. It transpires that
(4^.
EL ^-
K \ V
Values of k for which K'IK is a rational surd are of considerable interest and
importance. (See Chapter 4.) Such k are called singular values, the simplest
of which is 1A/2, where
-(—) =
K \V2/
1. Show that
Start by writing
«&)-
2fl\,r2fV
1 \ 4^(3) +r(i)
V2/ 8V7F
VV2/ Jo vrr^
Then set u •'= Vl — f2 and show that
1.6 Legendre's Relation 27
^)=711 vrb^"4"^ vrb^"
Show that
KvlMvl)-^
Show that
* 1 •■"
2 , V?T
Show first that
f{X):={\oe~'2dt) and ^):=^-^
V+i)
satisfy /' + g' = 0 and f + g = it I A.
b) Use a) to show that Y{\) = Vtt .
Use Theorem 1.2 to deduce that
^(V2-1) = V2.
[In fact K(V2- 1) = (V2> 1)^(1)1^(1)/213^172.] Thus V2- 1 is a
second singular value. We return to both singular values and T function
evaluations of K subsequently.
Establish Legendre's relation directly from (1.3.12) and (1.3.13) applied
to E, E', K, and K'. Use Exercise 4 of Section 1.3 to determine the
constant.
From the general theory of the Weierstrass function developed in the
next section, or more directly, one can show (Whittaker and Watson
[27, p. 516]) that for a, b >0,
J-" \l(n2 - Y2\(h2 4- y2\ Je
dt
yj{a2-x2){b2 + x2) •>' y4t3-g2t-g3
where e is the real root of the above cubic and
g2:=h(a2-b2)2-(abf
g3-=-ie (a2- b2)[(a2 - b2)2 + 36(afe)2] .
a) If g2 = 0, then
a2 + b2 = 2V3|2g3|1/3 a2 - b2 = -3(2g3)113
28 The Arithmetic-Geometric Mean Iteration
b) If g2 - 0 and g3 > 0, then
•^ f" dt - 2 K( a )
■^ r dt = 2 K( b )
J-V^+ft V7TV2 ^VT+T2''
c) Let e: = g3 ■ = \. Then a + b2 = 2V5, and bi) gives
Also, bii) gives
2X3-^'= f _*_= p-^L.+ r *
J-i VFTT J° Vi- *3 ^°
VT+
r
Now substitution of s : = 1 /f and s-=(1 + t ) , respectively, leads
to
2X3-1/4K' = !
>(|.i)-"(|.§)]-^"(i.i)--
d) Thus when k := sin(ir/12) = (V5 - 1)/2V2
K'(k) = V3K(k)
and
r,/V3-i\_,_1/4 r(|)r(|) 31/4r(i)3
*\ 2V2 I 4V¥ 27/3tt '
This evaluates K in terms of T functions and shows that sin(7j712) is
a third singular value.
1.7 ELLIPTIC FUNCTIONS
The study of elliptic functions began in the 1820s when Abel and Jacobi
independently discovered that the inverses of elliptic integrals are doubly
periodic functions. As noted earlier, they had been anticipated by Gauss
who at the turn of the century had studied a particular elliptic function, the
lemniscate sine. This work, however, had not been published. This was one
of the critical discoveries of the era and was vital to the concomitant
development of complex analysis and the later development of modular and
automorphic functions.
1.7 Elliptic Functions
29
Except for minimal application in Sections 2.6 and 2.7, we have no great
need for this body of material and offer only a brief sketch of this attractive
theory, primarily in the exercises at the end of this section. Details may be
found in Whittaker and Watson [27] and, more recently, in Bowman [53],
Du Val [73], Eagle [58], and Lang [73].
An elliptic function is a meromorphic function with two periods wt and
w2, with imiwjwj^0. That is, for all z, f(z)=f(z + w^^ftz + w2). We
assume that wt and w2 are minimal in the sense that they are not multiples
of any smaller period. The function / is completely determined by its
behaviour on any parallelogram that is a translate of the parallelogram with
vertices 0, wx, w2, and wt + w2. Any such parallelogram is called
fundamental. Associated with the periods wx and w2 there is the lattice L := {nw1 +
mw2\m, n integers}. It transpires that given any lattice L there exist elliptic
functions with the appropriate associated periods. However, there are not
many such functions. As we shall indicate in the exercises, any two such
functions are connected by an algebraic equation. The circular functions
may be thought of as degenerate elliptic functions (one of the periods is
infinite), as may rational functions (both periods infinite).
A function / is said to have an algebraic addition theorem if there exists a
polynomial il in three variables with complex coefficients so that for
x,yEC,
il(f(x),f(y),f(x + y)) = 0.
One of the many remarkable facts, due in part to Weierstrass, is that a
meromorphic function has an algebraic addition theorem if and only if it is
elliptic or degenerate elliptic. (See Exercises 11, 12 and 13 or Hancock [09].)
The Jacobian elliptic function sn is defined by
(1.7.1) u = [
sn(u,k) dt
V(l - t2)(l - k2t2)
The value k is thought of as a parameter and is often omitted. It is true,
though not obvious, that sn(w) is a meromorphic function with periods AK
and 2iK' and, hence, is elliptic. (See Exercise 2 of Section 2.7.) The two
other basic Jacobian elliptic functions are en and dn, defined by
(1.7.2) „=/;
cn(u,k) ^
V(i - t2)(k'2 + k2t2)
(1.7.3) « = Ji
and
•dn(u,k) df
y/(i-t2)(t2-k'2)'
Note that sin and tanh are limiting cases
30
The Arithmetic-Geometric Mean Iteration
sn(w, 0) = sin u
and
sn(w, 1) = tanh u
A "half-angle" formula for sn is
.2/ i
(1.7.4) snzl - = -====== .
■ ' \o) . + yi - A:2 sn2(w)
Again this is not obvious. It is worth observing that a half-angle formula is a
specialization (x = v) of an algebraic addition theorem. These matters are
pursued further in Sections 2.6 and 2.7. In particular, Exercise 6 of Section
2.7 presents an addition formula for sn.
Comments and Exercises
The following exercises develop some of the elementary theory of elliptic
functions. The approach is via the Weierstrass p function and is standard.
See, for example, Erdelyi et al. [53], Du Val [73], or Lang [73].
1. Show that the elliptic functions (with respect to a fixed lattice L) form
a field that is closed under differentiation.
2. Show that an entire elliptic function is constant.
3. Let/be elliptic with respect to L. Assume/is nonconstant. Let P be a
fundamental parallelogram whose boundary B contains no zeros or
poles of/.
a) Show that the sum of the residues of / in P is 0 by using the
periodicity of/to show that jB /=0. Thus any nontrivial elliptic
function has at least two poles in P. The number of poles in P,
counted according to multiplicity, is called the order of /
b) Show that / has the same number of zeros as poles in P counted
according to multiplicity. Hint: Consider $B flf and observe that
///is elliptic.
c) Show that / assumes every complex value exactly order of / times
in P.
4. Show that the Weierstrass function
p(z) :=-2+ 2
Z weL'
1 1
(z - wf w2
is meromorphic, even, and has double poles at each lattice point
[L' := L -(0,0)].
5. a) Show that
1.7 Elliptic Functions
31
p(z) = -2 2 7 ~3 ■
Show that p is an odd elliptic function of order 3.
b) Use a) and the fact that p is even to deduce that p is elliptic.
Observe that p has order 2.
Show that every even elliptic function / is a rational function of p.
Hint: Observe that if z is a zero of /in P (a fundamental
parallelogram) then so is — z mod L. (By — z mod L we mean the unique point in
P, that is equivalent to — z with respect to L; that is, there exist m and
n so that mw^ + nw2 — z = —z mod L and - z mod L G P.) Observe
that there are exactly four points in P where z =
— z mod L (0, wxl2, w2/2, (vvt + w2)/2) and that if/has a zero at one
of these four points, it must have even multiplicity. Consider
g(z):=U[p(z)-p(zi)f
where the product is extended over the zeros of /, choosing only one
representative from each pair (z,-zmodL) and where 8i is the
multiplicity of the zero (except in the case zt = -z,modL, in which
case 8i is half the multiplicity). Now shqw that / and g have the same
zeros with the same multiplicities. Treat the poles similarly.
Show that any elliptic function / is a rational function of p and p.
Hint: Consider
.ffr,-_Az) + A-z) , ([/(*)-/(-*)]/2)/K*)
A) 2 + p(z)
which decomposes / into an even elliptic function and an even elliptic
function divided by p.
Show that there exist constants g2 and g3 so that
[p(z)f = 4p\z)-g2p(z)-g3.
Hint: Consider the order of the pole of p2 — 4p3 at zero. Observe that
g2 and g3 depend on L. Different lattices lead to uniformizations
(parametrizations) of different cubics.
Show that any two nonconstant elliptic functions / and g (with respect
to the same lattice) satisfy an algebraic equation
ft(/,*) = 0
where il is a polynomial in two variables with complex coefficients.
Suppose that f is meromorphic and singly periodic with period w.
32
The Arithmetic-Geometric Mean Iteration
11.
Suppose that / assumes no value infinitely many times in any period
strip. Show that/is a rational function of el7rlzlw. (One can use this as
a definition of the class of trigonometric or circular functions.)
Prove that the function p satisfies the algebraic addition theorem
tK " 4lp(x)-p(y)
Hint: Show that, as a function of x,
-p(x)-p(y)
12.
13.
'Kx)-p(y)
p(x)-p(y)
-p(x)-p(x + y)
has no singularities at the points 0, ±y and, hence, is independent of x.
With Exercise 8, this is an addition theorem.
Show that every elliptic function/ has an algebraic addition theorem.
Hint: Connect f to p algebraically and use the addition theorem for p.
Show that if / meromorphic and has an algebraic addition theorem SI,
then either
a) / is a rational function,
b) / is a trigonometric function, or
c) / is an elliptic function.
Hint: Case a) occurs if / has a pole at infinity. So suppose to the
contrary that infinity is an essential singularity. Now for some w,
f(z) = w has infinitely many solutions z1; z2,. ... Choose z* so that
z*, z* + zx, z* + z2,.. . are not poles of/. Show that there exist m and
n, both less than a constant depending only on SI, so that/(z* + zn) =
f(z* + zm). Show that in any neighbourhood there exist infinitely many
points and some n and m so that f(z + zn)=f(z + zm). Thus / has a
period zn — zm. Case b) occurs if/has only one period, in which case/
can assume any value only finitely many times within any period strip.
In that case Exercise 10 applies. Otherwise we have case c).
Chapter Two
Theta Functions and the
Arithmetic-Geometric Mean
Iteration
Abstract. In this chapter we solve the AGM iteration in theta function terms
and derive a variety of useful properties of theta functions. The central tool is
the Poisson summation formula. These results are then applied to produce
quadratically convergent products for it and e* and quadratically convergent
sums for v.
We finish the chapter by discussing the Landen transform and the
relationship between theta and elliptic functions.
2.1 A THETA SERIES SOLUTION TO THE AGM
The basic theta functions are defined for | q\ < 1 by
CO
(2.1.1) 62(q):= 2 q(n+U2)2 8,(0) = 0.
n = — co
CO
(2.1.2) fl,(s):= 2 q"2 <%(0) = 1.
n=— co
CO
(2.1.3) 84(q):= 2 (-1)"/ 04(0) = 1.
These series are more properly viewed as functions of two complex
variables one of which is presently set to zero. (See Section 2.6.) Theta
functions have various number theoretic connections. Note that 03 is a
33
34 The Arithmetic-Geometric Mean Iteration
generating function for squares, while as 64(q) = 0j(-?), % also considers
the parity of the square. Thus
(2.1.4) 0,(4) + 0,(4) = 2 2 qn=283(q4).
n even
Also
CO CO
(2.1.5) e23(q) = 2 r2(n)q" 824(q) = 2 (-l)nr2(ri)q»
n=0 n=0
where r2(n) counts the number of ways of writing n = j + k . Here we
distinguish sign and permutation [so that, for example, r2(5) = 8 since
(±2)2 + (±1)2 = (±1)2 + (±2)2] and set r2(0) := 1. Now it is elementary
that r2(2ri) = r2(n). (See Exercise 2.) It follows that
CO
(2.1.6) 823(q) + 824{q) = 2 2 r2{2n)q2n = 2823{q2) .
n = 0
Also, (2.1.4) and (2.1.6) allow us to solve for 83(q)84(q). We have
%{q)Mq) = \ [<%(*) + o4(q)]2 - \ [e23(q) + e24(q)]
= 2823(q4)-823(q2) = 82(q2).
Thus
(2-1-70 2 = *3(« )
(2.1.7b) V02(?)04(?) = 04(?2)
which bears an obvious resemblance to the AGM. Similarly,
CO CO
0l(q) ~ 023(q2) = 2 r2(n)q" - 2 r2(2n)q2n
CO
= 2 r2(2n + l)q2n+1 .
n = 0
This last term may be rewritten as
2 r2(2n + l)q2"+l = 2 ^
n = 0 k,m——<*>
k+m odd
which, on setting k = i — j and w = i + / + 1, gives
2.1 A Theta Series Solution to the AGM 35
2 (q2f+1'2)2+U+1'2)2 = e22(q2).
Hence
(2.1.8) e\{q2) + e22{q2) = e\{q).
This combines with (2.1.7/) to produce
(2.1.9) el(q2)-el(q2)=e24(q)
and these last two and (2.1.7//) yield Jacobi's identity
(2.1.10) e\{q) = d\(q) + 8\{q).
Now set k:=k(q):=822(q)/el(q). Then (2.1.10) shows that k' =
#4(9)/03(9)- K we return to (2.1.7) and set an : = 823(q2") and bn := 824(q2")
we observe that an and bn satisfy the AGM iteration. Moreover, since
03(O) = 1, the limit is 1. Thus
(2.1.11) M(8l(q),824(q)) = l.
We recast these last observations in:
Theorem 2.1
Let 0 < k < 1 be given. The AGM satisfies
(2.1.12) M(l,k') = 8;2(q) for k'= 024(q)/823(q)
where q is the unique solution in (0,1) to /: = 8\{q)l8\{q). In particular,
(2.1.13) K{k)=\e\{q).
Proof. This follows from the previous discussion and Theorem 1.1. The
uniqueness of q will be obvious from the results of Section 3.1, which will
show that 84 decreases and 03 increases on (0,1). □
The results of Theorem 2.1 remain true more generally in the complex
plane. This is discussed in Exercise 4.
Comments and Exercises
The solution of the AGM in theta function terms can be found in Gauss
36 The Arithmetic-Geometric Mean Iteration
[1866]. The systematic investigation of theta functions in the context of
elliptic function theory originates in Jacobi's masterpiece Fundamenta Nova
Theoriae [1829].
While we have given a number-theoretically motivated development, it is
possible to give a very elegant formal verification. (See Exercise 1.) This
technique, used frequently by Liouville, is discussed in detail in Bell [27].
1. a) Let /: Z X Z-> R. Establish the formal identity
CO CO CO
2 f(m,n)= T f(l + k,l-k)+ 2 f(l + k,l-k-l)
(2.1.14)
valid whenever both terms on the right-hand side converge.
b) Apply (2.1.14) to qm2+"\ (-l)m+nqm2+n\ and (-l)mqm2+"2 to
derive (2.1.8), (2.1.9), and (2.1.7«) respectively.
c) Hence rederive (2.1.7) and (2.1.10).
2. Prove that r2(n) = r2(2n). Hint: 2a2 + 2b2 = (a + b)2 + (a - b)2.
3. a) Show that the downward transformation [Theorem 1.2(b)] sends
k(q) to \(q) := k(q2). Precisely,
,„,« 1-k' _82(q)-82(q) _022(q2)
(2-L15) 1 + *' el(q) + el(q) 023(q2)-k-
b) Show that the corresponding transformation for K [Theorem
1.2(6)] is
(2.1.16) 83{q )- -
0l(q)
2
4. Show that (2.1.12) is valid for complex q with \q\ <1. Precisely,
for such q. [That 83(q) does not vanish will be apparent from Section
3.1.]
2.2 POISSON SUMMATION
A most analytically accessible route to the behaviour of the AGM lies in the
Poisson summation formula, which we now describe. We then give some
examples of its use before returning in the next section to its relationship
with the AGM. The formula we need is:
2.2 Poisson Summation 37
Theorem 2.2
Let / be a nonnegative function, increasing on (-oo, 0] and decreasing on
[0,0°). Assume that JT„ f(x) dx exists as an improper Riemann integral.
Then, for each x in 05,
-j.mKt j,
e at
(2.2.!) ± f(n + x+)+f(n + x-) = £ ^ P ^
n=—oo 2 &=-°o J —oo
each series being absolutely convergent.
Proof. A complete proof may be found in Apostol [74, pp. 332-333]. In
essence one considers the function F(x) •'= S^=_eo /(n + x), which under the
given hypotheses is of bounded variation on compact intervals, and which is,
by construction, periodic. The left-hand side of (2.2.1) is merely the average
[F(x+) + F(x-)]/2, while the right-hand side is obtained by computing the
Fourier coefficients for F and regrouping. □
example 2.1. We apply the formula to / given by
The right-hand side becomes
- ei«ikx ^ 1 . ycos(27rfcc) + (2irk)*ui(2irkx)
fc~. y + 277//: ~y + tx y2 + (2irk)2
and the left-hand side becomes
2 e-*"+*> + fJ
„>_* w otherwise
If x'■=(), we derive
2V-1 y+2yiiy2 + (2^k)2
which yields
1 °° 1
(2.2.2) 77 coth(77*) = - + 2x 2 ^-
„=1 ;*;' + n2
on replacing y by 277*.
If x'-= I, we derive
1 °° (—IV
(2.2.3) 77 cosech (77*) = - + 2x 2 2 :
^ , 2
38 The Arithmetic-Geometric Mean Iteration
By elementary analyticity considerations, (2.2.2) and (2.2.3) remain valid in
the complex plane and produce the classical formulae for cot, cosec, and so
on. (See Exercise 1.)
example 2.2. This time apply the formula with
f{x):=e-sx2" s>0.
Then (2.2.1) becomes
CO 00 /* CO
n=— eo k — ~ <»' J —oo
Now the integral on the right is
2 j\-'2 cos (2Trkt)dt=^F(^k)
where
f 2 A/77" 2
(2.2.4) F(y) : = I e~x cos (2xy) rfx = — e~y .
(See Exercise 2.) Thus we deduce
(2.2.5) i ^-4-^ = ,-172 2 e2'***"
lirikx — irh2/s
r
Again, analyticity considerations show that (2.2.5) holds for re(s) >0. This
is a general form of the theta transformation formula. For future reference
we make the notational agreement that Bt{s) ■= O^e'™) and observe that for
x '■= 0, (2.2.5) can be written as
(2.2.6) ^8,(1) = 8,(1-1).
Note that for large s the sum Vsfl,(s) converges much more rapidly than
03^-1). For example, if $:=100, 03(O.O1) and 10 + 20e~100,r coincide
through more than 500 digits.
Comments and Exercises
The result of (2.2.1) was known to Poisson by 1827. With x :=0 he had
obtained the formula in 1823. Equation (2.2.5) was first obtained by Jacobi
using elliptic function theory in 1828.
2.2 Poisson Summation 39
Establish the formulae
' sin'--^ 2 2
b) * = 2 (-l)"+i(2n + l)
sin(7rz) z „=1 z2-n
\n + l
cos(ttz) ^o z2-(n+§)2
j. / s 1 v1 2z
C) 77 COt (TTZ) =- + 2, "2 2
z „=i z - n
d) sin (77 z) =7rzll(l 2 ) •
«=i ^ n I
Establish (2.2.4) by showing that
F(y) + 2yF(y) = 0
and using F(0) = Vtt/2. (See Exercise 3 of Section 1.6.)
Let I(s, y) := jr. e-*"2-2*" dt. Evaluate I(s, y) for real s, y by
completing the square. By analytic continuation I(s, ik) = s~112 e"""*/s.
Recall that the Laplace transform is defined by
~y'f{t)dt.
F(y):=\0
Provided that f(t)= 0(ebt) as f->°°, F will be analytic for re(;y)>6.
Show that the Laplace transform of (2.2.5) with * := 0 produces (2.2.2).
This entails evaluating integrals of the form
/* CO
F(a,b):=\o
-[a2t+b2lt] .-1/2
r1'2 dt,
This is a special case of a Bessel function transform which can be
evaluated explicitly as F(a, b) = {VJfla) e~2ba by substituting s '■= t and
v'= bis — as. (Various extensions and related matters are discussed in
Bellman [61].) In principle, therefore, one can derive (2.2.5) as an
inverse Laplace transform of the derivative of the product form of sin
given in Exercise Id).
Let / be nonnegative, continuous, decreasing, and Riemann integrable
on [0, oo). Let
S(y) := V- J0 /(*) c°s (xy) dx .
Show that
40
Va
The Arithmetic-Geometric Mean Iteration
1 °° t r 1 °°
~ /(0) + 2 /(ma) = Vfi ~ g(0) + 2 g(n/?)
»1 = 1 J ^-^ n = l
whenever a, /3 >0 satisfy a/3 = 2tt. Deduce that
Va
\+i e-a2-2/2'
•2 OT=i
(Exercises 2 and 5 follow Apostol [74].)
2.3 POISSON SUMMATION AND THE AGM
We commence by specializing (2.2.5). Setting x'-=0 produces
(2.3.1) ya%(e-") = «,(«-")
while a; : = § gives
(2.3.2) V5%(e~") = 04(e-,r")
and dually on setting s = s_1,
(2.3.3) Vs84(e'^) = ^(e-^)
where re(s)>0. On dividing (2.3.2) by (2.3.1) we have
(2.3.4) k(e~") = k'(e~'u).
From Theorem 2.1 we see that
M(l,k) = e;\e~'u) q:=e~™.
Thus
M(l,k') _ eKe-*")
(2.3.5)
IT
M(l,k) «&"")
tts= -log q
on using Theorem 2.1 and (2.3.1) again. This produces the following
fundamental theorem.
Theorem 2.3
For all k in (0,1),
(«)
and so
2.3 Poisson Summation and the AGM
M(l,k') , e\{q)
" M(l,k)=-l°^ k=0l(q) k =
el(q)
41
K'(k)
Proof. We have established (a) and (b) follows from our identification
of M(l, k') and ir/(2.Kik)). This second equation is often written as q =
e-*K ik an(j ^ jg caj^e(j tjje nome associated with k. In principle it solves the
inversion problem for q in terms of k. □
We know that k= 0\iq) 10\iq) = ^Vq~ + Oiq) as q tends to zero from
above. Theorem 2.3(a) and this information show that since M(i, k') tends
to 1 as k tends to zero,
(2.3.6) lim
k-*0+
2M(1
jtrlogQ)
:0
which reproduces the asymptotic of Exercise 4 of Section 1.3. As we shall
see (Exercise 1), we can derive the exact asymptotic (1.3.10) from these
considerations.
Now consider the AGM iteration commencing with a0: = 1 and b0 '■ = k'.
Then as we saw in Section 1, bjan = k'iq2 ) and cjan = kiq2"). Hence
(,3.7, ^-^)-1¾.
In Exercise 2 one establishes that the differential identity
(2.3.8) 2-"b;2dlog(^) = b-0>dlog(^)
holds. It follows from (2.3.5) and (2.3.7) that
and from (2.3.8),
''n"0 ...
while ir0=-T-r-,.
bn^0 . -, 2 dfc
~bj WMe ff«=-* A"
Since 6„ tends to Mil, k'), we have established that
42
(2.3.9)
The Arithmetic-Geometric Mean Iteration
dk
ds
tt kk'2
2 M(l,k'y if
2 kk'2K2
and since , = — it log q,
(2.3.10)
dk
kk'2
2kk'2K2
dq lq M(l,k')2 qir2
Rewriting k, k', and K in (2.3.9) in theta terms produces
03 e2 4 4"
(2.3.11) f - -t = T 01. (w.r.t.,)
Differentiation of (2.3.1) and (2.3.3) yields
(2.3.12)
s%(s) + - 6,(8) = -s-l'%(s-1) (w.r.t.,)
(2.3.13)
,26>4(,) + - 04(,) = -1-11%^1). (w.r.t.,)
On using (2.3.1) and (2.3.2) again we deduce that
(2.3.14) , f (,) + ,- | (,-1) = - \ = , £ (,) + ,-1 £ (,-) .
Now, since s20l(s) = 6\(s-1), this shows that
(2.3.15)
fy _ ^ = 5 «4
»4 03 4 °2
(w.r.t.,)
is equivalent to (2.3.11). Finally, adding (2.3.11) and (2.3.15) gives
(2.3.16)
^ _ A - E a*
e4 e2 4°*
(w.r.t.,)
on using (2.1.10).
We can now also express E in terms of theta functions. We have from
(1.3.13),
E- K=kk"
dK
dk
kK=~K
1 dK . k2K2
2K ds
+
because of (2.3.9). Hence
*Differentiation with respect to s.
2.3 Poisson Summation and the AGM
43
and
(2.3.17)
Similarly
and
(2.3.18)
E-K-l
'83 77
U+4
E=K~^i-
77 TTK' e2
E K ~ 2K + k2 e2
E' = — +
ttK' e4
V2 ft
+ \ »2J (w.r.t.s)
(w.r.t.s)
(w.r.t.s)
(w.r.t.s)
Comments and Exercises
1. a) Observe that k = ^Vqfiq) with / analytic for |q\ < 1 and /(0) = 1.
Standard real reversion arguments show that for 0<k<l,
Vq=\g{k2)
where g is (real) analytic with g(0) = 1. Show that
(2.3.19) K'(k) = - K(k)
log (£) + **)]
where h(0) = 0 and h is analytic,
b) Observe that the right-hand side of (1.3.10) and K' both solve
(1.3.8). Moreover, both can be expressed in the form of the
right-hand side of (2.3.19). Deduce that (1.3.10) holds.
2. a) Show that 2~"b~2d log (an/cn) is independent of n.
b) From a) deduce that 2~nc~2dlog(bJan) is independent of n.
c) Use a) and b) to show that 2~na~2d log (bn/cn) is independent of n.
d) Show that all three coincide with b$2d log (a0/c0). This again is due
to Gauss.
3. a) Establish (2.3.18) by using (2.3.16).
b) Observe that (2.3.18) and (2.3.17) immediately give another proof
of Legendre's relation.
4. a) Show that 02(e-,r) = 04(e-,r).
44 The Arithmetic-Geometric Mean Iteration
.2.
b) Show that 477 = (S:__. e~" ')/(£:._. «V" ').
c) Let r^(n) give the number of distinct representations of n as a sum
of k squares. Show that
477 = ( i *rt(«) «—)/( i "/•*(«) «—) .
Xn=0 '/ \„-0 '
M/tf: Take the kth power of (2.3.1) and differentiate.
All of results in this section can be derived without Poisson summation as
the following exercises show.
5. Derive (2.3.7) by combining Exercise le) of Section 1.4 and Exercise
4b) of Section 1.3. This can also be found in Gauss [1866], but not
apparently in later nineteenth-century authors. (See Borwein and Bor-
wein [84a] and King [24].)
6. Use k~Ayfq to rederive Theorem 2.3 from Theorem 1.1.
7. Observe that Theorem 2.3(a) and Theorem 2.1 show (2.3.1) and
(2.3.4). Hence deduce (2.3.2) and (2.3.3). Thus we have established the
0 transformation formulae [(2.3.1) to (2.3.3)] directly from the AGM.
8. Suppose that in (2.3.7) the 4 is omitted. Show that the convergence,
which was quadratic, is now linear. Of course the limit is unchanged.
2.4 THE DERIVED ITERATION AND SOME CONVERGENCE
RESULTS
It is generally the case with AGM related approximations that it is relatively
easy to establish the convergence rate and much harder to determine to
what the given iteration converges. We now consider various preparatory
convergence results. In the next section these will be used to produce two
surprising algorithms for v.
Motivated by the fact that V2K(l/V2)K(l/V2) = tt (Exercise 2 of
Section 1.6), we commence by computing K. If we consider the AGM
sequence with a0 := 1 and b0:= k, it is apparent that a„ and bn viewed as
functions of k converge uniformly and analytically to M(l, k). It follows that
the derived iterations d„ and bn converge to M(l, k). Since M(l, k) =
irl2K'{k), we see that
(2.4.!) Up,*)-! *|gl.
on noting that (dK'/dk)(k) = -(k/k')(dK/dk)(k'). Equivalent^,
(2-4.2) *(*')- o ,, „2.
77 k[ M(l, k)
2 k M\l,k)
Now the derived iteration is
2.4 The Derived Iteration and Some Convergence Results 45
(2.4.3) an+1 - —2— 6»+i 2 '
Since bjan converges quadratically to 1, it is easy to show directly that dn
and bn converge quadratically to M(l, k) for 0<k<l. Moreover bn
decreases and dn increases, at least eventually (Exercise 1). For our purposes
we will generally consider the Legendre forms xn ■= ajbn and yn '■= bnldn.
Then
^.t.tj xn+1 — 2 yn+\ v +1
where x0 ■= k~ , yx •'= VTQ, and y0 is undefined. Moreover,
(2.4.5) 1 < xn+1 <yn+1<Vx;<xn.
(See Exercise 2.) We also have
2V^(1 + V^)2~8^" 1}
and
(y„ - !)(*„ - 1) 2(*n+1 - 1)
y"+1 {yn + l)^x-n + l) yn + l
^Hyn-i)2 + Uxn-if^Uyn-i)2
so that for n s 1
(2.4.6/) *„+1-l<§(*„-l)2
(2.4.6b) ^1-1=^1(^-1)2.
This establishes the quadratic convergence of xn and ynto 1. From (2.4.1),
(2.4.2), and the previous discussion it is now apparent that
,„ ,^ „ ^M3(l,l/V2)
(2.4.7) 77 = 2V2 -T-H 7=1
V ' M(l,l/V2)
and since both M and M are quadratically computable, so is v. In the next
section we turn this identity into an explicit algorithm.
Comments and Exercises
1. Show that dn and bn defined by (2.4.3) converge quadratically to
M(l, k), by showing that dn increases and bn decreases. Then show
bn - dn < (Vx~l - 1) /2. For this to hold for all n, assume x0 ^ 3.
46 The Arithmetic-Geometric Mean Iteration
2. Establish (2.4.4) and (2.4.5).
3. a) Use *„ = e\{ q") I0\( q2") to show that *„ ~ 1 + 8e '^"'H'o1) as
n—»oo.
b) Show that cn^Aane~2'"i<KIK')(-x°i) :=8n, and that cn~8n as
n—>°°.
4. Show that with a0 := 1 and b0'-= k,
£(*') = £ £lim%.
v ' 2 fc «— ft2
Hence deduce that with x„ and yn, as in (2.4.4),
In particular with x0 := V2 and ^j := 21/4,
CO
n
" r2(l) •
5. Show that
«(l, £)- .-»r>(|) and 44) = 2^-^(¾.
2.5 TWO ALGORITHMS FOR it
The systematic use of the derived AGM leads directly to quadratic
algorithms for 77.
Algorithm 2.1
Let x0 := V2, tt0 := 2 + V2, and y1 := 21/4. Define
l + yn_4r2(|)
„ = 1 it„ 77
(0 W-j(v*; + ^;) "a0
<«) W-^+//^ «l
*« + !
(ui) ^:=^^-^ n>l
Then 7r„ decreases monotonically to 7r. Moreover, for n>0,
2.5 Two Algorithms for n 47
(2-5.1) Uyn+i~xn+1)^TTn-7T^l(yn+1-xn+1)
1 2
(2.5.2) 77„ + 1 - 77 =S — (77„ - 77)
and, for n > 2,
(2.5.3) 77„ - 77 < 1(T2"+1 .
Proof. We first establish the limit. This follows from (2.3.7) (see
Exercise 1) or from (2.4.7) as we now indicate. Let 77„ : = 2V262+1an+1/dn+1
where a0:=l and 60:=1/V2. Then by (2.4.7) of the previous section,
77„—> 77. From (2.4.4) we have
^n _ (t>n+i/bn)\an+1/an)
*■„-! an+1/dn
l + x„
l + y„
Moreover, 770 = 2 + V2 and the algorithm converges to 77 as claimed. Since
y„sx„> 1, it is obvious that 77„ decreases. Let us observe that
(2.5.4) y.+r^+i-i^TTTrr^B^""*")
provided that
Next
2V^(l + ;y„) 8
*" 1<2-V3.
y„-l
(2-5-5) T» - «■» + ! - —j ■
/n + 1 T x
Hence
77 77
~ ~TJ (yn + l ~ Xn + l) =^n~ ^n + l ~ ~f (^„+1 ~ *„ + l)
/n+1 ' x ^
and
48 The Arithmetic-Geometric Mean Iteration
Here we have used (2.5.4) and a geometric estimate. From the above
77
7T„ — 77 S ■
(.^n+l Xn+l) ■
yn+i +1
Thus, for n >0, upon checking the early cases,
(2.5.6) | (yn+1 - xn+1) < tt„ - 77 < | (yn+1 - xn+1)
and with (2.5.4),
77„ + 1 - 77 < I (yn+2 ~ Xn+2) S= & (yn + 1 - Xn + 1f < ^ (77„ - 77)2 .
Finally (2.5.3) follows from (2.5.2). □
The first nine iterations give 1, 3, 8, 19, 41, 83, 170, 345 and 694 digits of
77. The 24th will produce more than 45 million digits at the expense of only a
few hundred arithmetic operations. A more exact asymptotic will be derived
in Chapter 5. (See Exercise 5.) Note also that the number of leading zeros
+1 gives the number of digits of agreement between 77„ and 77 to within
1.
The second algorithm, based on an identity of Gauss [1866], was
discovered by Brent [76a] and Salamin [76] independently.
Algorithm 2.2
Let a0 := 1 and b0 := 1/V2. Define
2a»+i
77„ .=
1 _ y" 2kr2
where c„ :=V^ - b2n = c2n_1/4an and a,„ and bn are computed by the AGM
iteration. Then 77n increases monotonically to 77 and satisfies
(2.5.7) 7T-77„<
77 2 e
M2(l,l/V2)
and
2-(n+l)
(2.5.8) 77 - 77„ + 1 ^ — (77 - 77„)
77
2
Proof. This algorithm is based on the use of the second integral E
rather than K. With k:= 1/V2 Legendre's relation is (2E- K)K= tt/2.
Combine this with Algorithm 1.2 of Section 1.4 to derive
2.5 Two Algorithms for *r 49
2M2(1,1/V2)
(2.5.9) tt = ,
which on truncation shows that irn converges to it. We leave the
convergence estimates as Exercises 3 and 4. □
The first eight iterations produce 0, 3, 8, 19, 41, 84, 171 and 344 digits,
which agrees extraordinarily well with the asymptotic.
Both of these algorithms generalize in many ways. (See Chapter 5.) At
the moment we only exhibit two additional identities.
If we use the differential equation for K equation (1.3.13), we may
rewrite Legendre's identity as
(2.5.10) | = kk'\KK' - KK') .
[See equation (1.6.2).] With (2.4.2) we now derive
r?5in 2 ,,,2 K'(k) M(l,k') 2 K'(k') M(l,k)
( ' ir K(k) M\l,k') KK K(k') M\\,k)
and, as in Algorithm 2.1, we can produce
2
77 :
<-'>!In(i^)+(1-*,|§n(i±§)
(2.5.12)
Here x0'= k'~\ x%'=k~1, yx'-=VTQ, yX'-=^fx*Q, and the iterations are
given by (2.4.4). When /r-=l/V2, (2.5.12) reduces exactly to Algorithm
2.1. Also whenever K'{k)IK{k) = Vr for rational r (a singular value of k),
we can in principle find k algebraically, as we will see in Chapter 4. We
already know of four such values of k. (Exercise 7.)
In a similar fashion we may use Algorithm 1.2 to substitute for both E
and E' in Legendre's relation. This as observed in Salamin [76] produces
/,,,« 2M(1, k)M(l, k')
(2-5-13) ™~~1 ^ o«-l/„2 , „*2x
^:..2-^+0
Again, c„ and c* are computed from complementary AGM iterations. When
k:= 1/V2, this identity reduces to (2.5.9).
From (2.5.2) and (2.5.8) we observe explicitly that the corresponding
algorithms converge quadratically. We are primarily interested in the
50
The Arithmetic-Geometric Mean Iteration
cumulative error given by (2.5.3) and (2.5.7). We will say, informally, that
any such algorithm allows for quadratic or fast computation. In similar
fashion, we talk about mth-order computation if the cumulative error (i.e.,
digits correct) is of order m" after n steps.
Comments and Exercises
Algorithm 2.1 is derived in Borwein and Borwein [84a] by the method of
Exercise 1.
1. a) Differentiate (2.3.7) and apply (2.3.8) to establish that with
a0'-= 1 and &„'•= k',
2 (a*a )2
IT = T—T hill
kk'2 «-~ d*an - dna*n
Here a* as usual denotes the AGM iteration commencing with
b0 '■ = k, and all derivatives are with respect to k.
b) Let k ■= k' '■= 1/V2 and observe that the previous formula
reduces to (2.4.7).
a) Fill in the details in (2.5.3) to (2.5.6).
b) Show that
Note that while Algorithm 2.1 relies on evaluating K, Algorithm
2.2 in fact relies on evaluating E. Indeed (1.3.12) and Algorithm
1.2(6) combine to show that
E{k)-~\ (2 2-^(/:)
a) Use (2.3.7) to show that (c„/4a„) increases monotonically
and quadratically to q. Thisgives a quadratic algorithm for e7' on
using a0 := 1 and b0 ■= 1/Vl
b) Show explicitly that, with a0 := 1 and bQ := k',
(2.5.14) e^W^)=16n(^±i)21".
As in Algorithm 2.2, show that
(2" -2-1) ii
2 J- 20n„2
1T„,,C„, , IT 2 C„
+ 1^+1 _ _ _ _ " ^ '-n+l
=S 7T„ + 1 - 7T„ S
1+2
M2(l,l/V2)
Then deduce that
2.5 Two Algorithms for *r 51
r"+1(^~)C"+1 * * ~ ^- * M\l,llV2) 2
n + l 2
Cn + 1
Now use Exercise 3a) to show (2.5.7) and (2.5.8).
5. As in Exercise 4), show that in Algorithm 2.1 irn — v is of order
2" e_,r2" . Convergence proofs of this type can be found in detail in
Salamin [76] and Borwein and Borwein [84a] with discussion of the
asymptotics.
6. Prove (2.5.12).
7. Recall from Section 1.6 (see also Chapter 4) that
ii) K'(V2 - 1) = V2K(V2 - 1)
iv) K'(3 - 2V2) = V4K(3 - 2V2).
Observe that for these values of k and k' (2.5.12) reduces to an
algebraic combination of two infinite products. Equally (2.5.13)
simplifies.
8. Establish the general identity of Gauss, Brent, and Salamin given as
(2.5.13).
9. Use the quartic transformation to produce a quartically convergent
infinite product for tt. (See Exercise 3e) of Section 1.4.)
10. Show that with xn and yn as in Algorithm 2.1,
(V5+2)H*.(^)=r,(!)^
and
(V2 + 2>Sfix-(^)S
V32
11. There are actually eight natural products implicit in (2.4.7). One can
select either bn or an (bn or d„) for each of the means. While
Algorithm 2.1 is the cleanest of these, it is not the best approximation.
a) Show that there are four decreasing and four increasing products.
b) Show that the two best approximations are given by 2V26^/d„ and
2V2V„/6„.
52 The Arithmetic-Geometric Mean Iteration
c) Show that with xn and yn as in Algorithm 2.1,
2xm .WArt.nfr.^-^+W.
m = l l ^ ym m = l *-
+ ymx„
with both products converging monotonically to v.
d) Similarly, produce a decreasing analogue to Algorithm 2.2.
12. The identity (2.5.14) can be found in Gauss [1866]. Show similarly the
following identity due to Jacobi. (See King [24].)
K(k)
(2.5.15)
= log(^) + 3i2-"log(^) a0:=l b0:= k'
2.6 GENERAL THETA FUNCTIONS
Theta functions are more properly considered as a function of two
variables—a parameter q and an analytic variable z. So far we have considered
only special theta functions (z = 0). In this section we sketch some relevant
parts of the general theory. We write
^(z, q) := ^(z, t) := 2q114 sin z - 2q9'4 sin 3z
+ 2q25'4 sin 5z
02(z, ¢) := 02(z, 0 := 2#1/4 cos z + 2#9/4 cos 3z
+ 2?25/4cos5z+---
03(z, ¢) := 03(z, 0 := 1 + 2q cos 2z + 2#4 cos 4z
+ 2q9 cos 6z + • • •
04(z, ¢) := 04(z, t)'-=l-2q cos 2z + 2¾4 cos 4z
— 2q9 cos 6z + • • •
where q = emt and im (t) > 0.
When z = 0, then 0^0, ¢) = 0 and q(0, q) = 8j(q) for j = 2, 3, 4. When
the precise value of q is unimportant, one writes 0;( q) = 0;. and 0^(z, ¢) =
0;(z). When / is unimportant, one writes 0(z). It is straightforward to
establish the following functional identities (Exercise 1):
(2.6.1)
2.6 General Theta Functions 53
e1(z) = -02[z + |) = -iM8,(z + f + y) = -iM84(z + y)
82(z) = M83(z + y) = M84[z + | + y) = ^(z + |)
03(z) = 84{z + |) = M^z + § + y) = M02(z + y)
04(z) = -/M0! (z + y) = /M02(z + | + y) = 03(z + |)
where the multiplier M is given by
(2.6.3) M:=q1,4eiz.
From (2.6.2) or directly one has
81 (z) = - 81 (z + it) 8x (z) = - qe2% (z + Trf)
^(z) = -02(z + 7t) 02(z) = qe2i\{z + 7r0
(2.6.4)
fl,(z) = fl,(z + tt) J,(z)=?e2'z03(z + 7rO
04(z) = 04(z + it) 04(z) = -qe2i\{z + irt) .
These identities show us that any theta function is entirely determined by its
values on any fundamental parallelogram
P(Z0) ■■= {z\z = Z0 +r1TT+ ^77^,02^,^^1} .
We assume as we may that the given function has no zeros on the boundary
of P(z0). It is obvious that z = 0 is a zero of 0t(z) so that v/2, tt/2 + irtl2,
and TTt/2 are zeros of 02(z), 0j(z) and 84{z), respectively. Moreover, (2.6.4)
shows that z0 + mir + nirt (m, n integral) is a zero of a theta function
whenever z0 is. We now show that each theta function has exactly one zero
in each fundamental parallelogram. Thus we will have specified all the zeros
above.
Consider the integral
2iri JpOo) 8{z)
which gives the number of zeros of 8 inside P(z0). Explicitly,
54
The Arithmetic-Geometric Mean Iteration
0(z) 0(z + TTt)
0(z) 0(z + 77-r)
z0+irr
liriN
-n
-/:
dz
Lfl(z)
0(z + y)
8(Z + 77-)
dz
Now logarithmic differentiation of (2.6.4) shows that
0(z)
(2.6.5)
8(z)
0(z+tt) 8(Z+7Tt)
8{z + 77-) 8{z + ttO '
Thus 27r/iV = 277-f and N = 1 as required.
The identities in (2.6.4) show that each ft/ft, i¥=j, is doubly periodic. In
combination with our knowledge of the zeros of each ft we can painlessly
apply Liouville's principle (bounded entire functions are constant) to
establish many identities. We illustrate this with the following.
Proposition 2.1
(2.6.6)
Proof. Consider
«fc)«4 :
M-=
8\{z)8\
8\{z)8\
8\{z)8\ - 8\{z)8\
By (2.6.4) /(z) is doubly periodic with periods v and 7rf. Moreover 82A{z)
has a double zero at 7rf/2 in P(0). Also (2.6.2) shows that 8\{TrtlT)8\ =
8\{jrtl7)8\. Thus/is elliptic with at most one simple pole in P(0), and by
Exercise 3a) of Section 1.7,/can have no poles. Hence/is constant, being
bounded and analytic. Since 82{irl2) = 0 and 0,(77-/2) = 04, 04(7t72) = 83, the
constant must be 8\ and (2.6.6) follows. □
Note that on letting z = 0 in (2.6.6), we recover (2.1.10): 03 = 02 + 04-
It is a simple matter to recast Example 2.2 as the classical theta
transformation formula.
Theorem 2.4
For z an arbitrary complex number and im (t) >0,
(2.6.7) 03(^0 = (-^)-^^^^3(7,-7)-
Here one takes the principal square root. The proof is left as Exercise 3.
It is equally simple to use the Jacobi triple-product of Chapter 3 to
2.6 General Theta Functions
55
produce product expressions for 0;(z) (Exercise 4). Various other
relationships are indicated in the exercises.
Comments and Exercises
There is a proliferation of notations for theta functions. We follow the most
usual notations (used in Bellman [61]) and note that both Dickson
[71, vol. 3, p. 93] and Whittaker and Watson [27] give tables of alternate
notations. The abuse of functional notation, in particular the distinctions
among 8}, 0;(#), 8j(t), and 0-(z), necessitates some caution. Both Bellman
[61] and Whittaker and Watson [27] provide an accessible introduction to a
considerable amount of material on theta functions.
1. Verify the identities of (2.6.2) and (2.6.4). Note that (2.6.2) shows that
we can restrict attention to one theta function (say, 03) and lose no
information.
2. Show using Liouville's principle that,
a) e\{z)el = e\{z)e\-e\{z)e\
b) e\{z)e\ = e\{z)e\-e\{z)e\
c) e3(z + w)e3(z - w)e23 = el(w)e23(z) + e\{w)e\{z)
d) 84(z + w)e4(z - w)e24 = e24(w)e24(z) - e\{w)e\{z)
e) 8x{z + w)84{z - w)8283 = el(z)e4(z)6l(w)6i(w)
+ 82(z)83(z)81(w)84(w) .
Results of this kind are discussed in detail by Whittaker and Watson.
Many are given in tabular form in Erdelyi et al. [53].
3. Establish Theorem 2.4.
CO
4. Show that with Q0 := FE (1 — q2n),
n = \
CO
i) fl,(z) = Q0 11 (1 + 2q2n-1 cos (2z) + tf4""2)
n = l
CO
ii) B^z) = 2q1,4Q0 sin z U (1 - 2q2n cos (2z) + q4n) .
Establish similar identities for 82(z) and 84(z). Many variations are listed
in Erdelyi et al. [53]. (This relies on Section 3.1.)
5. Use Exercise 4 to show that
a M-4Y (-l)Vsin(2/iz)
1} fl,(z) 4^ \-q2n
56
The Arithmetic-Geometric Mean Iteration
u) logrMgi=4i(zi^!iBiM
'L 6,(0)
i 1-q'
for |im(z)| < \ log \q\.
6. One of the easy but important properties of theta functions is that they
solve the one-dimensional heat or diffusion equation
d26(z, t) _ 4f d8(z, t)
dz2 ~ it dt '
This provides a significant connection between the analytic properties of
theta functions as functions of z and their number theoretic properties
which revolve around the variable q. Various of these matters are
pursued in Bellman [61] and Rademacher [73].
7. Bellman [61] gives an interesting functional equation approach to
Theorem 2.4. Consider entire solutions /to
i) /(Z+T7)=/(Z)
ii) f(z+7Tt) = be-2,zf(z)
where b is an unspecified function of t independent of z. Suppose,
temporarily, that / possesses an absolutely convergent Fourier
expansion valid for all z:
m-= 2 c„e
2niz
a) Show that c„+1 = b xq2ncn. Hence deduce that for \q\ = |e"'| < 1,
f(z) = c0\ 2 ?«(«-D6-V"'z
*-n = ~ eo
is an analytic solution to i) and ii) and is unique up to choice of
constant. When q = b~ , we recover fl,(z).
b) Show that any entire solution to i) and ii) has an absolutely
convergent Fourier series. Hint: Use contour integration to show
that for any integral k,
I/.
it/2
/(z) e~ ,nz dz
—nil
<e-2knTr max \f(w-ik)\.
c) Use the uniqueness of solutions of i) and ii) to rederive Theorem
2.4. Hint: Show that both sides satisfy i) and ii). Then use Exercise
6 to normalize the equation.
There is also a considerable literature on multidimensional theta
functions. (See Bellman [61].) Since they do not impinge on our main
considerations, we say no more.
2.7 The Landen Transformation 57
2.7 THE LANDEN TRANSFORMATION
We finish this chapter by deriving an expression for sn in terms of theta
quotients and by relating this expression to the transformation of incomplete
elliptic integrals. We begin with the classical Landen transformation in theta
form.
Theorem 2.5
For all z and im (t) > 0,
83(z, t)84(z, t) fl,(0, t)84(0, 0 _ 82(z, M(z, 0
(2.7.1)
e4(2z,2t) e4(0,2t) e^izyit)
Proof. As a function of z, 84(2z, 2i) has zeros when 2z = irt + rmr +
2nirt or when z = mv/2 + (2n + l)vt/2. This is exactly where 03(z, t) or
64(z, t) is zero. Again (2.6.4) shows that/,(z):= 63(z, t)84(z, t)/84(2z,2t) is
doubly periodic with periods it and vt. Thus Liouville's theorem shows that
ft{z) is a constant with respect to z. The second equality follows on
substituting z + irtl2 for z and using (2.6.2). □
To establish the existence and analyticity of sn in theta terms we begin
with
( ' dz 64(z) °* 8\{z) ■
(See Exercise 1.) Now let p := 8^)/8^) and observe that
(This relies on Exercises 2a) and 2b) of the previous section.) Then
replacing z by 11822 and p by y = p83/82, we observe that, since k = 82/83,
(2.7.4) (|)'-(1-/X1-*V).
This is solved by
(2.7.5) Bn(ll)ik):=y=J4^Q
or
rr / ix «i(uvl2K)
^ksn^k)=W^I2K)
58 The Arithmetic-Geometric Mean Iteration
and then, in agreement with (1.7.1),
rsn(u,k) dy
r
'» V(l -y2){l-k2y2)
which solves the inversion problem (at least for real k) for the given
integral. We wish to have en + sn2 = 1 and k2 sn +dn =1, and it is
appropriate to define
(2.7.6) cn(«, k) .= - -^-^ = yjT- ^^
(See Exercise 2.)
Finally we wish to recast Theorem 2.5 in elliptic function terms.
Theorem 2.6 (The Descending Landen Transform)
Let 0 < i/f and 0 < k < 1 be given. If
^1 ' 1 + *'
and 0 < i/r < i/rj is given by
. , (1 + fc')sin i/rcos iA
sm i/fj: = T—
Vl - k2 sin2 ^
then
(2j-8) (i+t,)rvrfei=rviT|
2 sin2 0
or
(l + k')F(ilf,k) = F(ilf1,k1)
where
,, f* de
wt):=l-'- *^.
is the incomplete elliptic integral of the first kind.
2.7 The Landen Transformation
59
Proof. From Theorem 2.5 we deduce
^(z,Qfli(z,0_^(2z,20
03(z, 0«4(«. 0 «4(2z, 20 "
In. terms of sn, en, and dn this becomes
where u = zd\{q) and ux = 2zd\(q2) since kx = 02(q2)/83(q2).
Thus ^=(1-/:0/(1 + ^0 and u1 = 2ue23(q2)/823(q) = (l +k')u.
Collecting information, we have
(2.7.9) Bl = (l + *> t,-^ 811(^,^) = (1 + ^)-^-
since kk\xl2 = 1 + k'. The change of variables sin 6 = sn(w, /:) and sin 0t =
sn(«!, kx) now produces (2.7.8). D
Comments and Exercises
A profusion of information on the numerical use and derivation of various
Landen transforms is given in King [24]. More bibliographic information is
available in Watson [33] and in Whittaker and Watson [27].
1. Prove the differential equation (2.7.2) by showing that
^(z)«4(z)-«4(z)^(z)
*(2): =
02(z)03(z)
is doubly periodic with periods ir and irtll and that <f>(z) has simple
poles possibly only at v/2, v/2 + vt/2, and translated points. Then
show that (f>(z + vt/2) = <f>(z), and hence relative to the periods it and
■ntl2, <f> is doubly periodic with a single pole. Thus <£ is constant.
a) Establish (2.7.5).
b) With en and dn defined as in (2.7.6) and (2.7.7) show that
en2 + sn2 = 1, dn2 + k2 sn2 = 1, and that cn(0) = dn(0) = 1.
c) Show that (d/du) sn(«, k) = cn(w, k) dn(«, k).
d) Show that sn is strictly increasing on (0, k) with sn(0, /:) = 0 and
sn(jRT, k) = 1. [You may find it convenient to use the product
formulae for 0j(z) and 04(z).]
e) Show that dn(K) = k'.
f) Establish the double periodicity of sn(«, k), cn(«, k), and dn(«, k)
with respect to u and verify the following table:
60 The Arithmetic-Geometric Mean Iteration
Periods Zeros Poles Residues
sn
en
dn
4K, 2iK'
4K,2(K + iK')
2K, 4iK'
2mK + 2niK'
(2m + 1)*: + 2niK'
(2m + \)K + (2n + Y)iK'
2mK+(2n + \)iK'
2mK+(2n + \)iK'
2mK+(2n + \)iK'
(-
("
(-
-\)mlk
-l)m+1i/k
3. The ascending Landen transform is given by reversing the roles of k and
ku if/ and i/^. Thus if
,. 2Vk
and
then
l + k
sin (2^ - i/f) = k sin i/r fa > if/
(l + k)
W, k)=Fth, k,).
These transforms and their analogues for incomplete second and third
integrals clearly lead to quadratic iterations which are studied in detail
in King [24] without reference to theta functions. Note that if iff = it 12,
we recover the quadratic transformation for K.
a) Show that, in the notation of Theorem 2.6,
cos2 if/ — k' sin2 tp
COS fa = —■ =r- .
Vl - k2 sin2 if,
b) Hence show that
1 — k'
sin (2i/r - i/O = l + k, sin fa = K sin fa .
c) Thus show that, if k = 2\fk[l(\ + kx) and sin (2i/r - fa) = kx sin i/^,
then
^,^) = -^-^^,^).
Compare Exercise 5 of Section 1.4.
a) Show that if u : = 2Kx/tt, then
sn(«, k)=2q1,4k-ln sin (a;) fl f T^
n = l L 1 —
- 2qz" cos (2s) + q™
2n-l /0n , 4n-2
■ 2^-1 cos (2*) + ^
b) Find similar expressions for en and dn
2.7 The Landen Transformation 61
5. Combine Exercise 4a) and Theorem 2.6 to produce a quadratically
converging approximation to sn(w, k) given one for sin x.
6. Use (2.7.5) and Exercise 2 of Section 2.6 to show "the addition
theorem"
sn(w) cn(y) dn(y) + sn(y) cn(w) dn(w)
sn(w + v) = -J—27^—27T •
\-k sn2(«)sn (v)
Obtain similar expressions for cn(« + v) and dn(« + v). (See Whittaker
and Watson [27] and Exercise 11 of Section 1.7.)
Chapter Three
JacobVs Triple Product and Some
Number Theoretic Applications
Abstract. We establish JacobVs triple-product identity and apply it quite
variously. We first use it to derive the fundamental product identities for the
theta functions. We then rederive the triple product via Cauchy's q-binomial
theorem and present Bressoud's beautiful elementary proof of the celebrated
Rogers-Ramanujan identities. After this we derive JacobVs formula for r4(k)
(the representation of k as a sum of four squares) and two partition results
due to Ramanujan. We also establish the Gaussian sum formula and indicate
another proof of the theta transformation formula. Then we briefly discuss the
Mellin transform and use it to give the classical reflection formula for the
Riemann zeta function. Finally, we show how certain reciprocal series can be
evaluated in terms of theta functions. In particular, we give a result due to
Landau on the Fibonacci numbers. We also sum the squares of the reciprocals
of the Fibonacci numbers.
3.1 JACOBVS TRIPLE-PRODUCT IDENTITY
Our first proof of the triple-product identity is:
Theorem 3.1
For each pair of complex numbers x and q, with x ¥=0 and \q\ < 1,
co eo
(3.1.1) n (l + V-1)(i + x~V"_1)(i - i2n) = 2 A"2 •
n = l n--«>
Proof. Let F(x, ¢) : = Id (1 + V-1)(l + *"V""1). Now F(-, q) is
62
3.1 Jacobi's Triple-Product Identity
analytic except at zero and has a Laurent expansion at zero. Observe
F(x, q) = F(x~\ q) and F(xq2, q) = (*?)"¥(*, q). Thus if
CO
F(x,q)= 2 cn(q)xn
then cn(q) = C-n(q) and cn(q) = q2n~1cn_1(q) for n3=0. It follows
cn(q) = q"2c0(q) and
(3.1.2)
F(x,q) = c0(q) 2 A"
"t
I
It remains to evaluate c0(q). Letting x'-=l in (3.1.2) gives c0(#)fl,(
iCi (1 + q2"-1)2 and letting x := -1 gives co(?)04(?) = 11^ (1 - q2n
Since V03(?)04(?) = S4(q2), we deduce that
n = l
c0(q)84(q2)=ll(l-q4n-2)
But on replacing # by q2,
c0(q2Mq2) = U(l-q4"'2f
Hence
n = l
^-mi-c*2)2"-1].
Since c0(0) = 1 and c0 is analytic at zero,
CoiqT1 = 111^1) = 111111-^2^]
k = lLCQ(q )J t=l«-l
= fi(i-?2m).
m = l
This establishes (3.1.1). □
It is convenient to make the following notational abbreviations.
(3.1.3)
Qo-=U(l-q2n) Q1-=U(l + q2")
n=l n-1
22--=11(1 + ^-1) Q3-=fl(l-q2n-1)
n=l n»l
64 Some Number Theoretic Applications
From these definitions one easily verifies Euler's identity QiQ2Q3 = 1,
which may also be written
CO CO
(3.1.4) U(l + q")U(l-q2"~1) = l.
n=l n-1
Also
(3.1.5/) Q0Q1 = Q0(q2)
(3.i.5«) Q0Q3 = Qo(qin)
(3.1.5m) Q2Q3 = Q3(q2)
(3.1.5/v) QiQ2 = Qiiq1'2) ■
We gather the first three specializations of the triple-product identity into:
Corollary 3.1
For \q\ <1, one has
CO
(3.1.6) <%(*) = Q0Q22 = II (1" q2")(l + q2n"f
n = l
CO
(3.1.7) 84(q) = Q0Q23 = II (1 - q2")(l ~ q2n"f
n = l
CO
(3.1.8) e2(q)=2qV4Q0Q\=2q1UY{(l- q2n)(l^ q2nf .
n = l
Proof. These follow on using x-=l, —1, and ¢, respectively, in (3.1.1)
(Exercise 2). □
More generally, let k and I be real numbers, and let q'= qk and * := ±ql
in (3.1.1). Then
(3.1.9) I! (1 ± q2kn+k~')(l ± q2kn+k+')(l - q2kn+2k)
= 2 (±i)nqkn2+ln:
n = —co
When k'.= l and /: = \, this gives
CO CO
(3.1.10) 11(1-?")= 2 (-l)V3n+1)n/
This is Euler's pentagonal number theorem, which he found empirically and
which affords a combinatorial interpretation [Exercise 3b)], as most of these
identities do.
When /::=/:=3, we have
3.1 Jacobi's Triple-Product Identity 65
co co co _ 2n
(3.1.11) I[(l + q")(l-q2")=~ 2 ^^ = 11:^-4=1.
„ = 1 Z n = -«> „_i l — q
Finally, k'-=\ and /:=5,5give two formulae which play a central role in
the Rogers-Ramanujan identities (Section 3.4):
(3.1.12a) U(l-q5n+1)(l~q5n+4)(l-q5"+5)= 2 (-l)V5"+3>"/2
00 00
(3.1.126) U(l-q5"+2)(l-q5n+3)(l-q5n+5)= 2 (-l)V5n+1)n/2 .
n = 0
We finish the section with a slightly less immediate corollary of the triple
product. If q '■= q1'2 and x '■= q1 w, then (3.1.1) becomes
(3.1.13) 11 (1 - qn)(l + q"W)(l + q"^1) = (-^-) f W"q«»^>2 .
The right-hand side is
7 (W +W )i7m(m+l)l2 V -«/* + 1\ m(m+l)/2
If we now let w tend to — 1 from above this gives
co co
(3.1.14) EI (1 - q"f = 2 (-l)m(2m + i)qm(-m+^'2 .
n=l m=0
Thus with (3.1.10),
(3.1.15) [ 2 (-i)-y3»+1>»/2l =2 (-l)m(2m + l)?m(m+1)/2.
Lm = -co J OT=0
Comments and Exercises
The proof of (3.1.2) is due to Gauss [1866]. The AGM identity (2.1.7) was
also in his possession, but Jacobi, by elliptic function techniques, was the
first to publish a proof of the triple-product identity. There are many proofs
in the literature. Given the AGM, none perhaps is as simple as the one
given here.
1. Verify the identities (3.1.4) and (3.1.5).
2. a) Prove Corollary 3.1.
b) Show that
(3.1.16) ft (1 + q2"'1)8 = 11 (1 " q2"'1? + 16? 11(1 + q2"f .
66 Some Number Theoretic Applications
3. a) Establish (3.1.9), (3.1.10), and (3.1.11).
b) A pentagonal number is a number of the form n(3n ± 1)/2. Show
that (3.1.10) implies that every nonpentagonal number can be
partitioned into an even number of distinct parts as often as into an
odd number of distinct parts. Show that for pentagonal numbers
there is a surplus or deficit of 1, depending on whether n is odd or
even. This was first observed by Legendre in 1830. (See Dickson
[71, vol. 2].)
4. Establish (3.1.12) and give an interpretation in terms of partitions.
Ewell [81] observes that Euler's pentagonal number formula (3.1.10)
allows one to establish Jacobi's triple product, given (3.1.2). Thus one
can base the identity on a combinatorial proof of (3.1.10) such as is
given in Hardy and Wright [60].
5. Use (3.1.10) to show that c0(q3) n^=1 (1 - q6n) = 1.
6. A complex analytic approach to Jacobi's triple-product is as follows.
a) Show that (3.1.1) is equivalent to
CO CO
(3.1.17) 04(z, q) = EI [1 - 2q2"~1 cos 2z + q*n~2] Y\{1- q2n)
n=l n=l
where q '■ = emt.
b) From the discussion of Section 2.6 show that both sides of (3.1.17)
are analytic with zeros at z = (n + \ )irt + mir (n, m integral). By
Liouville's theorem they differ only by a multiplicative constant.
c) Use Exercise 5 to show that this constant is 1.
7. a) Justify taking the limit in (3.1.14).
b) Prove that (3.1.15) is equivalent to
(3.1.18) [ 2 (-l)Y6n+1)2l =2 (-ir(2m + l)?3(2m+1)2.
c) Observe that (3.1.18) has the following number theoretic
interpretation due to Catalan (Dickson [71, vol. 2]). The excess in the
number of even values of x + y + z in
(6x ± 1)2 + (6y ± 1)2 + (6z ± 1)2 = 3(2n + 1)2
over the number of odd values of x + y + z is (2n + 1)(-1)" In
particular, any number of the form 3(2n + 1)2 must have at least
(2n + 1)/6 decompositions as a sum of three squares.
Show that 03 and 04 never vanish (\q\ < 1).
Show that 82 and fl, increase monotonically on (0,1) and 04
decreases monotonically on (0,1).
8. a)
b)
3.2 Some Further Theta Function Identities 67
9. Let p(n) denote the number of partitions of a natural number n into
positive integral parts (the order being irrelevant). Thus p{A) = 5 and
,,(5) = 7.
a) Show that
(3.1.19) 1 + 2 p(n)qn = f[(l- q")^ = 1+2 7^-%
k\2
"iXlUil-q*)
For the second equality see also Proposition 3.4 of Section 3.3.
b) Use
\l+l P(n)qn] 2 (-1)V(3"+1)/2 = 1
L „ = 1 J n=-»
to write (and implement) a recursive formula for p (n). This is how
MacMahon (1918) computed />(n), l<n<200; /»(200) =
3972999029388. For more information on partition theory the
reader is referred to Hardy and Wright [60] or Andrews [76].
c) (Euler) Establish that
2 p(n-a(m))= 2 p(n-a(m))
m even m odd
- where a(m):= m(3m + 1)/2 is the mth pentagonal number.
3.2 SOME FURTHER THETA FUNCTION IDENTITIES
In this section we collect a number of definitions and relations, some of
intrinsic interest and some for future reference.
Let r G (0, oo) and define A*(r) := A:(e_W7). As in Chapter 2, we consider
k as a function of q (=e-irV7). Then
(3.2.1) 0£4e-W7/2 -A*(r) = 16e""3W7/2 + 0(e""SW7/2)
SUIV^C /I ^/^ — (72V
(3.2.2)
From Corollary
(3.2.3i) .
e ;/
3.1 we
e2{q)
o3ve
K'
£
lave
= Vfc =
^ vOCC JL
[A*(r)] =
2?i/4 ri
n = l
^ACltlSC J..
Vr.
( W"
\\ + q2n-
)■ "
*)'
68 Some Number Theoretic Applications
(323«) Mi)-vF=nf^^y
when 9 := e^™^.
We next derive another beautiful identity due to Jacobi, in which 8^
denotes the derivative of 8X with respect to z at zero. [See equation (2.6.1).]
Proposition 3.1
For \q\<\,
CO
(3.2.4) 82(q)83(q)84(q) = 2 2 (-l)"(2n + l)?(n+1/2>2 = O^q) .
Proof. Identity (3.1.14) can be rewritten as
OUq) = 2qV4Ql = 2qXI*Q\{Q,Q&,f
= Q-qllAQoQ\)(QoQ2i)(QoQl) = <%<%«4. □
For various applications to lattice sums it is natural to augment our theta
definitions by
CO
(3.2.5) 65{q): = 2 2 (-l)Y2"-1/2)2
and
(3.2.6) 06+(?): = 2i (-l)"("-1)/2(2n + l)?(n+1/2)2.
By similar arguments [see (3.1.14)] one can show that
(3.2.7) 85(q) = Iq^QvQJQziq2) = V2l# VXV)
and
(3.2.8) e+6{q) = 85823(q2) = 0,8,(q2)84(q4) .
We leave these identities as Exercise 6 and note that (3.2.4) and (3.2.8)
have number theoretic interpretations like that of (3.1.15). These three
identities and their remanipulations are among the very few known
reductions of three theta terms to one known theta expression. (See Glasser and
Zucker [80].)
Following Weber [08] and others, it is usual to identify the following
quantities. With q and r as above and t : = iVr,
3.2 Some Further Theta Function Identities
(3.2.9/) i} = i,(V=r) = t,(t) : = ?1/12eo
(3.2.9b) A =/x(V=r) =/x(t) := ?-1/24Q3 = (4*'2/*)1/12
(3.2.9m) /2 =/2(V=r) =/2(r) : = 21/V/12Q1 = (4*2/*')m2
(3.2.9/v) / = /(V=7) =/(r) : = <f 1/24Q2 = (4/kk'f12 .
Then
(3.2.100 /1/2/= V2
(3.2.108) /8 = /?+/2
and jRT satisfies
(3.2.11) K=-r)f -^VJi-^jvfi-
As will be discussed in Chapter 4, whenever r is rational, /1} /2, and /
satisfy algebraic equations (whose degree is determined by the number of
quadratic forms with determinant -4r). Thus once tj and one other of flff2,
f, k, k' is known, all six are determined,
A significant identity which follows from (3.2.9iv) is
(3.2.12) 2. 2 ' 2 = ~vf log [/(V=7)] = — log I —- I .
Here and hereafter, the prime over a summation indicates that the term
m = n = 0 is omitted and summation is over expanding rectangles. One
establishes (3.2.12) by writing
-^iog(Ga)=£i£ tf2q^k
v' v '■ t=ln=0 K
= ir2 —7=r- cosech (irVrk) = 2 2
00 00 / 1\*+«
(-ir_„„w„^_v v tiy
t^\ Vrk ~" "v/ ^xm^1-™ m2 + rk2
[using (2.2.3)], and so
V' ("I) 47T , 77
2, 2, 2 =~ v?log(<22)- —
since S^=1 (-l)m+1/m2 = 7r2/12. This is the desired result. One may
observe that whenever one can evaluate the double sum, one can also evaluate
kk'.
In some future work we will be following Ramanujan rather than Weber.
Ramanujan studied
70 Some Number Theoretic Applications
G„ := (2kk'yin2 = 2~1/4/(V^n)
, ,,2\ 1/12
..-(If) -2-'V,<V=*>.
For some purposes these give slightly cleaner results. For example,
_V5+1 _ V5+1
O25- 2 fto-\ 2 '
For the moment we denote k{ q") by 8 and jK(S) by A. We have
,3
(3.2.14) kk'^f) =4V?Go
so that
(3215) mV6ji- qU12Q°
^■2A5) \88'J VA _ qnn2Q0(q")
We also have
(3.2,6) (M)'(1-2*»)-l-24Ze5±iJ£
\ 7T / „=0 1+ q
(3.2.17) (^)V - k2k>2) = 1 + 240 £ -¾
v t / „=1 1 — 0
and
(3.2.18) (—) (1 - 2*2)(l + \ k2k'2) = 1 - 504 2 ^
\ IT I \ I I n = l 1 "
" -y
12n
These formulae lie considerably deeper, either in elliptic function theory, or
as direct computations. Indeed each formula entails formulae for powers of
theta functions. We prove only (3.2.16) and leave (3.2.17) as an exercise.
We need an identity whose proof we also leave as an exercise.
Lemma 3.1
For \q\ <l,
nqn _ ^ {In + l)q2n+x
(3.2.19) 2^ = 2^
2n+l
9
Proposition 3.2
For \q\<\,
(3.2.20) 6t(q) = 1 + 8 £ ^C + 8 £ ^¾^
„=0 1+9 n = 0 1 — ^
3.2 Some Further Theta Function Identities 71
,„n ,4,, of (2n + l)g2"+1 f (2n + l)g2"+1
(3.2.21) 02(g)-8Z ., 2«+i +8Z—;— 2«+i
„ = 0 1 + ^ n = 0 1 ~ 9
(3.2.22) ^) = 1 + 82^-82^^1^ ■
„=0 1+ ^ n=0 1 + ^
Proof From (2.3.16), on differentiating with respect to q = e~ns, and
Corollary 3.1 we have
*«>~««<H)-*£*(i)+>
which yields (3.2.20). The other two follow similarly. □
Theorem 3.2
For \q\ <1,
(3.2.23) «J(«) = l + 8 2 7¾
n^0(mod4) X 9
nal
and
2n + l
(3.2.24) flj(*) - e\{q) = 1 - 24 2 ^¾
„=o 1 + 9
Proo/. Apply Lemma 3.1 to (3.2.20) to establish (3.2.23). To prove
(3.2.24) one uses the lemma with q and —q. □
Now (3.2.16) is immediate from (3.2.24). Formula (3.2.17) is similarly
derived, if one knows in addition (see Rademacher [73]) that
(3.2.25) 8l(-q) = 8\{q) = 1 + 16 2 ^"^ .
Finally, (3.2.18) follows from (see Rademacher [73])
(3.2.26) 0«(g) = i + 8f JtiL-oS ^^+(0+)4
„=i 1- q „=i 1~9
Of course, implicit in (3.2.23) is a formula for the number of
representations of n as a sum of four squares. We discuss this more fully later.
72
Some Number Theoretic Applications
Comments and Exercises
1. Establish the asymptotic of (3.2.1).
Proposition 3.1 is often established by analytic arguments like those of
Exercise 6 of Section 3.1. Proposition 3.1 may then be used, as it is in
Whittaker and Watson, to establish the triple-product identity. (See
Exercise 6.)
2. a) Show that
n[l-e-(2-+1)'] = 21/8e-
n = 0
b) Show that
2 e-(2"+1>2- = (21/4-i)77-3/42-"/4r(i).
n=0 ^4/
(This was set in Trinity College, Cambridge, in 1881.)
c) Recall that if *:= (V3-1)/V8, then kk'= \ and K'(k) =
y/3K(k). Deduce that
n (l+e-ini"^)=2~i3/24(V3+1)" vi2V3.
n = l
d) Similarly, if k:= V2(3 - V7)/8, then kk' = £ and K'{k) =
VlK(k). Deduce that
CO
11 (1 + e-2nWV7) = 2-5/8(3 + V7)1M e"lnV~7 .
n = \
3. Establish the identities in (3.2.9), (3.2.10), (3.2.11), (3.2.14), and
(3.2.15).
4. a) Justify the derivation and convergence of (3.2.12).
b) Evaluate the left-hand sum when r '■= 1, 2, 3, 4, 7.
c) Prove that as r tends to infinity, the sum converges to —tt2/6. It is
already "close" by the time r = 7.
d) Prove that
Y V' (-1)" 477, r,, „ ir. (4k'2\
m=—n=- n + rm Vr °LJ^ 3Vr \ k /
3.2 Some Further Theta Function Identities 73
5. Show that
a) g*n = 2l"gnGn
b) Gn=GVnwAgnl = gun
c) {gnGn)\Gl~gl) = \.
6. a) Prove Lemma 3.1 by expanding both sides.
b) Prove (3.2.7) and (3.2.8).
c) Establish Theorem 3.2.
d) Prove (3.2.16) and (3.2.17). Try to prove (3.2.18). Exercise 9 will
help.
The next two exercises sketch out some facts about Jacobi's imaginary
quadratic transformations.
7. a) The transformation kx •'= k1 is given in theta terms by
(t)V">=(f)V-"-v
b) K'(kx) = kK' and K(kx) = k(K + iK').
c) The transformation kx:=k'~l gives K'{kx) = k'K and K{kx) =
k'(K' + iK).
d) The transformation kx'-=iklk' gives k[ = -k'~\ K(kx) = k'K,
and K'(kx) = k'{K' - iK), since re(s) > 0.
e) The transformation kx = k'lik gives k'x = k~l, K{kx) = kK', and
K'(kx) = k(K + iK').
8. a) Use sx ■'= i + s~x to derive from (2.3.14) that
_i= Aw. -.to
2 *<%(,) * 04(Sl)
and
Mill = _ • AM + AM + I
S>04(*i) l04(Sl) * 6,(5) + 2-
b) Now use Exercise 7a) and (2.3.17) and (2.3.18) to show that when
kx := k~\ kE{kx) = E(k) - (k')2K(k).
c) Derive similar formulae for E(kx) for the other three imaginary
transformations.
d) Evaluate K'lK when k'-= VZ + V2.
9. a) For r := 1, 2, 3, 4, show that ^(fq) = 83(q4) + 1¾¾4).
b) trrie"3(irq) = 4 i (?k-V)«2(*4)-
&=y(mod 4)
c) Evaluate 03~*0* for A: := 1, 2, 3, 4.
74 Some Number Theoretic Applications
10. Show that
j. ("ircTnc"(r+i)7r=\ ** w - ^
[Let /-:=9, 2kk' = (2 - V5)2 in (3.2.12). See (4.6.10).]
The formula for sums of two squares is derived from Jacobi's identity
CO „
(3.2.27) »3 = l + 4lrru.
n=\ 1 + q
This will be shown in Chapter 9.
11. A comprehensive list of hyperbolic identities can be derived, as was
(3.2.12). We have, for q : = e~",
1
(3.2.28/) -log Q0 = 2 , 2™ n
i n(e - 1)
_ 1 Y coth(7rns) — 1
~2 Y n
its 1 (2K3kk'\
= - n - 6 lo^ [-J-)
(3.2.28«) -ioget=i (2;ir
l n(e - 1)
_ 1 y (-l)"[CQth(ff«8)-l]
2 j n
= _ tts _ J_ / A:2 \
12 12 g \ 16/V I
„ 1 v (-l)"cosech(7rns)
(3.2.28m) -log Q2 = ■= L ^-2 * L
L J ft
ITS
24
- Tilog (*f)
/-^ no' \ i st, ! V cosech (7ms)
(3.2.28*v) -log 03 = -Z -*
3.2 Some Further Theta Function Identities
(3.2.28v) log Q0Q\ = log ^¾ == S
1 °°
= -2
2 ,
2# i n(e +1)
1 y 1 — tanh (jrtis)
ITS
T
1 , (kK\
(3.2.28vi) | log Q0q2 =3 log fl,
y 1 =1, /¾
""f (2n-l)K(2"-1)s + l] 410gW/
(3.2.28v//) - i log e0e2 = - | log 04
i (2n - i)[e<2"-^ - 1]
-i^h-.-.-ib,^)
(3.,28W,.01og[|^] = |_I-)l_
= ly (-!)"[!-tanh(ff/u)]
2 j n
T + 2log VIFV
/o^^o-\ , ^ ^2 v cosech [7r(2n - l)s] 1
(3.2.28**) log ete22 = Z ^^ ° = - 4 log k' .
These are taken from Zucker [84]. Many more identities follow
differentiation or as below.
12. a) Show that
Hmf: Use (3.2.28k). Then replace q by q1'2 so that k' is replac
by (1 — /:)/(1 + k). Then replace # by —q so that /: is replaced
iklk' (as in Exercise 7).
b) Show that
76
Some Number Theoretic Applications
+ 1)tts/p
V (-1)" sech
„t'02n + lSeCtl
f"(2n
tt/8
tt/24
isin_1[(3-V7)/4V2l
= 1
= 3
= 7
3.3 A COMBINATORIAL APPROACH TO THE TRIPLE-PRODUi
IDENTITY
The Gaussian or q-binomial coefficients are polynomials defined by
0/q \n/c
and by
(3.3.1)
(") =-
\m/ q
(?)„ n"=„-m + l(l - 91)
* («)»(«)-» nr=1(i-<7s)
when 0<m<n. Here (^)„ is defined by
(3.3.2)
(*).:= n
In
-q
l-qs+
This allows for any complex value of s, but we will consider only integral
values. Thus for n in N, (q)n = U"=1 (1 - q') and (q)Z\ = 0. For m<0 or
m > n we either define ( ) to be zero, or observe that it is implicit in our
definition of (q)m (for integral m). The next proposition gathers up some
easy facts.
Proposition 3.3
For \q\ <1,
(fl) (m + l)t = (m + l)t + «"'"(m)t
(6) Hm(n) =(")
(c) ( I is a polynomial in q.
\m/ q
Cauchy's binomial theorem is, for n = 1, 2, 3, . . . ,
3.3 A Combinatorial Approach to the Triple-Product Identity 77
(3.3.3) 2 y^-O-iW n) = fl (i + M*}.
m=0 \m/g fc=1
This is easily established inductively by showing that both sides of (3.3.3)
agree for n = 1 and satisfy
(3.3.4) Fn(y, q) = *„_,(>», q) + yqnFn_x{y, q) .
One can also argue, combinatorially, that the coefficient of ysq' on each side
gives the number of partitions of t into s distinct parts not exceeding n.
If we now let y ■ = xq~N and n '■ = 2N, we obtain, after some manipulation,
(3.3.5) £ *v(m+i)/2L2*J =n(i+x^)(i+*-v-1)'
m-~N Xly m/ 1 1
a result also due to Cauchy.
This bears a striking resemblance to the triple product in the form of
( 2iV \_1
(3.1.13). Indeed on letting N tend to oo one has \N_ ) tending to
Q(q) '■= n^=1 (1 - qm), and we are left with (3.1.13) if we can justify the
exchange of limit and summation (Exercise 3). Thus (3.3.5) deserves to be
considered as a finite form of the triple product.
The following Eulerian result will also be used in the next section. For
notational simplicity the empty product equals 1.
Proposition 3.4
For k = 1, ,2,.. . ,
rr 1 y *V2 (k\
fA l-v /-o(i-**)---(i-V) WV
Proof. Inductively one establishes that each side satisfies
xqk
Gk(x, q) = Gk„x{x, q) + y^—• Gk_x{xq, q) .
Obviously the result is true for k = l. Alternatively one can argue that the
coefficient of x'qs on each side counts partitions of s into t parts not
exceeding k. □
Comments and Exercises
Here and in the next section we follow Bressoud [83].
1. Prove Proposition 3.3 and (3.3.3).
78 Some Number Theoretic Applications
2. Verify (3.3.5).
3. a) Prove that for \q\ < 1,
limf 2" ) =Q-\q).
b) Justify the exchange of limit and summation in (3.3.5) by observing
that the terms possess uniform majorants that are summable.
4. a) Prove Proposition 3.4.
b) Deduce that
CO CO
(3.3.6) I! j = 2
,=i 1 - xq' ,=o
*v
(l-xq)(l-q)---(l-xq')(l-q>)
and compare with (3.1.19).
3.4 BRESSOUD'S 'EASY PROOF' OF THE
ROGERS-RAMANUJAN IDENTITIES
The Rogers-Ramanujan identities are
„2
n 1
(3.4.1) 1 + Zj ~7\ vTl ZJ\ /1 „m\ ~ 1 ■*■ 7l „5m+l\/1 5m + 4\
m = l (1-?)(1-? )•••(!-? ) m = o(l-? )(1-9 )
m = l (1-9)(1-9 )---(1-9 ) m = 0(l-9 )(1-9 )
They afford a remarkable combinatorial interpretation (Exercise 1). We
use equations (3.1.12a) and (3.1.126) and the notation of the last section to
rewrite the identities as
(3.4.3) 2 7^r = G_1(9) 2 (-iyy5m2+m)/2
m = 0 \H)m m = -»
00 mz + m <*>
(3.4.4) 2 fT-==G"1(9) 2 (-iyy5m2+3m)/2.
»"=0 K^m m=—<*>
It is this form that we establish. The key observation is the following.
Lemma 3.2
For n positive and integral and any complex a,
3.4 Rogers-Ramanujan Identities 79
m am2 s2 m (a-l)m2
(3.4.5) S, '« -2-^-2"%, ■
Proof. Note that the above sums are all finite. Set k'.— n — m and
x:= #2m in Proposition 3.4. Then multiply each side by (¢)^. We get
(?)n + m=2 T-
'^^ (*)-
Make this substitution for (q)~lm in the left-hand side of (3.4.5). We have
m am2 m am2 j2+2mj, \
y x q _ y x q y q K1)n-m
m KQjn-mKQjn + m m n — m—j
x qK
j +2m
We now sum over m and s-=m + j and exhibit the right-hand side of
(3.4.5). □
m2
The effect of Lemma 3.2 is to reduce the power of q by 1. Repeated
use of the lemma results in an expression which can be handled by the finite
triple-product. As we will see, the k = 2 case of the next result contains the
desired identities.
Theorem 3.3
Given positive integers k and iV, we have
_s\ + s\+---+32. Sk
x —^— — nd+x^xi+x-v1)
m \Jy — ml q
Proof. Commence with applying the lemma to the right-hand side of
(3.4.6). We have
(^2*v(2*+i)m2+m]/2L ) =s(7x}*, x—
- \N-m/q m (q)N-m(q)N+m
N+m
2 - l/2yn <2k-l)m2/2
= V 1 v w ) g
(q)s-m(l)s +
80 Some Number Theoretic Applications
We continue by applying the lemma /:-1 more times and arrive at
/ \sl+s%+---+si / l/2\m m2/2
y (?) v (*g ) g
^,¾.....¾ (#)^-^( g^-JjC #)^^-¾ m \1)sk-m\4)sk+m
We use the finite triple-product (3.3.5) to write
, l/2yn m2/2 s*
2 ffi >« = (¾ n (i+^m)(i+*-v-1) •
m \H)sk-m\<l)sk+m m = l
This completes the proof. □
The theorem is applied by specifying k and/or x. When x:=-1, we
observe that unless sk = 0, the products on the left-hand side are zero. Thus
(3.4.7)
=(^2(-1)-,---(^),.
If we let N tend to », we arrive at
(3.4.8) 2 2
*1>*2 "k-l (¢)^-¾ " ' " (¢)^-2-^-1( ?)»*_!
n = l m = -»
For /::=1, the left-hand side is 1 and the formula recaptures Euler's
pentagonal number theorem. For /::=2, (3.4.8) coincides with the first
Rogers-Ramanujan identity (3.4.3). The second identity is derived similarly
by specifying x '■= —q and /::=2 (Exercise 3).
Comments and Exercises
The Rogers-Ramanujan identities were discovered by Rogers in 1894 and
rediscovered by Ramanujan in 1913 (a letter to Hardy) and Schur in 1917.
They have continued to receive a great deal of interest. As recently as 1979
Basxter rediscovered them in a physical context.
Hardy and Wright say "no proof is really easy (and it would perhaps be
unreasonable to expect an easy proof)." Bressoud's proof certainly comes
close to contradicting this. Much of the early history can be found in
Hardy's footnotes to Ramanujan [62].
3.5 Some Number Theoretic Applications
81
1. Show that (3.4.1) says that the number of partitions of n into parts with
minimal difference 2 is the number of partitions into parts congruent to
1 or 4 modulo 5. Equally, (3.4.2) says that the number of partitions of n
into parts with minimal difference 2 and minimal part 2 is the number of
partitions into parts congruent to 2 or 3 modulo 5.
2. Verify the equivalence of (3.4.1) to (3.4.3) and (3.4.2) to (3.4.4).
3. Derive the second identity (3.4.4) from Theorem 3.3. There is an
interesting continued fraction associated with the identities.
n
(l-x5n+2)(i-x5n+3) _ K=-A-i)nxi5n+1)nn
„io (l - x5n+1)(i - x5n+4) e:__. (-i)Vs"+3),,/2
(3.4.9)
2 3
= 1+ 1+ 1+ 1+"" '
When x'-=e~,r/" (n rational), this is in principle evaluable in closed
form. Ramanujan gives
(3-0, 1+£^-[.-(^.v?ii)]-.
In a recent paper, Bhargava and Chandrashekar Adiga [84] detail
(3.4.9) and other continued fraction identities. The closed form (3.4.10)
and its extensions are elaborated on in Ramanathan [84].
4. Investigate the analogue of Theorem 3.3 in which 2k replaces 2/: + 1.
3.5 SOME NUMBER THEORETIC APPLICATIONS
Our first application is a proof of Jacobi's formula for r4(ri), the number of
representations of n as a sum of four squares [including sign and
permutation so r4(2) = 24 and r4(l) = 8]
Theorem 3.4
For each positive integer n,
(3.5.1) r4(n) = 8 2 d.
d\n
d>^0(mod 4)
In particular every positive integer is the sum of four or fewer squares.
Proof. Expand the right-hand side of formula (3.2.23) for 6\{q) to
obtain
82 Some Number Theoretic Applications
CO CO CO y ,
4-Tm 4fd
Now compare coefficients with 0\(q) written as
+ 00 00
n-=-oo
The subsidiary conclusion is Lagrange's famous result. Our second
application is an analytic proof of Fermat's theorem that any prime of the form
4/: + 1 is the sum of two squares. It is convenient to define
o-j(n) := Zi d
d\n
and
(3.5.2) w(n) := <rx{ri) + o-j(odd n)
where odd(n) is the odd part of n (that is, the largest odd divisor of n). We
set w(n):=0 for n=sO. The proof of the following lemma is left as
Exercise 3.
Lemma 3.3
The value w(n) is divisible by 4 unless n is an odd square.
Theorem 3.S
An odd prime p is the sum of two integral squares if and only if it is
congruent to 1 modulo 4.
Proof. The 'only if is immediate on consideration of residues mod 4.
Now argue as in Proposition 3.2 and Theorem 3.4 to show that
\ri (2n + 1)# \ri n#
2j \ „2n+l -^ 1 _ _"
n=0 I" ? « = 1 A 9
(since Q0G3 = G)- Hence, on expanding these expressions (as in Exercise 12
of Section 3.7)
On multiplying by 04 and comparing terms we have
3.5 Some Number Theoretic Applications 83
(3.5.3) »v(n) + 22(-l)Mn-y2) = {2(01)r V ^^
Suppose now that n = p = 4m + l with p prime. Then w(p) = 2<rx(p) =
8m + 4 and we see that
Am + 2 + 2 (-l)Mf - ft = 0 •
Thus some w(p — j2) is not divisible by 4 and Lemma 3.3 implies that
p = j2 + /:2 for some integers j and /:. □
Our third application is to establish the following reciprocity result for
Gaussian sums. Let
S(p,q):=2e-^'"q
where p and q are nonzero integers.
Theorem 3.6
For positive integers p and q with pq even,
«(/>, ¢) = ^^5(<?>/>)•
(Here the bar represents complex conjugation.)
We will prove the result from the transformation formula for 6^ (2.3.1).
This needs the following:
Lemma 3.4
For pq even and q positive,
liraiVa,(e + ^) = ½ S(P>1)-
e-»o+ \ q I q
Proof. Write 03(e + i/>/?) := £"_ e-™2(«+*/?) j^
fl,(e+ ^) = 1 + 2^ «-**"[2 e-^'"^']
on taking the periodicity of e~niTiplq into account. (Here we have used
e~niq2p,q = 1, which entails pq even.) Thus
84 Some Number Theoretic Applications
q
ds + 0(Ve)
Vee^e + &) = 2 e-"*«[2Ve [ <T(r
q/ r-l
on using the integral test. We now take the limit as e—> 0+ and calculate that
/•CO /*CO
lim Ve e-^sqf™ ds = lim Ve e~£,r?V ds
e^0+ JO e^0+ Jr/g
= lim Ve f e-"**2 <b = -4=- f e~'2 * = T~ •
e^o+ Jo y/Trq Jo 2q
Since S(/>, q) = S*=1 e^"2''7', this gives the desired result. □
Proof of theorem. We now use the theta transform (2.3.1) to write
Thus the lemma shows that
7f S<* *> = '""* W ,¾ ^(6 / - 'j + °(e2))
= e~"'*Vp lim VF03(e' -^ + 0(e'2)) .
By Lemma 3.4 applied to -^ and p we deduce that
(3.5.4) ■^S(p,q) = e-M4~S(-q,p). □
Our final application is a partition result of Ramanujan's which relies on
the triple-product identity. (See Exercise 9 of Section 3.1 for the definition
of P.)
Theorem 3.7
(a) p(5n + 4) is divisible by 5.
(b) p(7n + 5) is divisible by 7.
Proof. With Q(q) = n^=1 (1 - q") as before, we write
CO CO
qQ\q) = qQ(q)Q\q)=Z 2 (-l)"+"(2m + 1)**
m=0 « = — eo
where /::=1 + (3n + l)n/2 + m(m + 1)/2. This uses the triple-product
identities (3.1.10) and (3.1.14) multiplied together. One now considers when k is
3.5 Some Number Theoretic Applications
85
divisible by 5. Since 2(n + 1)2 + (2m + 1)2 = 8fc(mod5), we must have
2(n + 1)2 + (2m + 1)2 divisible by 5. An inspection of residues shows that
this can only happen if 2(n + 1)2 and (2m + 1)2 are both divisible by 5.
Hence, 2m + 1 is divisible by 5 and so is the coefficient of qim 3 in qQ (q).
From the binomial theorem one deduces that
(3.5.5) (l-qy5 = (l-q5y\mod5)
in the sense that all coefficients are congruent. It then follows that Q{q)l
Q5(q) = l(mod5), and hence the coefficient of q5m+5 in
gQ(q5) = qQ\q)Q(q5)
Q(q) Q\q)
is divisible by 5. But
qQ-\q) = qQ^Q-\q5)
_ qQ(q5) rr (v sm«\
so that the coefficient of q5m+5 in qQ~x(q) is divisible by 5. However, by
Exercise 9 of Section 3.1,
qQ'\q) = q+2 p(n-l)q".
n=2
Case (b) is similar, but uses the square of (3.1.14) instead of the product
with Euler's series. (See Exercise 5.) □
Comments and Exercises
The identity (3.5.1) was discovered by Jacobi on April 24th, 1828. He also
subsequently observed similar number theoretic interpretations of the
formulae for 02 and 8\, and he gave an arithmetic proof of his theorem. The
identities can also be found in Gauss's unpublished work. A wealth of this
and similar information can be found in Dickson [71, vol. 2].
An analysis of the components of our proof of (3.5.1) shows that it is
rather simpler than that in Hardy and Wright [60], which proceeds from
rz(n), or in Rademacher [73], which uses elliptic function arguments. We
use only the triple-product identify and the AGM.
1. Show that r4(n) is 8 times the sum of the odd divisors when n is odd,
and r4(n) is 24 times the sum of the odd divisors when n is even.
2. Show that
86
Some Number Theoretic Applications
n=0 J n=0 1 —
1)?
4(2n + l)
8(2n + l)
and deduce that the number of representations of n as a sum of 4 odd
squares is 16 T.4dd,=n d.
d,d' odd
The argument of Theorem 3.5 is due to Ewell [83]. We give the general
formula for r2(n) in Chapter 9.
3. Prove Lemma 3.3. Consider odd and even cases and use the multip-
licativity of crx.
Gaussian sums arise naturally in the study of cyclotomic polynomials and
hence occurred to Gauss while studying constructible polygons. Apparently
this led Gauss to the lemniscate and thence to elliptic functions. The
formula (3.5.4) (with p'-=2) plays a key role in establishing the class
number formula for binary forms. Landau [58] is sufficiently taken by the
result that he presents three proofs. The result we give is due to Dirichlet.
The case with p := 2 is due to Gauss save for the "detail" of determining the
sign of the complex square root. (See Landau for the significance of the
sign.) Our proof follows Bellman [61].
4. Show Gauss's result. For ?>1,
i + '
This, generalized, leads quickly to a proof of quadratic reciprocity. (See
Apostol [76a].)
There is a host of more recondite modular results on partitions. (See
Andrews [76] or Hardy and Wright [60].) The rule of thumb that additive
number theory is generally harder than multiplicative theory is born out by
the relative paucity of partition information.
5. a) Show that {\- q)"p = {\- qpyx{mod p) holds for any prime p.
b) Prove Theorem 3.7(6).
There is a combinatorial proof of the theta transformation formula which
is suggestive of the arguments used in the sections on the Rogers-Ramanu-
jan identities, in that it produces a 'finite theta transform' and moves to the
limit. The proof is due to Polya. Again we follow Bellman [61].
6. a) (zin + z~ll2)2m = Zj ( ,)z for any z and integral m.
3.6 The Mellin Transform and the Zeta Function 87
2mll
b) Let w ■= e m , I a' positive integer. Then
-//2=3fc=3//2 *:=-|m//J v"tT»"'/
(Here [jcJ is the greatest integer less than or equal to *.)
c) Fix s and f with t real and positive. Let 1-= [(mt)ll2\ and z = es".
Then
■ <s+2irtk)l2l , -(i+2ir/fc)/2/-|2in
-ins,ks,in "- ^
2
2=3*
= 2 {
-//2=3*:=3//2 *■
/ i 1 .-/,\2 1 8/2(m/4/2)
//2=3*:=3//2 "- O*
|m//J
= 2
fc=-|m//J
d) Now let / go to infinity and use
i) lim (1 + -) =ex if lim ;c„ = x
. n ( In \ e
u) lim —nr I = —y= if
H^n=x-
e) Deduce from c) and d) the following form of the general theta
transformation:
co t~~— CO
V (s+2irik)2/4t _ /_£. V -/A:2+JA:
3.6 THE MELLIN TRANSFORM AND THE ZETA FUNCTION
We continue our tour through theta function theory with a discussion of the
Riemann zeta function. This also allows us to catalogue a few useful
properties of the Mellin transform for future use.
The Mellin transform is a specialized Laplace transform defined by
/•CO
(3.6.1) M(f):=Ms(f)-\0 f{x)xs-ldx.
For integrable functions with suitable behaviour at zero and infinity, Ms is
analytic in a strip a<re(s)<6. For example, the gamma function,
r(s) '■= $o e~xxs~x dx, is analytic in re(s) >0. A most useful identity is
/•CO /*CO
(3.6.2) jo /fry)**"1 dx = y"s Jq fW1 dx y>0.
88
Some Number Theoretic Applications
Under mild conditions, the Mellin transform is invertible and one can
identify two functions whose transforms agree for re(s) > 0. (Certainly this is
true if the functions are transformable and continuous.) This is an
appropriate place to state the following functional characterization of the gamma
function, whose proof is left as a guided exercise.
Theorem 3.8
The gamma function is the unique function /: (0, oo)—>[0, °°) such that
(1) /(1) = 1
(2) f(x + 1) = xf(x) for x > 0, and
(3 ) log f(x) is convex.
Many otherwise tedious facts are easy consequence of this functional
characterization of the gamma function. We list three whose proofs are left
as exercises. These are the functional relation T(s)r(l — s) = 7;7sin its
(1.6.6), the beta function formula /3(s, t) = T(s)T(t)/T(s + t) (1.6.7), and
the duplication formula
(3.6.3) T(2s) = 77--17¾2^1^)^ + i) .
We now derive the functional equation for the Riemann zeta function
£(s) '•= S^=1 n~s, re(s) > 1. We observe in passing that £ has an immediate
analytic continuation to re(s) > 0 simply by writing
CO CO CO
n=l n=l n=l
so that
(3.6.4) £(s) = -A^S 2 (-1)"+V* re(s) > 0 .
More interestingly, consider g{i) •"= [#,(*) - l]/2. For re(s) > \ we have
Thus
r(§) *«»-'" "JV2-1^)*
=[t^g(t)dt+for^y^dt
+ \fo(t^-iy>^dt.
3.6 The Mellin Transform and the Zeta Function
89
Here we have used the theta transform for 03 (2.3.1) to substitute for g on
[l,oo). This leads to
if we evaluate the third integral and replace t by lit in the second integral.
Since \g(t)\ = 0(e~*') as t—>°°, we see that the integral is an entire
function of s. Thus r(s/2)£(s) is analytic except for simple poles at s = 0, 1.
Since T(s) has a simple pole at 0, we see that £(s) is analytic except for a
simple pole at s = 1. In particular (3.6.5) gives an analytic continuation of
£(s) to the entire complex plane. Moreover, as Riemann discovered, the
right-hand side of (3.6.5) is invariant under the change of variable s '•= 1 — s.
Thus we have
(3.6.6) T(±)„-'nto) = T(±^)v-»-*nC(l-').
This is the celebrated functional equation for the zeta function.
Comments and Exercises
It is also possible to deduce the Poisson summation formula from the
functional equation for the zeta function. (See Bellman [61].)
1. a) Use Holder's inequality to show that T satisfies theorem 3.8.
b) Conversely, g(x) ■= log f(x) satisfies g(n + 1) = log (n!) and
x log n =s g(n + 1 + jc) — g(n + l)^x log (n + 1) .
Thus
0^(*)-log x{x + 1ynX.{x + n^*log(l+l)
and
(3.6.7) r(je) = lim , , "!" , , , = /(*) .
y v ' »-» x(x + 1) • • • (x + n) J v '
This argument is due to Bohr and Mollerup. Interesting extensions
can be found in Askey [80] and in Denninger [84].
2. a) Prove formulae (1.6.6), (1.6.7), and (3.6.3). In the latter case write
the ostensible identity in the form T(x) = f(x) and verify that /
satisfies Theorem 3.8.
b) Establish that T has an analytic continuation to the entire plane
with simple poles at the negative integers and zero, and with no
zeros.
90 Some Number Theoretic Applications
3. Let
e-<'+*2/4'> r1'2 dt.
Verify that f(x) and Vtt e~x have the same Mellin transforms by using
the duplication formula. Hence reprove the result of Exercise 4 of
Section 2.2. Alternatively, establish the duplication formula from that
exercise.
4. a) Use (3.6.4) and the fact that £{s) has a pole with residue 1 at 1 to
show that
b) Show that £(-2n) = 0 for positive integral n. The reflection formula
(3.6.6) at least hints of the centrality of the line re(s) = | in the
behaviour of the zeta function. The factorization, due to Euler,
&)= 11 (l-p'T1 ">(*)> 1
p prime
shows the connection between prime distribution and the zeta
function. The Riemann hypothesis is that all the nontrivial zeros of
£(s) lie on re(s) = \. The asymptotic distribution of the primes is
inextricably tied up with this famous conjecture. (See Rademacher
[73].)
5. Use Theorem 3.8 to establish Gauss's multiplication formula,
T(nx) = (2fl-)(1-"),2«"-1'2 it r(* + -) .
6. (Stirling's formula) The formula is
-^=4= = 1 + 0(1) s-*c°.
Outline: Substitute x = s(\ + u) in Ms(ex) and write
T(s + 1) = /+1 e~s J [(1 + «) e-"]s du .
Now replace u by tVTIs and obtain
T(s + l) = ss e"sVTs J__ ij,s(i) dt
for a (dominated) kernel ifrs(t) which approaches e~' uniformly for
bounded t.
3.7 Reciprocals of Fibonacci Sequences 91
3.7 EVALUATION OF SUMS OF RECIPROCALS OF FIBONACCI
SEQUENCES
Since the theta functions provide quadratic analogues for the geometric
series, it is natural to ask about their relationship to sums of the form
(3.7.1) 2 «„ ' where a„+1 := Man + Na
n-l
with a0 and at given (M, iV^O). The one-term (N = 0) recursion leads to
the geometric series. The two-term recursion leads to theta series and their
relatives. If a ¥^ /3 are the roots of x = Mx + N, then
,~ - ~ («i ~ P«o)"" ~ («i ~ aao)P"
(3.7.2) an = —
(Exercise 1) and
a + /3 = M a/3 = -N .
We will consider only the case in which N = ±1 and M is real. We may write
an+l = (2c)a„ + ea„_j, |e| = 1, and to assure that a, /3 are real, we assume
c> max {0, -e}. Then ap" = ±1 and
(3.7.3) <*:=c + Vc2 + e /3 := c-Vc2 + e.
We must consider summing series of the form
The following proposition provides the key. We define the Lambert series
(3.7.5) L(/3) :-2-7^ IJ31 <1.
Proposition 3.S
For 0 < /3 < a with a/3 = 1,
SO
(») ^^ = ^ = 7^)-1]
~, a2" + /32" „" 1 + /34" 4
and
92 Some Number Theoretic Applications
00 i <» o2n + l i
(»0 2 2„+l4,fl2n+1 = 2 ,; -4n+2 = 7[^3()8)-823(p2)]
„=o a + /3 „=o 1 + /3 4
' =^()62).
Similarly,
(iv) i-7r^ = i-^r = L()8)-L(J82)
so
(v) 2 2n ft2n=21J1^r = L(j82)-L(j84)
„=! a -/3 „=1 1- /3
and
(W) 2 2n+1 * a2n+x = 2 , P "4n+2 = ms) - 2L()82) + L(/34) .
„=o a -jo „=o 1- jo
Proof, (i) The first equality follows on multiplying top and bottom by
/3" and the second now follows from (3.2.27), the formula for 0\. (ii) Now
observe that a2 and /32 satisfy the hypotheses of (i). Then (Hi) follows on
subtraction. We leave (iv), (v), and (vi) as Exercise 2. □
Proposition 3.5 will allow us to sum E"=1 a~l for certain starting values.
Specifically, we can handle e = ±1 and A = ±B in (3.7.3) and (3.7.4).
case 1 (A = -B) Let a0:=0 and a1'=l. [This normalization comes by
setting A := (2Vc2 + e)-1.]
(i) (e = -1) Then a/3 = 1 and c > 1. Now
(3.7.6) 2 a;1 = 2V7^1[L(/3) - L(/32)]
n = l
as follows from (3.7.2) and Proposition 3.5 (iv). Similarly, one can
evaluate
2a-1=2V7^1[L(J82)-L(J84)]
n-l
and
2 a^, =2V7^1[L(/3) -2L(/32) + L(/34)]
3.7 Reciprocals of Fibonacci Sequences 93
(ii) (e = 1) Then a/3 = -1. Now /3 < 0 and
CO CO ^
2 ail =2V7TT 2 2n 2n =2V7TT[L()82) - l(/34)]
n-l n = l a "JO
while
(3.7.7) 2 a^+1 = 2Vc2+l 2 2n+i \ aim+i
n=0 n = l a + |j8|
= ^4^[03(|j8|)-02(j82)].
case 2 (A = B) Let a0 := 1 and at-= c. [This normalization comes from
setting A = \.]
(i) (e = -1) Then a/3 = 1 and c> 1. Now
2j a„ = 2
(3.7.8)
- i2/„2>,
f -1 03(/n-i
2, «2» = o
2
2/o2x /i2/o2x
'i-' fl2n+l 9 O
n-0 z z
f r iv -1 2fl2(/32)-fl2(/3) + l
2 (-1) a„ = = .
(ii) (e = 1) Then )8 < 0 and a | /31 = 1. It follows that
^, _, 02(j82)-l
(3.7.9) 2 a2xn = -^
and
CO
2 a2-„1+1=2[L(|)8|)-2L()82) + L()84)].
In certain cases the theta series involved above are particularly simple to
evaluate. For example, in Case 2(i), we have )8 = c — Vc2 — 1. Thus c =
()8 + j8-1)/2. If )8 := l(Tm, the series 03(/3) and 0,(/32) in (3.7.8) can be
evaluated entirely by writing down sequences of l's and O's. Thus for
94 Some Number Theoretic Applications
1c = 10m + 10-m, the sums in (3.7.8) can be evaluated at the same speed as
one can multiply n-digit numbers. (See Chapter 6.) Moreover, since theta
functions can be fast computed for any algebraic )8, the series are always fast
computable. (See Chapter 7 and Exercise 6.) Hence the series of Exercises
3, 4, and 5 are quadratically computable.
Finally, consider (3.7.7) for c: = sinh (tts) and with s>0. Then
an+1 :=2smh(irs)an + an_1, a0- = 0, ax '■= 1, and
(3.7.10) 5(,) := S a£+1 = ^^- [*&-') - *&-*»)].
If we use Theorem 2.1, we have
S_, cosh (tts) ( 1 - k' \ „,..
n = 0 ?> N z '
where
on using
For singular values of fc (see Section 4.6), this formula becomes particularly
pretty. [See Exercise 6b).]
Comments and Exercises
1. a) Show that when x = Mx + N has distinct roots a and )8, a +1 =
Man + Nan_1 is solved by (3.7.2).
b) If a = )8, show that
an = naxan~ — (n — l)a0a" .
2. Establish (iv), (v), and (vi) of Proposition 3.5.
3. a) Use (3.7.7) to show that for the Fibonacci numbers
Here F0 := 0, Fx := 1, and F„+1 := F„ + F„_j. This result is due to
Landau [1899], as is
b)
3.7 Reciprocals of Fibonacci Sequences
95
This is also discussed in Ribenboim [85].
c) The Lucas numbers are defined by the same recurrence,
Ln+1 := Ln + L„_j but with L0 :=2 and L1 := 1. (See Hardy and
Wright [60].) Show that
s^1-
n=0
3-V5^
e\[ ^-^) + 1
'4.
d) Note that xn '■= F2n+1 satisfies xn+1'-=3xn- xn_1, x0'=l, and
*! := 2. Similarly, xn ■= L2n satisfies xn+1 ■= 3xn - xn_x, x0'=2,
and *! :=3.
a) Verify (3.7.8) and (3.7.9).
b) Show that for an+1 :=4a„ - an_1, a„: = 1, a1 :=2, one has
2j an =
«=i z
and
V ( iv -i 2^(7-4V3)-^(2-V3) + l
2, (-1) a„ = 5
Let a0 := 0, ax := 1, and a„+1 := 9.9a„ + a„_!. Show that
2 2 ^ = 5.05
2f-
3Vio
)-3(^):
a) Recall that d\(q) = (2/ir)K(k), where ^ = e--*'/*(*)_ As shown in
Chapter 7, it is possible to quadratically compute k given q. Since
K is quadratically computable given k, we can fast compute 63(q)
given q.
b) Show that with S(s) given by (3.7.10), we have
5(1) =
cosh(Tr) /V2-1
2V2
M£)
^.g*fe^('-^^),^-¾
^-==^ l^^)K(^i>).
cosh (W3) /2-V2 + V3\ r^V2(V3-l)N
7T
Some Number Theoretic Applications
c) Show that S(s) = cosh (tts) E~=0 sech [(2n + l)irs], so that
V ur/o a.i\ 1 gs(e ) ~ g3(e )
2j sech [(2n + 1)-7T5,J =
and
d) Show that
2 sech (mrs) =
ei(e-") + i
n=0
Zj tt sech [(2n + l)ir] =
(2-V2)r2(|)
16Vw
Equations (3.2.28) and (3.2.29) can be used to derive closed forms for
a host of other reciprocal sums. (F„ and L„ are as in Exercise 3.)
a) Use (3.2.29) to show that
VI
„f0(2n + l)F2n+1 4
(-1)"
arcsin
03()8)
where 0:=(3-VI)/2.
b) Use (3.2.28/v) to show that
h 2^:=12-log
02()82)03()82)
2V)802()82)
where /3 := (VI -1)/2.
Use (3.2.28/) and transformation formulae to show that
i) X cosech2 (n™) = - + — (—-—)K2-KE
„ = 1 0 7T L\ 3 /
k-
11
ii) 2 cosech2 (2mrs) = — -\ 5
„ = 1 O 7T
l + k
/2
K£
~ 1 r/i + &'2\
iii) Z cosech2 [(2n + 1) its] = — (—-— }K2 - KE
„=0 7T L\ 2 I
d) Similarly, use (3.2.28v) to show that
i) Z/ sech2 (rnrs) ■■
n = l
7T
3.7 Reciprocals of Fibonacci Sequences 97
ii) 2j sech2 (2mrs) =
EK + k'K2 1
„ = 1 TT2 2
.... X1 ■ 2 r,~ *s ^ EK — k'K
in) 2j sech [(2n + l)irs\ = 5
e) Show that
i) z, cosech2 (mr) = -: — -r—
n = l 6 2tt
(2n + l)7r
2
J_
2tt"
ii) 2 sech2
n=0
f) Combine results of c) and d) to show that, with /3 := (3 - VI)/2,
CO
(3.7.12) 2 f;2 = & [e*(/8) - e*4(p) +1]
n = l
and
(3.7.13) El;2 = | [e*(/3)-i]
«=1
and deduce similar formulae for E~_x (—1)"F2„2 and for E"=1
(-1)¾.
g) Show that the Lucas numbers satisfy
co / °° \ 1 °°
El;2 = 2 El- h-El-
and show that a similar formula holds for all recursions of the form
an+i = (2c)an + an-i> aQ:=2c, and ^:=1.
h) Show that
co co , co ^2
3Ef„-2 + 5 2l;2 = 4(Ef2~„1+1 .
a) Show that
i (-ir+i #*=i f „-2.
/Zmf: Differentiate (3.2.28m) and compare the result to Exercise
7f), equation (3.7.12).
b) Show that
98 Some Number Theoretic Applications
CO CO
-2 2 (-l)mm cosech (2rmrs) = X sech2 [(2m + 1)tts]
and hence that
vs 2 (-1)-^ = 2/^.
^2n n=0
9. Show that
^ <?" _034(?)-l
«l2
«-i[g" + (-l)"]
y (-l)V _et(q)-l
and
ei(q)-et(q) + i
„-![9" + (-l)"+1]2 24
10. a) Show (by expanding both sides) that
L(,):=E /^ = 2^^-
n = l x ■* n = l x ■*
b) Hence show that all the series considered in this section can be
computed with at most 0(Vn) operations for n digits. (See
Chapter 6.)
A remarkable elementary result is
V 1 =kV5
«=o F2n+1 + Fk 2Fk
k = l,3,5...
This is a specialization of identities established in Backstrom [81]. (Related
results can be found in Carlitz [71].) Thus
^ 1 V5
A related formula is
«=o F2n+1 + 1 2
V 1 =7V5
«=o F2n+1 + 13 . 58
3.7 Reciprocals of Fibonacci Sequences 99
11. Show that
v 1 KE \
where q := (VI - 1)/2. Now k is close to 1, so that KE/2v2 is close to
K = e23(q) = e^e^ioric^-D/gj
2tt2 4tt 4 log [(V5 + 1)/2]
where the last equality follows from the theta transform. One sees that
KEHir1 + g is close to
1 1
+
4 log [(V5 + 1)/2] 8
which explains, in some part, Backstrom's formal manipulation in
Backstrom [81]; and which evaluates (3.7.14) as requested therein.
Almqvist [Pr] treats this sum and some relatives. Note that
gw /iog[( 5-1)/2] _ jq-9 sQ t^at^ as |n ajj ^ese Fibonacci series,
transformation considerably speeds convergence.
Lambert series occur naturally in multiplicative number theory, as the
following exercise shows.
12. a) Show formally that for any real valued function /,
CO fi CO
n = l -1 ■* n = l
where
F(n):=2f(d).
This is due to Laguerre. Hence show that
i) 2 r—r = 2 *»)*"
„=i i X „_j
where r(n) is the number of divisors of n.
ii) E ^Ar = E ^(n)^" fc = 1,2,. . .
n = l L x n-1
where <rfc(n) is the sum of the fcth powers of divisors of n.
b) Let
100 Some Number Theoretic Applications
'. f 1 if n is square
F(n):=\n ., . M
10 if n is nonsqu
; nonsquare.
Show that/(n) = (-I)26'where
m
n = n P;' (in prime decomposition) .
i=i
Hint: Use Mobius inversion. Since F is multiplicative, so is /. Thus
using Liouville's function e(n) : = (-l)s'»ie<
2 e(n) „
c) Show that
«,(*) - 1
n = l ^2« z
where 0:=(3-VI)/2.
Zucker [79] gives general formulae for sums of powers of hyperbolic
functions (in which the coefficients are defined recursively and have been
computed extensively by Ramanujan and Zucker). Using these one can
evaluate X~ml (-l)"+1F;4fc and 2~ml F;4k+2 in terms of 6 when
k := 1,2,3,. . . . There are similar Lucas number results, and if K and E are
used, many more sums are expressible. We give two examples:
i (-i)"+1l;4 = ^ {3 + [e*(/8) -1]2 - [eJ(/3) +1]2}
and
96
2^i = ^<&/8)[1-044(/8)].
Here, as before, j8 : = (3 — VI)/2.
13. Let a0:=0, u1 := 1, and a„+1 := aun + un_1. Let u0: = 2, vx := a, and
u„+i '-=avn + »„_!•
a) Establish that
2 ^¾ = «V +4) 2 ^-^- = | (VTTi- a) .
/fim: The second sum in a) can be made to telescope.
3.7 Reciprocals of Fibonacci Sequences
b) In particular,
°° / i\«+i
I
c) Compare
i) 2^7^ = ^-3)
and
ii) 2^-72=6(3-41082).
«-»1 "jt=i f
d) Show that
„=1 z,^.! a*. * . . . .
Witxi - a
Jt=l "-tit + l
= UV7T4-a) + 2a2 .
^ fc=l M2*a2*: + 1
Chapter Four
Higher Order Transformations
Abstract. We develop algebraic transformations of prime order for the
elliptic integrals. For small numbers this can be managed purely algebraically.
However, the development of modular equations for arbitrary primes is
most comfortably effected via transcendental methods. This requires some
rudimentary modular function theory. The cubic equation is studied in
particular detail.
4.1 A FIRST APPROACH TO HIGHER ORDER TRANSFORMATIONS
The fundamental relation from Theorem 1.2
(4.1.1) KTO = TA^(f^)
is remarkable for a number of reasons. One notable consequence is the ab
initio unlikely observation that when k is algebraic and
(4.1,) «:-?£
the values of the transcendental function K at I and k are algebraically
connected. Equation (4.1.2) is one form of the quadratic modular equation.
It can be rewritten as
(4.1.3) /2(1 + A:)2-4A: = 0.
We will develop a class of algebraic equations (modular equations) that
induce algebraic transformations on K in a similar fashion.
We commence by sketching, a la Cayley [1895], a purely algebraic
102
4.1 A First Approach to Higher Order Transformations
103
approach to the modular equation. It transpires that this approach becomes
unduly complicated for all but a few simple cases and is hard to use as a
rigorous basis for the general theory. Thus in Sections 4.3 and 4.4 we will
derive the general theory using function theoretic techniques.
We are looking for a relation of the form
(4l4) M(l, k) dy = dx
V(i-/)(i-/V) V(i-*2)(i-fc2*2)
where I and k satisfy a polynomial equation O in two variables. Such an
algebraic connection between the two moduli (/ and k) is called a modular
equation or modular transformation for k. Specific but equivalent modular
equations for algebraically related functions are defined in (4.4.2), (4.4.6),
and (4.5.1). The function M will be an algebraic function of k and I and is
called the multiplier. With this in mind, let P and Q be polynomials in x2 so
that deg(P ±xQ)\\±x) = n (n odd) and write
(4.1.5) W_(P-*G)21-*
l + y (P + xQy l + x
The important condition to impose on P, Q, and l-=l(k) is that (4.1.5)
must be invariant when (x, y) is replaced by (1/kx, l/ly). (See Exercise 1.)
We now try to solve for P and Q. Set
U := x(P2 + 2PQ + x2Q2) V : = P2 + 2x2PQ + x2Q2
(4.1.6)
A := P ~xQ B- = P + xQ
and observe that
y = UIV
(4.1.7a) 1 - y = (1 - x)A2IV 1 + y = (1 + x)B2IV.
Also, from the invariance of (4.1.5),
(4.1.76) l-fy = (l-fcc)C2/y 1 + ly = (1 + kx)D2IV
where C and D are polynomials of the same form as P and Q. Thus we
deduce that
(4.1.8) "y - ^-^ *
V(i-/)(i-/V) ABCD V(i -*2)(i-*V)
It can now be computed (Exercise 1) that
104 Higher Order Transformations
1 UV-VU = Q(0)
(4 9) M(/, k) ~ ABCD ~ P(0)
is independent of x. Let us return to (4.1.5). For n = 4p + 1 we must have
(4.1.10) degP = 2p degQ = 2(p-l).
For n = 4p + 3,
(4.1.11) degP = 2p degQ = 2p.
Since P and Q are even functions, we have in either case \{n + 1) degrees of
freedom. It remains to use the invariance in (4.1.5) as (x, y)-*(l/kx, l/ly).
This leads immediately to the equation
(4112) ViMk*) uvl
since y = U/V. Thus we are reduced to solving
(4.1.13) (P2 + 2x2PQ + x2Q2)* = yjj k(n~1)l2{P2 + 2PQ + x2Q2)
where the * operation is defined as follows. If
S'-=a0 + axx + • • • + amxm
then
5* := a0(kx)m + a^kx)"1"1 + ■ ■ ■ + am .
Equating coefficients in (4.1.13) leads to a system of \(n + l) nonlinear
equations. There are \{n + 1) degrees of freedom in the coefficients of P
and Q. Since the equation is homogeneous in P and Q, one of the
coefficients may be assumed to be 1. This leaves one additional condition to
be satisfied by k and I and leads to the desired unique algebraic relation
between k and I. The pitfalls of this approach are now, of course, apparent.
We end up with a large system of nonlinear equations that are virtually
impossible to solve or analyse directly.
We illustrate with the cases 3 and 5.
cubic transformation (n = 3). We have P — l and Q = a. Equation
(4.1.13) becomes
k2x2 + 2a + a2 = ^ k{\ + 2a + a2x2) .
4.1 A First Approach to Higher Order Transformations 105
This leads to two equations,
*« yjl a2
(4.1.14)
2a + a2=yJ- k(l + 2a).
This is solved parametrically by
(4.1.15a) k2-^^- and /»- «(2+«>'
2a + 1 (2a + if
with similar expressions for (k')2 and (/')2, namely,
,2 (!-«)(! +a)3 2 _(! + «)(!-a)3
" 2^n and l (2«+i)3
From this one can deduce that
(4.1.16) Vkl + VkT^l
or, equivalently,
(4.1.17) (k2 - I2)4 = I28k2l\l - k2)(\ - l2){2 - k2 - I2 + 2k2l2).
In the a:
namely,
In the associated variables u : = k114 and i; := I1'4 this has a simpler form,
(4.1.18) u - v4 + 2u»(l - wV) = 0.
(See Section 4.5.) The multiplier M has any of the following forms:
(4.1.19) M =
1 i; 2u3 —w
2a + 1 u + 2w3 3«
(See Exercise 2 of this section and Exercise 2 of Section 4.6.)
Quintic transformation (n = 5). We have P = 1 + j8*2 and Q = a.
Equation (4.1.13) leads to the three equations (A2 := k5ll)
0a-A
(4.1.20) k2(2ap + 2f3 + a2) = A(2a + 2)8 + a2)
A:4(2a + l) = A(j82+2aj8).
This eventually solves, with u := k1'4 and v'—l11*, as
106 Higher Order Transformations
(4.1.21) u - v6 + 5wV(w2 - v2) + 4wv(l - wV) = 0 .
The multiplier M has any of the forms
1 v(l - uv3) u + v5
(4.1.22) M=__»__»__^.
The transformations are called cubic, quintic, and so on, because of the
underlying order of the transformation (and so of convergence).
Comments and Exercises
The very classical approach of the section to cubic and quintic modular
equations is due to Jacobi [1829]. This algebraic approach was extended to
the septic (n = 7) case by Cayley [1874], who also treated the endecadic
(n = 11) case partially. The calculations are formidable. We have followed
Cayley closely in this discussion. The associated variables u'=k114 and
V-—1 /4 of Jacobi's considerably simplify the calculations, as we will see in
Section 4.5.
In order to find the relationship between K(l) and K(k) implicit in (4.1.4)
we must show that the underlying transformation (4.1.5) is one to one and
onto on the interval [0,1]. This can be done directly for n = 3 (Exercise 3)
and other small n. However, as with most of the details, it is easier to use
the general transcendental approach of Sections 4.3, 4.4, and 4.5.
1. a) Show that an equation of the form (4.1.5) that is invariant under
the change of variables (x, y)-*(l/kx, 1/ly) exists. Hint: Set
_ x n * - x2/a2
Ml i 1 - k a,x
Replacing (x, y) by (1 Ikx, 1 lly) gives
n^
,2 2 2
k a,x
ly xMk"UtayiX \-x2la2
which holds, provided that
/ = M2r(n«,)4.
[See also (4.1.13).]
b) Establish (4.1.7b) and exhibit C and D.
c) Establish (4.1.9).
Hint: Let U and V be as in (4.1.6). Let y-=UIV and let
Y(a, b) := (a2 - b2)(a2 - l2b2). Then
4.1 A First Approach to Higher Order Transformations 107
Y(V, U) = (V2 - U2)(V2 - l2Tl2)
and
dy _ UV- UV dx
^/Y(Xy)~ VWTU) '
Now observe that any square factor (x — a)2 of Y(V, U) is a linear
factor of VU - UV Since
and
V2 - U2 = (1- x2)A2B2
V2 - l2U2 = (1- k2x2)C2D2
the fact that (UV- VU)IABCD is independent of x now follows.
The explicit form of the multiplier M(l, k) is derived by setting
x = 0 in 0V - VU) IABCD.
a) Establish the four forms of the cubic modular equations (4.1.15),
(4.1.16), (4.1.17), and (4.1.18).
b) Complete the calculation of the quintic modular equation (4.1.21).
Show that the n = 3 relation between x and y underlying the cubic
transformation is one to one and onto on [0,1], and hence (4.1.4) can
be integrated over [0,1].
(An explicit cubic algorithm for K) The cubic modular equation
(4.1.18) is of degree 4 and, hence, can be solved explicitly for u in terms
of v.
a) Show, for i; e (0,1), that u E. (0, v) is a solution of (4.1.18), where
v3 D-i?
y +
and
S :=^(1-08)
R:=V7+S
,„ 6 „ 4»-2d9
R
b) Show, for u G (0,1), that i; G (u, 1) is a solution of (4.1.18), where
108 Higher Order Transformations
u D + R
„ —y +
and
S:=V4w2(l-w8)
i?:=Vw6 + S
n ■ /o« „ , 4w - 2w9
D •= \/2w - 5 +
c) Show, for uG(0,1), that there is a unique wG(0,u) so that
(4.1.18) is solved by (w, v). We can define an iteration as follows.
For u, G (0,1), let vi+1 G (0, u,) be such that u '= uJ+1 and 1; := vt
satisfy the modular equation (4.1.18). Show, using a) and b), that
(4.1.23) vn+1 = v3n- ]/v6n + y/4v2n(l-v*n) + vn_1
where v0 G (0,1) and v1 G (0, v0) is computed from v0 by Exercise
4a).
d) Show that, for v0 G (0,1),
(4.1.24) *(„;)-|n(1 + M)
(Use Exercise 3.)
e) Show that vt tends to zero cubically. (For further details see
Borwein and Borwein [846].)
5. a) Let «6(0,1) and uG(w, 1) satisfy the cubic modular equation
(4.1.18). Show that
(4.1.25) ^=3"F5V-
K(u ) K(v )
Hint:
b) Let u G (0,1) and 1; G (u, 1) satisfy the quintic modular equation
(4.1.21). Show that
4.2 An Elementary Transcendental Approach 109
(4-L26) K(u*) ~5 K(v*) ■
Hint: Show that if (u, v) : = (k1H, I1'4) satisfies (4.1.21), then so
does (u, v) = (I'1'4, k' /4). Now Exercise 1 of Section 1.5 shows that
the two sides of the above equation differ by a constant. Use the
logarithmic asymptotic at 0 and the relationship between u and i; as
u-»0 to evaluate this constant.
These important identities will be revisited in Section 4.4.
6. a) Verify Schlafii's form of the modular equation of degree 5,
<«"> (f)'+(i)'-^-^)
where u := 2~1/4/(t) and v := 2~1m/(5t), / as in (3.2.9).
b) Compute the corresponding equation for /j.
4.2 AN ELEMENTARY TRANSCENDENTAL APPROACH TO
HIGHER ORDER TRANSFORMATIONS
In terms of the nome q we have, by Theorem 2.3, the identification
(4.2.1) KqY^k^^fi-
(4.2.2)
k'(q)- =
K(k) =
¢ =
k'
■ e~
el(q)
e\(q)
■r,K'(k)IK(k)
(4.2.3)
and
(4.2.4)
From Exercise le) of Section 1.4 and Exercise 5 of Section 4.1 we see that
the quadratic modular equation (4.1.3) is satisfied by l'-—k(q112) and
k:= k(q), while the cubic equation is solved by I := k(q113) and k'-~ k(q)
and the quintic equation is solved by I := k(q /5) and k'-— k{q). [To see this
just observe that (4.2.4) uniquely determines q.] In general we will see in
the next sections that the pth-order modular equation for k is a polynomial
in two variables with integer coefficients that is satisfied by k(qp) and k(q).
We observe from (4.2.4) that for these algebraically connected moduli
110 Higher Order Transformations
(4-2'5) P K(k(q)) K(k(q")) '
Before turning to the general theory we wish to give an elementary
derivation of the cubic transformation in theta function terms. From
(4.1.16) the cubic modular equation for k is
(4.2.6) Vkl + VFF=l.
From the preceding discussion this is seen to be equivalent to:
Theorem 4.1
(4.2.7) d4(q)d4(q3) + 62{q)92{q3) = e3(q)63(q3) .
Proof. From the definitions and Exercise 1 of Section 21 applied to
n2 + 3m2 „_j / 1\n+m n2+3m2 u„.,„
q and (—1) q we have
and
Now
(ft + / + l)2 + 3(ft-/)2 = (2ft-/+i)2 + 3(/+i)2.
Thus subtraction of the two theta identities produces
^)^)-04(4)04(^) = 2 i ^+^3(-^
m,« = —co
m+n even
= 62{q)62{q3)
(as replacing m by 1 — m shows). □
Comments and Exercises
This is as far as we wish to pursue the transformation theory on an ad hoc
basis. The next section introduces enough of the theory of modular
functions to provide a general framework for the development of modular
equations. We have only considered modular equations for p a prime. If we
view the modular equation as the algebraic relation between k(q") and k(q)
and can find this relation for p a prime, then for composite n a relationship
4.2 An Elementary Transcendental Approach 111
can be constructed out of the modular equations corresponding to the prime
factors of n. Rational n are treated similarly.
1. Construct modular equations of order 4, 6, and 8 [that is, construct
algebraic relationships that are satisfied by k(q) and k(q2), k(q6), and
k(q8), respectively].
2. Show, for
that
and that
c:=f Y(k(q))
0<k(qp")<4e-cpn ¢£(0,1)
k(qp )~4e~cp as n-*°°.
Many modular identities follow from:
3. (Schroter's formula) Consider a general theta function written as
CO
T{x, <?):= X x"q"
where x ¥- 0, | q\ < 1 (as in Section 3.1). Let a and b be positive integers,
a) Show that
(4.2.8) T{x,qa)T{y,qb) =
a + b-l
Sk bk2rr, 2bk a + b\rr, a -b 2abk ab(a+b)\
y q T(xyq , q )T(y x q ,qK >).
Hint: Write
x,q )T(y,q )= Z x y q
m,n
Let s be chosen so that n = m + (a + b)s + /:(0 <k<a + b) and let
u'-~ m + bs. Then w and 5 range over Z as m and n do. Also
m n / \u/ -~b a\s k
x y ={xy)(x y)y
and
am2 + bn =(a + b)u2 + 2bku + ab(a + b)s2 + labks + bk2 .
Now rearrange. (See Tannery and Molk [1893].)
112 Higher Order Transformations
b) A form of the seventh-order modular equation is
V03(<7M(<77) - y04(q)64(q7) = \J 62{q)62{q1) .
(See (4.5.4).)
Use part a) with a '■= 7 and b'-=\ to establish this formula.
Hint: Set x-=y-=±l to find a formula for 63(q)63(q1) +
04(q)64(q1). Now set x'= y1''■— ±q7 to similarly write
^.lq)^(q) + 62(-q)S2(-q1)'=02{q)e2(q1). On making simple
rearrangements this yields
0,0,(47) + OM q7) = 2T(l,q8)T(l,q56)
+ 2quT{q8,q8)T{q56,q56)
+ 4q*T(q\q8)T(q28,q56)
and
d2d2{q1) = 2q2T{q8, q8)T(l, q56) + 2quT(l, q8)T(q56, q56)
+ 4q4T(q\q8)T(q28,q56).
Since q2T(q8, q8) = ^(q8) and T(l, q8) = 63(q8), we may write
030,(<77) + 0,6^)-^0^) =
2[03(q8) - 02(q8)][63(q56) - 62(q56)] = 284(q2)d4(qu) .
Thus an application of equation (2.1.7ii) gives
(V0303(<77) - V0404(47))2 = 0202(47)
as required,
c) Establish the cubic modular equation (4.2.7) as above.
4.3 ELLIPTIC MODULAR FUNCTIONS
The theory of elliptic modular functions and more general automorphic
functions is, in part, a natural extension of the theory of elliptic functions.
The basic defining property of elliptic functions is their invariance under a
group of linear transformations. Automorphic functions are functions
meromorphic in the upper half-plane $f: = {im(t) > 0} that are invariant
under a group of linear fractional transformations. We will, of necessity,
explore only the rudiments of this remarkable and difficult theory.
4.3 Elliptic Modular Functions
113
Definition 4.1
(a) The (inhomogeneous) modular group T (J-group) is the set of all
transformations of the form
w ■■
at+ b
ct+ d
a, b, c, d integers, ad — be = 1
(b) The A-group is the subgroup A of T with a, d odd and b, c even.
That both of the above are groups (under composition) is straightforward.
The transformation
at+ b
w --
ct + d
can be represented as either of the two matrices
=(::)
and the group product becomes matrix multiplication (See Exercise 1.). The
(homogeneous) modular group SL(2, Z) distinguishes these matrices. Note
that any element of the modular group fixes the real axis and maps $f onto
itself.
Definition 4.2
(a) The set Fr of t e X* := {im(t) > 0} U {»«>} U {0} is denned by
Fr := {|re(0| < \ and |f| > 1} U {re(t) = - \ and |f| > 1}
U{|f| = land
; re(f) < 0}
(b) The set FA of t E "M* is defined by
*"a := (lre(0l < 1 and \2t ± 1| > 1} U {re(0 = -1} U {\2t + 1| = 1} .
These two sets described in a) and b) are fundamental sets for the T- and
A-groups. The interiors of these two sets (F° and F°x) are fundamental
regions in the following sense.
114 Higher Order Transformations
Theorem 4.2
(a) Every point in "M is the image under some element of T of some point
of Fr.
If A e T is not the identity, then A(F°r) D F°r = 0.
(b) Every point in $f is the image under some element of the A-group of
exactly one point of FA.
If a e A is not the identity, then A(F°X) D F°x = 0.
The proof of this theorem is elementary though not entirely
straightforward. (See Exercise 2.) Any set F (with interior F°) which satisfies (a) or (b)
of the above theorem is also a fundamental set.
Definition 4.3
(a) A T-modular function is a function / which satisfies:
(/) / is meromorphic in "M.
(ii) f(A(t)) = f(t) for all t e $f* and A e T.
(Hi) f(t) tends to a limit [possibly infinite in the sense that 1//(£)-» 0]
as £ tends to the vertices of the fundamental region Fr where the
approach is from within the fundamental region F°. [In the case
of i°° the convergence is uniform in re(x: + iy) as .y-*00.] The
vertices of the fundamental region are (0, 1), (-1/2, V5/2) and
i°°. Since / is meromorphic in "M, this condition is automatically
satisfied at (0, 1) and (—1/2, V5/2) and need only be checked at
(b) A k-modular function is a function / which satisfies (/), (ii), and (Hi)
above with the T-group replaced by the A-group. For condition (Hi)
the vertices of the fundamental region F° are (—1, 0), (0, 0), and /°°.
Our notation is not entirely standard. What we have termed T-modular is
often just called modular or automorphic with respect to the T-group, while
what we have labelled as A-modular is often referred to as automorphic or
modular with respect to the A-group.
The existence of a A-modular function is the content of the following
theorem.
Theorem 4.3
The function
A(0:=*2(0 =
W1 Q-e
is a A-modular function.
4.3 Elliptic Modular Functions 115
Proof. From Corollary 3.1 we have
(4.3.1) A(0 = l6gn(1 , f„-i)
and it is clear that A is meromorphic in $f. For the invariance of A under the
A-group it suffices, by Exercise le), to show that
(4.3.2) A(* + 2) = A(0
and
(4.3.3) <27TT) = ^-
The first equation follows since emt = em(f . The second equation is a
consequence of (2.3.1), (2.3.3), and (2.1.10), which combine to yield
(4.3.4) A(-i) = l-A(0.
(Note that t = is.) Hence
A(27TT) = A(2TT7i) =l~ A(-2- ?) =l~ A(- ?) = A« ■
Finally we observe that, in a limiting sense,
(4.3.5) A(ioo) = 0 A(0) = 1 A(±l) = oo.
The first value is immediate from (4.3.1), while the value at zero can be
calculated from (4.3.4). (Observe that as f-»0 in Fx, l/f-»°° in FA.) The
value at 1 is computed from Jacobi's imaginary transformation
(4.3.6) A(f + 1)= A(f)
A(0 - 1 •
(See Exercise 4.) D
Some additional properties of A are established in Exercises 4, 5, and 10.
From (4.3.4) and (4.3.6) one can prove, as in Exercise 6, the following
theorem.
Theorem 4.4
The function
A_ [1-A(Q + A2(Q]3 = ± {l-[k(t)k'(t)]2}3
U' 27 A2(0[1-A(0]2 27 [k(t)k'(t)]*
is T-modular. J is called Klein's absolute invariant.
116 Higher Order Transformations
The basic result we need is a version of Liouville's theorem.
Theorem 4.5
A T-modular function that is bounded on FT is constant. Similarly, a
A-modular function that is bounded on Fk is constant.
Proof. Suppose/is T-modular and is bounded and nonconstant on FT.
By Theorem 4.2 this implies that / is analytic on $f. Consider f(t) -/(foo).
This function has no poles interior to Fr, and so achieves its maximum
modulus at some finite point on the boundary of Fr. By the invariance of
/— /(i°°) under T this is a global maximum at an interior point of $f, which is
impossible.
For the second part consider the A-modular function
[/(0 - /(0)][/(0 - /(-i)][/(0 - /(*-)] ■ □
This is sufficient theory for our discussion of modular equations in the
next section.
Comments and Exercises
This is only the very tip of the iceberg. We have restricted our attention to
two particular groups where we can directly establish the existence of
modular functions. In general this restriction is unnecessary. Only slightly
further into the theory are results such as: any modular function takes each
complex value the same number of times in the fundamental region. An
important consequence of this is that A takes every value exactly once in Fx
and A has a well-defined inverse that has branch points only at 0, 1, and °°
(Exercise 10). J has similar properties on Fr. One can now prove, much as
for elliptic functions, that two nonconstant functions which are modular with
respect to the same group are algebraically connected. Furthermore, if one
of these functions is univalent on the fundamental region, then the other is a
rational function of it.
This wide-ranging and difficult body of theory that is intimately tied in to
many questions in number theory and algebraic geometry may be pursued in
any number of texts, such as Apostol [76b], Chandrasekharan [85], Lang
[73], Lehner [66], Rankin [77], or Schoeneberg [76].
Two of the seminal papers of the subject, both dating from 1882, are due
to Klein and Poincare (selections of which may be found in Birkhoff [73]).
Poincare was interested in studying linear differential equations with
algebraic coefficients. Klein, who considered this his main field of work, in
keeping with his Erlanger Programme had more algebraic and geometric
interests (Klein and Fricke [1892]).
4.3 Elliptic Modular Functions
117
a) Verify that T and A are groups.
b) Verify that composition of transformations is equivalent to
multiplication of the associated matrices.
c) Show that two transformations represent the same function if and
only if they have the same coefficients (associated matrix up to
sign).
d) Show that T is generated by
Sr:-(J j) and rr:=(J _J).
e) Show that A is generated by
Prove Theorem 4.2.
a) Hint: To prove part (a), fix z E. ~Mand let A = ( j e T. Show
that
im(z)
im A(z) =
cz + d\
Pick any element of Y that minimizes \cz + d\ and let w be the
image of z under this transformation. By considering Tr(w) show
that |h>|>1 and by considering 5^(^) show that for some k,
|re[5w(iv)]|<|. For this k, |5w(iv)|>l. Thus every element of
X is the image of some element of FT.
b) Show that no two elements of F° map to each other under an
element of T. Examine the image of Fr under 5r and Tr.
c) Deduce part (b) of Theorem 4.2 from part (a).
The upper half-plane can be tesselated by images of the fundamental
region under the generating transformations. Sketch pictures of the
tesselations associated with the T-group and the A-group.
a) Show that X(A) is transformed into one of A, 1 - A, 1 /A, 1 /(1 - A),
A/(A-1), 1-1/A by any AST. Hint: Examine X(t+l) and
A(— l/t) using similar arguments to those of Theorem 4.3. Use
Exercise Id).
b) Show that X(m/n) = 0, gcd(m, n) = 1, if and only if m is odd and n
is even.
Show that with q : = e'
16 l 4- Y hn"
118 Higher Order Transformations
where the bn are integers. Show that 1/A(0 is finite at every point of
6. a) Prove Theorem 4.4 from (4.3.4) and (4.3.6).
b) Let q:= elir". Show that
/(0 - 1728/(0= \ + i.cnqn
where the c„ are integers. [In fact, qj{t)= Qf{256q + Q^f.\
c) Show that f\t), -fT(t), and -f2\t) are the roots of
(x - 16)3 - *;(0 = 0 .
[See (3.2.9) for definitions.]
7. a) Show that if / is T-modular then for some integer k and nonzero
constant c
/(0 ~ ce
as f-» foo.
tfinf: Let /(^):=/(0 where q := e2vil for f in Fr. Show, by
modularity of /, that / is meromorphic in q in a neighbourhood of
zero. Show also that/has a pole at zero. Thus/has a convergent
^-expansion at i°° with finite principal part,
b) Show that the T-modular functions form a field. Likewise the
A-modular functions.
8. Establish that /f, f\\ and /24 [see (3.2.9) for definitions] are A-
modular.
9. The Schwarz derivative of any function / is
sm-=2//~3(/)2
Show that if / is modular, then so is 5(/)/(/)2. Show that/is not in
general modular.
10. {On the inverse of A)
a) Show that A maps the set A := {re(z) = -1, 0<im z) one to one
onto (-oo, 0). Hint: X(t ± 1) = A(0/[A(0 - 1]. Now consider A on
the imaginary axis.
b) Show that A maps the semicircle B '■= {\z + \\= \, im(z)>0} one
to one onto (1, oo). Hint: \(-\lt) = 1 - A(0-
c) Show that A maps the interior of Fx one to one onto C-
4.4 The Modular Equations for A and j
119
d)
{(—oo} 0] U [1,0°)}. Hint: The number of zeros of A — c is
Itri h
A'(0
t A(f) - c
dz
where 7 is a contour of the form seen in the accompanying figure.
Use the invariance X(t) = X(t + 2) to estimate the integral on the
sides of the contour. Use the relation A(— 1H) = 1 — X(t) to relate
the integral on BC to the integral on ED. Use the relation
k(t ± 1) = A(f)/[A(f) - 1] to estimate the integral on CD and DC
in terms of BAB and then (by t-*—llt) in terms of EE. Finally
take limits.
Thus with respect to Fr, A has a well-defined analytic inverse with
branch points at 0,1 and 00.
11. (Picard's theorem) Show that a nonconstant entire function assumes
every complex value except possibly one.
Hint: Suppose F does not assume either a or )8, then G(z)-=
[F(z) — a]/(j8 — a) never assumes 0 or 1. If co '■= A-1, then co(G(z))
is entire (by analytic continuation). Show that (i)(G(z)) is constant
since co(G(z)) G {im(z) 5:0} and hence ea(G(z)) is a bounded entire
function.
It is worth observing that the apparently special case analysis of the
function A leads directly to the celebrated general theorem of Picard.
4.4 THE MODULAR EQUATIONS FOR A AND j
A transformation of order p is a matrix
(4.4.1) ( , J a, b, c, d integers, ad-bc = p
or the associated linear fractional transformation. We assume throughout
120
Higher Order Transformations
that p is an odd prime, though for much of the development this is
unnecessary. We will denote the set of all such transformations by Tp. We
say that M is equivalent to N mod G (M = N mod G) for a group of
transformations G if there is an S E.G so that M = SN. We need the
following purely algebraic result.
Lemma 4.1
(a) Every M G Tp is equivalent mod T to one of the p + 1 transformations
of the set si, where
"■-W-Co ?)■ *■■■< ;).<-<u.....,-i}.
(b) The p + 1 elements of si are pairwise inequivalent mod I\
(c) Every B of the form B '■= BtC, where C is in the A-group and Bt G 08,
is equivalent mod A to some element of S3, where
•■■-{>A',, ;)■ *■■< 2;)' '-<m.-.»-i}.
(d) The p + 1 elements of S3 are pairwise inequivalent mod A.
The proof is left as Exercise 1.
Theorem 4.6
(a) The p + 1 functions
/(A,.(0) i = 0,..., p
are permuted by any element of the T-group.
(b) The p + 1 functions
AOB,(0) i = 0,..., p
are permuted by any element of the A-group.
Proof. For part (a) we must show that {/° At ° S}?=1 = {J° A,.}f=1
for any 5 G T. We first observe by the lemma that if S G T, then since
AjS^-^modr for some;
Thus by the modularity of J,
JoA^S^JoAj
4.4 The Modular Equations for A and j
111
which with part (b) of the lemma finishes the proof. The second part is
identical via parts (c) and (d) of the lemma. □
The modular equation for A of order p is the polynomial
p
(4.4.2) W,(*, A): = IT (* - A,) A,: = A ° B, ■
i=0
This is obviously of degree p + 1 in x and has a root at each A,. Note that
Xp(t):= X(pt) and Xt(t) = X((t + 2i)/p), i<p. Thus as functions of q,
Xp(q) = X(qp) and Xt(q) = X(a'qllp), where a" = 1. We now show that
(independent of t) Wp is also a polynomial in A. This relies on two basic
facts. First, any symmetric polynomial in the A, is A-modular and second,
any A-modular function is a rational function of A.
Theorem 4.7
Wp(x, A) is a polynomial of degree p + 1 in x and A with integer coefficients.
The coefficients of xp+1 and Ap+1 are both 1.
Proof. Consider Wp (x, A): = Of=0 ( y - Xt), where A;: = 16 /X, and
y •= 16/*. This is a convenient form to work with. Observe that Wp and Wp
are connected by
(4.4.3) l6p+1Wp(x, A) = \xp+1 J! x]wp(x, A) .
L i'=0 J
Now by Theorem 4.6 any symmetric polynomial in Aq , Aj ,..., A is
left invariant by any element of the A-group. (See Exercise 6 of Section
11.2.) It follows that any such polynomial is A-modular, and by Exercise 4b)
of the last section A, is finite valued in the fundamental set except possibly at
t := ioo. In particular if st is the coefficient of y' in Wp (viewed as a
polynomial in y), then st is A-modular. From Corollary 3.1,
(4.4.4) X(q) = 16qTl{.+ 1-J
„=i \1 + q I
In \ 8
I
<r
and we have integers an such that
- 16 1 v
A:= — = - + 2, a„<5
A 1 n = 0
and
A,. = -7iT7F + Ea„«V/p i<P
a q „=o
1
r „=o
122
Higher Order Transformations
where a is a primitive pth root of unity. It can now be established that there
are integers ci so that
CO
(4.4.5) 5,.= 2 cAl.
/=-(P+i)
It is a consequence of the symmetry that the nonintegral powers of q vanish.
(See Exercise 2.) Next there is a polynomial P of degree at most p + 1 with
integer coefficients so that
CO
5;-P(A) = 2<W
«=0
and 5, — P(A) has zero principal part. This is easily proved. First remove the
^-(p+i) term ^ considering
Tp + l
and then proceed inductively. Obs_erve from (4.4.4) and (4.3.5) that the
only candidate for a pole of st — P( A) is q = 0 but P has been chosen so that
5; —P(A) is finite at q = 0. Thus we see that s, — P(A) is a bounded
A-modular function and is hence, by Theorem 4.5, constant. Since
we have that 5,. is a polynomial of degree at most p + 1 in A: = 16 /A with
integer coefficients. Hence Wp(x, A) is a polynomial of degree p + 1 in 16/*
and 16/A with integer coefficients. We can prove directly (see Exercise 3)
that
ru=A'+i
i=0
Exercise 7 shows that 16p+1 divides every coefficient of Wp(x, A) when
viewed as a polynomial in l/x and 1/A. Thus with (4.4.3),
xp+1xp + 1
Wp(x, A) = w+l Wp{x, A)
is of the required form. □
Analogously we have a modular equation for ;':= 1728 J. (See Exercise
4-)
4.4 The Modular Equations for A and j
123
Theorem 4.8
The modular equation for j of order p
p
(4.4.6) Fp(x, j) :=U(x~ jt) jt : = ;»At
is a polynomial with integer coefficients of degree p + 1 in x and /. The
coefficients of xp+1 and/p+1 are both 1.
The modular equation (4.4.2) is irreducible over C(A) (the rational
functions in A) since any root can be transformed into any other by an
appropriate transformation. (See Exercise 5.) Likewise (4.4.6) is irreducible
over the rational functions in j. The Galois group of Fp over ®p(j) is a
group of order p(p2 — 1), which is nonsolvable for p^5. (See Exercise 10.)
Here Qp is Q adjoin the pth roots of unity. For nonprime n, modular
equations can be constructed from the modular equations corresponding to
the prime factors of n. (See Exercise 8.)
Comments and Exercises
Further properties of modular equations are chronicled in Lang [73],
Schoeneberg [76], and particularly in Weber [08]. We have chosen a path of
limited generality focusing on the modular equations for A and j. It should
however be fairly clear that analogous equations hold for other modular
functions.
1. Prove Lemma 4.1.
Hint: For (a) prove that M is equivalent to an upper triangular matrix
mod T. Then write out the system of equations required for two
triangular matrices to be equivalent. For part (c) show that BtC is
equivalent to a triangular matrix mod A. Note that BtC has
determinant p, and hence the diagonal entries of this equivalent triangular
matrix are ±1, ±p.
2. a) Suppose that v
CO
/(?)••= 2 cnq" c„real.
n--h
Show, for a := e2mlp and p prime, that
p
2 [/(«V/P)f m integer
n = l
has no fractional powers of q in its expansion. (See Exercise 4 of
Section 6.2.)
b) Prove that (4.4.5) holds by applying Newton's formulae to express
st in terms of powers of the roots. (See Exercise 6 of Section 11.2.)
124
Higher Order Transformations
3. Show directly from (4.4.4) that
fU = A'+1.
4. Prove Theorem 4.8 by modifying the proof of Theorem 4.7. Use
Exercise 6b) of section 4.3. The proof is somewhat easier since we can
consider IIf=0 (x — /,) directly.
5. Show that the modular equations (4.4.2) and (4.4.6) are irreducible in
x, over C(A) and C(;'), respectively, by elaborating on the comments
following Theorem 4.8. This requires a minimal knowledge of Galois
theory. Note that the transformations of Lemma 4.1 act transitively on
the roots.
6. Show that
a) W„(*,l) = (*-1)'+1
b) Wp(x,0)~xp+\
Hint: Consider the orbits of 0 and i°° under the A-group.
7. a) Show that
Wp(x,X) = Wp(X,x)
and
Wp(^,A) = Wp(^iy+V+1
and if c(.; is the coefficient of xlx' in Wp, then
Ci,j ~ Cp + l-i,p + l-j ~ Cj,i ~ Cp+l-j,p + l-i •
Thus there is a fourfold symmetry in the coefficients of Wp.
Hint: Wp(X(qp), X{q)) = 0 and Wp(X(q), X(qp)) = 0 and by
Exercise 5, Wp(x, A) is irreducible in x over C(A). Also, Wp(x, A) and
Wp(X,x) have a common root x = X(qp) and are of the same
degree in x. Hence,
Wp(x,X) = R(X)Wp(X,x)
where R is a rational function of A. Show that this implies that
R = l. For the second symmetry use X(t/(t — 1)) = l/X(t).
b) Consider Wp(x, A), as in the proof of Theorem 4.7, as a
polynomial in l/x and 1/A. Let c,_; be the coefficient of x~'X~!. Show
directly that
16''+/'|c,;/ ;+;>p + l
4.4 The Modular Equations for A and j 125
and, by part a), that
162(P + l)-0+;)|-^ i+jsp.
c) Show that the coefficient of x'x' in WJx, A) is divisible bv
16Ip+i-(<-+/)I p }
8. In general, for n not necessarily prime, the modular equation (4.4.2)
has degree i/f(n) := n Up\n(l + 1 lp). Let n '= px • • • pk be a product of
distinct primes. Prove that there exists a two-variable polynomial
Wn(x, y) of degree at most ip(n) that satisfies
Wn(X(q),X(q")) = 0.
Hint: Let n = plp2. Since
Wpi(A( q"), A( q)) = 0
and
WP2(A(^^),A(^)) = 0
we deduce that \(qPlP2) is algebraic over Q(A(q)) and is of degree
ty{PiP2)- Now proceed inductively.
9. Show that the coefficients of W3(x, A) are
1
-256
384
-132
384
-762
384
-132
384
-256
1
Note that by Exercises 6 and 7 is suffices to determine cl l and cx 2.
The cM coefficient is always -16p_1. (See Exercise 6 of the next
section.)
10. {On the Galois group of Fp ) If A G Y then A induces an automorphism
on the ;\ that fixes /. Hence A is an element of the splitting field for F
over 0(/). Any two elements 5 and T of T induce the same
automorphism exactly when
for some a. Equivalently MST~lM G V for all M G Tp. One can show
126
Higher Order Transformations
that T, equivalenced as above, is the Galois group for Fp over Qp(j)
and that it is a group of order p(p2 — 1) which is nonsolvable for p > 5.
(See Klein and Fricke [1892], Schoeneberg [76], and Exercise 8 of
Section 4.5.)
4.5 THE MODULAR EQUATION IN u-v FORM
The actual calculation of the modular relation for A is most readily carried
out in the associated variables u '■= k114 '■= A1/8 and v '■= I114 '■= x118.
Analogously to Theorem 4.7, though with a few substantial additional details (see
Exercises 1 and 2), we have the following theorem.
Theorem 4.9
If p is an odd prime, then the modular equation in u~v form is given by
(4.5.1) ap(v, u):=(v- u0)(v - mx)• • • (v - up)
where
Up := (-1)^-1>/8[A(^)]1/8 := (-l)(p2-1)/8M(¢0 q := e^
uk := [A(«V ^)]178 := u(a8kqVp) k = 0, 2,. .. , p - 1
and a is a primitive pth root of unity. This modular equation is a polynomial
in u and v of degree p + 1 (independent of t) with integer coefficients.
For p = ±l (mod 8),
np(v, u) = ap(u, v) = ap(-v, -a).
The coefficients of vp+1 and up+1 are 1.
For p = ±3 (mod 8),
np(v,i) = (v + iy(v-i)
ap(v,u) = -ap(-u,v) = ap(rv,-u).
The coefficient of up+1 is —1 and the coefficient of vp+1 is 1.
The most striking property of the modular equation in u and v is the
"octicity." Because ui(qp) is of the form a'qll8f(a8'q), where/is analytic in
q, only every eighth coefficient of the modular equation is nonzero. (See
Exercise 3.) We illustrate with some examples which give the nonzero
coefficients and the column sums.
4.5 The Modular Equation in u — v Form
4 3 2 *
V V V V 1
u
3
u
n3 u2
u
1
u
5
u
4
u
u2
u
1
+1
+2
-2
-1
1+2 0-2 -1 = (v + lf(v - 1)
+1
+4
+5
-5
-4
-1
1+4+5 0 -5 -4 -1 =(v + lf(v-l)
u
7
U
u5
3
U
2
U
U
1
0
+1
-8
+28
-56
+70
-56
+28
-8
+1
0
1 -8 +28 -56 +70 -56 +28
+ 1 =(v-
Higher Order Transformations
T
o
1
+
1
+
oo
oo
+
1
o
+
f
5
+
T
o
+
oo
oo
1
1
5
+
1
+
o
+
~3 ~3 ~3 °3 3 3 ^3 "a **S *S ^3 3
4.5 The Modular Equation in u - v Form
%
t
T
o
•
1
-3-
+
1
o
oo
o
+
-3-
1
o
+
o
+
°5
-3-
1
o
oo
o
ts
1
oo
o
+
o
-3-
+
o
1
1
o
-3-
+
00
o
.1
o
+
-3-
1
+
o
-
w
vm
0
0
+1
v"
-256
+272
-34
,16
0
-272
+425
,1S
+ 1632
-2418
v»
-4080
+7140
-
*13
+4896
-13464
*12
-4080
+22644
V11
+ 1632
-33456
v10
-272
+44030
0
V*
+272
-49164
+272
Vs
0
+44030
-272
v7
-33456
+1632
,6
+22644
-4080
v5
-13464
+4896
V*
+7140
-4080
v3
-2448
+uh
v2
425
-272
0
V
-34
+272
-256
1
+ 1
0
0
-18 +153 -816 +3060 -8568 +18564 -31824 +43758 -48520 +43758 -31824 +18564 -8568 +3060 -816 +153 -18 +1 =(i>-1)1!
-^^(0^^3^3^^3^^^^9^^^^^^
„2° „19 „B .." „16 „15
V» v"
v6 v5 v" V3 v2 V 1
0
1
+512
-608
+114
-2432
+2584
-2432
+3344
-114
0
-3952
+6859
+5472
+2280
+2432
+ 10488
+2584
+25536
-2280
+3952
+21242
0
-608
"""
+20748
-3344
-10488
+ 10488
+3344
-20748
+608
0
-21242
-3952
+2280
-25536
-2584
-10488
-2432
-2280
-5472
-6859
+3952
0
+ 114
-3344
+2432
-2584
+2432
-512
-114
*
+608
-1
0
0
1 +18 +152 +798 +2907 +7752 +15504 +23256 +25194 +16796 0 -16796 -25194 -23256 -15504 -7752 -2907 -798 -152 -18. -1 = (v + l)"(v -
132
Higher Order Transformations
u24
ua
u22
un
u2a
u"
u18
u"
H16
«15
u"
u"
«23 „12
u"
«10
9
U
«8
«7
u6
u5
u4
«3
u2
u
0
v24
0
0
0
+1
v°
-2048
+2944
-920
v22
0
-13248
+13524
v2i
-23552
+75072
-53544
v2a
+52992
-124752
+82386
19
V
-138368
+149408
-53544
u"
+334144
-213072
+ 13524
v"
-712448
+367264
-920
v"
1159200
-423729
0
1 -24 +276 -2024 +10626 -42504 +134596 -346104 +735471
The exhibited portion of il23 is sufficient to easily calculate the remainder
of the coefficients because of the symmetries. For p = ±1 mod 8 the table is
symmetric through both diagonals. For p = ±3 mod 8 the reflections through
the diagonals change the sign of the entries according to ct j = (-l)'+1c;, and
c,j = (-i)'cp+1_hp+1_t, where clyj is the coefficient of v'u' in O. (See
Exercise 4.)
The numerical calculation of these modular equations is fairly
straightforward. From the q expansions for the 6 functions we can compute
u0, w1;.. . , up for a variety of q values (for p : = 23 one must use three
values of q). We then use (4.5.1) to calculate the coefficient of v' at these
values. However, we know the form of the coefficient (for p : = 23 and
/:==23 one has, for example, that the coefficient of v' is of the form
au + bu + cu1), and we can easily calculate u at the same q values we
4.5 The Modular Equation in u - v Form
133
used to calculate the coefficient of v\ This leads to a system of linear
equations to solve. We know that the system has an integral solution. We
also know what the column sums are in each case. One can use this
information to reduce the size of the system by 1 or, perhaps more
reasonably, as a check on the solution.
The main limitation is that the size of the entries grows exponentially
with p. The pth modular equation will have entries of size roughly 2P, so for
large examples one must work to a high degree of precision. Note that the
size of the linear system only grows as p/8.
Comments and Exercises
A fairly complete account of modular equations up to 1928 is given in
Hanna [28]. This includes equations of degree 103, 107, 127, 167, 191, and
239.
The u — v modular equations up to degree 20 are presented in Cayley
[1874] as we have presented them. We easily computed il23 by the method
outlined in the section. The others were originally calculated by Sohnke
(1836), whose method roughly parallels the one we have described, except,
that he computed an expansion for u(q)/q118 and computed sufficient
coefficients of um, to calculate the elementary symmetric functions directly.
Then instead of solving a linear system, he compares coefficients. As Cayley
[1874] points out, "The process is a laborious one (although less so than
perhaps might beforehand have been imagined)."
A particularly simple form of the modular equation for p =23, due to
Schroter, is
(4.5.2) (A;/)1'4 + (k'l')1H + 22'\klk'l')1'12 = 1.
For many theoretical purposes modular equations for j are preferable.
However, for calculations the modular equation for u is usually simpler. The
extent of the numerical simplification is quite remarkable. Du Val [73]
exhibits low-order modular equations for the T-modular function /'.= //
(/— 1). The cubic modular equation for I tabulates as a 5 x 5 matrix where
all but one of the 25 entries are either 15- or 16-digit integers. Modular
equations for j up to order 11 have been calculated. (See Kaltofen and Yui
[84].) For the llth-order modular equation the coefficients are enormous.
For example, the coefficient of f is
27090964785531389931563200281035226311929052227303
x292319520ll2-53.
Various of the A and j modular equations are presented in Greenhill [1892].
In particular, clean forms for p =29, 31, 47, and 71 are given for Wp.
134
Higher Order Transformations
Exercises 1 and 2 outline the proof of Theorem 4.9. Some of the details
are rather complicated. The flavour should come through.
1. a) Observe using (3.2.3) that
„=i\l + <7 I
b) Analyze the action of A-group on u. Show that
u°SA8) = u and u°Tk = u .
Observe that u°Sk = j8w where )8 is an eight root of unity.
c) Analyze the action that SA induces on the functions A0,. .. , Ap and
on the functions u0,. . . , up. What is the permutation SA8) induces
on u0,. . . , up1
d) Analyze the permutation that Tx induces on the functions
A0,. . . , Xp and on the functions «„,..., up. Identify Ap(rA) and
A (rj1) in particular.
2. Prove Theorem 4.9.
Hint: Use Theorem 4.7 and Exercise 1. Consider how Wp(v8, u8) splits
over Q(w). (Note that u is invariant with respect to group generated by
Sf and TA.)
3. Prove the "octicity" of the u-v modular equation. That is, for p = ±1
mod 8 the nonzero entries of the table associated with Op lie only on the
main diagonal and every eighth sub and super diagonal, while for
p = ±3 mod 8 the nonzero entries are only in every second entry of
every fourth diagonal.
4. a) Prove the symmetries (or antisymmetries) of the u-v modular
equation with respect to reflection through both diagonals.
b) Evaluate the row sums explicitly from Theorem 4.9.
c) Observe that with the aid of the "octicity" one can read off the
modular equations of degrees 3, 5, and 7. Verify the modular
equations of degrees 11 and 13. This requires either calculating the
ui at a single value of q, or using Exercise 6.
5. Show that il7 can be written as
(4.5.3) (1 - w8)(l - v8) - (1 - uv)8
or as
(4.5.4)
(kl)1'4 + (JfcT)1'4 = 1.
4.5 The Modular Equation in u — v Form 135
6. From our analysis we know that
a.(v,u) = vp+1+ E ct/u' + (-l)(p2-1),8«p+1.
a) Show that Cl, = -(-1)(^-^2^-1^2
Hint: ilp(uQ,'u) = 0 and u = V2q118 + 0(u2) while w0 = V2q1,8p +
O(u20). Thus
0 = (-l)(p2~1);V + clAu0 + O(u0u)
and
Cu = -(-D(p2-1)/8lim,^.
b) Show that if 0 < u < v < 1 and Q,p(v, u) = 0, then
V<2("-1)/2M and lim-=2(p-1)/2.
c) Hence if W (y2, y2+1) = 0 with 1 > yn > yn+l > 0, one has
7„+1^t4 and lim-^-=4
7. In 1858, Hermite and Kronecker separately gave solutions of a general
quintic using quintic modular equations. Hermite's method is
outlined below.
a) Let
¢,. := («5 - ul)(ui+l - m,_1)(m,+2 - w,_2) i = 0,1,. . . , 4
where i + / is chosen mod 5. Then
/ 1 \1/4 1
v 5 ' w(l - u )
are the five roots of the quintic
2 1 + w8
*5 — x ■
r5/4 2/1 8\l/2
C5/4 Z/1 8\1
5 w (1 - u )
b) The quintic modular equation (4.1.21)
6 6 , c 2 2/ 2 2\ 1 ,1 /! 4 4\ «
w — v +5u v (u — v ) + 4uv(l — u v ) = 0
136 Higher Order Transformations
can equivalently be transformed into
5 o4c3 4/1 8\2 /-,6^5/2 3/i 8\2/1 , 8\ rv
x -25u{l-u)x — 25 w(l — w)(l + w) = 0.
The details are formidable,
c) Any quintic can be algebraically reduced (via solution of a quartic
equation) to the Bring form
x5 — x — a
and hence a) and c) provide a solution of the quintic in terms of the
roots of the modular equation. This requires solving
2 1 + w8
55M u\l - u8)in
and leads to a quartic equation in u .
The amount of calculation required above is prohibitive.
However, that some combination of the uf solves a quintic is not overly
surprising since the Galois group of the quintic modular equation
(4.1.21) is 55. The reduction to Bring form in c) is effected via the
Tschirnhaus substitution.
(On the Galois groups for W5 and W1 over <Q>( A))
a) For p = 5, SA and Tx induce the following permutations of the roots
(*' = *,)
5A: (0,1,2,3,4,5)-^(1,2,3,4,0,5)
TA: (0,1, 2, 3, 4, 5)-» (0,5, 3,1,2, 4).
b) For p = 7
5A: (0,1, 2, 3, 4, 5, 6, 7)-* (1, 2, 3, 4, 5, 6, 0, 7)
TA: (0,1,2,3,4,5,6,7)-^(0,3,1,4,6,7,5,2).
c) Show, using a), that the Galois group of W5 contains A5 and is not
solvable.
d) Show, using b), that the Galois group for W7 is not solvable.
In both cases these permutations actually generate the Galois
group. (See Exercise 10 of 4.4, and Exercise 1 of 4.5.)
4.6 THE MULTIPLIER
As we saw in Section 4.4, thepth-order transformation can be considered as
determined by k := k(q) and 1:= k(qllp). We define Mp, the multiplier of
order p, by
4.6 The Multiplier
l2/
137
1(4-6.1)
MM,k):
6i(q) _K(k)
e\(qVp) K(l)
|> and in the future will denote K(l) : = L.
Theorem 4.10
|If k := k(q) and /:= k(qVp), then Wp(/2, A;2) = 0 and
f2_ ll'2 dk v(l-v8) du
1(4.6.2)
pMt
kk'2 dl u(l-u8) dv
In particular, Mp is an algebraic function of k and I.
Proof. Since 7rK'/K= —log q [by Theorem 2.3(6)], we have
1 7T
f (4.6.3a)
A(£.
dk\K
2 A*'2*:
21^2
I on using (2.3.10). Similarly,
f (4-6-36)
1
dl
\LJ 2 U'2L
Now as pL'/L = K'/K [again by Theorem 2.3(6)], we can write the I, k
form of (4.6.2) on dividing (4.6.3a) by (4.6.36). We now verify the u-v
form directly. □
When Mp is given by v(l — v8)/[u(l — w8)] (du/dv) as a function of u and
v, we write Mp(v, u). In this form we see that M2 is rational. In fact
Mp(u, «) is rational. (See also Exercise 1.)
Let us use (4.6.2) to compute M2 and M3. When p'-=2, we have
/ = 2V£/(1 + k) and A: = (1 - /')/(! + /'), by equation (2.1.15). Thus
(4.6.4)
2M2 =
dk ll'2 Vk(l + k)2
kk'2 dl kk'2 1-k
(1 + kY
and
M2(l, k) :
1 + /'
1 + A:
- which corresponds with Theorem 2.6.
138
Higher Order Transformations
When p'-=3, we have Vlk + VVk7 = 1, by equation (4.2.6). This
produces, on differentiating implicitly and using (4.6.2),
3M, = -
l'2^M-l2VFF
VE-l2
1
TO
13 k'WJk-kWTF y/Tk-k2 \-
Let a4 := k3/l. Then I3Ik = [(2 + a) /(1 + 2a)]4 and
(46-5) 3M'=(dn? - m>=2Stt-
(See Exercise 2 of Section 4.1.) But a = ulv so that
M3 =
lai
2v -u
v + 2u
3w
because
3 J2+a
1
2a + 1 \2a + l-
For p : = 5 or 7, similar, but more elaborate, calculation produces
(4.6.6) Ms =
(4.6.7) M7
u + v
v(l - uv3)
5 5m(1 + u3v) v — us
- ^(1 ~ uv)[l - uv + (mi;)2] _ _
w — i;
v — u
7w(l — mu)[1 — uv + (uv) ]
(See Cayley [1895].) Many other multipliers have been calculated and can
be found in Ramanujan's collected works, Cayley [1874], Tannery and Molk
[1893], in Weber [08] and elsewhere. The main technique for larger p is via
manipulation of theta series. Thus one has Ramanujan's form of the
multiplier for 13:
(4.6.8)
»*..M>-ap.(£r-(&r-«(&)
1/3
1
Ml3(k, I)
We list also
l i\in /i'V2 i w V2
(4.6.9) 17M„ft«H*) +(f) +(W)
4.6 The Multiplier
139
We conclude this section by touching on the matter of singular values, kp,
which for us are defined to be the solutions in (0,1) of Wp(k' , k2) = 0.
These are often called singular moduli for the function A. Corresponding
values for J are discussed in Exercise 6.
Then, since K'(k)/K(k) is isotone, Theorem 2.3(b) shows that this is the
unique solution to
Y<*p) = Vp
0 < kp < 1.
In the notation of equation (3.2.1), k = A*(p) and k' = A*(l/p), so that
kp = k(e~7ry/p) and lp : = k'p = k(e~^p). Sophisticated number-theoretic
techniques are available for computing kp for large p, without knowledge of
Wp. This is discussed briefly in Exercise 5. For small p one can solve directly
for kp. Thus
'*! =
_1_
V2
fc, = V2-l
^3 =
V2(V3-1)
4
3-2V2
^7 =
VV5-1-V3-V5
V2(3 - V7)
, _ (V2-31/4)(V3-1)
/c9 _
_1_
V2
= V2V2-2
V2(V3 + 1)
4
= 21/4(2V2-2)
VV5-1 + V3-V5
2
V2(3 + V7)
8
(V2 + 31/4)(V3-1)
2
2^:^ = 1
^3^3 0
2A:5/5 = V5 - 2
ZiKjlj
2A:9/9 = (2 - V3)2
(4.6.10)
A more comprehensive list is given in the next chapter. A profusion of
modular equations of degrees 3, 5, and 7 are given in Chapter 19 of
Ramanujan's Second Notebook.
Comments and Exercises
1. From the results of Section 1.5 (in particular Theorem 1.5 and Exercise
1) we know that G(k) := kV2k'K(k) satisfies
140
Higher Order Transformations
where k and I are solutions of the pth-order modular equation Wp. Use
this to show that
M2(l,k): =
ll'z dk
= c
K(l)
K(k)i " kk'2 dl
where c is a constant. This provides an easy alternate derivation of
Theorem 4.10 up to the evaluation of the constant c.
2. a) Verify the computation of M3 in (4.6.5).
b) Compute Ms in (4.6.6).
Cayley [1874] discusses algebraic methods for computing Wp and Mp at
length. These seem only to be entirely reasonable for 2, 3, 5, 7, and in part,
11. The discussion therein also illuminates the rational nature of Mp(u, v).
3. a) Show that pMp(l, k)Mp{k', V) = 1.
b) Use Theorem 4.10 to show that for all u and v,
[PMp(v,u)Mp((-l)^-l)l8u,v)]2 = l.
Hint: Consider the similar symmetry of Op.
4. a) Cayley observes that given any polynomial identity F(u, v) = 0
which satisfies F(u, v) = F(-u, -v), one can produce a similar
identity G(u2, v2) = 0, with G of the same degree. One uses
(4.6.11) [F(u, v)F(u, -v)F(~u, v)F(-u, -v)]1'2 = 0 .
b) Use this technique to develop modular equations and multipliers in
terms of the u, v2n (n := 1,2, 4) for p : = 2, 3, 5, 7. (See Cayley
[1874].)
5. Verify the singular values in (4.6.10). In each odd case one verifies
t„ -Ikk' first and uses
kn = (VT+7n-VT=tn)/2
ln = (VT+7n + y/T=Tn)/2.
The invariants of (3.2.9) to (3.2.13) lie at the heart of calculating
singular values. Armed with these and either Ramanujan's insight or
some knowledge of group theory, singular values can be calculated in
profusion. Watson, in a long series of papers commencing with Watson
[32], has recreated what he believes to be Ramanujan's procedure,
while Weber [08] explains the classical theory and lists many examples.
Zucker [77] indicates an attractive way of calculating many large
singular values such as Ramanujan's celebrated
4.6 The Multiplier
141
(4.6.12) k2W = (V2 - 1)2(2 - V3)(V7 - V6)2(8 - 3V7)
x (VlO - 3)2(4 - Vl5)2(Vl5 - Vl4)(6 - V35) .
Zucker's methods are described in Section 9.2.
6. In this exercise we sketch the relationship between binary quadratic
forms and singular invariants or values for Fp [solutions of Fp(j, j) = 0].
a) Let
M: $)
where |5| = ad — be = p > 0 and a, b, c, d are integral, and assume
with no loss of generality that c > 0, b^O. We say t with im(t) > 0
satisfies complex multiplication by jit if (/jut, jit) = S(t, 1) so that
t = (at + b) /(ct + d). In particular, f is the solution with im(t) > 0 to
an integral equation Ax + Bx + C = 0 with -D-=B2 - 4AC <0.
[This corresponds to asking for which \x, one can solve
p(lJ,z, w1, w2) = R(p(z, wl, w2)) for some rational function jR. Here
p{z) : = p(z, h>1; w2) is the Weierstrass p function of Section 1.7. Or,
in other words, for which /jl does the lattice L generated by wt and
w2 contain the lattice jitL? Hence, the multiplication.]
b) Suppose that
at + b , „ a*t+b*
ct+d c*t+d*
(4.6.13) ad-bc = p, a*d* - b*c* = 1.
Then ;(f*)= j(t). Also t* solves a quadratic with the same
discriminant as that for t, and t* possesses a multiplication by \l if and
only if t does. Now (4.4.6) implies Fp(x, ;') = 0 is solved by j(t),
because 5 = A,modr for some i and hence j(t) =/,(0- Thus
Fp(i>J)~Q 1S solved by j(t) exactly when t possesses complex
multiplication. Note that Fp{j, j) will have lower degree than
Fp(x, j).
c) Since / is one to one on the fundamental region, f(t) =/(**) if and
only if the associated primitive binary quadratic forms are properly
equivalent. A binary form ax2 + bxy + cy = 0 is primitive if
gcd(a, b, c) = 1. Two forms are properly equivalent if there is an
integral linear transformation of determinant 1 converting the one
into the other. (See Dickson [71, vol. 3].)
d) Thus the study of the degree of Fp(j, j) becomes the study of
h(—D): the class number of primitive forms of negative
discriminant — D. More of this is sketched in Hardy [40, chap. 10], and in
Tannery and Molk [1893].
142
Higher Order Transformations
7. For fixed m and p let y0: = km be the mth singular value and let yn be
the sequence of solutions of Wp(yn, yn+l) = 0 as defined in Exercise 6 of
Section 4.5. Use Theorem 2.3 and (3.2.3) to show that
n—*<» \y /
lip"
Wm/2
e
■TV
This provides an algorithm of order p for computing e~in/m. When
p "- = 2, this reduces to the algorithm in Exercise 3 of Section 2.5.
By the same process as in Section 4.5 one can produce polynomials in
x := M~l{l, k) and k. Cayley [1874], following Joubert, gives
(4.6.14) *4 - 6*2 + 8(1 - 2fc2)* - 3 = 0 p:=3
x6 - 10*5 + 35*4 - 6(k3 + 55*2 + [64(2kk'f - 26]* + 5 = 0 p := 5
(4.6.15)
and
(4.6.16) *8 - 28*6 + 112(1 - 2*2)*5 - 210¾4
+ 224(1 - 2A:2)*3 - [140 + 1344(2A*')2];c2
+ [48 + 512(2A£')2](1 - 2A:2)* + 7 = 0 p := 7 .
He gives a similar expression when p := 11.
8. a) Verify (4.6.14) and observe that 3A/3(1/V2, k9) = V3 + 2V3.
b) Explicitly solve (4.6.14) to produce M^1 as a function of k.
c) Use (4.6.15) to determine ks.
4.7 CUBIC MODULAR IDENTITIES
Ramanujan in his notebooks gives the following remarkable identity:
03( q3)
$5-11
04^9)
1
[See (24.28) and (24.29) of Chapter 18 of his second notebook in Berndt
[Pr].]
An equivalent form and some variants of this formula are established in
the next theorem.
Theorem 4.11
ith
k:
el(q)
0l(q)
. e22(q3)
y'~el(q3)
4.7 Cubic Modular Identities
143
one has
(a)
(b)
(c)
e*(q)
3^5-1
^( ^)
= 4(^-) =9^^-!
V 77' 1 6t(q)
4/ _3>
e:(?)
02(<z)
024(jLl
1(9)
1.
Proof. We establish only (a). Then (6) and (c) follow (Exercise 2). It is
most convenient to use the quintuple-product extension of Jacobi's triple
product, which is developed in Section 9.4. Equation (9.4.3) with/:= 6 and
k: = 1 may be remanipulated to show that
CO
303(<79) -8,(9) = 211(1- g")(l + <72")(1 + q6"'3) ■
n=l
Hence, with (3.1.6),
CO
0s(q)pe3(q9) - e3(q)f = 8 IT tt + q6n+3f
n = 0
00
X I! (1 - q2n)\l - q2"'1)3^ + q2")\l + q2"~l?
and Euler's identity (3.1.4) reduces this to
s fi (l+q6-+y n (l+q2-+y n (-^4^) ■
Now (3.2.9iv) shows
and
EI (1+ q2n + 1f =VZql'\kk'rUA
n(i+?6n+3)3=V2^/8(77'r1M
n = 0
while (3.1.4), (3.1.7), and (3.1.8) combine to yield
In \ 4
/ 1 ~z" \ 4 1
(?)
These identities result in
144 Higher Order Transformations
(4.7.2) e3(q)[303(q9)-e3(q)f=4
606,
(kkTXyy'Y
■40
(kk')3'4
3 / ,\l/4
(77 )
This establishes the first equality for part (a). The second is most easily
seen by using equation (4.1.15), with y'■= k and k'-=l, to write k3/y =
[(2 + a)/(2a + l)]4, k'3/y' = [(1 - a)/(2a + l)]4, and using (4.1.19) or
(4.6.5) to write 043(q3)/643(q) = 1/(2« + 1)2. Then both sides of the second
equality become 4(2 + a)(l - a) /(2a + 1)2. □
These identities have many remarkable consequences some of which we
leave as exercises. Two, however, are worthy of more explicit analysis.
an iteration for the cubic multiplier. In decreasing form, as above, we
consider the multiplier Mn : = Kn+l/Kn where Kn : = K(q3"). Then part a)
of Theorem 4.11 shows that, with mn := 3A/„, we have
(4.7.3)
where
(4.7.4)
m„+, =
mn =
1 + 2V2
(2kk')
(277 01
3/4
Alternatively, part (b) shows that if we set rn '■= 3624(q3
mn = (/■„ + 3)/(rn - 1) (see Exercise 8). We have
)/04(<73")>sothat
(4.7.5)
where
(4.7.6)
[(^-1)^ + 1]
rn-
1+2V2) '
(27/7 )
2^3/4
4
1/2
This latter is more suitable when we begin with an even singular value.
We have written (4.7.4) and (4.7.5) in a form consistent with Ramanujan's
invariants Gn and gn. [See (3.2.13) and Exercise 5, Section 3.2.] In these
terms, the iterations are initialized by
(4.7.7)
and
(4.7.8)
m0:=m(n):=[l + (V2G9„/G^)3]1/2
r0:=r(n):=[l + (V2g9n/g3n)3]U2.
q:=e
-7rVn
We also write M(n) := m(n)/3.
4.7 Cubic Modular Identities
cubic recursions for G„ and gn. We have
9 g'(g3) =1 + 2V2^
03(<7) G«
and, using the theta inversion formula in part (a),
03( 1) _, ,../. G
3
= 1 + 2V2 —£- .
Ot(ql G
Thus
(4.7.9) 9 = (l + 2V2%)(l + 2V2^)
V G„/\ G9„/
Similarly
(4.7.10) 9 = (l + 2V2%)(l-2V2-%-).
on 69«
Thus given G„ or gn, we have a simple equation to solve for G9n or g9n.
example, with n"= 5 we have G3 = G1/3, and with *:= G3 we know
(1 + 2V2V6)2 =9 or G3=21/12. Similarly, g2~/3 = g6 so that (4.7.10) y
of -g6" = 4V2 and g* = V2+T.
A more tractable recursion can be attained by observing that
3^-1 = ^%
and
Thus
^)-1 = V2G'
03(<?y) G
3
81n
(4.7.11) 3 = (V2^ + l)(V2^ + l)
G81«
and, rearranging,
(4.7..2) ei.^^ig
146 Higher Order Transformations
Correspondingly
3 gl + V2g9„
(4.7.13) 881"~89" g9n-^2gl'
From these two formulae, and other singular values given by Ramanujan,
one can give explicit equations for some very large invariants (G\025, G\991,
#4698> f°r example). We illustrate with g4698. Ramanujan gives
2 _ 5 + V29
8s8- 2
'5
6*522
±^)"2(5V5+11V6r(^ + ^M)
Then manipulation of (4.7.13) yields
3 _
64698
/V29-5\1/2(V29 + 5) + V2(llV6+5V29)1/6(V9 + 3V6 + V5 + 3V6)
\ 2./ (V29 -5)- V2(ll V6 - 5V29)1/6(V9 + 3V6 - V5 + 3V6) "
(See also Exercise 13.) Analogous results for quintic and septic multipliers
are treated in Section 9.5.
Comments and Exercises
The quintuple-product identity was certainly known in essence to
Ramanujan so that our derivation of Theorem 4.11 is in all likelihood similar to that
which he had in mind. The proof we give in Section 9.4 is self-contained and
can be read with ease now. Biagioli [Pr] has given a modular function proof
of (4.7.1).
1. a) Show that (4.7.1) is equivalent to
3
03(<f)
03(q)
4/ _3>,
9g;(gj)
ot(q)
l
1/3
(either by modular considerations or by direct inversion of the
underlying quartic polynomial),
b) Show that the AGM iterations in theta form can be written as
e3(q) x rn2/~2
e3(q*) ' I-03(94)
el(q2) ^1
1
4.7 Cubic Modular Identities
147
or as
03(<74)
03(q)
1 =
Ol(q)
1
(Note that in this case the replacement of q by —q does not give a
formula in 04.)
a) Show that Theorem 4.11, parts (b) and (c) are equivalent to part
(a).
b) Derive part (c) directly from (9.4.3).
Use (4.7.9) and (4.7.10) to verify that
i) Gp = (V3 + l)/V2
ii) g?8 = V3 + V2
iii) G^7 = 21/4/(21/3-l).
a) Use Exercise 3 and (4.7.12) and (4.7.13) to verify that
.. 3 (2V3+2)1/3 + l
(2V3-2)1/3-l
.., 3 1 + (2V6 + 4)1;
u) giffl" " —
3
1/3
1-(2V6-4)J
b) Calculate G243 and g54.
Show that, in the notation of Proposition 3.1,
a) 11^(9)-30^)1 = 40^9)
/=2
b) 11^)-0/(9)1 = 40^(99).
/-2
Show that
9\l3
02{ql[e2{q) - Wtf + 0A(qv)[e4(q) - 6A(q*)]
= 0s(q9M(q)-d3(q9)}3.
Use Schlafli's equation (Exercise 6 of Section 4.1) to establish that
i) G5- 2
n) gio = —2—
iii) G25 =
V5+1
Higher Order Transformations
In the notation of (4.7.7) and (4.7.8) show using (4.1.15) that
i) [m(n) - l][r(n) - 1] = 4
and
ii) r(4n) = '\/m(n) + r(ri) + 3 = y/r(ri)m(ri) .
Also (4.7.9) gives
«0 ^)-^(1)-3-.-(1)-
a) Given that
^-(^OW
verify that
__6 /3-V7\/V7 + V3\3/2
G^=\-vr)\~t~ ) and
M-(|) = V3(^1)V^^V3.
b) Use the modular equations of degrees 3 and 7 to verify the value
of G21.
In each case, given the first invariant verify the following ones.
a) ^ = (-^^)(^^)^^3 = (-^-)(^1) and
Ki r~* /3V19-13\/V3-1\3 6 /3V19-13\/V3 + 1\3
b) G57 - ^ ^ )\-^) > Gl9l3 - {—^—)\~Vr)
and
Af^y) = V3V2V19 + 5^(^1)3.
N _6 /39-7V3T\/V3T-3V3\3/2
c) G93=( ^ )( ) ,
'39 - 7V3T\IV31 + 3V3\3/2
'31/3
G:L = t= = and
vr-A—2—J
M_1(|)=^V3T_3V3)3^y^l)3
4.7 Cubic Modular Identities 149
d) g- = (v^-5)(^^)3,g2t/3 = (V^+5)(^1^-)3 and
r(f) = V3-(^|^-)3/2V6 + 34V2 + 15Vl3.
e) g-06 = (VlO - 3)(V5 - 2), g-% = (VlO - 3)(V5 + 2) and
A"(y)=V3(V2-l)(V5 + 2).
11. a) Verify that m($) = V3 and r{§) = V6 + V3.
b) Hence verify that m(§) = 3V2 + 2V3 - V6-3 and that K§) =
V3(V3 + V2). =====
c) Verify that m(l) = V3 + Vl2 and r(4) = V3 + V3 + V9 + 6V3.
d) Verify that a-(2) = V2+V3 and m{\) = V6- V2 + 1. Hence
m(^) = 3(3V6 + 7 - 4V3 - 5V2).
e) Verify that m(|) = (3 - V6)(V2+1) and m(^) = (6V3+ 9)-
(4V6+6V2).
12. a) Show that (4.7.9) can be written as
(j)2 + (^)2 = 2V5[(^)_(^rl]
where x'= G3N and y '■= G9N. Hence compute G3 and Gg.
b) Find the parallel expression for g3N, g39N. [Compare (4.1.27).]
13. One can explicitly solve (4.7.9) and (4.7.10) to obtain the following
formulae.
a) Show that G39N and G3NI9 are the two solutions to
x2 - V2GN[G8N + ^G^+Gl + ljx + G2N[G8N + 1 + ^/gY+G% +
= 0.
b) Show that g39N and ~g3NI9 are the two solutions to
*2 - ^2gN[g8N+y/gN-g8N+i]x - gtigl -1+V^6-^+i]
= 0.
c) Given that G2S = (V5 + 1)/2, show that
G225 = (2 + V^)1/3(^i)[V4TVl5 + 151/4]
and
G25/9 = (2 + V3)1/3(^i)[V4TVl5 - 151/4].
150 Higher Order Transformations
d) Given that g58 = (5 + V29)/2, show that
^ = 1^^)^(5^+11^)1
9 + 3V6 /5 + 3V6
and
#58/
^~^y/2(5V29 + llV6)
1/6
4 + 3V6 9 + 3V6
14. a) Entry 23 in Chapter 18 of Ramanujan's second notebook (Berndt
[Pr]) gives
i) V2 E exp( 2W7r^)cos( ""y2)
M / 2 . 2 . \l/2 V -n2TTX / 2 -,
= (yx + y + x) Zj e cos (n Try)
w=-co
CO
. /« I 2 . 2 \l/2 V -n2!! -/2 \
+ (V* + y - x) Zj e sin (n Try)
W = —00
••x /~ V (-n\x\ . ( n-ny \
u) V2 2, expl 2 2 ) sin t 2 2)
«=-co \* + y / \* +y /
= (yx + y - x) Zj e cos (n 7ry)
/I=—CO
CO
/.. / 2 . 2 . \l/2 V -nVr -/2 \
- (V* + y + x) Zj e sin (n 777) .
/! = — co
Use Exercise 5 of Section 2.2 to prove these when x and y are
complex with re(* ± iy) > 0.
b) Deduce that if re(s) > 0,
S e~"2-cos (nW) =^^-^ S <r"2™sin(n2™')-
/1 = -00 VI S n = -co
c) Use ai) to deduce that
03(e~") = V5V5 - 10 fl,(<r5*) .
d) Use b) with s : = V5/3 to deduce that
(V5 + 3)030-7^3) = (3 + V3)03(e-"3V~5) .
4.7 Cubic Modular Identities 151
e) Use d) to conclude that
M(5) = £V 1 + 2V3 + 2 V5
and
G« = KV5 + 2)3(V5 + V3)4.
f) i) Use b) with s := V3/2 to obtain
kl2 = (V3 - V2)2(V2 - 1)2 and g% = V2(V3 + 1)3.
ii) Use b) with s := Vl5/4 to obtain
jT = (4+ Vl5)(2 + V3)2 and G" = 4(7 + 3V5)
iii) Use b) with s := V7/4 to obtain
^- = 8 + 3V7 and G" = ;
'7
g) Use c) to show that
vnr ^ (^ + 2] r2(')
15. a) As in Ewell [86] show that
04(g)3 1irJy g3""1
3n-l n3"~2
+ q3-2
and so develop a formula for r,(n).
Hint: Use the quintuple-product of Section 9.4 and mimic the
derivation of equation (9.1.14).
b) Evaluate
CO
El/[l + F«,-3]-
Chapter Five
Modular Equations and
Algebraic Approximations
to 71
Abstract. In this chapter we study the algebraic relationships between elliptic
integrals of the first and second kind. This study is applied to produce nth-
order iteratons for ir, rapid series for ir~ , and assorted other algebraic
approximations to ir.
5.1 SINGULAR VALUES OF THE SECOND KIND
In Section 4.6 singular values were introduced. We will call A* the singular
value function (of the first kind) where A*(r) := k(e~ r) as in Section 3.2.
We introduce the singular value function (of the second kind) a which we
define for positive r by
(5.1.1) «W:=f-~2 * := *(*-*"') = A*(r).
Since \*(r) tends to 0 as r tends to °°, we have a(r) converging to tr"1 as r
increases. Indeed, as we shall see, the convergence is exponential which
allows us to use a(r) to approximate 1/tt effectively.
Using Legendre's identity and the fact that K'(h*(r)) = VrK(\*(r)), we
have
(5.1.2) aW_ - m_,)
152
5.1 Singular Values of the Second Kind 153
and k:=X*(r)
(5.1.3) «W = V7f-^-
If in (5.1.2) we use the differential equation for K, equation (1.3.13), we
may also write
(5.1.4) «(r) = ± (^)2 - V?(kk* § - ¢) k:^ A*W ■
Now since A*(l/r) = A*'^), we discover that
(5.1.5) a(,-1) = ^W.
In particular a(l) = \. We may rearrange (5.1.2) and (5.1.3) as follows:
(5.1.6) j = K[V?E - (V7 - a(r))K]
and fc:=A*(r)
7¾ - ^ X'"
(5.1.7) l = K'
which may be viewed as one-sided forms of Legendre's identity. In the next
sections we will give these identities substance by showing that a(r) is
algebraic for rational r and by computing many values. Another useful
equivalent form is
(5.1.8) E'=V?E-8(r)K
where k-=X*(r)
(5.1.9) 5(r):=Vr-2a(r).
Theorem 5.1
The function a is montonically decreasing for rsl.
Proof. Since A* is decreasing, it suffices to show for k<l/V2 that
f'-=(E'K-ir/4)/K2 is an increasing function of k. This we establish by
computing / and using the differential equations for E' and K. (See
154 Modular Equations and Algebraic Approximations to jt
Exercises 2 and 3 of Section 1.3.) We deduce from Legendre's identity that
, = irlljEIK -1) + k2E(K' - E') + (k')2E'(K - E)
i{) ~ k(k')2K2
= k2[(K - E)(E - irl(2K)) + E(K' -E')-E(K- E)] + (k'f(K - E)(E' - irl{2K))
k(k')2K2
Since, for k<l/V2,
K>E>E'>l>irl2K
we finish by observing that, for k < 1/V2,
K'-E'>K-E
and so substitution into the last equality completes the proof. □
We next provide a theta function expression for a. We combine (2.1.13)
and (2.3.17) with (5.1.2) to write
,^ „ „™ , x tt"1- VrAqe.ld,
(5.1.10) a{r) = fl4 (w.r.t.q)
where q := e~ r. Expanding this gives
(5.1.11) 0 < a(r) - tt"1 = 8(V7 - tt"1) e'7^ + 0(re~27r/~r)
^leVfe"^ r>l
and
(5.1.12) 0<«(a-)-7t"1<V7A*2(a-).
One should compare (3.2.1).
Comments and Exercises
The function a is implicit in Ramanujan's work. In the next section we
indicate how. Zucker [79] computes a(n) for n '• = 1, 2, 3, 4, 5, 7, while a(3)
was known to Legendre. Formula (5.1.10) allows one to numerically
compute a very easily.
a) Show that
tt"1 = [a(r) - V?\*Xr)]el(q) + AVfq
-1-r.v.A .^,,2,.^4,^ ,„.,= _ ¢,(9)
03(q)
where q'- = e~'*v~T.
5.2 Calculation of a
155
b) Use a) to reprove Exercise 4b) of Section 2.3:
4v=
K=-~ne-^
2. By Exercise 4 of Section 1.6, A*(2) = V2 - 1. Use Theorem 1.2 to show
that a(2)=V2-l. Hence a(|) = |.
3. Prove (5.1.11) and (5.1.12).
5.2 CALCULATION OF a
We begin with an appropriate generalization of the quadratic transformation
formula for E given in Theorem 1.2.
Propostion 5.1
Letp>0, k:=k(q), and l:=k(qllp) be given. Then
U>2 dMp(l, k)
(5.2.1) pM2Mk)
| (*) - *'2
MM, k) dl
+
Lf <'>-<"
Note: Herein (dMp(l, k)ldl) is the full derivative of Mp{l, k) with respect to
Proof. We have Mp(l, k)K(l) = K(k). We differentiate both sides with
respect to k and use (4.6.2) to write
(5.2.2) pMp{l, k)~^^ (k) = Mp(l, k) ^ (/) + K(/) ^ (/, /:) .
Next we use the differential equation for K to write
dK _ E{k) - k'2K(k) dK E(l) - l'2K(l)
dkW~ kk'2 dl{l)~ ll'2
and we substitute these two identities into (5.2.2). We then have
2 dMM, k)
PM2p(l, k)[E(k) - k'2K(k)] = Mp(l, k)[E(l) - l'2K(l)] + K(l)ll'2 "^
On dividing each side by K(k) we have (5.2.1). □
We now derive the key identity for a.
156 Modular Equations and Algebraic Approximations to n
Theorem 5.2
Forp and r>0 let /: = \*(r) and k: = A*(pV). Then
a(p2r) = Mp2(l, k)a(r) - Vr
(5.2.3.)
W-*+^^
Proof. We suppress variables in the multiplier when convenient. From
(5.1.2) and (5.2.1) we have
a(p2r) = ^(k~)-pV?
Ml
AK\l)
' E
-K
V?
(*)-!_
[!«
-i
-V7
\,2+»'2dMP
I Mp dl
pMlk2
This gives
(5.2.4) a(p2r) = M'2\a(r)~V?
l2 +
ll'2 dMD
Mp dl
-pMlk2
on using (5.1.2) again. Another application of (4.6.2) produces the desired
formula. □
In particular, if we let r := lip above, then / = k', and with (5.1.5) we
derive that
(5.2.5)
a(p) = Vpk2 - | k'k2 j{ Mp(k', k)
or
(5.2.6)
a(p) = Vpk2-Plkk'2^Mp(k',k)
where k-~ A*(p).
Let us observe that, since A*(p) and Mp are algebraic for rational p, this
shows that a(p) is algebraic for rational p.
example 5.1. When p : = 2, Af (/, k) = 1/(1 + fc) and fc = A*(2)=V2-1.
Then dMp(k', k) ldk = -\ and a(2) = V2(V2 - 1)2 + (V2 - 1) = V2 - 1.
(Compare Exercise 2 of Section 1.) Similarly a(3) = (V3 - 1)/2 and also
a(7) = (V7-2)/2. (See Exercise 3.)
5.2 Calculation of a
157
Before continuing to evaluate a it is necessary to connect it to Ramanu-
jan's multiplier (of the second kind).
(5.2.7)
where
(5.2.8)
nfl .y-PPiqi-W)
P(q) :-1-242 nq
1£.
- ~ q
Proposition 5.2
Let p, k, and I be as in Theorem 5.2. Then
Rp(l, k) =p(l ~2k2)Mp(l, *) - (1 - 2l2)M;\l, k) + 3pkk'2 dMjlkk) •
(5.2.9)
Proof. We start with (3.2.15), which gives t}(<7)/tj(<71/p) in the form
l/12p/
(1-^)(1-^)(1-^ P)"-
(5.2.10)
We differentiate logarithmically and obtain
P[,)-1*(,»)-2,S
1 rf(fcfc') 1 rf(fl') rff 3 dMp
kk' dk IV dl dk Mn dk J
Now we use (4.6.2) for dlldk and (2.3.10) for qdkldq and obtain
PP(q)-P(qllp) =
4LK
TT
p(l - 2k2)Mp -(1- 2l2)M;1 + 3pkk'
dMp
dk J'
This gives (5.2.9). □
It is convenient to introduce two additional quantities:
dk
(5.2.11) ^)-.= -^
and
(5.2.12) <r(p):=Rp(k',k)
k-.^e'7^ ,
158 Modular Equations and Algebraic Approximations to jt
In these terms we have:
Theorem 5.3
(a) With r, p, k, and I as above, we have
spV, k) = 3
(5.2.13)
(5.2.14) a(p2r) = Mp"2(/, *:)«(>■) - Vrep{l, k) .
(b) With fc(<TWp) = A*(p), we have
1 + A*(p)2 cr(p)
(5.2.150 «(/>) = VJ> 3 6
(5.2.15,) ,(p),V?[l-2Ay]+a<fl
Prao/. We deduce (5.2.13) and (5.2.14) by comparing (5.2.9) and
(5.2.3). We obtain (5.2.15) on comparing (5.2.9) (with 1 = k') to
(5.2.6). □
Ramanujan has computed Rp for p := 2, 3, 4, 5, 7, 11, 15, 17, 19, 23, 31,
35. From these, many values of a are obtainable. The following table is
taken from Ramanujan [14] with the entry for i?4 corrected.
The verification that Rp has the given form is tedious but straightforward
for small p. [See Exercise lc).] For larger p we rely on Ramanujan.
example 5.2. For p: = 7, the modular equation in the form (kl)1,A +
{k'l')VA = 1 shows that 2k7k'7 = £, k7 := A*(7). Then k7 = (3 - V7)/4V2,
and so o-(7) = R7(k'7, k7) = 3(1 + 2k7k'7) =27/8, a(7) = (V7-2)/2, and
6(7) = 2.
We now make (5.2.14) explicit for p : = 2, 3, 4.
Proposition 5.3
If /:= A*(r) and A: := A*(4r), then
1-/'
(0 k-j^jr
and
(5.2.16) («) a(4r) = (1 + kfa(r) - 2V7ft .
5.2 Calculation of a
159
TABLE 5.1
RP(l, k)
1
11
15
17
19
23
31
35
I' + k
l + kl + k'l'
3(1 + /')(! + k)
2
l + kl + k'l'
(3 + kl + k'l')
3(1 + kl + k'l')
2[2(1 + kl + k'l') + VE + VkT - VkkW]
[1 + (kl)1'4 + (k'l')1'4]4 - (1 + /c/ + /c7')
[44(1 + /c72 + k'T2) + 168(/(:/ + k'l' - kk'll')
-102(1 - kl - k'l')(4kk'll')U3 - 192(4/c/c'//')2/3]1/2
6[(l + /c/ + /c7') + V&/ + VFF - VitFF]
11(1 + kl + /c7') - 16(4/c/c7/')1/6[l + VFl + VTT]
-20(4/c/c7/')1/3
3{3(1 + /c/ + /c7') + 4(V&/ + VkT + VkkW)
- 4(/c/c7/')1/4[l + (W)1M + (/c7')1/4]}
2[Vkl + VkT - VkkW]
+ (4/c/c7/')"V6[l - V&/ - VF77]3
Proo/. Since R2(l,k) = l' + k and Af^/, A:) = 2/(1 + /') = 1 + k, we
have
3e2(/, k) = (1 + fc)z(l + O - 2(1 + k:) + (1 + *:)(/' + A:)
and
v*\
3e2(/, /:) :
4(1 + O + 2(1+ /'")-4(1 + I")
>\2
(! + /'>
since/'+ A: = (1 + /'2)/(! + /')■ Thus
For example,
A*(4) = (V2-1)2 and a(4) = 2(V2 -1)2
160 Modular Equations and Algebraic Approximations to n
Proposition 5.4
If /:= A*(r) and fc:= A*(9r), then
(i) M;\l,k) = yjl + 4^p-=:s(r)
and
(5.2.17) (if) a(9,) = /W«W-^^±^^
where m(r) : = 3 /s(r) satisfies
r~\ fa ^ ([^W ~ if3 + I)2
(ui) m(9r) = ————7^ — .
m(r)
Proof, (i) and («7) were established in Section 4.7. For (h) write
3e3(/, fc) = /(/-)(1 + I2) + 5(r)(l + kl + k'V) - 3(1 + k2) .
Substitution in terms of a, as in Section 4.6, and some easy algebra yield
s(r) = 2a + 1 and
e3(/, fc) = 2a(a +2) = [s(r) - l][s(r) + 3] /2
as claimed. □
A convenient variant of (5.2.17) is
(5.2.18) 8(9r) = s2(r)8(r) + 2VTs(r)
where, as before, 8(r) = V7 - 2a(r).
example 5.3. If /-:=3, then as in Section 4.6, m(r) = s(r) = V3. Then
(5.2.17) gives a(3) = 3a( £ ) - 1. But (5.1.5) shows that a(3) = V3 - 3a( 5).
Thus a(|) = (V3+ 1)/6 and a(3) = (V3-1)/2. Now we have m(3) =
(21/3 + l)2/V3 so that s(3) = (41/3-l)V3 and 6(27) = 3(24/3 - 1). Thus
a(27) = 3[(V3 + 1)/2-2173].
Proposition 5.5
If /:= A*(r) and fc := A*(16r), then
(1) Vk = 77==
1 + ^1-/2
5.2 Calculation of a
161
and
(5.2.19) (ii) a(16r) = (1 + y)*a(r) - 4Vry(l + y + y2)
where y ■ = Vk.
Proof. This may be derived similarly but is easily deduced from
Proposition 5.2 and the quartic iteration of Exercise 3 of Section 1.4. □
example 5.4. For r: = 4 we have a(4) = 2( V2 - 1) and A*(4) = (V2 - 1)2
Thus A/A*(64) = (1 - 25/8VV2-l) /(1 + 25/8VV2 + 1) = (VVl+T - 25/8) I
(VV2 + 1 + 25/8) and a(64) = 8[2(V8 -1)- (21M - 1)4] /(VV2 + 1 + 25/8)4
which gives eight digits of tt~\ Similarly, VA*(16) = (21/4 - 1)/(21/4 + 1)
and a(16) = 4(V8-l)/(21/4 + l)4.
Combining some of these calculations with (5.1.6) we have established
that
i-<**-(^
= k( Vie
k: =
k: =
V3-1
2V2
3-V7
4V2
and
J = K\3V3 E -
V3-1
+ 21I3)]k)
A::=A*(27)
with a similar identity whenever a(r) and \*(r) are known.
Comments and Exercises
Computation of A*(r) will be discussed further in Section 9.2. In many of the
following numerical exercises the algebra is not entirely straightforward.
1. a) Verify that the quadratic case of (5.2.1) coincides with the formula
given in Theorem 1.2.
b) Verify (5.2.6).
c) Verify that R2, R3, and i?4 are as claimed.
2. a) Show that in terms of Ramanujan's Gn and gn of (3.2.13) one can
write
i) A*(„)=iG„-3[VOG„-6-V^-G„-6]
ii) A*(n)=HVl + G„-12-Vl-G;12]
iii) A*(n) = gUVs" + ^12-s£]-
162 Modular Equations and Algebraic Approximations to n
b) Find similar identities for A*'(").
c) Verify the singular values given in (4.6.10).
d) In each following case, given G„ or gn verify kn. Then verify Gn or
8n-
i) G35 = 2~1/4(V5 + 1),
kl5 = (3- V5)(V5 - V3)(2 - V3) /(8V2)
ii) G35 = V5 + 2, k25 = (V5 - 2)(3 - 2 ■ 51/4) /V2
iii) g66 = V2 + 1, fc6 = (V2 + l)(V6-V2-l)
= (2-V3)(V3-V2)
iv) g6l0 = V5 + 2, A:10 = (V5 + 2)(3V2-V5-2)
= (VlO-3)(V2-l)2
v) gi8 = 5 + 2V6, A:18 = (5 + 2V6)(7V2-5-2V6).
3. Use (5.2.6) with p '■ = 3 to compute a(3).
4. Use (5.2.15) to obtain the following values of a (or 6).
i) 6(5) = V2(V5-1)
ii) 6(11) = [2x2 -(*-§) + VlTVl-(*-§)2] /3
Note: G^12 = * - § where *4 - x3 = 2.
iii) a(15) = (VB-V5-1)/2.
5. Use (4.6.8) and (5.2.6) to compute 6(13) = (7 +3VT3)G1"36 where
G1"34 = (Vl3 -3)/2.
6. Show that Rp(l, k) = /?,(*', /'), that Rp-i(k, I) = -p^R^l, k), and
that d(p'l) = -a(p)lp.
7. Use (5.2.14) to show that
i) 6(25) = 10-51/4(7-3V5)
ii) 2A*(49)A*'(49) = (1863 + 704V7) - (810 + 306V7)V273/4
'71/4-V4 + V7y
and
«(49) = I - VV[V273/4(33011 + 12477V7) - 21(9567 + 3616V7)].
8. Use Proposition 5.3 to calculate
i) A*(8) = (V2 + 1)2(1 - VV8-2)2,
a(8) = (20 + 14V2)(1 -VV8-2)2
ii) A*(12) = (V3-V2)2(V2-1)2,
a(12) = 264 + 154V3 - 188V2 - 108V6.
5.2 Calculation of a 163
9. Use Proposition 5.4 to show that
i) a(18) = (21-6V6)^8A*(18)
ii) 6(9) = 33/4(V6 - V2), as m(l) = V3 + V12
iii) 6(81) = 9V2 31/4(V3 + 1)(3 + a)a'\ where
a:=[(2 + Vl2)1/3 + l]2.
10. Prove Proposition 5.5.
11. a) Use (5.1.4) to write
-i- ^,,,2 AG(l,fc') , a(p)-Vpk2
' -^^ X5(l^)5+ AG(1,*')2
where k'-= \*(p) and the derivative is with respect to k.
b) Then (as in Section 2.5) show that
tt = lim
1
Vpk'k2Pi + [a(p)-Vpk2]qi
where x0 : = 1/k', qx '- = x0, yt :=y/~x~0, and pl := x2al{xQ + 1) while
. VxZ + 1 /Vx~
y n+i ■
2
ynYx~ + l/Yx~
P"^-\lTx~JP"
In
Qn + l'-— v
An
c) Show that convergence is quadratic.
d) When p :=1, this is Algorithm 2.1.
The next exercises develop results in J.M. Borwein [85] and Ramanujan
[14]
12. a) Show that with P as in (5.2.8),
pP(e~'vli)+P(e-"vp)
-Wp\ , p/\,-w/VP\ _ ^Vp
7T
Hint: Let q := e-w? in (5.2.10) before differentiating.
Hence show that
7T
164 Modular Equations and Algebraic Approximations to jt
b) Show that
(5.2.20) P(e~^) = ^ AG-2(1, A*'(p)) + ^~=
Thus P(e~m/^) is quadratically computable.
c) Evaluate P(e-,r3).
13. Show that
(5.2.21) - = 1-2 p'
^r^Rpi^k^-ip-l)
where Kt:= K(kt) and A:,. := A*(p2'). In J. M. Borwein [85] these
identities are studied in detail.
14. Show that when p'- = 2, (5.2.21) is equivalent to Gauss's identity
(2.5.9).
15. a) Show that
P(q) =
K(k)
3 | (k) -(1 + k'2)
This identity, which follows from Exercise 7c) of Section 3.7, is
entry 2 in Chapter 18 of Ramanujan's second notebook,
b) Prove that
CO
P(q) = 1-24^ cr^q2".
Here cr^ri) is the sum of the divisors of n, as in Exercise 12 of
Section 3.7.
5.3 FURTHER FORMULAE FOR a
We commence by establishing a multiplication result for o\
Theorem 5.4
Let p, r>0 be given. Let /:= A*(l/pr), y-=\*(plr), and k'-= \*{pr).
Then
(5.3.1) tr(pr) = M~r\y, *)[*„(/, y) + Vf ^(% *)
and
(5.3.2) *(£) = Af,(% *)[*„(/, 7) - Vf *,(?, *)
5.3 Further Formulae for a 165
Proof. Let s : = pr and ql = q1/r. Then y = k(ql) and 1= k(q\/p). We
have
i/p\
flfol)^") J 03(9)
PP(^)-P(^i'P)
+ P
rP(g)-P(g1/r)1 0¾^)
L 03(9)0'3(9l/r)
03(91/S)
so that with r: = K(y),
(5.3.3) /?,(', *) = RPV, y) j; + pK(y, k) j-
For k' = I we have L = Vjp^ and (5.3.3) becomes
<*& = £
RPV,y) + y%Rr(y,k)
This is (5.3.1). Then (5.3.2) follows from (5.3.1) and Exercise 6 of Section
5.2. (See Exercise 1.) □
For p-r, (5.3.1) reduces to
2Rp(l/V2,k)
(5.3.4)
-(P2)=^P
Mp(l/V2,k)
k:=k(e~"p).
Corollary 5.1
Let p > 0 and k ■= A*(2p). Then
¢.3.5) S(2p):=Vf^ + ^«,(^|,S).
Ptoo/. Observe that M~\y, k) = 1 + k and i?2(7, fc) = * + 7', while
y' = (1 -/:)/(1 + *). Then ' 7'
<r(2p) = (1+ fc)[^(/, y)+yfl(k+yl)
and since Rp(l,y) = Rp(y',k),
(5.3.6) .(2/0 = (1 + k)Rp(i^|, /c) + Vf (1 + fc2)
Now (5.3.5) follows from (5.2.15). □
166 Modular Equations and Algebraic Approximations to n
example 5.5. For p: = 3, k = (2 - V3)(V5 - V2) while /?3(y,A:) = l +
7& + 7'fc'. Now
(1 + k)R3(j^j ,kj=2k + k'2 + 2Vkk'
which we rewrite as 2k(l + V2g66 + gf). Similarly \l\k'2 = Vlgfk, and we
can write 6(6) = [V6(V2 + 1)2 + 2V2(V2 + l)]k. Thus
8(6) = ggA*(6)(2V2 + 2V3 + V6)
and
«(6) = g6A*(6)(3-V2) = 5V6 + 6V3-8V2-ll. D
Corollary 5.2
Letp>0and/:=A*(l/3p), y:= \*(p/3), and k-= A*(3p). Then
(5.3.7) cr(3p) = Af 3^(7, *:)[/?,(7', fc) + Vf (1 + 7^ + y'k')]
and
(5.3.8) cr(|) = M3(7, A:)[k„(7', *) - Vf (1 + 7* + ?'*')_
where
3p
"1(|) = Af 3^(7,^) = )/1 + 2^
Pwo/. This follows from Theorem 5.4 and (4.7.9). (See Exercise
3.) □
example 5.6. By piecing together various formulae many more values of a
can be obtained. We illustrate this as follows. Given a(6) and A*(6) from
the previous example, we may use Proposition 5.3 to compute a(|). Then
the functional relation for a, equation (5.1.5), yields a(§) = (5V6 — 6V3 -
8V2 + 11)/3 = 0.5138118 Similarly, given a(3) and A*(3) from
Example 5.3, we can use Proposition 5.3 to find that a( f)= 66 + 47V2 - 38V3 -
27V6 = 0.5138837 ....
Indeed numerical computation of the maximum of a [using (5.1.10) and
Newton's method] shows that it occurs around 0.709 with a value of
approximately 0.514275. [Note that (| + f )/2 is very close to this point.] In
addition, one may observe graphically that a increases up to this value and
then decreases.
5.3 Further Formulae for a 167
Comments and Exercises
From the given formulae and appropriate singular values, many values of a
can be found in closed form. Some of the cleanest are given below.
1. Establish (5.3.2) of Theorem 5.4.
2. Use Corollary 5.1 and the given value of gn to compute a(ri) or S(ri).
i) g?o=V5 + 2, a(10) = g?0A*(10)(3V5-4)
ii) g?8 = (V3 + V2)2, «(18) = g?8A*(18)(21-6V6)
iii) g222=(V2 + l), a(22) = g«2A*(22)(33-17V2)
iv) g28 = (V29 + 5)/2, a(58) = g568A*(58)(99V29-444).
(This requires having a tractable form of W29, which we have not
given but which may be found in Greenhill [1892].)
v) gl42gl4 =2 + V2, 6(14) = gt4A*(14)[(8 + 6V2) + Vl4>n
vi) g3° g3° =6 + 5V2, 6(30) = g*0A*(30)[(56 + 38V2)
6.-6
vii) g46 g46 = 18 + 13V2, 6(46) = g46A*(46)[(200+142V2)
g646A*(46)[i
+ V46g646].
3. Establish Corollary 5.2.
4. Use the values of G3p, Gpl3, and M(p/3) given in Exercises 8 and 9 of
Section 4.7 to prove that:
i) o-(21) = 3G2-16(11 + 6V3 + 2V7 + V21)
ii) o-(|) = G7/63(11-6V3 + 2V7-V2T)
,_6 /2V3 + V11
iii) 0-(33) = 3 G;36tJ (11 + 13V3 + 5VII+V33)
) ^^r.W^^ai + ^-svn-vsj)
,_6 /2VI9 + 5V3
v) o-(57) = 3 G576y (5VB7 + 13V19 + 49V3 + 19)
) .(f)-G^?™^(SV57-13Vl9 + 49V3-19)
and
168 Modular Equations and Algebraic Approximations to n
IV3 +1 \3
vii) o-(93) = 6G~36^——J (15V93 + 13V31 + 201V3 + 217)
viii) o-(y)=2GJ16/3( 32-1) (15V93-13V31 + 201V3-217).
Hint: In each case express the right-hand-side of (5.3.7) or (5.3.8) as a
function of (yy'kk')1'4 by using the cubic modular equation (4.1.16).
5. The values of a given in Exercise 2 are all expressed in the form
a(p):= gpA*(p)ap for a quadratic number ap. This is also true for
some other even p.
a) Since g6p\*{p)~l/(2g6p), deduce that
«,:=-£-«
and estimate the accuracy of the approximation,
b) Show that
_ 14 + 10V2 140 + 26V29
77-22 ~ 33 - 17V2 and ^58 ~ 99V29 - 444
which give four and eight digits of ir, respectively.
6. a) Weber gives G27 + G~2 = (1 + VT7)/2 and gj4 + ^m = (3 +
VT7)/2. Hence evaluate o-(17) and o-(34) as cleanly as possible,
b) Similarly, /(V11!^) =: x solves *3 = 2x + 2 and £0/=38) =: V2x
solves x3 = 2*2 + (2x + 1)(V2 + 1). Attempt to evaluate a(19) and
a(38).
7. Use (5.3.4) to evaluate 6(25). (Compare Exercise 7 of Section 5.2.)
8. Given that G% = V37 + 6, one can show that
6(37) = (101 +21 V37)G
-6
37
Again we have not given a tractable form of W31.
Show that the perimeter of an ellipse with major axis a and eccentricity
k is given by
2a
V?
V +[V?+ 8(r)]K(k)
L2K(k)
where k'-= X*(r). In particular, if k '■ = tan (71-/8)
flV?
r(l) + r(|)
r(i) r(|)J
5.4 Recursive Approximation to tt 169
(See Exercise 4 of Section 1.6.) If k ■ = 1 /V2 = sin (ir/4)
fir
p = a-
r(!) r(|)J
Further such evaluations follow from Table 9.1, which gives K(\*(n))
in T terms for 1 < n < 16.
5.4 RECURSIVE APPROXIMATION TO tt
Theorem 5.3 is easily recast as a pth-order iterative method to compute tr.
A General Iteration 5.1
Let p^l be a positive integer. Let r > 0 and set
aQ:=a(r) and kQ '■= A*(r) .
For n in N compute kn+l by solving Wp(k2n, k2n+l) = 0 and let
mn:==M~\kn,kn+l) rn:=Rp(kn,kn+l)
sn:=[mnrn + m2n(l + k2n)-p(l + k2n+l)]/3
and
(5.4.2) o„+1:=mX-p"Vre„.
Then, for rp2">l,
(5.4.3) 0 < «„ - tt"1 = 8(pnVr - tt"1) e^"^ + 0{p2nr e'""2^)
sl6p"Vrg"i',VF*.
Proof. This is a straightforward consequence of Theorem 5.3 and
(5.1.11) because an : = a(p2"r). D
We may also write this as an identity for v.
Theorem 5.5
Let p ^ 1 be a positive integer and let r > 0. Let m, and e„ be as in (5.4.1).
Let aQ := 1 and an+1 := m~1an. Then
(5.4.4) ,._J^ki^_
a(r) - Vr E„_0 p a„+1e„
170 Modular Equations and Algebraic Approximations to n
Proof. Rewrite (5.4.2) as
(5.4.5) a2n+lan+l = »««« - ^rp"a2n+isn ■
Then summation yields
m
«0)-"i+i«m+i = V7:2 p"a2n+le„
n-0
and, as am converges to it 1 while am converges to AG(1, k'Q) [since
am = K(km)tK(ka)], we obtain (5.4.4). D
EXAMPLE 5.7.
a) When p' = 2, the proof of Proposition 5.3 shows that sn = 2kn+l. In
terms of the AGM: a2n+lsn =4a„+1c„+1/2 = c2nl2, and (5.4.4) gives a
family of identities extending (2.5.9), or Algorithm 2.2 (the case r : = 1).
b) When p : = 3, we similarly use Proposition 5.4 to deduce that a2n+lsn~
(an - fl„+1)(fl„ + 3an+1)/2, where an+l = m~1an can be computed from
the cubic recursion in Proposition 5.4. (See also the cubic iteration
given below.)
We now specialize Iteration 5.1, changing notation as appropriate.
A Quadratic Iteration 5.2
Let r>0. Let a0 := a(r) and k0 ■= A*(r). For n in N let
(5-4.60 k"^:=TTK
and
(5.4.6b) «„+1 := (1 + kn+lfan -2"+lVrkn+l.
Then, for a-22">1,
0< a - rr~l ^16-2"Vr e
-2"v7ir
A Quartic Iteration 5.3
Let r>0. Let a0 := a(r) and y0 : = VA *(/■). For n in N let
l-</\
y
(5-4-70 y„+i:~ 4/l :
and
(5.4.7b) «„+1 : = (1 + yn+l)\ -4"+1Vry„+1(l + j„+1 + y2n+l)
5.4 Recursive Approximation to jt 171
Then, for M2";
-!_..,; All ,— -4"V?1T
0<a„ -ir <16-4"Vre
Iteration 5.3 just peforms two steps of Iteration 5.2 as one, with some
computational saving. The error bound given in each case is very accurate.
A Cubic Iteration 5.4
Let r >0. Let a0:=a(r) and m0: = m(r) = \/l + 2V2G3gr/G9r. For n in N let
(5.4.8j) ,^.^-ir+if ±
and
(5.4.8«) an+l:=s2nan-3"Vr^
Then for a-32">1,
si + 25-3
-1--1^ 1« .r- -3"V7ir
0<a„-7r"1<16-3"V7e
One can also provide a cubic iteration using kn instead of sn. This is
somewhat more inelegant and no easier to initialize. [See Exercise 3b).]
Large numbers of initializations for Iterations 5.1, 5.2, and 5.3 are
available in the previous sections. We collect some of the cleanest in Tables
5.2a) and b). The cubic information is given in Table 5.3. Recall that
[m(N)-l][r(N)-l]=4.
All the information necessary to verify these values lies in Section 4.7 and
the previous sections of this chapter.
When p : = 7 we can write the iteration cleanly in terms of w„ := klJ*.
Indeed
(5.4.9) (l-«„«„+i)8 = (l-«*)(l-«*+i)
while
7"A+i(l - «„«„+i)[l - «„«„+i + ("„"„+i)2]
m„ .=
4
and
r„ : = 3[1 + (uBun+1)4 + (1- «„uB+1)4]
. ml(l+u8n) + mnrn-l(l+u8n+l)
*-•" 3
TABLE 5.2a. Values of G~12 and a(N) for N odd
N 2A*(JV)A*'(iV) = G~12 a(N)
1
2
V3-1
1
3
5
7
9
13
15
25
27
37
1
1
2
V5-2
1
8
(2-V3)2
5V13-18
sm
(V5-2)4
(2l/3 _ 1}4
2
(V37 - 6)3
V5-V2V5-2
2
V7-2
3 - 33/4V2(V3 - 1)
2
Vi3 - V74VI3 - 258
2
VB-V5-1
5(l-2-51/4(7-3V5))
2
3(V3 + l-24/3)
2
37-(171- 25V37)(V37 - 6)1/2
2
TABLE 5.2b. Values of g~12 and a(N) for JV even
N 2\*(N)/\*'2(N) = g-12 a(N)
2 1 V2-1
2(V2 -1)2
6 (V2-1)2 (V5-V2)(2-V3)(3-V2)(V2 + 1)
10 (V5-2)2 (7 + 2V5)(VT0-3)(V2-l)2
18 (V3-V2)4 3(V3+V2)4(V6-l)2(7V2-5-2V6)
22 (V2-1)6 (V2 + l)6(33~17V2)(3V22-7-5V2)
58
1
2V2
(V2-
(V5-
;V5-
(V2-
'V29
-I)2
-2)2
V2)'
-I)6
-^
/V29 +5\6
( = ) (99V29 - 444)(99V2 - 70 - 13V29)
V
172
TABLE 5.3. Selected cubic invariants
m(N) r{N) a(N)
1019 + 416VS - 720V2 - 588V3
6
47V2 + 27V6 - 38V3 - 66
3
8V2-4V6-6V3 + 11
6
V3 + 1
6
1
2
5V6-6V3-8V2 + 11
3
1
2
2[54V6 + 77V3 - 94V2 - 132]
3
V2-1
16(3V6 + 4V2-2)
1/18
1/12
1/6
1/3
1/2
2/3
1
4/3
2
8/3
3(3V6-4V3-5V2 + 7)
6V3+9-4V6-6V2
(3-V6)(V2+l)
V3
V6-V2 + 1
V3(V2-i)(V3 + V2)
V3 + 2V3
V6 + V2-1
— ■
-
(3 + V6)(V2 + l)
3 + 2V3
V5+V2 + 1
V6 + V3
—
V3(2 + V3)
V3 + V2
\/3(V3 + V2)
3[V3 + 1 + V2(V3-V2)]4
[1+21/3]2 _ V3-1
V3 2
4
5
6
7
8
9
18
—
V1+2V3 + 2V5
-
/ 6 + V21 + V27 + 6V2T \ "2
[(2V3 + 2)"3 + i]2(V3-i)
V2-31'4
-
V3 + V3 + V9 + 6V3
[V2(V2 + l)2/3 + l]2(V2-
V3
V(V2 + 1)(2 + V3)
-
1)
(V3-V2)[l + (4 + 2V6)"3]2
6-4V2
V5-V2V5-2
2
5V6 + 6V3-8V2-11
V7-2
2
(2 + V2)3(l-V2V2-2)2
3-33"V2(V3-l)
2
3[721 V2 - 1019 + 588V3 - 416V5]
173
174 Modular Equations and Algebraic Approximations to jt
We present a few illustrations of the behaviour of the algorithms. We
note that an in the quartic algorithm is just a2n in the quadratic algorithm.
r = 2
r = l
Digits Correct in Quadratic Algorithms
n=\ 2 3 4 5 6 7
0
2
5
3
5
12
8
13
26
19
28
55
41
56
112
84
120
227
171
242
458
344
489
919
Digits Correct in Cubic Algorithms
n=\ 2 3 4 5
r=l
r = 7
2
8
10
30
34
93
107
288
327
873
Digits Correct in Septic Algorithms
n = \ 2 3
r = l
7
22
63
173
464
>1000
The updates for a(25r) and a(49r) are studied again in Section 9.5, where
solvable versions of the quintic and septic iterations are given.
Comments and Exercises
Additional information on these iterations can be found in various of our
papers. In particular, Borwein and Borwein [86] indicates the genesis of the
quadratic iterations and [84b] that of the cubic iterations.
1. Verify the claims in Example 5.7.
2. a) Prove that, in the notation of Theorem 5.5,
E
-(/:) = 1-2 p"a2n+lsn k=:kQ.
b) Observe that this extends Algorithm 1.2.
3. a) Verify Iterations 5.2, 5.3, and 5.4.
b) Use (4.1.24) to produce another form of the cubic iteration.
5.4 Recursive Approximation to -n
175
a) Determine the initial values for the cubic iteration with r '■— Nl 3
and N: = l, 11, 19, 31.
b) Compute 6(45), 6(63), 6(243), and 6(54).
Derive the following version of the septic (seventh-order) iteration. Let
a0 := a(r) and u0 '■= A*(r)1/4, and generate un+l decreasingly from
Let
(l-u8n)(l-u8n+l) = (l-unun+l)8
u„u„
ft ••=
' ™n n + 1
8
u — u u ,,
n ™n n + l
and
«„+i : = *n«n +7"V7(7 -sn - tn).
In .
Then for rT" > 1
0<a-w x<\(s-TVre
-7"v7ir
Derive the following version of the quintic (fifth-order) iteration (given
in Hughes [84]). Let a0:=a(r) and u0 = A*(r)1/4, and generate un+l
from u6n+l -un- 5(w„w„+1)2(w„ - u2n+l) - 4w„w„+1[l - (w„w„+1)4] = 0.
Let
yn
an
bm
o 5
™n™n+l
^u2n + 5u2n+l + 2xn
= 5u2n + u2n+l-2yn
6-
dm: =
._ (1 ~ M«+i)[5(w'+i + *„) + c„(y„ - w*+1)]
4a„
+
(1 - u*n)[u2n + xn + 5cn(yn - u2n)}
4b„
176 Modular Equations and Algebraic Approximations to n
Then
satisfies
for r52">l.
«« + 1 := 5cnan + 5" + W^„ + Mn + 1 "~ CnUl)
< a„ — 7r :£ 16 • 5 V7 e
7. Show that, in Iteration 5.1, convergence is indeed pth order.
Observe that for a variety of other values of p we have the information to
make Iteration 5.1 entirely explicit. For example, p '•= 17 is satisfactory since
we have i?17, and M17 is given in (4.6.9). Moreover, (4.6.9) also gives a
form of the modular equation of order 17. Using G\7 + G^12 =40 + 10VT7
and
^(17)= (llV?+45>
(42 + iiVn)V13+25VI7-(W7+277VI7>
we have an algorithm for ir which gives more than VT7 • 17" digits at step n.
8. Let pal, let /:=A*(p) and fc:=A*(49p). Set zp : = (GpG49p)~V
V2.
a) Show that
M7(/, fc) =
Vi-
Vi-
2zp(l
.-,-24
_Cj49p
-4z -
-¾)
-Vi
Vi-
-4z
p
y-,-24
GP
UzJl - z.)
b) Ramanujan [14] gives
and
,_i V4 + V7-71
'49
G147 -2
\2 + V3
Verify that
V7
(28)1
vt^
where zl = G493/V2, and
5.5 Generalized Elliptic Integrals 177
«) M7-(,.(3).A.(14,))-vi^:^/2
where
_ [V7 + V3-2(28)176]3
H 48V3
c) Compute that
i?7(/,fc) = 6(l-zp)2.
9. Verify, from Table 5.2, that
i) a(22) = (V2 + 1)3(33 - 17V2)(3VII - 7V2)(10 - 3VTT)
and
/V29 + 5\3
ii) a(58) = (V2-l)6(13V58-99)(99V29-444)(^^ J .
10. Ramanujan gives the following form of the modular equation of degree
7:
(5.4.10) (04 + (J)' + 7 = 2V2{M3 + (xyy3}
where x : = G49N and y := GN. This is Entry 19(ix) in Chapter 19 of the
Second Notebook (Berndt [Pr]). One should compare Exercise 6 of
Section 4.1 and Exercise 12 of Section 4.7.
a) Verify G7, G49, and G147.
b) Show that
(5.4.11) (j)4 + (^)4-7 = 2V2{(^)3 + (^r3}
where x-= g49N and y := gN.
c) Verify that g2u + gu=V2 + l and that
+ gss1 = 2 {V2 + V14 + 4V14}
5.5 GENERALIZED ELLIPTIC INTEGRALS AND RATIONAL
AND ALGEBRAIC SERIES FOR 1/ir and 1/K
We begin with some results on hypergeometric functions [see (1.3.5)].
Changing notation slightly we write
178 Modular Equations and Algebraic Approximations to n
V (a)n(b)n *"
(5.5.1) JP^a, b; c; x) ■= Z, —tr— -7
n-0 \c)n m
for appropriate values of the variables. Here we use the rising factorial or
Pochhammer symbol (a)„ := T(a + n)/T(a) = a{a + l)(a + 2) • • • (a + n - 1).
Similarly the generalized hyper geometric function 3F2 is defined by
(5.5.2) 3F2(«, 6, c; d, e; *) .= 2 ^^ ^
again where appropriate. (See Slater [66].) We define the generalized
complete elliptic integrals of the first and second kind by .
(5.5.3) K,{k) := f • 2Fl(\ -s,\+s;l; k2)
and
(5.5.4) E,(k) := I • 2FX(- \ - s, \ + s; 1; fc2)
for |s| < I and 0<k^l. We still denote the complement k' ■= Vl — fc2 and
write K;(fc) := £,(*') and E;(fc) := £,(*)• Now K "•= ^o and E := £0 are the
classical elliptic integrals, and each Ks and Es admits many integral
continuations. (See Erdelyi et al. [53, Section 2.12].) Moreover one has
kk'2
(5.5.5) Es = k'2Ks + — Ks.
This may be verified directly or by using Erdelyi et al. [53, Section 2.8].
[Here K, = {dldk)Ks.} Similarly, using Erdelyi et al. [53, vol. 1(13), p. 85]
we have
(5.5.6) E.K> + Kn-K,K>=l?gg.
When s = 0, this is Legendre's relation (Section 1.6). The following
relationships will be helpful.
Proposition 5.6
For 0< h < 1/V2 we have
(") ~Ks{h) = 2Fx{\-^\+S-;l;{2hh'f)
(b)
TT
= ,F2[\- s,\ + s,\;l,l;{2hh'f).
5.5 Generalized Elliptic Integrals
179
Proof, (a) is a special case of Kummer's identity given in Rainville
[60, p. 67] or in Erdelyi et al. [53, Section 2.11]. It may be verified by
showing that both sides satisfy the appropriate hypergeometric differential
equation (given in Exercise 7 of Section 1.3), are analytic, and agree at
zero, (b) is a special case of Clausen's product for hypergeometric functions
given in Slater [66, p. 75] and Exercise 13. □
In the sequel we will again use Ramanujan's invariants of (3.2.13)
(5.5.7) G:=(2kk')~in2 g-={2klk,2yin2
and
2U4gG = (k2/2k'yUl2 ■
We also need Klein's absolute invariant J, which was introduced in
Theorem 4.4. This is
Ysstt (4G24-1)3 (4g24 + l)3
*" j 27G24 27g24 '
Ramanujan [14] talks about "corresponding theories" for Ks, s •"= 3,4, g,
to that for K. For 5:=5,3 this is explained by the next result.
Proposition 5.7
(a) Kll4(h) = (l + k2)U2K(k)
if 2M' = [(g12 + g~12)/2]_1 and 0<fc<l/V2, 0<A:<V2-1.
(b) KU3(h) = [l-(kk'f]U4K(k)
ii2hh' = J~V2 and 0<fc<l/V2, 0<A:==1/V2.
Proof. These may be discovered by piecing together the quadratic and
cubic transformations given in Erdelyi et al. [53, Section 2.11]. They may be
verified by establishing that both sides satisfy the same differential equation
(derived from the appropriate hypergeometric differential equation), and
both functions involved have the same finite value at zero. D
There is a corresponding relation for Kll6. Since it is a little less concise,
we consider it at the end of the section. Combining these last two
propositions leads to a variety of alternate hypergeometric expressions for K and
K2.
Theorem 5.6
2K /1 1 \ 1
(fl) (0 —{k) = 2F\-A,-A;l;{2kk')2) °^^^vf
180 Modular Equations and Algebraic Approximations to jt
(ii) fw^-^H;!;-!)2) 0^V2-1
(Ui) ^^) = k^%Fi}A,\-,l-,-{-^)2) 0^k2^2V2-2
(b) (iv) ^(^) = (1 + ^)-^(^^^(^-^)-2
(") vw=(^2-^2)"1/2^(iI;1;-/Gl2 G"2x"2
0<A:<V2-1
)1
0<A:<
21/4-V2^V2
(c) (,i) ^(k) = [l-(kk')2]-1'\Fl(^,-^;l;r1) O^fcs^
V2
Proo/.
(a) We let 5 := 0 above to deduce (i). Then («) follows on replacing q by
—q in the theta function representations of K and (2kk')2. This is
Jacobi's imaginary transformation of Exercise 7d) in Section 3.2. We
derive (Hi) from (ii) by replacing /c: by kl-= (1- k')/(l + k') and
using the quadratic transformation K(kr) = [(l +k')/2]K(k) of
Theorem 1.2.
(b) (iv) comes from letting s'-=\ above. Then (v) again follows from
Jacobi's imaginary transformation.
(c) (vi) comes from letting 5:=3 above. □
Similarly,
Theorem 5.7
For k restricted as in Theorem 5.6
(a) (i)
(ii)
7T
2K
(k)
= ,F2(\,\,\;lA;(2kk'f)
= Z,'~2
5.5 Generalized Elliptic Integrals
181
(b)
(c)
(iv)
(v)
(vi)
2K
2K
«]-(1^)-^4 f.lu.iKS41-)")
Proof. We combine Theorem 5.6 and Proposition 5.6. □
Thus we have provided series for K and K2 in terms of each of the six
invariants. One can produce other such formulae by further use of
transformation identities. For example, Bailey's formula in Erdelyi et al.
[53, (2), Section 4.5] with a'.= \ and b-= 1 gives
— (k)\ =[l-4(2kk )] 3¾ 6'2^^-4(2/^13J
(5.5.9)
= {k>* + l6ey^F2[\,\,\-,l,l;-21g
3X2\6'6'2'
(r + 4)
+ 4?)
Note also that we may use (5.5.5) with $:=0 and Theorem 5.6 to
produce similar series for E. We are now ready to build our series. Recall
(5.1.4), which we write as
1 AKK AK2
- = VNkNk'N2 — + [«(A0 - VNk2N] — kN := A*(AT)
(5.5.10)
or
1 AK AK
- = VNkNk'N2 — + [«(Af) - VNk2N] — kN := A*(iV) .
(5.5.11)
Thus given a(N) and A*(AT), we can combine (5.5.10) with Theorem 5.7 to
produce series for 1/tt. In like fashion we derive series for 1/K or for the
Gaussian AGM, M(l,k')= ir/2K. In each case we have [(2KI>n)(k)]2 =
m(k)F(<f>(k)) for algebraic m and <f>, while F(<f>) has a hypergeometric-type
power series expansion E~_0 an<f>". Then AKK/tt2 = |mF + jm.(j>F(<f>).
Substitution in (5.5.10) leads to
- = 2 an -~ kk'2m + [a(N) - VNk2]m + -^- m J kk'2
<f>
<t>".
(5.5.12)
182
Modular Equations and Algebraic Approximations to jt
Thus for rational N, the bracketed term is of the form a + nb with a and b
algebraic. We now specialize this for our invariants.
series mGjy: For N>1,
1
(5.5.13) - = 2
where
TT
n-0
(*)■
n\ J
~12\2n
an(N)(G-Nli)
an(N) := [a(N) - VNk2N] + nVN(k'N2 - k%) .
series in gN: For AT s= 2,
(5.5.14)
where
1
^ = 2 (-i)"
T „=0
r/-1
(_a
n! J
12N2n
bn(N)(g-N")
bn(N) := «(A0^~2 + "Viv(^|f) .
series in gm = 2 gNGN: For N > \,
(5.5.15)
^2(-1)-
T „ = o
G)„
n! J
c„(iV)(g4^2)2"
where
c„m-
a(N)~^fk2N
k'N-' + nVN(k'N+ klT1)
series in xN := (g» V*")"1 = (T+W ForAf>2'
(5.5.16)
1_ _ Y (■*)n(2)n(4)n , />A 2n + l
_ _ ^ ,- .^.3 an\ly)XN
TT „_0 (n!)
where
W:=
<*(#)*/ _ ViV _12'
1 + 4 4 8n
+ nVN
8%-g-12
Z)
SERIES in yN:
(5.5.17)
where
5.5 Generalized Elliptic Integrals
■ 4(kNkN)
183
- = 2j (-1) 7-7^ en(N)yN
(my
en(N):
a^y-1 VN
k'J-kl 2
'AT*-7 AT
!(^)
+ nsmiG" + G:"'
series in JNl. For N > 1,
(5.5.18)
where
A_Y (6)n(l)n(6)n ^ /An/ r-l/2y2n + l
H" «=0
w
w)(j-Ni,r
fn(N) := 3^ [ViVVl-G-24 + 2(«(AT) - VNk2N)(4G2N4 - 1)]
+ nVN 3Vf [(8G" + ^Vl-G^24] •
There are many rearrangements of these formulae. In similar fashion we
may deduce that for all N,
(5.5.19) M(l, k},) =
where
n = 0
mn(N) := [«(iV) - VM^] + nlVNk'J
and for N>1,
(5.5.20) M(l, ^) =
where
CO
7T 2 (-l)X(Af)
n-0
L n! J
7**
In
/-1
nn(N):=a(N)k^ l + n2VNk'N
These use the hypergeometric definition of K and (5.5.11). Also using
(5.5.11) and Theorem 5.6(ai) and (aii) leads to, for N>1,
184
Modular Equations and Algebraic Approximations to jt
(5.5.21)
where
and for JV>2,
M(1,^) = tt2 o„(N)
n=0
(U,
n! J
(G^12)2"
on(N) :- [«(AT) - VlM2,] + n2VN(k'N2 - ^)
(5.5.22) M(l,^) = 7r2(-irp„(JV)
where
n! J
-\2\2n
urn
pn(N) = a(N)k'N-1 + n2VN[l + fc^"1 .
Similar formulae exist in the other invariants.
From our formulae for ir-1 and the values of a(N) and X*(N) = kN
previously derived we have explicitly computed all but two of the 14 series
which Ramanujan gives without justification in [14, Section 14]. Ramanujan
gives series of the form (5.5:13) for N '•= 3, 7, 15, of the form (5.5.16) for
N := 6, 10, 18, 22, 58, and of the form (5.5.17) for N := 5, 9, 13, 25, 37. He
gives series of the form (5.5.18) for N:=3 and 7 and two in terms of Kll6
which we derive below. In each case manipulation of the formulae produces
the desired result. Indeed a(37) and a(58) were obtained by calculating
e0(37) and d0(58) to high precision. In fact, with N ■= 58, using (5.5.16) and
Exercise 2 of Section 5.3 produces
(5.5.23) - = 2V2 2 (3)"(2)"(^" (1103 + 26390n)(-^-)
ir „„0 (n!) ^99/
2n + l
irVlV
which adds eight digits a term! Since kN behaves like 16e~"v"', it is very
easy to estimate the convergence rate in each series. For N at all large, the
rate while linear is most impressive. In the exercises we give various other
examples. Bailey [35, p. 96] gives (5.5.14) with AT : = 2 [equivalently (5.5.15)
with N'■= f] and ascribes this to Ramanujan. The series is
(5.5.24)
\ = 2 (-D-
ti)n
n\ J
{An + 1) ,
Correspondingly, (5.5.22) with N'- = 2 yields
(i)J2
(5.5.25) M(l,l/^) = ^-E(-l)"
n\ J
(8« + 1) ,
while for N '•= 1, the series in (5.5.20) diverges.
5.5 Generalized Elliptic Integrals 185
We now return to Kll6. From Goursat's exhaustive list of transformations
(Goursat [1881]) we obtain:
Proposition 5.8
For k < 1 A/2 and h the smaller of the two real solutions of
.2x3
(9-8Ji')J (4G"-1)J
64h6h'2 K ' 21G2
(5-5-26) ^71677^ = ^)-^^4
one has
Proof. Formula (126) in Goursat [1881] and Propositions 5.6 and 5.7
combine to produce (5.5.26) and (5.5.27). □
There is a corresponding formula for the larger solution. This implicit
formula for I in terms of k can be solved explicitly as follows (Exercise 19).
For ft<(V3 + l)/2V2,
(5.5.280 ^:=(MT2 = (^+21/V")2
where
(5.5.28b) x := (^^) + § V3 + 2(71/3 - 1)A - A2
and
(5.5.28m) A: = Vl + J1'3 + J2'3 .
In each case (s '■ = j, \, g ) the transformation from k to h can be
described very simply analytically. Consider the generalized singular value
function A* defined by
(5 5 291 K's(K(N)) rj-
Then it transpires that in all three cases,
/rr^x r7TK's(K(N)) K'( A*(C,JV)) „ „ 2/ x
(5-5-3°) ^ iMm = KindN)) Cs := 4 cos2 (.,).
186
Modular Equations and Algebraic Approximations to jt
(This can be verified from formulae in Goursat [1881].) In other words, the
Mh singular value of K is sent to the Mh, (A/72)th and (A/73)th singular
values of Kll3, Kll4, and Kll6, respectively. Thus
12 _, ~12N -1
(5.5.310 (^2^) =2A?/4(Af)A?;4(A0
and
-1/2 .
(5.5.31«) J-^ = 2X*ll3(N)XX'l3(N),
while various singular values for KV6 are given in Exercise 19b). If we now
define as by
(5.5.32) ^.^2^-^1-0 *:-*:(*>
we may use (5.5.5) and (5.5.6) to write
(5.5.33) ^ = l[as(N) + nbs(N)]
where
(n!)3
G;24"(iV)
(5.5.340 as(N):= [as(N) - VN\f(N)] * + 2"
cos (its)
(5.5.34/0 5!W: = VWl-G;>)^
while
G;12(iV): = 2A:(iV)Ar(iV)
The details are left as Exercise 20. Now, with some perseverance, we can
derive series including Ramanujan's missing formulae, which come with
s:= | and N: = 4 and 5 in (5.5.33) [See Exercise 20b).]
Finally, we observe that the result of Exercise 22 in combination with the
discussion of T values in Section 9.2 shows that the formulae for Kll3, Kll4,
and Kll6 are in essence the only such formulae.
Comments and Exercises
In Section 13 of Ramanujan [14] one finds an explanation of seriesof the
form (5.5.13) without many details. Then in Section 14, with essentially no
5.5 Generalized Elliptic Integrals
187
explanation, he gives his other 14 series. Hardy quoting Mordell (in
Ramanujan [62]) observes that "it is unfortunate that Ramanujan has not
developed in detail the corresponding theories." The explanation as
provided by this section is a bit disappointing, since for all these theories, all we
have are well-concealed versions of the original theory for K. Nonetheless
we can explain all of the beautiful and mysterious series.
1. Prove the generalized Legendre identity of (5.5.6).
2. Prove Proposition 5.7.
3. Verify formulae (5.5.13) to (5.5.23).
4. Show that
-^-iriJ}§f^(im+2imnim)
2n + l
5. Show that
V2 _y
,1/4 ^-1
6. Show that
2
= 2 (-i)"
G)„
OX,
[(3 - V3) + 12n](2 - V3)
4n + l
3tt „=o
Show that in (5.5.18)
/„(2) = (28n + 3)V3/9
[(7 - 2V6) + 28n] (V3 - V2)
8n+2
3\3
while
and
(!)
•C = (£)3
/„(4) = (63n + 5)V6/3
/„(7) = (133n + 8)9V3/4 /^ = (^)3.
8. Show that
9. Show that
4'7iHf0"
n\ J
4'w) = f„!/12"+1>
ti)n
n\ J
188
and
Ml,
Modular Equations and Algebraic Approximations to jt
(ai2,
10. Show that
CO
2V2M(21/4, 2~1/4) = 7T E (-l)"(12n + 1)
n = 0
11. a) Show, using (5.5.13) with n := 7, that
_1 = y (2n\3 42n + 5
* n%\n) 212n+4
This series of Ramanujan's has the property that, as J. Holloway
has observed, it can be used to compute the millionth (binary)
digit of IItr without computing the first half million digits. Note
that the terms are exact binary fractions whose numerators grow
roughly like 26" while the denominators are 16-212^
b) Observe that formula (5.5.23) can be recast as
1 2V2
Siw11103"6390"1^)"-
7T 9801 „=0 44"(n!)4
12. Use Exercise 6 of Section 1.3 to show that when re(c — a — b) > 0,
T(c)T(c -a-b)
2F1(a, b; c;l) =
r(c - a)T(c - b)
[This is easy for re(c)>re(6)>0.]
13. Clausen's product formula is
[2Fx(a, b; a + b + \; z)]2 = 3F2(2a, a + b, 2b; a + b + \, la + 2b; z) .
Prove this by showing that both sides satisfy the same generalized
hypergeometric equation and are analytic at zero, with value 1 there.
14. Verify that
r K(k)dk = r12 r12
J° Vl - k2 J° J° Vl
da d6
(sin2 a sin2 0)
b) ^,>imV(T^)£4(2;)4-]V.
5.5 Generalized Elliptic Integrals 189
Use Exercise 12 and Theorem 5.6(ai) to compute K(l/V2). Similarly
use Theorem 5.6(biv) to compute K(V2 — 1). (Compare Section 1.6.)
a) Verify that
i) sin (tx) = (t sin x)2Fl y -^-, -^-; -; sin xJ
ii) arcsin* = x-2Fl(j, \; \;x )
iii) log (x + Vl + x2) = x ■ 2Ft(k, h \; -x2).
b) Use Clausen's product to deduce that
c) Similarly deduce Euler's formula (Bromwich [26])
. 2 If (2*)2
arcsin x= - 2j
"'»f;)
[This is also the limiting case of b). See also Exercise 16 of Section
11.3.]
d) Find similar formulae for sinh [t log (x + V1 + x2)] and for
log (x + Vl + x2).
e) Establish that
2l0g>(iW!)=i(_-ir'
»f;)
and that
f) Prove that
2 °° 1
7T _ Vi 1
18 «-i 2/2n\
"I J
sin (ttQ _ y (0,,(-0,,
irt «=o (n!)2
a) Prove that
y "'»f;)
190 Modular Equations and Algebraic Approximations to •n
b) Show that the previous integral is £(3)/10. (See Exercises 12d)
and 12e) of Section 11.3.)
18. Let Ga(x):=2Fl(a,(2a + l)/2; 2a + l;x).
a) From Clausen's product, establish that
b) Show that G_in(x) = (1 + Vl - x) 12. Thus
(Note that G_1/2 must be evaluated as a limit.)
19. a) Establish the solution of (5.5.28). Hint: x:=h'2/h2 satisfies a
simpler cubic than h2.
b) Now verify that the following solutions obtain:
i) Gf = 2 gives H24 = l
ii) g12 = (V2 + l)2 gives H? = 2
iii) Gl2 = (2 + V3)2 gives H? = 2(2^f
iv) g^=V2(V3 + l)3 gives HT=Y
N ^12 o/^5+1\4 • rT24 125
v) G is =*{—£—) gives ^" = -4-
12 . /14V3 + 13V2\2
vi) g^ = (V3 + V2)4 gives Hf = [ ^= J .
Hint: For iii) and vi) use the increasing form of (5.5.28), which
gives H3N in terms of GN. This entails changing the central sign in
(5.5.28«).
20. a) Establish the general formula for ir~l given by (5.5.33) and
(5.3.34).
b) Verify the following values of aU6(N) + nbll6(N):
i) N •■= 2 gives (6n + l)/3V3
ii) N : = 4 gives (60n + 8)/27
iii) N := 5 gives (66n + 8)/15V3.
c) Compute the values of aU6(N) and bll6(N) for N'-=3 andN' = 6.
5.6 Other Approximations
191
21. By comparing (5.5.33) with (5.5.16) or (5.5.18), verify the assertions
of (5.5.31) for various N.
22. a) Use Exercise 12 to establish that
1 \ T(\ + s/2)T(i-s/2)
K
(£)-
-V2/ 4Vtt
b) As in Exercise 15, compute K(X*(3)).
cos (tts) .
5.6 OTHER APPROXIMATIONS
We begin with equation (5.2.20), which we rewrite as
(5.6.1) VP
~. v> n 1 3 o-(
MM]2
IT' J
where /::= A*(p). Then this shows that
(5.6.2)
--vp- —
2K(k)
+ 0(k4Vp).
Thus on approximating 2K/ir by an algebraic quantity we produce various
approximations for tr. The simplest [which also follows from (5.2.15/)] is
(5.6.3)
-=Vp-^ + 0(k*Vp)
We do better, however, by using Theorem 5.7(a) to write
2K(k)
TT
= 1 + | (kk'f + 0(k4)
so that
(5.6.4)
-=Vp- ^r [2 + (kk'f] + 0(k4Vp) ■
Ramanujan [14] uses different estimates of 2Klir. Motivated perhaps by
symmetry considerations, he uses (3.2.16) to expand [2K(k)/-n-]2 as
(1 -2k2)f1 + 0(k2) and then (3.2.17) and (3.2.18) to obtain
2K(k)
>\2
1 - (kk')
(l-2k2)[l+\{kk'f]
7757 + 0(k«).
192 Modular Equations and Algebraic Approximations to n
Then his approximations to tr are
(5-6>5) ^):=VF-(p)/[2(l-2*2)]
with error O^xTpe"^), and
3
(5.6.6) 7r2(p) := ^_ _ Kp)(4G24 _ 1)]/[(1 _2k2)(8G2* + 1}]
with error 0(\^pe~2'nV^). (See Exercise 1.) Moreover, (5.6.5) and (5.6.6)
produce very simple approximations. Thus
,e,* Mrs 3(3+ V5) , /OCN 63(17+15 V5)
(5.6.7) ffl(25) = -S—-i and .,(25) = ^^ •
The latter gives 11 digits of tr. Similarly,
84 j rt„ <„„ 145V37 + 1134
ffl(37)=2lV^7-101 ^ Wa(37) = 14722399V^7- 41916"
(5.6.8)
(The values of a and a are not quadratic surds.) For even p it is better to
use
(5.6.9) tt3(p) =
Vp ~ [<r(p)(4g" + 1)]/[(1 + k*)(8g" - 1)]
which is again an 0(Vpe_2,rV?) approximation. Thus we derive
(5.6.10) ,(22).^¾^.
v y 3V y 887 +1045 V2
An even more classical approximation to ir is obtained through taking
logarithms of GN or another invariant. Thus in the notation of the previous
section we may write that ir is approximately equal to
(5.6.110 ^log(8G^)
(5.6.11«) ^=log(8^2)
or
(5.6.120 ^,eg (£
5.6 Other Approximations 193
(5.6.12,7) 3s**{B
or
(5.6.13) ^108(1728/^).
We leave it to the reader to estimate the error in each expression. (See
Exercise 4.) For example, when N'■= 58, (5.6.11«) produces
/V29 + 5V
12 , /V29 + 5\
V58l0gl^72-j
which gives 10 digits of tr. Not surprisingly, Ramanujan [14] gives a host of
examples of this kind. When the invariant is large these give very good
algebraic approximations for e17. Following Shanks [82] one can take this
analysis considerably further. In (3.2.9) we gave ^-product formulae for
various invariants. Thus (3.1.4) and (3.2.9«) yield
nx24
(5.6.14) ^:=/i(V^V)-24 = (^) =qn(l + q")
and there is a similar expression for (kk'/4)2. We may expand this product as
a power series and compute as many terms as we wish of its reversion. This
will produce a series of the form
(5.6.15) q = x - 2Ax2 + 852*3 - 35744*4 + ■ ■ • .
We may also take logarithms in (5.6.14). Then we can write
log x + VNtt = 24 2 log (1 + q")= 24 2 ^- qk(l - q")"1
n=\ k=l &
(5.6.16)
which may be expanded as a power series in q. When we substitute (5.6.15)
into (5.6.16) we will recursively compute
CO
(5.6.17) log(\x\) + VNTT=:^ (-1)""1 — x"
for a fixed sequence {an}. Indeed ax '•= 1, a2 : = 47, a3 '. = 2488, a4 := 138799,
a5 : = 7976456, and a6 :=467232200. In fact one can show (Newman and
Shanks [84]) that 24a„ is the coefficient of q" in 11^ (1 + 02*"1)24".
Moreover, 24a„ <64". A now standard trick of replacing q by —q shows that
(5.6.17) still holds for x := -{kk'IA)2. In Shanks [82] several large invariants
are computed (including g3502 and G2737) to which (5.6.17) may be applied.
194 Modular Equations and Algebraic Approximations to n
Each series adds roughly 7rViV log10e digits a term. Hence when we use g469g
given in (4.7.14), we gain more than 90 digits a term! But of course we have
to compute the logarithm as well. Thus in our context the formula should be
viewed as a rapid series for log (|*|) + VNtt, not as a computation of tr.
Comments and Exercises
Approximations (5.6.5) and (5.6.6) were the reason why Ramanujan
computed Rp. He did not give (5.6.9). Many very large invariants, including
G14155, G19947, and G20i55 are derived in Shanks [82].
1. a) Establish that (5.6.5) and (5^6.6) have the claimed errors without
using the rather deep formulae (3.2.16) to (3.2.18).
b) Show, using (3.2.16) to (3.2.18), that
>rrx(p) - ir ~8ire_w3?(ir\/p - 3)
and
7r2(p) - 7T ~247re~2W?(107rVp - 31) .
2. a) Use (5.6.5) and (5.6.6.) to deduce that
.... 3(3Vl3 + 7) 103V13 + 125
^1 (13) = yj and ir2(13) = ^ •
b) Verify (5.6.7) and (5.6.8).
c) Verify that (5.6.9) has the claimed error.
d) Show that
180 + 52V3
^ ^ ~ 45V93 + 39V3T - 201V3 - 217 "
3. a) For even p, the estimate
-¾
Vp- <r(p) /2(1 + kl)
will often be cleaner than irx{p). It also gives 0(Vpe w?) error
in estimation of tr.
b) Obtain
c) Compute 7r3(58) and tt2(22).
5.6 Other Approximations
195
a) Estimate the error in each of (5.6.11), (5.6.12), and (5.6.13).
[Compare (2.5.15).]
b) Use (5.6.12) to estimate tr by
~= log (396) and -~ log (84V2)
a) Show that
and
g-24 = 64q + 1536<?2 + 19200,7 + • ■
64g = q - 24 + 276<? - 2048^2 +
with similar expressions for G24.
b) Thus
nViV
^(8n + 8n24) + 24 = e^ + 4372c-^ +
and
Vn
-Wn
64(G£ + G~Z4) - 24 = e"" + 4372*-™ +
c) When gN or GN is a quadratic surd, this gives an expression for the
integer part of e17 N (and the proximate 0's or 9's). Thus the
integer part of e17^ is 2,508,951. That of e17^ is 199,148,647, and
that of e"^ is 24,591,257,751.
a) Show as discovered by Beukers that, with {an} as in (5.6.17),
i)
ii)
TT
2K(k)A
2K(k)i
Vl - (2kk'y
1 + 24
= (1 +A:2)
2At)
1+24I(-Wip)
b) Combine ai) and Theorem 5.7(ai) to produce a recursion for {fl„}.
c) Show that an is an integer.
A variety of other approximations to tr and to p, the perimeter of an
ellipse, can be found in Chapter 18 of Ramanujan's second notebook
and in Ramanujan [14]. For example, given an ellipse of major axis a,
minor axis b, and eccentricity k:=(b/a)', he gives
i) p = 21^0, - hi; k1) = 7r(a + b)2Ft(- \, - b i; 0
where t := [(a - b) /(a + b)f. Then, as t-*0,
Modular Equations and Algebraic Approximations to n
ii) p ~ 7r[3(a + b)- V(3a + b)(3b + a)]
and
iii) p ~ <rr(a + b)[l + 3/(10 + V4-37)~1/2]
where the error in ii) is about 2~9t3 and the error in iii) is about
3 • 2-1V. This is pleasantly developed in Almqvist and Berndt [Pr]
and in Berndt [Pr]. Truncating the approximation in iii) leads to
iv) p ~ tt(o + b)[l + yf
-83
with an error about 2 t. This is due to Nyvoll [78].
a) Prove i).
b) Justify the error estimates in ii), iii), and iv).
One may avoid the Landen transform in a) by following Ivory
[1796]. We write
k2
l-y(l-COS20)
1/2
d0
12 dd
p = 4aE(k) = 2a J
= (a + b) Jo (1 + tine2iy\l + tV2e-2W)in
= (a + b) 2 (-lY"+"(~s)";(~2)"f<m+">/2
m,n-o mini
r e2i{m-")e dd .
Jo
a) Combine (5.1.2) and (5.2.14) to derive that
p=4aE(k)~2rra[M;\k,f)-Mn(k,f)en(k,f)] as k -*0
where Wn(k2, f) = 0.
b) Show that the error is roughly of order ak2n.
c) Deduce that to order ak"1,
o (a2 + b2\
and to order ak8,
o ( a + b \2
v~2ir\—- 7=1 n: = 4.
5.6 Other Approximations 197
d) Use the cubic identities to establish that
p ~ ira[3m~l(r) + m(r) - 2]
when k = X*(r). Thus, with order ak6,
k: = V2-l gives 27ra(5V6-6V3-7V2 +9)
and 2kk': = VI - 2 gives 7ra[3(2 + V3)V(2V5 - 2V3 - 1)
+ V(2V5 + 2V3 + l)-2].
9. In the notation of the AGM iteration let k'-= c0/a0 and let x2n '■ =
Cn+Van + l-
a) Establish that
^ 1 (l + il-x*\ , (2\ K'(k)
b) Let x : = {xJ2)A so that x = [\ (1 - VF) /(1 + VF)]4 and
K'(k) 1 , „ 26 2 368 3
"W)=~4logx~2x~Tx -Tx +----
c) Thus with k ■= A*(r) one has another estimate for ttVt with jc of
order <T4,n/~r. When r := 1, x = ^[(21/4 - 1)4/(21M + l)]4, and the
given terms yield 19 digits of ir.
From our point of view possibly the most remarkable result in Chapter 18
of Ramanujan's second notebook is the following continued fraction identity
given in entry 12. Let n>0 be fixed. Define
w flV- a P2 (2«)2 (3^)2 (4«)2
for a, /3 > 0. Then
(5.6.18) a.fefi.Vg)-*■(*"> ;*■<*■>
whenever )8 > a > 0 and
AG(j8,Vj82-«2) = 1
198 Modular Equations and Algebraic Approximations to n
[or, equivalently, whenever K(alfi) = (irl2){i\. Even more surprisingly, a
slight adjustment of the proof given in Berndt [Pr] shows that (5.6.18) holds
for all a, /3 > 0.
10. (The moments of K and E) Let
Kn-=\o knK(k)dk
and
£„:=/„ k"E(k)dk.
a) Use the differential equations for K and E to establish that
nK + E„
and
K"+2 n + 2
„ *C + 1
E =
n + 2
b) Show that
ir/2 n °° 1 rir/2
rir/2 n °° 1 f'2
u Jo sin0 „_0 2n +1 Jo
4"
«=o /2n\ ,
( J(2n + 1)2
c) Use contour integration of 0/sin 6 (on the infinite rectangle above
[0, 7r/2]) to deduce that
*° „4-0(2« + 1)2
which is twice Catalan's constant, j8(2) or G.
d) Establish that Kt = 1, El = §, and E0 = \ + /3(2).
e) Observe that all the odd moments are rational while each even
moment is of the form a + bj8(2) with a and b rational. Thus all
moments lie in <Q>(/3(2)).
f) Analogously define K'n and E'n. Determine recursions for these
conjugate moments. Show that K'0 = tt2IA and E'0= v2/8.
Compute K[ and E[.
5.6 Other Approximations 199
g) Use the quadratic transformations to show that
)o 1 + k dk 2K° 8 •
h) Show that
Chapter Six
The Complexity of
Algebraic Functions
Abstract. The aim of this chapter is to analyze the complexity of algebraic
functions in general; and of multiplication, division, and root extraction in
particular. There are two primary tools, Newton's method and the fast
Fourier transform.
6.1 COMPLEXITY CONCERNS
It is obviously inappropriate to consider the multiplication of two many-
thousand-digit numbers to be of equal difficulty to the multiplication of two
single-digit numbers. A reasonable measure of complexity that takes this
into account is the bit complexity. The bit complexity of an algorithm is the
number of single-digit operations required to terminate the algorithm.
Single-digit operations include addition, multiplication, logical comparison,
and storage and retrieval of single-digit numbers. We are exclusively
interested in how the complexity increases with the size of the problem. For
example, addition of two n-digit integers by the usual algorithm has bit
complexity 0B(n)—the subscript B on the order symbol is for emphasis.
This is a serial notion of complexity in the sense that on a serial machine it is
an appropriate asymptotic measure of the time required for the calculation.
We use the slightly nonstandard notation
an=n(bn)
if
an = 0(bn) and bn = 0(an).
200
6.1 Complexity Concerns 201
If an = 0,(bn), we say that {an} and {bn} are equivalent. Since accessing an
n-digit number requires O(n) bit operations, it is apparent that "usual"
addition of two n-digit integers is in fact O(n). These trivial lower bounds
are a consequence of uniqueness considerations—if we change any digit of
one of the numbers being added, we change the answer. Thus any algorithm
for addition must at least "inspect" every digit. So, up to a constant, usual
additon is asymptotically optimal. As we shall see later, one of the
interesting consequences of this body of theory is that "usual" multiplication is far
from asymptotically optimal.
A detailed approach to complexity requires a model of computation and
is perhaps most readily made rigorous in an analysis of Turing machines.
(See, for example, Aho, Hopcroft, and Ullman [74].) This much detail is
unnecessary for our purposes.
We will usually content ourselves with merely counting single-digit
additions and multiplications. In all the algorithms we present the comparisons
[note that the comparison of two n-digit numbers is ftB(n)] and the storage
concerns will be bounded by the arithmetic operations—provided the
algorithms are sensibly implemented. This is almost always transparent and
will rarely even elicit comment.
Operational complexity counts the number of operations (addition,
multiplication, division, and extraction of kth roots performed to a precision
bounded by the precision of the output). When all the operations in an
algorithm are performed to roughly the same precision, this is a useful
measure. The reasons for the particular choice of operations will be made
apparent in Section 6.4. Thus the algorithms of Chapter 5 compute n digits
of ir with operational complexity 0op(log n)\ once again the subscript on the
order symbol is for emphasis.
Comments and Exercises
One of the primary tools for the analysis of algorithms is the use of recursive
functions. The idea is to divide a problem into smaller subproblems that can
be solved by essentially the same technique and then recurse, a strategy
often called "divide and conquer." The reader unfamiliar with this approach
might like to examine the exercises. One of the lessons of complexity theory
is that many of the usual algorithms of mathematics are far from optimal.
Multiplication, taking Fourier transforms, and matrix multiplication are
but three examples. (See Exercise 3 and the next section.) A second lesson
is that good lower bounds are very difficult to obtain. For the analytic
algorithms we are considering, the only lower bounds we can establish are
the trivial ones. Thus unless the algorithm is of the same order, as is the case
for addition, we cannot achieve exact results.
We shall not discuss combinatorial complexity except to mention that an
introduction to this well-developed and important field may be found in
Aho, Hopcroft, and Ullman [74].
202 The Complexity of Algebraic Functions
1. Prove the following. Let a, b > 0 and c > 1. Suppose that/is monotone
on (0, °°).
a) If f(n) < af(n/c) + bn and /(1) = d, then
f(n) = 0(n) if a < c
f(n) = 0(n log n) if a = c
f(n) = 0(nlogO if a > c .
b) If f(n) < af(n/a) + bn(log n)c~l and /(1) = d, then
/(«) = 0(«(log«r).
Hint: Analyze a) with equality. Then establish the general principle
that the equality solution is the maximal solution.
2. Show that the usual algorithms for multiplying and adding two nx n
matrices have complexity Oop(n3) and Oop(n2), respectively. (We are
counting the number of multiplications and additions of the entries of
the matrices.)
3. (Fast matrix multiplication (Strassen 1969))
a) Show that
An
- -^21
"*M2
-^22-
"fill
- fi21
fil2
fi22-
"Cu
- V21
£-12
^-22-
can be computed from
Mi = (A12-A22)(B21 + B22)
M2 = (An + A22)(Bn + B22)
M3 = (An-A21)(Bn + B12)
M4 = (An + A12)B22
M5 = An(B12-B22)
M6 = A22(B21 - Bn)
M7 = (A2l + A22)Bn
and
Cn = M1+M2-M4 + M6
C12 = M4 + M5
. C21 = M6 + M7
C22 = M2- M3 + M5-M7.
b) Observe that the above method reduces the multiplication of
2m x 2m matrices to 7 multiplications and 18 additions of n x n
6.1 Complexity Concerns 203
matrices. Thus if W(n) is the operational complexity of multiplying
two n x n matrices, then by iterating the procedure in a)
W(2n)<7W(n) + an2 .
The final term comes from using usual matrix addition for the 18
additions. The constant a can be chosen independent of n and can
be used to include the "overhead" of actually breaking the problem
up. Use the above inequality to show that
W(n) = Oop(nl°^).
Note that log27 < 2.81, so the above method is asymptotically faster
than the usual method. [Extensions of this method can reduce the
bound for multiplication down at least to 0(n2'5~), much as in
Knuth [81]. The best lower bound known is the trivial one, en2.]
Suppose A is a nonsingular 2m x In triangular matrix. Write
A = (B °
A \C D
where B, C, and D are n x n matrices,
a) Show that B-1 and D~x exist, and that
B'1 0
-D~lCB~l D~l
A~l =
b) Show that a) iterates to produce an algorithm for inverting 2m x 2m
triangular matrices. Let I(n) be the operational complexity of this
algorithm. Show with W(n) as in Exercise 3 that
7(2«) < 2W(n) + 2I(n) + en2
and hence that
I(n) = 0(W(n)) = 0(nlog27) .
(One can show that in general matrix inversion and matrix
multiplication are asymptotically equivalent. See Aho, Hopcroft, and Ul-
lman [74].)
Show that the bit complexity of calculating n! by multiplying 1 x 2 x 3 x
• • • using usual multiplication is 0((n log nf).
Hint: Analyze the complexity of multiplying an M-digit number by an
m-digit number. Use Stirling's formula to estimate the number of digits
in k\. Exercise 10 of Section 6.4 explores this further.
204
The Complexity of Algebraic Functions
6.2 THE FAST FOURIER TRANSFORM (FFT)
Let w be a primitive (n + l)th root of unity either in C or in a finite field Fm,
that is, w"+1 = 1 and wk ¥= 1 for k < n + 1. In the complex case we may take
w:=;e2^'/("+1). Consider the following two problems.
interpolation problem. Given n + 1 numbers a0,. . . , an, find the
coefficients of the unique polynomial pn{z) '■= a0 + axz H h anzn of degree n
that satisfies
pn(wi) = ai 0<i*<n + l.
evaluation problem. Given the coefficients of a polynomial pn of degree
n, calculate the n + 1 values
pn(wl) 0<i'<n + l.
These are the two directions of the finite or discrete Fourier transform.
The classical approaches to either part of the Fourier transform problem
have operational complexity at least en2. This is the operational complexity,
for example, of evaluating pn at n + 1 points using Horner's rule [writing
pn(x) = (((anx + an-i)x + an_2)x + ■ ■ •)]• We wish to prove that, in fact,
both directions can be solved with complexity Oop(n log n). Actually, we
only treat the case n + 1 := c2m, which is sufficient for our purposes and
somewhat simpler.
Theorem 6.1 (Fast Fourier Transform)
If n + 1 = c2m with c an integral constant, then both the interpolation and
the evaluation problem have operational complexity Oop(nlogn).
Proof. We assume c = 1, that is, n + 1 = 2m, the case for general c is
entirely analogous. We treat evaluation first. Suppose
p(x):=a0 + --- + anxn .
Let
q(x2) '■= a0 + a2x2 + a4x + h an_1xn~1
and
xr(x2) ■= x(al + a3x2 H V anxn~x) .
Then with y'=x2,
6.2 The Fast Fourier Transform (FFT)
205
(6.2.1) P(x) = xr(y) + q(y)
where r and q are both polynomials of degree 2m-1 -1. The observation
that makes the proof work is that for w an (n + l)th root of unity,
(w')2 = (w(n+1)/2+;)2.
Hence, evaluating p(x) at the n + 1 roots of unity in (6.2.1) reduces to
evaluating r and q each at the (n + 1)/2 points (w2)1, (w2)2,..., (w2)(n+1)/2
and amalgamating the results. Observe that w2 is a primitive (2m-1)th root
of unity and we can iterate this process. Let F(2m) be the number of
additions and multiplications required to evaluate a polynomial of degree
2m - 1 at the 2m points wk, k = 1,. . . , 2m, where w is a primitive (2m)th root
of unity. As above,
(6.2.2) F(2m) = 2F(2m~1) + 2-2m F(1) = 0.
The final term comes from the single addition and multiplication required to
calculate eachp(w') from r{w2') and q(w2'). The recursion (6.2.2) solves as
(6.2.3) F(2m) = 2m+1 • m
and the bound for the evaluation problem follows.
The interpolation problem is equivalent to evaluation. This can be seen
as follows. Let w be a primitive (n + l)th root of unity and let
1 1
w w2 ...
(6.2.4) W:=\ 1 w2 w4
Then
.„n In
W w
1
-1 -2 -n
w w ... w
(6.2.5) ^1 = ^Tl( 1 vv-2 w-4 ... w-2"
w~n w-2n ... w-»2
and w-1 is also a primitive {n + l)th root of unity. (See Exercise 4.) The
interpolation problem can be written as: Find (a0,..., an) so that
W(a0,..., an) = (a0,..., an).
206
The Complexity of Algebraic Functions
However, this can be solved by
W~\a0,... ,an) = (a0,. ..,an)
which is exactly the evaluation problem. □
While restricting the FFT to powers of 2 poses no problem for us over C,
it is a nuisance over finite fields. The problem is that Fm has primitive kth
roots if and only if k divides m — 1. Hence, our approach is restricted in the
finite case to considering primes of the form m = c2k + 1, which are not
particularly abundant. There are many ways around this difficulty. This is
discussed in Winograd [80].
Comments and Exercises
The FFT is an enormously useful and widely used algorithm. Depending on
exact form and implementation, it can outperform traditional methods for
values of n well below 100. For a history of the FFT consult Cooley, Lewis,
and Welch [67]. While antecedents for FFT methods are plentiful, Cooley
and Tukey [65] are primarily responsible for introducing the FFT in its
modern form as a complexity-reduced method. More extended discussion of
the FFT and related matters may be found in Aho, Hopcroft, and Ullman
[74] and Winograd [80]. As a theoretical tool the FFT and related methods
are central. They form the basis for the next section's discussion of fast
multiplication. In the exercises we show how these ideas can be used to
construct asymptotically fast polynomial multiplication, division, and
interpolation algorithms. It is even possible to accelerate integer factoring
algorithms by FFT methods. In Chapter 10 an application to the estimation
of certain transcendental functions is provided. Our proof of Theorem 6.1
and some of the exercises follow Borodin and Munro [75].
1. a) (Fast polynomial multiplication) Consider the following algorithm
for multiplying polynomials. Given the coefficients of p and q (both
of degree <« - 1), compute the coefficients of pq as follows:
Step 1: Evaluate p and q at 2m points w1,. . . , w2", where w is a
primitive (2n)th root of unity.*
Step 2: Form the 2m products
p(w')q(w') i = 1,..., In.
Step 3: Solve the interpolation problem for pq to find the
coefficients.
Show, using an FFT, that the above algorithm has operational
complexity
Oop(n log n) .
* Strictly speaking, we should be using (2m)th roots of unity where we have established an
FFT. This can be arranged by padding with leading zero terms, if necessary, without changing
the order of complexity.
6.2 The Fast Fourier Transform (FFT)
207
[The usual convolution product algorithm has operational complexi-
ty 0(«2).]
b) Given
n
p(x)-=U(x-xi)
1=1
show that the coefficients of p can be calculated in Oop(n(log nf).
Hint: Treat the problem recursively and recombine the pieces using
a fast polynomial multiplication.
(Fast polynomial division) Given p of degree n and q 6f degree m < n,
both with integer coefficients, it is possible to find u and r with
deg r < deg q so that
p(x) = u(x)q(x) + r(x)
in Oop(n log n).
Outline: Simplify by observing that it suffices to calculate u since r may
then be computed by Exercise 1. Set x ■= 1/x. Then
p(l/x)= (1\ r(llx)
q(llx) \x) q(llx)
and so
^- = u(x) + xn-m+h^\ v(x):=x^vv(-)
q(x) q(x) v ' \x)
where /i > 1. To calculate « (and hence u) it suffices to calculate the first
n-m (= deg u) Taylor coefficients of II q. This can be done by
Newton's method as follows. Suppose that deg st = j- 1 and that
Establish that
-=^ - [2st(x) - s2t(x)q(x)] = -^ [1 - st(x)q(x)]2 = 0(x2J) .
[Note that we may assume q(0) ¥^ 0.] Now the computation of si+x '■ = 2s,. -
s2q can be performed using an FFT-based polynomial multiplication and
need only be performed using the first 2/ — 1 coefficients of q and 5,. By
starting with an appropriate first estimate of s0 [say, s0(x) '■= l/q(0)] and
proceeding inductively as above (doubling the number of coefficients
utilized at each stage), show that the required number of terms of the
expansion can be calculated in Oop(n log n). (See Section 6.4.)
208
The Complexity of Algebraic Functions
3. (Fast polynomial evaluation) Given p of degree n and n + 1 distinct
points xn,. .. ,xn, show that p(x0),..., p(xn) can all be evaluated in
Oop(«(log«)2).
Hint: Let qx(x) = n"„o * (x - xt) and let rx be the remainder on dividing
p by qv Note that ^1(^) = ^(^,) for i<n/2. Similarly use
q2(x):=Kni2(,x~xi)- Thus two divisions reduces the problem to two
problems of half the size. Use Exercise 2 and evaluate the recursion.
[The other direction of this problem, namely, constructing Lagrange
interpolating polynomials, is also Oop(n(log n) ). See, for example,
Borodin and Munro [75]. In fact, both directions are Oop(n log n).]
4. Show that if w is a primitive (n + l)th root of unity, then
y (, = {" + l / = 0mod(n + l)
,=0 I 0 otherwise.
Show that (6.2.4) and (6.2.5) are inverse to each other.
5. (Reversion of power series) Let f(x) '■= E^„0 akx be a formal power
series, with known coefficients.
a) Show, as in the proof of Exercise 2, that the first n coefficients of
the formal series expansion of 1//00 can be computed in
Oop(n log n).
b) Discuss the complexity of computing the coefficients of the formal
inverse of / by Newton's method.
6. (On calculating xn) The 5-and-X binary method for calculating x" is
the following rule. Suppose n has binary representation 8,,8^2---8^
with S0 = 1. Given symbols 5 and X, define
rsx if s, = i
''IS if 8t = 0
and construct the sequence
SXS2 ■ ■ ■ Sk.
Now let 5 be the operation of squaring and let X be the operation of
multiplying by x. Let the sequence of operations SXS2••• Sk operate
from left to right beginning with x. For example, for n = 27,
8,,8^8384 = 11011
and
5!525354 = (SX)(S)(SX)(SX) .
6.3 Fast Multiplication 209
The sequence of calculations for x21 is then
2 3 6 12 13 26 27
a) Prove that the above method computes x" and observe that it only
requires storing x, n, and one partial product.
b) Show that the number of multiplications is less than 2 [log2 n\.
c) Show that the above method is optimal for the computation of x2™
(considering only multiplications).
d) Show that the 5-and-X method is not optimal for computing x15.
An extended discussion of this interesting and old problem is presented
in Knuth [81].
6.3 FAST MULTIPLICATION
We wish to present a strategy for multiplying very large numbers that is
considerably faster than the usual 0B(n2) method. The idea is to exploit the
FFT. Let a and /3 be two n-digit integers and write
(6.3.1) a(x) := anxn~l + an_xxn~2 + ■ ■ ■ + ax
and
(6.3.2) (3(x) := by1 + bn_xxn~2 + --- + b1.
Then if at and bt are the decimal digits of a and /3, respectively, we have
(6.3.3) a = a(10) and /3 = /3(10).
We can now calculate a • /3 by using the FFT, as in Exercise 1 of the last
section, to compute the coefficients of
(6.3.4) y(x):=a(x)p(x).
Finally we evaluate y(10).
Let T{n) denote the bit complexity of multiplying two n-digit integers (or
equivalently, floating point numbers) by the above method.
The analysis of the complexity of the algorithm that follows assumes that
we are working with complex roots of unity (and that we are working with a
problem of size 2m, as can always be arranged by padding with leading
zeros). This introduces rounding error problems into an intrinsically integer
algorithm; however, this is the setting which we have established an
abundance of values for which the FFT works. Since in practice (and in
210
The Complexity of Algebraic Functions
theory) there are plenty of FFT analogues in a finite setting, one may, if one
prefers, consider the entire algorithm performed mod (O(n)).
Step 1:. Evaluate a(x) and fj(x) to precision 0(log n) at the In points
w1, w ,. .. , w2n with w a primitive (2n)th root of unity. Using the FFT, this
has operational complexity
Oop(n log n)
and bit complexity
(6.3.5) 0B{n{\ogn)T{\ogn)).
[It suffices to use w to precision 0(log n) and thus every multiplication in
Theorem 6.1 is of complexity 0B(r(log «)). Observe that the coefficients of
a and /3 are single-digit numbers and that the coefficients of y are hence
0(log n)-digit numbers. See Exercise 3.]
Step 2: Form the 2n products
y(w') = a(w')/3(w')
computed to 0(log ri) digit precision. This is of bit complexity
(6.3.6) 0B(nT(logn)).
Step 3: Interpolate the coefficients of y to precision 0(log n) using the
FFT. This has operational complexity
Oop(n log n)
and bit complexity, as before,
(6.3.7) 0B{n(\ogn)T(\ogn)).
Step 4: Evaluate y(10). This has bit complexity
(6.3.8) 0B{n).
This step essentially requires only addition. The coefficients of y are closely
related to the digits of a/3 except that they may be too large and a "carry"
must be performed.
The total complexity thus satisfies
6.3 Fast Multiplication
211
(6.3.9) T(n) = 0B(n(log n) T{\og n))
or
(6.3.10) T(n) = 0B(n(log «)2(log(log n)f ■■■)
with the product terminating when the iterated log is less than 1.
This is not optimal. (See Exercise 2.) The same analysis as above using a
base n representation of a and /3 reduces the time for multiplication to
Ob(«(log «)(log log «)•••) .
This is still not quite the asymptotically fastest known algorithm. The best
bound is due to Schonhage and Strassen [71] and is
0B(w(log«)(loglog«)).
We shall call any multiplication that performs with this speed a Schonhage-
Strassen multiplication. It should be emphasized that in all the reduced
complexity multiplications we present, the order estimates include the
overhead additions (and storage concerns).
Comments and Exercises
The observation that multiplication is not intrinsically 0B(n2) was made by
Karatsuba in 1962. He proposed an Ofl(nloS23) algorithm. (See Exercise 1.)
Subsequent refinements and improvements are due to Toom, Cook,
Schonhage, Strassen, and others, culminating in the Schonhage-Strassen
multiplication of 1971. This algorithm is of the same flavour as the one
presented above using size 2^ representations and performing FFT
operations mod(22V" +1). A presentation may be found in Aho, Hopcroft, and
Ullman [74]. Knuth [81] has an extended discussion of multiplication
strategies which includes a discussion of the precision concerns of
performing a fast multiplication over C. Cook and Aanderaa [69] conjecture that
multiplication is not 0B(n). Here one must be careful about the model of
computation allowed. Under some more powerful than usual models 0B(n)
multiplication is possible (see Knuth [81]), while under other more
restrictive than usual models it can be shown to be not possible.
Once again it is possible to implement a fast multiplication that will
outperform traditional methods for n in the several hundred-digit range. For
the many million-digit calculations of it discussed in Chapter 11, use of a fast
multiplication is imperative. For a discussion of the multiplication used in
this setting see Tamura and Kanada [Pr] and Bailey [88].
212 The Complexity of Algebraic Functions
1. (An 0B(n'og23) multiplication) Observe that
(a + 610" )(c + rflO") = ac + [(a - b)(d - c) + ac + bd]10n + bdl02n.
Use this to reduce multiplication of 2/j-digit numbers to three
multiplications of n-digit numbers and some additions. Show that this can be
used to produce a multiplication of
0B(n10^) .
[This method can be refined to produce an 0B(n1+s) algorithm; details
are in Knuth [81].]
2. Construct a multiplication of complexity
0BO(log n)(log log «)•••)•
Hint: Instead of using base 10 representations, use base n
representations so that the polynomials (6.3.1) and (6.3.2) are polynomials of
degree n with coefficients of length log n. Now proceed as in the
algorithm of this section.
3. Discuss the bit complexity of the FFT. Show that if the input is given to
precision 0(m) and the output is required to precision 0(m), the bit
complexity is
0B((n log n)M(m))
where M(m) is the complexity of whatever multiplication is employed.
6.4 NEWTON'S METHOD AND THE COMPLEXITY OF
ALGEBRAIC FUNCTIONS
We wish to show the equivalence, from a complexity point of view, of
multiplication, division, and root extraction. The primary tool is Newton's
method.
Theorem 6.2
Suppose that/is analytic in a complex neighbourhood of z. Suppose/(z) = 0
and /(z) ¥= 0. Then the iteratipn
(6-4.1) xn+1 -=^-^),
converges uniformly quadratically to z for initial values x0 in some
neighbourhood of z.
6.4 Newton's Method
213
The reader unfamiliar with the proof is directed to Exercise 1. Newton's
method is useful for calculating inverse functions. Observe that g~1(y) is a
zero of f(x) = g(x) - y. This is the content of the corollary. The uniform
nature of the convergence is discussed in the exercises.
Corollary 6.1
Suppose that/is analytic and one to one in a neighbourhood of z0. Then
there is a neighbourhood of /(z0) where /-1 can be computed uniformly
quadratically by Newton's method.
The quadratic nature of the convergence is only half the story. The other
half is the "self-correcting" nature of Newton's method. Suppose that
/(z) = 0 and that we are computing z by Newton's method. If the nth iterate
xn is perturbed by an amount 0{\xn - z|), then provided we stay in the
domain of uniform quadratic convergence, computing xn+1 from the
perturbed value of xn will preserve quadratic convergence. In other words, if
xn - z agree through M digits, then the calculation of xn+1 need only be
performed to precision 2M.
Let M(n) denote the bit complexity of multiplication of two n-digit
numbers by some method. We make the following regularity assumptions:
2M(n) ^ M(2ri) < AM{n) and M(n) is nondecreasing .
(6.4.2)
Since two multiplications of length n can be viewed as subproducts of a
single multiplication of length 2n, while four multiplications of length n
comprise one of length 2n, the first part of the assumption is reasonable.
Since multiplications of length n can be padded with leading zeros to
multiplications of length n + k, the second part is also reasonable. Of course
it is easy to imagine a perversely designed multiplication for which (6.4.2)
does not hold.
Let D(ti) and R(n), respectively, denote the bit complexity of division
and extraction of square roots, where the input and output are to precision
n. We say that two operations are equivalent if the complexity of one is
bounded by the complexity of the other and conversely. For example, we
say multiplication and division are equivalent if, given a multiplication with
bit complexity M(n), we can construct a division with bit complexity
D(ri) = 0(M(n)); and conversely, given a division we can so construct a
multiplication. The following remarkable theorem, the first part of which is
due to Cook, may be found in Brent [76c].
Theorem 6.3
Multiplication, division, and root extraction are all equivalent.
214
The Complexity of Algebraic Functions
Proof.
(a) We first construct a division. Applying Newton's method to the
function f(x) '• = 1 Ix — y leads to the iteration
(6.4.3) xk+1: = 2xk-x\y
which employs only multiplication and addition. Note that
l ( V2
(6-4.4) ^+1-- = -3^-
and the quadratic nature of the convergence is manifest. We assume
|_y| < 1 and that y lies in a neighbourhood V bounded away from zero
(which if we are working in floating point, is no restriction). We may
also assume that, by using a usual Os(n ) division performed to a
fixed low precision, we have already computed x0-=x0(y), so that
\x0 — l/v| <1/10. Assume n is a power of 2. Then log2 n
iterations of (6.4.3) will by (6.4.4) produce an error bounded by
_ 1
X0 y
HT"
and hence provide n digits of IIy. Furthermore, by the self-correcting
nature of Newton's method, the fcth step of (6.4.3) requires two
multiplications and two additions of precision only 2k. Thus the total
complexity of the iteration is given by
log2n
(6.4.5) E [2M(2*) + 2-2*]<8M(n)
k~l
since 2M(2*)<M(2*+1). We have shown that lly can be calculated
with complexity 0B{M{n)). Hence since alb = a ■ (lib),
D{n) = 0B{M(tij) .
(b) The equivalence of division and multiplication is now obvious since
u a
ab = TFb-
(c) Square roots can be extracted by Newton's method applied to x — y,
which yields the classical iteration
(6.4.6) **+*:=!(** + i~
6.4 Newton's Method 215
This satisfies
(6.4.7) xk+i-Vy = ^(xk~Vyf
and the quadratic convergence is again apparent. We can proceed
exactly as in (a) to show that
R(n)=0B(D{n)) = 0B(M{n)).
(d) The proof that
M(n)=0B(R(n))
is Exercise 7. □
It is apparent that any time a function may be quadratically computed by
Newton's method from an iteration involving only addition, multiplication,
and division, that function will be of complexity 0B(M(n)). This applies to
any algebraic function over Q(x), that is, any function / satisfying an
equation
(6.4.8) 4>(x,/(x)) = 0
where $ is a polynomial in two variables with rational coefficients. More
precisely:
Theorem 6.4
If/is algebraic over Q(x), then the complexity of calculating n digits of f(x)
is 0B(M(n)).
The preceding results, of course, assume that we are avoiding the branch
points of the function in question.
Comments and Exercises
Newton's method has a host of refinements and variants. See, for example,
Householder [70] and Exercise 3. The iteration for square roots can in some
form be traced back to the Babylonians, who used one or two steps of the
method.
It should be observed that not only is D(n) — 0B(M(n)), but the constant
concealed by the order sign is fairly small. [From (6.4.5) we see that a
constant 8 works. Indeed, for all known multiplications, the additions term
is negligible and a constant of 4 is appropriate.] This is also the case for root
216
The Complexity of Algebraic Functions
extraction. See Brent [76c], where a number of constants for these and
various other equivalences, as in Exercise 7, are established.
Further discussion of the calculation of algebraic functions may be found
in Kung and Traub [78]. Related matters may also be pursued in Lipson
[81].
1. a) Prove Theorem 6.1 by observing that
fan) ="fa) + (*» - *)fo) + 0(Xn ~ zf .
Substitute this into (6.4.1) to get
'fan) ~fo)
* = (*»- z)
fan)
+ 0(xn - zf .
Use explicit estimates to prove uniformity.
b) Show that for real / and real xn, the tangent to / at xn intersects the
x axis at xn+1.
c) Suppose that / is convex and strictly increasing on [a, b] and that
f(x) = 0 for some x in (a, b). Show that if b > x0 > x, then {xn}
decreases to x and convergence is guaranteed.
2. Construct the Newton iteration for y1/p by inverting xp. Write xn+1 —
yllp in terms of (xn — y1/p)2, thus exhibiting explicitly the quadratic
convergence. For which real starting values does the method converge?
3. a) Consider the iteration
(l//)(n)
yk
where the notation indicates that the derivatives are evaluated at
yk. For sufficiently well-behaved (for example, analytic) /, this
method will find a zero of / with (n + 2)th-order convergence
provided various derivatives are nonvanishing. Prove these
assertions. For n'-=0 this is just Newton's method, for n'-=l it is
Halley's method. (See Householder [70].)
b) Let
xk+i :=xk(l + (1- yxk) + (1- yxkf) .
Show that
1\ ..2( 1X3
, :y \xk~
yl ' \ y
and that for x0 sufficiently close to 17y, xk converges cubically to
1/y.
6.4 Newton's Method 217
c) Let
xk+1-=lxk(15-10x2ky + 3y2xk)-
Show that
yx2k+1 -l = h (9/4 - 33y*2 + 64)(y*2 -1)3
and that for x0 sufficiently close to 1/y/y, xk converges cubically to
l/Vy.
d) Show that b) and c) require fewer multiplications than (6.4.3) or
(6.4.6) for computation of reciprocals and square roots. These
are, in practice, very good high precision algorithms.
Let x0 '■ = 1 and let
Kk + l ' *-Xk XfrX
be the iteration (6.4.3) for computing 1/x. Show that xk+l is the
(2 +1 - l)th Taylor polynomial of f(x) '■= 1/x expanded around the
point 1. Thus for division we may think of Newton's method as a
means of accelerating the computation of the Taylor series.
Let x0 : = 1 and let
_ 1 ( x_
xk+i — 2{xk + Xk
be the iteration (6.4.6) for computing Vx. Show that xk+l is a rational
function r(x) with numerator of degree 2 and denominator of degree
2—1 that satisfies
|V3e -r{x)\ = 0{x -1)2
[This implies that r(x) is the (2k, 2k - 1) Pade approximant to Vx at 1.
See Section 10.1.]
6. Invert 1/x — y to calculate Vy without using any divisions.
7. (Other equivalences; Brent [76c])
a) Show that squaring is equivalent to multiplication by considering
(a + bf - (a - bf.
b) Complete the proof of Theorem 6.3 by showing that square root
extraction is equivalent to multiplication.
Hint:
S2
(1 + 2Sx)112 = l + 8x- — x2 + 0((8xf) .
Use 8:=10 m for appropriate m to reduce the computation of x
to root extraction and 0B(n) operations.
218
The Complexity of Algebraic Functions
c) Show that inversion is equivalent to multiplication.
d) Show that pth-root extraction is equivalent to multiplication.
8. a) Suppose that Cf(ri) is the bit complexity of calculating n digits of/.
Assume that Cf(n) is increasing and that Cf(2n)^2Cf(n).
Suppose that / satisfies the conditions of Corollary 6.1. Show that/has
at worst the same bit complexity as /, and that
Cri(n)=0B(Cf(n) + M(n)).
b) Write down Newton's method for computing exp from log and log
from exp. These particularly simple iterations combine with a) to
show that the problems of calculating exp and log are effectively
equivalent, from a bit complexity viewpoint.
9. (Fast base conversion; Schonhage) Let k and / be fixed integers. Show
that an n-"digit" base k number can be converted into base / with bit
complexity
0B(lognM(n)).
Hint: Break the number to be converted in half (base k), convert each
half, and recombine. (See Knuth [81] for a lengthy discussion of this
problem.)
10. a) Show that n\ can be calculated with bit complexity
0B(log n M(n log n)) .
Hint: First calculate 1 x 2, 3 x 4, Then calculate 1x2x3x4,
5 x 6 x 7 x 8, .. . etc.
b) Compare this to calculating n! as in Exercise 3 of Section 6.1.
Show that no multiplication can reduce this method below 0(n2).
[The best known bound for n\ is 0B((log log n)M(n log «)); see
P. B. Borwein [85].]
Chapter Seven
Algorithms for the
Elementary Functions
Abstract. We analyze algorithms for the transcendental elementary functions
based on the transformation theory for elliptic integrals and in particular on
the AGM.
7.1 w AND LOG
All the elementary transcendental functions can be calculated with bit
complexity 0B(lognM(n)). This is a consequence of the fact that log has
operational complexity Oop(logn), and hence has bit complexity
0B(logn M(n)). The approach to log rests most easily on the logarithmic
asymptotic of K at 1. Before proceeding with this analysis it is convenient to
record the complexity of the algorithms for ir, based either on iterating the
modular equation, Wp, or more specially on the AGM.
Theorem 7.1
The initial n digits of tt can be calculated with operational complexity
Oop(log«)
and with bit complexity
Ofl (log «Af(n)).
Proof. Both Algorithms 2.1 and 2.2 as well as most of those of Chapter
5 perform with the above complexity. □
219
220
Algorithms for the Elementary Functions
In all of the above algorithms some of the calculation may be done to
reduced precision. For example, in Algorithm 2.1 the computation of xn and
y„, which both tend to 1 quadratically, can be performed at successively
lower precision. The saving, however, is only in the constant term, reducing
it by a factor of less than 2.
We record the following estimates:
Theorem 7.2
(7.1.1)
(7.1.2)
(7.1.3)
(7.1.4)
*'(*)-log I £
E'{k) -1| =
*'(*)-log
= 0(\k2 log k\)
0(\k2 log k\)
4l ;10|fc2logyt|
\E'(k)-l\<lO\k2logk\
re(Jfc)>0,Jfc-»0
re(fc)>0, Jfc-»0
ike (0,10-3)
yte (0,10-3).
Proof. The relationships for K are in Exercise 4 of Section 1.3. (See
also Exercise 1 of Section 2.3.) For (7.1.2) observe that by (1.3.2), for
0<yt<l
E'(k) -1
1 Vl - (1 - *V - Vl-f2 dt
Vv^T2
k2t2 dt
< k2K'(k).
,0 y[l-(l-k2)t2](l-t2)
The constant in (7.1.4) requires a little additional scrutiny. □
The following approach to calculating log is essentially due to Salamin (in
Beeler et al. [72]).
Algorithm 7.1
For x e(s,l) andn>3,
(7.1.5)
|log x - K'(10-) + K'(1(T"*)| < -~^x
where
£'(10"") =
IT
2AG(1,10-")
and
7.1 it and Log
221
K'(10 nx) 2AG(1; 10-Bje)
are computed from the AGM iteration.
This algorithm has operational complexity 0op(log n) and bit complexity
0B(M(n) log n).
Proof. The estimate (7.1.5) is immediate from Theorem 7.2. The
computation of the two elliptic integrals requires precomputing ir, which has
complexity Oop(logn). The final detail is that AG(1,10~nx) and
AG(1,10~") can be calculated to precision 2m using 0(log n) iterations of
the AGM iteration. (See Exercise 1.) □
A related algorithm that avoids precomputing ir can be established from
Algorithm 1.2 (which provides a direct calculation of K'IE').
Algorithm 7.2
Let R'(k) := K'(k)/E'(k). For* £(1,1) andn>3,
(7.1.6) |log x - R'(10~n) + R'(10~nx)\ < ^rsy
where
(7-1.7) R'(k)= *-n-12
1-E„.02 c„
and c„ is associated with the AGM process commencing with a0: = 1 and
b0 := k. [See equation (1.1.3).]
This algorithm has operational complexity 0op(log ri) and bit complexity
0B(M(n) log n).
The proof is straightforward and is left as an exercise. Both of the above
algorithms are based on an underlying quadratic method for calculating K'
and E'. Algorithms based on pth-order methods can be constructed as in
Chapter 5. A quartic version, for example, can be derived from Exercise 3
of Section 1.4. Instead of calculating R' from the AGM, we use
(7.1.8) R'(k) =
l-E:=04>^[(a2„ + /32„)/2]2]
where
«M:-H^ and ^-J"^±^'"
222
Algorithms for the Elementary Functions
and
a0:=l and po:=kin.
In this, as in other quartic versions of quadratic algorithms, there is a
substantial computational saving (as much as 35%).
Comments and Exercises
The algorithms of this section for log all suffer from the drawback that they
are not truly iterative. Increasing the precision requires choosing new
starting values. This is more of an aesthetic than a computational problem;
even with iterative methods one only computes to a fixed precision, and
increasing the accuracy usually entails starting at least one of the calculations
all over again. While two different AGM or related processes must be
calculated for the initial value of log, subsequent values require computing
only a single AGM since one of the terms can be reused. These methods are
quite stable, requiring only 0(loglog n)-guard digits. They will outcompete
traditional methods—depending enormously on implementation—in the
several-hundred-digit range.
The algorithms of this section are the asymptotically fastest known
algorithms for log (see Section 10.2) and are faster than any known
algorithms based on other methods, although 0B((log n)2M(n)) can be
achieved by techniques of Chapter 10. These types of algorithms were first
examined by Salamin (Beeler et al. [72]) and independently by Brent
[76a, b, and c]. Newman [82] gives a self-contained account, as do Borwein
and Borwein [84a]. The second algorithm is in Borwein and Borwein [84d].
Finally, while we have only presented the algorithms for real k, they
extend naturally into the complex plane; only the error estimates become
slightly more complicated. (See Exercise 1.) Matrix versions due to Stickel
[85] are discussed in Exercise 6.
1. Show that Algorithm 7.1 can be used to calculate log uniformly for
{z£C| |z - l| < j} with operational complexity 0op(log n) and bit
complexity 0B(logn M(n)).
2. Examine the convergence of the AGM for a0:=l and b0-=10~n.
Specifically, estimate the number of iterations required to produce an
answer within 10"" of the limit. For Algorithms 7.1 and 7.2, find a
reasonable bound on the number of iterations of the AGM required to
produce a 1000-digit precision algorithm for log x, xE.( \, 1).
3. (An asymptotic algorithm for ir) Show that, for n >3,
1 1 1
AG(1,10~") AG(1,10~" + 10~2n) J
< «102~2n ,
log (1 + 10-)--
and hence
7.1 it and Log
223
10" 10"
AG(1,10~") AG(1,10~" + 10-Z")J
< «102~
This provides an 0op(log n) algorithm for -rr. (See Newman [82] or
Borwein and Borwein [84a].)
4. Given thepth-order modular equation 0,p (as in Section 4.5), show how
to construct asymptotic algorithms for log analogous to Algorithms 7.1
and 7.2, but with an underlying pth-order iteration.
5. Show how the series expansion for K' and E' of Section 1.3 can be
combined with Algorithms 7.1 and 7.2 to provide 0op(log n) algorithms
for log that provide n digits of log using starting values lQ~nlkx and
10-n/k
6. (The matrix AGM) Let &N denote the N X-N self-adjoint positive
definite matrices and let I denote the N X N identity matrix. Let
Pn,B0:=I,
A0
(7.
(7.
:=A6
1.9/)
1.9«)
a) Show that if Ae.&N, then there exists a unique C e SPN so that
C2 = A. Hint: The iteration C„+1 '•= C„ + \(A - C2n), C0 : = 0
converges to C.
b) Suppose that X0 £ 3PN commutes with AE.SPN. Show that, for X0
sufficiently close to VA, Newton's method Xn+1 ■= \(Xn + AX~l)
converges to VA £ 3PN quadratically.
c) Show that the matrix AGM (7.1.9) converges to a matrix
AG(A, I) £ &N and show that An - Bn converges quadratically to
zero.
d) Show that
J AG(A, By1 = [(x2I+ A2)(x2I+ B2)]~112 dx .
Hint: Imitate the second proof of Theorem 1.1 and use the fact that
A„ and B„ commute.
n n
e) Let
•tt/2
r it
K(Ay.=\o
dd
yi-A2sin2d
and let K'(A) = K(\/l- A2). Suppose that AE. @N and also that
0<a<m||00<6. Show for large n that (7.1.5) holds, namely,
there exists c so that
(7.1.10) \\logA-K'(10-nI) + K'(10-nA)\\ ' C"
102"
224
Algorithms for the Elementary Functions
f) Show that this provides an 0 (log n) algorithm for the matrix
2\
logarithm of A £ &N and, by inversion, an 0op((log n) ) iteration
for the matrix exponential.
For further details, and extensions beyond the positive definite case see
Stickel [85], where computational experience is also indicated.
7.2 THETA FUNCTION ALGORITHMS FOR LOG
We start with the fundamental identity of Chapter 2
and the series expansion
We recall that
(7.2.3) ^ = -¾^
«;(<?)
e:.*c
-» q
The algorithm for log (IIq) is now:
Algorithm 7.3
Fix ^ £ (a, 6) with 0 < a < b < 1.
Step i
Step 2
Step 3
Step 4
Calculate K(k)/ir from (7.2.2).
Calculate k from (7.2.3).
Calculate K(k') using the AGM commencing with 1 and k.
Calculate log (IIq) = Tr[K(k')/K(k)].
This algorithm has operational complexity Oap(Vn) and bit complexity
0B(VnM(n)).
The algorithm's complexity is determined by the series expansions
employed in steps 1 and 2. These "sparse" series yield n-digit accuracy after
Vn nonzero terms. While the asymptotic complexity is far from optimal, the
algorithm has the advantage of not requiring very small starting values for
the AGM iteration (Step 3). Also, only a single AGM iteration is required.
Sasaki and Kanada [82], who proposed and analyzed the above algorithm,
show that it out performs the methods of Section 7.1 for numbers in the
7.2 Theta Function Algorithms for Log
225
3000-digit range. Note that it must be precomputed. As Sasaki and Kanada
observe, the algorithm may be accelerated by using
(7.2.4) mlQg (1) = log (-L)
for various m. This speeds up the series calculation at the expense of the
AGM. For n-digit precision, using m = n effectively reduces this algorithm
to Algorithm 7.1. (See Exercise 1.).
For certain choices of q we get reduction in complexity. If q is any small
integer, then the series expansions are particularly easy to evaluate. (See
Exercise 2.) This leads to a very fast algorithm for ir/log 10 (using base 10
arithmetic) as follows. From (7.2.1), (7.2.2), and (7.2.3) we have
(7.2.5) log (I) = „£!(*)
ql K" AG(dt(q),dt(q))
,2
IT . „/ I V „Z\ I V („ + l/2);
or
Now for q := 1/104 both of the above series are just sequences of 0's and l's
and the starting values for the mean iteration can be calculated very quickly
[0B(M(n))]. The remainder of the work involves calculating a single AGM.
Similarly ir/log p is amenable to very fast computation in base p.
Properly interpreted, (7.2.6) remains valid for matrices and provides a
theta-based computation of the matrix logarithm of a positive definite
matrix. (See Exercise 3.)
Comments and Exercises
Further discussion of material in this section may be found in Sasaki and
Kanada [82] and in Borwein and Borwein [84rf]. Sasaki and Kanada
compare Algorithm 7.3 to algorithms for log based on Taylor series for
log (1 + x) and log [(1 + x) /(1 — x)] and conclude that for more than
(roughly) 100 decimal digits the Taylor series methods are slower.
1. Show that (7.2.4) for various m can be combined with Algorithm 7.3 to
provide algorithms for log of any complexity between Oop(Vn) and
Oop(log n).
2. Discuss the bit complexity of calculating d2(q), 03(#), and d4(q), where
q is the reciprocal of a fixed integer. Show, forp integral, that (7.2.6)
can be used to calculate ir/log p with bit complexity OB(log n M(n)).
(See Exercise 9 of Section 6.4.)
226
Algorithms for the Elementary Functions
3. Consider the matrix AGM of Exercise 6 of the previous section.
Establish a matrix version of (7.2.1) to (7.2.6). Then construct an
algorithm for the matrix log based on (7.2.6).
7.3 THE COMPLEXITY OF ELEMENTARY AND ELLIPTIC
FUNCTIONS
The algorithms of Section 7.1 can be inverted by Newton's method to
provide algorithms for exp with bit complexity Ofl(log n M{n)) and
operational complexity 0op((log nf). From a bit complexity point of view, this is
the best known bound. Since we can invert log in {|z-l|^3}, we can
produce an algorithm for exp in a complex neighbourhood of zero and
hence have 0B(log n M(n)) algorithms for all the trigonometric functions.
Exercises 1 and 2 give some variations. In fact, for any elementary function
we have the following:
Theorem 7.3
Any elementary function / over Q(x) can be calculated uniformly (in
bounded regions where / is single valued and analytic) with bit complexity
Ofl(log n M(n))
and with operational complexity
Oop((log«)')
where s is a constant depending only on /.
Proof. For our purposes the elementary functions are the rational
functions, log and exp, and any function that can be formed from these
functions by a finite number of compositions, multiplications, additions, and
solutions of algebraic equations. (See Davenport [81] or Ritt [48].) The
point of the proof is that such an/is constructed from Q(x) by taking a finite
number of exponentials, logarithms, and solutions of algebraic equations in
these quantities. The number of algebraic equations to be solved determines
the constant s. As in Chapter 6, solution of the algebraic equation in
question can be effected in a time proportional to the complexity of
evaluating the equation. □
A number of comments are in order. First, we can formulate the above
theorem for / algebraic over U(x) if we assume that the requisite real
numbers are given. Second, while multiple solutions of an algebraic
equation pose no theoretical problem, in practice, determining the "correct root"
7.3 Elementary and Elliptic Functions
227
can be a major nuisance. For a multiple-valued function the theorem must
be interpreted as guaranteeing some value of the function. This is inevitable.
It is not even clear what it should mean to compute an infinite-valued
function.
That the operational complexity behaves like (log n)s instead of (log n)
reflects in part that operational complexity is an inappropriate measure
when Newton's method is involved. While the preceding algorithms for log
require most of the operations to be done to full precision, this is no longer
the case for this approach to other transcendental elementary functions.
Since we can calculate elliptic integrals with bit complexity
0B(log n M(n)) (Exercise 5 of Section 1.4 and Exercise 2 of this section), we
can calculate the Jacobian elliptic functions with similar dispatch.
It is not clear which other nonelementary transcendental functions have
bit complexity 0B(lognM(n)). Does, for example, the gamma function?
Nor is it clear whether the bit complexity 0B(log n M(n)) is best possible for
the nonalgebraic elementary functions. The best known lower bound for log
and exp is the virtually trivial bound of 0B(M(n)). (See Exercise 3.)
Comments and Exercises
The algorithms for exp (see also Exercises 1 and 2) require inversion and are
much less satisfactory than the algorithms for log. A direct 0B((log n)2M(n))
algorithm is presented in Chapter 10. There are a number of issues that
remain unresolved in this discussion. The most obvious is a discussion of
lower bounds. Observe that, by Theorem 6.4, showing that exp does not
have complexity 0B(M(n)) would show that exp is a transcendental
function. Likewise, showing that ir.does not have bit complexity 0B(M(n))
would imply the transcendence of it. The gap between the known
operational complexities for log [Oop(log n)] and exp [Oop((log n)2)] is probably
specious, but this also is not known. We will show in Section 8.8 that direct
algorithms for exp and log of the type that we derived for K in Chapter 1
cannot exist. There are no quadratically convergent fixed iterations for these
functions.
1. From (2.3.7) and Exercise 3 of Section 2.5,
lim(^r = exp[flim(%Y
where an and c„ are generated from the AGM commencing with a0 '■ = 1
and b0 '■ = k', while a* is generated from the AGM commencing with
ao:=l and b%'-=k. Show that this leads to an 0B(lognM(n))
algorithm for ex, which begins by solving for anla* = irx/2.
2. (A more direct approach to tan) As in Exercise 5 of Section 1.4 and
Theorem 2.6 of Section 2.6, we have the Landen transform for
228
n&
Algorithms for the Elementary Functions
* de
,k):=l
0 Vl - k2 sin2 d
ike[0,1), ^0.
if
l-K
kn+i:= Y^n? and tan^n+i ~ ^=k'ntan ^"
then, as before,
(7.3.1) F(<pn+1, kn+1) = (1 + k'n)F(<Pn, kn)
and
2
(7.3.2)
^>o> ^o) :
n
l + k'
lim^.
n-»co 2
a) Show that
F(<j)0, k) = <j>0 + 0{k2) as A;-* 0 .
b) Show that
F(0O, A:) = log tan (^+ ^) + 0(1-A:) as k->l
c) Show that if w„ : = tan </>n, then
(! + *>„
*>„_,., =
K"+1 1-^vv2
d) Thus
<£„ = F(<£„, A:„) + 0(k2n)
"frn + kL
or
m=0
tan x w„ =
logtan(^ + |>) + O(A:2„ + (l-A:0))
n
2
it tan w,
log tan ( - + 2
+ 0(^ + (!-*„))
^1 /1 + ^:
m = 0
logS + O(A:2 + (l-A:0))
7.3 Elementary and Elliptic Functions
229
where S •"= yw\ + 1 + w0. Show, by inverting c), that tan-1 can be
calculated from log with complexity 0B(log n M(n)). Show that, by
inverting the above, tan can be calculated from log with complexity
0B(lognM(n)).
This approach, which avoids using complex arithmetic to access
the trigonometric functions, is essentially due to Brent [76a].
3. Let L(n) denote the bit complexity for evaluating n digits of log. Show
that
D(n)=0B(L(n))
and hence, log and exp are at least as complex as multiplication.
Hint: Consider computing the derivative of log.
Chapter Eight
General Means and Iterations
Abstract. In Section 8.1 we define abstract means and discuss their behavior.
In Section 8.2 we discuss equivalent means. In the next three sections we
consider general mean iterations and examine their convergence properties.
Later sections concern Taylor expansions of means, multidimensional means,
and related questions. The final section considers algebraic mean iterations
and the possibility of extracting elementary limits from such iterations.
8.1 ABSTRACT MEANS
There is a large literature on means but little agreement as to what
exactly constitutes a mean. For our purposes we have
Definition 8.1
(a) A mean is a continuous real-valued function M of two strictly positive
real variables a and b such that
(8.1.1) aAb<M(a, b)<avb
for all a>0 and b>0. We denote a a b : = min(a, b) and
a.v b : = max (a, b). (Continuity is not always essential. On occasion
we will refer to possibly discontinuous functions satisfying the above
definition as discontinuous means.)
(b) A mean is strict if, in addition, it is diagonal:
(8.1.2) M(a,b) = a or M(a,b) = b
if and only if a = b.
230
8.1 Abstract Means 231
(c) A mean is homogeneous if
(8.1.3) M(Xa, Xb) = XM(a, b)
for a, b, A>0.
(d) A mean is symmetric if
(8.1.4) M(a, b) = M(b, a)
for a, b>0.
(e) A mean is (strictly) isotone if, for a, b > 0,
(8.1.5) M(a,) and M{-,b) are (strictly) increasing.
We will find it convenient to consider the trace tM of a mean M given by
*m(X): =Af(jc, 1). We gather up some useful properties of means whose
proofs are left as Exercise 1.
Proposition 8.1
(a) Every diagonal continuous (strictly), isotone mapping is a (strict) mean.
(b) Suppose that M is symmetric and homogeneous. Then M is isotone if
and only if its trace tM is isotone.
(c) The isotone, (symmetric), (strict), (homogeneous) means form a
convex set.
(d) The symmetric, (homogeneous) means form a uniformly closed convex
set.
(e) Let M and JV be (strict) means. Then any continuous mapping P such
that
M(a, b) > P(a, b) ^ N(a, b) a,b>0
is a (strict) mean.
Corresponding to each mean M we associate another mean Mp, defined
by
(8.1.6) Mp(a, b):=[M(ap, bp)]llp p^O.
Then M = (M_1)_1 and M_x is strict, symmetric, homogeneous, or
isotone whenever M is, and tM_ (x) = t^(\lx). This is a special way of
building an equivalent mean as we now discuss.
For any strictly monotone (increasing or decreasing) continuous function
/: IR+ -> IR+ we define the mean
Mf(a, b) := f~\M(f(a), /(&))).
232
General Means and Iterations
When f(x) '■ = xp (we write / := lp) we denote Mf by Mp [consistently with
(8.1.6)]. It is easy to check that Mf is a strict, symmetric, or isotone mean
whenever M is. (See Exercise 2.) To give our discussion some flesh, we
introduce four of the most useful classes of means. In this chapter we
reserve the letters M and N for general means.
The Holder Means
ForpeUx let
/a" + bp\Vp
(8.1.7) Hp(a,b):=[——) a,b>0.
Then Hx is just the arithmetic mean A, and thus Hp is a strict,
homogeneous, symmetric, isotone mean. Moreover, limp_0 Hp(a, b) = Vab is the
geometric mean G and may reasonably be denoted by H0. (See Exercise
14.) Since H_x < H0 ^ Hx, Proposition 8.1(e) shows H0 to be a strict mean.
For allp one can unambiguously define Hpfora,b^0. Then the trace of H
satisfies
(8.1.8) VR ^{[0,2-^) p<0-
Note also that (Hp)_1 = H_p and that Hp = Ap.
Another useful way of building means is based on the next proposition.
Proposition 8.2
Let M be an isotone, homogeneous mean. Then, for p £ IR,
(8.1.9) pM(a,b): =
M(ap,bp) _ Mpp{a,b)
M(ap~\ b"'1) MppZ\(a, b)
defines another homogeneous mean, pM, which is strict or symmetric
whenever M is strictly isotone or symmetric.
Proof. This is left as Exercise 4a). □
The Lehmer Means
For p £ IR we let
(8.1.10) V»*):=flP-x + frP-i-
Since Lp = p{Hx), each Lp is a symmetric, homogeneous, strict mean. Since
tL(x) = (xp + l)/(xp~1 + 1), Lp is isotone only for 0<p<l. Moreover,
Li = Hx, Lll2 = H0, L0 = H_x, and (by Exercise 5 of Section 8.6) these are
8.1 Abstract Means 233
the only means common to the Lehmer and Holder classes. Again there is
no difficulty extending Lp to a, b s 0 and
II p>l foo p>0
\ p = l and tLp(oo)= J 2 p = 0
0 p<\ " U p<0
where, here and below, we write /(0°) for lim^^ f(x). To see this, use
(8.1.12) (Lp)_x(a, b) = "^**f = (1_rt(Lx)(a, 6) = L(1_p)(a, 6).
Indeed, generally
(8.1.13) (pM)_x = (1_p)M.
Tfee Gim Means
Let /■ and 5 be given. Consider f=iq where q'= s — r. Then
(8.1.14) r%l(f-Ma),Kb)) =
ar + br
= -G,Aa,b)
defines the Gini mean Gsr(a> b). (See Gini [38].) Moreover (8.1.14) shows
that Gs r is indeed a strict, homogeneous mean.
Proposition 8.3
Let / be a continuous strictly monotone function of a nonnegative variable.
Then
(8.1.15) MSf(a, 6):=/-1
b — a
a¥=b
extends to a symmetric, (generally nonhomogeneous), strict, continuous
mean.
Proof. The integral mean value theorem gives the conclusion, bar the
continuity. This follows from the continuity of the definite integral. □
We immediately apply this to Stolarsky's power means (Stolarsky
[75, 80]).
Stolarsky's Means
Let p £ IR and denote Mjlf-i by Sp. Then Sp is a homogeneous, symmetric,
strict mean given by
234 General Means and Iterations
(8.1.16/) Sp(a,b) =
with
ap-bp
p(a - b) J
i/(P-i)
P^O.l
(8.1.16b) S0(a, b) = lim Sp(a, b) = ^1^. =■ *fi, b)
and
(8.1.16uz) Sx(a, b) = lim S(a, b) = e~\aab~b)ll(a~b) =■ $(a, b) .
p—*l *
The mean iE is the logarithmic mean and 3 is the identric mean. These
means have
p-i'(p-D p>0,p*l
3 p=sO
4PW _ 1 (p-l)xp-pxp~1 + l
p
*s (°) = \ e P = 1 and ts (oo) = oo.
lo
Also
'*,(*) (p-i) (*p-i)(*-i)
so that Sp(a, b) is increasing in (a, 6) for all p in U. This, in part, follows
from the inequality (1 -p)xp + pxp~l si for 0<p ^ 1.
All of the means in these classes are piecewise monotone in the sense that
iM{x) has only finitely many sign changes. Thus there is no difficulty in
defining M(0, b) - bM(0,1) = b limxl0 tM{x) in all these homogeneous
cases, and we can freely consider M defined on IR+ := {(x, y)\x >0, y>0}
and at oo. We will do so from now on. Note also that when M is continuous,
'm([°> °°]) = [°> °°] is equivalent to tM(0) = 0 and tM(*>) = oo.
An elementary but very useful proposition is next. The proof is left for
Exercise 11.
Proposition 8.4 {Composition)
If M is defined by
M(a,b):=M0(M1(a,b),M2(a,b))
where Mt, i = 0,1,2, are means, then M is a mean. If two of M0, Mx, M2 are
strict, so is M. If all three are homogeneous, symmetric, or isotone, then so
is M.
8.1 Abstract Means
235
For example, M v N and M a N are strict means whenever M and N are.
So is any mean between them, in the sense of Proposition 8.1 (e).
There are many highly pathological means, as Exercise 12 shows. This is
particularly so in the absence of continuity. For future reference we will say
that a homogeneous mean is ultimately monotone if M(x, 1) and M(l, x) are
monotone in some neighbourhood of zero and °o. We write M < N if
M(a, b) < N(a, b) for all a, b > 0.
Comments and Exercises
There is a great literature on particular means and very little on means in
general. Much classical information can be found in Hardy, Littlewood, and
Polya [59] and in the other references scattered throughout the chapter.
1. Prove Proposition 8.1.
2. Establish that for any mean M and any strictly monotone /: IR+ —> U+,
Mf{a, b) := /_1M(/(a), f(b)) defines a mean which is strict,
symmetric, or isotone whenever M is. Moreover, if/is ip, p £ IRX, then Mf is
homogeneous whenever M is.
3. a) Show that Hp(a, b) is a continuous increasing function of p with
liny^ Hp = v and limp^_0O Hp = a .
b) Establish the assertions about the Holder means.
c) If M is homogeneous and symmetric, then M0 = G, whenever M0
exists as a limit.
4. a) Prove Proposition 8.2.
b) Establish the assertions about the Lehmer means.
5. (Isotonicity of Mp) Let 4>(a)-'=aloga for a>0, and let M be a
differentiable mean.
a) Show that Mp(a, b) is isotone in p > 0 if and only if
(8.1.17) ¢(,) ^¾^ + ¢(6) ™^> ^ <D(M(a, b)) .
b) Suppose that M is also homogeneous. Then
dM(a,b) , dM(a,b) ... ,.
a—^ + b^r^^M(a,b).
c) Use a), b), and the convexity of $ to show that Hp is increasing for
p rnU.
d) If M is homogeneous and symmetric and if Mp is increasing for
p >0, then Mp is increasing for p in 03, whenever M0 exists.
236 General Means and Iterations
6. Suppose Mp is isotone in p. Then
i) pM>Mp pal
ii) pM<Mp p<l
iii) pM<Mp_x p<0.
7. a) Suppose M is symmetric and homogeneous with tM(0) > 0 or with
'm(°°)<00> then
lim Mn(a, b) = a a b and lim M(a, b) = av b.
b) In particular, this holds for Hp.
c) If, in addition, Mp is isotone in p, then
lim „M(a, b) = a a b and lim „M(a, 6) = a v 6 .
d) In particular, this holds for Lp.
8. a) Show that pAf is isotone in p if g(p): = log M(ap, bp) is always
convex, since then g(p + 1)- g(p) increases withp. (See Becken-
bach [50].)
b) Use .Cauchy's inequality to show that this holds for
pA(a, b) = Lp(a, b) = flP-x + &P-x •
9. a) Establish the assertions about Stolarsky's means,
b) Show that
^Vfl + VPj <<£(fl)6)<#i/2(a6)
c) Show that
log*
1
1 + X
and deduce that
2~* . ,2-*
Jfc-1 Z
10. a) Show that i£p{a, b) is isotone in p by applying Exercise 5a) and
observing that (8.1.17) becomes $>(a, b)^iE{a, b). [This in turn
follows by calculus from (t - 1)2 2= dog t, t > 1.]
8.1 Abstract Means 237
b) Show that 3>p{a, b) is isotone in p This can be done by showing
that (8.1.17) becomes
<B(a) log (a)-¢(6) log (&) | 1:
a - b
which again reduces to (t- 1) s/log t.
c) The generalized mean Er is defined by
(8.1.18) Er>s(a, b) :
¢^)-^6)12
s(ar - br)
r(as - bs) J
V(r-s)
and extended appropriately on the boundary (as in Leach and
Scholander [78]). Show that each Ers - (5 ) for some p and q.
Let A(r) : = log \{d - l)lr\ so that A(r) = log 3>T(a, 1)- BY b) A(r)
is convex. Thus for 5 > /-,
log £^,1)=^:^^108^,1)
and £^ r increases in r and 5 because Err = $r.
d) The mean
„ , ,x „ / ,x a + 6 + Vofc 2 , 1
He(a, 6) := E3l2>ll2(a, b) = ^ = -A+-G
is very classical and is called Heronian mean.
e) Show that if -1 < p < \ or p > 2, then
(8.1.19) 5p(a,6)^i/(p+1)/3(fl,6)
with the inequality reversed whenp < — 1 or ^ ^p ^2. Moreover
(8.1.19) fails if (p + 1)/3 is replaced by any smaller number
(larger in the reversed case). (See Stolarsky [80] for details.)
f) Show that SB and 3>p tend to v as p tends to infinity.
g) Use the condition of Exercise 8a) to show that pSB is isotone in p.
h) Show that Ep p_y= pSB> SBp forp>l and so Ers> SBr for r> s>
0.
i) Show that
i) (£r„)-i = £-
") Er,s = Es,
iii) Er-' = Er,:'E'-'.
Prove Proposition 8.4 on the composition of means.
Let q be a nonnegative function satisfying
i) ¢(1) = 1
ii) 1 a x < q(x) < x v 1 x^l.
~r,~s
238 General Means and Iterations
a) Then
M(x,y): = xq(£)vyq(Z)
M{x,y):=xqy-j/\yq(^
and
are (possibly discontinuous), homogeneous, symmetric, strict
means,
b) Suppose in addition that
Hi) q(x) = xq[-J.
Then
M(l, x) = q(x) = **(*) .
Hence if we take an arbitrary (even analytic) function satisfying iii)
with l<q(x)<x for x>l, there is a strict, homogeneous mean
with tM = q.
c) Let q(x) '■ =
1 + 3x
x > 1, x rational
4
1 + x
otherwise
Then M is a densely discontinuous mean which is not ultimately
monotone. Moreover M lies between two continuous means.
(n \ 1/n n
2ckakbn-k) c,>0, 2ck = l.
1/n n
I c,>o, 2
Then P„ is a homogeneous strict mean which is symmetric if and only if
14. If M is a continuously differentiable mean then M0 : = limp^0 Mp exists
and is given by
M0(a, b) = aqb
«i,i-«
where q:= (dM/da)(l,l).
Hint: Use L'Hopital's rule.
8.2 Equivalence of Means 239
8.2 EQUIVALENCE OF MEANS
Let ¢: IR+ —> IR+ be continuous. We say that a mean M dominates a mean N
if
M(<D(a), ¢(6)) = <$>(N(a, b)) a,b>0
and we write M>^ N or M> N. If M and N dominate each other, we call
M and N equivalent. Since domination is transitive, this is an equivalence
relation which we write ~. Unfortunately most of our results demand that
we consider more restrictive notions of equivalence, so we require that $ be
one to one from now on. Hence 4> is monotone.
Theorem 8.1
Suppose that M and N are two means with M>$N.
(a) Suppose that N is homogeneous. For each t>0, consider
gt{x) ■= <b{t$?~x{x)). Then gt is isotone and
(8.2.1) M(a, b) = g;xM(gt(a), gt(b))
for each a, b in rng (¢).
(b) Suppose that M:=A. Then
¢ = (2(/ + /3 a>0,p^0
and N=Hp, p ^0.
(c) Suppose that M■= G. Then either
or
<& = aefl'P a>0, p^O
and N=Hp, pBU.
(d) It follows that the only homogeneous means equivalent to A are the
Holder p means (Hp, p t^O) and the only homogeneous mean
equivalent to G is G.
Proof
(a) By homogeneity of N we have, for t > 0,
<&_1Af(«&(ta), ¢(/6)) = rfTxAf(<&(«), ¢(6))
240 General Means and Iterations
and (8.2.1) follows. Moreover gt is isotone as a composition of two
co-monotone functions.
(b) Now (8.2.1), with M-= A, becomes
(8.2.2) 2g/(^p) = g,(a) + g,(b) a, b e rng (¢) .
This is solved [Exercise la)] by
(8.2.3) gt(x) = a(t)x + 6(0 xerng(4>)
for some a(i) and b(t) in IR. Thus
4>(fcc) = a(t)$(x) + b(t) x,t>0.
Now Exercise lb) shows that this is solved by
Since 4> is positive, we must have ¢ = atp.+ /3, a >0, and N is as
claimed.
(c) In this case we have
(8.2.5) Vft(«)ft(&) = ft(VS) fl,6erng(4)).
Let ht: = log ( gt ° exp). Then
h(a + b\ht(a) + ht(b)
As above ^,(^) = a(t)x + b(t) and
(8.2.6) <D(fcc) = 2?(0<&(jc)fl(,)
for B(t) positive. This is solved by
(8.2.7) ¢= aP p*0
1 ae
[as in Exercise lc)]. If 4> is of the first type, N= G; while if $ is of the
second type, N = Hp, p¥=0.
(d) Thus all Hp, p ^0, are equivalent and G dominates them all. D
A more general program may be undertaken, based on Theorem 8.1(a).
Unfortunately, without extra hypotheses, solutions of (8.2.1) are very hard
to characterize. The following simple result is accessible.
8.2 Equivalence of Means 241
Proposition 8.5
Suppose M is a homogeneous, strict mean with M>hN for some isotone h,
and suppose ^(0) = 0. Then tM(0) = 0.
Proof. Since h(N(a, 1)) = M(h(a), h{\)), we have A(0) = h(N(0,1)) =
M(h(0), A(l)). This is only possible if A(0) = 0, ft(0) = A(l), or h(0) = °o.
Since & is strictly increasing, only h(0) = 0 can occur. Thus h(0) = 0 =
M(0, /i(l)). Since M is homogeneous, fM(0) = 0 as claimed. □
As an example we see that Hp>h AG, p > 0 (AG is the Gaussian AGM)
is impossible for h isotone, as is Lp >h AG withp > 1. These considerations
suggest that we can say more if h is required to be onto. We will write
M~ N if M>hN for some one-to-one mapping h of IR+ onto IR+ [so that h
must be (strictly) isotone with h(0) = 0, /i(°o) = oo or (strictly) antitone with
h(0) = oo, /i(oo) = 0]. This is an equivalence relation stronger than ~, and we
call it strong equivalence. We will only consider strong equivalence of
homogeneous means. If we let gt '.= h(th~ ), gt is surjective and we observe
that (8.2.1) becomes
gt(M(a, b)) = M(g,(a), g,(b)) a,b>0.
Since M is homogeneous, we may replace gt by ht := gtlgt{\). Then ht{\) =
1 and
(8.2.8) h,(M(a, b)) = M(ht(a), h,(b)) a,b>0.
In the next lemma we give several conditions for (8.2.8) to have only the
trivial solutions (normalized). Under these conditions we have gt = c(t)u and
(8.2.9) h{tx) = c{t)h{x) t,x>0.
Then Exercise lb) shows h = aip, a >0, p ¥0, and it follows that the only
homogeneous means strongly equivalent to M are Mp, p^0. Thus we have
determined the strong equivalence class of M in the cases covered by the
next result.
Lemma 8.1
Suppose M is a homogeneous strict mean.
(a) The following two conditions imply that (8.2.8) only has the trivial
solution when ht{\) = 1.
(i) M is ultimately monotone and
(ii) ht{a) = a for some a¥^\, a >0.
242
General Means and Iterations
(b) (i) (ai) holds if M is isotone (or piecewise monotone), while
" (ii) (aii) holds if M is differentiable and (1) tM has a unique positive
zero, a ^ 1 and h is differentiable; or (2) tM(U+) £ IR+.
Proof.
(a) Given that a and 1 are distinct positive fixed points of ht, it follows
from (8.2.8) and continuity that {c>0\hl(c) = c} is a closed interval
C. We show that s '■ = sup C is °o. If not we argue as follows.
For any c<s in C, M(c,s)<s. Hence. M(c, s + e) .< s for some
e > 0. But M(s, s + e)> s. Thus M(c, s '+ e) = s for .some £.< 5, c in C,
e>0. Then for n = 1,2, 3,..., •
s = h"(s) = M(hnt(c), h"(s + e)) = M(c,hnt(s + e)) .
(Here h" : = hn'1°h denotes iterated composition.)
Let bn■:= c_1/i"(5+ e). Then, as bx ¥=ba (because s+ e0C), bnis
a strictly monotone sequence, since ht is strictly isotone. If {bn} were
bounded above, the limit point would be a member of C larger than s.
Thus bn increases without bound and also tM{bn) = sic. This violates
monotonicity of M at infinity. The proof that inf C = 0 is left as
Exercise 2b).
(b) We consider (ii). If h and M are differentiable, we have from (8.2.8)
'm(^W)^00 = ht(tM(x))iM(x) .
Since ht is strictly increasing, we have iM(ht(a)) = 0 and ht(a) = a. If
on the other hand tM is not surjective, we argue as follows. Suppose
M(0,1)>0. Then
ht(M(0,1)) = Af(A,(0), A,(l)) = M(0,1)
as ^,(0) = 0, and M(0,1) < 1 is a second fixed point of ht. Finally if
M(°o, 1)<°°, we argue with M_x. □
Condition (ai) holds for Hp, Lp, and Sp. Condition (2) of (bii) holds for
Hp, p^O; for Lp, p <0 or p > 1; and for 5p, p >0; while condition (1)
holds for Lp, p > 1. (See Exercise 3.) It is reasonable, at least for
computational purposes, to require that equivalence be defined by differentiable
maps. If this is done, the results are more complete.
Comments and Exercises
Theorem 8.1 can be found in Wimp [84] in slightly different form. This is
partly explained by the fact that exp[(logx + log y)/2] = G(x, y). For
Hardy, Littlewood, and Polya [59] or Wimp [84] this means that A >log G.
8.3 Compound Means
243
In a more formal development this is problematical since means are only
defined on (0$+ ) and log is not always positive. Thus we rule out the second
case in (8.2.4).
1. a) Show, using continuity of gn that (8.2.2) is solved as in (8,2.3).
(Proofs without continuity assumptions can.be found in Wimp [84]
and elsewhere. More general results depend on the measurability of
Sr)
b) Let 4/(x) : = 4>(;c) - ¢(1). Then <// satisfies .
■;*""■ ij/(tx) = a(t)ij/(x) + c(t)
for some c(t) in R. Then (f/(t) = c(t) and thus [a(x) -1] lip(x) =
[tp(tx) - tp(t) - tp(x)] /tp(t)4/(x) is independent of x if t/>(*) ^ 0- Thus
ij/(xy) = aj/(x)il/(y) + 4>(x) + ij/(y). If c = 0, then t^(jt) = Clogx,
while if c^O, A(x) : = c(fi(x) + 1 solves k(xy) = X(x)k(y) whose
solution is tp. Thus (8.2.4) holds.
c) Show that (8.2.6) has solutions only of form (8.2.7). Hint: Consider
A(jc) := log [<&(*)]-log [¢(1)]. Then A(jtf) = a(t)X(x) + k(t). Now
proceed much as in b).
2. a) By considering M_1 show that if M >hN, M strict and
homogeneous, h isotone with fM(°°) < °°, then ^(00) < °°.
b) Complete the proof of Lemma 8.1(a).
c) Complete the proof of Lemma 8.1(6).
3. a) Verify the final claims of this section.
b) Calculate the strong equivalence classes within Grs and Ers.
4. a) If M is a symmetric, homogeneous, strict mean, then either tM(0) =
0 or fM(oo) = oo.
b) Similarly, suppose then that M>hN for some homogeneous N with
h monotone. Show that 0 or oo Hes in mg(h).
8.3 COMPOUND MEANS
We now formalize the notion of a mean iteration. The Gaussian AGM and
Archimedes' method provide the central examples. Let M and ./V be any two
continuous means. Let a > 0 and b >0 be given and consider the iteration
a0'-= a b0'-= b
(8.3.1) an+1:=M(an,bn)
bn+1-=N(an,bn).
Under mild hypotheses, Theorem 8.2 shows that the iterates converge to
a common limit, which we call the compound of M and ./V and denote by
244
General Means and Iterations
M(x)N(a, b). We will also denote the entire iterative process by [M, N]. We
say that M is comparable to N if one of the following holds:
(a) M(a,b)^N(a,b) for a, b>0
(b) M(a,b)<N(a,b) for a, b>0
or
(c) M(a,b)<N(a,b) for 0<a<b
and
N(a,b)<M(a,b) for 0<b<a.
When M and N are symmetric, c) cannot occur, and we say M and N are
comparable. In the nonsymmetric case c) can occur, and then M is
comparable to N but not conversely. In the next result comparability is only needed
to establish monotonicity of the iterates. (See Exercise 1 and Theorem 8.8
of Section 8.7.)
Theorem 8.2
Let M and N be means with M comparable to N.
(a) Suppose that M or N is strict. Then [M, N] converges and M (x) N is a
mean which is strict if both M and N are.
(b) M (x) N is a homogeneous, symmetric, or isotone if each of M and N is.
(c) M®N is continuous, and the convergence is monotone and uniform
on compact subsets of {(a, b)\a, b >0}.
Proof
(a) Suppose that a > b and that M>N. Then ax = M(a, b) > N(a, b) = bv
Inductively suppose that an^bn. Then
(8.3.2) an > M(an, bn) = an+1 > bn+1 = N(an, bn) > bn
and {an} decreases while {bn} increases. Since each bounds the other,
both sequences converge, say, to x and y, respectively. By continuity
we have x = M(x, y) and y = N(x, y). Since M or N is strict, x = y. If
a ^ b, we may have to exchange the roles of an and bn. Thus M®N
exists and satisfies
(8.3.3) M aN<M®N<Mv N.
This finishes (a).
8.3 Compound Means
245
(b) The sequences {an} and {bn} are in fact built by repeated composition
of means. Thus an = Mn(a, b) and bn = Nn(a, b) for means Mn and Nn.
These means are symmetric, homogeneous, or isotone when M and TV
are, and so is the limit mean M®N.
(c) Let a, b > 0 and e > 0 be given. Pick n so that \an - bn\< e/2. Now Mn
and Nn are continuous. So we may find 8 > 0 with \Mn(a, b) -
Mn(a',b')\<e/2 and \Nn(a, b) - Nn(a', b')\ < e/2 if \a'-a\<8 and
\b' - b\ < 8, (a', b'>0). Again assume a > b and M>N. Then
M®N(a', b') < M„(a', 6') < M>, 6) + | < #>, 6) + e
<M®N(a,b) + e
and
M®AT(fl', b')>Nn(a',b')>Nn{a, b)~ | >M„(a, 6)- e
>M®N(a,b)- e .
Thus M (x) A^ is continuous. Finally, Dini's theorem shows that Mn
and Nn must actually converge uniformly on compact subsets, since
convergence is monotone. □
The key observation about M(x) N is the following 'invariance principle'
which we have already used repeatedly in Chapters 1 and 2 to show that
AG(1, k') = irllK.
Theorem 8.3 {Invariance Principle)
Suppose that M®N exists. Then M® N is the unique, (continuous) mean
4> satisfying
(8.3.4) 4>(M(a, b), N(a, b)) = <D(a, b)
for all a, b > 0.
Proof. Iteration of (8.3.4) shows that
Thus
<D(a, b) = <D(M ® N(a, b), M ® JV(a, 6))
and since 4>(c, c) = c, ¢ = M®N. □
246 General Means and Iterations
Observe that we need not verify that 4> is a mean, but only that
4>(;c, x) = x for x > 0 and that 4> is a continuous solution of (8.3.4).
EXAMPLE 8.1
(a) Let M:=HX and N:=H_X. Then HX®H_X = G. Observe that
G(a, b)'-= Vab satisfies
la + b lab , ,
G(Hx(a, b), H_.(a, b)) = yj— —^ = Vol <D(a, b) : = Vdb.
Since Vxx = x, we must have Hx (x) H_x = G.
(b) Let M(a, b) : = 9ab2/(a + 2bf and N(a, b): = (a + 2b) /3. Then
Mv3(a, b)N2l\a, b) = aV3b2'3 <D(a, b) : = aV3b2'3
and again the invariance principle shows that M ®N(a, b) = all3b213.
(c) Let tfx(x)tf0 = AG:=,4(x)G. Then for a > 6,
a ^ ^, . x .,-,/., M air ,, ,. air
A®G(a,b) = aAG{l,-J = ^m ¢(,,6) :=^^^
as Theorem 1.1 (second proof) shows by the invariance principle.
We now distinguish two better structured classes of mean iterations.
Definition 8.2
Let M and N be symmetric means.
(a) Suppose M and N are comparable. Then we write
M®gN:=M®N and [M, N]g:=[M, N] .
We call these Gaussian mean iterations and call (x) the Gaussian
product.
(b) Consider the iteration: a0 : = a > 0, b0 := b > 0, and
an+x:=M(an,bn)
bn+1-=N(an+1,bn).
We denote the iteration by [M, N]a and the limit by M (x)a N. We call
these Archimedean mean iterations and call (x)a the Archimedean
product.
The existence of M®gN is guaranteed by Theorem 8.2. For M®aN
existence comes from the next result.
8.3 Compound Means
247
Proposition 8.6
Let M and N be symmetric means. Suppose that M is strict. Then M(x)aN
exists and
M®aN=M®N*
where N*(a, b) := N(M(a, b), b). So by Theorem 8.3 M®a N is the unique
continuous mapping t/> satisfying
<KM(a,b),N*(a,b))=4,(a,b)
with 4>(x, x) = x for x > 0.
In consequence M (x)a N is a continuous mean which is strict,
homogeneous, or isotone whenever both M and N are.
Proof. This follows from Theorem 8.3 since M is comparable to N*
(See Exercise 4.) □
The theorem makes no use of symmetry of M and N. However, this is
critical to our further analysis of convergence rates. Finally, suppose that we
are given a function M which satisfies (8.1.1) and (8.1.2) or (8.1.3), but only
for 0 < a < b. We may extend M to a symmetric mean M via
(8.3.5) M(a,b):=M(aAb,avb).
Similarly we can extend a function defined only on 0 < b < a by using
M(a v b, a a b). The new mean is homogeneous or strict when M is. Thus
in some of our future iterations we will consider means only defined on the
45° sectors (Exercise 3 among others). Moreover, if we have two comparable
means on 0 < b < a, the extensions are comparable so that our convergence
results apply. Note also that in the definition of comparability we excluded
the possibility that M<./Vfor0<6<a while N ^ M for 0 < a < b. In this
case the iterates oscillate, and combining two steps results in a comparable
iteration. We call such iterates partially comparable.
Comments and Exercises
Our treatment is a synthesis and extension of that in Schoenberg [77,82],
Lehmer [71], and Foster and Phillips [846] among others. The term
compound is due to Lehmer [71]. In general it is very hard to determine
M (x) N, but easy to verify a limit once one has found it. Schoenberg [77]
gives a geometric proof of the limit of the AGM due to Jacobi.
1. a) Let M and JV be strict means. Then [M, JV] converges and M (x) N is
a strict (continuous) mean.
248 General Means and Iterations
Hint: {a„ v bn} and {a„ a 6J are monotone sequences, and so
an v bn decreases to x and an a bn increases to y. One may suppose
x = (M v N)(x, y). Thus x = y by Proposition 8.4 and M®N
exists. Since MaN^M®N-^MvN, the limit is a strict mean,
b) M ® JV is symmetric or homogeneous when M and JV are.
2. a) Let M be a strict mean. Then
M® v = v®M = v .
b) Show that
(M®N)f = Mf®Nf
where as before Mf(a, b) '•— /_1M(/(a), f(b)).
c) In particular a) and b) show that
M® A = A®M= A .
d) If M and JV are symmetric, then
M®N = N®M .
[More generally, see Exercise 6.] Thus
M®gN = N®gM.
3. (Carlson's log) Define means M(a, b): = \ja • (a + b) /2 and
N(a, b): = y/[(a + b)ll\-b = M(b, a). Show that
a — b
Hence [using Exercise 2b)],
■M1I2®N1I2 = 2.
icitly, if
. an + y/anbn
the limit is
and
b — a
bn + l
K + VaJn
log 6 - log a
Note that this is neither a Gaussian nor an Archimedean iteration.
8.4 Convergence Rates and Some Examples
249
4. Prove Proposition 8.6 from Theorem 8.3. Note that in this setting
M < N* for 0 < a < b and M > N* for 0 < b < a.
5. a) Show that given strict symmetric means M and N,
M®a N(b, N(a, b)) = N®a M(a, b).
b) Show that in the special case in which M = N the limit
M ®a M =: 4> is characterized by
<D(a, b) = 0(b, M(a, b)).
c) Show [using b) or otherwise] that
i) A®aA(a,b) = ^~
ii) G®aG(a,b) = aV3b2n
(ap +2bp\Vp
hi) Hp®aHp(a,b) = { 3 j p*0.
6. For any function Q let Q(a, b) '■= Q(b, a). Prove that for any means M
and JV,
M®N=N®M
and
M®aN(b, N(b, a)) = N®aM(a, b).
8.4 CONVERGENCE RATES AND SOME EXAMPLES
In this section we show that Gaussian iterations typically converge quadrati-
cally and Archimedean iterations sublinearly. We also give some more
examples of Gaussian and Archimedean iterations for which we can calculate
the limit.
EXAMPLE 8.2 (ARCHIMEDES' METHOD)
(a) Let an denote the area of a regular m ■ 2"-gon inscribed in a unit circle.
Let bn denote the area of the circumscribed regular m ■ 2"-gon. It is
easily verified that an+1 = ^Janbn and bn+1 = 2a„+16„/(a„+1 + bn) while
a0 = \m sin (2ir/m) and b0 = m tan (rr/m). Thus we geometrically
verify that an and bn tend to it. This gives G®aH_1(a0, b0) = it or, on
using homogeneity and replacing 2ir/m by 0,
250 General Means and Iterations
(8.4.1) G®aH_1{sin 6,2 tan (f )) = « ■
There is no need for m to be integral and (8.4.1) holds for all
0 < 6 < ir.
(b) Consider now the same process but using circumscribed and inscribed
perimeters an and bn. Now we have an+1 = 2anbJ(an + bn) and
fr«+i = Va«+i^„ while a0 = 2m tan (ir/m) and b0 = 2m sin {Trim). Then
a„ and bn tend to 2ir and
(8.4.2) //_x(x)aG(tan0,sin 0) = 0.
This is pursued further in Exercises 1 and 2. □
A complete analysis of the iteration of Schwab, Borchardt, Pfaff, and
Gauss is in Miel [83]. The central observation is:
Theorem 8.4 (Schwab-Borchardt)
(8.4.3) A®aG(a,b)
Vb^V2
0<a<b
arccos(a/6)
a a — b
Va^-b2
arccosh(a/6)
0<b<a
Proof. Schwab established the first case by geometric arguments which
Schoenberg [82] reproduces. Given the formulae, it is simpler to use the
invariance principle. Since a2n+1 - b2+1 = (a2 - b2)/4, this reduces to
showing that
2arccos (—— J = arccos (^-) a„<b„
2 arccosh ( -r^) = arccosh (^-) a„>b„
(8.4.4)
Since an+1/bn+1 = \/(anlbn + 1)/2, this is just the half-angle formula for cos
or cosh. □
We now turn to study rates of convergence for Gaussian and Archimi-
dean means.
Theorem 8.5
Let M and JV both be continuously differentiable symmetric means and
suppose that at least one is strict.
8.4 Convergence Rates and Some Examples 251
(a) Consider the Archimedean iteration [M, N]a. Then, if an ¥= bn,
(8.4.5) lim fl"+1~^"+1 = \ .
»— an-bn 4
(b) Consider the Gaussian iteration [M, N]g for comparable M and N.
(i) Then, if an*bn,
(8.4.6) limfl"+1~^"+1=0.
n n
(ii) Suppose, in addition, that M and N are twice continuously
differentiable. Then, if an¥=bn,
(8.4.7) lim '«■"-»■+'' = 1^(^)-^(^)1
where s '■= M®gN{a, b).
Proof. Since M(c, c) = c for all c, we must have Mx(c, c) + M 2(c, c) =
1 for all c. (Here M,. denotes the partial derivative with respect to the ith
variable.) Since M and N are symmetric, it follows that M^{c, c) =
N/c, c) = \, i = 1, 2.
(a) Let 5 := (M ®a Ar)(a, 6). The mean value theorem gives
(8.4.8) an+t -s=±(an-s)+Hbn-s) + o(an - s) + o(bn - s)
and since bn+1 = N(an+1, bn),
(8.4.9) bn+1 - s = K«n+1 ~s)+Hbn-s) + o(an+1 - s) + o(bn - s)
= *(«„-*)+ !(&„ -^) + o(fl„ -s) + o(bn - s).
Thus
an+1 - bn+1 = \{an - bn) + o(an -s) + o(bn - s)
and (a) follows.
(b) (i) Similarly, (8.4.8) still holds, and also
bn+x ~s=\{an-s)+ \{bn -s) + o(an - s) + o(bn - s).
(8.4.10)
252
General Means and Iterations
Thus
an+i ~ bn+1 = o(an -s) + o(bn -s) = o(an - bn)
(since s lies between an and bn). This gives (b).
(ii) Given one more derivative, we have
«„+i -s=Ha„-s)+ \(bn -s) + \Mu(s, s){an - bnf
+ o{an-bnf
and a similar formula for bn. Subtraction gives (ii). Here we have
used V2M(s, s)(an — s,bn — s)2 = Mn(s, s)(an - bn). [Exercise
7d).] □
Considerably more can be said if M and N are two or three times
continuously differentiable. (See Foster and Phillips [846].) In addition the
convergence in each case is uniform. For our purposes Theorem 8.5 suffices.
In particular, Archimedean iterations characteristically converge linearly
and Gaussian iterations super linearly (quadratically if twice differentiable).
In light of Jacobi-type methods in the theory of equations, this may seem
counterintuitive. Exercise 3c) shows that (8.4.6) may fail if M and N are not
differentiable. The theorem justifies our separating Gaussian and
Archimedean iterations. While symmetry is central to our convergence arguments, it
is not always essential, all that is really needed in (a) and (b)(i) Theorem
8.5 is that the two means have the same gradients on the diagonal. (See
Exercise 11.) Also, (8.4.7) shows that better than quadratic convergence is
possible only if Mn = Nn. (See Exercise 7.) Additional information is given
in Exercise 11.
Finally, let us say that two iterations are equivalent ([M, N] ~h [M', N'])
if M~hM' and N~hN'. Strong equivalence is similarly defined. Clearly
equivalence of iterations implies equivalence of the limits, but not
conversely. Indeed, if the mapping h is continuously bi-differentiable, the rates of
convergence must be the same. (See Exercise 8.)
Comments and Exercises
Some of the Archimedean considerations are discussed again in Chapter 11.
If we take m'-=6 in Example 8.2(6), we have a recursive version of
Archimedes' original method. (See also Edwards [79] and Phillips [81].)
1. a) Show that (8.4.1) and (8.4.2) are consistent with the general
formula of Exercise 5a) of Section 8.3.
b) Show directly that in both (a) and (b) of Example 8.2 we have
an+\ ~ bn+1 ~ \(an - bn). This illustrates why computation of any
large number of digits of it by this method is impractical.
8.4 Convergence Rates and Some Examples 253
c) Show that in both cases (an + 2bn)l3 =: dn satisfies
dn - lim dn 16
2. a) Show that changes of variables cause no problems for Gaussian or
Archimedean iterations, that is,
i) (M®gN)f = Mf®gNf
ii) (M®aN)f = Mf®aNf.
b) Show that (8.4.2) implies that
A®aG(x,l) = - 0<*<1.
a arccosx
Hint: Let x-= cos 0 in (8.4.2) and use aii).
c) Rederive Theorem 8.4 from b).
The next exercise shows a class of means which is closed under the
Gaussian product. Contrast this with the Gaussian product of Holder
means.
3. Let Qt{a, b) := t(a v b) + (1 - t)(a a b) for 0 < t < 1.
a) Show that Qt is a homogeneous, symmetric, isotone mean which is
strict for 0 < t < 1.
b) Show that for t > s,
Qt®gQs = Qs/[s+(i-o] ■
This may easily be done by the invariance principle. Alternatively,
suppose a > b > 0 and observe that the limit must be linear.
c) Show that \an- bn\ = {t- s)n\a-b\.
d) showthate,®aa = e,®gar
4. Let M and JV be symmetric, homogeneous means.
a) Show that M®gN = G if and only if N = M_x.
b) Show that M ®g N = A if and only if N = 2A - M. Note that M is
a mean if and only if 2A - M is.
c) Characterize M®gN= Hp.
d) Show that if M{a, b): = \/(a ~ bf + ab, then
M® G = H2.
Explicitly,
*n+x:=^la2n + b2n-anbn and bn+1: = VaJ~n
converges to V(ao + b0)/2.
e) If M:=(HX + H_X)I2 and N: = (Hx + L2)/2 then M®gJV=A
. Explicitly,
: _a2n + b2n + 6anbn __3a2n + 3b,2 + 2anbn
dn+1[~ 4(a„ + &„) and &-- . 4<«. + &„"). ^
* .-
cbnverges to (a0 + b0)/2.
(A variant of the AGM) Show that
2 /w*\ 2 / //x
Explicitly, the common limit of
vi-—4—+ -T" and *»■» - ^+&;+-r-
is 2AG(a0, 60)2/(a0 + 60). (See P. B. Borwein [Pr].)
a) Show that M(a, b):=(a + b + 2ab)l(a + b + 2) is a strict
symmetric nonhomogeneous mean with M = M_,.
b) Show that M ®a M(a, b) = (a + 2b + 3ab)/(b + 2a + 3).
Hint: If A(f)':= f/(l + 0, then M = Ah.
a) Use Theorem 8.5(6) to show that [Hp,Hq]g, [Hp, Lq]g, and
[Lp, Lq]g converge quadratically, not better, for p¥= q.
b) (The quartic AGM) The iteration [A,N]g, where N(a,b): =
[(a3b + b3a)/2]114, converges with order 4. [See Exercise 3 of
Section 1.4]
c) Reformulate the cubic modular equation as a third-order Gaussian
mean iteration to compute AG.
d) Show that for any symmetric twice differentiable mean,
Mn(c,c) = -M12(c,c).
a) Verify the assertions about equivalent iterations.
b) Show that [Hp, Hq]g~[Hp., Hq,]g if and only if p*q = q*p.
(Suppose that p and q are not both zero.)
Hint: One may suppose p = 0 or 1 and use Theorem 8.1.
c) Thus [Hx, H2]g and [A, G]g are not equivalent.
For 0<yt<l let M(a,b):= (l-l/k)a +(l/k)b and N(a,b): =
fl*~16/[(l-l/ik)fl + (l/ik)6]*~1.
a) Show that M®N(a,b) = a1~Ukbllk.
b) Show directly that convergence is quadratic. (Theorem 8.5 does
not apply, but gradients do coincide on the diagonal.)
8.4 Convergence Rates and Some Examples 255
10. If M and N are homogeneous and L'= M (x) N, show that for x > 0,
L(0,x) = xL[tM(0),tN(0)].
Thus although we do not know an explicit formula for L'.= Hx (x) L2,
we have L(0, x) = xL{\, 1).
11. Suppose that M and N are comparable continuously differentiable,
homogeneous means such that M®N exists.
a) Show that if an ¥^ bn,
lim , + 1~b"t =\M1(s,s)-N ,(s, s)\
n n\
where s: = M (x) N(a, b).
b) What conditions, other than symmetry, in conjunction with part a)
now guarantee linear or superlinear convergence?
12. Suppose that M and JV are continuously differentiable, homogeneous,
symmetric means. Let
M*(a, b) := aa + (1 - a)M(a, b)
0<a<l
N*(a, b):=aa + (l- a)N(a, b).
a) Show that [M*, N*] still converges superlinearly.
b) For M:=A and N:=G [so that M*(a, b) = (3a + b)IA and
N*(a, b) = (a + Vab)/2] show directly that convergence is
quadratic.
13. The ten neo-Pythagorean means are /x := A, f2'-= G, /3 := H, /4 := L2
(also called the contraharmonic mean), and
f5{a,b):=\d+\{d2 + Am2)xn
f6{a,b):=-\d+h{d2 + m2)xn
/7(a, 6):= m + d2/m
—2
f9(a,b):--
m + d
f&{a,b)'-=m- d2lm
f10(a, b):=im + l(m2 + 4md)U2
where m '■= a a b, m '■= a v b, and d'-=m- m.
256
General Means and Iterations
a) Show that /5 to /10 are symmetric, homogeneous means which are
not differentiable on the diagonal. Thus Theorem 8.5(a) does not
apply.
b) Show that/7 ®g /g = A, and that [/7, fs]a converges sublinearly for
a < b but quartically for a > b.
c) Show that /8®g/7(2>l)= 1, and that we have one-step
termination—which cannot happen for strictly comparable means.
These means were originally defined by the Greeks in terms of
proportions. For/:= 1 to 10, respectively, x'-= fj(a, b) solves
CD IF"
(5)
(7)
(9)
m ■
x-
m ■
x -
m -
m -
m
m
- X
- m
- X
- m
- X
- m
— X
- m
m
m
m
m
m
X
m
m
m
m
X
(2)
(4)
(6)
(8)
(10)
x — m m
m - x x
x — m m
rn — x m
x — m m
m - x x
rn — m m
x - m m
m — m x
x - m m
d) Verify that x = fj(a, b) in each case.
e) Verify that these are the only such means.
8.5 CARLSON'S INTEGRALS AND MORE EXAMPLES
This section is largely given to a description of a unified approach to Gauss's
and Borchardt's algorithms due to Carlson [71]. It shows both the
possibilities and the limitations of looking for iterative methods based on a
more general hypergeometric transformation.
Let us consider the integral
ll(«;8,8Vx*/):=]S^/0V,-1(r + ^)-'a + /r'A
(8.5.1)
where a + a' = S + 8'; re(a), re(a') > 0, and /3 is the beta function. (The
prime is not complementation in this context.) Then obviously
(8.5.2) R(a; S, 8'\ x2, y2) = R(a;8', 8; y2, x2)
and R is homogeneous of degree -a'mx2 and v . (See Exercise 1.) We are
interested in
8.5 Carlson's Integrals and More Examples 257
(8.5.3) Cij:=Fi®Fj i,/ = 1,2,3,4
where the means Ft, i = 1,2, 3, 4, are given by
Fx(a,b):=^~ F2{a,b):=Vab
^, ,x la + b „, ,. ja + b ,
F3(a, 6):=y-y-a F4(a,b):= y-y- 6 .
Thus C12 = AG, C34 produces Carlson's log (Exercise 3 of Section 8.3), and
Cu ~ A®aG leads to Borchardt's algorithm (Theorem 8.4).
Theorem 8.6 (Carlson)
Let i, j = 1,2, 3, 4 with j ^/.
(a) [F/; .Fj] converges.
(6) Cjy(a,6) = [i?(a;S,S';a2,62)r1/2a
where (a; S, S') is given by the (i, /)th entry Table 8.1.
(c) Convergence is linear except for C12 = C21, which is AG, the limit of
the Gaussian AGM.
TABLE 8.1
1 /=1 1 = 2 / = 4 / = 3
1 * (2! 2) 2) (2! 2) 1) (4) 3> 2)
2 G;M) * (l; 4,1) (i;M)
3 (i;l,i) (1;1,1) (1;1,1)
4 (i;i,3) (l;i,i) * (1-,1,1)
Since Fx and F2 are symmetric while F3(b, a) = F4(a, b), up to exchange
of 8 and 8' the table is symmetric around the main diagonal because of
(8.5.2). Boxes marked with * correspond to trivial iterations.
Proof.
(a) Since F4 < F2 < Fx < F3 for 0 < b < a, each pair of means is partially
comparable. By Proposition 8.4 on composition, each mean is strict.
Thus Theorem 8.2 establishes (a).
(b) Let us denote Ft(a, b) by ft. Make, in (8.5.1), the substitution
^=5(5+/2)/(5+ /2). Then
258 General Means and Iterations
and
* =s +2sf2l+flfl = (^+/3)(^+/4)
ds (s + flf (s+fl)2
.2 _ (s+fl)2 _t2 {s+flf
t+a=- '-%- t + b =
s + fl s + fl
Thus
(8.5.4)
R(a;S,S';a2,b2)=j^-^[sa'-\s + fir-\s + f2r-'
x (s+fiy-^is+f4y-2s' ds.
If we fix i and /, we can determine values of the parameters for which
fk, k¥= /, /, vanishes in (8.5.4). For example, with /:=3 and7 := 4, we
set a : = a' ■ = 1 and have
(8.5.5) i?(l; 8,8'; a2, b2) = R(l, 28-1,28'- 1; f\, f2)
where 8 + 8'= 2. There is a unique value S:=l so that (8.5.5)
becomes invariant for/3 and/4. By Exercise lb), [R(l; 1,1; a2, b2)]~112
is an invariant for [F3, F4] to which Theorem 8.3 applies. Similarly, for
/:=1 and j: = 2, we set 8 '■ = 8': = \, which gives a + a' = 1 and
R(a; \,\;a2,b2) = R(a;l-a,a; f\, f\).
The only possible invariant has a:=a'= \, and by Exercise lb),
[^(5; 5) \',a2, b2)]~l is an invariant for [Fx, F2]. The rest of the table
is similarly verified. [See Exercise lc).]
(c) The convergence assertions are straightforward. [See Exercise
Id).] □
example 8.3. As observed before, C12, C34, and C14 have been previously
identified. In Exercise 2 we indicate how to recover the previous forms of
the limit. In particular Theorem 8.6 gives integral representations for these
means. Now consider /:=1 and /:=3 (or similarly, /:=1 and /: = 4).
Suppose that a> b. Let
(8.5.6) 1-^:=^.
t + a
8.5 Carlson's Integrals and More Examples 259
Then 54 = (a2 - b2) l(t + a2) and
MA 4-,-1/2 , /2 I.2\l/4/. , 2-,-3/4/. , .2-,-1/2 ,.
4(1 - s ) ds = -(a - b ) (t+ a ) (f + o) af.
Since a := |, S := f, and 8' ■= \, we have a' = 1, /3(a, a') = 4, and
ra-iW4
-(a2-6T1/4J0 d-/rl/2^.
Recall that the arclemniscate sine is
arcslx: = Jo (l-s4)~U2ds x2<l
and gives the arc length of the lemniscate (/-2 = cos 20) from the origin to
the point with radial position x. (See also Theorem 1.7.) Thus
(8-5J) c«(fl'fc)=KiF^fF °^b<a-
Similarly,
(8.5.8) Cx3(a, b)
where the hyperbolic arclemniscate is defined by
arcslhx:= f (1 + s4)~1/2 ds .
[See Exercise 2d).] Observe that V2C^I2(1,0) = K(l/V2), by Theorem
1.7.
example 8.4 (Archimedean means) Let us consider Hp ®a Hq for p, q =
±1, 0. Then the previous exercise and Exercise 5 of Section 8.3 show that it
suffices to consider Hx®aHx, Hx®aHQ, and Hx®aH_x. Exercise 5c) of
Section 8.3 gives Hx ®a Hx(a, b) = (a + 2b) /3 while Hx ®a H0 is Borchardt's
algorithm. It remains to study Hx ®aH_x. With a ■= /3 := \, this is a special
case of the mean iteration a0 ■= a > 0 and b0 : = b > 0,
an+x.= aan + (l-a)bn bn+x - ^^ + (1 _ ^
V^T2 n ,
[arcslh(V/a2-l)l/4]2 U<fl<&
260
General Means and Iterations
where 0 < a < 1 and 0 < /3 < 1. In the notation of Exercise 3 of Section 8.4
this computes G-= Qa®a(Qfi)_1 for a>b. If we let kn'-=bnlan, we
derive
^1-l = (a]8)(C-l) = («i8)"+1(f-
and
Thus
T = i + w(H
= a + (1 - a)kn
and the limit is
G(fl|fc)"fliUi-W(i-«/&)J-
If we let 1 - a/6 =: x and a(3 ='■ q, we have
(8.5.9) g(i,~-) = 11
1- (ax)g"
1-V
n:.0[i-(a*)g"]
which is the ratio of thetalike products occurring in the ^-binomial theorem.
(See Exercise 7 in Section 9.4.) Wimp [84] continues a discussion of similar
calculations, all of which give linear convergence.
If we let a/3 : = 5 and a : = \, we have a product expansion for
Hx ®aH_x. Foster and Phillips [84a] show how this function can be closely
approximated by elementary functions and can be given an elegant
asymptotic expansion.
Comments and Exercises
Many other related algorithms can be found in Wimp [84]. This includes an
extensive discussion of quadratically computable trigonometric integrals (in
Chapter 14).
1. a) Show that R(a; S, 8'; x2, y2) = y~2aF(a; 8, S + 8'; 1 - x2/y2),
where F is the Gaussian hypergeometric series. (See Exercise 6 in
Section 1.3.)
b) Show that R(a; 8,8';x2, y2) is homogeneous of degree -a in x
and y2. Also R{a; 8, 8'; 1,1) = 1 and R(a; 8,8';-,-) is continuous.
c) Verify the entries in Table 8.1.
8.5 Carlson's Integrals and More Examples 261
d) Show that [Fit Fj], i <j, is linearly convergent for all cases except
i '■= 1, j '■= 2. Observe that [F3, F4], which is not Archimedean, has
a convergence rate of 2~" and [F2, F3], which is partially
comparable, has oscillatory iterates.
2. a) Observe that Cn(a, b) = (ir/2)/I(a, b) with
/(«,&) :=/o
r/2 de
Va2 cos2 6 + b2 sin2 0
as before.
2 ,2
a - b
b) Show that C34(a, b) := V2log(fl/fe) (fl^6)-
//mf: Use partial fractions.
c) Show that Cu(a, b) is given by (8.4.3).
Hint: Let (f + a2)l{t + b2)=: cos d or cosh2 0.
d) Show that C13(a, b) is given by (8.5.8) for a < b.
e) Use the invariance principle to verify that
C24(b, a) = C32(a, b) = y/aCu(a, b)
and that
C23(b, a) = C42(a, b) = VbC13(a, b).
f) This completes the analysis of all of Carlson's means in explicit
form. Attempt to verify part e) directly from Theorem 8.6.
Use the invariance of C13 to show that
i) arcsl x = V2 arcslh y
where (1 + /)(1 + Vl-x4) = 2, and
ii) arcsl x = 2 arcsl z
where x =: 2zVl - z4/(l + z4) and z2 < V2 - 1.
This is Jacobi's duplication formula for arcsl (Watson [33]).
Show that with M(a, b) = G(a, Hpl2{a, b))=-N(b, a),
M®N(a,b) = Xp(a,b) = [^^]VP.
(Crrlson [75]) Let a, b > 0 and set
/^ r rrl2
F(a, b):= - log (a sin2 6 + b cos2 6) dd .
General Means and Itei
a) Show that
F
b) Using a) (or directly)
\c:bU]
show that
= 2F(a,b).
F(a, b) = 2 log {^Y^-) = log Hll2(a, b).
6. (Carlson [78]) Let a, b, c, A real be given. Assume a + \, b + X,
c + A> 0, and at most one of these is zero. Consider
/• CO
7/( A) := T(a, b, c; A) := J^ [(a + t)(b + t)(c + t)]~112 dt.
Let
k := A + \/(A + a)(A + b) + \/(A + 6)(A + c) + V(A + a)(A + c).
a) Show that 7/( A) = 2 T(k).
b) Show that T(\0) = limn^cc2n+1k~112, where iteratively k:= kn,
fc„+1 := k, and A = : fc0.
c) Show that for a, b,c, d>0,
/• CO
J0 [(* + a2)(t + 62)(* + c2)(t + rf2)]"1'2 * = T(A, B, C; 0)
where A:=(ab + cdf, B := (ac + 6rf)2 and C:= (arf + be)2.
7. (Tricomi [65]) Let 0< yfc < 1 be given. Set
Rk(a,b):=yk2a2 + k'2b2 k':=Vl-k2
and
M(a,b):=ah + Kfbh\b 0<a<b.
' b + Rk(a, b)
a) Show that M is a strict mean on 0 < a < b.
b) Show that
CO
M®aG(a,b):=bH cn(2-"v0)
where cn(u0) := a0/b0 and 0< v0 <2K.
Hint: Use the half-angle formula for en
8.6 Series Expansions of Certain Means 263
(8.5.10) c„2(iM)=^i
cn(u) + V*:'2 + k2cn\u)
2UJ 1 + \Jk'2 + k2cn\u)
and deduce that cn(u„) := a„/fr„ satisfies vn+1 = vn/2 and bn+1/bn =
c) Recover Borchardt's algorithm by considering A:: = 0.
8. Compute a formula for the MacLaurin series for G(l - x, 1) in (8.5.9).
8.6 SERIES EXPANSIONS OF CERTAIN MEANS
For homogeneous means it is particularly easy to compute Taylor series. It
suffices to expand the trace around 1. In the exercises we list various series
taken from Gould and Mays [84], extending results in Lehmer [71].
//(1 l-^) = l-l^ + ^ll^ + £^l^_(p-l)(p-3)(2p + 5) 4
npyi,i X) i 2x-r g x -r. 16 x 3g4 x
+ ••• .
(8.6.1)
Similarly
-^ = 1- ,
2 4 8 48
Lp(l,l-,) = l^ + ^^ + £^,3_(p-l)^-3);c4 +
(8.6.2)
From these it follows that the only means which are both Holder and
Lehmer means are //_x, A, and G. (See Exercise 5.) Similarly, but slightly
more elaborate, analysis shows that the following theorem holds.
Theorem 8.7 (Lehmer)
(«) Hp®gHq = Hs p*q
if and only if p + q = s = 0.
(b) Lp®gLq = Ls p*q
if and only if p + q = 2s = 0,1,2.
(c) Lp®gLq = Hs p*q
if and only if p + q - 1 = 5 = -1, 0,1.
(A) Hp®gLq = Ls p^q
if and only if p = -q and s= \.
264 General Means and Iterations
Proof. Sufficiency is easy in each case. Necessity is more elaborate and
in some cases somewhat tedious. (See Exercise 6.) In essence it follows from
the invariance principle and the computation of the first few terms of the
Taylor series of the limit. □
Thus only A, G, and H can arise in such compounding. Lehmer continues
by studying in detail the Taylor series of A®gL2, various properties of
which had been previously examined by Stieltjes in a letter to Hermite
(Hermite and Stieltjes [05, letter 323]). (See Exercise 7.)
Comments and Exercises
Theorem 8.7, while very pretty, is of limited ambit, since it cannot diagnose
whether the compounded mean is equivalent to a mean in the class.
1. Show that the Holder means satisfy
CO
H(\,l-x) = JJ A(p,n)xn
where
^=-(-^(^1^1^)
/t=o
for n > 1.
2. (Euler) If g(x): = E^0 anxn and gp(x) =: E:=0 bnxn, then
n
3. Show that the Stolarsky means satisfy
CO
Sp+1(l,l-x) = S D(p,n)xn
n — \
where
4. a) Show that the Gini means satisfy
G,,r(l, 1 - x) = GrtS(l, 1 - x)GStr(h 1 - x)
where
• s\ 1/0-r)
GsAa,b):=(^)
8.6 Series Expansions of Certain Means
b) Show that
CO
6,,,(1,1-*) :=2 G{s,r,n)xn
where
c) Thus Gsr can be computed by convolution. A similar result
for Ers.'
Show that H = Lp if and only if p = 0 and q = -1; p = 1 and q =
p = 5 and q = 0.
Prove Theorem 8.7.
(Lehmer) Let
i?0c):=A®gL2(l,l + 4;c).
a) Show that
i?(;c) = 1 + 2 E
x2"
n-0 °0M'2 '" °b
where the P, are defined recursively by P0(x):=l
//in*: If
P
a0:=l
*n p p . . . p
^.,+2*'
2x2
then
"l°2' '°n
Cq • j^A
,. _ bn+i-an+i
t-M4- 1 «
266 General Means and Iterations
Now observe that
R(x) = a0 + 2 0,+1 - a,)
/=0
and use the invariance principle,
b) Show independently that
R(x) = 1 + 2 gnxn
n = l
where gx := 2 and gm, m >2, satisfies the recursion
|m/2J
._/ 1\m V 0m-2/fc/ m-2 \
Observe that the g, are integers.
//inf: Show that the series for R satisfies
«W = (1 + 2,)^(^).
c) Use b) to compute gm for m < 20. (g20 = 23335660.)
8.7 MULTIDIMENSIONAL MEANS AND ITERATIONS
Most of the results of this chapter have direct analogues for functions of
more than two variables. Often these follow by similar arguments. We
concentrate on mean iterations. Let a '■= (ax,. . . , aN) be any strictly
positive vector.
An iV-dimensional mean is any continuous function M such that
(8.7.1) J\at<M{a)<\/ at
/=1 /=1
for all at > 0. The mean is strict if
(8.7.2) A «,<V «/=> A a,<M(a)<\/ a,.
/=i i=i /=i /=i
Also M is symmetric if M(a) = M(6) for any permutation b of a.
Homogeneity and isotonicity are defined analogously. The Lehmer and
Holder means are defined by
(8.7.3) Lp(a):=-^ x
£f=i<
8.7 Multidimensional Means and Iterations 267
respectively.
Given N iV-dimensional means M\ . . . , MN, we consider the iteration
[M\ ..., MN] defined by a0 : = a > 0 and
(8.7.5) a'n+1:=M'(dn) l^i^N.
We write this vectorially as dn+1'-= M(an) and denote the common limit
when it exists by ®/ii M'. Again, when each Mt is symmetric, we consider
N
this to be a Gaussian iteration with limit (x)g M'. Similarly, if
i=i
(8.7.6) a'n+1 := M\axn+X, a2n+1,..., a'~+\, a'n,. . . , a")
we have an Archimedean iteration (there are other possible generalizations)
N i
with limit (x)a M . Under mild hypotheses the convergence results of Section
8.4 remain valid.
Theorem 8.8
Let M1, M2, ..., MN be strict iV-dimensional means.
(a) Then ®^=i Ml exists and is a strict, continuous mean.
(b) Suppose that the means are symmetric and continuously differentiable.
Let an := V"x a'n and bn := /\?-i <■ Then, if an *bn,
(8.7.7) lim lfl"^~^il = o
and convergence is superlinear in the Gaussian iteration.
(c) If, in fact, the means are twice continuously differentiable,
convergence in the Gaussian iteration is quadratic (uniformly on compact
subsets).
(d) Convergence is linear in the Archimedean iteration for continuously
differentiable symmetric means.
Proof.
(a) Much as before, an > an+1 > 6n+1 > 6n, and we may suppose an
converges to a and bn converges to b. Let c be any cluster point of {an},
which is bounded. Then 6 < A /=i ci — V /=i c,^a, and
a = V M'(c) b = A M'(c).
/=i /=i
and
(8.7.4)
268
General Means and Iterations
Since all the means are strict, we must have ci = a and c, = b for each i.
Thus a = b, and the iteration converges, say, to a. As before, the limit
is a continuous strict mean.
(b) By symmetry we have M'k(ce) = 1/JV for any i, k and any multiple of
the unit vector e. Thus
1 N
a'n + 1 - a = - E [(a* - a) + o(a* -a)].-
iy fc-i
Hence
|aj,+1 - <+1| = o« - a) + o(a'n -a) = o(an - bn)
and (8.7.7) follows.
(c) This is argued as in the two-variable case and relies on the fact that the
Hessian V2M'(a, a,. . . ,a) sums to zero and has all diagonal entries
equal to —(N— \)y\ where each off-diagonal entry equals y'. Thus for
each i and j,
<+1 - <+1 = ^T~ 2 (flj - aknf + o((an - bnf)
z h<k
(and unless all y' coincide, convergence will be at best quadratic). Now
0^ «„+1 - &n+1 ^ 5|a„ - bn\2 + o{\an - bf)
where B is an easily computable constant depending on the means,
initial values, and dimension. This shows (c).
(d) We leave (d) as Exercise 2b). □
example 8.5 (schlomilch) Consider
a„ + b„ + c„
n n
= :M\an,bn,cn)
K+i-=yj~n'n ' V—— =--M\an,bn,cn)
a„b„ + b„c„ + a„c„
3
1/3
cn+i:=OAcJ =-M (an,bn, c„).
This is a quadratically convergent Gaussian iteration, as follows from
Theorem 8.8(c). (Since MX>M> M3, M2 is a strict mean.) Let S(a, b, c)
denote the limit. While we cannot identify 5, we can, following Landau,
observe that if ac = b2, then ancn = b2 and thus
(8.7.8) S(a, Vac, c) = He® G(a, c) =: S(a, c)
8.7 Multidimensional Means and Iterations
269
where He(a, c)-= (a + Vac + c)/3 = E3/21/2(a, c) is the Heronian mean.
While this does not appear to have a closed form, it has an attractive
product expansion,
(8.7.9)
nl + VX~n + Xn
. . . ~f
where x0 '■ = x and
3VX~n
*"+1" l + Vx-n + xn-
While it is difficult to identify two-dimensional compound means, it
appears even harder to identify higher dimensional ones.
There is a satisfactory analogue of Proposition 8.3. Let / be continuous
and strictly monotone as before. Let a0, ay,. . . , aN be distinct positive real
numbers. Let F(N) be any iVth antiderivative for/. We define
(8.7.10) Ms («„,«!,...,«„):=/-
>Nf
AH 2
£ F(N)(ak)
k=o nfc^- {ak - Uj) J
Then Mf is a strict, symmetric A^-dimensional mean. (See Exercise 5.)
We should emphasize that the invariance principle continues to hold:
$'■= ®/Ii M' is the unique continuous diagonal mapping satisfying
(8.7.11) ¢ = ^,^,...,^).
Comments and Exercises
A wealth of information on ./V-dimensional means can be found in Hardy,
Littlewood, and Polya [59]. The Lehmer means are studied in Beckenbach
[50]. The Schlomilch mean and Landau's contribution are discussed in
Schoenberg [77]. Various ./V-dimensional quadratic iterations are exhibited in
Wimp [84] and Arazy et al. [85].
1. a) Show that Hp and Lp are strict, homogeneous, symmetric means.
b) Show that (II?=1 a,.)lw = lim^o Hp(a) =: H0(a).
c) Show that Hp is convex for p>l and that Lp is convex for
l<p<2.
2. a) Show that ®*li M' is a continuous strict mean when each M' is.
b) Show that convergence is linear in the Archimedean iteration.
c) Estimate the rate in the three-variable case.
d) Show that
270
General Means and Iterations
®aHp(a) =
N
Ltf(tf + 1) ~
Zj ka
iip
p^O
and
®a H0(a) =
r N
n.
•Jfc = l
2/N(N+l)
Hint: Use the invariance principle when p : = 1.
3. Show that 5(1, jc) satisfies (8.7.9). Compare this to the product
expansion for SS(l,x).
4. a) Let
M\a,b,c): =
M\a,b,c): =
a + b + c
ab + be + ac
a + b + c
and
M\a,b,c):--
3abc
ac + ab + bc 1/a + l/fe + l/c*
Show that
(x) M'(a, b, c) = (abc)
1/3
and that convergence is quadratic,
b) Let Sk be the kth elementary symmetric mean function of N
variables. That is, 5^ : = fkl( Nk) with fk as in Exercise 6 of Section 11.2.
k ..
Thus, 50 := 1, 5X := Hx, etc. Let MK := Sk/Sk_1 for 1 < A; < w. Show
that Mk > M*+1 and that <g) Ml = H0.
c) In a) replace M by //0. Thus
a„ + &» + c„
*n ~n ~n
B+1* 3
c„
kn+i:=(aAO
1/3
""+1" l/an + l/bn + l/cn '
Show that the limit mean M satisfies
M(a, Vac, c) = He®gHe_x(a, c) = Vac = (abc)1'3
This is not the general limit.
8.7 Multidimensional Means and Iterations
271
d) More generally, whenever M is homogeneous and symmetric,
4> = M (x) G (x) M_x satisfies 4>(a, Vac, c) = Vac. Similar results
hold in N dimensions.
a) Show that Mj of (8.7.10) is uniquely defined and is a strict mean.
Hint: Let
-A Uj7tlc (x - Oj)
PN(x) := Z F(N)(ak) n.^ {Uk _ aj) ■
Then PN is the Lagrange interpolating polynomial for F(N) at
a0, 0,-y, • .. , aN. (See Exercise 1 of Section 10.1.) Thus P^° and /
must agree at some point strictly between A;=i at an(* V;=i a,-
Now show that Ms extends continuously to all strictly positive
variables. The harder part is to show it is strict.
b) Show that the three-dimensional Stolarsky means are appropriately
defined (and are homogeneous) if
S(a,b,c): =
+
Lp(p-l) \(a-b)(a-c) (b-a)(b-c)
+
l/(p-2)
for p ^0, 1, 2. Also
S0(a, b, c): =
(c- a){c - b),
{a - b)(b - c)(c - a)
%b - c) log a + 2(c - a) log b + 2{a - b) log c J
(a - b)(b - c)(a - c)
2(6 - c)a log a + 2(c — a)b log 6 + 2(a — b)c log c
and, using f(x) : = log, that the identric mean may be given as
while
5x(a, b, c): =
^"(a, 6, c) := 52(a, b, c)-= exp
a log a
+
fr2logfr
L (a - b){a -c) (b - a)(6 - c)
+
c log c
(c-a)(c-6) 2J-
c) Show that the iV-dimensional logarithmic mean is given by
°*7VV^l> ^2' • ■ ■ ' ^N) '~~
(-1)N(N-1)2
log a,-
/=i nyV,-(a,-- ay) J
-l/(Af-l)
d) Investigate the isotonicity of 5 (a, b, c) as a function of p.
272 General Means and Iterations
6. There are various multidimensional generalizations of the AGM due to
Borchardt [1888] and others. In four variables one may take
fl„ + b„ + c„ + d„
n n n n
"n + 1
K + l
Cn + 1-
dn + X-
4
y/a„bn + y/cndn
2
._ Va„c„ + VMn
2
y/andn + y/bncn
2
a) Observe that, while the means are not all symmetric, the derivatives
on the diagonal (a = b = c = d) all coincide, and hence establish
quadratic convergence.
b) Show that when a0 ■= b0 and c0-= d0, the iteration reduces to the
AGM. This iteration shares many of AGM's attributes (Arazy et al.
[85]).
7. (An extended convergence result)
a) Observe that the convergence result Theorem 8.8 continues to hold
if the condition that the means are strict is relaxed to
N N N N
M(c) = V c,- and M(c) = A ct implies /\ ct = V ci ■
/-1 i-l /-1 /=1
Here
N __ N
M:=/\ M' and M:=\/M'.
/=i /=i
This clearly holds if N - 1 of the means are strict.
b) More interestingly, use a) to establish the following result. Let A be
an entry-positive N by N matrix. Set A0 : = A and
An+1:=|(*VAJ2
for n > 0. Here *VT" represents entrywise square root. Then A„
converges to a constant matrix with entry e(A).
c) KA:=(" ba) then e(A) = AG(a, b).
d) KA:=(" a. J then e(A) = a(a, b).
e) Similarly consider A0 and B0 entry positive and
8.8 Algebraic Iterations and Functional Relations 273
Show that this iteration converges to a constant matrix. (See Cohen
and Nussbaum [87].)
8.8 ALGEBRAIC ITERATIONS AND FUNCTIONAL RELATIONS
Among the more familiar and fundamental properties of the exponential
function is that it satisfies the algebraic functional relation
(8.8.1) /(z) = [/(z/«)]\
Are there any other algebraic functional relations of the form
(8.8.2) /(z) = /3(/(/,(z),z))
where /3 and /i are algebraic? More precisely, /i is an algebraic function if
there exists a polynomial P in two variables with coefficients in some base
field F (for most of this discussion F '■ = Q or F: = C) such that
(8.8.3) P(/x(z),z) = 0
and O is an algebraic functional relation for / if O is a polynomial in three
variables over F and there exists an algebraic function /jl so that
(8.8.4) O(/(z),/(Mz)),z) = 0.
The function /i is termed an algebraic transformation (for /). We assume
throughout that z is a complex variable and unless otherwise indicated that/
is defined on a region U which contains an open set V so that fjb(V)G U.
[This ensures that (8.8.4) can be continued to any open connected
component of the domain of / that contains U.] The collection of all such
transformations satisfying (8.8.4) for some O we denote by TG(/: F), the
algebraic transformation group of / over F. If
(8.8.5) L(x):=M®N(l,x)
where M and N are homogeneous means, then
(8.8.6) L(x) = M(x)L(ijl(x))
where M(x) := M(l, x) and /jl(x) := iV(l, x)IM{\, x). Thus if both M and N
are algebraic, then L satisfies an algebraic functional relation with algebraic
274
General Means and Iterations
transform (jl. It transpires that the transformation groups of the most
elementary transcendental functions are very simple, too simple in fact, to
support quadratically converging mean iterations. (See Exercise 1.) This, in
part, explains why the algorithms of Chapter 7 for exp and log require the
intermediate use of nonelementary transcendental functions.
We wish now to compute the algebraic transformation groups of some
familiar functions. However, we need first to establish the transcendence
[over C(z)] of the functions exp, log, and exp(exp). This is considerably
easier than the arguments for the transcendence of numbers such as e. The
arguments are roughly the same for all the above functions. For exp it
proceeds as follows. Suppose exp satisfies an equation of the form
k
(8.8.7) 2 at exp (iz) = 0
/=o
where the a{ are rational functions, k is assumed minimal, and a0 = 1. We
differentiate (8.8.7). to get
k
(8.8.8) 2 (dt + iat) exp (iz) = 0 .
;=i
If k > 2 and we divide (8.8.8) by exp (z), we obtain a lower order expression
than (8.8.7) and violate the minimality of k. To dispose of the k = 1 case we
must show that exp is not a rational function. Since exp is entire, if it were
rational it would in fact have to be polynomial, and if it were polynomial it
would have to have a finite Taylor expansion.
Theorem 8.9
(a) TG(exp: Q) = {az: a rational}
(6) TG(log:Q) = {z*: b rational}
(c) TG(exp(exp):Q) = {z}.
Proof. We will prove, by elementary methods, that
(a') TG(exp: Q) = {az + b: a rational, exp (b) and b algebraic}
(b') TG(log: Q) = {azb: log (a) and a algebraic, b rational}
(c') TG(exp (exp): Q) = {z + b: exp (b) and b algebraic}.
To see that (a), (b), (c) and (a'), (b'), (c') are the same is equivalent to
Lindemann's theorem which guarantees that exp (a) is transcendental for
any algebraic a^O. (See Exercise 7 of Section 11.2.)
We first prove part (a1). Suppose that /jl is an algebraic transformation for
exp. Then there exists an expression of the form
k
(8.8.9) 2 ai exp (m^) exp (n,/i.(z)) = 0
i = 0
8.8 Algebraic Iterations and Functional Relations
275
where the at are nonzero algebraic functions in z and the mi and nl are
integers. Equivalently,
k
(8.8.10) 2 ai exp (mtz + n,/x(z)) = 0
1 = 0
where we assume k is minimal, that is, we assume (8.8.10) contains a
minimum number of distinct nonzero exp {mtz + «,/i.(z)) terms. If we divide
(8.8.10) by its last term and differentiate, we get
2 [m, -mk + («,. - nk)ii.][-f-j + (-1 j exp [(m. - mk)z + («,. - nk)fji]
= 0
(8.8.11)
This expression has one less term than (8.8.10) and contradicts the
minimality of (8.8.10) unless (8.8.11) contains no nonzero terms. This
implies that each term of (8.8.10) must be constant. Thus there exist m and
n integral and a nonzero algebraic function a{z) so that
(8.8.12) a{z) exp {mz + nfjb(z)) = constant.
Since exp is transcendental, mz + tifjb(z) must be constant. Call this constant
b; since /i. is algebraic over Q, b must be algebraic. Specializing (8.8.12) at
z = 0 shows that exp (b) must also be algebraic. Part (a') is completed by
observing that /jl(z) = (m/n)z + b is an algebraic transform of exp.
The proof of part (b') is similar. If fju is an algebraic transform for log,
then there exists an expression of the form
(8.8.13) 2 a,[log (z)]m< ■ [log (/i(z))]-' = 0
i = 0
where the ai are algebraic functions and the mt and nt are nonnegative
integers. Let (8.8.13) be minimal in the sense that it has the smallest
maximal degree and contains the fewest distinct terms of maximal degree.
(The degree of a term is mi + nt.) Suppose the term corresonding to i = 0 is
such a maximal term. If we now divide (8.8.13) by a0 and differentiate, we
are left with an expression containing fewer maximal terms, which
contradicts minimality unless all the transcendental terms vanish under
differentiation. In particular (8.8.13) must actually be of the form
(8.8.14) b log (z) + c log (/i(z)) = d
where b, c, and d are algebraic numbers. Thus
276
General Means and Iterations
(8.8.15) ti(z) = exp(d/c)z'blc
and we see that fi is algebraic exactly when exp {die) is algebraic and —blc
is rational. (Otherwise, by Exercise 2, z~blc is transcendental.) This finishes
(&')•
The proof of (c') is again similar to (a'). There are, however, a few
additional wrinkles. Suppose now that /jl(z) is an algebraic transform for
exp (exp (z)). Consider minimal length sums of the form
k
(8.8.16) Z, at exp [mi exp (z) - ni exp (/i.(z))] = 0
;=o
where the at are algebraic functions of z, exp (z), and exp (/a(z)). Dividing
and differentiating (8.8.16) leads, as in (a'), to a contradiciton unless
(8.8.17) mt exp (z) - nt exp (/i.(z)) = constant
and the result follows. One of the wrinkles is that we must establish the
transcendence of exp (exp (z)) over the algebraic functions in z, exp (z), and
exp (/i.(z)). This is necessary for our minimality argument and can be done
analogously to proving that exp is transcendental. (See Exercise 2.) □.
The transformations we are considering are algebraic over Q. This is a
natural choice for computationally related questions. The analogous results
for algebraic transformations over C are similar but easier because the
number theoretic details vanish. We have the following:
Theorem 8.10
(a) TG(exp: C) = {az + b: a 6Q and b eC}.
(6) TG(log: C) = {azb: b e Q and a e C}.
(c) TG(exp (exp): C) = {z + b: exp (b) e Q}.
If/(z) = g(a(z)), where a(z) is an algebraic function, then
(8.8.18) TG(g:F) = aoTG(/:F)oa-1.
example 8.6 Let /jb be an algebraic function and suppose /i.(n) is the
identity. [For example, fju(z) := rz, where r is an nth root of unity.] Then
(8.8.19) /:= exp (/x(1)) + exp (//2)) + ■ • • + exp (/x(n))
is invariant under /jl, where /i,(n) := /i.(/i,(n_1)). Note that TG(/) is not in
general conjugate to TG(exp) since TG(exp) contains no finite elements of
order greater than 2.
8.8. Algebraic Iterations and Functional Relations
277
For qz and log^ (z), where q is an algebraic number distinct from 0 and 1,
we have
(8.8.20) TG(qz: Q) = {az + b: a and b rational}
and
(8.8.21) TG(log^: Q) = {azb: a and b rational} .
To derive these results we first observe that (a') and (b') of Theorem 8.9
hold for the above functions (with exp replaced by qz and log replaced by
log9). The only difficult part is to show that q" and a are simultaneously
algebraic exactly when a is rational. This is the celebrated Gelfond-
Schneider theorem. (See Section 11.2.)
In Exercises 1 and 5 we show that functions like /3(e7(z),z) and
/3(log y(z), z), where /3(-, ■) and y(-) are algebraic, cannot support
functional equations that possess quadratic fixed points in their domain of analytici-
ty. In particular such functions cannot be the limit of quadratically
convergent homogeneous mean iterations. This shows why the elementary
functions are always associated with linearly convergent iterations.
Comments and Exercises
The results of the section underscore the difficulties of analyzing in closed
form the limits of mean iterations. The familiar elementary transcendental
functions can only be limits of fairly trivial iterations. The arguments are
extended in the exercises to cover sin, tan, arccos, and so on. Exercise 6
treats the special case of compounding rational means and shows that the
limit of such a mean iteration is either the kth root of a rational function or
is transcendental.
1. a) Suppose that
L(x) = M ® N(l, x)
as in (8.8.5), where M(l, x) and N(l,x) are algebraic functions
analytic in a neighbourhood of 1. Show that if L can be
quadratically computed by iterating (8.8.6), then fi has a fixed point at 1 of the
form (jl(x) := 1 + 0(1 - xf as x-* 1.
b) Call an algebraic transformation /i. quadratically attractive at c if
fjb(c) = c + 0{c - z)2 as z-> c. Suppose that A is algebraically
conjugate to /i.. That is,
A = a° /jl° a*1
where a is algebraic. Show that A is also quadratically attractive
at c.
278
General Means and Iterations
c) Show that none of exp, log, or exp (exp) nor any function
algebraically conjugate to the above functions is the limit of a quadratically
converging algebraic homogeneous mean iteration.
2. a) Prove that z" (a irrational), log, and sin are transcendental
functions over C(z).
b) Show that exp (exp) is transcendental over C(z, exp (z), exp (/i.(z)))
when /i is algebraic.
3. Prove (8.8.20) and (8.8.21) assuming the Gelfond-Schneider theorem.
4. (On the algebraic transformations of sin, cos, and exponential sums)
Let a(z) be an algebraic function over Fm-=C(z,ez), the field of
rational functions in z and ez. Suppose that a(z) is not algebraic over
C(z). Show that
TG(a : F) C {az + b: a, b £ C}
and calculate TG(cos:C) and TG(sin:C). Outline:
a) Show that if fj, £TG(a :C), then there exists a nontrivial relation
d
Z/ bj(z) exp [kj/Jb(z) + hjz] = 0
/-0
with bj £ C(z) and kj,hjB Z.
b) Consider minimal (in d) expressions of the above type and argue,
as in the proof of the transcendence of exp, that for some /,
exp [kjfjb(z) + hjz] £ C(z).
5. a) Suppose that a(z) is algebraic over C(z, es(z)) where g(z) is
algebraic over C(z), and suppose that a(z) is not algebraic over C(z).
Extend the arguments of Exercises 4 and 1 to show that TG(a, C)
contains no elements that are quadratic at any point where g is
analytic. In particular, such a function cannot be the limit of a
quadratically converging algebraic homogeneous mean iteration,
b) Suppose that a(z) is algebraic over C(z, log (g(z))) where g(z) is
algebraic over C(z) and suppose that a(z) is not algebraic over
C(z). Show that TG(a :C) contains no elements that are quadratic
at any point where g is analytic and nonzero.
6. (On the iteration of rational means)
a) Let M and N be homogeneous rational means. (A rational mean is a
mean that is a rational function.) Show that M®N(l,x) is
convergent in some neighbourhood of 1. Show that either
1) M(x)iV(l, x) is transcendental
or
2) [M(x)iV(l, jc)]' is a rational function for some integer i.
8.8 Algebraic Iterations and Functional Relations 279
Outline: Suppose/(x, y) = M®N(x, y) and suppose that/(1, x) is
algebraic. Then one has a finite sum
2s,(*)[/(i,*)]' = o, SieC(x)
and
(8.8.22) lri(x,y)[f(x,y)]i = 0
where rt(x, y) = x^'s^ylx). Thus, on substituting,
(8.8.23) 2 r,(M, N)[f(M, N)]' = 0
while by inyariance,
(8.8.24) ^ri(x,y)[f(M,N)Y = 0.
Consider a minimal expression of type (8.8.22) and deduce from
(8.8.23) and (8.8.24) that, for some i,
rt(M, N) = r,(x, y)
r0(M, N) r0(x, y) '
Observe that
A'r,(x, y) = rfa, $
and deduce by passing to the limit that
/-,.(1,1) /■„(*, yyUi
f(x,y) =
*-r0(i,i) ri(*>y)
b) Let
and
N(x,y):=(l-l)x+ly
As in Exercise 9 of Section 8.4,
N®M(a, b) = (ak-xb)Vk
280 General Means and Iterations
and the convergence is quadratic. In particular case 2) of a) can
occur,
c) Show that A® L2(l, x) is transcendental. Thus case 1) of a) can
also occur. (See Exercise 7 of Section 8.6.)
7. Calculate
TG(sin ~x: C) and TG(cos ~x: C).
Hint:
sin-1 z = —i log (iz + Vl - z2).
Chapter Nine
Some Additional Applications
Abstract. In Section 9.1 we derive the classical formula for r2(n) and its
theta function equivalent. In Sections 9.2 and 9.3 we consider the summation
of various multidimensional series. Results include an alternating series test in
several dimensions, evaluation of various lattice sums, and related invariants.
Section 9.4 gives Watson's quintuple-product identity, and Ramanufan's 1'VP1
product. Section 9.5 considers quintic and septic multipliers and solvable
iterations.
9.1 SUMS OF TWO SQUARES
We need the following simple lemma on Lambert series whose proof
(Exercise 1) proceeds by expanding both sides of each equation.
Lemma 9.1
If \q\ < 1 and un :- ¢7(1 - q"), then
(9.1.1) 2 um(l + um)=l nun
m = \ n=\
and
CO CO
(9.1.2) 2 (-l)m+1"2,„(l + «2«) = 2 (2» - lK„-2 ■
m—\ n=l
Our development hinges on yet another remarkable identity due to
Ramanujan [62]. (See also Hardy and Wright [60].)
281
282 Some Additional Applications
Proposition 9.1
Let 0 be real with 0 < 0 < <n\ Let
1 / 0\
(9.1.3) T := 7X¢,0):-= 4 cot (^)+ 2 "„sin(«0)
(9.1.4) ^:=7^,0)
1 / 0\1 °°
4COt\27J + ? "„(l + "„)cos(
1 °°
(9.1.5) T2:=T2(q,6)=- 2 >w„[l - cos(n0)]
Then
Proof.
T,+ T2= T2
1 /0
7 COtI -
4 \2
+ 5X + 52
where
and
Now
S\'-= 2 ^ "„ cot (^-j sin (n0)
52: = 2 umun sin (m0) sin («0)
1 I 0\ 1 nl 1
- cot (-) sin (nd) = - + 2 cos (kd) + - cos (nd)
while
Thus
2 sin (md) sin («0) = cos [(m - n)0] - cos [(m + n)6].
t cot ( -) + 2j ak cos (kd)
4 \Z/J k=0
9.1 Sums of Two Squares
for constants ak{q) which we proceed to evaluate. Now
m CO j CO
a0 = -z 2 un + - 2 un
A n = l L n = l
where the \u2n term comes from m = n in S2. Thus
283
(9.1.6)
1 °° 1 °°
fl0 = 9 2 "„(1 + "J = X 2 ""„
by (9.11). For fc>l, S1 contributes to ak,
1
-z uk + A «t
and 5, donates
— Zj u u + — Zj mm X MM
■^ m-
for m, n > 1. Thus
^ m-n=k *• n-m=k ^ m + n=k
k-\
Luckily,
and
1 1
^ 1=1 j-1 L 1=1
"/"*-/= "*(1 + "i + "*-,)
"*+,- + uiuk+i = uk(ui - uk+i) ■
Hence
(9.1.7) ak = uk
= uk(l + uk - - k)
Thfs shows that
*:-!
1 1
~+ 2 (",- - uk+i) --z 2 (l + "i + «**-()
rj-t2
T COt - + 2 K*(l + "J COS (*0) +-2 *«**[! - COS (k6)]
(9.1.8)
which is the desired result. D
284 Some Additional Applications
For 6 '■= it72 this result becomes
1 + 4 2 (-1)'
» 9
2n + l
1-9
2n + l
■i + i6 2 (-i)X,(i + «2»)
n = l
CO
+ 82 (2n + l)(«2„+1 + 4M4„+2)
1 + 8^ (2n + l)M2n+1
n-0
CO
+ 8 2 (4" + 2)m4„+2
on applying (9.1.2). Thus
1 + 4 2 (-1)"
n=0
2n+l
1-9
2n+l
:1 + ;
«-i 1-9
n^0(mod 4)
Now (3.2.23) can be used to show that the right-hand side is d\{q). Thus
(9.1.9)
03 = 1 + 4 2(-1)-:7^-2^
1-q
n = 0
= 1 + 42
n = 0
„4n+l
,4n+3
I"?
4n + l
1-9
4n+3
and this last expression is nonnegative for real q. (Hardy and Wright [60] use
Proposition 9.1 to deduce the formula for d\ from that for d\).
For r = 1 or 3 let dr(k) denote the number of divisors of k congruent to r
modulo 4. Then
_4n + r
CO CO 00
,7=2 2 q(4n+r)d = 2 dr{k)qk
n-0 J- — q n = 0 rf=l Jfc = l
and a comparison of the coefficients in (9.1.9) shows that
(9.1.10) r2(k) = 4[d1(k)-d3(k)].
In other words, the number of representations of a positive integer A: as a
sum of two squares, counting order and sign, is 4 times the surplus of
divisors of k congruent to 1 modulo 4 over those congruent to 3 modulo 4.
This recovers Jacobi's classical result, a result also known to Gauss. (See
Dickson [71].) Since
9.1 Sums of Two Squares 285
°° „m °° I -win 2m + \
£-1 -, , 2m ^-1 * 2m + l
m_l 1 + # m-0 I" #
we also have
CO
(9.1.11) 03(0) = 1 + 4 2 ~
«-1 l + q2m
This is the formula exploited in Section 3.7. Also (9.1.9) shows that
CO
(9.1.12) 02(?) = l + 4 2 {-\fn-x)nqnm
n,m — \
n odd
and
(9.1.13) e2A(q) = l + A S (-1)^^)^-^
n,m = \
t odd
= 1 + 4^ (-l)^"1-1^2"-1).
n.m^l
An alternative recent derivation of (9.1.9), due to Hirschhorn [85], relies
only on the triple-product identity. Begin with (3.1.13) and let a2 : = w. This
gives
CO
(a - a'1) 11 (1 - «V)(1 - «"V)(l - 9")
n = l
00 00
V 4n+l 2n2+n V „4n-l 2n2-n
rt = —00 rt = —CO
Now apply the triple product in form (3.1.1) to each of these sums. This
leads to
CO
(9.1.14) (a - a'1) U(l- a2qn)(l - «_V)(1 - qn)
n = l
=a fi (l+aVn_i)(i+<ry~3)(i - qAn)
n = l
-a-1 n (l + «y-3xi+a-y-'xi - <?4n).
Next differentiate each side with respect to a, at 1. (Use logarithmic
differentiation on each product separately.) This, after rearrangement,
produces
286 Some Additional Applications
1 + 4v (-i)V"'1 n:=1(i-g")3
(9 1 15) "=1 1 + ^ ^=1 (1 + I*"'1*1 + I*"'3*1 ~ ^
The right-hand side of (9.1.15) is
n;=1 (i-gnf n:=1(i-g")2
= el(q)
ni=1 (1 + ^)(1 -<T) n:=1 (l + qnf
where the last equality follows from (3.1.4) and (3.1.7). Thus
0^) = 1 + 4 2¼¾
n=l 1 + ^
which is equivalent to (9.1.9).
In the course of his study of Ramanujan's mock theta functions, Andrews
[86] discovered the following remarkable cubic counterpart of (9.1.11),
which we write as
(9.1.16) 6l(q) = 8 1 1 (l+^-^W-C^'tf.
It is an easy consequence of (9.1.16) that every number is a sum of three
triangular numbers (a fact originally observed by Fermat and proved by
Gauss). This also implies that a number is a sum of three odd squares
exactly when the number is congruent to 3 modulo 8 (Exercise 7]) as
observed by Euler.
Comments and Exercises
A comprehensive account of the development of formulae for sums of 2m
squares, from elliptic considerations and Lambert series, can be found in
Rademacher [73]. Many wonderful related identities are to be found in
Ramanujan [62]. Odd sums are much harder to evaluate. They involve
generating functions of class numbers as shown in Mordell [16] and Watson
[35].
1. Prove Lemma 9.1.
2. Verify (9.1.6) and (9.1.7). Show that these are equivalent to (9.1.8).
3. a) Prove formula (9.1.11) for d\(q).
b) Establish (9.1.12) and (9.1.13).
4. a) Let 6 tend to it in (9.1.8) and evaluate the limit.
b) Evaluate (9.1.8) when 6 := u74, -it/3, 2u73. In particular show that
9.1 Sums of Two Squares
287
°° I „3n+l
3n + 2
•— \ 1 „3n + l 1 „3n+2
= 1+122^
= 1 + 12 2 „ g n,2 - 36 2
00 „3n
5 ^ V ?
5. Two identities due to Lorenz (Dickson [71, vol. 3, p. 29]) are
q"
n=\ i q
where x8(n) is 1 if n = 8k + 1, 8k + 3 and -1 if n = 8k + 5, 8k + 7,
l-<74" 14«
/ 1 — /7n \ / 1
ii) 03(#3(^)=i+2S f4K+4^ t .
„=i \1 - # I „=1 \1- q
Let dai+b(k) be the number of divisors of k of the form ai + 6.
a) Show that the number i?2(fc) of integer solutions of n2 + 2m2 = k is
given by
R2(k) = 2[dw+1(*) + dSi+3(k) - dSi+5(k) - dSi+7(k)].
b) Similarly, the number R3(k) of integer solutions of n2 + 3m2 = k is
given by
R3(k) = 2[d3i+1(k) - d3i+2(k)] + 4[d12i+4(k) - d12i+s(k)].
Note that a) follows from Exercise 12 of Section 3.7.
c) Observe that ii) can be used to show that
q + q
/ 1 — n \
e2(q)e2(q3) = 4 2 (f-^V = 4 2 -^
/i odd
q2n + q~2n
Hence
Fin.x 02(/3)02(/33)
y
£0l + L4n+2 4V5
where /3 := (V5 - 1) /2 and Fn and Ln are the Fibonacci and Lucas
numbers, as in Section 3.7.
6. Use Ramanujan's modular identity of order 3 (Section 5.2) to deduce
that
288 Some Additional Applications
Hence deduce that
V4E?_20?(j8)0?(i83)
«=o L12n+6 3 6v5
where /3 := (3 - V5)/2. Note that | E£.2 0?(?)0?(93) can be written
more compactly as dl(q)dl(q3) + dl(q2)dl(q6). Find a closed form for
the Lambert series above when q'-= e~v/ 3.
7. Use formula (9.1.16) to establish that every positive integer is the sum
of three triangular numbers and that every number of the form 8 k + 3 is
the sum of three odd squares.
9.2 (CHEMICAL) LATTICE SUMS
Sums of the form
(—■\y+i+k
(9.2.1) 63(2,):= 2 ,.2./2, ,2y
arise naturally in chemistry. (Here the prime indicates that we avoid
summing i = j= k- 0.) Indeed, b3(l) can be considered as the potential or
Coulomb sum at the origin of a cubic lattice with alternating unit charges at
all nonzero lattice points. This may be considered as an idealization of a
rocksalt crystal. The quantity 63(1) is called Madelung's constant for NaCl.
Different crystals give rise to different lattice sums. We will also consider its
two-dimensional (laminar) analogue
(9.2.2) b2(2s):= 2 tVAt;
u-» 0 + 7 )
and its four-dimensional form
&4(2*)= 2' 7^-7-4^
i+j + k + l
There are some nontrivial considerations about the sense in which these
sums converge. (See Exercises 1 and 2.) We assume all sums denote limits of
rectangular summations. These will converge for re(s) > 0. The general form
of these rectangular sums is
N n
lim sn := 2 2 (-l)Xm' a(mx, m2,. . . , mN)
9.2 (Chemical) Lattice Sums
289
which, for real-valued a, can be shown to converge by an alternating series
test. (See Exercise 2.) The convergence in bN{\) is 0(n~1/2) so that 10"
terms are needed for 0(n) digits. Obviously direct computation is virtually
impossible.
Fortunately, some beautiful analytic reductions are possible. In this
section we illustrate this for b2 and b4. The idea is to observe that, for
re(s)>0.
(9.2.3) r(s)bN(2s) = Mt(e?-l) q-e"
where AT is the Mellin transform of Section 3.6. Thus
co \ 2 ■
2 (-i)Vj -i
n = —co /
T(s)b2(2s) = Ms(6t-l) = Ms
Now (9.1.13) shows that
CO
b2(2s) = T-\s)4 2 (-l)"+m_1M,tem(2,,~1)]
n,m=*l
= 4 2 (-l)n~1+m[m(2«-l)]-
Thus
(9.2.4)
b2(2s) = 4 2
(-1)-
(-1)"
„f o (2n + 1)' ~x ms
-4/3(s)a(s)
and we have factored b2(2s) into a product of Dirichlet L functions (the
alternating £ function a and /3 := L_4) defined by
^):=1^ = (1-2-
')£(*)
In particular, b2{2) = - ir log 2. Equation (9.2.4) is originally due to
Lorenz. Now any reasonable method of computing 0:(5) and /3(5) will
compute b2(l). Such is possible by various integral or summation
techniques. Correspondingly, from Theorem 3.2,
b4(2s) = T-\s)8Ms
; "*(-<?)m
Ui-(-4)"
-2
4mg4i
1-9
290
Some Additional Applications
and expansion gives
CO co -,
8 2 (-1)-^,(^)-8 2 4mMs(q4mk) .
m,k-l m,k-l
Now we compute these transforms and have
b<(2s) = 8 1 ^y1]%8(l-4'-)i ^
m,*-i (mk) m^=1 (mfc)
on adding and subtracting m(mk)~s in each summation. Thus
CO CO
But E"_0 (2n + 1)~* = (1 - 2~*K(*) and so
(9.2.5) b4(2s) = -8(1 - 22~')(1 - 21^)^)^ - 1) = -8o(*)o(* - 1).
In particular,
64(2) = -41og2=^62(2).
TV
Use of the functional equation for £, equation (3.6.6), allows one to
establish other formulae. In general any identity for any even-dimensional
power of theta functions will convert into a factorization for a matching
lattice sum. (See Exercise 4.) In Exercises 6 through 10 we show how
two-dimensional lattice sums may be used to evaluate elliptic invariants.
Comments and Exercises
An excellent and extensive recent survey of lattice sums can be found in
Glasser and Zucker [80]. This also discusses their chemical origin at some
length. Madelung's constant is analyzed from a mathematical perspective in
Borwein, Borwein, and Taylor [85]. There is ambiguity in the literature as
to whether the constants are positive or negative. We have chosen the sign
as convenient.
1. (Analyticity of lattice sums)
a) Considered as a limit of rectangular sums, show that b2(s) and
b3(s) exist and are analytic for re(s) >0.
Hint: Use the Mellin transform.
b) Show that
62(2S)=i(-l)"^ re(s)>\.
n = l " J
b4(2s) = r~\s)
9.2 (Chemical) Lattice Sums 291
Hint: g2(2s): = E"=1 (-l)nr2(n)nrS converges and is analytic for
re(s) > 5. For re(s) > 1, g2(2s) and b2(2s) coincide,
c) The analogous sum g3(2s) '■= E^=1 (-l)nr3(n)n~s diverges for
(Alternating series test) A mapping a: NN-> U is (N-)monotone if, for
m1, m2,. . . , mw>0,
N
2 2 (-l)*s'a(m1 + s1,m2 + s2,. . . , mN + jn)>0.
i = l i, = 0, 1
Thus 1-monotonicity is just a(m) s: d(m + 1), and 2-monotonicity is
a(m, n) + a(m + 1, n + 1) ^ a(m, n + 1) + a(m + 1, n), while 3-mono-
tonicity demands that the alternating sum over any unit cube be
positive if the bottom corner is. We say that a is fully monotone if a
and all its restrictions are monotone. Consider
N 00
2 2 (-1)^0(1^,1112,...,1^).
i-l m,=0
a) Prove the following alternating series test given in D. Borwein and
J. Borwein [86]. If a is fully monotone and lim^^^ a(m) = 0, then
the rectangular sums converge alternatingly. Hint: (In two
dimensions) show that sn := S"y=0 (-l)l+Ja(i, j) satisfies
V S2n— S2n+2
») 52„+1>52„_x
iii) 52„-52„_1-*0
Draw a picture. (The case of b3 and b2 is spelt out in Borwein,
Borwein, and Taylor [85]. An analogous bounded convergence
test due to Hardy can be found in Bromwich [26].)
b) Suppose that a is N times continuously differentiable on (IR4^.
Show that a is totally monotone if the partial derivatives satisfy
a, s 0, a,, s= 0, a, Ms0 (-1)¾ , , 2= 0
for all partial derivatives with ix < i2 < i3 < • • • < iN.
c) Verify that E„ (-l)'+y+*(i2 + /2 + A:2)"" and E„ (-l)'+y(2i + /)_"
converge for p > 0.
Prove that 1^)6^) = Ms(0< - 1).
a) Show that
00 1
H'7Tr^=^)^) re(5)>l
and -(»+»«)
(-1)"
(n2 + m2X
S't-TT^ =2-'62(2s) re(5)>l.
292 Some Additional Applications
b) Show that
b8(2s) = -16 £(s)a(s - 3) re (s) > 1
is equivalent to the formula (3.2.25)
(9.2.6) gfa) = l + 16S \1)nnq ■
c) Use 020304 = 0X+, equation (3.2.4), to show that
(9.2.7) 2 2 ("ir 2 =2* + 1j3(2^-l).
— [m + n +(p+ f) ]
5. The hexagonal sum is
^(2,): = 2' + g;2'mj. ._.,., re(5)>0
-» [(« + m/2) + 3(m/2) J
where <?(«, m) := f {sin [(n + 1)0] sin [(m + 1)0] - sin («0) sin [(m -
1)0]} and 0: = 2u73. This corresponds to calculating the Coulomb
potential on a regular hexagonal lattice. (See Borwein, Borwein, and
Taylor [85].)
a) For re(s) > 1, show using the cubic modular equation that
h2(2s) =
1-3
l-s
2Zi , 2 . „ 2\s ^-1
(-1)"
(n2 + 3m2Y (n2 + 3m2yj
2
b) A formula of Cauchy (Dickson [71, vol. 3, p. 20]) gives
1 + qn
e4(q)e4(g3) = i + 22 (-l)V
n = l
L 1 + q3n
Use this and a) to deduce that
^(25) = 3(1-3^)^)^3(5)
where
L__3(s) := 1 - 2~' + 4~s - 5~* + 7"* - 8~s + ■ ■ ■ .
c) This provides an analytic continuation of h2(2s). Thus
3(V3-l)(V2 + l)o(i)L_3(i) = A2(l)
is Madelung's constant for the hexagonal lattice. Show that
h2(2) = V3ir log 3 .
9.2 (Chemical) Lattice Sums
293
6. (Evaluating invariants) The factorization of two-dimensional zeta
sums into sums of products of L series can be carried a great deal
further. This is described in Glasser and Zucker [80] and in Zucker and
Robertson [76a,b]. By combining number theoretic and transform
techniques, one can explicitly factor all sums whose discriminants are
disjoint (have one form per genus) and a few others. This leads to
formulae such as
(9.2.8) T (m2 + Pn2)'s = 2W 2 L±ixLXAPI^
and
(9.2.9) T(m2 + 2Pn2ys = 2l-'2 L^L,^ .
n\P
In these two formulae P is an odd square-free number [congruent to
1 modulo 4 in (9.2.8)] with t distinct factors. The right hand sums over
all divisors of P and one has
CO
L±d(s):=2 (±d\n)n-s
n = \
which, are primitive L series modulo d. (These can only exist for d = P,
4P, or 8P, and the sign configuration is in fact uniquely specified.) Here
(k\n) is the Kronecker (generalized Legendre) symbol. L±^ is taken for
/i. = ±1 (mod4). For example, with P'~29, (9.2.8) becomes
Zi (m2 + 29n2)~s = L1L_n6 + L_4L29.
Dickson [29] gives an extensive list of disjoint discriminants. In
particular, there are 18 numbers less than 10,000 to which (9.2.8)
applies and 15 numbers less than 10,000 to which (9.2.9) applies.
Indeed, (9.2.8) holds for type one P: = 5, 13, 21, 33, 37, 57, 85, 93,
105, 133, 165, 177, 253, 273, 345, 357, 385, 1365 and (9.2.9) holds for
type two P:= 1, 3, 5, 11, 15, 21, 29, 35, 39, 51, 65, 95, 105, 165, 231.
There are only finitely many disjoint discriminants. We shall call such
P solvable.
Implicit in (9.2.8) and (9.2.9) are corresponding theta series
identities, and formulae for representations as weighted sums of squares.
a) Show that if q is replaced by -q in (9.2.8) and (9.2.9), we
produce formulae for
i) 1]'(-l)m+n(m2 + Pn2)s
ii) l]'(-l)m(m2 + 2Pn2ys .
Some Additional Applications
Show that the effect of replacing q by -q in (9.2.8), (9.2.9), or
similar formulae is to replace L±d by -[1 - {2\d)2x~s]L±d, unless
the L function was multiplied by some factor involving 2~s, or d is
even, in which cases it is unchanged.
Recall that (3.2.12) and Exercise 4d) of Section 3.2 gave
4ir
2' (-ir+n(m2 + m2)"1 -- ~ log /(V=7)
and
4ir
2' (-l)m(m2 + m2)"1 = - ^ log /x(V=r).
Use these formulae with a) and b) to establish that for the
appropriate P,
i) 2'-1^log/4(V=P) = Li4P(l)log2
+ S[l-(2|/*)]L±M(l)L*4/,/M(l)
Mi'
m#i
ii) 2'"1 ^= log /J(V=2P) = L_8P(l)log2
+ 2 [l-(2U)]L±M(l)L¥8/,/M(l)
The classical Dirichlet class number formulae (Landau [58]) allow
us to write L±d(l) algebraically. One has, for d>0 restricted so
that L±d is primitive,
i) L+d(l) = 2^1oge(<0
... _ ,., 2w h(-d)
u) L-(1)=V5^)"
Here h(d) is the number of (broadly) equivalent primitive classes
of reduced forms with discriminant d — b— 4ac or ideals in
Q(VD) where d is D or AD, depending on whether D = 1 (mod 4)
or D = 2,3 (mod 4); e(d) is the fundamental unit in Q(VD), which
may be computed from the fundamental solution of the
appropriate Pell's equation (see LeVeque [77] and Hua [82]); and w(d) is
a factor which counts the number of automorphs of the form, and
is 2 except that w(3) = 6 and w(4) = 4. From formula (4.12) in
Zucker and Robertson [76a] we have Dirichlet's formulae, for
d>4,
9.2 (Chemical) Lattice Sums
295
-1 d_1
h(-d)=-j- 2 n(-rf|«).
Also for d > 0
A(rf) log e(d) = - ^ t (d | w) log (sin ( ^)) .
Moreover, for disjoint forms, one can observe that the class
number h{—d) must coincide with the number of genera g. The
number of genera is as follows. If d is odd, then g = 2m~x, where d
has m distinct prime factors. If d is even and d/4 has m distinct
prime factors (including 2), then g = 1m when d/4 = 0,1,5 (mod 8)
and g — 2m-1 otherwise.
An excellent brief survey of history and of recent advances
regarding the class number can be found in Goldfeld [85]. Now
observe that, for the appropriate P, both \f and \f\ will be
products of powers of fundamental units from some of the divisors
of 4P or 8P. [For our solvable P, /i(-4P) = 2' when P = 1 (mod 4)
and /i(-8P) = 2'.] In particular, verify that for P:=5, 13, or 37,
since t — 1 in each case
4 - lf*Kf-5\-, „(p\KP)
G>^/t(V-P) = e(P)
Since h(P) = 1 in each case, we see that
Similarly, we may now verify the values of G6P, P-=21, 33,57, 93,
given in Exercise 9 and 10 of Section 4.7.
Establish that for P: = 5 or 29,
and for P:= 3 or 11,
g\P = e(P),
gT = s(2).
(Compare Table 5.2.)
For all the type one numbers P listed above, we can now
explicitly give Gp. Similarly for the type two numbers we can give
g2P. This accounts for most of the square-free and nonprime
invariants given by Weber or Ramanujan.
Show that
82i3o
=mm
Some Additional Applications
(Evaluating singular values) We can proceed further to evaluate
A: := A*(2P) for P of type two. We know that yfc/4 =
f1(V=2P)f;8(VI:8P). Hence
2plog('l) = 2'(-inm2 + 2Pnri
-4S'(-l)m(m2 + 8Pn2)-1.
We already know [Exercise 6c)] the first sum on the right. From some
elementary, but skillful, theta transformations we may deduce that
2' 1' (-l)'n(m-; + 8Pn2ys = 1 {[21^ - 1 + 2-'(2|/i)]L±/1L¥8l./M
On setting 5:=1 and substituting above, we derive
(9-2.10) - -^= logfc-22"' 1 L±V(1)L^(1),
a beautiful simplification.
a) Use (9.2.10) to compute A*(2P) for P:= 3, 5, 11, 29.
b) Use (9.2.10) to compute A*(210) given in (4.6.12).
c) Observe, as Zucker did, that there are computable in this form
two larger singular values: those for 2P : = 330 and 462. Verify that
A*(330) = (2 - V3)3(V2 - 1)2(V33 - 4V2)2(Vl0 - 3)2
x (3V5 - 2VH)2(4 - vl5)(V55 - 3V6)(10 - 3VTI)
(9.2.11)
and that
A*(462) = (V3 - V2)4(2 - V3)2(2V2 - V7)2(8 - 3V7)2
x (3VIT - 7V2)2(V22 - V2l)(10 - 3VTT)(76 - 5V231).
(9.2.12)
(Evaluation of K in terms of T) Selberg and Chowla [67] showed for
all rational numbers r that K(\*(r)) is expressible in closed form using
a finite number of T values. This relied on Kronecker's remarkable
'Grenz-Formel,' which has
lim (2' (m2 + rn2Ys - (5_^)v?j = ^ Py - log (4/-) - 4 log v]
(9.2.13)
9.2 (Chemical) Lattice Sums 297
as a special case. Here 17 : = r](Vzzr) is the eta function of (3.2.9) and
(3.2.11), and y is Euler's constant. Using (9.2.13), Zucker [77] applies
the factorization results described above to explicitly compute K
corresponding to solvable sums in terms of ir, surds, and Y values. This
leads to the following table of evaluations (Table 9.1). Elsewhere
Zucker has actually given £(A*(210)) [which involves T(n/840) for
(n, 840) = 1]. The general formula valid for either r-=P(Pof type
one) or r- = 2P (P of type two) is given by
and
4iog,(v-7) = ^S(-4H«)r(^
-log (877/-)- 2 L±Ai(l)L^(l).
(9.2.14) **< Ar) #
There is a corresponding formula when in Zucker's terms 5(1,1,
(1 + r) /4) is solvable.
a) Verify the contents of Table 9.1, for r = 1,2,. . ., 6.
b) Show that T(m/24) is Oop(log n) computable for integral m.
Hint: Express such T values in terms of K at singular values. It
would be interesting to know if this is possible more generally.
c) Compute £((3 - V7) /4V2). (Compare Exercise 9 of Section 5.2.)
d) Show in general that E(k*(rj) is computable in terms of V values
and algebraic quantities for rational r.
{Conjugate divisors and evaluations of k) Consider P as above and
suppose P-=d1d2 and Q-=d1/d2 for divisors dx and d2. Using
Kronecker's genus character sum formulae one can show that GP and
GQ = GQ-\ must have the same general form for P of type one.
Similarly, g2P and g2Q = g2Q-\ are paired, as are A*(2P) and X*(2Q).
This is best illustrated with examples.
a) Show that
g2190 = (V5 + 2)(VT0 + 3) g23g/5 = (V5 - 2)(VT0 + 3).
The first value may be computed as above. The second is easily
verified from Schlafli's form of the quintic modular equation.
TABLE 9.1. Evaluation of K at the First Sixteen Singular Values
K
[T(i)l
i\i2
9
10
11
12
13
14
15
16
4tt1/2
(V2 + i)1/2r(|)r(|)
013/4 1/2
2 TT
31M[r(j)]3
27/V
(V5 + i)[r(i)]2
07/2 1/2
2 TT
(V5+2)1M[
1Mrr(*)r(*M*M*2
1607T
- (V2 - i)(V3 + V2)(2 + V3)r( h )r( A )r( £ )r( a)
384-tt
iW(f)rO)
(71/4)41
2V2+(1 + 5V2)1/2
4V2
1/2(V2 + i)1/2r(i)r(i)
12tt1/2 L U/J
•(2 + 3V2 + Vs)r(A)r(j,)r(|,)r(^)r(a)r(^)r(a)r(a)
2560tt3
31M(2 + V3)'
[2 + (17 + 3V33)1'3 + (17 - 3V33)"3]2 IlAHp|0HAl
(V2 + 1)(V3 + V2)(2 - V3)1/231/4[r(|)]3
^13/3
(18 + 5V13)1'4
Mrr(A)r(^)r(^)r(M)r(i)r(^)r(ipr(i)r(i)r(M)r(i)r(l)i1/2
L 6656-tt5
[(10 + 6V2)1'2 + (2 + 2V2)1'2 + (3 + V2)1'2]1'2
[r(^)r(^)r(^)r(^)r(i)r(i)r(i)r(j)r(i)r(g)r(i)r(g)]1/4
X 16ttV7
(V5 + i)iU)r( 6)r(A)r(6)i"
240-77
(21/4 + i)2[r(i)]2
298
9.2 (Chemical) Lattice Sums 299
b) Correspondingly
_6 _/V5 + l\3/V3 + l\3/V7 + V3\VV7 + V5
G™-\~2~) \~Vr) \~~2 ) \~^7T~
_6 _/V5-l\3/V3 + l\3/V7+y3\VV7-V5
G^~{~2~) l~V2~/ V 2 ) \ V2
6 _/V5 + l\3/V3-l\3/V7 + V3\3/V7-V5
G21/5 V2/VV2M 2 M.V2
_6 _/V5-l\3/V3-l\3/V7 + V3\3/V7 + V5\
G15/7-^-y-j [-^-) { 2 / I V2 /•
One may verify G35/3 and G21/5 from the corresponding cubic and
quintic equations, and so on.
c) Use the techniques of Section 5.3 to compute 0-(105).
d) Observe that
i) A*(6) = (2 - V3)(V3 - V2)
A*(§)= A*'(|)= (2 - V3)(V3 + V2)
ii) A*(10) = (VT0-3)(V2-1)2
A*(|) = A*'(|) = (VI6- 3)(V2 + 1)2
iii) A*(58) = (13V58-99)(V2-1)6
A*(i)-A*'(f ) = (13V58-99)(V2 + 1)6.
e) Indeed, for all P of type two, A*(2Q) and A*(2P) will be
"conjugate", as we illustrate for A*(210). Let
^••-(V^-i)2
«4:=(4-Vl5)2
m7:=6-V35
M2:=(V7-V6)2
«5:=2-V3
«8: = 8-3V7.
«3:=(VlO-3)2
M6:=Vl5-Vl4
Then A*(210) = Ilm=1 um, and each A*(2Q) is a corresponding
product U8m=1 ue™(Q) where each em is ±1. In compact form one
has em{Q) := 1 -2bm{Q), where bm{Q) is the mth binary digit of
6(2(2) defined by
6(f) = 142 6(f) = 201 6(f) = 45
KH) = 71 &(g) = 163 6(1) = 228
6(^) = 106 6(210) = 0.
These correspond to all the reduced forms with discriminant 840.
300 Some Additional Applications
f) Numerically, find the "conjugate" values for P".= 165 and
P : = 231. [This involves computing 28 products and comparing
them with the theta expansion of k(2Q).]
With these conjugate values and the formulae of Section 5.3 one may
observe that a(2P) is now available in closed form for all P of type two (and
for conjugate divisors Q). Again we illustrate with an exercise.
10. a) Generate a recursive version of formula (5.3.3) and specialize this
expression to produce a formula for a(2d1d2) in terms of Rd ,Rd,
Md , M2 and the appropriate singular values.
b) Apply this formula with dx '■= 35 and d2 '■= 3 to compute a(210).
Observe that this uses only values of A* given in Exercise 9e).
c) Compute a(42). Note: g\2 = (2V2 + V7)'[(V7 + V3)/2]3 is
incorrectly, given in Weber [08].
11. Let {an} and {bn} be given sequences,
a) Show that
if and only if
b) Show that
if and only if
£(s) S ann s = S bnn
2 by.
" 1 - r"
a(s) Sa„" s= S bnn
S bnx".
c) Show that, with the notation of Exercise 12 of Section 3.7,
CO
i) £ <rk(n)n-'= £(s - k)£(s)
and
n) S,W« =^
9.3 Odd-Dimensional Sums and Benson's Formula
301
Also
in
j) f, *<">""'-^r
where ¢ is Euler's totient function, which counts the numbers less
than n and relatively prime to n. Thus ¢(1):=1, ¢(5) = 4, and
¢(6)=2.
9.3 ODD-DIMENSIONAL SUMS AND BENSON'S FORMULA
While even-dimensional sums usually factor, only a few odd-dimensional
ones factor. The reader can, however, produce identities like (9.2.7) from
Jacobi's identity (3.1.15) or from (3.2.8). (See Exercise 1.) There are
nonetheless many theta-based techniques of which we establish one based
on the theta transform.
Theorem 9.1 {Benson (1956))
-h(l) = E ,.2-, -2^,2,112 = 12T- 2 Sech2
-» (i + ] + k )
(9.3.1)
m,n=\
odd
- (m + n )
Proof. By symmetry,
(-2/- -ly' + fc
03(1)-3 2( ..2 , .2 , ,2
,-.2 , .2 , /2-,3/2
0 +7 +* )
and
r(| 63(1) = 3 S (-i)Vm3/2 S (-i)/+V2+y2
where # : = e '. Thus
r( 2 )63(1) = 3M3/2
2 (-dv^jco
The theta transform (2.3.2) leads to
-r^)&3(i) = 3Af3/2
»2 w „?/ it
2,(-1) n q — 02y t
302 Some Additional Applications
and since T( §) = Vtt/2,
00/- 00 /• CO -i
-63(i) = i2V¥E (-i)n+V E [g-^'^^^jr1'2* •
n=l <- y,*:=-o<.JO J
odd
The internal integral I(n, j, k) was evaluated in Exercise 4 of Section 2.2.
We have
and
-63(1) = 48tt E E (_i)»+i„e^nV(2^1)2+(2A:+1)2.
y',Jt=0 n=l
Finally, for a > 0,
4 2(-1^--^^--^(1)
and Benson's formula follows. D
The convergence acceleration is astounding. Summing for 0sj, A: < 3
produces b3(l) '■= -1.74756459 ..., which is correct to eight places. Even
the first approximation, by -12ir sech (ir/V2), gives -1.73 ....
For s¥= \ this manipulation leads to an integral involving Bessel functions
of the second kind, and the closed form is lost.' There is, however, an
extension to N dimensions (among others). One can show that
(9.3.2) -»^-2)-^^2i«h"(f^j).
odd
Thus
(9.3.3) -64(2) = 167r2 S sech2( ^[? + j2 + k2) = 4 log 2
«,M=1 ^
odd
and
00
(9.3.4) -62(0) = 2ir E sech2
n = 0
which coincides with Exercise 7eii) of Section 3.7. (See Exercise 2.) For the
final evaluation it helps to know that for primitive L series
(2n + 1)
9.3 Odd-Dimensional Sums and Benson's Formula 303
V*
L_t(0)= — L_t(l)
as follows from the functional equations for L±k:
L_k(s) = C(s)cos[™)L_k(l-s)
(SIT \
YJL+k(l-s)
where C(s) := 2V-1Jfc~,+1/2r(l - 5).
Comments and Exercises
Our derivation of Benson's formula can be found in Glasser and Zucker
[80]. In that paper, and references therein, one finds much further
discussion of odd-dimensional sums. They also illuminate the relationship between
the multidimensional zeta functions of Epstein and lattice sums.
1.
a)
b)
Show that
00
2S(-l)m
— CO
\2m--J + 2n+2p2
— s
= 2s+1L_8(2s-l)
where L_8(s) :=1 + 3~s-5~s-7~s + • • • .
Show that
CO
g(2s):=2(~l)m+n+p
— CO
= 12s/3
(2s - 1)
[(-^K)2+K)1
and g(l) = Vl2/3(0) = VS.
c) Derive similar identities from Exercise 5 of Section 4.7.
2. Establish the generalization of Benson's formula (9.3.2) and its special
cases (9.3.3) and (9.3.4). Exercise 4b) of Section 9.2 shows that
68(6) = -8f(3).
The remaining exercises examine the n-dimensional Hurwitz zeta
function. Let d>0 and define
(9.3.5) L,.(s,,):.(So)|^L_
where 5,- := sign (a,) and a :— (ax, a2, ■ ■ ■ , aN). Thus
304 Some Additional Applications
».«-i (n + m)
and
t ' ,<u- Y (-1)"
3. a) Show that
"'-'-'(-log*)'-1
^"•^['W-*
//inf: Take a Mellin transform. Then make a logarithmic variable
change and sum the resultant series.
b) Let Ls(s, d)~: AN(s,d) and L_-(s, d)=: A_N(s, d), where e is
the vector (1,1,. . ., 1) in UN. Show that for d > 1
(9.3.6) ^(MH^t^-lM^-^-l)
~A±(N_iy(s- l,d-l)].
Hint: Use integration by parts.
c) Combine integration by partial fractions and the recursion (9.3.6)
to show that every sum of the form (9.3.5) factors into a linear
combination of one-dimensional Hurwitz zeta functions (with
coefficients depending on 5).
4. Let AN(s) := A_N(s, N) and PN(s) := AN(s, N).
a) Show that
1
and
Pn(s) = J^l Pn-x(s -I)" PN-x(s) •
b) Deduce that, for appropriate s,
0.2,1¾-^-««■-«.
ii} 1,(^7 = 2^)-^ = 2^)
) 2
in
Thus
9.3 Odd-Dimensional Sums and Benson's Formula
1
305
„=1 (m + ny
IV
■) r
(-1)"
(\m\ + \n\):
= as -i)- «*).
= 4a(s -1)
o r
= 4^-1).
(H + wr
c) Show that
(#-1)1^)=2^-1 + /0
n = l
where a„ are Stirling numbers of the first kind. Thus
p4(S)=u(s-i)- as -2)+%as-i)~ as)
and
P5(s) = &&-*)- &&-$ + %C(s-2)- $C(s-l) + fa) .
There is a similar formula for ^4^(^).
5. a) Show that
S'(H+H+i^r=2^+4^-2)-
b) Show that
(-1)"
1-2
2 y«(5)+2 /3(5).
n,m=o (2n + m + l)s
c) Show that
°° (—■\\n+m 1
JL fe + < = 3[{2 + 3""><'> - «<' - *> - L~^ Dl
and so L_1>_3(1, 4) =
6. Show that
V ("■*■)
7 7T 1
3L3l0g2 3V3 2.
+ s
n,^-i (« + 2m + 4p)s ~0 (8n + 7)
Hint: (1 + jc)(1 + x2) ■ ■ ■ (1 + j2""1) = (1 - x2")/(l - x)
=8-sas).
306 Some Additional Applications
9.4 THE QUINTUPLE-PRODUCT IDENTITY
Jacobi's triple-product identity has an elegant fivefold analogue due to
Watson [29] and Gordon [61]. This is
CO
(9.4.1) II (1 ~ qn)(l - *?")(! - z"V-1)(l - z2q2n-x)(l - z-2q2n-x)
CO
= 2 (z3m - z-3m-1)gm(3m+1)/2
m = —CO
valid for all complex z and # with \q\ < 1 and z^O. The proof is left as an
exercise. (See Exercise 1.)
If we divide both sides by 1 — z~x and let z tend to 1, we derive
ri(l-g")3(l-g2,,-1)2= E (6m + l)gm(3m+1)/2 .
Now this yields
(?.4.Z) V4{q)— oo m(3m + l)/2
Em=_„(-1) <?
on using Euler's pentagonal formula (3.1.10) and (3.1.7). This can be
used to establish a recurrence formula for r2(n), as in Ewell [82]. (See
Exercise 4.)
Comments and Exercises
The identity, implicit in Ramanujan's work, was discovered by Watson [29]
and rediscovered by Gordon [61]. Further extensions are discussed by
Gordon. Other proofs abound in the literature.
1. Prove (9.4.1). As with the triple-product identity, it is easy to establish
the formula (9.4.1) up to a constant relying on q alone. To evaluate the
constant observe that when z:= -1, (9.4.1) reduces to Euler's
pentagonal identity.
2. Show that (9.4.1) is equivalent to
CO
II (1 - q2")(l - q2"^z)(l - q2n-lz-l){l - q4n-4z2)(l - g4"~4z-2)
= S q3n2~2n[(z3n + z~3n)- (z3n~2 + z-(3n~2))].
9.4 The Quintuple-Product Identity 307
a) Let q'=q' and z := q~k in (9.4.1) and deduce that
(9.4.3) EI (1-<?")= 2 qi^"^-*'"'- qV + W]
neN(j.k) m=-oo
where N(j, k) consists of all integers congruent to 0, ±k, j, j ± k,
j±2k (mod 2/) (repeated as appropriate).
b) Deduce that Euler's identity follows for /:=4 and k ■ = 1. For /:=3
and k : — 1 one gets
CO
II (1 - qn)(l - q6n-5)(l - q6"-1) = g(q) - 3ffi(g9)
n-l
where g(q) : = E:=0 g<"2+">'2. With /(¢) := 11^ (1 - q'y1 this
becomes a formula due to Ramanujan
c) Use g(q) = f(q)/f\q2) to obtain a functional equation for /(¢)
(the partition function).
d) Use (9.4.3) with / := 6 and k := 1 to obtain
2f(q2)f(q3)f(q12) „„„„,_9
/(<7)/(<74)/V)
= 3¾^9) -«,(<,)
Hence obtain a functional equation for 03. Thus note that
3#j(# )- #j(#) never has a solution. (Compare Section 4.7.)
a) Establish equation (9.4.2) and
E;=_0O(6m + l)9(6"1+1)2
el(q24)
£;=_(-iyY6m+1)2
b) Use (9.4.2) to derive a recursion for r2(n).
c) Prove that
CO CO
II (1 - q2")(l - q2n~X)2{l - ?4")2 = E (3m + l)?3m2+2m .
(9.4.4)
//i/tf: Use Exercise 2.
d) Show that
04(<?)_ E:,-.(3/i + l)g(3"+2)"
^(¢) £:„_.„ (-1)-(3/1 + l)g(3"+2)"
308 Some Additional Applications
so that
Observe that this is slightly faster to compute than the original theta
series ratio,
e) Show that
^_M_, 1/4£:=-„(3k + 1)<7(3"+2)"
Vk 63(q) lq E;__„(6,i+ 1)^-
so that
E:=_„(6« + l)^6"+1)2/12'
5. a) Show that
CO
2
/ly+i + k
/,/;*— [24i + 24/ + (6k + 1)T
1 1
= 1
2s-l /£, , r\2s-l
(6/n + ir-1 (6m+ 5)
= (1+21_2*)L_3(2j-1).
b) Replace g by -g in a) to express L_24(2,j - 1) as a lattice sum.
c) Combine (9.4.4) and (3.2.7) to prove that
L.3(2s - 1) = X I£t. ,^2, i,1?
i+j+k
[6(i+ \f + 6(; + |)2 +9(/c+i)2]s •
Observe that at 5 = | this equals 3.
Gordon [61] also gives various congruences, like those given in Chapter 3
for the partition function. For example,
/(<7)/(<74) nJA
has c3n+2 divisible by 3.
6. There is yet another remarkable identity due to Ramanujan, which
includes both the triple-product and the q-binomial theorems. This is
the 1M*1 sum, whose derivation and uses are accessibly described in
Askey [80]. In standard notation one writes
9.5 Quintic and Septic Multipliers and Iterations
309
(9 4 5) y (fl> ?)» x" = (ax'> q)~{qiax; g)~(<?; q)~(b/a; g)„
-„ (b; q)n (x; q)m(b/ax; q)„(b; q)„(q/a; q)„
where (a; q)„ := II (1- aqk) and (a; q)n : = ^-.
t=o 1«? . q)oo
This converges at least for \q\ < 1 and \blq\ < \x\ < 1.
a) Verify that the triple product is contained in (9.4.5).
Hint: Begin by setting a '■= c"1, x •'= ex and b ■ = 0. Now let c := 0.
b) For b'-= q (9.4.5) becomes the q-binominal theorem:
V ia' tin xn= (ax; <?)„
«=o (<?; q)n 0; <?)»
valid for |jc| < 1 and \q\ < 1. Verify that Cauchy's binomial theorem
is a special case. Hint: Begin by setting a '■= q~2".
c) Use the q- binomial theorem to express the limit mean of Example
8.4 as a Taylor series.
9.5. QUINTIC AND SEPTIC MULTIPLIERS AND ITERATIONS
In this section we discuss additional modular and multiplier identities and
give a number of applications. Notations are as in Sections 4.6 and 4.7.
References to entries are all to Chapter 19 in Ramanujan's Second
Notebook (Berndt [Pr]).
Proposition 9.2
(a)
(9.5.1) 5M5 = 1 + 2G25„/G^
(9.5.2) 1/M5 = 1 + 2G„/G^„
(9.5.3) 5M5 + l/M5=2{2 + /c/ + /c7'}.
(b) Let 11 (It + 1): = M5. Then
(9.5.4) 1 - 2/2 = M\{\ - lit - t2)V(l + t2)M5
(9.5.5) 1 - 2k2 = (1 + t - t2)V(l + t2)M5 .
Proof.
(a) These are given in Entry 13. Berndt [Pr] provides proofs which may
also be deduced from (4.1.20).
(b) These may similarly be found in Entry 14. □
310 Some Additional Applications
We will write, as in Section 4.7,
Mp(n):=Mp(kn,kp2n).
Ramanujan also gives the following beautiful counterpart to Theorem 4.11
(Entry 12(iii)).
Theorem 9.2
03(q25)
(9.5.6) ^ = 1 + ^ + ^
where for i = 1 or 2
(9.5.7) ri-=\x{y±yJy2-Ax3)
and
(9.5.8) ^:=5-^^-1, >>:=(*-l)2+7.
This provides a solvable update for M5 despite the nonsolvable nature of Ws
(See Exercise 8 of Section 4.5). Indeed with mn : = 5M5(r52") we have
(9.5.9) mn+1 = (1 + r11'5 + r12'5f/mn
with x: = mn - 1, y and r; as above.
If we combine (9.5.9) with the following formula for e5 we obtain a
remarkably simple solvable 5th-order iteration for it.
Proposition 9.3
If r >0 and s(r) := M^\r) then
(9.5.10) a(25r) = s\r)a{r) - Vr[S (^~5 + Vs(r)(s\r) -2s(r) + 5)} •
Prao/.
We begin with e5, as given by (5.2.13), and the explicit formula for R5-
We use Proposition 9.2(a) to rewrite s(r)R5 and 9.2(b) for the terms
involving /2 and k2. This leads reasonably directly to
e5 =. S-&ll + ys(r)(s\r) - 2s(r) + 5).
Now (5.2.14) becomes (9.5.10). (Exercise 2.) Q
9.5 Quintic and Septic Multipliers and Iterations 311
The identical manipulations to those in Example 5.3 immediately yield
a(5)=i{V5-V2(V5-l)}.
We next list several similar identities for M1.
Proposition
(a)
(9.5.11)
(9.5.12)
(b) Let t := i
9.4
{kl)1/4
, / /' //' ( IV \213
.,,- k k' kk' Jkk'\2'3
. Then
(9.5.13) 1M7 - 1 /M7 = 6 - 16t + 12t2 - 8t3 .
Proof. These are to be found in Entry 19. □
From (9.5.13) and (4.6.7) we may establish that for r>0 and
s{r):=M~\r)
(9.5.14) a(49r) = s\r)a(r) - Vr{^~7 + s(r)(4t2 -4/ + 3)} ,
where tA = kl. (See Exercise 3.)
Ramanujan does not give a septic analogue to Theorem 9.2. He does,
however, give quintic and septic updates for the eta-multiplier. Let 17 be
given by (3.2.9) and (3.2.11). Let
(9.5.15) Np:=JlLti
v\q'P)
so that Np corresponds to Mp. This is the eta-multiplier of order p. Now
(3.2.15) can be written as
(9.5.16) kk'M3p = ll'N3p or Mp = Np{^)
where Wp(l2, k2) = 0.
Theorem 9.3
(a)
(9517) 6=M2(1-M2)
1 -^ ■U) ■ 2 4(2M2-1)
1/3
312
(9.5
.18)
(9.5.19)
(b)
(9.5
.20)
Some Additional Applications
3 M3(l - M\)
3 9M2 -1
3_M5(1-M5)2
5 (5M5-1)2 •
(49M2 - 1)JV7 - (8M7) JV2 + (8M2)iV7 + M7(M27 -1) = 0.
Proof.
(a) In each case one combines (9.5.16) with appropriate multiplier
equations. For p ■ = 5 use (9.5.1) and (9.5.2). For p ■= 3 use the identities
preceding equation (4.7.9). (The details are left as Exercise 6a).)
(b) We use (9.5.11) and (9.5.12) to write
//'M7"2(l + 8JV7) = kl' + Ik' - kk'
and
A*'M2(49 + 8/JV7) = kl' + Ik' - IV .
We now subtract one from the other and divide by IV. □
We finish the section by listing Ramanujan's updates for N5 and N7. Entry
12(i) can be recast as
(9.5.21) 5JV5(<?5) = (M1/5 + v115 - l)2/(5N5(q))
where fi and v are the solutions to
fiv = -l, ix + v = 11 + (5N5(q)f .
Entry 18(ii) becomes
(9.5.22) 7iV7(^7) := (fiin + vin + a>in - l)2/(7iV7(^))
where /jl, v and w are the roots of
x3 - ax - bx + 1 = 0
and a and b are given by
a:=57 + 14[7JV7(<7)]2 + [7JV7(<7)]4
b : = 289 + 126[7JV7(^)]2 + 19[7JV7(^)]4 + [7JV7(^)]6 .
9.5 Quintic and Septic Multipliers and Iterations
313
Theorem 9.2 gives 63(q2S) solvably in terms of 83(q5) and 63{q). Likewise
(9.5.22) gives 63(q49) solvably in terms of 03(<?7) and 63(q). Thus 64(q) =
03(-q) is similarly solvable and since k = 62AI0\ and ;' is solvable in k we see
that for /any of 04, k, g, G or J, f(qp) is solvable over <Q(/(<?), f(ql'")); for
p '■ = 5 and for p ■= 7. In view of the nonsolvability of the quintic or septic
modular equations for A (and hence k and /), this is at first surprising. What
is happening is that the Galois group for Fp, /?>5 and prime, is a
nonsolvable group of order (p - l)p(p + 1). However j(qllp) is a root of Fp
and, since Fp is irreducible, it is of order/? + 1. Thus the splitting field for Fp
over Qp(j(q), j(qVp)) has order dividingp(p - 1). For/? := 3, 5, 7, and 11,
(/? - 1) /2 is prime and the corresponding group is obviously solvable. For
/? : = 7, for example, we expect seventh roots of cube roots to comprise the
solution ((/?-1)/2 = 3). This is consistent with equations (9.5.22) and
(9.5.20).
Comments and Exercises
Knowing that a solution exists and exhibiting it, particularly in simple form,
can be very different matters. The components of the quintic and septic
algorithms for it are far less complicated than one might initially expect.
Both can be packaged very elegantly. (See Exercises 2 and 7, and Borwein
and Borwein [Pr].)
1. a) Combine (9.5.1) and (9.5.2) to obtain Schlafli's form of the quintic
modular equation.
b) Compute that
i) M5"1(1/5) = V5
ii) M5_1(l) = 5(V5-2)
iii) m;\3/5)=£^±
iv) M5_1(2/5) = V5(V5+2)(V2-1)2.
c) Find closed forms for M^l{n) for « := 1, 5, 9, and 25.
d) Use G85 = [(V5 + 1) /2][(9 + V85) /2]1/4 and the conjugate nature
of G17/5 to compute 5M5(-f). Use Theorem 5.4 to compute cr(85),
2. a) Verify the formula (9.5.10) for a(25r).
b) Obtain closed forms for a(n) « = 25, 125, 625, 225, and 1225
(Ramanujan [14] givesG1225).
c) Observe that Exercise lb) (9.5.9) and (9.5.10) combine to give
several explicit iterations for it. For example, we may begin with
r:=l, a(l)=i,*(l) = 5(V5-2).
d) Use Theorem 9.2 to compute G62Sn in terms of Gn and G25n.
314 Some Additional Applications
3. a) Show that (9.5.14) holds and that
,_s2(r) + 2s(r)-7
e7.=
+ 2s(r){VJk + VFF}.
b) Compute the corresponding updates for S(25r) and S(49r). In
particular
8(49) = M~\l){l + 2\fkl + 2VW} .
c) Verify a(J).
d) Establish that R5 and R7 are as given in Table 5.1.
4. Ramanujan also gives (Entry 19)
m- - lV
M
(k'i'y"-(kiy
\7i 1/12
1-41«
- kk'
1Mi=7T7Jn
a) Thus show
(«)1/4-(*:'/')
1M2_(GA9n\1 Gl-2V2G,9n
/Mt~\ Gn I 2V2G -Gl
b) Compute M7( f) and M7( f).
5. Ramanujan in his letters (Hardy [40] p. 353) gives the following
beautiful hybrid identity. Let
(C C \3/2
Q Q5") > P:=(GnG9nG25nG225n)in ■
Then
Ap+1p) = Q+i
Q
a) Verify that G\5 = Qf±^ and G\n = 8^f + *> .
6. a) Establish the eta-multiplier formulae of Theorem 9.3a) and b).
b) Show that Np(k'p,kp) = ^=.
9.5 Quintic and Septic Multipliers and Iterations 315
c) Show that
49M72 = §(/>(>>) + ^p\y) + 196/)
1 IM2 = \ (P(x) + ^P2(x) + 196a;3)
where x : = (kk'/W)113, y :={ll'lkk')113 and
P(x):=l + 8x-8x2-x3.
d) Prove that if x ■ = \/M13 and y : = yN[z
(1 - l?>x2)y3 + (4x)y2 - (4x2)y + x(l -x2) = 0.
Hint: Use (4.6.8).
7. Combine (9.5.20) and (9.5.22) to produce a solvable update for
mn '^IM^rl2"); and so a solvable 7th-order iteration for it.
8. Ramanujan's letters also contain the modular equations of degree 5 for
KllA and K1/6. (See Section 5.5.) For K1/6 one has
(Ik)213 + (I'k'f3 + 3(ll'kk')U3 = 1
and for K1/4 one has
Ik + l'k' + 8(ll'kk')V3{(lk)V3 + (I'k')1'3} = 1.
a) Verify that, in the notation of (5.5.34),
b) Similarly
Gr,?(5) = ^y
G,7.2(5)=5
Hint: The appropriate /?th-order modular transformation for Ks
sends / =: A*(n) to k : = \*(p2n).
Chapter Ten
Other Approaches to the
Elementary Functions
Abstract. We examine some of the standard polynomial and rational
approximations to elementary functions, particularly to exp and log. We discuss
methods for reducing the complexity of calculating these functions based on
accelerating the evaluation of the approximants. While these methods are
usually less than optimal, they are of more general application than those of
Chapters 6 and 7.
10.1 CLASSICAL APPROXIMATIONS
We commence with an analysis of the standard approximations to exp on a
disk Ds := {\z\ s 8}. The notations we will require are as follows. Let Pn
denote the algebraic polynomials of degree at most n with real coefficients.
Let \\f\\A denote the supremum norm of a continuous function/on the set
A, that is,
(10.1.1) H/IL:=sup|/(*)|.
For a continuous / on an infinite compact set A C C, let
(10.1.2) E„(f,A):= rmn\\ f-p\\A
PSPn
and let
(10.1.3) /?„(/, A) :=/?„(/)= min \\f-p/q\\A.
p,q<s.rn
These quantities are, respectively, the error in best uniform polynomial and
316
10.1 Classical Approximations
317
best uniform rational approximations. The existence of the best
approximants is fairly straightforward and is left as Exercise 2.
The most commonly used polynomial approximations to exp are
undoubtedly the partial sums of the Taylor series. This is reasonable since in any
neighbourhood of zero the partial sums are asymptotically optimal. (See
also Exercise 3.)
Theorem 10.1
(a)
d1 (n + 1)!
1 +
n + 1
ion
(b) KpePB, then
\W -/WIU* (^Ti)t(i-^t)
Proof. Part (a) follows from the estimate
1
(n+1)!
1 +
1 +
+
+
n+2-V ' « + 3 (n+ 3)(rc +4)
For part (b) we observe that if
Ik2-p(z)\\D <mmy-sn(z)\
where sn is the nth partial sum to exp at zero, then by Rouche's theorem,
p(z)-s„(z) and ez-sn{z)
have the same number of zeros (counting multiplicity) in Dx. As ez -
s„(z) has a zero of order n + 1 at zero, we deduce the contradiction that
p = sn. To finish the proof we need only observe that
min |e2 - sn(z)\ > , .
|z|=i (n + 1)!
1
n + 1
O
We now turn to the Pade approximants to exp. These are rational
approximations that are a natural extension of the Taylor approximants. We
define
(10.1.4)
j;t\t + z)me-'dt
rm,«W $Z(t-z)ntme-'dt
and observe that rmn is a rational function of z with numerator of degree m
and denominator of degree n. In closed form,
318
Other Approaches to the Elementary Functions
(10.1.5) rmJz)=2
m
yi
vl (-z)'
v=0(n + m\ v u=0//n + «x v-
Theorem 10.2
(a)
(b) Iip,qEPn,then
r (z)|| < 8(^)(^)
rn^)\\Dl- (2/,)!(2/i + l)! •
p(z)\\ ^ (n\)(nl)
qiz)^ 8(2n)!(2n + l)! '
- Q(_Z)U r
(10.1.6) q„(z) := (2n)! X = I 0 - *)"<"«-'
Proof. Let
dr
"=o /2«
i>!
Then
(10.1.7) <7B(z)[ex-rBpB(z)] = J"o (f-z)rg*-'A-Jo tn(t + zye-'
= jZo(t-z)"t"e*-'dt
= z2n+lJo (u - l)"uVl_")x rfu .
Now from (10.1.6), for \z\ < l,
"(2/i)! (2w-l)! (2n-2)!
df
(10.1.8) kB(z)|an!
n! (n-l)! 2!(n-2)!
=^4-5-2½
>(2«)!(2-Ve).
Since
r1 i
(10.1.9) Jo (1 - u)"u" du =■£(/! + 1, n + 1) = tt^J
(2n + l)!
10.1 Classical Approximations 319
we have from (10.1.7) and (10.1.8),
(10.1.10) Ik"-V.MM-^(2,)^ + 1).-
Part (b) requires showing that for |z| = 1,
n\n\
(10.1.11) \e* - rB,B(z)|>
8(2n)!(2/i + l)!
which follows from the estimates (Exercise 5)
4 (2n + l)!
(10.1.12)
and
(10.1.13) \q„(z)\*(2n)lVe.
The rest of the argument is analogous to part (b) of Theorem 10.1. If
there were a rational function/?Iq satisfying (b) with the inequality strictly
reversed, then by (10.1.11) and Rouche's theorem,
plq-rnn and e2 - rnn
would both have the same number of zeros, and from (10.1.7) we would
deduce that plq — rnn has at least 2« + 1 zeros and hence is identically
zero. □
The fact that rn n is the Pade approximant is a consequence of (10.1.7),
which shows that
e2-r„,„(z) = 0(z2"+1) as z^0.
In general the (m, ri) Pade approximant to an / (analytic at zero) is the
unique rational function r = plq, where p E Pm and qE Pn, which satisfies
f(z)-p(z)/q(z)=0(zh)
where h (in nondegenerate cases h = m + n + 1) is as large as possible. For
n = 0 this defines the «th Taylor polynomial.
The following is a partial list of the standard series and continued fraction
expansions for exp and log.
320 Other Approaches to the Elementary Functions
(10.1.14) e2=X ""
(10.1.15) e2 = l +
2z
2 - z + 2zz E"_! [1 /(z2 + (2tt«)2)]
(10.1.16) e2 = l+~-^--^-^-#-#-^-
v ' 1-2+ 3- 2+ 5- 2+ 7-
(10.1.17) Mll+f) =^
\n + l_n
log(l + z)=2^^
(10.1.18) " |z|=sl, Z5*-1
log(z) = 2
z-l\ 1 /z-l\3 1 /z-1n5
+ ^ —T +^ ^-T +
,z + l/ 3 Vz + 1/ 5 \z + l,
(10.1.19) re(z)>0,
, (z + l\ Jl 1 1
log =2-+ —^ + —f +
e\z-l/ \z 3z3 5z5
(10.1.20) 1*1^1, Z5^±l
\ _ z z _£_ ij?_ ^£ 2£ —
8 ( -1 _ IT 2+ 3+ 4+ 5+ 6+ 7+ " "
(10.1.21) z^(-oo,-l]
z'-l
(10.1.22) lim —— =logz.
a io o
Comments and Exercises
Pade approximants derive their name from H. Pade, a student of Hermite,
who was one of the first to systematically study such approximations at the
end of the last century. The theory of Pade approximation may be pursued
in Baker and Graves-Morris [81]. The convergence theory for Pade
approximants is far more complicated than the analogous well-known theory for
Taylor series. (See Exercise 9.) Except in special cases, such as exp or
functions given by Stieltjes transforms [J- da{t)l{x + t)], analysis of region
or rates of convergence is only partially understood. Theorem 10.2 can be
sharpened to show that
10.1 Classical Approximations
321
n\n\
R"(e*,Dl)~ (2n)l(2n + l)\
using Pade approximants centered at z : = 1/(2« + 1). This is due to Tre-
fethen [84]. (See Exercise 10.) The discussion of the approximations of exp
follows Newman [79], as do Exercises 7 and 8. These exercises illustrate the
different rates of convergence on disks and intervals. Exercise 8 is the n = m
case of a conjecture of Meinardus, namely, that
Rn,m(ex,[-l,l])
n\m\
2n+m(n + m + l)!(n + m)\
[1 + 0(1)]
as n + m —»°°. This conjecture has been resolved recently by Braess [84].
(See also Nemeth [77].)
For further discussion of the material of this section the reader is referred
to Cheney [66] or Newman [79]. The various expansions may be found in
Abramowitz and Stegun [64].
In Section 11.3 we will use the Pade approximant to exp to derive a
precise irrationality measure for e.
1. (Lagrange interpolation formula) Given n + 1 points in the plane,
(z,., wt), i = 0,. . . , n, so that zt ^ zj for i^j, show that there exists a
unique p E Pn so that
Show that
where
P(?i) = wt » = 0,. .
n
p(z) = X wtlt(z)
/,(*)--=nz~z*
/t=0 Zi Zk
k¥-i
2. (Existence of best approximants) Prove that En and Rn are well
denned, that is, show that the min is achieved in (10.1.2) and (10.1.3).
Hint: A uniformly bounded sequence of polynomials, all of degree n
has a uniformly convergent subsequence whose limit is a polynomial of
degree at most n.
3. a) Suppose that / is entire and that *n is the «th partial sum of / at
zero. Show that
322 Other Approaches to the Elementary Functions
Hint: Show that if f(z) ='■ E™=0 aLz', then for infinitely many m,
kJ(l-e)^
a,z
<|flm|(l+e) |z| = l
Now use the arguments of Theorem 10.1.
b) Suppose / is analytic in a neighbourhood of zero. For fixed n show
that
lim -Tj-7 J,— = 1 .
«10 \\j-Sn\\Ds
Part a) illustrates that the Taylor approximants behave globally
like best polynomial approximants to entire functions on disks.
The story is different on different shaped regions. Part b) shows
that locally the Taylor approximants are always optimal.
4. Prove that (10.1.4) has the representation (10.1.5).
5. Establish the estimates (10.1.12) and (10.1.13).
6. Establish the expansions (10.1.14) to (10.1.22).
7. (Polynomial approximation to exp on [—1,1]) Let sn be the nth
partial sum of exp at zero.
a) Show that if p(z) is a polynomial of degree n, then p(z)p(l Iz) is a
polynomial of degree n in the variable z + 1 Iz,
b) If x is the real part of z, where |z| = l, then ex = ez,2ez'2 =
ez,2e1,2z. On \z\ = 1 approximate ez'2 by s„(z/2) and approximate
e1,2z by sn(l/2z) and estimate the errors.
c) Use part a) to construct polynomial approximations to exp that
satisfy
F^r-1 n^gW2 + Q(1)
En(e,[ 1,1])^ 2> + 1)! •
Note the approximation is
e'-S^iS^i z: = x + iy.
,=o v2 i=0 v2
d) Modify part c) to show that
(This is in fact asymptotically optimal.) Hint; Consider the method
with approximation centered at IIn.
10.1 Classical Approximations
323
8. (Rational approximation to exp on [-1,1]) Show that
«„(«.L M])^^ (2n)!(2/i + l)! '
Hint: Proceed as in 7). First observe that Exercise 7a) holds for
rational functions of degree n. The approximation is given by
^-^.-(5)^-(27)
where \z\ = 1 and z := x + iy. Use estimates like those in the proof of
Theorem 10.2 to prove the result.
9. The (n, 1) Pade approximant pn to /:=E"=0 a,-z' has denominator
an+1z - an and, provided «„ 5^0, satisfies
/>„-/=0(zB+2).
a) Show that if/is entire, then there is a subsequence of {/?„} that
converges to / uniformly on any given compact subset of C.
b) Show that there exists an entire function so that the full sequence
iPn) does not converge uniformly on any open set in C.
Hint: Show that the poles of the pn can be dense in C.
Exercise 9, due to Beardon and Perron, illuminates some of the problems
inherent in uniform convergence questions for Pade approximants. It was
conjectured that subsequential convergence holds for the sequence of (m, k)
Pade approximants (k fixed) and for the sequence of (m, m) Pade
approximants. Much of the conjecture concerning convergence along rows (k fixed)
was recently settled by Buslaev, Gonchar, and Suetin [84]. They show, for
example, that if /is entire, some subsequence of the (m, k) Pade
approximants (k fixed) converges uniformly to / on compact subsets of C. These
conjectures, due variously to Baker, Gammel, Graves-Morris, Wills, and
others are discussed in Baker and Graves-Morris [81]. A more complete
convergence theory is available if one is prepared to settle for weaker types
of convergence, for example, convergence in measure.
10. (More on the Pade approximants to exp) Let
pm.n(*y-=j~at+z)me-'dt
qm,n^Y-=l (t-z)ntme-'dt.
324 Other Approaches to the Elementary Functions
a) Show, as in the proof of Theorem 10.2, that
qmA*V-pmM =zm+n+1 f0 (« ~ i)W1^2 du.
b) Show that, as m + n—»°°,
„ /".A,,2 „ t-\ _ \*-) m-n- nzl(m + n) m+n + 1 ri . /1 s -i
<lm,n(Z)e -Pm,n(Z)~ (m +„ + !)! e Z [1+°(!)]-
Hint: Observe that (u - l)"um is essentially a "spike" at u:=
ml(m + n) and that
fo(u-iyu"
(-l)"m\n\
du~ — —- —
(m + n + 1)! '
c) Show that
_ •£ m\(n + m — k)\ k
Pm>"{Z)'" to (m~k)\k\ Z
a (Z):=f "Kn + m-k)\ kk
qmA) to (n-k)\k\ ( ]
Recall that J0" t"e ' dt=n\
d) Show that
„ . Pm,n , _ "m,n
Pm>n -=-^- and Qm,n-=^-
are polynomials with integer coefficients of degree m and n,
respectively.
e) Show that, as n, m—><*>,
Pm,n(z) = (n + m)\e[m,("+m)]>[l + o(l)]
and
^,,,(^) = ^ + ^)^-^^^1 + 0(1)].
The convergence is uniform on compact subsets.
Hint: Examine the coefficients of pm n(z)/(n + m)\
f) Show that, as m, «—»°°,
g2 _ Pjnjt) = ("J?"7"1"' „ ^0-^^ +1[! + o(1)] .
qm,n(z) (m + n)\(m + n + 1)\
The convergence is uniform on compact subsets that avoid any
10.1 Classical Approximations 325
zeros of the denominator sequence. Observe that, by e), only
finitely many of these zeros lie in any compact set. Further details
may be found in Trefethen [84] or Braess [84].
g) Show that
Hint: Consider the Pade approximant centered at 2p2/(2« + l).
That is, replace z by z - 1p l(2n + 1) in parts e) and f). This gives
the upper bound. Use Rouche's theorem, as in the proof of
Theorem 10.2, to derive the lower estimate.
11. {On the main diagonal Pade approximants to log)
a) Suppose Sn and Tn are polynomials of degree n and suppose that
Tn(x)logx-Sn(x) = 0(l-x)2n+1.
Show that, if Tn(x) := t0 + txx + • • • + tnx", then
[T„(x) log x]( = B+1 Z ———
x ;=o n
"J
and hence
x"+1[r,(x)io8x]<"^_j,(-i)V_
(-ir«! ;e0 /„• ( L)tAx l)
b) Show that
r.w-.t (;)(">
if we normalize so that t„'-=l. [This is the denominator of the
(n, n) Pade approximant to log at the point 1.] Observe that Tn is
of degree n and has integral coefficients.
c) Show (with the above normalization, t„:=\) that dn ■ Sn has
integer coefficients, where dn := LCM(1, ...,«).
d) Let
^) := log *-|r§-
Show that
326 Other Approaches to the Elementary Functions
Hint: Observe that £„ := 0(1 ~xfn+\ Now differentiate to get
1 C T — T ?
Thus
xT\{x)tn{x) = T\- x(SnTn - t„Sn) = 0(1 - xf .
Since the middle term is a polynomial of degree 2« with lead
coefficient 1, we must have
xT2n(x)tn(x) = (l-xf".
Show that
-Efc_„ [fc!fc!/[(fc + n + l)!()k - n)!]](l -*)B+fc+1
s„W =
^0(1)^
e) Show that
■ ^=(2;)
and that for x a 0,
f) Show that, for x £ (1 - S, 1 + S),
where ca > 0 and ds > 0 depend only on 8, 0 < S < 1.
10.2 REDUCED COMPLEXITY METHODS
We are primarily concerned with methods that accelerate the evaluation of
the elementary function by reducing the complexity of evaluating one of the
standard approximants. Most of the approximants listed in the previous
section evaluated by usual methods (such as, Horner's rule) provide between
0(n) and 0(n log n) digits for n arithmetic operations. The slight difference
comes from the more rapid convergence of the Taylor polynomials for exp
10.2 Reduced Complexity Methods
327
than those for log. (See Exercise 1.) We proceed to examine three methods
of complexity reduction. While none of these methods is as fast as those of
Chapter 7, they all have their own particular advantages. The second
method, based on the fast Fourier transform, for example, applies to most
of the special functions.
10.2.1 Acceleration Based on Functional Equations
The exponential satisfies the functional equation
(10.2.1)
f(2z) = [f(z)]2
This allows us to reduce the calculation of the exponential to a small region
about the origin and then to approximate in that region using a Taylor or a
Pade approximant. From estimates like those of Section 10.1 and (10.2.1)
we have
(10.2.2)
•""['■Wl \*nt2* 1'
and
(10.2.3)
e2-
/zYTi i mm
'"■AW J 1 16"2 (2n)!(2n)!
z si
\z =sl
where sn and r„ „ are, respectively, the nth Taylor polynomial and the (n, n)
Fade approximant to exp at zero. Both of these above estimates provide
Oop(Vn) and 0B(VnM(n))
methods for the evaluation of exp.
The functional relation for log is
(10.2.4) , f(z2)=2f(z)
Combined with (10.1.19), this leads to
- 1 (z2-"-l
(10.2.5) 2"log(z2"") = 2"+1 X
k.02k + l \z' " + i
2/fc+l
Truncation after n terms leads to an approximation on \z -1| < \ that has
error 0(4 ~" ) and yields an algorithm for log with complexity
Oap(Vn) and 0B(VnM(n)).
These are good intermediate range estimates of exp and log. For
328
Other Approaches to the Elementary Functions
n '■= 100, (10.2.3) provides in excess of 12,000-digit accuracy at the expense
of roughly 300 full precision multiplications. (300 terms of the Taylor series
provides roughly 600 digits.)
Variations of the above methods can be used to construct 0op(VH)
algorithms for the circular functions and their inverses. (See Exercise 2.)
Since any elliptic function (Section 1.7) satisfies an algebraic "half-angle"
formula, we can, as above, construct Oop(nll2+s) algorithms for elliptic
functions provided we have expansions available at the origin.
(Unfortunately, convenient closed forms of the Taylor series for sn, for example, are
not available.)
10.2.2 Acceleration Based on the FFT
It is possible, based on FFT methods, to evaluate a polynomial or rational
function of degree n at n + 1 distinct points in Oop(«(log n)2). (See Exercise
3 of Section 6.2.) Our aim is to use this observation to accelerate the
evaluation of partial sums or related approximations. We illustrate with log.
Start with
-log (1 - z) = z + \z2 + ^z3 + • • •
(10.2.6)
and consider
(10.2.7)
We can write
(10.2.8)
zk
J„2(z):=X -r
s„2(z) = 2 zknpn(kn)
/t=0
- z
(10.2.9) Pn(y) = l1
i=ii + y
2\
Now, for fixed z, evaluate p„(kn) at k = 0,1,. . . , n - 1 in Oop(«(log n) )
(Exercise 4) and evaluate sn2 in a further 0{n) operation. Since for \z\ ^ \,
sn2(z) provides D,(n2) digits of log z, we have constructed an algorithm for
log which is Oop(nu (logrc)2).
With care, this idea can be extended to produce
(10.2.10) 0op(rc,/2(logrc)2) and 0B(nv2(logn)2M(n))
algorithms for a variety of nonelementary transcendental functions. Almost
any function with regular Taylor coefficients is susceptible to such analysis.
10.2 Reduced Complexity Methods
329
Algorithms for the gamma function and the hypergeometric functions, of
complexity given by (10.2.10), are presented in Exercises 6 and 7.
We can combine the methods of Section 10.2.1 with the above to
construct an
(10.2.11) Oop(rc,/3(logrc)2) and 0B(nu\logn)2M(n))
algorithm for log. This merely requires truncating (10.2.5), say, after n
2
2\
terms and evaluating the truncation using the FFT method in 0 («(log n) )
3\
steps. Note that this approximation provides 0(n ) digits of log.
Likewise algorithms can be constructed for exp and the trigonometric
functions with complexity given by (10.2.11).
The relative difficulty of implementing these algorithms renders them
largely of theoretical interest.
10.2.3 Acceleration Based on Binary Splitting
If we wish to evaluate the constant e by summing the Taylor series at 1, then
we can and should take advantage of the reduced length of each individual
operation. With this in mind consider
(10.2.12) p(a,b):=b\/a\
and
(10.2.13) c(a, b) : = ,(*+£, b)c(a, £**),<*-)« + c(« + * , ,)
where
By construction,
c(a, a + 1) := (a + l)x.
(10 2 14) C(°'2) = X —
With x-=l, (10.2.14) approximates e with an error of less than e/(2")!, and
hence provides fi(«2") digits of e. We can use (10.2.13) to recursively
evaluate e with bit complexity (9B((log n)M(n)).
Brent [76c] shows how to modify this to provide an (9B((log n)2M(n))
algorithm for exp. This is outlined in Exercise 8.
Modifications and variations lead to OB((log n)2M(n)) algorithms for all
the elementary functions. Particular values of various of the nonelementary
special functions are also amenable to this analysis. So, for example, is
330
Other Approaches to the Elementary Functions
Euler's constant. [See Exercises 10 and 11.] Exercise 9 outlines an
0B((logn)2M(n)) algorithm for it based on recursive evaluation of arctan.
It is perhaps worth underlining the observation that these acceleration
methods apply only to the bit complexity. The operational complexity is not
reduced. Nonetheless, the algorithm for the number e implicit in (10.2.12),
(10.2.13), and (10.2.14) is asymptotically as fast as any known and has the
virtue of being implementable using only integer addition and
multiplication, except for a single final division.
Walz [Pr] has studied classes of asymptotic methods using extrapolation
and elimination techniques. These often outperform AGM-based methods
in the "microcomputer range" (less than 20 digits). Interestingly, for the
complete elliptic integral of the first kind he finds the AGM to be always
superior. For incomplete integrals this is not always so.
Comments and Exercises
Much of the material of this section is due to Brent [76c]. In particular,
Exercises 3 and 8 follow Brent closely.
1. a) Show that truncating the series in (10.1.14) and using Horner's
rule leads to an Oop(n/\ogn) algorithm for exp.
b) Show that (10.1.18), (10.1.19), and (10.1.20) all lead to Oop(n)
algorithms for log using usual methods for evaluating the
polynomials or rational functions in question.
c) Show, for n : = 2" and 8 := 1/2", that (10.1.17) and (10.1.22) lead
to Oop(n) algorithms for exp and log.
d) Analyze the complexity of (10.1.15), (10.1.16), and (10.1.21).
Assume in all parts that the method in question is used on a
compact region bounded away from the boundary of the domain of
convergence.
2. a) Use the functional relation
(10.2.15) 2 arctan ( , , Z J = arctan z
Wz2 + 1 + 17
and the expansion
v (—1) z , ,
arctanz = 2j —-,, , .— -2 <1
^o 2/: + 1
to construct on Oop(n ) algorithm for arctan on \z\ ^17 < 1.
b) Use the functional relation
' z
(10.2.16)
cos,2
2 cos z + 1
to construct an 00p(n1/2) algorithm for cos on Dt.
10.2 Reduced Complexity Methods 331
c) Show, in general, that if
CO
f(z)=lLanzn \z\<l + S
n = 0
and f(z/2) is an algebraic function of f(z), then one can construct
an algorithm of complexity
Oop(nin) and 0B(nU2M(n))
on Dr
Instead of approximating ez/2" by sn(zl2") in (10.2.2), approximate
ezn" - 1 by sn(z/2") - 1. Then repeatedly use the relation
(1 + e)2 - 1 = 2e + e2
to evaluate ez - 1. Show that this avoids requiring 0(nin) guard digits.
This modification and its obvious analogue for the Pade approxim-
ant allow the calculation of exp without loss of significant digits beyond
the 0(log ri) loss inherent in performing n112 operations.
Show that p„(y) °f (10.2.9) can be written as
, x wn(y)
p»{y) = ^Jy)
where w„ and vn are polynomials of degree n. Use FFT methods to
show that the coefficients of w„ and vn can all be calculated in
Oop(«(log rif) and hence, that pn{y) can be evaluated at n points
in Oop(«(log rif). (See Exercise lb) of Section 6.2.)
a) Construct an Oop(«1/2(log rif) algorithm for exp by writing
X ^ = ^(0) + --- + ^-1)
where
x x^_
(Nm)\ Lx Nm + 1 (Nm + l)(Nm + 2) + " '
„A/-1
xNm
tN(m):--
+
(Nm + !)••• (N(m + 1)-1)
and evaluating tN{m) for m = 0,. . . , N— 1. Care must be taken
to compute (Nk)\ in the requisite time,
b) Show how part a) can be used to construct an Oop(«1/3(log«)2)
algorithm for exp.
332 Other Approaches to the Elementary Functions
c) Construct an Oop(«1/2(log rif) algorithm for the Bess el function of
order zero,
°° (--\\k72k
<ia2-17) J^:'■s.Wsr-
6. (An 0op(nll2(log n)2) algorithm for the gamma function)
a) From the definition
r(s):= j~ e-T1 dt
show, by breaking the integral at N and expanding, that
(10.2.18) rw^ifc^-^ + JV,---,,.
b) Show for s G [1,2] that
6N
™-«OT&
s2Ne~N
c) Use b) and FFT methods to construct an algorithm for the gamma
function of complexity
Oop(nu\lognf) and 0B(nu\logn)2M(n)).
7. (0op(n1/2(/ogn)2) algorithms for hypergeometric functions) We now
consider a (general) hypergeometric function to be a function
CO
(10.2.19) f(z):=l+% anz"
n-l
where an/an_1'= R(n) for some fixed rational function R. In this
problem R is assumed to have rational (or precomputed) coefficients
and / is assumed to have a nonzero radius of convergence.
a) Show that the Gaussian hypergeometric series F(a,b;c;z) of
(1.3.5) is hypergeometric by the above definition. Show that,
provided a, b, and c, c ¥=■ 0, -1, -2, ..., are rational, F(a, b; c; z)
satisfies the additional assumptions.
b) Show that sin(Vz), e\ log(l-z), EItt, and Klir are all
hypergeometric functions, up to a rational normalization.
c) For fixed n let
10.2 Reduced Complexity Methods 333
n2-l
Sn*-i(z)-= 2 akzk
k=0
/fc-1
T(k):=UR(i) /2(0) := 1
/ = 0
and
Q(fc) :=/?(£) + /?(£)/?(£ +l)z + ---
+ R(k)R(k + l)---R(k+n- l)zn~l .
Observe that
k
ak = U R(k)
/=0
and show that
5„2_! = 6(0) + z"Q(n)T(n) + z2nQ(2n)T(2n) +■■■
+ zn(n~l)Q(n(n - l))T(n(n - 1)).
d) Show that
T(n), 7X2/1),..., T(n(n - 1))
can all be evaluated in Oop(«(log«)2).
Hint: Consider the rational function of y
V(y):=YlR(i + y)
i-0
and observe that
T(kn) = V(0) ■ V(n) ■ • • V((k ~l)n).
Now first compute the coefficients of V by recursively breaking the
problem in half and using a fast multiplication. Then calculate V at
the points K(0), V(n), V(2n),..., V((n - l)n). (See Exercises 1
and 3 of Section 6.2.)
e) Show that
2(0), Q(n),..., Q((n - l)n)
can all be evaluated in Oop(«(log rif).
334 Other Approaches to the Elementary Functions
Hint: Note that Q is a rational function of k of degree bounded by
n ■ degree(/?). Now show that the coefficients of Q as functions of
k can be calculated in Oop(«(log nf) by proceeding recursively.
To do this consider
R(k) + [R(k)R(k + l)]z + ■ ■ ■
+ [R(k)R(k + l)---R(k + 2n- l)]z2n~l
= {R(k) + ■■■ + [R(k)R(k + 1) • • • R(k + n - l)]zn~1}
+
z" I! R(i)
{R(k + «) + ••• + [R(k + n)
xR(k + n + l)---R{k + 2n-l)\zn~1}
and use a fast polynomial multiplication to recombine the pieces.
Finally evaluate 2(0), Q(n),..., Q((n - l)n) as before,
f) Use the preceding parts to construct
Oop(rc1/2(logrc)2) and 0B(nu\logn)2M(n))
algorithms for any hypergeometric function.
(An 0B((log n)2M(n)) algorithm for exp)
a) Show that (10.2.13) can be used to recursively evaluate e with bit
complexity 0B((\ogn)M(n)).
b) Use (10.2.13) for rational p/q, where p2<q<2N, to compute
eplq. Show that the bit complexity is as in Exercise 8a).
c) Suppose x G [0,1) is a binary 2m-digit number. Show that x can be
written as
V Pk
X = Zj
k-0 <ik
where qk- = 22 and 0<pk<22
d) Write
fl ePk'q"
k = 0
with pk and qk as in c). Show that this gives an Os((log n)2M(n))
algorithm for exp on [0,1).
9. (An 0B((log n)2M(n)) algorithm for it)
a) Consider the expansion
10.2 Reduced Complexity Methods 335
and the recursion
c(a, a + 1) := — x2
c(a,b):=P(^,b)c(a/-^y-xy^
I a + b\ (a + b ,\
+ p{a,-2-)c{—,b)
where, for a < b,
p(a, b) := (2a + 1) • (2a + 3) • • • (2b - 1).
Show that
c(0,2")
*a"+%(l,2")
calculates the (2"+1 -l)th partial sum of arctan(l/x).
b) Show that for a fixed integer x > 1 the above recursion computes
arctan(l/x) in
0B((\o%nfM(n)).
This gives an Os((log n)2M(n)) algorithm for it from
tt = 16 arctan ( 5) - 4 arctan ( 535)
or any similar arctan formula.
10. a) Let / be a hypergeometric function defined as in Exercise 7 (with
the same additional assumptions). Show, for fixed plq rational
inside the region of convergence of the expansion (10.2.19), that
f(plq) can be calculated with bit complexity
0B((lognfM(n)).
b) Show, for fixed rational plq, that T(plq) can be calculated with
bit complexity
0B((\ognfM(n)).
Other Approaches to the Elementary Functions
(On the complexity of Euler's constant) Euler's constant or the
Euler-Mascheroni constant y is denned by
, 1 1
1
H log m
m 6
y '■= lim
m—>«
(10.2.20)
It is related to the gamma function by the formula
= 0.5772156649
(10.2.21) ^
a) Use the recursion
= zeyz
n(i+^>-
L„=1\ nl
r<
> + 1) = zY(z)
to prove that if T has an expansion of form (10.2.21), then y is
given by (10.2.20).
b) The exponential integral E1 is denned by
(10.2.22)
Show that
/•CO —t
dt
-E1(x) = y + \ogx+*Z )-½^. X>Q-
(10.2.23)
c) Use (10.2.23) to construct an
0B((\ognfM(n))
algorithm for y by choosing x roughly of size 6«.
This method, suggested by Sweeney [63], is a reasonably efficient
method for computing y. Brent and McMillan [80] present a
number of algorithms for this computation. They calculate over
29,000 partial quotients of the continued fraction for y. As a
consequence they show that if y is rational the denominator of y
exceeds 1015'000
Chapter Eleven
Pi
i
Abstract. The first section of the chapter deals with the history of the
calculation of ir and related matters, while the second section deals with its
transcendence. The third section looks at irrationality measures and includes a
proof of the irrationality of 1(3). This chapter is largely self-contained and
indeed contains considerable related number theory, especially in the
exercises.
11.1 ON THE HISTORY OF THE CALCULATION OF it
The history of ir presumably begins with man's first attempts at estimating
the perimeter or area of a circle of given radius and as such starts at the
dawn of recorded history. The Egyptian Rhind (or Ahmes) Papyrus which
dates from approximately 2000 B.C., gives a value of (16/9)2 = 3.1604.. . for
ir. Various other early Babylonian and Egyptian estimates include 3, 3g,
and 3¾. Implicit in the Bible (1 Kings 7: 23) is a value 3: "And he made a
molten sea, ten cubits from the one brim to the'other; it was round all
about. .. and a line of thirty cubits did compass it round about."
Mathematical interest in ir comes into sharp focus in the classical Greek
period. The Greeks investigated the problem of "squaring the circle." This
question and its final resolution over two millenia later will be pursued in
the next section. Currently we wish to review the primary Western
developments in the calculation of ir.
Archimedes of Syracuse (287-212 b.c.) provided the first major landmark
in the quest for digits of ir. By considering inscribed and circumscribed
polygons of 96 sides, Archimedes gave the estimate
n 10 n 1
3 — <ir<3 - .
71 7
337
338
Pi
A salient feature of Archimedes' method is that it can, in principle, be used
to provide any number of digits of it.
If an denotes the length of a circumscribed regular 6-2"-gon and bn
denotes the length of an inscribed regular 6 ■ 2"-gon about a circle of radius
1/2, then
(11.1.1) aB+1 =
2a„b
n n
(11.1.2) bn+1 = V^Fn.
This two-term iteration, starting with a0 := 2V3 and b0 •'= 3, can be used to
calculate it. (See also Section 8.4.) The fourth iteration yields a4 = 3.1427. . .
and bA = 3.1410. . . and corresponds to estimating it using polygons with 96
sides.
If we observe that
(1M-3> ••"-*■"- (<.., /C)fe^.)(a--tJ
we again see that the error is decreased by a factor of approximately 4 with
each iteration. Variations of this modern formulation of Archimedes'
method provided the basis for virtually all extended precision calculations of
ir for the next 1800 years, culminating with Ludolph van Ceulen (1540-
1610) who correctly computed 34 digits. The limitations of this method stem
from the relatively slow convergence and from the need to extract square
roots. (See Exercise 1.)
Frangois Vieta (1540-1603) gave the first infinite expansion
which he derived by considering a limit of areas of inscribed 2"-gons. (See
Exercise 2.) John Wallis (1616-1703) through a complicated calculation
demanding prodigious numerical insight derived the infinite product
expansion
(1115) £ = 2-2.4.4-6-6.8-8-
^ ; 2 1-3-3-5.5.7-7.9..- '
This appears in his Arithmetica Infinitorum-oi 1655. A few years later Lord
Brouncker (1620-1684), the first president of the Royal Society, recast this
as the continued fraction.
11.1 On the History of the Calculation of rr
339
4
IT =
i + —L
9
(11.1.6) 2+
25
2+
49
2 +
2 +
The Scottish mathematician James Gregory (1638-1675) in 1671 provided
the underlying method for the next era in the history of the calculation of it.
He showed that
/■.I 1 r,x f* dx x3 xs x1
(11.1.7) arctan* = Jo —? = X-J + J-J + ...
and hence, on setting x'-=l, that
(11.1.8) ^ = l-| + |-i + ...
a formula independently discovered in 1674 by Leibniz (1646-1716). By the
beginning of the eighteenth century Abraham Sharp under the direction of
the English astronomer and mathematician E. Halley had obtained 71
correct digits of it using Gregory's series (11.1.7) with x' = Vl/3, namely,
7T = J_ / _J_ _1 1_
6~V3V 3-3 + 32-5 33-7
It is the techniques of calculus that so expanded the scope for calculating,
and it is perhaps not surprising that Isaac Newton (1642-1727) himself
calculated it to 15 digits sometime in 1665-66. He used the series
3V3 /11 1 1 \
(11.1J0) T=„+24(________...)
which is essentially an arcsin expansion. (See Exercise 4.) Newton was later
to write: "I am ashamed to tell you to how many figures I carried these
computations, having no other business at the time." John Machin (1680-
1752) derived the formula which bears his name:
(11.1.11) ^ = 4 arctan (|) - arctan (^) .
Coupled with Gregory's series for arctan this provides a very attractive
method for calculating it since the first term is well suited to decimal
arithmetic and the second term converges very rapidly. Machin calculated
100 digits this way in 1706. In the same year William Jones published his A
(11-1-9) Z = 77?(l-^ + ^-7-^ +
340
Pi
New Introduction to the Mathematics, where he denoted the ratio of the
circumference to the diameter by the Greek letter ir, presumably for the
first letter of periphery. It was, however, Leonard Euler (1707-1783) who
popularized the use of the symbol. Euler derived numerous series and
products for ir and ir2. Among the best-known are
2
(11.1.12) ^-=2^
and
6 t^\ n
(n-1.13) ib"=^^
=1 n
The explicit summation of (11.1.12) had eluded Liebniz and also the
Bernoulli brothers, Jacques and Jean. The method by which Euler derived
his evaluations of E™=1 \ln2k is outlined in Exercise 7. This is to be found in
Euler's Introducio in Analysin Infinitorum of 1748. The Machin-like formula
(11.1.14) ir = 20 arctan (-J + 8 arctan ( —J
coupled with the expansion
V / 2 2-4 , \
(11.1.15) arctanx = - (1 + x y + j^ y +•••)
where y •'= x2/(l + x2), allowed Euler to compute 20 digits of ir in under an
hour.
The next 200 years saw little change in the methods employed to calculate
ir. In 1844 Johann. Dase (1824-1861), a calculating prodogy, used the
formula
(11.1.16) — = arctan (-) + arctan (-J + arctan (-)
to produce 205 digits of ir. (Dase's arithmetical abilities were awesome—he
could multiply 100-digit numbers together in his head, a feat which took him
roughly 8 hours.)
The zenith (or nadir depending on your perspective) in premachine
calculations was achieved by William Shanks (1812-1882), who published
607 purported digits of ir, of which 527 were correct. Later Shanks
published an extension to 707 digits. This was also incorrect after the 527th
digit. These calculations took Shanks years and were performed in an
entirely straightforward fashion using no tricks or shortcuts. (See W. Shanks
[1853].) The mistakes went unnoticed until 1945, when D. F. Ferguson, in
one of the final hand calculations, produced 530 digits. Ferguson produced
808 digits in 1947, using a desk calculator and the formula
11.1 On the History of the Calculation of rr
341
(11.1.17) I = 3 arctan (~) + arctan (~) + arctan (-^) .
Thus dawns the computer age. In June 1949 ENIAC (Electronic
Numerical Integrator and Computer) was used to evaluate 2037 digits of it using
Machin's formula and 70 hours elapsed time. An analysis of the distribution
of the digits was carried out by Metropolis, Reitwiesner, and von Neumann.
By 1958, Genuys had computed 10,000 digits on an IBM 704 in 100 minutes,
once again using Machin's formula. Felton had performed a 10,000-digit
calculation in 1957; however, due to machine error it was only correct to
7480 digits. In 1961 D. Shanks and Wrench [62] used the identity
(11.1.18) it = 24 arctan ( -J + 8 arctan ( — ) + 4 arctan [^r
and under 9 hours on an IBM 7090 to produce 100,000 digits of it. This was
checked using the formula
(11.1.19) it = 48 arctan (—j + 32 arctan (—j - 20 arctan (^) •
The million-digit mark was set by Guilloud and Bouyer in 1973 on a CDC
7600. The calculation, which took just under a day, used (11.1.19) with
(11.1.18) as a check.
. Kanada, Tamura, Yoshino, and Ushiro [Pr] calculated in excess of 16
million digits using an AGM based algorithm, Algorithm 2.2, and checked
10 million digits using (11.1.19). The 16 million-digit calculation took under
30 hours on a HITAC M-280H and used an FFT-based fast multiplication.
At the end of 1985 the record belonged to W. Gosper. He calculated 17
million terms of the continued fraction expansion for it and so in excess of
this number of decimal digits—after a radix conversion from a binary
computation. His method is based on a very careful evaluation of Ramanu-
jan's series (5.5.23) on a Symbolics 3670. (A remarkable feat considering
the size of the machine.) As is surprisingly often the case with these large
scale calculations, Gosper uncovered subtle design flaws which had not
surfaced in smaller calculations.
In January 1986, D. H. Bailey [88] computed 29,360,000 decimal digits of
tt on the CRAY-2 at the NASA Ames Research Center. This calculation
used only 12 steps of the quartic algorithm (5.4.7) with /-:=4. This results
in computing a(250), which agrees with it-1 to more than 45 million places.
The calculation took less than 28 hours and was verified with a 40-hour
computation of 25 steps of Algorithm 2.1. It is amusing to observe that the
quartic calculation requires well under 100 full precision multiplications,
divisions, and root extractions.
In July 1986, Kanada reclaimed the record with a computation of 225
decimal digits. He again used Algorithm 2.2,. verified in September using
(5.4.7) with r : = 4, but reduced the elapsed time to 5 hours and 56 minutes
on a S-810/20 super computer. This represented a speed-up by a factor of
342
Pi
15. His previous computation now used only 96 minutes of CPU time. Plans
were to compute 227 decimal digits (over 100 million) at the end of 1987.
Nor is the end in sight. It will probably be the case than hundreds or
thousands of millions of digits will be calculated by the end of the century.
(This is now more a matter of will than anything else. Indeed, by the time of
second printing Kanada had similarly computed 201 million digits.) Apart
from observations like "the sequence 314159 appears in the digits of ir
commencing at digit 9,973,760," there is little we care to say about the
digits. They have, however, been subjected to considerable scrutiny. It is an
open question as to whether ir is normal. That is, do all sequences of
integers appear with the same frequency in the digits (are one-tenth of the
digits 7, one-hundredth of the consecutive digit pairs 23, etc.)? On the basis
of the first 30 million digits the answer appears positive. This, of course, is
no great help in deciding the normality issue. (See Wagon [85].)
In terms of utility, even far-fetched applications such as measuring the
circumference of the universe require no more digits than Ludolph van
Ceulen had available—but then utility has had little to do with this
particular story.
Comments and Exercises
This section presents only the highlights of the quest for digits. The matter
may be pursued in detail in Beckmann [77], a most useful though rather
individualistic history, and in Le Petit Archimede [80]. Schepler's chronog-
raphy [50] and Wrench's history [60] are also of interest. Details of the more
recent calculations may be found in Tamura and Kanada [Pr], where a
compendium of Machin-like identities is provided.
There is also a considerable collection of ir-related trivia. For example,
the Indiana House of Representatives attempted to legislate the value of ir
in Bill 246 of 1897. The bill, which appears to proclaim ir to be several
different incredibly inaccurate values, including 4 and 64/25 (see Beckmann
[77], and Singmaster [85]), passed the House and only floundered in the
Senate on the apparently chance intercession of C. A. Waldo, a professor at
Purdue. Keith [86] gives a 402 digit mnemonic for ir.
1. a) Show that the algorithm of (11.1.1) and (11.1.2) calculates ir by
showing that an and bn are as advertised,
b) Prove (11.1.3) and estimate how many iterations of (11.1.1) and
(11.1.2) are required to calculate 35 digits of ir. This should be
compared to Bailey's [88] calculation which uses the same
operations.
2. a) Prove, from the product expansions for sin and cos, that
(1LL20) 6= cos (0/2) cos (0/22) cos (0/23)~ ^<7r'
b) Alternatively deduce (11.1.20) in an elementary fashion by setting
11.1 On the History of the Calculation of rr 343
j)cos(|7 _V2„
ln :=COS I o ) COS \ ^2 J • • • COS ( ^
and showing that
sin 8
2" sin (0/2")-
c) Set 0:=tt/2 and use the formula cos(0/2) = V2 + \ cos 6 to
deduce Vieta's formula (11.1.4)
2 111 /1 1 11 1 /1
^2^2 + 2^2^2 + 2^2 + 2^2
1. a) Prove Wallis's formula (11.1.5) in the form
tt_2-2-4-4-6-6-8-8
2 ~ 1-3-3-5-5-7-7-9 '" '
Hint: Show that
I
■jr/2
o "" ""*" 2 • 4 • 6 • • • 2m
■ 2m , 1-3-5--- (2m - 1) it
sin xdx= —
and
I
vn 2m+i , 2-4-6---2/n
• zm t-1 j
sin xdx =
h l-3-5---(2/n + l)'
b) Establish the corresponding formula for e:
e /2\1/2/2-4\1/4/4-6-6-8N""
2 \1/ V3-3/ V5-5-7-7
c) Show that the volume of the 2«-dimensional unit sphere is ir7n!
while the (2n +1)-dimensional unit sphere has the volume
22n+l[n\l(2n + l)!]ir". Find a unified formula for these two cases.
Deduce (11.1.10) roughly as Newton did. Show that
V3
rl/4
S"32=l V^7*
and that
rl/4 /-1/4
J Vx-x2 dx = Jo V]c(Vl-Jt)
dx
_2 1_ 1
3-23 5-25 28-27
344 Pi
5. a) Deduce Machin's formula
it . . /1\ /_1
-=4 arctan ^ - j - arctan ^ 239
as follows. Let 0 '•= arctan 5. Then
2 tan 0 5
tan 20 =
1 - tan2 0 12
and
Hence
a* 120 -, 1
tan 40 = — = 1 + ^-
ir\ -1 +tan 40 1
tan 40
4 / 1 + tan 40 239
b) Show that arctan satisfies the addition formula
I x + y\
arctan x + arctan y = arctan I -r—— I xy < 1.
c) Show that
(11.1.21) arctan (-) = arctan (—■—) + arctan(-5— )
\p/ \p + q! \p2+pq + l/
and that if 1 + p2 = qr,
(11.1.22) arctan (^) + arctan (^) = arctan (I).
Formula (11.1.21) was known to Euler. Bromwich [26] attributes (11.1.22)
to Charles Dodgson (Lewis Carroll).
6. (Machin-like formulae)
a) Show, for integral a; and bj, that
(bi\ lbi\ (b,
kir = arctan — + arctan —.+••• + arctan —
\a1/ \a2/ \an
where k is an integer if and only if
(a, + ifcJCflj + ib2) •••(«„ + ib„)
has zero imaginary part.
11.1 On the History of the Calculation of rr
345
Hint: Consider (a1 + ib^- ■ -(an + ibn) = re'8, |0| < it, and use the
fact that
1 , (i + iz
arctan z = — log I —
(This gives an algorithmic check for Machin-like formulae.)
b) Show, for positive integral u, v, and k and integral m and n, that
/1\ /1\ kir
m arctan I - I + n arctan I - I = —
if and only if (1 - i)\u + i)m(v + if is real.
c) Verify
^ , /1
— = 4 arctan I -
4 \5
arctan
(±
TT /1 ,
— = arctan I - I + arctan .
V239
1
— = 2 arctan I -
arctan
j = 2 arctan ( - j + arctan ( -
(Machin, 1706)
(Euler, 1738)
(Hermann, 1706)
(Hutton, 1776).
These are, in fact, all the nontrivial solutions of b). This was a
problem of Grave's solved by St0rmer in 1897. The problem can
be reduced to finding integral solutions of 1 + x2 = 2y" or 1 + x2 =
y", n 2 3, n odd. (See Ribenboim [84].) Much related material on
Machin-like formulae occurs in Lehmer [38] and Todd [49].
7. Prove Brouncker's continued fraction by showing that
Hint: If
then
= 1 +
1
2 +
2 +
25
2 +
s '■= a0 + ax + axa2 + a1a2a3 +
s = a0 +
1 + a0
1 + flo
346
Pi
This is a nonsimple continued fraction. The convergents satisfy
a similar recursion to that given in Exercise 2 of Section 11.3.
Apply this to
so that
and
3 5 7
XXX
arctan ^ = ^-- + -^--- +
2 -,2
— X —jX
a0 = 0, a1 = x, a2 = —r~, «3 = —7"
arctan x:
1 +
x2
3-x2 +
9x2
5 - 3x2 +
7 - 5x2 •
Now set x := 1. (According to Beckmann, Brouncker merely announced
his result—the above derivation is essentially due to Euler.)
8. Consider the series
x3 xs
(11.1.23) sin x = x - — + Tj- — ■ • • .
Observe that sin x = 0 exactly when x = ± fcir. Now observe that on
setting y -=x2,
y y2 y3
(11.1.24) 1-^ + ^-^ + ---=0
exactly when y = (fcir)2, k = 1,2,3,. . .. If (11.1.24) were a polynomial,
we would know that the sum of the reciprocals of the roots of (11.1.23)
equals the negative of the coefficient of y and in general the sum of the
reciprocals of the powers would be expressible in terms of the
coefficients and Bernoulli numbers. Thus we would deduce that
tx (kirf 6 3nd £, (kirf 90 '
Use the product expansion for sin [Section 2.2, Exercise Id)] to make
the above argument of Euler's rigorous. (See also Exercise 14 of
Section 11.3.)
9. In computing ir from (11.1.19) one must evaluate arctan(^) and
arctan(^). Use (11.1.15) to observe that
11.2 On the Transcendence of it
347
arctan (h) = 18 (sfe + J^tf + 3^25? + '"')
and
arctan(^) = 57(^ +^^+ ^-^3+••••).
Thus terms of the second series are just decimal shifts of terms of the
first series. (See Ballantine [39].) How does this affect the'complexity
of calculating the two arctans?
10. Prove that the number
0.12345678910111213 • • • /1(/1 + 1) • • •
is normal. A proof may be found in Niven [56].
11.2 ON THE TRANSCENDENCE OF it
The problem of "squaring the circle" is the problem of constructing a square
of the same area as a given circle of radius 1, or alternatively given a line
segment of unit length of constructing a segment of length Vtt. The rules of
construction allow for the use of an unmarked straightedge and an
unmarked compass. A more precise definition of constructible is provided in
Exercise 1. In fact, the constructible numbers are exactly those numbers
which can be obtained from the integers by a finite sequence of rational
operations and extraction of square roots. (See Exercise 1.) Thus
constructible numbers are algebraic and the transcendentality of it shows the
impossibility of the problem.
The Greek notion of number, based on geometric construction, made
consideration of such problems more natural than they perhaps seem today.
Indeed the problem had arisen by the fifth century b.c. Anaxagoras, who
died in 428 B.C., had, according to Plutarch, considered it while in jail. His
contemporary, Hippocrates of Chios, the author of one of the first geometry
texts, also considered the question. The other classical Greek problems of
"duplicating the cube" and "trisecting the angle" also arose in this period.
The "Delian problem" of duplicating the cube (in volume), so named
because the oracle of Apollo at Delos had prescribed duplicating the cubical
altar as a means of halting the plague of 428 b.c, is equivalent to
constructing V2. (The impossibility of solving these problems is also
discussed in Exercise 1.)
By 414 b.c. attempts at constructing it had become so numerous that
Aristrophanes refers to "circle squarers" in his play "The Birds." The term
came to refer to people who attempt the impossible. However, attempting
the futile is not always a waste of time. As Boyer [68, p. 71] points out:
348
Pi
The better part of Greek mathematics, and of much later mathematical
thought, was suggested by efforts to achieve the impossible—or, failing
this, to- modify the rules. The Heroic Age failed in its immediate
objective, under the rules, but the efforts were crowned with brilliant
success in other respects.
It is hard to know whether more energy has been consumed by attempts
at circle squaring or by calculations of it. While doomed attempts to square
the circle flourish to this day, by the eighteenth century it had become
accepted in the mathematical community that the problem was probably
impossible. In 1755 the French Academy of Sciences refused to examine
any more quadratures while as early as 1668 Gregory had attempted to
prove their impossibility. The first substantial step in this direction was due
to Lambert (1728-1777), who proved it irrational in 1761. A subsequent
more rigorous proof was provided by Legendre (1752-1833). Legendre
proved the irrationality of it2 in his Elements de Geometrie of 1794 and
commented:
It is probable that the number it is not even contained among the
algebraic irrationalities . . . But it seems to be very difficult to prove
this strictly.
(See Exercise 3.) This belief was shared by Euler. Liouville established the
existence of transcendental numbers in 1840, and in 1873 Hermite proved e
transcendental. It had been proved irrational by Euler in 1737. Finally in
1882 F. Lindemann [1882] extended Hermite's proof to cover the
transcendence of it, thus laying to rest a 2300-year-old problem. This was simplified by
Weierstrass [67] in 1885, Hilbert [1893] in 1893, and many others.
Lindemann in fact established more generally that
pie"i + --- + pnea»*0
for distinct algebraic numbers alt. . . , an and nonzero algebraic numbers
Plt. .. , Pn. (See Exercise 7.) The transcendentality of it follows since
e,7r -1 = 0. This is the signal achievement of the nineteenth century with
regard to transcendental number theory. Note that Lindemann's theorem
implies the transcendence of cos a, sin a, and tan a for algebraic a^O and
also the transcendence of log p for p algebraic, p 5^0 or 1.
In 1900 Hilbert, as the seventh of his 23 problems posed at the
International Congress in Paris, asked whether a13 is transcendental for a algebraic
(a 5^0,1) and p an algebraic irrational. This was solved independently by
Gelfond and Schneider in 1934. (See Niven [56]). It is interesting to note
that Hilbert had speculated that this problem would probably resist solution
longer than the Riemann hypothesis or Fermat's last theorem. In 1966
Baker substantially generalized the Gelfond-Schneider theorem by showing
11.2 On the Transcendence of rr
349
that any nonvanishing linear combination of logarithms of algebraic
numbers with algebraic coefficients is transcendental. That is, if a;. and /3;. are
nonzero algebraic numbers, then /30 + S"_x a;log j8;.^0. (See Baker [75].)
Note that i' = e-'"72, and so transcendence of e" follows from the Gelfond-
Schneider theorem.
Comments and Exercises
For a discussion of constructibility one might consult Famous Problems in
Elementary Geometry by Klein [1897]. The treatment we give in Exercise 1
follows Clark [71]. The exercises on transcendental numbers follow Baker
[75], Hardy and Wright [60], Hua [82], and LeVeque [77], and for the most
part are simplifications of the original arguments. The particularly simple
proof of the irrationality of it2 (outlined in Exercise 3) is due to Niven [56].
The reader interested in pursuing these matters in depth is directed to Baker
[75], Mahler [67], or Lang [66a],
1. (On constructible numbers, doubling the cube, and trisecting the
angle) Constructible numbers can be defined by:
(i) The points (0, 0) and (0,1) are constructible.
(ii) Lines joining constructible numbers are constructible.
(iii) Circles with constructible centers and constructible radii (that
is, the radius is the distance between two constructible points)
are constructible.
(iv) The points of intersection of constructible lines and circles are
constructible.
It is the points of (iv) that form the constructible numbers.
a) Prove that the constructible numbers are a subset of the algebraic
numbers.
b) Show that perpendicular bisectors are constructible. See the hint
provided in the illustration. Hint:
B
350
Pi
c) Show that a circle through three given points is constructible. Se*
the hint provided in the illustration. Hint:
d) Show that the constructible numbers form a field. See the hint
provided in the illustration. Hint:
e) A number field C is constructible if C = Q(clt. . . , cn), where
c1;. . . , cn are all constructible. Show that if C is constructible,
then C has degree 2m over Q.
f) Show that real extensions of degree 2 are constructible.
g) (The impossibility of doubling the altar) Use e) to show that V2 is
not construcible and so the Delian problem is not solvable.
h) (The impossibility of trisecting the angle) Show that constructing
an angle 8 is equivalent to constructing cos 8 and show that a 60°
angle is constructible. Show that
cos 38 = 4 cos3 8 - 3 cos 8
11.2 On the Transcendence of rr
351
and hence, that cos 20° is a root of
8x3 -6*-l.
Show that the above polynomial is irreducible over Q and hence, by
part e), that a 20° angle is not constructible. Show that the
constructibility of 8, given a rational cos 38, depends only on
whether 4x3 -3x- cos 38 factors over Q. Show that a 30° angle is
constructible.
The following series of exercises is on transcendental and algebraic
numbers. Recall that a is algebraic of degree n if a is the root of an
irreducible polynomial of degree n with integer coefficients. If a satisfies no
such algebraic equation, it is transcendental.
2. (On Liouville numbers)
a) Prove that if a is algebraic of degree n, then for all e > 0 and for all
c>0.
0<
P
a
9
<
<7
has only finitely many solutions with p and q integral.
Hint: Suppose a satisfies
p(a) '■= ana" + ■ ■ ■ + a0 = 0.
Then, provided plq is not a root of p,
So by the mean value theorem, for plq E [a — 1, a + 1],
P
a
q
\p(a)-p(p/q)\ > D
sup \p(x)\ qn
xe[a-l, a + 1]
for some D, and the result follows.
A much deeper result of Roth (see Baker [75]) shows that a)
holds with qn+e replaced by q2+e.
b) Use a) to show that
a:=£w
and
/3:= 2
1
n-l 10
22"
Pi
are transcendental by showing that the partial sum approximations
would violate a).
c) Modify the construction of b) to exhibit uncountably many
transcendental numbers. (In fact, all infinite subseries of a are
transcendental, which exhibits an uncountable set of such numbers.)
d) Show that the algebraic numbers are countable and hence, that
almost all numbers are transcendental. Numbers that can be proved
transcendental because they can be approximated too rapidly by
rationals are called Liouville numbers. More precisely, a is a
Liouville number if for every m there exists a rational/?Iq, q >*1, so
that
1
0<
P
a
<7
<
9"
e) Prove that Liouville numbers are transcendental.
f) Show that the set of Liouville numbers has measure zero.
g) Show that the Liouville numbers have Baire category II. Show, in
fact, that the complement of the Liouville numbers is of Baire
category I. Actually it is the union of Q and a nowhere dense Fa.
(Recall that a set is category I if it is the countable union of
nowhere dense sets and that a set is nowhere dense if its closure has
empty interior. A set is category II if it is not category I. See
Oxtoby [80].)
{On the irrationality of e and it)
a) Prove that e is irrational.
Hint: Suppose e = qlp and consider
,>(ifca:-i
Show that this is an integer strictly between 0 and 1.
b) Show that for any positive integer n,
has the property that / and all its derivatives are integer valued at
x = 0 and x = l and that for x G (0,1),
0</(*)<^.
c) Prove that ep is irrational for integer p ¥=Q.
Outline: Suppose ep = alb with a,b EN. With / as in b), set
F(x) := p2nf{x)-p2n-Yl\x) + ■■■ +f2"\x).
11.2 On the Transcendence of rr 353
Show, by differentiating epxF(x), that
bjo P2n+1epxf(x)dx = aF(l)-bF(0)
is an integer but that the left-hand side above lies strictly between 0
and 1 for large n.
d) Deduce from c) that eq is irrational for all rational q ^ 0.
e) Show that it2 (and hence it) is irrational.
*• Outline: Suppose it2 = alb with a, b G N. Let / be as in b) and
consider
2n~2k
G(x):=b"^(-l)kf2k\x)7r
Then
— {G(x) sin (irx) - irG(x) cos (ttx)} = Tr2a"f(x) sin (ttx)
and
it J anf(x) sin (ttx) dx = G(0) + G(l).
However, G(0) and G(l) are integers while as in c) the integral on
the left is strictly between 0 and 1 for large n.
4. (On the transcendence of e) This exercise outlines Hilbert's proof of
the transcendence of e, a proof which, as LeVeque [77] puts it, is "as
elegant as it is mysterious."
a) Recall that, for k an integer,
/• CO
/fc:=J0 xke~xdx = k\
and thus if/? is polynomial with integer coefficients,
/• CO
I xkp(x)e"xdx^k\p(0)mod(k + l)\
b) Suppose e algebraic. Then there are integers at so that
«„ + a,e + «-,e2 + • ■ • + a„en =0 «„5^0.
354
Pi
ja:=j xm[(x-l)---(x-n)]m+1e-xdx
f CO /*CO /*CO
S--=a0jo +a1eji +--- + anenjn
T^aiefo+--- + ane"fo.
Observe that S+ T = 0.
c) Show that, for infinitely many m, Slm\ is a nonzero integer. To do
this, observe that if y '■= x - k, then
let
and let
r co
^1 =
fc = 0
«oj0 ymp0(y)e y dy
f CO
«*J0 ^^,()-)^^ fcal
where the /?t are particular polynomials with integer coefficients.
Now use a) to see that the first term of S is divisible by m\; that the
rest are divisible by (m + 1)!; and that for infinitely many m the first
term is not divisible by (m + 1)!.
d) Show that limm^„ \T\/m\=0. To do this let
M := max \(x - l)(x - 2) • • • (x - n)\(x + 1)
Osxsn
and show that
flJo
<yc|ajM"
whence
H- = ° —r
/n! V /n!
•0.
e) Observe that, since S + T = 0, c) and d) lead to a contradiction, and
hence e cannot be algebraic.
5. (On the transcendence of ir) The exercise outlines Baker's [75]
synthesis of Hilbert's proof of the transcendence of ir.
11.2 On the Transcendence of rr 355
a) Suppose 8t '■ = 'nr is algebraic. Let / be the lead coefficient of the
minimal polynomial for iir and let 02,, . . , Bd be the other roots of
the minimal polynomial. Since e'n = — 1,
(1 + ^)(1 + ^)---(1 + ^ = 0.
Show that expanding the above yields a nonzero integer n such that
q + e"1 + • • • + ea" = 0
where q ■ = 2d — n and each ai is a nonzero sum of some subset of
the 0,. of the form E ¢5,0,., S; = 0 or 1. Show that any elementary
symmetric function of lal,. . . , lan with integer coefficients is
integer valued. (See Exercise 6.)
b) Let
I(a):=foea~'f(t)dt
where
fity-=inptp~\t-"i)p---(t-a„y
for some prime p. Let
/:=/(^)+ ••• + /(«„).
Show, by using a) and repeated integration by parts, that for
m :=(« + l)p - 1,
m m n
/ = -^/(/)(0)-XS/(j)K).
/=0 ;-0 /t = l
c) Show that / is an integer. To do this, observe that the right-hand
term is a symmetric function of /a1;. . . , lan.
d) Show that for )¥=■ p - 1,
p!|/(/)(0)
while for j-p- 1,
/("_1)(0) = (p -1)1(-/)¾¾ • ••«„)'.
Thus for p sufficiently large,
(/?-l)!|/(p_1)(0) and p\^f(p~1\0).
356 Pi
e) Show that, for ;' ^p — 1, and p sufficiently large
/(/)K) = o.
Show that
Pi
2 2 /">(«,).
/=0 /fc-1
Hence with d)
\J\>(p-l)\
f) Show, however, that for some M independent of/?,
\J\*\XI(at)\*M".
This contradicts e) for large p and finishes the proof.
6. (On symmetric polynomials) A polynomial in n variables is symmetric
if it remains unchanged by any permutation of the variables. An
elementary symmetric polynomial ft in the variables xlt. . . , xn is defined
by
(y-x1)(y-x2)---(y-xn)=y"-fy-1+f2yn
(-1)7,,-
a) Show by induction that any symmetric polynomial in n variables
with integer coefficients can be written as a polynomial with integer
coefficients in the elementary symmetric functions.
b) Let sk : = x\ + • ■ ■ + xk„. Prove Newton's identities:
sk = (-l)k+lkfk + (-l)k+l 2(-l)'/*-A k< n
/=i
and
sk = (~l)k+1 1 (-l)'/*-A
k>n.
7. (Lindemann's theorem) This proof of Lindemann's theorem follows
Baker [75] and assumes some general familiarity with algebraic integers.
(An algebraic number whose minimal polynomial has lead coefficient 1:
observe that if a is an algebraic number and the lead coefficient of the
minimal polynomial is /, then la is an algebraic integer.)
11.2 On the Transcendence of rr
Lindemann's Theorem
If «j,.. . , an are distinct algebraic numbers and pi,. . ., fin a
algebraic numbers, then
)8^+--- + )8,^^0.
a) Suppose that
p1e°1 + --- + p„ea» = 0.
Show that one can assume that
(i) The jSj are ordinary integers,
(ii) There exist integers 0 = n0 < nx < ■ ■ ■ < ns = n so that,
««T + 1» • • • > anr+1
is a complete set of conjugates and
0v>i = -- = 0^.
b) Let / be an integer so that la1}. . . , lan and /^,.
algebraic integers. For p a large prime, let
/■(*)• = — •
For 1 < j < n, let
where
/,(«):= J V-'/KO*.
Show that
np-l „
/, = - S S &/,(/)(a*)
/=0 4 = 1
is an algebraic integer and that
(p-l)\\f\n(ak)
358 Pi
and for p sufficiently large
pH7iy)(«*). } = P-1, k = i
while
p\\f¥\ak) otherwise.
(An algebraic number a is divisible by h if a/h is an algebraic
integer.)
c) Show that |/i • • • /J is a nonzero integer divisible by (p - 1)! and
hence \JX- ■■ Jn\>(p -1)\.
d) Show that there exists C independent of p so that
\h>->Jn\ = 0{C>)
and that this contradicts c).
8. {Lengths and measures)
a) Suppose P is a polynomial with integer coefficients of length L and
degree D. Suppose a is an algebraic number with minimal
polynomial of degree d and length /. (The length is the sum of the absolute
value of the coefficients.) Show that either
P(a) = 0
or
IP, ,, [max(l,|a|)f
1
- Ld~T ■
Hint: Suppose at ■ = a has minimal polynomial
Q(x):=qdxd + --- + q0
and let a2,. . . , ad be the remaining roots of Q. Show that
qDd-P(ai)P(a2)---P(ad)
is a nonzero integer [if P(a1)^0]. Since
|P(aJ|<L-max(l,k|)D
we have d
l^\qDd\\P(ct1)\U[LmSx(l,Wk\)D]
^Z/-1 maxakD^lPK)!
11.2 On the Transcendence of rr 359
where the last inequality requires showing
d
/£kj II max(l,|aj)
k=\
and follows from part b) of this exercise,
b) Let P and Q be polynomials and let
r <-i
/x(P):=exp
fo\og\P(e2"")\dt .
Show that
i) ^e) = M*V(e)
ii) /x(P) =s length (P)
iii) /ui(x - c) = fi<(x- \c\)
iv) f,((x-c")v") = ^x-c)
v) f^(x- c) = max(l, \c\)
vi) f^(cYl (x- a,)) = |c|Il max(l, |a,|).
(fi is called the measure of P.)
The next exercise constructs a form
pn(x)en* + ■ ■ ■+p1(x)ex+p0(x) = oix^1^1^1)
where the pt are polynomials of degree ^m with integer coefficients. Such
forms can be viewed as higher dimensional Pade approximants. Setting
x'-=l in the above leads to a polynomial in e that can, in conjunction with
Exercise 8, be used to prove the transcendence of e. Setting x '■= iir, so that
e,1T = -1, leads to a proof of the transcendence of it. Such forms were
examined by Hermite. It was, however, Mahler [31] who showed how to
base the transcendence proofs on them.
9. {Another proof of the transcendence of e and it)
a) Suppose that we can find Vn so that
Vn,m := Vn := Pne»* + ■ ■ ■ + Ple* +p0 = o(xin+1^+1^)
where each pt is of degree m and pn '■ = xm + ■ • •. Then
Vf+1) = sne"x + ■■■ + s,ex = 0{xn(m+1)-x)
360 Pi
where each s, is of degree m. In particular
y0n + l) = n^Vn_^
and
r(m + \) — xmgx
1
b) Prove that, for sufficiently smooth G, if
G{x):=~\l{x-tTf{t)dt
then G(m+1)(x) =f(x). Use this to show that
\m + l rx
s \m + l rx
^ = ^1^-^-^
dt
(ni)m+1 cx f1 f2 r'"-1
c) Thus
j- x(n+1)(m+l)-1 Jo M{x)
(ml)'
where
K.m = ?lr^+1)(m+1Hl M(x)dt
fo*:=fo-fofo*>>-*i
and where
M(*> :=[a - tj ■ ■ • (l - oro2.:1^2 • • • <r+B-1
X g'l'g'i^g'i'a's* . . . g'1'2—'«*
d) Use c) to show that Vnm, defined as in a), exists and is uniquely
defined for all m and n.
e) Show that if Cs is a circle of radius 8>n, then
= m!(rc!r+1 f e" A
/.
2iri Jcs [t(t -1)---0 -«)]m+1'
7/mf: Use the residue theorem to see that Vnm is of the right form.
11.2 On the Transcendence of rr
361
Observe by expanding e'x and evaluating on circles of large radius
that
Vn,m = 0(x
(n+l)(m + l)-l
)■
f) Show that, with 8 <1,
Pk(x):
/»!(«!)"
I
e'xdt
2iri Jcs [n"=0 (t + k - J)]'
m.
Hint: Show by the residue theorem that pk is a polynomial of
degree m and that, with e),
2 Pkekx = Vn<m .
k=0
Note that
pk(x) = (nl)m+1
' d~
-dt-
m, jx
i*k
- i)m + i
The n '• = 1 case of e) and f) provides an alternate derivation of
the Pade approximant, equation (10.1.15).
g) Use f) to show that
o*
has integer coefficients [dn := LCM(1,...,«)].
h) Let n be fixed. Show from c) that for m sufficiently large,
Vn(x)^0 for x 5^0.
Note that this is trivial for real x^O.
i) Let D > e and let
From c) show that
W»,mW-=d:v„tm(x).
W„>m(x)= 2 akjxUk
■ Osksn
Qsjsm
where the akj are integers. Use c) and the fact that dn < e"(1+e) for
large n (Exercise 6 of Section 11.3) to show that for m + n large,
\Wm,„(x)\*
\x\(n+1)(m+l)-lenMDnm{n\)m+lm\
[(n + l)(m + l)-l]!
362
Pi
Show from f) with 8 -=1/n, that for m + n large,
n2(m+1)Dnm(ml)2n(m + 1)
|a*jl -
(n-l)m+1
j) Let x 5^0 and « be fixed. Then for m sufficiently large,
nm
o<|w„,m(*)|<-3
m
and
l I _- nm m
where cx and c2 are constants, independent of, n and m.
k) Show that e is transcendental.
7/wtf: Let Pm(e) = Wn m(l) for fixed large «. Show that as m^-o°,
Pm(e) is sufficiently small that, by Exercise 8, e must be
transcendental.
1) Show that ir is transcendental.
Hint: Let x'-= iir in i) so that e,1T = -1. Now proceed as in k).
Observe that W (wr) is a polynomial of degree m in ir.
11.3 IRRATIONALITY MEASURES
We examine the rate of approximation of e, ir, and log 2 by rationals. For
example, we show, for p and q integers and q sufficiently large, that
(11.3.1)
<7
>
<7
Estimates such as (11.3.1) are termed irrationality measures. The expected
rate of rational approximation is as follows: If f(x) is a positive nonincreas-
ing function, then
(11.3.2)
<
<7
has infinitely many integer solutions in p and q for almost all a exactly when
CO
(11.3.3) S /(<?) = «>.
¢=1
11.3 Irrationality Measures
Thus, with probability 1, we expect
363
(11.3.4)
a
q
<
1
q log<?
to have infinitely many solutions, while
(11.3.5)
P
a
<7
<
1
<r(iog <?r
usually has only finitely many solutions. This result is due to Khinchin [64].
(See also Exercise 2 of Section 11.2 and Exercise 1 of this section.)
It is standard to the theory of continued fractions that if
(11.3.6)
P
a
<7
<
2q2
p, gEZ
then p/q is a convergent of the simple continued fraction for a, while of any
two consecutive convergents of the continued fraction, at least one of them
satisfies (11.3.6). (See Exercise 2.) Roth's theorem states that for algebraic
a,
(11.3.7)
a
<7
<
1
2+e
e>0
has at most finitely many solutions. (See Baker [75].)
For specific transcendentals exact estimates are usually unknown, but e
can be analyzed very precisely.
Theorem 11.1
If
(11.3.8)
then
(11.3.9)
e~T
y (In - k)\
y (In - k)\
1 log log t„
~2 t>gtn
(-l)k
[1 + 0(1)].
Proof. From Exercise 10 of Section 10.1, with sn'= Pnn(l) and
^ = 6,,.,,(1) we have
364
(11.3.10)
and
(11.3.11)
The result now follows, as an application of Stirling's formula gives
21ogf„
t»
Pi
nlnl r\\ I
(2n)!(2n + 1)! L '
= ^!e-[l + 0(l)].
(11.3.12)
Theorem 11.2
If p, gEZ and
2n + l =
log log f „
[l + o(l)]. D
9
<
4^
then, for some n, plq = sjtn, where sn and f„ are as in Lemma 11.1.
Proof. The continued fraction for (e - 1)/2 is given by
(11.3.13)
e-1
= [0,1, 6,10,14,18,...] := K, «!,...].
If /?„ := (•?„ ~ t„)/2 and q„'-= tn, then one can verify that
(11.3.14) P„ = a„p„-1+pn-2 Po = Q />i = 1
(11.3.15) q„ = an(ln-i + (ln-2 4o = 1 ¢1 = 1-
This, with Theorem 11.1, shows that /?„/<?„ are the convergents for the
simple continued fraction for (e - 1)/2 and that expansion (11.3.13) holds.
(See Exercise 2.) In particular, if
<7
< —2 then
4q
e-1 p-q
2q
<
1_
8q2
and by Exercise 2i), (p - q)llq is a convergent of (11.3.13). □
Corollary 11.1
Let 0 < 8 < 1 and let v be a positive integer. Then
(11.3.16)
<7
1 + S log log g
2u q2log g
11.3 Irrationality Measures
365
has infinitely many integer solutions, while
1 - S log log q
(11.3.17)
2v q log q
has at most finitely many integer solutions.
The above corollary due to Davis [79] shows that ellv is atypical with
respect to the rate of rational approximation [compare (11.3.4) and
(11.3.5)]. For v = l, Corollary 11.1 can be deduced from Theorem 11.1
without reference to Theorem 11.2 on continued fractions. (See Exercise 4.)
For general v the proof is left as Exercise 5.
We now turn to irrationality measures for it, £(2), and £(3). The
approach follows Beuker's [79] elegant treatment of Apery's startling proof
of the irrationality of £(3).
Let
(a)
(b)
Lemma 11.1
r and s be nonnegative integers.
f1 f1 xrysdxdy _
Jo Jo 1 - xy
( n
—2 for some n in Z
U(2)
f1 f1 - xrys log xy dx dy
Jo Jo 1 - xy
■
f n
—5 for some n in Z
■2
k2(
/(3) l3 23 ••• r3
r(3)
r>5
r = s = 0
r> s
r = s>0
r = s = 0
where d,:=LCM(l,2,. . . , r)
Proof. Consider
(11-3.18) I'-\l\lA
1 rl „r+S s+5
^
dx dy-
If we expand (1 - xy) 1 and integrate term by term, we get
(11.3.19)
/=2
;r0 (/c + r + S + l)(/c + s + s + 1)
366 Pi
which on setting 8 = 0 establishes the r = s case of (a). For r > s,
1 1
/-fr1
(11.3.20)
k=o r s
1
k + s + 8+1 k+ r + 8 + 1
r- s
1
•s + l + S
+ ...+
1
r+8
which establishes the rest of (a). If we differentiate I with respect to 8 and
set 8 = 0, v/e get
(11.3.21)
Jo Jo 1-
log*y
•*y
dxdy
and part (6) follows from differentiating (11.3.19) and (11.3.20). D
Theorem 11.3
£(2) = it2/6 is irrational.
Proof. Let pn be the "shifted" Legendre polynomial
(11.3.22) p^y-hiih^1-^
and note that
is a polynomial of degree « with integer coefficients. Consider
rp<w»,w
" Jo Jo
(11.3.24)
xy
dxdy
^ ' Jo Jo (l-xy)n+1
dxdy
where the equality follows on integrating n times by parts with respect to x.
Since for 0<x, y ^1,
(11.3.25)
xy(l-y)(l-x) ^(V5-l\5
1-xy \ 2 /
[with equality for x = y = (V5 - 1)/2] we have, by (11.3.24) and Lemma
11.1(a),
(11.3.26) 0<|/J<(^-^)5
£(2).
11.3 Irrationality Measures 367
On the other hand by the same lemma applied to the first form of /„,
(11.3.27)
where
(11.3.28)
Pn£&)-f
r =Y (n\(n\(n + k)l
Pn £0\k)\kJ n\k\ '
an is an integer and dn = LCM(1,2,. . . , n). It is a simple consequence of
the prime number theorem (see Exercise 6) that, for any 17 > 1,
(11.3.29) dn = 0(ev").
Furthermore, Stirling's formula and a little calculus lead to the estimate
(11.3.30)
c,n
1 + V5
< /3„ < c3nc*
1 + V5
2 I ~<-n -a- v 2
where cx > 0, c2, c3, and c4 are constants. In fact, van der Poorten [79] gives
A,=
[(l + V5)/2]4 [(l + V5)/2]
2irV5 + 2V5 »
5n
l+O
From (11.3.26), (11.3.27), (11.3.29), and (11.3.30) we deduce that for
sufficiently large n,
(11.3.31)
0<
£(2) - ^
in
l
Jn
where yn '■= d\ /3n and an are integers, and where
Thus (11.3.31) proves the irrationality of £(2) = ir2/6. D
Theorem 11.4
For p and g integers and q sufficiently large,
(11.3.33)
and
(11.3.34)
,r2 P
p
IT- -
1
> ^11.86
1
23.72
<7
368
Pi
Proof. From (11.3.30), (11.3.31), (11.3.32), and Exercise 3 we deduce
that, given e>0, if
2
TT p
~6~q
<
1
<7
for sufficiently large q, then plq = ajyn for some n. [Here an, yn, and
8 :=0.092. . . are as in (11.3.31).] Thus we need only verify (11.3.33) for
plq = an/yn. This follows from the observation that, for small 17 > 0 and for
large n,
r /V5-1 \5"
[See (11.3.24) and (11.3.25).] In conjunction with (11.3.27) this leads to
[(V5-1)/2-tj]5"
«2> - 7
in
In
One now finishes the result by using estimates (11.3.29) and (11.3.30) to
show that
[(V5-1)/2-t,]5" 1 1
■v ~~ 3 1:
•n J n J n
3 > ..11.86
for large n and small 17.
The irrationality measure for it follows from the irrationality measure for
it since, for large q,
ir-P-
1
■ + p/q\
9
>
TT + p/q\ q*"-
86.
n
The first irrationality measure for it was due to Mahler [53] (see also
Mahler [67]), who showed that
TV ■
1
>
<7
Q
>2
This was later refined by Mignotte [74] to
TV ■
<7
>
1
<7
<?>2
and by Chudnovsky and Chudnovsky [84] to
9
> ^^65 q large
1
(which can be marginally sharpened).
11.3 Irrationality Measures 369
Theorem 11.5
£(3) is irrational.
Proof. Consider
(11-3.35) /„ := fQ fQ =^- Pn(x)Pn(y) dx dy
where, as in (11.3.22),
We observe that
-log xy _ f1 1_
1-xy Jo 1-(1-.
and rewrite (11.3.35) as
■ xy Jo 1 - (1 - xy)z
" Jo Jo Jo l-(l-*y)z y
An rc-fold integration by parts with respect to x yields
Jo Jo Jo [\-{\-xy)z\n '
which, on substituting w '■= (1 - z)l[\ - (1 - xy)z]> becomes
>W1 - vWn
dx dy dw
■ f f f [x(l~x)w(l-w)y(l -y)]n
Jo Jo Jo [1-(1- xy)w]"+1
where the last equality follows from an rc-fold integration by parts with
respect to y. For 0 < x, y, w ^ 1,
x(l -x)w(l-w)y(l-y) ,^ p*
and from (11.3.36),
(11.3.37) 0</„<2£(3)(V2-l)4".
From Lemma 11.1(6),
(11.3.38)
ft £(3)
dl
where a^, ft, and d\ are integers. Since e3(V2~ 1)4 <1 and ft grows at
most geometrically [see (11.3.42)], we deduce that there exists S' > 0 so that
(11.3.39)
0<
£(3)
K<
(P'ndl)1+S
for large n. This proves the irrationality of £(3). □
Corollary 11.2
For p and q integers and q sufficiently large,
(11.3.40)
£(3)
<7
>
1
<7
13.42 •
Proof. The proof is similar to the proof of Theorem 11.3 and 11.4. One
first calculates ft explicitly from (11.3.35) and (11.3.23),
(11.3.41)
k=22(:
k**0
2 / i i \ 2
n + k
From Stirling's formula one can show that
(11.3.42) Clnc\l + V2)4" < ft < c3rcc*(l + V2)4
for constants c^O, c2, c3, and c4. In fact,
2(1 + V2)2
(11.3.43)
ft
«3/2(2ttV2)3
(1 + V2)4
l + OI-
The remainder of the proof follows much as in Theorem 11.4, and the
details are left as Exercise 8. □
Similar considerations lead to the irrationality measures for log.
Theorem 11.6
Let p, q, and n be integers. Then
log 2
<7
>
(11.3.44)
and for any e > 0, and fixed n> Ne
n + 1 p
1
<7
q large
(11.3.45)
log
<7
>
2+e
q large .
11.3 Irrationality Measures
371
The proof of the theorem is sketched in Exercise 9 or in Alladi and
Robinson [79]. Similar results may be found in Baker [75]. The best known
irrationality estimate for log 2 is also due to Chudnovsky and Chudnovsky
[84], who show that the constant can be reduced to 4.13... in (11.3.44).
Comments and Exercises
Results on the rational approximation of e may be found in Adams [66],
Bundschuh [71], and Davis [79]. Adams shows that the number of relatively
prime integer solutions of
e-P-
< —2 and
<7
\<q<n
behaves like 3 log n/(log log n) (rather than the expected clogn that holds
for almost all numbers).
Transcendence estimates of type
(11.3.46)
|ir-j8|>
1
^cd log d
h large
where /3 is an algebraic number of degree d and height h (the height is the
modulus of the maximum coefficient of the minimal polynomial) due to
Feldman are discussed in Baker [75].
Irrationality measures are often difficult. The best known estimate for ew
is
e —
q
>
1
<7
c log log q
c a constant.
Apery's proof of the irrationality of £(3) does not obviously extend to
other values of £. It is not known whether £(2« + 1) is irrational for n > 1.
Equally, whether (log ir)/ir, e + it, Catalan's constant, and Euler's constant
are irrational is unknown.
1. Let / be a positive nonincreasing function so that
2 f{q)<™.
¢=1
Show that the set of a for which
<
/(<?)
has infinitely many solutions has (Lebesque) measure zero. (The other
half of Khinchin's theorem is more delicate.)
Pi
Hint: Fix N. For each q > N consider intervals of radius f(q)Iq around
the q points 0/q, llq,. . . ,(q-l)lq. The measure of the union of all
these intervals is bounded by
(On simple continued fractions) For integral an a0 > 0, at > 0, i ¥=■ 0,
let
1
[a0, a1,...,an\:=a0+ -
a, H
•i
a2 +
1
«3+ •
and let
1
+ —
a„
[a0, «!,...] := lim[a0, «!,...,«„].
Unless otherwise specified, we assume the continued fraction is
infinite. The nth convergent is defined by
-* := [a0,fl„...,flB].
Hn
a) Show that the convergents satisfy
(11.3.47) Pn = a»Pn-l+P»-2 pt = 0^+1 p0 = «0
(11.3.48) <?" = an<ln-x + <7«-2 qx = «i <7o = 1
(11.3.49) P„q„-i-p„-iq„ = (-l)n-1
(11.3.50) P„9„-2-P»-29» = (-l)Bfl„
/"11 a CI \ ^2" > Pln-2 Pln + 1 < Pln-1
lln 1ln-2 1ln + l 1ln-l
Deduce that pn and qn are relatively prime. Note that {/?„} and
{qn} are increasing sequences.
b) Show that continued fractions are well defined, that is, show that
the limit in the definition always exists.
c) Rational numbers have two representations since
[fl0. •••.«»] = [«<>.•--.0,.-1 + 1] fl« = 1
and
[a0,. . . , a„] = [a0,..., a„_„ a„ ~ 1,1] a„*l.
Show that a number is rational if and only if it has a finite simple
continued fraction. [See e).]
11.3 Irrationality Measures 373
d) Let a'n '■= [an, an+1,. . .] where a := [a0, a1}. . .]. Show that
«-£ = til
<ln 1n(a'n + l<ln+<ln-l)
Thus {p2n/q2n} f a and {p2„+1/?2n + 1} I a.
e) One constructs the simple continued fraction to a as follows
(L J := integer part). Let
«o:= LaJ and ao:= a
and proceed inductively to let
a"+l := a - [a J 3nd a"+1 :== ^+1-1
unless a„ - \_an\ = 0, in which case the algorithm terminates.
Show that
a = [a0,at,. . .]
and if the continued fraction is infinite, then the representation is
unique.
f) Show from d) that
Pn
In
<
1
1
< —
Inln+l q„
g) (Best approximation property) Show that if 0 < q < qn and if
plq^P„lq„, then
Pn
In
- fV
<-.
P
q
- fV
Hint: Prove \pn ~ qna\ < \p - qa\. Show first that one may
assume q„-1<q<q„. Now write q-= uqn +vq„_t, p:=upn +
vPn-u and Pn~ «»« = u(Pn~ 9»«) + "(P„-i ~ ?„-!«)• Show
that u and u are nonzero integers and that u(pn- qna) and
v(Pn-i ~ qn-ia) nave the same sign,
h) Show that if for z>l,
pz + r
qz + s
p, q,r,s<
and
q>s>0 ps — qr—±l
374
Pi
then rls and p/q are consecutive convergents of the continued
fraction for a.
Hint: Write p/q as a continued fraction and show that if
f-"»-
then
a = [a0;. . . ,an,z]
and
z = [an+1,an+2,...].
[There are two representations for p/q [part c)]. Choose the one
for which n satisfies ps — qr = (-1)"- •]
i) Show that if
<7
<
2q2
then p/q is a convergent of a.
Hint; Suppose
P
a
2
<7
0<8<
and
Write
<7
= [a0, au...,an\.
PnZ+Pn-
q»z + ?»-
where pilqi is the /th convergent to /?/^. Show that z > 1 and
apply h).
j) Of two consecutive convergents at least one satisfies
In
<
Hint; By a), if the above fails, then
qn<in
+i
Pn-i
Qn-i
Pn
In
-
Pn
(In
+
Pn + l
(ln +
+
2<ln ' 2^ + I
Hurwitz has shown that of any three consecutive convergents at
least one satisfies
11.3 Irrationality Measures
375
<7„
<
1
V5<?2'
This cannot be sharpened since, for example,
V5-1
1
(Vl-S)q2
8>0
for q sufficiently large. (See, for example, Hua [82].)
k) (Fibonacci numbers) Let
Show that {Fn+1/Fn} are the convergents to (l + V5)/2. Show
that
1 + V5
= [1,1,1,1,...].
1) Show that
Vl + fl2 = [a, 2a, 2a, 2a,. . .].
m) Show that a periodic continued fraction (one where ak+l = ak for
some / and all large k) represents a quadratic irrational (the root
of a quadratic equation with integer coefficients).
n) (Lagrange) Show that a quadratic irrational has a periodic
continued fraction. Thus with m) this characterizes quadratic
irrationals.
Hint: Suppose ra2 + sa + t = 0 and a = [a0, alf a2,. . .]. Then by
d),
P„-l<+Pn-2
a = TT ■
Substitute this into the quadratic equation for a to obtain integers
An, Bn, and Cn with
(11.3.52)
An(a'n)2 + Bna'n+Cn = 0.
Express An, Bn, and Cn in terms of r, s, and t and show that
A „5^0. Show that
^-4,4,,^=/-4^
and that
C„ .A,
376
Pi
Show, using f), that
and
\An\ ^2\ra\ + |r| + |*|
|Cj<2|ra| + |H + UI
\Bn\2<4\AnCn\ + \s2-4rt\.
Hence \An\, \Bn\, and \Cn\ are bounded independently of n. Thus
(Ani, Bni, Cni) = (A„2, B„2, Cn) = (A„3, B„3, Cn) for three distinct
indices n1, n2, and n3, and by (11.3.52) one of
a„. = ot„
or
or
Oil = ot'
«9 n
which implies the periodicity of a.
o) Show, using i), that any integral solution of Pell's equation
n - dm2 = 1
d a positive integer
has nlm a convergent of the continued fraction of Vd.
This outline of the basic theory follows Hardy and Wright [60].
3. Suppose there exists a sequence of rationals {pnlqn} and 8 >0 so that
(11.3.53)
and
(11.3.54)
Then either
Pn
In
<
<ln
+ o(.l)
<ln<<ln + l<<ln
— = — for some n
1 In
or for e > 0 and for q > ce
a —
>
l+i/S+e
Hint: Let \a -p/q\ = l/q1+lls+e , e'>0, and choose n so that
USn + l^q< Wn- Then
11.3 Irrationality Measures
377
and if p/q*pjq„,
l<-3* +
<7„
S ' llS+e'
In <l
The right-hand side, however, is less than 1 for q large.
4. Refine Exercise 3 to deduce Corollary 11.1, with v'=l, directly from
Theorem 11.1.
5. (Irrationality measure for e1/v) Let u be a positive integer. As in
Exercise 10 of section 10.1, let
£0(n-k)lkl\v
and
a) Show that
£0(n-k)\k\ \ v,
ellv--±
t„
n\n\
and that
Hence
t=v
llv an
6 ~Z
1
(2/i +l)!(2n)! " \v
„ (2n)! _,/2„
[1 + 0(1)]
/l!
e-"» [1 + 0(1)].
1 log log f„
[1 + 0(1)].
2u '>§'*
b) As in Exercises 3 and 4, show that if plq^s Jtn for some n then
<7
>
<7
for large g, where cv is a positive constant depending only on v.
c) Deduce Corollary 11.1.
6. Let dn:=LCM(l,...,n).
a) Show that there is a constant C so that
d. < C".
Pi
Hint:
av=rU)
(2n)! . (2n)! .„
v ' and ^-4-<4
where the product is taken over the primes pt, n<pt<2n. Thus
b) Let ir(«) denote the number of primes less than or equal to n. The
prime number theorem asserts that
(See, for example, Hardy and Wright [60].) Use this to prove that
dn = 0(eSn) for any 8 >1.
Hint:
where the product is taken over the primes <n and where at is the
largest integer so that p? ^n.
a) Show that
v> a,a7- ■ ■ at_. 1
fc=I (x + «j) • • • (x + ak) x x(x + «j) • • • (x + aK) '
b) Show that
"v (-l)*"1^-!)!]2 1 2(-l)"_1
1
^i{n-l2)---{n-k2) n /2n
n I
V n
Hint: Use a) with x '■= n and ak '■= —k2.
c) Set
* -=I W)2(n-k)\
"•*" 2 k\n + k)\ '
Show that
)*''[(*-I)']2
(n* - 12) • • ■ (n2 - k2)
(-1) "(s„,/t _ °n~i,k) ~ 73—Ta 73—3^
11.3 Irrationality Measures
379
and
N n-l N 1 N /-1 \«-
A/
= 2(-1)^,,-8,,,)
A/
= 2
(-i)* , i v (-1)^1
/t=l
N + k\fN\ 2 k"1 ,3(2k
2k ' . A*,
d) Let iV^>°° in c) to deduce that
«3) = |2
(-1)-
2«
(This exercise follows van der Poorten [79] and is essentially due
to Apery.)
8. Finish the proof of Corollary 11.2.
9. {Irrationality measures for log)
a) Let p„(x):= {d/dx}V(l - x)". Then as in (11.3.23),
'.w-scx:*)*-1**-
Estimate pn(—m). In particular show that for mE.N there exist
Cj >0, c2, c3, and c4 so that
cInC2(Vm + l + Vmf" <p„(-m) < c3nq(VmTl + Vm)2" .
ifwtf: By Stirling's formula, up to a power of n
n\ln + k
m behaves like
ma(l + a)1+a 1
a\i~a la
— a) a ->
where k-- an. It is now a calculus exercise to maximize the above
by differentiating with respect to a. The maximum occurs at
a '■= yjm/(m + 1).
b) Prove that for large q,
(11.3.55)
log 2
>
1
Pi
Outline: Let pn be as in (11.3.22) and let
(11.3.36) /.-J^*
= r1 pn(x)-pn(-i) dx+r £,,(-1)
Jo 1 + x Jo 1
+ x
dx
= ^+p„(-l)log2
where a„ is an integer and dn is as in Exercise 6. (Note that
(1 + x)\ [pn(x) -p„(-l)].) Furthermore, integrating /n by parts n
times with respect to x yields
(11.3.57) |/J = J-
^^.
(1 + *)
Estimate |/| from above and below by computing that the
maximum of x(l — x)/(l + x) occurs at x '■- V2 - 1. For large n,
(11.3.58) (V2-1-)2"<|/„|<(V2-1+)2" .
From (11.3.56), (11.3.58), and part a) prove (11.3.55).
c) Prove, for e> 0 and n> Ne, that
n + 1 p
log
° n q
> ~2T7 Q lar§e •
<7
Compare (11.3.4) and (11.3.5).
Hint: Consider
Jo 1 + x/m Jo 1 + x/m
and proceed as in b). Note that
■" 1 f1 X"(1~Xr J,~ 1 1
m" Jo /-1 + -/-^+J ax
n /\n
1 "' mn Jo (l + x/m)n+1 mn 4
a) Show that 83(q) and 84(q) are irrational for rational q :=1 In,
n = 1,2, 3,. . . .
b) (Euler) Show that m4 + nA = p2 with m, n, p integral implies
mn = 0.
c) Use b) to* show that 82, 83, and 8A are never all nonzero and
rational.
11.3 Irrationality Measures 381
a) Show that
2 » 1
(11.3.59)
Outline: Let
(-1)"
2n + l
= 2
(2fc + 1)2'
N N / ixm+n A/
i\ i\ / -* \m+n ^v ^
a .= y y (-1) y 1
°N' ~~ (2m+ 1)(2/1 + 1) t^(2)t + l)2
and
%'■= 2
" (-1)"
m=-N m-n
Show that
and that
^ ^, (_ir + n
dN ~~ (2/i + l)(m-/i)
1
JV-rc + 1
Thus prove (11.3.59) by showing that 8N^>0.
b) Evaluate £(2) = ir2/6 from a) and Gregory's formula.
a) Show that for 0 ^ x ^ 1, one has the following functional equation
for the dilogarithm E"=I x7n2:
(11.3.60) log(l-x)logx+2 ^+2 (1~2X)"=^.
n=I n n = I n o
7/mf: Show by taking derivatives that the left-hand side is zero.
Evaluate the constant by letting x-»0.
b) (Euler) Show that
2 1 ™ 1
T2-2^2)2=„?l2V-
c) Let Li3(x) denote the trilogarithm E"=I x7n3. Show that, for
0<x<l, the following identity due to Landen holds:
U3(x) + Li3(l -x) + Li3(y^) - Li3(l)
= y log (1 - x) - ! log (*) log2 (1 - *) + ^ log3 (1 - x)
Pi
d) Deduce that Li3(l) = £(3) and that
i) Li3(|) = ^Li3(l)-^-log2+ilog3(2)
.., T. /3-V5\ 4T. .„, tt\ /3-V5
e) Show that
1 , 3/3-V5
12 log ("1-
log[(V5+i)/2]
£(3) = Li3(l) = 10 J t2 coth (t) dt
and combine this with Exercise 17a) of Section 5.5 to provide
another verification of Exercise 7d). This is discussed in Lewin
[81, Sec. 6.3].
(Euler) Establish
CO
a) ir[cot (irx) - cot (ira)] = 2
a - x
(x - n)(a - n)
1
b) ir3fcot(irx)cosec2 (ttx)] = 2 rr
-»(x-n)
c) 7/
cosec4 (ttx) - - cosec2 (ttx) = 2 r
3 J -„ (x - n)
n=i An -1 2
e)
f)
2
CO
2
CO
V
1
(4rc2-
1
(4rc2-
1
I)2
I)3
tt2-;
16
32-3tt2
64
„=1 (4« - 1)
Hint: Use x := 3 and
77-4 + 3077-2 - 384
768
In - 1 2« + 1 An -I'
11.3 Irrationality Measures 383
(Evaluation of £(2m)) Define {Bn}, the Bernoulli numbers, by
ez-l ' 2 ~0~2m (2m)\
5, :=-¾ and 52B+1 : = 0,/1 = 1,2,3,...
(11.3.61) -^ + | = X B2m -^- \z\ =£2,
a) Show that
/fc=0 \ K /
b) Show, from (11.3.61), that
. . 2iriz
irz cot (irz) = -2^jj—- + iriz
e — 1
= Z (-1) (2ir)
m=o (2m)!
and from the product expansion for sin that
, x 1 V 2z
IT COt (iTZ) = 2j
Z n = :
c) Show that
2 2 •
« - Z
d) Thus
aim)-=1 ^=(-1) 2(2m)! •
«2) = T «4) = ^ «6) = ^
(Evaluation of /3(2m + 1)) Define {£2n}> ^e £u/er numbers, by
\«r _2n
J_ = V (-1) ^2 , . . ir
OS z „=(
a) Show that £0 = 1 and
b) Show that
(1L3'62) c^I = i^l2#- N<2
Z ( o?J£2«-2/t-°
Pi
IT
= X 4k+1p(2k + \)z
2k
cos (irz) k=0
where, as before,
)8(2/: + 1):=2 J"^" +i ■
„=o (2« + 1)
c) Show that
If I , _\ik+i
«2* + ')-5^f(f) ■
d) Thus
3 c 5
■"■ „,„* IT „,„. JIT
)8(1)=- )8(3)=- )8(5) =
4 ^v ' 32 ^w 1536 '
There are similar evaluations for more general L functions.
(Series involving ())
a) Show that
2x arcsin x _ y (2x)2m
vr^5
* m=i /2m
ml
V m
7/mf: Let /:= (arcsin x)l\l - x2. Show that
(l-x2)/=l + ;t/.
Show that
z* ■"=' '2m
ml
V m
also satisfies the above differential equation. (Compare Exercise
16 of Section 5.5.)
b) Show that
» /^ \2m
2(arcsin x)2 = 2 -±-^ .
m=1 2/2m
m
m
11.3 Irrationality Measures
Hint; Use a) and the fact that
d . . .2 2x arcsin*
x -j- (arcsin x) =
2
dx Vl-x
c) Show, by differentiating in a), that
Y (2x)2m _ x2 x arcsin x
«=i (2m\ 1~x (1_* )
\m/
d) Specialize the above series or the derivatives of the above se
show that
Y 1 2ttV3 + 9
^ 2m^= 27
2-^-
m=1 /2m
ml
V m
co 1
E -^-
m
ttV3
2
IT
i 2/2m^ 18
m I
m
m 2
2—= £(W5+9)
m=i /2m x ^7
m
E -^- = ^ + 3
m=1 /2m
\ m,
E 3
2/2m
m
m i
V 3m 4ir „
Z, = -7?+3
—' /2m ^ V3
V
m
(-l)"1"1 _ 1 , 4V5 , V5 + 1
V m
v (-1) i Wi
m-i /2mN 5 25 2
386
Pi
= Alo /Vs + i
-=i /2m\ V5 °gV 2
ml
V m
e) Show that
and that
■
Thus
2 ("ir =2
m=1 2/2m\
V m 1
M*;1)]
2
1 y (2n\ „
Vl - 4* ~0 V n r
01 /l-Vl-4x\
21oH 2x )
„=i n \ n 1
, A v ! /1•3 -••2« ~1 \
f) From e) deduce that
and that
- H 1
V ^ " ' In 1 / • „ x
^ 1 , 1 * = o (arcsin 2x)
„=i 2« + 1 2x v '
■ (2")
ttx (2n + !)16" 3 '
Further material is available in Lewin [81], Lehmer [85], and
Zucker [85]. There is an interesting evaluation of Comtet's, namely,
E
l
17ir4
m i
/2m \ 3240 '
m
g) We conclude with Ramanujan's
E
l
IT ,
(2m + l)2(2m
\ m
-G- 3-log(2 + V3)
with G denoting Catalan's constant.
Hint: Use part a), and parts b) and h) of Exercise 10 in Section
5.6.
Bibliography
[1881] N. H. Abel, Oeuvres Completes (Grondahl and Son, Christiania, 1881).
[64] M. Abramowitz and I. Stegun, Handbook of Mathematical Functions (Dover, New
York, 1964).
[66] W. W. Adams, "Asymptotic Diophantine Approximation to e," Proc. Natl. Acad. Sci.
55 (1966), 28-31.
[66] L. V. Ahlfors, Complex Analysis, 2nd ed. (McGraw-Hill, New York, 1966).
[74] A. V. Aho, J. E. Hopcroft, and J. D. Ullman, The Design and Analysis of Computer
Algorithms (Addison-Wesley, Reading, MA, 1974).
[79] K. Alladi and M. L. Robinson, "On Certain Irrational Values of the Logarithm,"
Lecture Notes in Mathematics 751, (Springer, Berlin, 1966).
[Pr] G. Almkvist, "A Solution to a Tantalizing Poblem," Fiboracci Quart (m press).
[Pr] and B. C. Berndt, "Gauss, Landen, Ramanujan, the Arithmetic-Geometric Mean,
Ellipses, the Ladies Diary, it, and the American Revolution," MAA Monthly (in press).
[76] G. E. Andrews, The Theory of Partitions (Addison-Wesley, Reading, MA, 1976).
[79] , "An Introduction to Ramanujan's 'Lost' Notebook," MAA Monthly 86 (1979),
89-108.
[86] , "The Fifth and Seventh Order Mock Theta Functions," Trans. AMS 273 (1986),
113-134.
[74] T. M. Apostol, Mathematical Analysis, 2d ed. (Addison-Wesley, Reading, MA, 1974).
[76a] , Introduction to Analytic Number Theory (Springer, New York, 1976).
[766] , Modular Functions and Dirichlet Series in Number Theory (Springer, New York,
1976).
[85] J. Arazy, J. Claesson, S. Janson, and J. Peetre, "Means and Their Iterations," in Proc.
19th Nordic Congress of Math., J.R. Stefanson (ed.), Reykjavik (1985).
[80] R. Askey, "Ramanujan's Extensions of the Gamma and Beta Functions," MAA
Monthly 87 (1980), 346-359.
[81] R. Backstrom, "On Reciprocal Series Related to Fibonacci Numbers," Fibonacci Quart.
19 (1981), 14-21.
[88] D. H. Bailey, "The Computation of tr to 29,360,000 Decimal Digits Using Borweins'
Quartically Convergent Algorithm", Math. Comput. 50 (1988), 283-296.
[35] W. N. Bailey, Generalized Hypergeometric Series (Cambridge Univ. Press, London,
1935).
[64] A. Baker, "Approximation to the Logarithms of Certain Rational Numbers," Acta
Arith. 10 (1964), 315-323.
[75] , Transcendental Number Theory (Cambridge Univ. Press, London, 1975).
[81] G. A. Baker and P. Graves-Morris, Fade Approximants, vols. I and II (Addison-Wesley,
Reading, MA, 1981).
387
388
Bibliography
[39] J. P. Ballantine, "The Best (?) Formula for Computing it to a Thousand Places," MAA
Monthly 46 (1939), 499-501.
[50] E. F. Beckenbach, "A Class of Mean Value Functions," MAA Monthly 52 (1950), 1-6.
[61] and R. Bellman, An Introduction to Inequalities (Random House, New York,
1961).
[77] P. Beckmann, A History of Pi, 4th ed. (Golem Press, Boulder, CO, 1977).
[72] M. Beeler, R. W. Gosper, and R. Schroeppel, "Hakmem," MIT Artificial Intelligence
Lab., MIT, Cambridge, MA, 1972.
[27] E. T. Bell, "Algebraic Arithmetic," AMS Colloq. 7, 1927.
[61] R. Bellman, A Brief Introduction to Theta Functions (Holt, Rinehart and Winston, New
York, 1961).
[Pr] B. C. Berndt, The Jacobian Elliptic Functions, Chapter 18 of Ramanujan's Second
Notebook (Springer, to be published).
[Pr] B. C. Berndt, Modular Equations of Degrees 3, 5, and 1 and Associated Theta Function
Identities, Chapter 19 of Ramanujan's Second Notebook (Springer, to be published).
[79] F. Beukers, "A Note on the Irrationality of f(2) and £(3)," Bull. London Math. Soc. 11
(1979), 268-272.
[84] S. Bhargava and Chandrashekar Adiga, "On Some Continued Fraction Identities of
Srinivasa Ramanujan," Proc. AMS 92 (1984), 13-18.
[Pr] A. J. Biagioli, "A Proof of Two Identities of Ramanujan."
[73] G. Birkhoff, A Source Book in Classical Analysis (Harvard Univ. Press, Cambridge,
MA, 1973).
[69] and G. C. Rota, Ordinary Differential Equations, 2d ed. (Blaisdell, Waltham, MA,
1969).
[1888] C. W. Borchardt, Gesammelte Werke (Berlin, 1888).
[66] A. Borel, S. Chowla, C. S. Herz, K. Iwasawa, and J. P. Serre, "Seminar on Complex
Multiplication," Lecture Notes in Mathematics 21 (Springer, Berlin, 1966).
[75] A. Borodin and I. Munro, The Computational Complexity of Algebraic and Numeric
Problems (American Elsevier, New York, 1975).
[86] D. Borwein and J. M. Borwein, "Alternating Series in Several Dimensions," MAA
Monthly (1986) to appear.
[85] , , and K. F. Taylor, "Convergence of Lattice Sums and Madelung's
Constant," J. Math. Phys. 26 (1985), 2299-3009.
[85] J. M. Borwein, "Some Modular Identities of Ramanujan Useful in Approximating ir,"
Proc. AMS 95 (1985), 365-371.
[83] and P. B. Borwein, "A Very Rapidly Convergent Product Expansion for ir," BIT
23 (1983), 538-540.
[84a] and , "The Arithmetic-Geometric Mean and Fast Computation of Elementary
Functions," SIAM Rev. 26 (1984), 351-365.
[846] and , "Cubic and Higher Order Algorithms for it," Can. Math. Bull. 27
(1984), 436-443.
[84c] and , "Explicit Algebraic Nth Order Approximations to Pi," in Approximation
Theory and Spline Functions, NATO ASI Ser., S. P. Singh, J. H. W. Burry, and B.
Watson, Eds. (Reidel, Dordrecht, 1984).
[84rf] , "Reduced Complexity Calculation of Log," Dalhousie Technical Report.
[86] and , "More Quadrau'cally Converging Algorithms for tt," Math Comput. 46
(1986), 247-253.
[Pr] J. M. Borwein and P. B. Borwein, "Approximating it with Ramanujan's Solvable
Modular Equations" (in press).
Bibliography
389
[85] P. B. Borwein, "On the Complexity of Computing Factorials," J. Algorithms 6 (1985),
376-380.
[Pr] , "Problem," SIAM Rev. (in press).
[53] F. Bowman, Introduction to Elliptic Functions (English Universities Press, London,
1953).
[68] C. B. Boyer, A History of Mathematics (Wiley, New York, 1968).
[84] D. Braess, "On the Conjecture of Meinardus on Rational Approximation to e", II," J.
Approx. Theory 40 (1984), 375-379.
[76a] R. P. Brent, "Fast Multiple-Precision Evaluation of Elementary Functions," J. ACM 23
(1976), 242-251.
[766] , "Multiple-Precision Zero-Finding Methods and the Complexity of Elementary
Function Evaluation," in Analytic Computational Complexity, J. F. Traub, Ed.
(Academic Press, New York, 1976).
[76c] , "The Complexity of Multiple-Precision Arithmetic," in Complexity of
Computational Problem Solving, R. S. Anderssen and R. P. Brent, Eds. (Univ. of Queensland
Press, Brisbane, 1976).
[80] and E. M. McMillan, "Some New Algorithms for High-Precision Calculation of
Euler's Constant," Math. Computat. 34 (1980), 305-312.
[83] D. M. Bressoud, "An Easy Proof of the Rogers-Ramanujan Identities," J. Number
Theory 16 (1983), 235-241.
[1875] C. Briot and J. C. Bouquet, Theorie des Fonctions Elliptiques (Paris, 1875).
[26] T. J. I. A. Bromwich, An Introduction to the Theory of Infinite Series, 2d ed.
(Macmillan, London, 1926).
[71] P. Bundschuh, "Irrationalitatsmasse fur e", a#0 rational oder Liouville-Zahl," Math.
Ann. 192 (1971), 229-242.
[84] V. I. Buslaev, A. A. Gonchar, and S. P. Suetin, "On the Convergence of Subsequences
of the nth Row of a Fade Table," Math. U.S.S.R. Sbornik 48 (1984), 535-540.
[71] L. Carlitz, "Reduction Formulae for Fibonacci Summations," Fibonacci Quart. 9
(1971), 449-466, 510.
[71] B. C. Carlson, "Algorithms Involving Arithmetic and Geometric Means," MAA
Monthly 78 (1971), 496-505.
[72] , "The Logarithmic Mean," MAA Monthly 79 (1972), 615-618.
[75] , "Invariance of an Integral Average of a Logarithm," MAA Monthly 82 (1975),
379-382.
[77] , Special Functions of Applied Mathematics (Academic Press, New York, 1977).
[78] , "Short Proofs of Three Theorems on Elliptic Integrals," SIAM J. Math. Anal. 9
(1978), 524-528.
[57] J. W. S. Cassels, An Introduction to Diophantine Approximation (Cambridge Univ.
Press, London, 1957).
[1874] A. Cayley, "A Memoir on the Transformations of Elliptic Functions," Phil. Trans.
164 (1874), 397-456.
[1895] , An Elementary Treatise on Elliptic Functions (Bell and Sons, 1895; republished
Dover, New York, 1961).
[85] K. Chandrasekharan, Elliptic Functions (Springer, Berlin, 1985).
[66] E. W. Cheney, Introduction to Approximation Theory (McGraw-Hill, New York, 1966).
[84] D. V. Chudnovsky and G. V. Chudnovsky, "Pade and Rational Approximation to
Systems of Functions and Their Arithmetic Applications," Lecture Notes in Mathematics
1052 (Springer, Berlin, 1984).
[71] A. Clark, Elements of Abstract Algebra (Wadsworth, Belmont, CA, 1971).
390
Bibliography
J. E. Cohen and R. D. Nussbaum, "Arithmetic-Geometric Means of Positive Matrices"
Math. Proc. Camb. Phil. Soc. 101 (1987), 209-219.
S. A. Cook and S. O. Aandefaa, "On the Minimum Computation of Functions," Trans.
AMS 142 (1969), 291-314.
J. W. Cooley and J. W. Tukey, "An Algorithm for the Machine Calculation of Complex
Fourier Series," Math. Comp. 19 (1965), 297-301.
, P. A. Lewis, and P. D. Welch, "History of the Fast Fourier Transform," Proc.
IEEE 55 (1967), 1675-1677.
D. A. Cox, "The Arithmetic-Geometric Mean of Gauss," VEnseignement Math. 30
(1984), 275-330.
, "Gauss and the Arithmetic-Geometric Mean," Notices of the AMS 32 (1985),
147-151.
J. H. Davenport, "On the Integration of Algebraic Functions," Lecture Notes in
Computer Science 102 (Springer, Berlin, 1981).
C. S. Davis, "A Note on Rational Approximation," Bull. Austral. Math. Soc. 20 (1979),
407-410.
C. Denninger, "On the Analogue of the Formula of Chowla and Selberg for Real
Quadratic Fields," J. Reine Ang. Math. 351 (1984), 171-191.
L. E. Dickson, Introduction to the Theory of Numbers (Dover, New York, 1929).
, History of the Theory of Numbers, vols. I-III (1919, 1920, 1923), reprinted
Chelsea, New York, 1971).
M. Dutta and L. Debnath, Elements of the Theory of Elliptic and Associated Functions
(World Press Private, Calcutta, 1965).
P. Du Val, Elliptic Functions and Elliptic Curves (Cambridge Univ. Press, London,
1973).
A. Eagle, The Elliptic Functions as They Should Be (Galloway and Porter, Cambridge,
1958).
C. H. Edwards, Jr., The Historical Development of the Calculus (Springer, New York,
1979).
A. Erdelyi et al., Higher Transcendental Functions, vols. 1-3 (McGraw-Hill, New York,
1953-1955).
J. A. Ewell, "An Easy Proof of the Triple-Product Identity," MAA Monthly 88 (1981),
270-272.
, "Consequences of Watson's Quintuple Product Identity," Fibonacci Quart. 20
(1982), 256-262.
, "A Simple Proof of Fermat's Two-Square Theorem," MAA Monthly 90 (1983),
635-637.
J. A. Ewell, "On the enumerator for sums of three squares," Fibonacci Quart. 24
(1986), 151-153.
[84a] D. M. E. Foster and G. M. Phillips, "The Arithmetic-Harmonic Mean," Math. Comput.
42 (1984), 183-191.
[846] and , "A Generalization of the Archimedean Double Sequence," J. Math.
Anal. Appl. 101 (1984), 575-581.
[1866] C. F. Gauss, Werke (Gottingen, 1866-1933), vol. 3, pp. 361-403.
[38] C. Gini, "Di una Formula Compressiva delle Medie," Metron. 13 (1938), 3-22.
[80] M. L. Glasser and I. J. Zucker, "Lattice Sums in Theoretical Chemistry," Advances and
Perspectives 5 (1980), 67-139.
[85] D. Goldfeld, "Gauss' Class Number Problem for Imaginary Quadratic Fields," Bull.
AMS 13 (1985), 23-37.
Bibliography
391
[61] B. Gordon, "Some Identities in Combinatorial Analysis," Quart. J. Math. 12 (1961),
285-290.
[84] H. W. Gould and M. E. Mays, "Series Expansions of Means," J. Math. Anal. Appl. 101
(1984), 611-621.
[1881] E. M. Goursat, "Sur l'Equation Differentielle Lineaire qui Admet pour Integrate la
Serie Hypergeometrique," Ann. Sci. I'Ecole Normale Sup. (2) 10 (1881), 3-142.
[80] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Products (Academic
Press, New York, 1980).
[1892] A. G. Greenhill, The Applications of Elliptic Functions (1892; republished Dover, New
York, 1959).
[09] H. Hancock, Theory of Elliptic Functions (1909, republished Dover, New York, 1958).
[28] M. Hanna, "The Modular Equations," Proc. London Math. Soc. 28 (1928), 46-52.
[75] E. R. Hansen, A Table of Series and Products (Prentice-Hall, Englewood Cliffs, NJ,
1975).
[40] G; H. Hardy, Ramanujan (Cambridge Univ. Press, London, 1940).
[59] , J. E. Littlewood, and G. Polya, Inequalities, 2d ed. (Cambridge Univ. Press,
London, 1959).
[60] and E. M. Wright, An Introduction to the Theory of Numbers, 4th ed. (Oxford
Univ. Press, London, 1960).
[77] P. Henrici, Applied and Computational Complex Analysis, vols. 1 and 2 (Wiley, New
York, 1974, 1977).
[12] C. Hermite, Oeuvres, vol. 3 (Paris, 1912), pp. 150-181.
[17] , Oeuvres, vol. 4 (Paris, 1917), pp. 357-377.
[05] and T. J. Stieltjes, Correspondence d'Hermite et de Stieltjes, vol. 2 (Gauthiers-
Villars, Paris, 1905).
[1893] D. Hilbert, Math. Ann. 43 (1893), 216-219.
[85] M. D. Hirschhorn, "A Simple Proof of Jacobi's Two Square Theorem," MAA Monthly
92 (1985), 579-580.
[70] A. S. Householder, The Numerical Treatment of a Single Nonlinear Equation (McGraw-
Hill, New York, 1970).
[82] L. K. Hua, Introduction to Number Theory (Springer, Berlin, 1982).
[84] D. Hughes, Master's Thesis, Dalhousie Univ., Halifax, NS, 1984.
[1796] J. Ivory, "A New Series for the Rectification of the Ellipsis; Together with Some
Observations on the Evolution of the Formula (a2 + b2 -2af>(cos <£)")," Trans. Roy.
Soc. Edinburgh 4 (1796), 177-190.
[1829] G. J. Jacobi, "Fundamenta Nova Theoriae Functionum EUipticarum," 1829; Jacobi's
Gesammelte Werke, vol. 1 (Berlin, 1881-1891), pp. 49-239.
[84] E. Kaltofen and N. Yui, "On the Modular Equation of Order 11," Proc. 1984
MACSYMA User's Conf, pp. 472-485.
[Pr] Y Kanada, Y Tamura, S. Yoshino, and Y Ushiro, "Calculations of it to 10,013,395
Decimal Places Based on Gauss-Legendre Algorithm and Gauss Arctangent Relation,"
preprint (1983).
[57] I. Kaplansky, An Introduction to Differential Algebra (Hermann, Paris, 1957).
[86] M. Keith, "Circle digits a self-referential story," The Mathematical Intelligencer 8
(1986), 56-57.
[64] A. Khinchin, Continued Fractions (Chicago Univ. Press, Chicago, 1964).
[24] L. V. King, On the Direct Numerical Calculation of Elliptic Functions and Integrals
(Cambridge Univ. Press, London, 1924).
392
Bibliography
[1897] F. Klein, Famous Problems of Elementary Geometry, transl. by W. Beman and D. E.
Smith (Ginn and Co., Boston, 1897).
[79] , "Development of Mathematics in the 19th Century," 1928, Trans., Math. Sci.
Press, R. Hermann Ed. (Brookline, MA, 1979).
[1892] and R. Fricke, Vorlesungen uber die Theorie der Elliptischen Modulfunktionen
(Teubner, Leipzig, 1890, 1892).
[81] D. Knuth, The Art of Computer Programming, vol. 2: Seminumerical Algorithms 2nd
ed. (Addison-Wesley, Reading, MA 1981).
[71] A. Krazer, Lehrbuch der Theta-Funktionen (Chelsea, New York, 1971).
[78] H. T. Kung and J. F. Traub, "All Algebraic Functions Can be Computed Fast," J.
ACM 25 (1978), 245-260.
[1899] E. Landau, "Sur la Serie des Invers de Nombres de Fibonacci," Bull. Soc. Math. France
27 (1899), 298-300.
[58] E. Landau, Elementary Number Theory (Chelsea, New York, 1958).
[66a] S. Lang, Introduction to Transcendental Numbers (Addison-Wesley, Reading MA,
1966).
[666] , Introduction to Diophantine Approximation (Addison-Wesley, Reading, MA
1966).
[73] , Elliptic Functions (Addison-Wesley, Reading, MA, 1973).
[78] E. B. Leach and M. C. Sholander, "Extended Mean Values," MAA Monthly 85 (1978),
84-90.
[1811] A. M. Legendre, Exercises de Calcul Integral, vols. 1-3 (Paris, 1811-1819).
[38] D. H. Lehmer, "On Arctangent Relations for it," MAA Monthly 45 (1938),
657-664.
[71] , "On the Compounding of Certain Means," J. Math. Anal. Appl. 36 (1971),
183-200.
[85] , "Interesting Series Involving the Central Binomial Coefficient," MAA Monthly 92
(1985), 449-457.
[66] , J. Lehner, A Short Course in Automorphic Functions (Holt, Rinehart and Winston,
New York, 1966).
[80] Le Petit Archimede, no. 64-65, "Numero Special it" (1980).
[77] W. J. LeVeque, Fundamentals of Number Theory (Addison-Wesley, Reading, MA,
1977).
[81] L. Lewin, Polylogarithms and Associated Functions (North Holland, New York, 1981).
[74] Tung-Po Lin, "The Power Mean and the Logarithmic Mean,' MAA Monthly 81 (1974),
879-883.
[1882] F. Lindemann, "Uber die Zahl tt," Math. Ann. 20 (1882), 213-225.
[1884] J. Liouville, "Sur des Classes Tres Etendues de Quantites dont la Valeur n'est ni
Rationnelle ni meme Reductible a des Irrationnelles Algebriques," Compt. Rend. 18
(1844), 883-885, 910-911.
[81] J. D. Lipson, Elements of Algebra and Algebraic Computing (Addison-Wesley, Reading,
MA, 1981).
[66] W. Magnus, Formulas and Theorems for the Special Functions of Mathematical Physics
(Springer, Berlin, 1966).
[31] K. Mahler, "Zur Approximations der Exponentialfunktion und des Logarithmus, I, II,"
J. Reine Ang. Math. 166 (1931), 118-136, 137-150.
[53] , "On the Approximation of it," Indagiones Math. 15 (1953), 30-42.
Bibliography
393
[67] , "Application of Some Formulae by Hermite to the Approximation of
Exponentials and Logarithms," Math. Ann. 168 (1967), 200-227.
[75] D. Masser, "Elliptic Functions and Transcendence," Lecture Notes in Mathematics 437
(Springer, Berlin, 1975).
[83] G. Miel, "On Calculations Past and Present: The Archimedean Algorithm," MAA
Monthly 90 (1983), 17-35.
[74] M. Mignotte, "Approximations Rationalles de it et Quelques Autres Nombres," Bull.
Soc. Math. France, Mem. 37 (1974), 121-132.
[70] D. S. Mitrinovic, Analytic Inequalities (Springer, New York, 1970).
[16] L. J. Mordell, "Note on Class Relation Formulae," Messenger of Math. 45 (1916),
75-80.
[77] G. Nemeth, "Relative Rational Approximation of the Function e"," Math. Notes 21
(1977), 325-328.
[69] R. Nevanlinna and V. Paatero, Introduction to Complex Analysis (Addison-Wesley,
Reading, MA, 1969).
[79] D. J. Newman, "Approximation with Rational Functions," AMS Regional Conf. Ser. in
Mathematics, no. 41 (1979).
[82] , "Rational Approximation Versus Fast Computer Methods," Lectures on
Approximation and Value Distribution (Presses de l'Universite de Montreal, Montreal, 1982),
pp. 149-174.
[85] , "A Simplified Version of the Fast Algorithms of Brent and Salamin," Math.
Comput. 44 (1985), 207-210.
[84] M. Newman and D. Shanks, "On a Sequence Arising in Series for it" Math. Comput.
42 (1984), 191-217.
[56] I. Niven, "Irrational Numbers," Cams Math. Monograph 11 (MAA, 1956).
[78] M. Nyvoll, 'Tilnoermelsesformler for Ellipsebuer," BIT 25-26 (1978), 70-72.
[80] J. Oxtoby, Measure and Category, 2d ed. (Springer, New York, 1980).
[81] G. M. Phillips, "Archimedes the Numerical Analyst," MAA Monthly 88 (1981),
165-169.
[84] , "Archimedes and the Complex Plane," MAA Monthly 91 (1984), 108-114.
[73] H. Rademacher, Topics in Analytic Number Theory (Springer, Berlin, 1973).
[60] E. D. Rainville, Special Functions (Macmillan, New York, 1960).
[84] K. G. Ramanathan, "On Ramanujan's Continued Fraction," Acta Arith. 43 (1984),
209-225.
[14] S. Ramanujan, "Modular Equations and Approximations to it," Quart. J. Math. 45
(1914), 350-372.
[62] , Collected Papers (Chelsea, New York, 1962).
[77] R. Rankin, Modular Forms and Functions (Cambridge Univ. Press, London, 1977).
[73] H. E. Rauch and A. Lebowitz, Elliptic Functions, Theta Functions and Riemann
Surfaces (Williams and Wilkins, Baltimore, MD, 1973).
[84] P. Ribenboim, "Consecutive Powers," Expo. Math. 2 (1984), 193-221.
[85] , "Representation of Real Numbers by Means of Fibonacci Numbers," L'Enseigne-
ment Mathematiques 3 (1985), 249-259.
[27] J. F. Ritt, "Meromorphic Functions with Addition or Multiplication Theorems," Trans.
AMS 29 (1927), 341-360.
[48] , Integration in Finite Terms (Columbia Univ. Press, New York, 1948).
[72] M. Rosenlicht, "Integration in Finite Terms," MAA Monthly 79 (1972), 963-972.
394
Bibliography
[76] E. Salamin, "Computation of it Using Arithmetic-Geometric Mean," Math. Comput. 30
(1976), 565-570.
[82] T. Sasaki and Y. Kanada, "Practically Fast Multiple-Precision Evaluation of log (*)," J.
Inf. Process. 5 (1982), 247-250.
[50] H. C. Schepler, "The Chronology of Pi," Math. Mag. (1950), 165-170, 216-228,
279-283.
[77] I. J. Schoenberg, "On the Arithmetic-Geometric Mean," Delta 7 (1977), 49-65.
[82] , Mathematical Time Exposures (MAA, Washington, DC, 1982).
[76] B. Schoenberg, Elliptic Modular Functions (Springer, Berlin, 1976).
[71] A. Schonhage and V. Strassen, "Schnelle Multiplikation Grosser Zahlen," Computing 7
(1971), 281-292.
[67] A. Selberg and S. Chowla, "On Epstein's Zeta-Function," J. Reine Ang. Math. Ill
(1967), 86-110.
[82] D. Shanks, "Dihedral Quartic Approximations and Series for it," J. Number Theory 14
(1982), 397-423.
[62] and J. W. Wrench, Jr., "Calculation of it to 100,000 Decimals," Math. Comput. 16
(1962), 76-99.
[1853] W. Shanks, Contributions to Mathematics Comprising Chiefly of the Rectification of the
Circle to 607 Places of Decimals (G. Bell, London, 1853).
[85] D. Singmaster, "The Legal Values of Pi," Math. Intelligencer 7 (1985), 67-72.
[66] L. J. Slater, Generalized Hypergeometric Functions (Cambridge Univ. Press, London,
1966).
[85] E. U. Stickel, "Fast Computation of Matrix Exponential and Logarithms," Analysis 5
(1985), 163-173.
[74] K. B. Stolarsky, Algebraic Numbers and Diophantine Approximation (Marcel Dekker,
New York, 1974).
[75] , "Generalizations of the Logarithmic Mean," Math. Mag. 48 (1975), 87-92.
[80] , "The Power and Generalized Logarithmic Means," MAA Monthly 87 (1980),
545-548.
[63] D. Sweeney, "On the Computation of Euler's Constant," Math. Comput. 17 (1963),
170-178.
[Pr] Y. Tamura and Y Kanada, "Calculation of it to 4,196,393 Decimals Based on
Gauss-Legendre Algorithm," Math Comput. (in press).
[1893] J. Tannery and J. Molk, Fonctions Elliptiques, vols. 1 and 2 (1893; republished Chelsea,
New York, 1972).
[49] J. Todd, "A Problem on Arctangent Relations," MAA Monthly 8 (1949), 517-528.
[75] , "The Leminiscate Constants," Communication of the ACM 18 (1975), 14-19.
[79] , Basic Numerical Mathematics, vol. 1 (Academic Press, New York, 1979).
[84] L. Trefethen, "The Asymptotic Accuracy of Rational Best Approximations to ez on a
Disk," J. Approx. Theory 40 (1984), 380-383.
[65] F. G. Tricomi, "Sull 'Algoritmo Iterativo del Borchardt e su di una sua Generaliz-
zazione," Rend. Arc. Mat. Palermo 14 (1965), 85-94.
[79] A. van der Poorten, "A Proof that Euler Missed.. ., Apery's Proof of the Irrationality
of f(3)," Math. Intelligencer 1 (1979), 195-203.
[85] S. Wagon, "Is tr Normal," The Mathematical Intelligencer 7 (1985), 65-67.
[74] M. Waldschmidt, "Nombres Transcendants," Lecture Notes in Mathematics 402
(Springer, New York, 1974).
Bibliography
395
[69] J. L. Walsh, "Interpolation and Approximation by Rational Functions in the Complex
Domain," AMS Colloq. Publ. 20, 5th ed. (1969).
[Pr] G. Walz, "Effiziente Berechnung spezieller Funktionen mittels asymptotischer Ent-
wicklungen und Eliminationsprozeduren (Mannheim).
[29] G. N. Watson, "Theorems Stated by Ramanujan (VII): Theorems on Continued
Fractions," J. London Math. Soc. 4 (1929), 39-48.
[32] , "Some Singular Moduli (1)," Quart. J. Math. 3 (1932), 81-98.
[33] , "The Marquis and the Land Agent," Math. Gaz. 17 (1933), 5-17.
[35] , "Generating Functions of Class Numbers," Compos. Math. 1 (1935), 39-68.
[08] H. Weber, Lehrbuch der Algebra (1908 republished Chelsea, New York, 1980).
[67] K. Weierstrass, Werke, vol. 2 (republished Frankfurt, 1967), pp. 341-362.
[76] A. Weil, Elliptic Functions According to Eisenstein and Kronecker (Springer, New York,
1976).
[27] E. T. Whittaker and G.N. Watson, A Course of Modern Analysis, 4th ed. (Cambridge
Univ. Press, London, 1927).
[84] J. Wimp, Computation with Recurrence Relations (Pitman, London, 1984).
[80] S. Winograd, "Arithmetic Complexity of Computations," SIAM Reg. Conf. Ser. in
Applied Mathematics 33 (1980).
[60] J. W. Wrench, Jr., "The Evolution of Extended Decimal Approximations to ir," Math.
Teacher 53 (1960), 644-650.
[77] I. J. Zucker, "The Evaluation in Terms of T-Functions of the Periods of Elliptic Curves
Admitting Complex Multiplication," Math. Proc. Cambridge^ Phil. Soc. 82 (1977),
111-118.
[79] , "The Summation of Series of Hyperbolic Functions," SIAM J. Math. Anal. 10
(1979), 192-206.
[84] , "Some Infinite Series of Exponential and Hyperbolic Functions," SIAM J. Math.
Anal. 15 (1984), 406-413., .
/2k\
[85] , "On the Series E~=1 II k~" and Related Sums," J. Number Theory 20 (1985),
92-102.
[76a] and M. M. Robertson, "Some Properties of Dirichlet L-Series," J. Phys. A 9
(1976), 1207-1214.
[766] and •, A Systematic Approach to the Evaluation of S(m„^00)(am2 + bmn +
cn2)~\" J. Phys. A 9 (1976), 1215-1225.
K(k)
Symbol List
Symbol Description Section
AGM Arithmetic-Geometric Mean Iteration 1.1
Af(-,-) Common limit of the AGM 1.1
AG(-, •) Common limit of the AGM 1.1
a„,bn,cn Variables in the AGM 1.1
kn Legendreformofthe AGM 1.1
k' k':=yl~kz, Complementary modulus 1.1
/ the derivative of / 1.1
J "nil in /*1 Jf
o Vl - k2 sin2 9 ~ J" V(l - <2)(1 ~ kY)
complete elliptic integral of the 1st kind
p'2 . r1 vT-HtV
E(k) E(ky-"]o Vl - fc2 sin2 6 M = Jo —r==f- <* 1.3
complete elliptic integral of the 2nd kind
K'(k) K'(k):=K(k') 1.3
E'(k) E'(k):= E(k') 1.3
F(a,b;c,z) Hypergeometric function 1.3
(2i-l)!! (2i-l)!!:=l-3-5-...-(2i-l) 1.3
1(-,-) Ha.by.-r ., 2de 2 , -IW*) 1.4
Jo Va2 cos2 61 + 62 sin2 61 « ™'
/(•,•) /(a,6):=Jo Va2 cos2 6" + 62 sin2 6" <# = aE'^-j 1.4
G G(k)-=klnk'K(k) 1.5
G* G*(jfc) = klnk'K'(k) 1.5
T T(a;):=Jo e~V_1df, Gamma function 1.6
396
Symbol
Description
Section F
p
sn(u, k)
cn(u, k)
dn(u, k)
P
»2
%
%
fo,yy.=^f-\\-ir>dt.
™<«.*) d(
Jo . V(i-<2)(i-^2)
r<--*> dt
h V(i - ?W2 + kV)
(•*.<«,*> dt
h V(l - t2)(t2 - k'2)
P(z)-~ 2 + E (, ,2 J
2 ,61' V(Z - W) W /
Weierstrass function
rt = — ™
rt = — ™
04(<7):= 2 (-l)V2
Beta function 1.6
1.7
2.7
1.7
2.7
1.7
2.7
1.7
2.1
2.1
2.1
r2 r2(n) -~ number of representations of n
as a sum of two squares 2.2
0,(.5) $(.5) := 0,(9) where q:=e~" 2.2
k(q) k(q):=k = e\{q)iel(q) 2.3
*'(*) *'(«):=*'=»J(«)/»U«) 2.3
*W AX*)-f »?(9) 2.3
9 , = «,-*'<*>'«<*> 2.3
yt(s) fc(i):=yt(?) where q:=e~" 2.3
0,(z, g) General theta functions 2.6
0y(<7) Theta functions in q, 2 := 0 2.6
dj(z) Theta functions in 2, g suppressed 2.6
0(2) Theta functions in 2, q and / suppressed 2.6
Qo e0(?):= n (I-?2") 3.1
C, 6,(9):= n(l + <72") 3.1
62 e2(<7):=n(l + <72"-1) 3.1
e3 &(*)=-n a-*ta~!) 3.1
A* A*(r) :=*(?) where 9 = e_w7 3.2
0f 0*(9): = 2E (-l)"(2n + l)9("+!,2)2 3.2
n-0
Symbol
Description
Section F
05 8,(,):-2 2 (-l)V2-"2)2 3.2
e; »6*(?):=2i;(-ir("-i'>+i)?("+"i'! 3.2
/.=0
1? 91/!2Go, Eta function 3.2
/ <r"24e3 3.2
/, . V291,I2e1 3.2
a <r"24e2 3.2
2' 2' am.n-= 2 <*„,„ 3.2
G„ G„:=(2A;A;')~"I2=2~"4/(V=n) 3.2
,«"2^'2
Sn - g,
'-:-(|t) =2-"4/1(V-H) 3.2
) Gaussian or g-binomial coefficients 3.3
(«). («).=- fi(i-<r)/(i-<rm) 3.3
r4 r4(n) := number of representations of n as a sum
of four squares 3.5
<rx ^(/1):=^(1 3.5
w(n) w(n) ••= a-jin) + tr^odd (n)) 3.5
¢-1
S(P,1) S(p, <?):= S^'" 3.5
r-0
Af(/) Af(/):=Afs(/):=Jo f{x)^~ldx, Mellin transform 3.6
I i(s) := 2 n"! , Riemann zeta function 3.6
n = l
g(t) g(t) : = («,(») -1)/2 3.6
/3"
L(/3) L(j3):° E n„ , |/3|<1. Lambert series
n=l 1 P
3.7
F„ F0: = 0, F, :=1, F„+I:=F„ + F„_1, Fibonacci numbers 3.7
(Ex.3)
I<„ L0 := 2, Lj := 1, Ln+1:= L„+ L„_, , Lucas numbers 3.7
ST Sif:={im(T)>0} 4.3
sr sr ••= ar u {;<*>} u {Q> 4.3
T-group Inhomogeneous modular group 4.3
A-group 4.3
Fr Fundamental set of T-group 4.3
FA Fundamental set of A-group 4.3
398
Symbol
Description
Section Formula
Sr
Si
u
(I
c
(J
G
m
/(0
m-
!)
"!)
?)
J)
:=fc2(0 =
._ 4 (1-
' 27 A2(
= 1728/(0
k(«)J '
A(0 + A2(0)3
:o(i-a(o)2 :
{(c bd}-<«l-bc = p;a,b,c,
transformations of order p
V
A,:-
V
Br-
-(!!)
-G i).
■(S3
■(! ?)■
i = 0,l,...
/ = 0,1,..
,:=«"'
Kle
dez]
,,p-l
-,p-i
4,3
(Ex. 1)
4.3
(Ex. 1)
4.3
(Ex. 1)
4.3
(Ex. 1)
4.3
Klein's absolute invariant 4.3
/ /(0:=1728/(0 4.3
Tp {(" ^j:ad~bc = p;a,b,c,d&ZJ 4.4 4.4.1
transformations of order p
v-(5 ?)
A .-4,--(5 p)> / = 0,1,..., p-1 4.4
B, 5,:=(J J'), / = 0,1,..., p-1 4.4
W„ W,(*. A) :=11(^-^), A,:=A = B,. 4.4 4.4.2
i=0
pth order modular equation for A
p
FP Fp(x,j)-Yl(x-Jd, /,:=/-/1,. 4.4 4.4.6
i=0
pth order modular equation for /
Qp Q adjoin the pth roots of unity 4.4
fip np(u, a) := 11 («- «i) 4.5 4.5.1
i=0
pth order modular equation for u
up «„:=(-l)<"2~!)'8(A(9''))"8:=('-l)<''2"I)'8"(9''), <7 = «''"' 4.5 4.5.1
«i:=(A(a8i9'"'))1'8:=«(a8A:91"'), *=0,1,... , p-1 4.5 4.5.1
„ = /»a-'"
Mp MpV<ky-= e*fq"')=lm' MultiPlieroforderP 4-6 4-6-1
k kp := yt(e_w?) pth singular value 4.6
/, /,:=^-^-^) 4.6
a(r) «W:=f-^p, k:=k(e'^) 5.1 5.1.1
Symbol
Description
Section Formula
8(r):=V7-2o(r) 5.1 5.1.9
^¢:=¾¾^ 5.2 ,5.2.7
5.2
5.2
5.4
5.4
5.4
5.4
5.4
5.2.11
5.2.12
5.4.1
5.4.1
5.4.1
5.4.2
P(<7):=1-24S -^-j- • 5.2 5.2.8
/i-i 1- q
nick'2 dM
^•^:=w)"^(/,")+m;2(/,")/2~m2
<r(p):=Rp(k',k), k:=e~"^
For ȣN, compute kn+l by solving Wp{k\, k2n+i) = 0
m„-~M~\kn,kn+l)
rn:-Rp(kn,kn+1)
«. := [«.»■. + «;(1 + *2) -p(l + ^+1)]/3
^,(^:^):=2^=^^, where 5.5 5.5.1
n-0 (C)„ «!
(a)„ := Ye ■) ~o(o + 1) ■ ■ ■ (" + " ~ 1), rising factorial 5.5
*.(*):-f - a^i(§ -*.§ + *;!; **) 5.5
£,(*) =- f ' a^.(- § - *, | + *; 1; **) 5.5
G:=2 7^ rt5 , Catalan's constant 5.6
.-o (2« + D (Ex.10)
b„ = fl(a„) if b„ = 0(a„) and a„ = 0{bn) 6.1
Bit complexity 6.1
Operational complexity 6.1
Fast Fourier Transform 6.2
Bit complexity of multiplication 6.4
6.3
(Ex, 3)
Bit complexity of division 6.4
Bit complexity of root extraction 6.4
a a 6:=min(a, b) 8.1 8.1.1
a v b ■ max(a, b) ' 8.1 8.1.1
tM{x) : = M(x, 1), M a mean, the trace 8.1
R-{0> 8.1
Mf(a,b):=rlM(f(a),f(b)) 8.1
Mp denotes Mf when f(x) = xp 8.1
5.5.3
5.5.4
400
Symbol
Description
Section Form
pM
Lp
Gs,
SP
M>N
M~N
M®N
M®gN
[M,N]g
M®„N
W, N]„
N*
M
R(a;8,8';x\y2)
F2
F%
F*
arcsl(;c)
arcslh(*)
la" + b"\Up
Hp(a,b):=y— J , a,b>0, Holder means
M(ap, b") _ Mp(a,b)
pM(a, b) :=
Lp(a,b) =
M(ap-\bp~l) MpZ\(a,b)
ap + bp
p-\ , ,P~i
+ bp
Lehmer means
[s . ii-jl/i-r
p -\up-i
p(«-bj-
P 7*0,1,
8.1
i,(,t):»«(^«.[-^]
Stolarsky means
j?(a, b) : = S0(a, b) , Logarithmic mean
$(a, b) : = Si (a, b) , Identric mean
„ , L, rs(ar- br)V'ir-s)
M(<t>(a), <t>(b)) = <t>(N(a, b)) , dominance 8.2
if M > 0jiV and N > <j>2M, equivalence 8,2
Compound Mean of M and N 8,3
M comparable to N 8.3
The Gaussian product (Def. 8.2) 8.3
The Gaussian mean iterative process (Def. 8.2) ■ 8.3
The Archimedean product (Def. 8.2) 8.3
The Archimedean mean iterative process (Def. 8.2) 8.3
N*(a,b) = N(M(a,b),b)
M(a, b) := M(a a b, a v b)
«(a;S,S';^/):=-^_/o «■-«(» + x2ys(t + y2)~s' dt 8.
Cv:=Ft®Fj (/,/ = 1,2,3,4)
ul 1.1. a + b
FM b) •= —J—
F2(a, b):=Vab
Ua,b):=^±±7a
F4{a,b):=^±^b
arcslW:=|o'(l-/)-"2
arcslh(*):=Jo (1 + s4)~in ds
ds
8.3
5
8.5
8.5
8.5
8.5
8.5
8.5
8.5
401
Symbol
Description
Section F
a a :=(<Jj,... ,aN) 8.7
LAS) LAS) : = (S af )/(2 af_1) , Lehmer means 8.7
I \ E, \VP
Hp(S) Hp(S):={—Z,a!) , Holder means 8.7
® Af, The common limit (when it exists) oi [M ,.. ., M ] 8.7
[M\ ..., MN] a'„+1 := M'(S„), or vectorially a„+1 = M(S„) 8.7
®gAf' iV-dimensional Gaussian product 8.7
N
®„Af' iV-dimensional Archimedean product 8.7
;-i
• t , % >t / x. a + Vac + c
He(a, c) He(a, c) := , Heronian mean 8.7
S(a,c) S(a,c): = He®sG(a,c) 8.7
MSNf %
(«,,«„....«„)-/ [2oiy^)j 8.7
TG(/: F) The algebraic transformation group of / over F 8.8
bA2s) 6,(2*):- 2' J"1] ' -av 9.2
b2(2s) 6,(2«):- £' -¾¾
64(2»):
a (*)'■ =
«*)■'-
£**(*)
(«; ?).
v' (-
i,y,t,i— 0 +J
y (-1)"
.ti (2»+iy
:=S(±rf|«Ks
t-0
jy+;'+*+/
r2 + yt2 + /2)s
-2^)^),
9.2
64(2*) 6,(2.):- 2 t*l* + tz + #v 9.2
(,y,t,i—« (' +/ + * +' )
«(*) <»(*)■■- 2 ^1¾^ = (1 -2^)^), ' 9.2
m m:-f. /ijPiv. 9.2
L±,(s) L±,(s):=2(±<i|nKs 9.2
(Ex. 6)
(«;?)- (a; 9).:=11(1-V) 9.4
(Ex.7)
«,(») M,(*-*a) 9-5
Np Np := i?2(o)/i?2(o"") , i? given by 3.2.9 9.5
Ds D6:={2eC||z|sS> 10.1
P„ Algebraic polynomials of degree n 10.1
ll/L ll/L:=sup|/W| 10.1
£„(/,/1) e„(/,/l):=rnm||/-/>L 10.1
402
Symbol
Description
Section Form
RnU.A) «.(/,i4)=-«.(/)= miri ||/-p/q\\A 10.1 10.1
P>QePn
= lim 1 + - + - + • • • + log m , Euler's constant 10.2
m-»«L 2 3 m J
y -y : = Urn 1 + - + -+---+ log m , Euler's constant 10.2 10.2.
m-.«.|_ 2 3 m J
d„ d„: = LCM(l,2, ...,n) 11.3
[0,,,0,,...,0,,] [o0, a,,..., o„] := o0+ — ^-^
0, + —— v " ;
'• 1
+ —
simple continued fraction
[a0,a„...] Jim [a0, a,, . .., o„] 11.3
(Ex. 2)
Z 22"1
B. 7^ + 2=2^(2^ ^S2w 1L3 1L3
Bi = ~2 and B2«+i:=0' n = l,2,3,
Bernoulli numbers
F 1 __ y (-l)nE2nz2" „
2" cosz~~£0 (2n)! ' |2|<2>
Euler numbers 11.3 11.3
403
Author Index
Numbers in parenthesis indicate references in the Bibliography. "Pr" indicates references in press
Abel, 5, 28
Abramowitz and Stegun (64), 15, 321
Adams (66), 371
Aho, Hopcroft and Ullman (74), 201, 203, 206,
211
Alladi and Robinson (79), 371
Almqvist (Pr), 99
Almqvist and Berndt (Pr), 3, 196, 371
Andrews (76), 67, 86
Andrews (86), 286
Apery, 365, 371, 379
Apostol (74), 37, 40
Apostol (76a), 86
Apostol (76b), 116
Arazy et al. (85), 269, 272
Archimedes, 243, 249, 337, 338
Aristophanes, 347
Askey (80), 89, 308
Backstrom (81), 98, 99
Bailey, D. (88), 211, 341, 342
Bailey, W., 181
Bailey, W. (35), 184
Baker, 323, 348
Baker (75), 349, 35J, 354, 356, 363, 371
Baker and Graves-Morris (81), 320, 323
Ballantine (39), 347
Baxter, 80
Beckenbach (50), 236, 269
Beckmann, 346
Beckmann (77), 342
Beeler et al. (72), 220, 222
Bell (27), 36
Bellman (61), 39, 55, 56, 86, 89
Benson, 301, 302, 303
Berndt (Pr), 142, 150, 177, 196, 309
Bernoulli, Jacques, 340
Bernoulli, Jean, 340
Beukers, 195
Beukers (79), 365
Bhargava and Chandrashekar Adiga (84), 81
Biagioli (Pr), 146
Birkhoff.(73), 10, 18, 116
Birkhoff and Rota (69), 9
Bohr, 89
Borchardt, 250, 256, 257, 263
Borchardt (1888), 272
Borodin and Munro (75), 206, 208
Borwein, Borwein and Taylor (85), 290, 291,
292
Borwein, D. and Borwein, J. (86), 291
Borwein, J.M. (85), 163, 164
Borwein, P.B. (85), 218
Borwein, P.B. (Pr), 254
Borwein and Borwein (84a), 44, 50, 51, 222,
223
Borwein and Borwein (84b), 108, 174
Borwein and Borwein (84c), 174
Borwein and Borwein (84d), 222, 225
Borwein and Borwein (86), 174
Borwein and Borwein (Pr), 313
Bouyer, 341
Bowman (53), 29
Boyer (68), 347
Braess (84), 325
Brent, 51
Brent (76a), 48, 222, 229
Brent (76b), 222
Brent (76c), 213, 216, 217, 222, 329, 330
Brent and McMillan (80), 336
Bressoud, 78
Bressoud (83), 77
405
406
Author Index
Bring, 136
Bromwich (26), 189, 291, 344
Brouncker, 338, 345
Bundschuh(71), 371
Buslaev, Gonchar and Suetin (84), 323
Carlitz(71), 71
Carlson, 248, 257, 261
Carlson (71), 7, 256
Carlson (75),261
Carlson (78), 262
Catalan, 66
Cauchy, 76, 77, 292
Cayley (1874), 106, 133, 138, 140, 142
Cayley (1895), 22, 102, 138
Chandrasekharan (85), 116
Cheney (66), 321
Chudnovsky and Chudnovsky (84), 368, 371
Clark (71), 349
Clausen, 179, 188, 189, 190
Cohen and Nussbaum (87), 273
Comtet, 386
Cook, 69, 213
Cook and Aanderaa (69), 69
Cooley, Lewis and Welch (67), 65
Cooley and Tuckey (65), 65
Cox (85), 15
Dase, 340
Davenport (81), 226
Davis (79), 365, 371
Denninger (84), 89
Dickson (29), 293
Dickson (71), 55, 66, 85, 141, 284, 287, 292
Dirichlet, 86
DuVal (73), 23, 30, 133
Eagle (58), 29
Edwards (79), 252
Epstein, 303
Erdelyi et al. (53), 10, 30, 55, 178, 179, 181
Euler, 3, 18, 64, 67, 77, 189, 264, 286, 306,
307, 340, 344, 346, 348, 381, 382, 383
Ewell (81), 66
Ewell (82), 306
Ewell (83), 86
Ewell (86), 151
Feldman, 371
Felton, 341
Ferguson, 340
Fermat, 82, 286
Foster and Phillips (84a), 260
Foster and Phillips (84b), 247, 252
Gammel, 323
Gauss, 1, 3, 5, 8, 10, 28, 43, 51, 85, 86, 250,
256, 284, 286
Gauss (1866), 5, 7, 35, 44, 48, 52, 65
Gelfond, 348
Gini (38), 233
Glasser and Zucker (80), 68, 290, 293, 303
Goldfield (85), 295
Gordon (61), 306, 308
Gosper, 341
Gould and Mays (84), 263
Goursat(1881), 185, 186
Gradshteyn and Ryzhik (80), 10
Grave, 345
Graves-Morris, 323
Greenhill (1892), 133, 167
Gregory, 339, 348, 381
Guilloud, 341
Hancock (09), 29
Hanna (28), 133
Hardy, 80, 187
Hardy (40), 141, 314
Hardy, Littlewood and Polya (59), 235, 242,
269
Hardy and Wright, 80
Hardy and Wright (60), 66, 67, 85, 86, 95,
281, 284, 349, 376, 378
Hermite, 135, 264, 348, 359
Hermite and Stieltjes (05), 264
Hilbert, 348, 353, 354
Hilbert (1893), 348
Hirschhorn (85), 285
Holloway, 188
Householder (70), 215, 216
Hua (82), 294, 349, 375
Hughes (84), 175
Ivory (1796), 12, 196
Jacobi, 5, 10, 20, 22, 28, 38, 52, 65, 68, 73,
74, 85, 106, 143, 180, 252, 261, 284, 301,
306
Jacobi (1829), 36, 106
Jones, 339
Joubert, 142
Kaltofen and Yui (84), 133
Kanada, 224, 225, 341
Karatsuba, 211
Keith (86), 342
King (24), 16, 18, 44, 52,59,60
Klein, 116
Klein (1897), 349
Author Index
407
Klein (79), 7
Klein and Fricke (1892), 116, 126
Knuth(81), 203, 211,212, 218
Kronecker, 135, 293, 296, 297
Kummer, 179
Kung and Traub (78), 216
Lagrange, 3, 82, 321, 375
Laguerre, 99
Lambert, 99, 348
Landau, 268, 269
Landau (1899), 94
Landau (58), 86, 294
Landen, 381
Lang (66), 349
Lang (73), 29, 30, 116, 123
Leach and Scholander (78), 237
Legendre, 3, 10, 23, 26, 152, 153, 154, 178,
348
Lehmer, 264, 265
Lehmer (38), 345
Lehmer (71), 247, 263
Lehmer (85), 386
Lehner(66), 116
Leibniz, 339, 340
Le Petit Archimede (80), 342
LeVeque (77), 294, 349
Lewin (81), 382, 386
Lindemann, 274, 348
Lindemann(1882), 348
Liouville, 10, 36, 116, 348
Lipson(81), 216
Lorenz, 287, 289
Machin, 339, 341, 344
MacMahon, 67
Mahler (31), 359
Mahler (53), 368
Mahler (67), 349, 368
Meil (83), 250
Meinardus, 321
Metropolis, 341
Mignotte (74), 368
Mobius, 100
Mollerup, 89
Mordell, 187
Mordell (16), 286
Nemeth (77), 321
Newman (79), 321
Newman (82), 7, 222, 223
Newman (85), 7
Newman, M., and Shanks (84), 193
Newton, 123, 207, 208, 212, 339, 343, 356
Niven (56), 347, 348, 349
Nyvoll (78), 196
Oxtoby (80), 352
Pade, 320
Pfaff, 250
Phillips (81), 252
Picard, 119
Poincarfi, 116
Poisson, 38
Polya, 86
Rademacher (73), 56, 71, 85, 90, 286
Rainville (60), 179
Ramanathan (84), 81
Ramanujan, 69, 80, 81, 84, 100, 138, 140, 144,
146, 150, 154, 157, 158, 161, 164, 179, 184,
186, 187, 188, 194, 195, 197, 287, 295, 306,
307, 308, 309, 310, 311, 312 313, 314, 315,
386
Ramanujan (14), 158, 163, 179, 184, 186, 191,
193, 195, 313
Ramanujan (62), 80, 187, 281, 286
Rankin (77), 116
Reitwiesner, 341
Ribenboim (84), 345
Ribenboim (85), 95
Riemann, 89
Ritt (48), 226
Rogers, 80
Rouche, 317, 319
Salamin, 51, 220, 222
Salamin(76), 48, 49, 51
Sasaki, 225
Sasaki and Kanada (82), 224, 225
Schepler (50), 342
Schlafli, 147, 297
Schlomilch, 268, 269
Schneider, 348
Schoeneberg (76), 116, 123, 126
Schoeneberg (77), 247, 269
Schoeneberg (82), 247, 250
Schonhage, 211
Schonhage and Strassen (71), 211
Schroter, 111, 133
Schur, 80
Schwab, 250
Selberg and Chowla (67), 296
Shanks, D. (82), 193, 194
Shanks, D. and Wrench (62), 341
Shanks, W., 340
Shanks, W. (1853), 340
408
Author Index
Singmaster (85), 342
Slater (66), 178, 179
Stickel (85), 222, 224
Stieljes, 264
Stolarsky (75), 233
Stolarsky (80), 237
Stormer, 345
Strassen, 202, 211
Sweeney (63), 336
Tamura and Kanada (Pr), 211, 342
Tannery and Molk (1893), 138, 141
Todd (49), 345
Todd (75), 394
Todd (79), 7
Toom, 211
Trefethen (84), 321, 325
Tricomi (65), 262
van Ceulen, 342
van der Poorten, 379
Vieta, 338, 343
von Neumann, 341
Wagon (85), 342
Waldo, 342
Wallis, 338, 343
Walz (Pr), 330
Watson (29), 306
Watson (32), 140
Watson (33), 3, 59, 261
Watson (35), 286
Weber, 168, 295
Weber (08), 68, 123, 138, 140, 300
Weierstrass, 29, 30, 141
Weierstrass (67), 348
Whittaker and Watson (27), 10, 26, 27, 29, 55,
59,72
Wills, 323
Wimp (84), 7, 242, 260, 269
Winograd (80), 206
Wrench (60), 342
Wright, 80
Zucker, 100, 141, 296,297
Zucker(77), 140, 297
Zucker (79), 100, 154
Zucker (84), 75
Zucker (85), 386
Zucker and Robertson (76a), 293, 294
Zucker and Robertson (76b), 293
Subject Index
Algebraic addition theorem, 29, 30, 32
Algebraic function, 273, 276
Algebraic functional relation, 273
Algebraic integrals, 5
Algebraic number, 351, 352
Algebraic series for 1/ir and 1/K, 181-190
Algebraic transformation, 273, 274, 278
Algebraic transformation group, 273
Alpha, singular value of second kind, 152
cubic, 160
monotonicity, 153
quadratic, 158
quartic, 161
quintic, 310
recursions, 156
septic, 311
theta function form, 154
Alternating series test, 291
Archimedean mean iterations, 246, 249, 251, 252
Archimedean means, 259
Archimedean product, 246, 249, 250,
253,262
Archimedes' method, 249, 338
Arclemniscate, 25
Arclemniscate sine, 259
Arithmetic-geometric mean (AGM), 1
AGM relation, 1
complex starting values, 15
calculation of it, 341
computational performance, 330
continued fraction, 188
Gaussian, 181, 241, 257
Jacobi's identity, 35
Legendre form, 3, 45
matrix AGM, 223, 226
multidimensional AGM, 272
quartic AGM, 17, 51, 254
theta forms, 146
variant of, 254
Benson's formula, 301, 303
Bernoulli numbers, 383
Bessel function transform, 39
Best approximants:
existence, 321
uniform polynomial, 316
uniform rational, 317
Best approximation property, 373
Beta function, 24, 88
Bit complexity, 200
Borchardt's algorithm, 250, 256, 263
Brouncker's continued fraction, 339, 345
Calculating powers of x, 208
Carlson's integrals, 256
Carlson's log, 248, 257
Catalan's constant, 198, 371, 386
Cauchy's binomial theorem, 76
Class number, 141, 295
Clausen's hypergeometric product, 179,188, 189,
190
Comparable means, 244, 246, 247
Complement, 3, 178
Complementary integral, 8
Complementary modulus, 8
Complete elliptic integrals, 7
complementary integrals, 8
cubic algorithm for K, r07
differential equations, 9
E in terms of theta functions, 42
of first kind (K), 7
higher order transformations, 102ff
homogeneous forms of K and E, 12
K in terms of:
gamma function, 297, 298
theta functions, 35
moments of K and E, 198
quadratic transformations, 12
quadratically convergent algorithms, 14
409
410
Subject Index
Complete elliptic integrals (Continued)
of second kind (£), 7
series expansions of, 8
Complexity:
of algebraic functions, 215
bit, 200
elementary functions, 226
elliptic integral calculations, 227
Jacobian elliptic functions, 227
Lambert series, 98
log, 219
lower bound for log and exp, 227
operational, 201
17, 219
of theta computation, 95
transcendental functions, 219, 226
Complexity of algebraic functions, 215
Compound of means, 243
Conjugate divisors, 297
Constructibility, 347, 349-351
Continued fraction, 372-377
Contra-harmonic mean, 255
Convergents, 372
Cubic invariants, 173
Cubic modular indentities, 142
Cubic recursions for Ramanujah's invariants, 145
Diagonal mapping, 230
Dilogarithm, 381
Dirichlet class number formulae, 294, 295
Dirichlet L functions, 289
Discrete Fourier transform, 204
Discriminants, 294
Disjoint discriminants, 293
Domination, 239
e:
irrationality of, 352
transcendence of, 353, 359
Elementary symmetric mean function, 270
Elementary symmetric polynomial, 356
Ellipse, arc length of, 8
Elliptic functions, 29, 30, 31
en, 29
degenerate elliptic functions, 29
dn, 29
fundamental parallelogram, 29, 53
Jacobian, 29
lattice, 29
odd elliptic functions, 31
order, 30
period of, 29
sn, 29, 57
Elliptic integrals
(see Complete and Incomplete)
Equivalence of operations, 213
Equivalent iterations, 252
Equivalent mean, 231, 239
Estimates of e, 363
Eta-multiplier, 311, 314
of orderp, 311
Euler-Mascheroni constant, 336
Euler numbers, 383
computation of, 336
Euler's addition theorem, 18
Euler's formula, 189
Euler's identity, 64
Euler's pentagonal number theorem, 64, 66, 80,
306
Euler's totient function, 301
Evaluation problem, 204
Exponential:
algorithms for, 227
approximations to, 317, 318, 320, 321, 322,
323, 327, 330, 331, 335
series for, 320
Exp(ir), quadratic algorithm for, 50
Factorial calculation, 218
Fast base conversion, 218
Fast computation, 50
Fast Fourier transform (FFT), 204, 206, 211,212
Fast matrix multiplication, 202
Fast multiplication, 209
Fast polynomial division, 207
Fast polynomial evaluation, 208
Fast polynomial multiplication, 206
Fibonacci numbers, 94, 151, 287, 375
Fibonacci sequences and series, 91, 97. 98, 100
Finite Fourier transform, 204
Fully monotone, 291
Fundamental limit theorem, 5
first proof, 5
second proof, 6
third proof, 12
fourth proof, 21
Fundamental regions, 113
Fundamental sets, 113
Fundamental unit, 294
Galois group:
for Fp, 123
for W., and W7, 136
Gamma function, 24, 27, 28, 87, 332, 336
duplication formula, 88, 90
evaluation of K and £, 25, 27, 189, 191, 297,
298
Subject Index
411
Gauss's multiplication formula, 90
Gaussian binomial coefficients, 76
Gaussian mean iterations, 246, 249, 251, 252
Gaussian product, 246, 248, 253
Gaussian sums, 83, 86
Gelfond-Schneider theorem, 277, 278, 348
Generalized complete elliptic integrals:
of first kind, 178
of second kind, 178
transformations, 179, 180, 185
Generalized Legendre symbol, 293
Generalized singular value function, 185
Genera, 295
Geometric series, 91
Gini's means, 233, 264
Gregory's series, 339
Halley's method, 216
Height, 371
Heronian mean, 237, 269, 270
Hexagonal lattice, 292
Hexagonal sum, 292
Holder's means, 232, 235, 263, 264
Homogeneous mean, 231
Homogeneous multidimensional mean, 266
Horner's rule, 204
Hurwitz zeta function, 303, 304
Hyberbolic arclemniscate, 259
Hyberbolic function identities, 74, 75, 100
Hypergeometric functions, 332-334
Hypergeometric functions and series:
Gaussian hypergeometric series, 8
generalized, 178
hypergeometric differential equation, 11
Identric mean, 234
Incomplete elliptic integrals, 10
Eulers addition theorem, 18
of first kind, 10, 58, 60
of second kind, 10
of third kind, 10
Inhomogeneous modular group, T, 113, 117
Interpolation problem, 204
Invariance principle, 245, 246, 260
Invariants, evaluating, 293
Irrationality measures, 362
for exp(l/v), 363, 364, 371, 377
for exp(ir), 371
for log, 370, 379
for 17, 367, 368, 371
for i72, 367
for 5(3), 369, 370
Irrationality of e, 352
Irrationality of it, 352
Isotone mean, 231
Isotone multimimensional mean, 266
Jacobian elliptic function, 29
en, 29
dn, 29
half angle formula for en, 262
half angle formula for sn, 30
sn, 29, 57
Jacobi's differential equation, 20, 22
Jacobi's duplication formula for arcsl, 260
Jacobi's formula for r4(n), 81
Jacobi's identity, 35
Jacobi's imaginary quadratic transformation, 73,
180
Jacobi's triple-product identity, 62, 65, 66, 72,
301, 306
corollaries, 64, 65
finite form, 77
first proof, 62
Khinchin's theorem, 362, 371
Klein's absolute invariant, 115, 179
Kronecker symbol, 293
Kummer's indentity, 179
Lagrange interpolation formula, 321
Lambert series, 91, 99, 281, 286, 288, 300
Landen transform, 17, 57, 59
in terms of elliptic functions, 58
in terms of theta functions, 57
Laplace transform, 39
Lattice sums, chemical, 288, 290
Ugendre's relation, 24, 26, 27, 49, 152, 153
generalized, 178
theta function proof, 43
Lehmer's means, 232, 235, 263
Lemniscate sine, 5, 28
Length, 358
Lengths and measures, 358
Lindemann's theorem, 348, 357
Liouville numbers, 351, 352
Liouville's function, 100
Liouville.'s principle, 54
Liouville's summation principle, 36
Logarithm:
approximations to, 325, 327, 328, 330
bit complexity, 219, 222
calculation, 220, 221, 222, 223
complex, 222
operational complexity, 219, 222
series for, 320
theta function algorithms for, 224
412
Subject Index
Logarithmic mean, 234, 248, 261
Lucas numbers, 95, 97, 100, 287
Machin's formula, 339, 340, 341, 344
Madelung's constant, 288, 292, 301
Matrix logarithm, 224, 225, 226
Mean, 230, 231
discontinuous, 230, 238
equivalent mean, 231
homogeneous, 231, 266
isotone, 231, 266
strict, 230, 266
symmetric, 231, 266
trace of, 231
Mean iterations, 243
algebraic, 273-280
Archimedean, 246
arithmetic-harmonic, 4
Carlson's, 257
Gaussian, 246
harmonic-geometric, 4
Lehmer's 265
multidimensional, 267
pathology, 253
rates of convergence, 251, 267, 272
Schlomilch's, 268
Tricomi's, 265
Means:
classes:
Gini, 233, 264
Heronian, 237, 269, 270
Holder, 232, 235, 263, 264
identric, 234
Lehmer, 232, 235, 263
logarithmic, 234, 248, 261
Neo-Pythagorean, 255
series expansions of, 263-266
Stolarsky, 233, 236, 264
Measure, of polynomial, 359
Meinardus conjecture, 321
Mellin transform, 87, 90, 289, 290, 301
Modular equations, 103, 315
cubic, 104, 107, 109, 110
degree of, 125
of degree 15, 314
of degree 23, 133
elliptic, 112
endecadic, 106
forj, 123
for K,/(i and K,/4, 315
for X, of orderp, 121, 140
octicity of u-v form, 126, 134
quadratic, 109
quintic, 105, 107, 109, 135, 136, 297, 313
septic, 106, 112, 313
solvable, 310, 311, 313
u-v form, 126, 127-132, 134
Modular functions:
T-modular functions, 114, 118
X-modular function, 114, 118, 121, 133, 134
X-modular group, 113, 117
Modular transformations, 103
Modulus, 8
Multidimensional Archimedean iteration, 267,
269
Multidimensional Gaussian iteration, 267, 268,
270
Multidimensional invariance principle, 269
Multidimensional mean, 266, 269
Multiplier, 103, 105, 106
Multiplier of order p, 136
cubic, 138, 144, 149
of degree 13, 138
of degree 17, 138
quintic, 138, 309
septic, 138, 311
Neo-Pythagorean means, 255
Newton's identities, 356
Newton's method, 212, 214, 215, 216, 217, 218,
223
N-monotone, 291
Nome, 41
Non-prime invariants, 295
Nth convergent, 372
Operational complexity, 201
Order of convergence, 2
Pade approximant, 319, 320, 323, 325, 327, 331,
359
Partially comparable iterates, 247
Partition congruences, 84, 308
Partition function, 67, 307
Partitions of natural number, 67
Pell's equation, 376
Pendulum, period of, 8
Pentagonal number, 66, 67
Perimeter of ellipse, 8, 168, 195-197
Periodic continued fraction, 375
Pi:
AGM identities, 48, 52, 169, 197
algorithms, 46, 48, 170, 171, 175', 222, 310,
315, 335
approximations to, 168, 191, 192, 195, 197
complexity of calculating, 219
computation of, 337-342
cubic iteration, 171, 174
Subject Index
413
general iteration, 169
irrationality, 348, 352
irrationality measure, 367
legislation on, 342
mnemonic for, 342
normality of, 342
quadratic algorithm for exp(ir), 50
quadratic iteration, 170, 174
quartic iteration, 170, 174
quintic iteration, 175,. 313
septic iteration, 171, 174, 175
series for 1/ir, 181-190
transcendence of, 347, 348, 352, 354
Picard's theorem, 119
Piecewise monotone, 234
Pochhammer symbol, 178
Poisson summation formula, 36, 89
Prime number theorem, 378
Primitive binary form, 141
Properly equivalent forms, 141
Q-binomial coefficients, 76
Q-binomial theorem, 308
Quadratically attractive transformation, 277
Quadratic computation, 50
Quadratic modular equation, 102
Quadratic reciprocity, 86
Quintic multipliers, 309
Quintuple-product identity, 143, 146, 306
Ramanujan's continued fraction identity, 81
Ramanujan's invariants, 70, 179
Ramanujan's modular identity:
of order 3, 287
of order 5, 310, 312
of order 7, 312
Ramanujan's ,^, sum, 308
Ramanujan's multiplier of second kind, 157, 159
Rational mean iteration, 278
Recursion, 201
Recursion formula for r2(n), r,(n), 151, 306, 307
Reduced complexity methods, 327
acceleration based on:
binary splitting, 329
on FFT, 328
on functional equations, 327
Reversion of power series, 208
Riemann hypothesis, 90
Rising factorial, 178
Rogers-Ramanujan identities, 65, 78, 80
Bressoud's "easy proof," 78
Roth's theorem, 363
Schlafli's equation, 147
Schonhage-Strassen multiplication, 211
Schroter's formula, 111
Schwarz derivative, 20, 118
Septic multipliers, 309
Simple continued fractions, 372
Singular invariants, 141
Singular moduli, 139
Singular value function, 152
of first kind, 152
generalized, 185
of second kind, 152ff
Singular values, 26,49, 139, 140, 141, 172, 173,
296, 297
Solvability of quintic in modular terms, 135
Solvable modular equations, 310, 311, 313
Solvable numbers:
of type one, 293
of type two, 293
Square-free invariants, 295
Stirling numbers, 305
Stirling's formula, 90
Stolarsky's means, 233, 236, 264
identric, 234, 271
logarithmic, 234, 271
multidimensional, 271
Strict mean, 230
Strict multidimensional mean, 266
Strong equivalence, 241, 243
of iterations, 252
Sums of squares:
of two squares, 82, 285, 290
of three squares, 66, 151, 286
of four squares, 81
of others, 71, 287, 292, 293
Supremum norm, 316
Symmetric mean, 231
Symmetric multidimensional mean, 266
Symmetric polynomial, 356
Tan, computation of, 227
Theta functions, 10, 33, 91, 93
basic identities, 64, 67, 68,70,71, 73,74, 111
finite transformation, 86, 87
general theta functions, 52
one-dimensional heat equation, 56
theta transformation formulae, 38, 44, 54, 87
Theta series, 91
Trace of mean, 231
Transcendence of e, 348, 353, 359
Transcendence of it, 347, 348, 352, 354
Transcendental functions, 274
Transcendental number, 351, 352
Transformation of order p, 119
Transformations of complete elliptic integrals:
414
Subject Index
Transformations of complete elliptic integrals
(Continued)
algebraic transformation for K, 21
of generalized integrals, 179, 180, 185
higher order, 102f
quadratic transformations for E and K, 12, 13
quadratic transformations for K, 36
Triangular numbers, 286, 288
Trilogarithm, 381
Triple-product identity, 62, 65, 66, 72, 306
Tschirnhaus transformation, 136
Turing machines, 201
Vieta's formula, 338, 343
Wallis' formula, 338, 343
Weierstrass function, 27, 30, 141
5(2), irrationality of, 366
5(3), irrationality of, 369
£(3), series for, 190, 379
Zeta function, 87, 88
functional equation for, 89
relation to prime distribution, 90
Ultimately monotone, 235, 241
V
r A
•.\>V
-V .
d W
(.A